BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 001655
         (1036 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|225443990|ref|XP_002280735.1| PREDICTED: DNA damage-binding protein 1 isoform 1 [Vitis vinifera]
          Length = 1089

 Score = 2034 bits (5270), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 993/1037 (95%), Positives = 1020/1037 (98%), Gaps = 1/1037 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE+SE+ITRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAENSEVITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            +KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPW+QNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173  SKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWAQNNLDNGADLLIPVPPPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSA+AFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSASAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNA VY+GSSYGDSQLIK++LQPDAKGSYVEVLERYVNLGPIV
Sbjct: 293  KIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKIHLQPDAKGSYVEVLERYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
             DTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA+Y+QLVQVTS SVRLV
Sbjct: 413  HDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVYDQLVQVTSSSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
             STSRELRNEWK+P GYSVNVATANA+QVLLATGGGHLVYLEIGDG LTEVKHAQLEY+I
Sbjct: 473  GSTSRELRNEWKAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGTLTEVKHAQLEYDI 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP++SQ+AAVGMWTDISVRIFSLPDLNLITKE+LGGEIIPRSVLLC+FEG
Sbjct: 533  SCLDINPIGENPNFSQLAAVGMWTDISVRIFSLPDLNLITKEYLGGEIIPRSVLLCSFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            I YLLCALGDGHLLNFLLNM TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593  IPYLLCALGDGHLLNFLLNMSTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG+LTIGTIDDIQKLHIRSIPLG
Sbjct: 653  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGDLTIGTIDDIQKLHIRSIPLG 712

Query: 661  EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
            EH RRICHQEQSRTFAICSLK NQS  E+SEMHF+RLLDDQTFEFISTYPLDTFEYGCSI
Sbjct: 713  EHARRICHQEQSRTFAICSLKYNQSSTEDSEMHFIRLLDDQTFEFISTYPLDTFEYGCSI 772

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 773  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 832

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            NGKLLAAINQKIQLYKWMLRDDGTRELQSE GHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 833  NGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGHILALYVQTRGDFIVVGDLMKSISL 892

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
            LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN+FTVRKNSEGATDEERGRL
Sbjct: 893  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNIFTVRKNSEGATDEERGRL 952

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
            EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QY+FLEK
Sbjct: 953  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEK 1012

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            LQ NLRKVIKGVGGL+HEQWRSFNNEKKTVDAKNFLDGDLIE+FLDL+RTRMDEISK M 
Sbjct: 1013 LQANLRKVIKGVGGLSHEQWRSFNNEKKTVDAKNFLDGDLIETFLDLNRTRMDEISKAMA 1072

Query: 1020 VSVEELCKRVEELTRLH 1036
            VSVEELCKRVEELTRLH
Sbjct: 1073 VSVEELCKRVEELTRLH 1089


>gi|224061051|ref|XP_002300334.1| predicted protein [Populus trichocarpa]
 gi|222847592|gb|EEE85139.1| predicted protein [Populus trichocarpa]
          Length = 1088

 Score = 2023 bits (5242), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 984/1036 (94%), Positives = 1014/1036 (97%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE+SELITRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAETSELITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLHGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            +KPTIVVLYQDNKDARHVKTYEVALKDKDF+EGPWSQNNLDNGADLLIPVPPP CGVLII
Sbjct: 173  SKPTIVVLYQDNKDARHVKTYEVALKDKDFIEGPWSQNNLDNGADLLIPVPPPFCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSAN F+AIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSANVFRAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNL PDAKG+YVEVL+RYVNLGPIV
Sbjct: 293  KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLHPDAKGTYVEVLDRYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS TDDP
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSLTDDP 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMN+EDELEETEIEGFCSQ QTLFCH A++NQLVQVTS SVRLV
Sbjct: 413  FDTFLVVSFISETRILAMNIEDELEETEIEGFCSQVQTLFCHCAVFNQLVQVTSSSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+RELR EW +P GYS+NVATANA+QVLLATGGGHLVYLEIGDG LT+ KHAQLE EI
Sbjct: 473  SSTTRELRQEWNAPSGYSINVATANATQVLLATGGGHLVYLEIGDGTLTQAKHAQLECEI 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP+YSQ+AAVGMWTDISVRIFSLPDLNLITKE LGGEIIPRSVLLC+FEG
Sbjct: 533  SCLDINPIGENPNYSQLAAVGMWTDISVRIFSLPDLNLITKEPLGGEIIPRSVLLCSFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            I+YLLCALGDGHLLNFLLN+ TG L DRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593  IAYLLCALGDGHLLNFLLNLSTGGLKDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL+IGTIDDIQKLHIRSIPLG
Sbjct: 653  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELSIGTIDDIQKLHIRSIPLG 712

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            EH RRICHQEQSRTF+ICS+KNQS AEESEMHF+RLLDDQTFEFISTYPLDTFEYGCSIL
Sbjct: 713  EHARRICHQEQSRTFSICSMKNQSNAEESEMHFIRLLDDQTFEFISTYPLDTFEYGCSIL 772

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
            SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN
Sbjct: 773  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 832

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
            IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE
Sbjct: 893  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 952

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL
Sbjct: 953  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 1012

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            Q+NLRKVIKGVGGL+HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR+RMDEISK M +
Sbjct: 1013 QSNLRKVIKGVGGLSHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRSRMDEISKAMEI 1072

Query: 1021 SVEELCKRVEELTRLH 1036
            SVEELCKRVEELTRLH
Sbjct: 1073 SVEELCKRVEELTRLH 1088


>gi|356512636|ref|XP_003525024.1| PREDICTED: DNA damage-binding protein 1a-like isoform 1 [Glycine max]
          Length = 1089

 Score = 1996 bits (5170), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 973/1037 (93%), Positives = 1010/1037 (97%), Gaps = 1/1037 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGEAQD+LFIATERYKFCVLQWD+E++EL+TRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGEAQDYLFIATERYKFCVLQWDSETAELVTRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            +KPTIVVLYQDNKDARHVKTYEVALKDKDF+EGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173  SKPTIVVLYQDNKDARHVKTYEVALKDKDFLEGPWSQNNLDNGADLLIPVPPPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDH GLL LLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSANAFKAIPIRPSITKAYGRVDPDGSRYLLGDHTGLLSLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIE LGETSIASTISYLDNA VYIGSSYGDSQLIKLNLQPDAKGSYVE LERYVNLGPIV
Sbjct: 293  KIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKLNLQPDAKGSYVEGLERYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 413  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVHNQLVQVTSSSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+RELRNEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDGIL EVKHAQLEYEI
Sbjct: 473  SSTTRELRNEWHAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGILQEVKHAQLEYEI 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP++S +AAVGMWTDISVRIFSLPDL+LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533  SCLDINPIGENPNHSHLAAVGMWTDISVRIFSLPDLSLITKEQLGGEIIPRSVLLCAFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF+LN  TGEL DRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593  ISYLLCALGDGHLLNFMLNTSTGELADRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG
Sbjct: 653  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 712

Query: 661  EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
            EH RRICHQEQSRTFAICSLK N +  E+SEMHFVRLLDDQTFEFISTY LDT+EYGC I
Sbjct: 713  EHARRICHQEQSRTFAICSLKYNPASGEDSEMHFVRLLDDQTFEFISTYSLDTYEYGCFI 772

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            +SCSFSDD+NVYYCVGTAYVLPEENEPTKGRILVF VEDGKLQLIAEKETKGAVY LNAF
Sbjct: 773  ISCSFSDDNNVYYCVGTAYVLPEENEPTKGRILVFAVEDGKLQLIAEKETKGAVYCLNAF 832

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            NGKLLAAINQKIQLYKW+LRDDGT ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 833  NGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 892

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
            LIYKHEEGAIEERARDYNANWMSAVEI+DDDIYLGAEN+FNLFTVRKNSEGATDEERGRL
Sbjct: 893  LIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLGAENSFNLFTVRKNSEGATDEERGRL 952

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
            EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT+NGVIGVIASLPHEQY+FLEK
Sbjct: 953  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTINGVIGVIASLPHEQYVFLEK 1012

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            LQ+NLRKVIKGVGGL+HEQWRSFNNEKKTV+A+NFLDGDLIESFLDL+R++MDEISK ++
Sbjct: 1013 LQSNLRKVIKGVGGLSHEQWRSFNNEKKTVEARNFLDGDLIESFLDLNRSKMDEISKALD 1072

Query: 1020 VSVEELCKRVEELTRLH 1036
            VSVEELCKRVEELTRLH
Sbjct: 1073 VSVEELCKRVEELTRLH 1089


>gi|356525401|ref|XP_003531313.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Glycine max]
          Length = 1089

 Score = 1994 bits (5165), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 971/1037 (93%), Positives = 1010/1037 (97%), Gaps = 1/1037 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGEAQD+LFIATERYKFCVLQWD+E+ EL+TRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGEAQDYLFIATERYKFCVLQWDSETGELVTRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            +KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173  SKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDH GL+ LLVI HEKEKVTGL
Sbjct: 233  GEETIVYCSANAFKAIPIRPSITKAYGRVDPDGSRYLLGDHTGLVSLLVIIHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIE LGETSIASTISYLDNA VY+GSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV
Sbjct: 293  KIEPLGETSIASTISYLDNAFVYVGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 413  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVHNQLVQVTSSSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+R+LRNEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDGIL EVKHAQLEYEI
Sbjct: 473  SSTTRDLRNEWHAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGILQEVKHAQLEYEI 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP++S +AAVGMWTDISVRIFSLPDL+LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533  SCLDINPIGENPNHSNLAAVGMWTDISVRIFSLPDLSLITKEQLGGEIIPRSVLLCAFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF+LN  TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593  ISYLLCALGDGHLLNFMLNTSTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG
Sbjct: 653  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 712

Query: 661  EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
            EH RRICHQEQSRTFAICSLK N +  E+SEMHFVRLLDDQTFEFISTY LDT+EYGC I
Sbjct: 713  EHARRICHQEQSRTFAICSLKYNPASGEDSEMHFVRLLDDQTFEFISTYSLDTYEYGCFI 772

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            +SCSFSDD+NVYYCVGTAYVLPEENEPTKGRI+VF VEDGKLQLIAEKETKGAVY LNAF
Sbjct: 773  ISCSFSDDNNVYYCVGTAYVLPEENEPTKGRIIVFAVEDGKLQLIAEKETKGAVYCLNAF 832

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            NGKLLAAINQKIQLYKW+LRDDGT ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 833  NGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 892

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
            LIYKHEEGAIEERARDYNANWMSAVEI+DDDIYLGAEN+FNLFTVRKNSEGATDEERGRL
Sbjct: 893  LIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLGAENSFNLFTVRKNSEGATDEERGRL 952

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
            EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT+NGVIGVIASLPHEQY+FLEK
Sbjct: 953  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTINGVIGVIASLPHEQYVFLEK 1012

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            LQ+NLRKVIKGVGGL+HEQWRSFNNEKKTV+A+NFLDGDLIESFLDL+R++MDEISK ++
Sbjct: 1013 LQSNLRKVIKGVGGLSHEQWRSFNNEKKTVEARNFLDGDLIESFLDLNRSKMDEISKAVD 1072

Query: 1020 VSVEELCKRVEELTRLH 1036
            VSVEELCKRVEELTRLH
Sbjct: 1073 VSVEELCKRVEELTRLH 1089


>gi|312283457|dbj|BAJ34594.1| unnamed protein product [Thellungiella halophila]
          Length = 1088

 Score = 1983 bits (5137), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 958/1036 (92%), Positives = 1006/1036 (97%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVP+YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG
Sbjct: 53   MLDVPMYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173  AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+HLLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSANAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERYVNLGPIV
Sbjct: 293  KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGA+KDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+ 
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAFKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413  FDTFLVVSFISETRVLAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 473  SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHAVLEYEV 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIG+NP+YSQ+A+VGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533  SCLDINPIGDNPNYSQLASVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF L+  TG+L DRKKVSLGTQPITLRTFSSKN THVFAASDRPTV
Sbjct: 593  ISYLLCALGDGHLLNFQLDTTTGQLKDRKKVSLGTQPITLRTFSSKNATHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 653  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 712

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            EH RRICHQEQ+RTF ICSL NQ+ AEESEMHFVRLLDDQ+FEF+STYPLD FEYGCSIL
Sbjct: 713  EHARRICHQEQTRTFGICSLGNQTNAEESEMHFVRLLDDQSFEFVSTYPLDAFEYGCSIL 772

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
            SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKG+VYSLNAFN
Sbjct: 773  SCSFADDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGSVYSLNAFN 832

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
            IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 893  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLE 952

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY+FLEKL
Sbjct: 953  VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYMFLEKL 1012

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 1013 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1072

Query: 1021 SVEELCKRVEELTRLH 1036
             VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088


>gi|297809743|ref|XP_002872755.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
 gi|297318592|gb|EFH49014.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
          Length = 1088

 Score = 1982 bits (5134), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 958/1036 (92%), Positives = 1006/1036 (97%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173  AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+HLLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSANAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERY+NLGPIV
Sbjct: 293  KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYINLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+ 
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413  FDTFLVVSFISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 473  SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEV 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIG+NP+YSQ+A+VGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533  SCLDINPIGDNPNYSQLASVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF L+  TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTV
Sbjct: 593  ISYLLCALGDGHLLNFQLDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 653  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 712

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            EH RRICHQEQ+RTF ICSL NQS AEESEMHFVRLLDDQTFEF+STYPLD+FEYGCSIL
Sbjct: 713  EHARRICHQEQTRTFGICSLGNQSNAEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSIL 772

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
            SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFN
Sbjct: 773  SCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFN 832

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
            +YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 893  LYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLVTVKKNSEGATDEERGRLE 952

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKL
Sbjct: 953  VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKL 1012

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 1013 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1072

Query: 1021 SVEELCKRVEELTRLH 1036
             VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088


>gi|225443992|ref|XP_002280744.1| PREDICTED: DNA damage-binding protein 1 isoform 2 [Vitis vinifera]
          Length = 1068

 Score = 1977 bits (5121), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 972/1037 (93%), Positives = 999/1037 (96%), Gaps = 22/1037 (2%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE+SE+ITRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAENSEVITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQ                     VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQ---------------------VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 151

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            +KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPW+QNNLDNGADLLIPVPPPLCGVLII
Sbjct: 152  SKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWAQNNLDNGADLLIPVPPPLCGVLII 211

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSA+AFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL
Sbjct: 212  GEETIVYCSASAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 271

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNA VY+GSSYGDSQLIK++LQPDAKGSYVEVLERYVNLGPIV
Sbjct: 272  KIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKIHLQPDAKGSYVEVLERYVNLGPIV 331

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 332  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 391

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
             DTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA+Y+QLVQVTS SVRLV
Sbjct: 392  HDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVYDQLVQVTSSSVRLV 451

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
             STSRELRNEWK+P GYSVNVATANA+QVLLATGGGHLVYLEIGDG LTEVKHAQLEY+I
Sbjct: 452  GSTSRELRNEWKAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGTLTEVKHAQLEYDI 511

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP++SQ+AAVGMWTDISVRIFSLPDLNLITKE+LGGEIIPRSVLLC+FEG
Sbjct: 512  SCLDINPIGENPNFSQLAAVGMWTDISVRIFSLPDLNLITKEYLGGEIIPRSVLLCSFEG 571

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            I YLLCALGDGHLLNFLLNM TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 572  IPYLLCALGDGHLLNFLLNMSTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 631

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG+LTIGTIDDIQKLHIRSIPLG
Sbjct: 632  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGDLTIGTIDDIQKLHIRSIPLG 691

Query: 661  EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
            EH RRICHQEQSRTFAICSLK NQS  E+SEMHF+RLLDDQTFEFISTYPLDTFEYGCSI
Sbjct: 692  EHARRICHQEQSRTFAICSLKYNQSSTEDSEMHFIRLLDDQTFEFISTYPLDTFEYGCSI 751

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 752  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 811

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            NGKLLAAINQKIQLYKWMLRDDGTRELQSE GHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 812  NGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGHILALYVQTRGDFIVVGDLMKSISL 871

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
            LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN+FTVRKNSEGATDEERGRL
Sbjct: 872  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNIFTVRKNSEGATDEERGRL 931

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
            EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QY+FLEK
Sbjct: 932  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEK 991

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            LQ NLRKVIKGVGGL+HEQWRSFNNEKKTVDAKNFLDGDLIE+FLDL+RTRMDEISK M 
Sbjct: 992  LQANLRKVIKGVGGLSHEQWRSFNNEKKTVDAKNFLDGDLIETFLDLNRTRMDEISKAMA 1051

Query: 1020 VSVEELCKRVEELTRLH 1036
            VSVEELCKRVEELTRLH
Sbjct: 1052 VSVEELCKRVEELTRLH 1068


>gi|15235577|ref|NP_192451.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
 gi|55976605|sp|Q9M0V3.1|DDB1A_ARATH RecName: Full=DNA damage-binding protein 1a; AltName: Full=UV-damaged
            DNA-binding protein 1a; Short=DDB1a
 gi|7267302|emb|CAB81084.1| UV-damaged DNA binding factor-like protein [Arabidopsis thaliana]
 gi|25054828|gb|AAN71904.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
 gi|332657117|gb|AEE82517.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
          Length = 1088

 Score = 1974 bits (5114), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 952/1036 (91%), Positives = 1005/1036 (97%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDPESSELITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQN+LDNGADLLIPVPPPLCGVLII
Sbjct: 173  AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSA+AFKAIPIRPSITKAYGRVD DGSRYLLGDHAG++HLLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSASAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGMIHLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERY+NLGPIV
Sbjct: 293  KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYINLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+ 
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413  FDTFLVVSFISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 473  SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEV 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIG+NP+YSQ+AAVGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533  SCLDINPIGDNPNYSQLAAVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF ++  TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTV
Sbjct: 593  ISYLLCALGDGHLLNFQMDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 653  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 712

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            EH RRICHQEQ+RTF ICSL NQS +EESEMHFVRLLDDQTFEF+STYPLD+FEYGCSIL
Sbjct: 713  EHARRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSIL 772

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
            SCSF++D NVYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFN
Sbjct: 773  SCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFN 832

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
            +YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 893  LYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLE 952

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKL
Sbjct: 953  VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKL 1012

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 1013 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1072

Query: 1021 SVEELCKRVEELTRLH 1036
             VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088


>gi|350537001|ref|NP_001234275.1| DNA damage-binding protein 1 [Solanum lycopersicum]
 gi|350539125|ref|NP_001233864.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
 gi|55976440|sp|Q6QNU4.1|DDB1_SOLLC RecName: Full=DNA damage-binding protein 1; AltName: Full=High
            pigmentation protein 1; AltName: Full=UV-damaged
            DNA-binding protein 1
 gi|38455768|gb|AAR20885.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
 gi|42602165|gb|AAS21683.1| UV-damaged DNA binding protein 1 [Solanum lycopersicum]
          Length = 1090

 Score = 1967 bits (5095), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 959/1038 (92%), Positives = 1004/1038 (96%), Gaps = 2/1038 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGE QD LFIATERYKFCVLQWD E+SE+ITRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTIVVLYQDNKDARHVKTYEV+LKDKDF+EGPW+QNNLDNGA LLIPVPPPLCGVLII
Sbjct: 173  PKPTIVVLYQDNKDARHVKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSA+AFKAIPIRPSIT+AYGRVDADGSRYLLGDH GLLHLLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSASAFKAIPIRPSITRAYGRVDADGSRYLLGDHNGLLHLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 293  KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLQPDTKGSYVEVLERYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS+TDDP
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSATDDP 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            +DTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413  YDTFLVVSFISETRVLAMNLEDELEETEIEGFNSQVQTLFCHDAVYNQLVQVTSNSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SSTSR+L+NEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDG+L EVK+A+L+Y+I
Sbjct: 473  SSTSRDLKNEWFAPVGYSVNVATANATQVLLATGGGHLVYLEIGDGVLNEVKYAKLDYDI 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP+YS IAAVGMWTDISVRI+SLPDLNLITKE LGGEIIPRSVL+C+FEG
Sbjct: 533  SCLDINPIGENPNYSNIAAVGMWTDISVRIYSLPDLNLITKEQLGGEIIPRSVLMCSFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF+L+M TGELTDRKKVSLGTQPITLRTFSSK+TTHVFAASDRPTV
Sbjct: 593  ISYLLCALGDGHLLNFVLSMSTGELTDRKKVSLGTQPITLRTFSSKDTTHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFN AAFPDSLAIAKEGELTIGTID+IQKLHIRSIPLG
Sbjct: 653  IYSSNKKLLYSNVNLKEVSHMCPFNVAAFPDSLAIAKEGELTIGTIDEIQKLHIRSIPLG 712

Query: 661  EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
            EH RRI HQEQ+RTFA+CS+K  QS A++ EMHFVRLLDDQTFEFISTYPLD FEYGCSI
Sbjct: 713  EHARRISHQEQTRTFALCSVKYTQSNADDPEMHFVRLLDDQTFEFISTYPLDQFEYGCSI 772

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            LSCSFSDDSNVYYC+GTAYV+PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 773  LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 832

Query: 780  NGKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            NGKLLAAINQKIQLYKW  R+D G+RELQ+ECGHHGHILALYVQTRGDFIVVGDLMKSIS
Sbjct: 833  NGKLLAAINQKIQLYKWASREDGGSRELQTECGHHGHILALYVQTRGDFIVVGDLMKSIS 892

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
            LLI+KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER R
Sbjct: 893  LLIFKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERSR 952

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
            LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QYLFLE
Sbjct: 953  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLE 1012

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            KLQTNLRKVIKGVGGL+HEQWRSF NEKKTVDAKNFLDGDLIESFLDLSR RM+EISK M
Sbjct: 1013 KLQTNLRKVIKGVGGLSHEQWRSFYNEKKTVDAKNFLDGDLIESFLDLSRNRMEEISKAM 1072

Query: 1019 NVSVEELCKRVEELTRLH 1036
            +V VEEL KRVEELTRLH
Sbjct: 1073 SVPVEELMKRVEELTRLH 1090


>gi|55976392|sp|Q6E7D1.1|DDB1_SOLCE RecName: Full=DNA damage-binding protein 1; AltName: Full=UV-damaged
            DNA-binding protein 1
 gi|49484911|gb|AAT66742.1| UV-damaged DNA binding protein 1 [Solanum cheesmaniae]
          Length = 1095

 Score = 1964 bits (5088), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 959/1038 (92%), Positives = 1004/1038 (96%), Gaps = 2/1038 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGE QD LFIATERYKFCVLQWD E+SE+ITRAMGDVSDRIG
Sbjct: 58   MLDVPIYGRIATLELFRPHGETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSDRIG 117

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 118  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 177

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTIVVLYQDNKDARHVKTYEV+LKDKDF+EGPW+QNNLDNGA LLIPVPPPLCGVLII
Sbjct: 178  PKPTIVVLYQDNKDARHVKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLII 237

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSA+AFKAIPIRPSIT+AYGRVDADGSRYLLGDH GLLHLLVITHEKEKVTGL
Sbjct: 238  GEETIVYCSASAFKAIPIRPSITRAYGRVDADGSRYLLGDHNGLLHLLVITHEKEKVTGL 297

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 298  KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLQPDTKGSYVEVLERYVNLGPIV 357

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS+TDDP
Sbjct: 358  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSATDDP 417

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            +DTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 418  YDTFLVVSFISETRVLAMNLEDELEETEIEGFNSQVQTLFCHDAVYNQLVQVTSNSVRLV 477

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SSTSR+L+NEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDG+L EVK+A+L+Y+I
Sbjct: 478  SSTSRDLKNEWFAPVGYSVNVATANATQVLLATGGGHLVYLEIGDGVLNEVKYAKLDYDI 537

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP+YS IAAVGMWTDISVRI+SLPDLNLITKE LGGEIIPRSVL+C+FEG
Sbjct: 538  SCLDINPIGENPNYSNIAAVGMWTDISVRIYSLPDLNLITKEQLGGEIIPRSVLMCSFEG 597

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF+L+M TGELTDRKKVSLGTQPITLRTFSSK+TTHVFAASDRPTV
Sbjct: 598  ISYLLCALGDGHLLNFVLSMSTGELTDRKKVSLGTQPITLRTFSSKDTTHVFAASDRPTV 657

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFN AAFPDSLAIAKEGELTIGTID+IQKLHIRSIPLG
Sbjct: 658  IYSSNKKLLYSNVNLKEVSHMCPFNVAAFPDSLAIAKEGELTIGTIDEIQKLHIRSIPLG 717

Query: 661  EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
            EH RRI HQEQ+RTFA+CS+K  QS A++ EMHFVRLLDDQTFEFISTYPLD FEYGCSI
Sbjct: 718  EHARRISHQEQTRTFALCSVKYTQSNADDPEMHFVRLLDDQTFEFISTYPLDQFEYGCSI 777

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            LSCSFSDDSNVYYC+GTAYV+PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 778  LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 837

Query: 780  NGKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            NGKLLAAINQKIQLYKW  R+D G+RELQ+ECGHHGHILALYVQTRGDFIVVGDLMKSIS
Sbjct: 838  NGKLLAAINQKIQLYKWASREDGGSRELQTECGHHGHILALYVQTRGDFIVVGDLMKSIS 897

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
            LLI+KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER R
Sbjct: 898  LLIFKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERSR 957

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
            LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QYLFLE
Sbjct: 958  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLE 1017

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            KLQTNLRKVIKGVGGL+HEQWRSF NEKKTVDAKNFLDGDLIESFLDLSR RM+EISK M
Sbjct: 1018 KLQTNLRKVIKGVGGLSHEQWRSFYNEKKTVDAKNFLDGDLIESFLDLSRNRMEEISKAM 1077

Query: 1019 NVSVEELCKRVEELTRLH 1036
            +V VEEL KRVEELTRLH
Sbjct: 1078 SVPVEELMKRVEELTRLH 1095


>gi|449435512|ref|XP_004135539.1| PREDICTED: DNA damage-binding protein 1-like [Cucumis sativus]
          Length = 1093

 Score = 1952 bits (5057), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 950/1041 (91%), Positives = 1000/1041 (96%), Gaps = 5/1041 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDTESSELITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTD+GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDSGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            ++PTIVVLYQDNKDARHVKTYEV LKDKDFVEGPWSQNNLDNGA +LIPVPPPLCGV+II
Sbjct: 173  SRPTIVVLYQDNKDARHVKTYEVVLKDKDFVEGPWSQNNLDNGAAVLIPVPPPLCGVIII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSA AFKAIP+RPSIT+AYGRVDADGSRYLLGDHAGLLHLLVITHEKE+VTGL
Sbjct: 233  GEETIVYCSATAFKAIPVRPSITRAYGRVDADGSRYLLGDHAGLLHLLVITHEKERVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNA VYIGSSYGDSQL+KLN+QPDAKGSYVEVLERYVNLGPIV
Sbjct: 293  KIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKLNVQPDAKGSYVEVLERYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 413  FDTFLVVSFISETRILAMNLEDELEETEIEGFNSQVQTLFCHDALFNQLVQVTSSSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+REL NEW +P  YS+NVATANASQVLLATGGG LV+LEI DG+L E KH QLE+EI
Sbjct: 473  SSTTRELLNEWNAPSNYSINVATANASQVLLATGGGVLVHLEICDGLLVEKKHIQLEHEI 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIG+NP+ SQ+AAVGMWTDISVRIFSLPDLNL+TKE LGGEIIPRSVLLC FEG
Sbjct: 533  SCLDINPIGDNPNCSQLAAVGMWTDISVRIFSLPDLNLLTKEQLGGEIIPRSVLLCTFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTG----ELTDRKKVSLGTQPITLRTFSSKNTTHVFAASD 596
            ISYLLCALGDGHLLNF+LN  +     EL DRKKVSLGTQPITLRTFSSKN THVFAASD
Sbjct: 593  ISYLLCALGDGHLLNFILNTNSNSNSCELMDRKKVSLGTQPITLRTFSSKNATHVFAASD 652

Query: 597  RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
            RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS
Sbjct: 653  RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 712

Query: 657  IPLGEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEY 715
            IPLGEH RRICHQEQSRTFAICSL+ NQS  E++EMHF+RLLDDQTFE ISTY LDT+EY
Sbjct: 713  IPLGEHARRICHQEQSRTFAICSLRYNQSGTEDTEMHFIRLLDDQTFESISTYALDTYEY 772

Query: 716  GCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYS 775
            GCSILSCSFSDD+NVYYCVGTAYV+PEENEPTKGRILVF+VE+GKLQLIAEKETKG+VYS
Sbjct: 773  GCSILSCSFSDDNNVYYCVGTAYVMPEENEPTKGRILVFVVEEGKLQLIAEKETKGSVYS 832

Query: 776  LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
            LNAFNGKLLAAINQKIQLYKW LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK
Sbjct: 833  LNAFNGKLLAAINQKIQLYKWTLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 892

Query: 836  SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
            SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN FNLFTVRKNSEGATDEE
Sbjct: 893  SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENYFNLFTVRKNSEGATDEE 952

Query: 896  RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL 955
            R RLEVVGEYHLGEFVNRF+HGSLVMRLPDSDVGQIPTVIFG+VNGVIGVIASLPH+QY+
Sbjct: 953  RSRLEVVGEYHLGEFVNRFQHGSLVMRLPDSDVGQIPTVIFGSVNGVIGVIASLPHDQYV 1012

Query: 956  FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
            FLE+LQ+NLRKVIKGVGGL+HEQWRSFNNEK+T +AKNFLDGDLIESFLDL+R++M+EIS
Sbjct: 1013 FLERLQSNLRKVIKGVGGLSHEQWRSFNNEKRTAEAKNFLDGDLIESFLDLNRSKMEEIS 1072

Query: 1016 KTMNVSVEELCKRVEELTRLH 1036
            + M+VS EELCKRVEELTRLH
Sbjct: 1073 RAMSVSAEELCKRVEELTRLH 1093


>gi|297799958|ref|XP_002867863.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297313699|gb|EFH44122.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1088

 Score = 1941 bits (5029), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 935/1036 (90%), Positives = 995/1036 (96%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDVP+YGRIATLELFRPHGEAQDFLF+ATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53   ILDVPLYGRIATLELFRPHGEAQDFLFVATERYKFCVLQWDYESSELITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTI VLYQDNKDARHVKTYEV+LK+KDFVEGPWSQNNLDNGADLLIPVP PLCGVLII
Sbjct: 173  TKPTIAVLYQDNKDARHVKTYEVSLKEKDFVEGPWSQNNLDNGADLLIPVPSPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDH+GL+HLLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHSGLIHLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIAS+ISYLDNAVV++GSSYGDSQLIKLNLQPDA GSYVE+LE+YVNLGPIV
Sbjct: 293  KIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDATGSYVEILEKYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+ 
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMN+EDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413  FDTFLVVSFISETRILAMNIEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+RELRN+W +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEVKH  LEYE+
Sbjct: 473  SSTTRELRNKWDAPAGFAVNVATANASQVLLATGGGHLVYLEIGDGTLTEVKHVLLEYEV 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIG+NP+YSQ+AAVG+WTDISVRIF LPDL LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533  SCLDINPIGDNPNYSQLAAVGLWTDISVRIFVLPDLTLITKEQLGGEIIPRSVLLCAFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF L+  +G+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRP V
Sbjct: 593  ISYLLCALGDGHLLNFQLDTSSGKLRDRKKVSLGTQPITLRTFSSKSATHVFAASDRPAV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYS+NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IP+G
Sbjct: 653  IYSNNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPIG 712

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            EH RRICHQEQ+RTFAIC L+NQ  AEESEMHFVRLLD Q+FEF+STYPLD FEYGCSIL
Sbjct: 713  EHARRICHQEQTRTFAICCLRNQPSAEESEMHFVRLLDAQSFEFLSTYPLDAFEYGCSIL 772

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
            SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVE+G+LQLI EKETKGAVYSLNAFN
Sbjct: 773  SCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEEGRLQLITEKETKGAVYSLNAFN 832

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
            IYKHEEGAIEERARDYNANWM+AVEILDDDIYLGA+N FNLFTV+KN+EGATDEER R+E
Sbjct: 893  IYKHEEGAIEERARDYNANWMAAVEILDDDIYLGADNCFNLFTVKKNNEGATDEERARME 952

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VVGEYH+GEFVNRFRHGSLVMRLPDS++GQIPTVIFGTV+G+IGVIASLP EQY FLEKL
Sbjct: 953  VVGEYHIGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVSGMIGVIASLPQEQYAFLEKL 1012

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            QT+LRKVIKGVGGL+HEQWRSFNNEK+T +AK++LDGDLIESFLDLSR +M+EISK M+V
Sbjct: 1013 QTSLRKVIKGVGGLSHEQWRSFNNEKRTAEAKSYLDGDLIESFLDLSRGKMEEISKGMDV 1072

Query: 1021 SVEELCKRVEELTRLH 1036
             VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088


>gi|356512638|ref|XP_003525025.1| PREDICTED: DNA damage-binding protein 1a-like isoform 2 [Glycine max]
          Length = 1068

 Score = 1939 bits (5022), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 952/1037 (91%), Positives = 989/1037 (95%), Gaps = 22/1037 (2%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGEAQD+LFIATERYKFCVLQWD+E++EL+TRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGEAQDYLFIATERYKFCVLQWDSETAELVTRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQ                     VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQ---------------------VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 151

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            +KPTIVVLYQDNKDARHVKTYEVALKDKDF+EGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 152  SKPTIVVLYQDNKDARHVKTYEVALKDKDFLEGPWSQNNLDNGADLLIPVPPPLCGVLII 211

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDH GLL LLVITHEKEKVTGL
Sbjct: 212  GEETIVYCSANAFKAIPIRPSITKAYGRVDPDGSRYLLGDHTGLLSLLVITHEKEKVTGL 271

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIE LGETSIASTISYLDNA VYIGSSYGDSQLIKLNLQPDAKGSYVE LERYVNLGPIV
Sbjct: 272  KIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKLNLQPDAKGSYVEGLERYVNLGPIV 331

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 332  DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 391

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 392  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVHNQLVQVTSSSVRLV 451

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+RELRNEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDGIL EVKHAQLEYEI
Sbjct: 452  SSTTRELRNEWHAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGILQEVKHAQLEYEI 511

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP++S +AAVGMWTDISVRIFSLPDL+LITKE LGGEIIPRSVLLCAFEG
Sbjct: 512  SCLDINPIGENPNHSHLAAVGMWTDISVRIFSLPDLSLITKEQLGGEIIPRSVLLCAFEG 571

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF+LN  TGEL DRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 572  ISYLLCALGDGHLLNFMLNTSTGELADRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 631

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG
Sbjct: 632  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 691

Query: 661  EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
            EH RRICHQEQSRTFAICSLK N +  E+SEMHFVRLLDDQTFEFISTY LDT+EYGC I
Sbjct: 692  EHARRICHQEQSRTFAICSLKYNPASGEDSEMHFVRLLDDQTFEFISTYSLDTYEYGCFI 751

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            +SCSFSDD+NVYYCVGTAYVLPEENEPTKGRILVF VEDGKLQLIAEKETKGAVY LNAF
Sbjct: 752  ISCSFSDDNNVYYCVGTAYVLPEENEPTKGRILVFAVEDGKLQLIAEKETKGAVYCLNAF 811

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            NGKLLAAINQKIQLYKW+LRDDGT ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 812  NGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 871

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
            LIYKHEEGAIEERARDYNANWMSAVEI+DDDIYLGAEN+FNLFTVRKNSEGATDEERGRL
Sbjct: 872  LIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLGAENSFNLFTVRKNSEGATDEERGRL 931

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
            EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT+NGVIGVIASLPHEQY+FLEK
Sbjct: 932  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTINGVIGVIASLPHEQYVFLEK 991

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            LQ+NLRKVIKGVGGL+HEQWRSFNNEKKTV+A+NFLDGDLIESFLDL+R++MDEISK ++
Sbjct: 992  LQSNLRKVIKGVGGLSHEQWRSFNNEKKTVEARNFLDGDLIESFLDLNRSKMDEISKALD 1051

Query: 1020 VSVEELCKRVEELTRLH 1036
            VSVEELCKRVEELTRLH
Sbjct: 1052 VSVEELCKRVEELTRLH 1068


>gi|356525403|ref|XP_003531314.1| PREDICTED: DNA damage-binding protein 1-like isoform 2 [Glycine max]
          Length = 1068

 Score = 1938 bits (5020), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 950/1037 (91%), Positives = 989/1037 (95%), Gaps = 22/1037 (2%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGEAQD+LFIATERYKFCVLQWD+E+ EL+TRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGEAQDYLFIATERYKFCVLQWDSETGELVTRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQ                     VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQ---------------------VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 151

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            +KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 152  SKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 211

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDH GL+ LLVI HEKEKVTGL
Sbjct: 212  GEETIVYCSANAFKAIPIRPSITKAYGRVDPDGSRYLLGDHTGLVSLLVIIHEKEKVTGL 271

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIE LGETSIASTISYLDNA VY+GSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV
Sbjct: 272  KIEPLGETSIASTISYLDNAFVYVGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 331

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 332  DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 391

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 392  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVHNQLVQVTSSSVRLV 451

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+R+LRNEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDGIL EVKHAQLEYEI
Sbjct: 452  SSTTRDLRNEWHAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGILQEVKHAQLEYEI 511

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP++S +AAVGMWTDISVRIFSLPDL+LITKE LGGEIIPRSVLLCAFEG
Sbjct: 512  SCLDINPIGENPNHSNLAAVGMWTDISVRIFSLPDLSLITKEQLGGEIIPRSVLLCAFEG 571

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF+LN  TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 572  ISYLLCALGDGHLLNFMLNTSTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 631

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG
Sbjct: 632  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 691

Query: 661  EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
            EH RRICHQEQSRTFAICSLK N +  E+SEMHFVRLLDDQTFEFISTY LDT+EYGC I
Sbjct: 692  EHARRICHQEQSRTFAICSLKYNPASGEDSEMHFVRLLDDQTFEFISTYSLDTYEYGCFI 751

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            +SCSFSDD+NVYYCVGTAYVLPEENEPTKGRI+VF VEDGKLQLIAEKETKGAVY LNAF
Sbjct: 752  ISCSFSDDNNVYYCVGTAYVLPEENEPTKGRIIVFAVEDGKLQLIAEKETKGAVYCLNAF 811

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            NGKLLAAINQKIQLYKW+LRDDGT ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 812  NGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 871

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
            LIYKHEEGAIEERARDYNANWMSAVEI+DDDIYLGAEN+FNLFTVRKNSEGATDEERGRL
Sbjct: 872  LIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLGAENSFNLFTVRKNSEGATDEERGRL 931

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
            EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT+NGVIGVIASLPHEQY+FLEK
Sbjct: 932  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTINGVIGVIASLPHEQYVFLEK 991

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            LQ+NLRKVIKGVGGL+HEQWRSFNNEKKTV+A+NFLDGDLIESFLDL+R++MDEISK ++
Sbjct: 992  LQSNLRKVIKGVGGLSHEQWRSFNNEKKTVEARNFLDGDLIESFLDLNRSKMDEISKAVD 1051

Query: 1020 VSVEELCKRVEELTRLH 1036
            VSVEELCKRVEELTRLH
Sbjct: 1052 VSVEELCKRVEELTRLH 1068


>gi|357519461|ref|XP_003630019.1| DNA damage-binding protein [Medicago truncatula]
 gi|355524041|gb|AET04495.1| DNA damage-binding protein [Medicago truncatula]
          Length = 1171

 Score = 1936 bits (5016), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 960/1119 (85%), Positives = 1001/1119 (89%), Gaps = 83/1119 (7%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDVP+YGRIATLELFRPHGE QDFLFIATERYKFCVLQWD E SEL+TR+MGDVSDRIG
Sbjct: 53   ILDVPLYGRIATLELFRPHGETQDFLFIATERYKFCVLQWDTEKSELVTRSMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQN+LDNGADLLIPVPPPLCGVLII
Sbjct: 173  PKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSAN FKAIPIR +ITKAYGRVD DGSRYLLGDH GLL LLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSANGFKAIPIRAAITKAYGRVDPDGSRYLLGDHTGLLSLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIE LGETSIASTISYLDNA VYIGSSYGDSQLIKLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 293  KIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKLNLQPDPKGSYVEVLERYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQAS------------------- 341
            DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQ S                   
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQVSPLFSCLSVGSFTQDTTKRK 412

Query: 342  -----------VELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE 390
                       +ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE
Sbjct: 413  RFFFFLIINNTLELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE 472

Query: 391  GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQ-- 448
            GFCSQ QTLFCHDA++NQLVQVTS SVRLVSS++REL NEW +P  YSVNVATANA+Q  
Sbjct: 473  GFCSQVQTLFCHDAVHNQLVQVTSNSVRLVSSSTRELLNEWHAPSDYSVNVATANATQSQ 532

Query: 449  --------------------------------------------------VLLATGGGHL 458
                                                              VLLATGGGHL
Sbjct: 533  NHFIFQISSPSLSHGNIMICLSCKDVDNWSSTSSSLSLIYRVLFLLVAASVLLATGGGHL 592

Query: 459  VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
            VYLEIGDGIL EVKHAQLEYEISCLDINPIGENP+ SQ+AAVGMWTDISVR+FSLP+LNL
Sbjct: 593  VYLEIGDGILQEVKHAQLEYEISCLDINPIGENPNQSQLAAVGMWTDISVRLFSLPELNL 652

Query: 519  ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
            ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNF+LN  TGELTDRKKVSLGTQPI
Sbjct: 653  ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFMLNTSTGELTDRKKVSLGTQPI 712

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
            TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE
Sbjct: 713  TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 772

Query: 639  GELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLL 697
            GELTIGTIDDIQKLHIR+IPLGEH RRICHQEQ+RTFAICSLK N + AEESEMHFVRLL
Sbjct: 773  GELTIGTIDDIQKLHIRTIPLGEHARRICHQEQTRTFAICSLKYNSASAEESEMHFVRLL 832

Query: 698  DDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
            DDQTF+FIS YPLDT+EYGC I+SCSFSDD+NVYYCVGTAYVLPEENEPTKGRILVF VE
Sbjct: 833  DDQTFDFISVYPLDTYEYGCFIISCSFSDDNNVYYCVGTAYVLPEENEPTKGRILVFSVE 892

Query: 758  DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
            +GKLQL+AEKETKGAVY LNAFNGKLLAAINQKIQLYKW+LR+DGTRELQSECGHHGHIL
Sbjct: 893  EGKLQLVAEKETKGAVYCLNAFNGKLLAAINQKIQLYKWVLREDGTRELQSECGHHGHIL 952

Query: 818  ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
            ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD+YLGAEN
Sbjct: 953  ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDVYLGAEN 1012

Query: 878  NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
            +FNLFTVRKNSEGATDEERGRLEV GEYHLGEF+NRFRHGSLVMRLPDSDVGQIPTVIFG
Sbjct: 1013 SFNLFTVRKNSEGATDEERGRLEVAGEYHLGEFINRFRHGSLVMRLPDSDVGQIPTVIFG 1072

Query: 938  TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
            T+NGVIGVIASLPHEQY+FLEKLQ+NLRKVIKGVGGL+HEQWRSFNNEKKTV+A+NFLDG
Sbjct: 1073 TINGVIGVIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTVEARNFLDG 1132

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            DLIESFLDL R++MDEISK M VSVEEL KRVEELTRLH
Sbjct: 1133 DLIESFLDLKRSKMDEISKAMEVSVEELAKRVEELTRLH 1171


>gi|297740793|emb|CBI30975.3| unnamed protein product [Vitis vinifera]
          Length = 1043

 Score = 1920 bits (4974), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 950/1037 (91%), Positives = 974/1037 (93%), Gaps = 47/1037 (4%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE+SE+ITRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAENSEVITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            +KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPW+QNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173  SKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWAQNNLDNGADLLIPVPPPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSA+AFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSASAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNA VY+GSSYGDSQLIK++LQPDAKGSYVEVLERYVNLGPIV
Sbjct: 293  KIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKIHLQPDAKGSYVEVLERYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
             DTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA+Y+QLVQVTS SVRLV
Sbjct: 413  HDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVYDQLVQVTSSSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
             STSRELRNEWK+P GYSVNVATANA+QVLLATGGGHLVYLEIGDG LTEVKHAQLEY+I
Sbjct: 473  GSTSRELRNEWKAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGTLTEVKHAQLEYDI 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP++SQ+AAVGMWTDISVRIFSLPDLNLITKE+LGGEIIPRSVLLC+FEG
Sbjct: 533  SCLDINPIGENPNFSQLAAVGMWTDISVRIFSLPDLNLITKEYLGGEIIPRSVLLCSFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            I YLLCALGDGHLLNFLLNM TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593  IPYLLCALGDGHLLNFLLNMSTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG+LTIGTIDDIQKLHIRSIPLG
Sbjct: 653  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGDLTIGTIDDIQKLHIRSIPLG 712

Query: 661  EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
            EH RRICHQEQSRTFAICSLK NQS  E+SEMHF+RLLDDQTFEFISTYPLDTFEYGCSI
Sbjct: 713  EHARRICHQEQSRTFAICSLKYNQSSTEDSEMHFIRLLDDQTFEFISTYPLDTFEYGCSI 772

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 773  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 832

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            NGKLLAAINQKIQLYKWMLRDDGTRELQSE GHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 833  NGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGHILALYVQTRGDFIVVGDLMKSISL 892

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
            LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN+FTVRKNSEGATDEERGRL
Sbjct: 893  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNIFTVRKNSEGATDEERGRL 952

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
            EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QY+FLEK
Sbjct: 953  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEK 1012

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            LQ NLRKVIKG                                               M 
Sbjct: 1013 LQANLRKVIKG----------------------------------------------AMA 1026

Query: 1020 VSVEELCKRVEELTRLH 1036
            VSVEELCKRVEELTRLH
Sbjct: 1027 VSVEELCKRVEELTRLH 1043


>gi|15233515|ref|NP_193842.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
 gi|73620956|sp|O49552.2|DDB1B_ARATH RecName: Full=DNA damage-binding protein 1b; AltName: Full=UV-damaged
            DNA-binding protein 1b; Short=DDB1b
 gi|110739453|dbj|BAF01636.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
 gi|332659001|gb|AEE84401.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
          Length = 1088

 Score = 1920 bits (4973), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 925/1036 (89%), Positives = 989/1036 (95%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDVP+YGRIAT+ELFRPHGEAQDFLF+ATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53   ILDVPLYGRIATMELFRPHGEAQDFLFVATERYKFCVLQWDYESSELITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCR+IGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRVIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTI VLYQDNKDARHVKTYEV+LKDK+FVEGPWSQNNLDNGADLLIPVP PLCGVLII
Sbjct: 173  TKPTIAVLYQDNKDARHVKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+HLLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIAS+ISYLDNAVV++GSSYGDSQLIKLNLQPDAKGSYVE+LE+YVNLGPIV
Sbjct: 293  KIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDAKGSYVEILEKYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+ 
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMN+EDELEETEIEGF S+ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413  FDTFLVVSFISETRILAMNIEDELEETEIEGFLSEVQTLFCHDAVYNQLVQVTSNSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+RELRN+W +P G+SVNVATANASQVLLATGGGHLVYLEIGDG LTEVKH  LEYE+
Sbjct: 473  SSTTRELRNKWDAPAGFSVNVATANASQVLLATGGGHLVYLEIGDGTLTEVKHVLLEYEV 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIG+NP+YSQ+AAVGMWTDISVRIF LPDL LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533  SCLDINPIGDNPNYSQLAAVGMWTDISVRIFVLPDLTLITKEELGGEIIPRSVLLCAFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF L+   G+L DRKKVSLGT+PITLRTFSSK+ THVFAASDRP V
Sbjct: 593  ISYLLCALGDGHLLNFQLDTSCGKLRDRKKVSLGTRPITLRTFSSKSATHVFAASDRPAV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYS+NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IP+G
Sbjct: 653  IYSNNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPIG 712

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            EH RRICHQEQ+RTFAI  L+N+  AEESE HFVRLLD Q+FEF+S+YPLD FE GCSIL
Sbjct: 713  EHARRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQSFEFLSSYPLDAFECGCSIL 772

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
            SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVE+G+LQLI EKETKGAVYSLNAFN
Sbjct: 773  SCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEEGRLQLITEKETKGAVYSLNAFN 832

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            GKLLA+INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI VGDLMKSISLL
Sbjct: 833  GKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIAVGDLMKSISLL 892

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
            IYKHEEGAIEERARDYNANWM+AVEIL+DDIYLG +N FN+FTV+KN+EGATDEER R+E
Sbjct: 893  IYKHEEGAIEERARDYNANWMTAVEILNDDIYLGTDNCFNIFTVKKNNEGATDEERARME 952

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VVGEYH+GEFVNRFRHGSLVM+LPDSD+GQIPTVIFGTV+G+IGVIASLP EQY FLEKL
Sbjct: 953  VVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIPTVIFGTVSGMIGVIASLPQEQYAFLEKL 1012

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            QT+LRKVIKGVGGL+HEQWRSFNNEK+T +AK +LDGDLIESFLDLSR +M+EISK M+V
Sbjct: 1013 QTSLRKVIKGVGGLSHEQWRSFNNEKRTAEAKGYLDGDLIESFLDLSRGKMEEISKGMDV 1072

Query: 1021 SVEELCKRVEELTRLH 1036
             VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088


>gi|62318656|dbj|BAD95136.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
          Length = 1088

 Score = 1919 bits (4971), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 924/1036 (89%), Positives = 989/1036 (95%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDVP+YGRIAT+ELFRPHGEAQDFLF+ATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53   ILDVPLYGRIATMELFRPHGEAQDFLFVATERYKFCVLQWDYESSELITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCR+IGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRVIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTI VLYQDNKDARHVKTYEV+LKDK+FVEGPWSQNNLDNGADLLIPVP PLCGVLII
Sbjct: 173  TKPTIAVLYQDNKDARHVKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+HLLVITHEKEKVTGL
Sbjct: 233  GEETIVYCSANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIAS+ISYLDNAVV++GSSYGDSQLIKLNLQPDAKGSYVE+LE+YVNLGPIV
Sbjct: 293  KIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDAKGSYVEILEKYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+ 
Sbjct: 353  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMN+EDELEETEIEGF S+ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413  FDTFLVVSFISETRILAMNIEDELEETEIEGFLSEVQTLFCHDAVYNQLVQVTSNSVRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+RELRN+W +P G+SVNVATANASQVLLATGGGHLVYLEIGDG LTEVKH  LEYE+
Sbjct: 473  SSTTRELRNKWDAPAGFSVNVATANASQVLLATGGGHLVYLEIGDGTLTEVKHVLLEYEV 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIG+NP+YSQ+AAVGMWTDISVRIF LPDL LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533  SCLDINPIGDNPNYSQLAAVGMWTDISVRIFVLPDLTLITKEELGGEIIPRSVLLCAFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF L+   G+L DRKKVSLGT+PITLRTFSSK+ THVFAASDRP V
Sbjct: 593  ISYLLCALGDGHLLNFQLDTSCGKLRDRKKVSLGTRPITLRTFSSKSATHVFAASDRPAV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYS+NKKLLYSNV+LKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IP+G
Sbjct: 653  IYSNNKKLLYSNVSLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPIG 712

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            EH RRICHQEQ+RTFAI  L+N+  AEESE HFVRLLD Q+FEF+S+YPLD FE GCSIL
Sbjct: 713  EHARRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQSFEFLSSYPLDAFECGCSIL 772

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
            SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVE+G+LQLI EKETKGAVYSLNAFN
Sbjct: 773  SCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEEGRLQLITEKETKGAVYSLNAFN 832

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            GKLLA+INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI VGDLMKSISLL
Sbjct: 833  GKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIAVGDLMKSISLL 892

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
            IYKHEEGAIEERARDYNANWM+AVEIL+DDIYLG +N FN+FTV+KN+EGATDEER R+E
Sbjct: 893  IYKHEEGAIEERARDYNANWMAAVEILNDDIYLGTDNCFNIFTVKKNNEGATDEERARME 952

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VVGEYH+GEFVNRFRHGSLVM+LPDSD+GQIPTVIFGTV+G+IGVIASLP EQY FLEKL
Sbjct: 953  VVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIPTVIFGTVSGMIGVIASLPQEQYAFLEKL 1012

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            QT+LRKVIKGVGGL+HEQWRSFNNEK+T +AK +LDGDLIESFLDLSR +M+EISK M+V
Sbjct: 1013 QTSLRKVIKGVGGLSHEQWRSFNNEKRTAEAKGYLDGDLIESFLDLSRGKMEEISKGMDV 1072

Query: 1021 SVEELCKRVEELTRLH 1036
             VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088


>gi|186511557|ref|NP_001118940.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
 gi|332657118|gb|AEE82518.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
          Length = 1067

 Score = 1919 bits (4970), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 931/1036 (89%), Positives = 984/1036 (94%), Gaps = 21/1036 (2%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDPESSELITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQ                     VIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113  RPTDNGQ---------------------VIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 151

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQN+LDNGADLLIPVPPPLCGVLII
Sbjct: 152  AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLII 211

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSA+AFKAIPIRPSITKAYGRVD DGSRYLLGDHAG++HLLVITHEKEKVTGL
Sbjct: 212  GEETIVYCSASAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGMIHLLVITHEKEKVTGL 271

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERY+NLGPIV
Sbjct: 272  KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYINLGPIV 331

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+ 
Sbjct: 332  DFCVVDLERQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEA 391

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 392  FDTFLVVSFISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 451

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 452  SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEV 511

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIG+NP+YSQ+AAVGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 512  SCLDINPIGDNPNYSQLAAVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 571

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNF ++  TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTV
Sbjct: 572  ISYLLCALGDGHLLNFQMDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTV 631

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 632  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 691

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            EH RRICHQEQ+RTF ICSL NQS +EESEMHFVRLLDDQTFEF+STYPLD+FEYGCSIL
Sbjct: 692  EHARRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSIL 751

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
            SCSF++D NVYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFN
Sbjct: 752  SCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFN 811

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 812  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 871

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
            +YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 872  LYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLE 931

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKL
Sbjct: 932  VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKL 991

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 992  QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1051

Query: 1021 SVEELCKRVEELTRLH 1036
             VEELCKRVEELTRLH
Sbjct: 1052 QVEELCKRVEELTRLH 1067


>gi|2911067|emb|CAA17529.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
 gi|7268907|emb|CAB79110.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
          Length = 1102

 Score = 1856 bits (4807), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 904/1050 (86%), Positives = 972/1050 (92%), Gaps = 14/1050 (1%)

Query: 1    MLDVPIYGRIATLELFRPHGEA----QDFLFIATERYKFCVLQWDAESS----------E 46
            +LDVP+YGRIAT+ELFRPH         FL +  +   FC+LQ    +S           
Sbjct: 53   ILDVPLYGRIATMELFRPHVSIVFLLNTFLCLRVKHKTFCLLQLKDINSVFFNGIMSPLS 112

Query: 47   LITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRL 106
            L+  AMGDVSDRIGRPTDNGQIGIIDPDCR+IGLHLYDGLFKVIPFDNKGQLKEAFNIRL
Sbjct: 113  LLQGAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKVIPFDNKGQLKEAFNIRL 172

Query: 107  EELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
            EELQVLDIKFLYGC KPTI VLYQDNKDARHVKTYEV+LKDK+FVEGPWSQNNLDNGADL
Sbjct: 173  EELQVLDIKFLYGCTKPTIAVLYQDNKDARHVKTYEVSLKDKNFVEGPWSQNNLDNGADL 232

Query: 167  LIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLH 226
            LIPVP PLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+H
Sbjct: 233  LIPVPSPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHAGLIH 292

Query: 227  LLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSY 286
            LLVITHEKEKVTGLKIELLGETSIAS+ISYLDNAVV++GSSYGDSQLIKLNLQPDAKGSY
Sbjct: 293  LLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDAKGSY 352

Query: 287  VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            VE+LE+YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG
Sbjct: 353  VEILEKYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 412

Query: 347  IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
            IKGMWSL+SS D+ FDTFLVVSFISETRILAMN+EDELEETEIEGF S+ QTLFCHDA+Y
Sbjct: 413  IKGMWSLKSSIDEAFDTFLVVSFISETRILAMNIEDELEETEIEGFLSEVQTLFCHDAVY 472

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG 466
            NQLVQVTS SVRLVSST+RELRN+W +P G+SVNVATANASQVLLATGGGHLVYLEIGDG
Sbjct: 473  NQLVQVTSNSVRLVSSTTRELRNKWDAPAGFSVNVATANASQVLLATGGGHLVYLEIGDG 532

Query: 467  ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 526
             LTEVKH  LEYE+SCLDINPIG+NP+YSQ+AAVGMWTDISVRIF LPDL LITKE LGG
Sbjct: 533  TLTEVKHVLLEYEVSCLDINPIGDNPNYSQLAAVGMWTDISVRIFVLPDLTLITKEELGG 592

Query: 527  EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            EIIPRSVLLCAFEGISYLLCALGDGHLLNF L+   G+L DRKKVSLGT+PITLRTFSSK
Sbjct: 593  EIIPRSVLLCAFEGISYLLCALGDGHLLNFQLDTSCGKLRDRKKVSLGTRPITLRTFSSK 652

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
            + THVFAASDRP VIYS+NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTI
Sbjct: 653  SATHVFAASDRPAVIYSNNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTI 712

Query: 647  DDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIS 706
            DDIQKLHIR+IP+GEH RRICHQEQ+RTFAI  L+N+  AEESE HFVRLLD Q+FEF+S
Sbjct: 713  DDIQKLHIRTIPIGEHARRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQSFEFLS 772

Query: 707  TYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAE 766
            +YPLD FE GCSILSCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVE+G+LQLI E
Sbjct: 773  SYPLDAFECGCSILSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEEGRLQLITE 832

Query: 767  KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD 826
            KETKGAVYSLNAFNGKLLA+INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD
Sbjct: 833  KETKGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD 892

Query: 827  FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
            FI VGDLMKSISLLIYKHEEGAIEERARDYNANWM+AVEIL+DDIYLG +N FN+FTV+K
Sbjct: 893  FIAVGDLMKSISLLIYKHEEGAIEERARDYNANWMTAVEILNDDIYLGTDNCFNIFTVKK 952

Query: 887  NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
            N+EGATDEER R+EVVGEYH+GEFVNRFRHGSLVM+LPDSD+GQIPTVIFGTV+G+IGVI
Sbjct: 953  NNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIPTVIFGTVSGMIGVI 1012

Query: 947  ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
            ASLP EQY FLEKLQT+LRKVIKGVGGL+HEQWRSFNNEK+T +AK +LDGDLIESFLDL
Sbjct: 1013 ASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTAEAKGYLDGDLIESFLDL 1072

Query: 1007 SRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            SR +M+EISK M+V VEELCKRVEELTRLH
Sbjct: 1073 SRGKMEEISKGMDVQVEELCKRVEELTRLH 1102


>gi|115465791|ref|NP_001056495.1| Os05g0592400 [Oryza sativa Japonica Group]
 gi|48475231|gb|AAT44300.1| putative DNA damage binding protein 1 [Oryza sativa Japonica Group]
 gi|113580046|dbj|BAF18409.1| Os05g0592400 [Oryza sativa Japonica Group]
 gi|215694552|dbj|BAG89545.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222632766|gb|EEE64898.1| hypothetical protein OsJ_19757 [Oryza sativa Japonica Group]
          Length = 1090

 Score = 1815 bits (4701), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 877/1038 (84%), Positives = 959/1038 (92%), Gaps = 2/1038 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            M+DVPIYGRIATLELFRPH E QDFLFIATERYKFCVLQWD E SEL+TRAMGDVSDRIG
Sbjct: 53   MIDVPIYGRIATLELFRPHNETQDFLFIATERYKFCVLQWDGEKSELLTRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA LLIPVP PL GV+II
Sbjct: 173  VKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGAGLLIPVPAPLGGVIII 232

Query: 181  GEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG 239
            GEETIVYC+AN+ F+AIPI+ SI +AYGRVD DGSRYLLGD+AG+LHLLV+THE+E+VTG
Sbjct: 233  GEETIVYCNANSTFRAIPIKQSIIRAYGRVDPDGSRYLLGDNAGILHLLVLTHERERVTG 292

Query: 240  LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
            LKIE LGETSIAS+ISYLDN VVY+GS +GDSQL+KLNLQ D  GSYVEVLERYVNLGPI
Sbjct: 293  LKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKLNLQADPNGSYVEVLERYVNLGPI 352

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
            VDFCVVDL+RQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKG+WSL+SS +D
Sbjct: 353  VDFCVVDLDRQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGLWSLKSSFND 412

Query: 360  PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
            P+D +LVVSFISETR LAMN+EDELEETEIEGF +QTQTLFC +AI + L+QVT+ SVRL
Sbjct: 413  PYDMYLVVSFISETRFLAMNMEDELEETEIEGFDAQTQTLFCQNAINDLLIQVTANSVRL 472

Query: 420  VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
            VS TSREL ++W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D  L EVKH QLE+E
Sbjct: 473  VSCTSRELVDQWNAPEGFSVNVASANASQVLLATGGGHLVYLEIKDSKLVEVKHIQLEHE 532

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
            ISC+D+NPIGENP YS +AAVGMWTDISVRI SLPDL LI KE+LGGEI+PRSVLLC  E
Sbjct: 533  ISCVDLNPIGENPQYSSLAAVGMWTDISVRILSLPDLELIRKENLGGEIVPRSVLLCTLE 592

Query: 540  GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
            G+SYLLCALGDGHL +FLLN  TGELTDRKKVSLGTQPI+LRTFSSK TTHVFA+SDRPT
Sbjct: 593  GVSYLLCALGDGHLFSFLLNASTGELTDRKKVSLGTQPISLRTFSSKGTTHVFASSDRPT 652

Query: 600  VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
            VIYSSNKKLLYSNVNLKEV+HMCPFN+AA PDSLAIAKEGEL+IGTIDDIQKLHIR+IPL
Sbjct: 653  VIYSSNKKLLYSNVNLKEVNHMCPFNTAAIPDSLAIAKEGELSIGTIDDIQKLHIRTIPL 712

Query: 660  GEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
             E  RRICHQEQSRT A CS K NQ+  EESE HFVRLLD QTFEF+S Y LD +E+GCS
Sbjct: 713  NEQARRICHQEQSRTLAFCSFKHNQTSIEESETHFVRLLDHQTFEFLSIYQLDQYEHGCS 772

Query: 719  ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
            I+SCSFSDD+NVYYCVGTAYVLPEENEP+KGRILVF VEDG+LQLI EKETKGAVYSLNA
Sbjct: 773  IISCSFSDDNNVYYCVGTAYVLPEENEPSKGRILVFAVEDGRLQLIVEKETKGAVYSLNA 832

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            FNGKLLAAINQKIQLYKWMLR+DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSIS
Sbjct: 833  FNGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSIS 892

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
            LL+YKHEE AIEE ARDYNANWMSAVE+LDD+IY+GAENN+N+FTVRKNS+ ATDEERGR
Sbjct: 893  LLVYKHEESAIEELARDYNANWMSAVEMLDDEIYIGAENNYNIFTVRKNSDAATDEERGR 952

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
            LEVVGEYHLGEFVNR RHGSLVMRLPDS++GQIPTVIFGT+NGVIG+IASLPHEQY+FLE
Sbjct: 953  LEVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIPTVIFGTINGVIGIIASLPHEQYVFLE 1012

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            KLQ+ L K IKGVG L+HEQWRSF+N+KKT +A+NFLDGDLIESFLDLSR +M+E++K M
Sbjct: 1013 KLQSTLVKFIKGVGNLSHEQWRSFHNDKKTSEARNFLDGDLIESFLDLSRNKMEEVAKGM 1072

Query: 1019 NVSVEELCKRVEELTRLH 1036
             V VEEL KRVEELTRLH
Sbjct: 1073 GVPVEELSKRVEELTRLH 1090


>gi|413946716|gb|AFW79365.1| hypothetical protein ZEAMMB73_562969 [Zea mays]
          Length = 1089

 Score = 1814 bits (4698), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 872/1037 (84%), Positives = 956/1037 (92%), Gaps = 1/1037 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIAT+ELFRPH E QDFLFIATERYKFCVLQWDAE SELITRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATIELFRPHNETQDFLFIATERYKFCVLQWDAEKSELITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLHGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            AKPTIVVLYQDNKD RHVKTYEVALKDKDFVEGPWSQNN+DNGA LLIPVP PL GV+II
Sbjct: 173  AKPTIVVLYQDNKDVRHVKTYEVALKDKDFVEGPWSQNNVDNGAGLLIPVPAPLGGVIII 232

Query: 181  GEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG 239
            GEE IVYC+AN+ FKAIPI+ SI +AYGRVD DGSRYLLGD+ G+LHLLV+THE+E+VTG
Sbjct: 233  GEEQIVYCNANSTFKAIPIKQSIIRAYGRVDPDGSRYLLGDNTGILHLLVLTHERERVTG 292

Query: 240  LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
            LKIE LGETSIAS+ISYLDN VVY+GS +GDSQL+KLNLQ DA GS+VE+LERYVNLGPI
Sbjct: 293  LKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKLNLQADASGSFVEILERYVNLGPI 352

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
            VDFCVVDL+RQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKG+WSL+SS +D
Sbjct: 353  VDFCVVDLDRQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGLWSLKSSIND 412

Query: 360  PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
            PFD +LVVSFISETR LAMN+EDELEETEIEGF +QTQTLFC +AI + L+QVT+ SVRL
Sbjct: 413  PFDMYLVVSFISETRFLAMNMEDELEETEIEGFDAQTQTLFCQNAINDLLIQVTANSVRL 472

Query: 420  VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
            VS TSREL N+W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D  L EVKHAQLE+E
Sbjct: 473  VSCTSRELVNQWNAPAGFSVNVASANASQVLLATGGGHLVYLEIRDAKLVEVKHAQLEHE 532

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
            ISCLD+NPIGENP YS +AAVGMWTDISVRIFSLPDL LI KE+LGGEI+PRSVLLC  E
Sbjct: 533  ISCLDLNPIGENPQYSSLAAVGMWTDISVRIFSLPDLELIRKENLGGEIVPRSVLLCTLE 592

Query: 540  GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
            G+SYLLCALGDG+L +FLLN  TGELTDRKKV+LGTQPI+LRTFSSK TTHVFA+SDRPT
Sbjct: 593  GVSYLLCALGDGNLFSFLLNASTGELTDRKKVTLGTQPISLRTFSSKGTTHVFASSDRPT 652

Query: 600  VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
            VIYSSNKKLLYSNVNLKEV+HMCPFN+AAFPDSLAIAKEGEL+IGTIDDIQKLHIR+IPL
Sbjct: 653  VIYSSNKKLLYSNVNLKEVNHMCPFNTAAFPDSLAIAKEGELSIGTIDDIQKLHIRTIPL 712

Query: 660  GEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
             E  RRICHQEQSRT A CS K     EESE H +RLLD QTFE +  YPLD +E GCSI
Sbjct: 713  NEQARRICHQEQSRTLAFCSFKYNQSVEESETHLIRLLDHQTFESLCVYPLDQYECGCSI 772

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            +SCSF+DDSNVYYCVGTAYV+PEENEPTKGRILVF VEDG LQLI EKETKGAVYSLNAF
Sbjct: 773  ISCSFADDSNVYYCVGTAYVIPEENEPTKGRILVFAVEDGSLQLIVEKETKGAVYSLNAF 832

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            NGKLLAAINQKIQLYKWM R+DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSISL
Sbjct: 833  NGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSISL 892

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
            L+YKHEE AIEERARDYNANWM+AVE+LDD++Y+GAEN++NLFTVRKNS+ ATD+ER RL
Sbjct: 893  LVYKHEESAIEERARDYNANWMTAVEMLDDEVYVGAENSYNLFTVRKNSDAATDDERARL 952

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
            EVVGEYHLGEFVNRFRHGSLVMRLPDSD+GQIPTVIFGT+NGVIG+IASLPH+QY+FLEK
Sbjct: 953  EVVGEYHLGEFVNRFRHGSLVMRLPDSDIGQIPTVIFGTINGVIGIIASLPHDQYIFLEK 1012

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            LQ+ L K IKGVG L+HEQWRSF+N+KKT +A+NFLDGDLIESFLDLSR++M+E+SK M 
Sbjct: 1013 LQSTLVKYIKGVGNLSHEQWRSFHNDKKTAEARNFLDGDLIESFLDLSRSKMEEVSKAMG 1072

Query: 1020 VSVEELCKRVEELTRLH 1036
            V VEEL KRVEELTRLH
Sbjct: 1073 VPVEELSKRVEELTRLH 1089


>gi|12082087|dbj|BAB20761.1| UV-damaged DNA binding protein [Oryza sativa Japonica Group]
          Length = 1090

 Score = 1808 bits (4683), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 875/1038 (84%), Positives = 957/1038 (92%), Gaps = 2/1038 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            M+DVPIYGRIATLELFRPH E QDFLFIATERYKFCVLQWD E SEL+TRAMGDVSDRIG
Sbjct: 53   MIDVPIYGRIATLELFRPHNETQDFLFIATERYKFCVLQWDGEKSELLTRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA LLIPVP PL GV+II
Sbjct: 173  VKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGAGLLIPVPAPLGGVIII 232

Query: 181  GEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG 239
            GEETIVYC+AN+ F+AIPI+ SI +AYGRVD DGSRYLLGD+AG+LHLLV+THE+E+VTG
Sbjct: 233  GEETIVYCNANSTFRAIPIKQSIIRAYGRVDPDGSRYLLGDNAGILHLLVLTHERERVTG 292

Query: 240  LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
            LKIE LGETSIAS+ISYLDN VVY+GS +GDSQL+KLNLQ D  GSYVEVLERYVNLGPI
Sbjct: 293  LKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKLNLQADPNGSYVEVLERYVNLGPI 352

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
            VDFCVVDL+RQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKG+WSL+SS +D
Sbjct: 353  VDFCVVDLDRQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGLWSLKSSFND 412

Query: 360  PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
            P+D +LVVSFISETR LAMN+EDELEETEIEGF +QTQTLFC +AI + L+QVT+ SVRL
Sbjct: 413  PYDMYLVVSFISETRFLAMNMEDELEETEIEGFDAQTQTLFCQNAINDLLIQVTANSVRL 472

Query: 420  VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
            VS TSREL ++W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D  L EVKH QLE+E
Sbjct: 473  VSCTSRELVDQWNAPEGFSVNVASANASQVLLATGGGHLVYLEIKDSKLVEVKHIQLEHE 532

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
            ISC+D+NPIGENP YS +AAVGMWTDISVRI S PDL LI KE+LGGEI+PRSVLLC  E
Sbjct: 533  ISCVDLNPIGENPQYSSLAAVGMWTDISVRILSFPDLELIRKENLGGEIVPRSVLLCTLE 592

Query: 540  GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
            G+SYLLCALGDGHL +FLLN  TGELTDRKKVSLGTQPI+LRTFSSK TTHVFA+SDRPT
Sbjct: 593  GVSYLLCALGDGHLFSFLLNASTGELTDRKKVSLGTQPISLRTFSSKGTTHVFASSDRPT 652

Query: 600  VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
            VIYSSNKKLLYSNVNLKEV+HMCPFN+AA PDSLAIAKEGEL+IGTIDDIQKLHIR+IPL
Sbjct: 653  VIYSSNKKLLYSNVNLKEVNHMCPFNTAAIPDSLAIAKEGELSIGTIDDIQKLHIRTIPL 712

Query: 660  GEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
                RRICHQEQSRT A CS K NQ+  EESE HFVRLLD QTFEF+S Y LD +E+GCS
Sbjct: 713  DHQTRRICHQEQSRTLAFCSFKHNQTSIEESETHFVRLLDHQTFEFLSIYQLDQYEHGCS 772

Query: 719  ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
            I+SCSFSDD+NVYYCVGTAYVLPEENEP+KGRILVF VEDG+LQLI EKETKGAVYSLNA
Sbjct: 773  IISCSFSDDNNVYYCVGTAYVLPEENEPSKGRILVFAVEDGRLQLIVEKETKGAVYSLNA 832

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            FNGKLLAAINQKIQLYKWMLR+DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSIS
Sbjct: 833  FNGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSIS 892

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
            LL+YKHEE AIEE ARDYNANWMSAVE+LDD+IY+GAENN+N+FTVRKNS+ ATDEERGR
Sbjct: 893  LLVYKHEESAIEELARDYNANWMSAVEMLDDEIYIGAENNYNIFTVRKNSDAATDEERGR 952

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
            LEVVGEYHLGEF NRFRHGSLVMRLPDS++GQIPTVIFGT+NGVIG+IASLPHEQY+FLE
Sbjct: 953  LEVVGEYHLGEFGNRFRHGSLVMRLPDSEMGQIPTVIFGTINGVIGIIASLPHEQYVFLE 1012

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            KLQ+ L K IKGVG L+HEQWRSF+N+KKT +A+NFLDGDLIESFLDLSR +M+E++K M
Sbjct: 1013 KLQSTLVKFIKGVGNLSHEQWRSFHNDKKTSEARNFLDGDLIESFLDLSRNKMEEVAKGM 1072

Query: 1019 NVSVEELCKRVEELTRLH 1036
             V VEEL KRVEELTRLH
Sbjct: 1073 GVPVEELSKRVEELTRLH 1090


>gi|255571318|ref|XP_002526608.1| DNA repair protein xp-E, putative [Ricinus communis]
 gi|223534048|gb|EEF35767.1| DNA repair protein xp-E, putative [Ricinus communis]
          Length = 1033

 Score = 1796 bits (4651), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 906/1036 (87%), Positives = 939/1036 (90%), Gaps = 55/1036 (5%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELF                                            
Sbjct: 53   MLDVPIYGRIATLELF-------------------------------------------- 68

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP    Q      D   I    Y   F V+ +D +         RLEELQVLDIKFLYGC
Sbjct: 69   RPHGEAQ------DFLFIATERYK--FCVLQWDAE---TSELITRLEELQVLDIKFLYGC 117

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            +KPTIVVLYQDNKDARHVKTYEVALKDKDF EGPW+QNNLDNGADLLIPVPPPLCGVLII
Sbjct: 118  SKPTIVVLYQDNKDARHVKTYEVALKDKDFGEGPWAQNNLDNGADLLIPVPPPLCGVLII 177

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GEETIVYCSANAFKAIPIRPSIT+AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL
Sbjct: 178  GEETIVYCSANAFKAIPIRPSITRAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 237

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            KIELLGETSIASTISYLDNAVVYIGSSYGDSQL+KLNLQPDAKGSYVEVLE YVNLGPIV
Sbjct: 238  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLVKLNLQPDAKGSYVEVLESYVNLGPIV 297

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 298  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 357

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCH A+YNQLVQVTS SVRLV
Sbjct: 358  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHYAVYNQLVQVTSSSVRLV 417

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SST+REL+NEW +P GYS+NVATANA+QVLLATGGGHLVYLEIGDG LT  KHAQLE EI
Sbjct: 418  SSTTRELQNEWHAPAGYSINVATANATQVLLATGGGHLVYLEIGDGTLTHTKHAQLECEI 477

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP+YSQ+AAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLC+FEG
Sbjct: 478  SCLDINPIGENPNYSQLAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCSFEG 537

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ISYLLCALGDGHLLNFLLN+ TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 538  ISYLLCALGDGHLLNFLLNLNTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 597

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG
Sbjct: 598  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 657

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            EH RRICHQEQSRTFA+CSLKNQ+ AEESE HF+RLLDDQTFEFISTY LD FEYGCSIL
Sbjct: 658  EHARRICHQEQSRTFAVCSLKNQASAEESETHFIRLLDDQTFEFISTYQLDPFEYGCSIL 717

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
            SCSFSDD+N+YYCVGTAYV+PEENEPTKGRILVF+VEDGKLQ+I EKETKGAVYSLN+FN
Sbjct: 718  SCSFSDDNNLYYCVGTAYVMPEENEPTKGRILVFLVEDGKLQVITEKETKGAVYSLNSFN 777

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            GKLLAAINQKIQLYKWMLRDDG+RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 778  GKLLAAINQKIQLYKWMLRDDGSRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 837

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
            IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE
Sbjct: 838  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 897

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY+FLEKL
Sbjct: 898  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYIFLEKL 957

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            Q+NLR+VIKGVGGL+HEQWRSFNNEKKTV+AKNFLDGDLIESFLDLSR RMDEISK + V
Sbjct: 958  QSNLRRVIKGVGGLSHEQWRSFNNEKKTVEAKNFLDGDLIESFLDLSRNRMDEISKAIGV 1017

Query: 1021 SVEELCKRVEELTRLH 1036
            SVEELCKRVEELTRLH
Sbjct: 1018 SVEELCKRVEELTRLH 1033


>gi|357132340|ref|XP_003567788.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
            distachyon]
          Length = 1090

 Score = 1795 bits (4649), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 864/1038 (83%), Positives = 959/1038 (92%), Gaps = 2/1038 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDVPIYGRIAT+ELFRP  EAQDFLFIATERYKFCVLQWDAE SELITR++GDVSDRIG
Sbjct: 53   LLDVPIYGRIATIELFRPPTEAQDFLFIATERYKFCVLQWDAEKSELITRSVGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             +PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA LLIPVP PL GV+II
Sbjct: 173  LRPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGAGLLIPVPAPLGGVIII 232

Query: 181  GEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG 239
            GEETIVYC+AN+ FKAIPI+ SI +AYGRVD DGSRYLLGD+ G+LHLLV+T E+E+VTG
Sbjct: 233  GEETIVYCNANSTFKAIPIKQSIIRAYGRVDPDGSRYLLGDNTGILHLLVLTQERERVTG 292

Query: 240  LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
            LKIE LGETS+AS+ISYLDN VVY+GS +GDSQL+KLNLQ DA GS+VEVLERYVNLGPI
Sbjct: 293  LKIEHLGETSVASSISYLDNGVVYVGSRFGDSQLVKLNLQADATGSFVEVLERYVNLGPI 352

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
            VDFCVVDL+RQGQGQVVTCSGA+KDGS+R+VRNGIGINEQASVELQGIKG+WSL+SS +D
Sbjct: 353  VDFCVVDLDRQGQGQVVTCSGAFKDGSIRVVRNGIGINEQASVELQGIKGLWSLKSSFND 412

Query: 360  PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
            P+DTFLVVSFISETR LAMN+EDELEETEIEGF +Q QTLFC +AI + L+QVT+ SVRL
Sbjct: 413  PYDTFLVVSFISETRFLAMNMEDELEETEIEGFDAQIQTLFCQNAISDLLIQVTANSVRL 472

Query: 420  VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
            VS  SREL + W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D  L EVKHAQLE+E
Sbjct: 473  VSCASRELVDHWNAPDGFSVNVASANASQVLLATGGGHLVYLEIRDAKLVEVKHAQLEHE 532

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
            ISC+D+NPIGENP YS +AA+GMWTDISV + SLPDL LI KE+LGGEI+PRSVLLC  E
Sbjct: 533  ISCVDLNPIGENPQYSSLAAIGMWTDISVSLLSLPDLELIRKENLGGEIVPRSVLLCTLE 592

Query: 540  GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
            G+SYLLCALGDGHL +FLLN+ TGELTD+KKVSLGTQPI+LRTFSSK TTHVFA+SDRPT
Sbjct: 593  GVSYLLCALGDGHLFSFLLNVSTGELTDKKKVSLGTQPISLRTFSSKGTTHVFASSDRPT 652

Query: 600  VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
            VIYSSNKKLLYSNVNLKEV+HMCPFN+AAFPDSLAIAKEGEL+IGTIDDIQKLHIR+IPL
Sbjct: 653  VIYSSNKKLLYSNVNLKEVNHMCPFNTAAFPDSLAIAKEGELSIGTIDDIQKLHIRTIPL 712

Query: 660  GEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
             E  RRICHQEQSRT A CS K N +  EESE HF+RLLD QTFEF+ST+PLD +E GCS
Sbjct: 713  NEQARRICHQEQSRTLAFCSFKYNPNSMEESEAHFIRLLDHQTFEFLSTHPLDQYECGCS 772

Query: 719  ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
            ++SCSFSDD+N YYCVGTAYVLPEENEPTKGRILVF VEDG+LQLI EKETKGAVYSLNA
Sbjct: 773  MISCSFSDDNNFYYCVGTAYVLPEENEPTKGRILVFAVEDGRLQLIVEKETKGAVYSLNA 832

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            FNGKLLAAINQKIQLYKWM R+DG+ ELQSECGHHGHILAL+ QTRGDFIVVGDLMKSIS
Sbjct: 833  FNGKLLAAINQKIQLYKWMTREDGSHELQSECGHHGHILALFTQTRGDFIVVGDLMKSIS 892

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
            LL+YKHEE AIEE ARDYNANWM+AVE++DDDIY+GAEN++NLFTVRKNS+ ATDEERGR
Sbjct: 893  LLVYKHEESAIEELARDYNANWMTAVEMIDDDIYVGAENSYNLFTVRKNSDAATDEERGR 952

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
            LEVVGEYHLGEFVNRFRHGSLVMRLPD+++GQIPTVIFGT+NGVIG+IASLPH+QY+FLE
Sbjct: 953  LEVVGEYHLGEFVNRFRHGSLVMRLPDTEMGQIPTVIFGTINGVIGIIASLPHDQYVFLE 1012

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            KLQ+ L K IKGVG L+H+QWRSF+NEKKT +A+NFLDGDLIESFLDL+R++M+E+SK M
Sbjct: 1013 KLQSILGKFIKGVGSLSHDQWRSFHNEKKTAEARNFLDGDLIESFLDLNRSKMEEVSKGM 1072

Query: 1019 NVSVEELCKRVEELTRLH 1036
             VSVE L KRVEELTRLH
Sbjct: 1073 GVSVENLSKRVEELTRLH 1090


>gi|218197365|gb|EEC79792.1| hypothetical protein OsI_21216 [Oryza sativa Indica Group]
          Length = 1089

 Score = 1779 bits (4609), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 863/1037 (83%), Positives = 947/1037 (91%), Gaps = 1/1037 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            M+DVPIYGRIATLELFRPH E QDFLFIATERYKFCVLQWD E SEL+TRAMGDVSDRIG
Sbjct: 53   MIDVPIYGRIATLELFRPHNETQDFLFIATERYKFCVLQWDGEKSELLTRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA LLIPVP PL GV+II
Sbjct: 173  VKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGAGLLIPVPAPLGGVIII 232

Query: 181  GEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG 239
            GEETIVYC+AN+ F+AIPI+ SI +AYGRVD DGSRYLLGD+AG+LHLLV+THE+E+VTG
Sbjct: 233  GEETIVYCNANSTFRAIPIKQSIIRAYGRVDPDGSRYLLGDNAGILHLLVLTHERERVTG 292

Query: 240  LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
            LKIE LGETSIAS+ISYLDN VVY+GS +GDSQL+KLNLQ D  GSYVEVLERYVNLGPI
Sbjct: 293  LKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKLNLQADPNGSYVEVLERYVNLGPI 352

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
            VDFCVVDL+RQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKG+WSL+SS +D
Sbjct: 353  VDFCVVDLDRQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGLWSLKSSFND 412

Query: 360  PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
            P+D +LVVSFISETR LAMN+EDELEETEIEGF +QTQTLFC +AI + L+QVT+ SVRL
Sbjct: 413  PYDMYLVVSFISETRFLAMNMEDELEETEIEGFDAQTQTLFCQNAINDLLIQVTANSVRL 472

Query: 420  VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
            VS TSREL ++W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D  L EVKH QLE+E
Sbjct: 473  VSCTSRELVDQWNAPEGFSVNVASANASQVLLATGGGHLVYLEIKDSKLVEVKHIQLEHE 532

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
            ISC+D+NPIGENP YS +AAVGMWTDISVRI SLPDL LI KE+LGGEI+PRSVLLC  E
Sbjct: 533  ISCVDLNPIGENPQYSSLAAVGMWTDISVRILSLPDLELIRKENLGGEIVPRSVLLCTLE 592

Query: 540  GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
            G+SYLLCALGDGHL +FLLN  TGELTDRKKVSLGTQPI+LRTFSSK TTHVFA+SDRPT
Sbjct: 593  GVSYLLCALGDGHLFSFLLNASTGELTDRKKVSLGTQPISLRTFSSKGTTHVFASSDRPT 652

Query: 600  VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
            VIYSSNKKLLYSNVNLKEV+HMCPFN+AA PDSLAIAKEGEL+IGTIDDIQKLHIR+IPL
Sbjct: 653  VIYSSNKKLLYSNVNLKEVNHMCPFNTAAIPDSLAIAKEGELSIGTIDDIQKLHIRTIPL 712

Query: 660  GEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
             E       +    T  +    NQ+  EESE HFVRLLD QTFEF+S Y LD +E+GCSI
Sbjct: 713  NEQHVAFAIRNVQDTSVLQLQHNQTSIEESETHFVRLLDHQTFEFLSIYQLDQYEHGCSI 772

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            +SCSFSDD+NVYYCVGTAYVLPEENEP+KGRILVF VEDG+LQLI EKETKGAVYSLNAF
Sbjct: 773  ISCSFSDDNNVYYCVGTAYVLPEENEPSKGRILVFAVEDGRLQLIVEKETKGAVYSLNAF 832

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            NGKLLAAINQKIQLYKWMLR+DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSISL
Sbjct: 833  NGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSISL 892

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
            L+YKHEE AIEE ARDYNANWMSAVE+LDD+IY+GAENN+N+FTVRKNS+ ATDEERGRL
Sbjct: 893  LVYKHEESAIEELARDYNANWMSAVEMLDDEIYIGAENNYNIFTVRKNSDAATDEERGRL 952

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
            EVVGEYHLGEFVNR RHGSLVMRLPDS++GQIPTVIFGT+NGVIG+IASLPHEQY+FLEK
Sbjct: 953  EVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIPTVIFGTINGVIGIIASLPHEQYVFLEK 1012

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            LQ+ L K IKGVG L+HEQWRSF+N+KKT +A+NFLDGDLIESFLDLSR +M+E++K M 
Sbjct: 1013 LQSTLVKFIKGVGNLSHEQWRSFHNDKKTSEARNFLDGDLIESFLDLSRNKMEEVAKGMG 1072

Query: 1020 VSVEELCKRVEELTRLH 1036
            V VEEL KRVEELTRLH
Sbjct: 1073 VPVEELSKRVEELTRLH 1089


>gi|242089089|ref|XP_002440377.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
 gi|241945662|gb|EES18807.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
          Length = 1783

 Score = 1776 bits (4600), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 856/1056 (81%), Positives = 947/1056 (89%), Gaps = 24/1056 (2%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIAT+ELFRPH E QDFLFIATERYKFCVLQWDAE SELITRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATIELFRPHNETQDFLFIATERYKFCVLQWDAEKSELITRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLHGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTIVVLYQDNKD RHVKTYEVALKDKDFVEGPWSQNN+DNGA LLIPVP PL GV+II
Sbjct: 173  VKPTIVVLYQDNKDVRHVKTYEVALKDKDFVEGPWSQNNVDNGAGLLIPVPAPLGGVIII 232

Query: 181  GEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG 239
            GEE IVYC+AN+ FKAIPI+ SI +AYGRVD DGSRYLLGD+ G+LHLLV+THE+E+VTG
Sbjct: 233  GEEQIVYCNANSTFKAIPIKQSIIRAYGRVDPDGSRYLLGDNTGILHLLVLTHERERVTG 292

Query: 240  LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
            LKIE LGETSIAS+ISYLDN VVY+GS +GDSQL+KLNLQ DA GS+VE+LERYVNLGPI
Sbjct: 293  LKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKLNLQADASGSFVEILERYVNLGPI 352

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
            VDFCVVDL+RQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKG+WSL+SS +D
Sbjct: 353  VDFCVVDLDRQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGLWSLKSSFND 412

Query: 360  PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
            P+D +LVVSFISETR LAMN+EDELEETEIEGF +QTQTLFC +A  + L+QVT+ SVRL
Sbjct: 413  PYDMYLVVSFISETRFLAMNMEDELEETEIEGFDAQTQTLFCQNATNDLLIQVTANSVRL 472

Query: 420  VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
            VS TSREL ++W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D  L EVKHAQLE+E
Sbjct: 473  VSCTSRELVDQWNAPAGFSVNVASANASQVLLATGGGHLVYLEIRDSKLVEVKHAQLEHE 532

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
            ISCLD+NPIGENP YS +AAVGMWTDISVRIFSLPDL LI KE+LGGEI+PRSVLLC  E
Sbjct: 533  ISCLDLNPIGENPQYSSLAAVGMWTDISVRIFSLPDLELIRKENLGGEIVPRSVLLCTLE 592

Query: 540  GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
            G+SYLLCALGDG+L +FLLN  TGELTDRKKV+LGTQPI+LRTFSSK TTHVFA+SDRPT
Sbjct: 593  GVSYLLCALGDGNLFSFLLNASTGELTDRKKVTLGTQPISLRTFSSKGTTHVFASSDRPT 652

Query: 600  VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
            VIYSSNKKLLYSNVNLKEV+HMCPFN+AAFPDSLAIAKEGEL+IGTIDDIQKLHIR+IPL
Sbjct: 653  VIYSSNKKLLYSNVNLKEVNHMCPFNTAAFPDSLAIAKEGELSIGTIDDIQKLHIRTIPL 712

Query: 660  GEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
             E  RRICHQEQS+T A CS K     EESE H +RLLD QTFE +  YPLD +E+GCSI
Sbjct: 713  NEQARRICHQEQSKTLAFCSFKYNQSVEESETHLIRLLDHQTFESLCVYPLDQYEFGCSI 772

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            +SCSF+DD+NVYYCVGTAYV+PEENEPTKGRILVF VEDG LQLI EKETKGAVYSLNAF
Sbjct: 773  ISCSFADDNNVYYCVGTAYVIPEENEPTKGRILVFAVEDGSLQLIVEKETKGAVYSLNAF 832

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            NGKLLAAINQKIQLYKWM R+DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSISL
Sbjct: 833  NGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSISL 892

Query: 840  LIYKHE-----------------------EGAIEERARDYNANWMSAVEILDDDIYLGAE 876
            L+YK                         E AIEERARDYNANWM+AVE+LDD++Y+GAE
Sbjct: 893  LVYKVVPLTVCLTHIVLSVIFFVSLFVVLESAIEERARDYNANWMTAVEMLDDEVYVGAE 952

Query: 877  NNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
            N +NLFTVRKNS+ ATD+ER RLEVVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIF
Sbjct: 953  NGYNLFTVRKNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIF 1012

Query: 937  GTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
            GT+NGVIG+IASLPH+QY+FLEKLQ+ L K IKGVG L+HEQWRSF+N+KKT +A+NFLD
Sbjct: 1013 GTINGVIGIIASLPHDQYVFLEKLQSTLVKYIKGVGNLSHEQWRSFHNDKKTAEARNFLD 1072

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            GDLIESFLDLSR++M+E+SK M V VEEL KRV+ +
Sbjct: 1073 GDLIESFLDLSRSKMEEVSKAMGVPVEELSKRVDTI 1108


>gi|168066745|ref|XP_001785293.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162663100|gb|EDQ49884.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1090

 Score = 1771 bits (4588), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 847/1038 (81%), Positives = 937/1038 (90%), Gaps = 2/1038 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVPIYGRIATLELFRP GE+QD LFI+ ERYKFCVLQWDAE+  L+TRAMGDVSDRIG
Sbjct: 53   MLDVPIYGRIATLELFRPPGESQDVLFISFERYKFCVLQWDAETGLLVTRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGI+DPDCRLIGLHLYDGLFKVIP DNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIVDPDCRLIGLHLYDGLFKVIPIDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            AKPTI VLYQDNKDARHVKTYEV LK+KDF EGPW QNNLDNGA LLIPVP PL G +II
Sbjct: 173  AKPTIAVLYQDNKDARHVKTYEVQLKEKDFGEGPWLQNNLDNGAGLLIPVPLPLGGAIII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GE+TIVY + + FKAIPIRPSITKAYGRVD+DGSRYLL DH G+L+LLVI+H+KE+V+ L
Sbjct: 233  GEQTIVYYNGSVFKAIPIRPSITKAYGRVDSDGSRYLLSDHNGMLYLLVISHDKERVSAL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
             +E LGETS AST+SYLDN VV++GSSYGDSQLI+LN Q DAK SYVEVLE YVNLGPIV
Sbjct: 293  NVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRLNHQADAKNSYVEVLESYVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            D CVVDLERQGQGQVVTCSGA+KDGSLRIVRNGIGINEQAS ELQGIKGMWSLR+S+ D 
Sbjct: 353  DLCVVDLERQGQGQVVTCSGAFKDGSLRIVRNGIGINEQASAELQGIKGMWSLRASSSDV 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            +DTFLVVSFISETRILAMN +DELEETEI+GF S+ QTLFC++A+++QLVQVT+GS+RLV
Sbjct: 413  YDTFLVVSFISETRILAMNTDDELEETEIDGFDSEAQTLFCYNAVHDQLVQVTAGSLRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
             + +R    EWK+P   ++NVATANASQVLLATGGG+LVY+EIG G L EV H QLEYEI
Sbjct: 473  DAKTRRQLTEWKAPAPMTINVATANASQVLLATGGGNLVYIEIGQGTLVEVAHVQLEYEI 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINP+G+NP  S + AVGMWTDIS+RIF+LP L LITKE LGGEIIPRSVL C+F+G
Sbjct: 533  SCLDINPVGDNPERSNLVAVGMWTDISIRIFALPSLTLITKEMLGGEIIPRSVLFCSFDG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            I+YLLCALGDGHL NF LN  TGEL+DRKK+SLGTQPI LRTF SKNTTHVFAASDRPTV
Sbjct: 593  IAYLLCALGDGHLFNFTLNPATGELSDRKKISLGTQPIALRTFRSKNTTHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEV+HMCPFNSA+FPDSLAI KEGELTIGTID+IQKLHIR++PLG
Sbjct: 653  IYSSNKKLLYSNVNLKEVNHMCPFNSASFPDSLAIGKEGELTIGTIDEIQKLHIRTVPLG 712

Query: 661  EHPRRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
            EHPRRI HQEQSRTFAICS K    S  E+ E H+VRL++DQTFE IS +PLD +E GCS
Sbjct: 713  EHPRRIAHQEQSRTFAICSAKYAPGSNGEDMETHYVRLIEDQTFEIISGFPLDPYENGCS 772

Query: 719  ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
            I++CSF+DDSNVYYCVGTAY LPEE+EP+KGRILVF VEDGK+QL+AEKE KGAVY+LNA
Sbjct: 773  IITCSFTDDSNVYYCVGTAYALPEESEPSKGRILVFSVEDGKIQLVAEKEVKGAVYNLNA 832

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            FNGKLLA INQKI LYKW LRDDGTRELQ E  HHGHILALYVQ+RGDFIVVGDLMKSIS
Sbjct: 833  FNGKLLAGINQKIALYKWTLRDDGTRELQYESSHHGHILALYVQSRGDFIVVGDLMKSIS 892

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
            LLIYK EEGAIEERARDYNANWM+AVEILDDD YLGAEN+FNLFTVRKN++ ATDEERGR
Sbjct: 893  LLIYKPEEGAIEERARDYNANWMTAVEILDDDTYLGAENSFNLFTVRKNNDAATDEERGR 952

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
            LEVVGEYHLGEFVNRFRHGSLVMRLPDS+  QIPTVIFGTVNGVIGVIASLP +Q+LFL+
Sbjct: 953  LEVVGEYHLGEFVNRFRHGSLVMRLPDSEASQIPTVIFGTVNGVIGVIASLPQDQFLFLQ 1012

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            KLQ  L KVIKGVGGL+HEQWRSF+NE+KTVDA+NFLDGDLIESFLDLSR +M+EI+ ++
Sbjct: 1013 KLQQALVKVIKGVGGLSHEQWRSFSNERKTVDARNFLDGDLIESFLDLSRNKMEEIATSL 1072

Query: 1019 NVSVEELCKRVEELTRLH 1036
             VSVEEL K VEELTRLH
Sbjct: 1073 EVSVEELHKAVEELTRLH 1090


>gi|302788810|ref|XP_002976174.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
 gi|300156450|gb|EFJ23079.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
          Length = 1089

 Score = 1771 bits (4586), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 839/1038 (80%), Positives = 940/1038 (90%), Gaps = 3/1038 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDVPIYGRIATLELFRP GE QD LF++TERYKFCVLQWD+E++EL+TRAMGDVSDRIG
Sbjct: 53   LLDVPIYGRIATLELFRPPGETQDVLFVSTERYKFCVLQWDSETTELVTRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGI+DP+CRLIGLHLYDGLFKVIP DNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIVDPECRLIGLHLYDGLFKVIPIDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            +KPTI VLYQDNKDARHVKTYE+ LK+KDF EGPW QNNLDNGA +LIPVP PL GV+II
Sbjct: 173  SKPTIAVLYQDNKDARHVKTYEIQLKEKDFGEGPWLQNNLDNGAGMLIPVPTPLGGVIII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GE+TIVY S +AFKAIPIRPSITKAYG+VDADGSRYLL DH G LHLLVITHE+++V GL
Sbjct: 233  GEQTIVYYSGSAFKAIPIRPSITKAYGKVDADGSRYLLSDHTGSLHLLVITHERDRVLGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            K+ELLGETS AS++SYLDN VVY+GSSYGDSQLIKLN Q D++ SYVEVLE +VNLGPIV
Sbjct: 293  KVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKLNAQVDSRNSYVEVLESFVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            D CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQAS ELQGIKGMWSLR+++ D 
Sbjct: 353  DLCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASAELQGIKGMWSLRATSKDV 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FD FLVVSFISETRILAMN++DELEETEIEGF S+ QTLFCH+AI++Q++QVTS S+RLV
Sbjct: 413  FDIFLVVSFISETRILAMNMDDELEETEIEGFDSEAQTLFCHNAIHDQIIQVTSTSLRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
             +TSR    EW++P G ++NVATANASQVLLATGGG+LVY+EIG G LTEVKH QL+ EI
Sbjct: 473  DATSRRQLTEWRTPSGVAINVATANASQVLLATGGGNLVYIEIGRGSLTEVKHVQLQNEI 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIG +P  S IAAVGMWTDISV++FSLP+L ++ KE LGGEIIPRSVL+CAFEG
Sbjct: 533  SCLDINPIGTDPDRSNIAAVGMWTDISVQVFSLPNLEVLAKESLGGEIIPRSVLICAFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            +SYLLCALGDGHL NFL++  TG+L+DRKK+SLGTQPI LRTF SKN THVFAASDRPTV
Sbjct: 593  VSYLLCALGDGHLFNFLIDTSTGQLSDRKKISLGTQPIMLRTFRSKNATHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEV+HMCPFNSA+FPDSLAI KEGELTIGTIDDIQKLHIR++ LG
Sbjct: 653  IYSSNKKLLYSNVNLKEVNHMCPFNSASFPDSLAIGKEGELTIGTIDDIQKLHIRTVALG 712

Query: 661  EHPRRICHQEQSRTFAICSLK--NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
            EHPRRICHQEQ+RTF +C+ +  +    E+ E HFV+LLDDQTFE + +Y LDTFE GC+
Sbjct: 713  EHPRRICHQEQTRTFGLCTARFYSNPNGEDHESHFVKLLDDQTFEVLGSYNLDTFENGCT 772

Query: 719  ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
            I++CSF+DD   YYCVGTAY LPEENEP+KGRIL+F VEDGK QL+ EKETKGAVY+LNA
Sbjct: 773  IITCSFTDDPATYYCVGTAYALPEENEPSKGRILIFTVEDGKFQLVTEKETKGAVYNLNA 832

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            FNGKLLA INQKIQLYKW  R D TRELQSECGHHGHILALYVQ+RGDFIVVGDLMKSIS
Sbjct: 833  FNGKLLAGINQKIQLYKWTQR-DSTRELQSECGHHGHILALYVQSRGDFIVVGDLMKSIS 891

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
            LL+YK EEGAIEERARDYNANWM+AVEILDDDIYLGAEN+FNLFTVRKNS+ ATDEERGR
Sbjct: 892  LLLYKPEEGAIEERARDYNANWMTAVEILDDDIYLGAENSFNLFTVRKNSDAATDEERGR 951

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
            LEVVGEYHLGEFVNRFRHGSLVMRLPD++  QIPTVIFGTVNGVIGV+ASL  EQ+ FL+
Sbjct: 952  LEVVGEYHLGEFVNRFRHGSLVMRLPDNETSQIPTVIFGTVNGVIGVVASLQQEQFNFLQ 1011

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            +LQ  L KVIKGVGGL+HEQWRSF++E+K  DAKNFLDGDLIESFLDL+R +MDE+S  M
Sbjct: 1012 RLQHCLAKVIKGVGGLSHEQWRSFSSERKNADAKNFLDGDLIESFLDLNRAKMDEVSAAM 1071

Query: 1019 NVSVEELCKRVEELTRLH 1036
            +VSVEELCKRVEE+TRLH
Sbjct: 1072 SVSVEELCKRVEEMTRLH 1089


>gi|302769568|ref|XP_002968203.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
 gi|300163847|gb|EFJ30457.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
          Length = 1089

 Score = 1770 bits (4584), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 838/1038 (80%), Positives = 939/1038 (90%), Gaps = 3/1038 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDVPIYGRIATLELFRP GE QD LF++TERYKFCVLQWD+E++EL+TRAMGDVSDRIG
Sbjct: 53   LLDVPIYGRIATLELFRPPGETQDVLFVSTERYKFCVLQWDSETTELVTRAMGDVSDRIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGI+DP+CRLIGLHLYDGLFKVIP DNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIVDPECRLIGLHLYDGLFKVIPIDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            +KPTI VLYQDNKDARHVKTYE+ LK+KDF EGPWSQNNLDNGA +LIPVP PL GV+II
Sbjct: 173  SKPTIAVLYQDNKDARHVKTYEIQLKEKDFGEGPWSQNNLDNGAGMLIPVPTPLGGVIII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GE+TIVY S +AFKAIPIRPSITKAYG+VDADGSRYLL DH G LHLLVITHE+++V GL
Sbjct: 233  GEQTIVYYSGSAFKAIPIRPSITKAYGKVDADGSRYLLSDHTGSLHLLVITHERDRVLGL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            K+ELLGETS AS++SYLDN VVY+GSSYGDSQLIKLN Q D++ SYVEVLE +VNLGPIV
Sbjct: 293  KVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKLNAQVDSRNSYVEVLESFVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            D CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQAS ELQGIKGMWSLR+++ D 
Sbjct: 353  DLCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASAELQGIKGMWSLRATSKDV 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FD FLVVSFISETRILAMN++DELEETEIEGF S+ QTLFCH+AI++Q++QVTS S+RLV
Sbjct: 413  FDIFLVVSFISETRILAMNMDDELEETEIEGFDSEAQTLFCHNAIHDQIIQVTSTSLRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
             +TSR    EW++P   ++NVATANASQVLLATGGG+LVY+EIG G LTEVKH QL+ EI
Sbjct: 473  DATSRRQLTEWRTPSAVAINVATANASQVLLATGGGNLVYIEIGRGSLTEVKHVQLQNEI 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIG +P  S IAAVGMWTDISV++FSLP+L ++ KE LGGEIIPRSVL+CAFEG
Sbjct: 533  SCLDINPIGTDPDRSNIAAVGMWTDISVQVFSLPNLEVLAKESLGGEIIPRSVLICAFEG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            +SYLLCALGDGHL NFL++   G+L+DRKK+SLGTQPI LRTF SKN THVFAASDRPTV
Sbjct: 593  VSYLLCALGDGHLFNFLIDTSNGQLSDRKKISLGTQPIMLRTFRSKNATHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEV+HMCPFNSA+FPDSLAI KEGELTIGTIDDIQKLHIR++ LG
Sbjct: 653  IYSSNKKLLYSNVNLKEVNHMCPFNSASFPDSLAIGKEGELTIGTIDDIQKLHIRTVALG 712

Query: 661  EHPRRICHQEQSRTFAICSLK--NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
            EHPRRICHQEQ+RTF +C+ +  +    E+ E HFV+LLDDQTFE + +Y LDTFE GC+
Sbjct: 713  EHPRRICHQEQTRTFGLCTARFYSNPNGEDHESHFVKLLDDQTFEVLGSYNLDTFENGCT 772

Query: 719  ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
            I++CSF+DD   YYCVGTAY LPEENEP+KGRIL+F VEDGK QL+ EKETKGAVY+LNA
Sbjct: 773  IITCSFTDDPATYYCVGTAYALPEENEPSKGRILIFTVEDGKFQLVTEKETKGAVYNLNA 832

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            FNGKLLA INQKIQLYKW  R D TRELQSECGHHGHILALYVQ+RGDFIVVGDLMKSIS
Sbjct: 833  FNGKLLAGINQKIQLYKWTQR-DSTRELQSECGHHGHILALYVQSRGDFIVVGDLMKSIS 891

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
            LL+YK EEGAIEERARDYNANWM+AVEILDDDIYLGAEN+FNLFTVRKNS+ ATDEERGR
Sbjct: 892  LLLYKPEEGAIEERARDYNANWMTAVEILDDDIYLGAENSFNLFTVRKNSDAATDEERGR 951

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
            LEVVGEYHLGEFVNRFRHGSLVMRLPD++  QIPTVIFGTVNGVIGV+ASL  EQ+ FL+
Sbjct: 952  LEVVGEYHLGEFVNRFRHGSLVMRLPDNETSQIPTVIFGTVNGVIGVVASLQQEQFNFLQ 1011

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            +LQ  L KVIKGVGGL+HEQWRSF++E+K  DAKNFLDGDLIESFLDL+R +MDE+S  M
Sbjct: 1012 RLQHCLAKVIKGVGGLSHEQWRSFSSERKNADAKNFLDGDLIESFLDLNRAKMDEVSAAM 1071

Query: 1019 NVSVEELCKRVEELTRLH 1036
            +VSVEELCKRVEE+TRLH
Sbjct: 1072 SVSVEELCKRVEEMTRLH 1089


>gi|168047617|ref|XP_001776266.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672361|gb|EDQ58899.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1089

 Score = 1747 bits (4525), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 836/1038 (80%), Positives = 932/1038 (89%), Gaps = 3/1038 (0%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDVP+YGRIATLELFRP GE+QD LFI+ ERYKFCVLQWDAE+   ITRAMGDVSDR G
Sbjct: 53   MLDVPLYGRIATLELFRPPGESQDVLFISFERYKFCVLQWDAETGSPITRAMGDVSDRTG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIGI+DPDCRLIGLHLYDG+FKVIP DNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113  RPTDNGQIGIVDPDCRLIGLHLYDGMFKVIPIDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A PTI VLYQDNKDARHVKTYEV LK+KDF EGPW QNNLDNGA LLIPVP PL G +II
Sbjct: 173  ANPTIAVLYQDNKDARHVKTYEVNLKEKDFGEGPWLQNNLDNGAGLLIPVPLPLGGAIII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GE+TIVY + + FKAIPIRPSITKAYGRVD+DGSRYLL DH G+L+LLVI+H+KE+V+ L
Sbjct: 233  GEQTIVYYNGSVFKAIPIRPSITKAYGRVDSDGSRYLLSDHNGMLYLLVISHDKERVSAL 292

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
             +E LGETS AST+SYLDN VV++GSSYGDSQLI+LN Q D KGSYVEVLE +VNLGPIV
Sbjct: 293  NVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRLNHQADVKGSYVEVLESFVNLGPIV 352

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            D CVVDLERQGQGQVVTCSGA+KDGSLRIVRNGIGINEQASVELQGIKGMWSLR+S+ D 
Sbjct: 353  DLCVVDLERQGQGQVVTCSGAFKDGSLRIVRNGIGINEQASVELQGIKGMWSLRASSSDV 412

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            +DTFLVVSFISETRILAMN +DELEETEI+GF S+ QTLFCH+A+++QLVQVT+GS+RLV
Sbjct: 413  YDTFLVVSFISETRILAMNTDDELEETEIDGFDSEAQTLFCHNAVHDQLVQVTAGSLRLV 472

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            ++ +R+   EWK+P   ++NVATANASQVLLATGGG+LVY+EIG G LT V H+QLEYEI
Sbjct: 473  NAKTRKQLTEWKAPAPMTINVATANASQVLLATGGGNLVYIEIGQGTLTGVAHSQLEYEI 532

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINP+GENP  S + AVGMWTDISVRIF+LP L LI KE LGGEIIPRSVL C+F+G
Sbjct: 533  SCLDINPVGENPERSNLVAVGMWTDISVRIFALPSLTLINKEMLGGEIIPRSVLFCSFDG 592

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            ++YLLCA+GDGHL NF+LN  TGEL+DRKK+SLGTQPI LRTF SKNTTHVFAASDRPTV
Sbjct: 593  LAYLLCAVGDGHLFNFMLNPSTGELSDRKKISLGTQPIALRTFRSKNTTHVFAASDRPTV 652

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSSNKKLLYSNVNLKEV+HMCPFNSA+FPDSLAI KEGELTIGTIDDIQKLHIR++PLG
Sbjct: 653  IYSSNKKLLYSNVNLKEVNHMCPFNSASFPDSLAIGKEGELTIGTIDDIQKLHIRTVPLG 712

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCA--EESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
            E P RI HQEQSR+FAICS K       E+ E H+VRL++DQTFE  S + LD +E GCS
Sbjct: 713  ERPCRIAHQEQSRSFAICSAKYSQGPNNEDIETHYVRLIEDQTFEITSGFALDLYEIGCS 772

Query: 719  ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
            I++CSF+DDSNVYYCVGTAY LPEE+EPTKGRILVF+VEDGKLQL+AEKE KGAVY+LNA
Sbjct: 773  IITCSFTDDSNVYYCVGTAYALPEESEPTKGRILVFLVEDGKLQLVAEKEMKGAVYNLNA 832

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            FNGKLLA INQKI LYKW LR DGTR L+ E  HHGHILALYVQ+RGDFIVVGDLMKSIS
Sbjct: 833  FNGKLLAGINQKIALYKWTLR-DGTRVLEIESSHHGHILALYVQSRGDFIVVGDLMKSIS 891

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
            LLIYK EEGAIEERARDYNANWM+AVEILDDD YLGAEN+FNLFTVRKN++ ATDEERGR
Sbjct: 892  LLIYKPEEGAIEERARDYNANWMTAVEILDDDTYLGAENSFNLFTVRKNNDAATDEERGR 951

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
            LEVVGEYHLGEFVNRFRHGSLVMRLPDS+   IPTVIFGTVNGVIGVIASLP +++LFL+
Sbjct: 952  LEVVGEYHLGEFVNRFRHGSLVMRLPDSEASLIPTVIFGTVNGVIGVIASLPQDKFLFLQ 1011

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            KLQ  L KVIKGVGGL+HEQWRSF+NE+KTVDA+NFLDGDLIESFLDLSR +M+EI+  +
Sbjct: 1012 KLQQALVKVIKGVGGLSHEQWRSFSNERKTVDARNFLDGDLIESFLDLSRNKMEEIAAPL 1071

Query: 1019 NVSVEELCKRVEELTRLH 1036
             +SVEELCKRVEELTRLH
Sbjct: 1072 EISVEELCKRVEELTRLH 1089


>gi|357135348|ref|XP_003569272.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
            distachyon]
          Length = 1074

 Score = 1460 bits (3779), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 707/1036 (68%), Positives = 846/1036 (81%), Gaps = 14/1036 (1%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            M+DVP+YG IATLELFR   E QDFLFI+ ERY+  VL WD  +SELITR+ GDVSD IG
Sbjct: 53   MVDVPLYGTIATLELFRSRSETQDFLFISMERYRCIVLHWDGRNSELITRSGGDVSDFIG 112

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RPTDNGQIG+IDP  RLIGL LYDGLFKVIPFDNKG LKEA NIRL+E  VLDIKFLYGC
Sbjct: 113  RPTDNGQIGVIDPQNRLIGLSLYDGLFKVIPFDNKGNLKEALNIRLQEFLVLDIKFLYGC 172

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A+PT+VVL+QDNKD+RHVKTYEVAL+DKDFVEG WSQ+NLDN A LLIPVP  L GV+II
Sbjct: 173  ARPTVVVLHQDNKDSRHVKTYEVALEDKDFVEGSWSQSNLDNSAHLLIPVP--LGGVIII 230

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            GE TIVYCSA  FKA+ I+ SI +A GRVD DGSRYL GD+ G LHL+VITHE  +VT L
Sbjct: 231  GEHTIVYCSATTFKALSIKQSIIRAVGRVDPDGSRYLYGDNTGALHLIVITHEWGRVTDL 290

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            K   +GETSIASTISYLD+ +VYIGS +GDSQLIKLN+Q DA  S+VE+LE+++N GPIV
Sbjct: 291  KTHYMGETSIASTISYLDSGLVYIGSRFGDSQLIKLNIQADASASFVEILEQFMNTGPIV 350

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVVD ER+GQGQV+TCSGAYKDGS+R VRNG+ I +QASVEL+G+KG+WS++SS +DP
Sbjct: 351  DFCVVDTERRGQGQVITCSGAYKDGSIRAVRNGVVITDQASVELRGMKGLWSMKSSLNDP 410

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            +DTFLVV+FI+ET  LAMN+E+ELEE +I+GF S+TQTL C  AI+NQL+QVTS SVRLV
Sbjct: 411  YDTFLVVTFINETHFLAMNMENELEEVDIKGFDSETQTLACGSAIHNQLIQVTSRSVRLV 470

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
            SS S EL ++W +P  +SVNVA ANA+QVLLATG  HLVYLEI    +  VKH QLE+EI
Sbjct: 471  SSVSLELLDQWFAPARFSVNVAAANANQVLLATGNCHLVYLEITSSKIVPVKHIQLEHEI 530

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
            SCLDINPIGENP YS +AAVGMWTDISVRIFSLP L LI KEHL GE++PRSVLLC  E 
Sbjct: 531  SCLDINPIGENPQYSSLAAVGMWTDISVRIFSLPGLKLIRKEHL-GEVVPRSVLLCTIEA 589

Query: 541  ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            +SYL C LGDGHL +F+LN  T EL+DR++VSLG QPI+L  FSS+N THVFAASDRP V
Sbjct: 590  VSYLFCGLGDGHLFSFVLNSSTCELSDRRRVSLGAQPISLHIFSSQNRTHVFAASDRPAV 649

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IYSS++KLLYS  NLKEV+H+CPFN+A FP+S+ +AKE EL+IG I+DI++LHIR+IPL 
Sbjct: 650  IYSSDQKLLYSYANLKEVNHVCPFNTAVFPESIVLAKESELSIGEINDIRQLHIRTIPLK 709

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            E  RRICHQEQS+T A+CS K +    ES  HFVRLLD QTF  +ST+ LD FE GCSI+
Sbjct: 710  EQARRICHQEQSQTLALCSFKPKFIHAESGKHFVRLLDYQTFWVLSTHTLDEFECGCSIV 769

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
            SCSFSDD N YYCVGTAY+LP E EPTKGRIL+F+VE+ KL+L+AE+ETKGAVYSLNA  
Sbjct: 770  SCSFSDDDNFYYCVGTAYILPYEIEPTKGRILIFLVEERKLRLVAERETKGAVYSLNALT 829

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            GKLLAA+NQKI +YKW+ RD+   +LQSEC + G +LAL+ QT G FIVVGD+++S+SLL
Sbjct: 830  GKLLAAVNQKIIVYKWVRRDN-RHQLQSECSYRGCVLALHTQTHGHFIVVGDMVRSVSLL 888

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
             YK+EEG IE   RD+N  W++AV +LDDDIY+GA+N  NLFT+            GR  
Sbjct: 889  RYKYEEGLIEVVTRDFNTKWITAVAMLDDDIYIGADNCCNLFTLHS----------GRPG 938

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VVGEYHLG+ VNR  HGSLVM   DS++GQIPTVIFGT++G IGVIAS P++QY+FLEKL
Sbjct: 939  VVGEYHLGDLVNRMHHGSLVMHHTDSEIGQIPTVIFGTISGAIGVIASFPYDQYVFLEKL 998

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            Q+ L K IK VG L+H +WRSF N  +T +A+NF+DGDLIESFL LS ++M+E+S+ M +
Sbjct: 999  QSVLVKFIKSVGNLSHVEWRSFYNVSRTAEARNFVDGDLIESFLSLSPSKMEEVSQVMGL 1058

Query: 1021 SVEELCKRVEELTRLH 1036
              +ELCK VEEL +LH
Sbjct: 1059 RADELCKIVEELRKLH 1074


>gi|110741229|dbj|BAF02165.1| UV-damaged DNA binding factor - like protein [Arabidopsis thaliana]
          Length = 727

 Score = 1358 bits (3515), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 657/727 (90%), Positives = 700/727 (96%)

Query: 310  QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSF 369
            QGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+ FDTFLVVSF
Sbjct: 1    QGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEAFDTFLVVSF 60

Query: 370  ISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
            ISETRILAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLVSST+RELR+
Sbjct: 61   ISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLVSSTTRELRD 120

Query: 430  EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIG 489
            EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+SCLDINPIG
Sbjct: 121  EWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEVSCLDINPIG 180

Query: 490  ENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALG 549
            +NP+YSQ+AAVGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEGISYLLCALG
Sbjct: 181  DNPNYSQLAAVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEGISYLLCALG 240

Query: 550  DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLL 609
            DGHLLNF ++  TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTVIYSSNKKLL
Sbjct: 241  DGHLLNFQMDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTVIYSSNKKLL 300

Query: 610  YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
            YSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLGEH RRICHQ
Sbjct: 301  YSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLGEHARRICHQ 360

Query: 670  EQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
            EQ+RTF ICSL NQS +EESEMHFVRLLDDQTFEF+STYPLD+FEYGCSILSCSF++D N
Sbjct: 361  EQTRTFGICSLGNQSNSEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSILSCSFTEDKN 420

Query: 730  VYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
            VYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFNGKLLAAINQ
Sbjct: 421  VYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFNGKLLAAINQ 480

Query: 790  KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
            KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL+YKHEEGAI
Sbjct: 481  KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLLYKHEEGAI 540

Query: 850  EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGE 909
            EERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLEVVGEYHLGE
Sbjct: 541  EERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLEVVGEYHLGE 600

Query: 910  FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 969
            FVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKLQ++LRKVIK
Sbjct: 601  FVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKLQSSLRKVIK 660

Query: 970  GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
            GVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV VEELCKRV
Sbjct: 661  GVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNVQVEELCKRV 720

Query: 1030 EELTRLH 1036
            EELTRLH
Sbjct: 721  EELTRLH 727


>gi|384250802|gb|EIE24281.1| hypothetical protein COCSUDRAFT_28729 [Coccomyxa subellipsoidea
            C-169]
          Length = 1101

 Score = 1320 bits (3417), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 625/1040 (60%), Positives = 800/1040 (76%), Gaps = 9/1040 (0%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
            DV IYGR+AT+ELFRP GE++D LF++TERYKFCVL++D+E+ EL+TRA GD+ D++GRP
Sbjct: 60   DVAIYGRVATMELFRPVGESKDLLFLSTERYKFCVLEYDSETGELVTRANGDIEDQVGRP 119

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
             DNGQIGI+DP CR+IGLHLYDGLFKVIP D+KGQL EAFN+R++EL V+D+ FL GCAK
Sbjct: 120  CDNGQIGIVDPGCRMIGLHLYDGLFKVIPIDDKGQLHEAFNMRIDELNVIDMIFLEGCAK 179

Query: 123  PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGE 182
            PTI VLYQDNKDARH+KTYEV LK+KD  EGPW Q+NLD GA  +I VP PL G L++GE
Sbjct: 180  PTIAVLYQDNKDARHIKTYEVVLKEKDLTEGPWRQSNLDAGASRVIAVPEPLGGALVVGE 239

Query: 183  ETIVYCS-ANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLK 241
              I Y     A K  PI+ +I +A+GRVD DGSRYLLGD+ G L+LLV+ H+ E V GLK
Sbjct: 240  SVIAYMGQGQAMKCTPIKATIIRAHGRVDEDGSRYLLGDYVGNLYLLVLQHDGEHVAGLK 299

Query: 242  IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN---LQPDAKGSYVEVLERYVNLGP 298
            +E LG TS  ST++YLDN VV++GSS GDSQL++L+   + P    ++VEVLE   NLGP
Sbjct: 300  VEPLGRTSAPSTLTYLDNGVVFVGSSGGDSQLVRLHPTPVTPQEPSNFVEVLETMTNLGP 359

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTD 358
            I+DF VVDLERQGQGQVV CSG   DGSLRIVRNGIG+ EQA+VEL GIKGMW+LR+S  
Sbjct: 360  IIDFVVVDLERQGQGQVVMCSGIMADGSLRIVRNGIGMIEQATVELPGIKGMWALRASHM 419

Query: 359  DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
            D FDTFLV+SF+ ETRILA+N +DEL+E E+ GF +  QTL C + + + LVQV    VR
Sbjct: 420  DAFDTFLVISFVGETRILAINADDELDEAELPGFSADAQTLCCGNTVSDHLVQVAGADVR 479

Query: 419  LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY 478
            LV +++R+L ++W+ P G ++NVA+ + +Q   +T  G+LVYLE+G+  + +  H +L+ 
Sbjct: 480  LVDASTRQLTHQWRPPAGLNINVASVSPTQASPSTAHGNLVYLELGESGIEQKGHVKLDA 539

Query: 479  EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAF 538
            E++C+DI+P+ E+   + + AVG W D+ V IFSLP +  +  E LGGEIIPRSVL  AF
Sbjct: 540  EVACVDISPLSEDGEAASLLAVGTW-DMRVHIFSLPAMAPLVSEPLGGEIIPRSVLFAAF 598

Query: 539  EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRP 598
            EG+ YLLCA+GDG L NF ++  TG L DRKK+ LGT+PI LR+F S   +HVFAASDRP
Sbjct: 599  EGVPYLLCAMGDGQLYNFHVDEATGALADRKKICLGTKPIMLRSFRSNGQSHVFAASDRP 658

Query: 599  TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIP 658
            TVIYS+NKKLLYSNVN  EV+ M  FNS++FPDSLA+AKEG +TIG+ID IQKLHIR++P
Sbjct: 659  TVIYSANKKLLYSNVNENEVNFMTSFNSSSFPDSLALAKEGAMTIGSIDQIQKLHIRTVP 718

Query: 659  LGEHPRRICHQEQSRTFAICSLKNQSCA--EESEMHFVRLLDDQTFEFISTYPLDTFEYG 716
            LGE PRR+ HQE SR+F + +  N      +++    VRLLDDQTFE +  + L+T E  
Sbjct: 719  LGEQPRRLAHQEASRSFLVLTSPNNGATGMDDAGPDSVRLLDDQTFETLDRFGLETNEVC 778

Query: 717  CSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSL 776
            C+  S SFSDD   YY VGTA  + EE EPTKGRILVF  + GKL L+ EKE KGA Y+L
Sbjct: 779  CAAASMSFSDDPCPYYVVGTAITVAEEPEPTKGRILVFGAKGGKLSLVCEKEVKGAAYNL 838

Query: 777  NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
            + F GKL+A IN ++QL+KW   +DG+REL +EC H GH+LALY+ TRGDF++VGDLM+S
Sbjct: 839  HPFQGKLIAGINSRVQLFKWTQSEDGSRELTNECSHVGHVLALYIVTRGDFVIVGDLMRS 898

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER 896
            + LLIY+ +EG +E RARDY  +WM+AVE+LDDD YLGAEN+ N+FT+RKN++ A DE+R
Sbjct: 899  LQLLIYRADEGILEVRARDYKTHWMTAVEVLDDDTYLGAENSNNIFTLRKNTDAAADEDR 958

Query: 897  GRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLF 956
             RLE VG+YHLG FVNRFRHGSLVM+LPDS+  +IPTV+F T+NG IGVIASLP +Q+ F
Sbjct: 959  NRLETVGQYHLGVFVNRFRHGSLVMKLPDSEAAKIPTVLFVTINGSIGVIASLPQQQFQF 1018

Query: 957  LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
            L +LQ  LRKVIKGVGGL+H  WR+F +E   + ++NF+DGDLIE FLDL R  M+ +++
Sbjct: 1019 LSRLQDCLRKVIKGVGGLSHVAWRTFQDEHTKMPSQNFVDGDLIEQFLDLKRDSMERVAR 1078

Query: 1017 TM--NVSVEELCKRVEELTR 1034
             M   V+ E+L + VEEL+R
Sbjct: 1079 EMGEGVTSEDLLRMVEELSR 1098


>gi|428164905|gb|EKX33915.1| hypothetical protein GUITHDRAFT_158867 [Guillardia theta CCMP2712]
          Length = 1092

 Score = 1223 bits (3165), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 602/1046 (57%), Positives = 775/1046 (74%), Gaps = 14/1046 (1%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LD  IYGRIA +EL+   G  ++ L+I TER KFC++++D+ + ELIT+AMGDV D +G
Sbjct: 51   VLDTGIYGRIAAIELYTVAGAERESLYILTERLKFCIVEYDSSTGELITKAMGDVQDSVG 110

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK-GQLKEAFNIRLEELQVLDIKFLYG 119
            RP D G I  IDP+ R+IG  LYDGLFKVIP D + GQL+EAFNIRLEELQVLD++FLYG
Sbjct: 111  RPVDGGPIAHIDPERRMIGFLLYDGLFKVIPIDTRNGQLREAFNIRLEELQVLDVQFLYG 170

Query: 120  CAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLI 179
             A+PTIV+LYQD K+ RH+KTY+V+++DKDF+ GPWSQ  ++ GA ++IPVP P+ G ++
Sbjct: 171  YAQPTIVLLYQDPKEMRHLKTYQVSIRDKDFIAGPWSQTGVEIGATMIIPVPTPIGGCIL 230

Query: 180  IGEETIVYCSANA--FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
            +GE+TI Y + +    K I +  ++ +A+G++D DG RYLLGDH G L++LV+  +  KV
Sbjct: 231  LGEQTISYLNGDKGDTKTIHMDMTVIRAWGKIDEDGRRYLLGDHLGQLYVLVLEFDGNKV 290

Query: 238  TGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLG 297
             GLK++ LGETS A TI+YLD+ VV+IGS +GDSQLI+L+   D   S +EVLE + NLG
Sbjct: 291  LGLKLDTLGETSSAKTITYLDSGVVFIGSCFGDSQLIRLHPDKDENDSNIEVLESFTNLG 350

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST 357
            PI DFCVVDLERQGQGQVVTCSG  KDGSLR+VRNGIGINEQA+VEL GIKG+WSLR S 
Sbjct: 351  PIQDFCVVDLERQGQGQVVTCSGTLKDGSLRVVRNGIGINEQAAVELPGIKGLWSLRESI 410

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            D  +D +L+ SF++ETR+L +  ++EL ETEI+GF    QT+FC + + + L+Q+T  S+
Sbjct: 411  DAQYDKYLIQSFVNETRVLEIA-DEELSETEIDGFDHNAQTIFCSNVLGDCLLQITEVSL 469

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLE 477
            RLVS+ S++L  EW  P G  + VA  N  QV+L +G   L+YL++ +G +TEVK  +++
Sbjct: 470  RLVSTKSKQLLKEWFPPNGERITVAGGNVQQVVLTSGKRTLIYLDVSNGDVTEVKRIEMD 529

Query: 478  YEISCLDINPIGENPSY--SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
             EI+CL++NP+GE   +  S   AVG W ++S+ +  LP + ++  E +GG+ IPRS+LL
Sbjct: 530  QEIACLNLNPLGEKSDHNKSDFVAVGHW-NLSLSMLRLPSMEVVCTESIGGDAIPRSLLL 588

Query: 536  CAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS 595
               EG+ YLLCALGDG+L  F ++  T ++ +RKK+SLGT P+ L  F S+  THVFAAS
Sbjct: 589  VTLEGVDYLLCALGDGYLFTFAIDASTAQIGERKKISLGTHPMILSKFMSRGATHVFAAS 648

Query: 596  DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR 655
            DRPTVIYS+N+KLL+SNVNLKEV+ M PFNS  FPDSLAIA E  L IG IDDIQKLHIR
Sbjct: 649  DRPTVIYSNNRKLLFSNVNLKEVTQMAPFNSEGFPDSLAIATETSLRIGVIDDIQKLHIR 708

Query: 656  SIPLGEHPRRICHQEQSRTFAIC--SLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTF 713
            ++ L E PRRICHQE S+TF +   S++     EE E  F++L DDQTFE + TY L  F
Sbjct: 709  TVYLREQPRRICHQESSKTFCVATLSIRINRDGEEVEEQFIKLFDDQTFEILDTYQLQEF 768

Query: 714  EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
            E  CS+   SFSDD  +YY VGTA  +P+E+EP +GR+LVF V D KL L A KE KGA 
Sbjct: 769  ENTCSVECASFSDDPTLYYIVGTATAVPQESEPKEGRLLVFEVIDRKLHLKASKEIKGAP 828

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDL 833
            Y +  FNGKLLA IN KI+L++    D G  EL SEC H GHIL LY+QTRGDFIV GDL
Sbjct: 829  YQIKPFNGKLLAGINSKIELFRLSDSDTGHMELVSECCHRGHILVLYLQTRGDFIVAGDL 888

Query: 834  MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
            M+SISLL YK  +G IEE ARD+NANWM+AV+ILDDD +LGAE  FNLFTVRKN++  +D
Sbjct: 889  MRSISLLTYKQVDGQIEEIARDFNANWMTAVDILDDDTFLGAEGYFNLFTVRKNTDATSD 948

Query: 894  EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQ 953
            EER RLEVVGEYHLG+ VNRF+ GSLV+R   SD     T+IFGTVNG+IGVIA L  E+
Sbjct: 949  EERARLEVVGEYHLGDMVNRFQRGSLVLR--SSDTPTTDTIIFGTVNGMIGVIAVLSKEE 1006

Query: 954  YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRTRM 1011
            Y FL K+Q  L  VIKGVGGL HE WRSF NE+       K F+DGDLIESFLDL R +M
Sbjct: 1007 YEFLLKVQDALNFVIKGVGGLRHEDWRSFENERTQGARAPKGFIDGDLIESFLDLRREKM 1066

Query: 1012 DEISKTM-NVSVEELCKRVEELTRLH 1036
            +E+   + +++VEEL +++EEL RLH
Sbjct: 1067 EEVCHAIGSITVEELSRKIEELQRLH 1092


>gi|255080490|ref|XP_002503825.1| predicted protein [Micromonas sp. RCC299]
 gi|226519092|gb|ACO65083.1| predicted protein [Micromonas sp. RCC299]
          Length = 1114

 Score = 1219 bits (3153), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 614/1067 (57%), Positives = 774/1067 (72%), Gaps = 38/1067 (3%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            M DVP+YGRIA ++L+RP GE +  L++ATER  FCVL +D  S  + TRAMGD+S+ IG
Sbjct: 52   MHDVPVYGRIAVMKLYRPAGEKRQLLYVATERLMFCVLAYDQTSGAIATRAMGDLSNTIG 111

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP ++G IG +DP+CRLIG   YDGLFKV+P D  GQL+EAF++RLEEL V+D+KF++GC
Sbjct: 112  RPCEHGLIGEVDPECRLIGSQAYDGLFKVVPMDRAGQLREAFDVRLEELNVVDVKFMHGC 171

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A PTI VLY+D K+ARHVKTYEV +K+K   +GPWSQ++++ G+ L+IPVP PL G +++
Sbjct: 172  ATPTICVLYEDTKEARHVKTYEVDVKEKTLRDGPWSQSDVEGGSSLIIPVPAPLGGAIVV 231

Query: 181  GEETIVYCSANAFKAIPIRPSITK----AYGRVDADGSRYLLGDHAGLLHLLVITHEKEK 236
            GE  IVY + +         +       A+G VDADGSRYLL D  G+LHLLV+ H++ +
Sbjct: 232  GESVIVYLNKDGGNGAGGAIATKSVNVMAHGVVDADGSRYLLSDSTGMLHLLVLVHDRRR 291

Query: 237  VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP----------DAKG-S 285
            V  LK+E LG+TSIAST+SYLDN VVY+GS+YGDSQL++L+ QP             G +
Sbjct: 292  VHALKLESLGQTSIASTLSYLDNGVVYVGSAYGDSQLVRLHAQPVRCAADQVPATPDGLT 351

Query: 286  YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
            YVE LE + NLGPIVDF VVDL+R GQGQVVTCSG  KDGSLR+VRNG+GI+E+A++EL 
Sbjct: 352  YVECLESFTNLGPIVDFAVVDLDRHGQGQVVTCSGVNKDGSLRVVRNGVGIHERAAIELP 411

Query: 346  GIKGMWSLR--SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
            G+KG WSLR   ++  P DT LVVSF  ETRILA++ +DEL E E  GF +  QTL   +
Sbjct: 412  GVKGCWSLRRGDASTHPSDTHLVVSFAGETRILAIDDDDELAECEFRGFSANEQTLCVCN 471

Query: 404  AIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
                 +VQ  +  VRLV++   E R  W  P G +V+VA+AN +Q L+AT GG L  L +
Sbjct: 472  VDGGFVVQCVASGVRLVNAADGEPRATWSPPGGATVSVASANRTQALVATTGGSLYSLAL 531

Query: 464  GDGILT-EVKHAQLE-YEISCLDINPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLIT 520
            G   L  E   A L+  EI+CLD  P+ +   + +++ AVG WT   V + ++PDL L+T
Sbjct: 532  GSAALIRETASASLDGKEIACLDCTPLSDPGDAAARLCAVGTWT-AEVFLLTMPDLRLVT 590

Query: 521  KEHLGGE---IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
               LGG    +IPR+VLLC+FEG  +LL  LGDG L  F ++ + G L D K +SLGTQP
Sbjct: 591  TSPLGGGGGGVIPRAVLLCSFEGTPHLLAGLGDGALHTFGVDPEAGTLRDGKSLSLGTQP 650

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            ITLRTF SK  THVFA SDRPTVIY +N KL+YSNVNL+EV H CPFN  AFPDSLA+A 
Sbjct: 651  ITLRTFRSKGATHVFAGSDRPTVIYGNNGKLIYSNVNLREVLHACPFNCDAFPDSLALAS 710

Query: 638  EGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLL 697
            E +LTIG IDDIQKLHIR++PLGE PRRI HQ ++RT+A  +        +   +FVRL 
Sbjct: 711  ESDLTIGGIDDIQKLHIRTVPLGEQPRRIAHQPETRTYAALTENF-----DENGYFVRLF 765

Query: 698  DDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
            DD TFE +  + L+  E   S++SC+F+DD  VYY VGT Y LPEE EPT+GRILVF  E
Sbjct: 766  DDVTFETLCKFRLEPDEQDSSVISCAFADDPRVYYVVGTGYSLPEEPEPTRGRILVFRAE 825

Query: 758  DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW--MLRDDG----TRELQSECG 811
            DGKLQL+AEKE KGAVY+LNAFNGKLLA IN K++L++    +  DG    T EL  EC 
Sbjct: 826  DGKLQLVAEKEVKGAVYNLNAFNGKLLAGINSKVELFRGGDPVGADGAGGSTYELAKECS 885

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
            HHGHI+ALYV  RG+FIVVGDLMKS+SLL YK EE  IEERARDYNANWM+AV+ILDDD 
Sbjct: 886  HHGHIVALYVAVRGEFIVVGDLMKSVSLLAYKPEESVIEERARDYNANWMTAVDILDDDT 945

Query: 872  YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
            YLGAENNFNLFT+R+ S+ ATDEER RLEVVGEYH+GEFVNRFR GSLVMRLPD +   +
Sbjct: 946  YLGAENNFNLFTLRRQSDAATDEERSRLEVVGEYHVGEFVNRFRRGSLVMRLPDQENADV 1005

Query: 932  PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE---KKT 988
            PT++FGTV+GVIGV+A+LP EQ+ FL  LQ  L K + GVGGL+H+ WRSF NE   +  
Sbjct: 1006 PTLLFGTVSGVIGVLATLPREQFEFLSALQAALNKTVSGVGGLSHDAWRSFQNEHRHRAK 1065

Query: 989  VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
              A+ F+DGDLIESFLDL   +  E++  + +SV+EL +RVE+L RL
Sbjct: 1066 DGARGFVDGDLIESFLDLRPEKAREVAAAVKLSVDELTRRVEDLQRL 1112


>gi|303271531|ref|XP_003055127.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226463101|gb|EEH60379.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1223

 Score = 1215 bits (3144), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 634/1160 (54%), Positives = 784/1160 (67%), Gaps = 128/1160 (11%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            M+DVPIYGRIAT+EL RP  E +  LFI TER  FCVL +DA   ELITRAMGD++DR+G
Sbjct: 65   MMDVPIYGRIATMELVRPPFEKKPMLFILTERNMFCVLSYDAAKGELITRAMGDLTDRVG 124

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IG +DP+CR+ GLHLYDGLFKVIP D  GQL+EAF++RLEELQV D+KFL G 
Sbjct: 125  RPSECGPIGAVDPECRMYGLHLYDGLFKVIPMDQTGQLREAFSVRLEELQVFDVKFLAGT 184

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             KPTI VLYQD K+ RH+KTYEV LKDKDF  GPW+QN++++G+  LI VP PL GV+++
Sbjct: 185  PKPTIAVLYQDTKEGRHIKTYEVCLKDKDFNPGPWAQNDVESGSRFLIAVPAPLGGVVVV 244

Query: 181  GEETIVYCSANA-------------------FKAIPIRPSIT-KAYGRVDADGSRYLLGD 220
            GE+ I Y +                       KAI ++   T   YG VD DGSRYLL D
Sbjct: 245  GEKVIAYLNKETTHGVGDGGGGGGGGGGGMIVKAIAMQSDATIMTYGAVDKDGSRYLLSD 304

Query: 221  HAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP 280
             AG LHLLV+ H+K +V  LK+E LG+TSIAS++SYLDN VVY+GS+YGDSQL++L+  P
Sbjct: 305  SAGRLHLLVLMHDKTRVRALKLESLGQTSIASSLSYLDNGVVYVGSAYGDSQLVRLHSTP 364

Query: 281  ------------------------------DAKGSYVEVLERYVNLGPIVDFCVVDLERQ 310
                                          DA  +YVEVLE + +LGPIVDF VVDL+R 
Sbjct: 365  IPIAGGGGGDGDGGGGGGGEIVPVDSGAVTDAP-NYVEVLESFTSLGPIVDFVVVDLDRH 423

Query: 311  GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFI 370
            GQGQVVTCSG +KDGSLR+VRNG+GI+EQA++EL G+KG W+L+++ D   DTFLVV+FI
Sbjct: 424  GQGQVVTCSGVHKDGSLRVVRNGVGIHEQAAIELPGVKGCWALKNADDAVSDTFLVVAFI 483

Query: 371  SETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNE 430
             ETRILA+N +DEL+ETE EGF    + L C +       QVTSG +RLV   +  LR  
Sbjct: 484  GETRILAINDKDELDETEFEGFAGDERALACANVDGGYACQVTSGGIRLVDVATGALRAR 543

Query: 431  WKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-----------ILTEVKHAQLEYE 479
            W   PG  V+VA AN +QV++A  GG LV +  G G           +L E+    + +E
Sbjct: 544  WTPEPGERVSVAAANRTQVVVALEGGTLVSVAAGGGGDAMDVDDASPLLRELARVNVGHE 603

Query: 480  ISCLDINPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL-GGEIIPRSVLLCA 537
            I+CLD+ P+ +   + S+I AVG+W+   VR+ +   L  ++   L   E+IPR+VLLC+
Sbjct: 604  IACLDVTPLADPRAASSEICAVGLWS-AEVRVLATATLEELSSAPLTDAEVIPRAVLLCS 662

Query: 538  FEGISYLLCALGDGHLLNF-LLNMKTGELTDRKKVSLGTQ--------------PITLRT 582
            FEGI YLL  LGDG L  F L+  ++G + D KK+S+GTQ              PITL+T
Sbjct: 663  FEGIPYLLAGLGDGQLFTFALMGERSGIIGDGKKLSVGTQARSIHWFPYDRVGVPITLKT 722

Query: 583  FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
            F +KNTTHVFA SDRPTVIYS NKKL+YSNVNL+EV HMC FN  AFPDSLA+A E ELT
Sbjct: 723  FRNKNTTHVFAGSDRPTVIYSQNKKLIYSNVNLREVLHMCAFNCDAFPDSLALASESELT 782

Query: 643  IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF 702
            IG IDDIQKLHIR++PLGE PRRI HQ  SRTFA+ +        E E  +VRL DD TF
Sbjct: 783  IGGIDDIQKLHIRTVPLGEQPRRIAHQPASRTFAVLTSHVSDVTNE-ESFYVRLFDDVTF 841

Query: 703  EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQ 762
            E +  Y LD  E   SI+SCSF+DD   YY VGTA+ LPEE EP++GRILV   ++G+L 
Sbjct: 842  ETLFKYRLDVGETDSSIISCSFADDPASYYVVGTAFSLPEEVEPSRGRILVLRADEGRLS 901

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDG-------------------- 802
            L+AEKE KGAVY+LNAFNGKLLA IN K+QL+KW+ R  G                    
Sbjct: 902  LVAEKEVKGAVYNLNAFNGKLLAGINSKVQLFKWVSRGAGAGAGAGGGAEGGAVAMADGG 961

Query: 803  --------------TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
                          T EL SEC HHGHI+ALYV  RGDFIVVGDLMKSISLL+YK +EG 
Sbjct: 962  GGGGGGGGAPAAATTCELASECSHHGHIVALYVDVRGDFIVVGDLMKSISLLVYKPDEGV 1021

Query: 849  IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
            IEERARD+N NWM+AV  LDD+ YLGAEN+FNLFTVRKNS+ A DEER RL+V+GEYHLG
Sbjct: 1022 IEERARDFNPNWMTAVCALDDETYLGAENSFNLFTVRKNSDAAADEERSRLDVIGEYHLG 1081

Query: 909  EFVNRFRHGSLVMRLPDSDV-----------GQIPTVIFGTVNGVIGVIASLPHEQYLFL 957
            EFVNRFR GSLVMRLP                + PT +FGTVNG IGV+ASLP   + FL
Sbjct: 1082 EFVNRFRAGSLVMRLPGDGDGAGLGLGLDASNEAPTQLFGTVNGAIGVVASLPESTHTFL 1141

Query: 958  EKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEIS 1015
              LQ  + KV+ GVGG +H+ WRSF+NE ++  V+A+ F+DGDLIESFLDL   +  E++
Sbjct: 1142 AALQKAMNKVVSGVGGFSHDAWRSFHNEHRSRLVEARGFVDGDLIESFLDLRPEKASEVA 1201

Query: 1016 KTMNVSVEELCKRVEELTRL 1035
              + V VEEL KR+EEL R+
Sbjct: 1202 SVVGVGVEELTKRIEELVRI 1221


>gi|307111604|gb|EFN59838.1| hypothetical protein CHLNCDRAFT_29381 [Chlorella variabilis]
          Length = 1108

 Score = 1204 bits (3114), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 595/1056 (56%), Positives = 758/1056 (71%), Gaps = 28/1056 (2%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDVP+YGR+A L+LFRP  E +D L + TER KFCVL +D ES EL+TRA GD SDR+G
Sbjct: 61   VLDVPLYGRVAALQLFRPASEPRDLLLLLTERNKFCVLGFDEESGELVTRANGDASDRVG 120

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            R  + GQ GI+DP CRLIGLHLYDGLFKVIP D +G L+EAFN+RL+EL+V+DI FL GC
Sbjct: 121  RQVELGQRGIVDPQCRLIGLHLYDGLFKVIPMDERGGLQEAFNMRLDELKVVDIAFLDGC 180

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A PTI VLY+D K+ RHVKTYEV+L++K+ VEGPW Q+NLD GA +LIPVP    G +++
Sbjct: 181  AAPTIAVLYEDTKEQRHVKTYEVSLREKELVEGPWQQSNLDAGASMLIPVPAS-GGAMVV 239

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG- 239
            GE  + + +    ++  I+P++  AYG+VD DGSR+LL D  G L+LL++  E+      
Sbjct: 240  GESVVTFIAPGVVRSAAIKPTL--AYGQVDDDGSRFLLSDFLGNLYLLLLLREEGAAGAS 297

Query: 240  ------------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP---DAKG 284
                        LK+E LG T  ASTI+YLD+ VV++GSS+GDSQLI+L+  P       
Sbjct: 298  SSSGAGAGAVAGLKLEPLGRTPAASTIAYLDSGVVFVGSSFGDSQLIRLHSSPPDPSQPT 357

Query: 285  SYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL 344
            S+VEVL+   NLGPIVDF VVDLERQGQGQVVTCSG   DGSLRIVRNGIG+ EQA VEL
Sbjct: 358  SFVEVLDSQPNLGPIVDFAVVDLERQGQGQVVTCSGTGVDGSLRIVRNGIGVVEQALVEL 417

Query: 345  QGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA 404
             GIK +WSLR +  D  DT+LV++F  ETR+L MN EDEL+E EI GF S   TL C + 
Sbjct: 418  PGIKDLWSLRKTFMDAHDTYLVLTFSGETRVLGMNAEDELDEAEIPGFNSTALTLCCANT 477

Query: 405  IYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG 464
            +++QL+QVT+ + RLV   +++L  +W+      + +A A+ +Q++ A GG  LVYLE+G
Sbjct: 478  VHDQLLQVTATAFRLVDCMTQQLVTQWEPGADGRITIAAASPTQLVAAVGGRTLVYLELG 537

Query: 465  DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
            DG + E    QL+ +I+CLDI P+GE    S++ AVG W  +   IF+LP L  + +E L
Sbjct: 538  DGQIEEKGRVQLDADIACLDITPVGELSEASEVVAVGSWA-LEAHIFALPSLAPLFREKL 596

Query: 525  GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
              ++IPRS L   FEG +YL+  LGDG L+N+ L+      TDRKK++LGT+PI+LRTF 
Sbjct: 597  PTDVIPRSTLFAQFEGDTYLMYGLGDGQLVNYRLDADGP--TDRKKIALGTKPISLRTFR 654

Query: 585  SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
            S+    VFAASDRPTVIYSSN+KLLYSN+N  EVSHM  FN+AAFPDSLA+ K+  L IG
Sbjct: 655  SRGAAAVFAASDRPTVIYSSNRKLLYSNLNENEVSHMTSFNTAAFPDSLALGKQEALLIG 714

Query: 645  TIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEF 704
            TID+IQKLH+R++PLGE PRRI HQE SRTFA+   +     E  +   VRL+D+QTFE 
Sbjct: 715  TIDEIQKLHVRTVPLGEQPRRIAHQETSRTFAVTCTQATISGEGGDS--VRLVDEQTFEL 772

Query: 705  ISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLI 764
            +    L   E  CS+ S    DD   YY VGTA+  P E EPTKGRI V     GKL ++
Sbjct: 773  LDRLQLQQHELACSLCSTQLGDDPATYYVVGTAFAPPNEPEPTKGRIFVLAAAGGKLCVV 832

Query: 765  AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
             EKET+GAVYSL  F G+LLA IN ++Q+YKW+ + +G R L  EC H GH+LALY+ TR
Sbjct: 833  CEKETRGAVYSLAEFQGRLLAGINSRVQMYKWLEQGEGGRALVPECSHAGHVLALYLATR 892

Query: 825  GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
            GD +VVGDLMKSI LL +  EEGA+E RARD++ NWMSAV +LDDD Y+GAEN++NLFTV
Sbjct: 893  GDLVVVGDLMKSIQLLAWGEEEGALELRARDFHPNWMSAVTVLDDDTYMGAENSYNLFTV 952

Query: 885  RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
            R+N++ ATDEER RLE VG YHLGEFVNRF+ GSLVMRLPDS++ QIPTV+FGT+NGVIG
Sbjct: 953  RRNADAATDEERSRLETVGRYHLGEFVNRFQPGSLVMRLPDSELSQIPTVLFGTINGVIG 1012

Query: 945  VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK-KTVDAKNFLDGDLIESF 1003
            V+ASLPH QY  LE LQ  +RKV+KGVGG +H QWR+F+N+      A+ F+DGDLIE F
Sbjct: 1013 VVASLPHAQYQLLESLQEAMRKVVKGVGGFDHAQWRAFSNQHMPATPARQFVDGDLIEQF 1072

Query: 1004 LDLSRTRMDEISKTM---NVSVEELCKRVEELTRLH 1036
            LDL R   + +   M     +VE + + VEEL+RLH
Sbjct: 1073 LDLKRDSAEAVIAAMAGGGATVESVTQLVEELSRLH 1108


>gi|348681092|gb|EGZ20908.1| hypothetical protein PHYSODRAFT_259403 [Phytophthora sojae]
          Length = 1137

 Score = 1165 bits (3014), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 586/1085 (54%), Positives = 767/1085 (70%), Gaps = 52/1085 (4%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
            DV +YGRIA  E+FR   E QD+LFI T+R++FCVL +DA + + +T+A G + D IGR 
Sbjct: 54   DVNLYGRIAIFEVFRALNEPQDWLFIVTQRFQFCVLAYDAAAQQFVTKAHGSIRDSIGRS 113

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDN-KGQLKEAFNIRLEELQVLDIKFLYGCA 121
            ++    G IDP+ RLIG++LY+G FKVIP D+ KG LK+ FNIRL+EL+V+DIKFL+G  
Sbjct: 114  SEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGILKDTFNIRLDELRVIDIKFLHGYT 173

Query: 122  KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
            KPTI VLY+D K ARH+KTY + LK+KDF EGPWSQ+N+++GA LLIPVP P+ GVLI+ 
Sbjct: 174  KPTICVLYEDYKAARHIKTYHILLKEKDFAEGPWSQSNVESGASLLIPVPAPVGGVLIVS 233

Query: 182  EETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLK 241
             +TIVY + + F AIP++ ++ + YG VD DGSR+LL D  G L ++ + H  ++VTG+ 
Sbjct: 234  NQTIVYHNGSTFHAIPMQSTVIQVYGAVDKDGSRFLLADQYGTLSVVALQHTGKEVTGVH 293

Query: 242  IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 301
            +E+LGET+IAS +SYLDN VV+IGS++GDSQLIKLN   D  GSY+EVL+ YVN+GPI+D
Sbjct: 294  LEVLGETNIASCLSYLDNGVVFIGSTFGDSQLIKLNADRDENGSYIEVLDTYVNVGPIID 353

Query: 302  FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 361
            FCV+DL+RQGQGQ+VTCSGA KDG+LR++RNGIGINEQAS EL GIKGMW+LR +     
Sbjct: 354  FCVMDLDRQGQGQIVTCSGADKDGTLRVIRNGIGINEQASAELPGIKGMWALRETFAAEH 413

Query: 362  DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
            D +L+ S++SE RILA+  EDE+EE EI  F +  +TL C +   +  +QVT   VRL+S
Sbjct: 414  DKYLLQSYVSEIRILAIGDEDEMEEKEIPAF-TNVKTLLCRNMYGDVWLQVTESEVRLIS 472

Query: 422  STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 481
             TS  L + W  P G  + VA AN +QV +AT GG LVYLE+ +G +TE    ++E+EI+
Sbjct: 473  CTSLSLSSTWSPPLGSRITVAAANPTQVAVATSGGVLVYLEVENGQVTEKTKVKMEHEIA 532

Query: 482  CLDINPIGENPSY-------------------SQIAAVGMWTDISVRIFSLPDLNLITKE 522
            C+DI P+  + +                    S I  VG+WT+ SV +  LP L  +T E
Sbjct: 533  CVDITPLARSQATDGDVAMTGSSTHWDMAALNSSICVVGLWTNFSVSVLKLPTLEKLTTE 592

Query: 523  HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
             LG +++PRSVL   FEG  YLL  LGDG L+N+ LN+  G L  RK+VSLG+QP++L T
Sbjct: 593  SLGTDLLPRSVLCNTFEGKDYLLVGLGDGSLMNYELNVAQGTLGTRKRVSLGSQPLSLST 652

Query: 583  FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
            F SKN THVFAA DRPTVIYSSN KLLYSN+N KEV+ MCPF+S +FP+ LA++ E ELT
Sbjct: 653  FRSKNMTHVFAACDRPTVIYSSNNKLLYSNINSKEVNVMCPFDSESFPECLALSSEEELT 712

Query: 643  IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMHFVRLLDDQ 700
            IGT+DDIQKLHI++  L E  RRI H  +S T  +   S       EE +  FVRL DDQ
Sbjct: 713  IGTVDDIQKLHIQTFHLNEWARRIAHDPESHTLGVLTVSFTVDDTGEEVDQGFVRLFDDQ 772

Query: 701  TFEFISTYPLDTFEYGCSILSCSFSDDSN--VYYCVGTAYVLPEENEPTKGRILVFIVE- 757
            TFE + +Y LD FE   S++ C FS DS+   Y+ VGTAY+  +E EP +GRILVF V  
Sbjct: 773  TFEVLHSYRLDPFETPSSVVVCPFSGDSSSGSYFVVGTAYIHEDEAEPHQGRILVFAVTG 832

Query: 758  ---DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
               + KLQL+ EKE KGAVY LNAFNGK+LA +N K QLYKW    D  +EL SECGH+G
Sbjct: 833  IHGERKLQLVTEKEVKGAVYCLNAFNGKVLAGVNSKAQLYKWSENTDNEKELVSECGHYG 892

Query: 815  HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
            H L LY+++RGDFIVVGDLMKS+SLL YK  +G IEE A+D N+NWMSA+ I+DDD Y+G
Sbjct: 893  HTLVLYMESRGDFIVVGDLMKSVSLLSYKQLDGTIEEIAKDLNSNWMSALGIVDDDTYIG 952

Query: 875  AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL-------------VM 921
            +E +FNLFTV++NS  A+DEERGRLE VGE+HLGEFVNRFR+GSL             V 
Sbjct: 953  SETDFNLFTVQRNSGAASDEERGRLETVGEFHLGEFVNRFRYGSLTPAAAGPTDMVDVVE 1012

Query: 922  RLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS 981
            + P     Q  +++FGTV+G+IGVI  L  +QY FL ++Q  L +V+KGVGG +H+ WR 
Sbjct: 1013 QAPIVPAAQNQSMLFGTVSGMIGVILPLTKDQYSFLLRVQQALTQVVKGVGGFSHKDWRM 1072

Query: 982  FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSVEELCKRVEE 1031
            F N +   +A+NF+DGDL+ESFLDL + +M ++   +N           +VE+L  R+EE
Sbjct: 1073 FENRRSVSEARNFIDGDLVESFLDLPKAQMTKVVDKLNSDGMLDGTDQFTVEDLTLRIEE 1132

Query: 1032 LTRLH 1036
            L +LH
Sbjct: 1133 LAQLH 1137


>gi|145351726|ref|XP_001420218.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580451|gb|ABO98511.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1120

 Score = 1161 bits (3004), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 592/1059 (55%), Positives = 772/1059 (72%), Gaps = 30/1059 (2%)

Query: 1    MLDVPIYGRIATLELFRP-HGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRI 59
            +LDVPI GRIAT+ L +   G+ +  L++ TERY F VL +D  + EL T A GDV D I
Sbjct: 66   VLDVPINGRIATMSLCQTGSGDGKARLYLTTERYGFTVLSYDEANEELKTEAFGDVQDNI 125

Query: 60   GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYG 119
            GRP D+GQIGI+D  CR IGL LYDGLFKVIP D KG +KEAFNIRLEEL+V DIKFL+G
Sbjct: 126  GRPADDGQIGIVDDTCRAIGLRLYDGLFKVIPCDEKGGVKEAFNIRLEELRVEDIKFLHG 185

Query: 120  CAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLI 179
              KPTI VLY+D KDA H+KTYE+ +++K+FV  PW+QN+L+ G++ +IPVP P+ GV++
Sbjct: 186  TPKPTIAVLYRDTKDAVHIKTYEIGIREKEFVSSPWAQNDLEGGSNKIIPVPAPIGGVVV 245

Query: 180  IGEETIVYCS-----ANAF-KA-----IPIRPSITKAYGRVDADGSRYLLGDHAGLLHLL 228
            +G+E IVY +     A+ F KA     IP R +IT  YG +D DGSRYLLGD  G+L+LL
Sbjct: 246  LGQEIIVYLNKFEDDADVFLKAINIPNIPDRTNIT-CYGAIDPDGSRYLLGDADGMLYLL 304

Query: 229  VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL-----NLQPDAK 283
            VI H+ ++V  LKIE LG+TSIAST+SYLDN VV++GS+YGDSQLIKL     ++  D  
Sbjct: 305  VILHDGKRVRELKIERLGDTSIASTLSYLDNGVVFVGSTYGDSQLIKLHAEKTSIDKDGN 364

Query: 284  GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE 343
             +YV++LE + NLGPIVDF  VDLER GQGQVVTCSGA KDGSLR+VRNGIGI+EQA ++
Sbjct: 365  PTYVQILEEFTNLGPIVDFAFVDLERHGQGQVVTCSGALKDGSLRVVRNGIGIDEQAVIQ 424

Query: 344  LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM--NLEDELEETEIEGFCSQTQTLFC 401
            L G+KG++SLR S D   D +LVV+FI+ETRIL    +  D L+ETEI GF ++ QTL C
Sbjct: 425  LPGVKGLFSLRDSDDSQMDKYLVVTFINETRILGFVGDEGDTLDETEIAGFDAEAQTLCC 484

Query: 402  HDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL 461
             +   N  +QVT   VRLVS    +L +EWK   G  +  A  N +Q+L+A  GG L  L
Sbjct: 485  GNMQGNVFLQVTHRGVRLVSRGG-DLLDEWKPKDGAEILSAKCNPTQILVAAAGGQLHCL 543

Query: 462  EIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK 521
             +  G +  +     E EI+CLD  P+G+  S S + AVG+W+ + + + S+ DL++ITK
Sbjct: 544  NVAKGKIVLLASKTFENEIACLDCTPMGDGMS-SPVCAVGLWS-MDIVLASMSDLSVITK 601

Query: 522  EHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
            E    +IIPRS LLC+FE I YL   LGDG L+ ++L+  TG L+ RKK+SLGT+PITL+
Sbjct: 602  ESTDEDIIPRSTLLCSFEDIPYLFVGLGDGQLITYVLDQNTGALSGRKKLSLGTKPITLQ 661

Query: 582  TFSS--KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
            TF S   N + VFAASDRPTVI+S+NKKL+YSNVN++EV H+CPF+S AFPD+LA+A + 
Sbjct: 662  TFKSHATNVSSVFAASDRPTVIFSNNKKLIYSNVNVQEVLHVCPFSSEAFPDALALAGDE 721

Query: 640  ELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD 699
            +LTIG IDDIQKLHIR+IPLG HPRRI HQ  + TFA+ ++++     + E+ F+RL+DD
Sbjct: 722  DLTIGGIDDIQKLHIRTIPLGGHPRRIAHQVDTNTFAV-AVEHLMSKGDQEL-FIRLIDD 779

Query: 700  QTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG 759
             +F+ +  + L+  E   S++SCSF+ DS  YY VGT +   +E+EP++GRILV  VE  
Sbjct: 780  GSFDTLHQFRLEEHELASSLMSCSFAGDSREYYVVGTGFAYEQEDEPSRGRILVLRVEAD 839

Query: 760  KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
             L+L++EKE +GAVY+LNAF GKLLA IN K++L+KW  R+D   EL SEC HHG I+  
Sbjct: 840  ALELVSEKEVRGAVYNLNAFKGKLLAGINSKLELFKWTPREDDAHELVSECSHHGQIITF 899

Query: 820  YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD-IYLGAENN 878
             V+TRGD+I+VGDL+KS+SLL YK EEGAI+E ARD+NANWM+AV +LDDD  YLGAEN+
Sbjct: 900  SVKTRGDWILVGDLLKSMSLLQYKPEEGAIDEIARDFNANWMTAVAMLDDDETYLGAENS 959

Query: 879  FNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
             NLFTV +N    TDEER RLE+ GEYHLGEFVN F  GSLVM L D D  ++PT++FGT
Sbjct: 960  LNLFTVARNMNAMTDEERSRLEITGEYHLGEFVNVFSPGSLVMSLKDGDSLEVPTLLFGT 1019

Query: 939  VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN--EKKTVDAKNFLD 996
             NGVIGV+ASLP + Y F E+LQT++ K I+GVGGL H +WRSF +   +K+  ++NF+D
Sbjct: 1020 GNGVIGVLASLPKDAYDFAERLQTSMNKHIQGVGGLKHAEWRSFRHTLRRKSDPSRNFVD 1079

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            GDL+ESFLDL   + D ++  M     E+ +RVEEL RL
Sbjct: 1080 GDLVESFLDLKVEQADVVAADMKCDRAEIIRRVEELQRL 1118


>gi|413081953|ref|NP_741992.2| DNA damage-binding protein 1 [Rattus norvegicus]
 gi|293344614|ref|XP_002725831.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
 gi|293356422|ref|XP_002728912.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
 gi|149062405|gb|EDM12828.1| damage-specific DNA binding protein 1 [Rattus norvegicus]
          Length = 1140

 Score = 1152 bits (2980), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 580/1094 (53%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPSRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+GE+   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGESNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|149725200|ref|XP_001502072.1| PREDICTED: DNA damage-binding protein 1 [Equus caballus]
          Length = 1140

 Score = 1151 bits (2978), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHAQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTATHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|344295432|ref|XP_003419416.1| PREDICTED: DNA damage-binding protein 1 [Loxodonta africana]
          Length = 1140

 Score = 1151 bits (2978), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 580/1094 (53%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVSLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +    QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAQQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +V+VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNVSVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|348526664|ref|XP_003450839.1| PREDICTED: DNA damage-binding protein 1-like [Oreochromis niloticus]
          Length = 1140

 Score = 1151 bits (2977), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 580/1094 (53%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD--AESSELITRAMGDVSDRIG 60
            +V +YGRIA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGRIAVMELFRPKGESKDLLFILTSKYNACILEYKQTGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGI+DP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEELQV+D+ FLYGC
Sbjct: 114  RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRDNRELKAFNIRLEELQVIDVHFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT+  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++IPVP P  G +II
Sbjct: 174  QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK--- 234
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E+   
Sbjct: 233  GQESITYHNGDKYLAIAPPTIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEELMD 292

Query: 235  --EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV V+E 
Sbjct: 293  GTVALKDLHVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVAVMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L ++ E E+EETE+ GF    QT +C +  + QL+Q+
Sbjct: 413  LRSEAGRETDDMLVLSFVGQTRVLMLSGE-EVEETELPGFVDNQQTFYCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TSGSVRLV   S+ L +EWK P G +++VA  N +QV+LA G   L YL+I  G L ++ 
Sbjct: 472  TSGSVRLVLQDSKTLVSEWKEPQGRNISVAACNHTQVVLAVGRA-LYYLQILAGELKQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
              ++E+E++CLDI P+GE    S + AVG+WTDIS R+  LP    + KE LGGEIIPRS
Sbjct: 531  TTEMEHEVACLDITPLGEGGGESPLCAVGLWTDISARVLKLPCFTALHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEG  YLLCALGDG L  F L+++TG L++RKKV+LGTQP  LRTF S +T++VF
Sbjct: 591  ILMTTFEGSYYLLCALGDGALFYFGLDLQTGALSERKKVTLGTQPTVLRTFRSLSTSNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PRRIC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRRICYQEVSQCFGVLSSRVEIQDVSGTTSAVRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D +VY+ 
Sbjct: 771  FPSSTSPHETSFGEEVEVHNLLVVDQHTFEVLHAHQFLPSEYALSLVSCRLGKDPSVYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGK LA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKFLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKSMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLV++ L +S      +V+FGTVNG+IG++ SL    Y  L  LQ  L KVIK VG
Sbjct: 1007 FCHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLDL R +M E+  T+ +          +V
Sbjct: 1067 KIEHSFWRSFHTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATV 1126

Query: 1023 EELCKRVEELTRLH 1036
            +E+ K VEELTR+H
Sbjct: 1127 DEVIKIVEELTRIH 1140


>gi|73983859|ref|XP_533275.2| PREDICTED: DNA damage-binding protein 1 [Canis lupus familiaris]
 gi|291409601|ref|XP_002721069.1| PREDICTED: damage-specific DNA binding protein 1 [Oryctolagus
            cuniculus]
 gi|301781686|ref|XP_002926259.1| PREDICTED: DNA damage-binding protein 1-like [Ailuropoda melanoleuca]
          Length = 1140

 Score = 1150 bits (2976), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|385865228|gb|AFI92852.1| DNA damage-binding protein 1 [Danio rerio]
          Length = 1140

 Score = 1150 bits (2976), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 580/1096 (52%), Positives = 768/1096 (70%), Gaps = 71/1096 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    +S ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGDSIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGI+DP+CR+IGL LYDGLFKVIP D + +  +AFNIRLEELQV+D++FLYGC
Sbjct: 114  RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRENRELKAFNIRLEELQVIDVQFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT+  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++IPVP P  G +II
Sbjct: 174  QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
            G+E+I Y + + + A+    I+ S    + RVD +GSRYLLGD  G L +L++  EKE++
Sbjct: 233  GQESITYHNGDKYLAVAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLL--EKEEL 290

Query: 238  TG-------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
                     L +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV V+
Sbjct: 291  MDGAVVLKDLHVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNDQGSYVGVM 350

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
            E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 351  ETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGL 410

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W LRS +    D  LV+SF+ +TR+L ++ E E+EETE++GF    QT FC +  + QL+
Sbjct: 411  WPLRSESSRDTDDMLVLSFVGQTRVLMLSGE-EVEETELQGFVDNQQTFFCGNVAHQQLI 469

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
            Q+TS SVRLV+  S+ L +EWK P G +++VA+ N +QV+LA G   L YL+I  G L +
Sbjct: 470  QITSVSVRLVTQDSKALVSEWKEPQGRNISVASCNNTQVVLAVGR-VLYYLQILSGELKQ 528

Query: 471  VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
            +   ++E+E++CLDI P+GE  + S I AVG+WTDIS R+  LP    + KE LGGEIIP
Sbjct: 529  ISSTEMEHEVACLDITPLGERTADSCICAVGLWTDISARLLKLPCFTPLHKEMLGGEIIP 588

Query: 531  RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
            RS+L+  FEG  YLLCALGDG L  F L+++TG L++RKKV+LGTQP  LRTF S +T++
Sbjct: 589  RSILMTTFEGSHYLLCALGDGALFYFGLDIQTGVLSERKKVTLGTQPTVLRTFRSLSTSN 648

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQ
Sbjct: 649  VFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQ 708

Query: 651  KLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------------- 681
            KLHIR++PL E P+RIC+QE S+ F + S +                             
Sbjct: 709  KLHIRTVPLYESPKRICYQEVSQCFGVLSSRVEMQDASGTTAAVRPSASTQALSSSVSSS 768

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                        S  EE E+H + ++D  TFE +  +     EY  S++SC    D  VY
Sbjct: 769  KLFPSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLQNEYALSMVSCKLGRDPAVY 828

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            + VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  +
Sbjct: 829  FIVGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTV 888

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG+ EE
Sbjct: 889  RLYEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGSFEE 944

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFV
Sbjct: 945  IARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFV 1004

Query: 912  NRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
            N F HGSLV++ L +S      +V+FGTVNG+IG++ SL    Y  L  LQ  L KVIK 
Sbjct: 1005 NVFSHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKS 1064

Query: 971  VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV---------- 1020
            VG + H  WRSF+ E+KT  A  F+DGDLIESFLDL R +M E+  T+ +          
Sbjct: 1065 VGKIEHSFWRSFHTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREA 1124

Query: 1021 SVEELCKRVEELTRLH 1036
            +V+E+ K VEELTR+H
Sbjct: 1125 TVDEVIKIVEELTRIH 1140


>gi|7657011|ref|NP_056550.1| DNA damage-binding protein 1 [Mus musculus]
 gi|134034087|sp|Q3U1J4.2|DDB1_MOUSE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=Damage-specific DNA-binding
            protein 1; AltName: Full=UV-damaged DNA-binding factor
 gi|5931596|dbj|BAA84699.1| XPE UV-damaged DNA binding factor [Mus musculus]
 gi|16307148|gb|AAH09661.1| Damage specific DNA binding protein 1 [Mus musculus]
 gi|74182145|dbj|BAE34102.1| unnamed protein product [Mus musculus]
 gi|74196166|dbj|BAE32993.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 1150 bits (2975), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|354504619|ref|XP_003514371.1| PREDICTED: DNA damage-binding protein 1-like [Cricetulus griseus]
 gi|344258340|gb|EGW14444.1| DNA damage-binding protein 1 [Cricetulus griseus]
          Length = 1140

 Score = 1150 bits (2975), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPSRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|410974071|ref|XP_003993471.1| PREDICTED: DNA damage-binding protein 1 [Felis catus]
          Length = 1193

 Score = 1150 bits (2975), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 107  EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 166

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 167  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 226

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 227  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 285

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 286  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 345

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 346  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 405

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 406  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 465

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 466  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 524

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 525  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 583

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 584  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 643

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 644  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 703

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 704  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 763

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 764  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 823

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 824  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 883

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 884  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 943

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 944  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 999

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 1000 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1059

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1060 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1119

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1120 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1179

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1180 DDLIKVVEELTRIH 1193


>gi|418316|sp|P33194.1|DDB1_CERAE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=DDBa; AltName:
            Full=Damage-specific DNA-binding protein 1; AltName:
            Full=UV-damaged DNA-binding protein 1; Short=UV-DDB 1
 gi|304026|gb|AAA03021.1| UV-damaged DNA-binding protein [Chlorocebus aethiops]
          Length = 1140

 Score = 1150 bits (2974), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|403255013|ref|XP_003920244.1| PREDICTED: DNA damage-binding protein 1 [Saimiri boliviensis
            boliviensis]
          Length = 1140

 Score = 1150 bits (2974), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|122692537|ref|NP_001073731.1| DNA damage-binding protein 1 [Bos taurus]
 gi|426251842|ref|XP_004019630.1| PREDICTED: DNA damage-binding protein 1 [Ovis aries]
 gi|134034086|sp|A1A4K3.1|DDB1_BOVIN RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|119223918|gb|AAI26630.1| Damage-specific DNA binding protein 1, 127kDa [Bos taurus]
 gi|296471644|tpg|DAA13759.1| TPA: DNA damage-binding protein 1 [Bos taurus]
          Length = 1140

 Score = 1150 bits (2974), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARIAKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|359546285|pdb|4A11|A Chain A, Structure Of The Hsddb1-Hscsa Complex
 gi|361132519|pdb|4A0K|C Chain C, Structure Of Ddb1-Ddb2-Cul4a-Rbx1 Bound To A 12 Bp Abasic
            Site Containing Dna-Duplex
          Length = 1159

 Score = 1150 bits (2974), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 73   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 132

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 133  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 192

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 193  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 251

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 252  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 311

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 312  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 371

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 372  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 431

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 432  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 490

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 491  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 549

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 550  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 609

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 610  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 669

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 670  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 729

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 730  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 789

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 790  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 849

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 850  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 909

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 910  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 965

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 966  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1025

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1026 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1086 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1145

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1146 DDLIKVVEELTRIH 1159


>gi|348560393|ref|XP_003465998.1| PREDICTED: DNA damage-binding protein 1-like [Cavia porcellus]
          Length = 1140

 Score = 1150 bits (2974), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1094 (52%), Positives = 758/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE +D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGENKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFSKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPSRETDDTLVLSFVGQTRVLMLNGE-EVEETELVGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSCGLSPLCAIGLWTDISARILQLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTVLRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCRLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEQATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|432851195|ref|XP_004066902.1| PREDICTED: DNA damage-binding protein 1-like [Oryzias latipes]
          Length = 1140

 Score = 1149 bits (2973), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1094 (52%), Positives = 764/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YGRI+ +ELFRP GE++D LFI T +Y  C+L++  + +S ++ITRA G+V DRIG
Sbjct: 54   EVGMYGRISVMELFRPKGESKDLLFILTSKYNACILEYKQNGDSIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGI+DP+CR+IGL LYDGLFKVIP D + +  +AFNIRLEELQV+D+ FLYGC
Sbjct: 114  RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRENRELKAFNIRLEELQVIDVHFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT+  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++IPVP P  G +II
Sbjct: 174  QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK--- 234
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E+   
Sbjct: 233  GQESITYHNGDKYLAIAPPTIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEELMD 292

Query: 235  --EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV V+E 
Sbjct: 293  GTVALKDLHVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVTVMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPI+D CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPILDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L ++ E E+EETE+ GF    QT +C +  + QL+Q+
Sbjct: 413  LRSEAGRESDDMLVLSFVGQTRVLMLSGE-EVEETELPGFVDNQQTFYCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TSGSVRLV   S+ L +EWK P G +++VA  N +QV+LA G   L YL+I  G L ++ 
Sbjct: 472  TSGSVRLVLQDSKALVSEWKEPQGRNISVAACNHTQVVLAVGRA-LYYLQILAGELKQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
              ++E+E++CLDI P+GE+ S S + AVG+WTDIS R+  LP    + KE LGGEIIPRS
Sbjct: 531  TTEMEHEVACLDITPLGESSSESPLCAVGLWTDISARMLKLPCFTPLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEG  YLLCALGDG L  F L+++TG L++RKKV+LGTQP  LRTF S +T++VF
Sbjct: 591  ILMTTFEGSYYLLCALGDGALFYFGLDLQTGALSERKKVTLGTQPTVLRTFRSLSTSNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PRRIC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRRICYQEVSQCFGVLSSRVEIQDASGAISAVRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EYG S++SC    D +VY+ 
Sbjct: 771  FPSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLPSEYGLSLVSCRLGKDPSVYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEPEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGVFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLV++ L +S      +V+FGTVNG+IG++ SL    +  L  LQ  L KVIK VG
Sbjct: 1007 FCHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWHSLLLDLQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSF  E+KT  A  F+DGDLIESFLDL R +M E+  T+ +          +V
Sbjct: 1067 KIEHSFWRSFYTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGGGMKREATV 1126

Query: 1023 EELCKRVEELTRLH 1036
            +E+ K VEELTR+H
Sbjct: 1127 DEVIKIVEELTRIH 1140


>gi|221046711|pdb|3EI1|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 14 Bp 6-4 Photoproduct
            Containing Dna-Duplex
 gi|221046715|pdb|3EI2|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Abasic Site
            Containing Dna-Duplex
 gi|221046719|pdb|3EI3|A Chain A, Structure Of The Hsddb1-Drddb2 Complex
          Length = 1158

 Score = 1149 bits (2973), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 72   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 131

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 132  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 191

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 192  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 250

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 251  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 310

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 311  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 370

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 371  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 430

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 431  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 489

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 490  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 548

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 549  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 608

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 609  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 668

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 669  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 728

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 729  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 788

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 789  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 848

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 849  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 908

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 909  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 964

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 965  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1024

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1025 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1084

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1085 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1144

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1145 DDLIKVVEELTRIH 1158


>gi|358440066|pdb|4A0A|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
            Pyrimidine At D-1 Position) At 3.6 A Resolution (Cpd 3)
          Length = 1159

 Score = 1149 bits (2973), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 73   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 132

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 133  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 192

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 193  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPKPFGGAIII 251

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 252  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 311

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 312  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 371

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 372  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 431

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 432  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 490

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 491  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 549

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 550  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 609

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 610  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 669

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 670  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 729

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 730  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 789

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 790  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 849

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 850  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 909

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 910  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 965

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 966  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1025

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1026 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1086 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1145

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1146 DDLIKVVEELTRIH 1159


>gi|358440070|pdb|4A0B|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
            Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
 gi|358440072|pdb|4A0B|C Chain C, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
            Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
          Length = 1159

 Score = 1149 bits (2973), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 73   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 132

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 133  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 192

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 193  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPSPFGGAIII 251

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 252  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 311

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 312  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 371

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 372  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 431

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 432  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 490

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 491  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 549

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 550  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 609

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 610  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 669

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 670  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 729

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 730  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 789

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 790  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 849

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 850  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 909

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 910  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 965

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 966  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1025

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1026 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1086 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1145

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1146 DDLIKVVEELTRIH 1159


>gi|301121252|ref|XP_002908353.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
 gi|262103384|gb|EEY61436.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
          Length = 1150

 Score = 1149 bits (2973), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 580/1098 (52%), Positives = 767/1098 (69%), Gaps = 65/1098 (5%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
            D+ +YGRIA  E+FR   E QD++FI T+R++FCVL +D+ + +++T+A G + D IGR 
Sbjct: 54   DINLYGRIAIFEVFRAANEPQDWIFIVTQRFQFCVLAYDSSAQQVVTKAHGSIRDSIGRS 113

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDN-KGQLKEAFNIRLEELQVLDIKFLYGCA 121
            ++    G IDP+ RLIG++LY+G FKVIP D+ KG L++ FNIRL+EL+V+DIKFL+G  
Sbjct: 114  SEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGILRDTFNIRLDELRVIDIKFLHGYN 173

Query: 122  KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
            KPTI VLY+D K ARHVKTY + LK+KDF EGPWSQ+N+++GA LLIPVP P  GVLI+ 
Sbjct: 174  KPTICVLYEDYKAARHVKTYHILLKEKDFAEGPWSQSNVESGASLLIPVPAPTGGVLIVS 233

Query: 182  EETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLK 241
             +TIVY + + F AIP++ ++ + YG VD DGSR+LL D  G L ++ + H  ++V+G+ 
Sbjct: 234  NQTIVYHNGSTFHAIPMQSTVIQVYGAVDKDGSRFLLADQYGTLSVVALQHTGKEVSGVH 293

Query: 242  IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 301
            +E+LGET+IAS +SYLDN VV+IGS++GDSQLIKLN   D  GSY+EVL+ YVN+GPI+D
Sbjct: 294  LEVLGETNIASCLSYLDNGVVFIGSTFGDSQLIKLNADRDETGSYIEVLDSYVNVGPIID 353

Query: 302  FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 361
            FCV+DL+RQGQGQ+VTCSGA KDG+LR++RNGIGINEQAS EL GIKGMW+LR +     
Sbjct: 354  FCVMDLDRQGQGQIVTCSGADKDGTLRVIRNGIGINEQASAELPGIKGMWALRETFAAEH 413

Query: 362  DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
            D FL+ S++SE RILA+  EDE+EE EI  F +  +TL C +   +  +QVT   VRL+S
Sbjct: 414  DKFLLQSYVSEVRILAIGDEDEMEEKEIPAF-TNVKTLLCRNMYGDYWLQVTESEVRLIS 472

Query: 422  STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 481
             +S  L + W    G  + VA AN +QV +AT GG LVYLEI +G + E    ++E+E++
Sbjct: 473  CSSFSLSSTWSPASGSRITVAAANPTQVAVATSGGVLVYLEIENGQVVEKTTVKMEHELA 532

Query: 482  CLDINPI-------------GENPSY------SQIAAVGMWTDISVRIFSLPDLNLITKE 522
            C+DI P+             G +  +      S +  VG+WT  SV +  LP L   T E
Sbjct: 533  CVDITPLTSSGAGDGDVAMTGSSTHWDMAALRSTLCVVGLWTSFSVCVLQLPTLEKRTTE 592

Query: 523  HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
             LG +++PRSVL   FEG  YLL  LGDG L+N+ L+++ G L  RK+VSLG+QP++L T
Sbjct: 593  ALGTDLLPRSVLCNTFEGKDYLLVGLGDGSLMNYELDVQQGALGTRKRVSLGSQPLSLST 652

Query: 583  FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
            F SKN THVFAA DRPTVIYS+N KLLYSN+N KEV+ MCPF+S +FP+ LA++ E ELT
Sbjct: 653  FRSKNMTHVFAACDRPTVIYSNNNKLLYSNINSKEVNVMCPFDSESFPECLALSSEEELT 712

Query: 643  IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMHFVRLLDDQ 700
            IGT+DDIQKLHI++  L E  RRI H  +S T  +   S       EE +  FVRL DDQ
Sbjct: 713  IGTVDDIQKLHIQTFHLNEWARRIAHDPESHTLGVLTVSFTVDDTGEEVDQGFVRLFDDQ 772

Query: 701  TFEFISTYPLDTFEYGCSILSCSFSDDS--NVYYCVGTAYVLPEENEPTKGRILVFIVE- 757
            TFE + +Y LD FE  CS++ C  + DS    Y+ VGTAY+  EE EP +GRILVF V  
Sbjct: 773  TFEVLHSYRLDPFETPCSVVVCPLAGDSVNASYFVVGTAYIHEEEAEPHQGRILVFAVTG 832

Query: 758  ---DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
               + KLQL+ EKE KGAVY LN+FNGK+LA +N K QLYKW    D  +EL SECGH+G
Sbjct: 833  IHGERKLQLVTEKEVKGAVYCLNSFNGKVLAGVNSKAQLYKWSENTDNEKELVSECGHYG 892

Query: 815  HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
            H L LY+++RGDFIVVGDLMKSISLL YK  +G IEE A+D N+NWMSAV I+DDD Y+G
Sbjct: 893  HTLVLYMESRGDFIVVGDLMKSISLLSYKQLDGTIEEIAKDLNSNWMSAVGIVDDDTYIG 952

Query: 875  AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------------ 922
            +E +FNLFTV++NS  A+DEERGRLE VGE+HLGEFVNRFR+GSLVM+            
Sbjct: 953  SETDFNLFTVQRNSGAASDEERGRLETVGEFHLGEFVNRFRYGSLVMQNSSSTSQTPSGV 1012

Query: 923  --------------LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVI 968
                           P + V Q  +++FGTV+G+IGVI  +  +QY FL ++Q  L  V+
Sbjct: 1013 VSTGPTAMVDVGESAPAAPVVQNQSMLFGTVSGMIGVILPISKDQYSFLLRVQQALTHVV 1072

Query: 969  KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN--------- 1019
            KGVGG +H+ WR+F N +   +A+NF+DGDL+ESFLDL + +M ++   +N         
Sbjct: 1073 KGVGGFSHKDWRTFENRRSVSEARNFIDGDLVESFLDLPKPQMTKVVDKLNSDGMLDGTD 1132

Query: 1020 -VSVEELCKRVEELTRLH 1036
              +VE+L  R+EEL +LH
Sbjct: 1133 QFTVEDLTLRIEELVQLH 1150


>gi|395852550|ref|XP_003798801.1| PREDICTED: DNA damage-binding protein 1 [Otolemur garnettii]
          Length = 1140

 Score = 1149 bits (2973), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSEPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLV    + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVCQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|74215029|dbj|BAE33503.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 1149 bits (2973), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +++S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQRDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|361132523|pdb|4A0L|A Chain A, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
            Site Containing Dna-Duplex
 gi|361132525|pdb|4A0L|C Chain C, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
            Site Containing Dna-Duplex
          Length = 1144

 Score = 1149 bits (2972), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 58   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 117

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 118  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 177

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 178  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 236

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 237  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 296

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 297  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 356

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 357  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 416

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 417  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 475

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 476  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 534

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 535  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 594

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 595  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 654

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 655  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 714

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 715  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 774

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 775  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 834

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 835  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 894

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 895  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 950

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 951  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1010

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1011 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1070

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1071 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1130

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1131 DDLIKVVEELTRIH 1144


>gi|148529014|ref|NP_001914.3| DNA damage-binding protein 1 [Homo sapiens]
 gi|296218432|ref|XP_002807395.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
            [Callithrix jacchus]
 gi|397516558|ref|XP_003828491.1| PREDICTED: DNA damage-binding protein 1 [Pan paniscus]
 gi|402893195|ref|XP_003909786.1| PREDICTED: DNA damage-binding protein 1 [Papio anubis]
 gi|426368721|ref|XP_004051351.1| PREDICTED: DNA damage-binding protein 1 [Gorilla gorilla gorilla]
 gi|12643730|sp|Q16531.1|DDB1_HUMAN RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=DNA damage-binding protein a;
            Short=DDBa; AltName: Full=Damage-specific DNA-binding
            protein 1; AltName: Full=HBV X-associated protein 1;
            Short=XAP-1; AltName: Full=UV-damaged DNA-binding factor;
            AltName: Full=UV-damaged DNA-binding protein 1;
            Short=UV-DDB 1; AltName: Full=XPE-binding factor;
            Short=XPE-BF; AltName: Full=Xeroderma pigmentosum group
            E-complementing protein; Short=XPCe
 gi|203282525|pdb|3E0C|A Chain A, Crystal Structure Of Dna Damage-Binding Protein 1(Ddb1)
 gi|695362|gb|AAA62838.1| X-associated protein 1, partial [Homo sapiens]
 gi|1052865|gb|AAC50349.1| DDBa p127 [Homo sapiens]
 gi|15079750|gb|AAH11686.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|29792243|gb|AAH50530.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|30354567|gb|AAH51764.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|61354161|gb|AAX44048.1| damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
 gi|119594341|gb|EAW73935.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_c [Homo
            sapiens]
 gi|168275638|dbj|BAG10539.1| DNA damage-binding protein 1 [synthetic construct]
 gi|189065506|dbj|BAG35345.1| unnamed protein product [Homo sapiens]
 gi|355566436|gb|EHH22815.1| Damage-specific DNA-binding protein 1 [Macaca mulatta]
 gi|380784123|gb|AFE63937.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|380808126|gb|AFE75938.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|380810144|gb|AFE76947.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|383408123|gb|AFH27275.1| DNA damage-binding protein 1 [Macaca mulatta]
 gi|410305600|gb|JAA31400.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
 gi|410352015|gb|JAA42611.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
          Length = 1140

 Score = 1149 bits (2972), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|400260815|pdb|4E54|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
            Dimerization And Its Roles In Chromatinized Dna Repair
 gi|401871507|pdb|4E5Z|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
            Dimerization And Its Roles In Chromatinized Dna Repair
          Length = 1150

 Score = 1149 bits (2972), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 64   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 123

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 124  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 183

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 184  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 242

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 243  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 302

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 303  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 362

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 363  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 422

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 423  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 481

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 482  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 540

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 541  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 600

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 601  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 660

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 661  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 720

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 721  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 780

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 781  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 840

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 841  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 900

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 901  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 956

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 957  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1016

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1017 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1076

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1077 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1136

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1137 DDLIKVVEELTRIH 1150


>gi|384941436|gb|AFI34323.1| DNA damage-binding protein 1 [Macaca mulatta]
          Length = 1140

 Score = 1149 bits (2972), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|355752055|gb|EHH56175.1| Damage-specific DNA-binding protein 1, partial [Macaca fascicularis]
          Length = 1125

 Score = 1149 bits (2972), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 39   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 98

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 99   RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 158

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 159  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 217

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 218  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 277

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 278  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 337

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 338  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 397

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 398  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 456

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 457  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 515

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 516  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 575

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 576  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 635

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 636  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 695

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 696  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 755

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 756  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 815

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 816  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 875

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 876  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 931

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 932  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 991

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 992  FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1051

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1052 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1111

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1112 DDLIKVVEELTRIH 1125


>gi|74138855|dbj|BAE27231.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 1149 bits (2971), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +P+SLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPNSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|441604084|ref|XP_004087862.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
            [Nomascus leucogenys]
          Length = 1140

 Score = 1148 bits (2970), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 577/1094 (52%), Positives = 758/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVAAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGP VD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPXVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|259155222|ref|NP_001158852.1| DNA damage-binding protein 1 [Salmo salar]
 gi|223647700|gb|ACN10608.1| DNA damage-binding protein 1 [Salmo salar]
          Length = 1139

 Score = 1148 bits (2969), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 582/1096 (53%), Positives = 762/1096 (69%), Gaps = 72/1096 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++  + ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQNGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGI+DP+CR+IGL LYDGLFKVIP D + +  +AFNIRLEELQV+D+ FLYGC
Sbjct: 114  RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRENRELKAFNIRLEELQVIDVHFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT+  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++IPVP P  G +II
Sbjct: 174  QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  EKE++
Sbjct: 233  GQESITYHNGDKYLAIAPPTIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLL--EKEEL 290

Query: 238  TG-------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
                     L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  +  GSYV V+
Sbjct: 291  MDGAVVLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVM 350

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
            E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 351  ETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGL 410

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W LRS      D  LV+SF+ +TR+L ++ E E+EETE+ GF    QT +C +  + QL+
Sbjct: 411  WPLRSEAGRETDDMLVLSFVGQTRVLMLSGE-EVEETELPGFVDNLQTFYCGNVAHQQLI 469

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
            Q+TSG VRLV   S+ L +EWK P G +++VA  N+SQV+LA G   L YL+I  G L +
Sbjct: 470  QITSGGVRLVMQDSKALVSEWKEPQGRNISVAACNSSQVVLAVGRA-LYYLQILSGELKQ 528

Query: 471  VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
            +   ++E+E++CLDI P+GE    S + AVG+WTDIS R+  LP    + KE LGGEIIP
Sbjct: 529  ISTVEMEHEVACLDITPLGEGGE-SPLCAVGLWTDISARVLKLPCFTALHKEMLGGEIIP 587

Query: 531  RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
            RS+L+  FEG  YLLCALGDG L  F L++ TG L++RKKV+LGTQP  LRTF S +T++
Sbjct: 588  RSILMTTFEGGYYLLCALGDGALFYFGLDLTTGVLSERKKVTLGTQPTVLRTFRSLSTSN 647

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQ
Sbjct: 648  VFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQ 707

Query: 651  KLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------------- 681
            KLHIR++PL E PRRIC+QE S+ F + S +                             
Sbjct: 708  KLHIRTVPLYESPRRICYQEVSQCFGVLSSRVEMQDASGTTAAVRPSASTQALSSSVSSS 767

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                        S  EE E+H + ++D  TFE +  +     EY  S++SC    D  VY
Sbjct: 768  KLFPSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLQSEYALSMVSCRLGRDPAVY 827

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            + VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  +
Sbjct: 828  FIVGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTV 887

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE
Sbjct: 888  RLYEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEE 943

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFV
Sbjct: 944  IARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGVFHLGEFV 1003

Query: 912  NRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
            N F HGSLV++ L +S      +V+FGTVNG+IG++ SL    Y  L  LQ  L KVIK 
Sbjct: 1004 NVFSHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKS 1063

Query: 971  VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV---------- 1020
            VG + H  WRSF+ E+KT  A  F+DGDLIESFLDL R +M E+  T+ +          
Sbjct: 1064 VGKIEHSFWRSFHTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREA 1123

Query: 1021 SVEELCKRVEELTRLH 1036
            +V+E+ K VEELTR+H
Sbjct: 1124 TVDEVIKIVEELTRIH 1139


>gi|311247551|ref|XP_003122699.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Sus scrofa]
          Length = 1140

 Score = 1148 bits (2969), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N++QV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSNQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARISKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|197097564|ref|NP_001126613.1| DNA damage-binding protein 1 [Pongo abelii]
 gi|75041202|sp|Q5R649.1|DDB1_PONAB RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|55732122|emb|CAH92767.1| hypothetical protein [Pongo abelii]
          Length = 1140

 Score = 1147 bits (2968), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 577/1094 (52%), Positives = 758/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNVCILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVY +  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYPMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|327278830|ref|XP_003224163.1| PREDICTED: DNA damage-binding protein 1-like [Anolis carolinensis]
          Length = 1140

 Score = 1147 bits (2968), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE +D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGENKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP + + +  +AFNIRLEELQV+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLERENKELKAFNIRLEELQVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 238  TG-----LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
             G     L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GGVSLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPHHETDNTLVLSFVGQTRVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLV+   + L +EWK P G +++VA+ N+SQV++A G   L YLEI    L ++ 
Sbjct: 472  TSASVRLVTQEPKALVSEWKEPNGKNISVASCNSSQVVVAVGRV-LYYLEIHPQELKQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
              ++E+E++CLDI P+G+    S + A+G+WTDIS RI  LP  +L+ KE LGGEIIPRS
Sbjct: 531  CTEMEHEVACLDITPLGDASGMSPLCAIGLWTDISARILKLPSFDLLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFENSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLSESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V P+E EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPDEAEPKQGRIVVFHYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALYV+T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWT----AEKELRTECNHYNNIMALYVKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+  G +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEFGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + +          +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATV 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKIVEELTRIH 1140


>gi|5353754|gb|AAD42230.1|AF159853_1 damage-specific DNA binding protein 1 [Mus musculus]
          Length = 1140

 Score = 1147 bits (2968), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 580/1095 (52%), Positives = 759/1095 (69%), Gaps = 69/1095 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E E++
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKE-EQM 291

Query: 238  TG------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
             G      L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E
Sbjct: 292  DGTVTPKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAME 351

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
             + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 352  TFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLW 411

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
             LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q
Sbjct: 412  PLRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQ 470

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
            +TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++
Sbjct: 471  ITSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQI 529

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
             H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPR
Sbjct: 530  SHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPR 589

Query: 532  SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
            S+L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+V
Sbjct: 590  SILMTTFESSRYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNV 649

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
            FA SDRPTVIYS N KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQK
Sbjct: 650  FACSDRPTVIYSXNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQK 709

Query: 652  LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
            LHIR++PL E PR+IC+QE S+ F + S +                              
Sbjct: 710  LHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSK 769

Query: 682  ---------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732
                       S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+
Sbjct: 770  LFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYF 829

Query: 733  CVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
             VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++
Sbjct: 830  IVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVR 889

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
            LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE 
Sbjct: 890  LYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEI 945

Query: 853  ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
            ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN
Sbjct: 946  ARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVN 1005

Query: 913  RFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
             F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK V
Sbjct: 1006 VFCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSV 1065

Query: 972  GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VS 1021
            G + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            +
Sbjct: 1066 GKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREAT 1125

Query: 1022 VEELCKRVEELTRLH 1036
             ++L K VEELTR+H
Sbjct: 1126 ADDLIKVVEELTRIH 1140


>gi|358440058|pdb|4A08|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 13 Bp Cpd-Duplex (
            Purine At D-1 Position) At 3.0 A Resolution (Cpd 1)
 gi|358440062|pdb|4A09|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 15 Bp Cpd-Duplex
            (Purine At D-1 Position) At 3.1 A Resolution (Cpd 2)
          Length = 1159

 Score = 1147 bits (2967), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 577/1094 (52%), Positives = 758/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 73   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 132

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 133  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 192

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTY V+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 193  QAPTICFVYQDPQ-GRHVKTYAVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 251

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 252  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 311

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 312  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 371

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 372  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 431

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 432  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 490

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 491  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 549

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 550  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 609

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 610  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 669

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 670  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 729

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 730  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 789

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 790  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 849

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 850  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 909

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 910  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 965

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 966  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1025

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1026 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1086 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1145

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1146 DDLIKVVEELTRIH 1159


>gi|45383688|ref|NP_989547.1| DNA damage-binding protein 1 [Gallus gallus]
 gi|82098863|sp|Q805F9.1|DDB1_CHICK RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
            subunit; AltName: Full=Damage-specific DNA-binding
            protein 1; AltName: Full=UV-damaged DNA-binding factor
 gi|28375613|dbj|BAC56999.1| damaged-DNA binding protein DDB p127 subunit [Gallus gallus]
 gi|53130071|emb|CAG31438.1| hypothetical protein RCJMB04_6h2 [Gallus gallus]
          Length = 1140

 Score = 1147 bits (2967), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 576/1094 (52%), Positives = 761/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+ A +ELFRP GE++D LFI T +Y  C+L++  + ++ ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKTAVMELFRPKGESKDLLFILTAKYNACILEYKQNGDNIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +  +AFNIRLEELQV+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRENKELKAFNIRLEELQVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS +    D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDSHREMDNMLVLSFVGQTRVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N++QV++A G   L YLEI    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPNGKNISVASCNSNQVVVAVGRA-LYYLEIRPQELRQIN 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
              ++E+E++CLDI P+G+    S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  CTEMEHEVACLDITPLGDTNGMSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F L+++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLSLETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG  HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + +          +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATV 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKIVEELTRIH 1140


>gi|223647932|gb|ACN10724.1| DNA damage-binding protein 1 [Salmo salar]
          Length = 1139

 Score = 1146 bits (2964), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 580/1096 (52%), Positives = 763/1096 (69%), Gaps = 72/1096 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++  + ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQNGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGI+DP+CR+IGL LYDGLFKVIP D + +  +AFNIRLEELQV+D+ FLYGC
Sbjct: 114  RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRENRELKAFNIRLEELQVIDVHFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT+  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++IPVP P  G +II
Sbjct: 174  QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  EKE++
Sbjct: 233  GQESITYHNGDKYLAIAPPTIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLL--EKEEL 290

Query: 238  TG-------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
                     L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  +  GSYV V+
Sbjct: 291  MDGAVVLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVM 350

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
            E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 351  ETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGL 410

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W LRS      D  LV+SF+ +TR+L ++ E E+EETE+ GF    QT +C +  + QL+
Sbjct: 411  WPLRSEAGRETDDMLVLSFVGQTRVLMLSGE-EVEETELPGFVDNLQTFYCGNVAHQQLI 469

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
            Q+TSG VRLV   S+ L +EWK P G +++VA  N+SQV+LA G   L YL+I  G L +
Sbjct: 470  QITSGGVRLVMQDSKALVSEWKEPQGRNISVAACNSSQVVLAVGRA-LYYLQILSGELKQ 528

Query: 471  VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
            +   ++E+E++CLDI P+GE+   S + AVG+WTDIS R+  LP    + KE LGGEIIP
Sbjct: 529  ISTVEMEHEVACLDITPLGEDGD-SPLCAVGLWTDISARVLKLPCFTALHKEMLGGEIIP 587

Query: 531  RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
            RS+L+  FE   YLLCALGDG L  F L++ +G L++RKKV+LGTQP  LRTF S +T++
Sbjct: 588  RSILMTTFEASYYLLCALGDGALFYFGLDLTSGVLSERKKVTLGTQPTVLRTFRSLSTSN 647

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQ
Sbjct: 648  VFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQ 707

Query: 651  KLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------------- 681
            KLHIR++PL E PRRIC+QE S+ F + S +                             
Sbjct: 708  KLHIRTVPLYESPRRICYQEVSQCFGVLSSRVEMQDASGTTAAVRPSASTQALSSSVSSS 767

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                        S  EE E+H + ++D  TFE +  +     EY  S++SC    D +VY
Sbjct: 768  KLFPSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLQSEYALSMVSCRLGRDLSVY 827

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            + VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  +
Sbjct: 828  FIVGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMMEFNGKLLASINSTV 887

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE
Sbjct: 888  RLYEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEE 943

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFV
Sbjct: 944  IARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGVFHLGEFV 1003

Query: 912  NRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
            N F HGSLV++ L +S      +V+FGTVNG+IG++ SL    Y  L  LQ  L KVIK 
Sbjct: 1004 NVFSHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKS 1063

Query: 971  VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV---------- 1020
            VG + H  WRSF+ E+KT  A  F+DGDLIESFLDL R +M E+  T+ +          
Sbjct: 1064 VGKIEHSFWRSFHTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREA 1123

Query: 1021 SVEELCKRVEELTRLH 1036
            +V+E+ K VEELTR+H
Sbjct: 1124 TVDEVIKIVEELTRIH 1139


>gi|224050582|ref|XP_002191856.1| PREDICTED: DNA damage-binding protein 1 [Taeniopygia guttata]
          Length = 1140

 Score = 1145 bits (2961), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 577/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+ A +ELFRP GE++D LFI T +Y  C+L++  + +S ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKTAVMELFRPKGESKDLLFILTAKYNACILEYKQNGDSIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEELQV+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELQVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPHRETDNTLVLSFVGQTRVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N++QV++A G   L YLEI    L ++ 
Sbjct: 472  TSASVRLVSQEPKSLVSEWKEPNGKNISVASCNSNQVVVAVGRA-LYYLEIRPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
              ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  CTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F L+++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLSLETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG  HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + +          +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATV 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKIVEELTRIH 1140


>gi|90108797|pdb|2B5L|A Chain A, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
            Protein
 gi|90108798|pdb|2B5L|B Chain B, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
            Protein
 gi|90108801|pdb|2B5M|A Chain A, Crystal Structure Of Ddb1
 gi|116667897|pdb|2HYE|A Chain A, Crystal Structure Of The Ddb1-cul4a-rbx1-sv5v Complex
 gi|1136228|gb|AAA88883.1| UV-damaged DNA binding factor [Homo sapiens]
 gi|1588524|prf||2208446A xeroderma pigmentosum group E-binding factor
          Length = 1140

 Score = 1145 bits (2961), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1095 (52%), Positives = 761/1095 (69%), Gaps = 69/1095 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTD-DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            LRS  + + +DT LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q
Sbjct: 413  LRSDPNRETYDT-LVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQ 470

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
            +TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++
Sbjct: 471  ITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQI 529

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
             H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPR
Sbjct: 530  SHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPR 589

Query: 532  SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
            S+L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+V
Sbjct: 590  SILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNV 649

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
            FA SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQK
Sbjct: 650  FACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQK 709

Query: 652  LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
            LHIR++PL E PR+IC+QE S+ F + S +                              
Sbjct: 710  LHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSK 769

Query: 682  ---------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732
                       S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+
Sbjct: 770  LFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYF 829

Query: 733  CVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
             VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++
Sbjct: 830  IVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVR 889

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
            LY+W    D    +++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE 
Sbjct: 890  LYEWTTEKD----VRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEI 945

Query: 853  ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
            ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN
Sbjct: 946  ARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVN 1005

Query: 913  RFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
             F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK V
Sbjct: 1006 VFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSV 1065

Query: 972  GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VS 1021
            G + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            +
Sbjct: 1066 GKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREAT 1125

Query: 1022 VEELCKRVEELTRLH 1036
             ++L K VEELTR+H
Sbjct: 1126 ADDLIKVVEELTRIH 1140


>gi|270346571|pdb|3I7H|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Hbx
 gi|270346573|pdb|3I7K|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Whx
 gi|270346575|pdb|3I7L|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Ddb2
 gi|270346577|pdb|3I7N|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdtc1
 gi|270346579|pdb|3I7O|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Iqwd1
 gi|270346581|pdb|3I7P|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr40a
 gi|270346583|pdb|3I89|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr22
 gi|270346585|pdb|3I8C|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr21a
 gi|270346587|pdb|3I8E|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr42a
 gi|270346588|pdb|3I8E|B Chain B, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
            Wdr42a
          Length = 1143

 Score = 1144 bits (2960), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1095 (52%), Positives = 761/1095 (69%), Gaps = 69/1095 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 57   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 116

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 117  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 176

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 177  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 235

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 236  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 295

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 296  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 355

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 356  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 415

Query: 353  LRSSTD-DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            LRS  + + +DT LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q
Sbjct: 416  LRSDPNRETYDT-LVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQ 473

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
            +TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++
Sbjct: 474  ITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQI 532

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
             H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPR
Sbjct: 533  SHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPR 592

Query: 532  SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
            S+L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+V
Sbjct: 593  SILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNV 652

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
            FA SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQK
Sbjct: 653  FACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQK 712

Query: 652  LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
            LHIR++PL E PR+IC+QE S+ F + S +                              
Sbjct: 713  LHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSK 772

Query: 682  ---------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732
                       S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+
Sbjct: 773  LFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYF 832

Query: 733  CVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
             VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++
Sbjct: 833  IVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVR 892

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
            LY+W    D    +++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE 
Sbjct: 893  LYEWTTEKD----VRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEI 948

Query: 853  ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
            ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN
Sbjct: 949  ARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVN 1008

Query: 913  RFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
             F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK V
Sbjct: 1009 VFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSV 1068

Query: 972  GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VS 1021
            G + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            +
Sbjct: 1069 GKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREAT 1128

Query: 1022 VEELCKRVEELTRLH 1036
             ++L K VEELTR+H
Sbjct: 1129 ADDLIKVVEELTRIH 1143


>gi|221046721|pdb|3EI4|A Chain A, Structure Of The Hsddb1-Hsddb2 Complex
 gi|221046723|pdb|3EI4|C Chain C, Structure Of The Hsddb1-Hsddb2 Complex
 gi|221046725|pdb|3EI4|E Chain E, Structure Of The Hsddb1-Hsddb2 Complex
          Length = 1158

 Score = 1144 bits (2960), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1095 (52%), Positives = 761/1095 (69%), Gaps = 69/1095 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 72   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 131

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 132  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 191

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 192  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 250

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 251  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 310

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 311  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 370

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 371  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 430

Query: 353  LRSSTD-DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            LRS  + + +DT LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q
Sbjct: 431  LRSDPNRETYDT-LVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQ 488

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
            +TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++
Sbjct: 489  ITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQI 547

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
             H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPR
Sbjct: 548  SHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPR 607

Query: 532  SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
            S+L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+V
Sbjct: 608  SILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNV 667

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
            FA SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQK
Sbjct: 668  FACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQK 727

Query: 652  LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
            LHIR++PL E PR+IC+QE S+ F + S +                              
Sbjct: 728  LHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSK 787

Query: 682  ---------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732
                       S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+
Sbjct: 788  LFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYF 847

Query: 733  CVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
             VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++
Sbjct: 848  IVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVR 907

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
            LY+W    D    +++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE 
Sbjct: 908  LYEWTTEKD----VRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEI 963

Query: 853  ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
            ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN
Sbjct: 964  ARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVN 1023

Query: 913  RFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
             F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK V
Sbjct: 1024 VFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSV 1083

Query: 972  GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VS 1021
            G + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            +
Sbjct: 1084 GKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREAT 1143

Query: 1022 VEELCKRVEELTRLH 1036
             ++L K VEELTR+H
Sbjct: 1144 ADDLIKVVEELTRIH 1158


>gi|147906138|ref|NP_001083624.1| DNA damage-binding protein 1 [Xenopus laevis]
 gi|82186503|sp|Q6P6Z0.1|DDB1_XENLA RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|38303806|gb|AAH61946.1| Ddb1 protein [Xenopus laevis]
          Length = 1140

 Score = 1144 bits (2958), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    +S ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGDSIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDPDCR+IGL LYDGLFKVIP +   +  +AFNIRLEEL V+D+KFLY C
Sbjct: 114  RPSETGIIGIIDPDCRMIGLRLYDGLFKVIPLERDNKELKAFNIRLEELHVIDVKFLYSC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFSKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDVNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KL  + + +GSYV V+E 
Sbjct: 293  GSVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLTTESNEQGSYVVVMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LR + D   D  LV+SF+ +TR+L +  E E+EET++ GF    QT FC +  + QL+Q+
Sbjct: 413  LRVAADRDTDDTLVLSFVGQTRVLTLTGE-EVEETDLAGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G  V+V + N+ QVLLA G   L YLEI  G L +  
Sbjct: 472  TSASVRLVSQNPQNLVSEWKEPQGRKVSVCSCNSRQVLLAVGR-VLYYLEIHPGELRQTS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
              ++E+E++CLD+ P+G N + S + A+G+WTDIS RI SLP   L+ KE LGGEIIPRS
Sbjct: 531  CTEMEHEVACLDVTPLGGNDTLSSLCAIGLWTDISARILSLPGFQLLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+ +FE   YLLCALGDG L  F LN  TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTSFESSHYLLCALGDGALFYFSLNTDTGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLKNQ------------------------SCA-- 686
            HIR++PL E PR+IC+QE S+ F + S + +                        SC+  
Sbjct: 711  HIRTVPLFESPRKICYQEVSQCFGVLSSRIEVQDASGGSSPLRPSASTQALSSSVSCSKL 770

Query: 687  -------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                         EE E+H + ++D  TFE + T+     EY  S++SC    D   Y+ 
Sbjct: 771  FSGSTSPHETSFGEEVEVHNLLIIDQHTFEVLHTHQFLQNEYTLSLVSCKLGKDPTTYFV 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V P+E EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPDEAEPKQGRIVVFQYNDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSPPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + +          +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRETTV 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|2632123|emb|CAA05770.1| Xeroderma Pigmentosum Group E Complementing protein [Homo sapiens]
          Length = 1140

 Score = 1143 bits (2957), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 576/1094 (52%), Positives = 756/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H  +E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTGMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KG VYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGDVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W    D    +++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTEKD----VRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|431910407|gb|ELK13480.1| DNA damage-binding protein 1 [Pteropus alecto]
          Length = 1143

 Score = 1143 bits (2957), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1097 (52%), Positives = 760/1097 (69%), Gaps = 70/1097 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQW---RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN---------- 1019
             + H  +   RSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +           
Sbjct: 1067 KIEHSLYPSQRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKRE 1126

Query: 1020 VSVEELCKRVEELTRLH 1036
             + ++L K VEELTR+H
Sbjct: 1127 ATADDLIKVVEELTRIH 1143


>gi|355683071|gb|AER97036.1| damage-specific DNA binding protein 1, 127kDa [Mustela putorius furo]
          Length = 1122

 Score = 1142 bits (2954), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 570/1062 (53%), Positives = 746/1062 (70%), Gaps = 57/1062 (5%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEV 1108


>gi|301616502|ref|XP_002937687.1| PREDICTED: DNA damage-binding protein 1-like [Xenopus (Silurana)
            tropicalis]
          Length = 1140

 Score = 1142 bits (2954), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 577/1094 (52%), Positives = 761/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++  + +S ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQNGDSIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDPDCR+IGL LYDGLFKVIP +   +  +AFNIRLEEL V+D+KFLY C
Sbjct: 114  RPSETGIIGIIDPDCRMIGLRLYDGLFKVIPLERDNKELKAFNIRLEELHVIDVKFLYSC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFSKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDVNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KL  + + +GSYV V+E 
Sbjct: 293  GSVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLTAESNEQGSYVVVMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LR + D   D  LV+SF+ +TR+L +  E E+EET++ GF    QT FC +  + QL+Q+
Sbjct: 413  LRVAADRDTDDTLVLSFVGQTRVLTLTGE-EVEETDLAGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLV+   + L +EWK P G  V+V + N+ QVLLA G   L YLEI  G L ++ 
Sbjct: 472  TSASVRLVAQNPQSLVSEWKEPQGRKVSVCSCNSRQVLLAVGR-VLYYLEIHPGELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
              ++E+E++CLD+ P+G + + S + A+G+WTDIS RI SLP   L+ KE LGGEIIPRS
Sbjct: 531  CTEMEHEVACLDVTPLGGSDTLSSLCAIGLWTDISARILSLPGFQLLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+ +FE   YLLCALGDG L  F LN  TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTSFESSHYLLCALGDGALFYFSLNTDTGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------NQSCA-- 686
            HIR++PL E PR+IC+QE S+ F + S +                        + SC+  
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGSSPLRPSASTQALSSSVSCSKL 770

Query: 687  -------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                         EE E+H + ++D  TFE + T+     EY  S++SC    D   Y+ 
Sbjct: 771  FSGSTSPHETSFGEEVEVHNLLIIDQHTFEVLHTHQFLQNEYTLSLVSCKLGKDPTTYFV 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V P+E EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPDEAEPKQGRIVVFQYNDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSPPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + +          +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRETTV 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|440893607|gb|ELR46310.1| DNA damage-binding protein 1 [Bos grunniens mutus]
          Length = 1143

 Score = 1142 bits (2953), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1097 (52%), Positives = 760/1097 (69%), Gaps = 70/1097 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARIAKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQW---RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN---------- 1019
             + H  +   RSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +           
Sbjct: 1067 KIEHSLYPSQRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKRE 1126

Query: 1020 VSVEELCKRVEELTRLH 1036
             + ++L K VEELTR+H
Sbjct: 1127 ATADDLIKVVEELTRIH 1143


>gi|281345356|gb|EFB20940.1| hypothetical protein PANDA_015888 [Ailuropoda melanoleuca]
          Length = 1124

 Score = 1140 bits (2950), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1096 (52%), Positives = 759/1096 (69%), Gaps = 69/1096 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 36   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 95

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 96   RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 155

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 156  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 214

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 215  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 274

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 275  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 334

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 335  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 394

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 395  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 453

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 454  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 512

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 513  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 572

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 573  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 632

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 633  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 692

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 693  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 752

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 753  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 812

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 813  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 872

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 873  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 928

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 929  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 988

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK + 
Sbjct: 989  FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKNIT 1048

Query: 973  -GLNH-EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------V 1020
              L H   WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            
Sbjct: 1049 HSLTHLSTWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1108

Query: 1021 SVEELCKRVEELTRLH 1036
            + ++L K VEELTR+H
Sbjct: 1109 TADDLIKVVEELTRIH 1124


>gi|74178494|dbj|BAE32502.1| unnamed protein product [Mus musculus]
          Length = 1140

 Score = 1139 bits (2947), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 577/1094 (52%), Positives = 755/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE    LLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHCLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR+ PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTDPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++   TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIGQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP  GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKLGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|260790329|ref|XP_002590195.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
 gi|229275385|gb|EEN46206.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
          Length = 1152

 Score = 1139 bits (2945), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 579/1109 (52%), Positives = 767/1109 (69%), Gaps = 85/1109 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V  YGRIA +ELFRP GE +D LFI T RY  C+L++  D +   ++TRA G+V DRIG
Sbjct: 54   EVGCYGRIAVMELFRPPGETKDLLFILTARYNACILEYKQDGDDVSIVTRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNRELKAFNIRLEELNVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT+V +YQD    RHVKTYE++++DK+F +GPW Q+N++  A ++I VP P CG LII
Sbjct: 174  QVPTVVFVYQD-PHGRHVKTYEISVRDKEFSKGPWKQDNVETEASMVIAVPEPFCGSLII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
            G+E+I Y + + + A+    I+ S    +GRVDA+GSRYLLGD +G L +L++  EKE+ 
Sbjct: 233  GQESITYHNGDKYVAVAPPAIKQSTLICHGRVDANGSRYLLGDMSGRLFMLLL--EKEEL 290

Query: 237  ------VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
                  V  LK+ELLGETSIA  ++YLDN VVY+GS  GDSQLIKLN+  D  GSYV V+
Sbjct: 291  IDGSVTVKDLKVELLGETSIAECLTYLDNGVVYLGSRLGDSQLIKLNVDADDSGSYVHVM 350

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
            E + NLGPIVD CVVDLERQGQGQ+VTCSGAYK+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 351  ETFTNLGPIVDMCVVDLERQGQGQLVTCSGAYKEGSLRIIRNGIGIHEHASIDLPGIKGI 410

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W L    +   D  LV++F+ +TR+L ++ E E+EETE+ GF    QT FC + +  QL+
Sbjct: 411  WPLCVDPNGTMDDTLVLAFVGQTRVLLLSGE-EVEETELPGF-EDKQTFFCGNVMGGQLL 468

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
            Q+T+ SVRLVS  +++L +EWK P   +++VA++N +QV+ A  G  + Y+EI +G L +
Sbjct: 469  QITAASVRLVSRQTKQLVSEWKPPSEKNISVASSNTTQVVCAV-GQVVYYIEIHEGELKQ 527

Query: 471  VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
            +    +  E++CLD+ P+ E    + + AVG+WTDIS  +  LP+L  +  E LGGEIIP
Sbjct: 528  LGETVMAREVACLDVTPLVEGSDRAHLCAVGLWTDISAHMLQLPNLEPMHVEMLGGEIIP 587

Query: 531  RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE------LTDRKKVSLGTQPITLRTFS 584
            RS+L+  FEGI YLLCALGDG L  F LN +TG+      LT + KV+LGTQP  LRTF 
Sbjct: 588  RSILMTTFEGIHYLLCALGDGSLFYFNLNPETGKNFSLHILTIKSKVTLGTQPTVLRTFR 647

Query: 585  SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
            S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+  +  LTIG
Sbjct: 648  SLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSQGYPDSLALTNDSTLTIG 707

Query: 645  TIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--------------------- 683
            TID+IQKLHIR++PL E PRRI +QE S+TF + S + +                     
Sbjct: 708  TIDEIQKLHIRTVPLYESPRRIAYQETSQTFGVLSTRTEVEDASGGSGTQPVRPSASTTA 767

Query: 684  --------------------SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCS 723
                                +  EE E+H + ++D  TFE +  + L   E+  S++SC 
Sbjct: 768  LSTSVSPNKNALRSSNDSDGAFGEEIEIHSLLVVDQHTFEILHAHQLLKDEFALSMVSCK 827

Query: 724  FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
              DD N Y+ +GTA V PEE+EP  GRI+VF   DGKLQ +AEKE KGAVYSL  FN KL
Sbjct: 828  LGDDPNTYFIIGTAMVYPEESEPKSGRIIVFQYTDGKLQQVAEKEVKGAVYSLVQFNNKL 887

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
            LA+IN  ++L++W       +EL+ EC H+ +ILALY++T+GDFI+VGDLM+S++LL YK
Sbjct: 888  LASINSTVRLFEWT----AEKELRVECNHYNNILALYLKTKGDFILVGDLMRSVTLLAYK 943

Query: 844  HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVG 903
              EG  EE ARD+N NWMSAVEILDDD +LGAEN+FN FT +K+S   TDEER  L+ VG
Sbjct: 944  PMEGCFEEIARDFNPNWMSAVEILDDDNFLGAENSFNFFTCQKDSAATTDEERQHLQEVG 1003

Query: 904  EYHLGEFVNRFRHGSLVMRLP-DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
             +HLGEFVN FRHGSLVM+ P ++      +V+FGTVNG +G++  LP + + FL+++Q+
Sbjct: 1004 HFHLGEFVNVFRHGSLVMQHPGETSTPTQGSVLFGTVNGAVGLVTQLPADFFNFLQEVQS 1063

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV-- 1020
             L +VIK VG + H  WRSFN E+KT   + F+DGDLIESFLDLSR +M E+ + + V  
Sbjct: 1064 KLTRVIKSVGKIEHSFWRSFNTERKTEACQGFIDGDLIESFLDLSRDKMQEVVQGLQVGG 1123

Query: 1021 -------------SVEELCKRVEELTRLH 1036
                         +VE+L K VEELTR+H
Sbjct: 1124 AIMDDGSGMKRECTVEDLIKLVEELTRIH 1152


>gi|427788481|gb|JAA59692.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
          Length = 1156

 Score = 1138 bits (2944), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 575/1110 (51%), Positives = 775/1110 (69%), Gaps = 83/1110 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQ--WDAESSELITRAMGDVSDRIG 60
            ++ IYGRI+ ++L+RP GE +D LF  T +Y   +L+     +S E+IT+A G+++D   
Sbjct: 54   EIGIYGRISIMKLYRPAGEKKDLLFFLTAKYNAAILECVQYGDSIEIITKAHGNIADTFS 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIR+EEL V D++FL+GC
Sbjct: 114  RPSETGNIGIIDPECRVIGLRLYDGLFKVIPLDKDNRELKAFNIRMEELTVQDMEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTIV+L+QD++ ARH+KTYEV+LKDK+FV+GPW Q+++++ A+L+I VP P CG LII
Sbjct: 174  KTPTIVLLHQDSQ-ARHMKTYEVSLKDKEFVKGPWKQDHVESEANLVIAVPEPFCGALII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + +  I    IR S    YG+VDA+GSRYLLGD AG L +L++  E +  
Sbjct: 233  GQESITYHNGDQYVVITPHLIRQSTIVCYGKVDANGSRYLLGDMAGRLFMLLLEREDKMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                V  LK+E LGE +IA  I+YLDN VVY+GS  GDSQLIKL+ + + +GS+VE++E 
Sbjct: 293  GTTTVKDLKLEFLGEITIAECITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEV 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKGMW 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGMWP 412

Query: 353  LR---------SSTDDPFDTF-----LVVSFISETRILAMNLEDELEETEIEGFCSQTQT 398
            LR             DP D+      LV+SF+ +TR+L ++ E E+EETE+ GF +  QT
Sbjct: 413  LRVGPGVAPHGGDGRDPGDSAERDNTLVLSFVRQTRVLMLSGE-EVEETELAGFDTSQQT 471

Query: 399  LFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
             FC +    QL+QVT+ +VRLV S +++L NEWK P   +++V T N SQV+ A     +
Sbjct: 472  FFCGNVRNKQLIQVTAAAVRLVDSQTKQLLNEWKPPGARNISVVTCNQSQVVCAVRK-EV 530

Query: 459  VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
              LEIGDG+L ++ +A+LE E++CLDI P+ E    + + AVG+WTDIS+RI SLP L  
Sbjct: 531  FCLEIGDGVLNQISNAELENEVACLDITPLSEKAEKATLCAVGLWTDISIRILSLPSLQQ 590

Query: 519  ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
            + KE++GGEIIPRS+L+  FEGI YLLCALGDG L  FLL   TG LTDRKKV+LGTQP 
Sbjct: 591  LQKENIGGEIIPRSILITTFEGIHYLLCALGDGSLFYFLLEATTGALTDRKKVTLGTQPT 650

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
             L+TF S +T++VFA SDRPTVIYSSN KL++SNVNLKEV+HMCP NS  +PDSLA+A +
Sbjct: 651  VLKTFKSLSTSNVFACSDRPTVIYSSNHKLVFSNVNLKEVNHMCPLNSEGYPDSLALAND 710

Query: 639  GELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------- 681
              L IGTID+IQKLHIR++PLGE PRRI +QE ++TF + +++                 
Sbjct: 711  NTLLIGTIDEIQKLHIRTVPLGELPRRIAYQEATQTFGVITIRNDILGSSGLTPVRPSAS 770

Query: 682  ------------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGC 717
                                    N    +E E+H + ++D  TFE +  +     EY  
Sbjct: 771  TQAQNVTHSAQMSSIFKPGSVSTGNDQLGQEVEIHNLLIIDQHTFEVLHAHQFMQTEYAM 830

Query: 718  SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
            SI+S    +D N YY VGTA VLP+E++P +GRI+VF   DGKL+ +AE+E KGA YS+ 
Sbjct: 831  SIVSTRLGNDPNTYYIVGTANVLPDESDPKQGRIVVFHWVDGKLEHVAEQEIKGAPYSML 890

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
             FNGKLLAAIN  ++L++W    +  REL++EC H  +ILALY++ +GDF++VGDLM+S+
Sbjct: 891  EFNGKLLAAINSTVRLFEW----NAERELRNECSHFNNILALYLRAKGDFVLVGDLMRSM 946

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG 897
            SLL YK  EG  EE ARDY  NWMS+VEILDDD +LGAE+  NLF  +K+S   TDEER 
Sbjct: 947  SLLAYKPLEGNFEEIARDYQTNWMSSVEILDDDTFLGAESTTNLFVCQKDSAATTDEERQ 1006

Query: 898  RLEVVGEYHLGEFVNRFRHGSLVMRLP-DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLF 956
             L+ VG++HLGEFVN FRHGSLVM+ P ++      +V+FGT++G IG+++ LP + Y F
Sbjct: 1007 HLQEVGQFHLGEFVNVFRHGSLVMQHPGETSSPTQGSVLFGTIHGAIGLVSQLPADFYTF 1066

Query: 957  LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
            L ++Q  L KVIK VG ++H  WRSF+ E+KT  A  F+DGDLIESFLDLSR +M E+ +
Sbjct: 1067 LSEVQEKLTKVIKSVGKIDHAFWRSFSTERKTEPAVGFIDGDLIESFLDLSRDKMQEVVQ 1126

Query: 1017 TM----------NVSVEELCKRVEELTRLH 1036
             +          + SV++L K +EEL+R+H
Sbjct: 1127 GIQMDDGSGMKRDASVDDLIKIIEELSRIH 1156


>gi|444513057|gb|ELV10249.1| DNA damage-binding protein 1 [Tupaia chinensis]
          Length = 1146

 Score = 1138 bits (2944), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 576/1100 (52%), Positives = 758/1100 (68%), Gaps = 73/1100 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSSGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK   
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKRCF 1066

Query: 973  GL------NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN------- 1019
             +      +   WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +        
Sbjct: 1067 QISPNSLTDMSTWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGM 1126

Query: 1020 ---VSVEELCKRVEELTRLH 1036
                + ++L K VEELTR+H
Sbjct: 1127 KREATADDLIKVVEELTRIH 1146


>gi|119594342|gb|EAW73936.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_d [Homo
            sapiens]
          Length = 1146

 Score = 1137 bits (2941), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 577/1100 (52%), Positives = 757/1100 (68%), Gaps = 73/1100 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK--- 969
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK   
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKRCF 1066

Query: 970  --GVGGLNH-EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN------- 1019
                  L H   WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +        
Sbjct: 1067 LISTCSLTHPSTWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGM 1126

Query: 1020 ---VSVEELCKRVEELTRLH 1036
                + ++L K VEELTR+H
Sbjct: 1127 KREATADDLIKVVEELTRIH 1146


>gi|410912407|ref|XP_003969681.1| PREDICTED: DNA damage-binding protein 1-like [Takifugu rubripes]
          Length = 1140

 Score = 1137 bits (2941), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 573/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++  + ES ++ITRA G+V D IG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTSKYNACILEYKQNGESIDIITRAHGNVKDPIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGI+DP+CR+IGL LYDGLFKVIP D   +  +A+NIRLEELQV+D+ FLYGC
Sbjct: 114  RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRDNRELKAYNIRLEELQVIDVHFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT+  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++IPVP P  G +II
Sbjct: 174  QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK--- 234
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E+   
Sbjct: 233  GQESITYHNGDKYLAIAPPTIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEELMD 292

Query: 235  --EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L +ELLGETSIA  ++YLDN VV++GS  GD QL+KLN+  + +GS+V V+E 
Sbjct: 293  GTVALKDLHVELLGETSIAECLTYLDNGVVFVGSRLGDPQLVKLNVDSNDQGSFVTVMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L ++ E E+EETE+ GF    QT +C +  +NQL+Q+
Sbjct: 413  LRSEAGRETDDMLVLSFVGQTRVLMLSGE-EVEETELPGFVDNQQTFYCGNVAHNQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TSGSVRLV   S+ L +EWK P G +++VA  N +QV+LA G   L YL+I  G L ++ 
Sbjct: 472  TSGSVRLVLQDSKALVSEWKEPQGRNISVAACNHTQVVLAVGRA-LYYLQILAGELKQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
              ++E+E++CLDI P+GE  + S + AVG+WTDIS R+  LP    + KE LGGEIIPRS
Sbjct: 531  TTEMEHEVACLDITPLGEAGAESPLCAVGLWTDISARVLKLPCFTPLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEG  YLLCALGDG L  F L+++TG L++ KKV+LGTQP  LRTF S +T++VF
Sbjct: 591  ILMTTFEGSYYLLCALGDGALFYFGLDLQTGALSECKKVTLGTQPTVLRTFRSLSTSNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PRRIC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRRICYQEVSQCFGVLSSRVEIQDVSGTTSPVRPSASTQALSSSMSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D +VY+ 
Sbjct: 771  FPSSTSPHESSFGEEVEIHSLLVVDQHTFEVLHAHQFLPSEYALSMVSCRLGKDPSVYFV 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWT----AEKELRTECSHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDE+R  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEDRQHLQEVGVFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLV++ L ++      +V+FGTV G+IG++ SL    +  L  LQ  L KVIK VG
Sbjct: 1007 FCHGSLVLQNLGETSTPTQGSVLFGTVTGMIGLVTSLSEGWHSLLLDLQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSF+ E+KT  AK F+DGDLIESFLDL R +M E+  T+ +          +V
Sbjct: 1067 KIEHSFWRSFHTERKTEQAKGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATV 1126

Query: 1023 EELCKRVEELTRLH 1036
            +E+ K VEELTR+H
Sbjct: 1127 DEVIKIVEELTRIH 1140


>gi|81868411|sp|Q9ESW0.1|DDB1_RAT RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|9843869|emb|CAB89874.2| damage-specific DNA binding protein 1 [Rattus norvegicus]
          Length = 1140

 Score = 1137 bits (2940), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 571/1094 (52%), Positives = 756/1094 (69%), Gaps = 67/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQ +KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQPVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPRAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLD+ P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDVTPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGT+++IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANTSTLTIGTMNEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++P+ E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPIYESPRKICYQEVSQCFGVLSTRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSAAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF    GKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSGGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+ GTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLLGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
             + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + 
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140


>gi|449283451|gb|EMC90093.1| DNA damage-binding protein 1 [Columba livia]
          Length = 1140

 Score = 1135 bits (2937), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 578/1105 (52%), Positives = 761/1105 (68%), Gaps = 78/1105 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++  + +S ++ITRA G+V DRIG
Sbjct: 43   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQNGDSIDIITRAHGNVQDRIG 102

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +  +AFNIRLEELQV+D+KFLYGC
Sbjct: 103  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRENKELKAFNIRLEELQVIDVKFLYGC 162

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 163  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 221

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 222  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 281

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 282  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 341

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 342  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 401

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 402  LRSDAHRETDNMLVLSFVGQTRVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQI 460

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YLEI    L ++ 
Sbjct: 461  TSASVRLVSQEPKALVSEWKEPNGKNISVASCNSSQVVVAVGRA-LYYLEIRPQELRQIS 519

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
              ++E+E++CLDI P+G+    S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 520  CTEMEHEVACLDITPLGDTNGMSPLCAIGLWTDISARILRLPSFELLHKEMLGGEIIPRS 579

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F L+++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 580  ILMTTFESSHYLLCALGDGALFYFGLSLETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 639

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 640  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 699

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 700  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKL 759

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 760  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 819

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 820  VGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINSTVRL 879

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 880  YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 935

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG  HLGEFVN 
Sbjct: 936  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNV 995

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 996  FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1055

Query: 973  GLNH---------EQW--RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV- 1020
             + H           W  +SF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + + 
Sbjct: 1056 KIEHSLYPSLVQLRAWASQSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQID 1115

Query: 1021 ---------SVEELCKRVEELTRLH 1036
                     +V++L K VEELTR+H
Sbjct: 1116 DGSGMKREATVDDLIKIVEELTRIH 1140


>gi|325186344|emb|CCA20849.1| predicted protein putative [Albugo laibachii Nc14]
          Length = 1148

 Score = 1132 bits (2927), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 566/1096 (51%), Positives = 762/1096 (69%), Gaps = 63/1096 (5%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
            D+ +YGRIA  E FRP GE QD++F+ T+R++FCVL +D    ++IT+A G + D IGR 
Sbjct: 54   DIKMYGRIAIFEKFRPSGETQDWIFLVTQRFQFCVLAYDTTLQQIITKANGSLRDTIGRN 113

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ-LKEAFNIRLEELQVLDIKFLYGCA 121
            ++    G IDPD RLIG+++Y+G FKVIP DN  + LK AFNIRL+EL++LDIKFLYG  
Sbjct: 114  SEILTNGNIDPDGRLIGMNIYEGYFKVIPIDNHSKSLKAAFNIRLDELRILDIKFLYGYN 173

Query: 122  KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
            KPTI VLY+D K ARHVKTY + LK+KDF EGPWSQ+N++ GA+LLIPVP P  GVLII 
Sbjct: 174  KPTICVLYEDFKAARHVKTYFILLKEKDFAEGPWSQSNVEAGANLLIPVPMPYGGVLIIS 233

Query: 182  EETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLK 241
             +TIVY +   F AIP++ ++ + YG V  DGSR+LL D  G LH++ +  E ++V  + 
Sbjct: 234  NQTIVYHNGTYFHAIPMQNTMIQVYGAVGDDGSRFLLADQYGALHVVALQTEGKEVLDVY 293

Query: 242  IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 301
            +E+LG+TSIAS +SYLDN VV++GS++GDSQL+KLN + D  GSY+EVL+ YVN+GPI+D
Sbjct: 294  LEVLGQTSIASCVSYLDNGVVFVGSTFGDSQLVKLNSKRDESGSYIEVLDSYVNIGPIID 353

Query: 302  FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 361
            FCV+DL+RQGQGQ+VTCSGA KDGSLR++RNGIGINEQAS EL GIKGMW+LR S    +
Sbjct: 354  FCVMDLDRQGQGQIVTCSGADKDGSLRVIRNGIGINEQASAELPGIKGMWALRESLASEY 413

Query: 362  DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
            D +LV S+++E RI+ +   DE+EE EIE F    +TL+C +   +  +QVT   VR++ 
Sbjct: 414  DKYLVQSYLNEIRIMTIGDSDEMEEVEIEAFL-DAKTLYCRNVNEDGWLQVTETEVRIID 472

Query: 422  STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 481
            + +  +   W  PP   + VA+AN +Q++LAT    L+Y+EI D  L E    ++E+EI+
Sbjct: 473  AQTTSICCGWIPPPSTRITVASANPTQIVLATSSKVLIYMEILDKQLMEKARKEMEFEIA 532

Query: 482  CLDINPI---------------------------GENPSYSQIAAVGMWTDISVRIFSLP 514
            C+D++P                              N   + I AVG WT  SV + SLP
Sbjct: 533  CIDLSPRLDQIDPPALTRSSQVMDEAMEDAIVSRDANALRTSICAVGFWTSSSVCVLSLP 592

Query: 515  DLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
             L+ ++KE    +++PRSVL  +FE   YLL  LGDG L+   L+ + G L++RK++SLG
Sbjct: 593  SLDELSKETFDKDLLPRSVLCASFEDSDYLLIGLGDGSLVTCHLDREFGTLSERKRISLG 652

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
            TQPI+L  F SK+  HVFA+ DRPTVIY + KKLLYSNVN KEV+ M  F+S  FP+ LA
Sbjct: 653  TQPISLSAFWSKDKLHVFASCDRPTVIYYNKKKLLYSNVNAKEVAVMSTFHSEVFPECLA 712

Query: 635  IAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK--NQSCAEESEMH 692
            +A   ELTIGT+DDIQKLHI+S  L E  RRI H  +SRT  + ++K       +E E  
Sbjct: 713  LASGSELTIGTVDDIQKLHIQSFHLNEWARRIAHDSRSRTLGVATVKLSKDDGGDEVEQG 772

Query: 693  FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN--VYYCVGTAYVLPEENEPTKGR 750
            ++RL DDQTFE + ++ LD FE  CSI++C F+ DS+   YY VGTA+V  EE EP +GR
Sbjct: 773  YIRLFDDQTFECLKSFRLDPFESPCSIITCIFTGDSSGGTYYVVGTAFVHEEEAEPHQGR 832

Query: 751  ILVFIVE----DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
            ILVF V     D +LQL+ EKE KG+VY LNAFNGKLLA +N K+ L+KW   ++   EL
Sbjct: 833  ILVFTVSGIHGDRRLQLVTEKEVKGSVYCLNAFNGKLLAGVNSKVYLFKWSESEENGEEL 892

Query: 807  QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
             SECGHHGH L LY+++RGDFIVVGDLMKSISLL +K  +G+IEE ARD N+NWM+AV I
Sbjct: 893  VSECGHHGHTLVLYMESRGDFIVVGDLMKSISLLNHKQLDGSIEEIARDLNSNWMTAVGI 952

Query: 867  LDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
            +DDD Y+G+E +FNLFTV++NS  A+DEERGRLE +GEYHLGEFVNRFR+GSLVM+   S
Sbjct: 953  IDDDNYVGSETDFNLFTVQRNSGAASDEERGRLETIGEYHLGEFVNRFRYGSLVMQHNLS 1012

Query: 927  DVGQIP-------------------TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
               + P                   +++FGTV+G+IGVI  +  E++ FL ++Q+ L +V
Sbjct: 1013 IGAEAPGISLSDDRPESLSPLSVQRSMLFGTVSGMIGVILPISKEKHEFLMRVQSALNQV 1072

Query: 968  IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN-------V 1020
            I+GVGG +H +WR+F N + +++A NF+DGDLIESFLDLS+  M ++   +N        
Sbjct: 1073 IQGVGGFSHSEWRTFENRRSSIEAHNFIDGDLIESFLDLSKDEMKQVVDELNRDQLEGKT 1132

Query: 1021 SVEELCKRVEELTRLH 1036
            ++E L  R+EEL+++H
Sbjct: 1133 TLEALAARIEELSQMH 1148


>gi|405970039|gb|EKC34976.1| DNA damage-binding protein 1 [Crassostrea gigas]
          Length = 1160

 Score = 1131 bits (2926), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 574/1118 (51%), Positives = 772/1118 (69%), Gaps = 95/1118 (8%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQ--WDAESSELITRAMGDVSDRIG 60
            +V +YGRIA +ELFRP GE++D +F+ T+RY   +L+   + E+ ++ITRA G+V D+IG
Sbjct: 54   EVGVYGRIAVMELFRPPGESKDLMFLLTQRYNAMILECNQNGENIDIITRAHGNVQDKIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            R ++ G IGIIDP CR+IGL LYDGLFKVIP +   +  +AFNIRLEEL V+DI+FL+GC
Sbjct: 114  RASETGIIGIIDPLCRVIGLRLYDGLFKVIPLERDNKELKAFNIRLEELTVIDIQFLHGC 173

Query: 121  AKPTIVVLYQDN--------------------KDARHVKTYEVALKDKDFVEGPWSQNNL 160
              PT+++++Q N                    +  RHVKTYE++L+DK+F +GPW Q+N+
Sbjct: 174  TTPTLILIHQANLNCYHLMTLCITNLLSFKQDQHGRHVKTYEISLRDKEFQKGPWKQDNV 233

Query: 161  DNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITK-----AYGRVDADGSR 215
            +  A +LI VP P  G LIIG+E+I Y   + F  IPI P   K      YG+VDA+GSR
Sbjct: 234  ETEACMLIAVPEPFGGALIIGQESITYHKGDNF--IPIAPPAIKQSTLTCYGKVDANGSR 291

Query: 216  YLLGDHAGLLHLLVITHEKE-----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
            YLLGD  G L +L++  E++      V  LK+ELLGET+IA  I+YLDNAVVYIGS  GD
Sbjct: 292  YLLGDMMGRLFMLMLEKEEKMDSTVTVKDLKVELLGETTIAECITYLDNAVVYIGSRLGD 351

Query: 271  SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIV 330
            SQL+KLN++PD  GSYV+ +ER+ NLGPI+D CVVDLERQGQGQ+VTCSGAYK+GSLRI+
Sbjct: 352  SQLVKLNVEPDENGSYVQEMERFTNLGPILDMCVVDLERQGQGQLVTCSGAYKEGSLRII 411

Query: 331  RNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE 390
            RNGIGI+E AS++L GIKG+WSLR ++ + +D  +V+SF+ +TR+L +N E E+EETE+ 
Sbjct: 412  RNGIGIHEHASIDLPGIKGIWSLRVNSPE-YDNMIVLSFVGQTRVLMLNGE-EVEETELS 469

Query: 391  GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVL 450
            G  S  QT  C + ++NQL+Q+T  SVRL+S  +++L  EWK   G ++++A++N  QV+
Sbjct: 470  GIESDQQTFLCANVVHNQLLQITPQSVRLISCDNQKLLKEWKHSGGKNISLASSNTCQVV 529

Query: 451  LATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRI 510
             A  G  L YLE+  G + +V  + +E+E++C+D+ P+ E    S + A+G+WTDIS R+
Sbjct: 530  -ACVGSELYYLELLQGDIKQVSTSTMEHEVACVDLTPLREGEEKSHLCAIGLWTDISARV 588

Query: 511  FSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
             SLPD N +  E LGGEIIPRS+L+  FEGI YLLCALGDG L  F  N+ TG  T+++K
Sbjct: 589  LSLPDFNSLHVEMLGGEIIPRSILMTTFEGIHYLLCALGDGSLFYFNFNIDTGYFTEKRK 648

Query: 571  VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
            V+LGTQP  LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV+HMCP NS  +P
Sbjct: 649  VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNHMCPLNSEGYP 708

Query: 631  DSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK--------- 681
            DSLA+A +G LTIGTID+IQKLHIR+IPLGE PRRI +QE S+TF + S++         
Sbjct: 709  DSLALANDGTLTIGTIDEIQKLHIRTIPLGESPRRIAYQESSQTFGVISMRMDLQDSNGL 768

Query: 682  ------------------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLD 711
                                            S  +E E+H + ++D  TFE + ++ L 
Sbjct: 769  NPTRPSASTHAAMMSSSSSGKVTMGTSTMGEHSAGDEVEVHSLLIIDQHTFEVLHSHQLM 828

Query: 712  TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKG 771
              E+  S++S    +D   YY VGTA V PEE EP +GRI++F   +GKL  IAEKE KG
Sbjct: 829  PNEFATSLISARLGEDPCNYYIVGTALVHPEEAEPKQGRIVIFHFHEGKLNQIAEKEIKG 888

Query: 772  AVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVG 831
            A Y+L  FNGKLLA+IN  ++L++W       +EL+ EC +   I+ALY++T+GDFI+VG
Sbjct: 889  AAYTLVEFNGKLLASINSTVRLFEWTT----DKELRLECNYFNSIVALYLKTKGDFILVG 944

Query: 832  DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA 891
            DLM+SI+LL+YK  EG  EE ARD N NW +AVEILDDD +LGAEN+FNLFT +K+S   
Sbjct: 945  DLMRSITLLLYKPMEGTFEEIARDCNPNWTTAVEILDDDNFLGAENSFNLFTCQKDSAST 1004

Query: 892  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT---VIFGTVNGVIGVIAS 948
            TDE+R  L+ VG +HLGEFVN FRHGSLVM+   S     PT   V++GTVNG +G++  
Sbjct: 1005 TDEDRQNLQEVGMFHLGEFVNVFRHGSLVMQ--HSGETSTPTQGSVLYGTVNGAVGLVTQ 1062

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +P E Y FL+ +Q+ L KVIK VG + H  WRSF+ E+KT   + F+DGDLIESFLDL+R
Sbjct: 1063 VPQEFYSFLQDIQSRLAKVIKSVGKIEHSFWRSFHTERKTEACEGFIDGDLIESFLDLNR 1122

Query: 1009 TRMDEISKTMNV----------SVEELCKRVEELTRLH 1036
             +M E  K + +          +V++L K +EELTR+H
Sbjct: 1123 DKMQETVKGLQIDDGSGMKREATVDDLVKTIEELTRIH 1160


>gi|427780151|gb|JAA55527.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
          Length = 1181

 Score = 1129 bits (2920), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 576/1135 (50%), Positives = 778/1135 (68%), Gaps = 108/1135 (9%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQ--WDAESSELITRAMGDVSDRIG 60
            ++ IYGRI+ ++L+RP GE +D LF  T +Y   +L+     +S E+IT+A G+++D   
Sbjct: 54   EIGIYGRISIMKLYRPAGEKKDLLFFLTAKYNAAILECVQYGDSIEIITKAHGNIADTFS 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIR+EEL V D++FL+GC
Sbjct: 114  RPSETGNIGIIDPECRVIGLRLYDGLFKVIPLDKDNRELKAFNIRMEELTVQDMEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTIV+L+QD++ ARH+KTYEV+LKDK+FV+GPW Q+++++ A+L+I VP P CG LII
Sbjct: 174  KTPTIVLLHQDSQ-ARHMKTYEVSLKDKEFVKGPWKQDHVESEANLVIAVPEPFCGALII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHE---- 233
            G+E+I Y + + +  I    IR S    YG+VDA+GSRYLLGD AG L +L++  E    
Sbjct: 233  GQESITYHNGDQYVVITPHLIRQSTIVCYGKVDANGSRYLLGDMAGRLFMLLLEREDKMD 292

Query: 234  --------------------KEKVTG------LKIELLGETSIASTISYLDNAVVYIGSS 267
                                ++K+ G      LK+E LGE +IA  I+YLDN VVY+GS 
Sbjct: 293  GTXYLLGDMAGRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAECITYLDNGVVYVGSR 352

Query: 268  YGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSL 327
             GDSQLIKL+ + + +GS+VE++E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSL
Sbjct: 353  LGDSQLIKLHAERNDQGSFVEIMEVFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSL 412

Query: 328  RIVRNGIGINEQASVELQGIKGMWSLR---------SSTDDPFDTF-----LVVSFISET 373
            RI+RNGIGI+E AS++L GIKGMW LR             DP D+      LV+SF+ +T
Sbjct: 413  RIIRNGIGIHEHASIDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAERDNTLVLSFVRQT 472

Query: 374  RILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKS 433
            R+L ++ E E+EETE+ GF +  QT FC +    QL+QVT+ +VRLV S +++L NEWK 
Sbjct: 473  RVLMLSGE-EVEETELAGFDTSQQTFFCGNVRNKQLIQVTAAAVRLVDSQTKQLLNEWKP 531

Query: 434  PPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPS 493
            P   +++V T N SQV+ A     +  LEIGDG+L ++ +A+LE E++CLDI P+ E   
Sbjct: 532  PGARNISVVTCNQSQVVCAVRK-EVFCLEIGDGVLNQISNAELENEVACLDITPLSEKAE 590

Query: 494  YSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHL 553
             + + AVG+WTDIS+RI SLP L  + KE++GGEIIPRS+L+  FEGI YLLCALGDG L
Sbjct: 591  KATLCAVGLWTDISIRILSLPSLQQLQKENIGGEIIPRSILITTFEGIHYLLCALGDGSL 650

Query: 554  LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNV 613
              FLL   TG LTDRKKV+LGTQP  L+TF S +T++VFA SDRPTVIYSSN KL++SNV
Sbjct: 651  FYFLLEATTGALTDRKKVTLGTQPTVLKTFKSLSTSNVFACSDRPTVIYSSNHKLVFSNV 710

Query: 614  NLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSR 673
            NLKEV+HMCP NS  +PDSLA+A +  L IGTID+IQKLHIR++PLGE PRRI +QE ++
Sbjct: 711  NLKEVNHMCPLNSEGYPDSLALANDNTLLIGTIDEIQKLHIRTVPLGELPRRIAYQEATQ 770

Query: 674  TFAICSLK-----------------------------------------NQSCAEESEMH 692
            TF + +++                                         N    +E E+H
Sbjct: 771  TFGVITIRNDILGSSGLTPVRPSASTQAQNVTHSAQMSSIFKPGSVSTGNDQLGQEVEIH 830

Query: 693  FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRIL 752
             + ++D  TFE +  +     EY  SI+S    +D N YY VGTA VLP+E++P +GRI+
Sbjct: 831  NLLIIDQHTFEVLHAHQFMQTEYAMSIVSTRLGNDPNTYYIVGTANVLPDESDPKQGRIV 890

Query: 753  VFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
            VF   DGKL+ +AE+E KGA YS+  FNGKLLAAIN  ++L++W    +  REL++EC H
Sbjct: 891  VFHWVDGKLEHVAEQEIKGAPYSMLEFNGKLLAAINSTVRLFEW----NAERELRNECSH 946

Query: 813  HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
              +ILALY++ +GDF++VGDLM+S+SLL YK  EG  EE ARDY  NWMS+VEILDDD +
Sbjct: 947  FNNILALYLRAKGDFVLVGDLMRSMSLLAYKPLEGNFEEIARDYQTNWMSSVEILDDDTF 1006

Query: 873  LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP-DSDVGQI 931
            LGAE+  NLF  +K+S   TDEER  L+ VG++HLGEFVN FRHGSLVM+ P ++     
Sbjct: 1007 LGAESTTNLFVCQKDSAATTDEERQHLQEVGQFHLGEFVNVFRHGSLVMQHPGETSSPTQ 1066

Query: 932  PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 991
             +V+FGT++G IG+++ LP + Y FL ++Q  L KVIK VG ++H  WRSF+ E+KT  A
Sbjct: 1067 GSVLFGTIHGAIGLVSQLPADFYTFLSEVQEKLTKVIKSVGKIDHAFWRSFSTERKTEPA 1126

Query: 992  KNFLDGDLIESFLDLSRTRMDEISKTM----------NVSVEELCKRVEELTRLH 1036
              F+DGDLIESFLDLSR +M E+ + +          + SV++L K +EEL+R+H
Sbjct: 1127 VGFIDGDLIESFLDLSRDKMQEVVQGIQMDDGSGMKRDASVDDLIKIIEELSRIH 1181


>gi|326919947|ref|XP_003206238.1| PREDICTED: DNA damage-binding protein 1-like [Meleagris gallopavo]
          Length = 1079

 Score = 1125 bits (2909), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 568/1081 (52%), Positives = 750/1081 (69%), Gaps = 69/1081 (6%)

Query: 18   PH--GEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIGIIDP 73
            PH  GE++D LFI T +Y  C+L++  + ++ ++ITRA G+V DRIGRP++ G IGIIDP
Sbjct: 6    PHTRGESKDLLFILTAKYNACILEYKQNGDNIDIITRAHGNVQDRIGRPSETGIIGIIDP 65

Query: 74   DCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNK 133
            +CR+IGL LYDGLFKVIP D + +  +AFNIRLEELQV+D+KFLYGC  PTI  +YQD +
Sbjct: 66   ECRMIGLRLYDGLFKVIPLDRENKELKAFNIRLEELQVIDVKFLYGCQAPTICFVYQDPQ 125

Query: 134  DARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAF 193
              RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y + + +
Sbjct: 126  -GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKY 184

Query: 194  KAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-----KVTGLKIELL 245
             AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++      +  L++ELL
Sbjct: 185  LAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELL 244

Query: 246  GETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVV 305
            GETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD CVV
Sbjct: 245  GETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVV 304

Query: 306  DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFL 365
            DLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS +    D  L
Sbjct: 305  DLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDSHHEMDNML 364

Query: 366  VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
            V+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+TS SVRLVS   +
Sbjct: 365  VLSFVGQTRVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPK 423

Query: 426  ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI 485
             L +EWK P G +++VA+ N++QV++A G   L YLEI    L ++   ++E+E++CLDI
Sbjct: 424  ALVSEWKEPNGKNISVASCNSNQVVVAVGRA-LYYLEIRPQELRQINCTEMEHEVACLDI 482

Query: 486  NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLL 545
             P+G+    S + A+G+WTDIS RI  LP   ++ KE LGGEIIPRS+L+  FE   YLL
Sbjct: 483  TPLGDTNGMSPLCAIGLWTDISARILKLPSFEMLHKEMLGGEIIPRSILMTTFESSHYLL 542

Query: 546  CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSN 605
            CALGDG L  F L+++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVIYSSN
Sbjct: 543  CALGDGALFYFGLSLETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSN 602

Query: 606  KKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRR 665
             KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E PR+
Sbjct: 603  HKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRK 662

Query: 666  ICHQEQSRTFAICSLK---------------------------------------NQSCA 686
            IC+QE S+ F + S +                                         S  
Sbjct: 663  ICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKLFSSSTAPHETSFG 722

Query: 687  EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEP 746
            EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ VGTA V PEE EP
Sbjct: 723  EEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEP 782

Query: 747  TKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
             +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++LY+W       +EL
Sbjct: 783  KQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWT----AEKEL 838

Query: 807  QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
            ++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE ARD+N NWMSAVEI
Sbjct: 839  RTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEI 898

Query: 867  LDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPD 925
            LDDD +LGAEN FNLF  +K+S   TDEER  L+ VG  HLGEFVN F HGSLVM+ L +
Sbjct: 899  LDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGE 958

Query: 926  SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
            +      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG + H  WRSF+ E
Sbjct: 959  TSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE 1018

Query: 986  KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVEELCKRVEELTRL 1035
            +KT  A  F+DGDLIESFLD+SR +M E+   + +          +V++L K VEELTR+
Sbjct: 1019 RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATVDDLIKIVEELTRI 1078

Query: 1036 H 1036
            H
Sbjct: 1079 H 1079


>gi|413948669|gb|AFW81318.1| hypothetical protein ZEAMMB73_456332 [Zea mays]
          Length = 674

 Score = 1118 bits (2891), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 540/673 (80%), Positives = 605/673 (89%)

Query: 364  FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
            +LVVSFISETR LAMN+EDELEETEIEGF +QTQTLFC +AI + L+QVT+ SVRLVS T
Sbjct: 2    YLVVSFISETRFLAMNMEDELEETEIEGFDAQTQTLFCQNAINDLLIQVTANSVRLVSCT 61

Query: 424  SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCL 483
            SREL ++W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D  L EVKHAQLE+EISCL
Sbjct: 62   SRELVDQWNAPAGFSVNVASANASQVLLATGGGHLVYLEIRDAKLVEVKHAQLEHEISCL 121

Query: 484  DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISY 543
            D+NPIGE+  YS +AAVGMWTDISVRIFSLPDL LI KE+LGGEI+PRSVLLC  EG+SY
Sbjct: 122  DLNPIGEHLEYSSLAAVGMWTDISVRIFSLPDLVLIRKENLGGEIVPRSVLLCTLEGVSY 181

Query: 544  LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYS 603
            LLCALGDG+L +FLLN  TGELTDRKKV+LGTQPI+LRTFSSK TTHVFA+SDRPTVIYS
Sbjct: 182  LLCALGDGNLFSFLLNASTGELTDRKKVTLGTQPISLRTFSSKGTTHVFASSDRPTVIYS 241

Query: 604  SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
            SNKKLLYSNVNLKEV+HMCPFN+AAFPDSLAIAKEGEL+IGTIDDIQKLHIR+IPL E  
Sbjct: 242  SNKKLLYSNVNLKEVNHMCPFNTAAFPDSLAIAKEGELSIGTIDDIQKLHIRTIPLNEQA 301

Query: 664  RRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCS 723
            RRICHQEQS+T A CS K     EESE H +RLLD QTFE +  YPLD +E GCSI+SCS
Sbjct: 302  RRICHQEQSKTLAFCSFKYNQSVEESETHLIRLLDHQTFESLCVYPLDQYECGCSIISCS 361

Query: 724  FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
            F DD+NVYYCVGTAYV+PEENEPTKGRILVF VEDG LQLI EKETKGAVYSLNAFNGKL
Sbjct: 362  FVDDNNVYYCVGTAYVIPEENEPTKGRILVFAVEDGSLQLIVEKETKGAVYSLNAFNGKL 421

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
            LAAINQKIQLYKWM R+DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSISLL+YK
Sbjct: 422  LAAINQKIQLYKWMSREDGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYK 481

Query: 844  HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVG 903
            HEE AIEERARDYNANWM+AVE+LDD++Y+GAEN +NLFTVRKNS+ ATD+ER +LEVVG
Sbjct: 482  HEESAIEERARDYNANWMTAVEMLDDEVYVGAENGYNLFTVRKNSDAATDDERAKLEVVG 541

Query: 904  EYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
            EYHLGEFVNRFRHGSLVMRLPDS++G+IPTVIFGT+NGVIG+IASLPH+ Y FLEK Q+ 
Sbjct: 542  EYHLGEFVNRFRHGSLVMRLPDSEIGKIPTVIFGTINGVIGIIASLPHDHYTFLEKFQST 601

Query: 964  LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
            L K IKGVG ++HEQWRSF+N+KKT +A+NFLDGDLIESFLDLSR++M+ +SK M VSVE
Sbjct: 602  LVKYIKGVGNMSHEQWRSFHNDKKTAEARNFLDGDLIESFLDLSRSKMEVVSKAMGVSVE 661

Query: 1024 ELCKRVEELTRLH 1036
            +L K VEELTRLH
Sbjct: 662  DLSKIVEELTRLH 674


>gi|241260143|ref|XP_002404926.1| DNA repair protein xp-E, putative [Ixodes scapularis]
 gi|215496735|gb|EEC06375.1| DNA repair protein xp-E, putative [Ixodes scapularis]
          Length = 1148

 Score = 1117 bits (2889), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 566/1105 (51%), Positives = 766/1105 (69%), Gaps = 81/1105 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS----ELITRAMGDVSDR 58
            ++ IYGRI+ ++LFRP GE +D LF  T +Y   +L+   +      E+IT+A G+++D 
Sbjct: 54   EIGIYGRISIMKLFRPPGEKKDLLFFLTAKYNAAILECIQDGGDGGVEIITKAHGNIADS 113

Query: 59   IGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
              RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIR+EEL V D++FL+
Sbjct: 114  FSRPSETGNIGIIDPECRVIGLRLYDGLFKVIPLDRDNRELKAFNIRMEELTVQDMEFLH 173

Query: 119  GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVL 178
            GC  PTIV+L+QD++ ARH+KTYE++LKDK+FV+GPW Q+++++ A ++I VP P C   
Sbjct: 174  GCKTPTIVLLHQDSQ-ARHMKTYEISLKDKEFVKGPWKQDHVESEATIVIAVPEPFCDAR 232

Query: 179  IIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
             IG+E+I Y + +    I    IR S    YG+VDA+GSRYLLGD AG L +L++  E +
Sbjct: 233  CIGQESITYHNGDQDVVITPHLIRQSTIVCYGKVDANGSRYLLGDMAGRLFMLLLEREDK 292

Query: 236  -----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
                  V  LK+E LGE +IA  ++YLDN VVY+GS  GDSQLIKLN + + +GSYVEV+
Sbjct: 293  MDGTTTVKDLKLEFLGEITIAECMTYLDNGVVYVGSRLGDSQLIKLNSERNEQGSYVEVM 352

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
            E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 353  EVFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGI 412

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W LR +TD   D  LV+SF+ +TR+L ++ E E+EETE+ GF    QT FC +   NQL+
Sbjct: 413  WPLRVNTDSSRDNTLVLSFVGQTRVLMLSGE-EVEETELAGFDISQQTFFCGNVRNNQLI 471

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
            QVT+ +VRLV   ++ L NEWK P G +++V T N +QV+ A     +  LEI DG L +
Sbjct: 472  QVTAAAVRLVDGKTKLLLNEWKPPGGKNISVVTCNQNQVVCAVRQ-EVFCLEIADGTLKQ 530

Query: 471  VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
              +A++E E++CLDI P+ +    S + AVG+WTDIS+R+  LP L  + KE++GGEIIP
Sbjct: 531  TSNAEMENEVACLDITPLNDGSGKSDLCAVGLWTDISIRVLRLPSLEQLQKENIGGEIIP 590

Query: 531  RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
            RS+L+  FEG+ YLLCALGDG L  FLL+  TG ++DRKKV+LGTQP  L+TF S +T++
Sbjct: 591  RSILMTTFEGVHYLLCALGDGSLFYFLLDASTGAVSDRKKVTLGTQPTVLKTFKSLSTSN 650

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            VFA SDRPTVIYSSN KL++SNVNLKEV+HMCP NS  +PDSLA+A +G L IGTID+IQ
Sbjct: 651  VFACSDRPTVIYSSNHKLVFSNVNLKEVNHMCPLNSEGYPDSLALANDGTLLIGTIDEIQ 710

Query: 651  KLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------------- 681
            KLHIR++PLGE   RI +QE ++TF + +++                             
Sbjct: 711  KLHIRTVPLGE---RIAYQEATQTFGVITIRNDIQGTGGLTPVRPSASTQAQNVTYSSTM 767

Query: 682  -----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 730
                       N    +E E+H + ++D  TFE +  +     EY  SI+S     D NV
Sbjct: 768  SSVFKPGTGSGNDQLGQEVEVHNLLIVDQHTFEVLHAHQFMQTEYAMSIVSTRLGSDPNV 827

Query: 731  YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
            Y+ VGTA VLP+E++P +GRI++F   DGKLQ +AEKE KGA YSL  FNGKLLA+IN  
Sbjct: 828  YFIVGTAIVLPDESDPKQGRIIIFHWVDGKLQQVAEKEIKGAPYSLLEFNGKLLASINST 887

Query: 791  IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
            ++L++W    +  REL +EC H  +ILALY++T+GDFI+VGDLM+S+SLL YK  EG+ E
Sbjct: 888  VRLFEW----NAERELHNECSHFNNILALYLKTKGDFILVGDLMRSMSLLAYKPLEGSFE 943

Query: 851  ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 910
            E ARDY  NWM AVEILDDD +LGAE+  NLF  +K+S   TDE+R  L+ VG++HLGEF
Sbjct: 944  EIARDYQTNWMCAVEILDDDTFLGAESTTNLFVCQKDSAATTDEDRQHLQEVGQFHLGEF 1003

Query: 911  VNRFRHGSLVMRLP-DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 969
            VN FRHGSLVM+ P ++      +V+FGT++G IG++A LP + Y FL ++Q NL KVIK
Sbjct: 1004 VNIFRHGSLVMQHPGEASSPTQGSVLFGTIHGAIGLVAQLPSDFYNFLLEVQGNLTKVIK 1063

Query: 970  GVGGLNHE--------QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM--- 1018
             VG ++H          WRSF+ E+KT  A+ F+DGDLIESFLDLSR +M E+ + +   
Sbjct: 1064 SVGKIDHTLYPFVRLFTWRSFSTERKTEQAQGFIDGDLIESFLDLSRDKMQEVLQGIQMD 1123

Query: 1019 -------NVSVEELCKRVEELTRLH 1036
                   + +V++L K +EEL+R+H
Sbjct: 1124 DGSGMKRDATVDDLIKIIEELSRVH 1148


>gi|351699158|gb|EHB02077.1| DNA damage-binding protein 1 [Heterocephalus glaber]
          Length = 1144

 Score = 1115 bits (2884), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 567/1098 (51%), Positives = 752/1098 (68%), Gaps = 71/1098 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVALKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS+     D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSAPSRETDDTLVLSFVGQTRVLMLNGE-EVEETELVGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P G + +VA+ ++SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQGKNNSVASGSSSQVVVAVGRA-LYYLQIHPRELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
              ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  QTEMEHEVACLDITPLGDSSGLSPLCAIGLWTDISARILQLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   D + ++ +    KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDEEREVSSRGLVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNHEQWR----SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN--------- 1019
             + H  +     SF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +          
Sbjct: 1067 KIEHSLYPSRAVSFHTERKTEQATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKR 1126

Query: 1020 -VSVEELCKRVEELTRLH 1036
              + ++L K VEELTR+H
Sbjct: 1127 EATADDLIKVVEELTRIH 1144


>gi|320163506|gb|EFW40405.1| UV-damaged DNA binding protein [Capsaspora owczarzaki ATCC 30864]
          Length = 1123

 Score = 1114 bits (2881), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 562/1049 (53%), Positives = 730/1049 (69%), Gaps = 76/1049 (7%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LD  +YGRIA ++LFRP    QD LFI TERY F VL +DA++ EL+T+A GD+ D+ G
Sbjct: 76   LLDAGMYGRIAIMQLFRPPNAQQDLLFICTERYAFTVLAYDAQTGELVTKANGDLQDKSG 135

Query: 61   RPTDNGQIGIIDPDCRL------------------------------IGLHLYDGLFKVI 90
             P D G IG+IDPDCRL                              IGL LY+G+FKVI
Sbjct: 136  NPADLGPIGVIDPDCRLIGENAALDLWGGGHPPFYFGCSCSLRLVVWIGLRLYNGMFKVI 195

Query: 91   PFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDF 150
            P D  GQ K+AFNIRLEELQV DIKFL G  +PTI+VLYQD K+ RHVKTY+V LK+K+F
Sbjct: 196  PIDPHGQFKDAFNIRLEELQVFDIKFLRGYDRPTILVLYQDTKETRHVKTYQVLLKEKEF 255

Query: 151  VEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD 210
             EGPW+QNN++ GA LLIPV  PL GVLI+GE+TI Y S +AF+++ +RP+I K Y  +D
Sbjct: 256  AEGPWAQNNVEGGASLLIPVLMPLGGVLIVGEQTITYHSGSAFRSVAMRPAIIKCYSVID 315

Query: 211  ADGSRYLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
             +  R+LL D  G L  +++TH+++ KVT +KI+ LG TSI S ++YLDN VV+ GS +G
Sbjct: 316  TN--RFLLADSEGNLLSVLLTHDRQDKVTAIKIDRLGVTSILSCLTYLDNGVVFGGSQFG 373

Query: 270  DSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRI 329
            DSQL++L  + D  GS+V VLE + NLGPI D  VVDLERQGQ QVVTCSGA+KDGSLR+
Sbjct: 374  DSQLLRLATERDETGSFVRVLESFSNLGPICDMAVVDLERQGQCQVVTCSGAFKDGSLRV 433

Query: 330  VRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI 389
            VRNG+GI EQA++EL GIKG+WSL+ +    + + LVVSFI ETR+L M+  +ELEE +I
Sbjct: 434  VRNGVGIEEQATIELPGIKGIWSLKPTEAALYRSILVVSFIGETRLLGMSSGEELEEMQI 493

Query: 390  EGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW-------KSPPGYSVNVA 442
             G    +QTL C +   +Q +QVT+  VRLV+ +++ L   W       +  PG  + +A
Sbjct: 494  PGLDQNSQTLHCANVSGDQFLQVTATEVRLVNCSTQALVASWSPASVPDRYAPGTRITMA 553

Query: 443  TANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVG 501
            ++N  QVL+A GGGHLV L +   G L  + HA++++EI+C+DI PIG  P  SQ+ AVG
Sbjct: 554  SSNDFQVLVACGGGHLVCLSVEASGNLVPIGHARMDHEIACVDITPIGGQP-LSQVCAVG 612

Query: 502  MWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMK 561
            +WTDI+VR+ S+P L  +  + L G+IIPRS+L+  FEG   LLCALGDG +  +  ++ 
Sbjct: 613  LWTDITVRVLSVPTLEQVLVQPLEGQIIPRSILMATFEGQPRLLCALGDGSMHTYSFDVL 672

Query: 562  TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHM 621
            + +LTD K+VSLGTQPI L  F S+  THVFA SDRPTVIYSSN+KLLYSNVNL+EV+H+
Sbjct: 673  SQQLTDHKRVSLGTQPILLSAFVSRGQTHVFACSDRPTVIYSSNRKLLYSNVNLREVTHV 732

Query: 622  CPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK 681
            CPF S +F D LA+     LTIGTID+IQKLH+R+IPLGE PRRI + E +RT+ + ++ 
Sbjct: 733  CPFTSESFADCLAVVSSTSLTIGTIDEIQKLHVRAIPLGEMPRRIAYHEPTRTYGVATVT 792

Query: 682  ------------NQSCAEES------------------EMHFVRLLDDQTFEFISTYPLD 711
                        N +   ++                  +  FVRL D QTFE   ++ L 
Sbjct: 793  LAEPLPVGSNSGNVAARAQNVRPMAFDDGPRSPSDVLEDTSFVRLFDGQTFEIRDSFQLP 852

Query: 712  TFEYGCSILSCSF---SDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKE 768
            + E   S +SCSF   S DS VY  VGTA+V+P E+EP +GRILVF V  G L L+  K+
Sbjct: 853  STETIMSFISCSFANDSSDSTVYLVVGTAFVIPSEDEPKRGRILVFDVAGGALHLVTAKD 912

Query: 769  TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
             KG VYSLNAFNGKLLA IN K+ L+KW L  DG REL SEC HHGHIL LY+++RGDFI
Sbjct: 913  VKGCVYSLNAFNGKLLAGINSKVNLFKWNLTGDGIRELVSECSHHGHILTLYLKSRGDFI 972

Query: 829  VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
            +VGDLM+SISLL+YK    +IEE A+D   NW++AV++LDDD+++G E++FN+FT R+N 
Sbjct: 973  IVGDLMRSISLLMYKSGTSSIEEIAQDTCPNWVTAVDMLDDDVFIGGESSFNIFTCRRNL 1032

Query: 889  EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-PTVIFGTVNGVIGVIA 947
            E +TDEER RLEVVGE+H+GEF+N+FR GSLVM+LPD     I P+ +FGT NGVIGVIA
Sbjct: 1033 EASTDEERKRLEVVGEFHVGEFINQFRAGSLVMKLPDEQEQPIQPSTLFGTGNGVIGVIA 1092

Query: 948  SLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
             L   QY FL+ +Q  + KVIKGVGGLNH
Sbjct: 1093 RLTRSQYEFLQLVQAAMAKVIKGVGGLNH 1121


>gi|410045300|ref|XP_508472.4| PREDICTED: DNA damage-binding protein 1 [Pan troglodytes]
          Length = 1107

 Score = 1109 bits (2869), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 564/1093 (51%), Positives = 736/1093 (67%), Gaps = 105/1093 (9%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 61   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 120

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 121  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 180

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 181  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 239

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 240  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 299

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 300  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 359

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 360  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 419

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 420  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 478

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 479  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 537

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 538  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 597

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 598  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 657

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 658  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 717

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 718  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 777

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 778  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 837

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 838  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 897

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 898  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 953

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 954  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1013

Query: 914  FRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
            F HGSLVM                                       Q  L KVIK VG 
Sbjct: 1014 FCHGSLVM---------------------------------------QNRLNKVIKSVGK 1034

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSVE 1023
            + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + +
Sbjct: 1035 IEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATAD 1094

Query: 1024 ELCKRVEELTRLH 1036
            +L K VEELTR+H
Sbjct: 1095 DLIKVVEELTRIH 1107


>gi|298711490|emb|CBJ26578.1| n/a [Ectocarpus siliculosus]
          Length = 1135

 Score = 1105 bits (2857), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 560/1065 (52%), Positives = 740/1065 (69%), Gaps = 47/1065 (4%)

Query: 2    LDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGR 61
            LDVP+YG IAT+E +R   E  + +F+ TERY+FCVLQ+D    E+ TR+ G V DRIGR
Sbjct: 53   LDVPLYGAIATMEAYRLADENCERIFVLTERYQFCVLQYDVSRQEIRTRSSGSVKDRIGR 112

Query: 62   PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCA 121
              DN +IG++DP  R+IGLHLY+G FKVIP D KGQLK+AFNIRLEEL+VLDI+FL GC 
Sbjct: 113  AIDNSKIGVMDPQSRMIGLHLYEGYFKVIPMDAKGQLKDAFNIRLEELEVLDIQFLSGCP 172

Query: 122  KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
            K TI VLYQD ++ARH+KTY ++ +DK+F  GPW+Q N+++ A  LIPVP P  GVLI+G
Sbjct: 173  KATIAVLYQDQRNARHIKTYTISTRDKEFDTGPWAQLNVEHNASELIPVPAPFGGVLILG 232

Query: 182  EETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHE--KEKVTG 239
             +TI Y S  AF  IPI+ +   AYG VDADGSR L+ DH+G LH++++T +     V  
Sbjct: 233  HQTICYHSGKAFITIPIQNTRMCAYGWVDADGSRLLVSDHSGGLHVVILTPDATNTAVET 292

Query: 240  LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
              IE LGETS AS+ISYLDN VV+IGS+ GDSQLIKLN + DA+G+Y++VLE Y NLGPI
Sbjct: 293  AHIEALGETSCASSISYLDNGVVFIGSASGDSQLIKLNPEKDAQGTYIQVLETYDNLGPI 352

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
            +D CV DL+RQGQGQ VTCSG  KDGSLRI+RNGIGINE A++EL GIKGMWSLR S  +
Sbjct: 353  LDMCVADLDRQGQGQAVTCSGCSKDGSLRIIRNGIGINEHAAIELAGIKGMWSLRPSNTN 412

Query: 360  PFDTFLVVSFISETRILAMNLEDE-----LEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
              D +LV +FISETR+LA   EDE     L E EI GF  +  TLFC     N  VQVT 
Sbjct: 413  -HDKYLVQAFISETRVLAFE-EDEDGDHQLAEGEIAGF-QEGCTLFCGCVGGNMAVQVTK 469

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHA 474
              V L+     +  + W  P   ++ VA+ NA++V+LA GGG+LV+LE+       V+ A
Sbjct: 470  RGVVLICCDGLQEIDRWDPPTDLNITVASGNATRVVLALGGGNLVHLEVDATAKKLVQKA 529

Query: 475  --QLEYEISCLDINPIGENP-----------------SYSQIAAVGMWTDISVRIFSLPD 515
              QL+ EI+C+ +NP    P                     + AVGMWTD++VR+ SLPD
Sbjct: 530  RVQLDNEIACISLNPPSNQPVSNAEPATTAMECDEESKLDSLVAVGMWTDMTVRLLSLPD 589

Query: 516  LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNM---KTGELTDRKKVS 572
            L  ++ + LGG+   RSV+L     + YL   LGDGH+++F L +    T  L   KKV+
Sbjct: 590  LQGVSSQPLGGDTQARSVILATIADVHYLFVGLGDGHVVSFPLEVTAESTLALGTPKKVA 649

Query: 573  LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
            LGTQP+ L  F +     VF ASDRP VIY S  KLLY+NVN+ EV+ +C F+S+  P  
Sbjct: 650  LGTQPVGLACFRNNGMVCVFVASDRPAVIYCSGGKLLYANVNMGEVNSVCSFDSSELPHC 709

Query: 633  LAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA---EES 689
            LA+A E  LTIGTIDDIQKLHI+ + LGE P+RI H +  R F I +   ++     EE 
Sbjct: 710  LALASENSLTIGTIDDIQKLHIQKVSLGEAPQRITHHDSGRMFGIITTSYRAVENSDEEE 769

Query: 690  EMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG 749
            E +FV+ LDD  FE +  +PLD FE G S++SC F++D   Y  VGT YV  +E EP  G
Sbjct: 770  EHNFVKFLDDTNFEELYCHPLDAFENGSSMVSCVFANDKKEYLVVGTGYVREDECEPAVG 829

Query: 750  RILVFIVE----DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRE 805
            R+LVF VE    + K+ L AE ET+GAVY LN FNGKLLA IN K+QL++W+ +DDG +E
Sbjct: 830  RLLVFSVEGQGAERKVDLAAEVETRGAVYVLNGFNGKLLACINSKVQLFRWIEKDDGIQE 889

Query: 806  LQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
            LQ+ECG+HGHILAL++Q+RGDFI+VGDLM+S+SLL+YK  +GAIEE ARDY+ANWM+AVE
Sbjct: 890  LQTECGYHGHILALHMQSRGDFIIVGDLMRSVSLLVYKAVDGAIEEVARDYHANWMTAVE 949

Query: 866  ILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP- 924
            +L+DD+Y+G E + N+FT+R+N++ AT+EER RLE+ GE+HLGEFVN+F  GSL+M+   
Sbjct: 950  MLNDDVYIGGEADCNIFTLRRNADAATEEERARLEIQGEFHLGEFVNKFCRGSLLMQSSE 1009

Query: 925  -------DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
                   DS + +   ++FGTVNG++G I +L  + + FL +LQT + KV+KGVGG +H+
Sbjct: 1010 VNSPGGMDSPLVKGQPLLFGTVNGMVGTILTLTEDNHRFLAQLQTAMTKVVKGVGGFSHD 1069

Query: 978  QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSV 1022
            +WRSF N ++T  + NF+DGDL+ES+LD+ R   +E+ + ++  V
Sbjct: 1070 EWRSFTNGRRTSPSSNFIDGDLVESYLDMPRHNQEEVLRHVDTPV 1114


>gi|156389050|ref|XP_001634805.1| predicted protein [Nematostella vectensis]
 gi|156221892|gb|EDO42742.1| predicted protein [Nematostella vectensis]
          Length = 1157

 Score = 1098 bits (2841), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 566/1114 (50%), Positives = 756/1114 (67%), Gaps = 86/1114 (7%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            MLDV +YG +  +ELFRP  E QD LFI T +Y+ C+L +  E+ +++TRA GDV DRIG
Sbjct: 52   MLDVGLYGSVGVMELFRPPNEPQDLLFILTAKYRVCILGYRKETGDIVTRACGDVQDRIG 111

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF--DNKGQLKEAFNIRLEELQVLDIKFLY 118
            RP+D GQIGIIDP CR+IGL LYDGLFKVIP   D+  +LK AFNIRLEEL V+DI+FLY
Sbjct: 112  RPSDTGQIGIIDPSCRVIGLRLYDGLFKVIPLELDSDKELK-AFNIRLEELHVVDIQFLY 170

Query: 119  GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVL 178
            GCA PTIV +YQD    RHVKTYE+ L+D +F +GPW Q+N++  A  +I VP PL G L
Sbjct: 171  GCANPTIVFIYQD-PHGRHVKTYEINLRDHEFAKGPWKQDNVEVEACRVIAVPNPLGGAL 229

Query: 179  IIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK- 234
            IIG+E+I Y   + + AI    ++ S    +G++D +GSRYLLGD  G L++L++  ++ 
Sbjct: 230  IIGQESITYHKGSNYHAIAPPALKQSSLTCHGKIDTNGSRYLLGDMNGRLYMLLLERQEL 289

Query: 235  ----EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
                 +V  LK+E+LGETSIA  + YLDN VV+IGS  GDSQL KL+ +PDA GSYV+V+
Sbjct: 290  IDGTYEVKDLKLEMLGETSIAHCLVYLDNGVVFIGSMLGDSQLAKLSTEPDADGSYVQVM 349

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
            E + NLGPIVD  VVDLERQGQGQ+VTCSGA K+GSLRI+RNGIGI+E A+++L GI G+
Sbjct: 350  ETFTNLGPIVDMVVVDLERQGQGQLVTCSGAKKEGSLRIIRNGIGIHEHATIDLAGIMGI 409

Query: 351  WSLR-SSTDDPFDTFLVVSFISETR---------ILAMNLEDELEETEIEGFCSQTQTLF 400
            W+L+   T   +D  LV+SF+ ++R         +L+++ E E+EETEI GF    QT +
Sbjct: 410  WALKLRKTQQEYDDTLVLSFVGQSRRVLCTIHSRVLSLSGE-EVEETEIPGFSDDQQTYY 468

Query: 401  CHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVY 460
              +    QL+QVT+ SVRLV+  +R+L +EWK P   +++VA+ N  QV+ A G   L Y
Sbjct: 469  SGNVTGAQLIQVTAASVRLVNCETRQLVSEWKHPSAKNISVASCNTEQVVAAVGS-ELYY 527

Query: 461  LEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT 520
            +EI  G L ++    LEYE++CLDI P  E  S + + AVG+WTDISVR+  LP LN + 
Sbjct: 528  IEIMPGELRQISQVTLEYEVACLDITPTVEGKSRADMVAVGLWTDISVRVLQLPKLNQLH 587

Query: 521  KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
             + LGGEIIPRS+L  AFEGI YLLCALGDG L  F ++  TG L + KKV+LGTQP  L
Sbjct: 588  VQMLGGEIIPRSILKTAFEGIHYLLCALGDGTLFYFTMDPSTGALAECKKVTLGTQPTML 647

Query: 581  RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
            RTF S +T +VFA SDRPTVIYSSN KL++SNVNLKEV+ MCP NS  +PDSLA+A +G 
Sbjct: 648  RTFKSLSTVNVFACSDRPTVIYSSNHKLVFSNVNLKEVNFMCPLNSQGYPDSLALANDGS 707

Query: 641  LTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ----------------- 683
            LTIGTID+IQKLHIR++PLGE PRRI +QE ++TF + S++ +                 
Sbjct: 708  LTIGTIDEIQKLHIRTVPLGESPRRIAYQEATQTFGVISVRIEISEPGTSGTVPLHPSAS 767

Query: 684  --------------------------SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGC 717
                                      +  +E E+  + ++D  TFE    + L   E   
Sbjct: 768  TTAHNVSSSVGTGVSGSASSSAPDGITFGDEMEIGSLLIIDQHTFEVTHAHQLHDNEQAT 827

Query: 718  SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
            S++SC+ SDD + YYCVGTAYV PEE EP  GR+L+F + +GKL  +AEKE KGAVYSL 
Sbjct: 828  SLMSCTLSDDPHTYYCVGTAYVFPEEPEPKAGRLLLFHLSEGKLVQVAEKEVKGAVYSLV 887

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
             FNGK+LA IN  + +++W       +E + EC ++ +ILALY++T+GDFI+VGDLM+S+
Sbjct: 888  EFNGKVLAGINSTVSIFEWT----ADKEFRYECSYYDNILALYLKTKGDFILVGDLMRSM 943

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG 897
            +LL+Y   EG+ +E A D++  WM+A+EILDDD +LGAEN++NLFT  K+S   TDEER 
Sbjct: 944  TLLVYLPLEGSFQEIAHDFSPKWMTAIEILDDDTFLGAENSYNLFTCTKDSGATTDEERY 1003

Query: 898  RLEVVGEYHLGEFVNRFRHGSLVMRLP-DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLF 956
             L+  G+YHLGEFVN FRHGSLVM  P D+       V+FGTVNG IG++A +  + + F
Sbjct: 1004 HLQDAGQYHLGEFVNVFRHGSLVMEHPGDASTPFQGCVLFGTVNGRIGIVAQIAQDLFNF 1063

Query: 957  LEKLQTNLRKVIKGVGGLNHEQW-----RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
            L ++Q  L KVIK VG ++H  +      + ++ +K   A  F+DGDLIESFLDL R RM
Sbjct: 1064 LIQVQKKLNKVIKSVGKIDHSLYPFPHCSNLSHSRKMEPAHGFIDGDLIESFLDLPRARM 1123

Query: 1012 DEISKTMNV---------SVEELCKRVEELTRLH 1036
            +E+   + +         +V++L K VEELTR+H
Sbjct: 1124 EEVVTGLQIDDGGMKKECTVDDLVKTVEELTRIH 1157


>gi|345328202|ref|XP_003431248.1| PREDICTED: DNA damage-binding protein 1-like [Ornithorhynchus
            anatinus]
          Length = 1045

 Score = 1097 bits (2837), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 551/1040 (52%), Positives = 726/1040 (69%), Gaps = 65/1040 (6%)

Query: 55   VSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDI 114
            + DRIGRP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEELQV+D+
Sbjct: 13   LKDRIGRPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELQVIDV 72

Query: 115  KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL 174
            KFLYGC  PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P 
Sbjct: 73   KFLYGCQAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPF 131

Query: 175  CGVLIIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVIT 231
             G +IIG+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++ 
Sbjct: 132  GGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLE 191

Query: 232  HEKE-----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSY 286
             E++      +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSY
Sbjct: 192  KEEQMDGAVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSY 251

Query: 287  VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            V  +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L G
Sbjct: 252  VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 311

Query: 347  IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
            IKG+W LRS+++   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  +
Sbjct: 312  IKGLWPLRSNSNRETDDTLVLSFVGQTRVLMLNGE-EVEETELTGFVDNQQTFFCGNVAH 370

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG 466
             QL+Q+TS SVRLV+   + L +EWK P G +++VA+ N+SQV++A G   L YL+I   
Sbjct: 371  QQLIQITSASVRLVTQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPR 429

Query: 467  ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 526
             L ++ H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGG
Sbjct: 430  ELRQISHTEMEHEVACLDITPLGDSHGMSPLCAIGLWTDISARILKLPSFELLHKEMLGG 489

Query: 527  EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            EIIPRS+L+  FE   YLLCALGDG L  F L+++TG L+DRKKV+LGTQP  LRTF S 
Sbjct: 490  EIIPRSILMTTFESSHYLLCALGDGALFYFGLSIETGLLSDRKKVTLGTQPTVLRTFRSL 549

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
            +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTI
Sbjct: 550  STTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTI 609

Query: 647  DDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------- 681
            D+IQKLHIR++PL E PR+IC+QE S+ F + S +                         
Sbjct: 610  DEIQKLHIRTVPLYESPRKICYQEGSQCFGVLSSRIEVQDASGGTTALRPSGSTQALSSS 669

Query: 682  --------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDD 727
                            S  EE E+H + ++D  TFE +  +     EY  S++SC    D
Sbjct: 670  VSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKD 729

Query: 728  SNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAI 787
            +N Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+I
Sbjct: 730  ANTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASI 789

Query: 788  NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
            N  ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG
Sbjct: 790  NSTVRLYEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEG 845

Query: 848  AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
              EE ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HL
Sbjct: 846  NFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHL 905

Query: 908  GEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            GEFVN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L K
Sbjct: 906  GEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNK 965

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV------ 1020
            VIK VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + +      
Sbjct: 966  VIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGM 1025

Query: 1021 ----SVEELCKRVEELTRLH 1036
                +V++L K VEELTR+H
Sbjct: 1026 KREATVDDLIKIVEELTRIH 1045


>gi|452824087|gb|EME31092.1| DNA damage-binding protein 1 isoform 1 [Galdieria sulphuraria]
          Length = 1128

 Score = 1094 bits (2829), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 548/1081 (50%), Positives = 756/1081 (69%), Gaps = 48/1081 (4%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            ++++PIYGRI+ ++L R   + QD LF  TE+YKF VL+W+ ++ E  T A GDV DRIG
Sbjct: 51   VIELPIYGRISVMKLCRFPDDLQDSLFFLTEKYKFAVLRWNTQTGECDTIAGGDVHDRIG 110

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
             PT  G IGI DP     GLHLYDGLFKVIP D K   KEAFNIRLEEL VLDI+FLYG 
Sbjct: 111  HPTSAGHIGICDPSMTCFGLHLYDGLFKVIPTDFK---KEAFNIRLEELDVLDIQFLYGH 167

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            +KPTI VLY D+++ RH+KTY V+LKDKDF  GP  Q NL++GA +LIPVP P+ GV+++
Sbjct: 168  SKPTIAVLYTDSEENRHLKTYTVSLKDKDFGNGPLFQGNLESGASMLIPVPTPIGGVVVL 227

Query: 181  GEETIVYCSANAFK---AIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI------- 230
            G+ET+ Y S +  +   +IP+  +I +AYGR+D DG+RYLLGD  G+L+LLV+       
Sbjct: 228  GQETVTYISGSGLRGYHSIPVSATIFRAYGRIDKDGTRYLLGDEKGILYLLVLEQSTSLS 287

Query: 231  --THEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAK-GSYV 287
              T  + K+TGLKI+ LGETS+ STI YLDN  VYIGS +GDSQLI+L  +P  + GS++
Sbjct: 288  TFTETETKITGLKIQTLGETSLPSTIDYLDNGFVYIGSCHGDSQLIRLLSEPHPETGSFL 347

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
            EV+E Y NL PIVDFCV+D ERQGQGQVVTCSGA KDGSLRI+RNGIGI+EQASVE+ G+
Sbjct: 348  EVMETYPNLSPIVDFCVMDAERQGQGQVVTCSGAAKDGSLRIIRNGIGIHEQASVEVPGV 407

Query: 348  KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
            K ++SL+ S+     + L++SF SE+R+L +   + + E     F  Q  TL+C + + +
Sbjct: 408  KELFSLKRSSLSSQHSLLLLSFASESRVLELVSTELMAEANFPVFEMQEPTLYCGNVVGD 467

Query: 408  QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI 467
             +VQ+T    RL+S     + NEW  P G+ ++VA+ N+ Q+++AT GG+L+Y +I    
Sbjct: 468  CIVQITPSKARLISCEDMSIVNEWHPPSGFRISVASGNSMQLIIATTGGNLIYFDIDANP 527

Query: 468  --LTEVKHAQLEYEISCLDINPIGE-------NPSYSQIAAVGMWTDISVRIFSLPDLNL 518
              + E  +  LEYEI  LDI+P+G+           S   AVGMWT++S+R +SLP L+L
Sbjct: 528  KRIMEKSYKSLEYEICSLDISPLGQAGMNLASQAIPSSFVAVGMWTEVSIRFYSLPSLDL 587

Query: 519  ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE-------LTDRKKV 571
            I  E LG ++I RS+L    +G  Y L ALGDG LL + L+    +       L D++++
Sbjct: 588  IHTEKLGLDVIARSLLFVTMDGEDYFLAALGDGRLLTYRLDKSAKDTDSEKKFLYDQRQM 647

Query: 572  SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK--KLLYSNVNLKEVSHMCPFNSAAF 629
            S+GTQP +L  F ++N  HVFAA DRPTVI+SS+   KLL SNVNL+EV+ +C F+S AF
Sbjct: 648  SIGTQPASLSIFETQNALHVFAACDRPTVIHSSSGGGKLLCSNVNLREVTRVCSFSSEAF 707

Query: 630  PDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK-------- 681
            PD LA+  EG L +GT+D+IQKLHIR+IPLGE PRRI H +    FA+ + K        
Sbjct: 708  PDCLALVTEGSLLLGTVDNIQKLHIRTIPLGEQPRRIAHLDTHHVFAVLTTKQVVTISED 767

Query: 682  -NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV-----YYCVG 735
             N++ +E +E  +VRL+DD   E + +Y L+ FE  CS+++ +F DD+       Y+ VG
Sbjct: 768  GNEALSETTEEGYVRLIDDTMMEIVHSYKLEQFETPCSVITVNFGDDAAAKDNQDYFVVG 827

Query: 736  TAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
            TAY   +E EP++GR+LVF V + +L L+AE+  KGA+YS++AFNGK+LA++N  ++L +
Sbjct: 828  TAYSYADEPEPSRGRMLVFAVREQRLTLVAERTFKGALYSMDAFNGKILASVNSMLKLVR 887

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
            W   + G R L  EC +HG I  L ++  GDFI++GDL++S+SLL YK   G IE+ ARD
Sbjct: 888  WSETESGARTLTEECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIEDVARD 947

Query: 856  YNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
             + +W++ +E+LD D Y+ AEN FNLFT+++NS+ +T+EER RLE VGEYHLGE VNR R
Sbjct: 948  IDPSWITVIEMLDLDYYISAENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIR 1007

Query: 916  HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLN 975
            HG LV+++P+S +  + ++++GT NG +GVIAS+  + + FL  LQT L +VIKGVGG+ 
Sbjct: 1008 HGRLVLQIPESGISILKSLLYGTANGALGVIASIDEKTFQFLHSLQTALNEVIKGVGGIQ 1067

Query: 976  HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            HE WR F +E++  D+KNFLDGDLIE FLDLSR +M+ ++K +NV VEEL K+VEELTR+
Sbjct: 1068 HEDWRRFTSERRIGDSKNFLDGDLIERFLDLSRDKMELVAKKVNVPVEELAKQVEELTRI 1127

Query: 1036 H 1036
            H
Sbjct: 1128 H 1128


>gi|449519304|ref|XP_004166675.1| PREDICTED: DNA damage-binding protein 1a-like [Cucumis sativus]
          Length = 596

 Score = 1091 bits (2822), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 526/593 (88%), Positives = 562/593 (94%), Gaps = 5/593 (0%)

Query: 449  VLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISV 508
            VLLATGGG LV+LEI DG+L E KH QLE+EISCLDINPIG+NP+ SQ+AAVGMWTDISV
Sbjct: 4    VLLATGGGVLVHLEICDGLLVEKKHIQLEHEISCLDINPIGDNPNCSQLAAVGMWTDISV 63

Query: 509  RIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG----E 564
            RIFSLPDLNL+TKE LGGEIIPRSVLLC FEG+ + LCALGDGHLLNF+LN  +     E
Sbjct: 64   RIFSLPDLNLLTKEQLGGEIIPRSVLLCTFEGVCFCLCALGDGHLLNFILNTNSNSNSCE 123

Query: 565  LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPF 624
            L DRKKVSLGTQPITLRTFSSKN THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPF
Sbjct: 124  LMDRKKVSLGTQPITLRTFSSKNATHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPF 183

Query: 625  NSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK-NQ 683
            NSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEH RRICHQEQSRTFAICSL+ NQ
Sbjct: 184  NSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHARRICHQEQSRTFAICSLRYNQ 243

Query: 684  SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEE 743
            S  E++EMHF+RLLDDQTFE ISTY LDT+EYGCSILSCSFSDD+NVYYCVGTAYV+PEE
Sbjct: 244  SGTEDTEMHFIRLLDDQTFESISTYALDTYEYGCSILSCSFSDDNNVYYCVGTAYVMPEE 303

Query: 744  NEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT 803
            NEPTKGRILVF+VE+GKLQLIAEKETKG+VYSLNAFNGKLLAAINQKIQLYKW LRDDGT
Sbjct: 304  NEPTKGRILVFVVEEGKLQLIAEKETKGSVYSLNAFNGKLLAAINQKIQLYKWTLRDDGT 363

Query: 804  RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
            RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA
Sbjct: 364  RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 423

Query: 864  VEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRL 923
            VEILDDDIYLGAEN FNLFTVRKNSEGATDEER RLEVVGEYHLGEFVNRF+HGSLVMRL
Sbjct: 424  VEILDDDIYLGAENYFNLFTVRKNSEGATDEERSRLEVVGEYHLGEFVNRFQHGSLVMRL 483

Query: 924  PDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
            PDSDVGQIPTVIFG+VNGVIGVIASLPH+QY+FLE+LQ+NLRKVIKGVGGL+HEQWRSFN
Sbjct: 484  PDSDVGQIPTVIFGSVNGVIGVIASLPHDQYVFLERLQSNLRKVIKGVGGLSHEQWRSFN 543

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            NEK+T +AKNFLDGDLIESFLDL+R++M+EIS+ M+VS EELCKRVEELTRLH
Sbjct: 544  NEKRTAEAKNFLDGDLIESFLDLNRSKMEEISRAMSVSAEELCKRVEELTRLH 596


>gi|297267724|ref|XP_001082958.2| PREDICTED: DNA damage-binding protein 1 [Macaca mulatta]
          Length = 1092

 Score = 1090 bits (2818), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 546/1024 (53%), Positives = 716/1024 (69%), Gaps = 57/1024 (5%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNH 976
             + H
Sbjct: 1067 KIEH 1070


>gi|119594339|gb|EAW73933.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_a [Homo
            sapiens]
          Length = 1094

 Score = 1089 bits (2817), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 546/1024 (53%), Positives = 716/1024 (69%), Gaps = 57/1024 (5%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNH 976
             + H
Sbjct: 1067 KIEH 1070


>gi|221040048|dbj|BAH11787.1| unnamed protein product [Homo sapiens]
          Length = 1092

 Score = 1089 bits (2817), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 546/1024 (53%), Positives = 715/1024 (69%), Gaps = 57/1024 (5%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54   EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114  RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174  QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I Y + + + AI    I+ S    + RVD  GSRYLLGD  G L +L++  E++  
Sbjct: 233  GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPHGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236  ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293  GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353  FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413  LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472  TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531  HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591  ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651  ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711  HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                      S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771  FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831  VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891  YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN 
Sbjct: 947  RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066

Query: 973  GLNH 976
             + H
Sbjct: 1067 KIEH 1070


>gi|452824086|gb|EME31091.1| DNA damage-binding protein 1 isoform 2 [Galdieria sulphuraria]
          Length = 1150

 Score = 1081 bits (2796), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 548/1103 (49%), Positives = 756/1103 (68%), Gaps = 70/1103 (6%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            ++++PIYGRI+ ++L R   + QD LF  TE+YKF VL+W+ ++ E  T A GDV DRIG
Sbjct: 51   VIELPIYGRISVMKLCRFPDDLQDSLFFLTEKYKFAVLRWNTQTGECDTIAGGDVHDRIG 110

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
             PT  G IGI DP     GLHLYDGLFKVIP D K   KEAFNIRLEEL VLDI+FLYG 
Sbjct: 111  HPTSAGHIGICDPSMTCFGLHLYDGLFKVIPTDFK---KEAFNIRLEELDVLDIQFLYGH 167

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            +KPTI VLY D+++ RH+KTY V+LKDKDF  GP  Q NL++GA +LIPVP P+ GV+++
Sbjct: 168  SKPTIAVLYTDSEENRHLKTYTVSLKDKDFGNGPLFQGNLESGASMLIPVPTPIGGVVVL 227

Query: 181  GEETIVYCSANAFK---AIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI------- 230
            G+ET+ Y S +  +   +IP+  +I +AYGR+D DG+RYLLGD  G+L+LLV+       
Sbjct: 228  GQETVTYISGSGLRGYHSIPVSATIFRAYGRIDKDGTRYLLGDEKGILYLLVLEQSTSLS 287

Query: 231  --THEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAK-GSYV 287
              T  + K+TGLKI+ LGETS+ STI YLDN  VYIGS +GDSQLI+L  +P  + GS++
Sbjct: 288  TFTETETKITGLKIQTLGETSLPSTIDYLDNGFVYIGSCHGDSQLIRLLSEPHPETGSFL 347

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
            EV+E Y NL PIVDFCV+D ERQGQGQVVTCSGA KDGSLRI+RNGIGI+EQASVE+ G+
Sbjct: 348  EVMETYPNLSPIVDFCVMDAERQGQGQVVTCSGAAKDGSLRIIRNGIGIHEQASVEVPGV 407

Query: 348  KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
            K ++SL+ S+     + L++SF SE+R+L +   + + E     F  Q  TL+C + + +
Sbjct: 408  KELFSLKRSSLSSQHSLLLLSFASESRVLELVSTELMAEANFPVFEMQEPTLYCGNVVGD 467

Query: 408  QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI 467
             +VQ+T    RL+S     + NEW  P G+ ++VA+ N+ Q+++AT GG+L+Y +I    
Sbjct: 468  CIVQITPSKARLISCEDMSIVNEWHPPSGFRISVASGNSMQLIIATTGGNLIYFDIDANP 527

Query: 468  --LTEVKHAQLEYEISCLDINPIGE-------NPSYSQIAAVGMWTDISVRIFSLPDLNL 518
              + E  +  LEYEI  LDI+P+G+           S   AVGMWT++S+R +SLP L+L
Sbjct: 528  KRIMEKSYKSLEYEICSLDISPLGQAGMNLASQAIPSSFVAVGMWTEVSIRFYSLPSLDL 587

Query: 519  ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE-------LTDRKKV 571
            I  E LG ++I RS+L    +G  Y L ALGDG LL + L+    +       L D++++
Sbjct: 588  IHTEKLGLDVIARSLLFVTMDGEDYFLAALGDGRLLTYRLDKSAKDTDSEKKFLYDQRQM 647

Query: 572  SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK--KLLYSNVNLKEVSHMCPFNSAAF 629
            S+GTQP +L  F ++N  HVFAA DRPTVI+SS+   KLL SNVNL+EV+ +C F+S AF
Sbjct: 648  SIGTQPASLSIFETQNALHVFAACDRPTVIHSSSGGGKLLCSNVNLREVTRVCSFSSEAF 707

Query: 630  PDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK-------- 681
            PD LA+  EG L +GT+D+IQKLHIR+IPLGE PRRI H +    FA+ + K        
Sbjct: 708  PDCLALVTEGSLLLGTVDNIQKLHIRTIPLGEQPRRIAHLDTHHVFAVLTTKQVVTISED 767

Query: 682  -NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV-----YYCVG 735
             N++ +E +E  +VRL+DD   E + +Y L+ FE  CS+++ +F DD+       Y+ VG
Sbjct: 768  GNEALSETTEEGYVRLIDDTMMEIVHSYKLEQFETPCSVITVNFGDDAAAKDNQDYFVVG 827

Query: 736  TAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
            TAY   +E EP++GR+LVF V + +L L+AE+  KGA+YS++AFNGK+LA++N  ++L +
Sbjct: 828  TAYSYADEPEPSRGRMLVFAVREQRLTLVAERTFKGALYSMDAFNGKILASVNSMLKLVR 887

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
            W   + G R L  EC +HG I  L ++  GDFI++GDL++S+SLL YK   G IE+ ARD
Sbjct: 888  WSETESGARTLTEECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIEDVARD 947

Query: 856  YNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
             + +W++ +E+LD D Y+ AEN FNLFT+++NS+ +T+EER RLE VGEYHLGE VNR R
Sbjct: 948  IDPSWITVIEMLDLDYYISAENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIR 1007

Query: 916  HGSLVMRLPDSDVGQIPTVIF----------------------GTVNGVIGVIASLPHEQ 953
            HG LV+++P+S +  + ++++                      GT NG +GVIAS+  + 
Sbjct: 1008 HGRLVLQIPESGISILKSLLYGMYICFDDNLKELFMHKYRFNLGTANGALGVIASIDEKT 1067

Query: 954  YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
            + FL  LQT L +VIKGVGG+ HE WR F +E++  D+KNFLDGDLIE FLDLSR +M+ 
Sbjct: 1068 FQFLHSLQTALNEVIKGVGGIQHEDWRRFTSERRIGDSKNFLDGDLIERFLDLSRDKMEL 1127

Query: 1014 ISKTMNVSVEELCKRVEELTRLH 1036
            ++K +NV VEEL K+VEELTR+H
Sbjct: 1128 VAKKVNVPVEELAKQVEELTRIH 1150


>gi|242010743|ref|XP_002426118.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
 gi|212510165|gb|EEB13380.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
          Length = 1148

 Score = 1075 bits (2779), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 540/1106 (48%), Positives = 748/1106 (67%), Gaps = 83/1106 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            ++ +Y RI+ ++ FRP  E +D LFI T RY   +L+   D E+ E+ T+A G+V+DRIG
Sbjct: 54   EIGLYDRISVVKFFRPPHEHKDLLFIVTTRYNAMILECIGDGENIEIKTKAHGNVADRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            + ++ G I +IDP+ R+IGL LYDGL K+IP        +A +IR+EE++V D+ FL+GC
Sbjct: 114  KQSETGIIAVIDPEARVIGLRLYDGLLKIIPLGKDNSELKASSIRMEEVEVQDLNFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI++++QD  + RHVKT+E++L+DK+FV+ PW Q+N++  A ++IPVP PLCG +II
Sbjct: 174  QNPTIILIHQD-INGRHVKTHEISLRDKEFVKMPWKQDNVEPDASIVIPVPEPLCGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
            G+E+I+Y +   + A+    I  S    Y +VD++GSRYLLGD AG L +L++  E++  
Sbjct: 233  GQESILYHNGAGYVAVAPPVINQSTITCYTQVDSNGSRYLLGDMAGHLFMLLLETEEKID 292

Query: 236  -----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
                 K  GLK+ELLGE SI   I+YLDN V++IGS  GDSQL+KLN  PD  G YV ++
Sbjct: 293  GTPCVKENGLKVELLGEISIPEAITYLDNGVLFIGSRCGDSQLVKLNTSPDENGYYVTIM 352

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
            E + NL PI+D  VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E A+++L GIKGM
Sbjct: 353  EAFTNLAPILDMVVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIQEHATIDLLGIKGM 412

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+LR+S D  +D  +V++F+ +TRIL +N E E+EETEI GF S  Q+ +C +   N ++
Sbjct: 413  WALRASLDSVYDNTIVLAFVGQTRILTLNGE-EVEETEIPGFLSDQQSFYCGNVENNNMI 471

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
            Q+T    RL+S  +++L +EW+ P G +++V   N  Q + A  G  L YLEI    L +
Sbjct: 472  QLTPTCARLISVETKQLVSEWRPPAGKTISVVACNTVQAICA-AGSDLYYLEILKNELVQ 530

Query: 471  VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
              +  LEYE++CLDI P+ E    + I AVG+WTDIS RI  LPDL  + +E+LGGEIIP
Sbjct: 531  KGNTTLEYEVACLDITPLSEGGKTADIIAVGLWTDISARILKLPDLEELNREYLGGEIIP 590

Query: 531  RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
            RS+L+  FE I+YLLCALGDG +  F LN + G L+D+KKV+LGTQP  LRTF S +TT+
Sbjct: 591  RSILMTCFENINYLLCALGDGSMFYFSLNNQNGILSDKKKVTLGTQPTVLRTFRSLSTTN 650

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            VFA SDRPTVIYSSN KL++SNVNLKEV+HMC  NS A+PDSLA+A +  +TIGTID+IQ
Sbjct: 651  VFACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNSEAYPDSLALATDSTVTIGTIDEIQ 710

Query: 651  KLHIRSIPLGEHPRRICHQEQSRTFAICSLK------------NQSCA------------ 686
            KLHIR++PLGE PRRI +QE ++TF + +++             QS +            
Sbjct: 711  KLHIRTVPLGESPRRIAYQETTQTFGVITMRMDVHQRSGLIPVKQSASTQAQSISSSSNI 770

Query: 687  ---------------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS 725
                                 +E E+H + ++D  TFE +  +     EY  S++S    
Sbjct: 771  GGAHNLKSGPAASINSFAEFGQEVEVHNLLVIDQHTFEVLHAHQFMQSEYALSLISTKLG 830

Query: 726  DDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLA 785
            DD N YY VGTA V P+E+E  +GRIL+F  ++GKL  +AEKE KGA YSL  FNGKLLA
Sbjct: 831  DDPNTYYIVGTAMVNPDESESKQGRILIFQFQEGKLYQVAEKEIKGAAYSLVEFNGKLLA 890

Query: 786  AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
            +IN  ++L++W       +EL+ EC H  +I++LY++T+GDFI+VGDL++S++LL YK  
Sbjct: 891  SINSTVRLFEWT----AEQELRLECSHFNNIISLYLKTKGDFILVGDLIRSMTLLQYKTM 946

Query: 846  EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
            EG  EE ARD+N NWM+AVEI+DDD +LGAEN+FNLF  +K+S  ATDEER ++  VG +
Sbjct: 947  EGCFEEMARDHNPNWMTAVEIIDDDTFLGAENSFNLFVCQKDSAAATDEERQQMHAVGMF 1006

Query: 906  HLGEFVNRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKL 960
            HLG+ VN FRHGSLVM+    +VG+  T     ++FGTV+G IG++  +    Y FL +L
Sbjct: 1007 HLGDMVNVFRHGSLVMQ----NVGETSTPTTGCILFGTVSGAIGLVTQISANFYNFLHEL 1062

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            +  L +VIK VG + H  WRSF  E KT     F+DGDLIESFLDLS  +M E++  + +
Sbjct: 1063 ECKLTEVIKSVGKIKHSFWRSFTTEIKTEPCDGFIDGDLIESFLDLSHEKMKEVAAGLQI 1122

Query: 1021 ----------SVEELCKRVEELTRLH 1036
                      +V++L K VE+LTR+H
Sbjct: 1123 DNGSGMKQEATVDDLVKMVEDLTRIH 1148


>gi|91087281|ref|XP_975549.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
            castaneum]
 gi|270010588|gb|EFA07036.1| hypothetical protein TcasGA2_TC010010 [Tribolium castaneum]
          Length = 1149

 Score = 1070 bits (2766), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 533/1105 (48%), Positives = 755/1105 (68%), Gaps = 80/1105 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V +YG++A ++LFRP  E +D LFI T RY   +L+   DA++ ++IT+A G+V+D+IG
Sbjct: 54   EVALYGKVAVMKLFRPQHEKKDLLFIVTMRYNAMILECVNDADNLDIITKAHGNVADKIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            +P++ G + +IDP  R+IGL LYDGLFK+IP +      +A NIR++ELQV D++FL+GC
Sbjct: 114  KPSETGILAVIDPKARVIGLRLYDGLFKIIPLEKDNSELKASNIRIDELQVHDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A PT+++++QD  + RHVKT+E++L++K+FV+ PW Q+N++  A ++IPVP PL G +II
Sbjct: 174  ANPTLILIHQD-VNGRHVKTHEISLREKEFVKVPWRQDNVETEASMIIPVPSPLGGAIII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITK-----AYGRVDADGSRYLLGDHAGLLHLLVITHEK- 234
            G+E I+Y   +    + + P++ K      Y +VD  G RYLLGD AG L +L +  +  
Sbjct: 233  GQENILY--HDGITPVVVAPAVIKQSTIVCYAKVDPGGLRYLLGDMAGHLFMLFLEVDNR 290

Query: 235  ----EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
                + V  LK+ELLGE +    I+YLDN V++IGS  GDSQL+KL  +P+  GSYV V+
Sbjct: 291  GDGNDVVKDLKVELLGEIATPECITYLDNGVLFIGSRLGDSQLVKLTTKPNESGSYVTVM 350

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
            E + NL PI+D CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGM
Sbjct: 351  ESFTNLAPILDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGIKGM 410

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+L+ ++D  +D  LV++F+ +TR+L++N E E+EET+I GF S  QT FC + I+ Q+V
Sbjct: 411  WALQVASDGRYDNTLVLAFVGQTRVLSLNGE-EVEETDIAGFASDQQTFFCGNVIHEQIV 469

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
            Q+T  S RL+S+ ++ L  EWK P   ++ V   N  Q++++TG   L Y+EI    L  
Sbjct: 470  QITPISARLISAQNKTLLAEWKPPSDKNIAVVACNTCQIVVSTGS-ILYYIEIHQNELIM 528

Query: 471  VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
                 L+ E++CLDI+P+G+  + S+  AVG+WTDI+ RI  +PDL+  TKE+LGGEIIP
Sbjct: 529  KGVTTLDVEVACLDISPLGDGVTTSEYIAVGLWTDITARILRIPDLSEATKEYLGGEIIP 588

Query: 531  RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
            RSVL+  FEG SYLLCALGDG +  F+ +  TG L+D+K+V+LGTQP  LRTF S +TT+
Sbjct: 589  RSVLMTCFEGHSYLLCALGDGSMFYFVFHKDTGTLSDKKRVTLGTQPTVLRTFRSLSTTN 648

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            VFA SDRPTVIYSSN KL++SNVN+KEV+HMC  N+ A+PDSLA+A +  +TIGTID+IQ
Sbjct: 649  VFACSDRPTVIYSSNHKLVFSNVNMKEVNHMCSLNAEAYPDSLALATDSSVTIGTIDEIQ 708

Query: 651  KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--------------------------- 683
            KLHIR++PL E P+RI +QE+S+TF + + +                             
Sbjct: 709  KLHIRTVPLQESPKRIAYQEESQTFGVLTARIDIQDSTGLNPARPSASTMAQSTTSSSSV 768

Query: 684  --------------------SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCS 723
                                   +E E+H + ++D  TFE +  + L   EY  SI+S +
Sbjct: 769  GSLSMSKSGSSSLIGGNIIPDYGQEVEVHNLLIIDQHTFEVLHAHQLMQQEYAMSIISTN 828

Query: 724  -FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGK 782
                D N YY VGTA V PEE+EP +GRIL+F   D KL  ++EKE KGA YSL  FNGK
Sbjct: 829  RLGGDMNEYYIVGTATVNPEESEPKQGRILIFQWNDNKLTQVSEKEIKGACYSLAEFNGK 888

Query: 783  LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
            LLA+IN  ++L++W +     +EL+ EC H  +IL L+++T+GDFI++GDLM+S++LL Y
Sbjct: 889  LLASINSTVRLFEWTVE----KELRLECSHFNNILTLFLKTKGDFILLGDLMRSMTLLQY 944

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVV 902
            K  EG+ EE ARDYN NWM+AVEILDDDI+LGAEN+FN+F  +K+S   TDEER ++  V
Sbjct: 945  KTMEGSFEEIARDYNPNWMTAVEILDDDIFLGAENSFNIFVCQKDSAATTDEERSQMHEV 1004

Query: 903  GEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
            G +H+G+ +N FRHGSLVM+ L ++       V+FGTV+G IG++  +  + Y FL +LQ
Sbjct: 1005 GRFHVGDMINVFRHGSLVMQNLGETSTPTTGCVLFGTVSGAIGLVTQITQDFYDFLLELQ 1064

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV- 1020
              L  VIK VG ++H QWR+FN + KT  ++ F+DGDLIESFLDLS  +M E++  + + 
Sbjct: 1065 NKLSTVIKSVGKIDHSQWRAFNTDIKTEPSEGFIDGDLIESFLDLSHDKMKEVADGLQIT 1124

Query: 1021 ---------SVEELCKRVEELTRLH 1036
                     +V++L K VE+LTR+H
Sbjct: 1125 GEGGMKQDCTVDDLVKMVEDLTRIH 1149


>gi|308808936|ref|XP_003081778.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
 gi|116060244|emb|CAL56303.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
          Length = 1282

 Score = 1069 bits (2765), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 564/1007 (56%), Positives = 711/1007 (70%), Gaps = 83/1007 (8%)

Query: 1    MLDVPIYGRIATLELF--RPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDR 58
            ++DVPIYGR+AT+ L   R  G ++  LFI TERY F  L +D ES EL T A GDV D 
Sbjct: 276  IMDVPIYGRVATMSLCGGRERG-SKGRLFITTERYGFTALSYDEESEELKTEAFGDVRDN 334

Query: 59   IGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
            IGRP +NGQIGI+D DCR IGL LYDGLFKVIP D KG++KEAFNIRLEEL+V DI+FL+
Sbjct: 335  IGRPAENGQIGIVDEDCRSIGLQLYDGLFKVIPCDEKGKVKEAFNIRLEELRVEDIQFLH 394

Query: 119  GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVL 178
            G AKPTI VLY+D K+A H+KTYE+ +++K+FV  PW+QN+L+ G+  +IPVP P+ GV+
Sbjct: 395  GTAKPTIAVLYRDMKEAVHIKTYEIGVREKEFVSSPWAQNDLEGGSSKIIPVPAPVGGVV 454

Query: 179  IIGEETIVYCSANA------FKAI--PIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI 230
            ++GEETIVY +  +       KAI  P R SI   YG +D DGSRYLLGDH G L+LLV+
Sbjct: 455  VLGEETIVYLNKTSDDTDVFLKAINIPERSSIV-CYGAIDPDGSRYLLGDHDGTLYLLVL 513

Query: 231  THEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL-----NLQPDAKGS 285
             H+ ++V  LKIE LGETSI ST+SYLDN VV++GS+YGDSQLIKL     N+  D   S
Sbjct: 514  VHDGKRVNELKIERLGETSIPSTVSYLDNGVVFVGSAYGDSQLIKLHSEKTNVDKDGNLS 573

Query: 286  YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
            Y+++LE + NLGPIVDF  VDLER GQGQVVTCSGAYKDGSLR+VRNGIGI+EQA ++L 
Sbjct: 574  YIQILEEFTNLGPIVDFAFVDLERHGQGQVVTCSGAYKDGSLRVVRNGIGIDEQAVIQLP 633

Query: 346  GIKGMWSLRSSTDDPFDTFLVVSFISETRILAM--NLEDELEETEIEGFCSQTQTLFCHD 403
            G+KG++SLR   +   D +LVV+FI+ETRIL    +  D L+ETEI GF ++ QTL C +
Sbjct: 634  GVKGLFSLRDDDESQVDKYLVVTFINETRILGFVGDEGDTLDETEISGFNAEAQTLCCGN 693

Query: 404  AIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
             + +                                               GG L  L  
Sbjct: 694  MLES-----------------------------------------------GGKLHCLSA 706

Query: 464  GDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
              G L  V  A  E EI+CLD  P+GE  S S + AVG+W+ + + + S+ DL++I +E 
Sbjct: 707  MKGKLALVASATFEDEIACLDCTPMGEATS-SPVCAVGLWS-MEIVLASMSDLSVIKRES 764

Query: 524  LGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
             G +IIPRS LLC+FE I YLL  LGDG L+ +LL+ K+GEL+ RKK+SLGT+PITL+TF
Sbjct: 765  TGEDIIPRSTLLCSFESIPYLLVGLGDGQLITYLLDEKSGELSVRKKLSLGTKPITLQTF 824

Query: 584  SS--KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
             S   N   VFAASDRPTVI+S+NKKL+YSNVN++EV H+CPF+SAAFPD+LA+A E +L
Sbjct: 825  KSHATNVHSVFAASDRPTVIFSNNKKLIYSNVNVQEVLHVCPFSSAAFPDALALAGEEDL 884

Query: 642  TIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAIC-----SLKNQSCAEESEMHFVRL 696
            TIG IDDIQKLHIR+IPLG  PRRI HQ ++ TFA+      S  +Q C       FVRL
Sbjct: 885  TIGGIDDIQKLHIRTIPLGGQPRRIAHQPETNTFAVVVEHLWSKSSQDC-------FVRL 937

Query: 697  LDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV 756
            +DD +FE +S + L+  E   S+ SC+F+ DS  YY VGT   L  E+EP++GRILVF V
Sbjct: 938  VDDGSFETLSQFQLEDQELTSSLTSCTFAGDSTTYYVVGTGIALETEDEPSRGRILVFKV 997

Query: 757  EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI 816
            +D +L L++EKE +GAVY+LNAF GKLLA IN K++L+KW  R+D   EL SEC HHG I
Sbjct: 998  DDDQLVLVSEKEVRGAVYNLNAFKGKLLAGINSKLELFKWTPREDEVHELVSECSHHGQI 1057

Query: 817  LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD-IYLGA 875
            +   V+TRGD+I+VGDLMKS+SLL+YK EEGAI+E ARD+NANWM+AV +LDDD  YLGA
Sbjct: 1058 VTFAVKTRGDWILVGDLMKSMSLLLYKPEEGAIDEVARDFNANWMTAVAMLDDDETYLGA 1117

Query: 876  ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVI 935
            EN+ NLFTV +N    TDEER RLE+ GEYHLGE VN F  GSLVM L D +   +PT++
Sbjct: 1118 ENSLNLFTVSRNVNAVTDEERSRLEITGEYHLGELVNAFAPGSLVMSLRDGESLSVPTLL 1177

Query: 936  FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
            FGT NGVIGV+ASLP + Y F E+LQ ++ K I+GVGGL H  WRSF
Sbjct: 1178 FGTANGVIGVLASLPKDVYEFTERLQASINKHIQGVGGLKHADWRSF 1224


>gi|321478515|gb|EFX89472.1| hypothetical protein DAPPUDRAFT_303245 [Daphnia pulex]
          Length = 1158

 Score = 1063 bits (2749), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 553/1114 (49%), Positives = 741/1114 (66%), Gaps = 89/1114 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V IYG+IA ++L++   + +D LFI T RY   +L+   + E+ E+ITR  G+V+D+IG
Sbjct: 54   EVGIYGKIAVMKLYKSGNDKKDSLFILTARYNAMILECSGEGENMEIITRTQGNVADKIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            +P++ G I IIDPDCR+IGL LYDGL K+IP +      +A+NIR+EEL + DI FLYGC
Sbjct: 114  KPSETGIIAIIDPDCRVIGLRLYDGLLKIIPLEKDSSEIKAYNIRIEELCIQDIAFLYGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A PT+V+++QD    RHVKT E++L+DK+F +  W Q+N++  A +L+PVP P  G LII
Sbjct: 174  ANPTVVIIHQD-AHGRHVKTREISLRDKEFAKTSWKQDNVETEAAMLLPVPEPYGGALII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
            G+E+I Y +   +  I    I+ S    YG+VD +GSRYLLGD AG L +LV+  E EK+
Sbjct: 233  GQESITYHNGQNYVTIAPPIIKQSTVTCYGKVDPNGSRYLLGDLAGHLFMLVLEKE-EKM 291

Query: 238  TG------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
             G      +KIELLGE SI   ++YLDN VV+IGS +GDSQL+KLN+ PD   SYV V+E
Sbjct: 292  DGTVTVRDIKIELLGEVSIPECLTYLDNGVVFIGSRFGDSQLVKLNVTPDDNNSYVTVME 351

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
             + NL PIVD  +VDL+RQGQGQ+VTCSGAYK+GSLRI+RNGIGI+EQAS++L GIKG+W
Sbjct: 352  TFTNLAPIVDMTIVDLDRQGQGQLVTCSGAYKEGSLRIIRNGIGIHEQASIDLPGIKGIW 411

Query: 352  SLR--SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
            +L+  SS +   D  +V+SF+ +TR+L +N E E+EETEI G  +  QT FC +   + +
Sbjct: 412  ALKMGSSGNPSVDDTVVLSFVGQTRVLMLNGE-EMEETEIPGLTADQQTFFCGNVGKDSV 470

Query: 410  VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILT 469
            +Q+T+GSVRL+S  +++  +EW  P G  +NV   N  QVL A  G  L YLE+ D    
Sbjct: 471  LQITTGSVRLISVNTKQKLSEWTPPDGKMLNVVACNHGQVLCA-AGRELYYLEMEDDTQV 529

Query: 470  EVK-HAQLEYEISCLDINPIG------ENPSYSQIAAVGMWTDISVRIFSLPDLNLITKE 522
             +K H  L+YE++CLD+ PI          S +++ AVG+WTDIS R+  LP L    KE
Sbjct: 530  VLKTHVTLDYEVACLDLTPISIGSQQNTTVSKAEVCAVGLWTDISARLLKLPTLEEFHKE 589

Query: 523  HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
             LGGEIIPRS+L+  FEG  YLL ALGDG L  F +N  T  L DRKKV+LGTQP  LR 
Sbjct: 590  PLGGEIIPRSILMALFEGTPYLLVALGDGSLFYFSMNPVTKLLGDRKKVTLGTQPTVLRP 649

Query: 583  FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
            F S++T +VFA SDRPTVIYSSN KL++SNVNLKEV+HMCP NS A+PDSL +A +  +T
Sbjct: 650  FRSQSTVNVFACSDRPTVIYSSNHKLVFSNVNLKEVAHMCPLNSEAYPDSLVLATDTAVT 709

Query: 643  IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK--------------------- 681
            IGTID+IQKLHIR++PLGE PRRI +QE ++TF + + +                     
Sbjct: 710  IGTIDEIQKLHIRTVPLGESPRRIAYQENTQTFGVITSRVDIVESSGLTPARQSASTQAQ 769

Query: 682  ------------------NQSC--------AEESEMHFVRLLDDQTFEFISTYPLDTFEY 715
                                SC         +E E++ + ++D  +F  +  +     EY
Sbjct: 770  SVSSSSSLGSAIKAPVSTRDSCGSGGGAEIGQEIEVYNLLIVDQHSFAVLHAHQFMQNEY 829

Query: 716  GCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYS 775
              SI+S    DD N YY VGTA V+PEE+EP +GRI++F   DGKL  +AEKE KGA YS
Sbjct: 830  ALSIVSTRLGDDVNPYYVVGTALVVPEESEPKQGRIVLFQWADGKLTTVAEKEVKGACYS 889

Query: 776  LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
            L  FN K+LAAIN  ++LY+W       +EL+ EC +  HI+ALY++ +GDFI+VGDLM+
Sbjct: 890  LVDFNSKILAAINNVVRLYEWT----AEKELRLECSNFNHIIALYLKRKGDFILVGDLMR 945

Query: 836  SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
            SI+LL YK  EG+ EE ARD N NWMSAVEILDDD +LGAEN+FNLF  +K+S   T+EE
Sbjct: 946  SITLLQYKTMEGSFEEMARDSNPNWMSAVEILDDDTFLGAENSFNLFVCQKDSAATTEEE 1005

Query: 896  RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT---VIFGTVNGVIGVIASLPHE 952
            R +L  VG +HLG+ VN FRHGSLVM    ++    PT   V+FGTV+G IGV+  LP E
Sbjct: 1006 RQQLTEVGRFHLGDMVNVFRHGSLVMDHA-AETLTTPTQGCVLFGTVHGAIGVVTQLPSE 1064

Query: 953  QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD 1012
             Y FL ++QT + +VIK VG + H  WRSF  E+K    + F+DGDLIESFLDLS  +M 
Sbjct: 1065 FYHFLSEVQTRMARVIKPVGKIEHSFWRSFATERKVEPCEGFIDGDLIESFLDLSSDKMK 1124

Query: 1013 EISKTMNV----------SVEELCKRVEELTRLH 1036
            E++  + +          +VE++ K +E+LTR+H
Sbjct: 1125 EVATGLQMDDGSGMKREATVEDIIKLIEDLTRIH 1158


>gi|443707495|gb|ELU03057.1| hypothetical protein CAPTEDRAFT_148808 [Capitella teleta]
          Length = 1084

 Score = 1048 bits (2711), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 540/1092 (49%), Positives = 723/1092 (66%), Gaps = 119/1092 (10%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            ++ IYGRIA LELFRP GE++D LF+ T RY   +L+   D +  E+ITRA G+V DRIG
Sbjct: 54   EIGIYGRIAVLELFRPPGESKDLLFLLTARYNAMILECSQDGDHIEIITRAHGNVQDRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP++ G IGI+DP CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D++FL+GC
Sbjct: 114  RPSETGIIGIVDPLCRIIGLRLYDGLFKVIPLDRDMRELKAFNIRLEELNVIDVQFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            + PTI++    +K                               D  + V PP     II
Sbjct: 174  STPTIILSITYHK------------------------------GDNYLAVAPP-----II 198

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE----- 235
             + TI  C                  G+VDA+G RY+LGD AG L +L++  E+      
Sbjct: 199  KQSTITCC------------------GKVDANGCRYMLGDMAGRLFMLLLEKEERIDGSM 240

Query: 236  KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVN 295
             V  +K+ELLGE++IA  I+YLDN VVY+GS  GDSQL+KLN++ D  GSYV+V+E + N
Sbjct: 241  AVKDIKVELLGESAIAECITYLDNGVVYVGSRLGDSQLVKLNVEADDSGSYVQVMETFTN 300

Query: 296  LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS 355
            LGP+VD CVVDLERQGQGQ+VTCSG YK+GSLRI+RNGIGI+E A+++L GIKGMW LR 
Sbjct: 301  LGPVVDMCVVDLERQGQGQLVTCSGGYKEGSLRIIRNGIGIHEHATIDLPGIKGMWPLR- 359

Query: 356  STDDPF-DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
              D P  D  +V+SF+ +TR+L ++ E E+EETE+ GF +  QT FC +  ++QL+Q+T+
Sbjct: 360  -VDSPTQDDMIVLSFVGQTRVLLLSGE-EVEETELGGFEADQQTFFCGNVAHSQLLQITA 417

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHA 474
             SVRLV   +++L +EW+ P G +++VA  N  QV+ A     +  L+I  G L      
Sbjct: 418  ASVRLVDPRTKKLLSEWRPPSGKNLSVAGCNHCQVVCAVSR-EVFCLDIQQGALILQGST 476

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL 534
             +E+E++C+DI P+ +    SQ+ AVG+WTDIS RI  LP L  + +E LGGEIIPRS+L
Sbjct: 477  TMEHEVACVDITPLNDESGPSQLCAVGLWTDISARILQLPSLEQLHQEMLGGEIIPRSIL 536

Query: 535  LCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAA 594
            L  FEG++Y+LCALGDG L  F L   TG L+DRKKV+LGTQP  LRTF S  TT+VFA 
Sbjct: 537  LATFEGVNYVLCALGDGALFYFHLQPSTGVLSDRKKVTLGTQPTVLRTFKSPGTTNVFAC 596

Query: 595  SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHI 654
            SDRPTV+YSSN KL++SNVNLKEV++MCP NS  +PDSLA+A +  LTIGTID+IQKLHI
Sbjct: 597  SDRPTVVYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANDCTLTIGTIDEIQKLHI 656

Query: 655  RSIPLGEHPRRICHQEQSRTFAICSLKN-------------------------------- 682
            R++PLGE PRRI +QE S+TF + +L++                                
Sbjct: 657  RNVPLGETPRRIAYQEASQTFGVITLRSDLQDSNGSTPARPSASTQALSTSSSSNVKVMA 716

Query: 683  -------QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVG 735
                    +  +E E+H + +LD  TFE + ++ L  +E+  +++S  F +D   YY VG
Sbjct: 717  ASNANTEHTFGDEVEVHSLLVLDQHTFEVLHSHQLMQYEFATALMSGRFGEDPTTYYVVG 776

Query: 736  TAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
            TA V PEE EP +GRI+VF   DGKL  +AEKE KGA Y+L  FNGKLLA+IN  ++L++
Sbjct: 777  TAMVYPEEAEPKQGRIIVFRFHDGKLTQVAEKEIKGAAYTLTEFNGKLLASINSTVRLFE 836

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
            W       +EL+ EC +  +I+ALY++T+GDFI+VGDLM+S++LL YK  EG  EE ARD
Sbjct: 837  WT----AEKELRVECSYFNNIIALYLKTKGDFILVGDLMRSVTLLSYKPMEGCFEEIARD 892

Query: 856  YNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
            YN NWM+++++LDDD +LGAEN+FN+FT +K+S   TDEER  L+ VG YHLGEFVN FR
Sbjct: 893  YNPNWMTSIDVLDDDTFLGAENSFNIFTCQKDSAATTDEERQHLQEVGLYHLGEFVNVFR 952

Query: 916  HGSLVMRLPDSDVGQIP-TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
            HGSLVM+ P         +V+FGTVNG +G++  LP E YLFL ++Q  L K IK VG +
Sbjct: 953  HGSLVMQHPGECTSPTQGSVLFGTVNGALGLVTQLPQEFYLFLLEVQNKLAKTIKSVGKV 1012

Query: 975  NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVEE 1024
             H  WRSF+ E+KT  A  F+DGDLIESFLDLSR +M E+ + + +          +V++
Sbjct: 1013 EHAFWRSFHTERKTEPATGFIDGDLIESFLDLSRDKMQEVVQGLQMDDGSGMKREAAVDD 1072

Query: 1025 LCKRVEELTRLH 1036
            L K +EELTR+H
Sbjct: 1073 LVKMIEELTRIH 1084


>gi|345498295|ref|XP_001607743.2| PREDICTED: DNA damage-binding protein 1-like [Nasonia vitripennis]
          Length = 1140

 Score = 1044 bits (2700), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 530/1093 (48%), Positives = 737/1093 (67%), Gaps = 68/1093 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V IYG+IA ++ FRP  E +D LF+ T RY   +L+   + E  E+IT+A G+V+DRIG
Sbjct: 57   EVGIYGKIAVVKFFRPQHEKKDLLFLLTTRYNAMILECVGEGEDIEIITKAHGNVADRIG 116

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            + ++ G   +IDP  R+IGL LYDGLFK+IP D      +A +IR++E  V D+ FL+GC
Sbjct: 117  KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMDEQNVQDVNFLHGC 176

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT+++++QD  + RHVKT+E++L+DK+FV+ PW Q+N++  A ++IPVP P+CG +II
Sbjct: 177  TNPTLILIHQD-INGRHVKTHEISLRDKEFVKIPWRQDNVEREAMMVIPVPSPICGAIII 235

Query: 181  GEETIVYCSANAF-KAIP--IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
            G+E+I+Y     +   +P  I+ S    Y +VD  G RYLLGD AG L +L +  +K+  
Sbjct: 236  GQESILYHDGTTYVTVVPPIIKQSTISCYAKVDNQGLRYLLGDLAGHLFMLFLEQDKKAD 295

Query: 237  ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                +  LK+ELLGE SI   I+YLDN V++IGS  GDSQLIKLN +PD  GSY   +E 
Sbjct: 296  GSMVIKDLKVELLGEVSIPECITYLDNGVIFIGSRLGDSQLIKLNTKPDENGSYCSTMET 355

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NL PIVD  VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 356  FTNLAPIVDMAVVDLERQGQGQIVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGIKGMWA 415

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            L+  + + FD  LV+SF+ +TRIL +N E E+EETEI GF +  QT    +   + ++Q+
Sbjct: 416  LKVDSVN-FDNTLVLSFVGQTRILMLNGE-EVEETEIPGFVADEQTFHTGNVTNDVIIQI 473

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            T  S RL+S+ S  + +EW+     +++V   N +QVL ATG   L YLEI D  +    
Sbjct: 474  TPTSARLISNKSSSVISEWEPDNKRTISVVACNGTQVLCATGND-LFYLEINDNQIVSKG 532

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            +  L++E++C+DI+P+ +  S ++I AVG+WTDISVRI +LP L  I KE LGGEIIPRS
Sbjct: 533  YTTLQHEVACVDISPL-DGSSEAKIVAVGLWTDISVRILTLPKLEEINKELLGGEIIPRS 591

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEG +YLLCALGDG +  F LN + G L D+KKV+LGTQP  LRTF S +TT+VF
Sbjct: 592  ILMTCFEGNTYLLCALGDGSMYYFTLNKQNGMLADKKKVTLGTQPTVLRTFRSLSTTNVF 651

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ ++ DSLA+A +  +TIGTID+IQKL
Sbjct: 652  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYSDSLALATDNTVTIGTIDEIQKL 711

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK-----------------NQSCA--------- 686
            HIR++PL E PRRI +QE ++TF + +++                  Q+ +         
Sbjct: 712  HIRTVPLYESPRRIAYQESTQTFGVITMRVDIQESNGVNIARPSASTQAASISNSNHIPT 771

Query: 687  ------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
                        +E E+H + ++D  TFE +  + L   EY  S++S    +D   YY V
Sbjct: 772  HNKPSNTASEIGQEVEIHNLLIVDQHTFEVLHAHTLVPTEYAMSLISTKLGEDPTPYYIV 831

Query: 735  GTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
            GTA + P+E+EP  GRIL++   DGKL  +AEKE KG+ YSL  FNGKLLA+IN  ++L+
Sbjct: 832  GTAMINPDESEPKSGRILLYHWNDGKLTQVAEKEIKGSCYSLVEFNGKLLASINSTVRLF 891

Query: 795  KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
            +W       +EL+ EC H  +I+ALY++T+GDF++VGDLM+S++LL YK  EG+ EE AR
Sbjct: 892  EWT----AEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSVTLLQYKTMEGSFEEIAR 947

Query: 855  DYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
            DYN NWM+++EILDDD +LGAEN FNLF  +K+S   ++EER +++ VG++HLG+ VN F
Sbjct: 948  DYNPNWMTSIEILDDDTFLGAENCFNLFVCQKDSAATSEEERQQMQEVGQFHLGDMVNVF 1007

Query: 915  RHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
            RHGSLVM+ L +S       V+FGTV G IG++  +P   Y FL  L+  L  VIK VG 
Sbjct: 1008 RHGSLVMQHLGESSTPTHGCVLFGTVCGAIGLVTQIPSTFYEFLRNLEDRLTSVIKSVGK 1067

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVE 1023
            + H  WRSFN + K    + F+DGDLIESFLDLS  +M E++  + +          +V+
Sbjct: 1068 IEHNFWRSFNTDLKIEQCEGFIDGDLIESFLDLSHEKMAEVAMGIVIDDGSGMKKEATVD 1127

Query: 1024 ELCKRVEELTRLH 1036
            +L K VE+LTR+H
Sbjct: 1128 DLVKIVEDLTRIH 1140


>gi|340714589|ref|XP_003395809.1| PREDICTED: DNA damage-binding protein 1-like [Bombus terrestris]
          Length = 1141

 Score = 1043 bits (2698), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 533/1094 (48%), Positives = 736/1094 (67%), Gaps = 69/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V IYG+IA ++ FRP  E +D LF+ T RY   +L+   + E+ E+IT+A G+V+DRIG
Sbjct: 57   EVGIYGKIAVVKFFRPPHEKKDLLFLLTTRYNAMILECIGEGENIEIITKAHGNVADRIG 116

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            + ++ G   +IDP  R+IGL LYDGLFK+IP D      +A +IR+EE QV D+ FL+GC
Sbjct: 117  KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMEEHQVQDVNFLHGC 176

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A PT+++++QD  + RHVKT+E++L+DK+FV+ PW Q+N++  A ++IPVP P+CG +II
Sbjct: 177  ANPTLILIHQD-INGRHVKTHEISLRDKEFVKVPWRQDNVEREAMIVIPVPSPICGAIII 235

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
            G+E+I+Y   N + A+    I+ S    Y +VD  G RYLLGD AG L +L +  EK+  
Sbjct: 236  GQESILYHDGNTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFVEQEKKPD 295

Query: 237  ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                V  LK+ELLGE SI   I+YLDN V+++GS +GDSQL+KL  + D  GSY   +E 
Sbjct: 296  GTQVVKDLKVELLGEISIPECITYLDNGVIFVGSRFGDSQLVKLITKADENGSYCVPMET 355

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NL PI+D  VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 356  FTNLAPIIDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIEEHASIDLPGIKGMWA 415

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            L+    + FD  LV+SF+ +TRIL +N E E+EET+I GF +  QT    +   +  +Q+
Sbjct: 416  LKVGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 473

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            T  S RL+S  ++ + +EW+     +++V   N +QVL ATG   L Y+EI  G +    
Sbjct: 474  TPTSARLISHETKTVVSEWEPENKRTISVVACNGTQVLCATGN-DLFYMEISYGQIVPKG 532

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
             A L+YE++CLDI+P+ +  + ++IAAVG+WT ISVRI +LP L  I KE LGGEIIPRS
Sbjct: 533  FATLQYEVACLDISPL-DGHTEAKIAAVGLWTHISVRILTLPALEEINKELLGGEIIPRS 591

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEG +YLLCALGDG +  F L+ + G L+D+KKV+LGTQP  LRTF S  TT+VF
Sbjct: 592  ILMTCFEGNTYLLCALGDGSMYYFTLHKQNGILSDKKKVTLGTQPTVLRTFRSLFTTNVF 651

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ ++PDSLA+A +  +TIGTID+IQKL
Sbjct: 652  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 711

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TF + +++                               
Sbjct: 712  HIRTVPLGESPRRIAYQESSQTFGVITMRVDIQDSSGVSIVRHSASTQAASTSSSSHIAS 771

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                         +E E+H + ++D  TFE +  + L   EY  S++S    +D   YY 
Sbjct: 772  YNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYALSLISTKLGEDPTSYYI 831

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V P+E EP  GRIL++   DGKL  +AEKE KG+ YSL  FNGKLLA+IN  ++L
Sbjct: 832  VGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSLTEFNGKLLASINSTVRL 891

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            ++W       +EL+ EC H  +I+ALY++T+GDFI+VGDLM+S++LL YK  EG  EE A
Sbjct: 892  FEWT----AEKELRLECSHFNNIIALYLKTKGDFILVGDLMRSLTLLQYKTMEGCFEEIA 947

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RDYN NWM+A+EILDDD +LGAEN FNLF  +K+S   +++ER +++ VG++HLG+ VN 
Sbjct: 948  RDYNPNWMTAIEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNV 1007

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            FRHGSLVM+ L +S       V+FGTV+G IG++  +P   Y FL  L+  L  VIK VG
Sbjct: 1008 FRHGSLVMQNLGESSTPTQGCVLFGTVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVG 1067

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSFN E K    + F+DGDLIESFLDLS  +M +++  + +          +V
Sbjct: 1068 KIEHNFWRSFNTELKIEQCEGFIDGDLIESFLDLSPNKMADVASGLMIDDPSGMKKEATV 1127

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VE+LTR+H
Sbjct: 1128 DDLVKIVEDLTRIH 1141


>gi|159489018|ref|XP_001702494.1| UV-damaged DNA binding complex subunit 1 protein [Chlamydomonas
            reinhardtii]
 gi|158280516|gb|EDP06273.1| UV-damaged DNA binding complex subunit 1 protein [Chlamydomonas
            reinhardtii]
          Length = 1147

 Score = 1043 bits (2698), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 530/1081 (49%), Positives = 728/1081 (67%), Gaps = 79/1081 (7%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDVPIYG +++L+ FRP     D LFI TE+YKFCVLQ+D+   +L+TR+ GDV+DR G
Sbjct: 71   VLDVPIYGNVSSLQAFRPRDLQTDLLFILTEKYKFCVLQYDSAKGQLVTRSNGDVADRTG 130

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK-GQLKEAFNIRLEELQVLDIKFLYG 119
            RPTDNGQ+G +DP CR+IGLHLYDG+ KVIP D + G L EA+N+RLEEL VLD+ +L  
Sbjct: 131  RPTDNGQLGFVDPACRVIGLHLYDGMLKVIPMDERTGALSEAYNVRLEELCVLDMAWLAP 190

Query: 120  CAK-----------------PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDN 162
             +K                 P + VL+QD K ARHVKTYE AL  K+ VEGPW Q ++D 
Sbjct: 191  TSKGSAAGPSGSSAAAASGRPLLCVLHQDPKGARHVKTYEAALAAKELVEGPWQQQHVDA 250

Query: 163  GADLLIPVPPPLCGVLIIGEETIVYCSANAFK-------AIPIRPSITKAYGRVDADGSR 215
            GA +LIPVP PL GV+++GE  + YC             + P+R +I  A+  VD DGSR
Sbjct: 251  GAGILIPVPAPLGGVVVVGENVLSYCGGPGAGPGGGAPVSAPLRQTIITAWCPVDPDGSR 310

Query: 216  YLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
            YLLGD  G LHLLV+ H+   +V+GL++E LG TS  S +SYLD+ + ++GS  GD QL+
Sbjct: 311  YLLGDRLGGLHLLVLAHDGAGRVSGLRVEPLGHTSTPSCLSYLDSGLTFVGSRSGDCQLV 370

Query: 275  KLNLQP-DAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNG 333
            +++  P +   SYVE+++ + NLGPI+DF V+DLERQGQGQ+VTCSG   DGSLR++RNG
Sbjct: 371  RISPTPVNQPPSYVELVDSFPNLGPILDFVVMDLERQGQGQLVTCSGVDGDGSLRVIRNG 430

Query: 334  IGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFC 393
            IGIN QA+VEL GIKG+WSLR+   D  D +LV++F+ ETR+LA+N E+EL+E EI GF 
Sbjct: 431  IGINRQATVELAGIKGVWSLRAHYGDEHDAYLVLTFVGETRLLALNAEEELDEAEIPGFD 490

Query: 394  SQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLAT 453
            +  QTL+C +   + L+QVTS SVRLV S+S  L +EW+ P G+S+NVA  + +QV++AT
Sbjct: 491  AAAQTLWCGNTACDHLLQVTSSSVRLVDSSSLALVSEWRPPAGFSINVAAGSPTQVVVAT 550

Query: 454  GGGHLVYLEI----GDGILTEVKHAQLEYEISCLDINPI------GENPSYSQIAAVGMW 503
            GGGHLVYLE+    G   + EV +  L+ E++C+D++P+      G +   S + AVG W
Sbjct: 551  GGGHLVYLEVQAQAGGAAVVEVANVTLDSEVACVDVSPLLLTAGPGGSAERSGLVAVGRW 610

Query: 504  TDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG 563
             D ++++ ++P L  ++   LGGE+IPRSVL    EG+ Y +  LGDG L  + L   TG
Sbjct: 611  -DQTLQLLAVPGLTPLSVTPLGGEVIPRSVLCVGLEGVPYCMVGLGDGALHTWRLEPATG 669

Query: 564  ELTDRKKVSLGTQPITLRTFSS-------------------KNTTHVFAASDRPTVIYSS 604
             LTDRK+V LGT+PITLRTF +                        VFAASDRPTV+YSS
Sbjct: 670  ALTDRKRVVLGTKPITLRTFRTAAAAAAAAGGDNGGARGGGGGGVSVFAASDRPTVVYSS 729

Query: 605  NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPR 664
            N+KL+YSN+N  +V+H+  F++AAFP SLA+A E  LTIGT D +QKLH+R++ LGE PR
Sbjct: 730  NRKLMYSNLNENDVAHLSSFHAAAFPHSLAVASEAALTIGTADQLQKLHVRTVVLGEQPR 789

Query: 665  RICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF 724
            RI H E  R   + + ++    E  ++   RLLDD TF+ +++Y L   E  C   + + 
Sbjct: 790  RIAHHEAGRLLGVLTQRDPDGEERGQL---RLLDDTTFDVVASYGLAPGEMPCGGSASAA 846

Query: 725  SDDSNVYYCVGTAYVLPEENEPTKGRILVF----------IVEDGKLQLIAEKETKGAVY 774
            S      + VGT  + P+E EP++GRILV               G ++L+ EKE KGA Y
Sbjct: 847  SAAGPATFIVGTVLLRPDEPEPSRGRILVLEYLPGGAGGAAAGGGAVRLVTEKEVKGAAY 906

Query: 775  SLNAFNG-KLLAAINQKIQLYKWMLRDDGTR-------ELQSECGHHGHILALYVQTRGD 826
            ++  F G K+LA++N K+ +Y+W++R+           EL SEC H G++LALY+  RG 
Sbjct: 907  NVRPFAGDKILASVNNKVTVYRWVVREGSGGPGGCGAYELASECHHLGNVLALYLAARGG 966

Query: 827  FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
             +VVGDLM+S+SLL Y  E+G +E RA DYN+ W +AVEILDDD Y+ A+N+ NL+ VR+
Sbjct: 967  LVVVGDLMRSVSLLSYNAEQGVLEHRAADYNSGWTTAVEILDDDNYIAADNHCNLYVVRR 1026

Query: 887  NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD-VGQIPTVIFGTVNGVIGV 945
            N++ ATDEER RL+VVGE+H G F+N+ R+GSLVMRLPDS+  G  P ++F   +G +GV
Sbjct: 1027 NADSATDEERARLQVVGEFHTGTFINQMRNGSLVMRLPDSEHAGLPPPLLFAGTDGRLGV 1086

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
            +A LP   Y +  KLQT +R V++GVGGL+HEQWR+F N+++  +A+ F+DGDLIES LD
Sbjct: 1087 VARLPPALYEWATKLQTAMRSVVRGVGGLDHEQWRAFANDRRCGEARGFVDGDLIESLLD 1146

Query: 1006 L 1006
            L
Sbjct: 1147 L 1147


>gi|302837243|ref|XP_002950181.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
            nagariensis]
 gi|300264654|gb|EFJ48849.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
            nagariensis]
          Length = 1104

 Score = 1043 bits (2698), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 535/1064 (50%), Positives = 739/1064 (69%), Gaps = 67/1064 (6%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDVPIYG I+ L++FRP     D LFI TE+YKFCVLQ+DA   +L+TRA GDV+DRIG
Sbjct: 50   VLDVPIYGNISALQVFRPRDLHADLLFILTEKYKFCVLQYDAAKGQLLTRANGDVADRIG 109

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK-GQLKEAFNIRLEELQVLDIKFLYG 119
            RP +NGQ+G++DP CRLIGLHLYDG+ KVIP D++ GQL EAF++RLEEL VLD+ +L+ 
Sbjct: 110  RPAENGQLGVVDPACRLIGLHLYDGMLKVIPMDDRSGQLSEAFSVRLEELSVLDMAWLHP 169

Query: 120  CAKPT-----------IVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
                            + VL+QD K ARHVKTYE+ L   D ++GPW Q ++D+GA LLI
Sbjct: 170  QPPGGGGGGGSPGRPLLCVLHQDPKGARHVKTYEMQLGANDLLDGPWQQQHVDSGAALLI 229

Query: 169  PVPPPLCGVLIIGEETIVYCSANAFKA---IPIRPSITKAYGRVDADGSRYLLGDHAGLL 225
            PVP PL GV+++GE  + Y      +A    P+R +I  A+  VD DGSR+LLG+  G +
Sbjct: 230  PVPSPLGGVVVVGENVVSYLGGPGGQAPVSAPLRQTIVTAWCPVDPDGSRFLLGNRQGGM 289

Query: 226  HLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP-DAKG 284
             LLV+ H+  +V+GL+ E LG T   S ++YLD+ + ++GS  GDSQL++++ QP +   
Sbjct: 290  QLLVLAHDGSRVSGLRTEPLGYTCAPSCLAYLDSGLTFVGSRSGDSQLVRISAQPVNQPP 349

Query: 285  SYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL 344
            +Y+E+++ + +L PIVDF V+DLERQGQGQ+V CSG   DGSLR+VRNGIGIN QA+VEL
Sbjct: 350  TYLELVDSFPSLAPIVDFVVMDLERQGQGQLVMCSGIDSDGSLRVVRNGIGINRQATVEL 409

Query: 345  QGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA 404
             GIKG+WSLRS  DD +D +L+++F+ ETR+LA+N E+EL+E E+ GF S +QTL+C + 
Sbjct: 410  PGIKGVWSLRSHYDDEYDKYLLLTFVGETRLLALNTEEELDEAELPGFDSGSQTLWCGNM 469

Query: 405  IYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI- 463
              + L+Q +  SVRLV + S +L +EW+  PG++++VA  + +QV++ATGGGHLVYLE+ 
Sbjct: 470  ATDHLLQASGSSVRLVDTASLQLVSEWRPAPGFAIHVAAGSPTQVVVATGGGHLVYLEVV 529

Query: 464  --GDGI--LTEVKHAQLEYEISCLDINPI--------GENP-SYSQIAAVGMWTDISVRI 510
               +G+  + E+ +  L+ E++C+D++P+        GE P   S + AVG W D ++++
Sbjct: 530  RRPEGVVEVVEISNVVLDSEVACVDVSPLMLQQQPLAGEWPGGRSSVVAVGRW-DQTMQL 588

Query: 511  FSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
             S+P L  ++   LGGE+IPRSVL  A EG+ Y +  LGDG L  + L+  TG L+D+K+
Sbjct: 589  LSVPSLAPLSSTPLGGEVIPRSVLCTALEGVPYCMVGLGDGALHTWRLDPATGGLSDKKR 648

Query: 571  VSLGTQPITLRTFSSKNT------------------THVFAASDRPTVIYSSNKKLLYSN 612
            + LGT+PI LRTF +                       VFAASDRPTV+YSSNKKLLYSN
Sbjct: 649  LVLGTKPIMLRTFRTAAAVSGGGGGSLRGSGGGHGGVSVFAASDRPTVVYSSNKKLLYSN 708

Query: 613  VNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQS 672
            +N  +V+ +  F+SAAFP SLA+A EG LTIGT D+IQKLH+R++PLGE+PRRI H E +
Sbjct: 709  LNENDVAFLASFHSAAFPRSLAVASEGALTIGTADEIQKLHVRAVPLGENPRRIAHHEGA 768

Query: 673  RTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS---- 728
            R   + +++  S  + SE  F+RLLDD TF+ +++Y L   E  CS+ +   S +     
Sbjct: 769  RMLGVLTMRLDS--DGSERSFLRLLDDTTFDVVASYALAPGEMPCSLAAWPGSSNGTAAV 826

Query: 729  ---NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF-NGKLL 784
               N  + VGTA+++PEE EPTKGRILV       ++L+ EKE KGA Y++  F   K+L
Sbjct: 827  GALNACFLVGTAFIVPEEPEPTKGRILVL----EHVRLVTEKEVKGAAYNVLPFVKDKIL 882

Query: 785  AAINQKIQLYKWMLRDDGTR-ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
            A++N K+      L   G R EL SEC + G+ILALY+ TRG+ +VVGDLM+S+SLL Y 
Sbjct: 883  ASVNSKVPASGCDL--GGVRVELASECSYLGNILALYLATRGNLVVVGDLMRSVSLLSYN 940

Query: 844  HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVG 903
             E+G +E RA DYN+ W ++VE LDDD YL  +N+ NL  +R+N++ ATDEER RL+VVG
Sbjct: 941  VEQGVLEHRAADYNSGWTTSVEALDDDTYLEGDNHLNLVVLRRNADSATDEERARLQVVG 1000

Query: 904  EYHLGEFVNRFRHGSLVMRLPDSDVGQIPT-VIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
            EYH G FVNRFRHGSLVMR PDS+   +P  ++FG  +G +GVIA LP   Y  L KLQ+
Sbjct: 1001 EYHTGTFVNRFRHGSLVMRPPDSEFVSLPVPLLFGGTDGRLGVIARLPPGLYEMLTKLQS 1060

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
             LR+V++GVGGL+HE W +F+NE++T DAK F+DGDLIE+FLDL
Sbjct: 1061 ALRQVVRGVGGLSHEAWIAFSNERRTADAKGFVDGDLIETFLDL 1104


>gi|58383228|ref|XP_312466.2| AGAP002472-PA [Anopheles gambiae str. PEST]
 gi|55242305|gb|EAA08181.2| AGAP002472-PA [Anopheles gambiae str. PEST]
          Length = 1138

 Score = 1043 bits (2696), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 519/1092 (47%), Positives = 729/1092 (66%), Gaps = 65/1092 (5%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
            ++ I G+IA ++LFRP  E +D +FI T RY   +LQ   +    E+IT+A G+V+DR+G
Sbjct: 54   EIGINGKIAVMKLFRPAEETKDLIFILTHRYNAMILQCAVQGDDIEIITKAHGNVADRVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            +P + G + +IDP  R+IG+ LY+GLFK+IP D      +A ++R+EE+ V D++FLYG 
Sbjct: 114  KPAETGILAVIDPKARVIGMRLYEGLFKIIPLDKDTNELKATSLRMEEMHVQDVEFLYGT 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT++V++QD  + RH+KT+E++LKDK+F +  W Q+N++  A +LI VP PL G ++I
Sbjct: 174  THPTLIVIHQD-INGRHIKTHEISLKDKEFTKIAWKQDNVETEATMLIAVPMPLGGAIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
            G+E+IVY   +++ A+    I+ S    Y R+D+ G RYLLG+ AG L ++ + T E  K
Sbjct: 233  GQESIVYHDGDSYVAVAPAIIKQSTINCYARIDSKGLRYLLGNMAGNLFMMFLETEENAK 292

Query: 237  ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                V  +K+ELLGE +I   I+YLDN V++IGS +GDSQL+KLN      G+YV ++E 
Sbjct: 293  GQTTVRDIKVELLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVMLMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NL PIVD CVVDLERQGQGQ++TCSG++K+GSLRI+RNGIGI E A ++L GIKGMW+
Sbjct: 353  FTNLAPIVDMCVVDLERQGQGQMITCSGSFKEGSLRIIRNGIGIQEHACIDLPGIKGMWA 412

Query: 353  LRSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            LR   DD P+D  L++SF+  TR+L ++  DE+EETEI G     QT +C +  + Q++Q
Sbjct: 413  LRVGIDDSPYDNTLILSFVGHTRVLMLS-GDEVEETEIAGILGDQQTFYCANVSHGQILQ 471

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
            VT  S RL+S  ++ +  EWK P    + V  AN +Q++ A+    + Y+EIGDG L   
Sbjct: 472  VTPSSARLISCDNKAMICEWKPPDNKRIGVVGANTTQIVCASAQ-DVYYVEIGDGKLEYK 530

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
                L YE++CLD++P+ +N + S+  AVG+WTDIS  +  +P L L+ KE LGGEIIPR
Sbjct: 531  GQTTLSYEVACLDVSPLEDNATRSEFVAVGLWTDISACVLKVPTLELLHKEKLGGEIIPR 590

Query: 532  SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
            S+L+ AFEGI+YLLCALGDG +  F+LN  TG LT++KKV+LGTQP  L+TF S +TT+V
Sbjct: 591  SILMAAFEGINYLLCALGDGSMYYFVLNKATGCLTEQKKVTLGTQPTILKTFRSLSTTNV 650

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
            FA SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+ DSLA+A    +  GTID+IQK
Sbjct: 651  FACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAEAYQDSLALATRNSVIFGTIDEIQK 710

Query: 652  LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
            LHIR++PLGE PRRI +QE S+TF + + +                              
Sbjct: 711  LHIRTVPLGESPRRIAYQEASQTFGVITFRMDVQDSSGLTPARQSASTQTNNITQSSGMG 770

Query: 682  -------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
                   N    +E E+H + ++D  TFE +  +     EY  S++S    +D N Y+ V
Sbjct: 771  LLKPGASNTEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYALSLMSAKLGNDPNTYFIV 830

Query: 735  GTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
            GT  V PEE EP  GRI+++   D +L+++++KE KGA YSL  FNG++LA IN  ++LY
Sbjct: 831  GTGLVNPEEPEPKTGRIIIYRYADNELKMVSDKEVKGACYSLVEFNGRVLACINSTVRLY 890

Query: 795  KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
            +W   DD  ++L+ EC H  ++LALY +T+GDFI+VGDLM+SI+LL YK  EG+ EE AR
Sbjct: 891  EWT--DD--KDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLLQYKQMEGSFEEIAR 946

Query: 855  DYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
            DY  NWM+AVEILDDD +LGA+N+ NLF   K+S   TDEER ++  V ++HLG+ VN F
Sbjct: 947  DYQPNWMTAVEILDDDAFLGADNSNNLFVCLKDSAATTDEERQQMPEVAQFHLGDMVNVF 1006

Query: 915  RHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
            RHGSLVM+ + +        V+FGTV+G IG++  +  + Y FL KLQ NL   IK VG 
Sbjct: 1007 RHGSLVMQNISERSTPTTGCVLFGTVSGAIGLVTQIQSDFYEFLRKLQENLTNTIKSVGK 1066

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE---------E 1024
            ++H  WRSF+ E K    + F+DGDL+ESFLDLSR +M E S  + + VE         +
Sbjct: 1067 IDHSYWRSFHTETKMERCEGFIDGDLVESFLDLSREKMREASLGLEIDVEGTKREATVDD 1126

Query: 1025 LCKRVEELTRLH 1036
            + K VE+LTR+H
Sbjct: 1127 IIKIVEDLTRIH 1138


>gi|350410909|ref|XP_003489174.1| PREDICTED: DNA damage-binding protein 1-like [Bombus impatiens]
          Length = 1141

 Score = 1042 bits (2694), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 534/1094 (48%), Positives = 735/1094 (67%), Gaps = 69/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V IYG+IA ++ FRP  E +D LF+ T RY   +L+   + E+ E+IT+A G+V+DRIG
Sbjct: 57   EVGIYGKIAVVKFFRPPHEKKDLLFLLTTRYNAMILECIGEGENIEIITKAHGNVADRIG 116

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            + ++ G   +IDP  R+IGL LYDGLFK+IP D      +A +IR+EE QV D+ FL+GC
Sbjct: 117  KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMEEHQVQDVNFLHGC 176

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A PT+++++QD  + RHVKT+E++L+DK+FV+ PW Q+N++  A ++IPVP P+CG +II
Sbjct: 177  ANPTLILIHQD-INGRHVKTHEISLRDKEFVKVPWRQDNVEREAMIVIPVPSPICGAIII 235

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
            G+E+I+Y   N + A+    I+ S    Y +VD  G RYLLGD AG L +L +  EK+  
Sbjct: 236  GQESILYHDGNTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFVEQEKKPD 295

Query: 237  ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                V  LK+ELLGE SI   I+YLDN V+++GS  GDSQL+KL  + D  GSY   +E 
Sbjct: 296  GTQVVKDLKVELLGEISIPECITYLDNGVIFVGSRLGDSQLVKLITKADENGSYCVPMET 355

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NL PIVD  VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 356  FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIEEHASIDLPGIKGMWA 415

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            L+    + FD  LV+SF+ +TRIL +N E E+EET+I GF +  QT    +   +  +Q+
Sbjct: 416  LKVGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 473

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            T  S RL+S  ++ + +EW+     +++V   N +QVL ATG   L Y+EI  G +    
Sbjct: 474  TPTSARLISHETKTVVSEWEPENKRTISVVACNGTQVLCATGN-DLFYMEISCGQIVPKG 532

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
             A L+YE++CLDI+P+ +  + ++IAAVG+WT ISVRI +LP L  I KE LGGEIIPRS
Sbjct: 533  FATLQYEVACLDISPL-DGHTEAKIAAVGLWTHISVRILTLPALEEINKELLGGEIIPRS 591

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEG +YLLCALGDG +  F L+ + G L+D+KKV+LGTQP  LRTF S  TT+VF
Sbjct: 592  ILMTCFEGNTYLLCALGDGSMYYFTLHKQNGILSDKKKVTLGTQPTVLRTFRSLFTTNVF 651

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ ++PDSLA+A +  +TIGTID+IQKL
Sbjct: 652  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 711

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TF + +++                               
Sbjct: 712  HIRTVPLGESPRRIAYQESSQTFGVITMRVDIQDSSGVSIVRHSASTQAASTSSSSHIAS 771

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                         +E E+H + ++D  TFE +  + L   EY  S++S    +D   YY 
Sbjct: 772  YNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYALSLISTKLGEDPTSYYI 831

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V P+E EP  GRIL++   DGKL  +AEKE KG+ YSL  FNGKLLA+IN  ++L
Sbjct: 832  VGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSLTEFNGKLLASINSTVRL 891

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            ++W       +EL+ EC H  +I+ALY++T+GDFI+VGDLM+S++LL YK  EG  EE A
Sbjct: 892  FEWT----AEKELRLECSHFNNIIALYLKTKGDFILVGDLMRSLTLLQYKTMEGCFEEIA 947

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RDYN NWM+A+EILDDD +LGAEN FNLF  +K+S   +++ER +++ VG++HLG+ VN 
Sbjct: 948  RDYNPNWMTAIEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNV 1007

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            FRHGSLVM+ L +S       V+FGTV+G IG++  +P   Y FL  L+  L  VIK VG
Sbjct: 1008 FRHGSLVMQNLGESSTPTQGCVLFGTVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVG 1067

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSFN E K    + F+DGDLIESFLDLS  +M +++  + +          +V
Sbjct: 1068 KIEHNFWRSFNTELKIEQCEGFIDGDLIESFLDLSPNKMADVASGLMIDDPSGMKKEATV 1127

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VE+LTR+H
Sbjct: 1128 DDLVKIVEDLTRIH 1141


>gi|170057515|ref|XP_001864517.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167876915|gb|EDS40298.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1138

 Score = 1042 bits (2694), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 524/1096 (47%), Positives = 733/1096 (66%), Gaps = 73/1096 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVL--QWDAESSELITRAMGDVSDRIG 60
            +V I G+IA ++LFRP    +D LFI T RY   +L  Q   +  E+IT+A G+V+DR+G
Sbjct: 54   EVGINGKIAVMKLFRPAEAQKDLLFILTHRYNAMILACQVQGDDIEIITKAHGNVADRVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            +P + G + +IDP  R+IG+ LY+GLFK+IP D      +A ++R+EE+ V D++FLYG 
Sbjct: 114  KPAETGILAVIDPKARVIGMRLYEGLFKIIPLDRDTHELKATSLRMEEMHVQDVEFLYGT 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A PT++V++QD  + RH+KT+E+ LKDKDF +  W Q+N++  A +LIPVP PL G ++I
Sbjct: 174  AHPTLIVIHQD-LNGRHIKTHEINLKDKDFTKIAWKQDNVETEATMLIPVPTPLGGAIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
            G+E++VY   +++ A+    I+ S    Y RVD+ G RYLLG+  G L ++ +  E+   
Sbjct: 233  GQESVVYHDGDSYVAVAPAIIKQSTINCYARVDSRGFRYLLGNMIGHLFMMFLETEENTR 292

Query: 237  ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                V  +K+ELLGE +I   I+YLDN V++IGS +GDSQL+KLN    A G+YV V+E 
Sbjct: 293  GQLTVKDIKVELLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAAASGAYVTVMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NL PI+D C+VDLERQGQGQ++TCSG+YK+GSLRI+RNGIGI E A ++L GIKGMW+
Sbjct: 353  FTNLAPIIDMCIVDLERQGQGQMITCSGSYKEGSLRIIRNGIGIQEHACIDLPGIKGMWA 412

Query: 353  LRSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            LR   DD P+D  LV+SF+  TRIL ++ E E+EETEI GF S  QT +C +  + Q++Q
Sbjct: 413  LRVGIDDSPYDNTLVLSFVGHTRILMLSGE-EVEETEIPGFLSDQQTFYCANVDFGQIIQ 471

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
            VT  + RL+S  ++ +  EWK P    + V   N+ Q++ AT    + Y+EI  G L   
Sbjct: 472  VTPMTARLISCDNKSMICEWKPPDDKRIGVVACNSCQMVCATARD-IYYIEIEAGKLVHK 530

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
                L+YE++CLDI+P+ E  + S++ AVG+WT+IS  I  LP+L  + KE LGGEIIPR
Sbjct: 531  STVTLDYEVACLDISPLDEPATRSELVAVGLWTEISACILRLPNLEFVHKEKLGGEIIPR 590

Query: 532  SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
            S+L+  FEGI YLLCALGDG +  F+++  T  LTD+KKV+LGTQP  L+TF S +TT+V
Sbjct: 591  SILMACFEGIIYLLCALGDGSMFYFVVDKTTHRLTDQKKVTLGTQPTILKTFRSLSTTNV 650

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
            FA SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ ++ DSLA+A +  + +GTID+IQK
Sbjct: 651  FACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYQDSLALATKNSVILGTIDEIQK 710

Query: 652  LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
            LHIR++PLGE PRRI +QE S+TF + +++                              
Sbjct: 711  LHIRTVPLGESPRRIAYQEASQTFGVITVRMDIQDSSGLTPSRQSASTQTSNVTSSSNMG 770

Query: 682  -------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
                   N    +E E+H + ++D  TFE +  +     EY  S++S    +D   YY V
Sbjct: 771  LLKPGASNTEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYVLSLISAKLGNDPATYYIV 830

Query: 735  GTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
            GTA V PEE EP  GRI+++   DG L  ++EKE KGA YSL  FNG++LA IN  ++LY
Sbjct: 831  GTAMVNPEEREPKVGRIIIYHYADGALTQVSEKEIKGACYSLVEFNGRVLATINSTVRLY 890

Query: 795  KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
            +W   DD  ++L+ EC H  ++LALY +T+GDFI+VGDLM+SI+LL YK  EG+ EE AR
Sbjct: 891  EWT--DD--KDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLLQYKQMEGSFEEIAR 946

Query: 855  DYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
            DY   WM+AVEILDDD +LGAEN+ NLF   K+S   TD+ER ++  V ++HLG+ VN F
Sbjct: 947  DYQPKWMTAVEILDDDAFLGAENSNNLFVCLKDSAATTDDERQQMPEVAQFHLGDMVNVF 1006

Query: 915  RHGSLVMRLPDSDVGQ--IPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 969
            RHGSLVM+    ++G+   PT   V+FGTV+G IG++  +P + Y FL KLQ NL   IK
Sbjct: 1007 RHGSLVMQ----NIGERTTPTSGCVLFGTVSGAIGLVTQIPPDYYEFLRKLQENLTNTIK 1062

Query: 970  GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE------ 1023
             VG ++H  WRSF+ E KT +++ F+DGDL+ESFLDL+R +M E +  + + VE      
Sbjct: 1063 SVGRIDHTYWRSFHTEMKTENSEGFIDGDLVESFLDLTREKMHEAALGLQIDVEGTKKEA 1122

Query: 1024 ---ELCKRVEELTRLH 1036
               ++ K VE+LTR+H
Sbjct: 1123 NVDDIIKIVEDLTRIH 1138


>gi|198432471|ref|XP_002129229.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
            DNA-binding protein 1) (UV-damaged DNA-binding factor)
            (DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
            (UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
            pigmentosum group E-co... isoform 2 [Ciona intestinalis]
          Length = 1142

 Score = 1041 bits (2693), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 550/1100 (50%), Positives = 724/1100 (65%), Gaps = 77/1100 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
            ++ IYGRIA L+ FRP GE +D LFI TERY  C+LQ+ A     E+IT+A GD+SD +G
Sbjct: 54   EINIYGRIAVLKFFRPEGEDRDLLFILTERYHGCILQYKATDGGCEIITKASGDLSDTVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP + G IGIIDP  +LIGL LY+G+FK +P+D   +    FNIR+EEL V+D KFL+G 
Sbjct: 114  RPPETGIIGIIDPTSKLIGLRLYEGVFKFLPYDPTSEELRPFNIRIEELSVIDAKFLHGY 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT+V++YQ N   RHVKTY V ++DK+ V GPW Q N+D  A+ +I VP PL G +II
Sbjct: 174  TTPTLVIIYQ-NSQGRHVKTYIVDVRDKEVVAGPWKQENIDAEANFIINVPKPLAGSIII 232

Query: 181  GEETIVYCSANAFKAIPIRP-----SITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK- 234
            G+E+I Y   N  K IPI P          Y  VD DGSRYLLGD AG L +L++  ++ 
Sbjct: 233  GQESITY--HNGDKYIPIAPLCFFQDTINCYAPVDKDGSRYLLGDLAGHLFILLLESDEM 290

Query: 235  ----EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
                  V  LKIELLGE SI   ISYLDN VVYIGS  GDSQLI+L+  P  +   + VL
Sbjct: 291  MDGTNTVRDLKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIRLHTTP-VEVHLISVL 349

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
            + Y NLGPI+D CVVDL+RQGQGQVVTCSGA+K+GSLRI+RNGIGI E AS++L GIKG+
Sbjct: 350  DTYTNLGPIIDMCVVDLDRQGQGQVVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGIKGL 409

Query: 351  WSLRS-STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
            W LR   T   +DT LV+SF+  +RIL ++ E E+EET++ GF  ++QT +C +  +NQL
Sbjct: 410  WPLRVFDTSRSYDT-LVISFVGHSRILQLSGE-EVEETDLPGFDDESQTFYCSNVCHNQL 467

Query: 410  VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILT 469
            VQ+T  S+RL+S T R   +EWK      ++VAT N SQVLLA G   L Y EI  G + 
Sbjct: 468  VQITEKSIRLISHTERRQVHEWKPKNDRHISVATCNKSQVLLAIGSS-LHYFEIQPGEVI 526

Query: 470  EVKHAQLEYEISCLDINPIGENPS--------YSQIAAVGMWTDISVRIFSLPDLNLITK 521
            E     L +E++CL I P+  +PS         + I AVG+W D + R+  LP L  + +
Sbjct: 527  ERACVDLPHEVACLTIEPLVSDPSELEGPDFVTASICAVGLWNDNTARVLKLPTLEEMHQ 586

Query: 522  EHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
            + L  EIIPRS+LL  F+GI+YLL  LGDG L  F LN +TG ++DRKKV LGTQP +L 
Sbjct: 587  QKLADEIIPRSILLVQFDGINYLLVTLGDGTLFYFTLNPETGYISDRKKVPLGTQPTSLS 646

Query: 582  TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
             F+S  +  VFA SDRPTV+YSSNKKL++SNVNLKEVSHMCP +S  +PDSLA+A +  L
Sbjct: 647  VFTSGGSRTVFACSDRPTVVYSSNKKLVFSNVNLKEVSHMCPLDSDGYPDSLALANDNTL 706

Query: 642  TIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVR----- 695
             IGTID+IQKLHIR++PL E PRRI +QE+S+ F + +L+  S  A   +M   R     
Sbjct: 707  LIGTIDEIQKLHIRTVPLYESPRRIAYQEESQCFGLVTLRTDSVDATGDKMKITRPSAST 766

Query: 696  -----------------------------LLDDQTFEFISTYPLDTFEYGCSILSCSFSD 726
                                         ++D  TFE    Y LDT E   SI+SC    
Sbjct: 767  QASVCTKSPPVDGRSVEGFSATADIGSLLIIDQHTFEVHHAYQLDTNEEPLSIMSCKLGS 826

Query: 727  DSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAA 786
            D N Y+ VGTA+V  EE EP  GRILVF   D KL L+AEKE KGAV+ L  FNG +LAA
Sbjct: 827  DPNSYFVVGTAFVYMEETEPKHGRILVFHYIDNKLTLVAEKEVKGAVFCLCQFNGHVLAA 886

Query: 787  INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
            IN  + +Y+W       +EL++EC +  +ILALY++ +GDF++VGDLM+S+S+L YKH E
Sbjct: 887  INTSVSIYQWTTE----KELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYKHVE 942

Query: 847  GAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
            G ++E A+DY+ NWM+AVEILDDD +LGAEN +N+F  +K+S   TDEER +L     +H
Sbjct: 943  GNLDEIAKDYSPNWMTAVEILDDDNFLGAENFYNVFICQKDSGATTDEERSKLREAALFH 1002

Query: 907  LGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLR 965
            +G+ +N FRHGSLVM+ + ++ V     ++FGTV+G IGVI ++  + Y FL  +Q  L 
Sbjct: 1003 VGDSINTFRHGSLVMQNVGETAVSSKGHILFGTVHGSIGVITTVDEDLYAFLHSIQNRLA 1062

Query: 966  KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----- 1020
            KVIK VG ++HE WRSF   +KT   + F+DGDLIE FLDL+R +M E++K + V     
Sbjct: 1063 KVIKSVGNIDHESWRSFCTNEKTEAHRGFVDGDLIECFLDLNREKMAEVAKGLMVKEHGT 1122

Query: 1021 ----SVEELCKRVEELTRLH 1036
                +V++L K VEE+ R+H
Sbjct: 1123 KREATVDDLIKAVEEMNRIH 1142


>gi|198432469|ref|XP_002129207.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
            DNA-binding protein 1) (UV-damaged DNA-binding factor)
            (DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
            (UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
            pigmentosum group E-co... isoform 1 [Ciona intestinalis]
          Length = 1150

 Score = 1041 bits (2692), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 552/1107 (49%), Positives = 727/1107 (65%), Gaps = 83/1107 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
            ++ IYGRIA L+ FRP GE +D LFI TERY  C+LQ+ A     E+IT+A GD+SD +G
Sbjct: 54   EINIYGRIAVLKFFRPEGEDRDLLFILTERYHGCILQYKATDGGCEIITKASGDLSDTVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP + G IGIIDP  +LIGL LY+G+FK +P+D   +    FNIR+EEL V+D KFL+G 
Sbjct: 114  RPPETGIIGIIDPTSKLIGLRLYEGVFKFLPYDPTSEELRPFNIRIEELSVIDAKFLHGY 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT+V++YQ N   RHVKTY V ++DK+ V GPW Q N+D  A+ +I VP PL G +II
Sbjct: 174  TTPTLVIIYQ-NSQGRHVKTYIVDVRDKEVVAGPWKQENIDAEANFIINVPKPLAGSIII 232

Query: 181  GEETIVYCSANAFKAIPIRPSITK----AYGRVDADGSRYLLGDHAGLLHLLVITHEK-- 234
            G+E+I Y   N  K IPI P   K     Y  VD DGSRYLLGD AG L +L++  ++  
Sbjct: 233  GQESITY--HNGDKYIPIAPPQIKDTINCYAPVDKDGSRYLLGDLAGHLFILLLESDEMM 290

Query: 235  ---EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL----NLQPDAKGSYV 287
                 V  LKIELLGE SI   ISYLDN VVYIGS  GDSQLI+L    +++   K S +
Sbjct: 291  DGTNTVRDLKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIRLPTDSSMEGRPKPSLI 350

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
             VL+ Y NLGPI+D CVVDL+RQGQGQVVTCSGA+K+GSLRI+RNGIGI E AS++L GI
Sbjct: 351  SVLDTYTNLGPIIDMCVVDLDRQGQGQVVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGI 410

Query: 348  KGMWSLRS-STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
            KG+W LR   T   +DT LV+SF+  +RIL ++ E E+EET++ GF  ++QT +C +  +
Sbjct: 411  KGLWPLRVFDTSRSYDT-LVISFVGHSRILQLSGE-EVEETDLPGFDDESQTFYCSNVCH 468

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG 466
            NQLVQ+T  S+RL+S T R   +EWK      ++VAT N SQVLLA G   L Y EI  G
Sbjct: 469  NQLVQITEKSIRLISHTERRQVHEWKPKNDRHISVATCNKSQVLLAIGSS-LHYFEIQPG 527

Query: 467  ILTEVKHAQLEYEISCLDINPIGENPS--------YSQIAAVGMWTDISVRIFSLPDLNL 518
             + E     L +E++CL I P+  +PS         + I AVG+W D + R+  LP L  
Sbjct: 528  EVIERACVDLPHEVACLTIEPLVSDPSELEGPDFVTASICAVGLWNDNTARVLKLPTLEE 587

Query: 519  ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
            + ++ L  EIIPRS+LL  F+GI+YLL  LGDG L  F LN +TG ++DRKKV LGTQP 
Sbjct: 588  MHQQKLADEIIPRSILLVQFDGINYLLVTLGDGTLFYFTLNPETGYISDRKKVPLGTQPT 647

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
            +L  F+S  +  VFA SDRPTV+YSSNKKL++SNVNLKEVSHMCP +S  +PDSLA+A +
Sbjct: 648  SLSVFTSGGSRTVFACSDRPTVVYSSNKKLVFSNVNLKEVSHMCPLDSDGYPDSLALAND 707

Query: 639  GELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVR-- 695
              L IGTID+IQKLHIR++PL E PRRI +QE+S+ F + +L+  S  A   +M   R  
Sbjct: 708  NTLLIGTIDEIQKLHIRTVPLYESPRRIAYQEESQCFGLVTLRTDSVDATGDKMKITRPS 767

Query: 696  --------------------------------LLDDQTFEFISTYPLDTFEYGCSILSCS 723
                                            ++D  TFE    Y LDT E   SI+SC 
Sbjct: 768  ASTQASVCTKSPPVDGRSVEGFSATADIGSLLIIDQHTFEVHHAYQLDTNEEPLSIMSCK 827

Query: 724  FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
               D N Y+ VGTA+V  EE EP  GRILVF   D KL L+AEKE KGAV+ L  FNG +
Sbjct: 828  LGSDPNSYFVVGTAFVYMEETEPKHGRILVFHYIDNKLTLVAEKEVKGAVFCLCQFNGHV 887

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
            LAAIN  + +Y+W       +EL++EC +  +ILALY++ +GDF++VGDLM+S+S+L YK
Sbjct: 888  LAAINTSVSIYQWTTE----KELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYK 943

Query: 844  HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVG 903
            H EG ++E A+DY+ NWM+AVEILDDD +LGAEN +N+F  +K+S   TDEER +L    
Sbjct: 944  HVEGNLDEIAKDYSPNWMTAVEILDDDNFLGAENFYNVFICQKDSGATTDEERSKLREAA 1003

Query: 904  EYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
             +H+G+ +N FRHGSLVM+ + ++ V     ++FGTV+G IGVI ++  + Y FL  +Q 
Sbjct: 1004 LFHVGDSINTFRHGSLVMQNVGETAVSSKGHILFGTVHGSIGVITTVDEDLYAFLHSIQN 1063

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV-- 1020
             L KVIK VG ++HE WRSF   +KT   + F+DGDLIE FLDL+R +M E++K + V  
Sbjct: 1064 RLAKVIKSVGNIDHESWRSFCTNEKTEAHRGFVDGDLIECFLDLNREKMAEVAKGLMVKN 1123

Query: 1021 -----------SVEELCKRVEELTRLH 1036
                       +V++L K VEE+ R+H
Sbjct: 1124 FNDQHGTKREATVDDLIKAVEEMNRIH 1150


>gi|383863765|ref|XP_003707350.1| PREDICTED: DNA damage-binding protein 1-like [Megachile rotundata]
          Length = 1138

 Score = 1039 bits (2686), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 536/1091 (49%), Positives = 737/1091 (67%), Gaps = 66/1091 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELI--TRAMGDVSDRIG 60
            +V IYG+IA ++ FRP  E +D LF+ T RY   +L+   E  E+   T+A G+V+DRIG
Sbjct: 57   EVGIYGKIAVVKFFRPPHEKKDLLFLLTTRYNAMILECIGEGEEIEIITKAHGNVADRIG 116

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            + ++ G   +IDP  R+IGL LYDGLFK+IP D      +A +IR++E QV D+ FL+GC
Sbjct: 117  KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMDEQQVQDVNFLHGC 176

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A PT+++++QD  + RHVKT+E++L+DK+FV+ PW Q+N++  A ++IPVP P+CG +II
Sbjct: 177  ANPTLILIHQD-INGRHVKTHEISLRDKEFVKIPWRQDNVEREATMVIPVPSPICGAIII 235

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK--- 234
            G+E+I+Y     + A+    I+ S    Y +VD  G RYLLGD AG L +L +  EK   
Sbjct: 236  GQESILYHDGTTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFLEQEKNPD 295

Query: 235  --EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
              + V  LK+ELLGE SI   I+YLDN V+++GS  GDSQLIKL  + D  GSY   +E 
Sbjct: 296  GTQVVKDLKVELLGEISIPECITYLDNGVIFVGSRLGDSQLIKLITKADENGSYCVPMET 355

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NL PIVD  VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 356  FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIEEHASIDLPGIKGMWA 415

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            LR    + FD  LV+SF+ +TRIL +N E E+EET+I GF +  QT    +   +  +Q+
Sbjct: 416  LRIGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 473

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            T  S RL+S  ++ + +EW+     +++V   N +QVL ATG   L Y+EI  G +    
Sbjct: 474  TPTSARLISHETKTVVSEWEPENKRTISVVACNGTQVLCATGN-DLFYMEISCGQIVPKG 532

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
             A L+YE++CLDI+P+  N + ++IAAVG+WTDISVRI +LP L  I KE LGGEIIPRS
Sbjct: 533  FATLQYEVACLDISPLDGN-TEAKIAAVGLWTDISVRILTLPALEEINKELLGGEIIPRS 591

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEG +YLLCALGDG +  F L+ + G L+D+KKV+LGTQP  LRTF S +T +VF
Sbjct: 592  ILMTCFEGNTYLLCALGDGSMYYFTLHKQNGILSDKKKVTLGTQPTVLRTFRSLSTINVF 651

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ ++PDSLA+A +  +TIGTID+IQKL
Sbjct: 652  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 711

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TF + +++                               
Sbjct: 712  HIRTVPLGESPRRIAYQENSQTFGVITMRVDIQDSSGVSIVRHSASTQAASTSSSSHIAS 771

Query: 682  -NQSCA----EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGT 736
             N+  A    +E E+H + ++D  TFE +  + L   EY  S++S    +D   YY VGT
Sbjct: 772  HNKPTASDISQEIEVHNLLIIDQHTFEVLHAHMLMPNEYALSLISTKLGEDPTFYYVVGT 831

Query: 737  AYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW 796
            A V P+E EP  GRIL++   DGKL  +AEKE KG+ YSL  FNGKLLA+IN  ++L++W
Sbjct: 832  ALVNPDETEPKMGRILLYHWNDGKLTQVAEKEIKGSCYSLVEFNGKLLASINSTVRLFEW 891

Query: 797  MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
                   +EL+ EC H  +I+ALY++T+GDF++VGDLM+S++LL YK  EG+ EE ARDY
Sbjct: 892  T----AEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYKTMEGSFEEIARDY 947

Query: 857  NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
            N NWM+AVEILDDD +LGAEN FNLF  +K+S   +++ER +++ +G++HLG+ VN FRH
Sbjct: 948  NPNWMTAVEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEIGQFHLGDMVNVFRH 1007

Query: 917  GSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLN 975
            GSLVM+ L +S       V+FGTV+G IG++  +P   Y FL  L+  L +VIK VG + 
Sbjct: 1008 GSLVMQNLGESSTPTQGCVLFGTVSGAIGLVTQIPFTFYEFLRHLEYRLTEVIKSVGKIE 1067

Query: 976  HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM----------NVSVEEL 1025
            H  WRSFN E K  + + F+DGDLIESFLDLS  +M E++  +            +V++L
Sbjct: 1068 HRFWRSFNTELKVENCEGFIDGDLIESFLDLSPDKMAEVAVDLMMDDSSGMRKEATVDDL 1127

Query: 1026 CKRVEELTRLH 1036
             K VE+LTR+H
Sbjct: 1128 VKIVEDLTRIH 1138


>gi|157128864|ref|XP_001655231.1| DNA repair protein xp-e [Aedes aegypti]
 gi|108882186|gb|EAT46411.1| AAEL002407-PB [Aedes aegypti]
          Length = 1138

 Score = 1039 bits (2686), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 518/1096 (47%), Positives = 735/1096 (67%), Gaps = 73/1096 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
            ++ I G+IA ++LFRP    +D +FI T RY   +L+   +    E+IT+A G+V+DR+G
Sbjct: 54   EIGINGKIAVMKLFRPAEAQKDLIFILTHRYNAMILECAVQGDDIEIITKAHGNVADRVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            +P + G + +IDP  R+IG+ LY+GLFK+IP D      +A ++R+EE+ V D++FLYG 
Sbjct: 114  KPAETGILAVIDPKARVIGMRLYEGLFKIIPLDRDTHELKATSLRMEEVHVQDVEFLYGT 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT++V++QD  + RH+KT+E+ LKDKDF +  W Q+N++  A +LIPVP PL G ++I
Sbjct: 174  QHPTLIVIHQD-LNGRHIKTHEINLKDKDFTKIAWKQDNVETEATMLIPVPTPLGGAIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
            G+E++VY   +++ A+    I+ S    Y RVD+ G RYLLG+ +G L ++ + T E  K
Sbjct: 233  GQESVVYHDGDSYVAVAPAIIKQSTINCYARVDSKGFRYLLGNMSGHLFMMFLETEENSK 292

Query: 237  ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                V  +K+ELLG+ +I   I+YLDN V++IGS +GDSQL+KLN      G+YV V+E 
Sbjct: 293  GLLSVKDIKVELLGDITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NL PI+D C+VDLE+QGQGQ++TCSG+YK+GSLRI+RNGIGI E A ++L GIKGMW+
Sbjct: 353  FTNLAPIIDMCIVDLEKQGQGQMITCSGSYKEGSLRIIRNGIGIQEHACIDLPGIKGMWA 412

Query: 353  LRSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            LR   DD P+D  LV+SF+  TRIL ++ E E+EETEI GF S  QT +C +  + Q++Q
Sbjct: 413  LRVGIDDSPYDNTLVLSFVGHTRILTLSGE-EVEETEIPGFLSDQQTFYCANVDFGQIIQ 471

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
            VT  + RL+   ++ +  EWK P    ++V   N+ Q++ AT    + Y+EIG+  L   
Sbjct: 472  VTPTTARLIQCDNKSMICEWKPPDDKRISVVACNSCQMVCATACD-IYYIEIGESKLVHK 530

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
                L+YE++CLDI+P+ +N +++++ AVG+WTDIS  I  LP+L ++  E LGGEIIPR
Sbjct: 531  STVTLDYEVACLDISPLEDNATHAELVAVGLWTDISACILRLPNLEVVHTEKLGGEIIPR 590

Query: 532  SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
            S+L+  FEGI YLLCALGDG +  F+L+  T  LTD+KKV+LGTQP  L+TF S +TT+V
Sbjct: 591  SILMAHFEGIVYLLCALGDGSMFYFVLDKNTNRLTDQKKVTLGTQPTILKTFRSLSTTNV 650

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
            FA SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+ DSLA+A +  + +GTID+IQK
Sbjct: 651  FACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAEAYQDSLALATKNSVILGTIDEIQK 710

Query: 652  LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
            LHIR++PLGE PRRI +QE S+TF + +++                              
Sbjct: 711  LHIRTVPLGESPRRIAYQEASQTFGVITVRTDIQDSSGLTPSRQSASTQTTNVTLSTNMG 770

Query: 682  -------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
                   N    +E E+H + ++D  TFE +  +     EY  S++S    +D N YY V
Sbjct: 771  LLKAGASNAEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYAMSLISAKLGNDPNTYYIV 830

Query: 735  GTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
            GTA V PEE EP  GRI+++   DG L  ++EKE KG+ YSL  FNG++LA+IN  ++LY
Sbjct: 831  GTALVNPEEPEPKVGRIIIYHYADGNLTQVSEKEIKGSCYSLVEFNGRVLASINSTVRLY 890

Query: 795  KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
            +W   DD  ++L+ EC H  ++LALY +T+GDFI+VGDLM+SI+LL YK  EG+ EE AR
Sbjct: 891  EWT--DD--KDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLLQYKQMEGSFEEIAR 946

Query: 855  DYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
            DY  NWM+AVEILDDD +LGA+N+ NLF   K+    TD+ER ++  V + HLG+ VN F
Sbjct: 947  DYQPNWMTAVEILDDDAFLGADNSNNLFVCLKDGAATTDDERQQMPEVAQVHLGDMVNVF 1006

Query: 915  RHGSLVMRLPDSDVGQ--IPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 969
            RHGSLVM     ++G+   PT   V+FGTV+G IG++  +P + Y FL KLQ NL   IK
Sbjct: 1007 RHGSLVME----NIGERTTPTSGCVLFGTVSGAIGLVTQIPADYYEFLRKLQENLTDTIK 1062

Query: 970  GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV--------- 1020
             VG ++H  WRSF+ E KT   + F+DGDL+ESFLDLSR +M E +  + +         
Sbjct: 1063 SVGKIDHAYWRSFHTEMKTERCEGFIDGDLVESFLDLSREKMHEAALGLQIDVDGTKKEA 1122

Query: 1021 SVEELCKRVEELTRLH 1036
            +V+++ K VE+LTR+H
Sbjct: 1123 TVDDIIKIVEDLTRIH 1138


>gi|328788389|ref|XP_396048.3| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Apis
            mellifera]
          Length = 1141

 Score = 1038 bits (2683), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 534/1094 (48%), Positives = 733/1094 (67%), Gaps = 69/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V IYG+IA ++ FRP  E +D LF+ T RY   +L+   + E+ E+IT+A G+V+DRIG
Sbjct: 57   EVGIYGKIAVVKFFRPPHEKKDLLFLLTTRYNAMILECIGEGENIEIITKAHGNVADRIG 116

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            + ++ G   +IDP  R+IGL LYDGLFK+IP D      +A +IR+EE QV D+ FL+GC
Sbjct: 117  KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMEEHQVQDVNFLHGC 176

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A PT+++++QD  + RHVKT+E++L+DK+FV+ PW Q+N++  A ++IPVP P+CG +II
Sbjct: 177  ANPTLILIHQD-INGRHVKTHEISLRDKEFVKIPWRQDNVEREAMIVIPVPSPICGAIII 235

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
            G+E+I+Y   N + A+    I+ S    Y +VD  G RYLLGD AG L +L +  EK+  
Sbjct: 236  GQESILYHDGNTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFVEQEKKAD 295

Query: 237  ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                V  LK+ELLGE SI   I+YLDN V+++GS  GDSQL+KL  + D  GSY   +E 
Sbjct: 296  GTQVVKDLKVELLGEISIPECITYLDNGVIFVGSRLGDSQLVKLITKADENGSYCVPMET 355

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NL PIVD  VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 356  FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIEEHASIDLPGIKGMWA 415

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            L+    + FD  LV+SF+ +TRIL +N E E+EET+I GF +  QT    +   +  +Q+
Sbjct: 416  LKIGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 473

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            T  S RL+S  ++ + +EW+     +++V   N +QVL ATG   L YLEI  G +    
Sbjct: 474  TPTSARLISYETKTVVSEWEPENKRTISVVACNGTQVLCATGN-DLFYLEISCGQILPKG 532

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               L+YE++CLDI+P+  N + ++IAAVG+WT ISV I +LP L  I KE LGGEIIPRS
Sbjct: 533  FTTLQYEVACLDISPLDGN-TEAKIAAVGLWTHISVHILTLPALEEINKELLGGEIIPRS 591

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEG +YLLCALGDG +  F L+ + G L+D+KKV+LGTQP  LRTF S  TT+VF
Sbjct: 592  ILMTCFEGNTYLLCALGDGSMYYFTLHKQNGILSDKKKVTLGTQPTVLRTFRSLFTTNVF 651

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ ++PDSLA+A +  +TIGTID+IQKL
Sbjct: 652  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 711

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TF + +++                               
Sbjct: 712  HIRTVPLGESPRRIAYQESSQTFGVITMRVDIQDSSGVSIVRHSASTQAASTSSSSHIAS 771

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                         +E E+H + ++D  TFE +  + L   EY  S++S    +D   YY 
Sbjct: 772  YNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYALSLISTKLGEDPTSYYI 831

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V P+E EP  GRIL++   DGKL  +AEKE KG+ YSL  FNGKLLA+IN  ++L
Sbjct: 832  VGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSLTEFNGKLLASINSTVRL 891

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            ++W       +EL+ EC H  +I+ALY++++GDFI+VGDLM+S++LL YK  EG  EE A
Sbjct: 892  FEWT----AEKELRLECSHFNNIIALYLKSKGDFILVGDLMRSLTLLQYKTMEGCFEEIA 947

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RDYN NWM+A+EILDDD +LGAEN FNLF  +K+S   +++ER +++ VG++HLG+ VN 
Sbjct: 948  RDYNPNWMTAIEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNV 1007

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            FRHGSLVM+ L +S       V+FGTV+G IG++  +P   Y FL  L+  L  VIK VG
Sbjct: 1008 FRHGSLVMQNLGESSTPTQGCVLFGTVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVG 1067

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSFN E K    + F+DGDLIESFLDLS  +M E++  + +          +V
Sbjct: 1068 KIEHNFWRSFNTELKIEQCEGFIDGDLIESFLDLSPDKMAEVASGLMIDDPSGMKKEATV 1127

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VE+LTR+H
Sbjct: 1128 DDLVKIVEDLTRIH 1141


>gi|307205760|gb|EFN83990.1| DNA damage-binding protein 1 [Harpegnathos saltator]
          Length = 1138

 Score = 1035 bits (2676), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 529/1094 (48%), Positives = 735/1094 (67%), Gaps = 69/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V IYGRIA ++ FRP  E +D LF+ T RY   +L+   + E  E+IT+A G+V+DRIG
Sbjct: 54   EVGIYGRIAVVKFFRPPHEKKDLLFLLTTRYNAMILECIGEGEDIEIITKAHGNVADRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            + ++ G   +IDP  R+IGL LYDGLFK+IP D      +A +IR++E QV D+ FL+GC
Sbjct: 114  KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMDEQQVQDVNFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A PT+++++QD  + RHVKT+E++L+DK+FV+ PW Q+N++  A ++IPVP P+CG +II
Sbjct: 174  ANPTLILIHQD-INGRHVKTHEISLRDKEFVKIPWRQDNVEREAMMVIPVPSPICGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
            G+E+I+Y     + A+    I+ S    Y +VD  G RYLLGD AG L +L +  EK+  
Sbjct: 233  GQESILYHDGTTYIAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFLEQEKKPD 292

Query: 237  ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                V  LK+ELLGE SI   I+YLDN V+++GS  GDSQLIKL  + D  GSY   +E 
Sbjct: 293  GTQVVKDLKVELLGEISIPECITYLDNGVIFVGSRLGDSQLIKLITKADENGSYCVPMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NL PIVD  VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 353  FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGIKGMWA 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            L+    + FD  LV+SF+ +TRIL +N E E+EET+I GF +  QT    +   +  +Q+
Sbjct: 413  LKVGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            T  S RL+S  ++ + +EW+     +++V   N +QVL ATG   L Y+EI    +    
Sbjct: 471  TPTSARLISHETKIVVSEWEPENKRTISVVACNGTQVLCATGN-DLFYMEIICNQIVPKG 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
             A L++E++CLDI+P+ +  S ++I AVG+WTDISVRI +LP L  I KE LGGEIIPRS
Sbjct: 530  FATLQHEVACLDISPL-DGISEAKIVAVGLWTDISVRILTLPGLEEINKELLGGEIIPRS 588

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEG +YLLCALGDG +  F L+ + G L+D+KKV+LGTQP  LRTF S +TT+VF
Sbjct: 589  ILMTCFEGNTYLLCALGDGSMYYFTLHKQNGILSDKKKVTLGTQPTVLRTFRSLSTTNVF 648

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ ++PDSLA+A +  +TIGTID+IQKL
Sbjct: 649  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 708

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TF + +++                               
Sbjct: 709  HIRTVPLGESPRRIAYQESSQTFGVITMRVDVQESSGVSIVRHSASTQAASTSSSSHIAS 768

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                         +E E+H + ++D  TFE +  + L   EY  S++S    +D   Y+ 
Sbjct: 769  HNKPSGHTASEIGQEIEVHNLLIIDQHTFEVLHAHTLMPTEYALSLISTRLGEDPTSYFV 828

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA + P+E EP  GRIL++   DGKL  +AEKE KG+ YSL  FNGKLLA+IN  ++L
Sbjct: 829  VGTALINPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSLVEFNGKLLASINSTVRL 888

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            ++W       +EL+ EC H  +I+ALY++T+GDF++VGDLM+S++LL YK  EG+ EE A
Sbjct: 889  FEWT----AEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYKTMEGSFEEIA 944

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RDYN NWM+++EILDDD +LGAEN FNLF  +K+S   +++ER +++ VG++HLG+ VN 
Sbjct: 945  RDYNPNWMTSIEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNV 1004

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            FRHGSLVM+ L +S    +  V+FGTV+G IG++  +P   Y FL  L+  L  VIK VG
Sbjct: 1005 FRHGSLVMQNLGESSTPTLGCVLFGTVSGAIGLVTQIPFAFYEFLRNLEDRLNSVIKSVG 1064

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSFN E K    + F+DGDLIESFLDL+  +M E++  + +          +V
Sbjct: 1065 KIEHNFWRSFNTELKIEQCEGFIDGDLIESFLDLNHDKMAEVAMGLMIDDGSGMKKEATV 1124

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VE+LTR+H
Sbjct: 1125 DDLVKVVEDLTRIH 1138


>gi|380025901|ref|XP_003696702.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
            [Apis florea]
          Length = 1141

 Score = 1035 bits (2676), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 533/1094 (48%), Positives = 732/1094 (66%), Gaps = 69/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V IYG+IA ++ FRP  E +D LF+ T RY   +L+   + E+ E+IT+A G+V+DRIG
Sbjct: 57   EVGIYGKIAVVKFFRPPHEKKDLLFLLTTRYNAMILECIGEGENIEIITKAHGNVADRIG 116

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            + ++ G   +IDP  R+IGL LYDGLFK+IP D      +A +IR+EE QV D+ FL+GC
Sbjct: 117  KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMEEHQVQDVNFLHGC 176

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A PT+++++QD  + RHVKT+E++L+DK+FV+ PW Q+N++  A ++IPVP P+CG +II
Sbjct: 177  ANPTLILIHQD-INGRHVKTHEISLRDKEFVKIPWRQDNVEREAMIVIPVPSPICGAIII 235

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
            G+E+I+Y   N + A+    I+ S    Y +VD  G RYLLGD AG L +L +  EK+  
Sbjct: 236  GQESILYHDGNTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFVEQEKKTD 295

Query: 237  ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                V  LK+ELLGE SI   I+YLDN V+++GS  GDSQL+KL  + D  GSY   +E 
Sbjct: 296  GTQVVKDLKVELLGEISIPECITYLDNGVIFVGSRLGDSQLVKLITKADENGSYCVPMET 355

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NL PIVD  VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 356  FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIEEHASIDLPGIKGMWA 415

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            L+    + FD  LV+SF+ +TRIL +N E E+EET+I GF +  QT    +   +  +Q+
Sbjct: 416  LKIGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 473

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            T  S RL+S  ++ + +EW+     +++V   N +QVL ATG   L YLEI  G +    
Sbjct: 474  TPTSARLISYETKTVVSEWEPENKRTISVVACNGTQVLCATGN-DLFYLEISCGQILPKG 532

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               L+YE++CLDI+P+  N + ++IAAVG+WT ISV I +LP L  I KE LGGEIIPRS
Sbjct: 533  FTTLQYEVACLDISPLDGN-TEAKIAAVGLWTHISVHILTLPALEEINKELLGGEIIPRS 591

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEG +YLLCALGDG +  F L+ + G L+D+KKV+LGTQP  LRTF S  TT+VF
Sbjct: 592  ILMTCFEGNTYLLCALGDGSMYYFTLHKQNGILSDKKKVTLGTQPTVLRTFRSLFTTNVF 651

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ ++PDSLA+A +  +TIGTID+IQKL
Sbjct: 652  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 711

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TF + +++                               
Sbjct: 712  HIRTVPLGESPRRIAYQESSQTFGVITMRVDIQDSSGVSIVRHSASTQAASTSSSSHIAS 771

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                         +E E+H + ++D  TFE +  + L   EY  S++S    +D   YY 
Sbjct: 772  YNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYALSLISTKLGEDPTSYYI 831

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA V P+E EP  GRIL++   DGKL  +AEKE KG+ YSL  FNGKLLA+IN  ++L
Sbjct: 832  VGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEXKGSCYSLTEFNGKLLASINSTVRL 891

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            ++W       +EL+ EC H  +I+ALY++++GDFI+VGDLM+S++LL YK  EG  EE A
Sbjct: 892  FEWT----AEKELRLECSHFNNIIALYLKSKGDFILVGDLMRSLTLLQYKTMEGCFEEIA 947

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RDYN NWM+A+EILDDD +LGAEN FNLF  +K+S   +++ER +++ VG++HLG+ VN 
Sbjct: 948  RDYNPNWMTAIEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNV 1007

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            FRHGSLVM+ L +S       V+ GTV+G IG++  +P   Y FL  L+  L  VIK VG
Sbjct: 1008 FRHGSLVMQNLGESSTPTQGCVLXGTVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVG 1067

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSFN E K    + F+DGDLIESFLDLS  +M E++  + +          +V
Sbjct: 1068 KIEHNFWRSFNTELKIEQCEGFIDGDLIESFLDLSPDKMAEVASGLMIDDPSGMKKEATV 1127

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VE+LTR+H
Sbjct: 1128 DDLVKIVEDLTRIH 1141


>gi|332030156|gb|EGI69950.1| DNA damage-binding protein 1 [Acromyrmex echinatior]
          Length = 1138

 Score = 1031 bits (2665), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 523/1094 (47%), Positives = 736/1094 (67%), Gaps = 69/1094 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V IYG+I+ ++ FRP  E +D LF+ T RY   +L+   + E  E+IT+A G+V+DRIG
Sbjct: 54   EVGIYGKISVIKFFRPPHEKKDLLFLLTTRYNAMILECIGEGEDIEIITKAHGNVADRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            + ++ G   +IDP  R+IGL LYDGLFK+IP D      +A +IR++E QV D+ FL+GC
Sbjct: 114  KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMDEQQVQDVNFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            A PT+++++QD  + RHVKT+E+ L+DK+F + PW Q+N++  A ++IPVP P+CG +II
Sbjct: 174  ANPTLILIHQD-INGRHVKTHEINLRDKEFAKIPWRQDNVEREAMMVIPVPSPICGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
            G+E+I+Y     + A+    I+ S    Y +VD  G RYLLGD AG L +L +  EK+  
Sbjct: 233  GQESILYHDGTTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFLEQEKKPD 292

Query: 237  ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                V  LK+ELLGE SI   I+YLDN V+Y+GS  GDSQLIKL  + D  GSY   +E 
Sbjct: 293  GSQVVKDLKVELLGEISIPECITYLDNGVIYVGSRLGDSQLIKLITKADENGSYCVPMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NL PIVD  VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 353  FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIQEHASMDLPGIKGMWA 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            L+      FD  LV+SF+ +TRIL +N E E+EET+I GF +  QT    +   +  +Q+
Sbjct: 413  LKVGGSH-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTDDLFIQI 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRL+S  ++ + +EW+     +++V   N +QVL ATG   L Y+EI    +    
Sbjct: 471  TSTSVRLISHENKIVVSEWEPQNKRTISVVACNGTQVLCATGND-LFYIEIICNQIVSKG 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               L++E++CLDI+P+ +  + ++I AVG+WTDISVRI +LP+L  I KE LGGEIIPRS
Sbjct: 530  FVTLQHEVACLDISPL-DGVNEAKIVAVGLWTDISVRILTLPNLEEINKELLGGEIIPRS 588

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEG +YLLCALGDG +  F+L+ ++G L+D+KKV+LGTQP  LRTF S +TT+VF
Sbjct: 589  ILMTCFEGNTYLLCALGDGSMYYFILHRQSGMLSDKKKVTLGTQPTVLRTFRSLSTTNVF 648

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ ++PDSLA+A +  +TIGTID+IQKL
Sbjct: 649  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 708

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TF + +++                               
Sbjct: 709  HIRTVPLGESPRRIAYQESSQTFGVITMRVDVQESSGVSIVRHSASTQAASTSSSSHVVS 768

Query: 682  --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                         +E E+H + ++D  TFE +  + L   EY  S++S    +D   Y+ 
Sbjct: 769  HNKPSGHTASEIGQEIEVHNLLIIDQHTFEVLHAHTLMATEYALSLISTKLGEDPTSYFV 828

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            VGTA++ P+E EP  GRIL++   +GK   +AEKE KG+ YSL  FNGKLLA+IN  ++L
Sbjct: 829  VGTAFINPDETEPKMGRILLYHWSEGKFTQVAEKEIKGSCYSLVEFNGKLLASINSTVRL 888

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
            ++W       +EL+ EC H  +I+ALY++T+GDF++VGDLM+S++LL YK  EG+ EE A
Sbjct: 889  FEWT----AEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYKTMEGSFEEIA 944

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            RDYN NWM+++EILDDD +LGAEN FNLF  +K+S   +++ER +++ +G++HLG+ VN 
Sbjct: 945  RDYNPNWMTSIEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEIGQFHLGDMVNV 1004

Query: 914  FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
            FRHGSLVM+ L +S    +  V+FGTV+G IG++  +P   Y FL  ++  L  VIK VG
Sbjct: 1005 FRHGSLVMQNLGESSTPTLGCVLFGTVSGAIGLVTQIPVTFYEFLRNMEDRLNSVIKSVG 1064

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
             + H  WRSFN E K    + F+DGDLIESFLDL+  +M E++  + +          +V
Sbjct: 1065 KIEHNFWRSFNTELKIEQCEGFIDGDLIESFLDLNHDKMAEVAMGLMIDDGSGMKKEATV 1124

Query: 1023 EELCKRVEELTRLH 1036
            ++L K VE+LTR+H
Sbjct: 1125 DDLVKIVEDLTRIH 1138


>gi|193644722|ref|XP_001942922.1| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
          Length = 1156

 Score = 1030 bits (2664), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 523/1110 (47%), Positives = 744/1110 (67%), Gaps = 85/1110 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVL---QWDAESSELITRAMGDVSDRI 59
            ++ +YG+I  +++FRP  + +D LF+ T RY   +L   Q ++   ++IT+A G++SD+I
Sbjct: 56   EIALYGKIEIMKVFRPKHKDKDLLFVVTARYNTMILECVQTESGDIDIITKAHGNISDQI 115

Query: 60   GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYG 119
            G+ ++ G + +IDP  R+IGL LYDGLFK+IP D +G+LK A+ +R+EE++V DI FLYG
Sbjct: 116  GKISEIGAMAVIDPSARVIGLKLYDGLFKIIPLDKEGELK-AYCLRMEEVEVQDIDFLYG 174

Query: 120  CAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLI 179
            CA PTI++++QD    RH+K  E+++KDK+FV+ PW Q N++  A ++IPVP PLCG +I
Sbjct: 175  CANPTIIIIHQDTM-GRHIKAKELSIKDKEFVKTPWKQENVETEASMIIPVPEPLCGAII 233

Query: 180  IGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE- 235
            IG E+++Y + ++F AI    I+ S    Y R+D +G+RYLLGD AG L +L++ +EK  
Sbjct: 234  IGRESVLYHNGSSFIAISPPVIKQSTIVCYARIDPEGTRYLLGDMAGHLFMLLLNYEKNP 293

Query: 236  ----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
                K+   K++LLGE SI  +++YLDN ++Y+ S  GDSQLIKLN +PD  GS++ VL+
Sbjct: 294  DGTFKIKDPKVDLLGEISIPESLTYLDNKIIYVASRVGDSQLIKLNKKPDQFGSHITVLD 353

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
             ++NLGPIVD CV+DLERQGQGQVVTCSGAYK+GSLRI+RNGIGI E A+++L GIKGMW
Sbjct: 354  TFMNLGPIVDMCVIDLERQGQGQVVTCSGAYKEGSLRIIRNGIGIQEVATIDLVGIKGMW 413

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
             LR +TD   D  LV+SF+  +R+LA + E E+EE ++EGF S+ QT +C +   N++VQ
Sbjct: 414  PLRITTDSLLDDTLVLSFVGHSRVLAYSGE-EVEEIDLEGFQSELQTFYCGNTSDNKMVQ 472

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
            +TS SVRL+   S+ L +EW  P G S+NV + N  Q + ATG   L Y+EIG   + + 
Sbjct: 473  ITSASVRLICLESKCLVSEWNVPDGKSINVVSCNGHQAVCATGNS-LYYIEIGTDKVDQK 531

Query: 472  KHAQLEYEISCLDINPIGENPSY--SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
                LE+E+SCLD+    +N  Y  S + AVG+W DISV+I  LPD   + +E LG EII
Sbjct: 532  GFITLEHEVSCLDVCSF-KNEFYKNSSLVAVGLWMDISVKILQLPDFVELVREPLGEEII 590

Query: 530  PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
            PRS+L+  FE I YLLCALGDG L  F LN + G L+D++KV+LGTQP  +R F S  TT
Sbjct: 591  PRSILMVTFENIDYLLCALGDGSLCYFHLNPENGVLSDKRKVNLGTQPTLIRKFQSLATT 650

Query: 590  HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
             VFA SD PTVIYSSN KL++SNVNL++V+HMC  N+ ++PDSLA+A +  + IGTID++
Sbjct: 651  SVFACSDHPTVIYSSNNKLIFSNVNLRKVNHMCSLNTKSYPDSLAMATDTAIIIGTIDEM 710

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---------------------------- 681
            QKLHIR+IPLGE PRRI HQE S++F I +++                            
Sbjct: 711  QKLHIRTIPLGEAPRRIAHQESSKSFGIITMRIDVHEGINLVPARPSASTSAQNISGAIN 770

Query: 682  -----NQSCAE------------ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF 724
                 N S               E E+H + +LD  TFE +  + L++ EY  SI+S   
Sbjct: 771  NRMPNNASAVNSNQGPLSSEYGLEVEIHNMLVLDQNTFEVLHAHQLNSNEYALSIISAKL 830

Query: 725  SDDSNVYYCVGTAYVLPEENEPTKGRILVFIVED--GKLQLIAEKETKGAVYSLNAFNGK 782
             DD   YY +GTA V PE+ +P  GRIL+F  +D   KL  I EKE KGA Y +  FNGK
Sbjct: 831  GDDPATYYILGTAVVNPEDQDPKLGRILIFHWDDSSSKLTPITEKEVKGACYGMAEFNGK 890

Query: 783  LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
            LLAA+N  ++L++W       +EL+ EC H  +I+AL+V+T+GDFIV GDLM+S++LL Y
Sbjct: 891  LLAAVNCTVRLFEWT----AEKELRLECSHFNNIVALFVKTKGDFIVCGDLMRSLTLLQY 946

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVV 902
            K  EG+ EE ARDYN  W +A+EI+DDD++LGAEN+ NLF + K+S   +DE R +L+ +
Sbjct: 947  KTMEGSFEEIARDYNPKWSTAIEIIDDDVFLGAENDKNLFIIHKDSTLTSDEARHQLQEI 1006

Query: 903  GEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
            G++H G+ +N FRHGSLVM+   D+ V     +++GT +G +G++  L  + + FL  L+
Sbjct: 1007 GQFHCGDLINVFRHGSLVMQHFTDTYVSVQGGILYGTCSGALGLVTQLTPKMFDFLSDLE 1066

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM---------- 1011
             +L  V+KGVG +NH+ WRS++ E +T  +++F+DGDLIESFLDLS+  M          
Sbjct: 1067 KSLATVVKGVGKINHQFWRSYHTEIRTEPSESFVDGDLIESFLDLSKREMIAVVDALQGA 1126

Query: 1012 -----DEISKTMNVSVEELCKRVEELTRLH 1036
                  +I K   +S++++ K VE+LTR+H
Sbjct: 1127 YDHEFKKIPKDTKISLDDVIKLVEDLTRIH 1156


>gi|307186138|gb|EFN71863.1| DNA damage-binding protein 1 [Camponotus floridanus]
          Length = 1136

 Score = 1025 bits (2649), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 525/1092 (48%), Positives = 728/1092 (66%), Gaps = 67/1092 (6%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            +V IYG+IA ++ FRP  E +D LF+ T  Y   +L+   + E  E+IT+A G+V+DRIG
Sbjct: 54   EVGIYGKIAVIKFFRPPHEKKDLLFLLTTSYNAMILECMGEGEDIEIITKAHGNVADRIG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            + ++ G   +IDP  R+IGL LYDGLFK+IP D      +A +IR++E QV D+ FL+GC
Sbjct: 114  KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMDEQQVQDVNFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT+++++QD  + RHVKT+E+ L++K+F + PW Q+N++  A ++IPVP P+CG +II
Sbjct: 174  TNPTLILIHQD-INGRHVKTHEINLREKEFSKIPWRQDNVEREAMMVIPVPSPICGAIII 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
            G+E+I+Y     + A+    I+ S    Y +VD  G RYLLGD AG L +L +  EK+  
Sbjct: 233  GQESILYHDGTTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFLELEKKPD 292

Query: 237  ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                V  LK+ELLGE SI   I+YLDN V+Y+GS  GDSQLIKL  + D  GSY   +E 
Sbjct: 293  GTQVVKDLKVELLGEISIPECITYLDNGVIYVGSRLGDSQLIKLITKADENGSYCVPMET 352

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NL PIVD  VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 353  FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGIKGMWA 412

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            L+    + FD  LV+SF+ +TRIL +N E E+EET+I GF +  QT    +   +  +Q+
Sbjct: 413  LKVGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            T  S RL+S  S+ + +EW+     +++V   N +QVL ATG   L Y+EI    +    
Sbjct: 471  TPTSARLISHESKMVVSEWEPQNKRTISVVACNGTQVLCATGN-DLFYMEISCNQIVPKG 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
             A L++E++CLDI+P+ +  + ++I AVG+WTDISVRI +LP L  I KE LGGEIIPRS
Sbjct: 530  FATLQHEVACLDISPL-DGVNEAKIVAVGLWTDISVRILTLPGLEEINKELLGGEIIPRS 588

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEG +YLLCALGDG +  F L  + G L+D+K+V+LGTQP  LRTF S +TT+VF
Sbjct: 589  ILMTCFEGNTYLLCALGDGSMYYFTLYKQNGVLSDKKRVTLGTQPTVLRTFRSLSTTNVF 648

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ ++PDSLA+A +  +TIGTID+IQKL
Sbjct: 649  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 708

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TF + +++                               
Sbjct: 709  HIRTVPLGESPRRIAYQESSQTFGVITMRVDVQESSGVSIVRHSASTQAASLSSGIASHN 768

Query: 682  ------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVG 735
                       +E E+H + ++D  TFE +  + L   EY  S++S    +DS  YY VG
Sbjct: 769  KPSGHTASEIGQEIEVHNLLIIDQHTFEVLHAHTLMPTEYALSLISTRLGEDSTSYYVVG 828

Query: 736  TAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
            TA++ P+E EP  GRIL+F   DGKL  +AEKE KG+ YSL  FNGKLLA+IN  ++L++
Sbjct: 829  TAFINPDETEPKMGRILLFHWSDGKLSQVAEKEIKGSCYSLVEFNGKLLASINSTVRLFE 888

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
            W       +EL+ EC H  +I+ALY++T+ DF++VGDLM+S++LL YK  EG+ EE ARD
Sbjct: 889  WT----AEKELRLECSHFNNIIALYLKTKSDFVLVGDLMRSLTLLQYKTMEGSFEEIARD 944

Query: 856  YNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
            YN NWM+++EILDDD +LGAEN FNLF  +K+S   +++ER +++ VG++HLG+ VN FR
Sbjct: 945  YNPNWMTSIEILDDDTFLGAENCFNLFICQKDSAATSEDERQQMQEVGQFHLGDMVNVFR 1004

Query: 916  HGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
            HGSLVM+ L +S       V+FGTV+G IG++  +P   Y FL  L+  L  VIK VG +
Sbjct: 1005 HGSLVMQNLGESSTPTQGCVLFGTVSGAIGLVTQIPFGFYEFLRNLEDKLTSVIKSVGKI 1064

Query: 975  NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM----------NVSVEE 1024
             H  WRSF  + K    + F+DGDLIESFLDLS  +M E++  +            +V++
Sbjct: 1065 EHNFWRSFKTDLKIEQCEGFIDGDLIESFLDLSHDKMAEVAMGLMMDDGSGMKKEATVDD 1124

Query: 1025 LCKRVEELTRLH 1036
            L K VE+LTR+H
Sbjct: 1125 LVKIVEDLTRIH 1136


>gi|432089478|gb|ELK23419.1| DNA damage-binding protein 1 [Myotis davidii]
          Length = 1047

 Score = 1008 bits (2606), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 501/942 (53%), Positives = 661/942 (70%), Gaps = 56/942 (5%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
           +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54  EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
           RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
           G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
               +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
           + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF  + QT FC +  + QL+Q+
Sbjct: 413 LRSDPTRETDDTLVLSFVGQTRVLMLNGE-EVEETELTGFVDEQQTFFCGNVAHQQLIQI 471

Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
           TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L  L+I    L ++ 
Sbjct: 472 TSASVRLVSQDPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYCLQIHPQELRQIS 530

Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
           H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
           +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
           A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
           HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                     S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
           VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
           Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891 YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
           RD+N NWMSAVEILDDD +LGAEN FNLF  +K+    T+ +
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDRSFHTERK 988



 Score = 60.5 bits (145), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 31/67 (46%), Positives = 42/67 (62%), Gaps = 10/67 (14%)

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSVEELCKRV 1029
            RSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + ++L K V
Sbjct: 981  RSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVV 1040

Query: 1030 EELTRLH 1036
            EELTR+H
Sbjct: 1041 EELTRIH 1047


>gi|16197726|emb|CAC94909.1| damaged-DNA recognition protein 1 [Mus musculus]
          Length = 994

 Score = 1008 bits (2605), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 499/935 (53%), Positives = 658/935 (70%), Gaps = 56/935 (5%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
           +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54  EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
           RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
           G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
               +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
           + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
           TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPRAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
           H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
           +L+  FE   YLLCALGDG L  F LN++TG L+DRKK++LGTQP  LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKMTLGTQPTVLRTFRSLSTTNVF 650

Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
           A SDRPTVIYSSN KL++SNVN KEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNFKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
           HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                     S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
           VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
           Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
           RD+N NWMSAVEILDDD +LGAEN FNLF  +K+S
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDS 981


>gi|357623954|gb|EHJ74904.1| putative DNA repair protein xp-e [Danaus plexippus]
          Length = 1128

 Score =  998 bits (2579), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 511/1090 (46%), Positives = 719/1090 (65%), Gaps = 67/1090 (6%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSD 57
            M +V +YGR+A ++LFRP  E +D +FI T RY   +L+W   ++   E++TRA G+V+D
Sbjct: 52   MKEVGLYGRVAKMKLFRPPYEQKDLVFILTARYNAMILEWRTGANGELEVVTRAHGNVAD 111

Query: 58   RIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
            RIG+P++NG + +IDP  R+IGL LYDGLFK+IP D      +A ++RLEEL V D++FL
Sbjct: 112  RIGKPSENGILAVIDPQARVIGLRLYDGLFKIIPLDKDSTELKAASLRLEELNVYDLEFL 171

Query: 118  YGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGV 177
            +GC+ PT+++++QD  + RH+KT+E+ L+DK+F++ PW Q+N++  A +LIPVP PL G 
Sbjct: 172  HGCSNPTLILIHQD-LNGRHIKTHEINLRDKEFMKIPWKQDNVETEASILIPVPSPLGGA 230

Query: 178  LIIGEETIVYCSANAFKAI--PIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
            ++IG+E+IVY    ++ A+  P   +    Y RVD  G RYLLGD AG L +L++   + 
Sbjct: 231  IVIGQESIVYHDGQSYVAVAPPQIKTPINCYCRVDVRGLRYLLGDIAGRLFMLLLELSER 290

Query: 236  ----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
                 V  LK+ELLG+  I   ++YLDN VV++GS  GDS L++L    D    YV+ +E
Sbjct: 291  DGTASVRDLKVELLGDIPIPECMTYLDNGVVFVGSRLGDSALVRLAAVRDDASQYVQPME 350

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
             + +L PIVD CVVDLERQGQ Q++TCSGA+K GSLRI+RNGIGI EQAS++L GIKGMW
Sbjct: 351  TFTSLAPIVDMCVVDLERQGQNQLITCSGAFKMGSLRIIRNGIGIQEQASIDLPGIKGMW 410

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            +L        DT LV+SF+ +TR+L +N E E+EETEI+GF S  QT F  +  ++QL+Q
Sbjct: 411  ALTLGQGPHHDT-LVLSFVGQTRVLTLNGE-EVEETEIKGFVSDRQTFFTGNVCHDQLIQ 468

Query: 412  VTSGSVRLVSSTSRELRN--EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILT 469
            VT   +RL+            W +P G +V+V +   ++ + A  G  +  + I  G L 
Sbjct: 469  VTDEGIRLIGRGPGGWNGVAAW-APAGRAVSVVSCGETRAV-AAAGLRIYLVAIKQGALE 526

Query: 470  EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
             +    +  E++CLD+ P GE      +  VG+WTDISVR+  LPDL  +  E L GEII
Sbjct: 527  LISEVCMNEEVACLDLGPGGEE----ALLGVGLWTDISVRVLKLPDLRPLHTEKLSGEII 582

Query: 530  PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
            PRS+L+C  EG+ YLLCALGDG +  F ++  +G LT++KKV+LGTQP  LR+F S +TT
Sbjct: 583  PRSLLICVLEGVCYLLCALGDGSMFYFTVDPDSGVLTNKKKVTLGTQPTVLRSFRSLSTT 642

Query: 590  HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
            ++FA SDRPTVI+SSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  +TIGTID+I
Sbjct: 643  NIFACSDRPTVIFSSNHKLVFSNVNLKEVAHMCSLNAVAYPDSLALATDSTVTIGTIDEI 702

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSL----------------------------- 680
            QKLHIR++PLGE PRRI +QE S+TF + ++                             
Sbjct: 703  QKLHIRTVPLGETPRRIAYQEASQTFGVITMRVDKVEWTGGCGSLVRPSASTAAASASAA 762

Query: 681  ----KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGT 736
                K+     + E+H + +LD  TFE +  + L   E+  S++SC  +DD N YY VGT
Sbjct: 763  APPSKHAPAPLDLELHNLLILDHHTFEVLHAHQLLANEFAMSLVSCKLADDPNHYYAVGT 822

Query: 737  AYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW 796
            A + PEE+EP +GRIL+F   +GKL  +AEKE KG  Y+L  FNGKLLA+IN  ++L++W
Sbjct: 823  AILNPEESEPKQGRILLFHWCEGKLTQVAEKEIKGGCYTLVEFNGKLLASINSTVRLFEW 882

Query: 797  MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
                   +EL+ EC H  +I+ALY++ +GDFI+VGDLM+S+SLL YK  EG+ EE ARDY
Sbjct: 883  T----SEKELRLECSHFNNIVALYLKVKGDFILVGDLMRSMSLLQYKQMEGSFEEIARDY 938

Query: 857  NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
            + NWM+AVEILDDD +LGAEN+FNLF  +K+S   TDEER ++  +G++H+G+ VN  R 
Sbjct: 939  SPNWMTAVEILDDDTFLGAENSFNLFVCQKDSAATTDEERQQMGYMGQFHVGDMVNVMRR 998

Query: 917  GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
            G+LV +L D+       V+  TV+G I ++  L  E + FL +L+  L   IK VG + H
Sbjct: 999  GALVAQLADTAAPVARPVLLATVSGAICLVVQLSQELFDFLHQLEERLTHTIKSVGKIPH 1058

Query: 977  EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVEELC 1026
              WRSFN + KT  A+ F+DGDLIESFLDLSR    E  + + +          +V++L 
Sbjct: 1059 SFWRSFNTDIKTEPAEGFIDGDLIESFLDLSRDMQQETLQGLQIDDGGGMMRDATVDDLI 1118

Query: 1027 KRVEELTRLH 1036
            K VE+LTR+H
Sbjct: 1119 KIVEDLTRIH 1128


>gi|148709424|gb|EDL41370.1| damage specific DNA binding protein 1 [Mus musculus]
          Length = 968

 Score =  993 bits (2566), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 492/922 (53%), Positives = 648/922 (70%), Gaps = 56/922 (6%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
           +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54  EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
           RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
           G+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++  
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292

Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
               +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E 
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352

Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
           + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W 
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           LRS      D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+Q+
Sbjct: 413 LRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471

Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
           TS SVRLVS   + L +EWK P G +++VA+ N+SQV++A G   L YL+I    L ++ 
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530

Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
           H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590

Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
           +L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650

Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
           A SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710

Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
           HIR++PL E PR+IC+QE S+ F + S +                               
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770

Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                     S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ 
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830

Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
           VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890

Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
           Y+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946

Query: 854 RDYNANWMSAVEILDDDIYLGA 875
           RD+N NWMSAVEILDDD + G+
Sbjct: 947 RDFNPNWMSAVEILDDDNFTGS 968


>gi|194901554|ref|XP_001980317.1| GG19434 [Drosophila erecta]
 gi|190652020|gb|EDV49275.1| GG19434 [Drosophila erecta]
          Length = 1140

 Score =  968 bits (2502), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 511/1099 (46%), Positives = 710/1099 (64%), Gaps = 77/1099 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDA--ESSELITRAMGDVSDRIG 60
            ++ I G IA +  FRP    +D LFI T RY   +L+     ++  ++T+A G+VSD +G
Sbjct: 54   EININGTIAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDAITVVTKANGNVSDSVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
             P++ G I  IDP  R+IG+ LY GLF +IP D      +A N+R++EL V D++FL+GC
Sbjct: 114  VPSEGGVIAAIDPKARVIGMCLYQGLFTIIPLDKDASELKATNLRMDELNVYDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT++V+++DN D RHVK++E+ L++K+F++  W Q+N++  A +LIPVP P+ GV++I
Sbjct: 174  MNPTVIVIHKDN-DGRHVKSHEINLREKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
            G E+IVY   + + A+     R S    Y RV ++G RYLLG+  G L++L + T E  K
Sbjct: 233  GRESIVYHDGSNYHAVAPLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSK 292

Query: 237  ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
               V  +K+E LGE SI   I+YLDN  +YIG+ +GDSQL++LN +    GSYV  +E +
Sbjct: 293  GVTVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLNSEA-IDGSYVVPVENF 351

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
             NL PI+D  VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352  TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411

Query: 354  RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            +   D+ P++  LV++F+  TRIL ++ E E+EETEI GF S  QT  C +  Y+QL+QV
Sbjct: 412  KVGVDESPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCSNVDYDQLIQV 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVSS ++ L  EW+     S+ V + N +Q+++A+    L Y+ I DG L E  
Sbjct: 471  TSDSVRLVSSATKALVAEWRPTGDRSIGVVSCNTTQIVVASAC-DLFYIVIEDGTLREQS 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               LEYE++CLDI P+ E    S + AVG+WTDIS  I SLPDL  I  E L GEIIPRS
Sbjct: 530  RRTLEYEVACLDITPLDETQKKSDMVAVGLWTDISAVIMSLPDLETIYTEKLSGEIIPRS 589

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEGI YLLCALGDG +  F+++  TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590  ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSLSTTNVF 649

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQKL
Sbjct: 650  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALANKNAVILGTIDEIQKL 709

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TFA+ +L+                               
Sbjct: 710  HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNFLP 769

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                      N    +E ++H + ++D  TFE +  +   + E   S++S    DD N Y
Sbjct: 770  KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVSPETISSLMSAKLGDDPNTY 829

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            Y V T+ V+PEE EP  GRI++F   + KL  +AE +  G  Y+L  FNGK+LA I   +
Sbjct: 830  YVVATSLVIPEEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL+ EC     I ALY++ +GDFI+VGDLM+SI+LL +K  EG   E
Sbjct: 890  RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD    WM AVEILDDD +LG+E N NLF  +K+S   TDEER  L  +  +HLG+ V
Sbjct: 946  IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005

Query: 912  NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            N FRHGSLVM+    +VG+  T     V++GT NG IG++  +P + Y FL  L+  L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS----- 1021
            +IK VG + H  +R+F    K   ++ F+DGDLIESFLDL R +M +  + + ++     
Sbjct: 1062 IIKSVGKIEHTYYRNFQINTKVEPSEGFIDGDLIESFLDLGREKMRDAVQGLELTLNGER 1121

Query: 1022 ----VEELCKRVEELTRLH 1036
                VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140


>gi|195500686|ref|XP_002097479.1| GE26244 [Drosophila yakuba]
 gi|194183580|gb|EDW97191.1| GE26244 [Drosophila yakuba]
          Length = 1140

 Score =  965 bits (2495), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 511/1099 (46%), Positives = 712/1099 (64%), Gaps = 77/1099 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDA--ESSELITRAMGDVSDRIG 60
            ++ I G IA +  FRP    +D +FI T RY   +L+     ++  ++T+A G+VSD +G
Sbjct: 54   EININGTIAVMRHFRPPDSNKDLIFILTRRYNVMILEARMVNDAITVVTKANGNVSDSVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
             P++ G +  IDP  R+IG+ LY GLF +IP D      +A N+R++EL V D++FL+GC
Sbjct: 114  IPSEGGVMAAIDPKARVIGMCLYQGLFTIIPLDKDASELKATNLRMDELTVYDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT++V+++DN D RHVK++E+ L++K+F++  W Q+N++  A +LIPVP P+ GV++I
Sbjct: 174  LNPTVIVIHKDN-DGRHVKSHEINLREKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
            G E+IVY   + + A+     R S    Y RV ++G RYLLG+  G L++L + T E  K
Sbjct: 233  GRESIVYHDGSNYHAVAPLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTSETSK 292

Query: 237  ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
               V  +K+E LGE SI   I+YLDN  +YIG+ +GDSQL++LN +    GSYV  +E +
Sbjct: 293  GVTVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLNSEA-IDGSYVVPVENF 351

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
             NL PI+D  VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352  TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411

Query: 354  RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            +   D+ P++  LV++F+  TRIL ++ E E+EETEI GF S  QT  C +  Y+Q++QV
Sbjct: 412  KVGVDESPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCSNVDYDQVIQV 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVSS ++ L  EW+     S+ V + N +Q+++A+    + Y+ I DG L E  
Sbjct: 471  TSDSVRLVSSATKALVAEWRPTGDRSIGVVSCNTTQIVVASAC-DIFYIVIEDGSLREQS 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               LEYE++CLDI P+ E  + S + AVG+WTDIS  I SLPDL  I  E L GEIIPRS
Sbjct: 530  RRTLEYEVACLDITPLDETQTKSDLVAVGLWTDISAVIMSLPDLKTIYTEKLSGEIIPRS 589

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEGI YLLCALGDG +  F+++  TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590  ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSLSTTNVF 649

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQKL
Sbjct: 650  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALANKNAVILGTIDEIQKL 709

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TFA+ +L+                               
Sbjct: 710  HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNFLP 769

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                      N    +E ++H + ++D  TFE +  +   + E   S++S    DD N Y
Sbjct: 770  KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHHFVSPETISSLMSAKLGDDPNTY 829

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            Y V T+ V+PEE EP  GRI++F   + KL  +AE +  G  Y+L  FNGK+LA I   +
Sbjct: 830  YVVATSLVIPEEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL+ EC     I ALY++ +GDFI+VGDLM+SI+LL +K  EG   E
Sbjct: 890  RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD    WM AVEILDDD +LG+E N NLF  +K+S   TDEER  L  +  +HLG+ V
Sbjct: 946  IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005

Query: 912  NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            N FRHGSLVM+    +VG+  T     V++GT NG IG++  +P + Y FL  LQ  L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLQERLKK 1061

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE----ISKTMN--- 1019
            +IK VG + H  +R+F    K   ++ F+DGDLIESFLDLSR +M +    +  T+N   
Sbjct: 1062 IIKSVGKIEHTYYRNFQINNKVEPSEGFIDGDLIESFLDLSREKMRDSVQGLELTLNGER 1121

Query: 1020 --VSVEELCKRVEELTRLH 1036
                VE++ K VE+LTR+H
Sbjct: 1122 KGADVEDVIKIVEDLTRMH 1140


>gi|21357503|ref|NP_650257.1| piccolo [Drosophila melanogaster]
 gi|74872881|sp|Q9XYZ5.1|DDB1_DROME RecName: Full=DNA damage-binding protein 1; Short=D-DDB1; AltName:
            Full=Damage-specific DNA-binding protein 1; AltName:
            Full=Protein piccolo
 gi|4928452|gb|AAD33592.1|AF132145_1 damage-specific DNA binding protein DDBa p127 subunit [Drosophila
            melanogaster]
 gi|7299719|gb|AAF54901.1| piccolo [Drosophila melanogaster]
 gi|220942640|gb|ACL83863.1| DDB1-PA [synthetic construct]
          Length = 1140

 Score =  962 bits (2486), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 509/1099 (46%), Positives = 708/1099 (64%), Gaps = 77/1099 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
            ++ I G IA +  FRP    +D LFI T RY   +L+    +    ++T+A G+VSD +G
Sbjct: 54   EININGTIAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDVITVVTKANGNVSDSVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
             P++ G I  IDP  R+IG+ LY GLF +IP D      +A N+R++EL V D++FL+GC
Sbjct: 114  IPSEGGVIAAIDPKARVIGMCLYQGLFTIIPMDKDASELKATNLRMDELNVYDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT++V+++D+ D RHVK++E+ L+DK+F++  W Q+N++  A +LIPVP P+ GV++I
Sbjct: 174  LNPTVIVIHKDS-DGRHVKSHEINLRDKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
            G E+IVY   + + A+     R S    Y RV ++G RYLLG+  G L++L + T E  K
Sbjct: 233  GRESIVYHDGSNYHAVAPLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSK 292

Query: 237  ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
               V  +K+E LGE SI   I+YLDN  +YIG+ +GDSQL++LN +    GSYV  +E +
Sbjct: 293  GVTVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLNSEA-IDGSYVVPVENF 351

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
             NL PI+D  VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352  TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411

Query: 354  RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            +   D+ P++  LV++F+  TRIL ++ E E+EETEI GF S  QT  C +  Y+QL+QV
Sbjct: 412  KVGVDESPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCSNVDYDQLIQV 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVSS ++ L  EW+     ++ V + N +Q+L+A+    + Y+ I DG L E  
Sbjct: 471  TSDSVRLVSSATKALVAEWRPTGDRTIGVVSCNTTQILVASAC-DIFYIVIEDGSLREQS 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               L YE++CLDI P+ E    S + AVG+WTDIS  I SLPDL  I  E L GEIIPRS
Sbjct: 530  RRTLAYEVACLDITPLDETQKKSDLVAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEGI YLLCALGDG +  F+++  TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590  ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSLSTTNVF 649

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQKL
Sbjct: 650  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALANKNAVILGTIDEIQKL 709

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TFA+ +L+                               
Sbjct: 710  HIRTVPLGEGPRRIAYQESSQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNFLP 769

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                      N    +E ++H + ++D  TFE +  +     E   S++S    DD N Y
Sbjct: 770  KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVAPETISSLMSAKLGDDPNTY 829

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            Y V T+ V+PEE EP  GRI++F   + KL  +AE +  G  Y+L  FNGK+LA I   +
Sbjct: 830  YVVATSLVIPEEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL+ EC     I AL+++ +GDFI+VGDLM+SI+LL +K  EG   E
Sbjct: 890  RLYEWT----NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD    WM AVEILDDD +LG+E N NLF  +K+S   TDEER  L  +  +HLG+ V
Sbjct: 946  IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005

Query: 912  NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            N FRHGSLVM+    +VG+  T     V++GT NG IG++  +P + Y FL  L+  L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS----- 1021
            +IK VG + H  +R+F    K   ++ F+DGDLIESFLDLSR +M +  + + ++     
Sbjct: 1062 IIKSVGKIEHTYYRNFQINSKVEPSEGFIDGDLIESFLDLSRDKMRDAVQGLELTLNGER 1121

Query: 1022 ----VEELCKRVEELTRLH 1036
                VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140


>gi|195449948|ref|XP_002072297.1| GK22405 [Drosophila willistoni]
 gi|194168382|gb|EDW83283.1| GK22405 [Drosophila willistoni]
          Length = 1140

 Score =  961 bits (2485), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 510/1099 (46%), Positives = 712/1099 (64%), Gaps = 77/1099 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD--AESSELITRAMGDVSDRIG 60
            ++ I G IA ++ FRP    +D LFI T RY   +L+     ++  +IT+A G+VSD +G
Sbjct: 54   EININGTIAVMKHFRPPDSNKDLLFILTRRYNVMILEARMVGDTITVITKANGNVSDSVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
             P++ G I  IDP  R+IG+ LY GLF +IP +      +A N+R++EL V D++FL+GC
Sbjct: 114  IPSEGGVIAAIDPKARVIGMCLYQGLFTIIPMEKDASELKATNLRMDELMVYDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT++V+++DN D RHVK++E+ L+DK+F++  W Q+N++  A +LIPVP P+ GV++I
Sbjct: 174  LNPTVIVIHKDN-DGRHVKSHEINLRDKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
            G E+IVY   + + A+     R S    Y RVD+ G RYLLG+  G L++L + T E  K
Sbjct: 233  GRESIVYHDGSNYHAVAPLTFRQSTINCYARVDSKGLRYLLGNMHGQLYMLFLGTSESSK 292

Query: 237  ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
               V  +K+E LGE SI   I+YLDN  +YIG+ +GDSQL++L+ +   +GSYV  +E +
Sbjct: 293  GITVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLSSEA-IEGSYVIPVENF 351

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
             NL PI+D  VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352  TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411

Query: 354  RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            +  TDD P++  LV++F+  TRIL ++ E E+EETEI GF S  QT  C +  ++QL+QV
Sbjct: 412  KVGTDDSPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCANVEHDQLIQV 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLV S+S+ L  EW+     S+ V + N +Q+++A+    + Y+ I DG L E  
Sbjct: 471  TSDSVRLVKSSSKSLVAEWRPEGDRSIGVVSCNGTQIVVASAR-DIFYIVIEDGNLQEKC 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               L YE++CLDI P+ E  + S++ AVG+WTDIS  + SLP L+ I  E L GEIIPRS
Sbjct: 530  RKTLAYEVACLDITPLDEKQNKSELVAVGLWTDISAVMLSLPSLDTIYTEKLSGEIIPRS 589

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEGI YLLCALGDG +  F+++  TG+LTD+KKV+LGTQP TLRTF S  TT+VF
Sbjct: 590  ILMTTFEGIHYLLCALGDGSMYYFIMDPITGQLTDKKKVTLGTQPTTLRTFRSFATTNVF 649

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQKL
Sbjct: 650  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQKL 709

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TFA+ +L+                               
Sbjct: 710  HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGPKPLRSSASTRAQNITCSSNILP 769

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                      N    +E ++H + ++D  TFE +  +   + E   +++S    DD N Y
Sbjct: 770  KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVSPETISALMSAKLGDDPNTY 829

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            Y V T+ V+P+E EP  GRI++F   D KL  +AE +  G  Y+L  FNGK+LA I   +
Sbjct: 830  YVVATSLVIPDEPEPKVGRIIIFHYHDNKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL+ EC     I ALY++ +GDFI+VGDLM+SI+LL +K  EG   E
Sbjct: 890  RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD    WM AVEILDDD +LG+E N NLF  +K+S   TDEER  L  +  +HLG+ V
Sbjct: 946  IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005

Query: 912  NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            N FRHGSLVM+    +VG+  T     V++GT NG IG++  +P + Y FL  L+  L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI---------SKT 1017
            +IK VG + H  +R+F    K   ++ F+DGDLIESFLDLSR +M E           + 
Sbjct: 1062 IIKSVGKIEHTYYRNFQINTKVEPSEGFIDGDLIESFLDLSRDKMREAVLGLELTLYGER 1121

Query: 1018 MNVSVEELCKRVEELTRLH 1036
             +  VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140


>gi|195329354|ref|XP_002031376.1| GM24084 [Drosophila sechellia]
 gi|194120319|gb|EDW42362.1| GM24084 [Drosophila sechellia]
          Length = 1140

 Score =  961 bits (2484), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 510/1099 (46%), Positives = 709/1099 (64%), Gaps = 77/1099 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
            ++ I G IA +  FRP    +D LFI T RY   +L+    +    ++T+A G+VSD +G
Sbjct: 54   EININGTIAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDVITVVTKANGNVSDSVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
             P++ G I  IDP  R+IG+ LY GLF +IP D      +A N+R++EL V D++FL+GC
Sbjct: 114  IPSEGGVIAAIDPKARVIGMCLYQGLFTIIPMDKDASELKATNLRMDELNVYDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT++V+++DN D RHVK++E+ L+DK+F++  W Q+N++  A +LIPVP P+ GV++I
Sbjct: 174  LNPTVIVIHKDN-DGRHVKSHEINLRDKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
            G E+IVY   + + A+     R S    Y RV ++G RYLLG+  G L++L + T E  K
Sbjct: 233  GRESIVYHDGSNYHAVAPLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSK 292

Query: 237  ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
               V  +K+E LGE SI   I+YLDN  +YIG+ +GDSQL++LN +    GSYV  +E +
Sbjct: 293  GVTVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLNSEA-IDGSYVVPVENF 351

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
             NL PI+D  VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352  TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411

Query: 354  RSSTDDP-FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            +   D+  ++  LV++F+  TRIL ++ E E+EETEI GF S  QT  C +  Y+QL+QV
Sbjct: 412  KVGVDESSYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCSNVDYDQLIQV 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVSS ++ L  EW+     S+ V + N +Q+L+A+    + Y+ I DG L E  
Sbjct: 471  TSDSVRLVSSATKALVAEWRPTGDRSIGVVSCNTTQILVASAC-DIFYIVIEDGSLREQS 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               L YE++CLDI P+ E    S + AVG+WTDIS  I SLPDL  I  E L GEIIPRS
Sbjct: 530  RRTLAYEVACLDITPLDETQKKSDLVAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEGI YLLCALGDG +  F+++  TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590  ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSLSTTNVF 649

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQKL
Sbjct: 650  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALANKNAVILGTIDEIQKL 709

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TFA+ +L+                               
Sbjct: 710  HIRTVPLGEGPRRIAYQESSQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNFLP 769

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                      N    +E ++H + ++D  TFE +  +   + E   S++S    DD N Y
Sbjct: 770  KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVSPETISSLMSAQLGDDPNTY 829

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            Y V T+ V+PEE EP  GRI++F   + KL  +AE +  G  Y+L  FNGK+LA I   +
Sbjct: 830  YVVATSLVIPEEPEPKVGRIIIFHYNENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL+ EC     I ALY++ +GDFI+VGDLM+SI+LL +K  EG   E
Sbjct: 890  RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD    WM AVEILDDD +LG+E N NLF  +K+S   TDEER  L  +  +HLG+ V
Sbjct: 946  IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005

Query: 912  NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            N FRHGSLVM+    +VG+  T     V++GT NG IG++  +P + Y FL  L+  L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS----- 1021
            +IK VG + H+ +R+F    +   ++ F+DGDLIESFLDLSR +M +  + + ++     
Sbjct: 1062 IIKLVGKIGHKFYRNFRIHTQVEPSQGFIDGDLIESFLDLSRDKMRDAVQGLELTLNGER 1121

Query: 1022 ----VEELCKRVEELTRLH 1036
                VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140


>gi|195145844|ref|XP_002013900.1| GL24391 [Drosophila persimilis]
 gi|194102843|gb|EDW24886.1| GL24391 [Drosophila persimilis]
          Length = 1140

 Score =  960 bits (2481), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 513/1099 (46%), Positives = 709/1099 (64%), Gaps = 77/1099 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDA--ESSELITRAMGDVSDRIG 60
            ++ I G +A +  FRP    +D LFI T RY   +L+     +S  +IT+A G+VSD  G
Sbjct: 54   EININGTVAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDSITVITKANGNVSDSAG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
             P++ G I  IDP  R+IG+ LY GLF +IP D +    +A N+R++EL V D++FL+GC
Sbjct: 114  IPSEGGVIAAIDPKARVIGMVLYQGLFTIIPMDKEASELKATNLRMDELNVYDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI+V+++DN D RHVK++E+ L++K+F++  W Q+N++  A +LIPVP P+ GV++I
Sbjct: 174  LNPTIIVIHKDN-DGRHVKSHEINLREKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
            G E+IVY   + + A+     R S    Y RVD  G RYLLG+  G L++L + T E  K
Sbjct: 233  GRESIVYHDGSNYHAVAPLTFRQSTINCYARVDGKGLRYLLGNMDGQLYMLFLGTSETSK 292

Query: 237  ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
               V  +K+E LGE SI   I+YLDN  +YIG+ +GDSQL++L+ +   +GSYV  +E +
Sbjct: 293  GVTVKDIKVEKLGEISIPECITYLDNGFLYIGARHGDSQLVRLSSE-SIEGSYVIPVENF 351

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
             NL PI+D  VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352  TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411

Query: 354  RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            +   D+ P++  LV++F+  TRIL ++ E E+EETEI GF S  QT  C +  ++QL+QV
Sbjct: 412  KVGIDESPYENTLVLAFVGLTRILTLSGE-EVEETEIPGFASDLQTFLCANVEHDQLIQV 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLV S ++ L  EW+     S+ V + N++Q++LA+    + Y+ I DG L E  
Sbjct: 471  TSDSVRLVKSATKALICEWRPEGDRSIGVVSCNSTQIVLASAR-DIFYIVIEDGRLVEKS 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               L YE++CLDI P+ E  + S + AVG+WTDIS  I SLPDL  I  E L GEIIPRS
Sbjct: 530  RKTLAYEVACLDITPLDETQNKSDLIAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEGI YLLCALGDG +  F+++  TG+LTD+KKV+LGTQP TLRTF S  TT+VF
Sbjct: 590  ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSFATTNVF 649

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQKL
Sbjct: 650  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQKL 709

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TFA+ +L+                               
Sbjct: 710  HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNILP 769

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                      N    +E ++H + ++D  TFE +  +     E   S++S    DD N Y
Sbjct: 770  KPGGNNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVAPETISSLMSAKLGDDPNTY 829

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            Y V T+ V+P+E EP  GRI++F   D KL  +AE +  G  Y+L  FNGK+LA I   +
Sbjct: 830  YVVATSLVIPDEPEPKVGRIIIFHYHDSKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL+ EC     I ALY++ +GDFI+VGDLM+SI+LL +K  EG   E
Sbjct: 890  RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD    WM AVEILDDD +LG+E N NLF  +K+S   TDEER  L  +  +HLG+ V
Sbjct: 946  IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005

Query: 912  NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            N FRHGSLVM+    +VG+  T     V++GT NG IG++  +P + Y FL  L+  L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE----ISKTMN--- 1019
            +IK VG + H  +R+F    K   ++ F+DGDLIESFLDLSR +M +    +  T+N   
Sbjct: 1062 IIKSVGKIEHTYYRNFQINTKVEPSEGFIDGDLIESFLDLSRDKMRDAVLGLELTLNGER 1121

Query: 1020 --VSVEELCKRVEELTRLH 1036
                VE++ K VE+LTR+H
Sbjct: 1122 KGADVEDVIKIVEDLTRMH 1140


>gi|125774475|ref|XP_001358496.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
 gi|54638233|gb|EAL27635.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
          Length = 1140

 Score =  960 bits (2481), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 513/1099 (46%), Positives = 709/1099 (64%), Gaps = 77/1099 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDA--ESSELITRAMGDVSDRIG 60
            ++ I G +A +  FRP    +D LFI T RY   +L+     +S  +IT+A G+VSD  G
Sbjct: 54   EININGTVAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDSITVITKANGNVSDSAG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
             P++ G I  IDP  R+IG+ LY GLF +IP D +    +A N+R++EL V D++FL+GC
Sbjct: 114  IPSEGGVIAAIDPKARVIGMVLYQGLFTIIPMDKEASELKATNLRMDELSVYDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PTI+V+++DN D RHVK++E+ L++K+F++  W Q+N++  A +LIPVP P+ GV++I
Sbjct: 174  LNPTIIVIHKDN-DGRHVKSHEINLREKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
            G E+IVY   + + A+     R S    Y RVD  G RYLLG+  G L++L + T E  K
Sbjct: 233  GRESIVYHDGSNYHAVAPLTFRQSTINCYARVDGKGLRYLLGNMDGQLYMLFLGTSETSK 292

Query: 237  ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
               V  +K+E LGE SI   I+YLDN  +YIG+ +GDSQL++L+ +   +GSYV  +E +
Sbjct: 293  GVTVKDIKVEKLGEISIPECITYLDNGFLYIGARHGDSQLVRLSSE-SIEGSYVIPVENF 351

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
             NL PI+D  VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352  TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411

Query: 354  RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            +   D+ P++  LV++F+  TRIL ++ E E+EETEI GF S  QT  C +  ++QL+QV
Sbjct: 412  KVGIDESPYENTLVLAFVGLTRILTLSGE-EVEETEIPGFASDLQTFLCANVEHDQLIQV 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLV S ++ L  EW+     S+ V + N++Q++LA+    + Y+ I DG L E  
Sbjct: 471  TSDSVRLVKSATKALICEWRPEGDRSIGVVSCNSTQIVLASAR-DIFYIVIEDGRLVEKS 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               L YE++CLDI P+ E  + S + AVG+WTDIS  I SLPDL  I  E L GEIIPRS
Sbjct: 530  RKTLAYEVACLDITPLDETQNKSDLIAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEGI YLLCALGDG +  F+++  TG+LTD+KKV+LGTQP TLRTF S  TT+VF
Sbjct: 590  ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSFATTNVF 649

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQKL
Sbjct: 650  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQKL 709

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TFA+ +L+                               
Sbjct: 710  HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNILP 769

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                      N    +E ++H + ++D  TFE +  +     E   S++S    DD N Y
Sbjct: 770  KPGGNNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVAPETISSLMSAKLGDDPNTY 829

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            Y V T+ V+P+E EP  GRI++F   D KL  +AE +  G  Y+L  FNGK+LA I   +
Sbjct: 830  YVVATSLVIPDEPEPKVGRIIIFHYHDNKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL+ EC     I ALY++ +GDFI+VGDLM+SI+LL +K  EG   E
Sbjct: 890  RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD    WM AVEILDDD +LG+E N NLF  +K+S   TDEER  L  +  +HLG+ V
Sbjct: 946  IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005

Query: 912  NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            N FRHGSLVM+    +VG+  T     V++GT NG IG++  +P + Y FL  L+  L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE----ISKTMN--- 1019
            +IK VG + H  +R+F    K   ++ F+DGDLIESFLDLSR +M +    +  T+N   
Sbjct: 1062 IIKSVGKIEHTYYRNFQINTKVEPSEGFIDGDLIESFLDLSRDKMRDAVLGLELTLNGER 1121

Query: 1020 --VSVEELCKRVEELTRLH 1036
                VE++ K VE+LTR+H
Sbjct: 1122 KGADVEDVIKIVEDLTRMH 1140


>gi|330792580|ref|XP_003284366.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
 gi|325085712|gb|EGC39114.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
          Length = 1064

 Score =  955 bits (2469), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 505/1062 (47%), Positives = 722/1062 (67%), Gaps = 37/1062 (3%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            M DV IYGRI+ L+LF   G+ QD+LFI+TE +KFC+L +D+E  E++T+A G+  D IG
Sbjct: 9    MFDVNIYGRISVLKLFSAGGK-QDYLFISTESFKFCILAYDSEKKEIVTKASGNAEDTIG 67

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFD---NKGQLKEAFNIRLEELQVLDIKFL 117
            RPT+ GQ+GIIDPD RLI LHLY+GL K+I  +   N    K A N RLEELQV+D+ FL
Sbjct: 68   RPTEAGQLGIIDPDGRLIALHLYEGLLKLINIEKGLNNPIQKTAANTRLEELQVMDMTFL 127

Query: 118  YGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGV 177
            YGC  PTI VL++D KD +H+ TYEV+ KD++   GPWSQ+N+   + +L+ VP  L GV
Sbjct: 128  YGCKIPTIAVLFKDTKDEKHIVTYEVSQKDQELCPGPWSQSNVGVYSSMLVAVP--LGGV 185

Query: 178  LIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
            L++ +  I Y +    ++I I  +   AY RVD DGSRYL GDH G L +LV+ + +++V
Sbjct: 186  LVVADNGITYMNGRTTRSIAIPYTKFLAYDRVDKDGSRYLFGDHFGRLSVLVLLNHQQRV 245

Query: 238  TGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPD-AKGSYVEVLERYVNL 296
            T LK E LG TSI S+ISYLD+ VV+IGSS GDSQLI+LN + D A  SY+  LE + N+
Sbjct: 246  TELKFETLGRTSIPSSISYLDSGVVFIGSSSGDSQLIRLNTEKDPATDSYISHLENFTNI 305

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR-- 354
            GPIVDFC+VD E+QGQ Q+VTCSG Y+DG+LR++RNGIGI E+A +EL+G+KG+W ++  
Sbjct: 306  GPIVDFCLVDTEKQGQAQIVTCSGTYRDGTLRVIRNGIGIAEKALIELEGVKGLWPIKEN 365

Query: 355  --SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA-IYNQLVQ 411
              S   +P D +L+VSFI  T++L     +E+EETE EG  S + T+ C +    N +VQ
Sbjct: 366  DPSDPLNPKDQYLIVSFIGYTKVLQFQ-GEEIEETEFEGLDSNSSTILCSNIDKENVIVQ 424

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE- 470
            VT+ ++ L++  + +  ++WKSP G  +N+ ++N SQ+ L+ G   L Y EI +    E 
Sbjct: 425  VTNQAINLINPITFKRVDQWKSPSGSPINLVSSNQSQIALSIGKS-LYYFEINEQSRIEL 483

Query: 471  VKHAQLEYEISCLDINPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
            +K  +L +EISC+DI+P+     S SQI AVG+WTDI++R+F LP L  I KE LGGEII
Sbjct: 484  IKEIELPHEISCIDISPLDSFMDSRSQICAVGLWTDITLRLFKLPTLEEIHKEPLGGEII 543

Query: 530  PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG-ELTDRKKVSLGTQPITLRTFSSKNT 588
            PRS+L+ +FEGI Y+ C+LGDGHL  F +++    +L D++K++LGTQPI L+ F  KNT
Sbjct: 544  PRSILMISFEGIDYIFCSLGDGHLFKFKIDIANNWKLFDKRKLTLGTQPIILKKFKLKNT 603

Query: 589  THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
             ++FA SDRPTVIYS+NKKL YS VNLKEV+++  FNS AFP S+AI+ E  L IGTID+
Sbjct: 604  INIFALSDRPTVIYSNNKKLFYSVVNLKEVTNVTSFNSDAFPGSMAISSESSLIIGTIDE 663

Query: 649  IQKLHIRSIPL-GEHPRRICHQEQSRTFAICSLK-----------NQSCAEE--SEMHFV 694
            IQKLHI++I L GE  RRI H E+   +A+ ++K           N +  +E   E+ +V
Sbjct: 664  IQKLHIKTISLNGEMARRIVHLEEYSCYAVITIKTNEDIISGNGENATTIDEVEEEVSYV 723

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF 754
            RL DDQTFE +S++ L+ +E G S+ S  F DD   Y  VGT+  +P+    T GR+L+F
Sbjct: 724  RLFDDQTFEPLSSFRLEHYEMGWSLTSTKFDDDPCTYLAVGTSINIPDRQ--TSGRVLLF 781

Query: 755  -IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML-RDDGTRELQSECGH 812
             I E  KL L+ E   +  V  L+ FNG+L+AA+ +++   ++   ++   + + SE  H
Sbjct: 782  NINEAKKLVLLEEISFRSGVLYLHQFNGRLIAAVLKRLYSIRYSYSKEKNCKVISSENVH 841

Query: 813  HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
             GH + L + +RG F++VGD+MKS+SLL  + E G++ + A++    W+ ++ +++DD +
Sbjct: 842  KGHTMILKLASRGHFMLVGDMMKSMSLL-GQSENGSLVQIAKNPQPIWIRSIAMINDDYF 900

Query: 873  LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
            +G+E + N   V+KN++   + ER  L+ VG YH+GE +N    GSLV RLPDSD   IP
Sbjct: 901  IGSETSNNFVVVKKNNDSTNELERELLDSVGHYHIGESINSMLCGSLV-RLPDSDAPPIP 959

Query: 933  TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK 992
            T+++ +VNG IGVIAS+  E Y F  KLQ  L +V+ G+GG  HE WR+F+N+  TV+++
Sbjct: 960  TILYASVNGSIGVIASISKEDYEFFSKLQKGLNRVVNGIGGFTHESWRAFSNDHHTVESR 1019

Query: 993  NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
            NF+DGDLIE F DL    M ++ + MNV+++E  KR+E L +
Sbjct: 1020 NFIDGDLIEMFPDLKIESMAKVIQDMNVTLDETLKRIESLMQ 1061


>gi|195571247|ref|XP_002103615.1| GD18880 [Drosophila simulans]
 gi|194199542|gb|EDX13118.1| GD18880 [Drosophila simulans]
          Length = 1140

 Score =  955 bits (2469), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 510/1099 (46%), Positives = 706/1099 (64%), Gaps = 77/1099 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
            ++ I G IA +  FRP    +D LFI T RY   +L+    +    ++T+A G+VSD +G
Sbjct: 54   EININGTIAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDVITVVTKANGNVSDSVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
              +  G I  ID   R+IG+ LY GLF +IP D      +A N+R++EL V D++FL+GC
Sbjct: 114  ILSAGGVIAAIDLKARVIGMCLYQGLFTIIPMDKDASELKATNLRMDELNVYDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT++V+++DN D RHVK++E+ L+DK+F++  W Q+N++  A +LIPVP P+ GV++I
Sbjct: 174  LNPTVIVIHKDN-DGRHVKSHEINLRDKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
            G E+IVY   + + A+     R S    Y RV ++G RYLLG+  G L++L + T E  K
Sbjct: 233  GRESIVYHDGSNYHAVAPLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSK 292

Query: 237  ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
               V  +K+E LGE SI   I+YLDN  +YIG+ +GDSQL++LN +    GSYV  +E +
Sbjct: 293  GVTVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLNSEA-IDGSYVVPVENF 351

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
             NL PI+D  VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352  TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411

Query: 354  RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            +   D+ P++  LV++F+  TRIL ++ E E+EETEI GF S  QT  C +  Y+QL+QV
Sbjct: 412  KVGVDESPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCSNVDYDQLIQV 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS SVRLVSS ++ L  EW+     S+ V + N +Q+L+A+    + Y+ I DG L E  
Sbjct: 471  TSDSVRLVSSATKALVAEWRPTGDRSIGVVSCNTTQILVASAC-DIFYIVIEDGNLREQC 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               L YE++CLDI P+ E    S + AVG+WTDIS  I SLPDL  I  E L GEIIPRS
Sbjct: 530  RRTLAYEVACLDITPLDETQKKSDLVAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FEGI YLLCALGDG +  F+++  TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590  ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSLSTTNVF 649

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQKL
Sbjct: 650  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALANKNAVILGTIDEIQKL 709

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TFA+ +L+                               
Sbjct: 710  HIRTVPLGEGPRRIAYQESSQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNFLP 769

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                      N    +E ++H + ++D  TFE +  +   + E   S++S    DD N Y
Sbjct: 770  KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVSPETISSLMSAKLGDDPNTY 829

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            Y V T+ V+PEE EP  GRI++F   + KL  +AE +  G  Y+L  FNGK+LA I   +
Sbjct: 830  YVVATSLVIPEEPEPKVGRIIIFHYNENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL+ EC     I ALY++ +GDFI+VGDLM+SI+LL +K  EG   E
Sbjct: 890  RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD    WM AVEILDDD +LG+E N NLF  +K+S   TDEER  L  +  +HLG+ V
Sbjct: 946  IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005

Query: 912  NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            N FRHGSLVM+    +VG+  T     V++GT NG IG++  +P + Y FL  L+  L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS----- 1021
            +IK VG + H  +R+F    K   ++ F+DGDLIESFLDLSR +M +  + + ++     
Sbjct: 1062 IIKSVGKIEHTYYRNFQINTKVEPSEGFIDGDLIESFLDLSRDKMRDAVQGLELTLNGER 1121

Query: 1022 ----VEELCKRVEELTRLH 1036
                VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140


>gi|194741158|ref|XP_001953056.1| GF17579 [Drosophila ananassae]
 gi|190626115|gb|EDV41639.1| GF17579 [Drosophila ananassae]
          Length = 1140

 Score =  954 bits (2465), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 506/1100 (46%), Positives = 710/1100 (64%), Gaps = 79/1100 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
            ++ I G IA +  FRP    +D LFI T RY   +L+    +    ++T+A G+VSD +G
Sbjct: 54   EININGTIAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDVITVVTKANGNVSDSVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
             P++ G I  IDP  R+IG+ LY GLF +IP D +    +A N+R++EL V D++FL+GC
Sbjct: 114  IPSEGGVIAAIDPKARVIGMCLYQGLFTIIPMDKEASELKATNLRMDELNVYDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT++V+++DN D RHVK++E+ L++K+F++  W Q+N++  A +LI VP P+ GV++I
Sbjct: 174  LNPTVIVIHKDN-DGRHVKSHEINLREKEFMKIAWKQDNVETEATMLITVPSPIGGVIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
            G E+IVY   + + A+     R S    Y RVD+ G RYLLG+  G L++L + T E  K
Sbjct: 233  GRESIVYHDGSNYHAVAPLTFRQSTINCYARVDSKGFRYLLGNMDGQLYMLFLGTSETSK 292

Query: 237  ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDA-KGSYVEVLER 292
               V  +K+E LGE SI   I+YLDN  +YIG+ +GDSQL++L+   DA  GSYV  +E 
Sbjct: 293  GITVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLS--SDAIDGSYVIPVEN 350

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NL PI+D  VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWS
Sbjct: 351  FTNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWS 410

Query: 353  LRSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            L+   DD P++  LV++F+  TRIL ++ E E+EETEI GF S  QT  C +  ++Q++Q
Sbjct: 411  LKVGIDDSPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCANVEHDQIIQ 469

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
            VTS SVRLV S +++L  EW+     S+ V + N++Q+++A+    + Y+ I DG L E 
Sbjct: 470  VTSDSVRLVKSATKDLVAEWRPEGDRSIGVVSCNSTQIVVASAR-DIFYIVIEDGKLVEK 528

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
                L YE++CLDI P+ E+ + S + AVG+WTDIS  I SLP+L  I  E L GEIIPR
Sbjct: 529  SRKTLAYEVACLDITPLDESQNKSDLVAVGLWTDISAVILSLPELETIYTEKLSGEIIPR 588

Query: 532  SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
            S+L+  FEGI YLLCALGDG +  F+++  TG+LTD+KKV+LGTQP TLRTF S  TT+V
Sbjct: 589  SILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSFATTNV 648

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
            FA SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQK
Sbjct: 649  FACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQK 708

Query: 652  LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
            LHIR++PLGE PRRI +QE S+TFA+ +L+                              
Sbjct: 709  LHIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGTKPLRNSASTQAQNITYSSNIL 768

Query: 682  -----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 730
                       N    +E ++H + ++D  TFE +  +     E   S++S    DD N 
Sbjct: 769  PKPGTGTSTATNAEVGQEIDVHNLLVIDQNTFEVLHAHQFVAPETISSLMSAKLGDDPNT 828

Query: 731  YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
            YY V T+ V+P+E EP  GRI++F   D KL  +AE +  G  Y+L  FNGK+LA I   
Sbjct: 829  YYVVATSLVIPDEPEPKVGRIIIFHYHDNKLTQVAETKVDGTCYALVEFNGKVLAGIGSF 888

Query: 791  IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
            ++LY+W       +EL+ EC     I AL+++ +GDFI+VGDLM+SI+LL +K  EG   
Sbjct: 889  VRLYEWT----NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFV 944

Query: 851  ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 910
            E ARD    WM AVEILDDD +LG+E N NLF  +K+S   TDEER  L  +  +HLG+ 
Sbjct: 945  EIARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDT 1004

Query: 911  VNRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLR 965
            VN FRHGSLVM+    +VG+  T     V++GT NG IG++  +P + Y FL  L+  L+
Sbjct: 1005 VNVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLK 1060

Query: 966  KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS---- 1021
            K+IK VG + H  +R+F    K   ++ F+DGDLIESFLDL R +M +  + + ++    
Sbjct: 1061 KIIKSVGKIEHTYYRNFQINTKVEPSEGFIDGDLIESFLDLGRDKMRDAVQGLEITLNGE 1120

Query: 1022 -----VEELCKRVEELTRLH 1036
                 VE++ K VE+LTR+H
Sbjct: 1121 RKSADVEDVIKIVEDLTRMH 1140


>gi|195395112|ref|XP_002056180.1| GJ10363 [Drosophila virilis]
 gi|194142889|gb|EDW59292.1| GJ10363 [Drosophila virilis]
          Length = 1140

 Score =  945 bits (2443), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 502/1099 (45%), Positives = 708/1099 (64%), Gaps = 77/1099 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD--AESSELITRAMGDVSDRIG 60
            ++ I G+I  +  FRP    +D LFI T R+   +L+     ++  ++T+A G+VSD +G
Sbjct: 54   EININGKITVMRHFRPPDSNKDLLFILTRRFNVMILEARMVGDAVTVVTKANGNVSDSVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
              ++ G I  IDP  R+IG+ LY GLF +IP D      +A N+R++EL V D++FL+GC
Sbjct: 114  ILSEGGFIAAIDPKARVIGMCLYQGLFTIIPLDKDASELKATNLRMDELTVYDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT++V+++DN D RHVK++E+ L+DK+F++  W Q+N++  A +LIPVP  + GV++I
Sbjct: 174  QNPTVIVIHKDN-DGRHVKSHEINLRDKEFIKVAWKQDNVETEATMLIPVPSSIGGVIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI----THE 233
            G E+IVY   + + A+     R S    Y RVD+ G RYLLG+  G L++L +    T +
Sbjct: 233  GRESIVYHDGSNYHAVAPLTFRQSTINCYARVDSKGLRYLLGNMDGQLYMLFLGTTETSK 292

Query: 234  KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
               V  +K+E LGE SI   I+YLDN  +YIGS +GDSQL++L+ +    GSYV  +E +
Sbjct: 293  GTTVKDIKVEQLGEISIPECITYLDNGFLYIGSRHGDSQLVRLSSEA-IDGSYVIPVENF 351

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
             NL PI+D  VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352  TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411

Query: 354  RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            +   DD  ++  LV++F+  TRIL ++ E E+EETEI GF S  QT  C +  Y+QL+QV
Sbjct: 412  KVGIDDSAYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCANVDYDQLIQV 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            T+ SVRLV S ++ L  EWK     S+ V + N++Q++ A+    + Y+ I DG L E  
Sbjct: 471  TAESVRLVKSATKTLVGEWKPEGDRSIGVVSCNSTQIVAASAR-EIFYISIEDGSLVEKC 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               L YE++CLD+ P+ E  + S++ AVG+WTDIS  I  LPDL  I  E L GEIIPRS
Sbjct: 530  RKTLPYEVACLDVTPLDEKQTKSELVAVGLWTDISAVILRLPDLETIYTEKLSGEIIPRS 589

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE I+YLLCALGDG +  F+L+  TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590  ILMTTFEDINYLLCALGDGSMYYFILDRTTGQLTDKKKVTLGTQPTTLRTFRSFSTTNVF 649

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQKL
Sbjct: 650  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQKL 709

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK---------------------NQSCA----- 686
            HIR++PLGE PRRI +QE S+TFA+ +L+                     N SC+     
Sbjct: 710  HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNSSCSSNILP 769

Query: 687  ---------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                           +E ++H + ++D  TFE + ++     E   S++S    DD N Y
Sbjct: 770  KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHSHQFVPPETISSLMSAKLGDDPNTY 829

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            Y V T+ V P+E EP  GRI++F   + KL  +AE +  G  Y+L  FNGK+LA I   +
Sbjct: 830  YVVATSLVFPDEPEPKVGRIIIFHYNENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL+ EC     I AL+++ +GDFI+VGDLM+SI+LL +K  EG   E
Sbjct: 890  RLYEWT----NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD    WM AVEILDDD +LG E + NLF  +K+S   TDEER  L  +  +HLG+ +
Sbjct: 946  IARDCEPKWMRAVEILDDDTFLGCETHDNLFVCQKDSAATTDEERQLLPELARFHLGDTI 1005

Query: 912  NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            N FRHGSLVM+    +VG+  T     V++GT NG IG++  +P + Y FL  L+  L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE----ISKTMN--- 1019
            +IK VG ++H  +R++    K   ++ F+DGDLIESFLDL+R +M E    +  T+N   
Sbjct: 1062 IIKSVGKIDHTYYRNYQINTKVEPSEGFIDGDLIESFLDLNRDKMREAVSGLELTLNGER 1121

Query: 1020 --VSVEELCKRVEELTRLH 1036
                VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140


>gi|119594340|gb|EAW73934.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_b [Homo
            sapiens]
          Length = 923

 Score =  942 bits (2435), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 481/929 (51%), Positives = 632/929 (68%), Gaps = 64/929 (6%)

Query: 166  LLIPVPPPLCGVLIIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHA 222
            ++I VP P  G +IIG+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  
Sbjct: 1    MVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDME 60

Query: 223  GLLHLLVITHEKE-----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
            G L +L++  E++      +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN
Sbjct: 61   GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLN 120

Query: 278  LQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
            +  + +GSYV  +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+
Sbjct: 121  VDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIH 180

Query: 338  EQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQ 397
            E AS++L GIKG+W LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    Q
Sbjct: 181  EHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQ 239

Query: 398  TLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
            T FC +  + QL+Q+TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   
Sbjct: 240  TFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA- 298

Query: 458  LVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLN 517
            L YL+I    L ++ H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   
Sbjct: 299  LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFE 358

Query: 518  LITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
            L+ KE LGGEIIPRS+L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP
Sbjct: 359  LLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQP 418

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
              LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A 
Sbjct: 419  TVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALAN 478

Query: 638  EGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---------------- 681
               LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S +                
Sbjct: 479  NSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPS 538

Query: 682  -----------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
                                     S  EE E+H + ++D  TFE +  +     EY  S
Sbjct: 539  ASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALS 598

Query: 719  ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
            ++SC    D N Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  
Sbjct: 599  LVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVE 658

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            FNGKLLA+IN  ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ 
Sbjct: 659  FNGKLLASINSTVRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVL 714

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
            LL YK  EG  EE ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  
Sbjct: 715  LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQH 774

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL 957
            L+ VG +HLGEFVN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L
Sbjct: 775  LQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLL 834

Query: 958  EKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKT 1017
              +Q  L KVIK VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   
Sbjct: 835  LDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVAN 894

Query: 1018 MN----------VSVEELCKRVEELTRLH 1036
            +            + ++L K VEELTR+H
Sbjct: 895  LQYDDGSGMKREATADDLIKVVEELTRIH 923


>gi|195108657|ref|XP_001998909.1| GI23368 [Drosophila mojavensis]
 gi|193915503|gb|EDW14370.1| GI23368 [Drosophila mojavensis]
          Length = 1140

 Score =  941 bits (2432), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 501/1099 (45%), Positives = 704/1099 (64%), Gaps = 77/1099 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD--AESSELITRAMGDVSDRIG 60
            ++ I G+I  +  FRP    +D LFI T R+   +L+     ++  ++T+A G+VSD +G
Sbjct: 54   EININGKITVMRHFRPPDSNKDLLFILTRRFNVMILEARMVGDNITVVTKANGNVSDSVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
              ++ G I  IDP  R+IG+ LY GLF +IP D      +A ++R++EL V D++FL+GC
Sbjct: 114  ILSEGGFIAAIDPKARVIGMCLYQGLFTIIPLDKDASELKATSLRMDELIVYDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT++V+++DN D RHVK +E+ L+DK+F++  W Q+N++  A +LIPVP P+ GV++I
Sbjct: 174  LNPTVIVIHKDN-DGRHVKCHEINLRDKEFMKLAWKQDNVETEATMLIPVPSPIGGVIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI----THE 233
            G E+IVY   + + A+     R S    Y RVD+ G RYLLG+  G L++L +    T +
Sbjct: 233  GRESIVYHDGSNYHAVAPLTFRQSTINCYARVDSKGLRYLLGNMDGQLYMLFLGINETGK 292

Query: 234  KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
               V  +K+E LGE SI   I+YLDN  +YIGS +GDSQL++L+ +P   GSYV  +E +
Sbjct: 293  VPTVKDIKVEQLGEISIPECITYLDNGFLYIGSRHGDSQLVRLSSEP-IDGSYVIPVENF 351

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
             NL PI+D  VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352  TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411

Query: 354  RSSTDDP-FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            +   DD  ++  LV++F+  TRIL ++ E E+EETEI GF S  Q+  C +  Y+QL+QV
Sbjct: 412  KVGIDDSVYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQSFLCANVEYDQLIQV 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            T+ SVRLV S+++ L  EWK     S+ V + N++Q++ A+    + Y+ I DG L E  
Sbjct: 471  TAESVRLVKSSTKALVGEWKPEGDRSIGVVSCNSTQIVAASAR-EIFYIAIEDGSLVEKC 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               L YE++CLD+ P+ E  + S++ AVG+WTDIS  I SLPDL  I  E L GEIIPRS
Sbjct: 530  RKTLAYEVACLDVTPLDEKQNKSELVAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE I YLLCALGDG +  F+L+  TG LTD+KKV+LGTQP TLRTF S  TT+VF
Sbjct: 590  ILMTTFEDIHYLLCALGDGSMYYFILDRTTGFLTDKKKVTLGTQPTTLRTFRSFATTNVF 649

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQKL
Sbjct: 650  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQKL 709

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TFA+ +L+                               
Sbjct: 710  HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNMLP 769

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                      N    +E ++H + ++D  TFE +  +     E   +++S    DD N Y
Sbjct: 770  KPGGGNSTAANAEVGQEIDVHNLLIIDQNTFEVLHAHQFVPPETISALMSAKLGDDPNTY 829

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            Y V T+ V P+E EP  GRI++F   + KL  +AE +  G  Y+L  FNGK+LA I   +
Sbjct: 830  YVVATSLVFPDEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL+ EC     I AL+++ +GDFI+VGDLM+SI+LL +K  EG   E
Sbjct: 890  RLYEWT----NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD    WM AVEILDDD +LG E + NLF  +K+S   TDEER  L  +  +HLG+ +
Sbjct: 946  IARDCEPKWMRAVEILDDDTFLGCETHDNLFVCQKDSAATTDEERQLLPELARFHLGDTI 1005

Query: 912  NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            N FRHGSLVM+    +VG+  T     V++GT NG IG++  +P + Y FL  L+  L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE----ISKTMN--- 1019
            +IK VG ++H  +R++    K   ++ F+DGDLIESFLDLSR +M E    +  T+N   
Sbjct: 1062 IIKSVGKIDHTYYRNYQINTKVEPSEGFIDGDLIESFLDLSRDKMHEAVTGLELTLNGER 1121

Query: 1020 --VSVEELCKRVEELTRLH 1036
                VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140


>gi|195037449|ref|XP_001990173.1| GH18378 [Drosophila grimshawi]
 gi|193894369|gb|EDV93235.1| GH18378 [Drosophila grimshawi]
          Length = 1140

 Score =  937 bits (2422), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 499/1099 (45%), Positives = 702/1099 (63%), Gaps = 77/1099 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD--AESSELITRAMGDVSDRIG 60
            ++ I G+I  +  FRP    +D LFI T R+   +L+     ++  ++T+A G+VSD +G
Sbjct: 54   EININGKITVMRHFRPPDTKKDLLFILTRRFNVMILEARMVGDTITVVTKANGNVSDSVG 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
              ++ G I  IDP  R+IG+ LY GLF +IP D      +A N+R++EL V D++FL+GC
Sbjct: 114  ILSEGGFIAAIDPKARVIGMCLYQGLFTIIPLDKDASELKATNLRMDELTVYDVEFLHGC 173

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
              PT++V+++DN D RHVK++E+ L+DK+F++  W Q+N++  A +LIPVP P+CGV++I
Sbjct: 174  LNPTVIVIHRDN-DGRHVKSHEINLRDKEFMKIAWKQDNVETEATMLIPVPSPICGVIVI 232

Query: 181  GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
            G E+IVY   + + A+     R S    Y R+D  G RYLLG+  G L++L + T E  K
Sbjct: 233  GRESIVYHDGSNYHAVAPLTFRQSTINCYARIDEKGLRYLLGNMDGQLYMLFLGTTETSK 292

Query: 237  ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
               V  +K+E LGE SI   I+YLDN  +YIGS +GDSQL++L+ +    G++V  +E +
Sbjct: 293  GITVKDIKVEQLGEISIPECITYLDNGFLYIGSRHGDSQLVRLSSEA-IDGTFVIPVENF 351

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
             NL PI+D  VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352  TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411

Query: 354  RSSTDDP-FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            +   D+  ++  LV++F+  TRIL ++ E E+EET+I GF S  Q+  C +  Y+QL+QV
Sbjct: 412  KVGIDESSYENTLVLAFVGHTRILTLSGE-EVEETDIPGFASDLQSFLCANVDYDQLIQV 470

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            T+ SVRLV S ++ L  EWK     S+ V + N +Q++ A+    + Y+ I DG L E  
Sbjct: 471  TAESVRLVKSATKALVGEWKPEGDRSIGVVSCNTTQIVAASAR-EIFYISIEDGSLVEKC 529

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
               L YE++CLD+ P+ E  + S + AVG+WTDIS  I SLPDL  I  E L GEIIPRS
Sbjct: 530  RKILPYEVACLDVTPLDEKQNKSDLVAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +L+  FE I YLLCALGDG +  F+L+  TG+LT++KKV+LGTQP TLRTF S  TT+VF
Sbjct: 590  ILMTTFEDIHYLLCALGDGSMYYFILDKTTGQLTEKKKVTLGTQPTTLRTFRSFATTNVF 649

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+PDSLA+A +  + +GTID+IQKL
Sbjct: 650  ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQKL 709

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
            HIR++PLGE PRRI +QE S+TFA+ +L+                               
Sbjct: 710  HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCGSNILP 769

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                      N    +E ++H + ++D  TFE +  +     E   +++S    DD N Y
Sbjct: 770  KPGGGNSTAANAEVGQEIDVHNLLIIDQNTFEVLHAHQFVPPETISTLMSAKLGDDPNTY 829

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            Y V T+ V PEE EP  GRI++F   D KL  +AE +  G  Y+L  FNGK+LA I   +
Sbjct: 830  YVVATSLVYPEEPEPKVGRIIIFHYNDNKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL+ EC     I AL+++ +GDFI+VGDLM+SI+LL +K  EG   E
Sbjct: 890  RLYEWT----NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD    WM AVEILDDD +LG E + NLF  +K+S   TDEER  L  +  +HLG+ +
Sbjct: 946  IARDCEPKWMRAVEILDDDTFLGCETHDNLFVCQKDSAATTDEERQLLPELARFHLGDTI 1005

Query: 912  NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            N FRHGSLVM+    +VG+  T     V++GT NG IG++  +P + Y FL  L+  L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE----ISKTMN--- 1019
            +IK VG ++H  +R++    K   ++ F+DGDLIESFLDL+R +M E    +  TM    
Sbjct: 1062 IIKSVGKIDHTYYRNYQINTKVEPSEGFIDGDLIESFLDLNREKMREAVLGLELTMGGER 1121

Query: 1020 --VSVEELCKRVEELTRLH 1036
                VE++ K VE+LTR+H
Sbjct: 1122 KAADVEDVIKIVEDLTRMH 1140


>gi|281208174|gb|EFA82352.1| UV-damaged DNA binding protein1 [Polysphondylium pallidum PN500]
          Length = 1054

 Score =  932 bits (2410), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 487/1054 (46%), Positives = 706/1054 (66%), Gaps = 77/1054 (7%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDV IYGRI+ + +    G+ QD LFI TE+YK+C+L +++ES EL+T A GD    IG
Sbjct: 51   VLDVNIYGRISDMRILTATGDKQDRLFILTEKYKYCILAFNSESRELVTIATGDAEGTIG 110

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP + GQIGI+DP+CR+IG+HLY+GLF+V+P ++   ++E+F++R+E+LQ++D+ FL  C
Sbjct: 111  RPAEAGQIGIVDPECRMIGMHLYEGLFRVVPLEHGQPVRESFSMRIEQLQIVDMVFLKQC 170

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            AKPT+ +L++D +DARH+ TY + +  K+ +EG  SQ++++  + +L+P+      +LI+
Sbjct: 171  AKPTLALLFKDTRDARHIVTYSIDVVTKELIEGA-SQDSVEENSTMLVPLDNG--AMLIV 227

Query: 181  GEETIVYCSANAFKAIPIRPSITK----AYGRVDADGSRYLLGDHAGLLHLLVITHEKEK 236
            GE  I Y +     + P+  SI      AY ++D D  R+LL D  G             
Sbjct: 228  GEMAITYMNLKG-NSQPVTISIDHTHIVAYEQIDRD--RFLLADDCGSF----------- 273

Query: 237  VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAK-GSYVEVLERYVN 295
                                    VVY+GSS GDSQLI+LN   D   GSY+ V++++ N
Sbjct: 274  -----------------------GVVYVGSSSGDSQLIRLNSHIDPNTGSYISVIDQFTN 310

Query: 296  LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS 355
            LGPI DFCVVD+E+QGQGQ+VTCSG ++DGSLRI+RNGIGI EQAS+EL GI+G+WSL +
Sbjct: 311  LGPITDFCVVDVEKQGQGQLVTCSGTFQDGSLRIIRNGIGIAEQASIELPGIRGLWSLSN 370

Query: 356  STDDPF-DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI-YNQLVQVT 413
            +++       L+VSFI+ T++L  + E E+EETEI GF S   TL+C +    N  +Q+ 
Sbjct: 371  NSNPSSLHRHLIVSFINSTKVLTFSGE-EIEETEIAGFDSNATTLYCGNTTENNHFIQIA 429

Query: 414  SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKH 473
            +  + LV S+S  +R +  +P   S+N+A+ N SQ+L++ G  +L YLEI D  L   K 
Sbjct: 430  TSGIYLVDSSSL-MRLDQYTPEKGSINLASCNGSQILISQGS-NLTYLEISDSKLIIKKE 487

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV 533
            AQL+YEISCLDI+ + +  + S + AVG+WTDISVRI  LP+LN + KE LGGEI+PRS+
Sbjct: 488  AQLQYEISCLDISLL-DGFTSSPVCAVGLWTDISVRILQLPNLNEVCKETLGGEILPRSI 546

Query: 534  LLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
            L   FEG +YLLC+LGDGHL NF  ++    L +RKK+SLGT PI L      N+T+VFA
Sbjct: 547  LFITFEGTNYLLCSLGDGHLFNFTFDVVENLLQERKKLSLGTTPILL------NSTNVFA 600

Query: 594  ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
            +SDRPTVIYS+NKKLLYS +N+K VSH+C FNS AF DS+AIA E  L IGTID+IQKLH
Sbjct: 601  SSDRPTVIYSNNKKLLYSAINMKVVSHVCSFNSEAFRDSIAIATESSLVIGTIDEIQKLH 660

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE---------------MHFVRLLD 698
            IR++PLGE  RRI + E+  ++A+ +++       +                 ++V+LL+
Sbjct: 661  IRNVPLGEMARRITYVEEYHSYAVITIQRNDGNNNNNDNDNFNNNNNNGVPLTNYVKLLN 720

Query: 699  DQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVED 758
            +QTFE  S Y L +FE+G SI++C F +D  +Y  VGTA+    E++ +KGRILVF +ED
Sbjct: 721  EQTFETTSKYALKSFEFGWSIVTCRFKNDDALYVVVGTAFHNEVESQQSKGRILVFRIED 780

Query: 759  GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 818
             +L L+ E      VY L  FNG+LLA IN+++Q + W +    T +L     + GH L+
Sbjct: 781  NRLILLDEVALPACVYCLLPFNGRLLAGINKRVQAFNWGV---DTNKLTKAESYSGHTLS 837

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              + +RG F++V DLMKS++LL+ + ++GAI+E AR+    W+S +E++DD+ ++G +N+
Sbjct: 838  HSMVSRGHFVLVADLMKSMTLLV-EDQQGAIKELARNPLPIWLSRIEMIDDETFIGGDNS 896

Query: 879  FNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
            +NL  V+KN+E +++ +   L+ VG++HLGE +N+F+HGSLV   PD D  ++PT++FGT
Sbjct: 897  YNLIVVQKNAEASSEIDNELLDTVGQFHLGETINKFKHGSLVTS-PDMDSPKLPTILFGT 955

Query: 939  VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
            V+G IGVI S+  + Y F EKLQ  L +V+ GVGGL  E WRSF+ E  T+ +KNF+DGD
Sbjct: 956  VSGAIGVIVSISKDDYEFFEKLQKGLNRVVHGVGGLPFENWRSFSTEHMTIPSKNFIDGD 1015

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            LIE+FLDL   +M E  K MN+S+E+  +R+E L
Sbjct: 1016 LIETFLDLRHDKMLEAIKDMNISIEDTYRRIESL 1049


>gi|166240328|ref|XP_637896.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
 gi|238064940|sp|B0M0P5.1|DDB1_DICDI RecName: Full=DNA damage-binding protein 1; AltName: Full=DNA repair
            protein E; AltName: Full=UV-damaged DNA-binding protein 1
 gi|165988543|gb|EAL64385.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
          Length = 1181

 Score =  929 bits (2402), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 499/1138 (43%), Positives = 719/1138 (63%), Gaps = 114/1138 (10%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            M DV IYGRI+ L+LF   G  QD+LFI+TE +KFC+L +D E  E+IT+A G+  D IG
Sbjct: 51   MFDVNIYGRISVLKLFSVAGSKQDYLFISTESFKFCILAYDYEKKEIITKASGNAEDTIG 110

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQL---------------------- 98
            RPT+ GQ+GIIDPD R++ LHLY+GL K+I  DN                          
Sbjct: 111  RPTEAGQLGIIDPDGRIVALHLYEGLLKLITLDNNNTPNKINNNNNNNNNNNNNNNNNNN 170

Query: 99   --------------------KEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHV 138
                                K   N+RLEELQVLD+ FLYGC  PTI VL++D KD +H+
Sbjct: 171  NNINNNNFNINNNNNNSPIQKNVNNVRLEELQVLDMTFLYGCKVPTIAVLFKDTKDEKHI 230

Query: 139  KTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI 198
             TYE++ KD + V GPWSQ+N+   + LL+PVP  L GVL++ +  I Y +    +++ +
Sbjct: 231  STYEISSKDTELVVGPWSQSNVGVYSSLLVPVP--LGGVLVVADNGITYLNGKVTRSVAV 288

Query: 199  RPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
              +   A+ RVD DGSR+L GDH G L +LV+ H+++KV  LK E LG  SI S+ISYLD
Sbjct: 289  SYTKFLAFTRVDKDGSRFLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPSSISYLD 348

Query: 259  NAVVYIGSSYGDSQLIKLNLQPD-AKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVT 317
            + VVYIGSS GDSQLI+LN + D    SYV  LE + N+GP+VDFCVVD E+QGQ Q+VT
Sbjct: 349  SGVVYIGSSSGDSQLIRLNTEKDQTTDSYVTYLEAFTNIGPVVDFCVVDAEKQGQAQIVT 408

Query: 318  CSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL---------------------RSS 356
            CSG Y+DGSLRI+RNGIGI EQAS+EL+GIKG++ +                      + 
Sbjct: 409  CSGTYRDGSLRIIRNGIGIAEQASIELEGIKGIFPINNNNNNNNNNNNNNNNNNNNNSNG 468

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH--DAIYNQLVQVTS 414
              D  D +L+ SFI  T++L+    +E+EETE EG  S   TL+C   D + N L+Q+T+
Sbjct: 469  ITDSKDRYLITSFIECTKVLSFQ-GEEIEETEFEGLESNCSTLYCGTIDKL-NLLIQITN 526

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI--GDGILTEVK 472
             S+ L+ S + +  ++W   P   +N+ + N  Q++L+     L+Y +I   +  +  VK
Sbjct: 527  VSINLIDSNTFKRVSQWNVEPSRRINLVSTNQDQIVLSIDKS-LLYFQINSSNKSIQLVK 585

Query: 473  HAQLEYEISCLDINPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
              +L +EISC+DI+P      + SQ+ +VG+W DI++RIF LP L  I KE LGGEI+PR
Sbjct: 586  EIELPHEISCIDISPFDSFMDTKSQLVSVGLWNDITLRIFKLPTLEEIWKEPLGGEILPR 645

Query: 532  SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
            S+L+ +F+ I Y+ C+LGDGHL  F  +  + +L D++K++LGTQPI L+ F  KNT ++
Sbjct: 646  SILMISFDSIDYIFCSLGDGHLFKFQFDFSSFKLFDKRKLTLGTQPIILKKFKLKNTINI 705

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
            FA SDRPTVIYS NKKL YS VNLK+V+++  FNS  FP+S+AIA    LTIGTID+IQK
Sbjct: 706  FAISDRPTVIYSHNKKLFYSVVNLKDVTNVTSFNSDGFPNSMAIATTNSLTIGTIDEIQK 765

Query: 652  LHIRSIPLGEH-PRRICHQEQSRTFAICSLKNQS---------CAEESEMHFVRLLDDQT 701
            LHI++IPL E   RRI H E    +A+ ++KN           C E+ E+ ++R+ +DQT
Sbjct: 766  LHIKTIPLNEEMGRRIVHLEDHSCYAVITVKNNEGLLGGAQDLCEEDEEVSYIRIYNDQT 825

Query: 702  FEFISTYPLDTFEYGCSILSCSFS-DDSNVYYCVGTAYVLPEENEPTKGRILVFIV---- 756
            FE IS+Y LD +E G SI  C F+ DD N Y  VGT+   P +   + GR+L+F +    
Sbjct: 826  FELISSYKLDPYEMGWSITPCKFAGDDVNTYLAVGTSINTPIK---SSGRVLLFSLSSSS 882

Query: 757  -------------------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
                                +GKL L+ E + + +VY L +FNG+L+AA+++++   ++ 
Sbjct: 883  SSNDKDSLDNNNNNNNNSGANGKLTLLEEIKFRSSVYFLLSFNGRLIAAVHKRLFSIRYT 942

Query: 798  -LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
              ++   + + SE  H GH + L + +RG FI+VGD+MKS+SLL+ +  +G++E+ AR+ 
Sbjct: 943  HSKEKNCKVISSESVHKGHTMILKLASRGHFILVGDMMKSMSLLV-EQSDGSLEQIARNP 1001

Query: 857  NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
               W+ +V +++DD ++GAE + N   V+KN++   + ER  L+ VG YH+GE +N  RH
Sbjct: 1002 QPIWIRSVAMINDDYFIGAEASNNFIVVKKNNDSTNELERELLDSVGHYHIGESINSMRH 1061

Query: 917  GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
            GSLV RLPDSD   IPT+++ +VNG IGV+AS+  E ++F  KLQ  L +V++GVGG +H
Sbjct: 1062 GSLV-RLPDSDQPIIPTILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSH 1120

Query: 977  EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
            E WR+F+N+  T+D+KNF+DGDLIE+FLDL      +    + ++ ++  +R+E L +
Sbjct: 1121 ETWRAFSNDHHTIDSKNFIDGDLIETFLDLKYESQLKAVADLGITPDDAFRRIESLMQ 1178


>gi|1399512|gb|AAC47162.1| repE [Dictyostelium discoideum]
          Length = 1139

 Score =  929 bits (2401), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 499/1138 (43%), Positives = 719/1138 (63%), Gaps = 114/1138 (10%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            M DV IYGRI+ L+LF   G  QD+LFI+TE +KFC+L +D E  E+IT+A G+  D IG
Sbjct: 9    MFDVNIYGRISVLKLFSVAGSKQDYLFISTESFKFCILAYDYEKKEIITKASGNAEDTIG 68

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQL---------------------- 98
            RPT+ GQ+GIIDPD R++ LHLY+GL K+I  DN                          
Sbjct: 69   RPTEAGQLGIIDPDGRIVALHLYEGLLKLITLDNNNTPNKINNNNNNNNNNNNNNNNNNN 128

Query: 99   --------------------KEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHV 138
                                K   N+RLEELQVLD+ FLYGC  PTI VL++D KD +H+
Sbjct: 129  NNINNNNFNINNNNNNSPIQKNVNNVRLEELQVLDMTFLYGCKVPTIAVLFKDTKDEKHI 188

Query: 139  KTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI 198
             TYE++ KD + V GPWSQ+N+   + LL+PVP  L GVL++ +  I Y +    +++ +
Sbjct: 189  STYEISSKDTELVVGPWSQSNVGVYSSLLVPVP--LGGVLVVADNGITYLNGKVTRSVAV 246

Query: 199  RPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
              +   A+ RVD DGSR+L GDH G L +LV+ H+++KV  LK E LG  SI S+ISYLD
Sbjct: 247  SYTKFLAFTRVDKDGSRFLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPSSISYLD 306

Query: 259  NAVVYIGSSYGDSQLIKLNLQPD-AKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVT 317
            + VVYIGSS GDSQLI+LN + D    SYV  LE + N+GP+VDFCVVD E+QGQ Q+VT
Sbjct: 307  SGVVYIGSSSGDSQLIRLNTEKDQTTDSYVTYLEAFTNIGPVVDFCVVDAEKQGQAQIVT 366

Query: 318  CSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL---------------------RSS 356
            CSG Y+DGSLRI+RNGIGI EQAS+EL+GIKG++ +                      + 
Sbjct: 367  CSGTYRDGSLRIIRNGIGIAEQASIELEGIKGIFPINNNNNNNNNNNNNNNNNNNNNSNG 426

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH--DAIYNQLVQVTS 414
              D  D +L+ SFI  T++L+    +E+EETE EG  S   TL+C   D + N L+Q+T+
Sbjct: 427  ITDSKDRYLITSFIECTKVLSFQ-GEEIEETEFEGLESNCSTLYCGTIDKL-NLLIQITN 484

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI--GDGILTEVK 472
             S+ L+ S + +  ++W   P   +N+ + N  Q++L+     L+Y +I   +  +  VK
Sbjct: 485  VSINLIDSNTFKRVSQWNVEPSRRINLVSTNQDQIVLSIDKS-LLYFQINSSNKSIQLVK 543

Query: 473  HAQLEYEISCLDINPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
              +L +EISC+DI+P      + SQ+ +VG+W DI++RIF LP L  I KE LGGEI+PR
Sbjct: 544  EIELPHEISCIDISPFDSFMDTKSQLVSVGLWNDITLRIFKLPTLEEIWKEPLGGEILPR 603

Query: 532  SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
            S+L+ +F+ I Y+ C+LGDGHL  F  +  + +L D++K++LGTQPI L+ F  KNT ++
Sbjct: 604  SILMISFDSIDYIFCSLGDGHLFKFQFDFSSFKLFDKRKLTLGTQPIILKKFKLKNTINI 663

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
            FA SDRPTVIYS NKKL YS VNLK+V+++  FNS  FP+S+AIA    LTIGTID+IQK
Sbjct: 664  FAISDRPTVIYSHNKKLFYSVVNLKDVTNVTSFNSDGFPNSMAIATTNSLTIGTIDEIQK 723

Query: 652  LHIRSIPLGEH-PRRICHQEQSRTFAICSLKNQS---------CAEESEMHFVRLLDDQT 701
            LHI++IPL E   RRI H E    +A+ ++KN           C E+ E+ ++R+ +DQT
Sbjct: 724  LHIKTIPLNEEMGRRIVHLEDHSCYAVITVKNNEGLLGGAQDLCEEDEEVSYIRIYNDQT 783

Query: 702  FEFISTYPLDTFEYGCSILSCSFS-DDSNVYYCVGTAYVLPEENEPTKGRILVFIV---- 756
            FE IS+Y LD +E G SI  C F+ DD N Y  VGT+   P +   + GR+L+F +    
Sbjct: 784  FELISSYKLDPYEMGWSITPCKFAGDDVNTYLAVGTSINTPIK---SSGRVLLFSLSSSS 840

Query: 757  -------------------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
                                +GKL L+ E + + +VY L +FNG+L+AA+++++   ++ 
Sbjct: 841  SSNDKDSLDNNNNNNNNSGANGKLTLLEEIKFRSSVYFLLSFNGRLIAAVHKRLFSIRYT 900

Query: 798  -LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
              ++   + + SE  H GH + L + +RG FI+VGD+MKS+SLL+ +  +G++E+ AR+ 
Sbjct: 901  HSKEKNCKVISSESVHKGHTMILKLASRGHFILVGDMMKSMSLLV-EQSDGSLEQIARNP 959

Query: 857  NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
               W+ +V +++DD ++GAE + N   V+KN++   + ER  L+ VG YH+GE +N  RH
Sbjct: 960  QPIWIRSVAMINDDYFIGAEASNNFIVVKKNNDSTNELERELLDSVGHYHIGESINSMRH 1019

Query: 917  GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
            GSLV RLPDSD   IPT+++ +VNG IGV+AS+  E ++F  KLQ  L +V++GVGG +H
Sbjct: 1020 GSLV-RLPDSDQPIIPTILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSH 1078

Query: 977  EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
            E WR+F+N+  T+D+KNF+DGDLIE+FLDL      +    + ++ ++  +R+E L +
Sbjct: 1079 ETWRAFSNDHHTIDSKNFIDGDLIETFLDLKYESQLKAVADLGITPDDAFRRIESLMQ 1136


>gi|194389106|dbj|BAG61570.1| unnamed protein product [Homo sapiens]
          Length = 1009

 Score =  928 bits (2399), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 475/916 (51%), Positives = 623/916 (68%), Gaps = 64/916 (6%)

Query: 179  IIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
            IIG+E+I Y + + + AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++
Sbjct: 100  IIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQ 159

Query: 236  -----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
                  +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +
Sbjct: 160  MDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAM 219

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
            E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 220  ETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGL 279

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT FC +  + QL+
Sbjct: 280  WPLRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLI 338

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
            Q+TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L YL+I    L +
Sbjct: 339  QITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQ 397

Query: 471  VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
            + H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+ KE LGGEIIP
Sbjct: 398  ISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIP 457

Query: 531  RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
            RS+L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+
Sbjct: 458  RSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTN 517

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQ
Sbjct: 518  VFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQ 577

Query: 651  KLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------------- 681
            KLHIR++PL E PR+IC+QE S+ F + S +                             
Sbjct: 578  KLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQGTSGGTTALRPSASTQALSSSVSSS 637

Query: 682  ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                        S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y
Sbjct: 638  KLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTY 697

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            + VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  +
Sbjct: 698  FIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTV 757

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
            +LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE
Sbjct: 758  RLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEE 813

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
             ARD+N NWMSAVEILDDD +LGAEN FN F  +K+S   TDEER  L+ VG +HLGEFV
Sbjct: 814  IARDFNPNWMSAVEILDDDNFLGAENAFNSFVCQKDSAATTDEERQHLQEVGLFHLGEFV 873

Query: 912  NRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
            N F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK 
Sbjct: 874  NVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKS 933

Query: 971  VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------V 1020
            VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            
Sbjct: 934  VGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 993

Query: 1021 SVEELCKRVEELTRLH 1036
            + ++L K VEELTR+H
Sbjct: 994  TADDLIKVVEELTRIH 1009



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 7/67 (10%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW-------DAESSELITRAMGDV 55
           +V +YG+IA +ELFRP GE++  LFI T +Y  C+L++       D    E IT   GD 
Sbjct: 54  EVGMYGKIAVMELFRPKGESKGLLFILTAKYNACILEYKQSGESIDIIGQESITYHNGDK 113

Query: 56  SDRIGRP 62
              I  P
Sbjct: 114 YLAIAPP 120


>gi|412992547|emb|CCO18527.1| predicted protein [Bathycoccus prasinos]
          Length = 1275

 Score =  922 bits (2382), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 514/1203 (42%), Positives = 718/1203 (59%), Gaps = 172/1203 (14%)

Query: 1    MLDVPIYGRIATLELF--RPHGEAQDF---LFIATE-RYKFCVLQWDAESSELITRAMGD 54
            +L+  +YGRIA L+L   R +G+ ++    L + TE RY   VL +DA ++ L T + GD
Sbjct: 75   VLETTVYGRIAMLKLCEGRKNGDRENIPGRLVVVTEDRYSLAVLSYDAATNTLRTVSSGD 134

Query: 55   VSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDI 114
            VSD  GRP ++G +G ID + R+I LH+YDGL K IPFD  G L +AFN  L EL+VLD 
Sbjct: 135  VSDLTGRPVEHGMLGDIDRNARVIALHIYDGLIKFIPFDENGNLLQAFNANLNELKVLDF 194

Query: 115  KFL----------------YG--------------------------------------C 120
             FL                YG                                       
Sbjct: 195  VFLPSTSSSTSGIAIDASEYGNNNNNNNNNNASDMNITLSNVSSLTRSINRENAGGGGLS 254

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
            + P I VL++D KD RHV TY V L+D+D   GP+ Q  ++ G+  L+ +  P+ G +++
Sbjct: 255  SMPVIAVLHKDVKDERHVSTYRVNLRDRDLEPGPFEQREVEIGSKKLLALSSPIGGCVVV 314

Query: 181  GEETIVY-----------------------------CSANAFKAIPIR---PSITKAYGR 208
            GEETI Y                              S NA   +  R    S T A  R
Sbjct: 315  GEETIAYLNEPAGIMDDHENNNNKNNNTATKNATSTLSNNANSNVMFRGENKSSTSALVR 374

Query: 209  V-----------------DADGSRYL-----LGDHAGLLHLLVI------THEKEKVTGL 240
                              D + S  L     + D  G L+LL++      T     V+ L
Sbjct: 375  AITTPDLLMFSSACLIPSDENVSNQLVDRFLIADEDGDLYLLMLSKGTTSTMSTSIVSKL 434

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQ--------PDAKGSYVEVLER 292
             IE LG+TS AS +SYLDN+VV+IGS+YGDSQ+IKL+ +         +   SYVEVLE 
Sbjct: 435  TIERLGKTSSASAMSYLDNSVVFIGSAYGDSQIIKLHTEQVNTMMSASNELPSYVEVLEE 494

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            + NLGPIVDFCV+DLER GQGQ+VTCSG    GSLR++RNGIGI EQA + L GIKG++S
Sbjct: 495  FTNLGPIVDFCVMDLERHGQGQLVTCSGVGTSGSLRVIRNGIGIREQAQISLSGIKGLYS 554

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
             +     P D++L+VSFI+ETRILA+N  DELEE    GF + +QT+ C +   N + QV
Sbjct: 555  CKRDETMPLDSYLIVSFIAETRILAINDNDELEEAVFPGFDASSQTIECANISGNVICQV 614

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
            TS  V +  ++S EL   W       +  ++A+ + + +AT GG+L YL    G ++E  
Sbjct: 615  TSKGVYVCDASSGELVASWVPSDESPITASSASENTIAVATVGGNLHYLSFQGGKISETG 674

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
                + EISC+D    G+      + AVG+W+   + +     + +   E+L  +++PRS
Sbjct: 675  KMTFDAEISCIDKTQCGDR----NVCAVGLWSTKVLLVELGSGMKVCHTENLSLDVVPRS 730

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTG----ELTDRKKVSLGTQPITLRTFSSKNT 588
             L C FE   YLL  LGDGHL+  +++  +      L+DRK VSLGTQP+TL+ F S+++
Sbjct: 731  TLFCFFEDTIYLLTGLGDGHLITNVVDSSSSAGGFALSDRKSVSLGTQPVTLKLFKSQHS 790

Query: 589  THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
             HVFA SDRPT+IYS++KKL+YSNVNL EV H+ PFN  AFPD+LA+A    LTIG +DD
Sbjct: 791  MHVFAGSDRPTIIYSNSKKLVYSNVNLNEVLHVAPFNCDAFPDALALASGEHLTIGAVDD 850

Query: 649  IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTY 708
            IQKLHIR++PL E PRRI HQ +++T A+ ++K      + E  FVRL D++TFE ++ Y
Sbjct: 851  IQKLHIRTVPLREQPRRIAHQPETKTLAVLTMKESDVPGQEEEFFVRLFDNKTFETLAKY 910

Query: 709  PLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLP-EENEPTKGRILVFIVE---------- 757
            PL+  E   SI+SCSF  D ++Y+ VGTA+  P  E E ++GRILVF V           
Sbjct: 911  PLEPNENDASIISCSFDGDDDIYFVVGTAFADPHSEPESSRGRILVFKVSNTSSSGGGNA 970

Query: 758  --------DGK-----------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
                    DG+           L L+ EKET+GAVY+LNAF GKLLA IN  ++L+ W +
Sbjct: 971  VVNGNDHGDGRASASSSVLQKSLTLVCEKETRGAVYNLNAFCGKLLAGINSLVKLFNWGV 1030

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
              +  REL  EC H GHI+AL V+T+ + IVVGDLMKSI+LL Y+ E G IEE A D+++
Sbjct: 1031 SKENKRELVHECSHMGHIIALKVETKDNLIVVGDLMKSITLLQYQRESGRIEEVAHDFSS 1090

Query: 859  NWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGS 918
            NWM+AVEILDD+ YLGAE+++NLFTV++N++  T+++RG LE+ G +HLG+ VNRFR GS
Sbjct: 1091 NWMTAVEILDDNTYLGAESSYNLFTVQRNADADTEDKRGTLELCGAFHLGDSVNRFRRGS 1150

Query: 919  LVMRLPD-----SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
            LVMR+PD     S + +I T +FGT++G +GV+A+LP   ++ L K+Q  ++KV+ GVG 
Sbjct: 1151 LVMRMPDLSDDTSSLSEISTWLFGTISGGLGVVATLPKRDFMLLNKVQEAMQKVVTGVGN 1210

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS-VEELCKRVEEL 1032
             +H  +RSF+N +++V+ +NF+DGDL+E FLDLS+     +S+   VS  E+L K++EE+
Sbjct: 1211 FSHSDFRSFHNVQRSVEMRNFIDGDLVEIFLDLSKEDQVAVSELSGVSNSEDLVKKIEEI 1270

Query: 1033 TRL 1035
            +RL
Sbjct: 1271 SRL 1273


>gi|119594343|gb|EAW73937.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_e [Homo
            sapiens]
          Length = 896

 Score =  917 bits (2371), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 466/886 (52%), Positives = 608/886 (68%), Gaps = 61/886 (6%)

Query: 206  YGRVDADGSRYLLGDHAGLLHLLVITHEKE-----KVTGLKIELLGETSIASTISYLDNA 260
            + RVD +GSRYLLGD  G L +L++  E++      +  L++ELLGETSIA  ++YLDN 
Sbjct: 17   HNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 76

Query: 261  VVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 320
            VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD CVVDLERQGQGQ+VTCSG
Sbjct: 77   VVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSG 136

Query: 321  AYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNL 380
            A+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS  +   D  LV+SF+ +TR+L +N 
Sbjct: 137  AFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNG 196

Query: 381  EDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVN 440
            E E+EETE+ GF    QT FC +  + QL+Q+TS SVRLVS   + L +EWK P   +++
Sbjct: 197  E-EVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNIS 255

Query: 441  VATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAV 500
            VA+ N+SQV++A G   L YL+I    L ++ H ++E+E++CLDI P+G++   S + A+
Sbjct: 256  VASCNSSQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAI 314

Query: 501  GMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNM 560
            G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE   YLLCALGDG L  F LN+
Sbjct: 315  GLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNI 374

Query: 561  KTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSH 620
            +TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++
Sbjct: 375  ETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNY 434

Query: 621  MCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSL 680
            MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S 
Sbjct: 435  MCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSS 494

Query: 681  K---------------------------------------NQSCAEESEMHFVRLLDDQT 701
            +                                         S  EE E+H + ++D  T
Sbjct: 495  RIEVQDTSGGTTALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHT 554

Query: 702  FEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKL 761
            FE +  +     EY  S++SC    D N Y+ VGTA V PEE EP +GRI+VF   DGKL
Sbjct: 555  FEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKL 614

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV 821
            Q +AEKE KGAVYS+  FNGKLLA+IN  ++LY+W       +EL++EC H+ +I+ALY+
Sbjct: 615  QTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTE----KELRTECNHYNNIMALYL 670

Query: 822  QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
            +T+GDFI+VGDLM+S+ LL YK  EG  EE ARD+N NWMSAVEILDDD +LGAEN FNL
Sbjct: 671  KTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNL 730

Query: 882  FTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVN 940
            F  +K+S   TDEER  L+ VG +HLGEFVN F HGSLVM+ L ++      +V+FGTVN
Sbjct: 731  FVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVN 790

Query: 941  GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
            G+IG++ SL    Y  L  +Q  L KVIK VG + H  WRSF+ E+KT  A  F+DGDLI
Sbjct: 791  GMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLI 850

Query: 1001 ESFLDLSRTRMDEISKTMN----------VSVEELCKRVEELTRLH 1036
            ESFLD+SR +M E+   +            + ++L K VEELTR+H
Sbjct: 851  ESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVVEELTRIH 896


>gi|322787057|gb|EFZ13281.1| hypothetical protein SINV_13198 [Solenopsis invicta]
          Length = 986

 Score =  914 bits (2361), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 462/943 (48%), Positives = 639/943 (67%), Gaps = 59/943 (6%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
           +V IYG+IA ++ FRP  E +D LF+ T RY   +L+   + E  E+IT+A G+V+DRIG
Sbjct: 54  EVGIYGKIAIIKFFRPPHEKKDLLFLLTTRYNAMILECIGEGEDIEIITKAHGNVADRIG 113

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
           + ++ G   +IDP  R+IGL LYDGLFK+IP D      +A +IR++E QV D+ FL+GC
Sbjct: 114 KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMDEQQVQDVNFLHGC 173

Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
           A PT+++++QD  + RHVKT+E+ L+DK+F + PW Q+N++  A ++IPVP P+CG +II
Sbjct: 174 ANPTLILIHQD-INGRHVKTHEINLRDKEFSKIPWRQDNVEREAMMVIPVPSPMCGAIII 232

Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
           G+E+I+Y   N++ A+    I+ S    Y +VD  G RYLLGD AG L +L +  EK+  
Sbjct: 233 GQESILYHDGNSYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFLEQEKKPD 292

Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
               V  LK+ELLGE SI   I+YLDN V+Y+GS  GDSQLIKL  + D  GSY   +E 
Sbjct: 293 GTLSVKDLKVELLGEISIPECITYLDNGVIYVGSRLGDSQLIKLITKADENGSYCVPMET 352

Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
           + NL PIVD  VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 353 FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGIKGMWA 412

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           L+      FD  LV+SF+ +TRIL +N E E+EET+I GF +  QT    +   +  +Q+
Sbjct: 413 LKVGGSH-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 470

Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
           T  SVRL+S  S+ + +EW+     +++V   N +QVL ATG   L Y+EI    +    
Sbjct: 471 TPTSVRLISYESKMVISEWEPQNKRTISVVACNGTQVLCATGND-LFYIEISCNQIVSKG 529

Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
              L++E++CLDI+P+  N +  +I AVG+WTDISVRI  LP+L  I KE LGGEIIPRS
Sbjct: 530 FVTLQHEVACLDISPLDGNEA--KIVAVGLWTDISVRILILPNLEEINKELLGGEIIPRS 587

Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
           +L+  FEG +YLLCALGDG +  F+L+ +TG L+D+KKV+LGTQP  LRTF S +TT+VF
Sbjct: 588 ILMTCFEGNTYLLCALGDGSMYYFILHKQTGILSDKKKVTLGTQPTVLRTFRSLSTTNVF 647

Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
           A SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ ++PDSLA+A +  +TIGTID+IQKL
Sbjct: 648 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 707

Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
           HIR++PLGE PRRI +QE S+TF + +++                               
Sbjct: 708 HIRTVPLGESPRRIAYQESSQTFGVITMRVDVQESSGVSIVRHSASTQATSTSSSSHIAS 767

Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                        +E E+H + ++D  TFE +  + L   EY  S++S    +D   Y+ 
Sbjct: 768 HNKPSGHTASEIGQEIEVHNLLIIDQHTFEVLHAHTLMATEYALSLISTRLGEDPTSYFV 827

Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
           VGTA++ P+E EP  GRIL++   +GK   +AEKE KG+ YSL  FNGKLLA+IN  ++L
Sbjct: 828 VGTAFINPDETEPKMGRILLYHWSEGKFTQVAEKEIKGSCYSLVEFNGKLLASINSTVRL 887

Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
           ++W       +EL+ EC H  +I+ALY++T+GDF++VGDLM+S++LL YK  EG+ EE A
Sbjct: 888 FEWT----AEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYKTMEGSFEEIA 943

Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER 896
           RDYN NWM+++EILDDD +LGAEN FNLF  +K+S   +++ER
Sbjct: 944 RDYNPNWMTSIEILDDDTFLGAENCFNLFVCQKDSAATSEDER 986


>gi|195996153|ref|XP_002107945.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
 gi|190588721|gb|EDV28743.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
          Length = 1134

 Score =  912 bits (2357), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 490/1101 (44%), Positives = 715/1101 (64%), Gaps = 83/1101 (7%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDV +YGRIA+++L RP  E  D LFI T RY+ C+LQ+  E+  +ITRA GD+ +R+ 
Sbjct: 52   LLDVGVYGRIASMQLIRPENENCDLLFILTCRYRVCILQYKPETKSIITRAYGDMKNRVS 111

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFD-NKGQLKEAFNIRLEELQVLDIKFLYG 119
            RP++ G IGI+DPDC++I L LYDG  K+IP + +  +   A ++RLEELQVLD+KFLYG
Sbjct: 112  RPSETGLIGIVDPDCKVICLKLYDGWLKLIPLELDTDKEMSAEDVRLEELQVLDVKFLYG 171

Query: 120  CAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLI 179
              +PTI ++Y+  ++ R++KTYE++L++ D    PW+   ++  A +++PVPPP CG+++
Sbjct: 172  FTEPTIALIYESGQN-RYLKTYEISLQNADIHRQPWNIGKVEEEAFMILPVPPPSCGMVV 230

Query: 180  IGEETIVYCSANAFKAIPIRPSITK----AYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
            IG  +I Y       ++ I P+  K     +GRVD++G RYLLGD++G L +L++  E  
Sbjct: 231  IGAGSISYYKGQ--DSLHITPASLKDRITCFGRVDSNGCRYLLGDYSGRLFMLILVQEHS 288

Query: 236  ----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK-LNLQPDAK-GSYVEV 289
                KV  L +E LGETSI S I+YLDNA  YIGSS GDSQLIK LN  PD++  SY++V
Sbjct: 289  QSGIKVKDLCLEYLGETSIPSCITYLDNAFAYIGSSCGDSQLIKVLNTSPDSETDSYIDV 348

Query: 290  LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG 349
            ++ + NLGPI+D   VDL++QGQ Q+VTCSG  K+ SLR++RNGIGI+E A+++L  I G
Sbjct: 349  IDNFTNLGPIIDMVSVDLDKQGQSQLVTCSGFGKNASLRVLRNGIGIHELANIDLDHICG 408

Query: 350  MWSLR--SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
            +W LR  S +   +D  LV+SF   +R L  +   E+EET+I GF    +T F  +  ++
Sbjct: 409  IWRLRTVSRSISEYDDVLVLSFAGHSRFLKFDGR-EVEETDISGFDDYKETDFAANVAFD 467

Query: 408  QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI 467
            Q+VQ+++ SVRL     R L  EWK P G +++ +TA  +Q+++A+G   L YLEIG+G 
Sbjct: 468  QIVQISNESVRLAGCDGRGLLQEWKPPNGKTISKSTAGNTQIMVASGC-ELFYLEIGEGE 526

Query: 468  LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
            L +V +  LE++I+C+DI+ + ++   +QI AVG+W D+S R+  LP+L L+  E LGG+
Sbjct: 527  LKQVSNISLEHDIACIDIS-LKDDNERAQICAVGLWVDMSARLLLLPNLQLMLTESLGGD 585

Query: 528  IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
            IIPRS++L  F+   YLL A+GDG L  +L+N  T  LT+RK V+LG     L TF S +
Sbjct: 586  IIPRSIMLNRFDNEIYLLVAMGDGTLAYYLVNTTTCSLTNRKSVNLGVVHSNLYTFKSGS 645

Query: 588  TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
             ++VFA SDRPTVIY +N KL++SNVNLK+V+ M PF+S +FP+SLA+  +    IGTID
Sbjct: 646  ISNVFACSDRPTVIYINNHKLVFSNVNLKKVNFMSPFHSESFPNSLALVNDSGFIIGTID 705

Query: 648  DIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ------------------------ 683
            +IQKLHIR+ PLGE  R    QE+S++F I + + +                        
Sbjct: 706  EIQKLHIRTKPLGETTR----QEESQSFGIITCRTEVPSEDDKNFVPTHQSASLLVSNRT 761

Query: 684  SCAEESE---------------MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS 728
             C E+S+               +  V ++D  + +      L   E+G S++SC+F +D 
Sbjct: 762  MCPEQSDNSSSTFDSDTLSEKNIDSVLIIDQHSLDAQCALQLQDCEWGMSLISCTFENDP 821

Query: 729  NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAIN 788
              YYCVGTA+V  E+ EPTKG I +    +GK+Q +  KE  GAVY + AFNG+LLA++N
Sbjct: 822  EAYYCVGTAFVNLEDKEPTKGNIRILKYFEGKIQQVHSKEVSGAVYCMVAFNGRLLASVN 881

Query: 789  QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
              + +Y+W       +EL  E   H ++LALY++T+GDFI++GDLM+SISL  Y+     
Sbjct: 882  STVSVYEWT----SNKELVEETSFHNNVLALYLKTKGDFILIGDLMRSISLCAYRPMNNE 937

Query: 849  IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
            IE   ++ + NWM+AVEI+DDD YLG EN+ NLFT +KNS  +++EE+  L  VG YH+G
Sbjct: 938  IELICKNNDPNWMTAVEIIDDDSYLGGENSHNLFTCQKNS-SSSEEEQKHLPTVGVYHVG 996

Query: 909  EFVNRFRHGSLVMR----LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
            EFVN FR GSLVM+    +PDS  G I   +FGTV+G +GV+ +L    + F+  +   L
Sbjct: 997  EFVNVFRQGSLVMQNTVDIPDSVQGSI---LFGTVSGAVGVVVTLAPAMFEFVSAIANKL 1053

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS--- 1021
              V+KGVG + H+ WRSF+N++KT   ++F+DGDL+ESFLDLS   M  ++  + +    
Sbjct: 1054 STVVKGVGKIEHQFWRSFSNDRKTEPCQSFVDGDLVESFLDLSPEDMQRVANGLTIQTAD 1113

Query: 1022 ------VEELCKRVEELTRLH 1036
                  VE++ K VEEL+R+H
Sbjct: 1114 GTRPAMVEDVLKTVEELSRIH 1134


>gi|47230701|emb|CAF99894.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 953

 Score =  910 bits (2351), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 487/1018 (47%), Positives = 651/1018 (63%), Gaps = 135/1018 (13%)

Query: 88   KVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKD 147
            K+IP D   +  +A+NIRLEELQV+D+ FLYGC  PT+  +YQD +  RHVKTYEV+L D
Sbjct: 2    KMIPLDGDNREWKAYNIRLEELQVIDVHFLYGCQAPTVCFIYQDPQ-GRHVKTYEVSLAD 60

Query: 148  KDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIP---IRPSITK 204
            K+F +GPW Q N++  A ++IPVP P  G +IIG+E+I Y + + + AI    I+ S   
Sbjct: 61   KEFNKGPWKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIV 120

Query: 205  AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG-------LKIELLGETSIASTISYL 257
             + RVD +GSRYLLGD  G L +L++  EKE++         L +ELLGETSIA  ++YL
Sbjct: 121  CHNRVDPNGSRYLLGDMEGRLFMLLL--EKEELMDGTVALKDLHVELLGETSIAECLTYL 178

Query: 258  DNAVVYIGSSYGDSQLIK---------LNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLE 308
            DN VV++GS  GDSQL+K         LN+  + +GS+V V+E + NLGPIVD CVVDLE
Sbjct: 179  DNGVVFVGSRLGDSQLVKVRVTHSLSELNVDSNDQGSFVTVMETFTNLGPIVDMCVVDLE 238

Query: 309  RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVS 368
            RQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS      D  LV+S
Sbjct: 239  RQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSEAGRETDDMLVLS 298

Query: 369  FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 428
            F+ +TR+L ++ E E+EETE+ GF    QT +C +  +NQL+Q+TSGSVRLV   S+ L 
Sbjct: 299  FVGQTRVLMLSGE-EVEETELPGFVDNQQTFYCGNVAHNQLIQITSGSVRLVLQDSKALV 357

Query: 429  NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPI 488
            +EW+ P G +++VA  N +QV+LA G   L YL+I  G L ++   ++E+E++CLDI P+
Sbjct: 358  SEWREPQGRNISVAACNHTQVVLAVGRA-LYYLQILAGELKQISTTEMEHEVACLDITPL 416

Query: 489  GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCAL 548
            GE  + S + AVG+WTDIS R+  LP    + KE LGGEIIPRS+L+  FEG  YLLCAL
Sbjct: 417  GEAGAESPLCAVGLWTDISARVLKLPCFTALHKEMLGGEIIPRSILMTTFEGSYYLLCAL 476

Query: 549  GDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKL 608
            GDG L  F L+++TG L++RKKV+LGTQP  LRTF S +T++VFA SDRPTVIYSSN KL
Sbjct: 477  GDGALFYFGLDLQTGALSERKKVTLGTQPTVLRTFRSLSTSNVFACSDRPTVIYSSNHKL 536

Query: 609  LYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICH 668
            ++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E PRRIC+
Sbjct: 537  VFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRRICY 596

Query: 669  QEQSRTFAICSLKNQ---------------------------------------SCAEES 689
            QE S+ F + S + +                                       S  EE 
Sbjct: 597  QEVSQCFGVLSSRMEIQDVSGTTSPVRPSASTQALSSSVSSSKLFPSSTSPHESSFGEEV 656

Query: 690  EMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG 749
            E+H + ++D  TFE +  +     EY  S++SC    D ++         +P E      
Sbjct: 657  EVHNLLVVDQHTFEVLHAHQFLPSEYALSLVSCRLGKDPSL-------QTIPNE------ 703

Query: 750  RILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
               VF                                   +++LY+W       +EL++E
Sbjct: 704  --CVF-----------------------------------QVRLYEWT----AEKELRTE 722

Query: 810  CGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
            C H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE ARD+N NWMSA+EILDD
Sbjct: 723  CNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAIEILDD 782

Query: 870  DIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDV 928
            D +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN F HGSLV++ L ++  
Sbjct: 783  DNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGVFHLGEFVNVFCHGSLVLQNLGETST 842

Query: 929  GQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT 988
                +V+FGTVNG+IG++ SL    Y  L  LQ  L KVIK         WRSF+ E+KT
Sbjct: 843  PTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKTT-------WRSFHTERKT 895

Query: 989  VDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVEELCKRVEELTRLH 1036
              A  F+DGDLIESFLDL R +M E+  T+ +          +V+E+ K VEELTR+H
Sbjct: 896  EQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKIVEELTRIH 953


>gi|328874742|gb|EGG23107.1| UV-damaged DNA binding protein1 [Dictyostelium fasciculatum]
          Length = 1116

 Score =  907 bits (2344), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 492/1084 (45%), Positives = 714/1084 (65%), Gaps = 80/1084 (7%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L+  IYGRI+ ++LF  HGE Q+ LFI TER+K+CVL +DAE  ELITRA G+  D IG
Sbjct: 52   ILETNIYGRISNMKLFSVHGEKQELLFILTERFKYCVLSYDAEKKELITRASGEADDSIG 111

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP D+GQ+ IIDPD +++ LHLY+GL K+IP + KG LK A+N+RL+ELQV+DI F+   
Sbjct: 112  RPADSGQLAIIDPDSKVLALHLYEGLLKIIPIE-KGTLKTAYNVRLQELQVIDIVFIAPT 170

Query: 121  A-KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGP--WSQNNLDNGADLLIPVPPPLCGV 177
              +PT+ VL +D +D+++V TY ++LKDK+ +E    W+Q+N+++GA  LI       G+
Sbjct: 171  NNQPTLAVLSKDTRDSKNVNTYILSLKDKELIESANTWAQSNVEDGASKLIAHQN---GL 227

Query: 178  LIIGEETIVYCS-ANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVIT-HEKE 235
            L++GE  + +    ++ + IPI  +   A GRVD D  ++L GDH G L+ L+++ +EKE
Sbjct: 228  LVVGETLVSFLKPTSSPRTIPIAGTRITAVGRVDKD--KFLFGDHLGQLYFLLLSQNEKE 285

Query: 236  KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAK-GSYVEVLERYV 294
                L+ E LGET  ASTISYLD+ VV+IGS+ GDSQLI+L    D    SYV VL+ + 
Sbjct: 286  ----LRFEKLGETCTASTISYLDSGVVFIGSALGDSQLIRLLSDRDPNTNSYVTVLDTFA 341

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR 354
            NLGPI DFC+VD+E+QGQ Q+V CSG +K+GSLR++RNGIGI EQAS++L GIK +WSL 
Sbjct: 342  NLGPIPDFCLVDIEKQGQNQIVACSGGFKEGSLRVIRNGIGITEQASIDLPGIKAIWSLA 401

Query: 355  SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
              +D     +L++SFIS T++L    ED +EETEI GF  Q+ TL+C +    Q++Q+++
Sbjct: 402  RGSD----RYLILSFISSTKVLEFQGED-IEETEIAGFDLQSPTLYCGNVADKQILQIST 456

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHA 474
              + LV   +    + WK P   S+N+A+   +Q+L++ G   L+Y EI D  + ++K  
Sbjct: 457  SGIYLVDHETNLNYDVWK-PSSGSINLASHQGNQILISFGKT-LIYFEIKDQKIIKLKEL 514

Query: 475  QLEYEISCLDINP--IGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
            ++E+EISCLDI+   +GE    S+I  VG+WTDIS+RI SLP L  + KE+LGGE+IPRS
Sbjct: 515  EMEFEISCLDISSFQVGER---SKICVVGLWTDISLRILSLPTLEQVHKENLGGEVIPRS 571

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            VL+ AFEGI+YLLC+LGDGHL NF+++     L +RKKVSLG QPI L  F  K T +VF
Sbjct: 572  VLMIAFEGINYLLCSLGDGHLFNFIVDTINNTLHERKKVSLGNQPIILSKFQLKGTVNVF 631

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A+SDRPTVIYSSN+KLLYS VNLK V  +C F+S  F D +AIA +  L+IG IDDIQ+L
Sbjct: 632  ASSDRPTVIYSSNRKLLYSVVNLKVVVGVCSFDSEVFRDCIAIATDKTLSIGHIDDIQRL 691

Query: 653  HIRSIPL-GEHPRRICHQEQSRTFAICSLK-NQSCAEES--EMHFVRLLDDQTFEFISTY 708
            HI+ I L GE  RRI H E  + +A+ +++ ++S  EES  E +F+R+++ QTFE I  Y
Sbjct: 692  HIKKIDLGGEFARRIAHVEAHQCYALITIRPDESGLEESETEQNFIRIVNQQTFETIVKY 751

Query: 709  PLDTFEYGCSILSCSFS-----------------------------------------DD 727
            PL   EYG +ILS +F                                          ++
Sbjct: 752  PLKENEYGWTILSTNFKETLTSNSSSTSSSTSTTSTTSTSGRRRLPTTTTTTTTSNDEEE 811

Query: 728  SNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAI 787
             + Y  VGT Y   +  E   GRILVF + D +L L+ E   +G+++ + AFNG+LL AI
Sbjct: 812  QSEYIVVGTTYHCHDRKEC--GRILVFKMIDSRLILLDETTVRGSIFCMIAFNGQLLVAI 869

Query: 788  NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
            N+ +  Y W   D  + +L  E  + GH  +LY+  RGDF++VGD+MKS++LL    ++ 
Sbjct: 870  NKSVHRYTWS-GDSSSGKLTGEEIYGGHTASLYLAGRGDFVLVGDMMKSMALLQASGKD- 927

Query: 848  AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
             ++E +R     W++ +  +DDD YLG++N++NL  ++KN+E A + +   L+ +G  H 
Sbjct: 928  -VKELSRSSQPFWLTGLTFIDDDTYLGSDNSYNLILMKKNTETANEVDSQLLDNIGHIHT 986

Query: 908  GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
            GEF+NRF HG+L   L D D  +  ++IF T++G IGVI+++  + Y F  KLQ  L +V
Sbjct: 987  GEFINRFHHGTLAT-LTDVDSPKPNSIIFATISGCIGVISTISKQDYDFFSKLQVGLNRV 1045

Query: 968  IKGVGGLNHEQWRSFNNEK-KTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
            I+G+GG +H++WRSF NE    ++++NF+DGDL+E FL L   +M E++K M++S+E+  
Sbjct: 1046 IRGIGGFSHDRWRSFQNEHISNIESRNFIDGDLVEQFLHLRHDKMLEVTKDMDISIEDTY 1105

Query: 1027 KRVE 1030
            K++E
Sbjct: 1106 KKIE 1109


>gi|157128866|ref|XP_001655232.1| DNA repair protein xp-e [Aedes aegypti]
 gi|108882187|gb|EAT46412.1| AAEL002407-PA [Aedes aegypti]
          Length = 980

 Score =  905 bits (2340), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 445/933 (47%), Positives = 632/933 (67%), Gaps = 55/933 (5%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
           ++ I G+IA ++LFRP    +D +FI T RY   +L+   +    E+IT+A G+V+DR+G
Sbjct: 54  EIGINGKIAVMKLFRPAEAQKDLIFILTHRYNAMILECAVQGDDIEIITKAHGNVADRVG 113

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
           +P + G + +IDP  R+IG+ LY+GLFK+IP D      +A ++R+EE+ V D++FLYG 
Sbjct: 114 KPAETGILAVIDPKARVIGMRLYEGLFKIIPLDRDTHELKATSLRMEEVHVQDVEFLYGT 173

Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             PT++V++QD  + RH+KT+E+ LKDKDF +  W Q+N++  A +LIPVP PL G ++I
Sbjct: 174 QHPTLIVIHQD-LNGRHIKTHEINLKDKDFTKIAWKQDNVETEATMLIPVPTPLGGAIVI 232

Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
           G+E++VY   +++ A+    I+ S    Y RVD+ G RYLLG+ +G L ++ + T E  K
Sbjct: 233 GQESVVYHDGDSYVAVAPAIIKQSTINCYARVDSKGFRYLLGNMSGHLFMMFLETEENSK 292

Query: 237 ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
               V  +K+ELLG+ +I   I+YLDN V++IGS +GDSQL+KLN      G+YV V+E 
Sbjct: 293 GLLSVKDIKVELLGDITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMET 352

Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
           + NL PI+D C+VDLE+QGQGQ++TCSG+YK+GSLRI+RNGIGI E A ++L GIKGMW+
Sbjct: 353 FTNLAPIIDMCIVDLEKQGQGQMITCSGSYKEGSLRIIRNGIGIQEHACIDLPGIKGMWA 412

Query: 353 LRSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
           LR   DD P+D  LV+SF+  TRIL ++ E E+EETEI GF S  QT +C +  + Q++Q
Sbjct: 413 LRVGIDDSPYDNTLVLSFVGHTRILTLSGE-EVEETEIPGFLSDQQTFYCANVDFGQIIQ 471

Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
           VT  + RL+   ++ +  EWK P    ++V   N+ Q++ AT    + Y+EIG+  L   
Sbjct: 472 VTPTTARLIQCDNKSMICEWKPPDDKRISVVACNSCQMVCATAC-DIYYIEIGESKLVHK 530

Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
               L+YE++CLDI+P+ +N +++++ AVG+WTDIS  I  LP+L ++  E LGGEIIPR
Sbjct: 531 STVTLDYEVACLDISPLEDNATHAELVAVGLWTDISACILRLPNLEVVHTEKLGGEIIPR 590

Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
           S+L+  FEGI YLLCALGDG +  F+L+  T  LTD+KKV+LGTQP  L+TF S +TT+V
Sbjct: 591 SILMAHFEGIVYLLCALGDGSMFYFVLDKNTNRLTDQKKVTLGTQPTILKTFRSLSTTNV 650

Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
           FA SDRPTVIYSSN KL++SNVNLKEV+HMC  N+ A+ DSLA+A +  + +GTID+IQK
Sbjct: 651 FACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAEAYQDSLALATKNSVILGTIDEIQK 710

Query: 652 LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
           LHIR++PLGE PRRI +QE S+TF + +++                              
Sbjct: 711 LHIRTVPLGESPRRIAYQEASQTFGVITVRTDIQDSSGLTPSRQSASTQTTNVTLSTNMG 770

Query: 682 -------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
                  N    +E E+H + ++D  TFE +  +     EY  S++S    +D N YY V
Sbjct: 771 LLKAGASNAEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYAMSLISAKLGNDPNTYYIV 830

Query: 735 GTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
           GTA V PEE EP  GRI+++   DG L  ++EKE KG+ YSL  FNG++LA+IN  ++LY
Sbjct: 831 GTALVNPEEPEPKVGRIIIYHYADGNLTQVSEKEIKGSCYSLVEFNGRVLASINSTVRLY 890

Query: 795 KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
           +W   DD  ++L+ EC H  ++LALY +T+GDFI+VGDLM+SI+LL YK  EG+ EE AR
Sbjct: 891 EWT--DD--KDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLLQYKQMEGSFEEIAR 946

Query: 855 DYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
           DY  NWM+AVEILDDD +LGA+N+ NLF   K+
Sbjct: 947 DYQPNWMTAVEILDDDAFLGADNSNNLFVCLKD 979


>gi|41055225|ref|NP_956920.1| DNA damage-binding protein 1 [Danio rerio]
 gi|34784892|gb|AAH56837.1| Zgc:63840 [Danio rerio]
          Length = 897

 Score =  893 bits (2308), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 445/845 (52%), Positives = 594/845 (70%), Gaps = 56/845 (6%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
           +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    +S ++ITRA G+V DRIG
Sbjct: 54  EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGDSIDIITRAHGNVQDRIG 113

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
           RP++ G IGI+DP+CR+IGL LYDGLFKVIP D + +  +AFNIRLEELQV+D++FLYGC
Sbjct: 114 RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRENRELKAFNIRLEELQVIDVQFLYGC 173

Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             PT+  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++IPVP P  G +II
Sbjct: 174 QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232

Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
           G+E+I Y + + + A+    I+ S    + RVD +GSRYLLGD  G L +L++  EKE++
Sbjct: 233 GQESITYHNGDKYLAVAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLL--EKEEL 290

Query: 238 TG-------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
                    L +ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV V+
Sbjct: 291 MDGAVVLKDLHVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNDQGSYVGVM 350

Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
           E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 351 ETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGL 410

Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
           W LRS +    D  LV+SF+ +TR+L ++ E E+EETE++GF    QT FC +  + QL+
Sbjct: 411 WPLRSESSRDTDDMLVLSFVGQTRVLMLSGE-EVEETELQGFVDNQQTFFCGNVAHQQLI 469

Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
           Q+TS SVRLV+  S+ L +EWK P G +++VA+ N +QV+LA G   L YL+I  G L +
Sbjct: 470 QITSVSVRLVTQDSKALVSEWKEPQGRNISVASCNNTQVVLAVGR-VLYYLQILSGELKQ 528

Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
           +   ++E+E++CLDI P+GE  + S I AVG+WTDIS R+  LP    + KE LGGEIIP
Sbjct: 529 ISSTEMEHEVACLDITPLGERTADSCICAVGLWTDISARLLKLPCFTPLHKEMLGGEIIP 588

Query: 531 RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
           RS+L+  FEG  YLLCALGDG L  F L+++TG L++RKKV+LGTQP  LRTF S +T++
Sbjct: 589 RSILMTTFEGSHYLLCALGDGALFYFGLDIQTGVLSERKKVTLGTQPTVLRTFRSLSTSN 648

Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
           VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQ
Sbjct: 649 VFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQ 708

Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------------- 681
           KLHIR++PL E P+RIC+QE S+ F + S +                             
Sbjct: 709 KLHIRTVPLYESPKRICYQEVSQCFGVLSSRVEMQDASGTTAAVRPSASTQALSSSVSSS 768

Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                       S  EE E+H + ++D  TFE +  +     EY  S++SC    D  VY
Sbjct: 769 KLLPSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLQNEYALSMVSCKLGRDPAVY 828

Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
           + VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  +
Sbjct: 829 FIVGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTV 888

Query: 792 QLYKW 796
           +LY+W
Sbjct: 889 RLYEW 893


>gi|290998415|ref|XP_002681776.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
 gi|284095401|gb|EFC49032.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
          Length = 1103

 Score =  879 bits (2272), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 469/1058 (44%), Positives = 689/1058 (65%), Gaps = 45/1058 (4%)

Query: 8    GRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQ 67
            GRI T+ LFRP GE QD LFI  E   F +   D +   + T + G + D +GR +++G 
Sbjct: 59   GRIDTMSLFRPSGEKQDLLFITIEDTFFTLGFIDGK---IETLSSGSIDDPVGRRSESGS 115

Query: 68   IGIIDPDCRLIGLHLYDGLFKVIPFDN-KGQLKEAFNIRLEELQVLDIKFLYGCAK---- 122
            I  IDP CR + L +Y+GL K+IPF+N K Q KEAFN+RLEEL V+DI FL         
Sbjct: 116  ITTIDPLCRAVALSIYEGLLKIIPFENNKHQFKEAFNVRLEELNVIDIAFLESLGSKSKS 175

Query: 123  -PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
             PT  +LYQD+  +RHVKTYEV   DKD  E   +Q N+D+GA++LIPVP PL GV+ +G
Sbjct: 176  GPTFALLYQDHVGSRHVKTYEVKTLDKDMEESSLNQLNVDHGANILIPVPAPLGGVICVG 235

Query: 182  EETIVY---CSANAFKAIPIRPSIT-KAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
            E  + Y    + N   A P    +  ++YG++D   +R+ LGD +G L+LL +     +V
Sbjct: 236  EAQVSYINESNKNHSVASPANSRMAIRSYGKLD--NTRWFLGDQSGQLYLLSLQVSDSEV 293

Query: 238  TGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN--LQPDAKGSYVEVLERYVN 295
            TGL ++ LG TSI+S ISYLDN  V+IGS+YGDSQ+I+++  L P+   SY EV + Y N
Sbjct: 294  TGLTLKELGVTSISSCISYLDNGYVFIGSNYGDSQVIRISDELNPETN-SYFEVFQTYSN 352

Query: 296  LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLR 354
            +GPIVDFC VD ++QGQGQ+VTCSGA+KDGSLRI++NGIGI E  ++ +L G+  +WSL+
Sbjct: 353  IGPIVDFCFVDADKQGQGQIVTCSGAFKDGSLRIIKNGIGIEELTTISDLVGLNRIWSLK 412

Query: 355  SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
            + T +  + +LV+SF  +T I +++ E E+ E +I GF   + T+ C   I N  +QVT 
Sbjct: 413  TETGE--EKYLVMSFTGQTLISSVDNE-EIGEAKIPGFDVDSTTVLCDTVIGNNYLQVTD 469

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHA 474
             + RLVSS + EL +EWK P   ++++A +N +Q++++ G G LVY+EI    L +++  
Sbjct: 470  KTARLVSSHTLELIDEWK-PSSGTISLAASNPTQLVVSLGEGKLVYIEISAQSLKQIQQT 528

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL 534
            +L+YE+SC+DI+P  E    S + AVGMWT+ISVR+ +LP  +++T + LGGEI+PRS+L
Sbjct: 529  KLDYEVSCIDISP-EEGKISSTVCAVGMWTEISVRVLTLPSFDILTVQELGGEILPRSIL 587

Query: 535  LCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFA 593
            +  FEGI+YL+C LGDGHL  F ++   G LTD+K++  G++PI LR F SK+   +VFA
Sbjct: 588  MPTFEGINYLMCGLGDGHLFTFKMDRGMGILTDKKRIVAGSKPIMLRPFISKDRQLNVFA 647

Query: 594  ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
            AS+RPT+IYS N+KL+YSNVNL EV+ MC FN  +FP  +AIA E  + IG ID+IQKLH
Sbjct: 648  ASERPTIIYSRNQKLVYSNVNLGEVNDMCSFNHESFPFHIAIANENSILIGQIDEIQKLH 707

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTF 713
            I+++ L   P +ICHQ  ++ + +   +     E  E   +  LD+  FE  S   L+  
Sbjct: 708  IKTVQLHAQPTKICHQPSTKAYGLLITQ----FENPETSAIFSLDENNFEKKSEIRLEGN 763

Query: 714  EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
            E G SI+SC F+DD N Y+ VGTA    +E EP+KGRILV  V+D KL L AEK+ KGAV
Sbjct: 764  ELGQSIISCKFTDDDNEYFIVGTAITEGDEEEPSKGRILVLQVQDDKLVLKAEKDVKGAV 823

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDG-TRELQSECGHHGHILALYVQTRGDFIVVGD 832
              L++FNGKLLA ++ ++ L+KW   DDG  ++L  EC   G I  L + + GDFI++GD
Sbjct: 824  MVLHSFNGKLLAGVSGRLMLFKWAESDDGDNKDLVQECSCSGGIYILDIDSHGDFILIGD 883

Query: 833  LMKSISLLIY-----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
            +MKS+ L +Y     +H  G +   ++DY  +W+S   +L++  Y+  +   N+ T++KN
Sbjct: 884  MMKSVHLFVYENPEEQHVSGNLRLISKDYQYSWLSCSLMLNESEYVAVDQQGNMITLKKN 943

Query: 888  SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD--SDVGQIP--TVIFGTVNGVI 943
             E A++EER +L  VG+Y+  + VNR + G + MR  +  SD+   P  T +FGT++G I
Sbjct: 944  DEAASEEERKQLVRVGKYYCSDRVNRIQPGFIGMRFANSSSDINTQPVKTALFGTISGGI 1003

Query: 944  GVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
            GV+A LP E + F+ K+Q  +  V+ G+  ++ E +R + +E+   D+  F+DGD +ESF
Sbjct: 1004 GVLAQLPPETFAFVTKIQKAMSSVVTGLANISRETYRQYRSERTREDSVGFIDGDFVESF 1063

Query: 1004 LDL----SRTRMDEISKTM--NVSVEELCKRVEELTRL 1035
            L+      +  ++E+S      +++EEL K +E+L+ L
Sbjct: 1064 LEFDFETQQRVIEELSNNHQEQITLEELVKNIEDLSHL 1101


>gi|391335522|ref|XP_003742140.1| PREDICTED: DNA damage-binding protein 1-like [Metaseiulus
            occidentalis]
          Length = 1154

 Score =  873 bits (2256), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 484/1118 (43%), Positives = 677/1118 (60%), Gaps = 101/1118 (9%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSE---LITRAMGDVSDRI 59
            +V IYGRI+ ++ FR   E++D LF  T++Y   +L++ AESSE   +ITR+ G VSD  
Sbjct: 54   EVNIYGRISVMKKFRYPNESKDLLFFLTDKYNVAILEFSAESSENFEVITRSHGCVSDPY 113

Query: 60   GRPTDNGQIGIID-PDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
             RP++ G + ++D P  R+I L LYDGL K+IP + + +   ++NIR+EE Q+ D+ FL 
Sbjct: 114  ARPSEAGNLVVVDQPKARVIALRLYDGLLKMIPLNREAKELRSYNIRVEEAQITDMCFLS 173

Query: 119  GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVL 178
              +   ++ +  + +  RH+KT+ +AL+DK+ ++GPW Q NLD  AD+LIPV     GV+
Sbjct: 174  SSSSDPVLAIVYEEQQTRHMKTHVIALRDKELMKGPWGQRNLDLEADMLIPVEDTETGVI 233

Query: 179  IIGEETIVYCSANAFKAIPIRPSITKA-----YGRVDADGSRYLLGDHAGLLHLLVITHE 233
            I+G ETIVY     +  I I+PS  +      Y R+D +   ++LG   G L +L +  E
Sbjct: 234  IVGGETIVYHYGQDY--ICIQPSFLRTTKISCYCRIDNNRLVFILGGICGRLFILTLRRE 291

Query: 234  KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
             +KV    ++LLG  SI   +SYLDN VV++GS  GDSQLI+++    A+  ++EVLE Y
Sbjct: 292  NKKVVSHSLDLLGSVSIPECLSYLDNGVVFVGSRLGDSQLIRMH----AQEPFIEVLESY 347

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
             NLG I+D  VVDLE+QGQ Q++TCSG    GSLRI+RNGIGI+E A VEL GIKG+W+L
Sbjct: 348  TNLGAILDMIVVDLEKQGQDQLITCSGQGACGSLRIIRNGIGIHELACVELSGIKGIWAL 407

Query: 354  RSST-----DDPFDTFLVVSFISETRILAMNLEDELEETEI-EGFCSQTQTLFCHDAIYN 407
            R +T     D P D  LV+SF+ +TR+   +  +ELE+  +   F   +QT    + + N
Sbjct: 408  RMNTAQLEEDTPTDDTLVLSFVGQTRVFNCSSTEELEQVTLPAAFDIDSQTFCARNVLGN 467

Query: 408  QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI 467
            Q++QVT   V L+S TS+   ++W  P G  +     N  QV LA     LVYLEI DG 
Sbjct: 468  QVIQVTDKRVNLISVTSKTRVDQWFPPEGEIITQCACNDVQVALALKNV-LVYLEIRDGS 526

Query: 468  LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
            LTE+K  +LEYEI+C+D+N + +    + I  VG+WTD+S+ + SLPDL  + ++ L  +
Sbjct: 527  LTEIKRTRLEYEIACMDLNTLDKEGDQTSIITVGLWTDVSILVLSLPDLEQLFRQELPKD 586

Query: 528  IIPRSVLLCAFEGIS-YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            +IPRSVL   FEG + YLLC L DG L  + L   TGELT +K+V+LGTQP TLR F S+
Sbjct: 587  VIPRSVLKITFEGSTDYLLCTLADGSLFYYHL-ASTGELTGQKRVTLGTQPTTLRKFRSQ 645

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
             T  VFA SDRPTVIYSS  KL++S VNL+EV HMC F+S AFPDSLA+A E E  IGTI
Sbjct: 646  RTWSVFACSDRPTVIYSSTSKLVFSKVNLREVKHMCSFSSEAFPDSLALASEDEFVIGTI 705

Query: 647  DDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--------CSLKNQS-------------- 684
            D IQKLHIR++PLGE PRRI +QE++ TF +        CS +  S              
Sbjct: 706  DAIQKLHIRTVPLGESPRRIAYQEETGTFGVIVSRSDMACSTRCASLDAPNKSNASPYAW 765

Query: 685  -----------CAEE--------------------SEMHFVRLLDDQTFEFISTYPLDTF 713
                       CA+                      E   + ++D  TFE +        
Sbjct: 766  HKDFSSFGHTQCADRVDSGIPSCSSTSLQRPPSGCDETFSLLIIDQNTFEVLHAMQFCPN 825

Query: 714  EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
            EYG SI S     D N YY VGTA++  EE+EP  GRI V    DGKL+ IAEKE  GA 
Sbjct: 826  EYGVSICSAKLGSDPNPYYIVGTAFINQEESEPKVGRIFVLRWHDGKLETIAEKEAAGAP 885

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDL 833
            YS+  F+ KL  AIN  ++LY W    +  ++LQSEC    +I+ L+++  GD+I+VGDL
Sbjct: 886  YSIREFHQKLAIAINSTVRLYSW----NAEKDLQSECTPFFNIVILHLKCLGDYILVGDL 941

Query: 834  MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
            M+S++LL Y  +  ++EE  RDY  NW +AVEILD+D +L AE+N NL+  +++   A D
Sbjct: 942  MRSMTLLNYNADITSLEEIGRDYQTNWTTAVEILDEDTFLAAESNLNLYVCKRDPSAA-D 1000

Query: 894  EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP-----TVIFGTVNGVIGVIAS 948
            + R  +  V  YHLGE VN    GSLVM  P    G +P     + ++G+++G +GVI  
Sbjct: 1001 DTRQHMHEVALYHLGEMVNVIVKGSLVMAQP----GDMPLPLNKSFLYGSLHGAVGVIVP 1056

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +  E Y  L ++QTNL K IK VG + H  WR+F  E+K   A  F+DGDLIE  LDL +
Sbjct: 1057 IKQELYAILNQIQTNLAKTIKSVGKIEHGFWRTFLAERKIEPATGFIDGDLIEQLLDLPK 1116

Query: 1009 TRMDEISKTM----------NVSVEELCKRVEELTRLH 1036
              ++ +S+++          N++ E+L K VE+LTR+H
Sbjct: 1117 EALESVSQSIKVDEEGGHQRNMTPEDLVKLVEDLTRIH 1154


>gi|449488592|ref|XP_004158102.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
           [Cucumis sativus]
          Length = 570

 Score =  853 bits (2205), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 420/450 (93%), Positives = 437/450 (97%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53  MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDTESSELITRAMGDVSDRIG 112

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
           RPTD+GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDSGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172

Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
           ++PTIVVLYQDNKDARHVKTYEV LKDKDFVEGPWSQNNLDNGA +LIPVPPPLCGV+II
Sbjct: 173 SRPTIVVLYQDNKDARHVKTYEVVLKDKDFVEGPWSQNNLDNGAAVLIPVPPPLCGVIII 232

Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
           GEETIVYCSA AFKAIP+RPSIT+AYGRVDADGSRYLLGDHAGLLHLLVITHEKE+VTGL
Sbjct: 233 GEETIVYCSATAFKAIPVRPSITRAYGRVDADGSRYLLGDHAGLLHLLVITHEKERVTGL 292

Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
           KIELLGETSIASTISYLDNA VYIGSSYGDSQL+KLN+QPDAKGSYVEVLERYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKLNVQPDAKGSYVEVLERYVNLGPIV 352

Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
           DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412

Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
           FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFNSQVQTLFCHDALFNQLVQVTSSSVRLV 472

Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVL 450
           SST+REL NEW +P  YS+NVATANASQ +
Sbjct: 473 SSTTRELLNEWNAPSNYSINVATANASQCI 502


>gi|340381612|ref|XP_003389315.1| PREDICTED: DNA damage-binding protein 1-like [Amphimedon
            queenslandica]
          Length = 1142

 Score =  823 bits (2126), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 455/1104 (41%), Positives = 679/1104 (61%), Gaps = 81/1104 (7%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LDV ++GRI  ++LFRP GE  D LFI T RY   ++ +D  +++++TRA GD+ +R+G
Sbjct: 52   VLDVDVFGRILAIQLFRPQGERMDLLFILTARYHVAIVGYDPIANDVLTRAYGDIKERVG 111

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIP--FDNKGQLKEAFNIRLEELQVLDIKFLY 118
            +PT    + ++DP C+LI LHLY G  K++P  FD+   LK AFNIRLE+L + DI+FL+
Sbjct: 112  KPTLGKNVAMVDPSCQLIALHLYTGQLKIVPLQFDSGSPLK-AFNIRLEDLYITDIQFLH 170

Query: 119  GCAKPTIVVLYQDNKDA--RHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCG 176
            G   PTI  + ++   A  R +KT+ ++ +DK+ + GPW  N ++  A LL  VP P  G
Sbjct: 171  GTENPTIAYISEEPSVATGRVLKTFVISQRDKELLPGPWKPNTIEGQASLLCSVPSPYNG 230

Query: 177  VLIIGEETIVYC--SANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK 234
            ++++G +++ Y   +++    I I+ S+      +D   SRYLLGD  G L  L +   +
Sbjct: 231  LIVVGADSVAYFNDTSHTVDPIVIKESVISCIEPLDH--SRYLLGDFRGRLLTLFLEFSE 288

Query: 235  EKVTGL------KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVE 288
            E  +G+      K+E+LGE SI  T+SYLDN VV++GS+ GDSQL+KL+  P   G Y++
Sbjct: 289  EMESGMTNIVNMKLEVLGEISIPHTLSYLDNGVVFVGSTKGDSQLVKLSSSPLENGGYID 348

Query: 289  VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 348
            VLE   N+GPI+D  VVDL++QG+  +V CSG  KDG+LRIV++GIGINE AS++L GIK
Sbjct: 349  VLESMTNIGPILDMSVVDLDKQGRDVLVCCSGLGKDGALRIVKSGIGINEAASIDLPGIK 408

Query: 349  GMWSLR-SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
            G+WSL+ +  +D  D  +V++F+ +T  L +  E E+EETE+    +  QT +C +   N
Sbjct: 409  GIWSLKCAGREDELDDTVVLTFVGQTMALRLAGE-EVEETELPALVTDQQTFYCSNVTGN 467

Query: 408  QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI 467
             ++Q+T+ SVRL+   + EL  +W  P G  ++ A  N+SQV++A G   L YLE+  G 
Sbjct: 468  AIIQITTKSVRLMDDKAMELICDWSPPDGRGISTAACNSSQVMVAVGC-DLYYLEVKPGS 526

Query: 468  LTE---VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
              E   + H  + +E++CLDINP+ E  + S + AVG+WTDISV+I ++P    +  + L
Sbjct: 527  PGELLLISHTTMSHEVACLDINPLSEAGT-SSLCAVGLWTDISVQILNVPQFEHLFTQPL 585

Query: 525  GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
            GG+IIPRSVL+    G  YLLCALGDG L  + ++ +TG+L   K+V LGT+PI L+ F 
Sbjct: 586  GGDIIPRSVLMVELGGACYLLCALGDGCLHYYTMDSETGQLRGGKRVVLGTKPIVLKQFK 645

Query: 585  SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
            S   T VFA SD PT+I+ SN+KLL+SNVN+KEV+++C  NS AF +SLA+     L  G
Sbjct: 646  SDGVTSVFACSDHPTIIHFSNQKLLFSNVNVKEVNYICTLNSEAFQNSLALVDSSTLIFG 705

Query: 645  TIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------N 682
             +D IQ LHI +IPLGE PR I +QE S+TF +   +                      N
Sbjct: 706  CVDQIQMLHIETIPLGESPRCIAYQESSQTFLVGGYRTDKSGPDNTYTPSRQSVSTRTSN 765

Query: 683  QSCA-------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS--DD 727
             S A              + EMH + L D  TF+    Y L   E+   + SC+ +  D+
Sbjct: 766  VSVAVVPPQLNIEEFKCPQVEMHSLILFDQTTFDVSHVYQLCPQEHILCVTSCNLTTNDE 825

Query: 728  SNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAI 787
                Y VGTA V PEE E + GRILVF V  GKL+L+ EK   GAV+ +  FNGK+L ++
Sbjct: 826  ERSVYVVGTALVKPEEKESSTGRILVFAVNSGKLELLHEKLENGAVFQVLGFNGKILNSV 885

Query: 788  NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
            N  + +   +   DG   L+ EC +  +ILALY++T+GDFI+VGD+++S+ LL+YK E G
Sbjct: 886  NSGVFVNALV---DGA--LKEECAYKNNILALYLKTKGDFILVGDILRSLKLLVYKEELG 940

Query: 848  AIEERARDYNAN--WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
             +EE   D+N +  + +A+E++DD+ YLGA+   ++F  +KN+E  ++ +   +      
Sbjct: 941  -LEEIGVDHNISPCFCTAIEMIDDENYLGADGR-HIFICQKNTEATSEADLLYMVQPSRM 998

Query: 906  HLGEFVNRFRHGSLVMRLPD---SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
            + G+ VN F  GS VM  P    S + Q   ++FGTV+G IG+I +L  + Y  L KLQ 
Sbjct: 999  YFGDNVNVFSRGSFVMDHPGAGASSLLQGKPILFGTVHGAIGLIGTLNMDTYTLLSKLQQ 1058

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM---- 1018
             +   IK VG + HE +RSF+NE ++     F+DGDL+E FL+L R +M +I + +    
Sbjct: 1059 KMAANIKSVGNIEHEIYRSFSNEHRSKPFAGFIDGDLVEKFLELPRPQMSQIVQGIKTTD 1118

Query: 1019 ------NVSVEELCKRVEELTRLH 1036
                  +VSV+++ K +E+L+RLH
Sbjct: 1119 VTGTEVDVSVDDILKLIEDLSRLH 1142


>gi|328770638|gb|EGF80679.1| hypothetical protein BATDEDRAFT_11194 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1098

 Score =  813 bits (2101), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 428/1051 (40%), Positives = 656/1051 (62%), Gaps = 25/1051 (2%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
            DVPIYGRIA +  FRP     D +F++TER    VL +D ++  L+  A GD +D   R 
Sbjct: 56   DVPIYGRIACIHAFRPEHRDTDLIFVSTERGSVIVLSFDQQTQRLVPEATGDFNDPSSRL 115

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
             D GQIG++DP  R+IGLH++ G FK+IP  +      ++N+ +   Q +  K +     
Sbjct: 116  ADPGQIGLVDPQNRMIGLHIHQGNFKIIPMFHSQISLSSWNLLVNSSQSIKGKGVARPPH 175

Query: 123  PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGE 182
            PT+ VL+ D+K++R+V  YEV +K+K  V    +   ++NG+ LLIPVP    GVL++GE
Sbjct: 176  PTLAVLHMDSKESRYVTQYEVVIKEKALVLKS-AATKVENGSSLLIPVPLAAGGVLVVGE 234

Query: 183  ETIVYCSANAFKAI--PIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            ++I   S +  K I   I+ ++ K + +VD  G RYLLGD+ G L++L I         L
Sbjct: 235  QSIALHSPSLQKPIMLSIKATLIKCFNQVDV-GFRYLLGDYEGQLYILAIIFANGTAREL 293

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            K+  +G+T  AS+++YL +  +++ S +GDS+L  +  +    G+ + + + + NL PI 
Sbjct: 294  KMTPVGQTVQASSLTYLTDGYLFVASHFGDSELYLIVPEDANTGNVLTLCKTFSNLAPIS 353

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDD 359
            DFCVVD+E+QGQ Q+V CSGA +DGS+RI+RNGIG+ E   ++ ++ + G+W+L+  +  
Sbjct: 354  DFCVVDIEKQGQAQIVACSGAQRDGSIRIIRNGIGVEEIGQLDDMEELTGVWALKPYSAA 413

Query: 360  PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
              D  LV+SFI ETR+  ++ +   E   +  F +  +TL+C +   + +VQ+TS S+ +
Sbjct: 414  RHDNVLVLSFIGETRLQKLDGDSMAEMDMLGNFKTAERTLWCQNLSSDMVVQITSQSITI 473

Query: 420  VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
            ++        EW    G SV  A+   + +L++ GGG +   E  D  L   +  Q++ E
Sbjct: 474  LTIEGWTTVAEWCFDLGASVTHASVYQNMILVSLGGGMIHLFEFNDRELVMKRSIQIQVE 533

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
            +SCL I  + E      + AVG W D SVR+  +PDL+ I KE L G+ IPRS+LL  F+
Sbjct: 534  VSCLHICKMEELNVC--LCAVGCWEDNSVRLLKIPDLSEIQKEILPGDTIPRSILLVEFD 591

Query: 540  GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
             + YLL +LGDG L NF +  K+ +L DRKK++L TQPITLRTF S   THVFAASDRPT
Sbjct: 592  NLPYLLVSLGDGQLFNFRIG-KSLKLADRKKITLATQPITLRTFQSHGRTHVFAASDRPT 650

Query: 600  VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
            VI+  + +LLYSNVN++E+SH+ PFNS     +LA A +G L IGTI+ +QKLHI++I L
Sbjct: 651  VIFVKSGQLLYSNVNVREISHVSPFNSHMAEGALAFASDGALKIGTIETVQKLHIKTIKL 710

Query: 660  GEHPRRICHQEQSRTFAICSL--KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGC 717
            GE PRRI + + S TF + ++  +N    + +++  +RLLD Q +E + +  L  FE   
Sbjct: 711  GETPRRIAYHDVSHTFGVLTVFSRNLPNGDLADISCLRLLDGQGYEVLDSIELQPFEIAS 770

Query: 718  SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKETKGAVYSL 776
            S+++  F+DD  +YY VGT +  P E+EP +GRILVF V D + LQL+ E + +G+ YS 
Sbjct: 771  SLITIRFTDDDTLYYTVGTGFAFPHEDEPVRGRILVFKVNDMRLLQLVHEYDIRGSAYSF 830

Query: 777  NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
             + +G+L+A +N  + + +W   D    ELQS   +HGH+LAL +  RGDFI+V DL+KS
Sbjct: 831  VSVHGRLVAGVNSNVMVLRWN-SDTSLLELQSM--NHGHVLALSLAVRGDFILVADLIKS 887

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER 896
            I+LL +     +++E A D ++NWM+A E++DDD +LGA+++ N+F + K  +  ++EER
Sbjct: 888  ITLLQFDLATDSLKELAYDADSNWMTAAELIDDDTFLGADSSMNIFALSKQGDQVSEEER 947

Query: 897  GRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG--QIPTVIFGTVNGVIGVIASLPHEQY 954
             RL   G +H GE +NRFR GSL +   D  +    IP +++ TV+G IGV+A +P ++ 
Sbjct: 948  QRLRPKGWFHTGELINRFRKGSLTLHATDETLALPAIPEILYCTVHGAIGVVARIPSDET 1007

Query: 955  L-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD- 1012
               L  LQ  L+ V++GVGGL H  WR +  E++++ +   +DGDLIESFL+L R+  D 
Sbjct: 1008 AKILSTLQEALKSVVQGVGGLIHSDWRRYRTERRSIKSAGIIDGDLIESFLELDRSMQDH 1067

Query: 1013 -------EISKTMNVSVEELCKRVEELTRLH 1036
                   +++ +  V++E L K VE+LTR+H
Sbjct: 1068 VFTQVATQVAGSTPVTLETLTKMVEDLTRIH 1098


>gi|324502823|gb|ADY41238.1| DNA damage-binding protein 1, partial [Ascaris suum]
          Length = 1129

 Score =  805 bits (2080), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 458/1098 (41%), Positives = 662/1098 (60%), Gaps = 86/1098 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
            +VP++GRIA ++ FR   E+ D + + T+++   +L WD+ + EL TRA G ++DR+GR 
Sbjct: 54   EVPVFGRIAAVKAFRVKNESVDSILVLTQKHHLAILCWDS-NGELRTRASGHIADRVGRH 112

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
            ++ G I  +     L+   LYDGL KV+ +     L+  FNIR E+L ++D+ FL    K
Sbjct: 113  SETGIIACVHSS-GLMAFRLYDGLIKVVQWTEGSDLR-GFNIRCEDLYIVDLDFLSDPEK 170

Query: 123  PTIVVLYQDNKDARHVKTYEVALKDKDFVEGP-WSQNNLDNGADLLIPVPPPLCGVLIIG 181
            PT+V +Y+D+ + RH+K   + L+DK+    P W Q+N++  A ++IP+P P  GV+++G
Sbjct: 171  PTVVYIYRDH-NGRHLKAAAINLEDKELSSPPLWKQDNIEAEACMVIPIPQPYGGVIVVG 229

Query: 182  EETIVYCS-ANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAG-----LLHLLVITH 232
             E I Y   ANA+ AI    I  S    YG++D DG RYLLGD +G     LL L V T 
Sbjct: 230  HEAISYHKDANAYSAIAPPLIHQSQISCYGKIDRDGQRYLLGDLSGRIFMLLLDLDVATD 289

Query: 233  EKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
                V  LK+ELLGETSI   + YLDN VV+IGS +GDSQL++L  +P   GSY+ +++ 
Sbjct: 290  GTASVKDLKVELLGETSIPECVVYLDNGVVFIGSRFGDSQLVRLRSEPCPDGSYISLMDT 349

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
            YVNL PI D  V++ +  GQ QV+TCSGA+KDGSLRI+RNGIGI E ASVEL G+K +++
Sbjct: 350  YVNLAPIRDMVVINAD--GQQQVITCSGAFKDGSLRIIRNGIGIEELASVELPGVKALFT 407

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            L   ++   D +LVV F+ ET IL ++ E ELE+T++ GF +  QTL+        + Q+
Sbjct: 408  LNVESE--LDDYLVVGFVDETHILKISGE-ELEDTQLPGFSTTEQTLWAGRVGSGGIAQI 464

Query: 413  TSGSVRLVSSTSRELRNE---WKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILT 469
            T   V L+      LR     W+ P   SV      + QV+LA G   L YL + D I T
Sbjct: 465  TPLKVVLI------LRGNTLTWEPPSRISVVSVNELSGQVVLACGN-QLHYLLLTDKI-T 516

Query: 470  EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
             + + + E+EI+C+D+  +G+    S++ AV  WTD+SV + SLPDLN I +E  GGE++
Sbjct: 517  PITNIECEFEIACIDVGCVGDEIE-SKLCAVAYWTDMSVALRSLPDLNEIVREKCGGEML 575

Query: 530  PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
             RS+L+C  EGI YLL ALGDG L  + ++M +G LT  KK +LGTQP TL+ F S+   
Sbjct: 576  ARSLLICMMEGIVYLLVALGDGTLYYYQIDMNSGALTQPKKATLGTQPTTLKKFMSRGAR 635

Query: 590  HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
            +VFA SDRPTVIYSS++KL++SNVNLK V+HMC  NSA + DSL +     L IG IDDI
Sbjct: 636  NVFACSDRPTVIYSSSQKLVFSNVNLKLVAHMCALNSATYTDSLVMTDGQTLVIGRIDDI 695

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-----------SCA------------ 686
            QKLHIR++PLGE   RI +Q ++ T AI   +N+            CA            
Sbjct: 696  QKLHIRTVPLGESVSRIAYQPETGTIAILVQRNEFVDADGKHHCGHCASKMAVNASSSHP 755

Query: 687  -------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                         EE E+  V + D  T E + ++ L   E   SI SC   DD   YY 
Sbjct: 756  SVVTSATTPPIEPEEIEVSSVVVFDANTLEILHSHELGKNELAMSIKSCVLGDDPQPYYA 815

Query: 734  VGTAYVLPEENEPTKGRILVFIV----EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
            VGTA VL +E E   GR+L+F V    E G+++L+ +KE KGA YS+    GKL+ AIN 
Sbjct: 816  VGTAVVLTDETESKSGRLLIFQVAPSSEGGRMRLVHDKEIKGAAYSIQVLMGKLVVAINS 875

Query: 790  KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
             ++L++W       +EL+ EC    ++ ALY++T+ D ++VGDLM+S+S+L YK  E + 
Sbjct: 876  CVRLFEWT----AEKELRLECSDFDNVTALYLRTKNDVVLVGDLMRSLSVLAYKPMESSF 931

Query: 850  EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGE 909
            E+ ARD+  NWM+A EI+D + +LGAE  FNLFTV K+     +  R +L+  G Y+LGE
Sbjct: 932  EKIARDFVTNWMTACEIIDMETFLGAEIMFNLFTVVKDCSSKDEGIRLQLQETGMYYLGE 991

Query: 910  FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 969
             VN F HGSL+    D        +++GT +G +GVI  L  + Y F+ +L+T +  V K
Sbjct: 992  SVNAFCHGSLIATHIDLTPSFTTPILYGTSDGGLGVIVQLTPQFYDFVHELETRIAAVTK 1051

Query: 970  GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV--------- 1020
                + H Q+R+F ++ +T  +  F+DGDL+E  LD+SR  ++++   + +         
Sbjct: 1052 NCMRIEHGQYRTFESDGRTEQSVGFIDGDLVEGLLDMSRDSVEKLIDGLTLPAAAGQEQK 1111

Query: 1021 --SVEELCKRVEELTRLH 1036
              ++EE+ K VE+L R+H
Sbjct: 1112 VATIEEVLKVVEDLARIH 1129


>gi|393905247|gb|EJD73911.1| CPSF A subunit region family protein [Loa loa]
          Length = 1145

 Score =  770 bits (1989), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 445/1108 (40%), Positives = 659/1108 (59%), Gaps = 90/1108 (8%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
            + P++GRIAT++LFR  GE  D L I T +Y   +++W   +SEL TRA G + DR+GRP
Sbjct: 54   ECPVFGRIATIKLFRAPGEDVDSLLILTAKYHLAIIRW-TPTSELRTRASGHIVDRVGRP 112

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
            ++ G I  +     ++   LYDGL KV+ + N+G+    FN+R ++L ++DI F+    +
Sbjct: 113  SETGMIATVHSSGLMV-FRLYDGLLKVVQW-NEGKDLRGFNVRCDDLYIIDITFMSDPDR 170

Query: 123  PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGE 182
            PT+  +YQD+ + RH+K   + ++DK+     W  +NL+  A ++I VP P  G LI G 
Sbjct: 171  PTLAYIYQDD-NGRHIKVVTLNIEDKELSSPLWKHDNLEGEASMVIGVPEPAGGCLIAGP 229

Query: 183  ETIVYCS----ANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
            + I Y      A  +  +P   +  +    Y  VD DG RYLL D AG L++L++   K 
Sbjct: 230  DAISYHKGGDDALRYAGVPGSRLHNTHPNCYAPVDRDGQRYLLADLAGNLYMLLLEFGKG 289

Query: 236  K---------VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS- 285
            +         V  +K+E LG T IA  + YLDN V +IGS +GDSQLI+L+ +P A G+ 
Sbjct: 290  QEQDESSTVSVKDMKVESLGNTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTG 349

Query: 286  YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
            Y+ +L+ Y NL PI D  V  +   GQ Q++TCSGAYKDG++RI+RNGIGI E ASVEL+
Sbjct: 350  YISLLDSYTNLAPIRDMTV--MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELASVELK 407

Query: 346  GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            GIK M++LR+  D  FD +L++SF S+T +L +N E ELE+T+I GF     TL+     
Sbjct: 408  GIKNMFTLRTR-DHEFDDYLILSFDSDTHVLLINGE-ELEDTQITGFVVDGATLWAGCLF 465

Query: 406  YNQ-LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG 464
             +  ++QVT G V L+   + ++   WK+    ++        Q+++A G   L+YLE  
Sbjct: 466  QSTTILQVTHGEVILIDGDNIQI---WKASKWITLVAVNEITGQLVIACGA-LLIYLEAD 521

Query: 465  DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
                  +   + E+EISC+DI PIG     S+I AVG WTD+SV + +LP L  + +E +
Sbjct: 522  SAGFKLISELECEFEISCIDITPIGNETLRSEICAVGYWTDLSVALRTLPQLMEVVREKI 581

Query: 525  GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
             G+++ RS++L   EG  YLL ALGDG +  F ++MKTG L D KK +LGTQPI LR F 
Sbjct: 582  PGDMLSRSIMLSPMEGHVYLLVALGDGTVHYFQIDMKTGALLDPKKATLGTQPIHLRKFR 641

Query: 585  SKNTT--HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
            S+ ++  ++F  SDRP VIYSSN+KLL+SNVNL+ VS M P  + A+PD+L +     L 
Sbjct: 642  SRCSSVHNIFVCSDRPAVIYSSNQKLLFSNVNLRMVSTMTPLYAEAYPDALVLTDGNSLV 701

Query: 643  IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI------------------CSLKN-- 682
            IG IDDIQKLHIR++PLGE P RI +Q ++ T A+                  C+ KN  
Sbjct: 702  IGRIDDIQKLHIRTVPLGESPSRIAYQPETNTIAVTVERLEFVDAMGKHHFGQCASKNAM 761

Query: 683  ----------------QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD 726
                            +  AEE E+  + LLD  TFE + ++ L+  E   S+ SC   +
Sbjct: 762  ETSSSRLSSMRREPTPECLAEEMEVSSILLLDSNTFEILHSHELEGSEMAMSLASCQLGN 821

Query: 727  DSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFNGKLL 784
            DS  Y+ VGTA ++ +E E   GRI++F   +G  +++L+ EKE KGA YS+ + +GKL+
Sbjct: 822  DSQPYFVVGTAVIMSDETESKMGRIMMFQASEGPERMRLVYEKEIKGAAYSIQSMDGKLV 881

Query: 785  AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
             A+N  ++L++W       +EL+ EC    ++ ALY++T+ D I+VGDLM+S+SLL YK 
Sbjct: 882  VAVNSCVRLFEWT----ADKELRLECSDFDNVTALYLKTKNDLILVGDLMRSLSLLSYKS 937

Query: 845  EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
             E   E+ ARD+  NWMSA EI+D D +LGAEN++NLFTV K+S     EE  RL+ +G 
Sbjct: 938  VESTFEKVARDFMTNWMSACEIIDSDSFLGAENSYNLFTVVKDSFTVFKEEGTRLQELGL 997

Query: 905  YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
            ++LGE VN F HGSL     D       ++++GT +G IGVI  +P   Y FL  +Q  L
Sbjct: 998  FYLGEMVNVFCHGSLTATQVDVAPLYHSSILYGTSDGGIGVIVQMPPVLYTFLHDVQKRL 1057

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI---------- 1014
                +    ++H Q+R+F  EK++     F+DGDLIES LD+ +  + +I          
Sbjct: 1058 ADYTENCMRISHTQYRTFETEKRSEVPNGFIDGDLIESLLDMGKDSVGQIVNGLKMPLLN 1117

Query: 1015 ------SKTMNVSVEELCKRVEELTRLH 1036
                  ++ ++ + E++ K VE+L+R+H
Sbjct: 1118 IPSSETTELVDATAEDVLKLVEDLSRIH 1145


>gi|194381178|dbj|BAG64157.1| unnamed protein product [Homo sapiens]
          Length = 826

 Score =  746 bits (1926), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 382/747 (51%), Positives = 499/747 (66%), Gaps = 56/747 (7%)

Query: 340  ASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
            AS++L GIKG+W LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    QT 
Sbjct: 86   ASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTF 144

Query: 400  FCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLV 459
            FC +  + QL+Q+TS SVRLVS   + L +EWK P   +++VA+ N+SQV++A G   L 
Sbjct: 145  FCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LY 203

Query: 460  YLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLI 519
            YL+I    L ++ H ++E+E++CLDI P+G++   S + A+G+WTDIS RI  LP   L+
Sbjct: 204  YLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELL 263

Query: 520  TKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
             KE LGGEIIPRS+L+  FE   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  
Sbjct: 264  HKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTV 323

Query: 580  LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
            LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A   
Sbjct: 324  LRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNS 383

Query: 640  ELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------ 681
             LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S +                  
Sbjct: 384  TLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSAS 443

Query: 682  ---------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
                                   S  EE E+H + ++D  TFE +  +     EY  S++
Sbjct: 444  TQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLV 503

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
            SC    D N Y+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FN
Sbjct: 504  SCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFN 563

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            GKLLA+IN  ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL
Sbjct: 564  GKLLASINSTVRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLL 619

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
             YK  EG  EE ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+
Sbjct: 620  AYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQ 679

Query: 901  VVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
             VG +HLGEFVN F HGSLVM+ L ++      +V+FGTVNG+IG++ SL    Y  L  
Sbjct: 680  EVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLD 739

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            +Q  L KVIK VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   + 
Sbjct: 740  MQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQ 799

Query: 1020 ----------VSVEELCKRVEELTRLH 1036
                       + ++L K VEELTR+H
Sbjct: 800  YDDGSGMKREATADDLIKVVEELTRIH 826



 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 18/31 (58%), Positives = 25/31 (80%)

Query: 3  DVPIYGRIATLELFRPHGEAQDFLFIATERY 33
          +V +YG+IA +ELFRP GE++D LFI T +Y
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKY 84


>gi|226510488|ref|NP_001145925.1| uncharacterized protein LOC100279448 [Zea mays]
 gi|219884971|gb|ACL52860.1| unknown [Zea mays]
          Length = 416

 Score =  736 bits (1901), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 343/416 (82%), Positives = 379/416 (91%)

Query: 621  MCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSL 680
            MCPFN+AAFPDSLAIAKEGEL+IGTIDDIQKLHIR+IPL E  RRICHQEQSRT A CS 
Sbjct: 1    MCPFNTAAFPDSLAIAKEGELSIGTIDDIQKLHIRTIPLNEQARRICHQEQSRTLAFCSF 60

Query: 681  KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVL 740
            K     EESE H +RLLD QTFE +  YPLD +E GCSI+SCSF+DDSNVYYCVGTAYV+
Sbjct: 61   KYNQSVEESETHLIRLLDHQTFESLCVYPLDQYECGCSIISCSFADDSNVYYCVGTAYVI 120

Query: 741  PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
            PEENEPTKGRILVF VEDG LQLI EKETKGAVYSLNAFNGKLLAAINQKIQLYKWM R+
Sbjct: 121  PEENEPTKGRILVFAVEDGSLQLIVEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMSRE 180

Query: 801  DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
            DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSISLL+YKHEE AIEERARDYNANW
Sbjct: 181  DGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYKHEESAIEERARDYNANW 240

Query: 861  MSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 920
            M+AVE+LDD++Y+GAEN++NLFTVRKNS+ ATD+ER RLEVVGEYHLGEFVNRFRHGSLV
Sbjct: 241  MTAVEMLDDEVYVGAENSYNLFTVRKNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLV 300

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
            MRLPDSD+GQIPTVIFGT+NGVIG+IASLPH+QY+FLEKLQ+ L K IKGVG L+HEQWR
Sbjct: 301  MRLPDSDIGQIPTVIFGTINGVIGIIASLPHDQYIFLEKLQSTLVKYIKGVGNLSHEQWR 360

Query: 981  SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            SF+N+KKT +A+NFLDGDLIESFLDLSR++M+E+SK M V VEEL KRVEELTRLH
Sbjct: 361  SFHNDKKTAEARNFLDGDLIESFLDLSRSKMEEVSKAMGVPVEELSKRVEELTRLH 416


>gi|395544366|ref|XP_003774082.1| PREDICTED: DNA damage-binding protein 1 [Sarcophilus harrisii]
          Length = 1239

 Score =  711 bits (1834), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 365/713 (51%), Positives = 476/713 (66%), Gaps = 56/713 (7%)

Query: 374  RILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKS 433
            R+L +N E E+EETE+ GF    QT FC +  + QL+Q+TS SVRLVS   + L +EWK 
Sbjct: 533  RVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKE 591

Query: 434  PPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPS 493
            P G +++VA+ N+SQV++A G   L YL+I    L ++ H ++E+E++CLDI P+G++  
Sbjct: 592  PQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNG 650

Query: 494  YSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHL 553
             S + A+G+WTDIS RI  LP   L+ KE LGGEIIPRS+L+  FE   YLLCALGDG L
Sbjct: 651  MSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGAL 710

Query: 554  LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNV 613
              F L+++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRPTVIYSSN KL++SNV
Sbjct: 711  FYFGLSIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNV 770

Query: 614  NLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSR 673
            NLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E PR+IC+QE S+
Sbjct: 771  NLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQ 830

Query: 674  TFAICSLK---------------------------------------NQSCAEESEMHFV 694
             F + S +                                         S  EE E+H +
Sbjct: 831  CFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNL 890

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF 754
             ++D  TFE +  +     EY  S++SC    D N Y+ VGTA V PEE EP +GRI+VF
Sbjct: 891  LIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVF 950

Query: 755  IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
               DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++LY+W       +EL++EC H+ 
Sbjct: 951  QYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWT----AEKELRTECNHYN 1006

Query: 815  HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
            +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE ARD+N NWMSAVEILDDD +LG
Sbjct: 1007 NIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG 1066

Query: 875  AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPT 933
            AEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN F HGSLVM+ L ++      +
Sbjct: 1067 AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS 1126

Query: 934  VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
            V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG + H  WRSF+ E+KT  A  
Sbjct: 1127 VLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATG 1186

Query: 994  FLDGDLIESFLDLSRTRMDEISKTMN----------VSVEELCKRVEELTRLH 1036
            F+DGDLIESFLD+SR +M E+   +            + ++L K VEELTR+H
Sbjct: 1187 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVVEELTRIH 1239



 Score =  410 bits (1054), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 198/348 (56%), Positives = 261/348 (75%), Gaps = 11/348 (3%)

Query: 17  RPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIGIIDPD 74
           R  GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIGRP++ G IGIIDP+
Sbjct: 73  RQQGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIGIIDPE 132

Query: 75  CRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKD 134
           CR+IGL LYDGLFKVIP D   +  +AFNIRLEELQV+D+KFLYGC  PTI  +YQD + 
Sbjct: 133 CRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELQVIDVKFLYGCQAPTICFVYQDPQ- 191

Query: 135 ARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFK 194
            RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +IIG+E+I Y + + + 
Sbjct: 192 GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYL 251

Query: 195 AIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-----KVTGLKIELLG 246
           AI    I+ S    + RVD +GSRYLLGD  G L +L++  E++      +  L++ELLG
Sbjct: 252 AIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLG 311

Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVD 306
           ETSIA  ++YLDN VV++GS  GDSQL+KLN+  + +GSYV  +E + NLGPIVD CVVD
Sbjct: 312 ETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVD 371

Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR 354
           LERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LR
Sbjct: 372 LERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLR 419


>gi|328700785|ref|XP_001945395.2| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
          Length = 1072

 Score =  699 bits (1803), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 387/1003 (38%), Positives = 601/1003 (59%), Gaps = 86/1003 (8%)

Query: 8    GRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIGRPTDN 65
            G+I  L++FRP  +  D +F+ T +Y   +L++   S   E++T+   +V D  G  T+ 
Sbjct: 91   GKIEILKVFRPKSKTTDLIFVVTAQYNAMILEYTQGSDKIEMMTKGHCNVFDHFGVATE- 149

Query: 66   GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTI 125
              +GIIDP+ +L+ L L++ +FK+IP D +G+L + ++I +EE  + DI FLYG   PTI
Sbjct: 150  -FMGIIDPNAKLVMLKLFEKMFKIIPLDKEGEL-DVYSITMEETNIQDIGFLYGFTNPTI 207

Query: 126  VVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETI 185
            +++Y+ N   R +K  ++               +++  A ++IPVP PLCG +IIGE +I
Sbjct: 208  IIIYE-NAMGRTIKIKKIIDSK--------KYKSIEKEASMVIPVPSPLCGAIIIGENSI 258

Query: 186  VY--CSANAFKAIPIRPSI-TKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-----KV 237
             Y   S N  + +PIR  I    Y RVD +G+RYLLGDH+G L +L + +EK      KV
Sbjct: 259  FYHNGSCNIIR-LPIRQKIEIVCYTRVDLEGTRYLLGDHSGCLLMLFLKYEKTLNGKFKV 317

Query: 238  TGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLG 297
            T L +   GE SI  +++YLDN V+Y+ S +GDSQLIKL+ + +  GS++ +L++Y+NLG
Sbjct: 318  TDLYLRYFGEISIPISLTYLDNKVIYVASKFGDSQLIKLHYELNETGSHLTILDQYLNLG 377

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST 357
            PIVD C+VD++++GQ Q+VTCSGAYKDGSLRI+ NG+GI E A+++L GIKG+WSL  +T
Sbjct: 378  PIVDMCLVDIDQRGQEQIVTCSGAYKDGSLRIINNGVGIQEIATIDLLGIKGIWSLSFNT 437

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
                D  LV+SF+  +++LA + E E EE  +EGF S+ QT +C   + N++VQVTS SV
Sbjct: 438  KSDLDDTLVLSFVWHSKVLAYDSE-EAEEIYVEGFESELQTFYCGKTLDNKMVQVTSASV 496

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLE 477
            RL+   S++L +EWK P   ++N  + N  Q + ++G   L Y+EIG   + + KH  LE
Sbjct: 497  RLICMESKKLISEWKVPYFRNINAVSCNGHQAVCSSGHD-LYYIEIGSQKIFQNKHITLE 555

Query: 478  YEISCLDINPIGENPSYSQI-AAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLC 536
            +E+SCLDI    +    + I  A+G+WTD S+++  LPD   + KE+L   I+PRSV   
Sbjct: 556  HEVSCLDICLFKDKFGETIILLAIGLWTDTSIKVLKLPDFVELEKENLFEGIVPRSVSFV 615

Query: 537  AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASD 596
              E I YL CALG+G L  F LN++TG+L    K+ LG +P  ++ F + ++  +F  SD
Sbjct: 616  TLENIHYLFCALGNGSLCYFYLNIETGKLYKNGKIKLGNRPALIKKFQTASS--IFICSD 673

Query: 597  RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
             P +I+SSNKKL+++NVN  +V+++C  N+  +P+SL +A +  + IG ID  +K H+R+
Sbjct: 674  FPIIIHSSNKKLVFTNVNSIKVNNICMVNTDNYPNSLILATDTAIIIGVIDMEKKHHVRT 733

Query: 657  IPLGEHPRRICHQEQSRTFAICSLKNQSCAE----------------------------- 687
            IPLGE PRRI +QE S+TF I ++K     E                             
Sbjct: 734  IPLGESPRRIAYQEASKTFGIITIKKNIKDEMIKGPVHPSASTRTQNITSAMGNRFLIDQ 793

Query: 688  ------------------ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
                              E E+  + +L   TFE +  Y L + EY  SI+S   ++D N
Sbjct: 794  HLSSSSEASSLSDSDFNPEVEISSMIILHQDTFEILHVYQLYSNEYALSIISTKLANDPN 853

Query: 730  VYYCVGTAYVLPEENEPTKGRILVFIVED--GKLQLIAEKETKGAVYSLNAFNGKLLAAI 787
             YY +GTA +  E  +P +GRI+VF  +    KL  I+EK   G  +S+  F+  L+A +
Sbjct: 854  TYYVLGTALMTEECQDPKEGRIVVFHYDTSLSKLTQISEKIVYGGCFSMVTFHDMLIATV 913

Query: 788  NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
            N  +QLY W       ++    C  + + L+ +V+T G +I+ GDLMKS++L  YK +E 
Sbjct: 914  NSSVQLYIWT----HEKKFVLRCTQNNNSLSQHVKTNGKYILCGDLMKSLALFKYKTDEP 969

Query: 848  AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT-DEERGRLEVVGEYH 906
             +E+   D    W SA+EI+DDD+++GAEN+  L+   K+S   + D +    + +G +H
Sbjct: 970  NLEKIVTDDCLKWSSAIEIIDDDLFIGAENDKYLYVFYKHSNFVSGDYQHNNFKEIGRFH 1029

Query: 907  LGEFVNRFRHGSLVMRLPDSD----VGQIPTVIFGTVNGVIGV 945
            LG+ VN FRHGSLVM+  +++    +    ++++GT++G +G+
Sbjct: 1030 LGDLVNVFRHGSLVMKQFENEYETQLSVQGSILYGTISGALGL 1072


>gi|449684814|ref|XP_004210722.1| PREDICTED: DNA damage-binding protein 1-like, partial [Hydra
            magnipapillata]
          Length = 725

 Score =  699 bits (1803), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 374/801 (46%), Positives = 514/801 (64%), Gaps = 88/801 (10%)

Query: 248  TSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL 307
            TSI   ++YLDN VV+IGS  GDSQ++KLN +PD KGS++ +L  + NLGPI+D CVVDL
Sbjct: 1    TSIPHCLTYLDNGVVFIGSCLGDSQIVKLNTEPDKKGSFITILRSFTNLGPILDMCVVDL 60

Query: 308  ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR--SSTDDPFDTFL 365
            ERQGQ Q+VTCSGA+KDGSLRI+RNGIGINE AS++L GI G+W L+  S   D  DT +
Sbjct: 61   ERQGQDQLVTCSGAFKDGSLRIIRNGIGINELASIDLAGIMGLWCLKVNSINSDLHDT-M 119

Query: 366  VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
            V+SF+ ++R+L+++  +E+EE EIEGF S  QT +C +  +NQL+Q              
Sbjct: 120  VLSFVGQSRVLSLS-TEEVEEIEIEGFSSDKQTTYCANVNFNQLIQ------------KH 166

Query: 426  ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI 485
               + W  P    ++VA +N+ Q++++ G   L+YLE+ D  + ++ H  LEYE++CLD+
Sbjct: 167  FFFSNWLPPDNKHISVAVSNSFQIVVSLGK-ELIYLEVEDSNIKQISHTVLEYEVACLDL 225

Query: 486  NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLL 545
            +P G N + S    VG+WTDISVRI +LP+L  +  E L GE+IPRS+L+  FE   YLL
Sbjct: 226  SPKGSNETTSDRLCVGLWTDISVRILALPNLEELYVEKLSGEMIPRSILMITFEDKEYLL 285

Query: 546  CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSN 605
            CALGDG L  FLLN  TG L+D+KKVSLGT+P  +++F S ++THVFA SDRPTVIYSSN
Sbjct: 286  CALGDGSLFYFLLNRLTGVLSDQKKVSLGTKPTVIQSFKSGSSTHVFACSDRPTVIYSSN 345

Query: 606  KKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRR 665
             KL++SNVNLKEV +M P N+ A+P+S           G ID I K+      L  H  +
Sbjct: 346  NKLVFSNVNLKEVCYMSPLNTQAYPNSFY--------SGHIDSIYKMFF----LVTHSHQ 393

Query: 666  ICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS 725
                E + +   CS                                            FS
Sbjct: 394  FLENEWATSLTSCS--------------------------------------------FS 409

Query: 726  DDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLA 785
            +D N YYCVGT+ V PEE+EP +G+I++F + +GKL  I  K   GAVY L  FNGKLLA
Sbjct: 410  NDPNTYYCVGTSMVYPEESEPKEGKIILFQLFEGKLVQIGSKTVNGAVYVLQGFNGKLLA 469

Query: 786  AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
             +N  + +Y+W       +EL+ EC +H  ILALY++++GDFI+VGDLM+S++LL YK  
Sbjct: 470  GVNSLVSVYEWT----SDKELKQECCYHNTILALYLKSKGDFILVGDLMRSMTLLAYK-P 524

Query: 846  EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
             G +EE A D++ NWM+AVEI+DDD +LGAEN+FNLF  +K++    DEER  L+ +G+Y
Sbjct: 525  LGRLEEIAHDFSPNWMTAVEIIDDDTFLGAENSFNLFICQKDNSSVNDEERHHLQTIGKY 584

Query: 906  HLGEFVNRFRHGSLVMRLPDSDVGQI-PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
            HLG+FVN F+HGSLVM      +  I  ++++GTV G IG++A LP   + FL ++Q  L
Sbjct: 585  HLGDFVNVFKHGSLVMHHSTEQLTPISSSILYGTVRGAIGLVAGLPKNTFDFLSQVQEKL 644

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV---- 1020
             K IK VG + HE WRSF N+KKT  A   +DGDLIES LDL+RT++ E+   + +    
Sbjct: 645  SKTIKSVGKIEHEFWRSFYNDKKTDLAVGCVDGDLIESCLDLTRTQLHEVVSGLEIEEAG 704

Query: 1021 -----SVEELCKRVEELTRLH 1036
                 +V++L K VEEL+R+H
Sbjct: 705  IKRECTVDDLIKVVEELSRIH 725


>gi|313238818|emb|CBY20011.1| unnamed protein product [Oikopleura dioica]
 gi|313245836|emb|CBY34826.1| unnamed protein product [Oikopleura dioica]
          Length = 1135

 Score =  697 bits (1799), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 397/1088 (36%), Positives = 622/1088 (57%), Gaps = 71/1088 (6%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            +YGRIA +E+FR   E +D LFI TE    C+L++     ++ITRA GD+ D+    + +
Sbjct: 62   LYGRIAVIEVFRYKNEKKDCLFILTESCYACILEY--VDGKIITRAYGDMRDKNYSVSQS 119

Query: 66   GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTI 125
            G    +DP+ R I L LYDG+ K+I  ++  +   +   R+EE+ V+D+ FL+   KPT+
Sbjct: 120  GMHACVDPEARCIALRLYDGVLKIINLNSSSKHLTSAEQRIEEILVVDMCFLHTANKPTL 179

Query: 126  VVLYQDNKDARHVKTYEVALKDK----DFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
             +LY DN  +RH+ T  + L +        +GP+    ++    L++ VP PL G+L++G
Sbjct: 180  ALLYDDNS-SRHLSTIAITLDNSGSGASIHKGPFRHTQVEQDTILIVAVPEPLAGILLLG 238

Query: 182  EETIVYCSANAFKAIPIRPSITKAYGRVDA-DGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
               I Y  +       I   + +    V   D  RYL GD  G L LL++ + + ++   
Sbjct: 239  HVNITYHDSKNRSTCSIENIVKRTIECVTPIDKHRYLCGDSNGELFLLLLDYNENRIPEE 298

Query: 241  KIEL----LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNL 296
            ++ L    LG T++ +T+SY+DN VV++GS++GDS+LI++ +  +  G +   L +Y NL
Sbjct: 299  RMRLATKYLGRTTLPNTLSYIDNYVVFVGSTFGDSELIRIEVSDNNSGQHFTSLHQYDNL 358

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSS 356
            GPI D C+VD E+QGQGQ+VT SG    GSLRI+RNG+GI+E AS++L+G+KG+W+L+  
Sbjct: 359  GPIKDMCIVDFEKQGQGQLVTASGVGTGGSLRIIRNGVGIHEYASIDLEGVKGLWALKYL 418

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
            +       L++SF+ +T  L +  +D  E  EI GF +  QT++  +    Q +Q+T   
Sbjct: 419  SSSTKQDSLLLSFVGQTIFLRLEGQDVTEVEEIPGFTNGEQTMYAGNVTDQQFLQITEKQ 478

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL 476
            VRL++  S  L+  W+      +N+ + N +QVLL  G    +YLEI D  + E      
Sbjct: 479  VRLIADES--LKGSWEPEENTQINLCSVNKNQVLLGVGST-AIYLEINDCEIVEKSRHVF 535

Query: 477  EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLC 536
            + EI+C+DI+P+ +  S S    +G+W +++V +  LP + +I K  LGG IIPRSVLL 
Sbjct: 536  DSEIACVDISPLQKEMS-SDFFTIGLW-NVTVSVNKLPSMEVIAKMELGGNIIPRSVLLN 593

Query: 537  AF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAA 594
            +F      YLL ++GDG L    LN +    + +K++ LGTQP +L  F + N + VFA 
Sbjct: 594  SFGENNTPYLLVSIGDGALFYIKLN-EDHSFSSKKRIQLGTQPTSLNKFQTSNGSTVFAC 652

Query: 595  SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHI 654
            SDRP VI+S+N+K+ +SNVNLK+V+HMC  ++  +P++LA+  E  L IG IDDIQKLH 
Sbjct: 653  SDRPAVIHSTNEKIFFSNVNLKQVNHMCVLDTEGYPNALALVNENALLIGKIDDIQKLHT 712

Query: 655  RSIPLGEHPRRICHQEQSRTFAI-----------------------------CSLK---- 681
             +I L E P  + H E+   FA                              C  K    
Sbjct: 713  STIRLNETPHSVLHYEEREVFAYLGEFDEEDLRDTRPDQESTKKLFTPLSIQCPYKSGVV 772

Query: 682  ----NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTA 737
                + S    + ++ + + D+ TF     +  D  E    +           +  VGTA
Sbjct: 773  ERDDSNSLTHYTMVNTLVICDEITFTPKWAHFFDVGEISSCMCIAKLGKKDEQFIVVGTA 832

Query: 738  YVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNG-KLLAAINQKIQLYK 795
             +  +E E   GRI VF   ++ KL L++ K+  GAVYS+ A NG K++ AINQ++++++
Sbjct: 833  -ITADEQECKNGRICVFSYSKEEKLTLVSTKQVNGAVYSVKALNGNKIICAINQQLKVFE 891

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQ-TRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
                 +    LQSE     HI  + V  ++  FI+  DLM+SIS+  YK  EGA+EE AR
Sbjct: 892  M----NEQTTLQSEAPIANHITCVAVDVSKNGFILSADLMRSISVFSYKPLEGALEEIAR 947

Query: 855  DYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
            DY+ NWM+A++++DDD Y+GAEN+ N+F   +N+E   +E+R +L   G YH+GE +N  
Sbjct: 948  DYHPNWMTAIKMIDDDNYIGAENSENIFICTRNTEAPDEEDRQQLLPTGYYHVGEHINTI 1007

Query: 915  RHGSLVMRLP-DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
              G+LVM +  +S +    T + G+V+G +G++A  P +Q+ FL KL+  +RKVI+GVG 
Sbjct: 1008 VEGNLVMDVHVESSITPTRTFLMGSVSGYVGLLAIFPEKQWQFLSKLEAKMRKVIRGVGK 1067

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE-----ELCKR 1028
            ++HE WR F ++ +  D K F+DGDLIE F DL   +  E+   + +  E     ++ + 
Sbjct: 1068 IDHESWRRFESDSRMEDCKGFVDGDLIEMFQDLRPEKQKEVISELTMDGEPATHDDVVRL 1127

Query: 1029 VEELTRLH 1036
            V++L RLH
Sbjct: 1128 VDDLCRLH 1135


>gi|255316764|gb|ACU01763.1| putative DNA damage binding protein [Brachypodium distachyon]
          Length = 384

 Score =  667 bits (1721), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/384 (81%), Positives = 352/384 (91%), Gaps = 1/384 (0%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDT 712
            IR+IPL E  RRICHQEQSRT A CS K N +  EESE HF+RLLD QTFEF+ST+PLD 
Sbjct: 1    IRTIPLNEQARRICHQEQSRTLAFCSFKYNPNSMEESEAHFIRLLDHQTFEFLSTHPLDQ 60

Query: 713  FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA 772
            +E GCS++SCSFSDD+N YYCVGTAYVLPEENEPTKGRILVF VEDG+LQLI EKETKGA
Sbjct: 61   YECGCSMISCSFSDDNNFYYCVGTAYVLPEENEPTKGRILVFAVEDGRLQLIVEKETKGA 120

Query: 773  VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
            VYSLNAFNGKLLAAINQKIQLYKWM R+DG+ ELQSECGHHGHILAL+ QTRGDFIVVGD
Sbjct: 121  VYSLNAFNGKLLAAINQKIQLYKWMTREDGSHELQSECGHHGHILALFTQTRGDFIVVGD 180

Query: 833  LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
            LMKSISLL+YKHEE AIEE ARDYNANWM+AVE++DDDIY+GAEN++NLFTVRKNS+ AT
Sbjct: 181  LMKSISLLVYKHEESAIEELARDYNANWMTAVEMIDDDIYVGAENSYNLFTVRKNSDAAT 240

Query: 893  DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
            DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD+++GQIPTVIFGT+NGVIG+IASLPH+
Sbjct: 241  DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDTEMGQIPTVIFGTINGVIGIIASLPHD 300

Query: 953  QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD 1012
            QY+FLEKLQ+ L K IKGVG L+H+QWRSF+NEKKT +A+NFLDGDLIESFLDL+R++M+
Sbjct: 301  QYVFLEKLQSILGKFIKGVGSLSHDQWRSFHNEKKTAEARNFLDGDLIESFLDLNRSKME 360

Query: 1013 EISKTMNVSVEELCKRVEELTRLH 1036
            E+SK M VSVE L KRVEELTRLH
Sbjct: 361  EVSKGMGVSVENLSKRVEELTRLH 384


>gi|219125301|ref|XP_002182922.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
            1055/1]
 gi|217405716|gb|EEC45658.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
            1055/1]
          Length = 1284

 Score =  607 bits (1564), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 411/1176 (34%), Positives = 621/1176 (52%), Gaps = 164/1176 (13%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS-----ELITRAMGDV 55
            +  VPI GRI  L  F+ HG    ++F+ T R ++ VL +D  +S      L+T A G +
Sbjct: 64   LASVPINGRIVGLVPFKVHGSDTSYVFVLTARQQYAVLAYDRTNSGSAAYPLVTLASGTL 123

Query: 56   SDR----IGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIP----FDNKGQLKEAFNIRLE 107
              +    +G+  ++G I  ID   R I LH+YDGL  +IP      ++  L   F+ R+E
Sbjct: 124  QSQEHAVLGQEAESGPIVAIDHFHRCIALHVYDGLLTIIPSTRTLASQQLLGTPFHCRIE 183

Query: 108  ELQVLDIKFLYGC--AKPTIVVLYQDNKDARHVKTYEVALKDKD-FVEGP--------WS 156
            E  +L + FL     A P + VL+QD + A+H+ ++ +  K K+ F+ G         W 
Sbjct: 184  ERTILHLAFLQIPFEALPQLAVLHQDARGAQHITSHVINWKRKNIFLYGSSSAPAATEWL 243

Query: 157  Q-NNLDNGADLLIPVP---PP--------LCGVLIIGEETIVYCSANAFKAIPIRPSITK 204
            Q +N+D G+ L+IPVP   PP          GVL++G+  + + + N  K +PI  ++  
Sbjct: 244  QKSNVDGGSSLIIPVPAEAPPDFAPAKHRSGGVLVVGQRQLTFINNNVTKVVPIPQALHL 303

Query: 205  AYGRVDADGS----RYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
                + AD +    RYLL D  G LH++ I    +KV  L+I+ LG  ++A++I+YL   
Sbjct: 304  CVEELPADPNGGLPRYLLADEFGNLHMVTIVLVVDKVVALQIDTLGSCTLATSIAYLREG 363

Query: 261  VVYIGSSYGDSQLIKLNLQP-----------DAKGSYVEVLERYVNLGPIVDFCVVD--- 306
            +V++GS+ GD QLI+++ +P            A+ SY+ V+E Y +LGPI+DF +V    
Sbjct: 364  LVFVGSTLGDPQLIQIHDEPIVDVEDEEDMVGAESSYLSVVEEYTHLGPILDFDLVPTAP 423

Query: 307  ----------LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSS 356
                      +    Q QVVT SG+ K GSLR++RNGIG+NE A+VE+ GI+ +WSLR S
Sbjct: 424  GGGGLGQTEGIHGPSQSQVVTASGSSKSGSLRLIRNGIGMNESAAVEIPGIQNVWSLRRS 483

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEG---------------FCSQTQTLFC 401
              D  DT+LV SF+ ETR+L +   D++ + E EG                 S   TL+ 
Sbjct: 484  FADVDDTYLVQSFVHETRVLGVTTMDDMSQDEKEGDVAPGGTLEEVFLIGLKSSCATLYV 543

Query: 402  HD--AIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATAN-ASQVLLATGGGHL 458
             +  A  N L+Q+T G VR   +T   + + W  P G ++ V TAN A Q+ +A  GG +
Sbjct: 544  GNVQAHQNGLLQITEGEVRF--ATMEAVLDTWLVPSGAAITVGTANEAGQIAVALNGGKV 601

Query: 459  VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
            +YL+I +G + E    Q+E E+SCL++NP       S   AVG+W D +VR+F    L L
Sbjct: 602  LYLKIEEGKIRECSGQQMEREVSCLNLNPFAATSHTSSFLAVGLWDDFTVRLF----LCL 657

Query: 519  ITKEHLGGEIIPRSVLLCAFEG----ISYLLCALGDGHLLNFLLNMKTGEL--TDRKKVS 572
            IT +   G     +    +       ++ L   LGDG L++F +  +   +    +K+V 
Sbjct: 658  ITLDFSSGTSGNTTSTSTSLSSTGSGVNMLFVGLGDGTLISFAVVERGASIFVQSKKEVC 717

Query: 573  LGTQPITLRTF-SSKNTTHVFAASDRPTVIYSS----------NKKLLYSNVNLKE---- 617
            LGTQ I L    + +  T V A  DRPTVIY +          N KL YSNVNL      
Sbjct: 718  LGTQRIDLVPLCTEQGGTCVLATGDRPTVIYLAGVGGISANQFNPKLCYSNVNLSAGDDE 777

Query: 618  ---------------VSHMCPFNSAAFPDS---------LAIAKEGELTIGTIDDIQKLH 653
                           V+   PF+S+   DS         L +A +  L +G IDDIQKLH
Sbjct: 778  EEDDVSRPPSQQSIVVNVATPFSSSLLFDSATGGSQRYSLCVADDSFLRMGIIDDIQKLH 837

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKN-----QSCAEESEM-HFVRLLDDQTFEFIST 707
            + +  LG  P RI H    R FA+  +++         +E+ M + +R +DD  F+ I  
Sbjct: 838  VTTCRLGMAPCRIVHCADGRLFAVGCIESGIKQFSLGGDEANMGNCIRFMDDANFDDIHR 897

Query: 708  YPLDTFEYGCSILSCSFSDDSN----------VYYCVGTAYVLPEENEPTKGRILVFIVE 757
              L+ FE   S++  +    S+           +  VGTAY +P+E+EP++GRILV+  +
Sbjct: 898  VDLEPFEMILSMVYATLRIPSDGDQSDQPVHRPFLLVGTAYAMPDEDEPSRGRILVYSCQ 957

Query: 758  DGK----------LQLIAEKETKGAVYSLNAF-NGKLLAAINQKIQLYKWMLRDDGTREL 806
              +          ++ I E  T+G VYS+  F +G  L  +N K  + + ++ D G   L
Sbjct: 958  ADEASGTPTSTRAVRQITEMSTQGGVYSICQFYDGNFLCTVNSKTHVVQ-IVADCGVLRL 1016

Query: 807  Q-SECGHHGHILALYVQTRGD-FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV 864
            +    GHHGHI++L+V++R     +VGDLM+S+SL+ Y  +   +EE ARD+N NW +AV
Sbjct: 1017 EYVGIGHHGHIVSLFVKSRAKPLAIVGDLMRSVSLMQYYPQHETLEEVARDFNPNWTTAV 1076

Query: 865  EILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRL- 923
            E+L DD+Y+GAEN  NLF +R+N    ++E R RL+ +GE+HLGE  N+F  GSLVM + 
Sbjct: 1077 EMLTDDVYIGAENWNNLFCLRRNKAATSEEIRCRLDNIGEFHLGEMCNKFMSGSLVMPVS 1136

Query: 924  PDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
             +S        +FGTV G +GVI  L      F   L+  + K I+ VGG +H+ +RS  
Sbjct: 1137 SNSTTSSRRATLFGTVEGSLGVILGLDGRTAAFFITLERAIAKTIQPVGGFSHQLYRSCQ 1196

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
             E +   A  F+DGDL+E+FLDL R  M+ +   MN
Sbjct: 1197 AELRVHPAHGFVDGDLVETFLDLDRRTMEAVVAEMN 1232


>gi|384500266|gb|EIE90757.1| hypothetical protein RO3G_15468 [Rhizopus delemar RA 99-880]
          Length = 1057

 Score =  604 bits (1558), Expect = e-170,   Method: Compositional matrix adjust.
 Identities = 363/957 (37%), Positives = 559/957 (58%), Gaps = 51/957 (5%)

Query: 99   KEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQN 158
            ++  NI L + +V+ + FL     PT+++LY+D  + R ++ + +  KD+  V G    +
Sbjct: 132  QDHINISLPDKKVISLAFLQDTLDPTLLILYEDALEQRLLQMFTI--KDRQLVPGDIILD 189

Query: 159  NLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFK-AIPIRPSITKAYGRVDADGSRYL 217
            + ++ A LLI +PP + GVL++  + I Y   N    AI IR S   ++  ++ +GSR L
Sbjct: 190  HFESDASLLIAMPPAVGGVLLVASKFIRYLKPNQPPIAIGIRSSTINSHCIMNEEGSRVL 249

Query: 218  LGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
            LGD  GLL+LL +    + V  L    LG  SI S ++YLDN +V++GS+  DSQL+ + 
Sbjct: 250  LGDAEGLLYLLALNTTNQCVESLSFIYLGSISIPSCLAYLDNDIVFVGSNLADSQLVYIQ 309

Query: 278  LQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
                     ++++E + NLGPI DFCV D  + GQ QV+TCSGA K+GSLRI+RNG+G+N
Sbjct: 310  RTTGESEDILQIIETFANLGPITDFCVAD--KGGQTQVITCSGAGKEGSLRIIRNGVGLN 367

Query: 338  EQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCS--- 394
            E A + + G+KG+W+L       +D  L++SF+ +TR+L +  +  +   +++ F +   
Sbjct: 368  ELAMIPISGVKGIWALGE-----YD-LLLMSFVHQTRLLQLQKDHTI--VQLDTFSAIDL 419

Query: 395  QTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATG 454
              +TL     +   ++QVT  SVRL+ + S  L + W S     + VA+ N +Q +++ G
Sbjct: 420  NARTLVAGCVVDGMIIQVTDHSVRLMDTMS--LLDVWSSDE--LITVASVNPTQCVISLG 475

Query: 455  GGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMW-TDISVRIFSL 513
             G LV L++ +  L  +   +L +EISC+DI+PIG   + S   A+G W ++ +V + SL
Sbjct: 476  FGKLVALQVLNRKLNVIGETRLSFEISCIDIHPIGSR-TESAFVALGTWNSNTNVCLLSL 534

Query: 514  PDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573
             DL  I ++ LGG ++PRS+L+  FE   YLL ALGDG   NF L+  +G+L+D+K+  L
Sbjct: 535  SDLQPIAQKSLGGTVVPRSILISQFENTVYLLVALGDGQFYNFKLDSISGQLSDKKRTFL 594

Query: 574  GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
            G  PI L  FS    THVFAASD+P+V++S N+KL+YSNVNLKEV     F   AFPD++
Sbjct: 595  GKLPIHLSNFSLNGVTHVFAASDKPSVVHSRNQKLVYSNVNLKEVRCATSFGCHAFPDAV 654

Query: 634  A-IAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS--LKNQSCAEESE 690
            A I KEG L IG +++IQKLHI  IP  + PRRI +QE +++F I +  + +      + 
Sbjct: 655  ALITKEG-LIIGQMEEIQKLHITKIPTIDTPRRIVYQESTKSFGIITERMISDRYRPSTV 713

Query: 691  MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGR 750
                 +LDDQ+F  +       FE   S+ S  F DD N YY V +     +EN+   GR
Sbjct: 714  TGGFEVLDDQSFTVLDRIYFKEFERPLSVTSVLFEDDPNEYYIVASG----KENDGL-GR 768

Query: 751  ILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM---LRDDGTREL 806
            ILV  +  D KL+LI++ +T G +  +    GKLLA+I   + LY+W    L    +R L
Sbjct: 769  ILVLQLASDRKLRLISQLKTGGMIDCVRPIEGKLLASIQGTLYLYRWQSQRLVKVSSRRL 828

Query: 807  QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
             S        +   + T  +FI+ GDL  S+ +  Y  +   + E A       + A++ 
Sbjct: 829  PS--------VTRCMTTHENFIMTGDLAYSVVMFQYDRQSDQLLEVAAHEKTKEVLAMKA 880

Query: 867  LDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
            +D ++ +GAE   +LF +    +  + +E   L+V+  +HLG+ V+RFR GSL M   D 
Sbjct: 881  IDSNLVIGAEREGHLFVLEHCQDEVSADE-PLLDVISTWHLGDVVSRFRFGSLGMNNVDP 939

Query: 927  DVGQI-PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
            D   I P++IF T +G IGVIA L  E+Y  L ++Q N+ +V+KG+G L+H  WR+ N  
Sbjct: 940  DSSPIAPSLIFATASGAIGVIADLSPERYKLLYQMQCNMCRVVKGIGELSHTDWRNVNIM 999

Query: 986  KKTVDAKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
             +  +A NF+DGDLIESFLDLS  +M  +       + ++++VE+LCK VEEL  +H
Sbjct: 1000 YRKEEAMNFIDGDLIESFLDLSSQQMQNVVDGLHGGRKLDLTVEDLCKVVEELMSIH 1056


>gi|17541566|ref|NP_502299.1| Protein DDB-1 [Caenorhabditis elegans]
 gi|74965443|sp|Q21554.2|DDB1_CAEEL RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|5824558|emb|CAA92824.2| Protein DDB-1 [Caenorhabditis elegans]
          Length = 1134

 Score =  596 bits (1536), Expect = e-167,   Method: Compositional matrix adjust.
 Identities = 368/1098 (33%), Positives = 603/1098 (54%), Gaps = 81/1098 (7%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
            ++PIYG++ T+ L +   + +  L + TE++   +L +     +++TRA G ++D  GR 
Sbjct: 54   EIPIYGQVLTIALVKCKRDKRHSLIVVTEKWHMAILAY--RDGKVVTRAAGCIADPTGRA 111

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-YGCA 121
            TDN    +      LI +  ++G  K+I +++   L+  FN+R +   V D KF+  G  
Sbjct: 112  TDN-LFSLTIHRNGLIAIRAFEGSVKMIQWESGTDLRH-FNVRFDYPNVSDFKFVDTGED 169

Query: 122  KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
                V    D+   +H++  ++ + DK+F      Q ++   + +LIPVP  + GV+++G
Sbjct: 170  DVYRVAFIYDDDHGKHLQFSDLNMHDKEF-RTYSRQASIAADSSVLIPVPHAIGGVIVLG 228

Query: 182  EETIVY-CSANAFKAIPIRPSITK-----AYGRVDADGSRYLLGDHAG-LLHLLVITHEK 234
              +++Y  + N  + +P   S+ +      +G VDA G R+LL D  G LL LL+   E 
Sbjct: 229  SNSVLYKPNDNLGEVVPYTCSLLENTTFTCHGIVDASGERFLLSDTDGRLLMLLLNVTES 288

Query: 235  EK---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
            +    V  ++I+ LGETSIA +I+Y+DN VV++GS  GDSQLI+L  +P+  GSY  +LE
Sbjct: 289  QSGYTVKEMRIDYLGETSIADSINYIDNGVVFVGSRLGDSQLIRLMTEPNG-GSYSVILE 347

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
             Y N+GPI D  +V  E  GQ Q+VTC+GA KDGSLR++RNGIGI+E ASV+L G+ G++
Sbjct: 348  TYSNIGPIRDMVMV--ESDGQPQLVTCTGADKDGSLRVIRNGIGIDELASVDLAGVVGIF 405

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ--- 408
             +R   D   D +++VS   ET +L +  E ELE+ ++    +   T+F           
Sbjct: 406  PIR--LDSNADNYVIVSLSDETHVLQITGE-ELEDVKLLEINTDLPTIFASTLFGPNDSG 462

Query: 409  -LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL------ 461
             ++Q T   +RL+SS+   L   W+   G  ++  + NA+   +       VYL      
Sbjct: 463  IILQATEKQIRLMSSSG--LSKFWEPTNGEIISKVSVNAANGQIVLAARDTVYLLTCIVD 520

Query: 462  EIGDGILTEVKHAQLEYEISCLDINPIGENPS-YSQIAAVGMWTDISVRIFSLPDLNLIT 520
            E+G   +      + E EI+CLD++  G++P+  +    +  W+  ++ +  LPDL  + 
Sbjct: 521  EMGALDIQLTAEKKFENEIACLDLSNEGDDPNNKATFLVLAFWSTFAMEVIQLPDLITVC 580

Query: 521  KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
               L  +IIPRS++    E + YLL A GDG L+ ++ ++KTG   + KK ++GT+P +L
Sbjct: 581  HTDLPTKIIPRSIIATCIEEVHYLLVAFGDGALVYYVFDIKTGTHGEPKKSNVGTRPPSL 640

Query: 581  RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
                +KN  H+F  SDRP +I+S++KKL++SNVN+K V  +C  +S+A+ D L I+    
Sbjct: 641  HRVRNKNRQHLFVCSDRPVIIFSASKKLVFSNVNVKLVDTVCSLSSSAYRDCLVISDGNS 700

Query: 641  LTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEE------------ 688
            +  GT+DDIQK+H+RSIP+GE   RI +Q+ + T+ +CS + +S AE             
Sbjct: 701  MVFGTVDDIQKIHVRSIPMGESVLRIAYQKSTSTYGVCSNRTESKAERVFASKNALVTSQ 760

Query: 689  -----------------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                             +      +LD  TF+ + ++    +E   S +S  F++DS+ Y
Sbjct: 761  SRPKVASTRADMDESPPNTTSSFMVLDQNTFQVLHSHEFGPWETALSCISGQFTNDSSTY 820

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVED---GKLQLIAEKETKGAVYSLNAFNGKLLAAIN 788
            Y VGT  + P+E E   GRI+VF V+D    KL+ + E   +G+  ++   NGKL+AAIN
Sbjct: 821  YVVGTGLIYPDETETKIGRIVVFEVDDVERSKLRRVHELVVRGSPLAIRILNGKLVAAIN 880

Query: 789  QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
              I+L++W       +EL+ EC    H++AL ++   + + V D+M+S+SLL Y+  EG 
Sbjct: 881  SSIRLFEWTT----DKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSYRMLEGN 936

Query: 849  IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-KNSEGATDEERGRLEVVGEYHL 907
             EE A+D+N+ WM   E +  +  LG E + NLFTV    +   TD+ R  LE  G ++L
Sbjct: 937  FEEVAKDWNSQWMVTCEFITAESILGGEAHLNLFTVEVDKTRPITDDGRYVLEPTGYWYL 996

Query: 908  GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
            GE        +LV++  DS +     ++FGT  G IG+I  +  +   FL  ++  +   
Sbjct: 997  GELPKVMTRSTLVIQPEDSIIQYSQPIMFGTNQGTIGMIVQIDDKWKKFLIAIEKAIADS 1056

Query: 968  IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRT-RMDEISKTMNVSVE--- 1023
            +K    + H  +R+F  +K+      F+DGDL+ES LD+ R+  MD +SK  +   +   
Sbjct: 1057 VKNCMHIEHSSYRTFVFQKRAEPPSGFVDGDLVESILDMDRSVAMDILSKVSDKGWDPSL 1116

Query: 1024 -----ELCKRVEELTRLH 1036
                 E+ K +E+L R+H
Sbjct: 1117 PRDPVEILKVIEDLARMH 1134


>gi|268536658|ref|XP_002633464.1| C. briggsae CBR-DDB-1 protein [Caenorhabditis briggsae]
          Length = 1134

 Score =  578 bits (1491), Expect = e-162,   Method: Compositional matrix adjust.
 Identities = 374/1102 (33%), Positives = 598/1102 (54%), Gaps = 85/1102 (7%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            + ++PIYG+++T+ L +   E Q  L + T+++   VL +     ++ITRA G +++  G
Sbjct: 52   ICEIPIYGQVSTISLIKFRREKQHSLVVVTDKFNLAVLGY--RDGKVITRAAGSIAEYSG 109

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            R  D   + +      LI +  Y+G  K++ ++    ++  FN+R +   V D KF+   
Sbjct: 110  RSADTA-VAMTIHRSGLIAIIGYEGSVKMVHWEPGADVRH-FNVRFDYPNVSDFKFVDTG 167

Query: 121  AKPTIVVLY-QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLI 179
               T+ V +  D+   +H++  ++ + DK+F      Q+ +   + +LIPVP P+ GV++
Sbjct: 168  DDDTVRVAFIYDDDHGKHLQFSDLNMHDKEF-HTFSKQSCIAADSQVLIPVPAPVGGVIV 226

Query: 180  IGEETIVYCSANA------FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVIT-- 231
            +G  + +Y +++       +    ++ +I   +G VDA G R+LL D  G L +L++   
Sbjct: 227  LGANSALYKASDVNGDVVPYSCSLLKNTIFTCHGIVDASGDRFLLADTDGRLLMLLLNIG 286

Query: 232  --HEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEV 289
                   V  ++IE LGETS+A +++Y+DN VV++GS  GDSQLI+L   P+  GSY  V
Sbjct: 287  EGRSGTTVKEMRIEYLGETSVADSVNYVDNGVVFVGSRLGDSQLIRLMTAPNG-GSYSVV 345

Query: 290  LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG 349
            LE Y N GPI D  +V  E  GQ Q+VTCSGA KDGSLR++RNGIGI E ASV+L  + G
Sbjct: 346  LETYTNTGPIRDMVLV--ESDGQPQLVTCSGADKDGSLRVIRNGIGIEELASVDLAKVIG 403

Query: 350  MWS--LRSSTDDPFDTFLVVSFISETRILAMN---LED-ELEETEIEGFCSQTQTLFCHD 403
            M+   LRS+TD+    F++VS   ET +L +    LED +L E E E       +LF  D
Sbjct: 404  MFPIRLRSTTDN----FVIVSLPDETHVLKITGEELEDVQLLEIETERTTMYASSLFGPD 459

Query: 404  AIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL-- 461
                 ++QVT   +R +S   +     W+   G SV+  + NA    +       VY   
Sbjct: 460  D-SELILQVTEEEIRFMSFQKQV--KIWRPTNGESVSKVSVNAIHGQIVVAARDTVYYLK 516

Query: 462  ----EIGDGILTEVKHAQLEYEISCLDINPIGEN-PSYSQIAAVGMWTDISVRIFSLPDL 516
                E G   +  V   + E EI+CLDI+  G++         + +W   S+ +  L DL
Sbjct: 517  CMVDEAGALDIQIVAERKFEAEIACLDISNEGDDYKKPGTFMVLALWGSFSMEVVQLSDL 576

Query: 517  NLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
                + +L  +I+PRS++    E + YLL A GDG L+ ++ ++KTG   + KK S+GT+
Sbjct: 577  KTACQTNLPSKIVPRSIVATCIEDVHYLLVAFGDGALIYYVFDIKTGTHGEAKKSSVGTR 636

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P +L    +KN  H+F  SDRP +I+S+NKKL++SNVN+K V+ +C  +S ++ D L I+
Sbjct: 637  PPSLHRVRNKNRQHLFVCSDRPVIIFSANKKLVFSNVNVKVVNTVCSLSSTSYRDCLVIS 696

Query: 637  KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE--------- 687
                +  GT+DDIQK+HIRSIP+GE   RI +Q+ + T+ +CS ++++  E         
Sbjct: 697  DGNSMVFGTVDDIQKIHIRSIPMGESVMRIAYQKSTGTYGVCSSRSETKPERVYASKNAL 756

Query: 688  ---ESEMHFVR-----------------LLDDQTFEFISTYPLDTFEYGCSILSCSFSDD 727
               +S    V                  +LD  TF+ +  +    FE   S +S  FSDD
Sbjct: 757  ANSQSRAKIVATRNEVNDGPPASTSSFMILDQNTFQVLHAHEFGPFEAAVSCISGQFSDD 816

Query: 728  SNVYYCVGTAYVLPEENEPTKGRILVFIVED---GKLQLIAEKETKGAVYSLNAFNGKLL 784
            +  YY VGT  + P+E++   GRI+VF V+D    KL+ + E   +G+  +L   NGKL+
Sbjct: 817  ARQYYIVGTGLIYPDESDTKLGRIIVFEVDDVERTKLRRVHELVVRGSPLALRILNGKLV 876

Query: 785  AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
            AAIN  ++L++W       + L+ EC +  HI+AL ++   + + V DLM+S+SLL Y+ 
Sbjct: 877  AAINSSVRLFEWT----ADKVLRLECSNFNHIVALDLKVMNEEVAVADLMRSVSLLSYRM 932

Query: 845  EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-KNSEGATDEERGRLEVVG 903
             EG  EE A+D+N+ WM   E +  +  LG E + N+FTV    S   TD+ R  LE  G
Sbjct: 933  MEGNFEEVAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPITDDGRYVLEPTG 992

Query: 904  EYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
             ++LGE        SLV++  DS +     ++FGT  G IG++  +  +   FL  ++  
Sbjct: 993  YWYLGELPKVMVRASLVVQPEDSTIEYSHPIMFGTNQGTIGMLVQIDDKWKKFLVSIEKA 1052

Query: 964  LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM----- 1018
            +   +K    + H  +RSF  +K+      F+DGDL+ES LD+ R+   +I K +     
Sbjct: 1053 ISDSVKNCMQIEHSTYRSFIFQKRIEPPSGFIDGDLVESILDMDRSVAIDILKKVSDKGW 1112

Query: 1019 NVSVE----ELCKRVEELTRLH 1036
            + S+     E+ K +E+L R+H
Sbjct: 1113 DASLPRDPVEMLKVIEDLARMH 1134


>gi|308477185|ref|XP_003100807.1| CRE-DDB-1 protein [Caenorhabditis remanei]
 gi|308264619|gb|EFP08572.1| CRE-DDB-1 protein [Caenorhabditis remanei]
          Length = 1154

 Score =  570 bits (1469), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 378/1123 (33%), Positives = 602/1123 (53%), Gaps = 111/1123 (9%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
            ++PIYG++ T+ L R   E +  L + TE+++  VL +     ++ITR  G ++D+ GR 
Sbjct: 54   EIPIYGQVLTMALVRCKREKRQSLVVVTEKWQMAVLTY--RDGKVITRTAGALADQSGRA 111

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-YGCA 121
            +DN    +      L+ +  Y+G  K+I ++  G    +FN+R +   V D KF+  G  
Sbjct: 112  SDN-LFSLTIHRSGLVAIRAYEGSVKMIQWE-PGTDVRSFNVRFDYPNVSDFKFIDTGVD 169

Query: 122  KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
                +    D+   +H++  ++ + DK+       Q ++   A +LIPVP P+ GVL++ 
Sbjct: 170  DTYRIAFIYDDDHGKHLQFSDLNMHDKEL-HTFSRQASIAADASVLIPVPAPISGVLVLA 228

Query: 182  EETIVYCSANA------FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
              +I+Y S++       + +  +  ++   +G VD  G R++L D  G L +L++   + 
Sbjct: 229  ANSILYKSSDVNGDVVPYASPLLDNTVFTCHGLVDPSGERFILSDTEGRLLMLILNIGEG 288

Query: 236  K----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
            +    V  ++IE LGETSIA +I+Y+D  VV++GS  GDSQLI+L   P   GSY  VLE
Sbjct: 289  RSGITVKDMRIEYLGETSIADSINYIDAGVVFVGSRLGDSQLIRLMPTPSG-GSYSVVLE 347

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
             Y N+GPI D  +V  E  GQ Q+VTCSGA KDGSLR++RNGIGI E ASVEL G+ G++
Sbjct: 348  TYSNIGPIRDMIMV--ESDGQAQLVTCSGAEKDGSLRVIRNGIGIEELASVELAGVIGIF 405

Query: 352  SLR--SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ- 408
             +R  S+TD+    +++VS   ET +L +N E ELE+ ++   C++  T+F    I+   
Sbjct: 406  PIRLNSTTDN----YVIVSLAEETHVLQINGE-ELEDVQLLQICTEMPTIFA-STIFGPD 459

Query: 409  ----LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL--- 461
                L+QVT   VR ++ +   L   W+ P G  ++  + NA    +       VY    
Sbjct: 460  NSEVLLQVTEKHVRFMAFSG--LSKIWEPPNGELISKVSVNAIHGQIVVAARDTVYFLLC 517

Query: 462  ---EIGDGILTEVKHAQLEYEISCLDINPIGENPSY-SQIAAVGMWTDISVRIFSLPDLN 517
               E+G   +  V   + E EI+CLDI+  G++ +       + +W+   + +  LPDL 
Sbjct: 518  VIEEMGGLDINLVAERKFEDEIACLDISNEGDDHTKPGTFMVLALWSTFCMEVVQLPDLK 577

Query: 518  LITKE------------------HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLN 559
             ++ E                  +L  +I+PRS++    E + YLL A GDG L+ ++ +
Sbjct: 578  TVSCERRSLLETVLSYLSQVCQTNLPSKIVPRSIVATCIEEVHYLLIAFGDGALVYYVFD 637

Query: 560  MKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS 619
            +KTG   + KK S+GT+P TL    +KN  H+F  SDRP +I+SS+KKL++SNVN+K V+
Sbjct: 638  IKTGTHGEAKKSSVGTRPPTLYRVRNKNRQHLFVCSDRPVIIFSSSKKLVFSNVNVKVVN 697

Query: 620  HMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS 679
             +C  +S+++ D L I+    +  GT+DDIQK+H+RSIP+GE   RI +Q  S T+ +CS
Sbjct: 698  TVCSLSSSSYCDCLVISDGISMVFGTVDDIQKIHVRSIPMGESVLRIAYQRSSGTYGVCS 757

Query: 680  LKNQSCAE------------------------------ESEMHFVRLLDDQTFEFISTYP 709
             + +S  E                              +S   F+ +LD  TF+   ++ 
Sbjct: 758  SRTESKRERIYASKNAIYTSNSRPKITSTRNEANDNPPKSTSSFM-VLDQNTFQRSGSFS 816

Query: 710  LDTFEYGCSILSC---SFSDDSNVYYCVGTAYVLPEENEPTKGRILVF---IVEDGKLQL 763
             +       I SC    F++DS VYY VGT  + PEE +   GRI+VF    VE  KL+ 
Sbjct: 817  -NNIRKTYMIFSCISGQFTNDSKVYYIVGTGLIYPEETDTKFGRIVVFEVDEVERSKLRR 875

Query: 764  IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQT 823
            + +   +G+  +L   NGKL+AAIN  ++L++W +     +EL+ EC +  HI+AL ++ 
Sbjct: 876  VHDLVCRGSPLALRILNGKLVAAINSSVRLFEWTM----DKELRLECSNFNHIMALDLKV 931

Query: 824  RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
              + + V D+M+S+SLL Y+  EG  EE A+D+N+ WM   E +  +  LG E + NLFT
Sbjct: 932  MNEEVAVADVMRSVSLLSYRMLEGNFEEVAKDWNSEWMVTCEFITAEQILGGEAHLNLFT 991

Query: 884  VR-KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGV 942
            V    S   TD+ R  LE  G Y+LGE        SLV +  D  +     ++FGT  G 
Sbjct: 992  VEVDKSRPITDDGRYVLEPTGYYYLGELPRVMVRSSLVAQPDDCSIQYSQPIMFGTNQGS 1051

Query: 943  IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
            IG++  +  +   FL  ++  +   +K    + H  +RSF  +K+      F+DGDL+ES
Sbjct: 1052 IGMVVQIDDKWKKFLIAVEKAIADSVKNCMHIEHTTYRSFIFQKRLESPTGFIDGDLVES 1111

Query: 1003 FLDLSRTRMDEI-----SKTMNVSVE----ELCKRVEELTRLH 1036
             LD+ R+    I      K  + S+     E+ K +E+L R+H
Sbjct: 1112 ILDMDRSAAIAILYKVSDKGWDASLPRDPIEILKVIEDLARMH 1154


>gi|341884150|gb|EGT40085.1| CBN-DDB-1 protein [Caenorhabditis brenneri]
          Length = 1134

 Score =  567 bits (1462), Expect = e-159,   Method: Compositional matrix adjust.
 Identities = 364/1100 (33%), Positives = 598/1100 (54%), Gaps = 87/1100 (7%)

Query: 4    VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT 63
            +PIYG++ +L L +   +++  L + TE++   +L +      ++TRA G + D  GR  
Sbjct: 55   IPIYGQVLSLALVKAKRDSRQSLVVLTEKWHMSILAY--RDGAIVTRAAGCILDPTGRTV 112

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKP 123
            DN    +      LI +  ++G+ K+I ++    L+  FN+R +   V + KF+      
Sbjct: 113  DNF-FTLSVHRSGLIAIKAFEGIVKLIQWEAGADLRH-FNVRFDFPNVSEFKFVDTTEDD 170

Query: 124  T--IVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
                  +++D+ + +H++  ++ + DK+F      Q ++   + +LIPVP P+ GV+++G
Sbjct: 171  VYRAAFIFEDD-NGKHLQFSDLNMHDKEF-RPHLRQTSIAADSSMLIPVPSPISGVVVLG 228

Query: 182  EETIVYCSA-NAFKAIP-----IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
              +++Y S+ N  + +P     +  +I  ++  VD  G R+++ D  G L +L++   + 
Sbjct: 229  THSLLYKSSENDGEVVPYSSPLLENTIFTSHSIVDPTGERFIVSDTDGRLLMLLLNAVEN 288

Query: 236  K----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
            +    V  ++I+LLG+TS+A +I+Y+DN VV+IGS +GDSQLI+L L      SY+ VL+
Sbjct: 289  QSGLSVKEIRIDLLGDTSVAESINYIDNGVVFIGSRFGDSQLIRL-LSEKTNSSYISVLD 347

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
             Y N+GPI D  +V  E  GQ Q+VTCSGA KDGSLR++RNGIGI E A+V+L G+ G++
Sbjct: 348  TYYNIGPIRDMIMV--ESDGQPQLVTCSGAEKDGSLRVIRNGIGIEELATVDLPGVVGIF 405

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI----YN 407
             +R   D   D +++VS + ET +L +  E ELE+ +     +   T+F           
Sbjct: 406  PIR--LDSSADNYVIVSLVEETHVLQITGE-ELEDVQFLQIDTALPTMFAGTLFGPNDSG 462

Query: 408  QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANA--SQVLLATGGGHLVYL---- 461
             +VQVT   VRL+S+    L   W+   G  ++    NA   QV +A    H+ +L    
Sbjct: 463  LVVQVTERQVRLMSNGG--LSKFWEPANGEMISKVAVNAVSGQVCVA-ARDHVYFLSCIV 519

Query: 462  -EIGDGILTEVKHAQLEYEISCLDINPIGENPSY-SQIAAVGMWTDISVRIFSLPDLNLI 519
             E+G   ++ +   Q E EI+CLDI+  G++    +    +  W   S+ +  LPDL  +
Sbjct: 520  DEMGALDISVIAEKQFEDEIACLDISNEGDDADKPATFMVLAFWRTFSMEVVQLPDLKTV 579

Query: 520  TKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
             +  L   I+PRS++    E + YLL A GDG L  ++ ++KTG   + KK S+GT+P +
Sbjct: 580  CQTDLPSRIVPRSIIATCIEDVHYLLVAFGDGALTYYVFDIKTGTHGEPKKSSVGTRPPS 639

Query: 580  LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
            L    +KN  H+F  SDRP +I+SS+KKL++SNVN+K V+ +C  +S  + D L I+   
Sbjct: 640  LHRVRNKNRQHLFVCSDRPMIIFSSSKKLVFSNVNVKVVNTVCSMSSRVYRDCLVISDGN 699

Query: 640  ELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE------------ 687
             +  GT+DDIQK+H+R+IP+GE   R+ HQ+ S T+ +CS +  +  E            
Sbjct: 700  CMVFGTVDDIQKVHVRTIPMGESVLRVAHQKSSGTYGVCSSRTDTRFERITASKSAIGKS 759

Query: 688  ------------------ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
                              +S   F+ +LD  TF+ + ++     E   S++S  F+DD+N
Sbjct: 760  LSKPKVTATRNDMNETPPKSTSSFI-VLDHNTFQVLHSHTFGPHETAVSVISGQFNDDNN 818

Query: 730  VYYCVGTAYVLPEENEPTKGRILVFIVED---GKLQLIAEKETKGAVYSLNAFNGKLLAA 786
             YY VGTA V P+E+E   GRI+VF V++    KL+ + E   +GA   +   NGKL+AA
Sbjct: 819  SYYIVGTALVYPDESETKIGRIIVFEVDETDKTKLRFMTEIVVRGAPMGIRILNGKLVAA 878

Query: 787  INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
            IN  +++++W       +EL+ EC    HI A+ ++   + I V D+M+S+SLL Y+  E
Sbjct: 879  INSSVRMFEWT----AEKELRVECSTFNHIAAVDLKVLNEEIAVADVMRSVSLLSYRTLE 934

Query: 847  GAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-KNSEGATDEERGRLEVVGEY 905
            G  EE A+D+N+ WM   E +  +  LG E + N+FTV    S   TD+ R  LE  G +
Sbjct: 935  GNFEEVAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPVTDDGRYVLEPTGYW 994

Query: 906  HLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLR 965
            +LGE         LV +  D+ +     +++GT  G +G++  +      FL  ++  + 
Sbjct: 995  YLGELTKVMIRAVLVPQPDDNSIRYTQPIMYGTNQGSLGLVVQIDDMYKKFLLSIEKAIS 1054

Query: 966  KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI-----SKTMNV 1020
               K    + H  +RSF   K+      F+DGDLIES LD+ R+R  EI     +K  + 
Sbjct: 1055 DAEKNCMQIEHSTYRSFTYNKRIEPPSGFIDGDLIESILDMDRSRAIEILEKANTKGWDP 1114

Query: 1021 SVE----ELCKRVEELTRLH 1036
            S+     E+ K +++L+R H
Sbjct: 1115 SIPKDPVEILKIIDDLSRTH 1134


>gi|390366809|ref|XP_780126.3| PREDICTED: DNA damage-binding protein 1-like isoform 1
            [Strongylocentrotus purpuratus]
          Length = 630

 Score =  559 bits (1441), Expect = e-156,   Method: Compositional matrix adjust.
 Identities = 293/565 (51%), Positives = 375/565 (66%), Gaps = 59/565 (10%)

Query: 527  EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            EIIPRS+LL  FEG +Y+LCALGDG L  F LN +TG +TDRKKV LGTQP  L+TF S 
Sbjct: 70   EIIPRSILLTTFEGQNYILCALGDGSLFYFQLNAETGYMTDRKKVILGTQPTVLKTFKSL 129

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
            +T +VFA SDRPTVIYSSN KL++SNVNLKEVS+MCP NS  +PDSLA+  +  L IG+I
Sbjct: 130  STVNVFACSDRPTVIYSSNHKLVFSNVNLKEVSYMCPLNSDGYPDSLALCNDTTLMIGSI 189

Query: 647  DDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN----------------QSCA---- 686
            D+IQKLHIR++PLGE P RI +QE S+TF I S +                 QS +    
Sbjct: 190  DEIQKLHIRTVPLGETPLRITYQEPSQTFGIISTRTDVVDSSGTTASMGQTRQSASTSAL 249

Query: 687  -------------------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
                                     +E E+H + ++D  TFE +  +   + EY  S++S
Sbjct: 250  NITKSGGNKGMAGQAGGSGEGSSFGDEVEVHSLLVIDQHTFEVLHAHHFGSSEYATSLIS 309

Query: 722  CSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNG 781
            C   +D N YY VG A V P+E EP  GRI+VF   DGKLQ IAEKE KGA YSL  FNG
Sbjct: 310  CKLCNDPNWYYIVGLANVHPDEAEPKSGRIVVFQYSDGKLQEIAEKEIKGAPYSLVEFNG 369

Query: 782  KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
            KLLA++N  ++L++W         L+ EC H+ ++LALY++T+GDFIVVGDLM+SI+LL 
Sbjct: 370  KLLASVNSVVRLFEWTPE----HSLRVECSHYNNVLALYLKTKGDFIVVGDLMRSITLLA 425

Query: 842  YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEV 901
            YK  EG +EE ARDY+ NWMSAVEILDDD +LGAEN+ NLFT +K+S   TDEER  L+ 
Sbjct: 426  YKPMEGCLEEIARDYSPNWMSAVEILDDDTFLGAENSSNLFTCQKDSAATTDEERRHLQE 485

Query: 902  VGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VG +HLGEFVN FRHGSLVM+ + +S +    +V+FGTV+G +G++  L  E Y FL ++
Sbjct: 486  VGLFHLGEFVNVFRHGSLVMQNIGESTIPTTGSVLFGTVSGSVGLVTQLNEEFYRFLLEV 545

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            Q  L KVIK VG + H  WRSF +E+KT    NF+DGDL+ESFLDLSR  MDE+++ + +
Sbjct: 546  QNKLTKVIKSVGKIKHSFWRSFYSERKTEPMDNFIDGDLLESFLDLSRDTMDEVAQGLQI 605

Query: 1021 S---------VEELCKRVEELTRLH 1036
                        +L K VEELTR+H
Sbjct: 606  DDGGMKRDCMANDLIKIVEELTRIH 630



 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 49/71 (69%), Positives = 57/71 (80%)

Query: 565 LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPF 624
           +TDRKKV LGTQP  L+TF S +T +VFA SDRPTVIYSSN KL++SNVNLKEVS+MCP 
Sbjct: 1   MTDRKKVILGTQPTVLKTFKSLSTVNVFACSDRPTVIYSSNHKLVFSNVNLKEVSYMCPL 60

Query: 625 NSAAFPDSLAI 635
           NS  +PD   I
Sbjct: 61  NSDGYPDRYEI 71


>gi|74208347|dbj|BAE26370.1| unnamed protein product [Mus musculus]
          Length = 599

 Score =  549 bits (1415), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 286/548 (52%), Positives = 361/548 (65%), Gaps = 54/548 (9%)

Query: 539  EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRP 598
            E   YLLCALGDG L  F LN++TG L+DRKKV+LGTQP  LRTF S +TT+VFA SDRP
Sbjct: 56   ESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRP 115

Query: 599  TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIP 658
            TVIYSSN KL++SNVNLKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++P
Sbjct: 116  TVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVP 175

Query: 659  LGEHPRRICHQEQSRTFAICSLK------------------------------------- 681
            L E PR+IC+QE S+ F + S +                                     
Sbjct: 176  LYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKLFSSSTA 235

Query: 682  --NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYV 739
                S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ VGTA V
Sbjct: 236  PHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMV 295

Query: 740  LPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
             PEE EP +GRI VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++LY+W   
Sbjct: 296  YPEEAEPKQGRIAVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTE 355

Query: 800  DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
                +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE ARD+N N
Sbjct: 356  ----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPN 411

Query: 860  WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
            WMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN F HGSL
Sbjct: 412  WMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSL 471

Query: 920  VMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
            VM+ L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG + H  
Sbjct: 472  VMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSF 531

Query: 979  WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSVEELCKR 1028
            WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +            + ++L K 
Sbjct: 532  WRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 591

Query: 1029 VEELTRLH 1036
            VEELTR+H
Sbjct: 592  VEELTRIH 599


>gi|339235331|ref|XP_003379220.1| DNA damage-binding protein 1 [Trichinella spiralis]
 gi|316978142|gb|EFV61158.1| DNA damage-binding protein 1 [Trichinella spiralis]
          Length = 1329

 Score =  548 bits (1411), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 335/1117 (29%), Positives = 585/1117 (52%), Gaps = 97/1117 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++GRI   +LF P GE +  + I T +    ++++D  +  + T A  ++S+  GRP  N
Sbjct: 224  MFGRIGAAKLFTPKGENKALMVIVTLKQDVAIVEYD--NGRIKTLASRNISENFGRPASN 281

Query: 66   GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTI 125
            G +  + PD  +IGL +    FK I ++        +++      + D  FL+G   P I
Sbjct: 282  GILLSVHPDGEVIGLRIMSSTFKCITWNRATSKLSTYSLNYSLTHLSDFVFLHGFQFPVI 341

Query: 126  VVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETI 185
             ++Y D    RHV T  ++L +++F  GPWS+ +++  A  LI VPPPLCGV+++G  ++
Sbjct: 342  ALIYGD-LVGRHVITCRISLDEQEFENGPWSRGHIEWEAHTLIAVPPPLCGVIVVGCSSL 400

Query: 186  VYCSANAFKAIPIRPSITKAYGRV-DA--DGSRYLLGDHAGLLHLLVITHE-----KEKV 237
            +Y   N+  +    P ++K+     DA  DG  Y LG   G L LL +  E     K  +
Sbjct: 401  LYIRDNSTISTVSPPFLSKSIVNCYDAAPDGLTYFLGQLDGTLSLLKLDIETDAEGKVTL 460

Query: 238  TGLKIELLGETSIASTISYLDN-AVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNL 296
            + ++  +LG TS   ++SY+   +++++GS   DS+L++LN       ++ EV E + NL
Sbjct: 461  SRMRATILGVTSPPDSLSYMHKESLLFVGSRIADSKLLRLNSLAICDETWTEV-ESFPNL 519

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSS 356
             PIVD  +VD+ +QGQG++++CSG  KDGSL+++R+GIG+ E A +++  +  +W+LR  
Sbjct: 520  APIVDMVLVDMAKQGQGEIISCSGYGKDGSLKVIRSGIGLYELARLDIPFVNRVWALRYI 579

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF---CHDAIYN-QLVQV 412
            T++P+D   ++  +  + ++    E++ E+  ++G  +  QT     CH A     +VQV
Sbjct: 580  TNEPYDNLFILGVVGNSTMVIKFQENQAEQISVDGLETAEQTFVAANCHMADGALAIVQV 639

Query: 413  TSGSVRLVSST--SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLV-YLEIGDGILT 469
                ++L  ++   R+L +EWK P G  V++A    +   L     H++ Y  + D  L 
Sbjct: 640  VRSKIQLADASLDGRKL-DEWKFPDGSEVSLAACEGTSGRLLVACRHILHYFNVRDDRLQ 698

Query: 470  EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
             V    +E E SC++   + +      +  VG WT I +++  +P L  +   +L    +
Sbjct: 699  LVTTRTMENEASCVEFGCLSDEGV--GVCIVGHWTKICLQLLFVPSLEPVQTVYLEDSSV 756

Query: 530  PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
             RS L+C FE   YL  +L DG+L+ + ++  +  L D KKVSLGT  I+L+ F +KNT 
Sbjct: 757  IRSALMCKFESNMYLFVSLADGNLIFYSIDENSFALIDGKKVSLGTDSISLKMFKTKNTL 816

Query: 590  HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
             + A +++PTVIY +N KL +S+++   + +M P  ++A+ + +       LTIG +D+I
Sbjct: 817  SILACTEKPTVIYMNNNKLQFSSLDSSAIYYMTPLFTSAYDNGVLFTNGQCLTIGVVDEI 876

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---------------NQSCAEES----- 689
            +KLHIRS+ LGE PRRI  Q +++   + +++               N++    S     
Sbjct: 877  KKLHIRSVVLGETPRRIVWQPENKLVGVLTIRIMDLPGNSSSESSASNRALHSASAPPLR 936

Query: 690  ------------------------------EMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
                                          E H   LLD  +F+ +  Y ++  E   SI
Sbjct: 937  RTTLTVDPNSAINEGAVNIGAEDTGVDRSIETHSFLLLDQNSFDVLHAYSMNPLEECTSI 996

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE-DGK----LQLIAEKETKGAVY 774
            LSC+  +D N ++ +  A +  +E EP +GR+L+   E DG+    L L+ EKE  G VY
Sbjct: 997  LSCTMGEDQNPFFILSAAVITADETEPLQGRLLMLRYERDGQGNSSLNLVHEKEVNGCVY 1056

Query: 775  SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
            ++ +F  KLL A+N  + L++W   D    +L S C     + A++++ R + I+VGD+ 
Sbjct: 1057 AMASFKSKLLVAMNSSVLLFEW--SDVTGLQLVSSCSL--FVTAMHLKVRDEVILVGDIQ 1112

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
            +SI++L Y   E +  E ARDY+ NW+SA+E++D+D ++ AEN+ N+   +K+ +     
Sbjct: 1113 RSIAVLRYVPSESSFVEEARDYHPNWISAIEVIDNDYFMAAENSLNITVSQKDLQQQPVS 1172

Query: 895  ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT-VIFGTVNGVIGVIASLPHEQ 953
            E   ++  G  HLGE++N F+HG+L M         +   ++ GT  G I +   +    
Sbjct: 1173 ESQVVKSAGRLHLGEYINVFKHGALSMYSYAGISSLVSNPIMIGTAEGSILIYCQIHDSH 1232

Query: 954  YLFLEKLQTNLRKVIK-GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD 1012
            +  L  LQ     ++   VG + ++ +R +   +K   A  F+DGDLIE  L++ R    
Sbjct: 1233 FRVLNDLQRCFSDIVPDNVGCIAYDSYRRYVVYEKNAPAFGFIDGDLIEQLLEMPRQEAI 1292

Query: 1013 EI------------SKTMNV-SVEELCKRVEELTRLH 1036
             +             +  N+ + +++   +EEL+R+H
Sbjct: 1293 RLINGWIAAGRISPERCSNITTAQQIIDLIEELSRIH 1329


>gi|223994993|ref|XP_002287180.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220976296|gb|EED94623.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1517

 Score =  541 bits (1393), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 446/1380 (32%), Positives = 628/1380 (45%), Gaps = 369/1380 (26%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG----R 61
            I GRI +L   +      D LF  TER  + ++ +D + ++L   A+   +   G    +
Sbjct: 84   INGRITSLAPIKYSSSNTDCLFFTTERGCYALISYDEQLAQLQFNAVTSTTSSTGGEETK 143

Query: 62   PTDN----------------------------------GQIGIIDPDCRLIGLHLYDGLF 87
              D+                                  G I  +DP  R I LH+YDG  
Sbjct: 144  DHDDSAIIGLHYPIQTHATGSFASYSSSVIAGGCQAECGPIMTVDPLNRCIALHVYDGWV 203

Query: 88   KVIPFD-----------------------NKGQLKEAFNIRLEELQVLDIKFLYGCAK-- 122
             VIP +                       + G   +AF+ R+EE  +L + FL   +   
Sbjct: 204  TVIPINRGYDVGSARASSSSSRGGKPRGLSTGPFGDAFHARVEERTLLSMTFLRPKSNAG 263

Query: 123  --------------PTIVVLYQDNKDARHVKTYEVALKDKDFV----------------- 151
                          P +VVL+QD +  +HV T+ V L  K  V                 
Sbjct: 264  GRGNSGGGVSSYYHPQLVVLHQDARGFQHVITHGVDLAKKGLVLHGSPNDSAVATMMGVK 323

Query: 152  --EG--PWS----------QNNLDNGADLLIPVPP----PLCGVLIIGEETIVY--CSAN 191
              EG  P S          ++ +D G+  LI +PP     L GVLI+G+  I Y      
Sbjct: 324  SDEGKMPSSTMPPANERLKKSRIDGGSAALIAIPPSKTSALGGVLILGQRQITYHHTGEG 383

Query: 192  AFKAIPIRPSITKAYGRV-DADGSR----------YLLGDHAGLLHLLVI----THEKEK 236
              + +PI  S+  +Y  V D D  R          Y+LGD  G +HLL I    T E + 
Sbjct: 384  ITRTLPIGSSLLYSYCLVADKDADRNINNNAPVLKYILGDSTGRIHLLTILLNVTPEGKG 443

Query: 237  ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP-------DAKG-- 284
               VT L +E LG  S AS + YL    +++GS +GDSQL+K+   P       D +G  
Sbjct: 444  DGCVTTLLMETLGTASSASAVVYLGKGCLFVGSQFGDSQLLKILDAPVPLSGGRDDEGEE 503

Query: 285  -------SYVEVLERYVNLGPIVDFCV---VDLERQGQGQ---------------VVTCS 319
                   +Y+ +L+ Y NLGPIVDF +    D    G G                VVTCS
Sbjct: 504  KNPLEETTYLRLLDEYTNLGPIVDFDLRPCSDATSGGHGNAKNDSKSRDQHRQSLVVTCS 563

Query: 320  GAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
            G  KDG++R+VRNG+G+ E A VE+ GIKGMWSLR +  D  D+FLV SF+ ETRIL + 
Sbjct: 564  GVGKDGTVRLVRNGVGMREHAEVEMPGIKGMWSLRRTFADEDDSFLVQSFVRETRILGVQ 623

Query: 380  ----------------LEDELEETEIEGFCSQTQTLFCHDAIYNQ---LVQVTSGSVRLV 420
                                L E  I GF S   TLF  + +      L+QV   SVRLV
Sbjct: 624  SGGEAEMEEEDDEDQEEGGALAEVTIPGFSSSKSTLFAGNLLVGSSDLLLQVVEDSVRLV 683

Query: 421  SSTSRELRNEWK------------SPPGYSVNVATANAS-QVLLATGGGHLVYLEI-GDG 466
            +S + EL  EW              P G+ + VA+AN+S Q+++A  GG LVYL + GD 
Sbjct: 684  NSETLELVTEWSPFSNDNEGSDDDEPLGF-ITVASANSSGQIVVALRGGTLVYLVVEGDA 742

Query: 467  I--LTEVKHAQLEYEISCLDINP-----IGENP------SYSQIAAVGMWTDISVRIFSL 513
               +  +K   L+ EISC+D+NP     +G N        +S++ AVG+W D SVR+  L
Sbjct: 743  SPSIRRLKRVTLDREISCIDLNPFDDTAVGGNTMDVDGGHHSKLVAVGLWDDFSVRLLDL 802

Query: 514  PD----------LNLITKEHLGG----------------EIIPRSVLLC----------- 536
                        +NL      GG                 ++ RS+ L            
Sbjct: 803  SSSHSVLDQVLQINLGVGNDNGGGKSAEEDDDELGESSQHMMARSLCLVTLDSHSSNSSS 862

Query: 537  -------AFEG---ISYLLCALGDGHLLNFLL-----NMKTGELTDRKKVSLGTQPITLR 581
                   +F G   +  LL  LGDG L++F +     + K   +  RK+VSLGTQ I L 
Sbjct: 863  LINKKSDSFTGSNKVDMLLVGLGDGKLISFAVVHSAASSKKWFVHSRKEVSLGTQGIHLI 922

Query: 582  TFSS-KNTTHVFAASDRPTVIY--------SSNKKLLYSNVNLK------EVSHMCPFNS 626
              SS  N T V A  DRPTV+Y        +SN KL YS ++L       E  H    N 
Sbjct: 923  PMSSGSNGTCVLATGDRPTVVYLTGGNGGGNSNPKLCYSTISLTVEDDDDEDGHASHRNV 982

Query: 627  AA--------------------FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRI 666
            +                        SL +A E  L +G IDDIQKLH+ S  LG  PRRI
Sbjct: 983  SVNVASSFRSSLLFSSSNSTSNHNYSLCVADETTLRLGMIDDIQKLHVTSYKLGMTPRRI 1042

Query: 667  CHQEQSRTFAICSL--------KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
             + E  R + +  +         NQ  AE +  + VR  DD TFE I+   L+ FE   S
Sbjct: 1043 AYHEAGRVYCVGCIDGNAKGGNNNQVGAEINMGNCVRFFDDSTFEEINQIDLEPFETILS 1102

Query: 719  ILSCSFSDDSNV---------------YYCVGTAYVLPEENEPTKGRILVFIVEDGK--- 760
            ++S S    S                 Y  +GTAY  P+E+EPT+GRILV     G+   
Sbjct: 1103 LVSVSLCTSSQTLTQSNSKQDTSEYKPYILIGTAYAYPDEDEPTQGRILVVECNSGEAEP 1162

Query: 761  ---------------LQLIAEKETKGAVYSLNAF-NGKLLAAINQKIQLYKWMLRDDGTR 804
                           ++ + +  T+G VYS++ F  G +LA +N K  L +  +  D   
Sbjct: 1163 HLKSDDDMEDTYSRYVRHVTQMPTRGGVYSISPFYGGTVLATVNSKTHLCRLSIGCDQIG 1222

Query: 805  ELQ-SECGHHGHILALYVQTRG----------------DFIVVGDLMKSISLLIYKHEEG 847
            EL+    GHHGH+L+L+V++                     +VGDLM+SISL+ Y+ +  
Sbjct: 1223 ELKFVGAGHHGHMLSLFVKSLAGSESESESSGTNRQAKQLAIVGDLMRSISLVEYQPKHN 1282

Query: 848  AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
             IEE ARDYNAN+ +AVE+L +  YLG+E   NLF +R N+  +++E R RL+ VGEYHL
Sbjct: 1283 VIEELARDYNANFCTAVEMLTNGTYLGSEGFNNLFVLRHNANASSEEARVRLDTVGEYHL 1342

Query: 908  GEFVNRFRHGSLVMRLPDSDVGQI--------PTVIFGTVNGVIGVIASLPHEQYLFLEK 959
            GE  N+F  GSL+M  P +  G +           +FGTV+G IG +  L    + FL  
Sbjct: 1343 GEMTNKFMGGSLIM--PSNSGGIMGAQNAYVGSQTLFGTVDGSIGSVLGLDGPTFAFLAC 1400

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            LQ  +  ++K VG ++HE++R+F  E++   ++ F+DGDLIE+FLDL+R  M+ I K MN
Sbjct: 1401 LQRAILSIVKTVGDISHEEYRAFRAERQVRPSRGFIDGDLIETFLDLNRPTMERIVKYMN 1460


>gi|345570887|gb|EGX53705.1| hypothetical protein AOL_s00006g33 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1133

 Score =  532 bits (1370), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 353/1101 (32%), Positives = 582/1101 (52%), Gaps = 92/1101 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            +YGR+  L   RP     D LF+ T+RY++  + WD+++  +   R   DV+DR  RP  
Sbjct: 55   VYGRVTGLLSLRPQQSTLDHLFLGTDRYEYFTVSWDSQTGTIRNERKAHDVTDRFQRPAQ 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK---------------GQLKEAFNIRLEEL 109
             G + + DP  RL+GL+LY+G+F  IP   +               G L +   IR+ EL
Sbjct: 115  VGHLYLADPGGRLLGLYLYEGIFTAIPIKRQSKGRGRHAQLPEAEIGNLDDPCPIRMNEL 174

Query: 110  QVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEV-----ALKDKDFVEGPWSQNNLDNGA 164
            +V+++ FLYG + P I VLY D+K   H+ TYE+     A+KD +F +     NNLD+GA
Sbjct: 175  KVINMVFLYGTSVPVIAVLYTDSKKLVHLITYELNVAKRAVKDPEFAQWGIKANNLDHGA 234

Query: 165  DLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-----PSITKAYGRVDADGSRYLLG 219
             LLIPV  P  G+L+IGE+ + Y      + +P++     P+    +G++D +  RYLL 
Sbjct: 235  KLLIPVDNPTGGILVIGEQVVSYFHPE--RTVPMKKPLHEPTSFVTHGKIDPE--RYLLS 290

Query: 220  DHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQ 279
            D  G L+LL++  E  K+  ++IE LGE   A  I YLDN  V++GS +GDS L++++  
Sbjct: 291  DELGHLYLLLLIIENNKLINMRIENLGEVCQARAIVYLDNGYVFLGSHFGDSTLVRIS-- 348

Query: 280  PDAKGSYVEVLERYVNLGPIVDFCVVDLE---------RQGQGQVVTCSGAYKDGSLRIV 330
              +KG  +EV++   NL PI DF V+  E           GQ  ++TCSG + DG LR V
Sbjct: 349  --SKGPRIEVVQSLPNLAPISDFIVLGTEVGGVEIHQYSAGQTMILTCSGGFYDGGLRSV 406

Query: 331  RNGIGINEQASV-ELQGIKGMWSL-RSSTDDPFDTFLVVSFISETRILAMNLEDELEETE 388
            R+G+GI +   + E+ G++ MW+L R+  D+ FD  L+ SF +E+R  A   + E+EE +
Sbjct: 407  RSGVGIRDIGLLGEMSGVQNMWALKRAILDNGFDDTLLFSFANESRAFAFGADGEVEEVD 466

Query: 389  -IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANAS 447
              E F   T TL   +   ++LVQVT   V +V   + +L N W  P G  + +A+ + +
Sbjct: 467  TFENFFLDTTTLEAGNVGNDKLVQVTPFKVIVVEKATSKLWN-WAPPVGAKIVMASLSGA 525

Query: 448  QVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDIS 507
            ++++   G   +  ++    + ++ +   E EISC+ I P  +    S    VG W   S
Sbjct: 526  RLVVVLNGRICLLFDLSSEPIKQIANRTFENEISCIHI-PTKQ----SDFLVVGFWMPAS 580

Query: 508  VRIFSLPDLNLITKEHLG---GEIIPRSVLLCAFEG--ISYLLCALGDGHLLNFLLNMKT 562
            + +  + DL  + +EHL    G + PRSV++   EG   S L   + DG ++++ +    
Sbjct: 581  LALLRIADLETMKEEHLAVFEGSV-PRSVMVANMEGDGPSSLFVGMADGEVISYTITEGP 639

Query: 563  GELTDRKKVSLGTQPITLRTFSSK---NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS 619
            G L D+K++ LGTQ +T      K   +++ V A  +RPT++Y    + +YS + L + S
Sbjct: 640  GILDDQKRIRLGTQTVTFEALPRKTGDDSSCVIATGERPTMVYGEEGRTVYSAITLNQAS 699

Query: 620  HMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS 679
             +  FN+ AFPD++ +A +  + I  ID+ +  H R  PL +  RR+   ++ + + + +
Sbjct: 700  SVVAFNAEAFPDTVVVATDESVFIAKIDEARTTHTRMSPLCQFARRVAFSKEKKAYGVAT 759

Query: 680  LKNQ---SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY---YC 733
            ++N    +   ES   ++ ++D+  ++ I  Y L   E   S+L  S ++        + 
Sbjct: 760  IRNSIDTTTGMESSSCYIHVIDENFYDKIDAYELYPNELVESLLCASLANPDGTISEKFV 819

Query: 734  VGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
            VGTA +  + +E   GR+L   +  D  L+LI E E  GA +SL    G +LA +++ I 
Sbjct: 820  VGTA-IGNDSDESEHGRLLFLELGADKMLRLITELELPGACHSLAIVKGYILAGLSKSID 878

Query: 793  LYKW-MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL--IYKHEEG-- 847
            LY++   R      +Q         L + +   G  + VGDL+K + +L  +    EG  
Sbjct: 879  LYRFSYTRGSLGASIQQISSIRAATLPVSLSVYGKRVFVGDLVKGVMVLEVVEGGGEGND 938

Query: 848  AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
             + E  R Y  +W++A+E LD+D  + A+++ NL  +R+ S GATDE+  R+  + E  L
Sbjct: 939  KLVEVCRQYGVSWVTALEALDEDTCISADSDGNLVLLRRESTGATDEDTRRMRPLSEIRL 998

Query: 908  GEFVNRFRHGSLVMRLPDSDV-GQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
            GE VN  R      R+ D    G +  P    GTV+G + ++  +  + +  L K Q N+
Sbjct: 999  GEMVNCIR------RVNDPITQGYVVQPKAYLGTVDGGLFMLGLIHPDYFDILMKCQVNM 1052

Query: 965  RKVIKGVGGLNHEQWRSFNNEK-KTVDAKNFLDGDLIESFLDLSRTRM--------DEIS 1015
             KVIKG+G L+  ++R++N +  +  +   F+DG+L+E FLDL    M        D+ +
Sbjct: 1053 AKVIKGIGDLDFNRYRAYNTKGIQPEEPFRFVDGELVEKFLDLDEDAMRMVIDGANDDDN 1112

Query: 1016 KTMNVSVEELCKRVEELTRLH 1036
              +  +V E+   VE L RLH
Sbjct: 1113 SQIECTVGEMKNIVETLKRLH 1133


>gi|18377609|gb|AAL66955.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
          Length = 270

 Score =  527 bits (1358), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 250/270 (92%), Positives = 264/270 (97%)

Query: 767  KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD 826
            KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD
Sbjct: 1    KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD 60

Query: 827  FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
            FIVVGDLMKSISLL+YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+K
Sbjct: 61   FIVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKK 120

Query: 887  NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
            NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVI
Sbjct: 121  NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVI 180

Query: 947  ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
            ASLP EQY FLEKLQ++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDL
Sbjct: 181  ASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDL 240

Query: 1007 SRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            SR +M++ISK+MNV VEELCKRVEELTRLH
Sbjct: 241  SRNKMEDISKSMNVQVEELCKRVEELTRLH 270


>gi|297815870|ref|XP_002875818.1| hypothetical protein ARALYDRAFT_323319 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297321656|gb|EFH52077.1| hypothetical protein ARALYDRAFT_323319 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 987

 Score =  523 bits (1347), Expect = e-145,   Method: Compositional matrix adjust.
 Identities = 357/1050 (34%), Positives = 552/1050 (52%), Gaps = 131/1050 (12%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
            + P+YGR+  +EL   +   +D LFIATE  K C+  WD ++S + +             
Sbjct: 53   EFPMYGRVVFMELSHLNNLRKDVLFIATETEKLCMFDWDVQTSNVFS------------- 99

Query: 63   TDNGQIGIIDPDCRLIGLHLY---DGLFKVIPFDNKGQLKEAFNI-RLEELQVLDIKFLY 118
                    +D    +  L +Y   D  ++   F   G + +   I  L    ++D+KFL 
Sbjct: 100  --------LDTIQYVPNLEMYQIEDESYEYYDFCQGGTIFDGKTIVLLSSTNIVDVKFLK 151

Query: 119  GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVL 178
               +P + +LY+                                   LLIP    L   L
Sbjct: 152  SYLRPGLALLYRTIMPT------------------------------LLIP---NLFSYL 178

Query: 179  IIGEETIVYCSANAFKAIPIRPSITKAYGRVDADG--SRYLLGDHAGLLHLLVIT-HEKE 235
            IIG++ +VY S N  K   +     + YGRV+      RY+LG H G + +L I  +  E
Sbjct: 179  IIGKKGVVYYSDNEIKTKLMGSLDIQTYGRVEETRHMCRYILGGHDGHISVLEIKINTAE 238

Query: 236  KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL----QPDAKGSYVEVLE 291
            +++GL  + LG+ S+ S+I+YL + +V++GS Y DSQL+K++       +     ++VL 
Sbjct: 239  RLSGLTFKPLGKISVPSSITYLRDDIVFVGSRYADSQLVKMHQLHGPSLNTNTPALQVLT 298

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
             Y N GP++DF + + +  G  ++V C G ++D SL  V  G+GI+ +   E+  I+ +W
Sbjct: 299  TYTNSGPVIDFSLCN-KPIGVRKLVMCCGTHEDSSLEGVSTGVGISVKFFAEIPNIQRIW 357

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLED--ELEETEIEGFCSQTQTLFCHDAIYNQL 409
            SL+SS+ +    FLV+  I+    L   ++D        + GF   + T+ CHDA     
Sbjct: 358  SLKSSSLNDNHPFLVLQPIAVAPALVFTIDDTGAFMSHTMVGFAYDSVTICCHDAADRLF 417

Query: 410  VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILT 469
            +QVT  SVRL+ S +  LR+EW +     +  A+A   QVLLAT    LVYL+I +GIL 
Sbjct: 418  IQVTEKSVRLIDSKTGVLRSEWLASS--KILFASATIKQVLLATLDKQLVYLDIRNGILV 475

Query: 470  EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
            + K  Q  Y++SCLDI P+G N +YS++AAVG+W + S+ +  LP+L ++T EH+   + 
Sbjct: 476  QAK-VQKGYDVSCLDIGPVGSNTTYSKLAAVGLW-EKSILVLKLPELEVLTVEHVHAHV- 532

Query: 530  PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
            PR +LLCAFE +SYLLCAL DG LL F LN   G+L ++    L   P TLRTF S  + 
Sbjct: 533  PRCILLCAFEQVSYLLCALADGSLLTFKLNEDDGQLQEKNTKELEGYPKTLRTFISNGSV 592

Query: 590  HV-FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
               F+ S  PT+I++  +KL Y+ +NL ++  MCPF  A   + LAI  +  + I T+ D
Sbjct: 593  RACFSTSKVPTLIFTMKQKLQYNRLNLTDIEDMCPFQRADISEGLAIVTKDGVVICTMGD 652

Query: 649  IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTY 708
            ++  +I+ IP+  + RR+CHQE+S+T+A+         E+   H ++LLD +     S +
Sbjct: 653  MET-NIK-IPMFSYGRRLCHQEESQTYAVL------VEEKDGRHSLQLLDKELNTLFS-F 703

Query: 709  PLDTFEYGCSILSCSFSDDSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDGKLQLIAEK 767
             L+  E+G SI+SCSF+ D   YYCVGT+  VL EEN  T+GRILVF+V DG+L +I + 
Sbjct: 704  ELNEHEHGLSIVSCSFAGDDQAYYCVGTSVNVLDEEN--TQGRILVFLVRDGELNIIFQY 761

Query: 768  ETKGAVYSLNAFNGKLLAAIN-QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD 826
             T GAVY L   NG LL  ++   I  ++WML DD  REL                    
Sbjct: 762  PTNGAVYYLKPLNGGLLVTVSVHTIHYFEWMLHDDA-RELH------------------- 801

Query: 827  FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
                     SI+  +   E+G  E+R    ++    +        Y  A+  +++     
Sbjct: 802  ---------SINPEVVGIEDGGWEQRFAVTSSRIQKS--------YFYADGLYSI----H 840

Query: 887  NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
            +++G  D   G    VG+++LG+ +   + G+L  RL D D   +P+V+F TV+G +G+I
Sbjct: 841  SADGVND--HGYPNTVGQWYLGQSITHMKAGTL-SRLSDEDSAIVPSVMFSTVSGAVGMI 897

Query: 947  ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
             SLP   Y FL KL+  +R+ I+G+G  N +  R     +   ++  FLDGD IESF  L
Sbjct: 898  VSLPPVIYEFLHKLEQYMREQIRGIGCPNEDFPRMCVELQSLSESTEFLDGDFIESFQIL 957

Query: 1007 SRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
                 +E+++ M V+VE++ K VE L  LH
Sbjct: 958  DIDDQEEVAEEMEVTVEDIRKIVEYLKLLH 987


>gi|358338734|dbj|GAA31211.2| DNA damage-binding protein 1, partial [Clonorchis sinensis]
          Length = 1515

 Score =  511 bits (1315), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 337/1035 (32%), Positives = 539/1035 (52%), Gaps = 151/1035 (14%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            DVPI  +I    LFR      D LFI T +  F ++      +  + +T   G V D   
Sbjct: 33   DVPINAKIVAACLFRRKERQTDSLFILTHKAGFALIDCARKGDDVQFVTVVSGSVEDCGA 92

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ------------LKEAFNIRLEE 108
            R  D G   ++DP    + + LY GL ++IP +  G+                +N+R+EE
Sbjct: 93   RLIDQGFDVLVDPGANCVVVRLYHGLLRIIPLNGIGEKLTTDSLEVNQYAANTYNVRIEE 152

Query: 109  LQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
              ++D+ FL+G   PT  ++Y+D     H+KTYE++ ++        + ++++  + LLI
Sbjct: 153  GNIVDMAFLHGYTLPTFAMIYEDEL-VLHMKTYEISGREPALRNVQLTLDSIEPDSKLLI 211

Query: 169  PVPPPLCGVLIIGEETIVYCSANA---FKAIP-IRPSITKAYGRVDADGSRYLLGDHAG- 223
            PVP P  GV+++G+  I Y + +     + IP  + S    Y  VDA   RYLLGD AG 
Sbjct: 212  PVPKPFGGVILVGDNIIYYHTKDGPHISQYIPQAKASQVLCYAAVDAQ--RYLLGDMAGR 269

Query: 224  --LLHLLVITHEKE----------------KVTGLKIELLGETSIASTISYLDNAVVYIG 265
              ++HLL   H                   ++  ++IELLGET+   +I+Y+DN VV+IG
Sbjct: 270  LYMVHLLAEDHTPSGNGLLGSTSSAAVPSARIGSIRIELLGETATPESIAYVDNGVVFIG 329

Query: 266  SSYGDSQLIKLNLQPDA-KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD 324
             + GDSQLI+LN  PD  + SY+ VLE Y N+GPIVD  +V LE +GQ Q++TCSGAYK+
Sbjct: 330  CTLGDSQLIRLNPDPDPERNSYITVLENYTNIGPIVD--MVLLESKGQNQLITCSGAYKE 387

Query: 325  GSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDEL 384
            G+LRI+RNGIGI+E A+++   IKG W     +D  +D  +VVS + +T++L +  +D++
Sbjct: 388  GTLRIIRNGIGIHEHATIDQDLIKGAWCFPLESDR-YDDSIVVSMVGQTQLLRLT-DDDI 445

Query: 385  EETEIEGFCSQTQTLFCH-----DAI----------------YNQLVQVTSGSVRLVSS- 422
                +EGF +  QT++C      DA                 ++ L+Q T+  +RL+   
Sbjct: 446  TALHLEGFKTDEQTVYCATLSPMDACTSDNPESSSRSYSVFQHSLLMQATTSGIRLIGIH 505

Query: 423  --TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG---ILTEVKHAQLE 477
                     EW+SP G  ++  +++ + +++A+G    V   +G        +  H  + 
Sbjct: 506  HLNGSGCLAEWRSPSGRGISCLSSHGALIVVASGPELYVLRVVGPANQPSFEQTAHRTMS 565

Query: 478  YEISCLDINP---------------IGENPSYS--QIAAVGMWTDISVRIFSLPDLNLIT 520
             E++C+D+ P               I E   Y+  Q+ AVG+W    + +  LP+L L+ 
Sbjct: 566  NEVACIDLTPFDHKRAAYAASQTSTIDEPVDYTVPQLVAVGLWLGYGLALLRLPNLELVH 625

Query: 521  KEHL-------GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE--LTDRKKV 571
            +E L       G  ++PRSVLL   E ++YL  A+GDG L  + +        L D K+V
Sbjct: 626  EEPLPETTASTGTALLPRSVLLAQLEDMAYLFAAMGDGTLYFYTVCPSADGIVLRDAKRV 685

Query: 572  SLGTQP-ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
            + GT P + L+ + S+   +VF  S+ P VIYS   KL+++N+N+KEV+ M P N A + 
Sbjct: 686  NAGTGPSMFLKQWRSQCKVNVFVCSNHPCVIYSIKNKLIFANLNMKEVNFMAPLNGAFYR 745

Query: 631  DSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE 690
            D +A+     L IG++D+IQKLH+R++PL E P+R+  Q+++ +  + + + +   E S 
Sbjct: 746  DCIALVTPTALVIGSVDEIQKLHVRTVPLEETPKRLALQDETGSLGVITYRQEVFQEGSG 805

Query: 691  MHFVR----------------------------------------LLDDQTFEFI---ST 707
               VR                                        + +  T E +   S 
Sbjct: 806  FKPVRSSISLSQKVPKSTSRLPKTAPSSVSATERKFREVEVSSLLIFNKSTMELMFAHSF 865

Query: 708  YPLDTF-EYGCSILSCSFSDDS-NVYYCVGTAYVLPEENEPTKGRILVFI--VEDGKLQL 763
            Y   T  E   SI S   +D S ++ Y VGTA+++ EE EP+KGRI +F    E  +L+ 
Sbjct: 866  YFSQTLVEVAVSIASIEPTDGSKSMLYAVGTAFLVEEEVEPSKGRIHLFHWDPETARLET 925

Query: 764  IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQT 823
            +   +  GAVY L  FNG+LLAAIN  ++L+   +++D    L+  C  + +I+AL+++ 
Sbjct: 926  VLVHDVNGAVYRLLDFNGRLLAAINSSVRLFD--IKEDS---LRLACSFNENIIALFLRR 980

Query: 824  RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
            +GDF++VGDLM+S++LL+Y+      E   R  N  W + +EILDD+ +L AE   +LF 
Sbjct: 981  KGDFVLVGDLMRSLTLLLYRPNVNNFEAIGRHRNPRWTTCIEILDDEHFLAAEVENSLFV 1040

Query: 884  VRKNSEGATDEERGR 898
            V ++S   T E   R
Sbjct: 1041 VSRDSPENTQEPTVR 1055



 Score = 89.0 bits (219), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 48/130 (36%), Positives = 76/130 (58%), Gaps = 2/130 (1%)

Query: 895  ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHE 952
            E  RL      H G+F+N F  G+LVM+  +     I  P+ ++GTV+G IG+I  +   
Sbjct: 1149 EVQRLADCAYIHTGDFINVFVRGNLVMQNNEERWQAIGHPSHLYGTVSGSIGLIIQVSPV 1208

Query: 953  QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD 1012
             + FL+++++ L K++  VGG  H+ WR+F  E+    A NF+DGDLIE+ LDLS     
Sbjct: 1209 LFAFLKEIESRLAKLVNPVGGFAHDMWRAFKAERIVRMAHNFVDGDLIETLLDLSSEDRA 1268

Query: 1013 EISKTMNVSV 1022
            ++ K + + V
Sbjct: 1269 KLVKGLRIPV 1278


>gi|353232348|emb|CCD79703.1| putative dna repair protein xp-E [Schistosoma mansoni]
          Length = 1329

 Score =  504 bits (1297), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 331/1028 (32%), Positives = 531/1028 (51%), Gaps = 144/1028 (14%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            DVPI  +I    LFR      D LF+ T +    +++   + +S E +T A G V DR  
Sbjct: 54   DVPINAKIVAASLFRRKDRETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVEDRSA 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ-LKEAF--NIRLEELQVLDIKFL 117
            R  D G   +IDP    I + LY GL K+I     G+ +   F    ++EE  ++D+ F+
Sbjct: 114  RIIDQGFDVLIDPGANYIVVRLYHGLLKIILLQCIGEKIGTDFLDTNQIEEGNIVDMAFI 173

Query: 118  YGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGV 177
            YG + PT  ++Y+D     H+KTYE+  ++        + ++++  + LLIPVP P  GV
Sbjct: 174  YGYSLPTFAMIYEDEL-VLHMKTYEIYGREPVLRNVQLTLDSIEPDSKLLIPVPKPYGGV 232

Query: 178  LIIGEETIVYCSANA---FKAIP-IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHE 233
            +++G+  I Y + +     + IP  + S    Y  VDA   RYLLGD AG L+++ +  E
Sbjct: 233  ILVGDNIICYHTKDGPHISQYIPQAKASQVLCYAAVDA--QRYLLGDMAGRLYMVHLLSE 290

Query: 234  -----------------KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
                               ++  ++IELLGET+   +I+YLDN VV+IGS+ GDSQLI+L
Sbjct: 291  DISAAANNGTSNSDSLSAVRIGSIRIELLGETATPESIAYLDNGVVFIGSTLGDSQLIRL 350

Query: 277  NLQPDA-KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIG 335
            N  PD  + SY+ +LE Y N+GPIVD  +V LE +GQ Q++TCSGAYK+GSLR++RNGIG
Sbjct: 351  NPDPDPERNSYITILETYTNIGPIVD--MVLLETKGQNQLITCSGAYKEGSLRVIRNGIG 408

Query: 336  INEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ 395
            I+E A+++   IKG W     +D  FD  +VVS + +T++L +  +D++    +EGF + 
Sbjct: 409  IHEHATIDQDLIKGAWCFPIESDR-FDDTIVVSMVGQTQLLHL-ADDDITALHLEGFKTD 466

Query: 396  TQTLFC--------------------HDAIYNQLVQVTSGSVRLV---SSTSRELRNEWK 432
             QT++C                    ++ +   L+Q T+  +RL+   S  S+    EWK
Sbjct: 467  EQTVYCATLSPANDSPRPESMMIDENNNTLDPLLLQATTSGLRLIGIQSLCSKGCLTEWK 526

Query: 433  SPPGYSVNVATANASQVLLATGGGHLVYLEIGDG---ILTEVKHAQLEYEISCLDINPIG 489
             P G  ++  ++    +++A+G    V   +G+       +V H Q+ +E++C+D+ P  
Sbjct: 527  PPTGRGISCLSSFRHTIVVASGTELYVLKVVGESNNPKFEQVSHRQMSHEVACIDLTPFN 586

Query: 490  ENPSYSQI-----------------------AAVGMWTDISVRIFSLPDLNLITKEHL-- 524
             + + + I                        AVG+W    + +  LP+L L+ +E L  
Sbjct: 587  RDRAIAAINAVSSNHLDSTSSSTPDKSVPYLVAVGLWLGHGLALLKLPNLELVHEEPLPE 646

Query: 525  -----GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE--LTDRKKVSLGTQP 577
                 G  ++PRSVL+   E I+YL  A+GDG L  + ++       L D K+VS GT P
Sbjct: 647  TTASTGTALLPRSVLIAQLEDIAYLFAAMGDGTLYFYTIDPSEDHVCLRDPKRVSAGTGP 706

Query: 578  -ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
             + LR + S+   +VF  S+ P VIYS   KL+++N+NLKEV+ M P N   + D +A+ 
Sbjct: 707  SMFLRQWRSQRKVNVFVCSNHPCVIYSIKNKLIFANLNLKEVNFMTPLNGLFYSDCIALV 766

Query: 637  KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR- 695
                L IG++D+IQKLH+R++PL E P+R+  Q ++ +  + + + +   E      VR 
Sbjct: 767  TPTALIIGSVDEIQKLHVRTLPLEETPKRLALQSETSSLGVITYRQEMFQEGMGFKPVRS 826

Query: 696  -----------------------LLDDQTFEFISTYPLDTF------------------- 713
                                      ++ F  I    L  F                   
Sbjct: 827  SISLSQKIPKSASRLPKTAPSSVSATERKFREIEVSSLLIFNQSNLEIQFAHNFYFSQTL 886

Query: 714  -EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI--VEDGKLQLIAEKETK 770
             E   SI S      +   + VGTA+++ +E EP+KGRI +F    E  +L  +   +  
Sbjct: 887  VEVAVSIASVQSDIHNGPLFAVGTAFLVEDEVEPSKGRIHLFRWDPESSRLDTVLVHDVN 946

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
            G+VY +  FNG+LLAAIN  ++L+   +++D    L+  C  + +I+ L+++ +GDF++V
Sbjct: 947  GSVYRIVDFNGRLLAAINSSVRLFD--IKEDS---LRLACSFNENIIVLFLRRKGDFVLV 1001

Query: 831  GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
            GDLM+S++LL++K      E   R  +  W + +EILDD+ +L AE   NLF V ++   
Sbjct: 1002 GDLMRSLTLLLFKSNVNNFEAIGRHRHPRWTTCIEILDDEHFLAAEVENNLFVVSRDLPE 1061

Query: 891  ATDEERGR 898
             T E   R
Sbjct: 1062 NTKEPSFR 1069



 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/180 (32%), Positives = 95/180 (52%), Gaps = 32/180 (17%)

Query: 885  RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--PTVIFGTVNGV 942
            + NS G + E + RL     +H G+FVN F  G+LV++  +     +  P  ++GTVNG 
Sbjct: 1154 KPNSSGVSGEMQ-RLVDCAYFHTGDFVNVFVRGNLVLQNAEERWTAVGYPGHLYGTVNGG 1212

Query: 943  IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
            +G++  +    + FL++++  L  ++  VGG +H+ WR+F  +++   A NF+DGDLIE+
Sbjct: 1213 LGLLVQVSPVLFAFLKEVEFRLSNLVVPVGGFSHDTWRAFKADREIKMAHNFVDGDLIET 1272

Query: 1003 FLDLSRTRMDEISK--------------------------TMNVSVEELCKRVEELTRLH 1036
              DLS   MD+ +K                          T   +VE+L K VEE++RLH
Sbjct: 1273 VTDLS---MDDKAKLVKGLRIPVNMNEFGTAGSTCTTDPETRECTVEDLVKVVEEMSRLH 1329


>gi|169611218|ref|XP_001799027.1| hypothetical protein SNOG_08717 [Phaeosphaeria nodorum SN15]
 gi|160702249|gb|EAT83885.2| hypothetical protein SNOG_08717 [Phaeosphaeria nodorum SN15]
          Length = 1140

 Score =  504 bits (1297), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 336/1081 (31%), Positives = 567/1081 (52%), Gaps = 69/1081 (6%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            IYG++  L+  RP     D LF+ T+R+ +  L W+AE  +L T ++   V+D   R + 
Sbjct: 79   IYGKVTMLQKLRPALSPTDHLFVGTDRFMYFTLSWNAEKKQLQTEKSFDSVADNAARESQ 138

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ----------LKEAFNIRLEELQVLDI 114
             G+   +DP  R + + +Y+G+  VIP   +G+          L E   +RL E+ V   
Sbjct: 139  TGERCHVDPSGRFMTVEVYEGIITVIPLVQRGKKRKQEPDIAHLGEPQPVRLPEMFVRSS 198

Query: 115  KFLYGCA---KPTIVVLYQDNKDARHVKTYEVALKDKDFVE---GPWSQNNLDNGADLLI 168
             FL   A   KP I +L++D      ++  E+     D VE   G   +  L+ GA  LI
Sbjct: 199  AFLRPKAFDDKPKIALLFEDTHSQVKLRLREILFAGGDSVELEDGEACRAELELGASHLI 258

Query: 169  PVPPPLCGVLIIGEETIVYC--SANAFKAIPI-RPSITKAYGRVDADGSRYLLGDHAGLL 225
            P+  P  G+++IGE +I Y    +N  +  P+   +I  A+ R+DA   R++L D  G L
Sbjct: 259  PIEAPSHGLIVIGETSIAYYDEESNDLQTEPLDEATIFVAWERIDA--QRFVLADDYGRL 316

Query: 226  HL-LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKG 284
            +L ++I  EK+KV   K++++G+TS AS + YLD   V++GS  GDSQ+I++  Q     
Sbjct: 317  YLFMLILDEKKKVQSWKLDIIGQTSRASVLVYLDAGYVFVGSHQGDSQVIRITEQS---- 372

Query: 285  SYVEVLERYVNLGPIVDFCVVDL-ERQGQGQ----------VVTCSGAYKDGSLRIVRNG 333
              +E+++ + N+ P++DF ++D+  R G+GQ          +VT SGAY+DGSLR VR+G
Sbjct: 373  --MEIVQTFANIAPVLDFTIMDMGNRSGEGQTNEYSSGQARIVTGSGAYQDGSLRSVRSG 430

Query: 334  IGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET-EIEG 391
            +G+ +   + E++ I  ++ L+S+    +   L+V+F++ETRI   + + E+EE  E   
Sbjct: 431  VGLEDLGVLGEMEHISDLFGLKSNASAEYADTLLVTFVNETRIFRFDPQGEVEEVDEFAS 490

Query: 392  FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 451
                  TL   +    ++VQVT G VR+       + +EW  P G ++  A+ N S VLL
Sbjct: 491  VALDETTLAAANISQGRVVQVTGGRVRVSDLDGGMITSEWVPPSGETITAASVNDSHVLL 550

Query: 452  ATGGGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQ-IAAVGMWTDISVR 509
            + GG  ++ L + G  +LTE K   +E +++C+ +      PS S  +  +G W +  + 
Sbjct: 551  SLGGVSVITLNMDGLKVLTEKKFG-VESQVACIAL------PSTSSSMCFIGFWKNSQLA 603

Query: 510  IFSLPDLNLITKEHLGGEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTD 567
            I SL  L  +    +  + +PRS+LL     +    L  A+ DG+++ +  +   GEL+ 
Sbjct: 604  ICSLDTLEAVKTVQVSEDSVPRSLLLTQIFPDQPPTLFAAMADGNVVTYSFDTSNGELSG 663

Query: 568  RKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS 626
            RK + LGT+  T R     N   +VFA  + P++IY+S  +L+YS V  +  + +CPF+S
Sbjct: 664  RKSIVLGTREATFRALPRGNGLFNVFATCEHPSLIYASEGRLVYSAVTAENATTVCPFDS 723

Query: 627  AAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA 686
             A+P S+AIA   +L I  +D  +  H++++ + E  RRI +    + F + ++K    A
Sbjct: 724  EAYPGSVAIATSDDLRIALVDTERTTHVQTLKVDETVRRIAYSPGLKAFGLGTVKRILKA 783

Query: 687  EESEM--HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN---VYYCVGTAYVLP 741
             E  M  HF +L+D+  F+ + TY L+  E    ++ C  +D S      + +GTAY+  
Sbjct: 784  GEEIMLSHF-KLVDEIQFKELDTYALNEEELVECVMRCDLADGSGGTAERFVIGTAYLDD 842

Query: 742  EENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
            + +   +GRIL+  +  +  L+L+ E   KG    L    GK++AA+ + I +Y    R 
Sbjct: 843  QNSTVERGRILILEVTPERVLKLVTEIAVKGGCRCLAMCEGKIVAALIKTIVVYDIEYRT 902

Query: 801  DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER----ARDY 856
                +L            + +   G  I + DLMKS+ ++ Y+  E  + ++    AR +
Sbjct: 903  QSKPDLVKAATFRCSTAPIDITVNGTQIAIADLMKSMVVVEYQRGETGLPDKLVEVARHF 962

Query: 857  NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
               W +AV  +D++ YL ++   NL  + ++ +G TD+++ RL V  E  LGE VNR R 
Sbjct: 963  QVTWATAVAEVDENTYLESDAEGNLLVLYRDPKGVTDDDKRRLNVSSEMLLGEMVNRIRR 1022

Query: 917  GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
               V   PD+ V  +P    GTV G I + A +       L  LQ+NL  ++   G ++ 
Sbjct: 1023 ID-VATAPDAVV--VPRAFMGTVEGSIYLFALISQNYLDLLITLQSNLGNLVVSPGNMDF 1079

Query: 977  EQWRSFNNEKKTVDAKN-FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
             ++R+F N+ +T +  N F+DG+LIE FLD       +  + + V +E++   VE L RL
Sbjct: 1080 AKFRAFKNQVRTEEEPNRFVDGELIERFLDCEEDVQRKAIEGLGVELEDIRSLVEGLRRL 1139

Query: 1036 H 1036
            H
Sbjct: 1140 H 1140


>gi|256088964|ref|XP_002580590.1| DNA repair protein xp-E [Schistosoma mansoni]
          Length = 1329

 Score =  503 bits (1296), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 331/1028 (32%), Positives = 531/1028 (51%), Gaps = 144/1028 (14%)

Query: 3    DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
            DVPI  +I    LFR      D LF+ T +    +++   + +S E +T A G V DR  
Sbjct: 54   DVPINAKIVAASLFRRKDRETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVEDRSA 113

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ-LKEAF--NIRLEELQVLDIKFL 117
            R  D G   +IDP    I + LY GL K+I     G+ +   F    ++EE  ++D+ F+
Sbjct: 114  RIIDQGFDVLIDPGANYIVVRLYHGLLKIILLQCIGEKIGTDFLDTNQIEEGNIVDMAFI 173

Query: 118  YGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGV 177
            YG + PT  ++Y+D     H+KTYE+  ++        + ++++  + LLIPVP P  GV
Sbjct: 174  YGYSLPTFAMIYEDEL-VLHMKTYEIYGREPVLRNVQLTLDSIEPDSKLLIPVPKPYGGV 232

Query: 178  LIIGEETIVYCSANA---FKAIP-IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHE 233
            +++G+  I Y + +     + IP  + S    Y  VDA   RYLLGD AG L+++ +  E
Sbjct: 233  ILVGDNIICYHTKDGPHISQYIPQAKASQVLCYAAVDA--QRYLLGDMAGRLYMVHLLSE 290

Query: 234  -----------------KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
                               ++  ++IELLGET+   +I+YLDN VV+IGS+ GDSQLI+L
Sbjct: 291  DISAAANNGTSNSDSLSAVRIGSIRIELLGETATPESIAYLDNGVVFIGSTLGDSQLIRL 350

Query: 277  NLQPDA-KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIG 335
            N  PD  + SY+ +LE Y N+GPIVD  +V LE +GQ Q++TCSGAYK+GSLR++RNGIG
Sbjct: 351  NPDPDPERNSYITILETYTNIGPIVD--MVLLETKGQNQLITCSGAYKEGSLRVIRNGIG 408

Query: 336  INEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ 395
            I+E A+++   IKG W     +D  FD  +VVS + +T++L +  +D++    +EGF + 
Sbjct: 409  IHEHATIDQDLIKGAWCFPIESDR-FDDTIVVSMVGQTQLLHL-ADDDITALHLEGFKTD 466

Query: 396  TQTLFC--------------------HDAIYNQLVQVTSGSVRLV---SSTSRELRNEWK 432
             QT++C                    ++ +   L+Q T+  +RL+   S  S+    EWK
Sbjct: 467  EQTVYCATLSPANDSPRPESMMIDENNNTLDPLLLQATTSGLRLIGIQSLCSKGCLTEWK 526

Query: 433  SPPGYSVNVATANASQVLLATGGGHLVYLEIGDG---ILTEVKHAQLEYEISCLDINPIG 489
             P G  ++  ++    +++A+G    V   +G+       +V H Q+ +E++C+D+ P  
Sbjct: 527  PPTGRGISCLSSFRHTIVVASGTELYVLKVVGESNNPKFEQVSHRQMSHEVACIDLTPFN 586

Query: 490  ENPSYSQI-----------------------AAVGMWTDISVRIFSLPDLNLITKEHL-- 524
             + + + I                        AVG+W    + +  LP+L L+ +E L  
Sbjct: 587  RDRAIAAINAVSSNHLDSTSSSTPDKSVPYLVAVGLWLGHGLALLKLPNLELVHEEPLPE 646

Query: 525  -----GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE--LTDRKKVSLGTQP 577
                 G  ++PRSVL+   E I+YL  A+GDG L  + ++       L D K+VS GT P
Sbjct: 647  TTASTGTALLPRSVLIAQLEDIAYLFAAMGDGTLYFYTIDPSEDHVCLRDPKRVSAGTGP 706

Query: 578  -ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
             + LR + S+   +VF  S+ P VIYS   KL+++N+NLKEV+ M P N   + D +A+ 
Sbjct: 707  SMFLRQWRSQRKVNVFVCSNHPCVIYSIKNKLIFANLNLKEVNFMTPLNGLFYSDCIALV 766

Query: 637  KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR- 695
                L IG++D+IQKLH+R++PL E P+R+  Q ++ +  + + + +   E      VR 
Sbjct: 767  TPTALIIGSVDEIQKLHVRTLPLEETPKRLALQSETSSLGVITYRQEMFQEGMGFKPVRS 826

Query: 696  -----------------------LLDDQTFEFISTYPLDTF------------------- 713
                                      ++ F  I    L  F                   
Sbjct: 827  SISLSQKIPKSASRLPKTAPSSVSATERKFREIEVSSLLIFNQSNLEIQFAHNFYFSQTL 886

Query: 714  -EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI--VEDGKLQLIAEKETK 770
             E   SI S      +   + VGTA+++ +E EP+KGRI +F    E  +L  +   +  
Sbjct: 887  VEVAVSIASVQSDIHNGPLFAVGTAFLVEDEVEPSKGRIHLFRWDPESSRLDTVLVHDVN 946

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
            G+VY +  FNG+LLAAIN  ++L+   +++D    L+  C  + +I+ L+++ +GDF++V
Sbjct: 947  GSVYRIVDFNGRLLAAINSSVRLFD--IKEDS---LRLACSFNENIIVLFLRRKGDFVLV 1001

Query: 831  GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
            GDLM+S++LL++K      E   R  +  W + +EILDD+ +L AE   NLF V ++   
Sbjct: 1002 GDLMRSLTLLLFKSNVNNFEAIGRHRHPRWTTCIEILDDEHFLAAEVENNLFVVSRDLPE 1061

Query: 891  ATDEERGR 898
             T E   R
Sbjct: 1062 NTKEPSFR 1069



 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/180 (32%), Positives = 95/180 (52%), Gaps = 32/180 (17%)

Query: 885  RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--PTVIFGTVNGV 942
            + NS G + E + RL     +H G+FVN F  G+LV++  +     +  P  ++GTVNG 
Sbjct: 1154 KPNSSGVSGEMQ-RLVDCAYFHTGDFVNVFVRGNLVLQNAEERWTAVGYPGHLYGTVNGG 1212

Query: 943  IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
            +G++  +    + FL++++  L  ++  VGG +H+ WR+F  +++   A NF+DGDLIE+
Sbjct: 1213 LGLLVQVSPVLFAFLKEVEFRLSNLVVPVGGFSHDTWRAFKADREIKMAHNFVDGDLIET 1272

Query: 1003 FLDLSRTRMDEISK--------------------------TMNVSVEELCKRVEELTRLH 1036
              DLS   MD+ +K                          T   +VE+L K VEE++RLH
Sbjct: 1273 VTDLS---MDDKAKLVKGLRIPVNMNEFGTAGSTCTTDPETRECTVEDLVKVVEEMSRLH 1329


>gi|407923753|gb|EKG16818.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
            phaseolina MS6]
          Length = 1129

 Score =  501 bits (1290), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 346/1095 (31%), Positives = 574/1095 (52%), Gaps = 84/1095 (7%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            +YG++  L   RP  E  + LF+ T+RY +  + W  E  +L T +   D++D+  R   
Sbjct: 55   VYGKVTMLNRLRPMNEPTEHLFVGTDRYYYFTMAWSPEEKQLKTVKTYQDLADKAARDAQ 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLK---------EAFNIRLEELQVLDIK 115
             G   ++DP  R + L LY+G+  V+P   KG+ K         E    R+EE+ V    
Sbjct: 115  TGDRSLLDPTGRFMTLELYEGIVTVVPLTEKGKRKGDPEVSALGEPVPSRIEEMFVRSSA 174

Query: 116  FLYGCA----KPTIVVLYQDNKDAR-----------HVKTYEVALKDKDFVEGPWSQNNL 160
            FL+  +    KP + +LY++++D++              T E ++   + VEG   +  L
Sbjct: 175  FLHRKSPESEKPLVALLYEEDEDSKIRLRLRQLAFQTAGTEEQSVAALEPVEG--LKEEL 232

Query: 161  DNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVDADGSRY 216
            D GA  LIPVP P  GVL++GE  I Y + +  KA+  +P    +I  A+ ++D    R+
Sbjct: 233  DLGASHLIPVPGPCYGVLVLGETCITYFN-DYTKALVKKPLQDSTIFVAWEQID--NQRF 289

Query: 217  LLGDHAGLLHLLVITHEKEK--VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
            LL D  G L+L ++  +     V G +++ +GETS AS + YLD   V++GS  GDSQ+I
Sbjct: 290  LLADDFGGLYLFMLLLDDNSGVVEGWRLDKIGETSRASVLVYLDAGHVFVGSHEGDSQVI 349

Query: 275  KLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-ERQGQGQ----------VVTCSGAYK 323
            ++      +GS +EV++ + N+ PI+DF ++D+  R G+GQ          +VT SGA+K
Sbjct: 350  RIT-----EGS-IEVVQTFHNIAPILDFTIMDMGNRSGEGQSNEYSSGQARIVTGSGAFK 403

Query: 324  DGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLED 382
            DGSLR VR+G+G+ +Q ++ +L  I  +++  S+    +D  L+VS ++E+RI  ++ + 
Sbjct: 404  DGSLRSVRSGVGLEDQGAIGDLGSISNIFAFSSTASGEYDDTLLVSLVNESRIFCVDADG 463

Query: 383  ELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNV 441
            EL+E E  +G      TL   +    +L+QVT  +VR++   +  + +EW +P G  +  
Sbjct: 464  ELDERESFKGLQLDESTLMTKNVPNYRLLQVTQFAVRIMDLENDMVVSEWHAPEGQVITD 523

Query: 442  ATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAV 500
              +N   ++L+ GG   + L I D I     K    + +ISC+ +    +N     I  V
Sbjct: 524  VDSNDRFIILSIGGVRAIILSIADTIHAVAEKDFGADSQISCVAMPQAADN-----ICFV 578

Query: 501  GMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGHLLNF 556
            G W + +V I  L  L  I    L   G  +PRS+LL          L  A  DG+++ F
Sbjct: 579  GFWQNCTVAILGLDGLQTIQTISLSEDGSSVPRSLLLTQIFANQPPTLFIATADGYVVTF 638

Query: 557  LLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVNL 615
             ++  T  L+ RK   LGTQ    R     N  + VFA  + P++IY S  +L++S V  
Sbjct: 639  SMDPSTFALSGRKSTVLGTQQANFRALPRGNGLYNVFATCEHPSLIYGSEGRLVFSAVTA 698

Query: 616  KEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTF 675
            ++ + +CPFN+ A+P S+AIA  GEL +  +D+ ++ H++++ + E  RRI +  Q + F
Sbjct: 699  EKATCVCPFNAEAYPRSIAIAASGELHLAVVDEERRTHVQTLHVNETVRRIAYSPQLKAF 758

Query: 676  AICSLKN--QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD---DSNV 730
             + ++K   +   E  + HF RL D+  F+ +  + ++ +E     +     D   ++  
Sbjct: 759  GLGTIKRVLRDREEVVQGHF-RLADEVIFKELDNFEMNEYEIVECAIRAELDDGDGETAE 817

Query: 731  YYCVGTAYVLPEENE-PTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAIN 788
             + VGT++++ EE +  T+GRILVF + ED +L++IAE  TKGA   L   + K++A + 
Sbjct: 818  RFIVGTSHLVEEEEQGSTRGRILVFEVTEDRRLKVIAEISTKGACRCLAMVDNKIVAGLI 877

Query: 789  QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG- 847
            + + +Y +      T  L  +         + +   G+ I V DL+KS+S+L YK   G 
Sbjct: 878  KTVVIYSFEYSTPSTPFLVKKASFRTSTAPIDITVTGNQIAVADLIKSVSVLEYKPGAGD 937

Query: 848  ---AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
                ++E AR    +W  A+  +D++ YL A+   NL  + ++  G T+E+R RL + G+
Sbjct: 938  QSDELKEVARHVQVSWSMALAEVDENTYLQADAEGNLILLERDVSGVTEEDRKRLMLRGD 997

Query: 905  YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
              LGE VNR R   +      SD   IP   F TV G I + A +   +   L +LQ+ L
Sbjct: 998  MLLGEQVNRIRRIDMATV---SDAPVIPRAFFATVEGSIYLFALIAPAKVDLLIRLQSQL 1054

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKN-FLDGDLIESFLDLSRTRMDEISKTM--NVS 1021
               ++  G     ++R+F N+ +  D  N F+DGDLIE FLDL     +E+ K +  ++ 
Sbjct: 1055 ADFVRSPGHYPFLRYRAFRNQVREEDEPNRFVDGDLIERFLDLKPREQEEVVKGVYADLD 1114

Query: 1022 VEELCKRVEELTRLH 1036
            VEE+   VE L RLH
Sbjct: 1115 VEEVRSLVETLRRLH 1129


>gi|449019082|dbj|BAM82484.1| UV-damaged DNA binding protein [Cyanidioschyzon merolae strain 10D]
          Length = 1372

 Score =  498 bits (1281), Expect = e-137,   Method: Compositional matrix adjust.
 Identities = 380/1216 (31%), Positives = 577/1216 (47%), Gaps = 228/1216 (18%)

Query: 21   EAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGL 80
            ++ D +FI TER  F V+++D     + T   G ++ ++GR    G+   +DP  R + +
Sbjct: 185  QSTDAVFILTERLDFAVIKYDTRLQGVATLVAGSLASKLGRRCPLGEFVALDPSARCLAV 244

Query: 81   HLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKT 140
            + YDGL KV+P    G   EAF +RL+ +QV  ++FL G   PT+ +++ D    RH+ +
Sbjct: 245  YAYDGLLKVVPGSLMG---EAFEVRLDVVQVQSMEFLIGTEHPTLAIIHTDYLHNRHLVS 301

Query: 141  YEVALKDKDFVEGPWSQNNLDNGADLLIPVPPP--------LCGVLIIGEET-------- 184
            YE+ L  +D  EGP +Q  LD  A+L+IPVPP          CG  ++  +T        
Sbjct: 302  YEILLGRQDVREGPIAQQYLDASAELIIPVPPAPNDSANDGACGGFVVACDTSLALHPGR 361

Query: 185  ------IVYCSANAFKAI----PIR------PSITKAYGRVD----------ADG--SRY 216
                  +++     F  +    PI        S +    R D          A G  + Y
Sbjct: 362  SGVAKQVIHFHGGQFAPMRCFCPISGIGGSSKSASVEANRQDHFETGLEMLAAAGYHAHY 421

Query: 217  LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
            LLGD  G L++L +  E     G+++E + E+S+AS I+YL + +V++GS+ GDSQL++L
Sbjct: 422  LLGDRDGFLYVLSVGAE----VGMRLECVCESSVASAIAYLGSDLVFVGSALGDSQLVRL 477

Query: 277  -------------NLQPDAKGS-----------------YVEVLERYVNLGPIVDFCVVD 306
                          L   A  S                 + +V++ Y N+GPI D  V  
Sbjct: 478  LPSQTESPELVQTTLFATASASSHERRESRAERAECQLWFGQVVQCYTNIGPIQDLLVTG 537

Query: 307  LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLV 366
             +    G ++TCSG  + GSLRI+RNGIG  E A+VEL GIK +++L S T   +D FLV
Sbjct: 538  DDSFSDGHIITCSGVSRMGSLRIIRNGIGFVEHAAVELDGIKALFTLPSLTSPEWDEFLV 597

Query: 367  VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL-VQVTSGSVRLVSSTSR 425
            VSF +ETR+L +   DEL   E+E       T+        QL + VT+  V LV  T  
Sbjct: 598  VSFTAETRVLRLAAHDEL--VEVESLAVDEATILAMRLPAEQLALWVTASYVGLVDLTLL 655

Query: 426  ELRN-EWKSPPGYSVNVATANASQ--VLLATGGGHLVYLEIGDGILTEVKHAQLEYEISC 482
            E     W  P    +  A  +     VL++T    L  L+     L  V    L  E++C
Sbjct: 656  ERPAVAWTPPSSEQITNAVFDELHKLVLVSTSNAQLYVLQQRKTSLVPVGSQTLPAEVAC 715

Query: 483  LDINPIGENPSYSQIAAVGMWTDISVRIFSLP-------DLNLITKEHLGGEIIPRSVLL 535
            +       + +Y  I A+G W +  +R+F L        +L  I +  L    +PRSVLL
Sbjct: 716  I-------HAAYG-IVALGTWAESRIRLFRLHEDTGNRWELECIRESALPSTSVPRSVLL 767

Query: 536  CAFE----------GISY--LLCALGDGHLLNFLLNM----KTG-----------ELTDR 568
               +          G ++  LL A+GDG L  F ++     ++G           +L   
Sbjct: 768  TYLDDHGGLMADHGGRAHLCLLVAVGDGRLFAFNVSQPDAKRSGPEPEDLSKCELQLQHP 827

Query: 569  KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
            +++ LG++P  L         +VFAA  R +VI++ +  L   NVNL++V+    F++  
Sbjct: 828  RQLRLGSRPAALNNLQLHGMRYVFAACGRASVIHAHHGTLFCGNVNLRDVTRAVRFHTKG 887

Query: 629  FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEE 688
            FPDS+A+A E  L +G I+ IQ+LHIR   L E PRRI H    R + IC L   S    
Sbjct: 888  FPDSIAVATEQGLALGGIEHIQQLHIRRHDLREQPRRIAHL---RGY-IC-LLTVSIVFG 942

Query: 689  SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTK 748
             E H VRLL+D+T E I+++ L   E+G S+++    D     + VGTAYVLP E EP++
Sbjct: 943  EERHHVRLLNDETLETITSHDLAMNEHGLSLVAIPERD----VFVVGTAYVLPSEMEPSR 998

Query: 749  GRILVFIVEDGKLQLIAEKETKGAVYSLNAF-------------NGKLLAA-INQKIQLY 794
            GRILVF     +L L+ E  T GAVY+++A                + LAA +N  + LY
Sbjct: 999  GRILVF--SREELLLLNELYTPGAVYTMSALADPSDRTCRFPASAARFLAAGVNNVVILY 1056

Query: 795  KWMLRDDG-TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG------ 847
             W     G   EL+    H GH+L L ++ RGD ++VGDLMKS+ +L     EG      
Sbjct: 1057 DWGQSGHGDDYELREVARHLGHVLVLRLEARGDQLLVGDLMKSLCVLQLVLPEGETSDGA 1116

Query: 848  --AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
               ++  A DY   W++A   L++D YL A+N++NL ++++N      E R  L   G +
Sbjct: 1117 SPCLKAVAWDYETAWITACAFLNEDTYLAADNSYNLLSLQRNPHETRSEFRHALNRAGAF 1176

Query: 906  HLGEFVNRFRHGSLVMRLPDS------------------DVGQIP------------TVI 935
            HLG+ VN FR G LV     +                  DV +              T++
Sbjct: 1177 HLGDLVNVFRRGKLVTEASGNEEAGTGNGHSTIDTESTRDVARASTGTTTADNVSRQTLL 1236

Query: 936  FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK-------------------------G 970
            F T  G IG+I  L   Q+  L +++  LR +                           G
Sbjct: 1237 FATTAGAIGIIVPLDPAQHRMLSRVEKALRSLTDHPAEAGAADNGHERVRRSPLTSIRLG 1296

Query: 971  VGGLNHEQWRSFNNEKK----------TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            V G  H  WR+  +E+            +  + F+DGDL+E F +L  T M+ ++  +  
Sbjct: 1297 VEGFAHADWRTPLSERALNSLHSMTCFDLPRRGFVDGDLVERFRELEPTEMEFVAAQVGA 1356

Query: 1021 SVEELCKRVEELTRLH 1036
            S+E +   V++L RLH
Sbjct: 1357 SIEHIKLLVDDLARLH 1372


>gi|390342012|ref|XP_793599.3| PREDICTED: uncharacterized protein LOC588842 [Strongylocentrotus
           purpuratus]
          Length = 1161

 Score =  494 bits (1271), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 254/513 (49%), Positives = 347/513 (67%), Gaps = 42/513 (8%)

Query: 44  SSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFN 103
           + E+I +A G+V DRIGRP++ G IGIIDP+CR+IGL LYDGLFK+IP D   +  +AFN
Sbjct: 28  TEEIIPKAQGNVQDRIGRPSETGPIGIIDPECRMIGLRLYDGLFKIIPLDRDNKELKAFN 87

Query: 104 IRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNG 163
           IRLEEL V+D++FLYGC +PTIV L+QD    RHVKTYEV L++K+F  GPW Q+N++  
Sbjct: 88  IRLEELNVIDVQFLYGCHQPTIVFLHQD-PHGRHVKTYEVNLREKEFNRGPWKQDNVETE 146

Query: 164 ADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGD 220
           A ++I VP P  G LIIG+E+I Y   + + AI    I+ S    YGR+D +GSRYLLGD
Sbjct: 147 ATMVIAVPQPYGGALIIGQESITYHKGDNYVAIAPPTIKNSTLVCYGRLDNNGSRYLLGD 206

Query: 221 HAGLLHLLVITHEKE-----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
             G L LL++  E+       V  LK+E LGETSIA  ++YLDN VV+IGS  GDSQL++
Sbjct: 207 LTGRLFLLLLDKEESMDGAATVKDLKLEFLGETSIAECLTYLDNGVVFIGSRLGDSQLVR 266

Query: 276 LNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIG 335
           LN + D  GSYV ++E + NLGPIVD  VVDL+RQGQGQ+VTCSGAYK+GSLRI+RNGIG
Sbjct: 267 LNTESDESGSYVTMMETFTNLGPIVDMAVVDLDRQGQGQLVTCSGAYKEGSLRIIRNGIG 326

Query: 336 INEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ 395
           I+E AS++L GIKG+W L+   +  FD  LV+SF+ +TR+ +                  
Sbjct: 327 IHEHASIDLPGIKGIWPLKVDMNSQFDDTLVLSFVGQTRVKSF----------------- 369

Query: 396 TQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGG 455
                        +  +T  SVRLVS+ ++ + +EW+   G +++VA+ N+ QV+ A  G
Sbjct: 370 -------------INYITGASVRLVSTPTKRMVSEWRPQSGKNISVASCNSKQVVCA-AG 415

Query: 456 GHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515
             + YLEI +G + +V    +E+E++CLDI P+  +  ++ + AVG+WTDIS RI  +P 
Sbjct: 416 SDIFYLEIFEGEVRQVSTVTMEHEVACLDITPLTGD--FTDLCAVGLWTDISARILKIPT 473

Query: 516 LNLITKEHLGGEIIPRSVLLCAFEGISYLLCAL 548
           +  +  E LGGEIIPRS+LL  FEG +Y+LCAL
Sbjct: 474 MESMHVELLGGEIIPRSILLMTFEGQNYILCAL 506


>gi|430814207|emb|CCJ28534.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 904

 Score =  491 bits (1263), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 309/909 (33%), Positives = 511/909 (56%), Gaps = 69/909 (7%)

Query: 122  KPTIVVLYQDNKDARHVKTYEVAL--KDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLI 179
            +PTI VLYQD+K A+H+ +YE+ +  ++K   EGP    +LD G+ LLIP+     G++I
Sbjct: 8    RPTIAVLYQDSKYAKHLSSYEIIVQSREKRMKEGPLKSRDLDIGSSLLIPLLDS--GIII 65

Query: 180  IGEETIVYCSANA-FKA---IPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
            +GE+T++Y   N+  K+   IPI P++  +Y  ++   ++++L D  G + +L +++  +
Sbjct: 66   VGEQTLMYIHPNSEIKSKFIIPI-PTVFSSYTTIN--NNKHILADDYGRIFMLTLSNSYK 122

Query: 236  KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVN 295
                +KI  +G TSIAS + YL N+ ++IGS YGDSQL+ +   PD       VL+ + N
Sbjct: 123  DSNFMKISQIGITSIASVLVYLPNSYLFIGSHYGDSQLVNI---PDCL-----VLQSFPN 174

Query: 296  LGPIVDFCVVDLERQGQGQ-VVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSL 353
            + PI DFC V   R+G+ + +VTCSGAYKDGSLRI+R  + +N+   +  L GI G+W L
Sbjct: 175  ISPISDFCFV--TREGRNEFIVTCSGAYKDGSLRILRYNVEMNKTLEISNLNGIYGIWGL 232

Query: 354  RSSTDDPFDTFLVVSFISETRILAM--NLEDELEETEIEGFC---SQTQTLFCHDAIYNQ 408
                +  + T LV+SF++ETRIL +  N+ +E E  E + F    ++  TL   +   N 
Sbjct: 233  YLQNEFEY-TALVLSFVNETRILKVFHNILNEPEIEEWDNFAIPNAKLPTLVAKNVNNNL 291

Query: 409  LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGIL 468
               +++ S+ L+   S  +  EW       +  A  +     ++   G +V   + +  +
Sbjct: 292  FCFISNKSICLIDWKSNLVLKEWIPTADDIITCACLDTEFASVSLTKGKIVVFSLKNMTI 351

Query: 469  TEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
             E+   +  YE+SC+DI       S + + +VG+W   S+ I S+P + L+    L G +
Sbjct: 352  VEIGEYKFNYEVSCIDI-------SNNALISVGLWIVPSIHILSIPSMELLLSHSLLGTV 404

Query: 529  IPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            +PRS+ + +   ++   +L  +GDG LL++ L+           +  GT PI L  F + 
Sbjct: 405  VPRSICIVSLASMNKPVILVGMGDGTLLSYGLD----------GLDKGTLPINLSKFITP 454

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
               +VFA SDR  +IY S+ KL +S++NLKE++ M  F S+ F  ++ +  E  + IG+I
Sbjct: 455  IGMNVFAISDRSIIIYGSSGKLSFSSINLKEINCMSSFISSIFSSTIVVVSENIIKIGSI 514

Query: 647  DDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---NQSCAEESEMHFVRLLDDQTFE 703
            D +Q+L I++IPLGE PRRIC+ ++ + F + ++K     S   E +  ++++LD  +F+
Sbjct: 515  DSLQQLQIQTIPLGELPRRICYHDKQKVFGVLTIKLSLEASNGNEVQTSYLKILDVTSFD 574

Query: 704  FISTYPLDTFEY---GCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK 760
             I    LD+F+     C     S + D+   + VGT + LPEE E +KGRI++F V + K
Sbjct: 575  GI----LDSFQLELNECVQCITSVTIDNQDIFVVGTGFSLPEEEESSKGRIILFGVTNKK 630

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY 820
            + + +E +   AVY +   + K++A IN  + +Y +   D   +       +    L L 
Sbjct: 631  IWVFSEIQVNDAVYCIGIIDNKIIAGINALVHIYAY---DSSLKNFNVIATYRSTTLCLS 687

Query: 821  VQTRGDFIVVGDLMKSISLLIYKHEEGA--IEERARDYNANWMSAVEILDDDIYLGAENN 878
            +   G  +++GDLMKS+SLL + + E    ++E A+D N  WM+ V  LD+D+Y+GAE  
Sbjct: 688  LAVHGTHVIIGDLMKSVSLLAFINTENGPRLKEVAKDCNPLWMTCVAALDNDLYIGAEAE 747

Query: 879  FNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
             NL    K+    T  E  +L+++ E   GE VN+ + G++   L   +   IP   F T
Sbjct: 748  GNLSLFWKDFN--TTFEENKLQIISEIKWGELVNQIKPGTI---LYSENSIIIPKATFVT 802

Query: 939  VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF-NNEKKTVDAKNFLDG 997
            V+G IG+I ++  E   FL  LQ+N+ K+I G+G LNH  WR+F N  KK+ + K F+DG
Sbjct: 803  VDGSIGIIFTVKREYLEFLVNLQSNMGKIISGIGCLNHSNWRAFCNRRKKSNEPKCFIDG 862

Query: 998  DLIESFLDL 1006
            D +E F++L
Sbjct: 863  DFVEIFINL 871


>gi|170589357|ref|XP_001899440.1| CPSF A subunit region family protein [Brugia malayi]
 gi|158593653|gb|EDP32248.1| CPSF A subunit region family protein [Brugia malayi]
          Length = 655

 Score =  483 bits (1244), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 264/658 (40%), Positives = 387/658 (58%), Gaps = 71/658 (10%)

Query: 445  NASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
            N  Q+++A G   L+YLE        +   + E+EISC+DI PIG+    S+I AVG WT
Sbjct: 3    NLGQLVVACGA-LLIYLEANSAGFKVITEIECEFEISCIDITPIGKGTLRSEICAVGYWT 61

Query: 505  DISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE 564
            D+SV + +LP L  + +E + G+++ RS++L   EG  YLL ALGDG +  F ++MKTG 
Sbjct: 62   DLSVALRALPQLVEVVREKIVGDMLSRSIMLSPMEGHVYLLVALGDGTVHYFQIDMKTGA 121

Query: 565  LTDRKKVSLGTQPITLRTFSSKNTT--HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
            L D KK +LGTQPI LR F S+ +   ++F  SDRP VIYSSN+KLL+SNVNL+ VS M 
Sbjct: 122  LLDPKKATLGTQPIHLRKFRSRCSPVHNIFVCSDRPAVIYSSNQKLLFSNVNLRMVSTMT 181

Query: 623  PFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI----- 677
            P  + A+PD+L +     L IG IDDIQKLHIR++PLGE P RI +Q ++ T A+     
Sbjct: 182  PLYAEAYPDALVLTDGHSLVIGRIDDIQKLHIRTVPLGESPSRIAYQPETNTIAVIVERL 241

Query: 678  ----------------------CSLKN------------------QSCAEESEMHFVRLL 697
                                  C+ KN                  +  AEE E+  V LL
Sbjct: 242  EVILFLFFYVFVDAMGKHHFGQCASKNAMETSSSRLSSMRREPTPECLAEEMEVSSVLLL 301

Query: 698  DDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
            D  TFE + ++ L+  E   S+ SC   DDS  Y+ VGTA ++ +E E   GRI++F   
Sbjct: 302  DSNTFEILHSHELEGSEMAMSLASCQLGDDSQPYFVVGTAVIMSDETESKMGRIMMFQAS 361

Query: 758  DG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 815
            +G  +++L+ EKE KGA YS+ + +GKL+ A+N  ++L++W       +EL+ EC    +
Sbjct: 362  EGPERMRLVYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEW----TADKELRLECSDFDN 417

Query: 816  ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
            + ALY++T+ D I+VGDLM+S+SLL YK  E   E+ ARD+  NWMSA EI+D D +LGA
Sbjct: 418  VTALYLKTKNDLILVGDLMRSLSLLSYKSMESTFEKVARDFMTNWMSACEIIDSDNFLGA 477

Query: 876  ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVI 935
            EN++NLFTV K+S     EE  RL+ +G ++LGE VN F HGSL     D       +++
Sbjct: 478  ENSYNLFTVMKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVDVAPLYHSSIL 537

Query: 936  FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
            +GT +G IGVI  +P   Y FL+ +Q  L +  +    ++H Q+R+F  EK++     F+
Sbjct: 538  YGTSDGGIGVIVQMPPVLYTFLQDVQKRLAEYAENCMRISHTQYRTFETEKRSEAPNGFI 597

Query: 996  DGDLIESFLDLSRTRMDEI-----------------SKTMNVSVEELCKRVEELTRLH 1036
            DGDLIES LD+ +  ++++                 ++ ++   E++ K VE+L+R+H
Sbjct: 598  DGDLIESLLDMGKDSVEQVVNGLKMPLLNSISSSETTELVDALAEDVLKLVEDLSRIH 655


>gi|452003044|gb|EMD95501.1| hypothetical protein COCHEDRAFT_1165632 [Cochliobolus heterostrophus
            C5]
          Length = 1116

 Score =  483 bits (1243), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 325/1078 (30%), Positives = 558/1078 (51%), Gaps = 63/1078 (5%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            IYG++  L+  RP     D LFI T+R+ +  L W+AE  +L T ++   V+D   R + 
Sbjct: 55   IYGKVTMLQKLRPALSPTDHLFIGTDRFMYFTLSWNAEKKQLQTEKSFASVADNAARESQ 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ----------LKEAFNIRLEELQVLDI 114
             G+   IDP  R + + +Y+G+  VIP   +G+          L +   +RL E+ V   
Sbjct: 115  TGERCHIDPTGRFMTIEVYEGILTVIPLVQRGKKRKQEADIAHLGDPQPVRLPEMFVRSS 174

Query: 115  KFLYGCA---KPTIVVLYQDNKDARHVKTYEVAL--KDKDFVEGPWSQNNLDNGADLLIP 169
             FL   +   KP + +LY+D      ++  E+     + D  EG   ++ L+ G+  LIP
Sbjct: 175  AFLRPRSTDDKPKMALLYEDTHAQVKLRLRELTYVGDEVDLQEGEACKSELELGSSHLIP 234

Query: 170  VPPPLCGVLIIGEETIVYC--SANAFKAIPI-RPSITKAYGRVDADGSRYLLGDHAGLLH 226
            +  P  G+++IGE +I Y    +   +  P+   +I  A+ R+DA   R++L D  G L+
Sbjct: 235  LEEPSHGLVVIGETSIGYFDDESGDLQTEPLDEATIFVAWERIDA--QRFVLADDYGRLY 292

Query: 227  L-LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
            + +++ + + +V   K++++G+TS AST+ YLD   V++GS  GDSQ+I++     A+ S
Sbjct: 293  MFMLVLNAQGRVQSWKLDVIGQTSRASTLVYLDAGYVFVGSHQGDSQVIRI-----AEKS 347

Query: 286  YVEVLERYVNLGPIVDFCVVDL-ERQGQGQ----------VVTCSGAYKDGSLRIVRNGI 334
             +E+++ + N+ PI+DF ++D+  R G+GQ          +VT SGAY+DGSLR VR+G+
Sbjct: 348  -MEIVQTFSNIAPILDFTIMDMGNRSGEGQTNEYSSGQARIVTGSGAYQDGSLRSVRSGV 406

Query: 335  GINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET-EIEGF 392
            G+ +   + E++ I  ++ LRS+    +   L+VSF+ E+R+   + + E+EE  E    
Sbjct: 407  GLEDLGVLGEMEHISELFCLRSAPSAQYTDTLLVSFVGESRVFRFDSQGEVEEVDEFVSL 466

Query: 393  CSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLA 452
                 TL   +    ++VQVT+G  R+       + +EW    G ++  A+ NA+ VL++
Sbjct: 467  ALDETTLAAANIPQGRIVQVTNGRARICDLDGGMMTSEWVPADGKTITAASVNATHVLVS 526

Query: 453  TGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 511
             GG  +V L + DG+ + + K    E +++C+ + P G     S    VG W +  + I 
Sbjct: 527  LGGVTIVSLSMADGLQVVKEKTFGAESQVACVTV-PSGT----SSTCFVGFWNNSQLAIC 581

Query: 512  SLPDLNLITKEHLGGEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRK 569
            SL  L  +    +  + +PRS+LL     +    L  AL DG+++ +  +    EL+ RK
Sbjct: 582  SLDTLEAVKTVQISDDSVPRSLLLTQIFPDQPPSLFVALADGNVVTYTFDPSNHELSGRK 641

Query: 570  KVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
             + LGT+  T R     +   +VFA  + P++IY+S  +L+YS V  ++ + +CPF+S A
Sbjct: 642  SIVLGTREATFRALPRGDGLFNVFATCEHPSLIYASEGRLVYSAVTAEKATTVCPFDSEA 701

Query: 629  FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN-QSCAE 687
            +P S+AIA   +L I  +D  +  H++++ + E  RRI +    + F + ++K      E
Sbjct: 702  YPSSVAIATSQDLRIALVDTERTTHVQTLKVDETVRRIAYSPSLKAFGLGTIKRILKHGE 761

Query: 688  ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN---VYYCVGTAYVLPEEN 744
            E  +   +L+D+  F+ + TY L+  E    ++ C   D S      + VGTAYV  +  
Sbjct: 762  EIMLSHFKLVDEIQFKELDTYALNEEELVECVMRCELPDGSGGLAERFVVGTAYVDDQNT 821

Query: 745  EPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT 803
               +GRI+V  V   + L+L+ E   KG    L    GK++AA+ + I +Y +  R   T
Sbjct: 822  TSERGRIIVLEVTPERILKLVTEIAVKGGCRCLAICQGKIVAALIKTIVVYDFEYRAPST 881

Query: 804  RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER----ARDYNAN 859
             +L            + V   G  I + DLMKS+ ++ Y   E  + ++    AR Y   
Sbjct: 882  PDLVKLASFRCSTAPIDVTVNGSLIAIADLMKSLVIVEYTKGETGLPDKLVEVARHYQIT 941

Query: 860  WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
            W +AV  +D + YL ++   NL  + ++  G TD+++ RL +  E  LGE VNR R    
Sbjct: 942  WATAVAEVDTNTYLESDAEGNLVVLYRDPNGVTDDDKRRLNISSEMLLGEMVNRIRR--- 998

Query: 920  VMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
            V  L  +D   +P    GTV G I +   +  +    L  LQ+NL  ++   G +   ++
Sbjct: 999  VDVLTAADAVIVPKAFVGTVEGSIYLFGLISPDYQNLLMTLQSNLGSLVAAPGDMQFAKF 1058

Query: 980  RSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            R+F N  ++  +   F+DG+L+E FLD+S     +    + V +EE+   +E L RLH
Sbjct: 1059 RAFKNSVREEEEPMRFVDGELVERFLDVSEEVQHKAIDGLGVELEEVKGLIEALRRLH 1116


>gi|312076590|ref|XP_003140929.1| CPSF A subunit region family protein [Loa loa]
          Length = 655

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 261/652 (40%), Positives = 381/652 (58%), Gaps = 69/652 (10%)

Query: 450  LLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVR 509
            L+   G  L+YLE        +   + E+EISC+DI PIG     S+I AVG WTD+SV 
Sbjct: 8    LVIACGALLIYLEADSAGFKLISELECEFEISCIDITPIGNETLRSEICAVGYWTDLSVA 67

Query: 510  IFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRK 569
            + +LP L  + +E + G+++ RS++L   EG  YLL ALGDG +  F ++MKTG L D K
Sbjct: 68   LRTLPQLMEVVREKIPGDMLSRSIMLSPMEGHVYLLVALGDGTVHYFQIDMKTGALLDPK 127

Query: 570  KVSLGTQPITLRTFSSKNTT--HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
            K +LGTQPI LR F S+ ++  ++F  SDRP VIYSSN+KLL+SNVNL+ VS M P  + 
Sbjct: 128  KATLGTQPIHLRKFRSRCSSVHNIFVCSDRPAVIYSSNQKLLFSNVNLRMVSTMTPLYAE 187

Query: 628  AFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI---------- 677
            A+PD+L +     L IG IDDIQKLHIR++PLGE P RI +Q ++ T A+          
Sbjct: 188  AYPDALVLTDGNSLVIGRIDDIQKLHIRTVPLGESPSRIAYQPETNTIAVTVERLEFVDA 247

Query: 678  --------CSLKN------------------QSCAEESEMHFVRLLDDQTFEFISTYPLD 711
                    C+ KN                  +  AEE E+  + LLD  TFE + ++ L+
Sbjct: 248  MGKHHFGQCASKNAMETSSSRLSSMRREPTPECLAEEMEVSSILLLDSNTFEILHSHELE 307

Query: 712  TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG--KLQLIAEKET 769
              E   S+ SC   +DS  Y+ VGTA ++ +E E   GRI++F   +G  +++L+ EKE 
Sbjct: 308  GSEMAMSLASCQLGNDSQPYFVVGTAVIMSDETESKMGRIMMFQASEGPERMRLVYEKEI 367

Query: 770  KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
            KGA YS+ + +GKL+ A+N  ++L++W       +EL+ EC    ++ ALY++T+ D I+
Sbjct: 368  KGAAYSIQSMDGKLVVAVNSCVRLFEWT----ADKELRLECSDFDNVTALYLKTKNDLIL 423

Query: 830  VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE 889
            VGDLM+S+SLL YK  E   E+ ARD+  NWMSA EI+D D +LGAEN++NLFTV K+S 
Sbjct: 424  VGDLMRSLSLLSYKSVESTFEKVARDFMTNWMSACEIIDSDSFLGAENSYNLFTVVKDSF 483

Query: 890  GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL 949
                EE  RL+ +G ++LGE VN F HGSL     D       ++++GT +G IGVI  +
Sbjct: 484  TVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVDVAPLYHSSILYGTSDGGIGVIVQM 543

Query: 950  PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRT 1009
            P   Y FL  +Q  L    +    ++H Q+R+F  EK++     F+DGDLIES LD+ + 
Sbjct: 544  PPVLYTFLHDVQKRLADYTENCMRISHTQYRTFETEKRSEVPNGFIDGDLIESLLDMGKD 603

Query: 1010 RMDEI-------------------------SKTMNVSVEELCKRVEELTRLH 1036
             + +I                         ++ ++ + E++ K VE+L+R+H
Sbjct: 604  SVGQIVNGLKMPLLNIIRVITLLPVPSSETTELVDATAEDVLKLVEDLSRIH 655


>gi|452838792|gb|EME40732.1| hypothetical protein DOTSEDRAFT_177898 [Dothistroma septosporum
            NZE10]
          Length = 1138

 Score =  481 bits (1238), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 332/1105 (30%), Positives = 561/1105 (50%), Gaps = 98/1105 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            +YG++  L   +P     D LF+ T+RY +  + W+ ++++L T +A  D++++  R + 
Sbjct: 58   LYGKVTLLHKLKPATSQTDHLFVGTDRYHYFTISWNLKTNDLNTEKAYVDIAEKAARDSQ 117

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF-----------DNK-GQLKEAFNIRLEELQVL 112
             G    IDP  R + +  Y+G+  V+P            DN+ G+L E   +R+ EL V 
Sbjct: 118  TGDRVHIDPTSRFMTIECYEGVVNVLPIAHAGKGKRKAADNEIGELHEPIPVRIPELFVR 177

Query: 113  DIKFLY------GCAKPTIVVLYQDNKDARHVKT----YEVALKDKD------FVEGPWS 156
               FL+          P   +L++D+ +   +K     Y  +L+  D        +G   
Sbjct: 178  SSCFLHRRQPTTKAVDPLFAILHEDSTNKVRLKIRELEYSPSLRPNDEPAVAELEKGHDV 237

Query: 157  QNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP--SITKAYGRVDADGS 214
            ++ L+ GA +LIP+P P+ G+L++GE +I Y     ++     P    T        D  
Sbjct: 238  EDTLELGASILIPLPSPMHGLLVVGETSISYIEEWQYRVANTEPLDEATIFVAWCSLDDQ 297

Query: 215  RYLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL 273
            RY+L D  G L+LL++   +E +  G +I++LGETS AST+ YLD   V++GS  GDSQ+
Sbjct: 298  RYVLADDYGKLYLLMVQQNREGEYHGHQIDILGETSRASTLVYLDGGRVFVGSHQGDSQI 357

Query: 274  IKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCSGAY 322
            I+++ Q       +EVL+ + N+ PI+DF V+D+  +           GQ ++VT SGA+
Sbjct: 358  IQISEQS------IEVLQTFPNIAPILDFTVMDMGNRSSDAPVNEFSSGQARIVTGSGAF 411

Query: 323  KDGSLRIVRNGIGINEQASVELQG--IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNL 380
            KDGSLR VR+G+G+ ++  +   G  I  ++SLRSS        LVVSF+ +TR+   + 
Sbjct: 412  KDGSLRSVRSGVGLEDRGDLGSLGAPISEVFSLRSSAAAKLVDTLVVSFVDQTRVFVFDQ 471

Query: 381  EDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSV 439
            E ++EE  E  GF     TLF  +    ++VQV+S S+ L  + S  + + W +P G ++
Sbjct: 472  EGDVEERDEFRGFSLSDATLFAGNLPDGRVVQVSSTSILLADAESGMITDTWAAPGGANI 531

Query: 440  NVATANASQVLLATGGGHLVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQI 497
                A+ S++LL+  G  LV L++ G+ I  + +       E+SC+ ++      S    
Sbjct: 532  TAVAADRSKLLLSLSGKALVALDLSGESIAVQARREFSSTEEVSCIALSA-----SIPDT 586

Query: 498  AAVGMWTDISVRIFSLPDLNLITKEHLGGE--IIPRSVLLCAF--EGISYLLCALGDGHL 553
               G W D  V   SL +L  IT E +  +   +PRS+ +     +  + L   L DG++
Sbjct: 587  CVAGFWKDGKVAFLSLHNLEAITTERVADDDLAVPRSLSIATILRDQPATLFVGLADGNV 646

Query: 554  LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASDRPTVIYSSNKKLLYSN 612
            + + +         RK + LGTQ          +   +VFA  + P++IY S  +++YS 
Sbjct: 647  VTYTIESLQKPFAARKSIILGTQQANFAVLPRGDGLQNVFATCEHPSLIYGSEGRMVYSA 706

Query: 613  VNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQS 672
            V  +  + +C FNS ++ +++AIA   EL I  +D+ +  H++ + + E  RR  +  + 
Sbjct: 707  VTAESATSICSFNSNSYGNAIAIASNDELRIAAVDEERTTHVQDLFIHETVRRTAYSAEL 766

Query: 673  RTFAICSLKNQSCA--EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD---D 727
            + F +  ++    A  EE + HF +L+D+  F+ + +Y L+  E   S++ C   D   D
Sbjct: 767  KAFGLGCIQRTLTAGQEEVKSHF-KLVDEVAFKELDSYELNEDELVESVIRCKLDDGSGD 825

Query: 728  SNVYYCVGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAA 786
                + VGTAY+  +++   +GRIL+  + ED +L+L+ E   KGA   L    GK++AA
Sbjct: 826  GAERFAVGTAYLDDQDSNTARGRILILEVTEDRRLKLVTELSVKGACRCLAVCEGKIVAA 885

Query: 787  INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
            + + + +Y +         L  +  +      + V   G+ I V DLMKS+SL+ YK   
Sbjct: 886  LIKTVIIYDFEFAASKA-TLTKKASYRTATAPIDVCVTGNVIAVTDLMKSMSLVEYKKGR 944

Query: 847  GA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVV 902
                  + E AR +   W +AV  + D+ YL ++   NL  ++ ++ G ++E+R RL V 
Sbjct: 945  TGMPDTLTEIARHFETLWGTAVANVADNTYLQSDAEGNLIVLQHDTNGFSEEDRRRLRVT 1004

Query: 903  GEYHLGEFVNRFR-------HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQY 954
             E  LGE VNR R       HG+LV          IP     TV G I + A + P +Q 
Sbjct: 1005 SELLLGEMVNRIRRIDVTPTHGALV----------IPRAFLATVEGSIYLFALIVPGKQD 1054

Query: 955  LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA---KNFLDGDLIESFLDLSRTRM 1011
            L + ++Q N+  ++K  G +    +R F N+ +   A     F+DG+LIE FLD  +   
Sbjct: 1055 LLM-RMQNNMASLVKSPGHVEFATFRGFKNQVRDEGANGPSRFVDGELIERFLDCGQDIQ 1113

Query: 1012 DEISKTMNVSVEELCKRVEELTRLH 1036
            +EI + + + +EE    VE L R+H
Sbjct: 1114 EEIIRDLGIELEEARGMVESLRRIH 1138


>gi|451856466|gb|EMD69757.1| hypothetical protein COCSADRAFT_32431 [Cochliobolus sativus ND90Pr]
          Length = 1116

 Score =  479 bits (1233), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 323/1078 (29%), Positives = 559/1078 (51%), Gaps = 63/1078 (5%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            IYG++  L+  RP     D LF+ T+R+ +  L W+ E  +L T ++   V+D   R + 
Sbjct: 55   IYGKVTMLQKLRPALSPTDHLFVGTDRFMYFTLSWNTEKKQLQTEKSFASVADNAARESQ 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ----------LKEAFNIRLEELQVLDI 114
             G+   IDP  R + + +Y+G+  VIP   +G+          L +   +RL E+ V   
Sbjct: 115  TGERCHIDPTGRFMTIEVYEGILTVIPLVQRGKKRKQEADIAHLGDPQPVRLPEMFVRSS 174

Query: 115  KFLYGCA---KPTIVVLYQDNKDARHVKTYEV--ALKDKDFVEGPWSQNNLDNGADLLIP 169
             FL   +   KP + +LY+D      ++  E+  A  + D  EG   ++ L+ G+  LIP
Sbjct: 175  AFLRPRSTDDKPKMALLYEDTHAQVKLRLRELTYAGDEVDLQEGETCKSELELGSSHLIP 234

Query: 170  VPPPLCGVLIIGEETIVYC--SANAFKAIPI-RPSITKAYGRVDADGSRYLLGDHAGLLH 226
            +  P  G++IIGE +I Y    +   +  P+   +I  A+ R+DA   R++L D  G L+
Sbjct: 235  LEEPSHGLVIIGETSIGYFDDESGELQTEPLDEATIFVAWERIDA--QRFVLADDYGRLY 292

Query: 227  L-LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
            + +++ + + +V   K++++G+TS AST+ YLD   V++GS  GDSQ+I++     A+ S
Sbjct: 293  MFMLVLNAQGRVQSWKLDVIGQTSRASTLVYLDAGYVFVGSHQGDSQVIRI-----AEKS 347

Query: 286  YVEVLERYVNLGPIVDFCVVDL-ERQGQGQ----------VVTCSGAYKDGSLRIVRNGI 334
             +E+++ + N+ PI+DF ++D+  R G+GQ          +VT SGAY+DGSLR VR+G+
Sbjct: 348  -MEIVQTFSNIAPILDFTIMDMGNRSGEGQTNEYSSGQARIVTGSGAYQDGSLRSVRSGV 406

Query: 335  GINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET-EIEGF 392
            G+ +   + E++ I  ++SL+S+    +   L+VSF+ E+R+   + + E+EE  E    
Sbjct: 407  GLEDLGVLGEMEHISELFSLKSTASAQYIDTLLVSFVDESRVFRFDSQGEVEEVDEFVSL 466

Query: 393  CSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLA 452
                 TL   +    ++VQVT+G  R+       + +EW    G ++  A+ N + VL++
Sbjct: 467  ALDETTLAAANIPQGRIVQVTNGRARICDLDGGMITSEWVPTDGKTITAASVNETHVLVS 526

Query: 453  TGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 511
             GG  +V L + +G+ + + K    E +++C+ + P G     S    VG W +  + I 
Sbjct: 527  LGGVTIVSLSMANGLQVVKEKTFGAESQVACVTL-PSGT----SSTCFVGFWNNSQLAIC 581

Query: 512  SLPDLNLITKEHLGGEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRK 569
            SL  L  +    +  + +PRS+LL     +    L  AL DG+++ +  +    EL+ RK
Sbjct: 582  SLDTLEAVKTVQISDDSVPRSLLLTQIFPDQPPSLFVALADGNVVTYTFDQSNHELSGRK 641

Query: 570  KVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
             + LGT+  T R     +   +VFA  + P++IY+S  +L+YS V  ++ + +CPF+S A
Sbjct: 642  SIVLGTREATFRALPRGDGLFNVFATCEHPSLIYASEGRLVYSAVTAEKATTVCPFDSEA 701

Query: 629  FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN-QSCAE 687
            +P S+AIA   +L I  +D  +  H++++ + E  RRI +    + F + ++K      E
Sbjct: 702  YPGSVAIATSQDLRIALVDTERTTHVQTLKVDETVRRIAYSPSLKAFGLGTIKRILKHGE 761

Query: 688  ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN---VYYCVGTAYVLPEEN 744
            E  +   +L+D+  F+ + TY L+  E    ++ C   D S      + VGTAYV  +  
Sbjct: 762  EIMLSHFKLVDEIQFKELDTYALNEEELVECVMRCELPDGSGGLAERFVVGTAYVDDQNT 821

Query: 745  EPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT 803
               +GRI+V  V   + L+L+ E   KG    L    GK++AA+ + I +Y +  R   T
Sbjct: 822  TSERGRIIVLEVTPERILKLVTEVAVKGGCRCLAICQGKIVAALIKTIVVYDFEYRTPST 881

Query: 804  RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER----ARDYNAN 859
             +L            + +   G  I + DLMKS+ ++ Y   E  + ++    AR Y   
Sbjct: 882  PDLVKLASFRCSTAPIDLTVNGSLIAIADLMKSLVVVEYTKGETGLPDKLVEVARHYQIT 941

Query: 860  WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
            W +AV  +D ++YL ++   NL  + ++  G TDE++ RL +  E  LGE VNR R    
Sbjct: 942  WATAVAEVDTNMYLESDAEGNLVVLYRDPNGVTDEDKRRLNISSEMLLGEMVNRIRR--- 998

Query: 920  VMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
            +  L  +D   +P    GTV G I +   +  +    L  LQ+NL  ++   G +   ++
Sbjct: 999  IDVLTAADAVIVPKAFVGTVEGSIYLFGLISPDYQNLLMTLQSNLGSLVAAPGDMQFAKF 1058

Query: 980  RSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            R+F N  ++  +   F+DG+LIE FLD+S     +    + V +EE+   +E L RLH
Sbjct: 1059 RAFKNSVREEEEPMRFVDGELIERFLDVSEEVQSKAIDGLGVELEEVKGLIEALRRLH 1116


>gi|330935579|ref|XP_003305038.1| hypothetical protein PTT_17772 [Pyrenophora teres f. teres 0-1]
 gi|311318228|gb|EFQ86975.1| hypothetical protein PTT_17772 [Pyrenophora teres f. teres 0-1]
          Length = 1115

 Score =  479 bits (1232), Expect = e-132,   Method: Compositional matrix adjust.
 Identities = 336/1083 (31%), Positives = 565/1083 (52%), Gaps = 74/1083 (6%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            IYG++  L+  RP   + D LF+ T+R+ +  L WDAE  +L T ++   V+D   R + 
Sbjct: 55   IYGKVTMLQKLRPALSSTDHLFVGTDRFMYFTLSWDAEQGQLQTEKSFASVADNAARESQ 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ----------LKEAFNIRLEELQVLDI 114
             G+   IDP  R + + +Y+G+  VIP   +G+          L E   +RL E+ +   
Sbjct: 115  TGERCHIDPLGRFMTVEVYEGIITVIPLVQRGKKRKQEADIAHLGEPQPVRLPEMFIRSS 174

Query: 115  KFLYGCA---KPTIVVLYQDNKDARHVKTYEV--ALKDKDFVEGPWSQNNLDNGADLLIP 169
             FL   +   KP + +LY+D      +K  E+  A  + D  EG   ++ L+ G+  LIP
Sbjct: 175  AFLRPRSPDEKPKMALLYEDTHSQVKLKLRELTYAGDEVDLQEGETCKSELELGSSHLIP 234

Query: 170  VPPPLCGVLIIGEETIVYC--SANAFKAIPI-RPSITKAYGRVDADGSRYLLGDHAGLLH 226
            V  P  G+++IGE +I Y    +    + P+   +I  A+ R+DA   R++L D  G L+
Sbjct: 235  VEKPSHGLVVIGETSIGYYDDESGEIHSEPLDEATIFVAWERIDA--QRFVLADDYGRLY 292

Query: 227  L-LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
            + +++   + KV   K++++G+TS AST+ YLD   VY+GS  GDSQ+IK+     A+ S
Sbjct: 293  MFMLVLSAQGKVQSWKLDIIGQTSRASTLVYLDAGYVYVGSHQGDSQVIKI-----AEKS 347

Query: 286  YVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCSGAYKDGSLRIVRNGI 334
             +E+++ + N+ PI+DF ++D+  +           GQ ++VT SGAY+DGSLR VR+G+
Sbjct: 348  -MEIVQTFSNIAPILDFTIMDMGNRSAEGQTNEYSSGQARIVTASGAYQDGSLRSVRSGV 406

Query: 335  GINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET-EIEGF 392
            G+ +   + E++ I  ++SL+S+    +   L+VSF++E+RI   + + E+EE  E    
Sbjct: 407  GLEDLGVLSEMEHISDLFSLKSTASAQYADTLLVSFVNESRIFRFDPQGEVEEVDEFASL 466

Query: 393  CSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLA 452
                 TL   +    ++VQVT+G  R+       + +EW    G ++  A+ N S VL++
Sbjct: 467  ALDETTLAAANISQGRIVQVTNGRARICDLDGGMIISEWLPMGGPTITAASINESHVLVS 526

Query: 453  TGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPS-YSQIAAVGMWTDISVRI 510
             GG  +V + + DG+ + + K    E +++C+ +      PS  S I  VG W +  + I
Sbjct: 527  LGGVTVVSISMADGLQVVKEKTFNAESQVACIAL------PSDSSSICFVGFWNNSGLAI 580

Query: 511  FSLPDLNLITKEHLGGEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDR 568
             SL  L  +    +  + +PRS+LL          L  AL DG+++ +  +  T EL+ R
Sbjct: 581  CSLDTLEPVKTVQISDDSVPRSLLLTQIFPAQPPSLFVALADGNVVTYAFDPSTHELSGR 640

Query: 569  KKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
            K + LGT+  T R     N   +VFA  + P++IYSS  +L+YS V  ++ + +CPF+S 
Sbjct: 641  KSIVLGTREATFRALPRGNGLFNVFATCEHPSLIYSSEGRLVYSAVTAEKATTVCPFDSE 700

Query: 628  AFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE 687
            A+P S+AIA   +L I  +D  +  H++++ + E  RRI +    + F + ++K     +
Sbjct: 701  AYPGSVAIATSEDLRIALVDTERTTHVQTLKVDETVRRIAYSPSLKAFGLGTIKRILKND 760

Query: 688  ESEM--HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY---YCVGTAYVLPE 742
            E  M  HF +L+D+  F+ + +Y L+  E    ++ C   D S      + +GTAY+  +
Sbjct: 761  EELMASHF-KLVDEIQFKELDSYALNEEELIECVMRCELPDGSGGLAECFVIGTAYLDDQ 819

Query: 743  ENEPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD 801
                 +GRIL+  V   + L+L+ E   KG    L    GK++AA+ + I +Y  +    
Sbjct: 820  NTTSERGRILILEVTPERILKLVTEIAVKGGCRCLATCEGKIVAALIKTIVIYD-VEYPT 878

Query: 802  GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER----ARDYN 857
             T  L            + +   G  I + DLMKS+ ++ Y   E  + ++    AR Y 
Sbjct: 879  QTPFLTKLATFRCSTAPIDITVNGSKIAIADLMKSLVVVEYTKGEAGLPDKLVEVARHYQ 938

Query: 858  ANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHG 917
              W +AV  +D ++YL ++   NL  + ++  G TD+++ RL V  E  LGE VNR R  
Sbjct: 939  ITWATAVAEVDTNMYLESDAEGNLMVLYRDPNGVTDDDKRRLNVSSEMLLGEMVNRIRR- 997

Query: 918  SLVMRLPDSDVGQIPTVIFGTVNGVI---GVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
              +  L  SD   IP    GTV G I   G+I+  P  Q L +  LQ+NL  ++   G +
Sbjct: 998  --IDVLTASDAVVIPRAFVGTVEGSIYLFGLIS--PAHQNLLM-TLQSNLGALVPAPGDM 1052

Query: 975  NHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
            +  ++R+F N  ++  +   F+DG+ +E FLD S     +  + + V +EE+   VE L 
Sbjct: 1053 DFAKFRAFKNGVREEEEPMRFVDGEFVERFLDQSEEVQTKAIEGLGVGLEEVRGLVEGLR 1112

Query: 1034 RLH 1036
            RLH
Sbjct: 1113 RLH 1115


>gi|189205943|ref|XP_001939306.1| DNA damage-binding protein 1 [Pyrenophora tritici-repentis Pt-1C-BFP]
 gi|187975399|gb|EDU42025.1| DNA damage-binding protein 1 [Pyrenophora tritici-repentis Pt-1C-BFP]
          Length = 1115

 Score =  477 bits (1227), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 336/1083 (31%), Positives = 569/1083 (52%), Gaps = 74/1083 (6%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            IYG++  L+  RP   + D LF+ T+R+ +  L WDAE  +L T ++   V+D   R + 
Sbjct: 55   IYGKVTMLQKLRPALSSTDHLFVGTDRFMYFTLSWDAEKRQLQTEKSFASVADNAARESQ 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ----------LKEAFNIRLEELQVLDI 114
             G+   IDP  R + + +Y+G+  VIP   +G+          L E   +RL E+ +   
Sbjct: 115  TGERCHIDPSGRFMTVEVYEGIITVIPLVQRGKKRKQEADIAHLGEPQPVRLPEMFIRSS 174

Query: 115  KFLYGCA---KPTIVVLYQDNKDARHVKTYEV--ALKDKDFVEGPWSQNNLDNGADLLIP 169
             FL   +   KP + +LY+D      +K  E+  A  + D  EG   +++L+ G+  LIP
Sbjct: 175  AFLRPRSPDEKPKMALLYEDTHSQVKLKLRELTYAGDEVDLQEGEICESDLELGSSHLIP 234

Query: 170  VPPPLCGVLIIGEETIVYC--SANAFKAIPI-RPSITKAYGRVDADGSRYLLGDHAGLLH 226
            V  P  G+++IGE +I Y    +    + P+   +I  A+ R+DA   R++L D  G L+
Sbjct: 235  VEKPSHGLVVIGETSIGYYDDESGEIHSEPLDEATIFVAWERIDA--QRFVLADDYGRLY 292

Query: 227  L-LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
            + +++   + KV   K++++G+TS AST+ YLD   VY+GS  GDSQ+IK+     A+ S
Sbjct: 293  MFMLVLSAQGKVQSWKLDIIGQTSRASTLVYLDAGYVYVGSHQGDSQVIKI-----AEKS 347

Query: 286  YVEVLERYVNLGPIVDFCVVDL-ERQGQGQ----------VVTCSGAYKDGSLRIVRNGI 334
             +E+++ + N+ PI+DF ++D+  R G+GQ          +VT SGAY+DGSLR VR+G+
Sbjct: 348  -MEIVQTFSNIAPILDFTIMDMGNRSGEGQTNEYSSGQARIVTASGAYQDGSLRSVRSGV 406

Query: 335  GINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET-EIEGF 392
            G+ +   + E++ I  ++SL+S+    +   L+VSF++E+RI   + + E+EE  E    
Sbjct: 407  GLEDLGVLGEMEHISDLFSLKSTASAQYADTLLVSFVNESRIFRFDPQGEVEEVDEFASL 466

Query: 393  CSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLA 452
                 TL   +    +++QVT+G  R+       + +EW    G ++  A+ N S VL++
Sbjct: 467  ALDETTLAAANISQGRIIQVTNGRARICDLDGGMITSEWLPMGGPTITAASINESHVLVS 526

Query: 453  TGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPS-YSQIAAVGMWTDISVRI 510
             GG  +V + + +G+ + + K    E +++C+ +      PS  S I  VG W +  + I
Sbjct: 527  LGGVTVVSISMANGLQVVKEKTFGAESQVACIAL------PSDSSSICFVGFWNNSGLAI 580

Query: 511  FSLPDLNLITKEHLGGEIIPRSVLLCA-FEG-ISYLLCALGDGHLLNFLLNMKTGELTDR 568
             SL  L  +    +  + +PRS+LL   F G    L  AL DG+++ +  +  T EL+ R
Sbjct: 581  CSLDTLEPVKTVQISDDSVPRSLLLTQIFPGQPPSLFVALADGNVVTYAFDPSTHELSGR 640

Query: 569  KKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
            K + LGT+  T R     +   +VFA  + P++IYSS  +L+YS V  ++ + +CPF+S 
Sbjct: 641  KSIVLGTREATFRALPRGDGLFNVFATCEHPSLIYSSEGRLVYSAVTAEKATTVCPFDSE 700

Query: 628  AFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSC 685
            A+P S+AIA   +L I  +D  +  H++++ + E  RRI +    + F + ++K   ++ 
Sbjct: 701  AYPGSVAIATPEDLRIALVDTERTTHVQTLKVDETVRRIAYSPSLKAFGLGTIKRILKNG 760

Query: 686  AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN---VYYCVGTAYVLPE 742
             E    HF +L+D+  F+ + +Y L+  E    ++ C   D S      + +GTAY+  +
Sbjct: 761  EEVMASHF-KLVDEIQFKELDSYALNEEELIECVMRCELPDGSGGVAERFVIGTAYLDDQ 819

Query: 743  ENEPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD 801
                 +GRIL+  V   + L+L+ E   KG    L    GK++AA+ + I +Y  +    
Sbjct: 820  STTSERGRILILEVTPERILKLVMEIAVKGGCRCLATCEGKIVAALIKTIVIYD-VEYPT 878

Query: 802  GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER----ARDYN 857
             T  L            + +   G  IV+ DLMKS+ ++ Y   E  + ++    AR Y 
Sbjct: 879  QTPFLTKLATFRCSTAPIDITVNGPKIVIADLMKSLVVVEYTKGEAGLPDKLVEVARHYQ 938

Query: 858  ANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHG 917
              W +AV  +D ++YL ++   NL  + ++  G TD+++ RL V  E  LGE VNR R  
Sbjct: 939  ITWATAVAEVDTNMYLESDAEGNLMVLYRDPNGVTDDDKRRLNVSSEMLLGEMVNRIRR- 997

Query: 918  SLVMRLPDSDVGQIPTVIFGTVNGVI---GVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
              +  L  SD   IP    GTV G I   G+I+  P  Q L +  LQ+NL  +I   G +
Sbjct: 998  --IDVLTASDAVVIPRAFVGTVEGSIYLFGLIS--PAHQNLLM-TLQSNLGALIPAPGDM 1052

Query: 975  NHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
            +  ++R+F N  ++  +   F+DG+ +E FLD       +  + + V +EE+   +E L 
Sbjct: 1053 DFAKFRAFKNGVRQEEEPMRFVDGEFVERFLDQGEEVQVKAIEGLGVGLEEVRGLIEGLR 1112

Query: 1034 RLH 1036
            RLH
Sbjct: 1113 RLH 1115


>gi|449295711|gb|EMC91732.1| hypothetical protein BAUCODRAFT_116696 [Baudoinia compniacensis UAMH
            10762]
          Length = 1148

 Score =  476 bits (1226), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 327/1109 (29%), Positives = 559/1109 (50%), Gaps = 96/1109 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            +YG++  L   RP     D LF+ T+ Y +  L WDA + +L T ++  D++++  R + 
Sbjct: 58   LYGKVTLLHKLRPATSQTDHLFVGTDHYHYFTLSWDASTKQLRTEKSYVDIAEKSARDSQ 117

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF-----------DNK-GQLKEAFNIRLEELQVL 112
             G    IDP  R + L  Y+G+  V+P            DN+ G+L++   +R+ EL V 
Sbjct: 118  TGDRVHIDPTSRFLSLECYEGVINVLPIAHAGKGKRKAADNEIGELQDPIPVRIPELFVR 177

Query: 113  DIKFLYG------CAKPTIVVLYQDNKDARHVKTYEVALKDK----------DFVEGPWS 156
               F++        A P + VL++D+ +   +K  E+               +  +G  +
Sbjct: 178  STCFVHKRQAGSKLANPELAVLWEDSTNKVRLKVRELEFTPSLRPAEEPPTAELEKGKDA 237

Query: 157  QNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVDAD 212
            Q  ++ GA  LIP+PPP+ G+L++GE  I Y     ++    +P    +I  A+  VD  
Sbjct: 238  QGEIELGASHLIPLPPPMYGMLVVGETRIAYVDEWEYRITDTQPLDEATIFVAWCAVDEQ 297

Query: 213  GSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
              RY+L D  G L+LL V+ +   +  G ++++LG+TS A+T+ YLD   +++GS  GDS
Sbjct: 298  --RYVLADDYGKLYLLFVLQNNTGEYAGHRLDILGQTSRANTLVYLDAGRIFVGSHQGDS 355

Query: 272  QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCSG 320
            Q+I+++ Q       +EV++ + N+ PI+DF ++D+  +           GQ ++VT SG
Sbjct: 356  QVIQISEQS------MEVVQTFANIAPILDFTIMDMGNRSSDAPVNEFSSGQARIVTGSG 409

Query: 321  AYKDGSLRIVRNGIGINEQASVELQG--IKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
            AYKDGSLR VR+G+G+ +  S+   G  +  M+SLRSS +  F   LVV F+S TR+   
Sbjct: 410  AYKDGSLRSVRSGVGLEDVGSIGEMGAPVSAMFSLRSSPESHFVDTLVVGFVSYTRVFRF 469

Query: 379  NLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
            +++ E+EE + + GF     +L+  +    ++VQVT  +V +  + +R + + W  P G 
Sbjct: 470  DVDGEVEEVDHLGGFELAAASLYAGNTSDGRIVQVTGSAVIVSHTETRTITSSWSLPDGR 529

Query: 438  SVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL------EYEISCLDINPIGEN 491
            S+    A    +L++ GG  LV L++ + + +E   A+       E ++SC+ +     +
Sbjct: 530  SITAVAAEGDSLLVSIGGAELVVLDLSN-VSSERLEARTRRTFESEEQVSCIAL-----S 583

Query: 492  PSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI----IPRSVLLCAF--EGISYLL 545
             +   +  VG W +  V   SL DL  I  E +        +PRSV+L     +    L 
Sbjct: 584  KAVRDVCVVGFWQESRVAFLSLHDLQPIATERVADSFDTSAVPRSVVLANILQDAPPTLF 643

Query: 546  CALGDGHLLNFLLNMKTGELTDRKKVSLGTQP--ITLRTFSSKNTTHVFAASDRPTVIYS 603
              L DG+++ + +       T RK   LGTQ    TL         +VFA  + P++IY 
Sbjct: 644  VGLADGNVVTYTVQSPQQPFTSRKSTILGTQQANFTLLPRGDGVLDNVFATCEHPSLIYG 703

Query: 604  SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA-KEGELTIGTIDDIQKLHIRSIPLGEH 662
               + +YS V  +    +C F+S A+  ++AIA +EGEL +  +D+ +  H++++ +GE 
Sbjct: 704  QEGRTVYSAVTAETAQSICSFDSEAYSGAIAIATEEGELKLAMVDEERTTHVQTLRVGET 763

Query: 663  PRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
             RRI +  + + F + ++K   ++  EE    F +L+D+  F+ + T  L+  E    ++
Sbjct: 764  VRRIAYSTELKAFGLGTIKRVLRAGVEEVTSSF-KLVDEVAFQELHTVALNEDELVECVM 822

Query: 721  SCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVY 774
             C   D S        + VGTAY+     + TKGRILV  + E+ +L+++AE   KGA  
Sbjct: 823  RCQLDDGSGTGETAERFVVGTAYLDDAPQQQTKGRILVLEVTEERRLKVVAELGLKGACR 882

Query: 775  SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
             L    G+++AA+ + + +Y    +      L  +  +      + +   G  I V DLM
Sbjct: 883  CLAVVLGRIVAALVKTVVIYALEYQTPSHPFLVKKAAYRTSTAPIDICVTGSTIAVTDLM 942

Query: 835  KSISLLIYKHEEGAI----EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
            KS+SL+ YK   G +     E AR Y   W +A+  + ++ YL A+   NL  ++    G
Sbjct: 943  KSVSLVSYKPGRGGVPDTLSEIARHYETLWGTAIANVAENTYLEADAEGNLVVLQHEVNG 1002

Query: 891  ATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLP 950
             +DE+R RL  V E  LGE VNR R  S+    P +    +P     TV G I + A + 
Sbjct: 1003 YSDEDRRRLRPVSEMLLGEMVNRIRSISV---QPTATAVVVPRAFLATVEGSIYLFALIS 1059

Query: 951  HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK---NFLDGDLIESFLDLS 1007
              +   L +LQ  L + +K  G +   +WR F ++ + +  +    F+DG+L+E +L+  
Sbjct: 1060 PGKQDLLMRLQALLAERVKSPGHVPFAKWRGFRSQVRDMGGEGPTRFVDGELVERYLEAP 1119

Query: 1008 RTRMDEISKTMNVSVEELCKRVEELTRLH 1036
                 +++  +   VEEL   VE L R+H
Sbjct: 1120 VEVQVDVASELGREVEELRGMVEGLRRMH 1148


>gi|398391687|ref|XP_003849303.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
 gi|339469180|gb|EGP84279.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
          Length = 1143

 Score =  473 bits (1218), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 326/1101 (29%), Positives = 547/1101 (49%), Gaps = 87/1101 (7%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            +YG+   L   RP     D LF+ T+RY +  L WDAE  EL T ++  D++++  R + 
Sbjct: 60   LYGKATLLHKLRPATSPTDHLFVGTDRYNYLTLSWDAELKELKTEKSYVDIAEKAARDSQ 119

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF------------DNKGQLKEAFNIRLEELQVL 112
             G    IDP  R + L  Y+G+  ++P             D  G+L +   +R+ EL V 
Sbjct: 120  TGDRVHIDPTSRFMTLECYEGVVNILPIAHAGKGKRKAADDEIGELADPIQVRIPELFVR 179

Query: 113  DIKFLY------GCAKPTIVVLYQDNKDARHVKTYEVAL----------KDKDFVEGPWS 156
               FL+        AKP   ++Y+D+++   +K  E+               D  EG   
Sbjct: 180  STCFLHRRQAGAKHAKPAFALMYEDSQNKVRIKVRELEYGQSLRPSEEPSTADLDEGKEV 239

Query: 157  QNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD---ADG 213
               L+ GA  LIP+PPP+ G+L++GE +I Y      +    +P + +A   VD    D 
Sbjct: 240  SGQLELGASHLIPLPPPMYGMLVVGETSISYVEEFEHQVKETQP-LEEATVFVDWCQIDD 298

Query: 214  SRYLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQ 272
             RY L D  G L+LL+I      + +  +I++LGETS AST+ YLD   V++GS  GDSQ
Sbjct: 299  QRYALADDYGKLYLLMIQQTSNGEYSSHQIDILGETSRASTLVYLDEGRVFVGSHQGDSQ 358

Query: 273  LIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCSGA 321
            +I++  Q       +EVL+ + N+ PI+DF V+D+  +           GQ ++VT SGA
Sbjct: 359  IIQILPQK------IEVLQTFPNIAPILDFAVMDMGNRSSDAPVNEFSSGQARIVTGSGA 412

Query: 322  YKDGSLRIVRNGIGINEQASV-ELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
            +KDGSLR VR+G+G+ ++  +  L   I  +++LRS         LVVSF+S TR+    
Sbjct: 413  FKDGSLRSVRSGVGLEDRGDLGSLDAPISSLFALRSRPAPAVVNTLVVSFVSHTRVFVFT 472

Query: 380  LEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYS 438
             E ++EE E   GF     TL   +    +++Q+T+ SV L       + + W+SP G S
Sbjct: 473  AEGDVEEQEQYRGFELGESTLHAGNLADGRVIQITNSSVVLADQEGDMVTDRWQSPSGTS 532

Query: 439  VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY----EISCLDINPIGENPSY 494
            +   +AN + +L++  G  L+ L++    +    HAQ E+    ++SC+ ++P     S 
Sbjct: 533  ITAVSANGNNILVSLKGASLLVLDVSGSTIK--VHAQREFGSDEQVSCIALSP-----SI 585

Query: 495  SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE---IIPRSVLLCA-FEG-ISYLLCALG 549
                  G W D  V    L  L     E +  E    +PRS+ +    EG  + L   L 
Sbjct: 586  PTQCVAGFWKDGKVAFLDLETLKTQASETVTEEDSLAVPRSLTVAKILEGQPATLFVGLA 645

Query: 550  DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASDRPTVIYSSNKKL 608
            DG+++ + +   +   T RK + LGTQ          N   +VFA  + P++IY S  ++
Sbjct: 646  DGNVVTYSIQSASNPFTARKSIILGTQQANFAVLPRDNGLENVFATCEHPSLIYGSEGRI 705

Query: 609  LYSNVNLKEVSHMCPFNS-AAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRIC 667
            ++S V  +  + +C F+S   + +++AIA   EL I  +D+ +  H++ + L +  RRI 
Sbjct: 706  VFSAVTAENATCICSFDSFEPYGNAIAIASNDELKIAVVDEERTTHVQDLFLNQTVRRIA 765

Query: 668  HQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD 726
            +    + F + C  +  S   E  +   +L+D+  F+ + +Y L+  E   S++ C   D
Sbjct: 766  YSADMKAFGLGCIRRTLSAGREEVVSCFKLVDEIAFKELDSYQLNEDELVESVIRCKLDD 825

Query: 727  DSN---VYYCVGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGK 782
             S      + +GTAY+  ++    KGRILV  + ED +L+L+ E   +GA   L   +G+
Sbjct: 826  GSGGDAERFVIGTAYLDDQDASNAKGRILVLEVTEDRRLKLVTEISVRGACRCLAVSHGR 885

Query: 783  LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
            ++AA+ + + +Y +      +  +  +  +      + +   GD I V DLMKS+SL+ +
Sbjct: 886  IVAALIKTVIIYSFEYETPSSPAMVKKAAYRTSTAPIDMCVTGDIIAVTDLMKSMSLVQH 945

Query: 843  KHEEGA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
               +      + E AR ++  W +AV  +D++IYL ++   NL  +  + +G ++E+R R
Sbjct: 946  TLGQAGGPDNLTEVARHFDTLWGTAVANVDENIYLESDAEGNLVVLEHDVKGFSEEDRRR 1005

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
            L V  E  LGE VNR R   +    P  +   IP     TV G I + A +   +   L 
Sbjct: 1006 LRVTSEILLGEMVNRIRRIDVS---PTPNATVIPRAFLATVEGSIYLFALIAEGKQDLLI 1062

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK---NFLDGDLIESFLDLSRTRMDEIS 1015
            ++Q  + ++++  G +   ++R F  + + +  +    F+DG+LIE FLD       E++
Sbjct: 1063 RMQNKMAEMVQSPGHVPFAKFRGFKTQVRDMGEEGPSRFVDGELIERFLDCDEDVQAEVA 1122

Query: 1016 KTMNVSVEELCKRVEELTRLH 1036
            K +   VEEL   VE L R+H
Sbjct: 1123 KELGGDVEELKIMVEGLKRIH 1143


>gi|242803623|ref|XP_002484212.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus ATCC
            10500]
 gi|218717557|gb|EED16978.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus ATCC
            10500]
          Length = 1140

 Score =  459 bits (1182), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 341/1104 (30%), Positives = 549/1104 (49%), Gaps = 91/1104 (8%)

Query: 6    IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            +Y +I+ L  L  P     D L + T++  +  L+WD+E   + T R+  D++D+  RP 
Sbjct: 55   LYAKISMLARLPTPTNSTTDHLLVGTDQNTYFTLKWDSEKQRVRTERSYVDLADKASRPC 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIP------------------FDNK-------GQL 98
             NG   +IDP  R + L +++G+  VIP                  + N        G+L
Sbjct: 115  QNGDRCLIDPSGRFMTLEMFEGIITVIPIIQPHKKRGKPPVLKTSHYSNPDEPVPQIGEL 174

Query: 99   KEAFNIRLEELQVLDIKFLY--GCAKPTIVVLYQDNKDARHVKTYEV---ALKDKDFVEG 153
             E    R++EL V    FL+    A P + +L++DN+    +K  E+   A  +  F E 
Sbjct: 175  GEPMPTRIDELMVRSSAFLHVESKAAPRLALLHEDNQRKVRLKIRELHFEASTEVVFQET 234

Query: 154  PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP--SITKAYGRVDA 211
                  LD GA  LIPVP PL G+L++GE  I Y      + I  RP    T     V  
Sbjct: 235  EDFTEELDLGASHLIPVPAPLGGLLVLGETCIKYIDDANNETIS-RPLDEATIFVAWVQV 293

Query: 212  DGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
            DG R+LL D  G L  L+++   + +V G KI+ LG  S AS + YL   + +IGS  GD
Sbjct: 294  DGQRWLLADDYGRLFFLMLVLDSRNEVEGWKIDYLGSASRASVLIYLGAGMTFIGSHQGD 353

Query: 271  SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCS 319
            SQ+I++     ++GS+ E+++   N+ PI+DF ++DL  +           GQ ++VT S
Sbjct: 354  SQVIRI-----SEGSF-EIIQTISNIAPILDFTIMDLGTREGDNYTHEFSSGQARIVTGS 407

Query: 320  GAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
            GA+ DG+LR VR+G+G+ E   + E++ I  MW+L+ S+ D F   LVV+F++ETRI   
Sbjct: 408  GAFNDGTLRSVRSGVGMEELGVLGEMEHITDMWALQVSSTDEFSDTLVVTFVNETRIFQF 467

Query: 379  NLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
            + + E+EE  E  G      TL  ++    +++ VT   V +  + S  + ++W SP   
Sbjct: 468  SSDGEVEELDEFLGLNLAENTLLSNNLPGGRIIHVTERGVSIADTDSGMVISKW-SPDEQ 526

Query: 438  SVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQ 496
            ++  A  N  ++++ TGG  +V L+I GD  +   K    + ++S + I P     + +Q
Sbjct: 527  TITSAACNDERLVVVTGGQVIVVLDITGDLKVLSQKDFDKDNQVSGVTIPP-----ASTQ 581

Query: 497  IAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGH 552
                       V + S  D   +  + LG   E  PR+VLL     +  S L  ++ DG 
Sbjct: 582  AFIAAFPQKAQVAVLSFQDFKELHSQSLGIASEAFPRAVLLAEILEDSPSTLFVSMADGS 641

Query: 553  LLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYS 611
            ++ F  +     LT + K+ LG++  T +     N  + VFA  + P++IY S  +++YS
Sbjct: 642  VVTFFYDSDNHSLTSKNKLILGSEQPTFKKLPRGNGLYNVFATCEHPSLIYGSEGRIIYS 701

Query: 612  NVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQ 671
             VN +  S +C FN+ A+PD++A+A   +L I  +D  +   I+++P+    RR+ +   
Sbjct: 702  AVNSEGASRVCHFNAEAYPDAIAVATSKDLKIALVDKERTTQIQTLPIEATVRRVAYSPT 761

Query: 672  SRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
             + F I ++K   Q  AE  E  FV L D+  F  + ++ L   E   S++       S+
Sbjct: 762  EKAFGIGTIKRRLQDGAEIVESQFV-LADEIMFRKLDSFDLKPDELVESVIRAQMVVGSD 820

Query: 730  VY--------YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFN 780
             Y        + VGTAY+  E  E  +GRIL+F V+   KL L  E   KGA  +L    
Sbjct: 821  AYNKPIYKDRFIVGTAYLDDETAESIRGRILLFEVDSNRKLSLFLEHPVKGACRALAMMG 880

Query: 781  GKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
             K++AA+ + + ++    +   G   L+    +      + +      IVV DLMKSIS 
Sbjct: 881  NKIVAALVKTVVIFDVERKSQLGKHALKKVAAYRTSTAPVDIAVTDSTIVVADLMKSIS- 939

Query: 840  LIYKHEEGAI----EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
            ++  H+  A+    +E AR +   W +AV  +  + +L ++   NL  +R+N +G T+E+
Sbjct: 940  IVESHKTDALTVEAKEVARHFATVWTTAVADIGSNQWLVSDAEGNLIVLRRNVDGVTEED 999

Query: 896  RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL 955
            R RLEV  E  LGE VNR R  ++   L  S V   P    GTV G I + A +  E   
Sbjct: 1000 RRRLEVTSELLLGEMVNRIRPVNI---LQTSTVAVNPKAFLGTVEGSIYLFALINPEHQD 1056

Query: 956  FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEI 1014
            FL +LQT +   +   G +   ++R+F +  +  D    F+DG+LIE FLD  R   +EI
Sbjct: 1057 FLMRLQTAITAYVDSPGYMPFSKFRAFRSSVREGDEPFRFVDGELIERFLDCDRPVQEEI 1116

Query: 1015 SKTM--NVSVEELCKRVEELTRLH 1036
               +     +E + K +E L RLH
Sbjct: 1117 LGVVGSGYDLESVQKMIEALRRLH 1140


>gi|453081643|gb|EMF09692.1| DNA damage-binding protein 1 [Mycosphaerella populorum SO2202]
          Length = 1151

 Score =  457 bits (1175), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 318/1090 (29%), Positives = 555/1090 (50%), Gaps = 91/1090 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            IYG++  L+  RP     D LF+ T+RY +  L WDA++  L T ++M D++D+  R + 
Sbjct: 60   IYGKVTMLQKLRPATSPTDHLFVGTDRYNYFTLSWDADTRRLKTEKSMVDIADKSARDSQ 119

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF-----------DNK-GQLKEAFNIRLEELQVL 112
             G    IDP  R + L  Y+G+  V+P            DN+ G+L +   +R+ EL V 
Sbjct: 120  TGDRVHIDPTARFMTLECYEGVVNVLPIAHAGKGKRRAADNEIGELGDPIPVRIPELYVR 179

Query: 113  DIKFLYG------CAKPTIVVLYQDNKDARHVKT----YEVALKDKD------FVEGPWS 156
               FL+        A P   VL++D+++   +K     Y+ +L+  +         G   
Sbjct: 180  SSCFLHKRHGGTKIADPVFAVLHEDSQNMPRIKIRELEYQPSLRPNEEPATAELERGQEV 239

Query: 157  QNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVDAD 212
            Q  ++ GA  LIP+P P  G LIIGE +I Y     ++ +  +P    +I  A+ ++D  
Sbjct: 240  QGKIEMGASHLIPLPAPTYGFLIIGETSISYVDEWKYEIVDTQPLDEATIFVAWCQIDE- 298

Query: 213  GSRYLLGDHAGLLHLLVITHEKE--KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
              R++L D  G L+LL++ H+K   +    +I++LGETS AST+ YLD   V++GS  GD
Sbjct: 299  -QRFVLADDYGKLYLLMV-HQKADGEYQSQQIDVLGETSRASTLVYLDEGRVFVGSHQGD 356

Query: 271  SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCS 319
            SQ+I++  Q       +EVL+ + N+ PI+DF V+D+  +           GQ ++VT S
Sbjct: 357  SQIIQILPQ------RIEVLQTFSNIAPILDFTVMDMGNRSADAPVNEFSSGQARIVTGS 410

Query: 320  GAYKDGSLRIVRNGIGINEQASVELQG--IKGMWSLRSSTDDPFDTFLVVSFISETRILA 377
            GA+KDGSLR VR+G+G+ ++ S+   G  I  ++SLRS         L+ SF+S +  + 
Sbjct: 411  GAFKDGSLRSVRSGVGLEDKGSLGDLGEPISAVFSLRSGPGVQAVDTLIASFVSHSSAII 470

Query: 378  MNLEDELE-ETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPG 436
               + ++E   E  GF     TL+  +    + VQVTS +V L  +    + + W++P G
Sbjct: 471  FGSDGDIEARDEFRGFDLTQSTLYAGELPNGRAVQVTSSTVLLTDTEGDMITDRWEAPDG 530

Query: 437  YSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL--EYEISCLDINPIGENPSY 494
             S+   +A+   VL++ G   L+ L++    +T   H QL  E +ISCL +     + S 
Sbjct: 531  SSITSVSADGDHVLVSLGSAALIVLDLSGSSITVGAHRQLGNEEQISCLSL-----SRSL 585

Query: 495  SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE---IIPRSVLLCAF--EGISYLLCALG 549
                 VG W D  V + SL DL  I  E +  +    +PRS+++ +   +  + L   L 
Sbjct: 586  PGACVVGFWKDSKVSVLSLSDLQPIASESVAEDDNLAVPRSLIVASILRDQPATLFVGLA 645

Query: 550  DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS-SKNTTHVFAASDRPTVIYSSNKKL 608
            DG+++ + +       + RK + LGTQ         S    +VFA  + P++IY S  ++
Sbjct: 646  DGNVVTYSVESLQRPFSARKSIILGTQQANFTVLPRSDGLQNVFATCEHPSLIYGSEGRM 705

Query: 609  LYSNVNLKEVSHMCPFNS-AAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRIC 667
            +YS V     + +C F+S   + +S+AIA   EL + ++D+ +  H++ +P+ E  RRI 
Sbjct: 706  VYSAVTADSATSICAFDSFGDYANSIAIATGSELKLSSVDEERTTHVQDLPVYETVRRIA 765

Query: 668  HQEQSRTFAICSLKNQSCA--EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS 725
            +  + + F +  +K    A  EE   HF +L+D+  F+ + ++ L+  E   S++ C   
Sbjct: 766  YSSELKAFGLGCIKRTLAAGVEEVRSHF-KLVDEVAFKALDSWALNEDELVESVIRCPLD 824

Query: 726  DDSNV---YYCVGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNG 781
            D + +    + VGTAY+  ++    +GR+LVF + ED +++L+ E   KGA   L    G
Sbjct: 825  DGTGLDAERFVVGTAYLDDQDANTARGRVLVFEVTEDRRIKLVTEMAVKGACRCLAVCKG 884

Query: 782  KLLAAINQKIQLYKWMLR-DDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKSIS 838
            +++AA+ + + +  +       + +L  +  +      + ++  +    I + DLMKS++
Sbjct: 885  RIVAALVKTVVILAYEFSPPKSSPQLIKKASYRTSTAPIDIFASSLDGLIAISDLMKSLT 944

Query: 839  LLIYKH----EEGAIEERARDYNANWMSAVE-ILDDDIYLGAENNFNLFTVRKNSEGATD 893
            L+ Y      +  ++ E AR ++  W +AV  I     Y+ ++   NL  +  +  G + 
Sbjct: 945  LVKYTPGRTGQPDSLVEIARHFDTLWGTAVAPIPGTHSYIQSDAEGNLVVLEHDPTGFSA 1004

Query: 894  EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQ 953
            E+R RL V  E  LGE VNR R  + V+  P ++   IP     TV G + V  ++  + 
Sbjct: 1005 EDRRRLRVTSEMCLGEMVNRIRPITTVI-TPSANAVVIPKAFIATVEGSVYVFGTIAQQY 1063

Query: 954  YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK---NFLDGDLIESFLDLSRTR 1010
               L +LQ ++ +++K  G +   ++R F  + + +  +    F+DG++IE FL LS   
Sbjct: 1064 QDLLIRLQGSMAEMVKSPGFVRFNRFRGFKTQVRDMGEEGPVRFVDGEIIEGFLGLSAEV 1123

Query: 1011 MDEISKTMNV 1020
             + ++K + V
Sbjct: 1124 QESVAKDLGV 1133


>gi|396465264|ref|XP_003837240.1| similar to DDB1B (Damaged DNA Binding Protein 1 B); damaged DNA
            binding / protein binding [Leptosphaeria maculans JN3]
 gi|312213798|emb|CBX93800.1| similar to DDB1B (Damaged DNA Binding Protein 1 B); damaged DNA
            binding / protein binding [Leptosphaeria maculans JN3]
          Length = 1089

 Score =  456 bits (1173), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 321/1107 (28%), Positives = 555/1107 (50%), Gaps = 102/1107 (9%)

Query: 13   LELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGII 71
            L+  RP     D LF+ T+R+ +  L WD++  +L T ++   V+D   R +  G+   +
Sbjct: 2    LQKLRPALSPTDHLFVGTDRFMYFTLSWDSDKKQLQTEKSFASVADNAARESQTGERCHV 61

Query: 72   DPDCRLIGLHLYDGLFKVIPFDNKGQ----------LKEAFNIRLEELQVLDIKFLYGCA 121
            DP  R + + +Y+G+  VIP   +G+          L E   +RL E+ V    FL   +
Sbjct: 62   DPTGRFMTVEVYEGIMTVIPLVQRGKKRKQEPDIAHLGEPQPVRLSEMFVRSSAFLRPRS 121

Query: 122  ---KPTIVVLYQDNKDARHVKTYEVALKDKDFVE---GPWSQNNLDNGADLLIPVPPPLC 175
               KP I +LY+D+     +K  E+     D VE   G   +  L+ G+  LIP+  P  
Sbjct: 122  FDDKPKIALLYEDSHSQVKLKLRELTFAGGDAVELEEGETCRAELELGSSHLIPLEEPTH 181

Query: 176  GVLIIGEETIVYC--SANAFKAIPIR-PSITKAYGRVDADGSRYLLGDHAGLLHL-LVIT 231
            G+++I E +I Y    +   +  P+   +I  A+ R+DA   R++L D  G L++ ++I 
Sbjct: 182  GLIVIAETSIGYYDDESGELQTEPLEEATIFVAWERIDA--QRFVLADDYGRLYMFMLIL 239

Query: 232  HEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
              + KV   K++++G+TS AS + YLD   V++GS  GDSQ+IK+  +       +E+++
Sbjct: 240  DARHKVRSWKLDIIGKTSRASVLVYLDAGYVFVGSHQGDSQVIKIAERS------MEIVQ 293

Query: 292  RYVNLGPIVDFCVVDL-ERQGQGQ----------VVTCSGAYKDGSLRIVRNGIGINEQA 340
             + N+ PI+DF ++D+  R G+GQ          +VT SGAY+DGSLR VR+G+G+ +  
Sbjct: 294  TFSNIAPILDFTIMDMGNRSGEGQTNEYSSGQARIVTGSGAYQDGSLRSVRSGVGLEDLG 353

Query: 341  SV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCS---QT 396
             + E++ I  ++SL+S     +   L+V+F++E+R+   + + E+EE  +E F S   + 
Sbjct: 354  VLGEMEHISNLFSLKSDATAQYADTLLVTFVNESRVFKFDPQGEVEE--VEEFASLALEE 411

Query: 397  QTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGG 456
             TL   +    ++VQVTSG  R+       + +EW      ++  A+AN + VL++ GG 
Sbjct: 412  TTLVAANISQGRVVQVTSGRARICDLDGGMIVSEWMPMGDQTITAASANDTHVLISLGGV 471

Query: 457  HLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPS-YSQIAAVGMWTDISVRIFSLP 514
             +V L + DG+ + + K    E +++C+ +      PS  S +  +G W +  + I SL 
Sbjct: 472  TVVSLNMTDGLQVAQQKTFGTESQVACVAL------PSDSSSVCFLGFWMNSQLAICSLD 525

Query: 515  DLNLITKEHLGGEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 572
             L  I    +  E +PRS+LL     +    L  A+ DG+++ +  +  + EL  +K + 
Sbjct: 526  TLETIKTVTISDESVPRSLLLTQVFPDQPPTLFAAMADGNVITYTFDPSSYELLGKKSIV 585

Query: 573  LGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
            LGT+  T R     +  + VFA  + P++IY+S  +L+YS V  ++ + +CPF+S A+P 
Sbjct: 586  LGTREATFRALPRGDGIYNVFATCEHPSLIYASEGRLVYSAVTAEKATAVCPFDSEAYPG 645

Query: 632  SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN-QSCAEESE 690
            S+AIA   +L I  +D  +  H++++ + E  RRI +    + F + ++K      EE  
Sbjct: 646  SVAIATSEDLRIALVDTERTTHVQTLKVDETVRRIAYSPSLKAFGLGTIKRILKNGEEIM 705

Query: 691  MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN---VYYCVGTAYVLPEENEPT 747
            +   +L+D+  F+ + TY L+  E    ++ C  +D S      + VGTAY+  +     
Sbjct: 706  LSHFKLVDEIQFKELDTYALNEEELVECVMRCELADGSGGLAERFVVGTAYLDDQNATAE 765

Query: 748  KGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
            +GRIL+  +  +  L+L+ E   KG    L    GK++AA+ + I LY    +      L
Sbjct: 766  RGRILILEVTPERVLKLVTELAVKGGCRCLAMCEGKIVAALIKTIVLYDVEFKTQSKPTL 825

Query: 807  QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER----ARDYNANWMS 862
                        + +   G  I + DLMKS++++ Y+  EG + ++    AR Y   W +
Sbjct: 826  VKAATFRCSTAPIDITVNGPVITIADLMKSLAMVKYQKGEGGLPDKLVELARHYQVTWAT 885

Query: 863  AVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR 922
            AV  +D + +L ++   NL  + +N EG TD+++  + +  E  LGE VNR R    V  
Sbjct: 886  AVAEIDTETFLESDAEGNLMVLSRNVEGVTDDDKRHMLISSEMLLGEMVNRIRRID-VQT 944

Query: 923  LPDSDVGQIPTVIFGTVN--------------------------GVIGVIASLPHEQYLF 956
             PD+ V  IP    GTV                           G I +   +       
Sbjct: 945  APDAVV--IPRAFVGTVRISPLLSSLPSSYPNNPPTNHQHTQVEGSIYLFGLIAPSHLHL 1002

Query: 957  LEKLQTNLRKVIKGVGGLNHEQWRSFNNE--KKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
            L  LQ+NL  ++   G ++  ++R+F N+  +   + K F+DG+L+E FLDL      + 
Sbjct: 1003 LMTLQSNLAALVPAPGNMDFAKFRAFRNQVRQDEEEPKRFVDGELVERFLDLGAEAQAKA 1062

Query: 1015 SKTMNVS-----VEELCKRVEELTRLH 1036
             + + +      VE +   VE L RLH
Sbjct: 1063 VQGLRLGEAELDVEAVRGLVEGLRRLH 1089


>gi|350629921|gb|EHA18294.1| damage-specific DNA binding protein [Aspergillus niger ATCC 1015]
          Length = 1140

 Score =  450 bits (1158), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 334/1105 (30%), Positives = 554/1105 (50%), Gaps = 93/1105 (8%)

Query: 6    IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            ++ ++  L  L  P     D LF+ T+RY +C L WD E +++ T R   D+SD   R  
Sbjct: 55   LFAKVTMLARLPAPANSPTDHLFVGTDRYTYCTLSWDGERNQIRTERNYVDISDPSSREA 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
              G   +IDP  R + L +Y+G+  V+P                        G+L E   
Sbjct: 115  QTGNRCLIDPSGRFMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTT 174

Query: 104  IRLEELQVLDIKFLY-GCAKPTIVVLYQDNKDARHVKT----YEVALK----DKDFVEG- 153
             R++EL V    FL+     P + +LY+DN+    +K     Y  A      D  F E  
Sbjct: 175  ARIDELFVRSSAFLHVQSGPPRLALLYEDNQKKVRLKVRALHYSAATASTGADAAFEESL 234

Query: 154  -PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGR 208
              +SQ  LD GA  LIPVP PL G+L++GE +I Y   ++ + +  RP    +I  A+ +
Sbjct: 235  DGFSQE-LDLGASHLIPVPAPLGGLLVLGETSIKYVDTDSNEIVS-RPLDEATIFVAWEQ 292

Query: 209  VDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
            VD+   R+LL D  G L  L+++     +V   K++ LG T+ AS + YL   V+++GS 
Sbjct: 293  VDS--QRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGVIFVGSH 350

Query: 268  YGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVV 316
             GDSQ++++       GS +EV++   N+ PI+DF ++DL     E Q      GQ ++V
Sbjct: 351  QGDSQVLRI-----GNGS-LEVIQTLSNIAPILDFTIMDLGNRTSESQTHEFSSGQARIV 404

Query: 317  TCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
            T SGA+ DG+LR VR+G+G+ E   + E++ I  ++ L+ +T   +   L+V+F+ ETR+
Sbjct: 405  TGSGAFDDGTLRSVRSGVGMEELGVLGEMELITDLFGLQLATKGGYLDTLLVTFVDETRV 464

Query: 376  LAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
               N + E+EE +   G      TL   +    +++QVT   V +       + NEW  P
Sbjct: 465  FQFNFDGEVEELDSFLGLSLSENTLLAMNLPGGRILQVTEQRVLIADIEGGMVTNEWTPP 524

Query: 435  PGYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPS 493
                +  A+AN   ++L +GG  +  L+I + + +   K    + +IS + + P+    S
Sbjct: 525  DNLVITSASANNDSIVLVSGGQLMTVLDINNDVRVISQKDFGADSQISGVTV-PL----S 579

Query: 494  YSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALG 549
             + +  VG      V +  L  L+ +    LG  GE  PRSVL+        S L  ++ 
Sbjct: 580  SAGVCIVGFPQLAKVSVLDLGRLSELHTTSLGPAGEAFPRSVLVADVLANSPSTLFISMA 639

Query: 550  DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKL 608
            DG ++ +  +     LT   ++ LG++  T +     +  + VFA  + P++IY S  ++
Sbjct: 640  DGSVITYSFDASNYSLTGMNRLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYGSEGRI 699

Query: 609  LYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICH 668
            +YS VN +  S +C FNS A+P S+A+A   +L I  +D  +   I+++P+GE  RR+ +
Sbjct: 700  IYSAVNSEGASRVCHFNSEAYPGSIAVATRHDLKIALVDKERTTQIQTLPMGETARRVAY 759

Query: 669  QEQSRTFAICSL--KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS- 725
                + F I ++  K +  AE  +  FV L D+  F  +  + L   E   S++   FS 
Sbjct: 760  SPSEKAFGIGTIERKLKDGAEMVQSRFV-LADEIMFRRLDAFDLRPEELVESVIRAEFSA 818

Query: 726  ----DDSNVY---YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLN 777
                +  +V+   + VGTAY+  E  E  +GRILVF +++G KL  +AE   KGA  +L 
Sbjct: 819  GKDENGRDVFKDRFVVGTAYLDDENEESIRGRILVFEIDNGRKLTKVAELPVKGACRALA 878

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
                K++AA+ + + +Y  +  D G  +L+    +      + V   G+ I V DLMKS+
Sbjct: 879  MLGEKIVAALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAPVDVTVTGNVIAVADLMKSV 938

Query: 838  SLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
             L+ Y   E     ++ E AR +   W + V  +  D +L  +   NL  +R+N  G  +
Sbjct: 939  CLVEYSEGENGMPDSLTEVARHFQTVWATGVSCIAKDTFLETDAEGNLIVLRRNLTGVEE 998

Query: 894  EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQ 953
            +++ RLEV GE  LGE VNR R  ++      + V   P    GTV G I + A +  E 
Sbjct: 999  DDKRRLEVTGEISLGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLFAIINPEH 1055

Query: 954  YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMD 1012
              FL +LQ  +   ++ +G +   ++R F +  ++T +   F+DG+LIE FL    +  +
Sbjct: 1056 QDFLMRLQATMAGKVESLGNIPFNEFRGFRSMVRETKEPYRFVDGELIERFLTCEPSLQE 1115

Query: 1013 EISKTMN-VSVEELCKRVEELTRLH 1036
            EI  ++  ++V+E+   +E L RLH
Sbjct: 1116 EIVDSVGMMNVDEVKIMIEALRRLH 1140


>gi|452979181|gb|EME78944.1| hypothetical protein MYCFIDRAFT_43692 [Pseudocercospora fijiensis
            CIRAD86]
          Length = 1149

 Score =  449 bits (1156), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 325/1111 (29%), Positives = 547/1111 (49%), Gaps = 98/1111 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            IYG++  L   RP     D LF+ T+RY +  L WD E  +L T ++M D++D+  R + 
Sbjct: 57   IYGKVTLLHKLRPATSQTDHLFVGTDRYNYFTLSWDGERQDLKTEKSMVDIADKAARDSQ 116

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF-----------DNK-GQLKEAFNIRLEELQVL 112
             G    IDP  R + L +Y+G+  V+P            DN+ G+L +   +R+ EL V 
Sbjct: 117  TGDRVHIDPTSRFMTLEVYEGVVNVLPIAHAGKGKRKAADNEIGELGDPIQVRIPELFVR 176

Query: 113  DIKFLY-------GCAKPTIVVLYQDNKDARHVKTYEVALKDK----------DFVEGPW 155
               FLY       G   P   VLY+D+K+   +K  E+               +  +G  
Sbjct: 177  SSCFLYRRDTGPRGKTDPMFAVLYEDSKNQVRIKIRELEYSPSLRPNEEPATAELEKGVD 236

Query: 156  SQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVDA 211
               +L+ G+  +IP+P P  G+L++GE  I Y      K    +P    ++  A+ ++D 
Sbjct: 237  VTGSLEMGSSFMIPLPAPTYGLLVVGETRITYVDEWEHKVQDSQPLEDATVFVAWCKLDE 296

Query: 212  DGSRYLLGDHAGLLHLLVI-THEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
               RY+L D    L+LL++ T        L+I++LGETS AST+ YLD   V++GS  GD
Sbjct: 297  --QRYILADDYKKLYLLMVNTSPHGDYQSLEIDVLGETSRASTLVYLDEGRVFVGSHQGD 354

Query: 271  SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCS 319
            SQ+I+++ +       ++VL+ + N+ PI+DF V+D+  +           GQ ++VT S
Sbjct: 355  SQVIQISPK------RIDVLQSFPNIAPILDFTVMDMGNRSLDAPVNEFSSGQARIVTGS 408

Query: 320  GAYKDGSLRIVRNGIGINEQASVELQG--IKGMWSLRSSTDDPFDTFLVVSFISETRILA 377
            GA+ DGSLR VR+G+G+ ++ S+   G  I  ++SLR+S    +   L+VS +S T ++ 
Sbjct: 409  GAFNDGSLRSVRSGVGLEDRGSLGDLGAPISSIFSLRTSAAAQYVDTLIVSHVSHTNVVV 468

Query: 378  MNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPG 436
               + ++E  +   GF  +  TLF  D    + VQVTS SV L  S    + + W++P G
Sbjct: 469  FTEDGDIEARDSFRGFELKDATLFAGDLRDGRAVQVTSSSVLLTDSEGDMVTDRWQAPSG 528

Query: 437  YSVNVATANASQVLLATGGGHLVYLEIGDG---ILTEVKHAQLEYEISCLDINPIGENPS 493
             S+    A+  ++L++  G  LV L++      +  E K    E ++SC+ +     +PS
Sbjct: 529  TSITAVAADGEKILVSLQGAVLVVLDLSASNIQVQAERKFGS-EEQVSCIAL-----SPS 582

Query: 494  YSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE---IIPRSVLLCAF--EGISYLLCAL 548
                   G W D  V   SL DL     E +  E    +PRS+++     +  + L   L
Sbjct: 583  IPNACVAGFWKDSKVSFLSLQDLQPTASERVAEEESLAVPRSLIVANILKDQPATLFVGL 642

Query: 549  GDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS-SKNTTHVFAASDRPTVIYSSNKK 607
             DG+++ + +       + RK + LGTQ         +    +VFA  + P++IY S+ +
Sbjct: 643  ADGNVVTYSVTSAQQPFSARKCIILGTQQANFAALPRADGLQNVFATCEHPSLIYGSDGR 702

Query: 608  LLYSNVNLKEVSHMCPFNS-AAFPDSLAIAK-----EGELTIGTIDDIQKLHIRSIPLGE 661
            L+YS V  +  + +  F+S   +  ++AIA      E EL +  +D+ +  H++ + + E
Sbjct: 703  LVYSAVTAENATCIASFDSFGDYAGAIAIATTDENGENELKLAVVDEERTTHVQDLFIHE 762

Query: 662  HPRRICHQEQSRTFAICSLKNQSCA--EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
              RRI +  + + F +  +K    A  EE   HF +L+D+  F+ + T+ L+  E    +
Sbjct: 763  TVRRIAYSAELKAFGLGCIKRTLSAGNEEVASHF-KLVDEVAFKELDTWALNEDELVECV 821

Query: 720  LSCSFSDDSN---VYYCVGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYS 775
            + C   D S      + VGTAY+  ++    KGRILV  I ED +++L+ E   +GA   
Sbjct: 822  IRCYLDDGSGEEAERFVVGTAYLDDQDANNAKGRILVLEITEDRRIKLVTELAVRGACRC 881

Query: 776  LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
            L    G+++AA+ + I +Y +  +   T  L  +  +      + +    + I V DLMK
Sbjct: 882  LAVCQGRIVAALVKTIVVYDFEYQTPSTPALTKKASYRTATAPIDICVTNNTIAVTDLMK 941

Query: 836  SISLLIYKH----EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA 891
            S+SLL +K     +   + E AR +   W +A   + ++ YL ++   NL  ++ +  G 
Sbjct: 942  SLSLLEFKAGRQGQPDTLIEIARHFETLWGTACARVSENTYLESDAEGNLIVLQHDINGF 1001

Query: 892  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH 951
            + E+R RL V  E+ LGE VNR R    +   P       P     T +G I V   +  
Sbjct: 1002 SQEDRRRLRVTSEFLLGEMVNRIRP---ITVQPSPGAVVTPQAFLATTDGSIYVYCEIGK 1058

Query: 952  EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK---NFLDGDLIESFLDLSR 1008
             +   L ++QT +  ++K  GG+   ++R F    + +  +    F+DG+LIE FLD+  
Sbjct: 1059 PRQDLLMRMQTLMADMVKSPGGVRFAKFRGFKTLVRDMGEEGPVRFVDGELIERFLDMPE 1118

Query: 1009 TRMDEISKTMN---VSVEELCKRVEELTRLH 1036
               +E+ K ++   V +E L   VE L R+H
Sbjct: 1119 VLQNEVVKGLDGTGVDLEGLRGMVEGLRRIH 1149


>gi|167517245|ref|XP_001742963.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778062|gb|EDQ91677.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1670

 Score =  448 bits (1152), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 317/1091 (29%), Positives = 542/1091 (49%), Gaps = 77/1091 (7%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            ++D  +YG +  + + R  G  +D +F+ T  ++F +L  D E + ++T   G+      
Sbjct: 95   VMDKDLYGSVLAMNVIRLPGWERDAVFLLTSTFRFFILASD-EDNGVVTVIKGNALSSCQ 153

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
            RP D G    + P    I + +Y G   +IPFD  G+  EA+++ +    +LD  F+ G 
Sbjct: 154  RPADCGVHVAVHPKGNCIFVSVYPGNALIIPFDASGEPMEAYSVFVPVSSLLDATFVNGA 213

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
               T+ +L  D+ +   +K + + ++++  VE   + + ++  +  L+P+     GVL++
Sbjct: 214  HDFTLALLSDDDTNFTSLKMFHLDVEERTLVEEQLADSTINTYSSRLLPLWNLDSGVLVL 273

Query: 181  GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-KVTG 239
            GEE     + +   +  +  S+  A G +D DGSR L+GD  G LHLLV+    E +VT 
Sbjct: 274  GEELCHVVTPSGIISSNLSESLPVAAGIIDTDGSRILIGDELGDLHLLVLEGIAERRVTS 333

Query: 240  LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL--NLQPDAKGSYVEVLERYVNLG 297
            +  + LG  S  S I YLDN VVYIGS   D QLI+L  ++Q +A    V+VL+ Y N+G
Sbjct: 334  IVRQHLGRISTPSAIVYLDNGVVYIGSDQADCQLIQLLSHVQAEADNK-VKVLQEYPNIG 392

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST 357
            PIVDF +VDL+  GQ QVV+C G+ +DG LRI+R G+GI+  AS++L+G++ +W LRS++
Sbjct: 393  PIVDFEMVDLDGHGQQQVVSCCGSNQDGCLRILRKGVGIDVLASLDLEGLQDLWCLRSAS 452

Query: 358  ---DDPFDTFLVVSFISETRILAMNLED--------------ELEETEIEGFCSQTQTLF 400
               +D  D  L + F+ +T  L++  ++              EL+  ++ G  ++   L 
Sbjct: 453  NLGEDQHDV-LALKFLEQTAFLSLAGDEVCLLYSTPTSHSYTELDGVDVAGANTELPALH 511

Query: 401  CHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGG-HLV 459
            C +      + VTS   RL+ +  R     W  P G  ++V  +    + +A+G   + +
Sbjct: 512  CGNVRDGMWLVVTSQDARLLDAVDRTEVTRWSPPNGKGIDVCASTGDLLAVASGSDLYAL 571

Query: 460  YLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLI 519
             L   +G L ++K+A L++EI+CL I   G +     I A G+WTD S+R FS   L   
Sbjct: 572  SLSRTEG-LHDMKNATLDHEIACLSIRASGPDQGAGTILA-GLWTDFSLRAFSTRTLEEQ 629

Query: 520  TKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
             K  +  +++  SV     EG  Y     GDG L   + +  +        V +G+ P+ 
Sbjct: 630  AKVEVPTQVVSSSVASVTMEGTCYFFIGHGDGKLAYGVFDPLSSTFGAPHVVQVGSLPVK 689

Query: 580  LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
            LR         VF A+DRP V+ S   KLL+ NV+          N+ A+ D LA  ++ 
Sbjct: 690  LRACKRGKDEFVFVATDRPMVVSSRRGKLLFCNVSAGACRTADVLNAEAYVDCLAYVEQD 749

Query: 640  ELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE-MHFVRLLD 698
             L  G ++++Q L IR IPL E P  + + + S  F + +   ++C    E + ++RL+D
Sbjct: 750  RLVFGKMENMQNLQIRKIPLDETPLGVTYHKSSGAFCVATDAARACPTPQEPICYLRLID 809

Query: 699  DQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV 756
             Q+FE   ++ L+  E  +G S+ +    +DS  Y  VGTA   P    P +GRILV  V
Sbjct: 810  AQSFEVRDSFKLEQAESLFGHSLHTMQLRNDSTEYIVVGTAMHDPNRPLPKQGRILVLRV 869

Query: 757  -EDGKLQLIAEKETK-GAVYSLNAFNGKLLAAINQKIQLYKWMLRD-DGTRELQSECGHH 813
             +DGKL+L+       G ++SL AF   ++A IN +++ +       +   E+ S+    
Sbjct: 870  NDDGKLELVVSHAIHDGGIFSLQAFRDGVVAGINGRLEYFSLESTPLERKVEVASQTVFR 929

Query: 814  GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
            G      +   G+ ++VGD+++S++ + Y  +         D  + ++    +  +D +L
Sbjct: 930  GMQTVSCLGVCGNTVLVGDILQSVTAVNYSEQRNRFVVGPGDPESRYLLTCFLPAEDRFL 989

Query: 874  GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN--------------------- 912
              +++ NL  +        + +   + + G  H+G+ +N                     
Sbjct: 990  FCDSDQNL-VLGMPPVDTVENDASLMHLAGRIHIGDNINSYVICACIHVWTPYLLCPDST 1048

Query: 913  ----------RFRHGSLVMRLP-DSDVGQI----------PTVIFGTVNGVIGVIASLPH 951
                      RF  GSL +     ++ G+           P ++F TV G +G+I  +  
Sbjct: 1049 FCFAALFVTSRFAFGSLSLSYERPAEAGEAGEDGAKQQSSPPIVFTTVLGGVGMILEVQQ 1108

Query: 952  EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRT 1009
            +   F+ ++Q  L  +   VGGL HE +RS  N K+     A+ F+DG+LIESFL+L+  
Sbjct: 1109 KHLWFMHEMQRRLADMGNAVGGLTHEDYRSTKNGKRESVTPARCFVDGNLIESFLELTPE 1168

Query: 1010 RMDEISKTMNV 1020
             M+E+ K  ++
Sbjct: 1169 EMEEVMKEFHI 1179


>gi|367044684|ref|XP_003652722.1| hypothetical protein THITE_2114471 [Thielavia terrestris NRRL 8126]
 gi|346999984|gb|AEO66386.1| hypothetical protein THITE_2114471 [Thielavia terrestris NRRL 8126]
          Length = 1187

 Score =  446 bits (1147), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 330/1146 (28%), Positives = 552/1146 (48%), Gaps = 128/1146 (11%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            + G IA L+  RP     D LF+ T+R+++  L W+ E+S+L T     D  +R  R + 
Sbjct: 55   VNGTIAILQKLRPKDARTDLLFVGTDRFEYFTLAWNPETSQLDTINPFNDPGERHMRDSQ 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFN----IRLEELQVLDIKFLYG- 119
            +    ++DP  R + +HL++G+  ++   N+       +    +RL EL +    FLY  
Sbjct: 115  SQDRCLVDPSGRFLAMHLWEGVLTILRLGNRKNTATVLDWMGQVRLSELFIKASTFLYTE 174

Query: 120  CAKPTIVVLYQDNKDAR--HVKTYEVALKDKDFVEGPWSQNN--------LDNGADLLIP 169
               P I +LYQ   D+    + TY +   D++     +  N          D  A +LIP
Sbjct: 175  TGHPKIALLYQSRADSSDAQLATYRLTSDDRNTELSRFDPNRDREIDAEIHDPSASMLIP 234

Query: 170  V------------------PPPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGR 208
            V                     + G++++GE  ++Y    +    ++     SI  A+  
Sbjct: 235  VRKVEEQVKRHNVRNVESAKAHIGGLIVVGETRLLYIDEVTKTTVESALTEASIFVAWAE 294

Query: 209  VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
             D     Y L D  G LHLL +  E   VTG+ +  +G+TS AS + YL + ++++ S Y
Sbjct: 295  YDV--KHYFLADDYGNLHLLTLETEDVVVTGMIVNRIGKTSRASNLVYLGDNLLFVASHY 352

Query: 269  GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQVV 316
            GDSQL +L+L+ D     V++++   N+GPI+DF ++DL  +            GQ ++V
Sbjct: 353  GDSQLFRLDLENDDARQLVQLVQTLPNIGPILDFEIMDLGNRGDEGQLANEYSSGQARIV 412

Query: 317  TCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
            TCSG +KDG+LR VR+G+G+ +   + +L+  +G++ L SS   P    L VSF++ETR+
Sbjct: 413  TCSGVHKDGTLRSVRSGVGLEDVGILADLEHCRGLFPL-SSYGSPKTDTLAVSFLTETRV 471

Query: 376  LAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
               +   ++EE E   G     QTL   +    QL+QVT  +  L+ + S      W   
Sbjct: 472  FKFDSHGDVEEVESFSGMTFDQQTLLAMNLPKGQLLQVTPAAASLLDAESGVTIASWAPE 531

Query: 435  PGYSVNVATANASQVLLATGGGHLVYLEIGDGILT-EVKHAQLEYEISCLDINPIGENPS 493
               ++  A+AN   +LL+ GG  LV L I +   T + K    + +++C+ +      P 
Sbjct: 532  GERTIISASANPRWLLLSVGGTELVSLSIANDFQTVQAKDMNQQDQVACIHV-----APG 586

Query: 494  YSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEGISY----LL 545
               +  VG WT  +V I  L  L  I  E L        IPR + L      S     L 
Sbjct: 587  LDDVGVVGFWTSGTVSIIDLHTLEPIHGESLRTSKDDASIPRDLALVQMLPPSASGPTLF 646

Query: 546  CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSS 604
             A+ DG+++ F ++ K   L+ RK+V LG +          +    +FA ++ P++IY S
Sbjct: 647  VAMQDGNVVTFNVS-KDLALSGRKRVILGMRQARFHLLPQPDGIFSIFATTEHPSLIYGS 705

Query: 605  NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPR 664
              +++YS V  +E +++CPF++ AFPD + +A + ++ I  ID  ++ H++ + +GE  R
Sbjct: 706  EGRIVYSAVTAEEATYICPFDTEAFPDCIVLATDAQIRISQIDRERRTHVKPLQMGEMVR 765

Query: 665  RICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFIS-TYPLDTFEYGCSILSC 722
            RI +  + + F + C  ++    EE      +L+D+  F+ +  ++PL +  Y   +   
Sbjct: 766  RIAYSPREKVFGLGCIKRDLVAGEEVVQSSFKLVDEIIFDRVGRSFPLGSPSYTELVECV 825

Query: 723  SFSDDSNVY------YCVGTAYVL-PEENEPT--KGRILVFIVE-DGKLQLIAEKETKGA 772
              ++  + Y      + VGT+Y+  P+    T  +GRILVF V+ D    L+   E KGA
Sbjct: 826  VRAELPDSYGAPAERFLVGTSYLADPDLGAGTDARGRILVFGVDADRNPYLVLSHELKGA 885

Query: 773  VYSLNAF-NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVG 831
               L    +GK++A + + + + ++      T EL     +      + +  RG+ I V 
Sbjct: 886  CRCLAVMDDGKIVAGLTKTVVVCRYEETSSTTAELTRLASYRPSTYPVELCVRGNTIAVA 945

Query: 832  DLMKSISLLIY---------------KHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
            DLMKS++L+ +               K  E  + E+AR + + W +AV  + DD +L A+
Sbjct: 946  DLMKSVALVEFVPAGADETGAAGPSSKRGEAKLVEKARHFGSVWATAVSHVQDDSWLEAD 1005

Query: 877  NNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
               NL  +R+N EG T E++ R+EV  E +LGE VNR R G  V   P + V  +P    
Sbjct: 1006 AQGNLMVLRQNLEGVTAEDKKRMEVTSEMNLGEMVNRIR-GIEVETTPGAIV--VPKAFL 1062

Query: 937  GTVN-------------------------GVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
            GTV+                         G+      +PH Q L L + Q  L  VIK  
Sbjct: 1063 GTVSLRCGESLFGMNERRFANTGQWQVEGGIYMFATVVPHAQDLLL-RFQAKLADVIKTA 1121

Query: 972  GGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
            GG+    +R+F N ++  D    F+DG+L+E FLD+     + I + +  +VE++   VE
Sbjct: 1122 GGIEFRTYRAFRNAEREGDGPFRFIDGELLERFLDVDEATQEVICQGLGPTVEDMRNLVE 1181

Query: 1031 ELTRLH 1036
            EL R+H
Sbjct: 1182 ELRRMH 1187


>gi|326426696|gb|EGD72266.1| hypothetical protein PTSG_00286 [Salpingoeca sp. ATCC 50818]
          Length = 1104

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 302/1068 (28%), Positives = 543/1068 (50%), Gaps = 51/1068 (4%)

Query: 3    DVPIYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGR 61
            D P+YG++A + ++  P  +  D L + T+  +F VL ++  + +L T   G++ +    
Sbjct: 54   DYPVYGKVACIRKMTFPQMKGLDSLLVMTDDMRFFVLTFNEHTKKLETHTNGNLLNPSSF 113

Query: 62   PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCA 121
            P + G +  +DPDCR+I + +Y GL K++        +EA   RL+  QVL +  L+GCA
Sbjct: 114  PMETGPLVAVDPDCRVIVMAMYPGLVKILRIHGCKFEEEASEARLDADQVLSMAMLHGCA 173

Query: 122  KPTIVVLYQDNKDARHVKTYEVALKDKDF-VEGPWS---QNNLDNGADLLIPVPPPLCGV 177
            +PT+ VL Q   D R VK   V  +++   V   W    +  + + A  L PVP    G 
Sbjct: 174  EPTLAVLCQ-RADQRFVKLMRVTSRNELVEVSSVWKTRLEGAVPDSAHFLHPVPNTSEGC 232

Query: 178  LIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDH-AGLLHLLVITHEKEK 236
            L+ G + +VY  AN +KA  I   + +A   VD  G+R+L+GD   G L LLV+  E+++
Sbjct: 233  LVFGADAVVYADANGYKAASIPEMMVQAVADVDDSGARFLIGDSLRGGLALLVLEREEDQ 292

Query: 237  VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNL 296
            V  +  E  GET   ST++YLDN+VV++GS  GDSQ++KL    D +   + VL+ + N+
Sbjct: 293  VQQIVYEPFGETVAPSTLAYLDNSVVFVGSVVGDSQVVKLETDDDNQNKII-VLDTHENI 351

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSS 356
            G ++D C +     G+ ++VT SGA KDGSLR+++ G+ I   A+V+L  ++ +W L   
Sbjct: 352  GSVLDMCSLPQASFGETRLVTASGAGKDGSLRVIQRGVNITSSATVQLDDLQRVWVLTKP 411

Query: 357  TDDP----FDTFLVVSFISETRILAMNLEDELEETEIEGF----CSQTQTLFCHDAIYNQ 408
            + +      ++FL +SF      L      + E  E+ GF          L C +   NQ
Sbjct: 412  SGEAEAATEESFLALSFAGGLAFL------QFEGEELCGFEVPSAPSDPALLCSNVAENQ 465

Query: 409  LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGIL 468
             + VT   V LV + +     EWK+  G ++  A A + + L+ + G  L+  EI  G L
Sbjct: 466  WLFVTEDEVVLVCAETLARVAEWKAAEGQAIG-ACACSEKQLVVSSGRQLLIFEIAKGKL 524

Query: 469  TEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
            T+ K   LE+E+SCLDI P+ ++ +   +A  G W ++  +++ LP + + +       +
Sbjct: 525  TQTKDTTLEHELSCLDILPVNDDGTSVMVA--GTW-NVEAKVYHLPSMRVASSAPFKAGV 581

Query: 529  IPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
            I  S  +      +     +GDG ++ ++    +  +T+++ V  G QP++L +    + 
Sbjct: 582  IATSCAITRLGDQNVAFFGMGDGSVVRYIFAEGSWHMTNQRHVHAGKQPVSLVSCKFASG 641

Query: 589  THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
              V A S+   + Y+ + ++ +S +N   ++ + P ++ A+PDSL  +    L IG I  
Sbjct: 642  PAVVALSNTSLLFYADSGRVTFSTLNEANLTCVAPLSTPAYPDSLVFSTPASLGIGQIGR 701

Query: 649  IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTY 708
            +  LHI  + LG  P  +     + ++ +  + +    +      +R+ D  T E ++++
Sbjct: 702  MNNLHINKVALGFSPVSLTTISANPSYVVV-VGHVDQEDGGIASAIRVFDGTTLEMVASH 760

Query: 709  PLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEK 767
             L   E    ++   F DD+  Y+ +GTA+V P E +P++GRIL+  +E+ K + ++ E 
Sbjct: 761  ELPAPEAVNCVIQHKFKDDNTEYFIIGTAFVDPTETQPSRGRILISKLENKKEIAIVHEC 820

Query: 768  ETKGAVYSLNAFNGK----LLAAINQKIQLYKW-MLRDDGTRELQSECGHHGHILALYVQ 822
            E  G+VY L    GK    L+A IN ++  +K+     D  ++L++  G+      + + 
Sbjct: 821  EAAGSVYCLTKMCGKDTDDLVAGINNQVVHFKYDATGQDAAKKLRAVSGNQNFGAVVSLD 880

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
            +  D ++VGD++ ++   + +  +  ++  A    ANW+S+  ++++ ++L A +  +L 
Sbjct: 881  SCDDIVLVGDMLNAV--FVMQKAQDKLQLVAGSQTANWVSSCALVNETVFLVASHAHSLS 938

Query: 883  TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL-VMRLPDSDVGQIPT-VIFGTVN 940
              ++  E  +  +   L    E +LGE V  F   +L      DS +    T  +FGT+ 
Sbjct: 939  VCQREFEPGSTMQ--TLNAKFEIYLGETVTSFVRAALGSAAAVDSSMPLRNTFFVFGTMG 996

Query: 941  GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN--FLDGD 998
            G +  +  L   Q   L  L+  + + I G+GGL+H ++R+  +E++     N   +DGD
Sbjct: 997  GGLACLLPLTPPQTELLTALECRMEEKIGGLGGLDHREFRTARDEQRMAQQVNPRLVDGD 1056

Query: 999  LIESFLDLSRTRMDEISKTMN----------VSVEELCKRVEELTRLH 1036
            L+E+FL L      E+   M+          V+ E     +E + RLH
Sbjct: 1057 LVETFLQLPEEEQKELVAGMSLMGEDGRPYVVTAEVAKAALEAMARLH 1104


>gi|70992271|ref|XP_750984.1| UV-damaged DNA binding protein [Aspergillus fumigatus Af293]
 gi|66848617|gb|EAL88946.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
            Af293]
 gi|159124553|gb|EDP49671.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
            A1163]
          Length = 1140

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 334/1104 (30%), Positives = 544/1104 (49%), Gaps = 91/1104 (8%)

Query: 6    IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            IY R+  L  L  P     D LF+ T+RY +  L WD+    + T R   D+SD   R +
Sbjct: 55   IYARVTMLARLPAPANSPTDHLFVGTDRYTYFTLSWDSSEKRVRTERDYVDMSDPSSRES 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
              G   +IDP  R + L +Y+G+  VIP                        G+L E   
Sbjct: 115  QTGNRCLIDPSGRFMTLEIYEGVIAVIPIIQLPSKKRGRAVALPSGPDAPRVGELGEPTT 174

Query: 104  IRLEELQVLDIKFLY-GCAKPTIVVLYQDN-KDAR--------HVKTYEVALKDKDFVEG 153
             R+EEL V    FL+     P + +LY+DN K  R        H  T   +  D  F E 
Sbjct: 175  ARIEELFVRSSAFLHVQEGLPRLALLYEDNQKKVRLRVRELQYHAATGSNSTADATFGEP 234

Query: 154  PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRV 209
                 +L+ G+  LIPVP PL G+LI+GE +I Y  A+  + I  RP    +I  A+ +V
Sbjct: 235  ADFTQDLELGSSHLIPVPAPLGGLLILGEMSIKYVDADNNEIIS-RPLDEATIFVAWEQV 293

Query: 210  DADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
            D+   R+LL D  G L  L+++     +V   K++ LG TS AS + YL   ++++GS  
Sbjct: 294  DS--QRWLLADDYGRLFFLMLVLDSDSQVESWKLDHLGNTSRASVLVYLGGGILFLGSHQ 351

Query: 269  GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVT 317
            GDSQ+++++  P      +EV++   N+ PI+DF ++DL     E Q      GQ ++VT
Sbjct: 352  GDSQVLRISNGP------LEVVQTLSNIAPILDFTIMDLGNRSSESQTHEFSSGQARIVT 405

Query: 318  CSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
             SGA+ DG+LR VR+G+G+ E   + ++  I  +W L+  +   F   L+V+F+ ETR+ 
Sbjct: 406  GSGAFDDGTLRSVRSGVGMEELGVLGDMDHITDLWGLQVGSSGDFLDTLLVTFVDETRVF 465

Query: 377  AMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
              + + E+EE +   G      TL   +    +++QVT   V +       +  EW  P 
Sbjct: 466  RFSSDGEVEEMDHFLGLSLSESTLLATNLPGGRILQVTEQRVLIAEIEGGMVIYEWTPPN 525

Query: 436  GYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSY 494
             + +  A+AN   ++L  GG  +  L I + + +   K    + +IS + +     +P+ 
Sbjct: 526  QFIITAASANDDSIVLVAGGELVTVLNITNEVQVVTQKDFGADSQISGVTVP---SSPTG 582

Query: 495  SQIAAVGMWTDISVRIFSLPDLNLI--TKEHLGGEIIPRSVLLCAF--EGISYLLCALGD 550
              IA       +SV    L DL+ +  T   + GE  PRSVL+     +    L  ++ D
Sbjct: 583  VCIAGFPQLAKVSV--LKLQDLSELHTTSVGMAGEAFPRSVLVANVLTDSPPTLFVSMAD 640

Query: 551  GHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLL 609
            G ++ +  N     LT   K+ LG+ QP   +        +VFA  + P++IY S  +++
Sbjct: 641  GSVITYSFNTNNFSLTGMTKLILGSEQPTFKKLPRGSGLFNVFATCENPSMIYGSEGRII 700

Query: 610  YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
            YS VN +  S +C FNS A+P+S+A+A   +L I  +D  +   I+++P+G   RR+ + 
Sbjct: 701  YSAVNSEGASRICHFNSEAYPESIAVATSHDLKIALVDKERTTQIQTLPIGATVRRVAYS 760

Query: 670  EQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-- 725
               + F I +++ +     E  +  FV L D+  F  +  + L   E   S++   F   
Sbjct: 761  PSEKAFGIGTIERKLVDGTEIVKSRFV-LADEILFRRLDAFELRPEELVESVIRAEFPAG 819

Query: 726  ------DDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNA 778
                  D+    + VGTAY+  E +E  +GRIL+F V++G KL  +AE   KGA  +L  
Sbjct: 820  KGANDRDEVKDRFIVGTAYLDDEGDESIRGRILIFEVDNGRKLTQVAELPVKGACRALAM 879

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
               K++AA+ + + +YK +  + G   L+    +      + V   G+ I V DLMKS+ 
Sbjct: 880  LGDKIVAALVKTVVVYKVINNNFGAMRLEKLASYRTSTAPVDVTVTGNLIAVSDLMKSMC 939

Query: 839  LLIYKHEEGA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
            L+ YK  E      + E AR +   W + V  +  D +L ++   NL  + +N+ G  ++
Sbjct: 940  LVEYKEGENGTPDTMTEVARHFQTVWATGVANIAPDTFLESDAEGNLIVLHRNTTGVEED 999

Query: 895  ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
            ++ RLEV GE  LGE VNR R  + + +L  + V   P    GTV G I + A +  +  
Sbjct: 1000 DKRRLEVTGEISLGEMVNRIRPVN-IQQL--ASVAVTPRAFLGTVEGSIYLFAIINPDHQ 1056

Query: 955  LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
             FL +LQ  +   ++ VG +   ++R F +  ++  +   F+DG+LIE FL    +  +E
Sbjct: 1057 DFLMRLQATIAGKVELVGNIPFNEFRGFRSMVREAKEPYRFVDGELIERFLTCEPSLQEE 1116

Query: 1014 ISKTM-NVSVEELCKRVEELTRLH 1036
            I  T+  +SV+E+   +E L RLH
Sbjct: 1117 IVSTVGKMSVDEVKGMIEALRRLH 1140


>gi|296411833|ref|XP_002835634.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295629420|emb|CAZ79791.1| unnamed protein product [Tuber melanosporum]
          Length = 1053

 Score =  440 bits (1131), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 310/977 (31%), Positives = 491/977 (50%), Gaps = 76/977 (7%)

Query: 5    PIYGRIATLELFRPHGEAQDFLFIATER--YKFCVLQWDAESSELITRAMGDVSDRIGRP 62
            PIYGR+  L  FRP   A D L I   +  Y++  + WD  + +  T  M         P
Sbjct: 54   PIYGRVIALAAFRPVKSATDHLLIVIGKVNYQYFTISWDPVAKKPKTEHMAVNFSEYNAP 113

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK--------------GQLKEAFNIRLEE 108
              +    + DP   ++G+H+Y G+F VIP   +              G +     +RL+E
Sbjct: 114  LSDSFQCLADPGKNMLGIHVYKGIFLVIPQIQQSIKGSRRSRADLDVGNIGNPCVVRLKE 173

Query: 109  LQVLDIKFLYGCAKPTIVVLYQDN-KDARHVKTYEVALKDKDFVEGPWSQNNLDNG--AD 165
            L++LD+KFL+G   P + VLY+ +  D   V TYE+++K  +     W   +L  G  A 
Sbjct: 174  LEILDLKFLFGTISPVLAVLYKPSGADEMAVNTYELSVKSGEVKLLDWRIRDLKGGREAL 233

Query: 166  LLIPVPPPLCGVLIIGEETIVYCSANAFKA-IPIRPSITKAYGRVDADGSRYLLGDHAGL 224
             LIPV PP  G+L+IG   I Y      K  +P+ P +      + +   RY+LGD AG 
Sbjct: 234  FLIPVRPPSNGLLLIGVTKIQYFDNYGNKTFLPVDPPMVWVTWEMLSP-ERYILGDEAGG 292

Query: 225  LHLLVITHE-KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAK 283
            LH+L ++    +   GL ++L+G  SI   + +L+  ++++GS  GDSQL  L L  +  
Sbjct: 293  LHMLTLSAGLMDTKVGLHLKLVGNASIPEILVHLNQGLLFLGSHSGDSQL--LQLYQNGI 350

Query: 284  GSYVEVLERYVNLGPIVDFCVVDLE--------RQ---GQGQVVTCSGAYKDGSLRIVRN 332
               V V +   N+GPIVDF V+DL+        RQ   G  + ++ SG +  G LR +R+
Sbjct: 351  KPMVRVQQVLRNIGPIVDFRVMDLDYSRSDEVMRQYSPGHIRFLSASGGHTQGHLRTIRS 410

Query: 333  GIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET-EIE 390
            G+G+ +   + E+ GI+G+WSLRS     FD  LVVSFI ETRI   +   E+EE  E  
Sbjct: 411  GVGLYDLGFLGEMSGIRGLWSLRSIPGSSFDDVLVVSFIEETRIFKFDNSGEIEELYEFM 470

Query: 391  GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQ-V 449
            GF    +T+  H  +  + +QVT+ +V+LV   S  L  E  S P  S+ +  A+A+Q +
Sbjct: 471  GFALNQRTILAHSVVGGRFLQVTATAVKLVDVRSNTLIAE--SYPDKSLTITIASANQDL 528

Query: 450  LLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVR 509
            L+   G  LV L +    L E      E EISCL++       S S I AVG WT   V 
Sbjct: 529  LIYAMGPTLVLLNLARD-LEEHIRTTFENEISCLNMPS-----SPSTICAVGFWTVSLVL 582

Query: 510  IFSLPDLNLITKEHLGGE---IIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGE 564
            I S+   +++++E L  E     PRS+L       G   LL ALGDG +  F LN  T  
Sbjct: 583  ILSVQSFSILSQEILSQEDSAATPRSLLFARLLENGPPTLLVALGDGSMFTFALNETTCG 642

Query: 565  LTDRKKVSLGTQPITLRTFSSKNT-THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCP 623
            L++RK + LG QPI  ++    N    VFA  D P+VIY S+ +++Y++V   + +++  
Sbjct: 643  LSERKHIILGAQPIRFQSIPGGNGGVTVFATCDHPSVIYGSDGRIVYASVTADKPTYVTS 702

Query: 624  FNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ 683
            FNS +FPD++ IA E +L +  +D ++ +H++S+P+G+  RRI + ++    AI ++   
Sbjct: 703  FNSPSFPDAVVIASEDDLKLSVVDPVRTMHVQSLPVGDVVRRIAYSKEKNIIAIVTVSKV 762

Query: 684  SCAEESEMHF---VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY---YCVGTA 737
                  +  +   +RL+D+  F  + +Y L+  E   S+ S      + +    +  GT 
Sbjct: 763  PDTRTGDYLYSSCIRLVDNTAFSVVDSYELNQLELVESLASGKICGGNGLLSEGFLAGTV 822

Query: 738  YVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFN-GKLLAAINQKIQLYK 795
            Y     +E  KGRI+VF   E  +++LI   +T G+V  +     GK +AAI ++I LY 
Sbjct: 823  YPGGGRDESEKGRIIVFNASETKRIKLIVSYDTPGSVNGIQIVGEGKFVAAIGREIHLYS 882

Query: 796  WMLRDDGTRELQSE------------CGHHGHILALYVQTRGDFIVVGDLMKSISLLIY- 842
                D  ++   SE                 H   L V    D I V D M   S+L + 
Sbjct: 883  LKYTDIQSKTAASESVGATGCTITKLASFKAHSTPLDVAVYNDIIAVCDFMHGPSILQHI 942

Query: 843  ---KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
               +++     E AR    +W++A+E+LD+     A+ + NL   ++   G T+++R +L
Sbjct: 943  EDKENKSSEFVEVARAPKPSWLTALELLDEKTVFCADTDGNLVVWQRQLSGVTEDDRKQL 1002

Query: 900  EVVGEYHLGEFVNRFRH 916
            + +    +GE ++R R 
Sbjct: 1003 QQIASMKIGEDIDRIRR 1019


>gi|317031116|ref|XP_001392900.2| UV-damaged DNA binding protein [Aspergillus niger CBS 513.88]
          Length = 1124

 Score =  439 bits (1130), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 329/1104 (29%), Positives = 549/1104 (49%), Gaps = 107/1104 (9%)

Query: 6    IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            ++ ++  L  L  P     D LF+ T+RY +C L WD E +++ T R   D+SD   R  
Sbjct: 55   LFAKVTMLARLPAPANSPTDHLFVGTDRYTYCTLSWDGERNQIRTERNYVDISDPSSREA 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
              G   +IDP  R + L +Y+G+  V+P                        G+L E   
Sbjct: 115  QTGNRCLIDPSGRFMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTT 174

Query: 104  IRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKT----YEVALK----DKDFVEG-- 153
             R++EL               + +LY+DN+    +K     Y  A      D  F E   
Sbjct: 175  ARIDEL---------------LALLYEDNQKKVRLKVRALHYSAATASTGADAAFEESLD 219

Query: 154  PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRV 209
             +SQ  LD GA  LIPVP PL G+L++GE +I Y   ++ + +  RP    +I  A+ +V
Sbjct: 220  GFSQE-LDLGASHLIPVPAPLGGLLVLGETSIKYVDTDSNEIVS-RPLDEATIFVAWEQV 277

Query: 210  DADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
            D+   R+LL D  G L  L+++     +V   K++ LG T+ AS + YL   V+++GS  
Sbjct: 278  DS--QRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGVIFVGSHQ 335

Query: 269  GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVT 317
            GDSQ++++       GS +EV++   N+ PI+DF ++DL     E Q      GQ ++VT
Sbjct: 336  GDSQVLRI-----GNGS-LEVIQTLSNIAPILDFTIMDLGNRTSESQTHEFSSGQARIVT 389

Query: 318  CSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
             SGA+ DG+LR VR+G+G+ E   + E++ I  ++ L+ +T   +   L+V+F+ ETR+ 
Sbjct: 390  GSGAFDDGTLRSVRSGVGMEELGVLGEMELITDLFGLQLATKGGYLDTLLVTFVDETRVF 449

Query: 377  AMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
              N + E+EE +   G      TL   +    +++QVT   V +       + NEW  P 
Sbjct: 450  QFNFDGEVEELDSFLGLSLSENTLLAMNLPGGRILQVTEQRVLIADIEGGMVTNEWTPPD 509

Query: 436  GYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSY 494
               +  A+AN   ++L +GG  +  L+I + + +   K    + +IS + + P+    S 
Sbjct: 510  NLVITSASANNDSIVLVSGGQLMTVLDINNDVRVISQKDFGADSQISGVTV-PL----SS 564

Query: 495  SQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGD 550
            + +  VG      V +  L  L+ +    LG  GE  PRSVL+        S L  ++ D
Sbjct: 565  AGVCIVGFPQLAKVSVLDLGRLSELHTTSLGPAGEAFPRSVLVADVLANSPSTLFISMAD 624

Query: 551  GHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLL 609
            G ++ +  + +   LT   ++ LG++  T +     +  + VFA  + P++IY S  +++
Sbjct: 625  GSVITYSFDARNYSLTGMNRLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYGSEGRII 684

Query: 610  YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
            YS VN +  S +C FNS A+P S+A+A   +L I  +D  +   I+++P+GE  RR+ + 
Sbjct: 685  YSAVNSEGASRVCHFNSEAYPGSIAVATRHDLKIALVDKERTTQIQTLPMGETARRVAYS 744

Query: 670  EQSRTFAICSL--KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-- 725
               + F I ++  K +  AE  +  FV L D+  F  +  + L   E   S++   FS  
Sbjct: 745  PSEKAFGIGTIERKLKDGAEMVQSRFV-LADEIMFRRLDAFDLRPEELVESVIRAEFSAG 803

Query: 726  ---DDSNVY---YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNA 778
               +  +V+   + VGTAY+  E  E  +GRILVF +++G KL  +AE   KGA  +L  
Sbjct: 804  KDENGRDVFKDRFVVGTAYLDDENEESIRGRILVFEIDNGRKLTKVAELPVKGACRALAM 863

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
               K++AA+ + + +Y  +  D G  +L+    +      + V   G+ I V DLMKS+ 
Sbjct: 864  LGEKIVAALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAPVDVTVTGNVIAVADLMKSVC 923

Query: 839  LLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
            L+ Y   E     ++ E AR +   W + V  +  D +L  +   NL  +R+N  G  ++
Sbjct: 924  LVEYSEGENGSPDSLTEVARHFQTVWATGVSCIAKDTFLETDAEGNLIVLRRNLTGVEED 983

Query: 895  ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
            ++ RLEV GE  LGE VNR R  ++      + V   P    GTV G I + A +  E  
Sbjct: 984  DKRRLEVTGEISLGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLFAIINPEHQ 1040

Query: 955  LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
             FL +LQ  +   ++ +G +   ++R F +  ++  +   F+DG+LIE FL    +  +E
Sbjct: 1041 DFLMRLQATMAGKVESLGNIPFNEFRGFRSMVREAKEPYRFVDGELIERFLTCEPSLQEE 1100

Query: 1014 ISKTMN-VSVEELCKRVEELTRLH 1036
            I  ++  ++V+E+   +E L RLH
Sbjct: 1101 IVDSVGMMNVDEVKIMIEALRRLH 1124


>gi|119471789|ref|XP_001258220.1| UV-damaged DNA binding protein, putative [Neosartorya fischeri NRRL
            181]
 gi|119406372|gb|EAW16323.1| UV-damaged DNA binding protein, putative [Neosartorya fischeri NRRL
            181]
          Length = 1140

 Score =  439 bits (1130), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 335/1104 (30%), Positives = 545/1104 (49%), Gaps = 91/1104 (8%)

Query: 6    IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            IY R+  L  L  P     D LF+ T+RY +  L WD+    + T R   D+SD   R +
Sbjct: 55   IYARVTMLARLPAPANSPTDHLFVGTDRYTYFTLSWDSSEKRVRTERDYVDMSDPSSRES 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
              G   +IDP  R + L +Y+G+  VIP                        G+L E   
Sbjct: 115  QTGNRCLIDPSGRFMTLEIYEGVIAVIPIIQLPSKKRGRAVALPSGPDAPRVGELGEPTT 174

Query: 104  IRLEELQVLDIKFLY-GCAKPTIVVLYQDN-KDAR--------HVKTYEVALKDKDFVEG 153
             R+EEL V    FL+     P + +LY+DN K  R        H  T   +  D  F E 
Sbjct: 175  ARIEELFVRSSAFLHVQEGLPRLALLYEDNQKKVRLRVRELQYHAATGSNSTADATFGEP 234

Query: 154  PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRV 209
                 +L+ G+  LIPVP PL G+LI+GE +I Y  A+  + I  RP    +I  A+ +V
Sbjct: 235  ADFTQDLELGSSHLIPVPAPLGGLLILGEMSIKYVDADNNEIIS-RPLDEATIFVAWEQV 293

Query: 210  DADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
            D+   R+LL D  G L  L+++     +V   K++ LG TS AS + YL   V+++GS  
Sbjct: 294  DS--RRWLLADDYGRLFFLMLVLDSDSQVESWKLDHLGNTSRASVLVYLGGGVLFLGSHQ 351

Query: 269  GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVT 317
            GDSQ+++++  P      +EV++   N+ PI+DF ++DL     E Q      GQ ++VT
Sbjct: 352  GDSQVLRISNGP------LEVVQTLSNIAPILDFTIMDLGNRSSESQTHEFSSGQARIVT 405

Query: 318  CSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
             SGA+ DG+LR VR+G+G+ E   + +++ I  +W L+  +   F   L+V+F+ ETR+ 
Sbjct: 406  GSGAFDDGTLRSVRSGVGMEELGVLGDMEHITDLWGLQVGSIGDFLDTLLVTFVDETRVF 465

Query: 377  AMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
              + + E+EE +   G      TL   +    +++QVT   V +       +  EW  P 
Sbjct: 466  RFSSDGEVEEMDHFLGLSLSESTLLATNLPGGRILQVTEQRVLIAEVEGGMVIYEWTPPN 525

Query: 436  GYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSY 494
               +  A+AN   ++L  GG  +  L I + + +   K    + +IS + +     +P+ 
Sbjct: 526  QLIITAASANDDSIVLVAGGELVTVLNITNEVQIVTQKDFGADSQISGVTVP---SSPTG 582

Query: 495  SQIAAVGMWTDISVRIFSLPDLNLI--TKEHLGGEIIPRSVLLCAF--EGISYLLCALGD 550
              IA       +SV    L DL+ +  T   L GE  PRSVL+     +    L  ++ D
Sbjct: 583  VCIAGFPQLAKVSV--LKLQDLSELHTTSVGLAGEAFPRSVLVANVLTDSPPTLFVSMAD 640

Query: 551  GHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK-NTTHVFAASDRPTVIYSSNKKLL 609
            G ++ +  N     LT   K+ LG++  T +         +VFA  + P++IY S  +++
Sbjct: 641  GSVITYSFNTDDYSLTGMTKLILGSEQPTFKKLPRGIGLFNVFATCENPSMIYGSEGRII 700

Query: 610  YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
            YS VN +  S +C FNS A+P+S+A+A   +L I  +D  +   I+++P+G   RR+ + 
Sbjct: 701  YSAVNSEGASRICHFNSEAYPESIAVATSHDLKIALVDKERTTQIQTLPIGATVRRVAYS 760

Query: 670  EQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-- 725
               + F I +++ +    AE  +  FV L D+  F  +  + L   E   S++   F   
Sbjct: 761  PSEKAFGIGTIERKLVDGAEIVKSRFV-LADEILFRRLDAFELRPEELVESVIRAEFPAG 819

Query: 726  ------DDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNA 778
                  D+    + VGTAY+  E +E  +GRIL+F V++G KL  +AE   KGA  +L  
Sbjct: 820  KGANDRDEVKDRFIVGTAYLDDEGDESIRGRILIFEVDNGRKLTQVAELPVKGACRALAM 879

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
               K++AA+ + + +Y+ +  + G   L+    +      + V   G+ I V DLMKS+ 
Sbjct: 880  LGDKIVAALVKTVVVYRVINNNFGAMRLEKLASYRTSTAPVDVTVTGNLIAVSDLMKSMC 939

Query: 839  LLIYKHEEGA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
            L+ YK  E      + E AR +   W + V  +  D +L ++   NL  + +N+ G  ++
Sbjct: 940  LVEYKEGENGTPDTMTEVARHFQTVWATGVANIAPDTFLESDAEGNLIVLHRNTTGVEED 999

Query: 895  ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
            ++ RLEV GE  LGE VNR R  + + +L  + V   P    GTV G I + A +  +  
Sbjct: 1000 DKRRLEVTGEISLGEMVNRIRPVN-IQQL--ASVAVTPRAFLGTVEGSIYLFAIINPDHQ 1056

Query: 955  LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
             FL +LQ  +   ++ VG +   ++R F +  ++  +   F+DG+LIE FL    +  +E
Sbjct: 1057 DFLMRLQATIAGKVELVGNMPLNEFRGFRSMVREAKEPYRFVDGELIERFLTCEPSLQEE 1116

Query: 1014 ISKTM-NVSVEELCKRVEELTRLH 1036
            I  T+  +SV+E+   +E L RLH
Sbjct: 1117 IVSTVGKMSVDEVKGMIEALRRLH 1140


>gi|212539802|ref|XP_002150056.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
            18224]
 gi|210067355|gb|EEA21447.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
            18224]
          Length = 1139

 Score =  438 bits (1127), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 326/1103 (29%), Positives = 551/1103 (49%), Gaps = 90/1103 (8%)

Query: 6    IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            +Y +++ L  L  P     D LF+ T++  +  L+WD+E   + T R+  D++D+  R  
Sbjct: 55   LYAKVSMLARLPAPANSTTDHLFVGTDQNTYFTLKWDSEKRRVRTERSYVDLADKASRAC 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIP------------------FDNK-------GQL 98
             NG   +IDP  + + L +++G+  V+P                  + N        G+L
Sbjct: 115  QNGDRCLIDPSGKYMTLEIFEGIITVLPIIQLHKKRGKPPVLKTSHYSNPDEPTPQIGEL 174

Query: 99   KEAFNIRLEELQVLDIKFLY--GCAKPTIVVLYQDNKDARHVKTYEVALK---DKDFVEG 153
             E    R++EL V    FL+    A P + +L++DN+    +K  E++ +   +  F E 
Sbjct: 175  GEPMMTRIDELMVRSSAFLHVESKAHPRLALLHEDNQRKVKLKIRELSFEASAEPVFQET 234

Query: 154  PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SANAFKAIPIRPSITKAYGRVDA 211
                  LD GA  LIPVP PL G+L++GE  I Y   + N   + P+  + T     V  
Sbjct: 235  EDFTEELDLGASHLIPVPAPLGGLLVLGETCIKYIDDAKNETISNPLDEA-TIFVAWVQV 293

Query: 212  DGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
            DG R+LL D  G L  L+++   + +V G K++ LGE S AS + YL   + +IGS  GD
Sbjct: 294  DGQRWLLADDYGRLFFLMLVLDSQNEVEGWKLDYLGEASRASVLIYLGAGMTFIGSHQGD 353

Query: 271  SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCS 319
            SQ+I++     ++GS+ E+++   N+ PI+DF ++DL  +           GQ ++VT S
Sbjct: 354  SQVIRI-----SEGSF-EIIQTISNIAPILDFTIMDLGAREGENYTHEFSSGQARIVTGS 407

Query: 320  GAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
            GA+ DG+LR VR+G+G++E   + E++ I  MW+L+ S+   F   LVV+F++ETR+   
Sbjct: 408  GAFNDGTLRSVRSGVGMDELGVLGEMEHITDMWALQISSPGDFSDTLVVTFVNETRVFRF 467

Query: 379  NLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
            + + E+EE  E  G      TL   +    +++ VT   V +  + S  + ++W SP G+
Sbjct: 468  SSDGEVEELDEFLGLNLAENTLLSSNLPGGRIIHVTESGVSIADTDSGMVTSKW-SPDGH 526

Query: 438  SVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQ 496
            ++     N  ++++ TGG  +  L+I GD  +   K    + ++S + I P     + SQ
Sbjct: 527  TITSVACNDERLVVVTGGQIIATLDITGDLKVLSQKDFGKDNQVSGVTIPP-----APSQ 581

Query: 497  IAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGH 552
            +       +  V +  L     +  + LG   E  PR+VLL     +  S L  ++ DG 
Sbjct: 582  VCIAAFPQNALVAVLDLHRFEELHSQSLGVASEAFPRTVLLAEILADSSSTLFVSMADGS 641

Query: 553  LLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYS 611
            ++ F  + ++  LT   K+ LG++  T +     +  + VF   + P++IY+S  +++YS
Sbjct: 642  VVTFSYDAESHSLTGTNKLILGSEQPTFKKLPRGDGLYNVFTTCEHPSLIYASEGRIIYS 701

Query: 612  NVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQ 671
             VN +  S +C FN+ A+PD++A+A   +L I  +D  +   I+++P+G   RR+ +   
Sbjct: 702  AVNSEGASRVCHFNTEAYPDAIAVATSRDLKIALVDKERTTQIQTLPIGATVRRVAYSPT 761

Query: 672  SRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
             + F I ++K   +  AE  E  FV L D+  F  + ++ L   E   S++        +
Sbjct: 762  EKAFGIGTIKRRLEDGAEIVESQFV-LADEIMFRKLDSFDLKPDELVESVIRAQMVVGKD 820

Query: 730  VY--------YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFN 780
             Y        + VGTAY+  E  E  +GRIL+F V+   KL L  E   KGA  +L    
Sbjct: 821  RYDEPIYKDRFIVGTAYLDDETAESIRGRILLFEVDSNRKLSLFLEHPVKGACRALAMMG 880

Query: 781  GKLLAAINQKIQLYKWMLR-DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
              ++AA+ + + +++   +   G   LQ    +      + +      IVV DLMKSIS+
Sbjct: 881  DYIVAALVKTVVIFEVTGQPQTGKYSLQKAAVYRTSTAPVDIAVTDKTIVVADLMKSISI 940

Query: 840  LIYKHEEG---AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER 896
            +     +      +E AR +   W +AV  +  + +L ++   NL  +R+N +G T+E+R
Sbjct: 941  VESNKTDALTMEAKEVARHFATVWTTAVADIGSNQWLVSDAEGNLIVLRRNVDGMTEEDR 1000

Query: 897  GRLEVVGEYHLGEFVNRFRHGSLVMRLPD-SDVGQIPTVIFGTVNGVIGVIASLPHEQYL 955
             RLEV  E  LGE VNR R     + +P  S +   P    GTV G I + A +  E   
Sbjct: 1001 RRLEVTSELLLGEMVNRIRP----VNIPQTSTMAVTPKAFLGTVEGSIYLFALINPEHQD 1056

Query: 956  FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEI 1014
            FL +LQT +   +   G +   ++R+F +  +  +    F+DG+LIE FLD  R   +EI
Sbjct: 1057 FLMRLQTAISAYVDSPGLMPFNKFRAFRSTVREAEEPFRFVDGELIERFLDCDRAVQEEI 1116

Query: 1015 SKTMNVS-VEELCKRVEELTRLH 1036
               +    +E + K +E L RLH
Sbjct: 1117 LGVVGSGDLESVQKMIEALRRLH 1139


>gi|358366432|dbj|GAA83053.1| UV-damaged DNA binding protein [Aspergillus kawachii IFO 4308]
          Length = 1643

 Score =  436 bits (1121), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 326/1081 (30%), Positives = 538/1081 (49%), Gaps = 91/1081 (8%)

Query: 18   PHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPDCR 76
            P     D LF+ T+RY +C L WD E +++ T R   D+SD   R    G   +IDP  R
Sbjct: 8    PANSPTDHLFVGTDRYTYCTLSWDGERNQIRTERNYVDISDPSSREAQTGNRCLIDPSGR 67

Query: 77   LIGLHLYDGLFKVIPF--------------------DNKGQLKEAFNIRLEELQVLDIKF 116
             + L +Y+G+  V+P                        G+L E    R++EL V    F
Sbjct: 68   FMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTTARIDELFVRSSAF 127

Query: 117  LY-GCAKPTIVVLYQDNKDARHVKT----YEVALK----DKDFVEG--PWSQNNLDNGAD 165
            L+     P + +LY+DN+    +K     Y  A      D  F E    +SQ  LD GA 
Sbjct: 128  LHVQSGPPRLALLYEDNQKKVRLKVRALHYSAATSSTGADAAFEESLDGFSQE-LDLGAS 186

Query: 166  LLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVDADGSRYLLGDH 221
             LIPVP PL G+L++GE +I Y   ++   I  RP    +I  A+ +VD+   R+LL D 
Sbjct: 187  HLIPVPAPLGGLLVLGETSIKYVDTDS-NEIVSRPLDEATIFVAWEQVDSQ--RWLLADD 243

Query: 222  AG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP 280
             G L  L+++     +V   K++ LG T+ AS + YL   V+++GS  GDSQ++++    
Sbjct: 244  YGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGVIFVGSHQGDSQVLRI---- 299

Query: 281  DAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTCSGAYKDGSLRI 329
               GS +EV++   N+ PI+DF ++DL     E Q      GQ ++VT SGA+ DG+LR 
Sbjct: 300  -GNGS-LEVIQTLSNIAPILDFTIMDLGNRTSESQTHEFSSGQARIVTGSGAFDDGTLRS 357

Query: 330  VRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE 388
            VR+G+G+ E   + E++ I  ++ L+ +T   +   L+V+F+ ETR+   N + E+EE +
Sbjct: 358  VRSGVGMEELGVLGEMELITDLFGLQLATKGGYLDTLLVTFVDETRVFQFNFDGEVEELD 417

Query: 389  -IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANAS 447
               G      TL   +    +++QVT   V +       + NEW  P    +  A+AN  
Sbjct: 418  SFLGLSLSENTLLAMNLPGGRILQVTEQRVLIADIEGGMVTNEWTPPDRLVITSASANND 477

Query: 448  QVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDI 506
             ++L +GG  L  L+I + + +   K    + +IS + + P+    S + +  VG     
Sbjct: 478  SIVLVSGGQILTVLDINNDVRVVAQKDFGADSQISGVTV-PL----SSAGVCIVGFPQLA 532

Query: 507  SVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKT 562
             V +  L  L+ +    LG  GE  PRSVL+        S L  ++ DG ++ +  +   
Sbjct: 533  KVSVLDLGRLSELHTTSLGPAGEAFPRSVLVADVLANSPSTLFISMADGSVITYAFDASD 592

Query: 563  GELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVNLKEVSHM 621
              LT   ++ LG++  T +     +  + VFA  + P++IY S  +++YS VN +  S +
Sbjct: 593  YSLTGMNRLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYGSEGRIIYSAVNSEGASRV 652

Query: 622  CPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSL- 680
            C FNS A+P S+A+A + +L I  +D  +   I+++P+GE  RR+ +    + F I ++ 
Sbjct: 653  CHFNSEAYPGSIAVATKHDLKIALVDKERTTQIQTLPMGETARRVAYSPSEKAFGIGTIE 712

Query: 681  -KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-----DDSNVY--- 731
             K +  AE  +  FV L D+  F  +  + L   E   S++   FS     +  +V+   
Sbjct: 713  RKLKDGAEMVQSRFV-LADEIMFRRLDAFDLRPEELVESVIRAEFSAGKDENGRDVFKDR 771

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
            + VGTAY+  E  E  +GRILVF +++G KL  +AE   KGA  +L     K++AA+ + 
Sbjct: 772  FVVGTAYLDDENEESIRGRILVFEIDNGRKLTKVAELPVKGACRALAMLGEKIVAALVKT 831

Query: 791  IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG--- 847
            + +Y  +  D G  +L+    +      + V   G+ I + DLMKS+ L+ Y   E    
Sbjct: 832  VVIYGVVNNDFGAMKLEKLASYRTSTAPVDVTVTGNVIAIADLMKSVCLVEYSEGENGMP 891

Query: 848  -AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
             ++ E AR +   W + V  +  D +L  +   NL  +R+N  G  ++++ RLEV GE  
Sbjct: 892  DSLTEVARHFQTVWATGVVCIAKDTFLETDAEGNLIVLRRNLTGVEEDDKRRLEVTGEIS 951

Query: 907  LGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            LGE VNR R  ++      + V   P    GTV G I + A +  E   FL +LQ  +  
Sbjct: 952  LGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLFAIINPEHQDFLMRLQATMAG 1008

Query: 967  VIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEEL 1025
             ++ +G +   ++R F +  ++  +   F+DG+LIE FL    +  +EI  ++ +  +  
Sbjct: 1009 KVESLGNIPFNEFRGFRSMVREAKEPYRFVDGELIERFLTCEPSLQEEIVDSVGMMNKGS 1068

Query: 1026 C 1026
            C
Sbjct: 1069 C 1069


>gi|121699866|ref|XP_001268198.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
            1]
 gi|119396340|gb|EAW06772.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
            1]
          Length = 1140

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 331/1103 (30%), Positives = 544/1103 (49%), Gaps = 91/1103 (8%)

Query: 7    YGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            Y R+  L  L  P     D LF+ T+RY +  L WD+  +++ T R   D+SD   R + 
Sbjct: 56   YARVTMLARLPAPANSPTDHLFVGTDRYTYFTLSWDSSKNQVRTEREYVDISDPSSRESQ 115

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFNI 104
             G   +IDP  R + L +Y+G+  V+P                        G+L E    
Sbjct: 116  TGNRCLIDPSGRFMTLEIYEGVIAVVPIIQLPSKKRGRNVALPSGPDAPRVGELGEPATA 175

Query: 105  RLEELQVLDIKFLY-GCAKPTIVVLYQDN-KDAR--------HVKTYEVALKDKDFVEGP 154
            R+EEL V    FL+     P + +LY+DN K  R        H  T    + D  F E  
Sbjct: 176  RIEELFVRSSAFLHVQEGLPRLALLYEDNQKKVRLRVRELQYHAATGSTNIADATFGEPA 235

Query: 155  WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVD 210
                 L+ G+  LIPVP PL G+LI+GE +I Y  A+  + I  RP    +I  A+ +VD
Sbjct: 236  DFTQELELGSSHLIPVPAPLGGLLILGETSIKYVDADNNEIIS-RPLDEATIFVAWEQVD 294

Query: 211  ADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
            +   R+LL D  G L  L+++     +V   K++LLG+TS AS + YL   V+++GS  G
Sbjct: 295  S--QRWLLADDYGRLFFLMLVLDSDNQVESWKLDLLGKTSRASVLVYLGGGVLFLGSHQG 352

Query: 270  DSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTC 318
            DSQ++++     + GS V+V++   N+ PI+DF ++DL     E Q      GQ ++VT 
Sbjct: 353  DSQVLRI-----SNGS-VDVVQTLPNIAPILDFTIMDLGSRTSESQTHEFSSGQARIVTG 406

Query: 319  SGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILA 377
            SGA+ DG+LR VR+G+G+ E   + E+  I  +W L+  +   F   L+V+F+ ETR+  
Sbjct: 407  SGAFDDGTLRSVRSGVGMEELGVLGEMDHITDLWGLQVGSKGDFLDILLVTFVDETRVFH 466

Query: 378  MNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPG 436
             + + E+EE +   G      TL   +    +++QVT   V +       +  EW  P  
Sbjct: 467  FSPDGEVEEMDHFLGLSLSENTLLAANLPGGRILQVTERRVLIAEVEGGMVTYEWTPPNQ 526

Query: 437  YSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYS 495
              +  A+ N   ++L  GG  +  L+IG+ + L   K      +IS + +     +P+  
Sbjct: 527  LIITAASTNNDMIVLVAGGELVTVLDIGNEVRLVTQKDFGANSQISGVTVP---ASPTGV 583

Query: 496  QIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDG 551
             IA       +SV    L DL  +    +G  GE  PRSVL+     +    L  ++ DG
Sbjct: 584  FIAGFPQLAKVSV--LKLQDLAEVQSTSVGPAGEAFPRSVLVADVLSDSPPTLFISMADG 641

Query: 552  HLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLY 610
             ++ +  N     LT   K+ LG++  T +     +   +VFA  + P++IY S  +++Y
Sbjct: 642  SVITYSFNPDDVSLTGMTKLILGSEQPTFKKLPRGDGLFNVFATCENPSMIYGSEGRIIY 701

Query: 611  SNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQE 670
            S VN +  S +C  NS A+P S+A+A   +L I  +D  +   I+++P+G   RR+ +  
Sbjct: 702  SAVNSEGASRICHLNSEAYPGSIAVATLHDLKIALVDKERTTQIQTLPIGATVRRVAYSP 761

Query: 671  QSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF---S 725
              + F I +++ +    AE  +  F+ L D+  F  +  + L   E   S++   F    
Sbjct: 762  SEKAFGIGTIERKLVDGAEIVKSEFM-LADEILFRRLDAFELRPEELVESVIRAEFPVGK 820

Query: 726  DDSNVY-----YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAF 779
            D+ N       + VGTA++  E +E  +GRIL+F V++G KL  +AE   KGA  +L   
Sbjct: 821  DEKNGATVKDRFIVGTAFLDDEGDESIRGRILIFEVDNGRKLTQVAELPVKGACRALAML 880

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
              +++AA+ + + +YK +  + G  +L+    +      + V   G+ I V DLMKS+ L
Sbjct: 881  GNRIVAALVKTVVVYKAVSNNFGAMKLEKLASYRTSTAPVDVTVTGNLIAVSDLMKSVCL 940

Query: 840  LIYKHEEGAIE----ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
            + YK  E  +     E AR +   W + V  +  D +L ++   NL  + +N+ G  +++
Sbjct: 941  VEYKEGEDGLPDTLTEVARHFQTVWATGVACIAQDTFLESDAEGNLIILCRNTTGVEEDD 1000

Query: 896  RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL 955
            + RLEV GE  LGE VNR R  ++      + V   P     TV G I + A +  +   
Sbjct: 1001 KRRLEVTGEISLGEMVNRIRPVNIQQL---TSVAVTPRAFLATVEGSIYLFAMINPDHQD 1057

Query: 956  FLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
            FL +LQ  +   ++ VG +   ++R F++  ++  +   F+DG+LIE FL    +  +EI
Sbjct: 1058 FLMRLQATIAGKVELVGNMPFNEFRGFHSMVREAQEPYRFVDGELIERFLACEPSVQEEI 1117

Query: 1015 -SKTMNVSVEELCKRVEELTRLH 1036
             S    ++V+E+   +E L RLH
Sbjct: 1118 VSIVGKMTVDEVKSMIEALRRLH 1140


>gi|406865227|gb|EKD18269.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
            MB_m1]
          Length = 1146

 Score =  435 bits (1118), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 336/1106 (30%), Positives = 552/1106 (49%), Gaps = 88/1106 (7%)

Query: 5    PIYGRIATLELFRPH-GEAQDFLFIATERYKFCVLQWDAESSELITRA-MGDVSDRIGRP 62
            PIYG I  L   RP    + D LF+ T R+++  L ++ E+  L TR    DVS+R    
Sbjct: 55   PIYGTITMLAKIRPEIAHSADQLFVGTSRFQYFTLAYNQETQALETRQNFVDVSERHMID 114

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVI-PF-DNKGQ---LKEAFNIRLEELQVLDIKFL 117
            + +  + ++DP  + + L L++G+     P    KG+   L +   +R+ EL+V    FL
Sbjct: 115  SQSRDLVLVDPAGKYVVLELFEGILSCFKPLRPRKGREDVLDKPEQVRITELRVRATTFL 174

Query: 118  YG-CAKPTIVVLYQDNKDAR-HVKTYEVALKDKDFVEGPWSQN------NLDNGADLLIP 169
            Y    +P I +L++D       + TY +  +         S++      +LD GA  LIP
Sbjct: 175  YTETQQPKIALLFEDGSGGEVRMATYRLVDEKGTLSRFDPSKDRENEISDLDPGASHLIP 234

Query: 170  VP-----------------PPLCGVLIIGEETIVYC---SANAFKAIPIRPSITKAYGRV 209
            +P                   L GV+++GE    Y    S  AF       +I  A+ +V
Sbjct: 235  IPKNAGQKRYIVRNSTVAKAHLGGVVVVGETKFTYLDDESKAAFDYPLNDAAIFVAWVKV 294

Query: 210  DADGSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
            D    RYLLGD  G LHLL ++++   +V G+ + LLG  S A+T+  L + V ++GS  
Sbjct: 295  D--DQRYLLGDDYGNLHLLSILSNHDGEVMGMDLMLLGTISKATTMVNLGDGVFFVGSHE 352

Query: 269  GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVT 317
             +SQ+I+++L  D K  Y+ +++   N+ PI+D  V+D+     E Q      GQ ++VT
Sbjct: 353  AESQVIRVDL--DRKDHYITIIQTMQNIAPILDLAVMDMGNREGESQSNEYSTGQTRLVT 410

Query: 318  CSGAYKDGSLRIVRNGIGINEQASV--ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
             SGA++ GSLR VR+G+G+ +   +  E+  I+ ++S+RS+    FD  LVVS  +ETR+
Sbjct: 411  GSGAFQSGSLRSVRSGVGLEDIGILVDEIGDIRDVYSMRSTAGTHFDDILVVSLPTETRV 470

Query: 376  LAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
                 E E E  E  G     QTL         ++QVT  SV+++      +  +W  P 
Sbjct: 471  FTFLGEIE-EVAEFRGLELNCQTLLASGLSNGMMLQVTESSVKILGPGPSYVAAKWTPPA 529

Query: 436  GYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY--EISCLDINPIGENPS 493
            G  +  A+AN S VL++  G  LV L+IG G L EV    LE   +++C+ +       +
Sbjct: 530  GEYITDASANDSYVLVSVSGTTLVSLDIGQG-LKEVAVQPLEAADQVACVYVPR-----N 583

Query: 494  YSQIAAVGMWTDISVRIFSLPDLNLITKEHL---GGEIIPRSVLLC----AFEGISYLLC 546
             S I  VG W   S+ I +L +L +I  E L       IPR ++L     A      L  
Sbjct: 584  LSDIGVVGFWKSGSISILNLSNLEIILSEDLRRKNNASIPRHIILAQLLPAVAAGPTLFV 643

Query: 547  ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASDRPTVIYSSN 605
            A+ DG +L F ++  T  L+ RK + LGT+         +    +V A  + PT+IY+S 
Sbjct: 644  AMEDGVVLTFNVDKSTFSLSGRKSLVLGTEHAKFHLLPREGGLNNVLATCEHPTLIYASE 703

Query: 606  KKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRR 665
             +++YS V   +    CPFNS  +P SL +A +  L I  IDD ++ H+R++ +G+  RR
Sbjct: 704  GRIVYSAVTADDAKCACPFNSEEYPGSLVVATQRSLKISKIDDERQTHVRTVHIGKTVRR 763

Query: 666  ICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTF-EFISTYPLDT------FEYGC 717
            I +    R F I C  +  +  EE       L++D  F E  +  PLD        EY  
Sbjct: 764  IAYSGAERAFGIGCIERTMADKEERYTSTFSLVEDVKFAEVGNPVPLDDENGTELIEYII 823

Query: 718  SILSCSFSDDSNVYYCVGTAYVLPEENEPT-KGRILVFIVEDGK-LQLIAEKETKGAVYS 775
                 +  +D    + VGT+++  E  +P  KGRILVF ++  K   L+A    K A   
Sbjct: 824  RTEVLNAREDLAERFIVGTSFLDEESADPNIKGRILVFGIDPKKNPYLVASLNLKCACRR 883

Query: 776  LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
            +   +GK++A +N+ + ++K++   +   E +         + + +    + I + D+M+
Sbjct: 884  VAMLDGKIVAVLNKTVAMFKYVEITEKAGEFKKLATFRSSTVPIDIAITENIIAITDMMQ 943

Query: 836  SISLLIYKH-EEG---AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA 891
            S+S++ Y   +EG    +E+ ARDY   W +AV  + D+ +L ++++ NL  +++N +G 
Sbjct: 944  SVSIVQYTPGKEGMPDKLEQVARDYQTCWGTAVTDIGDNSWLESDHHGNLLVLQRNIDGI 1003

Query: 892  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH 951
            T E++ RL + GE +LGE VN  R  ++    P      +P     T  G I + +++  
Sbjct: 1004 TLEDKQRLRITGEMNLGEQVNMIRKIAID---PSPTAMVVPKAFLATTEGSIYLFSTILD 1060

Query: 952  EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN-EKKTVDAKNFLDGDLIESFLDLSRTR 1010
                 L +LQ N+ + +  +G L+ + +RSF + E+ T +   F+DG+LIE FLD S   
Sbjct: 1061 GSQDLLLRLQENITECVDTLGRLDFKTYRSFKSAERTTEEPYRFVDGELIERFLDESEDM 1120

Query: 1011 MDEISKTMNVSVEELCKRVEELTRLH 1036
              +I + +  +VE +   VE L RLH
Sbjct: 1121 QQQICEGLGYTVEAIRDVVENLKRLH 1146


>gi|296803967|ref|XP_002842836.1| DNA damage-binding protein 1a [Arthroderma otae CBS 113480]
 gi|238846186|gb|EEQ35848.1| DNA damage-binding protein 1a [Arthroderma otae CBS 113480]
          Length = 1143

 Score =  435 bits (1118), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 331/1113 (29%), Positives = 559/1113 (50%), Gaps = 106/1113 (9%)

Query: 6    IYGRIATLE-LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            +YGRI  L+ L RP     D LF+ T++Y +  L WDA  ++L T R   D++D   R  
Sbjct: 55   VYGRITALQKLPRPDPALTDVLFVGTDQYAYFSLTWDAVHNQLRTERKYIDLADGSLREA 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF-----DNKG---------------QLKEAFN 103
             +    ++DP    + L +Y+G+  V+P       NK                QL E   
Sbjct: 115  HSDDRCLLDPSGSFLTLEVYEGVVSVVPLVSADTHNKRSRSAAAHVTPSATLEQLGEPLQ 174

Query: 104  IRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVA---------------LK 146
            +R+EEL V    FL   A   P + +LY+D++    +K  ++                LK
Sbjct: 175  VRIEELMVRSSAFLDQEASHAPRLALLYEDSQGKARLKLRDLKYTHAALSGDGGSAAELK 234

Query: 147  DKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SANAFKAIPIRPSITK 204
            D   + G      LD GA LLIP+P PL G+LI+GE TI Y   S N   + P+  S T 
Sbjct: 235  DVTTLSG-----ELDLGASLLIPIPRPLGGLLILGESTITYVDVSQNEIISRPLAES-TV 288

Query: 205  AYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
                V  DG R+LL D  G L  L+++      V   K++LLG+TS AS + YLD  +V+
Sbjct: 289  FVAWVQVDGQRWLLADDYGRLFFLMLVLDPDNAVEAWKVDLLGQTSRASVLVYLDGGLVF 348

Query: 264  IGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ---------GQGQ 314
            +GS  GDSQ+I++      +G + ++++   N+ PI+DF V+DL  +         GQ +
Sbjct: 349  VGSHQGDSQVIQIR-----EGGF-DLVQTIANIAPILDFTVMDLGDRSGEAREFSSGQTR 402

Query: 315  VVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISET 373
            +VT SGA+ DGSLR VR+G+GI +   +  ++ I  +W LR++  +PF   L+VSF++E+
Sbjct: 403  IVTGSGAFGDGSLRSVRSGVGIEDLGVLASMEHITDLWGLRAACPEPFSDTLLVSFVNES 462

Query: 374  RILAMNLEDELEETEIEGFCS---QTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNE 430
            R+   + E ++EE E EGF        TL   +   N+++QVT    +++   S      
Sbjct: 463  RVFHFSPEGDVEEKE-EGFLGLVFSQSTLLAANLPGNRIIQVTENMAKIIDLDSS--MTT 519

Query: 431  WKSP-PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIG 489
            W+S     ++  A+AN   ++L  GG  L+ + +        K  + + ++S + I    
Sbjct: 520  WQSSHEDSAITSASANDDYLVLVFGGIRLICVSLSSYEEVGSKDFEADNQVSGMTIPA-- 577

Query: 490  ENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCA--FEGISYLL 545
               S +Q   V +     + I  LPDL +  K+ LG  GE IPRSV++    ++    L 
Sbjct: 578  ---SPAQACIVCLPQSAEIVILDLPDLKVQNKQTLGEPGEAIPRSVIVAEILYDQSPTLF 634

Query: 546  CALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTTHVFAASDRPTVIYSS 604
             ++ DG + +F  +++   +++  K++LG+ QP+  +        +VFA  D P++IY+S
Sbjct: 635  VSMADGTVFSFSFSLEAFTISNSSKITLGSEQPLFKKLPRGNGQYNVFATCDHPSLIYAS 694

Query: 605  NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPR 664
              +++YS V+    S +C  N+ A+P S+A++ + EL I  +D+ +   I ++P+    R
Sbjct: 695  EGRIVYSAVDSDSASRICNLNTQAYPGSIALSNQRELKIAIVDEERTTQIHTLPMHASVR 754

Query: 665  RICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
            R+ +    + F + ++     +  E+    FV L D+  F  +ST+ L + E   S++  
Sbjct: 755  RLAYSPVEKAFGLGTVTRTISNGVEKVSSSFV-LADEILFRPLSTHELRSDELVESVIRS 813

Query: 723  SFSDDSN-----VY---YCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKETKGAV 773
               D  +     V+   + VGTA++    ++  +GRIL F V   + L L+ +K   GA 
Sbjct: 814  QIPDGEDEVGNTVFRDLFFVGTAFLDDVGDDNVRGRILAFEVNRSRELALLVDKPVLGAC 873

Query: 774  YSLNAF-NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
             +L    N KL+A + + + +++ +    G  EL  +  +      + V    + I V D
Sbjct: 874  RTLAVMDNDKLVAGLVKSVSIFRIVRDSFGNIELLKQTAYRTSTAPIDVSVTENTIAVAD 933

Query: 833  LMKSISLLIY-KHEEGAI----EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
            +MKS+SL+ Y   EEGA+    EE AR Y   W +AV  +++++YL AE   NL  +++N
Sbjct: 934  VMKSVSLVQYTPAEEGALEPKFEEIARHYQTLWSTAVGHIEENVYLLAEAEGNLVVLQRN 993

Query: 888  SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIA 947
            + G T+ +R RL+   E  LGE VNR     + ++ P +        +  TV+G I +  
Sbjct: 994  TTGVTESDRKRLQPTSEMRLGEMVNRIH--PITVQAP-AKAAVSARALLATVDGSIYLFG 1050

Query: 948  SLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDL 1006
             +       L +LQ  +  V    G +   ++R+F    +  D    F+DG+LIE FL  
Sbjct: 1051 LINPTYIDLLLRLQAIMASVTVSPGEIPFTKYRAFRTTVRQSDEPFRFVDGELIERFLGC 1110

Query: 1007 SRTRMDEISKTM---NVSVEELCKRVEELTRLH 1036
            + +  +EI+  +   N++V  L + ++EL R+H
Sbjct: 1111 APSTQEEIASRLDDQNITVASLKEMIDELRRMH 1143


>gi|440639387|gb|ELR09306.1| hypothetical protein GMDG_03874 [Geomyces destructans 20631-21]
          Length = 1138

 Score =  434 bits (1117), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 338/1103 (30%), Positives = 553/1103 (50%), Gaps = 91/1103 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            ++ RI  L   RP       LF+ T R  +  L WDA +  L T  A  D S++  R   
Sbjct: 55   LHARITMLAAIRPPTSCTAHLFVGTTRAHYFTLAWDAATRRLETVHAFVDASEKHMRDAA 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFN-------IRLEELQVLDIKFL 117
             G+   +DP  R + L L++G+   +      ++  A +       IR+ EL V    FL
Sbjct: 115  GGERCAVDPSGRQLCLSLFEGVLSFVKVMKPRKVAAAGSYLDDPEQIRITELFVRATVFL 174

Query: 118  YG-CAKPTIVVLYQDNKDARHVKTYEVA-----LKDKDFVEGPWSQNNLDNGADLLIPVP 171
            +     P + +LYQD +D   + TY V          D  +    +  ++ GA  LIPVP
Sbjct: 175  HTESTSPKVALLYQDGRDRVGLATYRVTDGRGQYGGFDPRKAREDELGVEVGASHLIPVP 234

Query: 172  ------------------PPLCGVLIIGEETIVYC---SANAFKAIPIRPSITKAYGRVD 210
                                L GV+++GE  ++Y    S    + +    SI  A+    
Sbjct: 235  KGEGVQRRYVVRNNASLKAQLGGVIVVGETRMLYLDDESKATVEHVLDEASIFVAW--TA 292

Query: 211  ADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
             DG RYLLGD  G LHLL +  + E VTG+ I+     S  ST+  L++ +++IGS  GD
Sbjct: 293  YDGLRYLLGDEYGWLHLLTLVVDAEVVTGMTIKKFVRISRPSTMVCLEDDLLFIGSHDGD 352

Query: 271  SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCS 319
            SQ++KL+L  DAK    EV++   N+ PIVDF V+D+  +           GQ ++VT S
Sbjct: 353  SQVLKLDL--DAK--VAEVVQTLDNIAPIVDFTVMDMGSRSEEARANEFSSGQARIVTGS 408

Query: 320  GAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
            GA+++GSLR VR+G+G+ +   + E+  IKG+++L+++  +  DT LV+SF +ETR+   
Sbjct: 409  GAFQEGSLRSVRSGVGLEDIGQLGEMDNIKGLYTLQTNNSEFHDT-LVISFSTETRVFRF 467

Query: 379  N-LEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
            +   +  E  E  G   +  TL   +    +++Q+T     L+ S S      W+  PG 
Sbjct: 468  DSEGEVEEVEEFLGLSFEEHTLLAANVSNGRILQITPSKALLIDSESGVAVASWQPAPGE 527

Query: 438  SVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL--EYEISCLDINPIGENPSYS 495
             +  A+ N    LL+  G  L+ L + D  L+E+         +++C+D+ P  E P   
Sbjct: 528  IITAASTNEDYALLSADGKSLISLNL-DNDLSEIARQDFGDTDQVACVDV-PNTETP--- 582

Query: 496  QIAAVGMWTDISVRIFSLPDLNLITKEHLGGE---IIPRSVLLCAFEGISY---LLCALG 549
             I  VG+W   SV I  L  L+ I  + L  +    +PRS++L      ++   LL A+ 
Sbjct: 583  -IGLVGLWQSGSVSIIDLRTLHPIQGDTLRNDDTAAVPRSMVLAQILPKTFGPTLLVAMS 641

Query: 550  DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS-SKNTTHVFAASDRPTVIYSSNKKL 608
            DG + ++ +   T  LT+RK V LGTQ   LR    +    +VFA  +  ++IYSS  ++
Sbjct: 642  DGVVHSYSVTPCTFALTNRKSVVLGTQQANLRVLPRAGGLMNVFATCEHSSLIYSSEGRI 701

Query: 609  LYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICH 668
            +YS V  ++ + +CPFN+AA+PD++ +A   E+ I  ID  ++ H+R++P+GE  RR+ +
Sbjct: 702  IYSAVTAEDATFICPFNAAAYPDAIVVATASEIKISQIDTERRTHVRTLPMGETVRRVTY 761

Query: 669  QEQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTF-EFISTYPLDTFEYGCSILSCSFSD 726
                + F + ++K +    EE      RL+D+  F E    + L T +    + +   + 
Sbjct: 762  SPSEKVFGLGAIKRELIDGEEVIESSFRLVDEIVFSELGKPFQLGTSQGEELVEAVIRAP 821

Query: 727  DSNVY------YCVGTAYVLPEENEPTKGRILVFIVEDGKLQL-IAEKETKGAVYSLNAF 779
              N Y      + VGT+++        +GRILVF V+  +    IAE + KGA   L   
Sbjct: 822  LPNTYGTPQERFIVGTSFLDDNPGLSYRGRILVFGVDSSRNPYKIAEYKVKGACRCLGVI 881

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            +GK++AA+ + I ++++      +  ++    +      + +   G+ I V DLMKS+SL
Sbjct: 882  DGKIVAALVKTIVVFEYTELSGTSARIEKVASYRTSTCPVDLAIEGNTIAVADLMKSVSL 941

Query: 840  LIYKH----EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
            + Y+     E   + E AR + + W +AV  +D+  +L A+ + NL  +R+N    T E+
Sbjct: 942  VEYRAGTSGEAPTLVEVARHFQSVWATAVAHVDEG-WLEADADGNLIVLRRNEAAVTFED 1000

Query: 896  RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ-IPTVIFGTVNGVIGVIASLPHEQY 954
            R ++EV GE+HLGE VNR R     +R+  S+    +P     T  G + +  S+     
Sbjct: 1001 RKKMEVTGEFHLGEQVNRIRK----IRVDASEGATVVPRAFLATTEGSLFLYGSVAPASQ 1056

Query: 955  LFLEKLQTNLRKVIKGVGGLNHEQWRSFNN-EKKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
              L +LQ  L + ++  G +    +RSF N E++T +   F+DG+LIE FLDL   R + 
Sbjct: 1057 DLLLRLQQRLAENVETPGNIPFTTYRSFRNAERETEEPYRFIDGELIERFLDLDEERQEV 1116

Query: 1014 ISKTMNVSVEELCKRVEELTRLH 1036
            + K +   VEE+   VEEL R+H
Sbjct: 1117 VCKGL-AKVEEVRDLVEELRRMH 1138


>gi|115397303|ref|XP_001214243.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gi|114192434|gb|EAU34134.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 1140

 Score =  434 bits (1116), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 329/1105 (29%), Positives = 545/1105 (49%), Gaps = 93/1105 (8%)

Query: 6    IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            +Y R+  L  L  P     D LF+ T+RY +C L WD+  +++ T R   DVSD   R +
Sbjct: 55   LYARVTMLARLPAPANSPTDHLFVGTDRYTYCTLSWDSSQNQIRTERNYVDVSDPSSRES 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
              G   +IDP  R + L +Y+GL  VIP                        G+L E   
Sbjct: 115  QTGNRCLIDPSGRFMTLEVYEGLVAVIPIVQLPGKKRGRAPAMPSGPDAPQVGELGELTT 174

Query: 104  IRLEELQVLDIKFLY-GCAKPTIVVLYQDNKDARHVKTYEVAL--------KDKDFVEGP 154
             R++EL V    FL+     P + +LY+DN+    +K  E+           D    +  
Sbjct: 175  ARIDELFVRSSAFLHVQSGLPRLALLYEDNQKKVRLKVRELQYTAATSSSGADATLADLA 234

Query: 155  WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGRVDA 211
                 LD GA  LIPVP PL G+LI+GE +I Y   +  + +       +I  A+ +VD+
Sbjct: 235  DFAQELDLGASHLIPVPAPLGGLLILGETSIKYVDDDNNEIVSRLLDEATIFVAWEQVDS 294

Query: 212  DGSRYLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
               R+LL D  G L  L++  + E +V G +++ LG TS AST+ YL   V+++GS  GD
Sbjct: 295  --QRWLLADDYGRLFFLMLVLDSENQVQGWQLDHLGNTSRASTLVYLGGGVIFVGSHQGD 352

Query: 271  SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTCS 319
            SQ++++       GS+ E+++   N+ PI+DF ++DL     E Q      GQ ++VT S
Sbjct: 353  SQVLRV-----GDGSF-EIIQTLPNIAPILDFTIMDLGNRTSESQTHEFSSGQARIVTGS 406

Query: 320  GAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
            GA+ DG+LR VR+G+G+ E   + E++ I  +W L+      F   L+V+FI ETR+   
Sbjct: 407  GAFDDGTLRSVRSGVGMEELGVLGEMEHITDLWGLQFKAKGDFLDTLLVTFIDETRVFHF 466

Query: 379  NLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
            + + E+EE  +  G      TL   +    +++QVT   V +    S  +  EW      
Sbjct: 467  SSDGEVEELDQFLGLSLSENTLLAANLPGGRILQVTERRVLIADMESEMVTYEWTPSDQL 526

Query: 438  SVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDI--NPIGENPSY 494
             +  A+AN   V+L  GG  +   +I + + +   K+   + +IS + +  +P G     
Sbjct: 527  IITSASANEDSVVLVAGGELMTVFDIRNNVQIVTQKNFGADSQISGVTVPSSPTG----- 581

Query: 495  SQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLC--AFEGISYLLCALGD 550
              +  VG      V +  L DL  +    LG  GE  PRSVL+     +    +  ++ D
Sbjct: 582  --VCIVGFPQSAKVSVLKLQDLTELHATSLGPEGEAFPRSVLVANVLVDSPPTIFISMAD 639

Query: 551  GHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLL 609
            G ++ +  N     LT   K+ LG++  T +     +  + VFA  + P++IY S  +++
Sbjct: 640  GSVITYSFNANDYSLTGMNKLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYGSEGRII 699

Query: 610  YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
            YS VN +  S +C FNS A+P+S+A+A   EL I  +D  +   I+++P+G   RR+ + 
Sbjct: 700  YSAVNSEGASRVCHFNSEAYPESIAVATSHELKIALVDKERTTQIQTLPMGATVRRVAYS 759

Query: 670  EQSRTFAICSL--KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-- 725
               + F I ++  K +  AE  +  FV L D+  F  +  + L + E   S++   F   
Sbjct: 760  PSEKAFGIGTIERKLEEGAEIVKSQFV-LADEILFRRLDAFDLRSEELVESVIRAEFPVG 818

Query: 726  ------DDSNVYYCVGTAYVLPEEN-EPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLN 777
                  D     + VGTAY+  +E+ +  +GRIL+F V++G KL  +AE   KGA  +L 
Sbjct: 819  KDEKGRDMFKDRFVVGTAYLDEDEDRDSIRGRILMFEVDNGRKLTKVAELAVKGACRALA 878

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
                K++AA+ + + +YK    + G  +L+    +      + +    + I V DLMKS 
Sbjct: 879  MLGDKVVAALVKTVVIYKVTGNNFGAMKLEKLASYRTSTAPVDITVTDNVIAVSDLMKSS 938

Query: 838  SLLIY-KHEEG---AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
             L+ Y + E+G   +++E AR +   W + +  +    YL ++   NL  +R+N  G  +
Sbjct: 939  CLVEYIEGEDGLPDSLKEVARHFQTVWATGIACIAPHTYLESDAEGNLIILRRNLSGVEE 998

Query: 894  EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQ 953
            +++ RLEV GE  LGE VNR R  ++      + V   P    GTV G I + A +  E 
Sbjct: 999  DDKRRLEVTGEISLGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLYAIINPEH 1055

Query: 954  YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMD 1012
              FL +LQ  +   I+ +G +   ++R F +  ++  +   F+DG+LIE FL    +  +
Sbjct: 1056 QDFLMRLQATMAGKIESLGDMPFNEFRGFRSMVREAKEPYRFVDGELIERFLTCEPSVQE 1115

Query: 1013 EISKTMN-VSVEELCKRVEELTRLH 1036
            +I  ++  ++V ++   +E L RLH
Sbjct: 1116 DIVNSVGMMNVHDVKVMIEALRRLH 1140


>gi|303313681|ref|XP_003066852.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
 gi|240106514|gb|EER24707.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
 gi|320031496|gb|EFW13458.1| UV-damaged DNA binding protein [Coccidioides posadasii str. Silveira]
          Length = 1144

 Score =  432 bits (1110), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 328/1115 (29%), Positives = 555/1115 (49%), Gaps = 109/1115 (9%)

Query: 6    IYGRIATLE-LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            IYG+I+ L+ + R H  A D LF+ T+RY +  L WD  + +L T +   D++D   R  
Sbjct: 55   IYGKISVLQKVPRSHSSATDLLFVGTDRYAYFTLSWDPSTCQLHTEQKYLDIADPSLRDN 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAFNI 104
             +G    +DP  + + + +Y+G+  VIP                    + +  L E    
Sbjct: 115  QSGDRSWVDPSGKFLTMEIYEGIISVIPIAQEPLKRPSPSASRSSGTSEQREYLGEPLQT 174

Query: 105  RLEELQVLDIKFLY--GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG-PWSQ---- 157
            R+EEL V    FL+      P I +LY++ +    +K     L+D  +  G P  +    
Sbjct: 175  RIEELIVRSTAFLHHDPTKPPRIAILYENTQGKVKLK-----LRDLIYSRGIPGGEASAA 229

Query: 158  ---------NNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITK 204
                     ++L+ GAD+L+PVP PL GVLI+GE+ I Y      + I  RP    +I  
Sbjct: 230  EFRDVDDLYDDLELGADILVPVPLPLGGVLILGEKFIKYIDTVKNETI-TRPLENNTIFV 288

Query: 205  AYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
            A+ ++D    R+LL D  G L   ++I +    V   K+ LLGETS AS + +L   VV+
Sbjct: 289  AWEQLD--NQRWLLADDYGRLFFFMLILNSANAVQSWKVGLLGETSRASALVHLGGGVVF 346

Query: 264  IGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG----------QG 313
            +GS  GDS +I++      +GS+ E+++   N+GPI+DF V+DL  +G          Q 
Sbjct: 347  LGSHQGDSHVIRIT-----EGSF-EIIQTLSNIGPILDFTVMDLGNRGETPTHEFSSGQA 400

Query: 314  QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
            ++VT SGA++DGSLR VR+G+G+ +   +  ++ I  +W + +   + F   L++SF+ E
Sbjct: 401  RIVTGSGAFRDGSLRSVRSGVGMEDLGVLGAMEHITDLWGVSAFCPEGFCDTLLLSFVDE 460

Query: 373  TRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
            +R+   + + E+EE +   G      T+   +    +++QVT    R+    SR    EW
Sbjct: 461  SRVFHFSPDGEVEEKDDFLGLLLGEPTIHAANLPSRRILQVTEHGARVTDVESRMTLWEW 520

Query: 432  KSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEIS--CLDINPI 488
             +     +  A++N   ++L  GG  L+  +IGD I ++  K  + + ++S   L  +PI
Sbjct: 521  SAVESRKITAASSNDRHLVLMVGGQKLMVFDIGDDIKMSSTKTFEADKQVSGVALTSSPI 580

Query: 489  GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLL-CAF-EGISYL 544
                   Q   +       V I  L DLN+   E LG  G+ +PRSVL+ C F +    L
Sbjct: 581  -------QACILCFPQSAEVTIIDLTDLNIRHTETLGEPGDAVPRSVLVACMFSDRAPTL 633

Query: 545  LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
              A+ DG + +F LN+    L+D  K+ LG++    +     +  + +FA  D P++IY+
Sbjct: 634  FVAMADGSVFSFSLNVANYSLSDANKLVLGSEAPVFKLLPRADGLYNIFATCDHPSLIYA 693

Query: 604  SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
            S  +++YS VN  + + +C FN+ A+P ++A+A   EL I  +D  +   I+++ + E  
Sbjct: 694  SEDRIVYSAVNSDKATRICHFNAEAYPGAIAVATPDELKIALVDAERTTQIQTLMINETV 753

Query: 664  RRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEY-GCSIL 720
            RR  +    R F + +++       EE + HF+ L D+  F  +S + L+  E   C I 
Sbjct: 754  RRTSYSSTERAFGLGTIQRTLVQNVEEVKSHFI-LADEIMFRQLSVFDLNPNELVECVIR 812

Query: 721  SCSFSDDSNV-------YYCVGTAYV-LPEENEP-TKGRILVFIVEDGK-LQLIAEKETK 770
            +     ++ +        + VGT+ +  PEE E  TKGRIL+F V+  + L+ I +   +
Sbjct: 813  TEHPGSNAQMGSSRPRDIFIVGTSVLDTPEEAEARTKGRILIFDVDTNRELRKICDFPVR 872

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
            GA  +L   N K++AA+ + + +      +    E++ E  +      + +   G+ I V
Sbjct: 873  GACRALAMINNKIVAALMKTVVVLNIKKGNLYNFEIEKEASYRTSTAPVDISVTGNIIAV 932

Query: 831  GDLMKSISLLIYKHEEGA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
             DLMKSISL+ Y   EG     ++E AR Y   W +A   + ++ +L A+   NL  + +
Sbjct: 933  ADLMKSISLVEYHAGEGGQPDTLKEVARHYQTLWTTAAAPVAENEFLVADAEGNLVVLNR 992

Query: 887  NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
            N+ G T+++R R++V  E  LGE VNR  H   +   P+S V  IP     TV+G I + 
Sbjct: 993  NTTGVTEDDRRRMQVTSELRLGEMVNRI-HPMDLQTSPESPV--IPKAFLATVDGSIYLF 1049

Query: 947  ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFL- 1004
              +       L +LQ+ L   +   G +   ++R+F +  +  +    F+DG+LIE FL 
Sbjct: 1050 GLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQAEEPFRFVDGELIEQFLT 1109

Query: 1005 ---DLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
               D+    +  +     V+V E+   +E L R+H
Sbjct: 1110 FPPDIQEAVLARMDGGGRVNVIEIKDMIEGLKRMH 1144


>gi|392864500|gb|EAS34654.2| UV-damaged DNA binding protein [Coccidioides immitis RS]
          Length = 1144

 Score =  429 bits (1102), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 327/1115 (29%), Positives = 556/1115 (49%), Gaps = 109/1115 (9%)

Query: 6    IYGRIATLE-LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            IYG+I+ L+ + R H  A D LF+ T+RY +  L WD  + +L T +   D++D   R  
Sbjct: 55   IYGKISVLQKVPRSHSSATDLLFVGTDRYAYFTLSWDPSTCQLHTEQKYLDIADPSLRDN 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAFNI 104
             +G    +DP  + + + +Y+G+  VIP                    + +  L E    
Sbjct: 115  QSGDRSWVDPSGKFLTMEIYEGIISVIPIAQEPLKRPSPSAGRSSGTSEQREYLGEPLQT 174

Query: 105  RLEELQVLDIKFLY--GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG-PWSQ---- 157
            R+EEL V    FL+      P I +LY++ +    +K     L+D  +  G P  +    
Sbjct: 175  RIEELIVRSTAFLHHDPTKPPRIAILYENTQGKVKLK-----LRDLIYSRGIPGGEASAA 229

Query: 158  ---------NNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITK 204
                     ++L+ GAD+L+PVP PL GVLI+GE+ I Y      + I  RP    +I  
Sbjct: 230  EFRDVDDLYDDLELGADILVPVPLPLGGVLILGEKFIKYIDTVKNETI-TRPLEHNTIFV 288

Query: 205  AYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
            A+ ++D    R+LL D  G L   ++I +    V   K+ LLGETS AS + +L   VV+
Sbjct: 289  AWEQLD--NQRWLLADDYGRLFFFMLILNSANAVQSWKVGLLGETSRASALVHLGGGVVF 346

Query: 264  IGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG----------QG 313
            +GS  GDS +I++      +GS+ E+++   N+GPI+DF V+DL  +G          Q 
Sbjct: 347  LGSHQGDSHVIRIT-----EGSF-EIIQTLSNIGPILDFTVMDLGNRGETPTHEFSSGQA 400

Query: 314  QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
            ++VT SGA++DGSLR VR+G+G+ +   +  ++ I  +W L +   + F   L++SF+ E
Sbjct: 401  RIVTGSGAFRDGSLRSVRSGVGMEDLGVLGAMEHITDLWGLSAFCPEGFCDTLLLSFVDE 460

Query: 373  TRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
            +R+   + + E+EE +   G      T+   +    +++QVT    R+    SR    EW
Sbjct: 461  SRVFHFSPDGEVEEKDDFLGLLLGEPTIHAANLPSRRILQVTEHGARVTDVESRMTLWEW 520

Query: 432  KSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEIS--CLDINPI 488
             +     +  A++N   ++L  GG  L+  +IGD I ++  K  + + ++S   L  +PI
Sbjct: 521  SAVESRKITAASSNDRHLVLMVGGQKLMVFDIGDDIKMSSTKTFEADKQVSGVALTSSPI 580

Query: 489  GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLL-CAF-EGISYL 544
                   Q   +       V I  L  LN+   E LG  G+ +PRSVL+ C F +    L
Sbjct: 581  -------QACILCFPQSAEVTIIDLTGLNIRHTETLGEPGDAVPRSVLVACMFSDRAPTL 633

Query: 545  LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
              A+ DG + +F LN+    L+D  K+ LG++    +     +  + +FA  D P++IY+
Sbjct: 634  FVAMADGSVFSFSLNVANYSLSDANKLVLGSEAPVFKLLPRADGLYNIFATCDHPSLIYA 693

Query: 604  SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
            S  +++YS VN  + + +C FN+ A+P ++A+A   ++ I  +D  +   I+++ + E  
Sbjct: 694  SEDRIVYSAVNSDKATRICHFNAEAYPGAIAVATPDDIKIALVDAERTTQIQTLMINETV 753

Query: 664  RRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEY-GCSIL 720
            RR  +    R F + +++       EE + HF+ L D+  F  +S + L+  E   C I 
Sbjct: 754  RRTSYSSTERAFGLGTIQRTLVQNVEEVKSHFI-LADEIMFRQLSVFDLNPNELVECVIR 812

Query: 721  SCSFSDDSNV-------YYCVGTAYV-LPEENEP-TKGRILVFIVEDGK-LQLIAEKETK 770
            +   S ++ +        + VGT+ +  PEE E  TKGRILVF V+  + L+ I +   +
Sbjct: 813  TEHPSSNAQMGSSRPRDIFIVGTSVLDTPEEAEARTKGRILVFDVDTNRELRKICDFPVR 872

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
            GA  +L   N K++AA+ + + +      +    E++ E  +      + +   G+ I V
Sbjct: 873  GACRALAMINNKIVAALMKTVVVLNIKKGNLYNFEIEKEASYRTSTAPVDISVTGNIIAV 932

Query: 831  GDLMKSISLLIYKHEEGA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
             DLMKSISL+ Y   EG     ++E AR Y   W +A   + ++ +L A+   NL  + +
Sbjct: 933  ADLMKSISLVEYHAGEGGQPDTLKEVARHYQTLWTTAAAPVAENEFLVADAEGNLVVLNR 992

Query: 887  NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
            ++ G T+++R R++V  E  LGE VNR  H   +   P+S V  IP     TV+G I + 
Sbjct: 993  DTTGVTEDDRRRMQVTSELRLGEMVNRI-HPMDLQTSPESPV--IPKAFLATVDGSIYLF 1049

Query: 947  ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFL- 1004
              +       L +LQ+ L   +   G +   ++R+F +  ++  +   F+DG+LIE FL 
Sbjct: 1050 GLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQAEEPFRFVDGELIEQFLT 1109

Query: 1005 ---DLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
               D+    +  +     V+V E+   +E L R+H
Sbjct: 1110 FPPDIQEAALARMDGGGRVNVIEIKGMIEGLKRMH 1144


>gi|255956643|ref|XP_002569074.1| Pc21g20880 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211590785|emb|CAP96985.1| Pc21g20880 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1140

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 324/1097 (29%), Positives = 540/1097 (49%), Gaps = 99/1097 (9%)

Query: 18   PHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPDCR 76
            P     D LF+ T++Y +  + WD ES+++ T R+  D+++   R +      +IDP  R
Sbjct: 65   PTNSPTDHLFVGTDQYNYFTITWDRESNQIKTARSCVDIAEPSSRESQCAPRCLIDPTGR 124

Query: 77   LIGLHLYDGLFKV---------------------IPFDNKGQLKEAFNIRLEELQVLDIK 115
             + L +Y+G+  V                     +P    G+L +    R++EL V    
Sbjct: 125  FMTLEVYEGVIVVVPIVQPTKKRGRMSMGGPQADLPL-QVGELDKPTTSRIDELFVRSSA 183

Query: 116  FLYGCAKPTIVVLYQDNKDARHVKTYEV--------ALKDKDF-----VEGPWSQNNLDN 162
            FL+  + P + +LY+DN+    ++  E+         L D  F     +EG      LD 
Sbjct: 184  FLHSESNPWLALLYEDNQQKVRLRIRELDFTPGTAGTLADATFKEVQKLEGGEFGQELDL 243

Query: 163  GADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGRVDADGSRYLLG 219
            G+  LIP+P PL G++++GE +I Y   NA   I       ++  A+ +VD+   R+LL 
Sbjct: 244  GSSHLIPIPAPLGGLIVLGETSIKYIDDNANDVITRNLDEATVFVAWEKVDS--QRWLLA 301

Query: 220  DHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL 278
            D  G L  L  I +   ++   K+E LG+TS AS + YL   ++++GS +GDSQ+++L+ 
Sbjct: 302  DDYGRLFFLSFILNNMGEIDDWKLEYLGKTSRASVLVYLGGGMLFVGSHHGDSQVLRLD- 360

Query: 279  QPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTCSGAYKDGSL 327
                 GS  EV++   N+ PI+DF ++DL     E Q      GQ ++VT SGA+ DG+L
Sbjct: 361  -----GSSFEVIQTLSNIAPILDFTIMDLGNRTSESQTHEFSSGQARIVTGSGAFDDGTL 415

Query: 328  RIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
            R VR+G+G+ E   + E++ I   W L++ + D F   L+V+F+ ETR+   + + E+EE
Sbjct: 416  RSVRSGVGMEELGVLGEMEHITDFWGLQTRSKDDFLNTLIVTFVDETRVFQFSTDGEVEE 475

Query: 387  TE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATAN 445
             +   G      TL       ++++QVT   V +    S  +  EW       +  A+AN
Sbjct: 476  LDNFLGLSLTECTLLVTRLQGDRILQVTEQRVLVADLESGMVTFEWAPQDQKLITAASAN 535

Query: 446  ASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDI--NPIGENPSYSQIAAVGM 502
               ++L   G  +   +I D + +   K    + +IS L +  +P G       +   G 
Sbjct: 536  EDHLVLVISGQIVASFDIRDNVQIITQKDLGADQQISGLTVPSSPTG-------VFIAGF 588

Query: 503  WTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAFEGISY--LLCALGDGHLLNFLL 558
                 V I ++ D  ++  + LG  GE  PRSVL+      S+  L  ++ DG ++ F L
Sbjct: 589  PQSAKVSIMAIKDFAILQTKSLGPTGESFPRSVLVAEVLADSHPTLFISMADGCVITFSL 648

Query: 559  NMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVNLKE 617
            N +   L++  K+ LG++  T +     +  + VFA  + P++IY S  +++YS VN + 
Sbjct: 649  NPQDYSLSEMNKLILGSEQPTFKKLPKGDGLYNVFATCENPSLIYGSEGRIIYSAVNSEG 708

Query: 618  VSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI 677
             S +C  N+ AFPDS+A+A E EL I  +D  +   I+++P+G   RR+ +    + F I
Sbjct: 709  ASRVCHLNAEAFPDSIAVATEKELKIALVDKERTTQIQTLPMGSTVRRVAYSPSEKAFGI 768

Query: 678  CSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF--SDDSNV--- 730
             ++  +  + AE  + HFV L D+  F  +    L   E   S++        D N    
Sbjct: 769  GTIDRKLVNGAEVVKSHFV-LADEIMFRRLDALELGPDELVESVVRAELPAGKDENGKEI 827

Query: 731  ---YYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAA 786
                + VGTA+   E++E  +GRIL+  V+ G KL  +AE    GA  +L      ++AA
Sbjct: 828  MKDRFVVGTAFADEEQDESIRGRILILEVDHGRKLSQVAELPVMGACRALAMMGDCVVAA 887

Query: 787  INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH-- 844
            + + + +Y+  + + G  +L+    +      + V    D I V DLMKS+ L+ Y    
Sbjct: 888  LVKTVVVYRVKINNVGPMKLEKLAAYRTSTAPVDVIVVDDLIAVADLMKSLCLVRYTPGH 947

Query: 845  --EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVV 902
              E   + E  R Y   W +A+  + D+ +L ++   NL  + +N  G T +++ RL   
Sbjct: 948  AGEPAKLTEVGRHYQTVWSTAIACVGDETFLQSDAEGNLIVLSRNMNGVTAQDKHRLMPT 1007

Query: 903  GEYHLGEFVNRFRHGSLVMRLPD-SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
             E  LGE VNR R     + +P  S V   P     TV G I + A +  E   FL  LQ
Sbjct: 1008 SEISLGEMVNRIRP----VNIPQLSSVMVTPRAFMATVEGSIFLFAVINPEHQDFLMTLQ 1063

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNV 1020
             +L   I  +G L+ +++RSF    ++ +A   F+DG+LIE FL+ S +  +EI + +  
Sbjct: 1064 ASLSTKINSLGNLSFDKFRSFRTMVRSAEAPYRFVDGELIEQFLNCSPSMQEEIVQEIGS 1123

Query: 1021 S-VEELCKRVEELTRLH 1036
            S V E+ + +E L RLH
Sbjct: 1124 SDVVEVKRMIEALRRLH 1140


>gi|320593036|gb|EFX05445.1| uv-damaged DNA-binding protein [Grosmannia clavigera kw1407]
          Length = 1504

 Score =  428 bits (1100), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 337/1157 (29%), Positives = 562/1157 (48%), Gaps = 146/1157 (12%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            I+G I+ L+  RP     + LF+ T+R+++  L WD ES  L T     DV ++  R + 
Sbjct: 368  IFGTISMLQSLRPIDSKTELLFVGTDRFEYFTLTWDMESHSLRTVDTFQDVGEKHLRDSQ 427

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF---------DNKGQLKEAFNIRLEELQVLDIK 115
            +    ++DP  R + L L++G+  V+           D+   L     +RL EL +    
Sbjct: 428  SQDRCLVDPSGRFMALLLWEGVLTVLRLQTRRDRARPDHMRHLVPMDQVRLSELFIKAST 487

Query: 116  FLYG-CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQNNL--------DNGA 164
            FL+    +P I  LYQ   D    K   Y +   D++   G +  N          D GA
Sbjct: 488  FLHSETGQPQIAFLYQSQGDRAEAKLSAYRLTAGDRNVDTGRFDANRDREISMDIDDPGA 547

Query: 165  DLLIPV-----------------PPPLCGVLIIGEETIVY--CSANAFKAIPIRP-SITK 204
             LLIPV                    L G+L++GE  ++Y   +      +P+R  SI  
Sbjct: 548  ALLIPVEKVEPAKRHNVRNTATATANLGGLLVVGETRLLYIDSTTKCTVEVPLRAASIFV 607

Query: 205  AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL-DNAVVY 263
            A+ R DA  + YLL D  G LHLL I      V  L +  +G+TS AS + YL D  +++
Sbjct: 608  AWARYDA--THYLLADEYGTLHLLTILVSGAVVDNLDVSPIGKTSRASCLVYLPDRRLLF 665

Query: 264  IGSSYGDSQLIKLNLQPDAKGSY-VEVLERYVNLGPIVDFCVVDLERQ------------ 310
            +GS  GDSQL +L+L     G + ++VL+   N+ P++DF V+D+  +            
Sbjct: 666  VGSHNGDSQLFRLDLAASPVGLHELQVLQ---NIAPVLDFTVMDMGNREDDQQLANEYAS 722

Query: 311  GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRS---STDDPFDTFLV 366
            GQ ++VT SG +KDGSLR VR+G+G+ +   + +L G++G++SLRS         DT LV
Sbjct: 723  GQARIVTGSGVHKDGSLRSVRSGVGLEDIGILGDLGGVRGLFSLRSPQTQQQQQVDT-LV 781

Query: 367  VSFISETRILAMNLEDELEETE-IEGFCSQTQTLFCHD-----AIYNQLVQVTSGSVRLV 420
             SF++ETR+   + + E+EE E   G    TQTL   +     +  ++L+Q+T GSV L 
Sbjct: 782  ASFLTETRVFLFDGDGEIEEVEAFPGLNLGTQTLLATNLSSGSSSSSRLLQITPGSVTLA 841

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV-KHAQLEYE 479
             + S  +   W  P   ++  A+AN+  VLL+  G  LV L + + +     K      +
Sbjct: 842  ETASGTIVASWTPPDDRTIVSASANSRWVLLSVEGTTLVSLSLDNSLAVAAQKEVGTTDQ 901

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI--------IPR 531
            I+CL   P         +  VG W    V   S+ DL  +  +H G  +        +PR
Sbjct: 902  IACLHAAP-----QLLDVGVVGQWASGMV---SVVDLATLEPKHGGKSLRRRDDNASVPR 953

Query: 532  SVLLCAF--EGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF--SS 585
            S++L      G++   L  A+ DG+++ F ++     L+ RK V LGT+   L     S 
Sbjct: 954  SIVLAQVLPPGMAGPTLFVAMDDGNVITFAVSPSDLSLSGRKSVVLGTRHARLHPLPQSD 1013

Query: 586  KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
            + T  +FA ++ P++IY S  ++ Y+ V  ++ + +C F++AAFP ++A+A + ++ +  
Sbjct: 1014 EATYSIFATTEHPSLIYGSEGRIAYAAVTAEDANFVCHFDAAAFPGAIAVATDSQIKLSR 1073

Query: 646  IDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEF 704
             D  ++ H+R+I +GE  RRI +    R FA+  +K + +   E     ++L+D+  F+ 
Sbjct: 1074 TDTTKQTHVRAIDMGETIRRIAYSATERVFALGCIKRELTQGREVVTSSLKLVDEVAFQP 1133

Query: 705  ISTYPLDTFEYGCSILSCSFS-------DDSNVYYCVGTAYVLP--EENEPTKGRILVFI 755
            +   PL     G  ++ C           +    + VGT+++     ENE T GRILV  
Sbjct: 1134 LGK-PLALEVEGTELVECVVRAELRDALGNPAERFLVGTSFMAAGSSENEHTLGRILVVG 1192

Query: 756  VE-DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
            V+ D    +++    +G    L   +  ++A +++ + L ++      + EL+    +  
Sbjct: 1193 VDSDHSPYIVSSHRVRGPCRCLAMVDDLIVAGLSKTVVLSRYTETSSMSGELKKVASYRT 1252

Query: 815  HILALYVQTRGDFIVVGDLMKSISLLIY-------------------KHEEG-------- 847
                + +   G  I VGD+MKS +L+ Y                   K  +G        
Sbjct: 1253 ATYVVDLAVDGHMIAVGDMMKSTALVEYIPATSGDGEDEEDDGAGDNKKGKGKTADRSKT 1312

Query: 848  -----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVV 902
                  + ERAR Y A+W +AV  ++ D++L A+   NL  + ++ +G T +++ RL  V
Sbjct: 1313 IAEGPKLVERARGYQASWATAVCHVEGDLWLEADGFGNLTMLERDVQGVTADDKRRLRTV 1372

Query: 903  GEYHLGEFVNRFRHGSLVMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            GE +LGE VNR R  ++     ++  G +  P     TV G I ++ ++  E    L  L
Sbjct: 1373 GEMYLGEMVNRIRPIAV-----ETSPGAMVHPRAFLATVEGSIYMVGTIAPEAQDLLMNL 1427

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNN-EKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            QT L  ++KG G  +   +RSF N E+++ +   F+DG+L+E FLD+      E+++ + 
Sbjct: 1428 QTKLAAIVKGPGNTSFSAYRSFRNAERESTEPFRFVDGELLERFLDVGEDVQKEVAQGLG 1487

Query: 1020 VSVEELCKRVEELTRLH 1036
             SVE+L   +EEL RLH
Sbjct: 1488 PSVEDLRNIIEELKRLH 1504


>gi|340960602|gb|EGS21783.1| hypothetical protein CTHT_0036510 [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 1100

 Score =  427 bits (1097), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 331/1114 (29%), Positives = 553/1114 (49%), Gaps = 105/1114 (9%)

Query: 13   LELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGII 71
            L+  RP     D LF+AT+R+ +  L W+ ++  L T  +  D  +R  R + +    ++
Sbjct: 2    LQKLRPRDSKTDLLFVATDRFDYFTLLWNPQTFRLETVNSYKDPGERYMRDSQSQDRCLV 61

Query: 72   DPDCRLIGLHLYDGLFKVIPF----DNKGQLKEAFNIRLEELQVLDIKFLYG-CAKPTIV 126
            DP  R + +HL++G+  V       +N   L+    +RL EL +    FLY     P + 
Sbjct: 62   DPSGRFMIMHLWEGVLNVWRMGHRKNNAMVLEWMEQVRLSELFIKASTFLYTETGHPKVA 121

Query: 127  VLYQDNKDARHVK--TYEVALKDKDFVEG---PWSQNNLD-----NGADLLIPV------ 170
             LYQ   DA   K  TY +   D+D V     P     LD     + A +LIPV      
Sbjct: 122  FLYQTRADATDAKLATYRLTSDDRDTVASQFNPEKDRELDMEVRDSTASMLIPVRKVEEE 181

Query: 171  ------------PPPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGRVDADGSR 215
                           L G++++GE  ++Y    +    +++   PSI  A+   +   + 
Sbjct: 182  VKRHNVRHVELAKAHLGGLIVVGETRLLYIDEVTKAKVESVLKEPSIFVAWAEYNV--TH 239

Query: 216  YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
            Y L D  G LHLL I      VTG+++  +GETS AST+ YL +  +++GS YG+SQL++
Sbjct: 240  YFLADDYGHLHLLRIKTNGVVVTGMEVSRIGETSRASTLVYLGDNYLFVGSHYGNSQLLR 299

Query: 276  LNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQVVTCSGAYK 323
            L+L+       ++++E + N+GPI+DF ++D+  +            GQ ++VTCSG +K
Sbjct: 300  LDLENQNPKQRLQLIESFQNIGPILDFAIMDMGNRGDSGQPGNEYSSGQARIVTCSGVHK 359

Query: 324  DGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLED 382
            DG+LR VR+G+G+ +   + +L+  +G++SLRS      +  LV+SF++ETR+   + + 
Sbjct: 360  DGTLRSVRSGVGLEDIGILADLELCRGLFSLRSHGSLKTN-ILVMSFLTETRVFKFDHQG 418

Query: 383  ELEETEIEGFCSQT---QTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSV 439
            ++E  E+  FC  T   QTL   D    Q++ VT  +  L+ + S      W    G  +
Sbjct: 419  DIE--ELSSFCGMTLDQQTLLAVDLPSGQILHVTPAAATLLDTESGVAITSWTPEEGRCI 476

Query: 440  NVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIA 498
              A+ANA  +LL+  G  LV L + + + + + K+     +I+CL +      P  S I 
Sbjct: 477  INASANAEWLLLSVDGVGLVSLSLSNDLRILKEKNLNQSDQIACLHV-----PPQSSGIG 531

Query: 499  AVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSV----LLCAFEGISYLLCALGD 550
             VG WT  +V I  L  L  I  E L        IP+ +    LL        LL A  D
Sbjct: 532  VVGFWTSGTVSIIDLHTLEPIHGESLRTSKDDTSIPQDLALVQLLPPEVSGPLLLIATQD 591

Query: 551  GHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLL 609
            G++++F ++      + +++V LG           +N  + V A ++ P++IY S  +++
Sbjct: 592  GNVVSFNISSDYS-FSGKRRVVLGVTQAKFHLLPQENNLYSVLATTEHPSLIYGSEGRIV 650

Query: 610  YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
            YS V  +E +++CPF+S AFPD +A+A + ++ +  +D  +K ++RS+PL E  RRI + 
Sbjct: 651  YSAVTAEEAAYVCPFDSEAFPDCVALATDSQIKLARLDRERKTYVRSLPLNEMVRRIAYS 710

Query: 670  EQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTFEFI-STYPLDTFEYGCSILSCSFSDD 727
             + + F +  +K +    EE      +L+D+  F+ +  +  L +  Y   +   + ++ 
Sbjct: 711  PKEKVFGLGCIKRELVKGEEIVQSSFKLVDEVLFDRVGKSIELGSPSYTELVECVTRAEF 770

Query: 728  SNVY------YCVGTAYV-------LPEENEP-TKGRILVF-IVEDGKLQLIAEKETKGA 772
             + Y      + VGT+Y+        P    P  +GRILV  I  D     I   + KGA
Sbjct: 771  IDSYGNPAERFLVGTSYLPDPDYSPAPSHGNPEARGRILVLGIDSDRNPYQILSYQLKGA 830

Query: 773  VYSLNAF-NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVG 831
               L    +GK++  + + + + ++      T +L     +      + +   G  I V 
Sbjct: 831  CRCLAVMDDGKVVVGLTKAVTVCEYKETSSTTAQLTKLASYRPSTYPVEIAIHGRTIAVA 890

Query: 832  DLMKSISLLIY-KHEEGA---IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
            DLMKSISL+ Y   EEG    + ERAR Y + W +AV  + + ++L A+   NL  +R+N
Sbjct: 891  DLMKSISLVDYIPAEEGGQAKLVERARHYQSAWSTAVGYVQNGLWLEADAQGNLQVLRQN 950

Query: 888  SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIA 947
             +G T+++R R+E+  E +LGE VNR R  + V   P++ +  IP    GTV G I +  
Sbjct: 951  VDGITEDDRKRMELTAEINLGEMVNRIRSIT-VETSPEALI--IPRAFLGTVEGGIYMFG 1007

Query: 948  SL-PHEQYLFLEKLQTNLRKVIKGVG---GLNHEQWRSFNNEKKTVDAK-NFLDGDLIES 1002
            ++ PH   L L + Q  +  VIK VG     N   +R+F N ++       FLDG+L+E 
Sbjct: 1008 TIAPHALDLLL-RFQEKVADVIKAVGDSDNANFRSYRAFKNAERVGHGPFRFLDGELLER 1066

Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            FLD+     + I   +  +VE++   VEEL R+H
Sbjct: 1067 FLDVDEAIQEVICSGLGPTVEDMRNIVEELRRMH 1100


>gi|134077422|emb|CAK45676.1| unnamed protein product [Aspergillus niger]
          Length = 1133

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 326/1103 (29%), Positives = 546/1103 (49%), Gaps = 116/1103 (10%)

Query: 6    IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            ++ ++  L  L  P     D LF+ T+RY +C L WD E +++ T R   D+SD   R  
Sbjct: 55   LFAKVTMLARLPAPANSPTDHLFVGTDRYTYCTLSWDGERNQIRTERNYVDISDPSSREA 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
              G   +IDP  R + L +Y+G+  V+P                        G+L E   
Sbjct: 115  QTGNRCLIDPSGRFMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTT 174

Query: 104  IRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKT----YEVALK----DKDFVEG-- 153
             R++EL               + +LY+DN+    +K     Y  A      D  F E   
Sbjct: 175  ARIDEL---------------LALLYEDNQKKVRLKVRALHYSAATASTGADAAFEESLD 219

Query: 154  PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRV 209
             +SQ  LD GA  LIPVP PL G+L++GE +I Y   ++ + +  RP    +I  A+ +V
Sbjct: 220  GFSQE-LDLGASHLIPVPAPLGGLLVLGETSIKYVDTDSNEIVS-RPLDEATIFVAWEQV 277

Query: 210  DADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
            D+   R+LL D  G L  L+++     +V   K++ LG T+ AS + YL   V+++GS  
Sbjct: 278  DS--QRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGVIFVGSHQ 335

Query: 269  GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVT 317
            GDSQ++++       GS +EV++   N+ PI+DF ++DL     E Q      GQ ++VT
Sbjct: 336  GDSQVLRI-----GNGS-LEVIQTLSNIAPILDFTIMDLGNRTSESQTHEFSSGQARIVT 389

Query: 318  CSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
             SGA+ DG+LR VR+G+G+ E   + E++ I  ++ L+ +T   +   L+V+F+ ETR+ 
Sbjct: 390  GSGAFDDGTLRSVRSGVGMEELGVLGEMELITDLFGLQLATKGGYLDTLLVTFVDETRVF 449

Query: 377  AMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
              N + E+EE +   G      TL   +    +++QVT   V +       + NEW  P 
Sbjct: 450  QFNFDGEVEELDSFLGLSLSENTLLAMNLPGGRILQVTEQRVLIADIEGGMVTNEWTPPD 509

Query: 436  GYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSY 494
               +  A+AN   ++L +GG  +  L+I + + +   K    + +IS + + P+    S 
Sbjct: 510  NLVITSASANNDSIVLVSGGQLMTVLDINNDVRVISQKDFGADSQISGVTV-PL----SS 564

Query: 495  SQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGD 550
            + +  VG      V +  L  L+ +    LG  GE  PRSVL+        S L  ++ D
Sbjct: 565  AGVCIVGFPQLAKVSVLDLGRLSELHTTSLGPAGEAFPRSVLVADVLANSPSTLFISMAD 624

Query: 551  GHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLL 609
            G ++ +  + +   LT   ++ LG++  T +     +  + VFA  + P++IY S  +++
Sbjct: 625  GSVITYSFDARNYSLTGMNRLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYGSEGRII 684

Query: 610  YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
            YS VN +  S +C FNS A+P S+A+A   +L I  +D  +   I+++P+GE  RR+ + 
Sbjct: 685  YSAVNSEGASRVCHFNSEAYPGSIAVATRHDLKIALVDKERTTQIQTLPMGETARRVAYS 744

Query: 670  EQSRTFAICSL--KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-- 725
               + F I ++  K +  AE  +  FV L D+  F  +  + L   E   S++   FS  
Sbjct: 745  PSEKAFGIGTIERKLKDGAEMVQSRFV-LADEIMFRRLDAFDLRPEELVESVIRAEFSAG 803

Query: 726  ---DDSNVY---YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNA 778
               +  +V+   + VGTAY+  E  E  +GRILVF +++G KL  +AE   KGA  +L  
Sbjct: 804  KDENGRDVFKDRFVVGTAYLDDENEESIRGRILVFEIDNGRKLTKVAELPVKGACRALAM 863

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
               K++AA+ + + +Y  +  D G  +L+    +      + V   G+ I V DLMKS+ 
Sbjct: 864  LGEKIVAALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAPVDVTVTGNVIAVADLMKSVC 923

Query: 839  LLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
            L+ Y   E     ++ E AR +   W + V  +  D +L  +   NL  +R+N  G  ++
Sbjct: 924  LVEYSEGENGSPDSLTEVARHFQTVWATGVSCIAKDTFLETDAEGNLIVLRRNLTGVEED 983

Query: 895  ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
            ++ RLEV GE  LGE VNR R           ++ Q+ +V   TV G I + A +  E  
Sbjct: 984  DKRRLEVTGEISLGEMVNRIRP---------VNIQQLASV---TVEGSIYLFAIINPEHQ 1031

Query: 955  LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
             FL +LQ  +   ++ +G +   ++R F +  ++  +   F+DG+LIE FL    +  +E
Sbjct: 1032 DFLMRLQATMAGKVESLGNIPFNEFRGFRSMVREAKEPYRFVDGELIERFLTCEPSLQEE 1091

Query: 1014 ISKTMN-VSVEELCKRVEELTRL 1035
            I  ++  ++V+E  +    LT L
Sbjct: 1092 IVDSVGMMNVDERIRPTVYLTSL 1114


>gi|67516629|ref|XP_658200.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
 gi|40747539|gb|EAA66695.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
 gi|259489136|tpe|CBF89158.1| TPA: damaged DNA binding protein (Eurofung) [Aspergillus nidulans
            FGSC A4]
          Length = 1132

 Score =  426 bits (1094), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 327/1100 (29%), Positives = 539/1100 (49%), Gaps = 91/1100 (8%)

Query: 6    IYGRIATLE-LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            I+ R+  L  L  P     D LF+ T+RY +  L WD+  +++ T R   D++D   R  
Sbjct: 55   IFARVTMLACLPAPANSPTDHLFVGTDRYSYFTLSWDSARNQVRTERDYVDIADPSSRDA 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
              G   +IDP  R + L +YDG+  VIP                        G+L E   
Sbjct: 115  RTGSRCMIDPSGRFMTLEIYDGMIVVIPIIQLPSKRRGRQVALPTGPDAPRIGELGEPII 174

Query: 104  IRLEELQVLDIKFLY-GCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGPWSQ 157
             R++EL V    FL+     P + +LY+DN+       R +K    A  + +F       
Sbjct: 175  TRIDELFVRSSAFLHVQAGSPRLALLYEDNQKKVKLKVRELKYSTAAGAESEFTSIADYA 234

Query: 158  NNLDNGADLLIPVPPPLC---GVLIIGEETIVYCSA--NAFKAIPIR-PSITKAYGRVDA 211
              LD GA  LIPVP PL    G+LI+GE +I Y  A  N   + P+   +I  A+ +VD+
Sbjct: 235  QELDLGASHLIPVPAPLAAAGGLLILGETSIKYVDADNNEIVSQPLEEATIFVAWEQVDS 294

Query: 212  DGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
               R+LL D  G L  L++     +V   ++  LG TS AS + YL   VV++GS  GDS
Sbjct: 295  --QRWLLADDYGRLFFLMLVLRNSEVERWELHSLGNTSRASVLVYLGGGVVFVGSHQGDS 352

Query: 272  QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTCSG 320
            Q+I++  Q        +V++   N+ P++DF ++DL     E Q      GQ ++VT SG
Sbjct: 353  QVIRIGDQS------FQVIQTLSNIAPVLDFTIMDLGNRTSENQMHEFSSGQARIVTGSG 406

Query: 321  AYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
            A+ DG+LR VR+G+G+ E   + +++ I  +W L+  +   F   L+V+F++ETR+   +
Sbjct: 407  AFDDGTLRSVRSGVGLEELGVLGDMEHITDLWGLQVGSRGDFLDTLLVTFVNETRVFRFS 466

Query: 380  LEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYS 438
             + E EE E   G      TL   +   ++++QVT   V +          EW       
Sbjct: 467  PDGEAEELESFLGLSLSENTLLAANLPGSRILQVTEQRVLIADIECGMTIFEWTPKNQLI 526

Query: 439  VNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQI 497
            +  A+AN   ++L  GG H+  L+I  +  +   K    + +IS + +       + + +
Sbjct: 527  ITAASANDDTIVLVAGGKHVTVLDIQSEARVVSEKDFGADNQISGVTLPT-----TPTDV 581

Query: 498  AAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGHL 553
              VG      V +  L DL+ I+   LG  GE  PRSVL+ +   E    L  ++ DG +
Sbjct: 582  CIVGFPQLAKVSVLKLQDLSHISSTSLGPAGEAFPRSVLVASVLAENAPTLFISMADGSV 641

Query: 554  LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASDRPTVIYSSNKKLLYSN 612
            + +  N +   L+   K+ LG++  T +     N  ++VFA  + P++IY S  +++YS 
Sbjct: 642  ITYDYNDQDHSLSGMNKLVLGSEQPTFKKLPRGNGLSNVFATCENPSLIYGSEGRIIYSA 701

Query: 613  VNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQS 672
            VN +  S +C FNS A+P+S+A+A   EL IG +D  +   I+++P+    RR+ +    
Sbjct: 702  VNSEGASRICHFNSEAYPESIAVATAQELKIGLVDKERTTQIQTLPIKATVRRVAYSPSE 761

Query: 673  RTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC-------- 722
            + F + +++ +  S  E  +  FV L D+  F  +  + L+    G  I+ C        
Sbjct: 762  KAFGMGTIERKLVSGEEIVKSQFV-LADEILFRRLDAFDLE----GEEIVECVIRAEAPE 816

Query: 723  SFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNG 781
            S   ++   + VG+AY+  ++ + T G I VF V++G KL  +A++  KGA  +L     
Sbjct: 817  SKDGEAKDRFVVGSAYLGEDDGDSTLGYIRVFEVDNGRKLAKVAQERVKGACRALAVMGD 876

Query: 782  KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
            K++AA+ + + +++ + R  G  +LQ    +      + +    + I + DLMKS+ ++ 
Sbjct: 877  KIVAALVKTVVVFQVVPRSGGL-QLQRLASYRTSTAPVDITVTRNVIAIADLMKSVCVVE 935

Query: 842  YKH-EEGA---IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG 897
            Y   E GA   + E AR +   W + V  +  D YL ++   NL  +R+N  G  +++R 
Sbjct: 936  YHEGENGAPDKLVEVARHFQTVWATGVTSVAPDTYLESDAEGNLIVLRRNRSGVEEDDRR 995

Query: 898  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL 957
            RLEV GE  L E VNR R  + + +LP + V  +P     TV G I + A +  +   FL
Sbjct: 996  RLEVTGEICLNEMVNRIRPVN-IQQLPSATV--VPRAFLATVEGSIYLYAIINPDYQDFL 1052

Query: 958  EKLQTNLRKVIKGVGGLNHEQWRSFNN-EKKTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
             +LQ  +      +GG+    +R+F    ++  +   F+DG+LIE FL        EI  
Sbjct: 1053 MRLQATMASRADSLGGIPFTDYRAFRTMTRQATEPYRFVDGELIERFLTCEPAVQKEIVD 1112

Query: 1017 TMNVSVEELCKRVEELTRLH 1036
             +  S+EE+   VE L RLH
Sbjct: 1113 IVGSSLEEVRAIVEALRRLH 1132


>gi|238491136|ref|XP_002376805.1| UV-damaged DNA binding protein, putative [Aspergillus flavus
            NRRL3357]
 gi|220697218|gb|EED53559.1| UV-damaged DNA binding protein, putative [Aspergillus flavus
            NRRL3357]
          Length = 1117

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 332/1110 (29%), Positives = 551/1110 (49%), Gaps = 104/1110 (9%)

Query: 6    IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            +Y +++ L  L  P     D LF+ T+RY +C L WD+  + + T R   D++D   R +
Sbjct: 33   LYAKVSMLARLPAPAHSPTDHLFVGTDRYTYCTLSWDSAQNRIRTERNYVDIADPSSRES 92

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
              G   +IDP  R + L +Y+GL  V+P                        G+L E   
Sbjct: 93   QTGNRCLIDPSGRFMTLEVYEGLVAVVPIVQLPARKRGRAPAVPTGPDAPKVGELGELTT 152

Query: 104  IRLEELQVLDIKFLY-GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQN---- 158
             R++EL V    FL+     P + +LY+DN+     K   + +++ ++     S +    
Sbjct: 153  ARIDELFVRSSAFLHVQSGLPRLALLYEDNQ-----KKVRLMVRELNYTSATASTSADAT 207

Query: 159  ---------NLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKA 205
                      LD GA  LIPVP PL G+LI+GE +I Y   +  + I  RP    +I  A
Sbjct: 208  LTHIADFAQELDLGASHLIPVPAPLGGLLILGETSIKYVDDDNNEIIS-RPLDEATIFVA 266

Query: 206  YGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
            +  VD+   R+LL D  G L  L+++     +V G K++ LG TS AS + YL   +V++
Sbjct: 267  WEGVDS--QRWLLADDYGRLFFLMLVLDSDNQVQGWKLDHLGNTSRASALVYLGGGIVFV 324

Query: 265  GSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQG 313
            GS  GDSQ++++       GS+ E+++   N+ PI+DF ++DL     E Q      GQ 
Sbjct: 325  GSHQGDSQVLRI-----GNGSF-EIIQALSNIAPILDFTIMDLGNRTSESQTHEFSSGQA 378

Query: 314  QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLR-SSTDDPFDTFLVVSFIS 371
            ++VT SGA+ DG+LR VR+G+G+ E   + +++ I  +W L+  +  D  DT L+V+FI 
Sbjct: 379  RIVTGSGAFDDGTLRSVRSGVGMEELGVLGDMEHITDLWGLQVQAGGDTLDT-LLVTFID 437

Query: 372  ETRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNE 430
            ETR+   + + E+EE +   G      TL   +    +++QVT   V +       +  E
Sbjct: 438  ETRVFHFSPDGEVEELDHFLGLSLSENTLLAANLPRGRILQVTEQRVLIADLEGGMVVYE 497

Query: 431  WKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD--GILTEVKHAQLEYEISCLDINPI 488
            W  P    +  A+AN   ++L  GG  +  L+IG    ++TE K    + ++S + +   
Sbjct: 498  WTPPNELVITAASANDDSLVLVIGGELMTVLDIGTEAQVITEKKFGA-DSQVSGVTVPA- 555

Query: 489  GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYL 544
                S +++  VG      V +  L DL  +    LG  GE  PRSVL+     +    L
Sbjct: 556  ----SPTEVCVVGFPQLAKVSVLRLRDLTEVHTTSLGPAGEAFPRSVLVADVLADSPPTL 611

Query: 545  LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
              ++ DG ++ +        L+   K+ LG++  T +     +  + VFA  + P++IY 
Sbjct: 612  FISMADGSVITYSFKTDDYSLSHMNKLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYG 671

Query: 604  SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
            S  +++YS VN +  S +C FNS A+P S+A+A   EL I  +D  +   I+++ +G   
Sbjct: 672  SEGRIIYSAVNSEGASRVCHFNSEAYPGSIAVATLHELKIALVDRERTTQIQTLQIGATV 731

Query: 664  RRICHQEQSRTFAICSLKNQSCAEESEMHFVRLL--DDQTFEFISTYPLDTFEYGCSILS 721
            RR+ +    + F I +++ +  A+ +E+   R +  D+  F  + ++ L   E   S++ 
Sbjct: 732  RRVAYSPSEKAFGIGTIERK-LADGAEIVTSRFMLADEVLFRQLDSFELRPEEIVESVIR 790

Query: 722  CSF--SDDSNV------YYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGA 772
              F    D N        + VGTAY+  E  E  +GRIL+F +++G KL  +AE   KGA
Sbjct: 791  AEFPAGKDENGREMTKDRFVVGTAYLDDEGEESIRGRILMFEIDNGRKLTKVAELPVKGA 850

Query: 773  VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
              +L     K++AA+ + I +YK +  + GT +L+           + V   G+ IVV D
Sbjct: 851  CRALAMLGDKIVAALVKTIVIYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSD 910

Query: 833  LMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
            LMKS+ LL +K  E     ++ E AR +   W + V  +D D +L ++   NL  +R+N 
Sbjct: 911  LMKSVCLLEFKEGENGLPDSLTEVARHFQTVWATGVACIDKDTFLESDAEGNLIVLRRNL 970

Query: 889  EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS 948
             G  +++R RLEV  E  LGE VNR R  ++      + V   P    GTV G I + A 
Sbjct: 971  AGVEEDDRRRLEVTSEISLGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLFAI 1027

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLS 1007
            +  E   FL +LQ  +   ++ +G +   ++R F +  ++  +   F+DG+LIE FL+  
Sbjct: 1028 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVREATEPYRFVDGELIEQFLNCE 1087

Query: 1008 RTRMDEISKTMN-VSVEELCKRVEELTRLH 1036
                +EI  ++  ++V E+   +E L RLH
Sbjct: 1088 PELQEEIVNSVGMMNVHEVKVMIEALRRLH 1117


>gi|169773185|ref|XP_001821061.1| UV-damaged DNA binding protein [Aspergillus oryzae RIB40]
 gi|83768922|dbj|BAE59059.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1139

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 332/1110 (29%), Positives = 551/1110 (49%), Gaps = 104/1110 (9%)

Query: 6    IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            +Y +++ L  L  P     D LF+ T+RY +C L WD+  + + T R   D++D   R +
Sbjct: 55   LYAKVSMLARLPAPAHSPTDHLFVGTDRYTYCTLSWDSAQNRIRTERNYVDIADPSSRES 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
              G   +IDP  R + L +Y+GL  V+P                        G+L E   
Sbjct: 115  QTGNRCLIDPSGRFMTLEVYEGLVAVVPIVQLPARKRGRAPAVPTGPDAPKVGELGELTT 174

Query: 104  IRLEELQVLDIKFLY-GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQN---- 158
             R++EL V    FL+     P + +LY+DN+     K   + +++ ++     S +    
Sbjct: 175  ARIDELFVRSSAFLHVQSGLPRLALLYEDNQ-----KKVRLMVRELNYTSATASTSADAT 229

Query: 159  ---------NLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKA 205
                      LD GA  LIPVP PL G+LI+GE +I Y   +  + I  RP    +I  A
Sbjct: 230  LTHIADFAQELDLGASHLIPVPAPLGGLLILGETSIKYVDDDNNEIIS-RPLDEATIFVA 288

Query: 206  YGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
            +  VD+   R+LL D  G L  L+++     +V G K++ LG TS AS + YL   +V++
Sbjct: 289  WEGVDS--QRWLLADDYGRLFFLMLVLDSDNQVQGWKLDHLGNTSRASALVYLGGGIVFV 346

Query: 265  GSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQG 313
            GS  GDSQ++++       GS+ E+++   N+ PI+DF ++DL     E Q      GQ 
Sbjct: 347  GSHQGDSQVLRI-----GNGSF-EIIQALSNIAPILDFTIMDLGNRTSESQTHEFSSGQA 400

Query: 314  QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLR-SSTDDPFDTFLVVSFIS 371
            ++VT SGA+ DG+LR VR+G+G+ E   + +++ I  +W L+  +  D  DT L+V+FI 
Sbjct: 401  RIVTGSGAFDDGTLRSVRSGVGMEELGVLGDMEHITDLWGLQVQAGGDTLDT-LLVTFID 459

Query: 372  ETRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNE 430
            ETR+   + + E+EE +   G      TL   +    +++QVT   V +       +  E
Sbjct: 460  ETRVFHFSPDGEVEELDHFLGLSLSENTLLAANLPRGRILQVTEQRVLIADLEGGMVVYE 519

Query: 431  WKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD--GILTEVKHAQLEYEISCLDINPI 488
            W  P    +  A+AN   ++L  GG  +  L+IG    ++TE K    + ++S + +   
Sbjct: 520  WTPPNELVITAASANDDSLVLVIGGELMTVLDIGTEAQVITEKKFGA-DSQVSGVTVPA- 577

Query: 489  GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYL 544
                S +++  VG      V +  L DL  +    LG  GE  PRSVL+     +    L
Sbjct: 578  ----SPTEVCVVGFPQLAKVSVLRLRDLTEVHTTSLGPAGEAFPRSVLVADVLADSPPTL 633

Query: 545  LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
              ++ DG ++ +        L+   K+ LG++  T +     +  + VFA  + P++IY 
Sbjct: 634  FISMADGSVITYSFKTDDYSLSHMNKLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYG 693

Query: 604  SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
            S  +++YS VN +  S +C FNS A+P S+A+A   EL I  +D  +   I+++ +G   
Sbjct: 694  SEGRIIYSAVNSEGASRVCHFNSEAYPGSIAVATLHELKIALVDRERTTQIQTLQIGATV 753

Query: 664  RRICHQEQSRTFAICSLKNQSCAEESEMHFVRLL--DDQTFEFISTYPLDTFEYGCSILS 721
            RR+ +    + F I +++ +  A+ +E+   R +  D+  F  + ++ L   E   S++ 
Sbjct: 754  RRVAYSPSEKAFGIGTIERK-LADGAEIVTSRFMLADEVLFRQLDSFELRPEEIVESVIR 812

Query: 722  CSF--SDDSNV------YYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGA 772
              F    D N        + VGTAY+  E  E  +GRIL+F +++G KL  +AE   KGA
Sbjct: 813  AEFPAGKDENGREMTKDRFVVGTAYLDDEGEESIRGRILMFEIDNGRKLTKVAELPVKGA 872

Query: 773  VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
              +L     K++AA+ + I +YK +  + GT +L+           + V   G+ IVV D
Sbjct: 873  CRALAMLGDKIVAALVKTIVMYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSD 932

Query: 833  LMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
            LMKS+ LL +K  E     ++ E AR +   W + V  +D D +L ++   NL  +R+N 
Sbjct: 933  LMKSVCLLEFKEGENGLPDSLTEVARHFQTVWATGVACIDKDTFLESDAEGNLIVLRRNL 992

Query: 889  EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS 948
             G  +++R RLEV  E  LGE VNR R  ++      + V   P    GTV G I + A 
Sbjct: 993  AGVEEDDRRRLEVTSEISLGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLFAI 1049

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLS 1007
            +  E   FL +LQ  +   ++ +G +   ++R F +  ++  +   F+DG+LIE FL+  
Sbjct: 1050 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVREATEPYRFVDGELIEQFLNCE 1109

Query: 1008 RTRMDEISKTMN-VSVEELCKRVEELTRLH 1036
                +EI  ++  ++V E+   +E L RLH
Sbjct: 1110 PELQEEIVNSVGMMNVHEVKVMIEALRRLH 1139


>gi|116195210|ref|XP_001223417.1| hypothetical protein CHGG_04203 [Chaetomium globosum CBS 148.51]
 gi|88180116|gb|EAQ87584.1| hypothetical protein CHGG_04203 [Chaetomium globosum CBS 148.51]
          Length = 1127

 Score =  424 bits (1090), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 322/1141 (28%), Positives = 549/1141 (48%), Gaps = 125/1141 (10%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            + G I+ L+  RP     D LF+ TER+++  L W+ E+S+L T     D  +R  R + 
Sbjct: 2    VNGTISILQKLRPKDSKTDLLFVGTERFEYFTLAWNPETSQLHTIDPFTDPGERHMRDSQ 61

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF---DNKGQLKEAFNIRLEELQVLDIKFLYG-C 120
            +    ++DP  R + +HL++G+  ++      NK  L     +RL EL +    FLY   
Sbjct: 62   SQDRCLVDPSGRYLAMHLWEGVLTMLRLGERKNKRGLAWMDQVRLSELFIKASTFLYTET 121

Query: 121  AKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQNN--------LDNGADLLIPV 170
              P I  LYQ   D+   K  TY + + D++     +  +          D  A +LIPV
Sbjct: 122  GHPKIAFLYQSRADSADAKLATYRMTIDDRNTEASKFDPDRDREIDTDIPDPSASILIPV 181

Query: 171  ------------------PPPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGRV 209
                                 + G+L++GE  ++Y    +    ++   + SI  A+   
Sbjct: 182  RKVEEEVKRHNVRNVESAKAHIGGLLVVGETRLLYIDEVTKATVQSALKQASIFVAWAEY 241

Query: 210  DADGSRYLLGDHAGLLHLLVITHEKEK---VTGLKIELLGETSIASTISYLDNAVVYIGS 266
            +   + Y L D  G LHLL +  E      VT +++  +G+TS AS + +L   ++++ S
Sbjct: 242  NE--TNYFLADDYGNLHLLTLVRESPDSVVVTSMEVNRIGKTSRASHLVFLGGDMLFVAS 299

Query: 267  SYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER-----------QGQGQV 315
             YGDSQL +L+ Q +     +++++   N+GPI+DF ++D+ R            GQ ++
Sbjct: 300  HYGDSQLFRLDFQ-NEDVPPIQLVQTLANIGPILDFAIMDMGRGDEGQQGNEYSSGQARI 358

Query: 316  VTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETR 374
            VTCSG +KDGSLR VR+G+G+ +   + +L+  +G++SL++  + P    L  SF++ETR
Sbjct: 359  VTCSGVHKDGSLRSVRSGVGLEDVGILADLEHCRGLFSLKT-YEAPKTNILAASFLTETR 417

Query: 375  ILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKS 433
            +   + + E+EE +   G     QTL        Q++QVT  +V L+ + S    + W  
Sbjct: 418  VFKFDPQGEVEELDSFAGMTFNQQTLLARCLSTAQILQVTPAAVTLLDAESGLTIDSWTP 477

Query: 434  PPGYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENP 492
                S+  A+ N   +LL+  G  LV   I   + L   K    + +I+C+ + P+    
Sbjct: 478  GAQKSIISASGNNKSLLLSVDGTELVLFSIDINLRLVRTKEIGYQDQIACIHVPPL---- 533

Query: 493  SYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAF---EGISYLL 545
              + +A VG W+  +V I  LP L  +  E L        IPR + L       G   L 
Sbjct: 534  -RNDLAVVGFWSSGTVSIIDLPSLEPMHGEQLRTSPDDASIPRDLALVKLLDNVGGGTLF 592

Query: 546  CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV---FAASDRPTVIY 602
             A+ DG+++ F L+ +   LT RK+V LG +          +  H+   FA ++ P++IY
Sbjct: 593  VAMQDGNVITFNLS-ENFRLTGRKRVILGMRQARFHLLPQPDAPHIYSIFATTEHPSLIY 651

Query: 603  SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEH 662
             S  +++YS V  +E +++CPF++ AFPD + +A + +L I  ID  ++ H++ +P+ E 
Sbjct: 652  GSEGRIVYSAVTAEEATYICPFDTEAFPDCIVLATDNQLKISHIDRERRTHVKPLPMNEM 711

Query: 663  PRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFI-STYPLD----TFEYG 716
             RRI +  + + F + C  +     EE      +L+D+  F+ +  T+PL     T    
Sbjct: 712  VRRIAYSPKEKVFGLGCIRRELVNGEEIVESSFKLVDEVIFDRVGKTFPLGVPSRTELVE 771

Query: 717  CSI---LSCSFSDDSNVYYCVGTAYVLPEENEP-------TKGRILVFIVE-DGKLQLIA 765
            C +   L  S+ + +   + VGT+++   +  P        +GR+LV  V+ D    L+ 
Sbjct: 772  CVVRAELRDSYGNPAE-RFLVGTSFLPDPDYGPGPGPAADARGRLLVLGVDADRNPYLVL 830

Query: 766  EKETKGAVYSLNAFN--------GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
              + KGA   L            G ++A + + + + ++      T EL     +     
Sbjct: 831  SHDLKGACRCLAVLGEDAGPNAAGLIVAGLTKTVVVCRYDETSSTTAELTRLASYRPSSY 890

Query: 818  ALYVQTRGDFIVVGDLMKSISLLIY--------------------KHEEGAIEERARDYN 857
               +  RG  I V DLMKSISL+ Y                      +   + E AR ++
Sbjct: 891  PAEIAVRGSTIAVADLMKSISLVEYIPAGSSSGSGGGGSGSGESGSDDGPRLVEHARHFS 950

Query: 858  ANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHG 917
            + W +AV  +++  +L A+   NL  +R+N EG T E++ R+EV  E +L E VNR R  
Sbjct: 951  SVWATAVGFVEEGSWLEADAQGNLMVLRRNVEGVTAEDKRRMEVTSEINLNEMVNRIRTI 1010

Query: 918  SLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNH 976
              V   P + +  +P    GTV G I +  ++ PH Q L L + Q+ L  V+K  G +  
Sbjct: 1011 D-VETTPGAMI--VPKAFLGTVEGGIYMFGTVAPHVQDLLL-RFQSRLADVLKTAGDIEF 1066

Query: 977  EQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
              +R+F N ++  D    F+DG+L+E FLD+  T  + + K +  +VE++   VEEL R+
Sbjct: 1067 RTYRAFRNAEREGDGPFRFVDGELLEKFLDVDETTQEAVCKGLGPTVEDMRNLVEELRRM 1126

Query: 1036 H 1036
            H
Sbjct: 1127 H 1127


>gi|85107301|ref|XP_962347.1| hypothetical protein NCU06605 [Neurospora crassa OR74A]
 gi|28923951|gb|EAA33111.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 1158

 Score =  423 bits (1087), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 323/1122 (28%), Positives = 551/1122 (49%), Gaps = 109/1122 (9%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            I G I  L+  +P     D LF+ T+++++   +WD E+ +L T     D  +R  R + 
Sbjct: 55   INGTITILQKLQPKDHPTDLLFVGTDQFEYFTAEWDRETQQLKTLNRFSDPGERHMRDSQ 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNI----RLEELQVLDIKFLYG- 119
            +    I+DP  R + +HL++G+  V    N+       +I    RL EL +    FLY  
Sbjct: 115  SQNKCIVDPSGRFMAMHLWEGVLSVWRLGNRKNTATTLDILVQVRLSELFIKGSTFLYTE 174

Query: 120  CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQNN--------LDNGADLLIP 169
               P +  LY++  ++   K  TY +   D+      +             D GA +LIP
Sbjct: 175  TGIPKVAFLYRNQANSNETKLATYRLTSDDRHTEISKFDPTRDREIDADVEDPGAGILIP 234

Query: 170  VP------------------PPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGR 208
            V                   P + G++++GE  ++Y    +    ++    PSI  A+  
Sbjct: 235  VKKVEEEVKRHHFRNTEQAKPHVGGLIVVGETRLLYIDEVTKTQVESALREPSIFVAWAE 294

Query: 209  VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
             D   + Y L D  G LHLL I  E   VTG+ +  +G+T+ A  ++YL + ++++GS Y
Sbjct: 295  YDP--THYFLSDDYGNLHLLTILTEGAVVTGMDVSNIGKTARAHVLTYLGDDMLFVGSHY 352

Query: 269  GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQVV 316
            G+SQL +LNL  +     +++++   N+GP+ DF V+D+  +            GQ ++V
Sbjct: 353  GNSQLYRLNLLSEDLSEILQLVQVLENIGPVTDFTVMDMGNRENDSQLGNEYSSGQARIV 412

Query: 317  TCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
            T SG +KDG+LR VR+G+G+ + A + ELQ  + ++SL+S      DT LV SF+++TRI
Sbjct: 413  TASGVFKDGTLRSVRSGVGLQDIAILGELQHTRALFSLQSYNSPRVDT-LVASFLTDTRI 471

Query: 376  LAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
               +   E+EE +   G   Q QTL   +    QL+QVT+ +  L+ + S      W   
Sbjct: 472  FKFDPHGEIEEVDNYYGMDLQHQTLLATNLDNGQLLQVTTAAATLLDAESGVTIASWAPE 531

Query: 435  PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL--EYEISCLDINPIGENP 492
                +  A+AN   +LL+  G  LV + I D  LT V+   +  + +I+C+ +      P
Sbjct: 532  GDRQIINASANKHWLLLSVQGTTLVSINI-DNDLTVVQEKDISEQDQIACIHV-----AP 585

Query: 493  SYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI----IPRSVLLCAF----EGISYL 544
              S +  VG WT  +V I  +  L  I  E L        IPR ++L        G++ L
Sbjct: 586  QLSDVGVVGFWTSGTVSIIDMSTLEPIHGESLRRSADDASIPRDLVLAKVLPNAPGMT-L 644

Query: 545  LCALGDGHLLNFLLNMKTGE---LTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTV 600
              A+ DG+++ F      GE    + RK V LGT+         ++  + +FA ++ P++
Sbjct: 645  FIAMEDGNVVTF----NIGEDLTFSGRKSVILGTREARFHLLPQQDGIYSIFATTEHPSL 700

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IY S  +++YS V  ++ + +CPF+S AFP ++ ++ E E+ I  ID  ++ H+RS+ LG
Sbjct: 701  IYGSEGRIIYSAVTAEDATCVCPFDSEAFPGAVILSTENEIRISEIDTARQTHVRSLELG 760

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMH-FVRLLDDQTFEFIST-YPLDTFEYGCS 718
            E  RRI +    + F +  ++ +    E  +H   +L+D+  F  +   + L T  Y   
Sbjct: 761  EMVRRIAYSPSEKGFGLGCIRREVVNGEEIIHSSFKLVDEILFARVGKEFMLGTSSYSEL 820

Query: 719  ILSCSFSDDSNVY------YCVGTAYVL-PEENEPT--KGRILVFIVEDGK-LQLIAEKE 768
            +     ++  + Y      + VGT+++  P+    T  +GRILVF ++  +   L+ + E
Sbjct: 821  VEDVIRAELPDSYGNLVERFIVGTSFLEDPDRGAGTDKRGRILVFGIDSNRDPYLVLKHE 880

Query: 769  TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
             KG   +L     K++AA+++ + + ++         L     +      + +   G+ I
Sbjct: 881  LKGGCRALAVMGSKIVAALHKTVVISQYEETSSTEAHLVKLASYRCTTYPVDIAVHGNMI 940

Query: 829  VVGDLMKSISLLIY--------KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
             V D+MKS +L+ Y        K E   + E AR  ++ W +AV  ++ + +L A+ N N
Sbjct: 941  AVADMMKSATLVEYVPAKTGGEKSEAPKLVECARHRHSAWATAVAHVEGESWLEADANGN 1000

Query: 881  LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
            L  +++N+EG T E++ +L +  E +LGE VN+ R    V   P++ +  IP     T  
Sbjct: 1001 LIVLQRNAEGVTVEDQRQLRITSELNLGEQVNKIRPIK-VETSPNAII--IPRAFLATAE 1057

Query: 941  GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK------NF 994
            G I +  ++  EQ L L + Q  L  VIK VG L+   +R+F N ++  +A        F
Sbjct: 1058 GGIYMFGTIAREQDLLL-RFQDKLAAVIKTVGELDFNSYRAFRNAERGPEADGTTGPVRF 1116

Query: 995  LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            LDG+L+E FLD+      EI + +  SVE++   VEEL R+H
Sbjct: 1117 LDGELLERFLDVDEKTQKEICEGLGPSVEQMRNMVEELRRMH 1158


>gi|358056808|dbj|GAA97158.1| hypothetical protein E5Q_03834 [Mixia osmundae IAM 14324]
          Length = 1243

 Score =  423 bits (1087), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 334/1182 (28%), Positives = 553/1182 (46%), Gaps = 169/1182 (14%)

Query: 2    LDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE--SSELITRAMG------ 53
            L++P++G I  L+  R    +   L + T    F VL ++ +  +    TR++G      
Sbjct: 84   LEIPLFGTILDLQPIRFKDRSTSSLLLLTTSLHFSVLTYEPDFLTPASGTRSLGWSNARV 143

Query: 54   ------DVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG----------- 96
                   VS+R GR ++  Q  ++DP  R + LH+Y GL +V+P  +             
Sbjct: 144  QDEATLSVSERAGRQSEEAQTILVDPHNRCVLLHIYCGLIRVLPIKSSAGAASKPSRRRS 203

Query: 97   ---------------------------------QLKEAFNIRLEELQVLDIKFLY--GCA 121
                                              +  ++NIRL  L V  I FL     +
Sbjct: 204  SAAAPVGELSSSLKKSSGKARASAAASAESDGIDVFRSYNIRLPYLNVKHIAFLPLPESS 263

Query: 122  KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNL---DNGADLLIPVPPPLC--- 175
             PTI +L+ +      +  + ++LKDK+ V        L   D   DL IP+   L    
Sbjct: 264  LPTIALLHTNFLGQNVLSIHAISLKDKELVNSAKPDFELVLDDETVDLFIPLERNLATGQ 323

Query: 176  --GVLIIGEETI----------------------------VYCSANAFKAIPIRPS---- 201
              G+L++G   I                            V  +A+  ++ P  P+    
Sbjct: 324  PLGLLLVGSGQINWLPLPLLDTIASPPISPTTARRASISSVSTNASGRRSAPPSPTNTRK 383

Query: 202  -------------------ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKI 242
                               I + Y  +D D S  L+GD  G L  + I  + E+VT L +
Sbjct: 384  SKGKERASDRRVTSKVPVGIYRTYCAIDHDPSLLLVGDEEGYLVSVRIGLQGERVTSLDV 443

Query: 243  ELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL-NLQPDA-KGSYVEVLERYVNLGPIV 300
              LG+    ++++++ +  V++GS YGDS L+ +  L  D  +   V V+    NL PIV
Sbjct: 444  VDLGKVPSPTSLTHIADEYVFVGSYYGDSSLVAIPALTSDGMECDSVSVISAMANLAPIV 503

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            DFCVV  +  GQ  +VTCSGA   GSLR+VR G+G++  A+++L  ++  W L+ ++   
Sbjct: 504  DFCVVT-DDVGQSHLVTCSGAKNSGSLRLVRQGVGLSILATIDLPAVQNAWPLKLASSSI 562

Query: 361  FDTFLVVSFISETRILAM---NLEDELEETEIEGFCSQTQTLFCH---DAIYNQLVQVTS 414
                ++VSF+  +++L++   NL      +E+        TL+     DA     +  T+
Sbjct: 563  KHDAILVSFLDRSQLLSLCNGNL------SELASPALTEPTLYAGTLGDAGAQVAICATA 616

Query: 415  GSV-RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKH 473
             ++ RL      EL   W SP G  +  A+++ S +LLAT    LV + +    +     
Sbjct: 617  KAILRLSIEKPSEL---WTSPTGDLITAASSDGSSLLLATSSKALVLMTLSPNGIALTTT 673

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV 533
             Q   E+SCL         + SQIAAV  W+  ++ ++SLP L  +T   L  + +P+S+
Sbjct: 674  KQTISEVSCL---AAWTTTTGSQIAAVASWSTNAILLYSLPHLEPVTGAELSFDHLPQSM 730

Query: 534  LLCAFEGIS-YLLCALGDGHLLNFLLNMKTGELT--DRKKVSLGTQPITLRTFSSKNTTH 590
            L   FE  + +L   LG G L++F ++  +G +    RK V+LG +P+ L   S+     
Sbjct: 731  LFQKFEDDAVHLFVGLGSGDLISFGIDAASGAVLPLSRKSVTLGKKPVLLSQCSAAGQPA 790

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            VFA +DRPTV+  S  +L Y++ N + +  +   ++  F  SL +     +     D  +
Sbjct: 791  VFAVTDRPTVVSRSAGRLSYASDNRRTLVAINQIDALRFEQSLMLVSPEGIQFARADGNE 850

Query: 651  KLHIRSIPLGE-HPRRICHQEQSRTFAICSLK---NQSCAEESEMHFVRLLDDQTFEFIS 706
             LH+RS+ LGE  PR+I H  + R +A+  L+   ++S    +    V+ +D   F  + 
Sbjct: 851  SLHVRSLSLGELQPRKIAHSAELRAYAVLCLQETIDRSTGHLTRAGSVQFVDASDFALLD 910

Query: 707  TYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYV--LPEENEPTKGRILVFIVEDGKLQLI 764
            ++ L + E+G ++ + S       ++ VGTA+     +  EP KGR+L F+ +  K +  
Sbjct: 911  SFDLQSDEHGTALETVSL--HGAAHFAVGTAFSDRTVDAREPKKGRVLTFMRDGDKFEQH 968

Query: 765  AEKETKGAVYSLNAFNGKLLAAI-NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQT 823
                 +G V+ L       LAAI N +++++      +     Q  C   G  LA  + +
Sbjct: 969  VHAVLEGGVFGLCQLPNSFLAAIANAQVKVFHVT---EQAHIDQMTC-WAGTFLAQSISS 1024

Query: 824  RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL--DDDIYLGAENNFNL 881
            R   I+VGDL +S+ LL +   +  + E AR+++ N MSAVE L   DD Y+G E   N+
Sbjct: 1025 RDSQIIVGDLYRSVVLLQWDEAKDTLSEVAREHHVNGMSAVEFLGFTDDRYIGTEQELNI 1084

Query: 882  FTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
            FT+ K     T E    LE  G +H+GE+V R R G+LV    D+  G  P ++FGT +G
Sbjct: 1085 FTLTKTK---TRERIDILETEGMFHIGEYVTRIRKGALVPGYTDTSFGAAPQLLFGTSDG 1141

Query: 942  VIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
             +GVI +   E  L L  L+ N+R VI+  GGL    WR+F    +  +   F+DGD+I 
Sbjct: 1142 SLGVIVNCTPEVSLKLFALERNMRAVIRAFGGLEQVDWRAFRAPHRVHEPVGFVDGDMIG 1201

Query: 1002 SFLDLSRTRMDEI-------SKTMNVSVEELCKRVEELTRLH 1036
             F +L+ T+++++       S   N + E+L + V+EL R+H
Sbjct: 1202 RFAELNETQVNQVLQGASEHSALTNTTAEDLYRLVDELQRMH 1243


>gi|336263557|ref|XP_003346558.1| hypothetical protein SMAC_04731 [Sordaria macrospora k-hell]
 gi|380090453|emb|CCC11749.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1149

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 324/1111 (29%), Positives = 543/1111 (48%), Gaps = 110/1111 (9%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            I G I  L+  +P     D LF+ T+++++   +WD E+ +L T     D  +R  R + 
Sbjct: 55   INGTITILQKLQPKDHPTDLLFVGTDQFEYFTAEWDHETQQLKTLNRFSDPGERHMRDSQ 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNI----RLEELQVLDIKFLYG- 119
            +    I+DP  R + +HL++G+  V    N+       +I    RL EL +    FLY  
Sbjct: 115  SQDKCIVDPSGRFMAMHLWEGVLSVWRLGNRKSTATTLDILVQVRLSELFIKGSTFLYTE 174

Query: 120  CAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVP-------- 171
               P +  LY++  ++   K       D D VE P        GA +LIPV         
Sbjct: 175  TGIPKVAFLYRNQANSNETKLATDREIDAD-VEDP--------GAGILIPVKKVEEEVKR 225

Query: 172  ----------PPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGRVDADGSRYLL 218
                      P + G+++IGE  ++Y    +    ++    PSI  A+   D   + Y L
Sbjct: 226  HHFRNTEQAKPHVGGLIVIGETRLLYIDEVTKTQVESALKEPSIFVAWAEYDP--THYFL 283

Query: 219  GDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL 278
             D  G LHLL I  E   VTG+ +  +G T+ A  ++YL + ++++GS YG+SQL +LNL
Sbjct: 284  ADDYGNLHLLTILTEGAVVTGMDVSNIGRTARAHVLTYLGDDMLFVGSHYGNSQLYRLNL 343

Query: 279  QPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQVVTCSGAYKDGS 326
              +     +++++   N+GPI DF ++D+  +            GQ ++VT SG +KDG+
Sbjct: 344  LNEDLNEILQLVQVLENIGPITDFTIMDMGNRENDSQLGNEYSSGQARIVTASGIFKDGT 403

Query: 327  LRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
            LR VR+G+G+ + A + ELQ  + ++SL+S      DT LV SF+++TRI   +   E+E
Sbjct: 404  LRSVRSGVGLQDIAILGELQHTRALFSLQSYNSSRADT-LVASFLTDTRIFRFDPHGEIE 462

Query: 386  ETEIEGFCS---QTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVA 442
              E+  +C    Q QTL   +    QL+QVT+ +  L+ + S      W       +  A
Sbjct: 463  --EVADYCGMDLQHQTLLTTNLDNGQLLQVTTAAATLLDAESGVTIASWAPEGDRQIINA 520

Query: 443  TANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVG 501
            +AN   +LL+  G  LV + I  D  + + K    + +I+C+ +      P  S +  VG
Sbjct: 521  SANKHWLLLSVQGTTLVSINIDNDLTVVQEKDVSEQDQIACIHV-----APQLSDVGVVG 575

Query: 502  MWTDISVRIFSLPDLNLITKEHLGGEI----IPRSVLLCAF----EGISYLLCALGDGHL 553
             WT  +V I  +  L  I  E L        IPR ++L        G++ L  A+ DG++
Sbjct: 576  FWTSGTVSIIDMSTLEPIHGESLRRSADDASIPRDIVLAKVLPNTPGMT-LFIAMEDGNV 634

Query: 554  LNFLLNMKTGE---LTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLL 609
            + F      GE    + RK V LGT+         ++  + +FA ++ P++IY S  +++
Sbjct: 635  VTF----NIGEDLTFSGRKSVILGTREARFHLLPQQDGIYSIFATTEHPSLIYGSEGRII 690

Query: 610  YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
            YS V  ++ + +CPF+S AFP ++ ++ E E+ I  ID  ++ H+RS+ LGE  RRI + 
Sbjct: 691  YSAVTAEDATCVCPFDSEAFPGAVVLSTETEIKISEIDTARRTHVRSLELGEMVRRIAYS 750

Query: 670  EQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFIST-YPLDTFEYGCSILSCSFSDD 727
               + F + C  +     EE      +L+D+  F      + L T  Y   +     ++ 
Sbjct: 751  PSEKGFGLGCIRREMVNGEEIIQSSFKLVDEILFARAGREFRLGTSSYSELVEDVIRAEL 810

Query: 728  SNVY------YCVGTAYVL-PEENEPT--KGRILVFIVEDGK-LQLIAEKETKGAVYSLN 777
             + Y      + VGT+++  P+    T  +GRILVF ++  +   L+ + E +GA  +L 
Sbjct: 811  PDSYGNLLERFIVGTSFLEDPDRGAGTDKRGRILVFGIDSNRDPYLVLKHELRGACRALA 870

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
                K++AA+++ + + ++         L     +      + +   G+ I V D+MKS 
Sbjct: 871  VMGSKIVAALHKTVVISQYEETSSTEARLVKLASYRCTTYPIDIAVHGNIIAVADMMKSA 930

Query: 838  SLLIY--------KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE 889
            +L+ Y        K+E   + E AR  ++ W +AV  ++ + +L A+ N NL  +++N E
Sbjct: 931  TLVEYVQAKTEEEKYEPAKLVECARHRHSAWATAVAHVEGESWLEADANGNLVVLQRNVE 990

Query: 890  GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL 949
            G T E++ +L +  E +LGE VN+ R    V   P++ +  IP     T  G I +  ++
Sbjct: 991  GVTAEDQRQLRITSELNLGEQVNKIRPIK-VETSPNTII--IPRAFLATAEGGIYLFGTI 1047

Query: 950  PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK------TVDAKNFLDGDLIESF 1003
              EQ L L + Q  L  VIK VG L+   +R+F N ++      T     FLDG+L+E F
Sbjct: 1048 AREQDLLL-RFQDKLAAVIKTVGELDFNSYRAFRNAERGPETDGTTGPVRFLDGELLERF 1106

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
            LD+  T   EI + +  SVE++   VEEL R
Sbjct: 1107 LDVDETTQKEICEGLGPSVEQMRNMVEELRR 1137


>gi|347838030|emb|CCD52602.1| similar to DDB1B (Damaged DNA Binding protein 1 B); damaged DNA
            binding / protein binding [Botryotinia fuckeliana]
          Length = 1157

 Score =  420 bits (1079), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 334/1125 (29%), Positives = 568/1125 (50%), Gaps = 116/1125 (10%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            IYGRI+ L+  +P G   D LF+ T R ++  + W+ ++ +L T ++  DVS    R ++
Sbjct: 55   IYGRISMLQKIQPAGSKTDHLFVGTVRAQYFTVMWNPQTHKLDTMQSFVDVSQEHMRDSE 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVI----PFDNKGQ-LKEAFNIRLEELQVLDIKFLYG 119
            +    ++DP  RL+ + LY+G+  ++    P   K   L+    +R+ E++V    FLY 
Sbjct: 115  SRDRCLVDPTGRLLVMELYEGVLNLVKIVKPRGGKTDYLENPEQVRISEMKVRASAFLYT 174

Query: 120  CAK-PTIVVLYQDNKDARHVKTYEVALKDK-----DFVEGPWSQNNLDN---GADLLIPV 170
              K P + +LYQD ++   + TY + L DK      F      +N++D+   GA  +IPV
Sbjct: 175  DTKQPKLALLYQDARENVKLATYRM-LDDKGQLILQFDPKKNRENDVDDLCVGAMHIIPV 233

Query: 171  PPP-------------------LCGVLIIGEETIVYCSANAFKAIPIRPSITKAY---GR 208
            P                     L G++++GE    Y    + KAI +  ++ +A      
Sbjct: 234  PKARDEASKRYIVRNATTAKANLGGLVVLGETKFTYLDDES-KAI-VEYALDEAVLWAAW 291

Query: 209  VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
               D   YLLGD  G L++L I  +   VTGL++  LG+ S  +++  L N V YI S  
Sbjct: 292  EPIDERTYLLGDDYGFLYILTILVDGATVTGLQVVKLGQVSKPTSLENLGNGVFYIASHE 351

Query: 269  GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVT 317
             D+Q+++++L+    G  V +L+   N+ PI+DF V+D+     E Q      GQ ++VT
Sbjct: 352  ADNQVVQIDLESPEHG--VTLLQTLPNIAPILDFTVMDMGGREGETQLNEYSSGQARLVT 409

Query: 318  CSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
             SG ++ GSLR VR+G+G+++ A + E++GI+ +++L S    P DT LVVSF +ETR  
Sbjct: 410  GSGGFEGGSLRSVRSGVGLDDTAILAEMEGIRKVFALHSGPTLPNDT-LVVSFSTETRFF 468

Query: 377  AMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
              + + ++EE E ++   S ++TL  ++     ++QVT   V +   +     + W+ P 
Sbjct: 469  KFDTQGDIEEVETVKNLSSTSETLLTYNLDEGCILQVTQHEVSIYGKSPG---HRWQPPN 525

Query: 436  GYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL-EYEISCLDINPIGENPSY 494
            G  +  A+ N + +LL++ G  L  L I    L EV   +L + +++C+ +  +      
Sbjct: 526  GQIITAASGNQNYILLSSNGRTLFTLSIQQN-LAEVAFQELGDDQVACIHVPQV-----M 579

Query: 495  SQIAAVGMWTDISVRIFSLPDLNLITKEHL---GGEIIPRSVLLCA-----FEGISYLLC 546
              I  VG+W   SV +  L  LN I  E L    G  IPR + L         G + L  
Sbjct: 580  GDIGVVGLWKSGSVSLLDLATLNTIVSEDLRRADGASIPRDIALTQILPPELSGPT-LFV 638

Query: 547  ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSN 605
            ++ DG +L+F ++     L+ RK + LGTQ   L+     NTT +VFA  + P++IY S 
Sbjct: 639  SMEDGIVLSFNVDKSDCSLSGRKSIVLGTQQAQLQILPRDNTTFNVFATCEHPSLIYGSE 698

Query: 606  KKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRR 665
             + +YS V  ++   +C  NS A+P S+ +A   EL +  ID+ ++ H+R++P+GE  RR
Sbjct: 699  GRTVYSAVTAEDAIAVCSLNSVAYPGSVVVATINELKLAVIDNERRTHVRTLPIGETVRR 758

Query: 666  ICHQEQSRTFAICSLKNQ-SCAEESEMHFVRLLDDQTF-EFISTYPL----DTFEYGCSI 719
            + +  + ++FAI ++K + +  +E      RL+D+  F E    Y L    +  E     
Sbjct: 759  VAYSAKEKSFAIGAIKRELTKGQEVVTTSFRLVDEVVFGELGEPYYLPPNNEIIETVVRA 818

Query: 720  LSCSFSDDSNVY--YCVGTAYVLPEENEPTKGRILVFIVE-DGKLQLIAEKETKGAVYSL 776
               +   D  +   + VGT++ L EE    +GR+L+F V  D    +IA    KG+   +
Sbjct: 819  ELPTRHGDGELVERFLVGTSF-LHEEEANVRGRLLIFGVNADRAPYMIASHNLKGSCRCI 877

Query: 777  NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
               +GK++AA+N+ + +Y +      +  L+    +      + +    + I V D+MKS
Sbjct: 878  GVLDGKIVAALNKTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIAVADIMKS 937

Query: 837  ISLLIYKHEEGA------IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
            I+L+ Y    GA      +EE AR     + ++V  +D D YL  +++ NL  +++N EG
Sbjct: 938  IALVEYT--PGADGLPDKLEEVARHAQQVFSTSVAEVDTDTYLETDHDGNLILLKRNREG 995

Query: 891  ATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLP 950
             T E++ R+EV  E +LGE VNR +  ++       D   IP    GT  G I + + +P
Sbjct: 996  VTREDKTRMEVTCEMNLGEMVNRVKRINVET---SKDALLIPRAFLGTTEGSIYLFSLIP 1052

Query: 951  HEQYLFLEKLQTNLRKV-----IKGV-------------GGLNHEQWRSF-NNEKKTVDA 991
             +    L +LQ+ L  +     I+G              G L+  ++RS+ +  ++T + 
Sbjct: 1053 PQNQDLLMRLQSRLASLPSASSIRGSSDSTSPHQIELSPGNLDFNKYRSYISATRETSEP 1112

Query: 992  KNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
              F+DG+LIE FLDL     + +++ + V  E+L   VE L RLH
Sbjct: 1113 FRFVDGELIERFLDLEVEVQEHVAEGLGVKAEDLRGMVEGLRRLH 1157


>gi|336470789|gb|EGO58950.1| hypothetical protein NEUTE1DRAFT_145059 [Neurospora tetrasperma FGSC
            2508]
 gi|350291855|gb|EGZ73050.1| hypothetical protein NEUTE2DRAFT_156600 [Neurospora tetrasperma FGSC
            2509]
          Length = 1158

 Score =  419 bits (1078), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 323/1122 (28%), Positives = 550/1122 (49%), Gaps = 109/1122 (9%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            I G I  L+  +P     D LF+ T+++++   +WD E+ +L T     D  +R  R + 
Sbjct: 55   INGTITILQKLQPKDHPTDLLFVGTDQFEYFTAEWDRETQQLKTLNRFSDPGERHMRDSQ 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNI----RLEELQVLDIKFLYG- 119
            +    I+DP  R + +HL++G+  V    N+       +I    RL EL +    FLY  
Sbjct: 115  SQNKCIVDPSGRFMAMHLWEGVLSVWRLGNRKNTATTLDILVQVRLSELFIKGSTFLYTE 174

Query: 120  CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQNN--------LDNGADLLIP 169
               P +  LY++  ++   K  TY +   D+      +             D GA +LIP
Sbjct: 175  TGIPKVAFLYRNQANSNETKLATYRLTSDDRHTEISKFDPTRDREIDADVEDPGAGILIP 234

Query: 170  VP------------------PPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGR 208
            V                   P + G++++GE  ++Y    +    ++     SI  A+  
Sbjct: 235  VKKVEEEVKRHHFRNTEQAKPHVGGLIVVGETRLLYIDEVTKTQVESALREASIFVAWAE 294

Query: 209  VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
             D   + Y L D  G LHLL I  E   VTG+ +  +G+T+ A  ++YL + ++++GS Y
Sbjct: 295  YDP--THYFLSDDYGNLHLLTILTEGAVVTGMDVSNIGKTARAHVLTYLGDDMLFVGSHY 352

Query: 269  GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQVV 316
            G+SQL +LNL  +     +++++   N+GPI DF V+D+  +            GQ ++V
Sbjct: 353  GNSQLYRLNLLSEDLSEILQLVQVLENIGPITDFTVMDMGNRENDSQLGNEYSSGQARIV 412

Query: 317  TCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
            T SG +KDG+LR VR+G+G+ + A + ELQ  + ++SL+S      DT LV SF+++TRI
Sbjct: 413  TASGVFKDGTLRSVRSGVGLQDIAILGELQHTRALFSLQSYNSPRVDT-LVASFLTDTRI 471

Query: 376  LAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
               +   E+EE +   G   Q QTL   +    QL+QVT+ +  L+ + S      W   
Sbjct: 472  FKFDPHGEIEEVDNYYGMDLQHQTLLATNLDNGQLLQVTTAAATLLDAESGVTIASWAPE 531

Query: 435  PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL--EYEISCLDINPIGENP 492
                +  A+AN   +LL+  G  LV + I D  LT V+   +  + +I+C+ +      P
Sbjct: 532  GDRQIINASANKHWLLLSVQGTTLVSINI-DNDLTVVQEKDISEQDQIACIHV-----AP 585

Query: 493  SYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI----IPRSVLLCAF----EGISYL 544
              S +  VG WT  +V I  +  L  I  E L        IPR ++L        G++ L
Sbjct: 586  QLSDVGVVGFWTSGTVSIIDMSTLEPIHGESLRRSADDASIPRDLVLAKVLPNAPGMT-L 644

Query: 545  LCALGDGHLLNFLLNMKTGE---LTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTV 600
              A+ DG+++ F      GE    + RK V LGT+         ++  + +FA ++ P++
Sbjct: 645  FIAMEDGNVVTF----NIGEDLTFSGRKSVILGTREARFHLLPQQDGIYSIFATTEHPSL 700

Query: 601  IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
            IY S  +++YS V  ++ + +CPF+S AFP ++ ++ E E+ I  ID  ++ H+RS+ LG
Sbjct: 701  IYGSEGRIIYSAVTAEDATCVCPFDSEAFPGAVILSTENEIRISEIDTARQTHVRSLELG 760

Query: 661  EHPRRICHQEQSRTFAICSLKNQSCAEESEMH-FVRLLDDQTFEFIST-YPLDTFEYGCS 718
            E  RRI +    + F +  ++ +    E  +H   +L+D+  F  +   + L T  Y   
Sbjct: 761  EMVRRIAYSPSEKGFGLGCIRREVVNGEEIIHSSFKLVDEILFARVGKEFMLGTSSYSEL 820

Query: 719  ILSCSFSDDSNVY------YCVGTAYVL-PEENEPT--KGRILVFIVEDGK-LQLIAEKE 768
            +     ++  + Y      + VGT+++  P+    T  +GRILVF ++  +   L+ + E
Sbjct: 821  VEDVIRAELPDSYGNLVERFIVGTSFLEDPDRGAGTDKRGRILVFGIDSNRDPYLVLKHE 880

Query: 769  TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
             KG   +L     K++AA+++ + + ++         L     +      + +   G+ I
Sbjct: 881  LKGGCRALAVMGSKIVAALHKTVVISQYEETSSTEAHLVKLASYRCTTYPVDIAVHGNII 940

Query: 829  VVGDLMKSISLLIY--------KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
             V D+MKS +L+ Y        K E   + E AR  ++ W +AV  ++ + +L A+ N N
Sbjct: 941  AVADMMKSATLVEYVPAKTGGEKSEAPKLVECARHRHSAWATAVAHVEGESWLEADANGN 1000

Query: 881  LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
            L  +++N+EG T E++ +L +  E +LGE VN+ R    V   P++ +  IP     T  
Sbjct: 1001 LIVLQRNAEGVTVEDQRQLRITSELNLGEQVNKIRPIK-VETSPNAII--IPRAFLATAE 1057

Query: 941  GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK------NF 994
            G I +  ++  EQ L L + Q  L  VIK VG L+   +R+F N ++  +A        F
Sbjct: 1058 GGIYMFGTIVREQDLLL-RFQDKLAAVIKTVGELDFNSYRAFRNAERGPEADGTTGPVRF 1116

Query: 995  LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            LDG+L+E FLD+      EI + +  SVE++   VEEL R+H
Sbjct: 1117 LDGELLERFLDVDEKTQKEICEGLGPSVEQMRNMVEELRRMH 1158


>gi|171691144|ref|XP_001910497.1| hypothetical protein [Podospora anserina S mat+]
 gi|170945520|emb|CAP71632.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1158

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 326/1117 (29%), Positives = 550/1117 (49%), Gaps = 100/1117 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL-ITRAMGDVSDRIGRPTD 64
            + G IA L+  RP     D LF+ T+R+++  L W+ E+S++  T A  D  +   R + 
Sbjct: 56   VNGTIAMLQKLRPKDAETDLLFVGTDRFEYFTLYWNPETSQMETTNATRDPGEHFMRNSQ 115

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF----DNKGQLKEAFNIRLEELQVLDIKFLYG- 119
            +    I+DP  R I +HL++G+  +        N  QL     IRL EL +    FL+  
Sbjct: 116  SLDRAIVDPSGRFIAMHLWEGVMTIARLGTRKTNAAQLDWMGQIRLAELFIKASTFLHNE 175

Query: 120  CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQNN--------LDNGADLLIP 169
               PT+  LYQ + +A+  K  TY +   D++ V   ++            D GA++LIP
Sbjct: 176  TGHPTVAFLYQTSANAQDSKLATYRLTSDDRNTVASEFNAQKHRIIDITIADAGANMLIP 235

Query: 170  V------------------PPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRV-- 209
            V                   P L GV+++GE  ++Y   +  KA  +   + KA   V  
Sbjct: 236  VRKVEEEVKRHNFRNTGSAKPHLGGVIVVGETRLLYID-DVTKAT-VESKLDKASIFVKW 293

Query: 210  -DADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
             + +   Y L D  G LHLL I  +  +V G+ +  +G TS AS + YL N ++++ S +
Sbjct: 294  AEYNVQTYFLADDYGSLHLLTINTDGAEVKGMVLTKIGVTSRASELVYLGNEMLFVASHH 353

Query: 269  GDSQLIKLNLQPD--AKGSYVEVLERYVNLGPIVDFCVVDLERQG-------------QG 313
            GDS+L +L+L  D  A   ++ +++   N+GPI+DF V+D+  +G             Q 
Sbjct: 354  GDSRLFQLDLSADKPADKPFLTLIQTISNIGPIMDFAVMDMGNRGGEDSQLGNEYSSGQA 413

Query: 314  QVVTCSGAYKDGSLRIVRNGIGINEQASVEL----QGIKGMWSLRSSTDDPFDTFLVVSF 369
            ++V  SG YKDGSLR VR+G+G+ E   + L    Q ++G++SLR +  +     L VSF
Sbjct: 414  RIVCGSGVYKDGSLRSVRSGVGL-EDVGLLLEDLGQHVRGVFSLRGAVGEGKMDTLAVSF 472

Query: 370  ISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 428
            ++ETR+   + E  +EE  +  GF    QTL   +     ++QVT+  V L+ + S    
Sbjct: 473  LTETRVFKFDSEGGVEEVGDFMGFGLDCQTLLARNLGGGMILQVTTRGVVLIDAESGVTV 532

Query: 429  NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL--EYEISCLDIN 486
              W      ++  A+AN   +LL+  G  LV +      L  VK   +  + +++C+ + 
Sbjct: 533  ATWVPRDENTIINASANGEWLLLSVEGTGLVSISTAGNELRLVKEKDISQQDQVACIHVA 592

Query: 487  PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLC--AFEG 540
            P         I  VG WT  +V I  L  L  +  E L        IPR V+L   A   
Sbjct: 593  P-----QLQGIGVVGFWTSGTVSIIDLNTLEPMHGESLRQSQDDASIPREVVLVQVASPK 647

Query: 541  IS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT--HVFAASD 596
            +S   L  A+ DGH++ F ++    E + +K+V LGT+   L      N +   + A ++
Sbjct: 648  VSGPTLFIAMEDGHVVTFNISADF-EFSGKKQVILGTRQARLHLLPQDNDSIYSILATTE 706

Query: 597  RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
             P++IY    +++YS V  +E   +CPF++ AFPDS+ +A + ++ I  ID  ++ H+R 
Sbjct: 707  HPSLIYGEENRIVYSAVTAEEAMFICPFDTEAFPDSIIVATDTQIKISKIDRTRRTHVRE 766

Query: 657  IPLGEHPRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFI-STYPLDTFE 714
            +P+GE  RRI +  + + F + C  ++    +E      RL+D+  F+ +  T+ L+   
Sbjct: 767  LPMGEMVRRIAYSPKEKVFGLGCIKRSLVDGDEVVQSSFRLVDEVIFQPVGKTFQLERTN 826

Query: 715  YGCSILSCSFSDDSNVY------YCVGTAYVLPEENEPT----KGRILVFIVEDGK-LQL 763
            Y   + +   ++  + Y      + VGT++ LP+ +       +GRILVF ++D K   L
Sbjct: 827  YVELVEAVVRAELPDSYGNPAERFIVGTSF-LPDPDYAMTGEHRGRILVFGIDDNKDPYL 885

Query: 764  IAEKETKGAVYSLNAFNG-KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQ 822
            I    TKG    L   +G K++A + + + + ++      T  L     +      + + 
Sbjct: 886  ILSHLTKGVCRCLEVLDGNKIVAGLAKTVAIARYDETSTTTATLTRLASYKPSTHPIQIA 945

Query: 823  TRGDFIVVGDLMKSISLLIY-KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
             +G+ I V D+MKS++L+ Y   ++  + E AR + +   +A+  +D D +L A++  NL
Sbjct: 946  AQGNIIGVADVMKSMTLVEYMPGDKDRLVEVARHWQSAAGTALCHVDGDDWLEADDQGNL 1005

Query: 882  FTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ-IPTVIFGTVN 940
              +R+N++    E+R  + V  E +LGE VNR R     +R+  S     +P    GTVN
Sbjct: 1006 MMLRRNADAVVMEDRKIMSVTAEMNLGEMVNRIR----AVRVETSRGAMVVPRAFLGTVN 1061

Query: 941  GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN-EKKTVDAKNFLDGDL 999
            G I +  ++  E    L + Q  L +V+   G +    +R+F N E++  +   FLDG+L
Sbjct: 1062 GGIYMFGTVAPEAQDLLLRFQEKLARVVHTAGEIEFNCYRAFRNAEREGSEPVRFLDGEL 1121

Query: 1000 IESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            +E FLD       EI + +  S+E +   VEEL R+H
Sbjct: 1122 LERFLDQDEATQREICEGLGPSLEHMRNVVEELRRMH 1158


>gi|224587439|gb|ACN58665.1| DNA damage-binding protein 1 [Salmo salar]
          Length = 444

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 221/446 (49%), Positives = 284/446 (63%), Gaps = 54/446 (12%)

Query: 641  LTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------- 681
            LTIGTID+IQKLHIR++PL E PRRIC+QE S+ F + S +                   
Sbjct: 3    LTIGTIDEIQKLHIRTVPLYESPRRICYQEVSQCFGVLSSRVEMQDASGTTAAVRPSAST 62

Query: 682  --------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
                                  S  EE E+H + ++D  TFE +  +     EY  S++S
Sbjct: 63   QALSSSVSSSKLFPSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLQSEYALSMVS 122

Query: 722  CSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNG 781
            C    D +VY+ VGTA V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNG
Sbjct: 123  CRLGRDLSVYFIVGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMMEFNG 182

Query: 782  KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
            KLLA+IN  ++LY+W       +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL 
Sbjct: 183  KLLASINSTVRLYEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLA 238

Query: 842  YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEV 901
            YK  EG  EE ARD+N NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ 
Sbjct: 239  YKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQE 298

Query: 902  VGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
            VG +HLGEFVN F HGSLV++ L +S      +V+FGTVNG+IG++ SL    Y  L  L
Sbjct: 299  VGVFHLGEFVNVFSHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDL 358

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            Q  L KVIK VG + H  WRSF+ E+KT  A  F+DGDLIESFLDL R +M E+  T+ +
Sbjct: 359  QNRLNKVIKSVGKIEHSFWRSFHTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQI 418

Query: 1021 ----------SVEELCKRVEELTRLH 1036
                      +V+E+ K VEELTR+H
Sbjct: 419  DDGSGMKREATVDEVIKIVEELTRIH 444


>gi|358380497|gb|EHK18175.1| hypothetical protein TRIVIDRAFT_80808 [Trichoderma virens Gv29-8]
          Length = 1161

 Score =  417 bits (1073), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 317/1124 (28%), Positives = 555/1124 (49%), Gaps = 110/1124 (9%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
            I+G I  L+  +P   A D LFI T+R ++  + W+ E+++L  + + + D ++   R +
Sbjct: 55   IHGTIDMLQRLQPKDSATDLLFIGTDRLQYFNIAWNPETNQLDAVEQTIHDSAEPYMRQS 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPFD-NKG---QLKEAFNIRLEELQVLDIKFLYG 119
             +    ++DP  + + +HL++G+  V      KG   +L+    +RL EL +    FL+ 
Sbjct: 115  QSQNRCLVDPTGKFMAMHLWEGVLNVFRLRIRKGLTTRLEVLDQVRLTELWMKSSVFLHS 174

Query: 120  -CAKPTIVVLYQ---DNKDARHVKTYEVALKDKDFVEG---PWSQNNLDN-----GADLL 167
                P I  LY+   D ++AR V  Y +   DK  V     P     LD       A +L
Sbjct: 175  RTGHPRIAFLYKNQLDREEAR-VAVYRLTEDDKGGVSSKFDPLKDRELDQVIPDPYASML 233

Query: 168  IPVP----------------PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAY-GRVD 210
            IPVP                  L G+L++GE  + Y  +  + ++       K Y    +
Sbjct: 234  IPVPVYEEKRYHVRNNEGARAHLGGLLVVGETLLTYFDSLTYSSVCSSIEDPKIYVAWAE 293

Query: 211  ADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKIELLG------ETSIASTISYLDNA 260
             DG+ Y L D  G L LL I    E     VTG+++  +        TS AS++ Y+ N 
Sbjct: 294  YDGTHYFLADDYGRLDLLEIKTTNESTGVVVTGMEVHPITFQDSSRYTSRASSLVYMGNN 353

Query: 261  VVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ---------- 310
            +++IGS +GDSQL+ ++++   + S V+VL    N  PI+DF ++DL  +          
Sbjct: 354  LLFIGSHHGDSQLLHIDIETQ-QMSLVKVLS---NNAPIMDFTIMDLGNREGDAQSGNTF 409

Query: 311  --GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVV 367
              GQ ++V   GAY+DGSLR +R+G+G+ ++  + E QG +G+++LRS   +  DT +++
Sbjct: 410  SSGQARIVAGCGAYQDGSLRSIRSGVGLEDRGLLDEFQGTRGLFTLRSVESEKVDT-VII 468

Query: 368  SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
            S ++ TR+L    ++  E    +G   +++TL   +    QL+Q+T   V ++   S   
Sbjct: 469  STLTGTRVLRFEPDNIEELYSFQGIDLESETLLAANLPNGQLLQITPRVVNVLDPDSGTS 528

Query: 428  RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY------EIS 481
               W+SP G  +  A+AN    LL+  G  LV L + DG+   VK+A  +       +IS
Sbjct: 529  LCSWQSPEGKVITAASANTKWALLSIDGSILVSLNLLDGLKAVVKNASQDSVSGQPDQIS 588

Query: 482  CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCA 537
            CL       +P       +G WT  ++ +  L  L  I  E L        +PR V L  
Sbjct: 589  CLHA---ARDPQ--DFGVIGWWTSGTISVVDLATLTPIHGEPLRQTDDSSSVPRDVALVQ 643

Query: 538  FE--GIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVF 592
                 IS   +L AL DG++++F +++K   ++ RK V+LG+ P  L      +   +VF
Sbjct: 644  LHPPDISGPTMLVALEDGNVISFNVSVKGFSISGRKTVTLGSGPARLHVLPQADGICNVF 703

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A ++  ++IYSS  +++YS     + + + PF+S AFPDS+ ++ E  + I  +D+ +  
Sbjct: 704  ATTEHASLIYSSEGRVVYSATTADDATFVAPFDSEAFPDSIVLSTEEHIRICHVDNERLT 763

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDD-------QTFEF 704
            H++++P+ E  RR+ +    + F + C  K     EE     +RL+D+       + FE 
Sbjct: 764  HVKALPMSETVRRVAYSPGLKAFGLGCIRKELIENEEVITSTIRLVDEIIFQELGKPFEL 823

Query: 705  ISTYPLDTFEYGCSILSCSFSDDSNV---YYCVGTAYVL---PEENEPTKGRILVFIVED 758
              +  L+  E  C I+     D + V    + VGT++V     EE   T+GRILV  V++
Sbjct: 824  NGSASLELVE--C-IIRAELPDSNGVPAERFLVGTSFVADPGTEEAGETRGRILVLGVDE 880

Query: 759  GK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
             + L  IA    KG    L   +  ++A + + + +Y +         L           
Sbjct: 881  SRQLYQIASHNLKGVCRCLAIMDDYIVAGLTKTVVVYGYTQETSTAASLNKLASFRPASF 940

Query: 818  ALYVQTRGDFIVVGDLMKSISLLIYKH----EEGAIEERARDYNANWMSAVEILDDDIYL 873
             + +   G+ I +GDLM+S++L+ +      ++  +EE+AR Y   W ++V  LD+  +L
Sbjct: 941  PIDLDVSGNIIGIGDLMQSLTLVEFTPGQDGKKPTLEEKARHYQQAWTTSVSALDESRWL 1000

Query: 874  GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
             A+   N+  +R+N E  T+++R +LE+  E ++GE +NR R    +   P  +   IP 
Sbjct: 1001 EADAQGNIIVLRQNQEAPTEQDRSQLEITSELNIGEQINRIRK---IQVAPAENAIVIPK 1057

Query: 934  VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK- 992
               G++ G + +   +  +    L   Q+ L++ I+  G L+ + WR+F N+ +  +A  
Sbjct: 1058 AFLGSIEGTLYLYGDIAPKYQDLLMTFQSRLQEYIQTPGNLSFDTWRAFRNQARDGEAPF 1117

Query: 993  NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
             F+DG++IE FLDL   + + + + +  SVE++   +EEL R+H
Sbjct: 1118 RFVDGEMIERFLDLDEKQQELVCEGLGPSVEDMRNMIEELRRMH 1161


>gi|325094412|gb|EGC47722.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H88]
          Length = 1201

 Score =  417 bits (1073), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 328/1133 (28%), Positives = 544/1133 (48%), Gaps = 135/1133 (11%)

Query: 16   FRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPD 74
            FRP     D LFI T+R  +  + WD ++  L T R   D++D   R +  G   +IDP 
Sbjct: 92   FRPQ---TDILFIGTDRASYFTVSWDQKTKNLRTERKYVDLADPSARASQLGDRCLIDPS 148

Query: 75   CRLIGLHLYDGLFKVIP----FDNKG--------------------------------QL 98
             R + L LY+G+  VIP      ++G                                +L
Sbjct: 149  GRFLTLELYEGIVTVIPIGQPLRSRGSGRRLGKRAANASASAAYANQDAAAGAAAADLEL 208

Query: 99   KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV------------- 143
             E    R+EEL V    FL+  A   P + +LY+D      +K  E+             
Sbjct: 209  GEPCQARVEELLVRSSTFLHTQADTLPRMALLYEDTMGKVRLKVRELEVMHGGMGMGGGT 268

Query: 144  --------------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC- 188
                           LK  D +     +  L+ GA  L+PVP PL G+L++GE +I Y  
Sbjct: 269  GSGGGIGADGGIVAVLKGLDLL-----KEELEMGASFLVPVPAPLGGLLVLGETSIRYLD 323

Query: 189  -SANAFKAIPIR-PSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELL 245
             ++N   + P++  +I  A+ +VD  G R+LL D  G L  L+++      V   K++LL
Sbjct: 324  DASNECISQPLKEATIFVAWEQVD--GQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLL 381

Query: 246  GETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVV 305
            G+   AS + Y+   + +IGS  GDS+LI++      +GS+ EV++ + N+ PI+DF ++
Sbjct: 382  GDIPRASVLVYMGGGITFIGSHQGDSELIRIT-----EGSF-EVIQTFANIAPILDFTIM 435

Query: 306  DL------ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWS 352
            DL      E Q      GQ ++VT SGA+ DGSLR VR+G+G+ E   +  ++ I  +W+
Sbjct: 436  DLGGRAIGESQTHEFSSGQARIVTGSGAFNDGSLRSVRSGVGMEEVGVLGAMKHITDLWA 495

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQ 411
            LR +    F   L+VSF+ ETR+     + E+EE E   G      TL   +  + +++Q
Sbjct: 496  LRVACPQEFSDTLLVSFVDETRVFYFTPDGEVEEKEEFMGLGLAESTLLAANLPHGRILQ 555

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
            VT  +VR+       +  EW      ++  A++N   V+L  GG  L+  +I GD  +T 
Sbjct: 556  VTEWNVRVAELDGGMVIWEWSPEQQKAITAASSNDDHVVLMVGGQVLMIFDISGDINITG 615

Query: 471  VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEI 528
             K   ++ ++S + +     +P+ + I  +    ++ V   SLPDL +     LG  G+ 
Sbjct: 616  EKDFGVDTQVSGVTVT---TSPARACILCLPQTAEVVV--MSLPDLAIRRSTSLGEPGDA 670

Query: 529  IPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSS 585
            +PRSVL+        + L  ++ DG + +F  N +   LT   K++LG+ QP   +    
Sbjct: 671  VPRSVLVAEVLPNNPATLFVSMADGSVFSFSFNSEDFSLTSMSKLTLGSEQPSFKKLPRG 730

Query: 586  KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
                +VFA  ++P++IY+   +++YS VN  + S +C FNS A+P S+A+A   EL I  
Sbjct: 731  DGLYNVFATCEQPSLIYAVEGRIVYSAVNSDQASRICHFNSEAYPGSIALATPSELKITL 790

Query: 646  IDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFE 703
            +D  +   I+++ +GE  RR+ +    R F I ++K   +  AE     F+ L D+  F 
Sbjct: 791  VDAERTTQIQTLEVGETVRRVAYSAPERAFGIGTIKRTLEDGAEVIASRFM-LADEIMFR 849

Query: 704  FISTYPLDTFEYGCSILSCSFSD-----DSNVY---YCVGTAYVLPEENEPTKGRILVF- 754
             +  Y L+  E   S++   F D      ++++   + VGT+Y+        +GRIL F 
Sbjct: 850  ELDIYDLNKDELVESVIRAQFPDGIGSEGNDLFKDLFVVGTSYLDDFGEGSIRGRILAFE 909

Query: 755  IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
            +  + +L  +AE   KGA  +L     K++AA+ + + +Y           L     +  
Sbjct: 910  VTANRQLAKVAEMPVKGACRALAIVQDKIVAALMKTVVVYTLSKGQFADYTLSKTASYRT 969

Query: 815  HILALYVQTRGDFIVVGDLMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDD 870
                + +   G+ I V DLMKS+S++ Y+        ++ E AR +   W +AV  + +D
Sbjct: 970  STAPVDIAVTGNLIAVADLMKSVSIVEYQQGANGLPDSLTEVARHFQTLWSTAVAPVAED 1029

Query: 871  IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
             +L ++   NL  + +N  G TD++R RLEV  E  LGE VNR R  + +     ++   
Sbjct: 1030 TWLESDAEGNLVMLHRNVNGVTDDDRRRLEVTSEILLGEMVNRIRPVN-IQGSQGAEAAI 1088

Query: 931  IPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTV 989
             P    GTV G I +   +       L +LQ+ +  ++   GG+   ++R+F N  ++T 
Sbjct: 1089 SPRAFLGTVEGSIYLFGIINPTYQDLLMRLQSAMAGMVVTPGGMPFNKFRAFRNTIRQTE 1148

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
            +   F+DG+LIE FL+      +EI           V+VE +   VEEL R+H
Sbjct: 1149 EPYRFVDGELIERFLNCGVELQEEIVGKVIADGVAGVTVESVKGLVEELRRMH 1201


>gi|225558618|gb|EEH06902.1| DNA damage-binding protein 1a [Ajellomyces capsulatus G186AR]
          Length = 1201

 Score =  416 bits (1070), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 329/1133 (29%), Positives = 540/1133 (47%), Gaps = 135/1133 (11%)

Query: 16   FRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPD 74
            FRP     D LFI T+R  +  + WD ++  L T R   D++D   R +  G   +IDP 
Sbjct: 92   FRPQ---TDILFIGTDRASYFTVSWDQKTKNLRTERKYVDLADPSARASQLGDRCLIDPS 148

Query: 75   CRLIGLHLYDGLFKVIP----FDNKG--------------------------------QL 98
             R + L LY+G+  VIP      ++G                                +L
Sbjct: 149  GRFLTLELYEGIVTVIPIGQPLRSRGSGRRLGKRAANASASAAYANQDAAAGAAAADLEL 208

Query: 99   KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV------------- 143
             E    R+EEL V    FL+  A   P + +LY+D      +K  E+             
Sbjct: 209  GEPCQARVEELLVRSSTFLHTQADTLPRMALLYEDTMGKVRLKVRELEVMHGGMGMGGGT 268

Query: 144  --------------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC- 188
                           LK  D +     +  L+ GA  L+PVP PL G+L++GE +I Y  
Sbjct: 269  GSGGGIGADGGIVAVLKGLDLL-----KEELEMGASFLVPVPAPLGGLLVLGETSIRYLD 323

Query: 189  -SANAFKAIPIR-PSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELL 245
             ++N   + P++  +I  A+ +VD  G R+LL D  G L  L+++      V   K++LL
Sbjct: 324  DASNECISQPLKEATIFVAWEQVD--GQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLL 381

Query: 246  GETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVV 305
            G+   AS + Y+   + +IGS  GDS+LI++      +GS+ EV++ + N+ PI+DF ++
Sbjct: 382  GDIPRASVLVYMGGGITFIGSHQGDSELIRIT-----EGSF-EVIQTFANIAPILDFTIM 435

Query: 306  DL------ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWS 352
            DL      E Q      GQ ++VT SGA+ DGSLR VR+G+G+ E   +  ++ I  +W+
Sbjct: 436  DLGGRAIGESQTHEFSSGQARIVTGSGAFNDGSLRSVRSGVGMEEVGVLGAMKHITDLWA 495

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQ 411
            LR +    F   L+VSF+ ETR+     + E+EE E   G      TL   +  + +++Q
Sbjct: 496  LRVACPQEFSDTLLVSFVDETRVFYFTPDGEVEEKEEFMGLGLAESTLLAANLPHGRILQ 555

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
            VT  +VR+       +  EW      ++  A++N   ++L  GG  L+  +I GD  +T 
Sbjct: 556  VTEWNVRVAELDGGMVIWEWSPEQTKAITAASSNDDHLVLMVGGQVLMIFDISGDINITG 615

Query: 471  VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEI 528
             K   ++ ++S + +     +P+ + I  +    ++ V   SLPDL +     LG  G+ 
Sbjct: 616  EKDFGVDTQVSGVTVT---TSPARACILCLPQTAEVVV--MSLPDLAIRRSTSLGEPGDA 670

Query: 529  IPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSS 585
            +PRSVL+        + L  ++ DG + +F  N +   LT   K++LG+ QP   +    
Sbjct: 671  VPRSVLVAEVLPNNPATLFVSMADGSVFSFSFNSEDFSLTSMSKLTLGSEQPSFKKLPRG 730

Query: 586  KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
                +VFA  ++P++IY+   +++YS VN  + S +C FNS A+P S+A+A   EL I  
Sbjct: 731  DGLYNVFATCEQPSLIYAVEGRIVYSAVNSDQASRICHFNSEAYPGSIALATPSELKITL 790

Query: 646  IDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFE 703
            +D  +   I+++ +GE  RR+ +    R F I ++K   +  AE     F+ L D+  F 
Sbjct: 791  VDAERTTQIQTLEVGETVRRVAYSAPERAFGIGTIKRTLEDGAEVIASRFM-LADEIMFR 849

Query: 704  FISTYPLDTFEYGCSILSCSFSD--DS------NVYYCVGTAYVLPEENEPTKGRILVF- 754
             +  Y L+  E   S++   F D  DS         + VGT+Y+        +GRIL F 
Sbjct: 850  ELDIYDLNKDELVESVIRAQFPDGIDSEGNDLFKDLFVVGTSYLDDFGEGSIRGRILAFE 909

Query: 755  IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
            +  + +L  +AE   KGA  +L     K++AA+ + + +Y           L     +  
Sbjct: 910  VTANRQLAKVAEMPVKGACRALAIVQDKIVAALMKTVVVYTLSKGQFADYTLSKTASYRT 969

Query: 815  HILALYVQTRGDFIVVGDLMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDD 870
                + +   G+ I V DLMKS+S++ Y+        ++ E AR +   W +AV  + +D
Sbjct: 970  STAPVDIAVTGNLIAVADLMKSVSIVEYQQGANGLPDSLTEVARHFQTLWSTAVAPVAED 1029

Query: 871  IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
             +L ++   NL  + +N  G TD++R RLEV  E  LGE VNR R  + +     ++   
Sbjct: 1030 TWLESDAEGNLVMLHRNVNGVTDDDRRRLEVTSEISLGEMVNRIRPVN-IQGSQGAEAAI 1088

Query: 931  IPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTV 989
             P    GTV G I +   +       L +LQ+ +  ++   GG+   ++R+F N  ++  
Sbjct: 1089 SPRAFLGTVEGSIYLFGIINPTYQDLLMRLQSAMAGMVVTPGGMPFNKFRAFRNTIRQAE 1148

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
            +   F+DG+LIE FL  S    +EI           V+VE     VEEL R+H
Sbjct: 1149 EPYRFVDGELIERFLGCSVELQEEIVGKVIADGVAGVTVESAKGLVEELRRMH 1201


>gi|239613967|gb|EEQ90954.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ER-3]
 gi|327353314|gb|EGE82171.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ATCC 18188]
          Length = 1199

 Score =  416 bits (1069), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 328/1129 (29%), Positives = 536/1129 (47%), Gaps = 131/1129 (11%)

Query: 16   FRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPD 74
            FRP     D LF+ T+R  +  + WD ++  L T R   D++D   R +  G   +IDP 
Sbjct: 94   FRPQ---TDILFVGTDRSTYFTVSWDQKTKNLHTERKYVDLADGSARESQLGDRCLIDPS 150

Query: 75   CRLIGLHLYDGLFKVIP----FDNKG----------------------------QLKEAF 102
             R + L LY+G+  VIP      ++G                            +L E  
Sbjct: 151  GRFLTLELYEGIVTVIPIGQPLRSRGSGRKLGKRAANASASTMQGNQDVGAADLELGEPC 210

Query: 103  NIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV----------------- 143
              R+EEL V    FL+  A   P + +LY+D      +K  E+                 
Sbjct: 211  QARVEELLVRSSTFLHTQADMLPRMALLYEDTMGKVRLKVRELEVTYGGMGIGSGSGGGS 270

Query: 144  ----------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SAN 191
                       LK  D +     +  L+ GA  L+PVP PL G+L++GE +I Y   ++N
Sbjct: 271  GVGADGGSAAVLKGLDML-----KEELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASN 325

Query: 192  AFKAIPIR-PSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETS 249
               + P+   +I  A+ +VD  G R+LL D  G L  L++I      V   K++ LG   
Sbjct: 326  ECISQPLEEATIFVAWEQVD--GQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIP 383

Query: 250  IASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-- 307
             AS + Y+   V +IGS  GDSQLI++      +GS+ EV++ + N+ PI+DF ++DL  
Sbjct: 384  RASVLVYMGGGVTFIGSHQGDSQLIRIT-----EGSF-EVIQTFANIAPILDFTIMDLGG 437

Query: 308  ----ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSS 356
                E Q      GQ ++VT SGA+ DGSLR VR+G+G+ E   +  ++ I  +W+LR +
Sbjct: 438  RAMGESQTHEFSSGQARIVTGSGAFNDGSLRSVRSGVGMEEVGVLGSMEHITDLWALRVA 497

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              + F   L+VSF+ ETR+     +  +EE  E  G      TL   +    +++QVT  
Sbjct: 498  CPEEFSDTLLVSFVDETRVFYFTPDGGVEEKDEFMGLGLTESTLIAANLPNGRILQVTER 557

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHA 474
            +VR+       +  EW      ++  A++N   ++L  GG  L+  +I G+  +   K  
Sbjct: 558  NVRVAELDGGMVMWEWSPVSQKAITAASSNDDHLVLMVGGQVLMIFDIRGEIKVAGEKDF 617

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRS 532
             ++ ++S + +     +P+ S I  +    ++ V   SL DL +     LG  G+ +PRS
Sbjct: 618  GVDTQVSGVTVT---ASPARSCILCLPQTAEVIV--MSLADLTIRHSTSLGEPGDAVPRS 672

Query: 533  VLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTT 589
            VL+        + L  ++ DG + +F  N     LT   K++LG+ QP   +        
Sbjct: 673  VLVAEVLPNNPATLFVSMADGSVFSFSFNATDFSLTSMSKITLGSEQPSFKKLPRGDGLY 732

Query: 590  HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
            +VFA  ++P++IYS+  +++YS VN  + S +C FNS A+P S+A+A   EL I  +D  
Sbjct: 733  NVFATCEQPSLIYSAEGRIVYSAVNSDQASRICHFNSEAYPASIALATPSELKIALVDAE 792

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFIST 707
            +   I+++ +GE  RR+ +    R F I ++K   +  AE     F+ L D+  F  +  
Sbjct: 793  RTTQIQTLEVGETVRRVAYSAPERAFGIGTIKRTLEDGAEVIASRFM-LADEIMFRELDV 851

Query: 708  YPLDTFEYGCSILSCSFSDDSNV--------YYCVGTAYVLPEENEPTKGRILVF-IVED 758
            Y L+  E   S++   F DD +          + VGT+Y+        +GRIL F +  +
Sbjct: 852  YDLNKDELVESVIRAQFPDDKDSEGEELMKDLFIVGTSYLDDFGEGSIRGRILAFEVTPN 911

Query: 759  GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 818
             +L  +AE   KGA  +L     K++AA+ + + +Y           L     +      
Sbjct: 912  RQLGKVAEMPVKGACRALAIVQDKIVAALMKTVVVYTLSKGQFADYILTKTASYRTSTAP 971

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHE----EGAIEERARDYNANWMSAVEILDDDIYLG 874
            + +   G+ I V DLMKS+S++ Y+       G++ E AR +   W +AV  +  D +L 
Sbjct: 972  IDIAVTGNLIAVADLMKSVSIVEYQQGTDGLSGSLTEVARHFQTLWSTAVAPVAQDTWLE 1031

Query: 875  AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
            ++   NL  +R+N  G T+++R RLEV  E  LGE VNR R  ++   L  ++    P  
Sbjct: 1032 SDAEGNLVVLRRNVNGVTEDDRRRLEVTSEVLLGEMVNRIRPVNIQASL-GTEAAISPRA 1090

Query: 935  IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKN 993
              GTV G I +   +       L +LQ+ +  ++   GG+   ++R+F N  ++  +   
Sbjct: 1091 FLGTVEGSIYLFGIINPTYQDLLMRLQSAMAGMVVTPGGMPFNKFRAFRNTVRQAEEPYR 1150

Query: 994  FLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
            F+DG+LIE FL       +EI           V+VE +   VEEL R+H
Sbjct: 1151 FVDGELIERFLGCGAELQEEIVGKVIADGVAGVTVERVKAIVEELKRMH 1199


>gi|340521192|gb|EGR51427.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1161

 Score =  414 bits (1064), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 314/1125 (27%), Positives = 555/1125 (49%), Gaps = 112/1125 (9%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
            I+G I  L+  +P   A D LFI T+R ++  L W+ E+++L  + + + D ++   R +
Sbjct: 55   IHGTIDMLQRLQPKDSATDLLFIGTDRLQYFNLAWNPETNQLDAVEQTIEDTAEPYMRQS 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPFD-NKG---QLKEAFNIRLEELQVLDIKFLYG 119
             +    ++DP  + + +HL++G+  V      KG   +L+    +RL EL +    FLY 
Sbjct: 115  QSQNRCLVDPTGKFMAMHLWEGVLNVFRLRIRKGLTTKLEGLDQVRLTELWMKASTFLYS 174

Query: 120  -CAKPTIVVLYQ---DNKDARHVKTYEVALKDKDFVEGPWS-QNNLD-------NGADLL 167
                PTI  LY+   D ++AR +  Y +   DK  V   +  Q N D         A +L
Sbjct: 175  RTGHPTIAFLYKNQLDREEAR-IAVYRLTEDDKPGVSSKFDPQKNRDFEEVIRDPYASML 233

Query: 168  IPVP----------------PPLCGVLIIGEETIVYCSANAFKAIP---IRPSITKAYGR 208
            IPVP                  L G+L++GE  + Y  +  + ++      P I  A+  
Sbjct: 234  IPVPVYEEKRYHVRNNEGARAHLGGLLVVGETLLTYFDSLTYSSVCSSLAEPKIYVAWA- 292

Query: 209  VDADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKI------ELLGETSIASTISYLD 258
             + DG+ YLL D  G L +L I    E     VTG+++      +L   TS AS++ Y+ 
Sbjct: 293  -EYDGTHYLLADDYGRLDMLEIKTTNESTGVVVTGMEVHPFAFEDLSRYTSRASSLVYMG 351

Query: 259  NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-------- 310
            N +++IGS +GDSQL+++    D +   + +L+   N  PI+DF ++DL  +        
Sbjct: 352  NNLLFIGSHHGDSQLLRI----DIETQQMTLLKVLPNNAPIMDFTIMDLGNREGDAQAGN 407

Query: 311  ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFL 365
                GQ ++V   GAY+DGSLR +R+G+G+ ++  + E  G +G+++LRS   +  DT +
Sbjct: 408  TFSSGQARIVAGCGAYQDGSLRSIRSGVGLEDRGLLDEFPGTRGLFTLRSVDSEKVDT-V 466

Query: 366  VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
            VVS ++ TR+L    E   E    +G   +++TL   +    QL+Q+T   V L+   S 
Sbjct: 467  VVSTLAGTRVLRFEPESIEEVYSFQGMDLESETLLAANLPNGQLLQITPRVVTLLDPESG 526

Query: 426  ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY------E 479
                 W+SP G  +  A+AN    LL+  G  LV L + DG+   V+ A          +
Sbjct: 527  ASLCSWQSPDGKVITAASANTKWALLSIDGSLLVSLNLLDGLKAVVQDASENPVSGQPDQ 586

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLL 535
            ISC  I+   + P +     VG W   ++ +  L  L  +  E L        +PR V L
Sbjct: 587  ISC--IHAARDPPDF---GVVGWWASGTISVVDLATLTPLHGESLRQTDDSASVPRDVAL 641

Query: 536  CAFEGISY----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTH 590
                        LL AL DG++++F +++K   ++ RK V+LG+ P  L      +   +
Sbjct: 642  VQLHPPDMSGPTLLVALEDGNVISFNVSLKGFSISGRKTVTLGSGPARLHVLPRPDGICN 701

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            VFA ++  ++IYSS  +++YS     + + + PF+S AFPDS+ ++ +  + I  +D  +
Sbjct: 702  VFATTEHASLIYSSEGRIVYSATTADDATFVAPFDSEAFPDSIVLSTDEHIRICHVDSER 761

Query: 651  KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA-EESEMHFVRLLDD-------QTF 702
              H++S+P+ E  RR+ +    + F +  +K +    EE     VRL+D+       Q F
Sbjct: 762  LTHVKSLPMHETVRRVAYSPGLKAFGLGCIKKELVENEEVVTSTVRLVDEIIFQELGQPF 821

Query: 703  EFISTYPLDTFEYGCSILSCSFSDDSNV--YYCVGTAYVL---PEENEPTKGRILVFIVE 757
            E  ++  L+  E  C I +     + N+   + VGT++V     +E   T+GRI+V  V+
Sbjct: 822  ELNASASLELVE--CVIRAELPDSNGNMTERFLVGTSFVADPGTDEAGETRGRIVVLGVD 879

Query: 758  DGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI 816
            + + L  IA    KG    L   +  ++A +++ + +Y +         L     +    
Sbjct: 880  ESRQLYQIASHNLKGVCRCLAMLDDYIVAGLSKTVVVYSYAQETSTAASLTKVASYRPAS 939

Query: 817  LALYVQTRGDFIVVGDLMKSISLLIYKHEE----GAIEERARDYNANWMSAVEILDDDIY 872
              + +   G+ I VGDLM+S++L+ +   +      +EE+AR Y   W ++V  LD+  +
Sbjct: 940  FPVDLDVSGNMIGVGDLMQSLTLIEFTPPQDGKMAKLEEKARHYQQAWTTSVCALDETRW 999

Query: 873  LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
            L A+   N+  +R+  E  T+++R +LE+  E ++GE +NR R   +    P  +   +P
Sbjct: 1000 LEADAQGNVIVLRQRQEAPTEQDRSQLEITSELNIGEQINRIRKLQVA---PGENAVVVP 1056

Query: 933  TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK 992
                G++ G + +   +  +    L   Q+ L+  I+  G L+ + WR+F N+ +  ++ 
Sbjct: 1057 KAFLGSIEGTLYLYGDIAPKYQDLLMTFQSRLQGYIQTPGNLSFDLWRAFRNQAREGESP 1116

Query: 993  -NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
              F+DG++IE FLDL  ++ + + + +  +VE++   +EEL R+H
Sbjct: 1117 YRFVDGEMIERFLDLDESQQELVCEGLGPNVEDMRNLIEELRRMH 1161


>gi|156049323|ref|XP_001590628.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980]
 gi|154692767|gb|EDN92505.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 1153

 Score =  414 bits (1064), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 331/1118 (29%), Positives = 561/1118 (50%), Gaps = 108/1118 (9%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            IYGRI+ L+  +P G   D LF+ T R ++  + W+  + +L T ++  D+S    R ++
Sbjct: 55   IYGRISMLQKIQPAGSKTDHLFVGTVRAQYFTVMWNPNTHKLDTMQSFVDISQEHMRDSE 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVI----PFDNKGQ-LKEAFNIRLEELQVLDIKFLYG 119
            +    ++DP  RL+ + LY+G+  ++    P   K   L++   +R+ E++V    FLY 
Sbjct: 115  SRDRCLVDPTGRLLVMELYEGVLNLVKIVKPRGGKTDYLEKPEQVRISEMKVRASAFLYT 174

Query: 120  CAK-PTIVVLYQDNKDARHVKTYEVALKDK-----DFVEGPWSQNNLDN---GADLLIPV 170
              K P + +LYQD ++  ++ TY + L DK      F      +N+LD+   GA  +IPV
Sbjct: 175  HTKQPKLALLYQDARENVNLATYRM-LDDKGHPNLQFDPKKNRENDLDDLCVGAMHIIPV 233

Query: 171  P-------------------PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAY---GR 208
            P                     L GV+++GE    Y    + KAI +  ++ +A      
Sbjct: 234  PKGEDEGSKRYIVRNATTAKANLGGVVVLGETKFTYLDDES-KAI-VEYALDEAVLWAAW 291

Query: 209  VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
               D   YLLGD  G L+L  I  +   VTG+K+  LG+ S  +++  L N V Y+ S  
Sbjct: 292  EPIDDRNYLLGDDYGFLYLFTILVDGATVTGMKVLKLGQVSKPTSLENLGNGVFYVASHE 351

Query: 269  GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL---ERQ--------GQGQVVT 317
             D+Q+I+++L   +    V VL+   N+ PI+DF ++D+   E Q        GQ ++VT
Sbjct: 352  ADNQVIQIDL--GSPNYSVTVLQVLPNIAPILDFTIMDMGGREGQTQLNEYSSGQARLVT 409

Query: 318  CSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
             SG ++ GSLR VR+G+G+++ A + E++GI  +++L S    P DT LVVSF +ETR  
Sbjct: 410  GSGGFEGGSLRSVRSGVGLDDIAILAEMEGIHKVFALHSGPSLPNDT-LVVSFSTETRFF 468

Query: 377  AMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
              +   ++EE   I    + ++TL  ++     ++QVT   V +     +   + W+ P 
Sbjct: 469  KFDSLGDIEEVNTISNLSAASETLLTYNLDNGCILQVTQHEVAI---HGKSPGHRWQPPD 525

Query: 436  GYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL-EYEISCLDINPIGENPSY 494
            G  +  A+ N + +LL++ G  LV L I    L EV   +L + +++C+ +  +      
Sbjct: 526  GQIITAASGNQNYILLSSNGRTLVTLSIEQN-LAEVAFQELGDDQVACIHVPQL-----L 579

Query: 495  SQIAAVGMWTDISVRIFSLPDLNLITKEHL---GGEIIPR-----SVLLCAFEGISYLLC 546
              I  +G+W   S+ +  L  L  I  E L    G  IPR      +LL    G + L  
Sbjct: 580  GDIGVIGLWKSGSISLLDLKTLKTIVSEDLRRNDGASIPRDIALTQILLPELSGPT-LFV 638

Query: 547  ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSN 605
            ++ DG +L+F ++ +   L+ RK + LGTQ   L+     NTT +VFA  + P++IY S 
Sbjct: 639  SMEDGIVLSFDVDSENCSLSGRKSIVLGTQQARLQILPRDNTTFNVFATCEHPSLIYGSE 698

Query: 606  KKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRR 665
             + +YS V  ++   +C FNS A+PDS+ +A   EL +  ID+ ++ H+R++P+GE  RR
Sbjct: 699  GRTVYSAVTAEDAITVCSFNSRAYPDSVVVATTNELKLSAIDNERRTHVRTLPIGETVRR 758

Query: 666  ICHQEQSRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFIST--YPLDTFEYGCSI--- 719
            I +  + R FAI ++K + +  +E      RL+D+  F  +    Y     E   S+   
Sbjct: 759  IAYSAKERAFAIGAIKRELTKGQEIVTTSFRLVDEVVFGELGEPFYLPANDEIIESVIRA 818

Query: 720  -LSCSFSDDSNV-YYCVGTAYVLPEENEPTKGRILVFIV-EDGKLQLIAEKETKGAVYSL 776
             L   + D   V  + VGT++ L +     +GR+L+F V  D    +IA    KG+   +
Sbjct: 819  ELPTKYGDGELVERFLVGTSF-LHDGEVNIRGRLLIFGVNSDRTPYIIASHTLKGSCRCI 877

Query: 777  NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
               NGK++AA+N+ + +Y +      T  L+    +      + +  RG+ I V D+MKS
Sbjct: 878  GVLNGKIVAALNKTVVMYDYEETSRTTANLRKVATYRCATCPIDIDIRGNIIAVADIMKS 937

Query: 837  ISLLIYKHE----EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
            ++L+ Y          +EE  R     + +++  +D D YL ++++ NL  +++N EG T
Sbjct: 938  VALVEYTPGVDGLPDKLEEVGRHAQQVFATSIAEVDTDTYLESDHDGNLIVLKRNREGVT 997

Query: 893  DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
             E++ RLEV+ E +LGE VN+ +  ++       D   IP     T  G I + + +P +
Sbjct: 998  REDKLRLEVLCEMNLGEMVNKIKRINVET---SKDALLIPRAFVATTEGSIYLFSLIPPQ 1054

Query: 953  QYLFLEKLQTNLRKVIK--------------GVGGLNHEQWRSFNNE-KKTVDAKNFLDG 997
                L +LQ+ L  +                  G L+ +++RS+ +  ++T +   F+DG
Sbjct: 1055 NQDLLMRLQSRLASLPARSLTDSSFSAPIEFSPGNLDFDKYRSYVSAVRETNEPFRFVDG 1114

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            +LIE FLDL     + I   + V  E+L   VE L RL
Sbjct: 1115 ELIERFLDLDGAIQENICDGLGVRAEDLRGVVEGLRRL 1152


>gi|391865638|gb|EIT74917.1| damage-specific DNA binding complex, subunit DDB1 [Aspergillus oryzae
            3.042]
          Length = 1135

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 329/1110 (29%), Positives = 548/1110 (49%), Gaps = 108/1110 (9%)

Query: 6    IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            +Y +++ L  L  P     D LF+ T+RY +C L WD+  + + T R   D++D   R +
Sbjct: 55   LYAKVSMLARLPAPAHSPTDHLFVGTDRYTYCTLSWDSAQNRIRTERNYVDIADPSSRES 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
              G   +IDP  R + L +Y+GL  V+P                        G+L E   
Sbjct: 115  QTGNRCLIDPSGRFMTLEVYEGLVAVVPIVQLPARKRGRAPAVPTGPDAPKVGELGELTT 174

Query: 104  IRLEELQVLDIKFLY-GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQN---- 158
             R++EL V    FL+     P + +LY+DN+     K   + +++ ++     S +    
Sbjct: 175  ARIDELFVRSSAFLHVQSGLPRLALLYEDNQ-----KKVRLMVRELNYTSATASTSADAT 229

Query: 159  ---------NLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKA 205
                      LD GA  LIPVP     +LI+GE +I Y   +  + I  RP    +I  A
Sbjct: 230  LTHIADFAQELDLGASHLIPVP----DLLILGETSIKYVDDDNNEIIS-RPLDEATIFVA 284

Query: 206  YGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
            +  VD+   R+LL D  G L  L+++     +V G K++ LG TS AS + YL   +V++
Sbjct: 285  WEGVDS--QRWLLADDYGRLFFLMLVLDSDNQVQGWKLDHLGNTSRASALVYLGGGIVFV 342

Query: 265  GSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQG 313
            GS  GDSQ++++       GS+ E+++   N+ PI+DF ++DL     E Q      GQ 
Sbjct: 343  GSHQGDSQVLRI-----GNGSF-EIIQALSNIAPILDFTIMDLGNRTSESQTHEFSSGQA 396

Query: 314  QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLR-SSTDDPFDTFLVVSFIS 371
            ++VT SGA+ DG+LR VR+G+G+ E   + +++ I  +W L+  +  D  DT L+V+FI 
Sbjct: 397  RIVTGSGAFDDGTLRSVRSGVGMEELGVLGDMEHITDLWGLQVQAGGDTLDT-LLVTFID 455

Query: 372  ETRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNE 430
            ETR+   + + E+EE +   G      TL   +    +++QVT   V +       +  E
Sbjct: 456  ETRVFHFSPDGEVEELDHFLGLSLSENTLLAANLPRGRILQVTEQRVLIADLEGEMVVYE 515

Query: 431  WKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD--GILTEVKHAQLEYEISCLDINPI 488
            W  P    +  A+AN   ++L  GG  +  L+IG    ++TE K    + ++S + +   
Sbjct: 516  WTPPNELVITAASANDDSLVLVIGGELMTVLDIGTEAQVITEKKFGA-DSQVSGVTVPA- 573

Query: 489  GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYL 544
                S +++  VG      V +  L DL  +    LG  GE  PRSVL+     +    L
Sbjct: 574  ----SPTEVCVVGFPQLAKVSVLRLRDLTEVHTTSLGPAGEAFPRSVLVADVLADSPPTL 629

Query: 545  LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
              ++ DG ++ +        L+   K+ LG++  T +     +  + VFA  + P++IY 
Sbjct: 630  FISMADGSVITYSFKTDDYSLSHMNKLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYG 689

Query: 604  SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
            S  +++YS VN +  S +C FNS A+P S+A+A   EL I  +D  +   I+++ +G   
Sbjct: 690  SEGRIIYSAVNSEGASRVCHFNSEAYPGSIAVATLHELKIALVDRERTTQIQTLQIGATV 749

Query: 664  RRICHQEQSRTFAICSLKNQSCAEESEMHFVRLL--DDQTFEFISTYPLDTFEYGCSILS 721
            RR+ +    + F I +++ +  A+ +E+   R +  D+  F  + ++ L   E   S++ 
Sbjct: 750  RRVAYSPSEKAFGIGTIERK-LADGAEIVTSRFMLADEVLFRQLDSFELRPEEIVESVIR 808

Query: 722  CSF--SDDSNV------YYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGA 772
              F    D N        + VGTAY+  E  E  +GRIL+F +++G KL  +AE   KGA
Sbjct: 809  AEFPAGKDENGREMTKDRFVVGTAYLDDEGEESIRGRILMFEIDNGRKLTKVAELPVKGA 868

Query: 773  VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
              +L     K++AA+ + I +YK +  + GT +L+           + V   G+ IVV D
Sbjct: 869  CRALAMLGDKIVAALVKTIVIYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSD 928

Query: 833  LMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
            LMKS+ LL +K  E     ++ E AR +   W + V  +D D +L ++   NL  +R+N 
Sbjct: 929  LMKSVCLLEFKEGENGLPDSLTEVARHFQTVWATGVACIDKDTFLESDAEGNLIVLRRNL 988

Query: 889  EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS 948
             G  +++R RLEV  E  LGE VNR R  ++      + V   P    GTV G I + A 
Sbjct: 989  AGVEEDDRRRLEVTSEISLGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLFAI 1045

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLS 1007
            +  E   FL +LQ  +   ++ +G +   ++R F +  ++  +   F+DG+LIE FL+  
Sbjct: 1046 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVREATEPYRFVDGELIEQFLNCE 1105

Query: 1008 RTRMDEISKTMN-VSVEELCKRVEELTRLH 1036
                +EI  ++  ++V E+   +E L RLH
Sbjct: 1106 PELQEEIVNSVGMMNVHEVKVMIEALRRLH 1135


>gi|367033707|ref|XP_003666136.1| hypothetical protein MYCTH_2310607 [Myceliophthora thermophila ATCC
            42464]
 gi|347013408|gb|AEO60891.1| hypothetical protein MYCTH_2310607 [Myceliophthora thermophila ATCC
            42464]
          Length = 1043

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 310/1047 (29%), Positives = 525/1047 (50%), Gaps = 94/1047 (8%)

Query: 70   IIDPDCRLIGLHLYDGLFKVIPF---DNKGQLKEAFNIRLEELQVLDIKFLYG-CAKPTI 125
            ++DP  R + +HL++G+  ++      NK  L     +RL EL +    FLY     P I
Sbjct: 11   LVDPTGRFLAMHLWEGVLTMMRLGDRKNKMVLTWMDQVRLSELFIRASTFLYTETGHPKI 70

Query: 126  VVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQNN--------LDNGADLLIPV----- 170
              LYQ   D+   K  TY +   D++     ++            D+ A +LIPV     
Sbjct: 71   AFLYQSRADSGDAKLATYRMTADDRNTEASKFNPERDREIDMDVPDSSASMLIPVRKVEE 130

Query: 171  -------------PPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKA---YGRVDADGS 214
                            + G++++GE  ++Y   +    + ++ ++ +A       + +  
Sbjct: 131  GVKRHNVRNVESAKAHIGGLVVVGETRLLYI--DEVTKVTVQSALKEASIFVAWAEYNAV 188

Query: 215  RYLLGDHAGLLHLLVITHEKEK---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
             Y L D  G LHLL +  E      VT L +  +G+TS AS + YL N ++++GS YGDS
Sbjct: 189  NYFLADDYGNLHLLTLVTENADSVVVTSLDVNWIGKTSRASHLVYLGNDLLFVGSHYGDS 248

Query: 272  QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG------------QGQVVTCS 319
            QL +L+LQ +   + +++++   N+GPI+DF ++D+  +G            Q ++VTCS
Sbjct: 249  QLFRLDLQNEDPKALLQLVQTLHNIGPILDFAIMDMGNRGDEGQLGNEYSSGQARIVTCS 308

Query: 320  GAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
            G +KDGSLR VR+G+G+ +   + +L+  +G++SL+S    P    L VSF++ETR+   
Sbjct: 309  GVHKDGSLRSVRSGVGLEDVGILADLEHCRGLFSLKS-YGAPKTDILAVSFLTETRVFKF 367

Query: 379  NLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
            + + ++EE E   G     QTL   D    +L+QVT  +  ++ + S      WK     
Sbjct: 368  DAQGDVEELESFAGLAFDQQTLLATDLQGGRLLQVTPAAATVLDTESGLTVASWKPGDQR 427

Query: 438  SVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQ 496
            S+  A+AN+  +LL+  G  LV L I D + +   +    + +I+C+ +    +N     
Sbjct: 428  SIINASANSKSLLLSVDGTELVSLSISDDLQVVRTRDISKQDQIACIHVPAQPDN----- 482

Query: 497  IAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAF--EGIS--YLLCAL 548
            +  VG W+  +V I  L  L  I  E L        IPR + L      G+S   L  A+
Sbjct: 483  VGVVGFWSSGTVSIIDLHTLEPIHGESLRTSQDDASIPRDLALVQILPPGVSGPTLFVAM 542

Query: 549  GDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKK 607
             DG+++ F +  K   L+ RK+V LG +   L      +  + +FA ++ P++IY S  +
Sbjct: 543  QDGNVVTFNIT-KDLALSGRKRVILGMRQARLHLLPQPDGIYSIFATTEHPSLIYGSEGR 601

Query: 608  LLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRIC 667
            ++YS V  +E +++CPF++ AFP+ +A+A + +L I  ID  ++ H++ + + E  RRI 
Sbjct: 602  IVYSAVTAEEATYICPFDAQAFPECIALATDKQLKICHIDRERRTHVKPLAMNEMVRRIA 661

Query: 668  HQEQSRTFAICSLKNQSCA-EESEMHFVRLLDDQTFEFIS-TYPLDTFEYG----CSI-- 719
            +  + + F +  +K +  A EE      +L+D+  F+ +  ++PL    Y     C +  
Sbjct: 662  YSPKEKVFGLGCIKRELVAGEEVVQSSFKLVDEVIFDRVGRSFPLGAPSYTELVECVVRA 721

Query: 720  -LSCSFSDDSNVYYCVGTAYVLPEENEP---TKGRILVFIVE-DGKLQLIAEKETKGAVY 774
             L  S+ + +   + VGT+++   +  P    +GR+L F V+ D    LI   E KGA  
Sbjct: 722  ELRDSYGNPAE-RFIVGTSFLPDPDYGPGTDVRGRMLFFGVDADRNPYLILSHELKGACR 780

Query: 775  SLNAF-NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDL 833
             +    +G ++A + + + + ++      T EL     +        +  RG+ I V DL
Sbjct: 781  CVAVMEDGTIVAGLTKTVVVCRYEETSSTTGELTRLASYRPSSYPAEICVRGNTIAVADL 840

Query: 834  MKSISLLIYKHEEGA--IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA 891
            MKS+SL+ Y   +G   + ERAR + + W +AV  + +  +L A+   NL  +R+N EG 
Sbjct: 841  MKSVSLVEYVPGDGTPRLVERARHFGSVWATAVGFVREGSWLEADAQGNLMVLRRNVEGV 900

Query: 892  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-P 950
            T E+R R+E+  E +LGE VNR R    V   P + V  +P    GTV G I +  ++ P
Sbjct: 901  TAEDRRRMEITSEMNLGEMVNRIREVE-VETTPGALV--VPRAFLGTVEGGIYMFGTVAP 957

Query: 951  HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRT 1009
            H Q L L + Q  L  V+K  G +    +R+F N ++  D    FLDG+L+E FLD+   
Sbjct: 958  HVQDLLL-RFQGKLADVLKTAGDIPFRSYRAFRNAEREGDGPFRFLDGELLERFLDVDEA 1016

Query: 1010 RMDEISKTMNVSVEELCKRVEELTRLH 1036
              + I K +  SVE++   VEEL R+H
Sbjct: 1017 TQEIICKGLGPSVEDMRNMVEELRRMH 1043


>gi|302503607|ref|XP_003013763.1| hypothetical protein ARB_07875 [Arthroderma benhamiae CBS 112371]
 gi|291177329|gb|EFE33123.1| hypothetical protein ARB_07875 [Arthroderma benhamiae CBS 112371]
          Length = 1148

 Score =  412 bits (1059), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 326/1117 (29%), Positives = 544/1117 (48%), Gaps = 109/1117 (9%)

Query: 6    IYGRIATLELFRPHGEA-----QDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRI 59
            IYGRI  L+   P   A      D LF+ T++Y +  + WD+   +L T R   D++D  
Sbjct: 55   IYGRITLLKKL-PRSTAGGLALTDALFVGTDQYAYFSITWDSVHEQLRTERKYIDLADGS 113

Query: 60   GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG---------------------QL 98
             R   +     IDP    + L +Y+G+  + P                          QL
Sbjct: 114  LREAHSDDRCQIDPSGSFLTLEVYEGVVSIFPLVTADSHKRAKSAVSASASSASTPVEQL 173

Query: 99   KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWS 156
             E   +R+EEL V    FL   A   P   +LY+D +    +K  ++          P S
Sbjct: 174  GEPLQVRIEELMVRSSAFLDQEASNTPRFALLYEDTQGKVKLKLRDLKYTHAIITSDPGS 233

Query: 157  QNNL----------DNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SI 202
               L          D GA +LIPVP PL G+LIIGE +I Y   +  + I  RP    ++
Sbjct: 234  AAELKDVTTLSDEVDLGASILIPVPRPLGGLLIIGESSIKYVDVSRNETIS-RPLAESTV 292

Query: 203  TKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
              A+ +VD  G R+LL D  G L  L+++      V   K++ LG TS AS + YLD  +
Sbjct: 293  FVAWEQVD--GQRWLLADDYGRLFFLMLVLDADNAVDTWKVDFLGVTSRASVLVYLDGGI 350

Query: 262  VYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ---------GQ 312
            V++GS  GDSQ+I++      +G + ++++   N+ PI+DF V+D+  +         GQ
Sbjct: 351  VFVGSHQGDSQVIQIK-----EGGF-DLVQTISNIAPILDFTVMDMGDRSGATREFSSGQ 404

Query: 313  GQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
             ++VT SGA+ DGSLR VR+G+GI E   +  ++ I  +W+LRS+  +PF   L+VSF++
Sbjct: 405  TRIVTGSGAFGDGSLRSVRSGVGIEELGVLASMEHITDLWTLRSACPEPFFDTLLVSFVN 464

Query: 372  ETRILAMNLEDELEETE--IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
            ETR+   + + ++EE E    G      TL   +   N+++QVT  + R +   S  +  
Sbjct: 465  ETRVFHFSADGDVEEKEDGFLGLVFSQSTLLATNIPGNRILQVTESTSRAIDVDSGMII- 523

Query: 430  EWKSP-PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPI 488
             W+S    +++  A+AN   ++L  GG  LV + +    L   +  + + ++S + I   
Sbjct: 524  -WRSSYEEFTITSASANDDYLVLVLGGVRLVCISLSTFELVGSRDFEADNQVSGMTIPA- 581

Query: 489  GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYL 544
                S +Q   V +     + I +LP+L +  K+ LG  GE IPRSV++          L
Sbjct: 582  ----SPTQACIVCLPQSAEIIILNLPELEVKNKQALGEPGEAIPRSVIVAEILPNKPPTL 637

Query: 545  LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
              ++ DG + +F  ++    + +  K++LG++  + +     N  + VFA  D P++I++
Sbjct: 638  FVSMADGTVFSFSFDVHAFAIFNSSKITLGSEQPSFKKLPRGNGQYNVFATCDHPSLIHA 697

Query: 604  SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
            S  +++YS V+    S +C  N+ A+P S+A++ + EL I  +D+ +   I ++P+    
Sbjct: 698  SEGRIVYSAVDSASASRICSLNTQAYPGSIALSSQNELKIAIVDEERTTQIHTLPMHASV 757

Query: 664  RRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPL----------- 710
            RR+ +    + F + ++K +  +  EE    FV L D+  F  +STY L           
Sbjct: 758  RRLAYSPMEKAFGLGTVKRKISNGVEEVSSSFV-LADEILFRPLSTYDLRPDELVECVIR 816

Query: 711  DTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKET 769
                YG   +  S S D    + VGTA++    ++  +GRIL+F V   + L LI EK  
Sbjct: 817  SQLNYGKDEVGNSISKD---LFFVGTAFLDDVGDDHIRGRILIFEVNRSRELSLIVEKSL 873

Query: 770  KGAVYSLNAFNGKLL-AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
             GA  +L   +  LL A + + + ++K      G   L+    +      + +   GD +
Sbjct: 874  MGACRTLAVMDHTLLVAGLVKSVSVFKLARDRFGNILLEKHTAYRTSTAPIDISVVGDTV 933

Query: 829  VVGDLMKSISLLIY-KHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
             V D+MKS+SL+ Y + EEG      EE AR Y   W +AV  +++++YL A+   NL  
Sbjct: 934  AVADVMKSMSLVQYTQAEEGEQEPKFEEVARHYQTLWSTAVAPIEENVYLLADAEGNLVV 993

Query: 884  VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVI 943
            +++N  G T+ +R RL+   E  LGE VNR  H  +V    ++ V     ++   V+G I
Sbjct: 994  LQQNITGVTESDRKRLQPTSEIRLGEMVNRI-HPIVVQTHTETAVSA-RALLATQVDGSI 1051

Query: 944  GVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIES 1002
             +   +       L +LQT +  +    G +   ++R+F    +  D    F+DG+LIE 
Sbjct: 1052 YLFGLINPAYIDLLLRLQTAMGSITISPGEIPFPKYRAFRTTVRQSDEPFRFVDGELIER 1111

Query: 1003 FLDLSRTRMDEISKTMN---VSVEELCKRVEELTRLH 1036
            FL  +    +EI   ++   V+V  L + +EEL R+H
Sbjct: 1112 FLSCTPAMQEEIVGRLDDSTVTVSSLKEMIEELRRMH 1148


>gi|310796681|gb|EFQ32142.1| CPSF A subunit region [Glomerella graminicola M1.001]
          Length = 1163

 Score =  411 bits (1056), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 320/1128 (28%), Positives = 546/1128 (48%), Gaps = 117/1128 (10%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            +YG I  L+  RP     D LF+ T+R+++   +W   +  L T + + D ++   R   
Sbjct: 56   VYGTILLLQRLRPKDSNADLLFVGTDRFQYFTARWGPGTQRLHTEQVIEDAAEPHMRDAQ 115

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDN-KGQ---LKEAF-NIRLEELQVLDIKFL-Y 118
            +    ++DP  R + +HL++G+  V+   N KGQ   L  A+  +RL EL V    F+  
Sbjct: 116  SQDKCLVDPTGRFMAMHLWEGVMNVMRLGNRKGQFARLDGAWEQVRLSELFVKASAFVPT 175

Query: 119  GCAKPTIVVLYQ---DNKDARHVKTYEVALKDKDF--------VEGPWSQNNLDNGADLL 167
                PTI  LYQ   D +DAR +  Y +   DK+          +  +  +  D  A ++
Sbjct: 176  DTGNPTIAFLYQSNIDKEDAR-LAIYRLMGDDKNTNVSRFDPSKDREYELDIKDPYARII 234

Query: 168  IPVP------------------PPLCGVLIIGEETIVYCSANAFKAIP---IRPSITKAY 206
            IPVP                    L G++++GE  +VY        +      P+I  A+
Sbjct: 235  IPVPIIEDDVKRYHKRDTTGSKAQLGGLVVVGETLLVYVDTLTGTVVEEGLDSPAIFVAW 294

Query: 207  GRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
               D   + Y L D  G LHLL I  E   VTGL + +LG TS AS + Y+ + ++++GS
Sbjct: 295  AAYDH--TNYFLSDDYGNLHLLTIQTEGVVVTGLFMRMLGVTSRASCLVYMGDGMLFLGS 352

Query: 267  SYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER-----------QGQGQV 315
             YGDSQL+ ++++ +      ++++   N+ PI+DF ++DL              GQ ++
Sbjct: 353  HYGDSQLLSVSVETET----TKLVQTIPNIAPILDFSIMDLGNAGDSQVGNAFSSGQARI 408

Query: 316  VTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETR 374
            V   G +++GSLR +R+ +G+ +   ++ L+ ++G++SLRS      DT LVVSFI+ETR
Sbjct: 409  VAGCGVHQNGSLRSIRSSVGLEDIGVLDDLRDVRGLFSLRSYGSAKVDT-LVVSFITETR 467

Query: 375  ILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKS 433
            +   + E  +EE  + +G      TL         L+Q+T+ +V ++ S S    + W  
Sbjct: 468  VFRFDPEGGIEEVFDFQGLVLDRPTLVATTLPSGHLLQITATAVTVLDSESGVTLSTWAV 527

Query: 434  PPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKH-----AQLEYEISCL----D 484
            P G ++  A+AN    LL+  G  LV L + D +  + +         E +ISC+    D
Sbjct: 528  PDGKTIVNASANNKWALLSISGTTLVSLNLSDNLSAQEQVLCRPIGGQEDQISCIHAARD 587

Query: 485  INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE----IIPRSVLLCAFEG 540
            ++ IG          VG W   SV I  L  L+ +  E +        +PR + L     
Sbjct: 588  LDDIG---------VVGFWATGSVSIIHLGTLHALQGETVKQTDDSVSVPRDLALVQLHP 638

Query: 541  ISYL----LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF---SSKNTTHVFA 593
               L      A+ DG +++F ++ +   L+ RK V+LG+Q   L        +  +++FA
Sbjct: 639  PHLLGPTLFVAMEDGQVVSFNVSKEDFSLSSRKSVTLGSQQAGLHILPRPGDEGISNIFA 698

Query: 594  ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
             ++  ++IYSS  +++YS    ++V+++ PF+S AFPD++ +A    + I  ID  ++ H
Sbjct: 699  TTEHSSLIYSSEARVMYSAATAEDVTYIAPFDSEAFPDAIFLATNRNIRIAHIDPERRTH 758

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFIST-YPLD 711
            +  + L E  RR+ +    R F I ++  + +  EE      +L+D+     +   + LD
Sbjct: 759  VNPLQLRETVRRVAYSPALRAFGIGTIHRELTNNEEVVTSSFQLVDEIVLGVVGKPFHLD 818

Query: 712  ---TFEYGCSILSCSFSDDSNV---YYCVGTAYVLP---EENEPTKGRILVFIVE-DGKL 761
               + E   S++     D        + VGT+Y+     +E+  TKGRILV  V+ D   
Sbjct: 819  GSISAEMVESVIRAELPDSMGQPAERFIVGTSYLADPDIDESGDTKGRILVLGVDSDKNP 878

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV 821
             LI   E KGA  SL     KL+A +++ + +Y ++     +  L+           + +
Sbjct: 879  YLIVSHELKGACRSLGVMGEKLVAGLSKTVVVYDYVEESTTSGALRKLATFRPSTFPVDI 938

Query: 822  QTRGDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
               G+ I + DLM+S++L+ +       +  + ERAR +   W ++V  L+   +L A+ 
Sbjct: 939  DVHGNMIGIADLMQSLTLVEFVPAQDGNKAKLVERARHFQYIWATSVCHLEGHSWLEADA 998

Query: 878  NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
              NL  +R+N    T+ +R ++EV GE+HLGE VN+ R   +    P+ +   IP     
Sbjct: 999  QGNLMVLRRNPNAPTEHDRKQMEVTGEFHLGEQVNKIRSLDIT---PNENDPIIPKAFLA 1055

Query: 938  TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG--------GLNHEQWRSFNNEKKTV 989
            TV G + V A +  E    L + Q  L +V++ +G        GL+   WR F N K+  
Sbjct: 1056 TVEGSLYVFADIKSEYQSLLIQFQERLAEVVRALGQADGEPGSGLSFTTWRGFRNAKRAA 1115

Query: 990  DAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            +    F+DG+LIE FLDL   + + + + +  +VE +   VEEL R+H
Sbjct: 1116 EGPFRFVDGELIERFLDLDEAKQEAVVQGLGPTVESMRNLVEELKRMH 1163


>gi|302894051|ref|XP_003045906.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256726833|gb|EEU40193.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 1162

 Score =  410 bits (1055), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 315/1125 (28%), Positives = 549/1125 (48%), Gaps = 111/1125 (9%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
            I+G I+ L+  RP G   D LFI T+R ++  L W+ E  +L  I R + D+S+   R +
Sbjct: 55   IHGTISMLQRLRPKGSETDLLFIGTDRLQYFNLAWNPEMKQLDTIERVIEDLSEPYMRHS 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK----GQLKEAFNIRLEELQVLDIKFLYG 119
             +    ++DP  R + +HL++G+  V     +     +L+    +RL EL +    F++ 
Sbjct: 115  QSQNKCLVDPTARFLAMHLWEGVLNVFRLPTRKGSTNKLEILDQVRLTELFMKASTFIHS 174

Query: 120  -CAKPTIVVLYQ---DNKDARHVKTYEVALKDKDFVEG---PWSQNNLD-----NGADLL 167
                PTI  LY+   + ++AR +  Y +   DK  V     P     LD       A +L
Sbjct: 175  RTGHPTIAFLYKSQMEQEEAR-LAIYRLTHDDKGGVVSKFDPHKDRELDVVIPDPYASML 233

Query: 168  IPVP----------------PPLCGVLIIGEETIVYCSANAFKAIPIR---PSITKAYGR 208
            IPVP                  L G+LI+GE  + Y      +++      P I  A+  
Sbjct: 234  IPVPLDEEKRYHVRNTEGAKAHLGGLLIVGETLLTYYDGLTHRSVSSSLKDPRIFVAWA- 292

Query: 209  VDADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKI--ELLGE----TSIASTISYLD 258
             + DG+ Y L D  G L +L I    E     VTG+ +    LGE    TS AS++ Y+ 
Sbjct: 293  -EYDGTHYFLADDYGRLDMLTIETNVEATGVVVTGMTLVPMKLGESPALTSRASSLVYMG 351

Query: 259  NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-------- 310
            N  +++ S +GDSQL +++ + +A    + +++   N  PI+DF ++D+  +        
Sbjct: 352  NNTLFVASHHGDSQLYQIDPETNA----MLLIKSLSNNAPILDFSIMDMGNREGDAQAGN 407

Query: 311  ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFL 365
                GQ ++V   GAY+DGSLR +R+G+G+ E+  + EL G +G+++LRS   D  DT L
Sbjct: 408  AFSSGQSRIVAGCGAYQDGSLRSIRSGVGLEERGILDELDGTRGLFTLRSYNSDLVDT-L 466

Query: 366  VVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
            VVS I+ETRIL+ + +  +EE    +G    T+TL   +    QL+Q+T  SV L+   S
Sbjct: 467  VVSSITETRILSFDTDGGIEEIYSFQGMEQDTETLLASNLPNGQLLQITPKSVVLLDPES 526

Query: 425  RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY------ 478
                + W  P G S+  A+AN    LL+  G  LV L +   +   V+  Q E       
Sbjct: 527  GVAVSRWDVPTGKSITRASANTKWALLSVDGTSLVSLNLLQNLAVNVQQTQAEPGSQQPD 586

Query: 479  EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVL 534
            +ISC  I+   + P    I  VG W+   + +  +     +  E +        +PR + 
Sbjct: 587  QISC--IHAARDPP---DIGVVGWWSSGRISLIDMATFQPLHGESMRQTDDSATVPRDIA 641

Query: 535  LCAFEG--IS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TT 589
            L       IS   LL A+ DG ++ F +++K   ++ RK V+LG+ P  L      + T+
Sbjct: 642  LVQLHPPEISGPTLLVAMEDGTVVTFNVSIKGFAVSGRKSVTLGSNPARLHVLPQDDGTS 701

Query: 590  HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
             VF  ++  ++IYSS  ++++S     + + + PF+S AFPDS+ ++ +  + I  +D  
Sbjct: 702  SVFVTTEHASLIYSSEGRIIFSATTADDATFVAPFDSHAFPDSIVLSTDSHIRICHVDKE 761

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTY 708
            +  H++++P+ E  RR+ +    + F + S+K +    EE      RL+D+  F+ +   
Sbjct: 762  RLTHVKALPVHETVRRVAYSPGLKAFGLGSIKKELVLNEEVVTSSFRLVDEIVFKELGEP 821

Query: 709  PLDTFEYGCSILSCSFSDD-----SNV--YYCVGTAYV---LPEENEPTKGRILVFIVED 758
             L        I+ C    +      N+   + VGT+++     +E   T GRILV  V++
Sbjct: 822  FLLNASSTLEIVECVIRAELPDAGGNLTERFIVGTSFINDGQVQEANGTLGRILVLGVDE 881

Query: 759  GK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
             + +  I     KG    L   +  ++A +++ + +Y +      +  L+    +    L
Sbjct: 882  HRQVYQIVSHNLKGPCRCLGMMDDYIVAGLSKTVVVYNYSQDTSSSGSLEKLAAYRPAAL 941

Query: 818  ALYVQTRGDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYL 873
             + +   G+ I VGDLM+S+SL+ +       +  +EERAR Y   W +++  LD++ +L
Sbjct: 942  PVDLDISGNMIGVGDLMQSLSLVEFIPAQDGRKAKLEERARHYEPIWTTSLCHLDEERWL 1001

Query: 874  GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-P 932
             A++  NL  +++N++  T+++R RLEV  E  +GE +NR R     + +P  D   + P
Sbjct: 1002 EADSQGNLIVLQRNADAPTEQDRSRLEVTSEIGIGEQINRIRK----LHVPAGDNSIVHP 1057

Query: 933  TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK 992
                 +  G + +   +  +    L   Q+ + + I   G +  + WRSF NE +  D  
Sbjct: 1058 RAFLASAEGSLYLYGDIAPQYQDLLMTFQSKMEEYIHAPGNIEFKLWRSFRNENRESDGP 1117

Query: 993  -NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
              F+DG+++E FLD+   + + + + +  SVE++   +EEL R+H
Sbjct: 1118 YRFIDGEMVERFLDMDEGKQELVCEGLGPSVEDMRNLIEELRRMH 1162


>gi|358400469|gb|EHK49795.1| hypothetical protein TRIATDRAFT_146031 [Trichoderma atroviride IMI
            206040]
          Length = 1161

 Score =  410 bits (1054), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 306/1123 (27%), Positives = 550/1123 (48%), Gaps = 108/1123 (9%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
            I+G I  L+  +P   A D LFI T+R ++  + W+ E+++L  + +A+ D ++   R +
Sbjct: 55   IHGTIDMLQRLQPKDSATDLLFIGTDRLQYFNVAWNPETNQLDAVEQAIEDTAEPYMRQS 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPFD-NKG---QLKEAFNIRLEELQVLDIKFLYG 119
             +    ++DP  + + +HL++G+  V      KG   +L+    +RL EL +    FL+ 
Sbjct: 115  QSQNRCLVDPTGKFMAMHLWEGVLNVFRLRIRKGSTTRLEGLDQVRLTELWMKTSTFLHS 174

Query: 120  -CAKPTIVVLYQ---DNKDARHVKTYEVALKDKDFVEG---PWSQNNLDNG-----ADLL 167
                P I  LY+   D ++AR +  Y +   DK  V     P     LD       A +L
Sbjct: 175  RTGHPRIAFLYKNQLDREEAR-IAVYRLTEDDKGGVSSKFDPQKDRELDQVIPDPYASML 233

Query: 168  IPVP----------------PPLCGVLIIGEETIVYCSANAFKAIPIR---PSITKAYGR 208
            IPVP                  L G+L++GE  + Y  +  +  +      P I  A+  
Sbjct: 234  IPVPVYEEKRYHVRNNEGARAHLGGLLVVGETLLTYFDSLTYSNVCSTLDDPKIYVAWA- 292

Query: 209  VDADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKIELLG------ETSIASTISYLD 258
             + DG+RY L D  G L LL I    E     VTG+ +  +        TS AS++ Y+ 
Sbjct: 293  -ECDGTRYFLADDYGRLDLLEIKTTNESTGVVVTGMTVHPMAFGDSSRYTSRASSLVYMG 351

Query: 259  NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-------- 310
            N +++IGS +GDSQL+ +    D +   + +++   N  PI+DF ++DL  +        
Sbjct: 352  NNLLFIGSHHGDSQLLHV----DIESQQMTLIKVISNNAPIMDFTIMDLGNREGDAQSGN 407

Query: 311  ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFL 365
                GQ ++V  SGAY+DGSLR +R+G+G+ ++  + E++G +G+++LRS      DT +
Sbjct: 408  TFSSGQARIVAGSGAYQDGSLRSIRSGVGLEDRGLLDEIEGTRGLFTLRSVDSAKADT-V 466

Query: 366  VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
            V+S ++ TR+L    ++  E    +G   ++ TL   +    Q++Q+T  +V L+   S 
Sbjct: 467  VISTLAGTRVLRFEPDNIEELFSFQGMDLESPTLLAANLPSGQILQITPQAVNLLDPDSG 526

Query: 426  ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY------E 479
               + W++P G ++  A+AN    LL+  G  LV L + D +    K A  +       +
Sbjct: 527  VSVSSWQAPEGKTITAASANTKWALLSVDGSILVSLNLLDNLKATTKDASQDSVSGRPDQ 586

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLL 535
            ISC+       +P       +G WT  ++ +  +  L  +  E L        +PR V L
Sbjct: 587  ISCIHA---ARDPQ--DFGVIGWWTSGTISVVDMATLTPLHGEPLRQTDDSASVPRDVAL 641

Query: 536  CAF----EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTH 590
                        +L AL DG++++F +++K   ++ RK V+LG+ P  L     ++   +
Sbjct: 642  VQLHPPESSGPTMLVALEDGNVISFNVSVKGFAISGRKTVTLGSGPARLHVLPREDGICN 701

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            VFA ++  ++IYSS  +++YS     + + + PF+S AFPDS+ ++ E  + I  +D+ +
Sbjct: 702  VFATTEHASLIYSSQGRIVYSATTAADATFVAPFDSEAFPDSIVLSTEDHIRICHVDNER 761

Query: 651  KLHIRSIPLGEHPRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFISTYP 709
              H++++P+ E  RR+ +    R F + C  K     EE     V+L+D+  F+ + T  
Sbjct: 762  LTHVKALPMYETVRRVAYSPGLRAFGLGCIKKELVNNEEVITSTVKLVDEIIFQELGTPF 821

Query: 710  LDTFEYGCSILSCSFS---DDSN----VYYCVGTAYVLPE---ENEPTKGRILVFIVEDG 759
            L        ++ C       D+N      + VGT++V  +   E+  TKGRILV  V++ 
Sbjct: 822  LLNASSSLELVECVIRAELPDANGNLAERFLVGTSFVADQGTAESGETKGRILVLGVDES 881

Query: 760  K-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 818
            + L  I     KG    L   N  ++A + + +  Y +      +  L            
Sbjct: 882  RQLYQITSHSLKGVCRCLVMMNNYIVAGLTKTVVAYSYNQDTSTSASLTKLASFRPASFP 941

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKH----EEGAIEERARDYNANWMSAVEILDDDIYLG 874
            + +   G+ I +GDLM+S++L+ +      ++  +EE+AR Y   W +++  LDD  +L 
Sbjct: 942  VDLDVSGNIIGIGDLMQSLTLVEFTPPQDGKKAKLEEKARHYQQAWTTSISALDDSRWLE 1001

Query: 875  AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
            A+   N+  +R+N E  T++++ +L+V+ E ++GE +NR R   +    P  +   +P  
Sbjct: 1002 ADAQGNVIVLRQNLEAPTEQDQSQLQVISELNIGEQINRIRKLQVA---PGENAIVVPKA 1058

Query: 935  IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-N 993
              G+  G + +   +  +    L   Q+ L++ I   G L+ + WR+F N+ +  +A   
Sbjct: 1059 FLGSTEGTLYLYGDIAPKYQDLLMTFQSRLQEYISTPGNLSFDLWRAFRNQSREGEAPFR 1118

Query: 994  FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            F+DG++IE FLDL   + + + + +  SVE++   +EEL R+H
Sbjct: 1119 FVDGEMIERFLDLDEGKQELVCEGLGPSVEDMRNLIEELRRMH 1161


>gi|342885673|gb|EGU85655.1| hypothetical protein FOXB_03801 [Fusarium oxysporum Fo5176]
          Length = 1160

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 308/1118 (27%), Positives = 552/1118 (49%), Gaps = 99/1118 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
            I+G IA L+  RP G   D LFI T+R  +  L W+  + +L  I R + D+++   R +
Sbjct: 55   IHGSIAMLQCLRPKGSETDLLFIGTDRLHYFNLVWNPLTKQLETIERVIEDLAEPYMRHS 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPFD-NKG---QLKEAFNIRLEELQVLDIKFLYG 119
             +    ++DP  R + +HL++G+  V      KG   +L+    +RL EL +    F++ 
Sbjct: 115  SSQNKCLVDPTGRFLAMHLWEGVLNVFKLPIRKGSTNKLERLDQVRLTELFMKASTFIHS 174

Query: 120  -CAKPTIVVLYQ---DNKDARHVKTYEVALKDKDFVEG---PWSQNNLD-----NGADLL 167
                PTI  LY+   + ++AR V  Y +   DK        P     LD       A +L
Sbjct: 175  RTGHPTIAFLYKTQLEQEEARLV-IYRLTHDDKGNTVSKFDPHKDRELDVVIPDPYASML 233

Query: 168  IPVP----------------PPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGR 208
            IPVP                  L G+L+IGE  + Y    +  +  ++   P I  ++  
Sbjct: 234  IPVPLDEEKRYHVRNTEGAKAHLGGLLVIGETLLTYFDGLTHRSVSSVLQDPRIFVSWA- 292

Query: 209  VDADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKIELL------GETSIASTISYLD 258
             + DG+ YLL D  G L LL I    E     VTG+ +E L        TS AS + YL 
Sbjct: 293  -EYDGTHYLLADDYGRLDLLTIDTNLETTGVVVTGMTLEPLKIGRSPAITSRASNLVYLG 351

Query: 259  NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-------- 310
            ++ +++ S +GDSQL ++    D + + V +++ + N  PI+DF ++D+  +        
Sbjct: 352  DSTLFVASHHGDSQLYQI----DVESATVTLVQSFSNNAPILDFSIMDMGNREGDAQAGN 407

Query: 311  ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFL 365
                GQ ++V   GAY+DGSLR +R+G+G+ ++  + EL+G +G+++LRS   D  DT L
Sbjct: 408  AFSSGQSRIVAGCGAYRDGSLRSIRSGVGLEDRGVLDELEGTRGLFTLRSYGSDLVDT-L 466

Query: 366  VVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
            VVS I+ETR+L+ + E  +EE    +G    T+TL   +    QL+Q+T  SV L+    
Sbjct: 467  VVSAITETRVLSFDREGGIEEIYSFQGMSLDTETLLASNLPNGQLLQITPRSVVLLDPEG 526

Query: 425  RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLD 484
              + ++W  P G S+  A+AN+   LL+  G  LV L +   +   V+ +Q         
Sbjct: 527  GTVTSKWDVPSGKSITRASANSKWALLSVDGTSLVSLNLLQNLAVNVQQSQNNSGSQADQ 586

Query: 485  INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEG 540
            I+ I        +  VG W+   + +  +  L  +  E +        +PR + L     
Sbjct: 587  ISCIHAARDPPDLGVVGWWSSGQISLIDMASLKPLHGESMRQTEDSATVPRDIALVQLHP 646

Query: 541  --IS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAAS 595
              IS   LL A+ DG+++ F ++ K   ++ RK V+LG+ P  L     ++ T++VF  +
Sbjct: 647  PEISGPTLLVAMEDGNVVTFNVSTKDFAVSGRKSVTLGSNPARLHILPQQDGTSNVFVTT 706

Query: 596  DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR 655
            +  ++IYS+  ++++S     + + + PF+S AFPDS+ ++ +  + I  +D  +  H++
Sbjct: 707  EHASLIYSAEGRVIFSATTADDATFVAPFDSHAFPDSVILSTDQHIRICHVDKERLTHVK 766

Query: 656  SIPLGEHPRRICHQEQSRTFAICSLKNQSCA-EESEMHFVRLLDDQTF-EFISTYPLD-- 711
            ++P+ E  RR+ +    + F + S+K +    EE      RL+D+  F E  S +PLD  
Sbjct: 767  ALPVNETVRRVAYSPGLKAFGLGSIKKELVGNEEVVSSSFRLVDEIVFKELGSPFPLDAS 826

Query: 712  -TFEYGCSILSCSFSDDSNVY---YCVGTAYVLPEENEP--TKGRILVFIVEDGK-LQLI 764
             + E    ++     D    +   + VGT+++     +P  T GRILV  V+  + +  I
Sbjct: 827  SSLEIVECVIRAELPDVGGNHVERFIVGTSFISDGVEDPNGTGGRILVLGVDSNRQVYQI 886

Query: 765  AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
                 KG    L   +  ++A +++ +  Y ++     +  LQ    +    L + +   
Sbjct: 887  VSHNLKGPCRCLGMIDDNIIAGLSKTVVAYSFLQETSSSGSLQKLAVYRPAALPVDLDIS 946

Query: 825  GDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
            G+ I V DLM+S+SL+ +       +  +EERAR +   W ++V  ++ + +L A++  N
Sbjct: 947  GNMIGVVDLMQSLSLVEFIPAQDGNKAKLEERARHFEPLWATSVCHIEGERWLEADSKGN 1006

Query: 881  LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-PTVIFGTV 939
            L  +++N +  T+++R RLE+  E ++GE +NR R     + +P ++ G + P     + 
Sbjct: 1007 LVVLQRNVDAPTEQDRSRLEITSEMNIGEQINRIRK----LHVPMAENGIVHPRAFLASA 1062

Query: 940  NGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGD 998
             G + +   +  +    L   Q+ + + I   G +  + WRSF NE +  +    F+DG+
Sbjct: 1063 EGSLYLYGDIAPQYQDLLMTFQSKMEEYIHVPGSVEFKLWRSFRNENRESEGPFRFIDGE 1122

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            ++E FLD+   + + + + +  S+E++   +EEL R+H
Sbjct: 1123 MVERFLDMDEGKQELVCEGLGPSIEDMRNLIEELRRMH 1160


>gi|19114492|ref|NP_593580.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe 972h-]
 gi|46395602|sp|O13807.1|DDB1_SCHPO RecName: Full=DNA damage-binding protein 1; AltName:
            Full=Damage-specific DNA-binding protein 1
 gi|2330717|emb|CAB11219.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe]
          Length = 1072

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 295/1072 (27%), Positives = 549/1072 (51%), Gaps = 96/1072 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITR-AMGDVSDRIGRPTD 64
            I+ +I  ++ F+P     D + +AT+ +++  L WDA  + +     + D S+R  R + 
Sbjct: 55   IFAKIVNVKAFKPVSSPTDHIIVATDSFRYFTLFWDANDNTVSNGIKIQDCSERSLRESQ 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF----------DNKGQLKEAFNIRLEELQVLDI 114
            +G + ++DP  R+I LH+Y GL  +IP            N   L + F++R++EL V+DI
Sbjct: 115  SGPLLLVDPFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQELNVVDI 174

Query: 115  KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL 174
              LY  ++P++ VLY+D+K   H+ TY++ +++++  E     ++++ G      +P   
Sbjct: 175  AMLYNSSRPSLAVLYKDSKSIVHLSTYKINVREQEIDEDDVVCHDIEEGK----LIPSEN 230

Query: 175  CGVLIIGEETIVYCSAN--------AFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLH 226
             GV + GE  + Y S +         +      PSI+        D S Y++ D +G+L+
Sbjct: 231  GGVFVFGEMYVYYISKDIQVSKLLLTYPITAFSPSISND-PETGLDSSIYIVADESGMLY 289

Query: 227  LLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSY 286
                    E V+ +++E LGE+SIAS +  L +  +++GS + +S L++L          
Sbjct: 290  KFKALFTDETVS-MELEKLGESSIASCLIALPDNHLFVGSHFNNSVLLQLP-SITKNNHK 347

Query: 287  VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            +E+L+ +VN+ PI DF + D   Q    ++TCSGAYKDG+LRI+RN I I   A +E++G
Sbjct: 348  LEILQNFVNIAPISDFIIDD--DQTGSSIITCSGAYKDGTLRIIRNSINIENVALIEMEG 405

Query: 347  IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA-- 404
            IK  +S+    +  +D ++ +S I ETR + ++          EG  S    L C ++  
Sbjct: 406  IKDFFSVSFRAN--YDNYIFLSLICETRAIIVSP---------EGVFSANHDLSCEESTI 454

Query: 405  ----IY--NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
                IY  +Q++Q+T+  +RL         + W SP   S+   ++ A  V +A  GG +
Sbjct: 455  FVSTIYGNSQILQITTKEIRLFDGKK---LHSWISP--MSITCGSSFADNVCVAVAGGLI 509

Query: 459  VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT-DISVRIFSLPDLN 517
            ++ E   GI TEV   Q + E+S L      EN  Y     VG+W+ DI +  +    ++
Sbjct: 510  LFFE---GI-TEVGRYQCDTEVSSLCFTE--ENVVY-----VGLWSADIIMLTYCQDGIS 558

Query: 518  LITKEHLGGEIIPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKTGELTDR--KKVSL 573
            L     L    IPRS++     G     L  +  +G++L F  N + G++ +   ++  L
Sbjct: 559  LTHSLKLTD--IPRSIVYSQKYGDDGGTLYVSTNNGYVLMF--NFQNGQVIEHSLRRNQL 614

Query: 574  GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
            G  PI L+ F SK    +FA  ++P ++Y  + KL+ + ++  E+ ++  + + +   ++
Sbjct: 615  GVAPIILKHFDSKEKNAIFALGEKPQLMYYESDKLVITPLSCTEMLNISSYVNPSLGVNM 674

Query: 634  AIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA--EESEM 691
                   +++  + +I+ L+++++ +   PRRIC       F +C    +S    E+  +
Sbjct: 675  LYCTNSYISLAKMSEIRSLNVQTVSVKGFPRRICSNSLFY-FVLCMQLEESIGTQEQRLL 733

Query: 692  HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRI 751
             F+R+ +  T   I+ +  + +E   SI+    +DD  V   VGT +  P+++ P  GR+
Sbjct: 734  SFLRVYEKNTLSEIAHHKFNEYEMVESII--LMNDDKRV--VVGTGFNFPDQDAPDSGRL 789

Query: 752  LVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
            +VF +  D  +++ AE + +G+V +L  +   ++A IN  + ++++   + GT  +++  
Sbjct: 790  MVFEMTSDNNIEMQAEHKVQGSVNTLVLYKHLIVAGINASVCIFEY---EHGTMHVRNSI 846

Query: 811  GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
                + + + V    D I+  DLMKSI++L +  ++  + E ARDY+  W ++VEIL + 
Sbjct: 847  RTPTYTIDISVNQ--DEIIAADLMKSITVLQFIDDQ--LIEVARDYHPLWATSVEILSER 902

Query: 871  IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
             Y   E + N   + +++      +R +L    +++LGE +N+ RH + +   P      
Sbjct: 903  KYFVTEADGNAVILLRDNVSPQLSDRKKLRWYKKFYLGELINKTRHCTFIE--PQDKSLV 960

Query: 931  IPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
             P ++  TV+G + ++          L +LQ N+RKVI   GGL+H++W+ +  E +T  
Sbjct: 961  TPQLLCATVDGSLMIVGDAGMSNTPLLLQLQDNIRKVIPSFGGLSHKEWKEYRGENET-S 1019

Query: 991  AKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
              + +DG LIES L L    ++EI         +++SV++L   +E L +LH
Sbjct: 1020 PSDLIDGSLIESILGLREPILNEIVNGGHEGTKLDISVQDLKSIIENLEKLH 1071


>gi|327301962|ref|XP_003235673.1| UV-damaged DNA binding protein [Trichophyton rubrum CBS 118892]
 gi|326461015|gb|EGD86468.1| UV-damaged DNA binding protein [Trichophyton rubrum CBS 118892]
          Length = 1147

 Score =  407 bits (1047), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 325/1119 (29%), Positives = 540/1119 (48%), Gaps = 114/1119 (10%)

Query: 6    IYGRIATLELFRPHGEAQ-----DFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRI 59
            IYGR+  L+   P   A      D LF+ T++Y +  + WD    +L T R   D++D  
Sbjct: 55   IYGRVTLLKKL-PRSTAAGLALTDALFVGTDQYAYFSITWDPVHEQLRTERKYIDLADGS 113

Query: 60   GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG---------------------QL 98
             R   +     IDP    + L +Y+G+  + P                          QL
Sbjct: 114  LREAHSDDRCQIDPSGSFLTLEVYEGVVSIFPLVTADSHKRAKSAVSASAFSASTSVEQL 173

Query: 99   KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWS 156
             E   +R+EEL V    FL   A   P   +LY+D +    +K  ++          P S
Sbjct: 174  GEPLQVRIEELMVRSSAFLDQEASNTPRFALLYEDTQGKVRLKLRDLKYTHAIITSDPGS 233

Query: 157  QNNL----------DNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SI 202
               L          D GA +LIPVP PL G+LIIGE +I Y   +  + I  RP    ++
Sbjct: 234  AAELKDVTTLSDEVDLGASILIPVPRPLGGLLIIGESSIKYVDVSRNETIS-RPLAESTV 292

Query: 203  TKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
              A+ +VD  G R+LL D  G L  L+++      V   K++ LG TS AS + YLD  +
Sbjct: 293  FVAWEQVD--GQRWLLADDYGRLFFLMLVLDADNAVESWKVDFLGVTSRASVLVYLDGGI 350

Query: 262  VYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ---------GQ 312
            V++GS  GDSQ+I++      +G + ++++   N+ PI+DF ++D+  +         GQ
Sbjct: 351  VFVGSHQGDSQVIQIK-----EGGF-DLVQTISNIAPILDFTIMDMGDRSGATREFSSGQ 404

Query: 313  GQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
             ++VT SGA+ DGSLR VR+G+GI E   +  ++ I  +W+LRS+  +PF   L+VSF++
Sbjct: 405  TRIVTGSGAFGDGSLRSVRSGVGIEELGVLASMEHITDLWALRSACPEPFFDTLLVSFVN 464

Query: 372  ETRILAMNLEDELEETE--IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
            ETR+   + + ++EE E    G      TL   +   N+++QVT    R +   S  +  
Sbjct: 465  ETRVFHFSADGDVEEKEDGFLGLVFSESTLLATNIPGNRILQVTESISRAIDVDSGMII- 523

Query: 430  EWKSP-PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI--N 486
             W+S    +++  A+AN   ++L  GG  LV + +    L   +  + + ++S + I  +
Sbjct: 524  -WRSSYEEFTITSASANDDYLVLVLGGTRLVCISLSTFELVGSRDFEADNQVSGMTIPAS 582

Query: 487  PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGIS 542
            PI       Q   V +     + I  LP L +  K+ LG  GE IPRSV++         
Sbjct: 583  PI-------QACIVCLPQSAEIIILDLPGLEVKNKQTLGEPGEAIPRSVIVAEILPNKPP 635

Query: 543  YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVI 601
             L  ++ DG + +F  ++    + +  K++LG++  + +     N  + VFA  D P++I
Sbjct: 636  TLFVSMADGTVFSFSFDVHAFAIFNSSKITLGSEQPSFKKLPRGNGQYNVFATCDHPSLI 695

Query: 602  YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 661
            ++S  +++YS V+    S +C  N+ A+P S+A++ + EL I  +D+ +   I ++P+  
Sbjct: 696  HASEGRIVYSAVDSASASRICSLNTQAYPGSIALSSQHELKIAIVDEERTTQIHTLPMHA 755

Query: 662  HPRRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPL--------- 710
              RR+ +    + F + ++K +  +  EE    FV L D+  F  +STY L         
Sbjct: 756  SVRRLAYSPTEKAFGLGTVKRKISNGVEEVSSSFV-LADEILFRPLSTYDLRPDELVECV 814

Query: 711  --DTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEK 767
                  +G   +  S S D    + VGTA++    ++  +GRILVF V   + L LI EK
Sbjct: 815  IRSQLNHGKDEVGNSISKD---LFFVGTAFLDDVGDDHIRGRILVFEVNRSRELSLIVEK 871

Query: 768  ETKGAVYSLNAFNGKLL-AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD 826
               GA  +L   +  LL A + + + ++K      G   L+    +      + +   GD
Sbjct: 872  SLMGACRTLAVMDHTLLVAGLVKSVSVFKLARDRFGNILLEKHTAYRTSTAPIDISVVGD 931

Query: 827  FIVVGDLMKSISLLIY-KHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
             + V D+MKS+SL+ Y + EEG      EE AR Y   W +AV  +++++YL A+   NL
Sbjct: 932  TVAVADVMKSMSLVQYTQAEEGEREPKFEEIARHYQTLWSTAVAPIEENVYLLADAEGNL 991

Query: 882  FTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
              +++N  G T+ +R RL+   E  LGE VNR  H  ++    ++ V      +  TV+G
Sbjct: 992  VVLQQNITGVTESDRKRLQPTSEIRLGEMVNRI-HPIVIQTYTETAVSA--RALLATVDG 1048

Query: 942  VIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLI 1000
             I +   +       L +LQT +  +    G +   ++R+F       D    F+DG+LI
Sbjct: 1049 SIYLFGLINPTYIDLLLRLQTAMGSITISPGEIPFSKYRAFRTTVHQSDEPFRFVDGELI 1108

Query: 1001 ESFLDLSRTRMDEISKTMN---VSVEELCKRVEELTRLH 1036
            E FL  +    +EI   ++   V+V  L + +EEL R+H
Sbjct: 1109 ERFLSCTPGMQEEIVSRLDDSTVTVSSLKEMIEELRRMH 1147


>gi|295667673|ref|XP_002794386.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226286492|gb|EEH42058.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1195

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 326/1131 (28%), Positives = 534/1131 (47%), Gaps = 131/1131 (11%)

Query: 15   LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDP 73
            LF P     D LFI T+R  +  + W+  +S+L T R   D++D   R +  G   +IDP
Sbjct: 87   LFLPQ---TDVLFIGTDRATYFTVSWNPVTSQLRTERKYVDLADPSSRESQLGDRCLIDP 143

Query: 74   DCRLIGLHLYDGLFKVIPFDNKG-----------------------------------QL 98
              + I L LY+G+  VIP                                        +L
Sbjct: 144  SGKFITLELYEGIITVIPIGQPQRTARQGGRKYGKRAVTAQNQHDSSHIGNTNAAGEVEL 203

Query: 99   KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV------------- 143
             E    R++EL V    FL+  A   P +  LY+D      +K  E+             
Sbjct: 204  GEPCQARVDELLVRSSAFLHTQADMLPRMAFLYEDTMGQVRLKVRELEFTYGGMGIGTGG 263

Query: 144  ----------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SAN 191
                       LK  D +     +  L+ GA  LIPVP PL G+L++GE +I Y   + N
Sbjct: 264  GAGQDTGCIAVLKALDLL-----KEELEMGASFLIPVPAPLGGLLVLGETSIRYLDDATN 318

Query: 192  AFKAIPI-RPSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETS 249
               ++P+   +I  A+ +VD  G R+LL D  G L  L++I  E   V   K++LLG   
Sbjct: 319  ECISLPLDEATIFVAWEQVD--GQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIP 376

Query: 250  IASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-- 307
             AS + YL   V +IGS  GDSQLI++      +GS+ EV++ + N+ PI+DF ++DL  
Sbjct: 377  RASVLVYLGGGVTFIGSHQGDSQLIRIT-----EGSF-EVIQTFSNIAPILDFTIMDLGG 430

Query: 308  ---ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSST 357
               E Q      GQ ++VT SGA+ DGSLR VR+G+G+ E   +  ++ I  +W+LR + 
Sbjct: 431  RAGENQTHDFSSGQARIVTGSGAFDDGSLRSVRSGVGMEEVGVLGAMEHITDLWALRVAC 490

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             + F   L+VSF+ E+R+     + E+EE  E  G      TL   +    +++QVT  +
Sbjct: 491  QEGFSDTLLVSFVDESRVFHFTQDGEVEEKDEFMGLGLAESTLLAANLPNGRILQVTERN 550

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQ 475
            VR+       L   W  P   ++  A++N   ++L  GG  L+  +I G+  L   K   
Sbjct: 551  VRVADLDDGMLLWNWSPPSQKAITAASSNDDHLVLVVGGQVLMCFDIQGEIKLAGKKDFG 610

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSV 533
             + ++S + +     +P+   I  +    +++V   SL D+ +     +G  GE  PRSV
Sbjct: 611  DDTQVSGVTV---ASSPATYCILCLPQTAEVAV--MSLADMTIRHSTSVGEPGEAFPRSV 665

Query: 534  LLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTTH 590
            L+        + L  ++ DG + +F  N     LT   K+ LG+ QP   +        +
Sbjct: 666  LVAEVLPNQPATLFVSMADGGVFSFSFNADEFTLTKMSKLVLGSEQPSFKKLPRGDGLYN 725

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            +FA  ++P++IY++  +++YS V+  + S +C FNS A+P S+A+A   EL I  +D  +
Sbjct: 726  IFATCEQPSLIYATEGRIIYSAVHSDQASRICHFNSEAYPGSIALATPTELKIAHVDSER 785

Query: 651  KLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTY 708
               I ++ +GE  RR+ +    + F I ++K   ++ AE     F+ L D+  F  +  Y
Sbjct: 786  TTQIHTLEIGETVRRVAYSAAEKAFGIGTIKRTLENGAEVITSRFM-LADEIMFRELDDY 844

Query: 709  PLDTFEYGCSILSCSF-----SDDSNVY---YCVGTAYVLPEENEPTKGRILVFIVEDGK 760
             L   E   S++   F     SD +  +   + VGT+Y+        +GRIL F V   +
Sbjct: 845  SLRPDELVESVIQAQFPEGKDSDGNESFKDMFVVGTSYLDDVGEGSIRGRILAFEVTGSR 904

Query: 761  -LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
             L  +AE   KGA  +L     K++AA+ + + +Y     +     L     +      +
Sbjct: 905  QLAKVAELPVKGACRALAVMQDKIVAALMKTVVIYSIAKGELSDYTLNKTASYRTSTAPI 964

Query: 820  YVQTRGDFIVVGDLMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGA 875
             +   G+ I V DLMKS+S++ +K  E     ++ E AR +   W +AV  + ++++L +
Sbjct: 965  DIAVTGNLIAVADLMKSVSIIEFKQGENDQPDSLTEVARHFQTLWSTAVAPIAENMFLES 1024

Query: 876  ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI---P 932
            +   NL  + +N  G TD+++ RLEV  E  LGE VNR R  S+   LP +   +    P
Sbjct: 1025 DAEGNLVVLNQNVNGVTDDDKRRLEVTSEILLGEMVNRIRPVSIQGSLPATGPREAVISP 1084

Query: 933  TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDA 991
                GTV G I +   +       L +LQ+ +  ++   G +   ++R+F N  ++  + 
Sbjct: 1085 KAFLGTVEGSIYLFGLINPAYQDLLMRLQSAMAGLVVTPGAMPFNKFRAFKNAVRQAEEP 1144

Query: 992  KNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
              F+DG+LIE F        +EI           V+VE+  + VEEL R+H
Sbjct: 1145 YRFVDGELIERFWTCEAALQEEIVGLVVAGGVAGVTVEKTKRIVEELRRMH 1195


>gi|46121747|ref|XP_385428.1| hypothetical protein FG05252.1 [Gibberella zeae PH-1]
          Length = 1161

 Score =  404 bits (1039), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 302/1123 (26%), Positives = 548/1123 (48%), Gaps = 108/1123 (9%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
            I+G I+ L+  RP G   D LFI T+R ++  L W+  + +L  I R + D+S+   R +
Sbjct: 55   IHGTISMLQRLRPKGSETDLLFIGTDRLQYFNLIWNDATKQLETIERVIEDLSEPYMRQS 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK----GQLKEAFNIRLEELQVLDIKFLYG 119
             +    ++DP  R + +HL++G+  V     +     +L+    +RL EL +    F Y 
Sbjct: 115  QSQNKCLVDPTGRFLAMHLWEGVLNVFKLPTRKGSTNKLEVLDQVRLTELFMKASTFTYS 174

Query: 120  CAK-PTIVVLYQDNKDARHVK--TYEVALKDKDFVEG---PWSQNNLD-----NGADLLI 168
                PTI  LY+   D    +   Y +   DK        P     LD       A +LI
Sbjct: 175  YTNHPTIAFLYKTQMDQEETRLVIYRLTHDDKGNTVSKFDPHKDRELDIEIPDPYASMLI 234

Query: 169  PVP----------------PPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGRV 209
            PVP                P L G++++GE  + Y      +++      P I  A+   
Sbjct: 235  PVPLDEEKRYNVRNNQGAKPHLGGLVVVGETLLTYYDGQTHRSVSAGLKDPRIFVAWAEY 294

Query: 210  DADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKIELL------GETSIASTISYLDN 259
            D+  +RYLL D  G L LL I    E     VTG+ +E          TS AS++ YL N
Sbjct: 295  DS--TRYLLADDYGRLDLLTIRTNLEPTGVVVTGMTVEPFEFDNSPAITSRASSLVYLGN 352

Query: 260  AVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ--------- 310
              +++ S +GDSQL ++N+  D K + +  ++ + N  PI+DF ++D+  +         
Sbjct: 353  GNLFVASHHGDSQLYQINI--DTKAAML--VQSFSNNAPILDFSIMDMGNREGDTQSGNV 408

Query: 311  ---GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLV 366
               GQ ++V   GAY+DG+LR +R+G+G+ ++  + EL+G +G+++L S   +  DT LV
Sbjct: 409  FSSGQSRIVAGCGAYRDGTLRSIRSGVGLEDRGVLDELEGTRGLFTLHSYGSEMVDT-LV 467

Query: 367  VSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
            VS I+ETRIL+ + E  +EE    +G    ++TL   +    QL+Q+T  SV L+     
Sbjct: 468  VSSITETRILSFDFEGGIEEIYSFQGMSLDSETLLASNLPSGQLLQITPKSVVLLDPEGG 527

Query: 426  ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY-----EI 480
               ++W  P G ++  A+AN+   LL+  G  LV L +   +    +    +      +I
Sbjct: 528  TTISKWDVPDGKTITRASANSKWALLSVDGTSLVSLNLLQNLAVNAQQINNDSTSQPDQI 587

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLC 536
            SC  I+   ++P    +  VG W+   + +  +  L  +  E +        +PR V L 
Sbjct: 588  SC--IHAARDSP---DLGVVGWWSSGQISLIDMASLKPLHGESMRQTEDSATVPRDVALV 642

Query: 537  AFE--GIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHV 591
                  IS   LL A+ DG+++ F ++ K   ++ RK V+LG+ P  L     ++ T++V
Sbjct: 643  QLHPPDISGPTLLVAMEDGNIVTFNVSTKGFSVSGRKSVTLGSNPARLHILPQEDGTSNV 702

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
            F  ++  ++IYSS  ++++S     + + + PF+S AFPDS+ ++ +  + I  +D  + 
Sbjct: 703  FVTTEHASLIYSSEGRIIFSATTADDATFVAPFSSHAFPDSVVLSTDQHIRICHVDRERL 762

Query: 652  LHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTF-EFISTYP 709
             H++++P+ E  RR+ +    + F + S+K +    EE      RL+D+  F E  S +P
Sbjct: 763  THVKALPVNETVRRVAYSPGLKAFGLGSIKRELVNNEEVVTSSFRLVDEIVFKELGSPFP 822

Query: 710  LD---TFEYGCSILSCSFSD---DSNVYYCVGTAYVLP---EENEPTKGRILVFIVE-DG 759
            L+   T E    ++     D   +    + VGT+++     E+   T GRILV  V+ + 
Sbjct: 823  LNGSSTLELVECVIRAELPDVGGNPTERFIVGTSFISDGGVEDPNGTLGRILVLGVDANR 882

Query: 760  KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
            ++  I     KG    L      ++A +++ + +Y +      +  LQ    +    + +
Sbjct: 883  QVYQIVSHNLKGPCRCLGMIEDNIVAGLSKTVVVYSFSQETSSSGSLQKLAAYRPAAIPI 942

Query: 820  YVQTRGDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
             +   G+ I VGDLM+S+SL+ +       +  ++ERAR + + W +AV  ++ + +L A
Sbjct: 943  DIDVSGNMIGVGDLMQSLSLVEFIPAQDGNKAQLQERARHFESLWTTAVCHIEGERWLEA 1002

Query: 876  ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-PTV 934
            ++  NL  +++N    T+++R R+E+  E  +GE +NR R     + +P ++   + P  
Sbjct: 1003 DSRGNLVVLQRNVNAPTEQDRRRMEITSEMGIGEQINRIRK----LNVPAAENNIVHPRA 1058

Query: 935  IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-N 993
               +  G + +   +  +    L   Q+ + + +  +G +  + WRSF N+ +  +    
Sbjct: 1059 FLASAEGSLYLYGDIAPQYQDLLMTFQSRMEECVLALGNVEFKLWRSFRNDNRESEGPYR 1118

Query: 994  FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            F+DG++IE FLD+   + + +   +  +VE++   +EEL R+H
Sbjct: 1119 FIDGEMIERFLDMGEEQQELVCDGLGPTVEDMRNMIEELRRMH 1161


>gi|378732825|gb|EHY59284.1| DNA damage-binding protein 1 [Exophiala dermatitidis NIH/UT8656]
          Length = 1185

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 335/1150 (29%), Positives = 554/1150 (48%), Gaps = 138/1150 (12%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            +YG I  LE  RP     D LF+ T+RY +    W+AE  +L T ++  D++D + R + 
Sbjct: 55   VYGYITILEKLRPASSQTDHLFVGTDRYHYFTCSWNAEIKQLRTEQSYVDLADNVLRNSK 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF---DNK------------GQLKEAFNIRLEEL 109
                  IDP  R + L LYDG+  V+PF    NK            G L E   +R+EEL
Sbjct: 115  EVDRCHIDPTGRYMTLELYDGVISVLPFVQPSNKRVKRETTTNSPVGSLGEPVQVRVEEL 174

Query: 110  QVLDIKFLYGCA----KPTIVVLYQDNKDARHVKTYEVAL---KDKDFVE----GPWSQN 158
                  FL         P + +L++DN +   +K  E+      D+   E         +
Sbjct: 175  LTRSSAFLETDPDTRENPKLAILWEDNLENPQLKLRELKYYPGGDQATAELETVAELHGD 234

Query: 159  NLDNGADLLIPVPPPLCGVLIIGEETIVYCS---ANAF-KAIPIRPSITKAYGRVDADGS 214
             LD G   LIPV  P  G LI+G+  I Y      N   +A+    +I   + ++D    
Sbjct: 235  QLDKGVSHLIPVSAPYGGFLILGDHAIRYVDRGLTNVITQALEDNATIWTCWTKID--DR 292

Query: 215  RYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
            R+LL D  G L  L+I     +V+  +++ +G  S A+ + YLD   V++GS  GDSQ++
Sbjct: 293  RWLLADDFGRLFFLMIEVSGGEVSSWRLDTVGVASKATCLVYLDEGYVFVGSHSGDSQVV 352

Query: 275  KLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER-------------QGQGQVVTCSGA 321
             +    + +G  V V++ + N+ PI+DF ++DL R              GQ ++VT SGA
Sbjct: 353  HI----EEEG--VRVVQSFANIAPILDFTIMDLGRGAEAGGQQALEFSSGQARIVTASGA 406

Query: 322  YKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNL 380
            ++DG++R VR+G+G+ E  ++ EL  I  +W L S+        LVVSF++ETR+   + 
Sbjct: 407  WQDGTIRSVRSGVGLEELGTIGELSHITDLWGLSSTGQSDVQDVLVVSFVTETRVFKFDS 466

Query: 381  EDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP-PGYS 438
            E  +EE  E         TL   +    +L Q+    +R+    S  L  EWK P     
Sbjct: 467  EAAVEEVDEFHHLEFSQPTLLAANLPDRKLAQIYETGLRITDLESNMLTLEWKPPDSAAK 526

Query: 439  VNVATANASQVLLATGGG--HLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQ 496
            +  A AN+  +L+  GG   H+ +    +   T  K   L+ +IS + +       S S 
Sbjct: 527  ITAACANSVHLLMVEGGHTLHVFHASNNEAKPTVSKSFPLDSQISSVTVAD-----SQSN 581

Query: 497  IAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGH 552
            +  V  W   SV I  L  L+ +  + LG  G  IPRSVL+     +    L  A+ DG 
Sbjct: 582  VCIVSFWQTASVAILDLHSLDTLYTQTLGTPGTDIPRSVLVAHVLPDAPPTLFIAMADGT 641

Query: 553  LLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT-----THVFAASDRPTVIYSSNKK 607
            +L++  ++    L+   ++ LGT+P+  +     N+     ++VFA+ ++P++IYSS  +
Sbjct: 642  VLSYSFDLAKHSLSGMTRILLGTEPVVFKQLPKDNSPDSNLSNVFASCEQPSLIYSSEGR 701

Query: 608  LLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRIC 667
            ++YS VN +  S +C FN  A+P+++A+A   EL +  I   +   ++++ +GE  R + 
Sbjct: 702  IIYSAVNAESASRVCNFNCEAYPEAIAMATPTELKLALIGRERTTQLQTLSIGETVRCLT 761

Query: 668  HQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEY-GCSILSCSF- 724
            ++  ++ FA+ C  +      E+ +  V++ D+ +F+ + +  L   E   C + + SF 
Sbjct: 762  YEPTAKMFAMGCIRRIMEGGTEALLSSVKIADEVSFKELDSVELQDRELVECIVSTGSFD 821

Query: 725  -SDDSNVY---YCVGTAYVLPEEN---EPTKGRILVFIV-EDGKLQLIAEKETKGAVYSL 776
              D+S  Y   + VGT+ +L E++   E TKGRILVF V ++ KL+ + + + KGA  SL
Sbjct: 822  TEDESKGYGDMFVVGTS-ILEEDDAGEEVTKGRILVFEVNKEKKLKAVTQIKVKGACRSL 880

Query: 777  NAFNGKLLAAINQKIQLYKWM---LRDDGTRELQSECGHHGHI--LALYV----QTRGDF 827
               +GK++A + + + LY  +    R + + ELQ    +      L+L V    Q     
Sbjct: 881  AMCDGKIVAGLVKTVVLYGLIPSSTRGEHSLELQKLATYRTSTNPLSLAVTPATQDSPAL 940

Query: 828  IVVGDLMKSISLLIY-----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
            I V DLMKS+S+L        + E  + E +R +   W SA  +L  D ++ A+   NL 
Sbjct: 941  IAVADLMKSLSVLQVLPPDSSNSEYRLHETSRHFATLWSSATAVLRHDEWVVADMEGNLI 1000

Query: 883  TVRKNSEGATDE----ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS----------DV 928
            T+R+ S  A +      R RLEV GE+ LGE VN+     +V  LP            DV
Sbjct: 1001 TLRQGSLDAENNGAVAARRRLEVTGEFRLGEVVNK-----IVSILPPGSQRQSDQKMVDV 1055

Query: 929  G-------------QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLN 975
            G               P    GTV G I ++A++       L  LQ++L   ++  G + 
Sbjct: 1056 GLDGGKKPARTGPLVTPRTFLGTVEGAIYMLATINPAYVNVLLLLQSSLATRVQAPGYMP 1115

Query: 976  HEQWRSFNNEKKTVDAK-NFLDGDLIES-FLDLSRTRMDEI-------SKTMNVSVEELC 1026
              ++R++  E    D     +DG+++E   + L    ++++        + +NVSVEE+ 
Sbjct: 1116 WAKYRAWKTEVLEKDEPFRVVDGEMLEQGLVALDDRELEDVLREAGLMEEALNVSVEEVR 1175

Query: 1027 KRVEELTRLH 1036
               +EL RL+
Sbjct: 1176 GWADELRRLY 1185


>gi|326474493|gb|EGD98502.1| UV-damaged DNA binding protein [Trichophyton tonsurans CBS 112818]
 gi|326481558|gb|EGE05568.1| UV-damaged DNA binding protein [Trichophyton equinum CBS 127.97]
          Length = 1147

 Score =  400 bits (1029), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 323/1116 (28%), Positives = 541/1116 (48%), Gaps = 108/1116 (9%)

Query: 6    IYGRIATLELFRPHGEAQ-----DFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRI 59
            IYGRI  L+   P   A      D LF+ T++Y +  + WD +   L T R   D++D  
Sbjct: 55   IYGRITLLKRL-PRSTAAGLALTDALFVGTDQYAYFSITWDPDHELLRTERKYIDLADGS 113

Query: 60   GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG---------------------QL 98
             R   +     IDP    + L +Y+G+  + P                          QL
Sbjct: 114  LREAHSDDRCQIDPSGNFLTLEVYEGVVSIFPLVTTDSHKRAKSAVSASASAASTSIEQL 173

Query: 99   KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWS 156
             E   +R EEL V    FL   A   P   +LY+D +    +   ++          P S
Sbjct: 174  GEPLQVRTEELMVRSSAFLDQEASNTPRFALLYEDTQGQVRLTLRDLKYTHAIVTSDPGS 233

Query: 157  ----------QNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SI 202
                       + ++ GA +LIPVP PL G+L+IGE +I Y   +  + I  RP    ++
Sbjct: 234  AAELKYVTTLSDEIELGASILIPVPRPLGGLLVIGESSIKYVDVSKNETIS-RPLAESTV 292

Query: 203  TKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
              A+ +VD  G R+LL D  G L  L+++      V   K++ LG TS AS + YLD  +
Sbjct: 293  FVAWEQVD--GQRWLLADDYGRLFFLMLVLDTDNAVESWKVDYLGVTSRASVLVYLDGGI 350

Query: 262  VYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ---------GQ 312
            V++GS  GDSQ+I++      +G + ++++   N+ PI+DF ++D+  +         GQ
Sbjct: 351  VFVGSHQGDSQVIQIK-----EGGF-DLIQTISNIAPILDFTIMDMGDRLGATREFSSGQ 404

Query: 313  GQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
             ++VT SGA+ DGSLR VR+G+GI E   +  ++ I  +W+LRS+  +PF   L+VSF++
Sbjct: 405  TRIVTGSGAFGDGSLRSVRSGVGIEELGVLASMEHITDLWALRSACPEPFSDTLLVSFVN 464

Query: 372  ETRILAMNLEDELEETE--IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
            ETR+   + + ++EE E    G      TL   +   N+++QVT  S R++      +  
Sbjct: 465  ETRVFHFSADGDVEEKEDGFLGLVFSQSTLLATNIPGNRILQVTESSSRVIDIDGGMII- 523

Query: 430  EWKSP-PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI--N 486
             W+S    +++  A+AN   ++L  GG  LV + +    L   +  + + ++S + I  +
Sbjct: 524  -WRSTYEEFTITSASANDDYLVLVLGGIRLVCISLSTFELIGSREFEADNQVSGMTIPAS 582

Query: 487  PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGIS 542
            PI       Q   V +     + I  LP L +  K+ LG  GE IPRSV++         
Sbjct: 583  PI-------QACIVCLPQSAEIVILGLPGLEVKNKQTLGEPGEAIPRSVIVAEILPNNPP 635

Query: 543  YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVI 601
             L  ++ DG + +F  ++    + +  K++LG++  + +     N  + VFA  D P++I
Sbjct: 636  TLFVSMADGTVFSFSFDVHAFAIFNSSKITLGSEQPSFKKLPRGNGQYNVFATCDHPSLI 695

Query: 602  YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 661
            ++S  +++YS V+    S +C  N+ A+P S+A++ + EL I  +D+ +   I ++P+  
Sbjct: 696  HASEGRIVYSAVDSASASRICSLNTQAYPGSIALSSQHELKIAIVDEERTTQIHTLPMHA 755

Query: 662  HPRRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEY-GCS 718
              RR+ +    + F + ++K +  +  EE    FV L D+  F  +STY L   E   C 
Sbjct: 756  SVRRLAYSPVEKAFGLGTVKRKISNGVEEVSSSFV-LADEILFRPLSTYDLRPDELVECV 814

Query: 719  ILS-CSFSDD------SNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKETK 770
            I S  ++  D      S   + VGTA++    ++  +GRILVF V   + L LI EK   
Sbjct: 815  IRSQLNYGKDEVGNLISKDLFFVGTAFLDDVGDDHIRGRILVFEVNRSRELSLIVEKSLM 874

Query: 771  GAVYSLNAFNGKLL-AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
            GA  +L   +  LL A + + + ++K      G   L+    +      + +    + + 
Sbjct: 875  GACRTLAVMDQTLLVAGLVKSVSVFKLARGRFGNILLEKHAAYRTSTAPIDISVVDNTVA 934

Query: 830  VGDLMKSISLLIY-KHEEGA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
            V D+MKS+SL+ Y + EEG      EE AR Y   W +AV  +++++YL A+   NL  +
Sbjct: 935  VADVMKSMSLVQYTQAEEGGQEPKFEEIARHYQTLWSTAVAPIEENVYLLADAEGNLVVL 994

Query: 885  RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
            ++N  G T+ +R RL+   E  LGE VNR  H  +V    ++ V      +  TV+G I 
Sbjct: 995  QQNITGVTESDRRRLQPTSEIRLGEMVNRI-HPIVVQTHTETAVSA--RALLATVDGSIY 1051

Query: 945  VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESF 1003
            +   +       L +LQT +  +    G +   ++R+F    +  D    F+DG+LIE F
Sbjct: 1052 LFGLINPNYVDLLLRLQTAMGSIATSPGEIPFSKYRAFRTTVRQSDEPFRFVDGELIERF 1111

Query: 1004 LDLSRTRMDEISKTMN---VSVEELCKRVEELTRLH 1036
            L  +     EI   ++   V+V  L + +EEL R+H
Sbjct: 1112 LSCTLAMQQEIVGRLDDSTVTVSSLKEMIEELRRMH 1147


>gi|226291941|gb|EEH47369.1| DNA damage-binding protein 1a [Paracoccidioides brasiliensis Pb18]
          Length = 1209

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 323/1126 (28%), Positives = 533/1126 (47%), Gaps = 121/1126 (10%)

Query: 15   LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDP 73
            LF P     D LFI T+R  +  + W+  +S+L T R   D++D   R +  G   +IDP
Sbjct: 101  LFLPQ---TDVLFIGTDRATYFTVSWNPVTSQLRTERKYVDLADPSSRESQLGDRCLIDP 157

Query: 74   DCRLIGLHLYDGLFKVIPFDNKG-----------------------------------QL 98
              + I L LY+G+  VIP                                        +L
Sbjct: 158  SGKFITLELYEGIITVIPIGQPQRTARQSGRKYGKRAVTAQNQHDSSHIGNTNAAGEVEL 217

Query: 99   KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV------------- 143
             E    R++EL V    FL+  A   P +  LY+D      +K  E+             
Sbjct: 218  GEPCQARVDELLVRSSAFLHTQADMLPRMAFLYEDTMGQVRLKVRELEFTYGGMGIGTGG 277

Query: 144  -ALKDKD----FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SANAFKAI 196
             A +D      F      +  L+ GA  LIPVP PL G+L++GE +I Y   + N   ++
Sbjct: 278  GAWQDTGCIAVFKALDLLKEELEMGASFLIPVPAPLGGLLVLGETSIRYLDDATNECISL 337

Query: 197  PI-RPSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTI 254
            P+   +I  A+ +VD  G R+LL D  G L  L++I  E   V   K++LLG    AS +
Sbjct: 338  PLDEATIFVAWEQVD--GQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPRASVL 395

Query: 255  SYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ER 309
             YL   V +IGS  GDSQLI++      +GS+ E+++ + N+ PI+DF ++DL     E 
Sbjct: 396  VYLGGGVTFIGSHQGDSQLIRIT-----EGSF-EIIQTFSNIAPILDFTIMDLGGRAGEN 449

Query: 310  Q------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFD 362
            Q      GQ ++VT SGA+ DGSLR VR+G+G+ E   +  ++ I  +W+LR +  + F 
Sbjct: 450  QTHDFSSGQARIVTGSGAFDDGSLRSVRSGVGMEEVGVLGAMEHITDLWALRVACQEGFS 509

Query: 363  TFLVVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
              L+VSF+ ETR+     + E+EE  E  G      TL   +    +++QVT  +VR+  
Sbjct: 510  DTLLVSFVDETRVFHFTQDGEVEEKDEFMGLGLAESTLLAANLPNGRILQVTERNVRVAD 569

Query: 422  STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEI 480
                 L   W  P   ++  A++N   ++L  GG  L+  +I G+  L   K    + ++
Sbjct: 570  LDDGMLLWNWSPPSQKAITAASSNDDHLVLVVGGQVLMCFDIQGEVKLAGKKDFGDDTQV 629

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF 538
            S + +     +P+   I  +    ++ V   +L D+ +     +G  GE  PRSVL+   
Sbjct: 630  SGVTVT---SSPATYCILCLPQTAEVVV--MNLEDMTIRHSTSVGEPGEAFPRSVLVAEV 684

Query: 539  --EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTTHVFAAS 595
                 + L  ++ DG + +F  N     L    K+ LG+ QP   +        ++FA  
Sbjct: 685  LPNQPATLFVSMADGRVFSFSFNADEFTLKKMSKLVLGSEQPSFKKLPRGDGLYNIFATC 744

Query: 596  DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR 655
            ++P++IY++  +++YS V+  + S +C FNS A+P S+A+A   EL I  +D  +   I 
Sbjct: 745  EQPSLIYATEGRIIYSAVHSDQASRICHFNSEAYPGSIALATPTELKIAHVDSERTTQIH 804

Query: 656  SIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTYPLDTF 713
            ++ + E  RR+ +    + F I ++K   ++ AE     F+ L D+  F  +  Y L   
Sbjct: 805  TLEIDETVRRVAYSAAEKAFGIGTIKRTLENGAEVITSRFM-LADEIMFRELDDYSLRPD 863

Query: 714  EYGCSILSCSF-----SDDSNVY---YCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLI 764
            E   S++   F     SD +  +   + VGT+Y+        +GRIL F V   + L  +
Sbjct: 864  ELVESVIQAQFPEGKDSDGNESFKDIFVVGTSYLDDVGEGSIRGRILAFEVTGSRQLAKV 923

Query: 765  AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
            AE   KGA  +L     K++AA+ + + +Y     +     L     +      + +   
Sbjct: 924  AELPVKGACRALAVVQDKIVAALMKTVVIYSIAKGELSDYTLNKTASYRTSTAPIDIAVT 983

Query: 825  GDFIVVGDLMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
            G+ I V DLMKS+S++ +K  E     ++ E AR +   W +AV  + ++++L ++   N
Sbjct: 984  GNLIAVADLMKSVSIIEFKQGENDQPDSLTEVARHFQTLWSTAVAPIAENMFLESDAEGN 1043

Query: 881  LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI---PTVIFG 937
            L  + +N  G TD+++ RLEV  E  LGE VNR R  S+   LP +   +    P    G
Sbjct: 1044 LVVLNRNVNGVTDDDKRRLEVTSEILLGEMVNRIRPVSIQGSLPATGPREAVISPKAFLG 1103

Query: 938  TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLD 996
            TV G I +   +       L +LQ+ +  ++   G +   ++R+F N  ++  +   F+D
Sbjct: 1104 TVEGSIYLFGLINPAYQDLLMRLQSAMAGLVVTPGAMPFNKFRAFKNAVRQAEEPYRFVD 1163

Query: 997  GDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
            G+LIE F        +EI          +V+VE++ + VEEL R+H
Sbjct: 1164 GELIERFWACETALQEEIVGLVVAGGVADVTVEKVKRIVEELRRMH 1209


>gi|402083318|gb|EJT78336.1| hypothetical protein GGTG_03437 [Gaeumannomyces graminis var. tritici
            R3-111a-1]
          Length = 1155

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 302/1115 (27%), Positives = 545/1115 (48%), Gaps = 98/1115 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            I+G I  L+  +P     + + + T+R+++  + W+ E+ EL T +   D  ++  R   
Sbjct: 55   IHGTIVMLQALQPKDSPTEMILVGTDRFRYFSVLWNQETRELETVQDFLDHGEQFVRDVT 114

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDN-KGQLKEAFNI---RLEELQVLDIKFLYG- 119
                 ++DP  R I + ++ G+  V+     KG+ ++   +   R+ EL + D  FLY  
Sbjct: 115  TRHRCLVDPSGRFIVMLVWSGVINVMCLQRRKGKQRDLLLLDQSRISELYIKDAAFLYSE 174

Query: 120  CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWS--------QNNLDNGADLLIP 169
             A P +  LYQ   +    K   Y +   DKD     +          +  D GA +LIP
Sbjct: 175  DAHPRVAFLYQTKPNVPESKLVAYRLCSDDKDLEVSRFDPLKDRIIDIDIPDPGAVMLIP 234

Query: 170  VP---------------PPLCGVLIIGEETIVYC---SANAFKAIPIRPSITKAYGRVDA 211
            V                  L G+L++GE  ++Y    +    +   + P +  A+ + DA
Sbjct: 235  VERVESKRHHFRNPGADARLGGLLVVGETQVLYIDDMTQATVETALMSPMVFVAWAKCDA 294

Query: 212  DGSRYLLGDHAGLLHLLVITHEKEKVT--GLKIELLGETSIASTISYLDNAVVYIGSSYG 269
              S Y+L D  G LHLL +  +++ VT   + + L+G+ S A+ +  LD+  +++GS YG
Sbjct: 295  --SHYILADDYGGLHLLTLDVKEDGVTVKDIIVCLIGKASRANELVCLDSGYLFVGSHYG 352

Query: 270  DSQLIKLNL-QPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQVV 316
            DS+L+ LNL + D     VE ++   ++ PI+DF V+D+  +            GQ ++V
Sbjct: 353  DSELLCLNLVREDTDRPLVETVQTLPSIAPILDFTVMDMGNREADNTVGNEYASGQARIV 412

Query: 317  TCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
            T SGA++DGSLR VR+G+ +++   + ++ G++G++ L S+ +      LV SF++ETR+
Sbjct: 413  TGSGAHRDGSLRSVRSGVRLDDIGILADMPGVRGLFPLESNGET---NALVASFLTETRV 469

Query: 376  LAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
               + E ++EE E  +G     ++L   +   + ++Q+T     ++   +    + W+ P
Sbjct: 470  FTFDAEGDIEEVEAFQGLDLSVRSLLVTNLGDDSILQITDHGASVIGPEAGITISSWEPP 529

Query: 435  PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSY 494
             G  +   + N   +LL+  G  LV L I D  ++  + +    +ISCL       +P  
Sbjct: 530  TGRVITGCSTNGKWLLLSIDGKTLVSLRIPDLTVSTQRESSPNDQISCL-----AASPHL 584

Query: 495  SQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLC----AFEGISYLLC 546
              I AVG W+D SV I +L  L++I  E L       I+PR ++L     A      L  
Sbjct: 585  PDIGAVGFWSDGSVSIINLCTLDVIHSETLRRTKDAAIVPRDIVLARVLPAEVAGPTLFV 644

Query: 547  ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK-NTTHVFAASDRPTVIYSSN 605
            ++ DG ++ F LN   G+L+ RK V LGT+           +  ++F  S+  ++IY S+
Sbjct: 645  SMDDGEVVTFTLN-DVGKLSARKSVVLGTRQARFHVLPQPGDVCNIFVTSEHSSLIYGSD 703

Query: 606  KKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRR 665
            K+++YS V  +  +++CPF++A FP SLA+A + EL I  ID  ++ H++S+P+GE+ R 
Sbjct: 704  KRIIYSAVTAESATYVCPFDTAVFPGSLAVATDSELKISKIDPQRQSHVQSLPMGENVRS 763

Query: 666  ICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFI-STYPLDTFEYGCSILSCS 723
            I +   +R F + C  +  S   E      RL+D+   + + + + L+  E   +++   
Sbjct: 764  IAYSAPTRVFGLGCIRREISKGVEKASSTFRLVDEVVLQPLGNPFELNEGEVVETVIRAQ 823

Query: 724  FSDDSNVY---YCVGTAYVLPEENEP---TKGRILVFIVEDGKLQL-IAEKETKGAVYSL 776
              D        + VGT +++ E   P   +KGR+LVF V++ +    I     K     L
Sbjct: 824  LRDTFGRLAERFIVGTRFLVDENLVPGSNSKGRVLVFGVDEERSPFQIVSHPLKSGCRRL 883

Query: 777  NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
                  ++ A+ + + + ++      + +L     +     A+ V   G  I VGD+MKS
Sbjct: 884  AVMEEMIVVALTKTVVVARYEELTSTSGKLIKVASYQTTSYAIDVAVEGRLIAVGDIMKS 943

Query: 837  ISLLIY--------------KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
            +SL+ +                +   + E  R Y ++W +AV   + + +L A+ + N+ 
Sbjct: 944  MSLVEFVPPTTVAGDGKAGETKKPAQLIEVCRHYQSSWSTAVAHFEGESWLEADADGNVM 1003

Query: 883  TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGV 942
             + +N+ G T E+R R+E+  E +LGE +NR +  S V   P++ +   P     T  G 
Sbjct: 1004 VLGRNTTGVTLEDRRRMEITSEINLGENINRIQKIS-VETGPNAPIH--PKAFLSTTEGS 1060

Query: 943  IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIE 1001
            I ++ ++  +    L  LQ  L   +  +G +  + +RSF N ++  D    F+DG+ IE
Sbjct: 1061 IYLVGAIAPQMRDLLLNLQDRLEDYVGTLGNIPFKNFRSFRNAEREADGPVRFIDGEYIE 1120

Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
             FLD++     ++ + +  SVE++   VEEL  +H
Sbjct: 1121 RFLDMNEETQSQVCRDLGPSVEDMRNLVEELKNMH 1155


>gi|154286506|ref|XP_001544048.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407689|gb|EDN03230.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 1158

 Score =  400 bits (1027), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 317/1090 (29%), Positives = 523/1090 (47%), Gaps = 92/1090 (8%)

Query: 16   FRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPD 74
            FRP     D LFI T+R  +  + WD ++  L T R   D++D   R +  G   +IDP 
Sbjct: 92   FRPQ---TDILFIGTDRASYFTVSWDQKTKNLRTERKYVDLADPSARASQLGDRCLIDPS 148

Query: 75   CRLIGLHLYDGLFKVIP---------------FDNKGQLKEAFNIRLEELQ------VLD 113
             R + L LY+G+  VIP                    +L +    R+EEL        L 
Sbjct: 149  GRFLTLELYEGIVTVIPTLRPHDPQGLGGLGAAAAGQELGDPCQARVEELSDTMGKVRLK 208

Query: 114  IKFLYGCAKPTIVVLYQDNKDARHVKTYEVA-LKDKDFVEGPWSQNNLDNGADLLIPVPP 172
            ++ L        +                VA LK  D +     +  L+ GA  L+PVP 
Sbjct: 209  VRELEVMHGGMRMGGGTGGGGGIGADGGIVAVLKGLDLL-----KEELEMGASFLVPVPA 263

Query: 173  PLCGVLIIGEETIVYC--SANAFKAIPIR-PSITKAYGRVDADGSRYLLGDHAG-LLHLL 228
            PL G+L++GE +I Y   ++N   + P++  +I  A+ +VD  G R+LL D  G L  L+
Sbjct: 264  PLGGLLVLGETSIRYLDDASNECISQPLKEATIFVAWEQVD--GQRWLLADDYGRLFFLM 321

Query: 229  VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVE 288
            ++      V   K++LLG+   AS + Y+   + +IGS  GD +LI++      +GS+ E
Sbjct: 322  LVLDTDNAVQSWKLDLLGDIPRASVLVYMGGGITFIGSHQGDPELIRIT-----EGSF-E 375

Query: 289  VLERYVNLGPIVDFCVVDL------ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGI 336
            V++ + N+ PI+DF ++DL      E Q      GQ ++VT SGA+ DGSLR VR+G+G+
Sbjct: 376  VIQTFANIAPILDFTIMDLGGRAIGESQTHEFSSGQARIVTGSGAFNDGSLRTVRSGVGM 435

Query: 337  NEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE-IEGFCS 394
             E   +  ++ I  +W+LR +    F   L+VSF+ ETR+     + E+EE E   G   
Sbjct: 436  EEVGVLGAMKHITDLWALRVACPQEFSDTLLVSFVDETRVFYFTPDGEVEEKEEFMGLGL 495

Query: 395  QTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATG 454
               TL   +  + +++QVT  +VR+       +  EW      ++  A++N   ++L  G
Sbjct: 496  AESTLLAANLPHGRILQVTEWNVRVAELDGGMVIWEWSPEQQKAITAASSNDDHLVLMVG 555

Query: 455  GGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL 513
            G  L+  +I GD  +T  K    + ++S + +     +P  + I  +    ++ V   SL
Sbjct: 556  GQVLMIFDIRGDINITGEKDFGFDTQVSGVTVT---TSPVRACILCLPQTAEVVV--MSL 610

Query: 514  PDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRK 569
             DL +     LG  G+ +PRSVL+        + L  ++ DG + +F  N++   LT   
Sbjct: 611  SDLAIRRSTSLGEPGDAVPRSVLVAEVLSNNPATLFVSMADGSVFSFSFNLEDFSLTSMS 670

Query: 570  KVSLGT-QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
            K++LG+ QP   +        +VFA  ++P +IY+   +++YS VN    S +C FNS A
Sbjct: 671  KLTLGSEQPSFKKLPRGDGLYNVFATCEQPRLIYAVEGRIVYSAVNSGLASRICHFNSEA 730

Query: 629  FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCA 686
            +P S+A+A   EL I  +D  +   I+++ +GE  RR+ +    R F I ++K   +  A
Sbjct: 731  YPGSIALATPSELKITLVDAERTTQIQTLEVGETVRRVAYSAPERAFGIGTIKRTLEDGA 790

Query: 687  EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV--------YYCVGTAY 738
            E     F+ L D+  F  +  Y L+  E   S++   F D  +          + VGT+Y
Sbjct: 791  EVIASRFM-LADEIMFRELDIYDLNKDELVESVIRAQFPDGIDREGNDLFKDLFVVGTSY 849

Query: 739  VLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
            +        +GRIL F +  + +L  +AE   KGA  +L     K++AA+ + + +Y   
Sbjct: 850  LDDFGEGSIRGRILAFEVTANRQLAKVAEMPVKGACRALAIVQDKIVAALMKTVVVYTIS 909

Query: 798  LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG----AIEERA 853
                    L     +      + +   G+ I V DLMKS+S++ Y+        ++ E A
Sbjct: 910  KGQFADYTLSKTASYRTSTAPIDIAVTGNLIAVADLMKSVSIVEYQQGSNGLPDSLTEVA 969

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
            R +   W +AV  + +D +L ++   NL  + +N  G TD++R RLEV  E  LGE VNR
Sbjct: 970  RHFQTLWSTAVAHVAEDTWLESDAEGNLVMLHRNVNGVTDDDRRRLEVTSEILLGEMVNR 1029

Query: 914  FRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
             R  + +     ++    P    GTV G I +   +       L +LQ+ +  ++   GG
Sbjct: 1030 IRPVN-IQGSQGAEAAISPRAFLGTVEGSIYLFGIINPTYQDLLMRLQSAMAGMVVTPGG 1088

Query: 974  LNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELC 1026
            +   ++R+F N  ++  +   F+DG+LIE FL  S    +EI           V+VE + 
Sbjct: 1089 MPFNKFRAFRNTIRQAEEPYRFVDGELIERFLSCSVELQEEIVGKVIADGVTGVTVESVK 1148

Query: 1027 KRVEELTRLH 1036
              VEEL R+H
Sbjct: 1149 GLVEELRRMH 1158


>gi|261193401|ref|XP_002623106.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
 gi|239588711|gb|EEQ71354.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
          Length = 1168

 Score =  399 bits (1026), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 321/1120 (28%), Positives = 526/1120 (46%), Gaps = 144/1120 (12%)

Query: 16   FRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPD 74
            FRP     D LF+ T+R  +  + WD ++  L T R   D++D   R +  G   +IDP 
Sbjct: 94   FRPQ---TDILFVGTDRSTYFTVSWDQKTKNLHTERKYVDLADGSARESQLGDRCLIDPS 150

Query: 75   CRLIGLHLYDGLFKVIP----FDNKG----------------------------QLKEAF 102
             R + L LY+G+  VIP      ++G                            +L E  
Sbjct: 151  GRFLTLELYEGIVTVIPIGQPLRSRGSGRKLGKRAANASASTMQGNQDVGAADLELGEPC 210

Query: 103  NIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV----------------- 143
              R+EEL V    FL+  A   P + +LY+D      +K  E+                 
Sbjct: 211  QARVEELLVRSSTFLHTQADMLPRMALLYEDTMGKVRLKVRELEVTYGGMGIGSGSGGGS 270

Query: 144  ----------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SAN 191
                       LK  D +     +  L+ GA  L+PVP PL G+L++GE +I Y   ++N
Sbjct: 271  GVGADGGSAAVLKGLDML-----KEELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASN 325

Query: 192  AFKAIPIR-PSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETS 249
               + P+   +I  A+ +VD  G R+LL D  G L  L++I      V   K++ LG   
Sbjct: 326  ECISQPLEEATIFVAWEQVD--GQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIP 383

Query: 250  IASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-- 307
             AS + Y+   V +IGS  GDSQLI++      +GS+ EV++ + N+ PI+DF ++DL  
Sbjct: 384  RASVLVYMGGGVTFIGSHQGDSQLIRIT-----EGSF-EVIQTFANIAPILDFTIMDLGG 437

Query: 308  ----ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSS 356
                E Q      GQ ++VT SGA+ DGSLR VR+G+G+ E   +  ++ I  +W+LR +
Sbjct: 438  RAMGESQTHEFSSGQARIVTGSGAFNDGSLRSVRSGVGMEEVGVLGSMEHITDLWALRVA 497

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              + F   L+VSF+ ETR+     +  +EE  E  G      TL   +    +++QVT  
Sbjct: 498  CPEEFSDTLLVSFVDETRVFYFTPDGGVEEKDEFMGLGLTESTLIAANLPNGRILQVTER 557

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHA 474
            +VR+       +  EW      ++  A++N   ++L  GG  L+  +I G+  +   K  
Sbjct: 558  NVRVAELDGGMVMWEWSPVSQKAITAASSNDDHLVLMVGGQVLMIFDIRGEIKVAGEKDF 617

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRS 532
             ++ ++S + +     +P+ S I  +    ++ V   SL DL +     LG  G+ +PRS
Sbjct: 618  GVDTQVSGVTVT---ASPARSCILCLPQTAEVIV--MSLADLTIRHSTSLGEPGDAVPRS 672

Query: 533  VLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTT 589
            VL+        + L  ++ DG + +F  N     LT   K++LG+ QP   +        
Sbjct: 673  VLVAEVLPNNPATLFVSMADGSVFSFSFNATDFSLTSMSKITLGSEQPSFKKLPRGDGLY 732

Query: 590  HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
            +VFA  ++P++IYS+  +++YS VN  + S +C FNS A+P S+A+A   EL I  +D  
Sbjct: 733  NVFATCEQPSLIYSAEGRIVYSAVNSDQASRICHFNSEAYPASIALATPSELKIALVDAE 792

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFIST 707
            +   I+++ +GE  RR+ +    R F I ++K   +  AE     F+ L D+  F  +  
Sbjct: 793  RTTQIQTLEVGETVRRVAYSAPERAFGIGTIKRTLEDGAEVIASRFM-LADEIMFRELDV 851

Query: 708  YPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEK 767
            Y L+  E   S++   F DD +                 ++G  L+       L ++AE 
Sbjct: 852  YDLNKDELVESVIRAQFPDDKD-----------------SEGEELM-----KDLFIVAEM 889

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDF 827
              KGA  +L     K++AA+ + + +Y           L     +      + +   G+ 
Sbjct: 890  PVKGACRALAIVQDKIVAALMKTVVVYTLSKGQFADYILTKTASYRTSTAPIDIAVTGNL 949

Query: 828  IVVGDLMKSISLLIYKHE----EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
            I V DLMKS+S++ Y+       G++ E AR +   W +AV  +  D +L ++   NL  
Sbjct: 950  IAVADLMKSVSIVEYQQGTDGLSGSLTEVARHFQTLWSTAVAPVAQDTWLESDAEGNLVV 1009

Query: 884  VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVI 943
            +R+N  G T+++R RLEV  E  LGE VNR R  ++   L  ++    P    GTV G I
Sbjct: 1010 LRRNVNGVTEDDRRRLEVTSEVLLGEMVNRIRPVNIQASL-GTEAAISPRAFLGTVEGSI 1068

Query: 944  GVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIES 1002
             +   +       L +LQ+ +  ++   GG+   ++R+F N  ++  +   F+DG+LIE 
Sbjct: 1069 YLFGIINPTYQDLLMRLQSAMAGMVVTPGGMPFNKFRAFRNTVRQAEEPYRFVDGELIER 1128

Query: 1003 FLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
            FL       +EI           V+VE +   VEEL R+H
Sbjct: 1129 FLGCGAELQEEIVGKVIADGVAGVTVERVKAIVEELKRMH 1168


>gi|342318986|gb|EGU10938.1| DNA damage-binding protein 1b [Rhodotorula glutinis ATCC 204091]
          Length = 1102

 Score =  398 bits (1023), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 316/1121 (28%), Positives = 533/1121 (47%), Gaps = 153/1121 (13%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS-ELITRAMGDVSDRI 59
            +L+VP++  IA+++     G+A   L + T   +   L  D  S   + T A   +++  
Sbjct: 50   LLEVPLFSTIASMQ----SGQATASLVVLTTCLRLFALVHDPTSPYNVKTVASVSIAEPF 105

Query: 60   GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF---DNKGQ------------------- 97
            GR  +   I ++DP  R + +H Y GL +V+PF   D   Q                   
Sbjct: 106  GRLAEYQDI-LVDPANRCLLVHAYAGLVRVVPFFIGDTPAQRGSRTATRLAGSPHGSPLD 164

Query: 98   LKEAFNIRLEELQVLDIKFLYGCA--KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPW 155
               ++N+RL  L V  +  +   A   P +  ++ ++   R + T+ ++LKDKD  EGP 
Sbjct: 165  FSYSYNVRLPILNVASLAIVASSADDPPILAGIHIEHTGTRTLSTFAISLKDKDLCEGPV 224

Query: 156  SQNNL-DNGADLLIPVPPP-LCGVLIIGEETIVY--------CSANAFKA---------I 196
            +   L D G++L++PV      G+L+ GEET+ +         S+ A K          +
Sbjct: 225  AAFALADPGSELVLPVNEANAPGILVAGEETVTWVSLEERNGASSTASKGKGRAASQMLV 284

Query: 197  PIRPSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
              R  + +       +  R LLGD  G L  + +   E   V+G+  + +G+ + AS I 
Sbjct: 285  SCRLPVARITAWSWRNNDRLLLGDIYGKLFEVNLRRAENGVVSGISAQDVGDAASASAIV 344

Query: 256  YLDNAVVYIGSSYGDSQLIKLNLQPD---AKGSYVEVLERYVNLGPIVDFCVVDLERQGQ 312
             L    VY+ S +GDSQ+++     D   A  + + +++   N+ PIVD C+V    Q  
Sbjct: 345  PLGATTVYLASRFGDSQIVRFVSGEDEGMAGPAELALVDSISNIAPIVDLCIVSAHGQAA 404

Query: 313  GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
               VTCSGAYK GSLR++R G+G++E A++E  G++  WSL S      +  LV+ F SE
Sbjct: 405  NYAVTCSGAYKTGSLRVIRRGVGLSELAALEADGVQQAWSLTSPAPST-EPILVLGFFSE 463

Query: 373  TRILAMNLED-------ELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
            TR+  M+  D       +++E +++ F S   T+F  + + + LV+VT+ +V   S    
Sbjct: 464  TRVFRMSAGDPAGAEAVQIDEMDLQLFASPGPTIFAGN-VGSMLVRVTAEAVMYASDRGE 522

Query: 426  ELRNEWKS-PPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLD 484
            E    WK+   G     A A    ++LA  GG L  L+  +G  TE  +   E++I+ + 
Sbjct: 523  E---SWKATSAGKITAAAAAGGDHLVLAFDGGFLQLLKAKNGTFTESGNIAFEHDIASVS 579

Query: 485  INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAF-EGISY 543
            +       S    A VG+WT  +V +  +PDL +   + +    + RS  L  F +G   
Sbjct: 580  L----AETSAGAFAVVGLWTSQAVHLVGIPDLAVYASQKITSTFLIRSAALINFGDGDYT 635

Query: 544  LLCALGDGHLLNFLLNMKTGELTDR--KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 601
            L   LGDG L ++ +++    + D   K ++LG +P+ L     ++   + A SDRPTVI
Sbjct: 636  LFVGLGDGTLASYRVDLAAPAVIDSTGKMIALGRRPLLLSEIGPESAKVLLAVSDRPTVI 695

Query: 602  YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 661
              +  +L YS+     VS +   +  A  D +A+     + IG +D +QK+ +R++PL E
Sbjct: 696  SKARDRLNYSS----GVSSVANISHPALGDLVALVSREGVQIGRMDTVQKVDVRTVPLAE 751

Query: 662  -HPRRICHQEQSRTFAI-CSLK--NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGC 717
              PRRI +   SR F + CS +  ++   +++    VRL+ + +FE  S + L+  E   
Sbjct: 752  DEPRRIVYDPVSRKFCVACSRRDIDRHTGQQAVTSVVRLVSEDSFETSSVFELERGE--- 808

Query: 718  SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV-YSL 776
                                                      + Q +A  E++G V +S 
Sbjct: 809  ------------------------------------------EAQSLALVESQGTVAWSS 826

Query: 777  NAFNG-----KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG--DFIV 829
            + F       K  A ++  + +  W +  +G   L+S     G  +A Y   R   D ++
Sbjct: 827  SPFQKLETCVKFRAPVSAAVAV--WTIDSEGALALKSTW--SGAFVA-YTLARAPDDTLI 881

Query: 830  VGDLMKSISLLIY-KHEEGAIEERARDYNANWMSAVEILDDDI-----YLGAENNFNLFT 883
            VGD ++S+++L Y    +  + E A+DY + +M  VE L  D      Y+GAE + NLF 
Sbjct: 882  VGDALRSLTVLRYTASPQPKLLEVAKDYRSRYMVGVEDLGRDSTGADRYIGAETDLNLFA 941

Query: 884  VRKNSEGATDE--ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
            V ++ + A         L+  G +HLGE V RFR GSL   + D++   +P +++ T  G
Sbjct: 942  VSRHPQQAAGNLANAATLQDAGAFHLGELVTRFRTGSLGDLIGDANTTVVPRLVYSTSAG 1001

Query: 942  VIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
             IGVIA L       L  L+ N+R+ +KGVGGL  E+  +   +K    +  F+DGD ++
Sbjct: 1002 TIGVIADLDAASSRILSDLERNMREFVKGVGGLEQEELVAGVADKVKTPSTGFIDGDFVQ 1061

Query: 1002 SFLDLSRTRMDEISK------TMNVSVEELCKRVEELTRLH 1036
            SFLDLS+   +++ +       + V   E+ + +EE+ R+H
Sbjct: 1062 SFLDLSKDVQEQVMQGKSEHERLTVDKAEIVRLLEEVARVH 1102


>gi|429850956|gb|ELA26181.1| DNA damage-binding protein 1 [Colletotrichum gloeosporioides Nara
            gc5]
          Length = 1409

 Score =  397 bits (1020), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 321/1127 (28%), Positives = 533/1127 (47%), Gaps = 139/1127 (12%)

Query: 4    VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRP 62
            V IYG +  L+   P     D LF+ T+R+++   +WD++   L+T + + D  +   R 
Sbjct: 52   VSIYGTVLLLQRLTPKDSKTDLLFVGTDRFQYFTARWDSDLQRLVTEQVIEDSGEPHMRE 111

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFN-----IRLEELQVLDIKFL 117
              +    ++DP  R + +HL++G+  V+    +       +     +RL EL +    F+
Sbjct: 112  AQSQDKCLVDPSGRFMAMHLWEGVMNVMRLGARAPHASRLDPLWEQVRLSELSMKASTFV 171

Query: 118  -YGCAKPTIVVLYQ---DNKDAR-----------HVKTYEV-ALKDKDFVEGPWSQNNLD 161
                  PTI  LYQ   D +DAR           + KT    A KD++F         LD
Sbjct: 172  PTETGHPTIAFLYQSKIDKEDARLAIYRLMSDDRNTKTSNFDAEKDREF-----DMEILD 226

Query: 162  NGADLLIPVP------------------PPLCGVLIIGEETIVYC---SANAFKAIPIRP 200
              A +LIPV                     L G++++GE  +VY    +     A    P
Sbjct: 227  PYARILIPVSVEEDQVKRYHKRDTSAAKAQLGGLIVVGETLLVYVDTLTKTKVSAGIDNP 286

Query: 201  SITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
            +I  AY   D     Y L D  G L LL I  +   VTGL++  LG TS AS + Y+ + 
Sbjct: 287  TIFVAYAEYDV--RNYFLSDDYGNLQLLTIDTDGVVVTGLRVTHLGLTSRASCLVYMGDG 344

Query: 261  VVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER----------- 309
            ++++GS YGDSQL+K+++   A     ++++   N+ PI+DF ++DL             
Sbjct: 345  MLFLGSHYGDSQLLKVDVT--AGSETTQIVQTIPNIAPILDFAIMDLGNAGDSQVGNAFS 402

Query: 310  QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVS 368
             GQ ++V   G +++GSLR +R+ +G+ +   +E L  ++G++SLRS+  D  DT LVVS
Sbjct: 403  SGQARIVAGCGVHQNGSLRSIRSSVGLEDIGVLEDLSDVRGLFSLRSNGSDKDDT-LVVS 461

Query: 369  FISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
            FI++TR+   + E+ +EE  + +G      TL        QL+QVT+ S  L+       
Sbjct: 462  FITDTRVFRFDAEEGIEEVFQFQGLGLDKPTLIASTLPNGQLLQVTANSAALLDPEGGIT 521

Query: 428  RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-----------GDGILTEVKHAQL 476
             + W +P G  +  A++NA  VLL+  G  LV L +           G G+  E      
Sbjct: 522  LSTWSAPEGKVIVNASSNAKWVLLSIDGTTLVSLSLLENLAAREQVLGSGVGGEGD---- 577

Query: 477  EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE----IIPRS 532
              +ISC+      +N     +  VG W   S+ I  L  L+ +  E +        +PR+
Sbjct: 578  --QISCIHAATDLDN-----VGVVGFWATGSISIIDLQTLDAMHGETIKQTDDSVSVPRN 630

Query: 533  VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT---T 589
            + L                 +++F ++     L+ RK V LG++   L      +T   +
Sbjct: 631  IAL-----------------VVSFDVSKDNYSLSGRKSVILGSKQSGLHILPRPDTDGIS 673

Query: 590  HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
            +VFA ++  ++IYSS  +++YS    ++V+++ PF+S AFPD++ +A +  + I  ID  
Sbjct: 674  NVFATTEHSSLIYSSEGRIIYSAATAEDVTYIAPFDSEAFPDAIVLATDKNVRIAHIDVE 733

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTFEFI-ST 707
            ++ H+  +PL +  RR+ +    + F I +++ +    EE      +L+D+     +   
Sbjct: 734  RRTHVNPLPLRQTVRRVAYSPALKAFGIGTIRRELFNNEEMVTSSFQLVDEIVLGVVGKP 793

Query: 708  YPLD---TFEYGCSILSCSFSDDSNV---YYCVGTAYVL-PE--ENEPTKGRILVFIVE- 757
            + LD   T E   S++     D S      + VGT+Y+  PE  EN   KGRILV  V+ 
Sbjct: 794  FHLDGAATTELVESVIRAELPDSSGQPAERFIVGTSYLADPEMDENSEVKGRILVLGVDS 853

Query: 758  DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
            D     I   E KGA  SL     KL+A +++ + +Y +      +  L           
Sbjct: 854  DKNPYQIVSHELKGACRSLAVMGDKLVAGLSKTVVVYDYAEESSTSGSLLKLATFRPSTF 913

Query: 818  ALYVQTRGDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYL 873
             + +   G+ I V DLM+S++L+ +       +  + ERAR +   W +AV  L+ D+++
Sbjct: 914  PVDLDVNGNMIGVADLMQSMTLIEFIPAQDGNKARLVERARHFQYIWATAVCHLEQDLWI 973

Query: 874  GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
             A+   NL  +R+N    T+ ++ ++EV+ E+HLGE +N+ R   +V    D      P 
Sbjct: 974  EADAQGNLMVLRRNPNAPTEHDKKQMEVISEFHLGEQINKIRPLDVVSGENDPIE---PK 1030

Query: 934  VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG-------GLNHEQWRSFNNEK 986
                T+ G I V A +  E    L + Q  L  VIK +G       GL+   WR F N K
Sbjct: 1031 AFLATIEGSIYVFADIKPEYQSLLLQFQERLAGVIKTLGQADEPGAGLSFMSWRGFRNAK 1090

Query: 987  KTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            ++ D    F+DG+LIE FLDL   R + + + +  +VE +   VEEL
Sbjct: 1091 RSADGPFRFVDGELIERFLDLDAGRQEAVVQGLGPTVERMRDLVEEL 1137


>gi|408393339|gb|EKJ72604.1| hypothetical protein FPSE_07241 [Fusarium pseudograminearum CS3096]
          Length = 1161

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 297/1118 (26%), Positives = 540/1118 (48%), Gaps = 98/1118 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
            I+G I+ L+  RP G   D LFI T+R ++  L W+  + +L  I R + D+S+   R +
Sbjct: 55   IHGTISMLQRLRPKGSETDLLFIGTDRLQYFNLIWNDATKQLETIERVIEDLSEPYMRQS 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK----GQLKEAFNIRLEELQVLDIKFLYG 119
             +    ++DP  R + +HL++G+  V     +     +L+    +RL EL +    F Y 
Sbjct: 115  QSQNKCLVDPTGRFLAMHLWEGVLNVFKLPTRKGSTNKLEVLDQVRLTELFMKASTFTYS 174

Query: 120  CAK-PTIVVLYQDNKDARHVK--TYEVALKDKDFVEG---PWSQNNLD-----NGADLLI 168
                PTI  LY+   D    +   Y +   DK        P     LD       A +LI
Sbjct: 175  YTNHPTIAFLYKTQMDQEETRLVIYRLTHDDKGNTVSKFDPHKDRELDIEIPDPYASMLI 234

Query: 169  PVP----------------PPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGRV 209
            PVP                P L G++++GE  + Y      +++      P I  A+   
Sbjct: 235  PVPLDEEKRYHVRNNQGAKPHLGGLVVVGETLLTYYDGQTHRSVSAGLKDPRIFVAWAEY 294

Query: 210  DADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKIELL------GETSIASTISYLDN 259
            D+  +RYLL D  G L LL I    E     VTG+ +E          TS AS++ YL +
Sbjct: 295  DS--TRYLLADDYGRLDLLTIRTNLEPTGVVVTGMTLEPFEFDKSPAITSRASSLVYLGD 352

Query: 260  AVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ--------- 310
              +++ S +GDSQL ++N+   A    V +++ + N  PI+DF ++D+  +         
Sbjct: 353  GNLFVASHHGDSQLYQINIDTKA----VMLVQSFSNNAPILDFSIMDMGNREGDTQSGNV 408

Query: 311  ---GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLV 366
               GQ ++V   GAY+DG+LR +R+G+G+ ++  + EL+G +G+++L S      DT LV
Sbjct: 409  FSSGQSRIVAGCGAYRDGTLRSIRSGVGLEDRGVLDELEGTRGLFTLHSYGSKMVDT-LV 467

Query: 367  VSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
            VS I+ETRIL+ +LE  +EE    +G    ++TL   +    QL+Q+T  SV L+     
Sbjct: 468  VSSITETRILSFDLEGGIEEIYSFQGMSLDSETLLASNLPSGQLLQITPKSVVLLDPEGG 527

Query: 426  ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI 485
               ++W  P G ++  A+AN+   LL+  G  LV L +   +    +    +       I
Sbjct: 528  TTISKWDVPDGKTITRASANSKWALLSVDGTSLVSLNLLQNLAVNAQQINNDSTSQPDQI 587

Query: 486  NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFE-- 539
            + I        +   G W+   + +  +  L  +  E +        +PR V L      
Sbjct: 588  SCIHAARDPPDLGVAGWWSSGQISLIDMASLKPLHGESMRQTEDSATVPRDVALVQLHPP 647

Query: 540  GIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASD 596
             IS   LL A+ DG+++ F ++ K   ++ RK V+LG+ P  L     ++ T++VF  ++
Sbjct: 648  NISGPTLLVAMEDGNVVTFNVSTKGFSVSGRKSVTLGSNPARLHILPQEDGTSNVFVTTE 707

Query: 597  RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
              ++IYSS  ++++S     + + + PF+S AFPDS+ ++ +  + I  +D  +  H+++
Sbjct: 708  HASLIYSSEGRIIFSATTADDATFVAPFSSHAFPDSVVLSTDQHIRICHVDRERLTHVKA 767

Query: 657  IPLGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTF-EFISTYPLD--- 711
            +P+ E  RR+ +    + F + S+K +    EE      RL+D+  F E  S + L+   
Sbjct: 768  LPVNETVRRVAYSPGLKAFGLGSIKRELVNNEEVVTSSFRLVDEIVFKELGSPFALNGSS 827

Query: 712  TFEYGCSILSCSFSD---DSNVYYCVGTAYVLP---EENEPTKGRILVFIVE-DGKLQLI 764
            + E    ++     D   +    + VGT+++     E+   T GRILV  V+ + ++  I
Sbjct: 828  SLELVECVIRAELPDVGGNPTERFIVGTSFISDGGVEDPNGTLGRILVLGVDANRQVYQI 887

Query: 765  AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
                 KG    L      ++A +++ + +Y +      +  LQ    +    + + +   
Sbjct: 888  VSHNLKGPCRCLGMIEDNIVAGLSKTVVVYSFSQETSSSGSLQKLAAYRPAAIPIDIDVS 947

Query: 825  GDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
            G+ I VGDLM+S+SL+ +       +  ++ERAR + + W +AV  ++ + +L A++  N
Sbjct: 948  GNMIGVGDLMQSLSLVEFIPAQDGNKAQLQERARHFESLWTTAVCHIEGERWLEADSRGN 1007

Query: 881  LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-PTVIFGTV 939
            L  +++N +  T+++R RLE+  E  +GE +NR R     + +P ++   + P     + 
Sbjct: 1008 LVVLQRNVDAPTEQDRRRLEITSEMGIGEQINRIRK----LNVPAAENNIVHPRAFLASA 1063

Query: 940  NGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGD 998
             G + +   +  +    L   Q+ + + +  +G +  + WRSF N+ +  +    F+DG+
Sbjct: 1064 EGSLYLYGDVAPQYQDLLMTFQSRMEECVLALGNVEFKLWRSFRNDNRESEGPYRFIDGE 1123

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            +IE FLD+   + + +   +  +VE++   +EEL R+H
Sbjct: 1124 MIERFLDMDEEQQELVCDGLGPTVEDMRNLIEELRRMH 1161


>gi|425777692|gb|EKV15851.1| UV-damaged DNA binding protein, putative [Penicillium digitatum Pd1]
 gi|425779888|gb|EKV17916.1| UV-damaged DNA binding protein, putative [Penicillium digitatum
            PHI26]
          Length = 1140

 Score =  396 bits (1018), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 314/1096 (28%), Positives = 533/1096 (48%), Gaps = 100/1096 (9%)

Query: 18   PHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPDCR 76
            P     D +F+ T++Y +  + WD E++++ T R   D+++   R +      +IDP  R
Sbjct: 68   PTNSPTDHIFVGTDQYNYFTITWDRETNDIKTARTCVDIAEPSSRESQCAPRCLIDPTGR 127

Query: 77   LIGLHLYDGLFKV---------------------IPFDNKGQLKEAFNIRLEELQVLDIK 115
             + L +Y+G+  V                      P    G+L +    R++EL V    
Sbjct: 128  FMTLEVYEGVIVVVPIVQPTKKRGRMSMAGSQPDFPL-QVGELDKPTTSRIDELFVRSSA 186

Query: 116  FLYGCAKPTIVVLYQDNKDARHVKTYEV--------ALKDKDF-----VEGPWSQNNLDN 162
            FL+  + P + +LY+DN+    ++  E+         L D  F     +EG      LD 
Sbjct: 187  FLHSESSPWLALLYEDNQQKVRLRIRELDFTPGSSGTLADATFKEVQKLEGGEFGQELDL 246

Query: 163  GADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGRVDADGSRYLLG 219
            G+  LIP+P PL G++++GE +I Y   NA   I       ++  A+ +VD+   R+LL 
Sbjct: 247  GSSHLIPIPAPLGGLIVLGETSIKYIDDNANDVITRHLEEATVFVAWEKVDS--QRWLLA 304

Query: 220  DHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL 278
            D  G L  L  I +   ++   K+E LG+T+ AS + YL   ++++GS +GDSQ+++L+ 
Sbjct: 305  DDYGRLFFLSFILNNLGEIDDWKLESLGKTARASVLVYLGGGMLFVGSHHGDSQVLRLD- 363

Query: 279  QPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTCSGAYKDGSL 327
                 G+  EV++   N+ PI+DF ++DL     E Q      GQ ++VT SGA+ DG+L
Sbjct: 364  -----GTSFEVIQTLSNIAPILDFTIMDLGNRTNESQTHEFSSGQARIVTGSGAFDDGTL 418

Query: 328  RIVRNGIGINEQASV-ELQGIKGMWSLRS-STDDPFDTFLVVSFISETRILAMNLEDELE 385
            R +R+G+G+ +   + E++ I  +W L++ ST D  DT L+V+F+ ETR+   + + E+E
Sbjct: 419  RSLRSGVGMEDLGVLGEMEHISDLWGLQTRSTGDYLDT-LIVTFVDETRVFQFSPDGEVE 477

Query: 386  ETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATA 444
            E +   G      TL        +++QVT     +    S  +  EW       +   + 
Sbjct: 478  ELDNFLGLSLTESTLLATRLQGGRILQVTEQRALVADLESGMVIFEWVPQDQKLITAVST 537

Query: 445  NASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMW 503
            N   ++L   G  +   +I D + + ++K    + +IS L +      P+   IA  G  
Sbjct: 538  NEDHLVLVISGQIVASFDIRDNVQIIKLKDLGADQQISGLTVP---STPTGVFIA--GFP 592

Query: 504  TDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAFEGISY--LLCALGDGHLLNFLLN 559
                V I  + D  ++  + LG  GE  PRSVL+      S   L  ++ DG ++ F L+
Sbjct: 593  QSAKVSILDIKDFAVLQTKSLGPPGESFPRSVLVAEVLANSQPTLFISMADGCVITFSLD 652

Query: 560  MKTGELTDRKKVSLGT-QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEV 618
             +   L++  K+ LG+ QPI  +        +VFA  + P++IY S  +++YS VN    
Sbjct: 653  SRDCSLSEMNKLILGSEQPIFKKLPKGDGLYNVFATCENPSLIYGSEGRIIYSAVNSDGA 712

Query: 619  SHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAIC 678
            S +C  N+ A+PDS+A+A E EL I  +D  +   I+++P+G   RR+ +    + F I 
Sbjct: 713  SRVCHLNAEAYPDSIAVATEKELKIALVDRERTTQIQTLPMGSTVRRVAYSPSEKAFGIG 772

Query: 679  SL--KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF--SDDSNV---- 730
            ++  K ++ AE  + HFV L D+  F  +    L   E   S++        D N     
Sbjct: 773  TIDRKLENGAEVVKSHFV-LADEIMFRRLDALELGPDELVESVVRAELPAGKDENGKEIV 831

Query: 731  --YYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAI 787
               + VGTA+   ++ E  +GRIL+  V+ G KL  +AE    GA  +L      ++AA+
Sbjct: 832  KDRFVVGTAFADEDQEESIRGRILILEVDHGRKLSQVAELPVMGACRALAMMGDCIVAAL 891

Query: 788  NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH--- 844
               + +Y+  + + G  +L+    +      + V    + I V DLMKS+ L+ Y     
Sbjct: 892  ---VVVYRVKINNVGPMKLEKLAAYRTSTAPVDVTVVDNLIAVADLMKSLCLIRYTPGHT 948

Query: 845  -EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVG 903
             E   + E  R Y   W +A+  + D+ +L ++   NL  + +N+ G T +++ RL    
Sbjct: 949  GEPAKLTEVGRHYQTVWSTAIACVGDETFLQSDAEGNLIVLSRNTNGVTAQDKHRLIPTS 1008

Query: 904  EYHLGEFVNRFRHGSLVMRLPD-SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
            E  LGE VNR R     + +P    V   P     TV G I + A +  E   FL  LQ 
Sbjct: 1009 EISLGEMVNRIRP----VHIPQLCSVMVTPRAFMATVEGSIFLFAVINPEHQDFLMTLQA 1064

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             L + +  +G L+ +++R F    ++  A   F+DG+LIE FL  + +  +EI++ +  S
Sbjct: 1065 ALSQKLNSLGNLSFDKFRGFRTMVRSAAAPYRFVDGELIEQFLKCTPSMQEEIAQEVGSS 1124

Query: 1022 -VEELCKRVEELTRLH 1036
             V E+ + +E L RLH
Sbjct: 1125 DVGEVKRLIEALRRLH 1140


>gi|402595041|gb|EJW88967.1| hypothetical protein WUBG_00126 [Wuchereria bancrofti]
          Length = 621

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 236/579 (40%), Positives = 355/579 (61%), Gaps = 29/579 (5%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
           + P++GRIAT++LFR  GE  D L I T +Y   +++W   +SEL TRA G + DR+GRP
Sbjct: 54  ECPVFGRIATIKLFRAPGEDVDSLLILTAKYHLAIIRW-TPTSELRTRASGHIVDRVGRP 112

Query: 63  TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
           ++ G I  +     ++   LYDGL K++ + N+G+    FN+R ++L ++DI F+    +
Sbjct: 113 SETGMIATVHSSGLMV-FRLYDGLLKIVQW-NEGKDLRGFNVRCDDLYIVDITFMSDPDR 170

Query: 123 PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGE 182
           PT+  +YQD+ + RH+K   + + DK+     W  +NL+  A+++I VP P+ G LI G 
Sbjct: 171 PTLAYIYQDD-NGRHIKVVTLNIDDKELSSPLWKHDNLEGEANIVISVPEPVGGCLIAGP 229

Query: 183 ETIVYCS----ANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
           + I Y      A  +  +P   +  +    Y  VD DG RYLL D AG L++L++   K+
Sbjct: 230 DAISYHKGGDDALRYAGVPGSRLHNTHPNCYAPVDRDGQRYLLADLAGNLYMLLLELGKD 289

Query: 236 K---------VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS- 285
           +         V  +K+E LGET IA  + YLDN V +IGS +GDSQLI+L+ +P A G+ 
Sbjct: 290 QEQDENSAVIVRDMKVESLGETCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTG 349

Query: 286 YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
           Y+ +L+ Y NL PI D  V  +   GQ Q++TCSGAYKDG++RI+RNGIGI E ASVEL+
Sbjct: 350 YISLLDSYTNLAPIRDMTV--MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELASVELK 407

Query: 346 GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
           GIK M++LR+  D+ FD +L++SF SET +L +N E ELE+TEI GF     TL+     
Sbjct: 408 GIKNMFTLRTRGDE-FDDYLILSFDSETHVLFINGE-ELEDTEITGFAVDGATLWAGCLF 465

Query: 406 YNQ-LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG 464
           +++ ++QVT G V L+   + ++   WKSP   ++      ++  L+   G  L+YLE  
Sbjct: 466 HSKTILQVTHGEVILIDGDNIQV---WKSPKWITLLNYDERSTGQLVVACGALLIYLEAN 522

Query: 465 DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
                 +   + E+EISC+DI PIG+    S+I AVG WTD+SV + +LP L  + +E +
Sbjct: 523 SAGFKVITQIECEFEISCIDITPIGKGTLRSEICAVGYWTDLSVALRALPQLVEVVREKI 582

Query: 525 GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG 563
            G+++ RS++L   EG  YLL ALGDG +  F ++MKTG
Sbjct: 583 AGDMLSRSIMLSPMEGHVYLLVALGDGTVHYFQIDMKTG 621


>gi|258572939|ref|XP_002540651.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
 gi|237900917|gb|EEP75318.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
          Length = 1144

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 314/1113 (28%), Positives = 555/1113 (49%), Gaps = 105/1113 (9%)

Query: 6    IYGRIATLE-LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
            +YG+I+ L+ L RP+    D LF+ T+R  +  L W++ + +L T +   D++D   R +
Sbjct: 55   VYGKISVLQKLPRPNPSETDLLFVGTDRDAYFTLSWNSATGQLHTEQKYVDMADPSLRDS 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFNIR 105
             +G    +DP  + + L +Y+G+  VIP                     +  L E    R
Sbjct: 115  QSGDRSWVDPSGKFLTLEIYEGIITVIPIAQEPLKRSSLSGPSLGPPKERAHLGEPVQAR 174

Query: 106  LEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWS------- 156
            +EEL +    FL+  +   P I +LY+       +K  ++ +  +  V G  S       
Sbjct: 175  IEELAIRSTAFLHQDSSRVPRIAILYESTDGRVKLKLRDL-IYTRGVVNGEASVAEFHNV 233

Query: 157  ---QNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS--ANAFKAIPIR-PSITKAYGRVD 210
                +NL+ GA++L+PVP PL G+LI+GE+ I Y    +N    +P+   ++  A+ ++D
Sbjct: 234  DELYDNLELGAEILVPVPLPLGGILILGEKCIKYVDTISNETITLPLEYNTVFVAWEQLD 293

Query: 211  ADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
                R+LL D  G L  L+++      V   K++LLGETS AS + +L   VV++GS  G
Sbjct: 294  --NQRWLLADDYGRLFFLMLVLDSANAVRTWKVDLLGETSRASVLVHLGGGVVFLGSHQG 351

Query: 270  DSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG----------QGQVVTCS 319
            DS +I++      +GS+ E+++   N+ PI+DF V+DL  +G          Q ++VT S
Sbjct: 352  DSHVIRIT-----EGSF-EIIQTLSNIAPILDFTVMDLGHRGDTLTHEFSSGQARIVTGS 405

Query: 320  GAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
            GA+ DGSLR VR+G+G+ +   +  ++ I  +W L +   +     L+++F++E+R+   
Sbjct: 406  GAFHDGSLRSVRSGVGMEDLGVLGAMEHITDLWGLSAFCAEENCDTLLLAFVNESRVFHF 465

Query: 379  NLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
            + + E+EE  E  G      TL   +    +++QVT  + R+    S  +   W SP G+
Sbjct: 466  SPDGEVEEKDEFLGLLLGETTLHASNLAGCRILQVTERTARITDVESELVIWHW-SPSGH 524

Query: 438  S-VNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYS 495
              +  A  N   ++L  GG   V  +I   I ++  K  +   ++S + +     +P+ +
Sbjct: 525  QKITAAAVNEQYLVLMIGGQEAVIFDIASDIQVSGPKTFKANKQVSGVTLT---SSPAQA 581

Query: 496  QIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDG 551
             I       +IS  I +L DL +   E LG  G+ +PRSVL+          L  ++ DG
Sbjct: 582  CIFCFPQSAEIS--IVNLTDLTIRHTETLGEPGDAVPRSVLVANMIPSKPPSLFVSMADG 639

Query: 552  HLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLY 610
             + +F LN +   L++  K+ LG++    +     +   +VFA  D P++IY+S  +++Y
Sbjct: 640  SVFSFSLNAEDYSLSNANKLVLGSEAPVFKLLPRGDGLFNVFATCDHPSLIYASEDRIVY 699

Query: 611  SNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQE 670
            S VN  + + +C FN+ A+P ++A+A   E+ I  +D  +   I+++ +    RRI +  
Sbjct: 700  SAVNSDKATRICHFNAEAYPGAIAVATPDEIKIALVDAERTTQIQTLMINGTVRRIAYSA 759

Query: 671  QSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEY-GCSILS--CSFS 725
              R F + +++      AEE + +F+ L D+  F  +S + L++ E   C I +   +F+
Sbjct: 760  AERAFGLGTVRRSLVQNAEEVKSYFI-LSDEIMFRQLSVFDLNSNELVECVIRTEHPAFN 818

Query: 726  DDSNV-----YYCVGTAYVLPEE--NEPTKGRILVFIVE-DGKLQLIAEKETKGAVYSLN 777
               N       + VGT+ + P E     TKGRIL+F V  + +L++++E   +GA  +L 
Sbjct: 819  GQQNNGRPKDIFIVGTSVLDPAEAPESQTKGRILIFDVGVNRELRMVSEFPVRGACRALA 878

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
              NGK++AA+ + + +      +  + ++  E  +      + +    + IVV DLMKSI
Sbjct: 879  MVNGKIVAALMKSVVILSMKKGNSYSIDIGKESSYRTSTAPVDLSVTDNIIVVADLMKSI 938

Query: 838  SLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
            SLL Y+  E     +++E AR Y   W +    + ++ +L ++   NL  + +N+ G T+
Sbjct: 939  SLLEYQAGEAGQPDSLKEVARHYQTLWTTTAAPIAENAFLVSDAEGNLVVLNRNTTGVTE 998

Query: 894  EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ-IPTVIFGTVNGVI---GVIASL 949
            +++ R+++  E  LG  VNR R     M L  S     IP     T +G I   GVIA  
Sbjct: 999  DDKRRMQITSELRLGTMVNRIRR----MDLQASQSSPVIPKAFLATTDGSIYLFGVIAQF 1054

Query: 950  PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSR 1008
              +    L +LQ+ L   +   GG+    +R+F +  +  D    F+DG+L+E FLD   
Sbjct: 1055 AQD---LLMRLQSALASFVASPGGIPFSGYRAFKSATRQADEPFRFVDGELVEQFLDCPL 1111

Query: 1009 TRMDEISKTM-----NVSVEELCKRVEELTRLH 1036
               + +   M     +V++ +L   VE L R+H
Sbjct: 1112 EVQEAVLAKMDGGGRDVTLSQLKDIVERLKRMH 1144


>gi|346970653|gb|EGY14105.1| hypothetical protein VDAG_00787 [Verticillium dahliae VdLs.17]
          Length = 1160

 Score =  387 bits (993), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 308/1131 (27%), Positives = 533/1131 (47%), Gaps = 124/1131 (10%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            + G I  L+  RP     D LF+ T+ +++   +WD    +L+T + + D+++   R   
Sbjct: 54   VCGTITLLQTLRPKASETDILFVGTDLFQYFTARWDPAQGKLVTEQVIQDIAEPHMREAQ 113

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQL-----KEAFNIRLEELQVLDIKFL-Y 118
            +    ++DP  R + +HL++G+  ++    +        K+   +RL EL +    F+  
Sbjct: 114  SQDKCLVDPAGRFLAMHLWEGVMNIMQLGTRKGANFRLDKDWAQVRLSELFMKASTFVPT 173

Query: 119  GCAKPTIVVLYQ---DNKDARHVKTYEVALKDKDFVE--GPWSQNNL-----DNGADLLI 168
                PTI  LYQ   D +DA H+  Y +       V    P     L     D  A  LI
Sbjct: 174  ETGHPTIAFLYQSSIDREDA-HLAVYRLMEDGNTNVSKFDPLKNRELELEIPDPFARTLI 232

Query: 169  PVP------------------PPLCGVLIIGEETIVYC----SANAFKAIPIRPSITKAY 206
            PV                     L G+++ GE  ++Y          KA+   P I  ++
Sbjct: 233  PVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISKALD-EPRIFVSW 291

Query: 207  GRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
             + D   +RYLL D  G LHLL +  +   VTGL ++ +G+TS AS + Y+ N ++++GS
Sbjct: 292  AKYDV--TRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYMGNEILFLGS 349

Query: 267  SYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER-----------QGQGQV 315
             +GDSQL  L+L        + +++   N+ PI+DF ++DL              GQ ++
Sbjct: 350  HHGDSQLFTLDL----CAHTIRLIQTIPNIAPILDFSIMDLGNAGDSGVGNAFSSGQARI 405

Query: 316  VTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETR 374
            V   G + +GSLR +R+ +G+ +   ++ +Q ++G++SL+S   +  DT LVVSF++ETR
Sbjct: 406  VAGCGVHHNGSLRSIRSSVGLEDIGILDDIQDVRGLFSLKSYGSEKVDT-LVVSFLTETR 464

Query: 375  ILAMNLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKS 433
            +   + +  +EE T  +G      TLF         +Q+T+ S  L  S S    N W S
Sbjct: 465  VFKFDADGGVEELTAFQGLTLDQPTLFAGSLANGHTLQITASSALLQDSESGVTINSWTS 524

Query: 434  PPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLE-------YEISCLDIN 486
            P G S+  A+ N    LL+ GG  LV L +      +V+   L         +ISC    
Sbjct: 525  PNGGSIVNASVNEKYALLSVGGSTLVSLNLTQNF--DVREQALGNGIGGDGSQISC---- 578

Query: 487  PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII---------PRSVLLCA 537
             I  +  +  I A+G W+  SV +     +NL T + L GE I         PR +    
Sbjct: 579  -IHASRDFPDIGAIGFWSTGSVSV-----INLGTLQALHGESIKQNDDNVSVPRDLAFVQ 632

Query: 538  FE----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF---SSKNTTH 590
                      L  +L DG +++  ++     LT RK V+LG++   L       +   ++
Sbjct: 633  LHPPQLAGPTLFVSLEDGQVVSINVSKDDYSLTARKSVTLGSREAGLHVLPRPGAPGQSN 692

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            VFA ++  ++IYSS  +++YS    ++V+++ PF+S  FPD++ +A +  + I  +D  +
Sbjct: 693  VFATTEHSSLIYSSEGRIIYSAATAEDVTYIAPFDSEGFPDAIFLATDKNVRIANVDTER 752

Query: 651  KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPL 710
            + H+  + +GE  RR+ +    + F I +++ +   +E  +     L D+        P 
Sbjct: 753  RTHVNPLHIGETVRRVAYSPALKAFGIGTIRKELLHDEEVVSSAFQLVDEIVLGKVGRPF 812

Query: 711  -----DTFEYGCSILSCSFSDDSNV---YYCVGTAYVL-PEENEP--TKGRILVFIVE-D 758
                  + E   +++     D +      + +GT+Y+  P+ NE    KGRILV  V+ D
Sbjct: 813  ALGGEASVELVEAVIRAELDDSTGQPAERFIIGTSYLADPDVNESGDVKGRILVLGVDSD 872

Query: 759  GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 818
                LI   E KGA   L     KL+A +++ + +Y ++     + +L+           
Sbjct: 873  RNPYLIVSHELKGACRCLGVMGDKLVAGLSKTVVVYDYIEDSTTSGKLEKLTTFRPSTFP 932

Query: 819  LYVQTRGDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
            + +   G+ I V DLM+S++L+ +       +  + ERAR +   W +AV  L ++ +L 
Sbjct: 933  VDLDISGNMIGVADLMQSMTLVEFLPAKDGRKAKLIERARHFEYIWATAVCSLGEESWLE 992

Query: 875  AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
            A+   NL  + +  +  T+ ++ ++    E HLGE VN+ R   L +   ++D+  +P  
Sbjct: 993  ADAQGNLMILERQPDAPTEHDQKQMRTTSEMHLGEQVNKIR--PLQITATENDI-IVPKA 1049

Query: 935  IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG--------GLNHEQWRSFNNEK 986
               TV+G + V+A++  E    L   Q  L  +++ +G        G +  QWR F N K
Sbjct: 1050 FLATVDGSLYVLANISAEYQSILLPFQERLAGIVRYLGQAAPEDNEGPSFSQWRGFRNAK 1109

Query: 987  KTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            +   A   F+DG+LIE FLDL   R + + + +  SVE +   VEEL R+H
Sbjct: 1110 RMAAAPFRFVDGELIERFLDLDELRQEAVVEGLGPSVEAMRNMVEELRRMH 1160


>gi|393212467|gb|EJC97967.1| hypothetical protein FOMMEDRAFT_162310 [Fomitiporia mediterranea
            MF3/22]
          Length = 1161

 Score =  382 bits (981), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 308/1085 (28%), Positives = 518/1085 (47%), Gaps = 103/1085 (9%)

Query: 6    IYGRIATLELFRPHGE-AQDFLFIATERY--KFCVLQW----DAESSELITRAMGDVSDR 58
            I+GRI +L+      E +Q  L + T+    K  +L++          L T     + +R
Sbjct: 56   IWGRITSLQAVPTDDENSQHHLLVLTDHPDPKLFLLEYVQSNAGSGPSLKTLKTVSLHER 115

Query: 59   IGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
              RPT+     ++D   ++     Y G  +V+  + KG +   F+  + EL ++ + FL 
Sbjct: 116  NARPTEYVSKCLVDHKGKVAVACSYTGKLRVLELE-KGLINSEFDTAVRELNIVSLCFLR 174

Query: 119  GC-AKPT-IVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQ----NNLDNGADLLIPVPP 172
               A+ T + +LY+D      V +++++L D +    P S     +  D  ++LLIPVPP
Sbjct: 175  TTNARATALAILYKDYMQKLSVTSHDLSLADLELSPSPSSFLPEFHVPDEDSNLLIPVPP 234

Query: 173  PL-------CGVLIIGEETIVYCSAN--------------AFKAIPIRP------SITKA 205
             +        GVL++G  TI + S +              +   IP          IT A
Sbjct: 235  QIKSSWNVNGGVLVLGGSTIAFYSIDRKQKKKNSSSQSKSSTSKIPQAEVNWPYFDIT-A 293

Query: 206  YGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIG 265
            + ++D DG RYLLGD  G L LL I     +   L I LLGE S  ++++ L +  +Y+G
Sbjct: 294  WAQIDEDGLRYLLGDSFGRLALLAI---NPQYAYLDIVLLGEVSPPTSLTPLASQYIYVG 350

Query: 266  SSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDG 325
            S +GDSQLI++  +  + GSY+E+ + + N+ PI+D    D +  GQ  ++TCSG    G
Sbjct: 351  SHFGDSQLIRVTSERSSNGSYLEISDTFKNIAPIMDAVFEDTDDSGQPTIITCSGGESTG 410

Query: 326  SLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM-NLEDE 383
            SLR++RNG   NE A +E +  I GMW +R   DD F  +++V+  + T +L + N + E
Sbjct: 411  SLRVIRNGANFNEDARIEGIANITGMWPIRRQYDDTFHHYMLVTTDTNTHLLELPNSQQE 470

Query: 384  LEETEIEGFCSQT---QTLFCHDAIYNQLVQ--------------VTSGSVRLVSSTSRE 426
               +    F   T   +TL   + +   + +              V+  SV L+++ +  
Sbjct: 471  TAVSRSNDFSDLTIDSRTLVAGNMLTRLMSESGKSEYVSSSYVVQVSRDSVILLNTRTGL 530

Query: 427  LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL------EYEI 480
              ++W   PG  + +A  + SQ+ +A  GG +V L +    + E    Q         EI
Sbjct: 531  REDQWSPGPGNKIVLADISPSQICVAISGGTVVLLNLFADKINEQSRKQFYSPDGSSSEI 590

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAF-- 538
            S L I+P+    ++S   A+G W+   V++  LP    I    +    +PRS+LLC F  
Sbjct: 591  SALSISPMKRGANFSSFVALGFWSSHEVKVLRLPTFEQIDAVPVVMPHLPRSLLLCDFSE 650

Query: 539  ---EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS 595
               +   YL+  L +G +++     K G L ++K   LG  P++L      +   VFA+ 
Sbjct: 651  EESKPHRYLMVGLANGTVVSMPFAEK-GVLGEKKFFGLGGAPVSLSRCEVNDKPAVFASG 709

Query: 596  DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR 655
             R  + Y S   L +S V +K+V+     +S +F  S+A++    L +G + ++ KLHIR
Sbjct: 710  ARSALFYRSKDTLSHSPVLIKDVTFASAIHSESFTSSMALSTLDGLVVGRVLELDKLHIR 769

Query: 656  SIPLG-EHPRRICHQEQSRTFAICSLK------NQSCAEESEMHFVRLLDDQTFEFISTY 708
            ++ +G E+P ++ +   ++   +  LK        +    S    V  +D +T   IS  
Sbjct: 770  TVFMGKENPTKLAYHSAAKLLGVGCLKLDRSTFKGAPVAASSFKLVDTVDFRTTHEISLE 829

Query: 709  PLDTFEYGCSI-LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQ---LI 764
            P +       + L C    D+  ++ VG+ Y    E EP++GRIL+      + Q   ++
Sbjct: 830  PNEEITAVALVSLDCGIGWDT--FFAVGSVYFDETEREPSRGRILIISTGSKRNQTPHIL 887

Query: 765  AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
            A  E KGAV +L    GKL+ AIN  + +++    D+      +   H+  ++   V   
Sbjct: 888  ASTEVKGAVNALTCIQGKLVVAINTSVDVFRLKHGDNTVLTAVTSWNHNYLVITAVVM-- 945

Query: 825  GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
             D IV+GD + S+++L  K E+  +   ARDY+  W   +   D+   +GA N+ NLF+ 
Sbjct: 946  DDLIVIGDAVSSLAVL--KLEDDKLTTFARDYSPLWPLCIGAFDNKTVIGANNDNNLFSY 1003

Query: 885  RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
            R   +G+    +  L+  G Y++ E VN+   G      P+    Q   ++F T  G I 
Sbjct: 1004 RLQKQGS----KMLLDQDGLYNIEEIVNKCVPGCF-DSTPNKFGLQTKQLLF-TSTGRIA 1057

Query: 945  VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK----NFLDGDLI 1000
            V+ ++  +    L  L+ N+  VIKG G L+H  WR+ +N +   DA      FLDGDL+
Sbjct: 1058 VVVNVEDDLSRLLSSLERNMAGVIKGPGDLDHTLWRAPSNSRGRSDANGAATGFLDGDLL 1117

Query: 1001 ESFLD 1005
            E FLD
Sbjct: 1118 ERFLD 1122


>gi|400600376|gb|EJP68050.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
          Length = 1174

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 304/1129 (26%), Positives = 545/1129 (48%), Gaps = 111/1129 (9%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELI--TRAMGDVSDRIGRPT 63
            ++G I  L+  RP     D LF+ TER ++  + W+ +S+E++     + D+++   R +
Sbjct: 59   VHGTICMLQRLRPRDATTDLLFVGTERLQYFNVSWNPKSNEMVLTNETIHDMAEPYMRES 118

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDN----KGQLKEAFNIRLEELQVLDIKFLYG 119
             +    ++DP  + + +HL++G+  +    N      +LK+   IRL EL + D  F++ 
Sbjct: 119  QSQNKCLVDPTAKYMVMHLWEGVLNIFRLPNTPRASKKLKKMEQIRLTELWMKDSAFIHS 178

Query: 120  -CAKPTIVVLYQ---DNKDARHVKTYEVALKDK--DFVE-GPWSQNNLDN-----GADLL 167
                P I  LY+   D ++AR V  Y +   DK  D  +  P     LD       A +L
Sbjct: 179  RTGHPRIAFLYKTQLDQEEAR-VAVYRLTTDDKGTDVSKFDPHKDRELDQVIADPYASML 237

Query: 168  IPVP----------------PPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGR 208
            IPVP                  L G+L++GE  + Y  +  + ++      P I  A+  
Sbjct: 238  IPVPVAEEKRYHVRNTEGTRAHLGGLLVVGETLLTYFDSLTYSSVSSALDEPKIYVAWAM 297

Query: 209  VDADGSRYLLGDHAGLLHLLVI-THEKEK---VTGLKIELLGET------SIASTISYLD 258
             D   + Y L D  G L L+ + T+ + K   VTG+ +E L  +      S AS + Y+ 
Sbjct: 298  YD--DTHYFLADDYGRLDLMTLKTYTEPKGVIVTGMTVEPLKISDSRELISRASNLVYMG 355

Query: 259  NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-------- 310
            + ++++ S +G+SQL++++L  D++  Y+  ++   N GPI+DF ++D+  +        
Sbjct: 356  DGMLFLASHHGNSQLLRIDL--DSRTLYI--IKSLSNNGPILDFAIMDMGNRDGDNHQGN 411

Query: 311  ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFL 365
                GQ ++V   GAY+DGSLR +R+G+G+ +   + EL+  +G++ ++S      D  L
Sbjct: 412  VFSSGQTRIVAGCGAYRDGSLRSIRSGVGLEDCVILDELENTRGLFMIKSRGSRLVDV-L 470

Query: 366  VVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
            V SF++ETRI  M LEDE+EE  +  G   + +TL         L+QVT   V +     
Sbjct: 471  VASFVTETRIFLM-LEDEVEELYDWCGLNLEVETLIAATLPGYDLLQVTRTDVAIFHFDD 529

Query: 425  RELRNEWKSPPGY------SVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY 478
            R   + W     +       +  A+ N   VLL   G  +V L I   +   V+  Q   
Sbjct: 530  RPEIDSWVKTKSWVNESEGEITSASYNDKWVLLCIDGTTIVSLNIEKNLEACVQRYQGTD 589

Query: 479  EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI----IPRSVL 534
                  ++ +  +P       VG W    + I  L  LN    E+L   +    +PR + 
Sbjct: 590  PSQSDQVSCLHASPDVMDFGVVGWWNSGKITIVDLATLNAKHGENLQQTMDAASVPRDIA 649

Query: 535  LCAFE--GIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF-----SS 585
            L       IS   LL AL DG + +F ++++  EL+ +K V+LG+ P  L        +S
Sbjct: 650  LVQLHPPDISGPTLLVALEDGTVTSFDVSVENLELSGKKTVTLGSSPARLHILPEPDDTS 709

Query: 586  KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
              T+ +FA ++  ++IYSS  +L+YS       + + PF+S  FP  + ++ +  + I  
Sbjct: 710  DGTSSIFATTEHSSLIYSSEGRLVYSATTADNATCVVPFDSYGFPHCILVSTDKNVRICR 769

Query: 646  IDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEF 704
            +D  +  H++S+P+ E  RR+ +   ++ FA+ C  K+     E     V+L+D+  F+ 
Sbjct: 770  VDKERLTHVKSLPVHETVRRVAYAPGAKAFALGCIKKDLIQNAEVITSSVKLVDEIMFQE 829

Query: 705  IST-YPL---DTFEYGCSILSCSFSDDSNVY---YCVGTAYV----LPEENEPTKGRILV 753
            + T  PL    T E   S++     D +      + VGT++V    + E  E T+GRILV
Sbjct: 830  LGTPLPLAASSTLEMVESVIRAELPDPTGALVERFVVGTSFVNDAEVGEAGE-TRGRILV 888

Query: 754  FIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
              V++ + L  I     KGA   L   +  ++A + + + +Y +   +     LQ    +
Sbjct: 889  LGVDEKRQLYTIVSHNLKGACRCLGILDEYIVACLAKTVVVYSYTEENSTEGYLQKLAAY 948

Query: 813  HGHILALYVQTRGDFIVVGDLMKSISLLIYKH----EEGAIEERARDYNANWMSAVEILD 868
                  + +   G+ I V D+M+S+SL+ +      E G +EE+AR + + W ++V  L 
Sbjct: 949  RPASFPVALDISGNMIGVADIMQSLSLVEFTPPKDGEPGKLEEKARHFQSAWATSVCHLG 1008

Query: 869  DDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             + +L  +   N+  + +N +  T+ +R RLE+  E +LGE +N+ +  ++    P  +V
Sbjct: 1009 GERWLETDAQGNIIVLARNPDAPTEHDRSRLEITSEMNLGEQINKIQRLNVA---PADNV 1065

Query: 929  GQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT 988
               P     ++ G + +   +  +    L  LQT + K +K  GG++ + WR+F N+ + 
Sbjct: 1066 VVSPKAFLASIEGTLYLYGDIAPKYQDLLITLQTTIEKYVKTTGGISFDAWRAFRNQARE 1125

Query: 989  VDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
             D    F+DG+++E FLDL +     + + + ++VE++   +EEL R+H
Sbjct: 1126 ADGPFRFVDGEMVERFLDLRKQTQAALCQDLGLNVEDVRNMIEELRRMH 1174


>gi|302655264|ref|XP_003019424.1| hypothetical protein TRV_06556 [Trichophyton verrucosum HKI 0517]
 gi|291183145|gb|EFE38779.1| hypothetical protein TRV_06556 [Trichophyton verrucosum HKI 0517]
          Length = 1027

 Score =  377 bits (968), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 291/993 (29%), Positives = 482/993 (48%), Gaps = 103/993 (10%)

Query: 6    IYGRIATLELFRPHGEA-----QDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRI 59
            IYGR+  L+   P   A      D LF+ T++Y +  + WD    +L T R   D++D  
Sbjct: 55   IYGRVTLLKKL-PRSTAGGLALTDALFVGTDQYAYFSITWDPVHEQLRTERKYIDLADGS 113

Query: 60   GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG---------------------QL 98
             R   +     IDP    + L +Y+G+  + P                          QL
Sbjct: 114  LREAHSDDRCQIDPSGSFLTLEVYEGVVSIFPLVTADSHKRAKSAVSASASSASTSVEQL 173

Query: 99   KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWS 156
             E   +R+EEL V    FL   A   P   +LY+D +    +K  ++          P S
Sbjct: 174  GEPLQVRIEELMVRSSAFLDQEASNTPRFALLYEDTQGKVKLKLRDLKYTHAIITGDPGS 233

Query: 157  QNNL----------DNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SI 202
               L          D GA +LIPVP PL G+LIIGE +I Y   +  + I  RP    ++
Sbjct: 234  AAELKDVTTLSDEVDLGASILIPVPRPLGGLLIIGESSIKYVDVSRNETIS-RPLAESTV 292

Query: 203  TKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSIASTISYLDNAV 261
              A+ +VD  G R+LL D  G L  L++  + E  V   K++ LG TS AS + YLD  +
Sbjct: 293  FVAWEQVD--GQRWLLADDYGRLFFLMLVLDAENAVDTWKVDFLGVTSRASVLVYLDGGI 350

Query: 262  VYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ---------GQ 312
            V++GS  GDSQ+I++      +G + ++++   N+ PI+DF V+D+  +         GQ
Sbjct: 351  VFVGSHQGDSQVIQIK-----EGGF-DLVQTISNIAPILDFTVMDMGDRSGATREFSSGQ 404

Query: 313  GQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
             ++VT SGA+ DGSLR VR+G+GI E   +  ++ I  +W+LRS+  +PF   L+VSF++
Sbjct: 405  TRIVTGSGAFGDGSLRSVRSGVGIEELGVLASMEHITDLWTLRSACPEPFLDTLLVSFVN 464

Query: 372  ETRILAMNLEDELEETE--IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
            ETR+   + + ++EE E    G      TL   +   N+++QVT  + R +   S  +  
Sbjct: 465  ETRVFHFSADGDVEEKEDGFLGLVFSQSTLLATNIPGNRIIQVTESTSRAIDVDSGMII- 523

Query: 430  EWKSP-PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPI 488
             W+S    +++  A+AN   ++L  GG  LV + +    L   +  + + ++S + I   
Sbjct: 524  -WRSSYEEFTITSASANDDYLVLVLGGIRLVCMSLSTFELVGSRDFEADNQVSGMTIPA- 581

Query: 489  GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYL 544
                S +Q   V +     + I  LP+L +  K+ LG  GE IPRSV++          L
Sbjct: 582  ----SPTQACIVCLPQSAEIIILDLPELEVKNKQALGEPGEAIPRSVIVAEILPNKPPTL 637

Query: 545  LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
              ++ DG + +F  ++    L +  K++LG++  + +     N  + VFA  D P++I++
Sbjct: 638  FVSMADGTVFSFSFDVHAFSLFNSSKITLGSEQPSFKELPRGNGQYNVFATCDHPSLIHA 697

Query: 604  SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
            S  +++YS V+    S +C  N+ A+P S+A++ + EL I  +D+ +   I ++P+    
Sbjct: 698  SEGRIVYSAVDSASASRICSLNTQAYPGSIALSSQHELKIAIVDEERTTQIHTLPMHASV 757

Query: 664  RRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPL----------- 710
            RR+ +    + F + ++K +  +  EE    FV L D+  F  +STY L           
Sbjct: 758  RRLAYSPMEKAFGLGTVKRKISNGVEEVSSSFV-LADEMLFRPLSTYDLRPDELVECVIR 816

Query: 711  DTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKET 769
                YG   +  S S D    + VGTA++    ++  +GRIL+F V   + L LI EK  
Sbjct: 817  SQLNYGKDEVGNSISKD---LFFVGTAFLDDVGDDHIRGRILIFEVNRSRELSLIVEKSL 873

Query: 770  KGAVYSLNAFNGKLL-AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
             GA  +L   +  LL A + + + ++K      G   L+    +      + +   GD +
Sbjct: 874  MGACRTLAVMDHTLLVAGLVKSVSVFKLARDRFGNILLEKHTAYRTSTAPIDISVVGDTV 933

Query: 829  VVGDLMKSISLLIYKH-----EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
             V D+MKS+SL+ Y       +E   EE AR Y   W +AV  +++++YL A+   NL  
Sbjct: 934  AVADVMKSMSLVQYTQPEEGEQEPKFEEVARHYQTLWSTAVAPIEENVYLLADAEGNLVV 993

Query: 884  VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
            +++N  G T+ +R RL+   E  LGE +    H
Sbjct: 994  LQQNITGVTESDRKRLQPTSEIRLGEMLFLHEH 1026


>gi|440487047|gb|ELQ66855.1| DNA damage-binding protein 1a [Magnaporthe oryzae P131]
          Length = 1213

 Score =  376 bits (965), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 321/1142 (28%), Positives = 528/1142 (46%), Gaps = 125/1142 (10%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITR-AMGDVSDRIGRPTD 64
            ++G+I  L+  RP     D LF+ T+R+K+   ++D ++ EL+TR A+ D+ ++  R   
Sbjct: 86   VFGKIVMLQAVRPKDSETDMLFVGTDRFKYFTAEYDPDTRELVTRQAISDLGEQFVREVS 145

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF-DNKGQLKEAFNIRLE--------ELQVLDIK 115
            +    I+DP  R + L L+ G+  V      KGQ K     RLE        EL + D  
Sbjct: 146  SRNRCIVDPSGRYMVLLLWSGIMHVWRLHKRKGQQKGQLQTRLELMDQARISELYIKDAV 205

Query: 116  FLYG-CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQ-------NNLDNGAD 165
            FL+   A P I  LYQ   +    K  +Y +   D+D     + Q       N  D G+ 
Sbjct: 206  FLHSETAHPRIAFLYQPRPNEPDCKFASYRLCTDDRDRELSRFEQKDREFTFNVPDPGST 265

Query: 166  LLIPVP------------PP-----LCGVLIIGEETIVYCSANAFKAIPI---RPSITKA 205
            ++IPV             P      L GV+++GE  ++Y    ++           +  A
Sbjct: 266  MMIPVERVETERRHNFRNPARDECHLGGVIVVGESRMLYIDDQSWTWTETALKNAMVFVA 325

Query: 206  YGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG---LKIELLGETSIASTISYLD-NAV 261
            + + D   + YLL D  G LHLL I  ++   T    +    +G TS A+ + Y + N  
Sbjct: 326  WAKFD--NTHYLLADDYGGLHLLTIQVKQNSDTAVDHMSTVQIGTTSRATKLVYSETNRT 383

Query: 262  VYIGSSYGDSQLIKLNLQPDA-KG-SYVEVLERYVNLGPIVDFCVVDLERQ--------- 310
            +++ S YGDSQ   +NL  DA KG S++E+ +   N+ PI+DF V+D+  +         
Sbjct: 384  LFVASHYGDSQFYDVNLFADAAKGESFLELRQTIENIAPILDFAVMDMGNREGDSQLGNE 443

Query: 311  ---GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV--ELQGIKGMWSLRSSTDDPFDTFL 365
               GQ ++VT SGA KDGSLR VR+G+G+ +   +  E+ G+ G++SL+S   D  DT L
Sbjct: 444  YSSGQARIVTASGAQKDGSLRSVRSGVGLEDIGVITDEISGVTGLFSLKSYGSDVEDT-L 502

Query: 366  VVSFISETRILAMNLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
            VVSF++ETR+   + + E+EE ++++G      TL         ++ VT     L  +  
Sbjct: 503  VVSFLTETRVFRFDKQGEVEELSQLQGLDISQPTLLVLGLDNGHVLYVTEEKATLFDAEG 562

Query: 425  RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCL 483
                + W    G  +  A++N   VLL+  G  LV L IG D  ++     + E +ISC 
Sbjct: 563  GVTISSWSPTSGKPITHASSNGRWVLLSVDGRKLVSLNIGLDLKVSAESEERDEDQISC- 621

Query: 484  DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLC--- 536
                +  +P    + AVG W+  ++ I  L  L     E L       ++ R V+L    
Sbjct: 622  ----VNASPHLLDVGAVGFWSSGTISIIDLKTLEATQTEKLRRNEDDAVVAREVVLARVL 677

Query: 537  -AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAA 594
             A      L  +  DG ++ F+ N   G L+ RK V LGT+    R     N    +F  
Sbjct: 678  PAEVANPTLFVSKDDGEVMTFVYN-DNGTLSSRKSVVLGTREARFRVLPQPNGLCSIFVT 736

Query: 595  SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG------ELTIGTIDD 648
             +  ++I+ + ++++YS V     +++CPF++AAF D LA+A E       EL I  ID 
Sbjct: 737  CEHSSLIHGAERRIVYSAVTAHSAAYVCPFDTAAFRDCLAVATESAIDRRMELKISRIDR 796

Query: 649  IQKLHIRSIPLGEHPRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFI-S 706
             ++  + + P+GE+ R I +    + F + C  +  S   E      +L D+  FE   +
Sbjct: 797  QRQCQMMTRPMGENVRSIAYSSADKVFGLGCIRRVLSRGIEKVYGTFKLFDEVIFEPKGN 856

Query: 707  TYPLDTFEYGCSILSCSFSD---DSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQ- 762
             + L+  E    +      D   +    + VGT Y L        GR+LVF V++ +   
Sbjct: 857  VFALEDGEVPECVTRAPLLDSYGEQAERFIVGTRY-LSGTGSGHGGRVLVFGVDESRSPY 915

Query: 763  LIAEKETKGAVYSLNAFNGKLLA-AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV 821
            LI    TK     +   +  LL  A+ + + L ++      + +            A+ V
Sbjct: 916  LIHAHSTKSGCRRIATMDDDLLVIALTKTVVLVRYSETSTTSAKFLKVAAFQTSSYAVDV 975

Query: 822  QTRGDFIVVGDLMKSISLLIY--------------------KHEEGAIE----ERARDYN 857
               G  I V D+MKSI+LL Y                    K  EG+ +    E  RDY 
Sbjct: 976  TVHGKLIAVADIMKSITLLEYIPGVGKSAKTGGKDKATRSDKEVEGSKQAKLVEVCRDYQ 1035

Query: 858  ANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHG 917
            A W +AV  L+ D ++ A+ + NL  + +N+ G T E++ R+++  E+ LGE VN+ +  
Sbjct: 1036 AMWSTAVSHLEGDSWIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGECVNKIQK- 1094

Query: 918  SLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRK-VIKGVGGLN 975
              VM    ++   +      T  G I +  ++ P  Q L ++  Q N+   V   +G L 
Sbjct: 1095 --VMVETSANAPIVAKAFLSTTEGSIYLFGTVAPKFQSLLMD-FQANMEAHVSSPLGELQ 1151

Query: 976  HEQWRSFNN-EKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
              QWRSF N E++    + FLDG+ +E FLD+      +I + ++ + E++   + E+  
Sbjct: 1152 FNQWRSFRNPEREGAGPERFLDGEFLEMFLDMEENTQIDICQGLSYTAEDMRNLIGEMKN 1211

Query: 1035 LH 1036
            +H
Sbjct: 1212 MH 1213


>gi|389629928|ref|XP_003712617.1| hypothetical protein MGG_16867 [Magnaporthe oryzae 70-15]
 gi|351644949|gb|EHA52810.1| hypothetical protein MGG_16867 [Magnaporthe oryzae 70-15]
 gi|440464739|gb|ELQ34110.1| DNA damage-binding protein 1a [Magnaporthe oryzae Y34]
          Length = 1183

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 319/1141 (27%), Positives = 525/1141 (46%), Gaps = 123/1141 (10%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITR-AMGDVSDRIGRPTD 64
            ++G+I  L+  RP     D LF+ T+R+K+   ++D ++ EL+TR A+ D+ ++  R   
Sbjct: 56   VFGKIVMLQAVRPKDSETDMLFVGTDRFKYFTAEYDPDTRELVTRQAISDLGEQFVREVS 115

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPF-DNKGQLKEAFNIRLE--------ELQVLDIK 115
            +    I+DP  R + L L+ G+  V      KGQ K     RLE        EL + D  
Sbjct: 116  SRNRCIVDPSGRYMVLLLWSGIMHVWRLHKRKGQQKGQLQTRLELMDQARISELYIKDAV 175

Query: 116  FLYG-CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQ-------NNLDNGAD 165
            FL+   A P I  LYQ   +    K  +Y +   D+D     + Q       N  D G+ 
Sbjct: 176  FLHSETAHPRIAFLYQPRPNEPDCKFASYRLCTDDRDRELSRFEQKDREFTFNVPDPGST 235

Query: 166  LLIPVP------------PP-----LCGVLIIGEETIVYCSANAFKAIPI---RPSITKA 205
            ++IPV             P      L GV+++GE  ++Y    ++           +  A
Sbjct: 236  MMIPVERVETERRHNFRNPARDECHLGGVIVVGESRMLYIDDQSWTWTETALKNAMVFVA 295

Query: 206  YGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG---LKIELLGETSIASTISYLD-NAV 261
            + + D   + YLL D  G LHLL I  ++   T    +    +G TS A+ + Y + N  
Sbjct: 296  WAKFD--NTHYLLADDYGGLHLLTIQVKQNSDTAVDHMSTVQIGTTSRATKLVYSETNRT 353

Query: 262  VYIGSSYGDSQLIKLNLQPDA-KG-SYVEVLERYVNLGPIVDFCVVDLERQ--------- 310
            +++ S YGDSQ   +NL  DA KG S++E+ +   N+ PI+DF V+D+  +         
Sbjct: 354  LFVASHYGDSQFYDVNLFADAAKGESFLELRQTIENIAPILDFAVMDMGNREGDSQLGNE 413

Query: 311  ---GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV--ELQGIKGMWSLRSSTDDPFDTFL 365
               GQ ++VT SGA KDGSLR VR+G+G+ +   +  E+ G+ G++SL+S   D  DT L
Sbjct: 414  YSSGQARIVTASGAQKDGSLRSVRSGVGLEDIGVITDEISGVTGLFSLKSYGSDVEDT-L 472

Query: 366  VVSFISETRILAMNLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
            VVSF++ETR+   + + E+EE ++++G      TL         ++ VT     L  +  
Sbjct: 473  VVSFLTETRVFRFDKQGEVEELSQLQGLDISQPTLLVLGLDNGHVLYVTEEKATLFDAEG 532

Query: 425  RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCL 483
                + W    G  +  A++N   VLL+  G  LV L IG D  ++     + E +ISC 
Sbjct: 533  GVTISSWSPTSGKPITHASSNGRWVLLSVDGRKLVSLNIGLDLKVSAESEERDEDQISC- 591

Query: 484  DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLC--- 536
                +  +P    + AVG W+  ++ I  L  L     E L       ++ R V+L    
Sbjct: 592  ----VNASPHLLDVGAVGFWSSGTISIIDLKTLEATQTEKLRRNEDDAVVAREVVLARVL 647

Query: 537  -AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAA 594
             A      L  +  DG ++ F+ N   G L+ RK V LGT+    R     N    +F  
Sbjct: 648  PAEVANPTLFVSKDDGEVMTFVYN-DNGTLSSRKSVVLGTREARFRVLPQPNGLCSIFVT 706

Query: 595  SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG------ELTIGTIDD 648
             +  ++I+ + ++++YS V     +++CPF++AAF D LA+A E       EL I  ID 
Sbjct: 707  CEHSSLIHGAERRIVYSAVTAHSAAYVCPFDTAAFRDCLAVATESAIDRRMELKISRIDR 766

Query: 649  IQKLHIRSIPLGEHPRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFI-S 706
             ++  + + P+GE+ R I +    + F + C  +  S   E      +L D+  FE   +
Sbjct: 767  QRQCQMMTRPMGENVRSIAYSSADKVFGLGCIRRVLSRGIEKVYGTFKLFDEVIFEPKGN 826

Query: 707  TYPLDTFEYGCSILSCSFSD---DSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQ- 762
             + L+  E    +      D   +    + VGT Y L        GR+LVF V++ +   
Sbjct: 827  VFALEDGEVPECVTRAPLLDSYGEQAERFIVGTRY-LSGTGSGHGGRVLVFGVDESRSPY 885

Query: 763  LIAEKETKGAVYSLNAFNGKLLA-AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV 821
            LI    TK     +   +  LL  A+ + + L ++      + +            A+ V
Sbjct: 886  LIHAHSTKSGCRRIATMDDDLLVIALTKTVVLVRYSETSTTSAKFLKVAAFQTSSYAVDV 945

Query: 822  QTRGDFIVVGDLMKSISLLIY--------------------KHEEGAIE----ERARDYN 857
               G  I V D+MKSI+LL Y                    K  EG+ +    E  RDY 
Sbjct: 946  TVHGKLIAVADIMKSITLLEYIPGVGKSAKTGGKDKATRSDKEVEGSKQAKLVEVCRDYQ 1005

Query: 858  ANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHG 917
            A W +AV  L+ D ++ A+ + NL  + +N+ G T E++ R+++  E+ LGE VN+ +  
Sbjct: 1006 AMWSTAVSHLEGDSWIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGECVNKIQK- 1064

Query: 918  SLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK-VIKGVGGLNH 976
              VM    ++   +      T  G I +  ++  +    L   Q N+   V   +G L  
Sbjct: 1065 --VMVETSANAPIVAKAFLSTTEGSIYLFGTVAPKFQSLLMDFQANMEAHVSSPLGELQF 1122

Query: 977  EQWRSFNN-EKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
             QWRSF N E++    + FLDG+ +E FLD+      +I + ++ + E++   + E+  +
Sbjct: 1123 NQWRSFRNPEREGAGPERFLDGEFLEMFLDMEENTQIDICQGLSYTAEDMRNLIGEMKNM 1182

Query: 1036 H 1036
            H
Sbjct: 1183 H 1183


>gi|194377326|dbj|BAG57611.1| unnamed protein product [Homo sapiens]
          Length = 451

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 191/364 (52%), Positives = 247/364 (67%), Gaps = 15/364 (4%)

Query: 684  SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEE 743
            S  EE E+H + ++D  TFE +  +     EY  S++SC    D N Y+ VGTA V PEE
Sbjct: 92   SFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEE 151

Query: 744  NEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT 803
             EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++LY+W       
Sbjct: 152  AEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTE---- 207

Query: 804  RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
            +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG  EE ARD+N NWMSA
Sbjct: 208  KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSA 267

Query: 864  VEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR- 922
            VEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN F HGSLVM+ 
Sbjct: 268  VEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQN 327

Query: 923  LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
            L ++      +V+FGTVNG+IG++ SL    Y  L  +Q  L KVIK VG + H  WRSF
Sbjct: 328  LGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSF 387

Query: 983  NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSVEELCKRVEEL 1032
            + E+KT  A  F+DGDLIESFLD+SR +M E+   +            + ++L K VEEL
Sbjct: 388  HTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVVEEL 447

Query: 1033 TRLH 1036
            TR+H
Sbjct: 448  TRIH 451


>gi|322706594|gb|EFY98174.1| DNA damage-binding protein 1 [Metarhizium anisopliae ARSEF 23]
          Length = 1121

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 307/1137 (27%), Positives = 552/1137 (48%), Gaps = 130/1137 (11%)

Query: 13   LELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPTDNGQIGI 70
            L+  +P     D LFI T+R+++  + W+ E+++L  + + + DV+++  R + +    +
Sbjct: 2    LQRLKPKDSETDLLFIGTDRFQYFNVAWNPETNQLDTVEQTIEDVAEQYMRHSQSQNKCL 61

Query: 71   IDPDCRLIGLHLYDGLFKV--IPFDNKG---QLKEAFNIRLEELQVLDIKFLYG-CAKPT 124
            +DP  + + +HL++G+  V  +P   KG   +L+    +RL EL +    FL+     P 
Sbjct: 62   VDPTGKFMAMHLWEGVLNVFRLPM-RKGMTTKLEALDQVRLTELWMKASTFLHSQTGHPK 120

Query: 125  IVVLYQDNKDARHVKTYEVALKDKDFVEG------PWSQNNLDN-----GADLLIPVP-- 171
            I  LY+   D    +     L  KD   G      P  +  LD       A +LIPVP  
Sbjct: 121  IAFLYKTQTDQEEARIAVYRLT-KDDSRGNVARFDPHRERELDQIISDPYASMLIPVPFR 179

Query: 172  --------------PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAY-GRVDADGSRY 216
                            L G+L++GE  I Y  +  +  +       K Y    + D   Y
Sbjct: 180  EEKRYHVRHNEGAKAHLGGLLVVGETLITYFDSLTYSRVSSTLQDPKIYVAWTEYDDVHY 239

Query: 217  LLGDHAGLLHLLVITHEKEK----VTGLKIELL------GETSIASTISYLDNAVVYIGS 266
            LL D  G L +L I    E     VTG+ I  +      G TS AS++ Y+ N ++++ S
Sbjct: 240  LLADDYGRLDILTIETATESTGIVVTGMAISPMRFPDSSGCTSRASSLVYMGNDMLFLAS 299

Query: 267  SYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQ 314
             +GDSQL+++++  DA+   + V +   N  PI+DF ++D+  +            GQ +
Sbjct: 300  HHGDSQLLRIDI--DAQ--VMVVAKALSNNAPILDFAIMDMGNREGDSQFGNAFSSGQAR 355

Query: 315  VVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISET 373
            +V   GAY DGSLR +R+G+G+ +Q  + E+Q  KG+++LRS      DT LVVS +++T
Sbjct: 356  IVAGCGAYHDGSLRSIRSGVGLEDQGILDEIQDTKGLFTLRSHQSSHVDT-LVVSSVADT 414

Query: 374  RILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWK 432
            R+L  +    +EE    +G     +TL   +    QL+QVT  S  L+ S S      W 
Sbjct: 415  RVLRFDSAGGIEEVYAFQGLTLDMETLLAVNISDGQLLQVTPKSAVLLDSESGVNLCSWD 474

Query: 433  SPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY------EISCLDIN 486
            +P G ++  A+AN    LL+  G  LV L + D +   ++    +       +ISCL  +
Sbjct: 475  APSGKAITAASANKGWALLSIDGSLLVSLNLHDNLAAVLRDTSGDESSTQPDQISCL--H 532

Query: 487  PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEGIS 542
               ++P    I  VG W   ++ I  L  L  +  E L        +PR + L       
Sbjct: 533  AARDSP---DIGVVGWWASGTISIVDLATLQPLHGEPLRQTEDSASVPRDIALVQLHPPQ 589

Query: 543  Y----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASDR 597
                 LL AL DG+++ F ++++   ++ RK V+LG+ P  L     ++ T +VFA ++ 
Sbjct: 590  VSGPTLLIALEDGNVVTFDMSIQGYTISGRKSVTLGSSPARLHVLPQEDGTCNVFATTEH 649

Query: 598  PTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSI 657
             ++IYS+  +++YS     + +++ PF+S AFP+S+ ++ +  + +  ID  +  H++++
Sbjct: 650  ASLIYSAEGRIIYSATTADDATYVAPFDSEAFPNSIVLSTDSHIRLSHIDKERLTHVKTL 709

Query: 658  PLGEHPRRICHQEQSRTFAICSLKNQSCA-EESEMHFVRLLDDQTFE-------FISTYP 709
             + E  RR+ +    + F +  +K +    EE      R++D+  F+       F ++  
Sbjct: 710  SVKETVRRVAYSPTLKVFGLGCIKKELIQNEEVITSSFRIVDEIIFQELGKPFIFNTSTS 769

Query: 710  LDTFEYGCSILSCSFSDD-SNV--YYCVGTAYVLP----EENEPTKGRILVFIV-EDGKL 761
            L+  E   +++     D   N+   + +GT+++      EEN+ T+GRILV  V E+ ++
Sbjct: 770  LEMVE---TVIRAELPDSMGNLAERFIIGTSFITDDDAIEEND-TRGRILVLGVDENRQV 825

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV 821
              I     KGA   L      ++A +++ + +Y ++        LQ    +      L +
Sbjct: 826  YQIVSHNLKGACRCLGTLGEHIVAGLSKTVVVYHYVEETTVFGSLQKLAAYRPASFPLSL 885

Query: 822  QTRGDFIVVGDLMKSISLLIY-KHEEGA---IEERARDYNANWMSAVEILDDDIYLGAEN 877
               G+ I V DLM+S++L+ +   E+G+   +EE AR Y   W ++V  LD + +L A+ 
Sbjct: 886  DISGNIIGVVDLMQSLTLVEFIPSEDGSRAKLEETARHYQPGWATSVAHLDGERWLEADA 945

Query: 878  NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL-----VMRLPDSDVGQI- 931
              N+  +++N E  T+++R +LEV  E ++GE +N+ R   +      +  P + +G + 
Sbjct: 946  QGNIIVLQRNPEAPTEQDRSKLEVTSEMNIGEQINQIRKLHVASNENAVVSPKAFLGSVG 1005

Query: 932  --PTVI--------FGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
               T+I           + G + +   + P+ Q L L   Q+ L+  I   G ++   WR
Sbjct: 1006 LSETIITCWNQLLMLVQIEGTLYLFGEIAPNYQDLLL-TFQSRLQDYIYAPGNVSFNLWR 1064

Query: 981  SFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            +F N+ +  D    F+DG+++E FLDL   + + + + +  SVE++   +EEL R+H
Sbjct: 1065 AFRNKAREGDGPFRFVDGEMVERFLDLDEAKQELVCEGLGPSVEDMRNMIEELRRMH 1121


>gi|346321204|gb|EGX90804.1| DNA damage-binding protein 1 [Cordyceps militaris CM01]
          Length = 1160

 Score =  370 bits (951), Expect = 2e-99,   Method: Compositional matrix adjust.
 Identities = 294/1125 (26%), Positives = 533/1125 (47%), Gaps = 113/1125 (10%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSE--LITRAMGDVSDRIGRPT 63
            ++G I  L+  RP     D LF+ TER ++  + W+++ +E  L    + D ++   R +
Sbjct: 55   VHGTICMLQRLRPKDANTDLLFVGTERLQYFNVSWNSQRNEMELTHETIHDTAEPYMRES 114

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK----GQLKEAFNIRLEELQVLDIKFLYG 119
             +    ++DP  + + +HL++G+  V     +     +L+E   IRL EL + D  F++ 
Sbjct: 115  QSQNKCLVDPTAKYMVMHLWEGVLNVFRLPARRGVSDKLEEMDQIRLTELWMKDSAFIHS 174

Query: 120  -CAKPTIVVLYQ---DNKDARHVKTYEVALKDK--DFVE-GPWSQNNLDN-----GADLL 167
                P I  LY+   D ++AR V  Y +   DK  D  +  P     LD       A +L
Sbjct: 175  RTGHPRIAFLYKTQLDEEEAR-VAVYRLTKDDKGSDVSKFDPHKDRELDQVIADPCASML 233

Query: 168  IPVP----------------PPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGR 208
            IPVP                  L G+L++GE  + Y  +  + ++      P I  A+  
Sbjct: 234  IPVPVAEEKRYHVRNNEGTRAHLGGLLVVGETLLTYFDSLTYSSVSSVLDEPKIYVAWAM 293

Query: 209  VDADGSRYLLGDHAGLLHLLVI-THEKEK---VTGLKIELL------GETSIASTISYLD 258
             D   + Y L D  G L LL + TH ++    V  + +E L        TS AS++ Y+ 
Sbjct: 294  YDE--THYFLADDYGRLDLLTLQTHSEQTGVVVNRMTVEPLKFPDSKNLTSRASSLVYMG 351

Query: 259  NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-------- 310
              ++ + S +GDSQL    LQ D     +  ++   N GPI+DF ++D+  +        
Sbjct: 352  GGMLVVASHHGDSQL----LQIDIDSRTMHPIKLLSNNGPILDFAIMDMGNRDGDNLQGN 407

Query: 311  ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFL 365
                GQ ++V   GAY+DGSLR +R+G+G+ +   +E ++  + +++++S      DT L
Sbjct: 408  LFSSGQARIVAGCGAYRDGSLRSIRSGVGLEDLGILEEIKNTRRLFTIKSRRSKKVDT-L 466

Query: 366  VVSFISETRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
            V  +I +TR+     + +++E     G   + ++L   +    +L+QVT   V +     
Sbjct: 467  VACYILDTRVFLFGADGDIQEVNNFPGLNLRVESLLVVNLPNGRLLQVTRMDVTIKDPNE 526

Query: 425  RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ------LEY 478
            +     W +P G ++  A+ N   +LL   G  +V L I   +   V+  Q         
Sbjct: 527  QAKALRWVAP-GANITSASCNGKWILLCVDGSKIVSLNIDKNLEACVQQYQGADDSNQSN 585

Query: 479  EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVL 534
            ++SCL       + ++     +G W    + I  L  L+    E L        +PR + 
Sbjct: 586  QVSCLH-----ASTTFQDYGVIGWWYPGRITIIDLATLSAKHVETLQQTTDAASVPRDIS 640

Query: 535  LCAFE----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TT 589
            L            LL AL DG ++ F ++ +   L+ +K+V+LG+ P  L      + T+
Sbjct: 641  LVQLHPRHASGPTLLVALEDGTVITFNVS-EDLTLSGKKRVTLGSSPACLHVLPEADGTS 699

Query: 590  HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
             +FA ++  ++IYSS  +++YS     + + + PF+S AFP+S+ ++ +  + +  ID  
Sbjct: 700  KIFATTENASLIYSSEGRIIYSATTADDATCVVPFDSEAFPNSILLSTDKHVRLCHIDKE 759

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA-EESEMHFVRLLDDQTFEFISTY 708
            +  H++S+P+ E  RR+ +    + FA+  +K +    EE     V+L+D+  F+ +   
Sbjct: 760  RLTHVKSLPVYETVRRVAYAPGVKAFALGCIKKELIRNEEVITSSVKLVDEIIFQELGA- 818

Query: 709  PL-----DTFEYGCSILSCSFSDDSNVY---YCVGTAYVLPE---ENEPTKGRILVFIV- 756
            PL      T E   S++     D +      + VGT+++      E   T+GRILV  V 
Sbjct: 819  PLQLNASSTLEMVESVIRAELPDHTGALVERFIVGTSFITDADVGEASETRGRILVLGVD 878

Query: 757  EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI 816
            E+ +L  I     KGA   L+  +  ++A +++ + +Y++         LQ    +    
Sbjct: 879  EERQLYTIVTHNLKGACRCLSVLDEYIVAGLSKTVVVYRYTEETSTEGSLQKLAAYRPAS 938

Query: 817  LALYVQTRGDFIVVGDLMKSISLLIYKH----EEGAIEERARDYNANWMSAVEILDDDIY 872
              + +   G+ I VGDLM+S+SL+ +      E   ++E+AR + + W ++V  LD + +
Sbjct: 939  FPVALDVSGNMIGVGDLMQSLSLVEFTPPKDGEPAKLQEKARHFQSAWATSVCHLDGERW 998

Query: 873  LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
            L  +   N+  + +N E  T+++RGRLE+  E +LGE +N+ R  ++    P  +    P
Sbjct: 999  LETDAQGNIMVLARNPEAPTEQDRGRLEITSEMNLGEQINKIRKLNVA---PADNAVVSP 1055

Query: 933  TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK 992
                 ++ G + +   +  +    L  LQ+N+ + +K  G ++   WRSF N+ +  D  
Sbjct: 1056 KAFLASIEGTLYLYGDIAPKYQDLLITLQSNIEQYVKTTGDISFNAWRSFRNQTREADGP 1115

Query: 993  -NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
              F+DG+++E FLDL      E+ K +  SVE++   +EEL R+H
Sbjct: 1116 FRFVDGEMVERFLDLDELTQVELCKDLGPSVEDVRNMIEELRRMH 1160


>gi|90108802|pdb|2B5N|A Chain A, Crystal Structure Of The Ddb1 Bpb Domain
 gi|90108803|pdb|2B5N|B Chain B, Crystal Structure Of The Ddb1 Bpb Domain
 gi|90108804|pdb|2B5N|C Chain C, Crystal Structure Of The Ddb1 Bpb Domain
 gi|90108805|pdb|2B5N|D Chain D, Crystal Structure Of The Ddb1 Bpb Domain
          Length = 323

 Score =  369 bits (946), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 183/323 (56%), Positives = 241/323 (74%), Gaps = 4/323 (1%)

Query: 330 VRNGIGINEQASVELQGIKGMWSLRSSTD-DPFDTFLVVSFISETRILAMNLEDELEETE 388
           +RNGIGI+E AS++L GIKG+W LRS  + + +DT LV+SF+ +TR+L +N E E+EETE
Sbjct: 4   MRNGIGIHEHASIDLPGIKGLWPLRSDPNRETYDT-LVLSFVGQTRVLMLNGE-EVEETE 61

Query: 389 IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQ 448
           + GF    QT FC +  + QL+Q+TS SVRLVS   + L +EWK P   +++VA+ N+SQ
Sbjct: 62  LMGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQ 121

Query: 449 VLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISV 508
           V++A G   L YL+I    L ++ H ++E+E++CLDI P+G++   S + A+G+WTDIS 
Sbjct: 122 VVVAVGRA-LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISA 180

Query: 509 RIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDR 568
           RI  LP   L+ KE LGGEIIPRS+L+  FE   YLLCALGDG L  F LN++TG L+DR
Sbjct: 181 RILKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDR 240

Query: 569 KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
           KKV+LGTQP  LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS  
Sbjct: 241 KKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDG 300

Query: 629 FPDSLAIAKEGELTIGTIDDIQK 651
           +PDSLA+A    LTIGTID+IQK
Sbjct: 301 YPDSLALANNSTLTIGTIDEIQK 323


>gi|322700871|gb|EFY92623.1| DNA damage-binding protein 1 [Metarhizium acridum CQMa 102]
          Length = 1121

 Score =  367 bits (942), Expect = 2e-98,   Method: Compositional matrix adjust.
 Identities = 301/1137 (26%), Positives = 548/1137 (48%), Gaps = 130/1137 (11%)

Query: 13   LELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPTDNGQIGI 70
            L+  +P     D LFI T+R  +  + W++E+++L  + + + DV+++  R + +    +
Sbjct: 2    LQRLQPKDSETDLLFIGTDRLNYFNVAWNSETNQLDTVEQTIEDVAEQYMRHSQSQNKCL 61

Query: 71   IDPDCRLIGLHLYDGLFKV--IPFDNKG---QLKEAFNIRLEELQVLDIKFLYG-CAKPT 124
            +DP  + + +HL++G+  V  +P   KG   +L     +RL EL +    FL+     P 
Sbjct: 62   VDPTGKFMAMHLWEGVLNVFRLPM-RKGITTKLVALDQVRLTELWMKASTFLHSQTGHPK 120

Query: 125  IVVLYQDNKDARHVKTYEVALKDKDFVEG------PWSQNNLDNG-----ADLLIPVP-- 171
            I  LY+   D    +     L  KD   G      P  +  LD       A +LIPVP  
Sbjct: 121  IAFLYKTQTDQEEARIAVYRLT-KDDNRGNVASFDPHRERELDQVISDPYASMLIPVPFR 179

Query: 172  --------------PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAY-GRVDADGSRY 216
                            L G+L++GE  I Y  +  +  +       K Y    + D   Y
Sbjct: 180  EEKRYHVRNNEGAKAHLGGLLVVGETLITYFDSLTYSRVSSTLQDPKIYVAWTEYDDVHY 239

Query: 217  LLGDHAGLLHLLVITHEKEK----VTGLKIELL------GETSIASTISYLDNAVVYIGS 266
            LL D  G L +L +    E     VTG+ +  +        TS AS++ Y+ + ++++ S
Sbjct: 240  LLADDYGRLDILTVETTTESTGIVVTGMAVSPMRFPDSSAYTSRASSLVYMGDDMLFLAS 299

Query: 267  SYGDSQLIKLNLQPDAKGSYVEVLERYV-NLGPIVDFCVVDLERQ------------GQG 313
             +GDSQL+++++  DA+   V VL + + N  PI+DF ++D+  +            GQ 
Sbjct: 300  HHGDSQLLRIDI--DAQ---VMVLVKTLSNNAPILDFAIMDMGNREGDSQFGNAFSSGQA 354

Query: 314  QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
            ++V   GAY DGSLR +R+G+G+ +Q  + E+Q  KG+++LRS      DT LVVS +++
Sbjct: 355  RIVAGCGAYHDGSLRSIRSGVGLEDQGILDEIQDTKGLFTLRSHESSHVDT-LVVSSVAD 413

Query: 373  TRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
            TR+L  +   ++EE    +G     +TL   +    +L+QVT  S  L+ S S      W
Sbjct: 414  TRVLRFDSAGDIEEVYAFQGLTLDMETLLAVNISDGRLLQVTPKSAVLLDSESGVNLCSW 473

Query: 432  KSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY------EISCLDI 485
             +P G ++  A+AN    LL+  G  LV L + D +   ++    +       +ISCL  
Sbjct: 474  DAPSGKAITAASANKGWALLSIDGSSLVSLNLHDNLAAVLRDTSGDESSGQPDQISCL-- 531

Query: 486  NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEGI 541
            +   ++P    I  VG WT  ++ I  L  L  +  E L        +PR + L      
Sbjct: 532  HAARDSP---DIGVVGWWTSGTISIVDLATLQPLHGEPLRQTEDSASVPRDIALVQLHPP 588

Query: 542  SY----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASD 596
                  LL AL DG+++ F ++M+   ++ RK V+LG+ P  L     ++ T +VFA ++
Sbjct: 589  QVSGPTLLIALEDGNVVTFDMSMQGYTISGRKSVTLGSSPARLHVLPQEDGTCNVFATTE 648

Query: 597  RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
              ++IYS+  +++YS     + +++ PF+S AFP+S+ ++ +  + +  ID  +  H+++
Sbjct: 649  HASLIYSAEGRIIYSATTADDATYVAPFDSEAFPNSIVLSTDSHIRLSHIDKERLTHVKT 708

Query: 657  IPLGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTFE-------FISTY 708
            + + E  RR+ +    + F +  +K +    EE      R++D+  F+       F ++ 
Sbjct: 709  LSVKETVRRVAYSPTLKVFGLGCIKKELIHNEEVITSSFRIVDEIIFQELGKPFIFNTST 768

Query: 709  PLDTFEYGCSILSCSFSDDSNVY---YCVGTAYVLP----EENEPTKGRILVFIV-EDGK 760
             L+  E   +++     D +      + +GT++V      EEN+ T+GRILV  V E+ +
Sbjct: 769  SLEMVE---TVIRAELPDSTGNLAERFIIGTSFVTDDDAIEEND-TRGRILVLGVDENRQ 824

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY 820
            +  I     KGA   L+     ++A +++ + +Y ++        LQ    +      L 
Sbjct: 825  VYQIVSHNLKGACRCLSTLGEHIVAGLSKTVVVYNYVEETTVFGSLQKLAAYRPASFPLG 884

Query: 821  VQTRGDFIVVGDLMKSISLLIY-KHEEGA---IEERARDYNANWMSAVEILDDDIYLGAE 876
            +   G+ I V DLM+S++L+ +   ++G+   +EE AR Y   W ++V  LD + +L A+
Sbjct: 885  LDISGNIIGVVDLMQSLTLVEFIPSKDGSRAKLEEVARHYQPGWATSVTNLDGERWLEAD 944

Query: 877  NNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL-----VMRLPDSDVGQI 931
               N+  +++N E  T+++R +LEV  E ++GE +N+ R   +      +  P + +G +
Sbjct: 945  AQGNIIVLQRNPEAPTEQDRSKLEVTSEINIGEQINQIRRLHVASNENAVVSPKAFLGSV 1004

Query: 932  -----------PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
                         +I   + G + +   +  +    L   Q  L+  I   G ++   WR
Sbjct: 1005 GLSETTINCWTQLLILVQIEGTLYLFGEIAPKYQDLLLTFQARLQDYIYAPGNVSFNLWR 1064

Query: 981  SFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            +F N+ +  D    F+DG+++E FLDL   + + + + +  SVE++   +EEL R+H
Sbjct: 1065 AFRNKAREGDGPFRFVDGEMVERFLDLDEAKQELVCEGLGPSVEDMRNMIEELRRMH 1121


>gi|384080885|dbj|BAM11105.1| damage-specific DNA binding protein 1, 127kDa, partial
           [Siebenrockiella crassicollis]
          Length = 364

 Score =  359 bits (922), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 186/367 (50%), Positives = 237/367 (64%), Gaps = 44/367 (11%)

Query: 615 LKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRT 674
           LKEV++MCP NS  +PDSLA+A    LTIGTID+IQKLHIR++PL E PR+IC+QE S+ 
Sbjct: 2   LKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQC 61

Query: 675 FAICSLK---------------------------------------NQSCAEESEMHFVR 695
           F + S +                                         S  EE E+H + 
Sbjct: 62  FGVLSSRIEVQDASGGTTALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLL 121

Query: 696 LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI 755
           ++D  TFE +  +     EY  S++SC    D N Y+ VGTA V PEE EP +GRI+VF 
Sbjct: 122 IIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFH 181

Query: 756 VEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 815
             DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++LY+W       +EL++EC H+ +
Sbjct: 182 YSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWT----AEKELRTECNHYNN 237

Query: 816 ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
           I+ALYV+T+GDFI+VGDLM+S+ LL YK  EG  EE ARD+N NWMSAVEILDDD +LGA
Sbjct: 238 IMALYVKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGA 297

Query: 876 ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTV 934
           EN FNLF  +K+S   TDEER  L+ VG +HLGEFVN F HGSLVM+ L ++      +V
Sbjct: 298 ENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQDSV 357

Query: 935 IFGTVNG 941
           +FGTVNG
Sbjct: 358 LFGTVNG 364


>gi|154303693|ref|XP_001552253.1| hypothetical protein BC1G_08731 [Botryotinia fuckeliana B05.10]
          Length = 1087

 Score =  357 bits (915), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 269/882 (30%), Positives = 454/882 (51%), Gaps = 76/882 (8%)

Query: 212  DGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
            D   YLLGD  G L++L I  +   VTGL++  LG+ S  +++  L N V YI S   D+
Sbjct: 225  DERTYLLGDDYGFLYILTILVDGATVTGLQVVKLGQVSKPTSLENLGNGVFYIASHEADN 284

Query: 272  QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTCSG 320
            Q+++++L+    G  V +L+   N+ PI+DF V+D+     E Q      GQ ++VT SG
Sbjct: 285  QVVQIDLESPEHG--VTLLQTLPNIAPILDFTVMDMGGREGETQLNEYSSGQARLVTGSG 342

Query: 321  AYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
             ++ GSLR VR+G+G+++ A + E++GI+ +++L S    P DT LVVSF +ETR    +
Sbjct: 343  GFEGGSLRSVRSGVGLDDTAILAEMEGIRKVFALHSGPTLPNDT-LVVSFSTETRFFKFD 401

Query: 380  LEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYS 438
             + ++EE E ++   S ++TL  ++     ++QVT   V +   +     + W+ P G  
Sbjct: 402  TQGDIEEVETVKNLSSTSETLLTYNLDEGCILQVTQHEVSIYGKSPG---HRWQPPNGQI 458

Query: 439  VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL-EYEISCLDINPIGENPSYSQI 497
            +  A+ N + +LL++ G  L  L I    L EV   +L + +++C+ +  +        I
Sbjct: 459  ITAASGNQNYILLSSNGRTLFTLSIQQN-LAEVAFQELGDDQVACIHVPQV-----MGDI 512

Query: 498  AAVGMWTDISVRIFSLPDLNLITKEHL---GGEIIPRSVLLCA-----FEGISYLLCALG 549
              VG+W   SV +  L  LN I  E L    G  IPR + L         G + L  ++ 
Sbjct: 513  GVVGLWKSGSVSLLDLATLNTIVSEDLRRADGASIPRDIALTQILPPELSGPT-LFVSME 571

Query: 550  DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKL 608
            DG +L+F ++     L+ RK + LGTQ   L+     NTT +VFA  + P++IY S  + 
Sbjct: 572  DGIVLSFNVDKSDCSLSGRKSIVLGTQQAQLQILPRDNTTFNVFATCEHPSLIYGSEGRT 631

Query: 609  LYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICH 668
            +YS V  ++   +C  NS A+P S+ +A   EL +  ID+ ++ H+R++P+GE  RR+ +
Sbjct: 632  VYSAVTAEDAIAVCSLNSVAYPGSVVVATINELKLAVIDNERRTHVRTLPIGETVRRVAY 691

Query: 669  QEQSRTFAICSLKNQ-SCAEESEMHFVRLLDDQTF-EFISTYPL----DTFEYGCSILSC 722
              + ++FAI ++K + +  +E      RL+D+  F E    Y L    +  E        
Sbjct: 692  SAKEKSFAIGAIKRELTKGQEVVTTSFRLVDEVVFGELGEPYYLPPNNEIIETVVRAELP 751

Query: 723  SFSDDSNVY--YCVGTAYVLPEENEPTKGRILVFIVE-DGKLQLIAEKETKGAVYSLNAF 779
            +   D  +   + VGT++ L EE    +GR+L+F V  D    +IA    KG+   +   
Sbjct: 752  TRHGDGELVERFLVGTSF-LHEEEANVRGRLLIFGVNADRAPYMIASHNLKGSCRCIGVL 810

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            +GK++AA+N+ + +Y +      +  L+    +      + +    + I V D+MKSI+L
Sbjct: 811  DGKIVAALNKTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIAVADIMKSIAL 870

Query: 840  LIYKHEEGA------IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
            + Y    GA      +EE AR     + ++V  +D D YL  +++ NL  +++N EG T 
Sbjct: 871  VEYT--PGADGLPDKLEEVARHAQQVFSTSVAEVDTDTYLETDHDGNLILLKRNREGVTR 928

Query: 894  EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQ 953
            E++ R+EV  E +LGE VNR +  ++       D   IP    GT  G I + + +P + 
Sbjct: 929  EDKTRMEVTCEMNLGEMVNRVKRINVET---SKDALLIPRAFLGTTEGSIYLFSLIPPQN 985

Query: 954  YLFLEKLQTNLRKV-----IKGV-------------GGLNHEQWRSF-NNEKKTVDAKNF 994
               L +LQ+ L  +     I+G              G L+  ++RS+ +  ++T +   F
Sbjct: 986  QDLLMRLQSRLASLPSASSIRGSSDSTSPHQIELSPGNLDFNKYRSYISATRETSEPFRF 1045

Query: 995  LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            +DG+LIE FLDL     + +++ + V  E+L   VE L RLH
Sbjct: 1046 VDGELIERFLDLEVEVQEHVAEGLGVKAEDLRGMVEGLRRLH 1087



 Score = 69.7 bits (169), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 49/172 (28%), Positives = 90/172 (52%), Gaps = 10/172 (5%)

Query: 629 FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-SCAE 687
           +P S+ +A   EL +  ID+ ++ H+R++P+GE  RR+ +  + ++FAI ++K + +  +
Sbjct: 5   YPGSVVVATINELKLAVIDNERRTHVRTLPIGETVRRVAYSAKEKSFAIGAIKRELTKGQ 64

Query: 688 ESEMHFVRLLDDQTF-EFISTYPL----DTFEYGCSILSCSFSDDSNVY--YCVGTAYVL 740
           E      RL+D+  F E    Y L    +  E        +   D  +   + VGT++ L
Sbjct: 65  EVVTTSFRLVDEVVFGELGEPYYLPPNNEIIETVVRAELPTRHGDGELVERFLVGTSF-L 123

Query: 741 PEENEPTKGRILVFIVE-DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            EE    +GR+L+F V  D    +IA    KG+   +   +GK++AA+N+ +
Sbjct: 124 HEEEANVRGRLLIFGVNADRAPYMIASHNLKGSCRCIGVLDGKIVAALNKTV 175


>gi|393243160|gb|EJD50676.1| hypothetical protein AURDEDRAFT_112250 [Auricularia delicata
            TFB-10046 SS5]
          Length = 1140

 Score =  354 bits (909), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 294/1007 (29%), Positives = 488/1007 (48%), Gaps = 113/1007 (11%)

Query: 83   YDGLFKVIPFDNK-GQL-KEAFNIRLEELQVLDIKFL-YGCAKPTIVVLYQDNKDARHVK 139
            Y G    +  D++ G L +E F+  L EL VL    +      P + +L+ D+ D R V 
Sbjct: 126  YKGKLHALRLDDRRGVLGRECFDCNLRELTVLSACVVPVESDLPVLTILHTDSDDKRRVT 185

Query: 140  TYEVALKDKDF---VEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANA---- 192
            T  V L + D     E  +++   DN    L+P+    C + + G    VY  A+A    
Sbjct: 186  TRPVLLDESDVGERTEHLFTKVISDN-THTLVPLRNHGCILTMGGTSLRVYKVAHADVRL 244

Query: 193  ----------------FKAIPIRPSITKA-------YGRVDA----DGSRYLLGDHAGLL 225
                             KA    P+IT+A       +G + A    D  R L+GD  G L
Sbjct: 245  SPAHKKSRKRQSTTGKGKAAADEPAITEAAVSEPWPWGEIVAWTMVDDERILIGDAYGRL 304

Query: 226  HLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
             L+ ++  +     +   LLG+TS A+T +YLDN ++Y+GS +GDSQL++L+   DA GS
Sbjct: 305  VLVTVSLNENAAFTISPVLLGQTSPATTFTYLDNGILYVGSHFGDSQLVRLSTAADASGS 364

Query: 286  YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-L 344
            Y+ V++ Y NL PIVD  +VD+    Q QVVTCSG Y  GSLR+V+    + + A ++  
Sbjct: 365  YISVVKAYSNLAPIVDAVIVDINDSNQPQVVTCSGGYNTGSLRVVQKSAELEQLAIIDAF 424

Query: 345  QGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA 404
               + ++ LR+S+D   DT ++VS   +T+ +  NLE E    E   F ++  TL   + 
Sbjct: 425  PHTENIFPLRASSDAKEDTRMLVSSHMDTKFV--NLEGEDLSVEESPFVTREPTLAASNF 482

Query: 405  IYNQLV------QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
            ++   V      Q+T+  V LV++      +  + P G  + +A+ + +Q LLA   G L
Sbjct: 483  VHKDQVETPYVLQITTKRVVLVNTVVDMEIHSHEYPEG--IMLASCSRTQALLALSDGAL 540

Query: 459  VYLEIGDGILTEVKHAQLEYEI----SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP 514
            VY+ + +  +   K  +LE+ +    S L I P       +    +  W +       L 
Sbjct: 541  VYIRLENDNINATKGPKLEWTLRGQWSALCIEPADARGGPTDAVVIAKWDN----TVQLR 596

Query: 515  DLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDG----HLLNFLLN--------MKT 562
             +  + ++ +    +  + L C     S LL   G G    H+L    N         K 
Sbjct: 597  RMGALFRDDIAHATLQLASLAC-----SVLLVDFGQGRENRHVLAGCANGTVVAARVTKD 651

Query: 563  GELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
              L   + + LG+ P+ LR    ++   V A     TV+Y  + +L +S + +K++++  
Sbjct: 652  WNLEAPRTIGLGSLPVRLR----RDGDRVVACGSLVTVLYWDSGRLQHSALAVKDITNAV 707

Query: 623  PFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLK 681
            P ++ AF ++      G L I  +  +Q+LHI++IP+G E+PRRI +   ++ F + ++ 
Sbjct: 708  PLHTPAFENTTVFISPGSLLIARVKQVQRLHIKTIPMGSENPRRIAYNSHAKAFGVGTMH 767

Query: 682  NQSCA---EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV-YYCVGTA 737
            +Q       ES     ++L+  TF+ +S+  LD  E   S+ S     +S    + VGTA
Sbjct: 768  SQPVGVGEPESVTSSFKILNQDTFDTLSSMQLDDNEEITSVASLPIMPESRTEMFVVGTA 827

Query: 738  YVLPEENEPTKGRILVF-IVED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            Y+   E EP++GRILVF  +ED   G   L A  +  GAV SL + +G ++A +N  + L
Sbjct: 828  YIKDSEMEPSRGRILVFGSLEDSGTGGSWLTAFLQVTGAVLSLTSVDGLIVAGVNTAVIL 887

Query: 794  YKWMLRDDGTREL---------QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
            Y+  LR +   E          Q +  +H +++   +  RGD I +GD + SI++L +KH
Sbjct: 888  YE--LRRNTLSEAERASHLTLRQKKEWNHNYVVT-SLAARGDTIYIGDSVASIAILRWKH 944

Query: 845  EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
            E   +   AR +   +  A++++     + A  ++NL T  +  E  TD    +LE+ G 
Sbjct: 945  E--TLHTIARHFGPIFPLALDVMSSGSVITANIDYNLHTFHQ--ESPTDR---KLEIDGS 997

Query: 905  YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
            YHLG+ VN+F  G L    P      +   +F T  G IG++A    +    L  L+ N+
Sbjct: 998  YHLGDQVNKFIPGRL--SAPTVGASIVLEQVFVTSLGRIGIVAEADKDASWALSALERNI 1055

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAK----NFLDGDLIESFLDLS 1007
             KV+   G   H+ WR+ ++E    DA+    NF+DGD++E F D++
Sbjct: 1056 EKVLDQ-GAPKHDLWRAPHSEHGVSDAQRSAANFIDGDMLERFYDVA 1101


>gi|315039795|ref|XP_003169275.1| hypothetical protein MGYG_08821 [Arthroderma gypseum CBS 118893]
 gi|311337696|gb|EFQ96898.1| hypothetical protein MGYG_08821 [Arthroderma gypseum CBS 118893]
          Length = 1076

 Score =  351 bits (900), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 309/1094 (28%), Positives = 510/1094 (46%), Gaps = 154/1094 (14%)

Query: 6    IYGRIATLE-LFRPHGEA---QDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIG 60
            IYGRI  L+ L RP        D LF+ T++Y +  L WDA   +L T R   D++D   
Sbjct: 74   IYGRITLLQKLSRPTTAGPVLTDVLFVGTDQYAYFSLTWDAAHDQLRTERKYIDLADGSL 133

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF----------------------DNKGQL 98
            R   +     IDP    + L +Y+G+  VIP                        N+   
Sbjct: 134  REAHSDDRCQIDPSGSFLTLEVYEGVVSVIPLATADSHKRAKSAASTSASTAPTSNRTAR 193

Query: 99   KEAFNIRLE-ELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQ 157
            + A N + +  L++ D+K  +        V+  D   A  +K     L D+         
Sbjct: 194  RAATNSQGKVRLKLRDLKHTHA-------VVTGDGGSAAELKDV-TTLSDE--------- 236

Query: 158  NNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVDADG 213
              LD GA +LIPVP PL G+LIIGE +I Y   +  + I  RP    ++  A+ +VD  G
Sbjct: 237  --LDLGASILIPVPRPLGGLLIIGESSIKYVDVSRNETIS-RPLAESTVFVAWEQVD--G 291

Query: 214  SRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL 273
             R+LL D  G L  L++  +                        DNAV          + 
Sbjct: 292  QRWLLADDYGRLFFLMLVLDA-----------------------DNAV----------ES 318

Query: 274  IKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNG 333
             K++L                 LG   +F        GQ ++VT SGA+ DGSLR VR+G
Sbjct: 319  WKVDL-----------------LGTTREFS------SGQTRIVTGSGAFGDGSLRSVRSG 355

Query: 334  IGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE-- 390
            +GI +   +  ++ I  +W LRS+  +PF   L+VSF++ETR+   + + E+EE E +  
Sbjct: 356  VGIEDLGVLASMEHITDLWGLRSACPEPFYDTLLVSFVNETRVFHFSPDGEVEEKEDDFL 415

Query: 391  GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVL 450
            G      TL   +   N+++QVT  + +++   S  +   W+S    S+  A+AN   ++
Sbjct: 416  GLVFSRSTLLATNIPENRILQVTESTAKVIDLDSGMVI--WQSSHNESITSASANDDYLV 473

Query: 451  LATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI--NPIGENPSYSQIAAVGMWTDISV 508
            L  GG  LV + +        +  +++ ++S + I  +PI       Q   V +     +
Sbjct: 474  LVLGGIRLVCISLSTFEQVGSRDFEVDNQVSGMTIPASPI-------QACIVCLPQSAEI 526

Query: 509  RIFSLPDLNLITKEHLG--GEIIPRSVLLCAFEG--ISYLLCALGDGHLLNFLLNMKTGE 564
             I  LP L +  K+ LG  GE IPRSV++          L  ++ DG + +F  ++ T  
Sbjct: 527  VILDLPGLEVKNKQALGEPGEAIPRSVIVAEILAGKPPTLFVSMADGTVFSFSFDINTFT 586

Query: 565  LTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCP 623
            ++   K++LG++  + +     N  + VFA  D P++I++S  +++YS V+    S +C 
Sbjct: 587  ISGSSKITLGSEQPSFKKLPRGNGQYNVFATCDHPSLIHASEGRIVYSAVDSASSSRICS 646

Query: 624  FNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN- 682
             N+ A+P S+A++ + EL I  +D+ +   I ++P+    RR+ +    + F + ++K  
Sbjct: 647  LNTQAYPGSIALSNQNELKIALVDEERTTQIHTLPMHASVRRLAYSPLEKAFGLGTVKRT 706

Query: 683  -QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEY-GCSILS---CSFSDDSNVY----YC 733
              +  EE    FV L D+  F  +STY L   E   C I S   C   +  N+     + 
Sbjct: 707  ISNGVEEVSSSFV-LADEIHFRPLSTYDLRPDELVECVIRSQVNCGKDEVGNLMSKDLFF 765

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLL-AAINQKI 791
            VGTA++    ++  +GRIL+F V   + L LI +K   GA  +L   +  LL A + + +
Sbjct: 766  VGTAFLDDVGDDHIRGRILIFEVNKSRELSLIVDKSLMGACRTLAVMDPSLLVAGLVKSV 825

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK-----HEE 846
             ++K      G   L+    +      + +   GD + V D+MKS+SL+ Y       +E
Sbjct: 826  SVFKLARDRFGNIFLEKHTAYRTSTAPIDISVTGDTVAVADVMKSMSLVQYTPAEKDEQE 885

Query: 847  GAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
               EE AR Y   W +AV ++++ +YL A+   NL  +++N+ G T+ +R RL+   E  
Sbjct: 886  PKFEEIARHYQTLWSTAVTLIEEHVYLLADAEGNLVVLQQNTTGVTESDRKRLQPTSEIR 945

Query: 907  LGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            LGE VNR  H   V    ++ V      +  TV+G I +   +       L +LQT +  
Sbjct: 946  LGEMVNRI-HPITVQTHTETAVSA--RALLATVDGSIYLFGLINPAYIDLLLRLQTAMAS 1002

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTM---NVSV 1022
            +    G +   ++R+F    +  D    F+DG+LIE FL  + T  +EI+  +   NV+V
Sbjct: 1003 ITVSPGEIPFSKYRAFRTTVRQSDEPFRFVDGELIERFLSCTPTMQEEIANRLDDSNVTV 1062

Query: 1023 EELCKRVEELTRLH 1036
              L + +EEL R+H
Sbjct: 1063 SSLKEMIEELRRMH 1076


>gi|402223178|gb|EJU03243.1| hypothetical protein DACRYDRAFT_115454 [Dacryopinax sp. DJM-731 SS1]
          Length = 1175

 Score =  346 bits (888), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 315/1087 (28%), Positives = 512/1087 (47%), Gaps = 128/1087 (11%)

Query: 58   RIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
            R GR  +  Q  I+ P  R+     Y G  KV  F   G+ K+  ++R+ E  +L   FL
Sbjct: 109  RAGRIAEYCQTSIVHPSGRVAVTSAYTGSLKVTVFGEDGKGKD-IDVRIREQNLLSFAFL 167

Query: 118  -YGCAKP-TIVVLYQDNKDARHVKTYEV----------ALKDKDFVEGPWSQN---NLDN 162
                + P T+ +L+ D    RH+ T  +          +L   D  E P +     NLD+
Sbjct: 168  PLSTSSPFTLALLHIDQNGKRHLVTRSLKHVPEPGSTKSLASLDISE-PTAHIPDINLDD 226

Query: 163  G--ADLLIPVPPPL----CGVLIIGEETIVYCSAN-----------------------AF 193
               A  ++ VPPP      GV++ GE    + +                           
Sbjct: 227  EWEATSVVGVPPPRPGARAGVMVFGEGQARWYAVKEEGAPDPKGKRRSSTAGSPTQEKKG 286

Query: 194  KAIPIRPSITKA---YGRVDA----DGSRYLLGDHAGLLHLLVITHEKEK--VTGLKIEL 244
            K+ P R    K    +GR+ A    D  + L+ D  G L+LL +   + +  V  ++ E 
Sbjct: 287  KSKPPRVEEAKVGLPWGRIAATCAIDEEKLLVSDEYGKLYLLALHRRQSEGPVVSIQREA 346

Query: 245  LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCV 304
            LGE S  S+I++L ++  ++ S  GDSQLI L          + VL+ ++NL PI DF V
Sbjct: 347  LGEISSPSSITHLTSSYFFVTSKCGDSQLISLTPSAICGTGNIAVLDTHLNLAPISDFVV 406

Query: 305  VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDT 363
             D E+ GQ Q+VT SG    GSLR+VR+G+   E   +E +   + ++ L+ + D  +  
Sbjct: 407  TDPEKIGQQQIVTASGRMNTGSLRVVRSGVAFEELGLLEQVMNTRSIFPLKINFDAEYHL 466

Query: 364  FLVVSFISETRILAMNLED------ELEETEIEGFCSQTQTLFCHDAIYNQL-VQVTSGS 416
            +LV S +  T + +M  +D      E++E+E  G   Q  TL   +   + + VQVT+GS
Sbjct: 467  YLVTSSLHRTTLYSMRGDDSSPQFEEVDESEARGIVRQVPTLAAGNIREDDIFVQVTAGS 526

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL--EIGDGILTEVKHA 474
            V  +   + E R+ WK  P   +  A  N  +V+    GG+LV+L  +    I+     +
Sbjct: 527  VVAIDLITWEQRSIWK--PHVEIVCAAVNGQRVVAGLKGGYLVHLIPQRDQFIVANDWKS 584

Query: 475  QLEY---EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII-- 529
               +   EISC+ ++P   +      AA+G W    V+IF L D     +E    E I  
Sbjct: 585  PGSWGLTEISCIALDPT-LSAELGNYAAIGFWGTNQVKIFYLGDETHRFEELKLAEPISP 643

Query: 530  ----PRSVLLCAFEGIS---YLLCALGDGHLLNFLLNMKT--GE----LTDRKKVSLGTQ 576
                P S +L  F G S   +LL  LG+G + ++ L  +   GE     TD+K  SLGT+
Sbjct: 644  EDHLPVSTILTTF-GTSNRPHLLVGLGNGTISSYALKTEIVLGEPSVRATDKKIFSLGTK 702

Query: 577  PITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
            PI L   +     +++FA  DRP +++  N +L  S + L+++      +   FP S   
Sbjct: 703  PIMLNACTDLGRESNIFACGDRPALLFLKNDRLTASPIKLRDIHAGSVLHIPQFPSSFIF 762

Query: 636  AKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFA-ICSLKNQSCAEESEMH- 692
            A    L IG I + QK+ +R+I LG + P R+ +    R +  +C  K  +  ++ E++ 
Sbjct: 763  ASASTLLIGQIRESQKIDVRTISLGLDTPIRLTYHRGLRAYGVVCQRKELNREDDREIYS 822

Query: 693  -FVRLLDDQTFEFISTY---PLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTK 748
               +L DD TFE+++ +   P +      +I   +  +DS+ +  VGT      E + +K
Sbjct: 823  SSFKLFDDITFEYLNNFTARPDEQMMCVTTIPDSTGEEDSD-FLVVGTYEATGAEEDVSK 881

Query: 749  GRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
            GRIL+F  V + KL+L+   +  G VY++      L AAIN  +Q++      D  R ++
Sbjct: 882  GRILIFEEVPNRKLKLVVSHDVGGCVYAVTNVGANLAAAINGTLQVFSLHRSHDDIR-IE 940

Query: 808  SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
            S        +A  +  RG+ ++VGD M+++ +L +   +  +E    DY + W+  +E +
Sbjct: 941  SVAKWSSAYVASSLICRGNTLLVGDAMRAVCILRWTGAK--LETLYHDYASLWIQTLESI 998

Query: 868  DDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP--D 925
            D+   +GAE N N+ T RK+         G+LE  G ++ GE +NRFR  SL    P   
Sbjct: 999  DEGGVIGAELNNNIVTWRKD---------GKLERDGMWYFGEGINRFRRASLNAAAPGAG 1049

Query: 926  SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
             +      ++F T  G IG++ASL  +  + L  LQ N+  VI+G G + H + R+  + 
Sbjct: 1050 GNNAGRGNLVFCTNTGRIGIVASLDEDLSMQLSNLQRNIGSVIQGPGKVEHSRRRAPQST 1109

Query: 986  K-------KTVDAKNFLDGDLIESFLDLSRTRMDEISKTM--NVSVEELCKR-------V 1029
                    +   A  F+DGD +E FL L  +  DE+ + +      E+L  R       +
Sbjct: 1110 PGLPTILTQNTHAVGFIDGDFVEKFLYLDPSS-DEVMRILEGKNKYEQLGNRYHDIVQML 1168

Query: 1030 EELTRLH 1036
            EEL RLH
Sbjct: 1169 EELQRLH 1175


>gi|328858656|gb|EGG07768.1| hypothetical protein MELLADRAFT_105631 [Melampsora larici-populina
            98AG31]
          Length = 1216

 Score =  344 bits (883), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 323/1194 (27%), Positives = 535/1194 (44%), Gaps = 191/1194 (15%)

Query: 4    VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT 63
            +P++G I  ++ +         L + T       L++  +S  +IT A   ++    RP 
Sbjct: 53   LPVFGLIDHIQSYIKQPNQTATLLVLTSELNLFTLRYCPQSKSVITTATISLNQPGARPA 112

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPF------------------------------- 92
            D  Q  I+DP  R + LH  +G+  +IP                                
Sbjct: 113  DYLQTSIVDPHSRCLVLHALNGILHLIPLEPSSKPKPKHLEPSLGGRRKRSTISLKSNHP 172

Query: 93   ---------DNKGQLKEAFNIRLEELQV--LDIKFLYGCAKPTIVVLYQDNKDARHVKTY 141
                     D   +L +   +RL E+ V  LD   L     PT+++LY ++   R +++ 
Sbjct: 173  PNPDLDTNIDPDSELYQIVQLRLNEVNVHALDFAALPAHLPPTLLILYSNHMGQRVLRSR 232

Query: 142  EVALK----DKDFVEGPWSQNNL--DNGADLLIPVPPPLCGVLIIGEETI--VYCS---- 189
             + L     D+D        N+L  D    L+IP+P     VL++GE+++  VY +    
Sbjct: 233  SIDLITANCDQDICP-----NHLCPDLDTSLIIPIPDDQGSVLLVGEDSVELVYLTDRLP 287

Query: 190  ------ANAFKAIPIRPSITKA----------------------YGRVDADGSRYLLGDH 221
                   +  +++  R   T+A                      + +V+     +L+GD 
Sbjct: 288  NPTGKGKSRDRSVDSRTGATRAPESDKLLKHVTSINIPLGSYTCFCKVEDQPDVWLIGDL 347

Query: 222  AGLLHLLVITHEKEKVTG--LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQ 279
             G  H+LVI  E+ +     L+ +  G+ S    + Y+ +  +Y+ S YGDSQL++++  
Sbjct: 348  YG--HVLVIRLERPEGGKPLLRYQQAGQVSSPEALVYISDRFIYLASHYGDSQLLRISAL 405

Query: 280  PDA--KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
              A  +    EV+  Y NL PI D CVVD       Q+VTCSGAY+DGSLRI+ +GI + 
Sbjct: 406  DSALAQDCQPEVVANYPNLAPISDVCVVDQSEGFDHQLVTCSGAYQDGSLRIITHGITLT 465

Query: 338  EQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM---NLEDELEETE-IEGFC 393
            +   + + G + +WS+   T       L+V F +ETR L +       ++EE +   GF 
Sbjct: 466  DLGMLPIAGAEHIWSI--DTPHSPQVTLIVGFRNETRFLIIENDQFSQDVEELDSFSGFK 523

Query: 394  SQTQTLFCHDAIYNQ------LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANAS 447
            S  +T+     I          +QVT   V +V    R     W+      + VA   AS
Sbjct: 524  SDRRTILAGQIISQGHHVQAFPIQVTQEEV-IVGEMFR-----WEPTSNDLITVAAIGAS 577

Query: 448  QVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDIS 507
              ++A     ++ L + D +L + +  +   E+SCL I+P        ++ AVG W   S
Sbjct: 578  LTVVAL-QREVLLLHVKDNLLVQSESIRFPNEVSCLAIDP------SQKVLAVGQWISNS 630

Query: 508  VRIFSLPDLNLITKEHLGGEIIPRSVLLCAF-EGIS-YLLCALGDGHLLNFLLN-MKTGE 564
            + + SL  +N+ T    G +    SVL   F EGI  +LL  + DG L++  L  ++ G 
Sbjct: 631  IALVSLDQMNVSTTIETGSDFGVHSVLFTNFGEGIDPHLLAGMDDGELISIKLQGLENGS 690

Query: 565  LT---DRKKVSLGTQPITLR-------TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVN 614
            L     R+ V LG +PIT+        T        V+  S+ PT++   + +L Y  V 
Sbjct: 691  LVQEQSRRTVVLGHRPITMSRIVLDIPTDQQLGKPAVWIHSEHPTLMIEVDGRLKYRPVA 750

Query: 615  LKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL-GEHPRRICHQEQSR 673
            L + +      +        +  EG + +G ID ++K+HI  IPL GE P+RI H +  R
Sbjct: 751  LGDATDAIKIGTRGVG---VVTSEG-IRLGRIDSLEKIHISKIPLGGEQPKRIAHSKSMR 806

Query: 674  TFAICSLK---NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 730
             + +  +    NQ   E  +   V+++DD TFE +  + L + E G SI +     D   
Sbjct: 807  AYGVVCVSQKVNQQTGELDKGSSVKIIDDVTFELLFDFQLLSIEQGTSIAAIELGKDLIE 866

Query: 731  YYCVGTAYVLPEENEPTKGRILVFIV-----EDGKLQLIAEK---ETKGAVYSLNAF-NG 781
             + +GT +V P E++   GRIL   +     ++G L+    K   + KG V+ L     G
Sbjct: 867  DFIIGTGFVNPNESQSNTGRILTIGLSSKHDQEGNLREFKLKRMTKVKGTVHGLGGLPGG 926

Query: 782  KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
            K +A+ N  +  +     ++       +    G + +  V T  ++I+VGDL KSI +L 
Sbjct: 927  KFVASANAFVHAFGINEEEEDEGFEVLDTWGGGFV-SQTVLTEKNWIIVGDLYKSIVVLE 985

Query: 842  YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV--RKNSEGATDEER--- 896
            +  ++ +++   RDY+A  +  + ++ D +++ A+  FNLFTV  R+  +G  +E+    
Sbjct: 986  FDLKKFSLKVLGRDYSAMSVRPIGMISDRVFVAADTEFNLFTVEMRERQKGLKEEDEDEE 1045

Query: 897  -----------------------------GRLEVVGEYHLGEFVNRFRHGSLVMRLPD-- 925
                                           L+ VG +HLGE VN F+ GSLV  L    
Sbjct: 1046 GLSVEEEKGDDDEWEEEERRMRVEKVFNDDHLDTVGGFHLGENVNHFKAGSLVKSLKHFY 1105

Query: 926  -SDVGQIPTVIFGTVNGVIGVIASLPHEQ-YLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
              D+     +IF +  G IGVI  L   + Y  L+ L+  L+K I  +GGL+  ++R F 
Sbjct: 1106 GQDLKYGGKLIFVSSTGGIGVIIKLEDLKIYKHLKALEDRLKKEILSIGGLDSTEFRKFK 1165

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN-VSVEELCKRVEELTRLH 1036
            N+ K VD   FLDGDLI+   + S  R   I +  + V ++E+   VEEL+  H
Sbjct: 1166 NKWKKVDGCEFLDGDLIQRLYNDSFKRKSLIKEDQDEVGIDEV---VEELSWFH 1216


>gi|403415203|emb|CCM01903.1| predicted protein [Fibroporia radiculosa]
          Length = 1267

 Score =  341 bits (874), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 278/967 (28%), Positives = 450/967 (46%), Gaps = 157/967 (16%)

Query: 205  AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
            A+   D  G R+++GD  G L +L      +   GL +  LGETS A +++YL + V+Y+
Sbjct: 323  AWCPADDQGRRFIIGDTFGRLAMLAF----DDANGLLLIPLGETSSAVSLTYLSSQVLYL 378

Query: 265  GSSYGDSQLIKLNLQP----DA-------------------------------------- 282
            GS +G+SQL++++  P    DA                                      
Sbjct: 379  GSHFGESQLLRIHQSPFTERDAETLPIPDHVKSISAVSLPGGKGKERLDDSLEPSKKERG 438

Query: 283  ------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGI 336
                  KG +VEVL+ Y N+ PI+D  +VDL+  GQ Q++ CSG    G+L+I+R G   
Sbjct: 439  GQVVLGKGKFVEVLDSYENIAPIMDAVMVDLDGSGQPQIIACSGGRNSGALKIIRTGADF 498

Query: 337  NEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI--EGFC 393
             EQA +  ++ +  +W +RS  +D  D+ L+ + + ET + +++  + +   +    GF 
Sbjct: 499  QEQAVIRGIENVTDIWPIRSHYEDIIDSHLIATTLHETLVFSLDGRNAVTHMDPSDHGFI 558

Query: 394  SQTQTL----------------FCHDAIYNQLV-QVTSGSVRLVSSTSR-----ELRNEW 431
            +++ TL                     + + LV QV+   VR++   +       + + W
Sbjct: 559  TRSPTLAVGNIPRRATTQNGGRVVSSYVDSSLVLQVSQEKVRVIEHDAALGLFVPVGDGW 618

Query: 432  -KSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQL------EYEISCL 483
              +  G ++  A  N+SQ +L   GG L+ L +G+   + E+K+           EIS +
Sbjct: 619  DATKEGRTIVAAAINSSQFVLGLSGGRLILLNLGENEQIQELKYRDFGDRTYGMLEISAV 678

Query: 484  DINPIGENPSYSQIAAVGMWTDISVRIFSLPD----LNLITK-------------EHLGG 526
              +P      Y+    V  W    + I SL      L  I +              + G 
Sbjct: 679  SCSPFDTTKKYATCIGVSFWGTNRIAILSLESATSYLTTICESDPLPSLPRSLLLHNFGQ 738

Query: 527  EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            +  P+       +   ++L  L DG L++F    K  +L D+K  +LG  P+ L   +  
Sbjct: 739  DHNPKGA-----DYHPHVLAGLVDGTLVSFSFREK--DLKDKKTFALGDVPVALAKSTVD 791

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
                +FA+  R +V+Y   ++L  S V +K+       +S+AFP  L +A    L IG +
Sbjct: 792  GKPAIFASGSRASVLYWDRQRLHQSPVMIKDAVRGASISSSAFPSCLILATRSTLMIGNV 851

Query: 647  DDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLK---NQSCAEESEMHFVRLLDDQTF 702
              + K+ I S+ +G ++PRRI H    R FA+  ++   N+    +     ++LLDD TF
Sbjct: 852  RGVDKMQINSVRMGLDNPRRIAHHPTLRVFAVACIQTKPNRIGEPQETTSSLKLLDDTTF 911

Query: 703  EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG--- 759
              I+++  ++ E   S+L+ S SD S+  +CVGT    P E EP+ GRIL+F +  G   
Sbjct: 912  NRIASFACESDEEVTSVLTLSSSDVSSARFCVGTVQFKPGETEPSSGRILLFSLNTGPES 971

Query: 760  KLQLIAEKETKGAVYSLNAFNGKLLAAIN------------------QKIQLYKWMLRDD 801
              QL++     G VY L +  G + AA+N                   K+ L+K    ++
Sbjct: 972  SFQLVSSTPVSGCVYQLVSIQGMIAAAVNTSVRTAYVFIAFDPNMTIHKVILFKPEKLNN 1031

Query: 802  GTRELQ--SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
             T  L   SE  H+  +  L V   G  ++VGD + SIS +  K ++  +E  ARDY+  
Sbjct: 1032 STVVLTKVSEWNHNYSVTGLVVH--GCMLIVGDAISSISFV--KVDDTTLESIARDYSPL 1087

Query: 860  WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
            W  +VE +D D  +GA ++ NLFT      G     R  LE  G Y+LG+ VN+F  GSL
Sbjct: 1088 WPVSVEAMDGDGVIGANSDCNLFTFALQRSG----HRSTLERNGSYYLGDMVNKFLRGSL 1143

Query: 920  VMRLPDSDVGQI-PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
               +  S+   I P  +F T  G IGVI  +  +  L +  LQ N+ K I G GG++H  
Sbjct: 1144 T-NIDISERKSIEPKHLFFTSTGRIGVILEMNDKISLHMTGLQRNMGKRIIGPGGVHHAT 1202

Query: 979  WRSFNNEKKTVDAK---NFLDGDLIESFL------DLSRTRMDEISKTMNVSVEELCKRV 1029
            +R+  N K   DA+    FLDGD +E +L      DL +  MD    TM+ S  ++   +
Sbjct: 1203 FRAPANSKGHSDAEAAFGFLDGDFLEQYLGYADYTDLLQGDMDAEQVTMSHS--QIQDVL 1260

Query: 1030 EELTRLH 1036
            E+L  LH
Sbjct: 1261 EQLQSLH 1267



 Score = 47.0 bits (110), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 41/179 (22%), Positives = 76/179 (42%), Gaps = 12/179 (6%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATER-----YKFCVLQWDAESSELITRAMGDVSDRIG 60
           I+GRI +L+        Q  L + T+          ++  D  ++ L+++   D+ DR  
Sbjct: 54  IWGRIVSLKAIPSKDSKQSNLLVLTDHPDPKLIPLTLVLDDTGNASLLSKETIDLHDRYA 113

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
           RP +      +DP   +  +  Y G  KVI F N+ +     +I + EL +L +  LY  
Sbjct: 114 RPAEFVTDVYVDPTGEVAVVSCYTGRLKVIEFKNRERTGIPLDISIPELYILAMTLLYTT 173

Query: 121 AKP-TIVVLYQDNKDARHVKTYEVALKDKD------FVEGPWSQNNLDNGADLLIPVPP 172
               T+ +L+ D++    + + ++ L++              S N   +    LIPV P
Sbjct: 174 KDLYTLSILHYDHQGRLQLLSRDLDLRELSASPSTLLFSTILSSNVPSDSTPFLIPVLP 232


>gi|392591958|gb|EIW81285.1| hypothetical protein CONPUDRAFT_56293 [Coniophora puteana RWD-64-598
            SS2]
          Length = 1245

 Score =  339 bits (869), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 327/1158 (28%), Positives = 532/1158 (45%), Gaps = 185/1158 (15%)

Query: 48   ITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLE 107
            IT+ +  +++R GR  +     ++DP  +L  +  Y G  KV+  D+ G+    F++   
Sbjct: 104  ITKKLS-LAERTGRIAEFFVDLLVDPTGQLAIVSTYAGKLKVVQLDD-GEYDSDFDVSTT 161

Query: 108  ELQVLDIKFLYGCAKP-TIVVLYQDNKDARHVKTYEVALKDKDFVEGPWS---------- 156
            EL +L +  +       T+ +L+ D +    + T +++LK+      P +          
Sbjct: 162  ELNLLSLATILPSPDDLTVALLHVDYQQRLQLLTRDLSLKELQLSPLPSAVLPSVPLAAK 221

Query: 157  QNNLDNGADLLIPVPPP-------LCGVLIIGEETI-VYCSANAFK-------------- 194
              ++D     LI VP           GVL+IG   I +Y  A A K              
Sbjct: 222  HFSVDEEVPKLIHVPISEKDEESFAGGVLVIGGSKIFLYDLAPAEKQRKQKEKQRRTQKK 281

Query: 195  -----------------AIPIRPSI------TKAYGRVDADGSRYLLGDHAGLLHLLVIT 231
                                 R S+        A+ +VD  G R+LLGD  G L +L + 
Sbjct: 282  KASADSDEVMKDEGKEKKKKARASVEWPWAEVSAWCQVDDVGRRFLLGDKFGRLVMLCV- 340

Query: 232  HEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP----------- 280
             +    + L +  LGETS A+T++YL N VVY+GS YGDSQL++++              
Sbjct: 341  -DISSNSSLTLIALGETSPATTLTYLSNQVVYLGSHYGDSQLLQISPSRLSDSDTPTLPV 399

Query: 281  ----------------------------DAKGSYVEVLERYVNLGPIVDFCVVDLERQGQ 312
                                        + +GS++  ++R+ N+ PI D  +VD +  GQ
Sbjct: 400  PGDIHTVTHIPHSSPGSDYDRRHRGFIVNGRGSHLTEVDRFQNIAPIYDAALVDPDNSGQ 459

Query: 313  GQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
             +VVTCSG    GS+R+VR+G    E A++  +  I  +W +RS   DP  T +V S   
Sbjct: 460  YEVVTCSGGQNTGSIRLVRSGADFQEAAAIGGIPNITNIWPVRSLYSDPIHTHIVASTPQ 519

Query: 372  ETRILAMNLEDELEETEIEGFCS-QTQTLFCHDA--------------IYNQLV-QVTS- 414
            ET +  ++ +  +      GF + + +TL   +               + + LV QVTS 
Sbjct: 520  ETYVFRIDNKHTVSHVADSGFVTGKKRTLVVQNLQKKVPKEAGQGLTYMDSSLVLQVTSD 579

Query: 415  -----------GSVRLVSSTSRELRNEWKSPPGY---SVNVATANASQVLLATGGGHLVY 460
                       G    V     ++       PG+    +  A+ANASQV+L    G L  
Sbjct: 580  ELVLLNLDKGLGEFTRVGDGVCKMGQLAGGKPGWIDREIVAASANASQVVLGLSFGRLAV 639

Query: 461  LEIGDGILTEVKHAQLE------YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL- 513
            +   D     V +           EIS +   P  ++ S+S +AAV  W    V I SL 
Sbjct: 640  VNFADNKFKLVGYRDFTNPSGGIAEISAVSCTPADQSKSFSPMAAVAFWQTHRVEIISLG 699

Query: 514  -PDLNLITKEHLGGEIIPRSVLLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTD 567
             P   L     L    +PRS+L+  F G S     +LL    DG +  F+L  K   L +
Sbjct: 700  SPFPTLCASVSLPS--LPRSLLMHRFAGESANAPPHLLVGRADGVVATFVLKDKA--LVE 755

Query: 568  RKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
            +K++ +G  P+T  T  +K  T VFA   R +V++    +L +S + LKEV+     ++ 
Sbjct: 756  QKQIPVGNLPVTFHTCKAKGRTAVFACGSRTSVLFWEKDRLRHSPLILKEVAAAASLHTH 815

Query: 628  AFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSC- 685
             +  SL +A    L IG + +++KLHIRSI  G ++PRRI H    +  A+  +++    
Sbjct: 816  DYRSSLVLATSEGLVIGDVQNLEKLHIRSIHTGLDNPRRISHSPVHKALAVGCVRHTPVR 875

Query: 686  --AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI--LSCSFSDDSNVYYCVGTAYVLP 741
                E     V+L +D T + +    LD  E   +I  LS   ++++   + VGT  +  
Sbjct: 876  VGEPEISRGSVQLYNDTTLDKLGQVVLDHDEEPMAIKALSVRVAEEAKDCFVVGTVIIDS 935

Query: 742  EENEPTKGRILVFIVEDGK----LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
             ENE + GR+L+   +  +    + + A ++ KG VY++ A +G ++AA+N  + +Y  +
Sbjct: 936  LENESSSGRLLLVEPDYSRGESFVAVSASEKVKGCVYAVAAVDGLVVAAVNSAVVIYS-I 994

Query: 798  LRDDGTREL---QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
              DD TR L   +    +H +++A  V +RG+ ++VGD + S++LL Y  E GA++  AR
Sbjct: 995  EADDHTRALSFVKKVEWNHNYVVANLV-SRGNLLLVGDAISSVTLLQY--ERGALQNVAR 1051

Query: 855  DYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
            DY+  W ++VE+LD+   +GA+N+ NLF      +GA   ER  LE  G Y+ G+ VN+F
Sbjct: 1052 DYSPLWPTSVEMLDERNVIGADNDCNLFMFTLQ-DGA---ERKVLERNGHYYFGDMVNKF 1107

Query: 915  RHGSLVMRLPD---SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG- 970
              G +   L     SD+   P  +F T  G IGV+  +  E  L +  LQ NL       
Sbjct: 1108 IPGEIYRALSSFEASDIEVEPKQLFFTTTGSIGVVIDMSDELSLHMSSLQRNLSTYFAAQ 1167

Query: 971  VGGLNHEQWRSFNNEKKTVDAKN----FLDGDLIESFL------DLSRTRMDEISKTMNV 1020
             GG +H ++R+  N +   DA N    FLDGDL+E FL      +  R  ++  ++   +
Sbjct: 1168 PGGASHTKYRAPKNARGRSDADNSSFGFLDGDLLERFLLFGDDEEAVRKVLEGSTEAEQL 1227

Query: 1021 SV--EELCKRVEELTRLH 1036
            S+  E + K +E L  +H
Sbjct: 1228 SIAPERIIKVLERLQSMH 1245


>gi|170090007|ref|XP_001876226.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164649486|gb|EDR13728.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 1275

 Score =  337 bits (865), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 335/1235 (27%), Positives = 532/1235 (43%), Gaps = 214/1235 (17%)

Query: 4    VPIYGRIATLELFRPHGEAQD--FLFIA---TERYKFCVLQWDAESSELITRAMGDVSDR 58
            V I+G++ T++     G A+    L IA    E   F  +    ++  L       + +R
Sbjct: 53   VEIWGKVLTVKAIPQIGSARSTIVLMIAHPDPELVFFTYIDRQGDNPALEVSKKLSLFER 112

Query: 59   IGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
            I RP +     ++ P  +L  +  Y G  K+I     G   + ++++L E+ V  + FL 
Sbjct: 113  IARPAEFFNDVLVHPSGKLALVSCYSGKLKIIRLKC-GTYDKDYDVQLPEVNVFSVTFLS 171

Query: 119  GC-AKPTIVVLYQDNKDARHVKTYEVALKDKDFV----------EGPWSQNNLD---NGA 164
                +  + +LY D ++   +   ++ +   D              P S   L       
Sbjct: 172  SFNGEYAVAILYLDYQERVQLVARDILVNGDDITVSEEPSTLLHPTPISNKTLPFPTESV 231

Query: 165  DLLIPVPPP------------LCGVLIIGEETIVYCSANAFKA----------IPIRPS- 201
              L+ VPP             L GVL++G   I+    ++ K+          +  R S 
Sbjct: 232  PQLVSVPPEELDEDTDVDDAFLGGVLVVGGRQILLFELSSEKSQEKQRGKRKRLESRKSS 291

Query: 202  -----ITKAY-------GR--------------------VDADGSRYLLGDHAGLLHLLV 229
                 I KA        GR                    +D    RYL+GD  G L +L 
Sbjct: 292  KDVTEIAKAREKEKEREGRRRKPKSSIEWPFSELAAWCAIDKAPYRYLIGDSFGRLSILS 351

Query: 230  ITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK-------------L 276
            +  +K +  GL +   GE S  ++++YL N +VY+GS  GDSQLI+             L
Sbjct: 352  L--DKVQRIGLVLIPFGEASSPTSLTYLTNQLVYVGSHSGDSQLIRLSPLPISSSESPTL 409

Query: 277  NLQPDAK--------------------------------------------GSYVEVLER 292
             + P+ K                                            GS++EVL  
Sbjct: 410  PIPPEIKTVSPNLLEALGHRKGKSKSVAEDHMDEDHDDEDDASQGYIVETHGSFIEVLSS 469

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMW 351
            Y NL PI+D  +VD +  GQ  +VTCSGA   GS+ IVRNG    E   V  L G+ G+W
Sbjct: 470  YKNLAPILDAILVDTDGSGQQHIVTCSGARSTGSINIVRNGADFQEIGHVPGLTGVVGVW 529

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDE------LEETEIEGFCSQTQTLFCHD-- 403
            S+R+  +D  D +++VS    T +  ++          ++ T I    +   TL   +  
Sbjct: 530  SVRTMLEDTTDRYILVSTNRSTHLFEIDDSGSTSTITPVDSTTIRSLVTTEATLAFSNLA 589

Query: 404  --------AIYNQ---LVQVTSGSVRLVSSTSR----ELRNEWKS----PPG---YSVNV 441
                    ++Y      VQV +   RL+ S +     EL  ++      P G     V  
Sbjct: 590  RRSSVGGSSVYKNSPLAVQVVASGARLLKSNTAFGGYELVAQYSMLSSVPYGQRPLEVVA 649

Query: 442  ATANASQVLLATGGGHLVYLEIGDGI--LTEVKHAQLEY--EISCLDINPIGENPSYSQI 497
            A+ANASQ+++A  GG L    +G+ +  L  +   Q +   EIS +   P+      S  
Sbjct: 650  ASANASQLVMAASGGKLTLWRLGENVDALEMIAGCQRKEGPEISAVSCAPLNTTKRTSPT 709

Query: 498  AAVGMWTDISVRIF--SLPDLNLITKEHLGGEIIPRSVLLCAFEGIS---------YLLC 546
              V  W   ++ I   SL  L  + K      ++ RSVLL  F   +         YLL 
Sbjct: 710  IIVSYWQSNTIEILQVSLKGLESVYKSPTLPALV-RSVLLYNFGSDTNPKGTDHHPYLLA 768

Query: 547  ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK 606
             L +G + +F    K  +LTD+K + LG  P+ L     +    VFAA +R TV+   NK
Sbjct: 769  GLANGTVASF--RWKDKQLTDKKIIPLGHAPVNLMPCQVEGRHAVFAAGNRATVLSFENK 826

Query: 607  KLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRR 665
            +L++S + LK++S     N+  F  SL ++    L IG +  + KLHIRSIP G ++PR+
Sbjct: 827  RLVHSPIMLKDISSAARLNTPTFASSLILSTPTGLFIGRVQGLGKLHIRSIPFGFDNPRK 886

Query: 666  ICHQEQSRTFAICSL---KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
            I H+   + F +       N+    E      +LLDD +F  +  +  D  E   +++S 
Sbjct: 887  IAHEPSIKAFGVAFTTMEPNRVNDPEISRSSFKLLDDTSFANLCQFNCDPDEETTAVVSF 946

Query: 723  S--FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG-----KLQLIAEKETKGAVYS 775
            S   +     ++CVGT      E EP+ GR+++F           L L+A  +  G VY+
Sbjct: 947  SQKIAGKPMPFFCVGTYVYKAGEVEPSAGRLMIFTATTSTSSNLALSLMASTKVPGCVYA 1006

Query: 776  LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH-HGHILALYVQTRGDFIVVGDLM 834
            L     +++AA+N  + L++     D       +    H + L   + +  D +VVGD  
Sbjct: 1007 LTVVQNQIVAAVNSSVMLFRLESSSDSLSPSLIKVSEWHHNYLVTSLGSYADRVVVGDQP 1066

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
             SISLL  +  +  +  +ARDY   W   VE LD+   +GA ++ NLFT   + E A   
Sbjct: 1067 SSISLL--QVTQSKLISQARDYGPLWPVCVEALDERHIIGANDSLNLFTF--SLEKAMG- 1121

Query: 895  ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
             R RLE  G YH+ + V +F  GSL      +        +F T +G IGV+  +  E+ 
Sbjct: 1122 -RSRLERDGCYHVADLVTKFLRGSLSSSDASTTSPLTSEAMFFTSSGRIGVVVDVKDEEL 1180

Query: 955  -LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN----FLDGDLIESFLD--LS 1007
             L L  +Q NL  VI+GVGG +H ++R+    + T DA +    FLDGD +E FL   LS
Sbjct: 1181 SLQLTNMQRNLANVIQGVGGSSHSKYRAPKTTRGTSDADSGAIGFLDGDFLEQFLTHVLS 1240

Query: 1008 RTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
              + +++       + + ++ E L   +E+L  LH
Sbjct: 1241 PQQSEKVIQGQSPPERLTITREALQMVIEDLQSLH 1275


>gi|166158025|ref|NP_001107422.1| damage-specific DNA binding protein 1, 127kDa [Xenopus (Silurana)
            tropicalis]
 gi|157422734|gb|AAI53474.1| Zgc:63840 protein [Danio rerio]
 gi|163916541|gb|AAI57552.1| LOC100135265 protein [Xenopus (Silurana) tropicalis]
          Length = 306

 Score =  337 bits (864), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 172/309 (55%), Positives = 220/309 (71%), Gaps = 15/309 (4%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V PEE EP +GRI+VF   DGKLQ +AEKE KGAVYS+  FNGKLLA+IN  ++LY+W  
Sbjct: 2    VCPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWT- 60

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
                 +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK  EG+ EE ARD+N 
Sbjct: 61   ---AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGSFEEIARDFNP 117

Query: 859  NWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGS 918
            NWMSAVEILDDD +LGAEN FNLF  +K+S   TDEER  L+ VG +HLGEFVN F HGS
Sbjct: 118  NWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFSHGS 177

Query: 919  LVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
            LV++ L +S      +V+FGTVNG+IG++ SL    Y  L  LQ  L KVIK VG + H 
Sbjct: 178  LVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHS 237

Query: 978  QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVEELCK 1027
             WRSF+ E+KT  A  F+DGDLIESFLDL + +M E+  T+ +          +V+E+ K
Sbjct: 238  FWRSFHTERKTEQATGFIDGDLIESFLDLGQAKMQEVVSTLQIDDGSGMKREATVDEVIK 297

Query: 1028 RVEELTRLH 1036
             VEELTR+H
Sbjct: 298  IVEELTRIH 306


>gi|392566425|gb|EIW59601.1| hypothetical protein TRAVEDRAFT_167065 [Trametes versicolor FP-101664
            SS1]
          Length = 1263

 Score =  337 bits (864), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 272/916 (29%), Positives = 418/916 (45%), Gaps = 133/916 (14%)

Query: 205  AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
            A+  VDA   R LLGD  G L +L   ++    T L    LGETS A++++YL + V+Y+
Sbjct: 329  AWSAVDARCRRVLLGDAFGRLAMLAFNNDAVAFTLLP---LGETSPATSLTYLSSQVLYV 385

Query: 265  GSSYGDSQLIKLNLQPDA------------------------------------------ 282
            GS +GDSQ ++++  P A                                          
Sbjct: 386  GSHFGDSQQLRIHPAPIANASVDTLPIPKGVSTVSPSALASSPSKGKGRSMDLDDGLVRE 445

Query: 283  ------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGI 336
                  KG+Y+EVL+ + N+ PI+D  + D++  GQ QV+T SGA   GSLR++R     
Sbjct: 446  GRVIQTKGTYLEVLQTHDNVAPIMDAVLADIDGSGQPQVITASGARNTGSLRVIRTEADF 505

Query: 337  NEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE--IEGFC 393
             EQA ++ L GI  +W +R   +D   T LVVS   ET +L+   +D +   E  + GF 
Sbjct: 506  QEQAKLDGLPGITDIWPVRPRFNDATHTHLVVSTSRETLVLSFAGQDTITHVEPSVAGFA 565

Query: 394  SQTQTLFCHDAIYNQ-----------------LVQVTSGSVRLVS-----STSRELRNEW 431
            +   T    +    Q                 + Q+TS  VRLV      +        W
Sbjct: 566  THVPTYAIGNVPRRQTTTSGGRTTSSYVDSPLVAQITSEGVRLVEYDPTLNGFTPFGAGW 625

Query: 432  -------KSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCL 483
                    S  G  +  A  + SQ ++   GG L  L +G +  L  VK      EI  +
Sbjct: 626  YPKKSGDASMAGRDIVAAAMSPSQFVVGLSGGRLALLNLGANDALQVVKTRDFPDEICAI 685

Query: 484  DINPIGENPSYSQIAAVGMWTDISVRIFSL--PDLNLITKEHLGGEIIPRSVLLCAFEGI 541
               P     +Y+   AV  W+   + + SL  P   L     +    +PRS+LL  F   
Sbjct: 686  SCTPFDPTKNYATHIAVSFWSSNKIAVLSLDSPASYLGPICEVSLPTLPRSILLHNFGAG 745

Query: 542  S---------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
            +         +LL  L DG+++ F L  + GEL D K  SLG  P +L   +    T V 
Sbjct: 746  ARPKDADFRPHLLAGLADGNVMTFAL--RDGELHDSKASSLGNAPASLSLCAVDGRTVVL 803

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A   R  V+Y   +++  S V +K ++     N+AAFP  LAIA    L IG +  + K+
Sbjct: 804  AGGARSNVLYWDRQRIRPSPVGVKYMARGATLNTAAFPSCLAIATSSALLIGNVRGVDKM 863

Query: 653  HIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSCA---EESEMHFVRLLDDQTFEFISTY 708
             IR+IPLG ++PR++ +    + F +   +         E      +LLD  +F+ +  +
Sbjct: 864  QIRTIPLGMDNPRKMAYHAGQQVFGVACARTAPGRVGDNEDITGSFKLLDAHSFQQLHHF 923

Query: 709  PLDTFEYGCSILSCSFSDD-SNVYYCVGTAYVLPEENEPTKGRILVFIV--EDG--KLQL 763
              D  E   S+L+       S+  +C+GTA + PEE EP+ GRIL+F +  E+G   L  
Sbjct: 924  QCDPDEEPASVLALPGEGSASSPAFCLGTAVIRPEEREPSNGRILLFSLSSENGVRSLTT 983

Query: 764  IAEKETKGAVYSLNAFN-GKLLAAINQKIQLYKWMLRD-------DGTRELQSECGHHGH 815
            +A  + +G VY+L   + G + AAIN  + LYK  +R+       D   +  +E  H+  
Sbjct: 984  VASHKVRGCVYALQHVSEGVIAAAINTSVLLYK--IREGNLGEGFDRVLDKAAEWNHNHF 1041

Query: 816  ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
            + +L     G F++VGD + S+S+L    +   +E  ARDY   W  A+E   +   +GA
Sbjct: 1042 VTSLVWD--GQFLLVGDAISSVSVLRVADDATKLESVARDYAPLWPVAIESTGNGGVIGA 1099

Query: 876  ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV- 934
             ++ NLF+           +R  LE  G YH+ + VN+   G+    L  +DV Q   V 
Sbjct: 1100 NSDCNLFSFALQR----GPQRNGLEKNGVYHIDDVVNKLIKGA----LSSADVSQDQAVK 1151

Query: 935  ---IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 991
               +F T  G IG I  +     L +  LQ N+ K + G GG+NH + R+    +   DA
Sbjct: 1152 AGHVFFTSTGRIGAILDMNDTMSLHMTALQRNMAKSLIGPGGVNHTKRRAPATPRGHTDA 1211

Query: 992  K---NFLDGDLIESFL 1004
            +    FLDGD +E+FL
Sbjct: 1212 EASYGFLDGDFLETFL 1227



 Score = 44.3 bits (103), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 35/148 (23%), Positives = 72/148 (48%), Gaps = 7/148 (4%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATE--RYKFCVLQWDAES--SELITRAMGDVSDRIGR 61
           ++GRI  L+      + Q  L + T+    K  ++ +  E   + L +     + D+  R
Sbjct: 56  MWGRIVGLQAVPARTQGQSNLLVMTDLPDAKLVLMTYSTEGGVAALTSGTSHSLMDQAAR 115

Query: 62  PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCA 121
           P +      +DP  +++    Y G  KV+  ++ G+L +AF++ L E+ +L   FLY   
Sbjct: 116 PAEFVTDFQVDPTGQVVIASCYAGKLKVVRIED-GELGQAFDVSLPEINLLAFTFLYTDD 174

Query: 122 KP--TIVVLYQDNKDARHVKTYEVALKD 147
           +   T+ +L+ ++K    + + +V L+D
Sbjct: 175 EDRFTLALLHLNHKRQIQLLSRDVLLED 202


>gi|336369683|gb|EGN98024.1| hypothetical protein SERLA73DRAFT_109335 [Serpula lacrymans var.
            lacrymans S7.3]
 gi|336382464|gb|EGO23614.1| hypothetical protein SERLADRAFT_449959 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 1257

 Score =  336 bits (862), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 303/1150 (26%), Positives = 511/1150 (44%), Gaps = 200/1150 (17%)

Query: 70   IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKP---TIV 126
            ++DP  RL  +  Y G  K++    +G     F++ + EL +L ++F+   ++P   T+ 
Sbjct: 125  LVDPTGRLAVVSPYTGKLKIVLITPEGTYDTDFDVSIAELNLLALEFI--SSEPDLYTLA 182

Query: 127  VLYQDNKDARHVKTYEVALKDKDFV--------EGPWSQNNLDNG--ADLLIPVPPPL-- 174
            +++ D+     + + +++L +              P S  +   G     L+ +PP +  
Sbjct: 183  IMHIDHLKRLQLLSRDISLDEYQLSPTPSPALPSTPLSAKSFPIGDKPPSLVYIPPFIDD 242

Query: 175  -----CGVLIIGEETIVYCSANA------------------------------------F 193
                  G+L+IG   I++   ++                                    +
Sbjct: 243  DEDFIGGMLVIGGTKILFYDTSSRESQGNRKQKQRRMDKGKDSTVKELAKAKQKEDEREW 302

Query: 194  KAIPIRPSITKAYGRV-------DADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLG 246
            +    R S+   +  +       D  G  +L+GD  G L +L I   +   T L +  LG
Sbjct: 303  RKKKARASVDWPWSEITAWCAVGDELGRSFLIGDKYGRLAMLSIGINEG--TTLTLVALG 360

Query: 247  ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP-------------------------- 280
            ETS A+T++YL + V+Y+GS +GDSQL+++N  P                          
Sbjct: 361  ETSSATTLTYLTSQVLYVGSHFGDSQLLRINTTPVQSLESPTLPVPSDINTTTSSSLAAK 420

Query: 281  ---------------DAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDG 325
                           + +GS++ V+E Y N+ PI+D  +VD++  GQ  VVTCSG    G
Sbjct: 421  GKMEESSDRSGGCIINGRGSFLSVIESYKNIAPIIDAALVDVDNSGQHAVVTCSGGQNTG 480

Query: 326  SLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM---NLE 381
            S+ IVRNG      A++E +  +  +W LR++  D  DT + VS    T IL+    N+ 
Sbjct: 481  SISIVRNGADFKAMANMEGIVDVTNIWPLRANYYDTVDTHVAVSTFKATHILSFDGRNVA 540

Query: 382  DELEETEIEGFCSQTQTLFCHDAI---------------YNQLVQVTSGSVRLVSSTSR- 425
              +     +GF + + TL   + +                + +VQ+TS  + L+      
Sbjct: 541  SHVNPVS-KGFITTSPTLVIANVLGRPKAPGQATNSYVDSSLVVQITSRGIALLEYAVEL 599

Query: 426  ----ELRNEWKSP-------PGYS---VNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
                 + + W          PG+    +  A+AN SQ +LA   G +V L + D    + 
Sbjct: 600  GEYARVGDIWTPAKLSSTNGPGWENREIVAASANGSQFVLALNSGRIVLLNLDDRNCFDC 659

Query: 472  KH-------AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL---PDLNLITK 521
             +       +    EIS +   P+  +  +S    V  W+   + I S         I K
Sbjct: 660  LNWRDLDGSSNYPTEISAISCVPLNPSKKFSMYIVVSFWSSNHIEILSANAASHFASICK 719

Query: 522  EHLGGEI--------IPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573
              +   +          RS +    E   YL+  LGDG ++++    K  EL  +K  S+
Sbjct: 720  TPVLPSLPRSLLLFNFTRSDIPKQTEYNQYLIVGLGDGSVVSY--EFKKEELHGQKIFSV 777

Query: 574  GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
            G  P+TL     +    VFA   R ++++   ++L +S + L EVS +   N+  F  S+
Sbjct: 778  GNVPVTLHPCDVEGRRTVFACGSRTSILFWDKERLHHSPLMLNEVSAVSRLNTTVFDSSV 837

Query: 634  AIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAI-CSLKNQSCAEESEM 691
             +A    L IG++ ++ KL+IRSIPLG ++PRRI H    R +A+ C     +  +++E 
Sbjct: 838  ILATSTGLVIGSVKNLDKLYIRSIPLGYDNPRRILHVPSLRAYAVGCISITPTRIDKAEY 897

Query: 692  --HFVRLLDDQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPT 747
                ++L D  TF  +  +  D  E      ++S +       Y C GT Y   +E EP+
Sbjct: 898  TSSSMQLFDHTTFARLGQFRCDPNEEITALGVVSVTLERSIGTYICAGT-YKYVDEVEPS 956

Query: 748  KGRILVFIVEDG-----KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDG 802
            +GR+LVF  EDG     K+ +    E +G VY++ + NG ++AAIN  + +Y+  + D  
Sbjct: 957  QGRLLVFDAEDGSLLREKITMAVSLEVRGCVYAVGSVNGMIIAAINSSVVVYRPEI-DAS 1015

Query: 803  TRELQ----SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            T+ L     +E  H  + L   +  RGD I+VGD + SIS L  +  E  I+  ARDY +
Sbjct: 1016 TQLLALHKITEWNH--NYLVTNLVCRGDKILVGDAINSISFL--RMVESQIQCLARDYGS 1071

Query: 859  NWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGS 918
             W   VE+LD    +GA +++NLFT          E R  LE  G Y++G+ VN+F  G+
Sbjct: 1072 LWPVCVEMLDQSSIIGANSDYNLFTFALQET----ELRKSLERDGSYYIGDMVNKFIPGA 1127

Query: 919  LVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
            L       D+   P  +F T  G IGVI  +  E  L +  LQ N+   +    G+ H +
Sbjct: 1128 LTAHDVSVDMPLEPKQLFFTSTGCIGVIVDMGDELSLHMTALQRNMSTYLSQTKGVTHTK 1187

Query: 979  WRSFNNEKKTVDAK----NFLDGDLIESFLDL------SRTRMDEI--SKTMNVSVEELC 1026
            +R+  N     DA+     FLDGD +E F+         ++ MD    ++ + +SV+ + 
Sbjct: 1188 FRAPKNAYGRSDAEATSFGFLDGDFLEKFMQFLNDPGPLKSIMDGQGEAERLTISVDRIH 1247

Query: 1027 KRVEELTRLH 1036
            + +E L  +H
Sbjct: 1248 RVLERLQSMH 1257


>gi|302423344|ref|XP_003009502.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
 gi|261352648|gb|EEY15076.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
          Length = 1119

 Score =  328 bits (842), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 284/1119 (25%), Positives = 504/1119 (45%), Gaps = 141/1119 (12%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
            + G I  L+  RP     D LF+ T+ + +   +WD    +L+T + + D+++   R   
Sbjct: 54   VCGTITLLQTLRPKASETDILFVGTDLFHYFTARWDPTQGKLVTEQVIQDIAEPHMREAQ 113

Query: 65   NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQL-----KEAFNIRLEELQVLDIKFL-Y 118
            +    ++DP  R + +HL++G+  ++    +        K+   +RL EL +    F+  
Sbjct: 114  SQDKCLVDPAGRFVAMHLWEGVMNIMQLGTRKGANLRLDKDWAQVRLSELFMKASTFVPT 173

Query: 119  GCAKPTIVVLYQ---DNKDARHVKTYEV------------ALKDKDF-VEGPWSQNNLDN 162
                PTI  LYQ   D +DA H+  Y +             LK+++  +E P      D 
Sbjct: 174  ETGHPTIAFLYQNSIDREDA-HLAVYRLMEDGNTNVSKFDPLKNRELELEIP------DP 226

Query: 163  GADLLIPVP------------------PPLCGVLIIGEETIVYCSANAFKAIPI-----R 199
             A  LIPV                     L G+++ GE  ++Y   +    + I      
Sbjct: 227  FARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYV--DTLTKVKISKALDE 284

Query: 200  PSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
            P I  ++ + D   +RYLL D  G LHLL +  +   VTGL ++ +G+TS AS + Y+ N
Sbjct: 285  PRIFVSWAKYDV--TRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYMGN 342

Query: 260  AVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER---------- 309
             ++++GS +GDSQL  L+L        + +++   N+ PI+DF ++DL            
Sbjct: 343  EILFLGSHHGDSQLFTLDLC----AQTIRLIQTIPNIAPILDFSIMDLGNAGDSGVGNAF 398

Query: 310  -QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVV 367
              GQ ++V   G + +GSLR +R+ +G+ +   ++ +Q ++G++SL+S   +  DT LVV
Sbjct: 399  SSGQARIVAGCGVHHNGSLRSIRSSVGLEDIGILDGIQDVRGLFSLKSYGSEKVDT-LVV 457

Query: 368  SFISETRILAMNLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
            SFI+ETR+   +    +EE  + +G      TLF         +Q+T+ S  L+ S S  
Sbjct: 458  SFITETRVFKFDAYGSVEELADFQGLTLDQPTLFAGSLANGHTLQITASSALLLDSESSV 517

Query: 427  LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDIN 486
              N W S  G S+  A+ N      +TG  +                    +  +     
Sbjct: 518  TINSWTSSNGGSIVNASVNEKA---STGRRY--------------------WGATAARSR 554

Query: 487  PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGIS-YLL 545
                  ++  I A+G W+  SV      +    T     G+   R+     F GIS +  
Sbjct: 555  AFTHRETFPDIGAIGFWSTGSVCGHQPRN----TASTAWGKASNRTTTTSLFLGISLWSS 610

Query: 546  CALGDGHLLNFLLNMKTGEL---TDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIY 602
            C         +   ++T +    T +K   L  +    R  + +         DR  +  
Sbjct: 611  CTRLSSQAPRYSFLLRTAKWFQSTSQKMTILSRRGKVSRWGAGRQA--CMCCRDRVPL-- 666

Query: 603  SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEH 662
                +++YS    ++V+H+ PF+S  FPD++ +A +  + I  +D  ++ H+  + +GE 
Sbjct: 667  ---GRIIYSAATAEDVTHIAPFDSEGFPDAIFLATDKNVRIANVDTERRTHVNPLHIGET 723

Query: 663  PRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPL-----DTFEYGC 717
             RR+ +    + F I +++ +   +E  +     L D+        P       + E   
Sbjct: 724  VRRVAYSPALKAFGIGTIRKELLHDEEIVSSAFQLVDEIVLGKVGRPFALGGEASVELVE 783

Query: 718  SILSCSFSDDSNV---YYCVGTAYVL-PEENEP--TKGRILVFIVE-DGKLQLIAEKETK 770
            +++     D +      + +GT+Y+  P+ N+    KGRILV  V+ D    LI   E K
Sbjct: 784  AVIRAELDDSTGQPAERFIIGTSYLADPDVNDSGDVKGRILVLGVDSDRNPYLIVSHELK 843

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
            GA   L     KL+A +++ + +Y ++     + +L+           + +   G+ I V
Sbjct: 844  GACRCLGVMGDKLVAGLSKTVVVYDYIEDSTTSGKLEKLTTFRPSTFPVDLDISGNMIGV 903

Query: 831  GDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
             DLM+S++L+ +       +  + ERAR +   W +AV  L ++ +L A+   NL  + +
Sbjct: 904  ADLMQSMTLVEFVPAKDGRKAKLIERARHFEYIWATAVCSLGEESWLEADAQGNLMILER 963

Query: 887  NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
              +  T+ ++ ++    E HLGE VN+ R   L +   +SD+  +P     TV+G + V+
Sbjct: 964  QPDAPTEHDQKQMRTTSEMHLGEQVNKIR--PLQITATESDI-IVPKAFLATVDGSLYVL 1020

Query: 947  ASLPHEQYLFLEKLQTNLRKVIKGVG--------GLNHEQWRSFNNEKKTVDAK-NFLDG 997
            A++  E    L   Q  L  +++ +G        G +  QWR F N K+   A   F+DG
Sbjct: 1021 ANISAEYQSILLPFQERLAGIVRYLGQAAPEDNEGPSFSQWRGFRNAKRMAGAPFRFVDG 1080

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            +LIE FLDL   R + + + +  SVE +   VEEL R+H
Sbjct: 1081 ELIERFLDLDELRQEAVVEGLGPSVEAMRNMVEELRRMH 1119


>gi|213407660|ref|XP_002174601.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces japonicus
            yFS275]
 gi|212002648|gb|EEB08308.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces japonicus
            yFS275]
          Length = 1078

 Score =  328 bits (841), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 274/1073 (25%), Positives = 501/1073 (46%), Gaps = 90/1073 (8%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITR--AMGDVSDRIGRPT 63
            ++ ++ +++ FRP    +D L +    + +  + WD +  + I    A    S       
Sbjct: 54   VFSKVRSIKSFRPPLSTKDHLIVTVSPFSYFTVTWDPDKKQFINEVTAQPPASTFFLNKQ 113

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK-----------GQLKEAFNIRLEELQVL 112
             N ++ +IDP  R+I  H + GL   IP   +             +++ F +R+ EL VL
Sbjct: 114  RNPKL-LIDPSNRVICSHAFQGLLTFIPITQRITKKRQKDVPSQSVQDNFTVRIMELDVL 172

Query: 113  DIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG-PWSQNNLDNGADLLIPVP 171
            DI FLY    PT+ VLY+D++   H+K Y V L +K+  E   ++ ++++ G      + 
Sbjct: 173  DIVFLYTSKTPTLAVLYKDSRSVIHLKAYTVNLAEKELNEDESFACHDIEEGR----LIA 228

Query: 172  PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYG----------RVDADGSRYLLGDH 221
                GV + GE  + Y S     +  +      AY            V  +  +Y++ D 
Sbjct: 229  HYDGGVFVFGEVYVYYVSREKKSSKMLVSYPVTAYSAVFQSFQNGTEVCINERKYIVSDE 288

Query: 222  AGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPD 281
            +G +++  I +       + +E +GE  IAS I  L N  +++GS +GDS+L +   +P 
Sbjct: 289  SGAVYMFNIGNN----FNVTLEKIGEAPIASCIVALPNDQIFLGSHFGDSKLYRFQNKPG 344

Query: 282  AKGSYVEVLE--RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQ 339
              GS VE+LE   + NL PI DFC+      G   +V C  AY +GSLR +++GI I++ 
Sbjct: 345  VAGS-VELLEVQSFDNLAPISDFCID--HGNGGSFMVACCNAYNEGSLREMKSGIDISDY 401

Query: 340  ASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
              +E+  +  ++S++  +      +L V  ISET +  ++   E++   ++  C +  T+
Sbjct: 402  GVIEMPNVCSLYSVQLQSS--VTKYLFVGSISETSVFEISQSGEMDL--VDSLCLEEPTI 457

Query: 400  FC-HDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
            F    +  + L  V   SV L     ++L + W +  G ++  A    + V L+     +
Sbjct: 458  FVGATSDSSCLYHVARSSVCLFDG--KQL-SYWFADGG-AITCAAVYDNAVCLSMTNNQI 513

Query: 459  VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
            V+L      L  V   Q++ +++ L         + ++   VG+W    + +    +   
Sbjct: 514  VFLHK----LQVVSKLQVDSDVTALSF------LNDARTICVGLWNQ-KLLLLDSDNTTC 562

Query: 519  ITKEHLGGEIIPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              +E L      +S++     GI+   L  +   G L+ F ++  T  +  R    L   
Sbjct: 563  TVRETLEVNEKAQSLVCVKLAGIASVALYMSTESGKLITFDIDPVTKGIKQRLTYPLTAV 622

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ + TF + N   +    + P  +Y  N+ L ++ +   ++  +        P +   A
Sbjct: 623  PLNMNTFQTPNGPVLITLGEHPYAVYGENQHLSFAYIGNSDIVCLSHLQHPGIPANTVYA 682

Query: 637  KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE-MHFVR 695
                L +  ++ +QKLHIR IP+   PRRI   ++        L+++  ++ S  M  + 
Sbjct: 683  TRNALKLSNVNMLQKLHIRRIPVAGIPRRIAATKEHYFVLSVDLQDKLASQGSSIMSSLH 742

Query: 696  LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF- 754
            +    T+E +  +  + +E    I+ C+ +   N    VGT +  P+ +EP  GR++VF 
Sbjct: 743  VFKKLTYETVLQHEFEDYE----IVECALTLPDNERVVVGTGFNYPDRDEPDGGRLIVFR 798

Query: 755  IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW---MLRDDGTRELQSECG 811
            + E  KL   A  +T+GA++S+    GKLL  +N  +  +++    LR  G+    + C 
Sbjct: 799  LDEQEKLVTEAVYKTQGAIFSVEYQEGKLLVGMNAVLCTFRYENKTLRVVGSTRTPTYC- 857

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                 L +   ++ D +VVGD+MKS++L  Y  E+   EE ARD+ A W+++V+ L + +
Sbjct: 858  -----LNIAASSK-DIVVVGDMMKSLTL--YNTEKDTAEEVARDFGALWVTSVQPLSETL 909

Query: 872  YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
            +     +    T+  +++     ER +L     Y LG+ VNR R G  V+  P   V   
Sbjct: 910  FFCTTADGEAVTMLWDTKAPQSVERKKLRWKSCYRLGDMVNRTRRGCFVLSSPSRLVK-- 967

Query: 932  PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQW--RSFNNEKKTV 989
            P ++  TV G I +I        L L+ +Q N  + +  +GGL+  +W  R F   + + 
Sbjct: 968  PELMCVTVEGGILLIGDASQHADLLLQ-IQHNFLEAVPPLGGLDFYKWHERLF-PARASA 1025

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
              K+F+DGDL+ES  DL  + + +I       +++N+SV +L   + +L RLH
Sbjct: 1026 ANKDFIDGDLLESIEDLPESTLQKIVQGTNGGQSLNISVPDLLGIISDLKRLH 1078


>gi|389744702|gb|EIM85884.1| hypothetical protein STEHIDRAFT_121882 [Stereum hirsutum FP-91666
            SS1]
          Length = 1255

 Score =  327 bits (839), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 313/1201 (26%), Positives = 522/1201 (43%), Gaps = 208/1201 (17%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERY-KFCVL--QWDAESSELITRAMGDVSDRIGRP 62
            +YGRI+ +       +  + L +      K  VL  +W +    L+T     + ++I RP
Sbjct: 54   LYGRISVIRTITRKDDPDNILLMTDHPAPKLVVLSPKWISNRLSLVTVQTESLYEQIARP 113

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
            ++      +DP   +   H Y G  K     N G L++  ++ + EL +L +  +    +
Sbjct: 114  SEFVHNVFLDPYGYVGLAHCYTGKAKAFTI-NDGLLEDFCDVLVPELNILSLTLVSSPEQ 172

Query: 123  P-------TIVVLYQDNKDARHVKTYEVALKDKDFVEGP---WSQNNLDNGADLLIPVP- 171
                     + +L+ D++    +   ++ + + +    P   ++   + +    + PVP 
Sbjct: 173  AIDTPDPSALAILHLDHRQNIQLLLRDLNISEYELSPSPSLMFTPATISSKVFEMAPVPE 232

Query: 172  --------PPL---------------CGVLIIGEETIVY--------------------- 187
                    PPL                G+L+IG + ++                      
Sbjct: 233  NPPIIVNIPPLQLPNPEAMEGTSDFNGGLLVIGGKKLLLFELTSKEFQRKHHKRLRKQES 292

Query: 188  ----CSANAFKAIP---------IRP--SITKAYGRVDA------DGSRYLLGDHAGLLH 226
                   +  KA P          +P  S+   +  V A      +G+RYL+GD  G L 
Sbjct: 293  RRSTSGKDKAKAEPKEEKEGKSRSKPIASVAWPWSEVTACCQANDEGTRYLVGDAYGRLA 352

Query: 227  LLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP------ 280
            +L +    ++  GL +  LGETS A+T++YLD+  +Y+GS +GDSQLIK++ QP      
Sbjct: 353  MLSLDSLYDQ--GLILIPLGETSPATTLTYLDSQFLYVGSHFGDSQLIKIHRQPYGYTNT 410

Query: 281  --------------------------------------DAKGSYVEVLERYVNLGPIVDF 302
                                                    KG Y+EV+  + N+ PI+D 
Sbjct: 411  DTLPLLGILPLEPTNLASSSGKGKAKAAAGRKSDGRILGMKGKYLEVVHSWRNIAPILDA 470

Query: 303  CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPF 361
               D +  G   +VTCSG    GSLR++RNG    E+A V+ L  + G+W+LRS   D  
Sbjct: 471  VKADTDGSGLPHIVTCSGGKSTGSLRVIRNGADFQEKAVVDGLDNMTGLWALRSHNSDKD 530

Query: 362  DTFLVVSFISETRILAMNLEDELEETEIEG---FCSQTQTLFCH---DAIYNQLVQVTSG 415
             ++LV S   ET +      + L   E      F     TL      D   + ++QVTS 
Sbjct: 531  HSYLVASDKDETHLFRFEGPESLRRVESSSESYFRMNKPTLAVANVGDLDSSLILQVTSD 590

Query: 416  SVRLVSSTSR----ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG----- 466
             V+L+   +      L + W    G  + +A+ + +QVLLA  GG +V L +        
Sbjct: 591  KVQLLEFDAVLEIFHLSSHWSPESGKQITLASISPTQVLLALDGGTVVLLGLDKSRNMTV 650

Query: 467  ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL----PDLNLITKE 522
            + T + +A++   +SC   N +G         AV  W    + + SL    P L+ +T  
Sbjct: 651  LSTRLYNAEIS-ALSCALPNALG---------AVSFWDSDRIELVSLGGSTPLLSTVTSR 700

Query: 523  HLGGEIIPRSVLLCAFEGIS---YLLCALGDGHLLNFLL---NMKTGELTDRKKVSLGTQ 576
             +G     RS+LL  F+  S    LL  L +G ++ F++     KT EL DRK  ++G  
Sbjct: 701  EIGSSARARSILLHKFDSTSSEMQLLVGLTNGIMVAFVVASSETKTPELQDRKTFAMGNM 760

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L        T V A+  R  V++    +L  S V L  +    P ++   P S+ +A
Sbjct: 761  PVLLVKVMVDGHTSVLASGSRSVVVFWEKGRLKTSPVMLNNIFAGAPVDTPWSPSSVVLA 820

Query: 637  KEGELTIGTIDDIQKLHIRSI---PLG-EHPRRICHQEQSRTFAICSLKNQSC---AEES 689
               +L IG++  + K+HI  +   P G ++PRRI +   +  FA+   + +      EE 
Sbjct: 821  TGSKLIIGSVHGLDKMHIVPVTATPFGLDNPRRIVYDSHASVFAVACTRIEPAPVFGEEV 880

Query: 690  EMHFVRLLDDQTFEFISTYPL----DTFEYGCSILSCSFSDDSNV--YYCVGTAYVLPEE 743
                 +LLDD TFE I   P     D  +   +I +    + +    +YCVG      +E
Sbjct: 881  SRGSFKLLDDITFELIGEVPFAENTDQMQEITAIHAFQHVEGNIAISHYCVGLVTYQLDE 940

Query: 744  NEPTKGRILVF----IVEDGKLQL----IAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
             EP+ G + +F       DG ++     + ++E  G VY+L +F G ++AAIN  +  +K
Sbjct: 941  KEPSHGELHIFSKGGTEPDGTVKADLLEVVKQEVSGCVYALASFEGYIVAAINSTVSFFK 1000

Query: 796  WMLRDDGTREL---QSECGHHGHILALYVQTRGDFIVVGDLMKSISLL--IYKHEEGAIE 850
              L   GT      + E  H+  + +L V   G ++++GD + S+S+L  I   E G I 
Sbjct: 1001 --LDTSGTEATLVKKHEWNHNYLVTSLVVS--GSYLLIGDAISSVSVLQVIQVDENGEIT 1056

Query: 851  ER----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
            E+    ARDY   W  +++    +  +GA ++ NLF+           ++  L+  G +H
Sbjct: 1057 EKLKTVARDYGPLWPVSLQGWGKEGVIGANSDCNLFSFTLQ---PVTPQKTVLDRDGHFH 1113

Query: 907  LGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            L + VN+F HG++       D+      +F T +G IG++  +  E  L +  LQ NL  
Sbjct: 1114 LDDHVNKFLHGTVHSSEKAEDLDIEARSLFFTASGRIGLVLDMGKELSLHMTALQRNLNG 1173

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAK----NFLDGDLIESFLDLSRTRMDEISKTMNVSV 1022
            V+K V G  H++WR+  + +   DA      FLDGD IE FL+      DE+S  M   +
Sbjct: 1174 VVKDVSGTTHKRWRAPVSGRGPSDADAEAYGFLDGDFIEQFLN-----YDELSPEMKRIM 1228

Query: 1023 E 1023
            E
Sbjct: 1229 E 1229


>gi|384490729|gb|EIE81951.1| hypothetical protein RO3G_06656 [Rhizopus delemar RA 99-880]
          Length = 967

 Score =  327 bits (837), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 276/1034 (26%), Positives = 468/1034 (45%), Gaps = 152/1034 (14%)

Query: 2   LDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGR 61
           L V IYG I  L++ +   ++   LFI T    + ++ +  ++  ++T   G ++    R
Sbjct: 32  LIVCIYGTIRNLQVIQLPDKSTCSLFILTVHQCYTIITYSLKTQSIVTEFSGQLNITNAR 91

Query: 62  PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------DNKGQLKEAFNIRLE 107
            TD   +  +D    +I +  + G    IPF              D K        IR  
Sbjct: 92  ETDQQVVVTVDKTSEMIFVSAFTGYVIAIPFGKPAPGAKLTSKNRDTKASRFMQIPIRTN 151

Query: 108 ELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
           E   + +  L       + VL  + +D + +KT++   + KD +E   S   ++     L
Sbjct: 152 EFDFISVAALQQGGY--LSVLVGEMEDLKTIKTFKYRDEYKDLLERNKSTIKVEASTHAL 209

Query: 168 IPVPPPLCGVLIIGEETIVYCS--ANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLL 225
           +PVP PL G+L+IGE  I Y     N  + + I P+   A+  +  + +RYLLGD  G L
Sbjct: 210 VPVPEPLGGLLVIGEYIITYFDPLTNTNRELSIDPARVTAWEFMKDESNRYLLGDEEGYL 269

Query: 226 HLLVITHEKEKVTGLKIELLGET----------------SIASTISYLDNAVVYIGSSYG 269
           ++  I     KV  L    +G+                 S  S I  L N + YIGS++G
Sbjct: 270 YVFSIETSHNKVVNLSSTFIGQVPSFNQNIESKANHPQVSRPSCIVDLGNLMFYIGSTHG 329

Query: 270 DSQLIKLNLQPDAKGSY-VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
           DS LI+L ++   K  Y V+VL  Y  LGPIVDFC+ D  +QG+  +  C+G  KD S+R
Sbjct: 330 DSCLIQL-IKGQEKSKYTVKVLSTYSCLGPIVDFCLYDYNKQGKQTMACCAGVEKDASIR 388

Query: 329 IVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE 388
           IV NGIG +++ +++   +  MW+L  S D   D+ L+ + + +T +L  + ++ELE T+
Sbjct: 389 IVENGIGFSKKYALDFPLVYAMWTL--SLDGDRDSLLISTAL-DTVLLKPSDQEELEVTQ 445

Query: 389 IEGFCS--QTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR-ELRNEWKSPPGYSVNVATAN 445
              + +   +Q     D   + +VQ TS  VR++++    +L  EWK P G S+ +A   
Sbjct: 446 HTSYSALDTSQMTLAADMFNSFIVQATSSFVRMMTNDEYGQLIGEWKPPTGTSIAIAKIK 505

Query: 446 ASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPS-YSQIAAVGMWT 504
            S  ++   G  ++YLE+ +    E    QL+   SC+ I+   EN + Y  + A    +
Sbjct: 506 DSHCVVCCEGDMIIYLEMTNKGFIEKSKRQLK-NASCISISTRKENETLYDYVVAGTCGS 564

Query: 505 DISVRIFSLPDLNLITK-EHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG 563
           + SV    LPDL ++ + + +     P  +L+   E + YL+  LGDG + ++ + + + 
Sbjct: 565 NPSVVFLQLPDLEVVLEHKDMPSTTGPNDLLVVTMEKVLYLMVLLGDGQMFSYHMEVGSE 624

Query: 564 E--LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHM 621
           +  L    ++ +GT    +  +       VF A  RPTVI S ++ L   +VNL      
Sbjct: 625 DVILESETEIMVGTYCTAMYPYQHGQEKRVFVAGQRPTVISSFHQTLFVYSVNL------ 678

Query: 622 CPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK 681
                     +  +  E  L I  I DI+ L + S      P +   +   +        
Sbjct: 679 ----------TYELPGEMPLRIEYISDIKALAVASCTNVHDPNKNIIERTGK-------- 720

Query: 682 NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLP 741
                       VRLLD QTF+                                      
Sbjct: 721 ------------VRLLDAQTFQ-------------------------------------- 730

Query: 742 EENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ-LYKWMLRD 800
                       F +E+ + +LI   +  G VY + +    ++AA++ KI  LY +    
Sbjct: 731 -----------AFTIENSRCELIDAVDMPGVVYRMESIKNTIIAAVDGKIYGLYNFKPDL 779

Query: 801 DGTRELQSECGHHGHILALYVQT-RGDFIVVGDLMKSISLLIYKHEEGAIE--ERARDYN 857
                ++ +   H +++AL + T   D ++VGDLM+S+SLL  + +E +++    A D  
Sbjct: 780 LKGERIEFKFLLHNNVVALDMDTDNNDTLLVGDLMESMSLLKVEKDEESLKLSLEAVDNK 839

Query: 858 ANWMSAVEILDDDIYLGAENNFNLFTVRK---NSEGATDEERGRLEVVGEYHLGEFVNRF 914
             WM+AV+ +++++ +GA++  NLFT+ K     EG T     +LE+ G YHLG  VNRF
Sbjct: 840 QVWMTAVKFVNENVLIGADDRHNLFTMIKPEIRQEGKT----CKLELEGGYHLGTLVNRF 895

Query: 915 RHGSLVMRLPDSDVGQIPTV-------IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
           R    ++R  ++    I ++        F TVNG IG + ++  E + F + +Q  +  +
Sbjct: 896 RKD--ILRDVENASDNIDSISKYESEFTFATVNGSIGTVKTISRESFEFFKGIQEGILNI 953

Query: 968 IKGVGGLNHEQWRS 981
           +   G L+H  W S
Sbjct: 954 LPNNGNLDHGLWLS 967


>gi|449704103|gb|EMD44407.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
          Length = 1088

 Score =  324 bits (831), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 280/1050 (26%), Positives = 509/1050 (48%), Gaps = 75/1050 (7%)

Query: 22   AQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLH 81
            ++D+L   T   +  + Q + ++ EL      ++ DR+GR    G  G++  +  L+   
Sbjct: 76   SKDYLIFVTNNQQLIIAQLN-DNKELSLVYSNNIEDRLGRTAFYGISGVVIGNSYLLLHL 134

Query: 82   LYDGLFKVIPFDNKGQ-LKEA-FNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVK 139
                + K++P   + + L EA FN+R++E +V+ +  L    K  I + ++   D + + 
Sbjct: 135  YNHLI-KIVPLPREEENLPEAPFNLRIDEQRVVGLYCLQLDQKQVIAIHHEIKNDVKVIT 193

Query: 140  TYEVALKDKDFVEGPWSQNNLDNGA-DLLIPVPPPLCGVLIIGEETIVYC-SANAFKAIP 197
             YE  +  K      +++   D    DL IP      G   I  + + +  +++   ++ 
Sbjct: 194  FYEFNIDSKTL--SLYNKQIEDTSTIDLFIPYLGNE-GYFKINSQIVSFVPNSSQISSVS 250

Query: 198  IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI----THEKE-KVTGLKIELLGETSIAS 252
            +RP   + Y  +D    R LL D+ G +HL+ +     HE E     L+ + L   +I S
Sbjct: 251  MRPIHLETYCFLDE--HRLLLSDNQGKMHLITLKENNNHEIELYYYPLEYQAL---TIPS 305

Query: 253  TISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQ 312
            T+ YLDN+V++ GS  GDS LIK+N     KG   E+LE + N GPI+D   +  E   +
Sbjct: 306  TVLYLDNSVLFWGSKGGDSHLIKIN----EKGC--EILETFENRGPILDMTAIHDEITNK 359

Query: 313  GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
              ++ C   Y+ G+L+++ +G+GI+     E++GI  ++       +    +L++S+   
Sbjct: 360  DNLLMCCNTYQQGTLKLISSGVGIDIICQNEMKGITHLYQAEMGHKE----YLIISYGDG 415

Query: 373  TRILAMNLED---ELEETEIEGFCSQTQTLFCHDAIY------NQLVQVTSGSVRLVSST 423
            +++     E       E EI+GF  + +T+ C   I       +  VQVT   + +  S 
Sbjct: 416  SKVFESQQEVNQLRFNEIEIKGFNRKEETI-CSGIIEIGEMKESCFVQVTREEINIFDSE 474

Query: 424  SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY----E 479
              E   + +     S    +    + L  +    +        +L E K  +  Y    +
Sbjct: 475  QLEFIQQIRFNKFIS---HSCIYDEKLYISQSNEI-------NVLNENKEIKQVYSGEND 524

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
            IS   I   G       +  +  W   ++ I ++ + N      + G I  +S+ +   +
Sbjct: 525  ISAFTIGKEGT----EDVIGICYWDSNTMEIINMENGNKNHIREINGNIYGKSIEIINNQ 580

Query: 540  GISYLLCALGDGHLLNF----LLNMKTGELTDRKKV-SLGTQPITLRTFSSKNTTHVFAA 594
               Y++  +GDG +L +    L+ ++ GE+    KV  +G   I L+T       ++   
Sbjct: 581  EGMYVVIGMGDGRVLVYQMEELMEIE-GEIPKNYKVLDIGLSGIELKTLEINGMKYIMCL 639

Query: 595  SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHI 654
             DRPT++     K+   +VN  E+  + PF      +SL +A +  + IG ID+IQ +H 
Sbjct: 640  CDRPTLVSICKGKIKTLSVNCSEIVSIVPFKMKECMNSLVVATKENIIIGNIDEIQSIHT 699

Query: 655  RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES-EMHFVRLLDDQTFEFISTYPLDTF 713
            +S+ +G    RI   +Q     + S + +   E+  E H V+L+D +  E   ++ L   
Sbjct: 700  KSLSIGVFVSRIVVSKQENKALLLSSEIKELKEKRVETHSVKLIDFRKMEIEDSHELKEN 759

Query: 714  EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
            E+  SI       D N  + +GTA+  P E EP+ GRIL+  ++DG+L++I EK+  GAV
Sbjct: 760  EHALSIEQIVI--DENEMFVIGTAFAKPNEVEPSSGRILIVQIKDGRLEIIFEKDVNGAV 817

Query: 774  YSLNAFNGKLLA-AINQKIQLYKWM-LRDDGTRE--LQSECGHHGHILALYVQTRGDFIV 829
            YS+     K LA +I +K+ ++++  +  +G  E  LQ +   +  ++ LYV+T G+ I+
Sbjct: 818  YSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVKLQEKGSCNVKLIGLYVKTLGNKIL 877

Query: 830  VGDLMKSISLLIYK---HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
            VGDLMKSIS+  +    + +  + E +RD+ A++ +A+E +D+D YL +++N N+     
Sbjct: 878  VGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFVDEDCYLSSDSNSNILIFNT 937

Query: 887  NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
            NS G  + ER RL      H+GE +N    GS+       +  Q   ++FG V G IG I
Sbjct: 938  NSTG-NESERFRLNNCAHIHVGECINVMCKGSIAPTHSTYETVQKKCILFGGVTGYIGGI 996

Query: 947  ASLPHEQYLFLEKLQTNLRKVIKG-VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
              +P+E Y  L K+Q  +   +KG V     + W+   ++ K + + N +DG ++ES+L+
Sbjct: 997  CEIPNEIYDVLIKVQNQILLQMKGIVECTTPDNWKKVIDDWKRMPSSNIIDGSIVESYLE 1056

Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            +S+ +  EI+    V+ E++   +E +  L
Sbjct: 1057 MSKEKQCEIAHLSGVNEEQISDIIENMISL 1086


>gi|183232997|ref|XP_653855.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
 gi|169801778|gb|EAL48469.2| damaged DNA binding protein, putative [Entamoeba histolytica
            HM-1:IMSS]
          Length = 1088

 Score =  323 bits (829), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 280/1050 (26%), Positives = 509/1050 (48%), Gaps = 75/1050 (7%)

Query: 22   AQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLH 81
            ++D+L   T   +  + Q + ++ EL      ++ DR+GR    G  G++  +  L+   
Sbjct: 76   SKDYLIFVTNNQQLIIAQLN-DNKELSLVYSNNIEDRLGRTAFYGISGVVIGNSYLLLHL 134

Query: 82   LYDGLFKVIPFDNKGQ-LKEA-FNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVK 139
                + K++P   + + L EA FN+R++E +V+ +  L    K  I + ++   D + + 
Sbjct: 135  YNHLI-KIVPLPREEENLPEAPFNLRIDEQRVVGLYCLQLDQKQVIAIHHEIKNDVKVIT 193

Query: 140  TYEVALKDKDFVEGPWSQNNLDNGA-DLLIPVPPPLCGVLIIGEETIVYC-SANAFKAIP 197
             YE  +  K      +++   D    DL IP      G   I  + + +  +++   ++ 
Sbjct: 194  FYEFNIDSKTL--SLYNKQIEDTSTIDLFIPYLGNE-GYFKINSQIVSFVPNSSQISSVS 250

Query: 198  IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI----THEKE-KVTGLKIELLGETSIAS 252
            +RP   + Y  +D    R LL D+ G +HL+ +     HE E     L+ + L   +I S
Sbjct: 251  MRPIHLETYCFLDE--HRLLLSDNQGKMHLITLKENNNHEIELYYYPLEYQAL---TIPS 305

Query: 253  TISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQ 312
            T+ YLDN+V++ GS  GDS LIK+N     KG   E+LE + N GPI+D   +  E   +
Sbjct: 306  TVLYLDNSVLFWGSKGGDSHLIKIN----EKGC--EILETFENRGPILDMTAIHDEITNK 359

Query: 313  GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
              ++ C   Y+ G+L+++ +G+GI+     E++GI  ++       +    +L++S+   
Sbjct: 360  DNLLMCCNTYQQGTLKLISSGVGIDIICQNEMKGITHLYQAEMGHKE----YLIISYGDG 415

Query: 373  TRILAMNLED---ELEETEIEGFCSQTQTLFCHDAIY------NQLVQVTSGSVRLVSST 423
            +++     E       E EI+GF  + +T+ C   I       +  VQVT   + +  S 
Sbjct: 416  SKVFESQQEVNQLRFNEIEIKGFNRKEETI-CSGIIEIGEMKESCFVQVTREEINIFDSE 474

Query: 424  SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY----E 479
              E   + +     S    +    + L  +    +        +L E K  +  Y    +
Sbjct: 475  QLEFIQQIRFNKFIS---HSCIYDEKLYISQSNEI-------NVLNENKEIKQVYSGEND 524

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
            IS   I   G       +  +  W   ++ I ++ + N      + G I  +S+ +   +
Sbjct: 525  ISAFTIGKEGT----EDVIGICYWDSNTMEIINMENGNKNHIREINGNIYGKSIEIINNQ 580

Query: 540  GISYLLCALGDGHLLNF----LLNMKTGELTDRKKV-SLGTQPITLRTFSSKNTTHVFAA 594
               Y++  +GDG +L +    L+ ++ GE+    KV  +G   I L+T       ++   
Sbjct: 581  EGMYVVIGMGDGRVLVYQMEELMEIE-GEIPKNYKVLDIGLSGIELKTLEINGMKYIMCL 639

Query: 595  SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHI 654
             DRPT++     K+   +VN  E+  + PF      +SL +A +  + IG ID+IQ +H 
Sbjct: 640  CDRPTLVSICKGKIKTLSVNCSEIVSIVPFKMKECMNSLVVATKENIIIGNIDEIQSIHT 699

Query: 655  RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES-EMHFVRLLDDQTFEFISTYPLDTF 713
            +S+ +G    RI   +Q     + S + +   E+  E H V+L+D +  E   ++ L   
Sbjct: 700  KSLSIGVFVSRIVVSKQENKALLLSSEIKELKEKRVETHSVKLIDFRKMEIEDSHELKEN 759

Query: 714  EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
            E+  SI       D N  + +GTA+  P E EP+ GRIL+  ++DG+L++I EK+  GAV
Sbjct: 760  EHALSIEQIVI--DENEMFVIGTAFAKPNEVEPSSGRILIVQIKDGRLEIIFEKDVNGAV 817

Query: 774  YSLNAFNGKLLA-AINQKIQLYKWM-LRDDGTRE--LQSECGHHGHILALYVQTRGDFIV 829
            YS+     K LA +I +K+ ++++  +  +G  E  LQ +   +  ++ LYV+T G+ I+
Sbjct: 818  YSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVKLQEKGSCNVKLIGLYVKTLGNKIL 877

Query: 830  VGDLMKSISLLIYK---HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
            VGDLMKSIS+  +    + +  + E +RD+ A++ +A+E +D+D YL +++N N+     
Sbjct: 878  VGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFVDEDCYLSSDSNSNILIFNT 937

Query: 887  NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
            NS G  + ER RL      H+GE +N    GS+       +  Q   ++FG V G IG I
Sbjct: 938  NSTG-NESERFRLNNCAHIHVGECINVMCKGSIAPTHSTYETVQKKCILFGGVTGYIGGI 996

Query: 947  ASLPHEQYLFLEKLQTNLRKVIKG-VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
              +P+E Y  L K+Q  +   +KG V     + W+   ++ K + + N +DG ++ES+L+
Sbjct: 997  CEIPNEIYDVLIKVQNQILLQMKGIVECTTPDDWKKVIDDWKRMPSSNIIDGSIVESYLE 1056

Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            +S+ +  EI+    V+ E++   +E +  L
Sbjct: 1057 MSKEKQCEIAHLSGVNEEQISGIIENMISL 1086


>gi|407044103|gb|EKE42371.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
          Length = 1088

 Score =  323 bits (827), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 277/1051 (26%), Positives = 507/1051 (48%), Gaps = 77/1051 (7%)

Query: 22   AQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLH 81
            ++D+L   T   +  + Q + ++ EL      ++ DR+GR    G  G++  +  L+   
Sbjct: 76   SKDYLIFVTNNQQLIIAQLN-DNKELSLVYSNNIEDRLGRTAFYGISGVVIGNSYLLLHL 134

Query: 82   LYDGLFKVIPFDNKGQ-LKEA-FNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVK 139
                + K++P   + + L EA +N+R++E +V+ +  L    K  I + ++   D + + 
Sbjct: 135  YNHLI-KIVPLPREEENLPEAPYNLRIDEQRVVGLYCLQLDQKQVIAIHHEIKNDVKVIT 193

Query: 140  TYEVALKDKDFVEGPWSQNNLDNGA-DLLIPVPPPLCGVLIIGEETIVYC-SANAFKAIP 197
             YE  +  K      +++   D    DL IP      G   I  + + +  +++   ++ 
Sbjct: 194  FYEFNIDSKTL--SLYNKQIEDTSTIDLFIPYLGNE-GYFKINSQIVSFVPNSSQISSVS 250

Query: 198  IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELL------GETSIA 251
            +RP   + Y  +D    R LL D+ G +HL+ +    +     +IEL          +I 
Sbjct: 251  MRPIHLETYCFLDE--HRLLLSDNQGKMHLITLKENNKH----EIELYYYPLEYQALTIP 304

Query: 252  STISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG 311
            ST+ YLDN+V++ GS  GDS LIK+N     KG   E+LE + N GPI+D   +  E   
Sbjct: 305  STVLYLDNSVLFWGSKGGDSHLIKIN----EKGC--EILETFENRGPILDMTAIHDEITN 358

Query: 312  QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
            +  ++ C   Y+ G+L+++ +G+GI+     E++GI  ++       +    +L++S+  
Sbjct: 359  KDNLLMCCNTYQQGTLKLISSGVGIDIICQNEMKGITHLYQAEMGHKE----YLIISYGD 414

Query: 372  ETRILAMNLED---ELEETEIEGFCSQTQTLFCHDAIY------NQLVQVTSGSVRLVSS 422
             +++     E       E EI+GF  + +T+ C   I       +  VQVT   + +  S
Sbjct: 415  GSKVFESQQEVNQLRFNEIEIKGFNRKEETI-CSGIIEIGEMKESCFVQVTREEINIFDS 473

Query: 423  TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY---- 478
               EL  + +     S    +    + L  +    +        +L E K  +  Y    
Sbjct: 474  EQLELIQQIRLNKFIS---HSCIYDEKLYISQSNEI-------NVLNENKEIKQVYSGEN 523

Query: 479  EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAF 538
            +IS   I   G       +  +  W   ++ I  + + N      + G I  +S+ +   
Sbjct: 524  DISAFTIGKEGT----EDVIGICYWDSNTMEIIKMENGNKNHIREINGNIYGKSIEIINN 579

Query: 539  EGISYLLCALGDGHLLNF----LLNMKTGELTDRKKV-SLGTQPITLRTFSSKNTTHVFA 593
            +   Y++  +GDG +L +    L+ ++ GE+    KV  +G   I L+T       ++  
Sbjct: 580  QEGMYVVIGMGDGRVLVYQMEELMEIE-GEIPKNYKVLDIGLSGIELKTLEINGMKYIMC 638

Query: 594  ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
              DRPT++     K+   +VN  E+  + PF      +SL +A +  + IG ID+IQ +H
Sbjct: 639  LCDRPTLVSICKGKIKTLSVNCSEIVSIVPFKMKECMNSLVVATKENIIIGNIDEIQSIH 698

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES-EMHFVRLLDDQTFEFISTYPLDT 712
             +S+ +G    RI   +Q     + S + +   E+  E H V+L+D +  E   ++ L  
Sbjct: 699  TKSLSIGVFVSRIVVSKQENKALLLSSEIKELKEKRVETHSVKLIDFRKMEIEDSHELKE 758

Query: 713  FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA 772
             E+  SI       D N  + +GTA+  P E EP+ GRIL+  ++DG+L+++ EK+  GA
Sbjct: 759  NEHALSIEQIVI--DENEMFVIGTAFAKPNEVEPSSGRILIVQIKDGRLEIVFEKDVNGA 816

Query: 773  VYSLNAFNGKLLA-AINQKIQLYKWM-LRDDGTRE--LQSECGHHGHILALYVQTRGDFI 828
            VYS+     K LA +I +K+ ++++  +  +G  E  LQ +   +  ++ LYV+T G+ I
Sbjct: 817  VYSMKTLLKKYLAISIEKKLVVFEYQRVITNGEFEVKLQEKGSCNVKLIGLYVKTLGNKI 876

Query: 829  VVGDLMKSISLLIYK---HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR 885
            +VGDLMKSIS+  +    + +  + E +RD+ A++ +A+E +D+D YL +++N N+    
Sbjct: 877  LVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFVDEDCYLSSDSNSNILIFN 936

Query: 886  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
             NS G  + ER RL      H+GE +N    GS+       +  Q   ++FG V G IG 
Sbjct: 937  TNSTG-NESERFRLNNCAHIHVGECINVMCKGSIAPTHSTYETVQKKCILFGGVTGYIGG 995

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKG-VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
            I  +P+E Y  L K+Q  +   +KG V     + W+   ++ K + + N +DG ++ES+L
Sbjct: 996  ICEIPNEIYDILIKVQNQILLQMKGIVECTTPDDWKKVIDDWKRMPSSNIIDGSIVESYL 1055

Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            ++S+ +  EI+    V+ E++   +E +  L
Sbjct: 1056 EMSKEKQCEIAHLSGVNEEKISDIIENMISL 1086


>gi|390603312|gb|EIN12704.1| hypothetical protein PUNSTDRAFT_97523 [Punctularia strigosozonata
            HHB-11173 SS5]
          Length = 1268

 Score =  319 bits (817), Expect = 6e-84,   Method: Compositional matrix adjust.
 Identities = 273/925 (29%), Positives = 417/925 (45%), Gaps = 141/925 (15%)

Query: 217  LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
            L+GD  G + +L I  +  +  GL +  LGETS  ++I+YL + V Y+GS  GDSQL+++
Sbjct: 344  LIGDAYGRMSMLSI--DPIRGNGLTLLPLGETSPPTSIAYLTSQVFYLGSHMGDSQLLRI 401

Query: 277  --------------------------------------------------NLQPDAK--- 283
                                                              ++ PDA+   
Sbjct: 402  SPTPHSLIDKPTLPIDPRISTVLPSQLSSYNAGESVKGKGKRRADLDSDIDVDPDAQSGR 461

Query: 284  --------GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIG 335
                    GSY+EVL  Y NL P+ D  +++ +  GQ Q+VTCSG    GSL++VR G  
Sbjct: 462  KGKVVRSDGSYLEVLATYQNLAPVTDALLMNADGSGQPQIVTCSGGANAGSLKVVRKGAD 521

Query: 336  INEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE--TEIEGF 392
                A VE L G   +W +R    D  D+++V S +  T++L +   D +         F
Sbjct: 522  FKTAAVVESLPGTVSVWPVRKRYYDNTDSYIVASTLRCTQVLLLEEHDTVNPLVAASTDF 581

Query: 393  CSQTQTLFCHDAIYNQLV-------------QVTSGSVRL-----VSSTSRELRNEWK-S 433
             +   T+   + +  +LV             QVT   +RL     +      L + WK S
Sbjct: 582  ATSGPTISVANILRRRLVNGKSEYEDSSLVVQVTPSKMRLLEHDMIGPVEFRLVDTWKPS 641

Query: 434  PPG-YSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ-LEYEISCLDINPIGEN 491
             PG   +  A+ N +Q+ L   GG L    +      +++  Q    +IS +   P+   
Sbjct: 642  GPGPQEIVTASINPTQIALGLRGGRLCVFRLAPNDHFDLRFTQHFSNDISAVSCLPLNPG 701

Query: 492  PSYSQIAAVGMWTDISVRIF----SLPDLNLITKEHLGGEIIPRSVLLCAFEGISY---- 543
               S   AVG W   +V I     ++ +L L T        +PRS+LL  F G S+    
Sbjct: 702  NLISAFIAVGFWGSNNVMILCQKGNILELELQTDPL---PALPRSLLLYNF-GTSFGKKD 757

Query: 544  ------LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDR 597
                  LL  L DG L+++    K  EL D+K V LG  P++L    +     +F    R
Sbjct: 758  PNYHAHLLIGLADGSLVSYAYARK--ELKDKKVVPLGASPVSLVPCEANGKKAIFCCGTR 815

Query: 598  PTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSI 657
              V+Y    +L  +   LK V   CP NS +F  S  +      TIG   ++  LH+RS+
Sbjct: 816  AAVVYWDRDRLQNAPALLKNVVTACPLNSTSFEQSTILVTGSGFTIGHFGNVAGLHVRSV 875

Query: 658  PLG-EHPRRICHQEQSRTFAICSLKNQSCA----EESEMHFVRLLDDQTFEFISTYPLDT 712
            PLG + P+RI +  +S    +  ++ +       E +     RLLDD +F  +  + L+ 
Sbjct: 876  PLGVDVPKRITYNNESHLLGVACIRKEPHRIGDDEGTIRSSFRLLDDTSFGELDRFDLEA 935

Query: 713  FE--YGCSILSCSFSDDSNVYYCVGTA-YVLPEENEPTKGRILVFIVEDGKLQLIAEKET 769
             E      +LS   ++    ++C+GTA +   ++ E +KGR++VF      L  +A  + 
Sbjct: 936  DEDITSAVVLSLGTAEAYTSHFCIGTADFTSDDQLEVSKGRLVVFDPSTKVLSPVATLDV 995

Query: 770  KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT-----RELQSECGHHGHILALYVQTR 824
             G VY+L +  G + AA+N  + +Y+  L  DG      R +Q    +H + +   V TR
Sbjct: 996  NGCVYALASIQGLVAAAVNSAVIVYR--LETDGPTFSSKRLVQLANWNHNYFVTNLV-TR 1052

Query: 825  GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
            G  I VGD + S+S+L    +  A++  ARDY   W  A+E    D  +GA+  FNLFT 
Sbjct: 1053 GSRIFVGDAISSVSILELTGQ--ALQTVARDYGPLWPVAIESTGPDSVIGADGEFNLFTF 1110

Query: 885  RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--PTVIFGTVNGV 942
            + +         G+LE  G YHLGE VN+F  G L+   P         P     T +G 
Sbjct: 1111 KLS--------EGKLERDGSYHLGEQVNKFVPGGLLAADPAHTTQTTCKPIQTLFTSSGR 1162

Query: 943  IG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA---KNFLDGD 998
            I  VI  L  E  L L  L  N+ KVI G+ G+ +E WRS    +   DA   + FLDGD
Sbjct: 1163 IAIVIDVLDPEVSLHLTNLHRNMSKVITGLTGIQNETWRSPATSRGRTDAETSRGFLDGD 1222

Query: 999  LIESFLDLSRT--RMDEISKTMNVS 1021
             +E+FLD+ RT   + EI +  NV+
Sbjct: 1223 FLEAFLDVPRTSPSLAEILEGENVA 1247


>gi|395330962|gb|EJF63344.1| hypothetical protein DICSQDRAFT_153890 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1263

 Score =  318 bits (816), Expect = 8e-84,   Method: Compositional matrix adjust.
 Identities = 309/1199 (25%), Positives = 506/1199 (42%), Gaps = 227/1199 (18%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERY--KFCVLQWDAES--SELITRAMGDVSDRIGR 61
            ++GR+  L+      E +  +   T+    +  +L +D +   + L       ++DR  R
Sbjct: 56   MWGRVVGLQAVPAEEEGRSHILAMTDHPDPELILLSYDVKDGVASLTPAGHHSLADRAAR 115

Query: 62   PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCA 121
              +      +DP   +     Y G  KV+   + G+L  AF++ L EL +L   FL+   
Sbjct: 116  QAEFVTDFQVDPSGTVAVASCYTGKLKVVTVSD-GKLDTAFDVSLPELNLLAFTFLHTGQ 174

Query: 122  --KPTIVVLYQDNKD-----ARHVKTYE-------------VALKDKDF---------VE 152
              + ++ +L+ +++       R V   E               L DK F         V 
Sbjct: 175  EDRYSLALLHLNHRRQIQLLCRDVDLAEFELSPVHSNILLTATLSDKTFPSLESPLMLVP 234

Query: 153  GPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY------------------------- 187
             P  + + D G D     P    GVLI+G   I++                         
Sbjct: 235  IPQHETDADEGDD---DAPAHRGGVLILGGRKILFYEHSTEEQQETRKEKHRRLSKRLAS 291

Query: 188  ------------CSANAFKAIPIRPSI------TKAYGRVDADGSRYLLGDHAGLLHLLV 229
                              + I  R S+        A+  VDA+  R LLGD  G L ++ 
Sbjct: 292  EDEAKAQEAKKKEKERESRRIKPRASVKWPWAAVTAWTAVDAERRRVLLGDAYGRLAMIA 351

Query: 230  ITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP--------- 280
                K     L +  LGE S  + +SYL + V+Y+GS YGDSQL++++  P         
Sbjct: 352  FDDGK---MCLNLIPLGEASSPTCLSYLSSQVLYLGSHYGDSQLLRIHPTPFANATVDTL 408

Query: 281  ------------------------------------------DAKGSYVEVLERYVNLGP 298
                                                      + +G+++EVL+ + N+ P
Sbjct: 409  PIPRGVSSVPPSSLAGSGKGKAKATSGVGDAESGSAREGRVVNTQGTFLEVLQNFDNIAP 468

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSST 357
            I+D  + D++  GQ QV+T SG    GSLR++R      EQA ++ L G+  +W +++ +
Sbjct: 469  IMDAALADIDGSGQPQVITSSGGRNTGSLRVIRTEADFQEQARLDGLIGVTDIWPVKTHS 528

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELE--ETEIEGFCSQTQTL---------------- 399
             +P  T LVVS + ET + A   +D +   +  + GF +   T                 
Sbjct: 529  AEPIHTHLVVSTLRETHVFAFEGKDAIAHLDPSVAGFTTHAPTFVLGNIPRRVVSASGTS 588

Query: 400  -FCHDAIYNQLVQVTSGSVRLVSSTS--------------RELRNEWKSPPGYSVNVATA 444
             + H ++   +VQ+TS  ++LV                  +++  E+    G  +  A  
Sbjct: 589  SYEHSSL---VVQITSEGIQLVEYEPTLFAFGKVGPGWYPKQVGGEYA---GREIVAAAM 642

Query: 445  NASQVLLATGGGHLVYLEIGDGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMW 503
            + SQ ++   GG L    +G     ++   +    EI  +   P   + +Y+   AV  W
Sbjct: 643  SPSQFVVGLSGGRLALFNLGQKDTVQLLTTRNFPDEICAISCQPYDSSKNYASAIAVSFW 702

Query: 504  TDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAF---------EGISYLLCALGDGHL 553
                V + +L P   L T        +PR VLL  F         +   +LL  L DG L
Sbjct: 703  GSNKVAVLALDPSSPLATVCDTALPTLPRVVLLHNFGTGGRPKDPDFHPHLLVGLADGTL 762

Query: 554  LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNV 613
            + + L  + G+L DRK+  LG  P +L        T VFA+  R +V++   +++  S V
Sbjct: 763  VTYAL--RDGKLHDRKQSGLGNAPASLSVCDVDGRTVVFASGARSSVLFWDRQRVRPSPV 820

Query: 614  NLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQS 672
            ++K++      N+AAFP  LAIA    L IGT+  + K+ IR+IPLG + PRR+ +    
Sbjct: 821  SVKDMIKGVTLNTAAFPSCLAIATSSALLIGTVRGLDKMQIRTIPLGLDSPRRLGYHAGQ 880

Query: 673  RTFAICSLKNQSCA----EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC-SFSDD 727
              F +   +         EE    F +LLD  +F+ +  +     E   S+L+  +    
Sbjct: 881  GVFGVACTRTTPDRIGDYEEVTGSF-KLLDAHSFQQLHNFLCQADEEPSSVLTLPAHGSG 939

Query: 728  SNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK---------LQLIAEKETKGAVYSLNA 778
            ++  + +GT Y+ PEE EP+KGRIL+F V   +         L  +A     G VY+L  
Sbjct: 940  ASPAFALGTVYIRPEEREPSKGRILLFSVSSTEGARGANVRSLHTLASVNVGGCVYALAN 999

Query: 779  FNGKLL-AAINQKIQLYKWMLRDDG-----TRELQSECGHHGHILALYVQTRGDFIVVGD 832
             +  L+ AAIN  + L+K    + G     + E  +E  H+  +  + V   G+ I+VGD
Sbjct: 1000 LSENLIVAAINTSVVLFKSTENEAGESTPLSLEKVTEWNHNHFVTNVVVD--GERILVGD 1057

Query: 833  LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
             + S+S+L +      +E  ARDY   W  A+E   + + +GA  + NLF+    S    
Sbjct: 1058 AISSVSVLKWNERLERLESIARDYGPLWPIAIEGTGNGL-IGANADCNLFSFSLQSV--- 1113

Query: 893  DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV----IFGTVNGVIGVIAS 948
               R  LE  G YHL +  N+F  G+L      +DV +   V    +F T  G IG I  
Sbjct: 1114 -PHRTYLEKDGVYHLNDVTNKFVRGALT----STDVAEDQVVKASHVFFTSTGCIGAILD 1168

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK---NFLDGDLIESFL 1004
            +     L +  LQ N+ K + G GG NH + R+ +  +   DA+    FLDGD +E +L
Sbjct: 1169 MNDVTSLHMTALQRNMAKTLTGPGGDNHTKLRAPSTPRGHTDAEASYGFLDGDFLEQYL 1227


>gi|167390599|ref|XP_001739420.1| DNA damage-binding protein [Entamoeba dispar SAW760]
 gi|165896898|gb|EDR24200.1| DNA damage-binding protein, putative [Entamoeba dispar SAW760]
          Length = 1088

 Score =  315 bits (807), Expect = 9e-83,   Method: Compositional matrix adjust.
 Identities = 282/1056 (26%), Positives = 513/1056 (48%), Gaps = 87/1056 (8%)

Query: 22   AQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLH 81
            ++D+L   T   +  + Q + ++ EL      ++ DR+GR    G  G++  +  L+   
Sbjct: 76   SKDYLIFVTNNQQLVIAQLN-DNKELSLVYSNNIEDRLGRTAFYGISGVVISNSHLLLHL 134

Query: 82   LYDGLFKVIPFDNKGQ-LKEA-FNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVK 139
                + K+IP   + + L EA FN+R++E +V+ +  L    K  I + ++   D + + 
Sbjct: 135  YNHLI-KIIPLPREEENLPEAPFNLRIDEQRVVGLYCLQLDQKQVIGIHHEIKNDIKVIT 193

Query: 140  TYEVALKDKDFVEGPWSQNNLD-NGADLLIPVPPPLCGVLIIGEETIVYC-SANAFKAIP 197
             YE  ++ K      +++   D +  DL IP      G   I  +T+ +  ++N   ++ 
Sbjct: 194  FYEFNIESKTL--NLYNKQIEDTSNIDLFIPYLGNE-GYFKINSQTVSFVPNSNQISSVS 250

Query: 198  IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELL------GETSIA 251
            +RP   + Y  +D    R LL D+ G +HL+ +    +     +IEL          +I 
Sbjct: 251  MRPIHLETYCFLDE--KRLLLSDNKGKMHLITLKENNKH----EIELYYYPLEYQALTIP 304

Query: 252  STISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG 311
            STI YLDN+V++ GS  GDS LIK+N +        E+LE + N GPI+D   +  E   
Sbjct: 305  STILYLDNSVLFWGSKGGDSHLIKINEKR------CEILETFENRGPILDMTAIHDEITN 358

Query: 312  QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
            +  ++ C   Y+ G+L+++ +G+GI+     E++GI  ++ +     +    +L++S+  
Sbjct: 359  KDDLLMCCNTYQQGTLKLISSGVGIDIICQNEMKGITHLYQVEMGNKE----YLIISYSD 414

Query: 372  ETRILAMNLED---ELEETEIEGFCSQTQTLFCHDAIYNQ------LVQVTSGSVRLVSS 422
             +++     E+   +  E EI+GF  + +T+ C   I          VQVT   V +  S
Sbjct: 415  NSKVFESQQENNQLQFNEIEIKGFNRKEETI-CSGIIEIGEMKEICFVQVTREEVNIFDS 473

Query: 423  TSRELRNEWKSPPGYS--------VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHA 474
               +L  + +     S        + ++ +N   VL                   E+KH 
Sbjct: 474  EQLKLIQQIRVNKFISHSCIYDEKLYISQSNEINVLNEK---------------KEIKHI 518

Query: 475  QL-EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV 533
               E +IS   I   G       +  +  W    + I  L + N      + G I  +S+
Sbjct: 519  YSGENDISAFTIGKEGT----EDVIGICYWDSNKMEIIKLENGNKNHIREINGNIYGKSI 574

Query: 534  LLCAFEGISYLLCALGDGHLLNF----LLNMKTGELTDRKKV-SLGTQPITLRTFSSKNT 588
             +   +   Y++  +GDG +L +    L+ ++ GE+T   KV  +G   I L+T      
Sbjct: 575  EIINNKEGMYIVIGMGDGRVLVYQMEELMEIE-GEITKNYKVLDIGLSGIELKTLEINGM 633

Query: 589  THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
             ++    DRPT++     K+   +VN  E+  + PF      +SL +A +  + IG I++
Sbjct: 634  KYIMCLCDRPTLVSIYKGKIKTLSVNCSEIVSIVPFKMKECMNSLIVATKENIIIGNIEE 693

Query: 649  IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES-EMHFVRLLDDQTFEFIST 707
            IQ +H +S+ +G    RI   +Q     + S + +   E+  E H ++L+D +  E   +
Sbjct: 694  IQSIHTKSLSIGVFISRIVISKQENKALLLSSEIKELKEKRIESHSIKLIDFRKMEIEDS 753

Query: 708  YPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEK 767
            + L   E+  SI       D    + +GTA+  P E EP+ GRIL+  ++DGKL+++ EK
Sbjct: 754  HELKENEHALSIEEIVV--DEKEMFVIGTAFAKPNEVEPSSGRILIVQIKDGKLEIVFEK 811

Query: 768  ETKGAVYSLNAFNGKLLA-AINQKIQLYKWM-LRDDGTRE--LQSECGHHGHILALYVQT 823
            +  GAVYS+     K LA +I +K+ ++++  +  +G  E  LQ +   +  ++ LYV+T
Sbjct: 812  DVNGAVYSIKTLLKKYLAMSIEKKLVIFEYQRIITNGEFEVKLQEKGSCNVKLIGLYVKT 871

Query: 824  RGDFIVVGDLMKSISLLIY---KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
             G+ I+VGDLMKSIS+  +    + +  + E +RD+ A++ +A+E +D++ YL +++N N
Sbjct: 872  MGNKILVGDLMKSISVYSFDNNGNNKNCLNEVSRDFYASYTTAIEFVDENCYLSSDSNSN 931

Query: 881  LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
            L     NS G  + ER RL      H+GE +N    GS+       +  Q   ++FG V 
Sbjct: 932  LLVFNTNSTG-NESERFRLNNCAHIHVGECINVMCKGSIAPTHSTYETIQKKCILFGGVT 990

Query: 941  GVIGVIASLPHEQYLFLEKLQTNLRKVIKG-VGGLNHEQWRSFNNEKKTVDAKNFLDGDL 999
            G IG I  +P+E Y  L K+Q  +   +KG V     ++W+   ++ K + + N +DG++
Sbjct: 991  GYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTPDEWKKVIDDWKRMPSSNIIDGNI 1050

Query: 1000 IESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            +ES+L++S+ +  EI+    V+ E++   +E +  L
Sbjct: 1051 VESYLEMSKEKQCEIAHLSGVNEEQINDIIENMISL 1086


>gi|119191318|ref|XP_001246265.1| hypothetical protein CIMG_00036 [Coccidioides immitis RS]
          Length = 1072

 Score =  312 bits (799), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 228/773 (29%), Positives = 388/773 (50%), Gaps = 86/773 (11%)

Query: 6   IYGRIATLE-LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
           IYG+I+ L+ + R H  A D LF+ T+RY +  L WD  + +L T +   D++D   R  
Sbjct: 55  IYGKISVLQKVPRSHSSATDLLFVGTDRYAYFTLSWDPSTCQLHTEQKYLDIADPSLRDN 114

Query: 64  DNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAFNI 104
            +G    +DP  + + + +Y+G+  VIP                    + +  L E    
Sbjct: 115 QSGDRSWVDPSGKFLTMEIYEGIISVIPIAQEPLKRPSPSAGRSSGTSEQREYLGEPLQT 174

Query: 105 RLEELQVLDIKFLY--GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG-PWSQ---- 157
           R+EEL V    FL+      P I +LY++ +    +K     L+D  +  G P  +    
Sbjct: 175 RIEELIVRSTAFLHHDPTKPPRIAILYENTQGKVKLK-----LRDLIYSRGIPGGEASAA 229

Query: 158 ---------NNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITK 204
                    ++L+ GAD+L+PVP PL GVLI+GE+ I Y      + I  RP    +I  
Sbjct: 230 EFRDVDDLYDDLELGADILVPVPLPLGGVLILGEKFIKYIDTVKNETI-TRPLEHNTIFV 288

Query: 205 AYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
           A+ ++D    R+LL D  G L   ++I +    V   K+ LLGETS AS + +L   VV+
Sbjct: 289 AWEQLD--NQRWLLADDYGRLFFFMLILNSANAVQSWKVGLLGETSRASALVHLGGGVVF 346

Query: 264 IGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG----------QG 313
           +GS  GDS +I++      +GS+ E+++   N+GPI+DF V+DL  +G          Q 
Sbjct: 347 LGSHQGDSHVIRIT-----EGSF-EIIQTLSNIGPILDFTVMDLGNRGETPTHEFSSGQA 400

Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
           ++VT SGA++DGSLR VR+G+G+ +   +  ++ I  +W L +   + F   L++SF+ E
Sbjct: 401 RIVTGSGAFRDGSLRSVRSGVGMEDLGVLGAMEHITDLWGLSAFCPEGFCDTLLLSFVDE 460

Query: 373 TRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
           +R+   + + E+EE +   G      T+   +    +++QVT    R+    SR    EW
Sbjct: 461 SRVFHFSPDGEVEEKDDFLGLLLGEPTIHAANLPSRRILQVTEHGARVTDVESRMTLWEW 520

Query: 432 KSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEIS--CLDINPI 488
            +     +  A++N   ++L  GG  L+  +IGD I ++  K  + + ++S   L  +PI
Sbjct: 521 SAVESRKITAASSNDRHLVLMVGGQKLMVFDIGDDIKMSSTKTFEADKQVSGVALTSSPI 580

Query: 489 GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLL-CAF-EGISYL 544
                  Q   +       V I  L  LN+   E LG  G+ +PRSVL+ C F +    L
Sbjct: 581 -------QACILCFPQSAEVTIIDLTGLNIRHTETLGEPGDAVPRSVLVACMFSDRAPTL 633

Query: 545 LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
             A+ DG + +F LN+    L+D  K+ LG++    +     +  + +FA  D P++IY+
Sbjct: 634 FVAMADGSVFSFSLNVANYSLSDANKLVLGSEAPVFKLLPRADGLYNIFATCDHPSLIYA 693

Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
           S  +++YS VN  + + +C FN+ A+P ++A+A   ++ I  +D  +   I+++ + E  
Sbjct: 694 SEDRIVYSAVNSDKATRICHFNAEAYPGAIAVATPDDIKIALVDAERTTQIQTLMINETV 753

Query: 664 RRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFE 714
           RR  +    R F + +++       EE + HF+ L D+  F  +S + L+  E
Sbjct: 754 RRTSYSSTERAFGLGTIQRTLVQNVEEVKSHFI-LADEIMFRQLSVFDLNPNE 805



 Score =  105 bits (261), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 71/241 (29%), Positives = 120/241 (49%), Gaps = 12/241 (4%)

Query: 805  ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA----IEERARDYNANW 860
            E++ E  +      + +   G+ I V DLMKSISL+ Y   EG     ++E AR Y   W
Sbjct: 835  EIEKEASYRTSTAPVDISVTGNIIAVADLMKSISLVEYHAGEGGQPDTLKEVARHYQTLW 894

Query: 861  MSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 920
             +A   + ++ +L A+   NL  + +++ G T+++R R++V  E  LGE VNR  H   +
Sbjct: 895  TTAAAPVAENEFLVADAEGNLVVLNRDTTGVTEDDRRRMQVTSELRLGEMVNRI-HPMDL 953

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
               P+S V  IP     TV+G I +   +       L +LQ+ L   +   G +   ++R
Sbjct: 954  QTSPESPV--IPKAFLATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYR 1011

Query: 981  SFNNE-KKTVDAKNFLDGDLIESFL----DLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            +F +  ++  +   F+DG+LIE FL    D+    +  +     V+V E+   +E L R+
Sbjct: 1012 AFKSSVRQAEEPFRFVDGELIEQFLTFPPDIQEAALARMDGGGRVNVIEIKGMIEGLKRM 1071

Query: 1036 H 1036
            H
Sbjct: 1072 H 1072


>gi|380481704|emb|CCF41690.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
          Length = 932

 Score =  308 bits (788), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 256/968 (26%), Positives = 449/968 (46%), Gaps = 115/968 (11%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
           +YG I  L+  RP     D LF+ T+R+++   +WD  +  L T + + D ++   R   
Sbjct: 7   VYGTILFLQRLRPKDTKADLLFVGTDRFQYFTARWDPATQRLQTEQVIEDAAEPHMRDAQ 66

Query: 65  NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG----QLKEAF-NIRLEELQVLDIKFL-Y 118
           +    ++DP  + + +HL++G+  V+    +     +L +++  +RL EL V    F+  
Sbjct: 67  SQDRCLVDPTGKFMAMHLWEGVLNVMRLGTRKGTFTRLDDSWERVRLSELFVKASTFVPT 126

Query: 119 GCAKPTIVVLYQDN--KDARHVKTYEVALKDKDFVEGPWSQNN--------LDNGADLLI 168
               PT+  LYQ    K+  H+  Y +   DK+     +  +          D  A ++I
Sbjct: 127 ETGHPTVAYLYQSQIEKEDAHLAIYRLMSDDKNTTVSRFDPSRDREFELEIKDPYARIVI 186

Query: 169 PVP------------------PPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYG 207
           PVP                    L G++++GE  +VY        +      P+I  A+ 
Sbjct: 187 PVPIVEDEVKRYHKRDTTGAKAQLGGLIVVGETLLVYVDTLTRTVVESGLNSPAIFVAWA 246

Query: 208 RVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
             D   + Y L D  G LHLL I  E   VT L + LLG TS AS + ++ N ++++GS 
Sbjct: 247 AYD--DTNYFLSDDYGNLHLLTIETEGVVVTNLSLRLLGVTSRASCLVHMGNGLLFLGSH 304

Query: 268 YGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER-----------QGQGQVV 316
           YGDSQL+++N++   K   V+ +    ++ PI+DF ++DL              GQ ++V
Sbjct: 305 YGDSQLLQINME-SLKTRLVQTIP---SIAPILDFSIMDLGNAGDSQVGNAFSSGQARIV 360

Query: 317 TCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
              G +++GSLR +R+ +G+ +   +E LQ ++G++SLRS      DT LVVSFI+ETRI
Sbjct: 361 AGCGVHQNGSLRSIRSSVGLEDIGVLEDLQDVRGLFSLRSHGSPKVDT-LVVSFITETRI 419

Query: 376 LAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
            + + E  +EE  E +G      TL        +L+QVTS +V L+ S      N W  P
Sbjct: 420 FSFDPEGGIEEVFEFQGLALDRPTLVATTLPDGRLLQVTSTTVTLLESERGITLNTWAVP 479

Query: 435 PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKH-----AQLEYEISCLDINPIG 489
            G ++  A+AN   VLL+  G  LV L + + +  + +         E +ISC     I 
Sbjct: 480 DGKAIVNASANNKWVLLSINGTTLVSLNLLNNLSAQEQVLGRDIGGHEDQISC-----IH 534

Query: 490 ENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE----IIPRSVLLCAFEGISYL- 544
                  +  VG W   SV I  L  L+ +  E +        +PR + L        L 
Sbjct: 535 AASDLDDVGVVGFWATGSVSIIDLRTLDALHGETIKQTDDSVSVPRDLALVQLHPPHLLG 594

Query: 545 ---LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF---SSKNTTHVFAASDRP 598
                A+ DG +++F ++ +   L+ RK V+LG+Q   L        +  +++FA ++  
Sbjct: 595 PTLFVAMEDGQVVSFNVSKQDFSLSSRKSVTLGSQQAGLHVLPRPGGEGISNIFATTEHS 654

Query: 599 TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIP 658
           ++IYSS  +++YS    ++V+++ PF+S AFPD++ +A +  + I  ID  ++ H+  +P
Sbjct: 655 SLIYSSEGRIIYSAATAEDVTYIAPFDSEAFPDAIFLATDQNIRIAHIDAERRTHVNPLP 714

Query: 659 LGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTFEFIST-YPLDTFEYG 716
           L E  RR+ +    R F I ++  +    EE      RL+D+     +   + LD     
Sbjct: 715 LRETVRRVAYSPALRAFGIGTISRELVNNEEVVTSSFRLVDEIVLGVVGKPFHLDG---- 770

Query: 717 CSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSL 776
                   S  + +   V  A +     +P                  AE+   GA  SL
Sbjct: 771 --------STSTEMVESVIRAELPDSMGQP------------------AERFIIGACRSL 804

Query: 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
                KL+A +++ + +Y++      +  L+           + +   G+ I + DLM+S
Sbjct: 805 GIMGEKLVAGLSKTVVVYEYAEESSTSGALRKLATFRPSTFPVDIDVNGNMIGIADLMQS 864

Query: 837 ISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
           ++L+ +       +  + ERAR +   W ++V  L+   +L A+   NL  +R+N +  T
Sbjct: 865 MTLVEFIPAQDGNKAKLVERARHFQYIWATSVCHLEGHSWLEADAQGNLMVLRRNPDAPT 924

Query: 893 DEERGRLE 900
           + ++ ++E
Sbjct: 925 EHDQKQME 932


>gi|449549048|gb|EMD40014.1| hypothetical protein CERSUDRAFT_63520 [Ceriporiopsis subvermispora B]
          Length = 1265

 Score =  303 bits (776), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 239/797 (29%), Positives = 383/797 (48%), Gaps = 84/797 (10%)

Query: 283  KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
            KG ++EVL+ + N+ PIVD  + DL+  GQ Q++TCSG    G+L++VR G    E A V
Sbjct: 450  KGQFIEVLDTFQNIAPIVDAVLADLDDSGQSQIITCSGGRNSGALKVVRTGADFQELARV 509

Query: 343  E-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE--TEIEGFCSQTQTL 399
              + GI  +W +RS  +   D+ LV S  +ET +   +  D + +  +  +GF +   TL
Sbjct: 510  NGITGITSLWPVRSRFEHSTDSHLVASTETETYVFKFDSSDVITQLDSSADGFITTAPTL 569

Query: 400  FCHD--------------AIY---NQLVQVTSGSVRLVSSTSR----ELRNEWKSPPGYS 438
               +              + Y   + ++QVT   + L+   +      L  E   P    
Sbjct: 570  AAANIPRRVSTNAGGRVSSSYVDSSLVIQVTPEMITLLEYDAALGLFSLVGEGWDPKSQG 629

Query: 439  -------VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL-------EYEISCLD 484
                   +  A  NASQ+++   GG +  L + D    +V+ ++          ++S + 
Sbjct: 630  AIGGRRRIVAADVNASQIVVGLDGGKVCLLNLADNSRFQVQRSRGFADPVYGPLDVSAVS 689

Query: 485  INPIGENPSYSQIAAVGMWTDISVRIFSLP--DLNLITKEHLGG-EIIPRSVLLCAF--- 538
              P     +++   AV  W    V+I S+   D  L T   + G   +PRSVLL  F   
Sbjct: 690  CVPFDRTKNFATNIAVAFWGTNKVQILSISSQDATLATVCEVSGLPSLPRSVLLHNFGTG 749

Query: 539  ------EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
                  +   ++L  L DG +++F  ++   EL ++K  SLG  P++L          VF
Sbjct: 750  RTKKEPDFHPHVLVGLVDGSVISF--SVVENELKEKKVFSLGIAPVSLSRCEVDGKITVF 807

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A   R +V+Y   ++L  S V +K+++     N++ F  SL +A    L IGTI  + K+
Sbjct: 808  AVGSRTSVLYWDKQRLTNSPVMIKDMAVGTSLNTSYFRSSLVLAASSGLIIGTIRGVDKM 867

Query: 653  HIRSIPLG-EHPRRICHQEQSRTFAI-C--SLKNQSCAEESEMHFVRLLDDQTFEFISTY 708
             IRSIP G   PRRI +  + + F + C  ++  +    +      +++D  TF  +  +
Sbjct: 868  QIRSIPFGLSDPRRIAYHSRLKLFGVGCNRTMPLRPGEFQGTTSSFKIIDATTFNGLWDF 927

Query: 709  PLDTFEYGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV--------EDG 759
             L   E   S+++    +D  +  +CVGT +   EE EPT GR+L+F +         DG
Sbjct: 928  ELQANEEVSSVMALPDGTDGRSPCFCVGTVFFEVEETEPTSGRLLLFAIGSDGATSSADG 987

Query: 760  KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR-ELQSECGHHGHILA 818
            +L+L+  ++ KG V+ + + N  + AAIN  + L+   LRD   +  LQ     + +   
Sbjct: 988  ELRLVTTQDVKGCVFQITSVNNFIAAAINSNVVLFA--LRDTNKQYALQQVADWNHNYFV 1045

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              + + GD ++VGD + S+SLL  +     IE  +RDY+  W  AVE   ++  +GA ++
Sbjct: 1046 TNLASHGDRLIVGDAISSVSLL--RVSVARIECLSRDYSPLWPVAVEATAENQIIGANSD 1103

Query: 879  FNLFTVR-KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--PTVI 935
             NLF+   ++ +G     R  LE  G YHL + VN+F  G LV    DS  G    P  +
Sbjct: 1104 CNLFSFALQHIDG-----RKVLERDGSYHLDDIVNKFAPGGLVA--ADSSTGYTLRPRQL 1156

Query: 936  FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK--- 992
            F + +G IGVI  +  E  L L  LQ N+ K IKG G  NH +WR+  N +   DA+   
Sbjct: 1157 FFSSSGRIGVIIDVDDELSLPLTSLQMNMAKRIKGPGDTNHTEWRAPTNARGRTDAEASA 1216

Query: 993  -NFLDGDLIESFLDLSR 1008
              FLDGD IE FL   R
Sbjct: 1217 FGFLDGDFIEQFLTHPR 1233



 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 50/78 (64%), Gaps = 5/78 (6%)

Query: 205 AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
           A+  +D +  R+L+GD  G L LLV+       TGL +  LGETS  +T+SYLD+ VVY+
Sbjct: 318 AWCPLDEESRRFLVGDAYGRLALLVL-----DTTGLIVSPLGETSPPTTLSYLDSQVVYV 372

Query: 265 GSSYGDSQLIKLNLQPDA 282
           GS  G+ QL++++  P A
Sbjct: 373 GSHMGNPQLLRIHTSPIA 390


>gi|170589359|ref|XP_001899441.1| Xeroderma Pigmentosum Group E Complementing protein [Brugia malayi]
 gi|158593654|gb|EDP32249.1| Xeroderma Pigmentosum Group E Complementing protein, putative
           [Brugia malayi]
          Length = 521

 Score =  303 bits (775), Expect = 4e-79,   Method: Compositional matrix adjust.
 Identities = 186/446 (41%), Positives = 277/446 (62%), Gaps = 29/446 (6%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           ++GRIAT++LFR  GE  D L I T +Y   +++W   +SEL TRA G + DR+GRP++ 
Sbjct: 57  VFGRIATIKLFRAPGENVDSLLILTAKYHLAIIRW-TPTSELRTRASGHIVDRVGRPSET 115

Query: 66  GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTI 125
           G I  +     ++   LYDGL KV+ + N+G+    FN+R ++L ++DI F+    +PT+
Sbjct: 116 GMIATVHSSGLMV-FRLYDGLLKVVQW-NEGKDLRGFNVRCDDLYIVDITFMSDPDRPTL 173

Query: 126 VVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETI 185
             +YQD+ + RH+K   + + DK+     W  +NL+  A+++I VP P+ G LI G + I
Sbjct: 174 AYIYQDD-NGRHIKVVALNIDDKELSSPLWKHDNLEGEANIVIGVPEPVGGCLIAGPDAI 232

Query: 186 VYCS----ANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK-- 236
            Y      A  +  +P   +  +    Y  VD DG RYLL D AG L++L++   K++  
Sbjct: 233 SYHKGGDDALRYAGVPGSRLHNTHPNCYAPVDRDGQRYLLADLAGNLYMLLLELGKDQEQ 292

Query: 237 -------VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS-YVE 288
                  V  +K+E LGET IA  + YLBN V +IGS +GDSQLI+L+ +P A G+ Y+ 
Sbjct: 293 DENSAMYVRDMKVESLGETCIAECMCYLBNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS 352

Query: 289 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 348
           +L+ Y NL PI D  V  +   GQ Q++TCSGAYKDG++RI+RNGIGI E ASVEL+GIK
Sbjct: 353 LLDSYTNLAPIRDMTV--MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELASVELKGIK 410

Query: 349 GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
            M++LR+  DD FD +L++SF SET +L +N E ELE+TEI GF     TL+     +++
Sbjct: 411 NMFTLRTR-DDEFDDYLILSFDSETHVLLINGE-ELEDTEITGFTVDGATLWAGCLFHSK 468

Query: 409 -LVQVTSGSVRLVSSTSRELRNEWKS 433
            ++QVT G V L+   + ++   WKS
Sbjct: 469 TILQVTHGEVILIDGDNIQV---WKS 491


>gi|312076588|ref|XP_003140928.1| xeroderma Pigmentosum Group E Complementing protein [Loa loa]
          Length = 516

 Score =  295 bits (754), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 183/449 (40%), Positives = 273/449 (60%), Gaps = 29/449 (6%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
           + P++GRIAT++LFR  GE  D L I T +Y   +++W   +SEL TRA G + DR+GRP
Sbjct: 54  ECPVFGRIATIKLFRAPGEDVDSLLILTAKYHLAIIRW-TPTSELRTRASGHIVDRVGRP 112

Query: 63  TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
           ++ G I  +     ++   LYDGL KV+ + N+G+    FN+R ++L ++DI F+    +
Sbjct: 113 SETGMIATVHSSGLMV-FRLYDGLLKVVQW-NEGKDLRGFNVRCDDLYIIDITFMSDPDR 170

Query: 123 PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGE 182
           PT+  +YQD+ + RH+K   + ++DK+     W  +NL+  A ++I VP P  G LI G 
Sbjct: 171 PTLAYIYQDD-NGRHIKVVTLNIEDKELSSPLWKHDNLEGEASMVIGVPEPAGGCLIAGP 229

Query: 183 ETIVYCS----ANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
           + I Y      A  +  +P   +  +    Y  VD DG RYLL D AG L++L++   K 
Sbjct: 230 DAISYHKGGDDALRYAGVPGSRLHNTHPNCYAPVDRDGQRYLLADLAGNLYMLLLEFGKG 289

Query: 236 K---------VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS- 285
           +         V  +K+E LG T IA  + YLDN V +IGS +GDSQLI+L+ +P A G+ 
Sbjct: 290 QEQDESSTVSVKDMKVESLGNTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTG 349

Query: 286 YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
           Y+ +L+ Y NL PI D  V  +   GQ Q++TCSGAYKDG++RI+RNGIGI E ASVEL+
Sbjct: 350 YISLLDSYTNLAPIRDMTV--MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELASVELK 407

Query: 346 GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
           GIK M++LR+  D  FD +L++SF S+T +L +N E ELE+T+I GF     TL+     
Sbjct: 408 GIKNMFTLRTR-DHEFDDYLILSFDSDTHVLLINGE-ELEDTQITGFVVDGATLWAGCLF 465

Query: 406 YN-QLVQVTSGSVRLVSSTSRELRNEWKS 433
            +  ++QVT G V L+   + ++   WK+
Sbjct: 466 QSTTILQVTHGEVILIDGDNIQI---WKA 491


>gi|449540702|gb|EMD31691.1| hypothetical protein CERSUDRAFT_109269 [Ceriporiopsis subvermispora
            B]
          Length = 1265

 Score =  288 bits (737), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 234/801 (29%), Positives = 383/801 (47%), Gaps = 92/801 (11%)

Query: 283  KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
            KG ++EVL+ + N+ PI+D  + DL+  GQ Q++TCSG    G+L++VR G    E A V
Sbjct: 450  KGQFIEVLDTFQNMAPIMDAVLADLDDSGQSQIITCSGGRNSGALKVVRAGADFQELARV 509

Query: 343  E-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE--TEIEGFCSQTQTL 399
              + GI  +W +RS ++   D+ LV S  +ET +   +  D + +  +  + F +   TL
Sbjct: 510  NGITGITSLWPVRSRSEHSTDSHLVASTETETYVFKFDSSDVITQLDSSADEFITTAPTL 569

Query: 400  FCHDAIYNQLVQVTSGSVRLVSSTSRE-------------------------LRNEWKSP 434
                A+ N   +V++ +   VSS+  +                         L  E   P
Sbjct: 570  ----AVANIPRRVSTNASGRVSSSYVDSSLVIQVTPEKITLLEYGAALGLFSLVGEGWDP 625

Query: 435  PGYS-------VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ-------LEYEI 480
                       +  A  NASQ+++   GG +  L + D    +V+ ++          ++
Sbjct: 626  KSQGAIGGRRRIVAADVNASQIVVGLDGGKVCLLNLADNSRFQVQRSRDFADPVYGPLDV 685

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP--DLNLITKEHLGG-EIIPRSVLLCA 537
            S +   P     +++   AV  W    V+I S+   D  L T   + G   +PRSVLL  
Sbjct: 686  SAVSCVPFDRTKNFATNIAVAFWGTNKVQILSMSSQDATLATVCEVSGLPSLPRSVLLHN 745

Query: 538  F---------EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
            F         +   ++L  L DG +++F   +   EL ++K  SLG  P++L      + 
Sbjct: 746  FGTGRTKKEPDYHPHVLVGLVDGSVISF--KVVENELKEKKVFSLGIAPVSLSRCEVDDK 803

Query: 589  THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
              VFA   R +V+Y   ++L  S V +K+++     N++ F  SL +A    L IGTI  
Sbjct: 804  ITVFAVGSRTSVLYWDKQRLTNSPVMIKDMTVGASLNTSYFRSSLVLAASSGLIIGTIRG 863

Query: 649  IQKLHIRSIPLG-EHPRRICHQEQSRTFAI-C--SLKNQSCAEESEMHFVRLLDDQTFEF 704
            + K+ IRSIP G  +PR I +  + + F + C  ++  +    +      +++D  TF+ 
Sbjct: 864  VDKMQIRSIPFGLTNPRWITYHSRLKLFGVGCNHTMPLRLGEFQGTTSSFKIVDATTFDG 923

Query: 705  ISTYPLDTFEYGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV------- 756
            +  + L   E   S+++    +D  +  +CVGTA++  EE EP  GR+L+F +       
Sbjct: 924  LWDFELQANEEVTSVMALPDGTDGRSPSFCVGTAFLEVEETEPRSGRLLLFAIGSDGATS 983

Query: 757  -EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR-ELQSECGHHG 814
              DG+L+L+A ++ KG V+ + + N  + AAI+  + L+   LR+   +  LQ     + 
Sbjct: 984  SADGELRLVATQDVKGCVFQITSVNSFIAAAISSNVVLFA--LRNTNKQYALQQVADWNH 1041

Query: 815  HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
            +     + + GD ++VGD + S+SLL  +  +  IE  +RDY      AVE   ++  +G
Sbjct: 1042 NYFVTNLASHGDLLIVGDAISSVSLL--RVSDSRIECLSRDYGPLRPVAVEATAENQIIG 1099

Query: 875  AENNFNLFTVR-KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-- 931
            A +  NLF+   ++ +G     R  LE  G YHL + V +F  G LV    DS  G    
Sbjct: 1100 ANSYCNLFSFALQHIDG-----RKVLERDGSYHLDDIVKKFVPGGLVA--ADSSTGYTLR 1152

Query: 932  PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 991
            P  +F + +G IGVI  +  E  L L  LQ N+ K IKG G  NH +WR+  N +   DA
Sbjct: 1153 PRQLFFSSSGRIGVIIDVDDELSLPLTSLQMNMAKRIKGPGDTNHTEWRAPTNARGRTDA 1212

Query: 992  K----NFLDGDLIESFLDLSR 1008
            +     FLDGD IE FL   R
Sbjct: 1213 EASAFGFLDGDFIEQFLTHPR 1233



 Score = 63.9 bits (154), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 70/296 (23%), Positives = 127/296 (42%), Gaps = 65/296 (21%)

Query: 47  LITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEA-FNIR 105
           L+T+    + +R GR  +      + P   +  +  Y G+ KV+ F        A F++ 
Sbjct: 100 LVTKGHVSLQERGGRVAEFVTDVFVHPGGEVAVVSCYAGMLKVVQFKEGNIDTHAHFDVS 159

Query: 106 LEELQVLDIKFLYGCAKPTIVVLYQ-DNKDARHVKTYEVALKDKDFVEGP--------WS 156
           + E+ +L + FL+      I+ +   D++    + + ++ L++ +    P         S
Sbjct: 160 IPEMNILALNFLHSDDNQYILAIVNYDHQRRVQLLSRKLDLENYELDPSPSVVLRSTQLS 219

Query: 157 QNNLDN--GADLLIPVPPP-----LCGVLIIGEETIVY---CSANAFK------------ 194
            ++  +     LL+P+ P      L GVL++G   I++    S N  +            
Sbjct: 220 SSHFPSIETPPLLVPILPEGDGGHLGGVLVLGGRKILFFEHTSKNRQQIKRWKQIRLESR 279

Query: 195 --------------------AIPIRPSIT--------KAYGRVDADGSRYLLGDHAGLLH 226
                               A  ++P  +         A+  +D +  R+L+GD  G L 
Sbjct: 280 LSSKDPMEVAKAREKEKERDARKVKPKFSVKWPWSDVTAWCPLDEECRRFLVGDAYGRLA 339

Query: 227 LLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDA 282
           LLV+       TGL +  LGETS  +T+SYLD+ VVY+GS  G+ QL++++  P A
Sbjct: 340 LLVL-----DTTGLIVSPLGETSPPTTLSYLDSQVVYVGSHMGNPQLLRIHTSPIA 390


>gi|387219217|gb|AFJ69317.1| dna damage-binding protein 1a, partial [Nannochloropsis gaditana
           CCMP526]
          Length = 383

 Score =  288 bits (736), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 169/380 (44%), Positives = 224/380 (58%), Gaps = 31/380 (8%)

Query: 405 IYNQLVQVTSGSVRLV--SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
           +   LVQVT  SVRL+  +S S  L +EW S  G  + VA  NA QV+LA  GG +VYLE
Sbjct: 4   VGEMLVQVTDASVRLLDLASASSPLLSEWISAHGTRITVAAGNAMQVVLALSGGEVVYLE 63

Query: 463 I--GDGILTEVKHAQLEYEISCLDINPI------GEN---------------PSYSQIAA 499
           +      L E     LE+E+SCL ++P+      GE+               P  + + A
Sbjct: 64  LKANQRALEEKARIHLEHEVSCLSVHPLIPGPVPGEDGAEEAMQVEDGREEAPPSAFLVA 123

Query: 500 VGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG-ISYLLCALGDGHLLNFLL 558
           VG WTD+SVR+ +LP L  + +  LG +   RSVLL   +  I YLL  LGDG L++F +
Sbjct: 124 VGTWTDLSVRLLALPSLQSLHRCELGSDTQARSVLLITLQADIHYLLVGLGDGFLVSFAV 183

Query: 559 NM--KTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNK-KLLYSNVN 614
            +  KT  L  RKKVSLGTQP++L  FSS      VF  S+RPTVI+ S   KLLYSNVN
Sbjct: 184 ALEGKTPALGPRKKVSLGTQPLSLTPFSSTAAEPCVFVCSERPTVIHVSKADKLLYSNVN 243

Query: 615 LKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRT 674
             EV+ M PF+SA FPD LA+A E  L IGT+DDIQKL I+++PLGE PRRI H  Q+  
Sbjct: 244 TSEVTLMAPFHSALFPDCLALASETGLRIGTVDDIQKLRIQTVPLGESPRRIAHIPQAGV 303

Query: 675 FAICSLKNQSCAE-ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
            A+ + K       E E ++VR LDD +FE +  + L   E  CS+ +C+F+ D   Y  
Sbjct: 304 LAVLTAKYAVGENGEEETNYVRFLDDASFEAVGAFELRPMELACSVAACTFAKDPREYLV 363

Query: 734 VGTAYVLPEENEPTKGRILV 753
           VGT   L +E+EP +GR++V
Sbjct: 364 VGTCMALEDEDEPREGRLIV 383


>gi|225680146|gb|EEH18430.1| DNA damage-binding protein [Paracoccidioides brasiliensis Pb03]
          Length = 1138

 Score =  282 bits (721), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 235/819 (28%), Positives = 383/819 (46%), Gaps = 116/819 (14%)

Query: 15  LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDP 73
           LF P     D LFI T+R  +  + W+  +S+L T R   D++D   R +  G   +IDP
Sbjct: 113 LFLPQ---TDVLFIGTDRATYFTVSWNPVTSQLRTERKYVDLADPSSRESQLGDRCLIDP 169

Query: 74  DCRLIGLHLYDGLFKVIPFDNKG-----------------------------------QL 98
             + I L LY+G+  VIP                                        +L
Sbjct: 170 SGKFITLELYEGIITVIPIGQPQRTARQSGRKYGKRAVTAQNQHDSSHIGNTNAAGEVEL 229

Query: 99  KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV------------- 143
            E    R++EL V    FL+  A   P +  LY+D      +K  E+             
Sbjct: 230 GEPCQARVDELLVRSSAFLHTQADMLPRMAFLYEDTMGQVRLKVRELEFTYGGMGIGTGG 289

Query: 144 ----------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SAN 191
                       K  D +     +  L+ GA  LIPVP PL G+L++GE +I Y   + N
Sbjct: 290 GAGQDTGCIAVFKALDLL-----KEELEMGASFLIPVPAPLGGLLVLGETSIRYLDDATN 344

Query: 192 AFKAIPI-RPSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETS 249
              ++P+   +I  A+ +VD  G R+LL D  G L  L++I  E   V   K++LLG   
Sbjct: 345 ECISLPLDEATIFVAWEQVD--GQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIP 402

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-- 307
            AS + YL   V +IGS  GDSQLI++      +GS+ E+++ + N+ PI+DF ++DL  
Sbjct: 403 RASVLVYLGGGVTFIGSHQGDSQLIRIT-----EGSF-EIIQTFSNIAPILDFTIMDLGG 456

Query: 308 ---ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSST 357
              E Q      GQ ++VT SGA+ DGSLR VR+G+G+ E   +  ++ I  +W+LR + 
Sbjct: 457 RAGENQTHDFSSGQARIVTGSGAFDDGSLRSVRSGVGMEEVGVLGAMEHITDLWALRVAC 516

Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
            + F   L+VSF+ ETR+     + E+EE  E  G      TL   +    +++QVT  +
Sbjct: 517 QEGFSDTLLVSFVDETRVFHFTQDGEVEEKDEFMGLGLAESTLLAANLPNGRILQVTERN 576

Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQ 475
           VR+       L   W  P   ++  A++N   ++L  GG  L+  +I G+  L   K   
Sbjct: 577 VRVADLDDGMLLWNWSPPSQKAITAASSNDDHLVLVVGGQVLMCFDIQGEVKLAGKKDFG 636

Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSV 533
            + ++S + +     +P+   I  +    ++ V   +L D+ +     +G  GE  PRSV
Sbjct: 637 DDTQVSGVTVT---SSPATYCILCLPQTAEVVV--MNLEDMTIRHSTSVGEPGEAFPRSV 691

Query: 534 LLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTTH 590
           L+        + L  ++ DG + +F  N     L    K+ LG+ QP   +        +
Sbjct: 692 LVAEVLPNQPATLFVSMADGRVFSFSFNADEFTLKKMSKLVLGSEQPSFKKLPRGDGLYN 751

Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
           +FA  ++P++IY++  +++YS V+  + S +C FNS A+P S+A+A   EL I  +D  +
Sbjct: 752 IFATCEQPSLIYATEGRIIYSAVHSDQASRICHFNSEAYPGSIALATPTELKIAHVDSER 811

Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTY 708
              I ++ + E  RR+ +    + F I ++K   ++ AE     F+ L D+  F  +  Y
Sbjct: 812 TTQIHTLEIDETVRRVAYSAAEKAFGIGTIKRTLENGAEVITSRFM-LADEIMFRELDDY 870

Query: 709 PLDTFEYGCSILSCSFSD--DSNV------YYCVGTAYV 739
            L   E   S++   F +  DSN        + VGT+Y+
Sbjct: 871 SLRPDELVESVIQAQFPEGKDSNGNESFKDIFVVGTSYL 909



 Score =  109 bits (273), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 71/231 (30%), Positives = 119/231 (51%), Gaps = 14/231 (6%)

Query: 820  YVQTRGDFIVVGDLMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGA 875
            Y+   G+ I V DLMKS+S++ +K  E     ++ E AR +   W +AV  + ++++L +
Sbjct: 908  YLDDVGNLIAVADLMKSVSIIEFKQGENDQPDSLTEVARHFQTLWSTAVAPIAENMFLES 967

Query: 876  ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI---P 932
            +   NL  + +N  G TD+++ RLEV  E  LGE VNR R  S+   LP +   +    P
Sbjct: 968  DAEGNLVVLNRNVNGVTDDDKRRLEVTSEILLGEMVNRIRPVSIQGSLPATGPREAVISP 1027

Query: 933  TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDA 991
                GTV G I +   +       L +LQ+ +  ++   G +   ++R+F N  ++  + 
Sbjct: 1028 KAFLGTVEGSIYLFGLINPAYQDLLMRLQSAMAGLVVTPGAMPFNKFRAFKNAVRQAEEP 1087

Query: 992  KNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
              F+DG+LIE F        +EI          +V+VE++ + VEEL R+H
Sbjct: 1088 YRFVDGELIERFWTCETALQEEIVGLVVAGGVADVTVEKVKRIVEELRRMH 1138


>gi|409049568|gb|EKM59045.1| hypothetical protein PHACADRAFT_181065 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 1268

 Score =  277 bits (709), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 267/947 (28%), Positives = 412/947 (43%), Gaps = 155/947 (16%)

Query: 189  SANAFKAIPIRPSITKAYGRVDADGSR----------YLLGDHAGLLHLLVITHEKEKVT 238
            +A   K +  R ++   +  V A  SR          + +GD  G L LL I  + E   
Sbjct: 308  AARELKKVKPRATVKWPWSEVTASASRWCPADDEMRRFFVGDVFGRLSLLTINDDPE--- 364

Query: 239  GLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP------------------ 280
             L I  LGE S  +T+SYL + V+Y+GS  G+SQL++++  P                  
Sbjct: 365  -LIIIPLGEISSPTTLSYLSSQVLYVGSVLGNSQLLRISPSPVGDIDSDTLPIPGGIHTI 423

Query: 281  ----------------------------------DAKGSYVEVLERYVNLGPIVDFCVVD 306
                                              + KG ++E L +Y N+ PIVD  + D
Sbjct: 424  KPAELSTLRAESPDEDYDMRDAFDTPEGRGGKIVNCKGRFIEELTQYANIAPIVDAVMAD 483

Query: 307  LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFL 365
             +  GQ Q++TCSG    GSL +VR G    E A + E+  +  +W +R+S D P DT++
Sbjct: 484  PDESGQPQIITCSGGANTGSLNVVRTGADFQELAVLNEIPNVTNIWPIRTSFDGPADTYV 543

Query: 366  VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN-----------------Q 408
            + + + E+ +L  +  D +  T I+   ++  T     A+ N                  
Sbjct: 544  LATTLYESFVLRFDDGDSV--TRIDPSATELVTNRPTIAVANIPRRTTQQNASTYINSSL 601

Query: 409  LVQVTSGSVRLVS---------------STSRELRNEWKSPPGYSVNVATANASQVLLAT 453
            +VQVT   + LV                S  ++    W++     +  A+ N SQ  +A 
Sbjct: 602  VVQVTPQGLNLVEYDVALGAFNKVGDGWSLQKQENPLWRA---KEIVAASINPSQFAVAL 658

Query: 454  GGGHLVYLEIG-DGILTEVKHAQLE-YEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 511
             GG  +   +  DG L  ++  +    EI+ L   P+  + ++S   AV  W   +V + 
Sbjct: 659  NGGTFLLFNLSPDGQLNLLQTREFHGKEIAALSCAPLDPSKNFSPFVAVSFWGSNTVTLL 718

Query: 512  SLPDLNLITKEHLGGEIIP---RSVLLCAF-EGIS--------YLLCALGDGHLLNFLLN 559
            S  D  L T  H     +P    S+LL  F  G S        Y++  L DG +    ++
Sbjct: 719  STKDPALGT--HTESAPLPALAHSLLLHNFGAGRSTRDADFQPYVVAGLVDGTVA--CVS 774

Query: 560  MKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS 619
             +  EL D+K  +LG  P++L   +      V A   R  V Y   ++L  S V LK V+
Sbjct: 775  FRNNELRDQKLFALGAAPVSLAVSTVDGARMVLATGSRAAVFYWDRQRLRQSPVMLKNVA 834

Query: 620  HMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAIC 678
                 N+AAFP    +A    L +G I  + K+ IRS  LG E PRRI +      F + 
Sbjct: 835  VGAGLNTAAFPACQILATPSSLVVGQIRGVDKMQIRSFGLGHESPRRIAYHSDLNLFGVS 894

Query: 679  SLKNQSC----AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
              K+       AE  E  F  + D  TF  IS +  +  E   ++L+      +   + V
Sbjct: 895  IAKSAPARVGEAELQESTF-EIKDSVTFSTISLFTAEPDEEITAVLA--LPAGAPGCFVV 951

Query: 735  GTAYVLPEENEPTKGRILVFIVE---DG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
            GT      E EP+ GR+++F V    DG  +L+ +AE E  G VY+L A    + AA+N 
Sbjct: 952  GTVKHQHGEFEPSAGRLILFGVVPSVDGGRELKKLAEAEANGCVYALAAVENGVAAAVNT 1011

Query: 790  KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL-IYKHEEGA 848
             + LY  M+  +G   L+     + +     +  RG  ++ GD + S+S+L + +     
Sbjct: 1012 SVDLYG-MVEHEGAHALEKVAAWNHNYFVTSLVARGGRLIAGDAISSVSVLEVLRGSH-- 1068

Query: 849  IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
            +   ARDY   W  AVE   D   +GA  + NLFT         +  R  LE  G YHLG
Sbjct: 1069 LRTIARDYGPVWPVAVEATKDGGVIGANTDGNLFTF-----ALPEGPRAVLERNGHYHLG 1123

Query: 909  EFVNRFRHGSLVMRLPDSDVGQIPTV-----IFGTVNGVIGVIASLPHEQY-LFLEKLQT 962
            E VN+F  G+LV        G   ++     +F T +G +G++  +  +   L L  LQ 
Sbjct: 1124 ELVNKFIPGALVGNNSGGGSGGEGSLFETEQLFVTSSGRVGLVHHVCDDGVALALTALQR 1183

Query: 963  NLRKVIKGVGGLN-HEQWRSFNNEKKTVDAKN----FLDGDLIESFL 1004
            NL  V+    G   H QWR+  N +   DA++    FLDGDL+ESFL
Sbjct: 1184 NLGGVLGDGPGGTAHGQWRAPANARGRSDAEDGATGFLDGDLLESFL 1230



 Score = 47.4 bits (111), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 34/133 (25%), Positives = 61/133 (45%), Gaps = 6/133 (4%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERY--KFCVLQWDAESSELITRAMGDVSDRIGRPT 63
           I+GR+ ++       E    L + T+    K  +  ++ E+ EL  +   D+ DR  R  
Sbjct: 56  IWGRVLSVCAVPAEAEGVSNLLVLTDHPYPKLILFAFNGETRELDEKWFADLHDRNARHA 115

Query: 64  DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYG---C 120
           +     ++ P  R   +  Y G  KV+ F  KG + + F + L EL +L + FLY     
Sbjct: 116 EYLNDIVVHPAGRAAAVSCYAGKLKVVTF-KKGGVDKHFEVILPELNLLALSFLYSEPNA 174

Query: 121 AKPTIVVLYQDNK 133
              T+ +++ D+K
Sbjct: 175 HTHTLAIMHIDHK 187


>gi|299743623|ref|XP_001835883.2| hypothetical protein CC1G_02971 [Coprinopsis cinerea okayama7#130]
 gi|298405741|gb|EAU85948.2| hypothetical protein CC1G_02971 [Coprinopsis cinerea okayama7#130]
          Length = 1282

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 273/980 (27%), Positives = 441/980 (45%), Gaps = 178/980 (18%)

Query: 212  DGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
            D SR+L+ D  G L LL +  +  K  G+ +  LG TS  ++++YL N V+++GS  GDS
Sbjct: 326  DDSRFLISDVCGGLSLLGV--DNVKTNGMTLLPLGMTSPPTSLTYLTNQVIFVGSHLGDS 383

Query: 272  QLIK------------LNLQPDAK------------------------------------ 283
            QL++            L++ P+ K                                    
Sbjct: 384  QLVQVSSTPNNQDGPMLDILPEIKTVARNMPAPSRDKGKGRASDEAMDVDDEDDDDISRG 443

Query: 284  ------GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
                  GS++ VL  + N+ PI D  VVD+E  GQ ++VTCSG Y  GSL IVR+G   +
Sbjct: 444  RIIKPEGSHLHVLHSFKNIAPINDAVVVDVEGNGQNEIVTCSGGYTSGSLNIVRSGAEYH 503

Query: 338  EQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDE--------LEETE 388
            E A++  +  +  +W+++S+ +D   + +V S  +  R L   ++D+        L+ T 
Sbjct: 504  EAATLPGVCNVNSLWTIKSNFEDTIHSHIVAS--THDRTLLFRIKDDGRNTTFTLLDSTA 561

Query: 389  IEGFCSQTQTLFCHD----------AIY---NQLVQVTSGSVRLVSST----SRELRNEW 431
               F +   T+   +          ++Y   N +VQVT   V L+           R  W
Sbjct: 562  ARDFITDQPTVALANVRKRVSVERKSVYRDCNWVVQVTDNVVNLLEHDVVLGGFNKRASW 621

Query: 432  KSPPGYS-----VNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYEISCLD 484
              P   +     +  A  N +QV+LA  GG LV L   + G   E V       EIS + 
Sbjct: 622  SPPSSVAPRPVEIVAADINPTQVVLALSGGRLVVLRHNEEGTAFELVAEKNTLREISAVS 681

Query: 485  INPIGENPSYSQIAAVGMWTDIS---------VRIFSL----------PDLNL---ITKE 522
                     Y+++  VG W  ++         V I  L          P L     ++K+
Sbjct: 682  CQAADTKTPYTKVFLVGYWEQVAEADTDRDTVVEILELERRSNPGSGTPSLTCLVKVSKK 741

Query: 523  HLGGEIIPRSVLLCAFEGI--------------SYLLCALGDGHLLNFLLNMKTG-ELTD 567
            ++    +PRS+LL +F G+              ++L C L DG + +F++    G  +TD
Sbjct: 742  YVPA--LPRSLLLYSF-GVPDSEPTNLKPNSQPTHLFCGLADGSVAHFVVWKDGGLNVTD 798

Query: 568  RKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
             K V LGT P+            V A  +R ++     K++ +S V LK+++   P N+ 
Sbjct: 799  SKIVPLGTTPVKFSACVVDGKRCVLAVGNRASIFSYERKRMAHSPVMLKDLNAAYPLNTH 858

Query: 628  AFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAI-CSLKNQSC 685
             FP S  +A    LTIG++ +I K+ IR+IPLG ++P+RI H    R +A+ C+      
Sbjct: 859  TFPTSFILANHQGLTIGSVKEIGKISIRTIPLGYDNPQRIVHIPLLRAYAVACATYTPVR 918

Query: 686  AEESEMH--FVRLLDDQTFEFISTYPLDTFEYGCSILSCS--FSDDSNVYYCVGTAYVLP 741
              ++E     ++LLDD TF+ +S Y  D+ E   ++ + +   S        VGT+    
Sbjct: 919  VGDAEAFKGSLKLLDDLTFKQLSQYNCDSDEVISALTTFTEEISGKETPLLVVGTS---- 974

Query: 742  EENEPTKGRILVFIVEDG----KLQLIAEKETKGA-VYSLNAFNGKLLAAINQKIQLYKW 796
                 ++ R+LVF V       +L LI   E  G  V SL      +LAA++  +  YK+
Sbjct: 975  ---SSSQARLLVFSVASSEACQELTLITSLEVNGQHVNSLCVMGNYVLAAVDCAVFSYKF 1031

Query: 797  ----MLRDDGTRELQSECGHHGHILALYVQTRGDF---IVVGDLMKSISLLIYKHEEGAI 849
                   D  + EL+ E G   H     V++ G F   +V+GD+  S+SL+     +G  
Sbjct: 1032 KGSSDDTDSQSSELK-EVGEWNH--NYIVRSLGSFNNSLVIGDIASSVSLV--NVNKGQF 1086

Query: 850  EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGE 909
               ARDY   +  A+E L ++  +G  +  NLFT    S G     R  LE  G + LG+
Sbjct: 1087 TPIARDYAPLFPYALEALSENALIGGNDASNLFTF---SLGQGGMGRKVLERDGSFFLGD 1143

Query: 910  FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY-LFLEKLQTNLRKVI 968
               +F  GS+       +  + P  IF T +G IG +  +  +Q  L L  LQ NL  ++
Sbjct: 1144 LATKFIRGSITTDYTAIEALE-PIAIFFTGSGRIGAVIDIKDQQLALHLSGLQRNLSALV 1202

Query: 969  KGVGGLNHEQWRSFNNEKKTVDAK----NFLDGDLIESFLDL--SRTRMDEI------SK 1016
            +GVG   H ++R+  N +   DA+     F+DGD +E+FL +   +  +D++       +
Sbjct: 1203 QGVGASTHTKFRAPRNNRGRTDAEAAGFGFIDGDFVETFLGMLGDKGMVDKVMTGQSAPE 1262

Query: 1017 TMNVSVEELCKRVEELTRLH 1036
             +  SV+E  K +E L  LH
Sbjct: 1263 KLEFSVDEYQKTLETLQGLH 1282


>gi|343429611|emb|CBQ73184.1| related to UV-damaged DNA-binding protein [Sporisorium reilianum
            SRZ2]
          Length = 1505

 Score =  275 bits (703), Expect = 9e-71,   Method: Compositional matrix adjust.
 Identities = 246/978 (25%), Positives = 426/978 (43%), Gaps = 174/978 (17%)

Query: 217  LLGDHAGLLHLLVITHEKE---------KVTGLKIELLGETSIAS---TISYLDNAVVYI 264
            L    +G L+LL IT             +   +++E LG TS  +    +SYL + ++ +
Sbjct: 515  LFACSSGALNLLRITMPSGGQITPSSPLQPKSMRVETLGTTSQPAGPQALSYLGDGLICV 574

Query: 265  GSSYGDSQLIK---------LNLQPDAK-------------------------------G 284
            GS+ GDS L K         + L P  +                                
Sbjct: 575  GSATGDSCLYKILQQGAEQDMPLSPKEQLLTPPSSPTQSRRRRSSQVVGASLDNTELPTA 634

Query: 285  SYVEVLERYVNLGPIVDFCVVD-------LERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
              + V+E + NLGP+V+F V D            Q ++VTCSGA   GS+R  R+G  + 
Sbjct: 635  GSLAVVETWQNLGPVVEFVVDDGAGGDPTCSSGAQARIVTCSGAGPSGSIREARSGASVQ 694

Query: 338  EQASVELQGIKGMWSLRSSTD-DPFDTFLVVSFISETRILAMNLEDELEE---------T 387
            + +S+ +   + +W + +  D   +   L++ F + T  L  + + +L +         T
Sbjct: 695  DVSSLSIPNAQQIWPVHAGNDASKYSVGLLLGFATSTAYLHFDAKGDLADATDRLAATGT 754

Query: 388  EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS--STSRELRNEWKSPPGYSVNVATAN 445
            ++        T++  D    QL+++T     LVS    S  L ++W  P G  V  A+AN
Sbjct: 755  DLTLPTLAASTVWSSDH-QPQLLRITRSEACLVSLNDESASLLHQWVPPKGLEVTAASAN 813

Query: 446  A-SQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
               QV+LA     L++L +  G L E    QLE+E+SC+D++P+      +Q+AA G W 
Sbjct: 814  VHGQVVLALSDKTLLHLSVESGALVEKGKVQLEHEVSCVDVSPLVAG-KAAQLAACGFWQ 872

Query: 505  DISVRIFSLPDLNLITKEHLGGE---IIPRSVLLCAFEG--------------------- 540
              +++I+SLP+L  + +  +  +    +PRS+LL  F                       
Sbjct: 873  TRTIQIYSLPELAAVGQSSVVQQRFPAVPRSILLHRFASKQTDGGSDTQSKGSLSNRDAL 932

Query: 541  ISYLLCALGDGHLLNFLLNM-------KTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
              +LL  LGDG L+++ L++       KT  L+D K VSLGTQ + L    +     V A
Sbjct: 933  TPHLLIGLGDGTLVSYSLSLPTHDSYSKTVGLSDAKTVSLGTQALKLDALETAAGARVVA 992

Query: 594  AS-DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
             S  RPT++Y+ +K+  Y+ +  K+   +   ++       A A    + + +I  +++ 
Sbjct: 993  VSGSRPTLVYADSKRFSYNALKYKDQRTVATLHAGQERVFAAFALSDSVELASIGALRQR 1052

Query: 653  HIRSIPLG-EHPRRICHQEQSRTFAICSL----KNQSCAEESEMHFVRLLDDQTFEFIST 707
             IR+ PLG   P  I        FA+C+     +  +   +     VR+LD  TFE +  
Sbjct: 1053 DIRTFPLGLNQPLAITQWANRGVFAVCTWAFLPRGTASKSDGPRGAVRILDQTTFETLDE 1112

Query: 708  YPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---------ED 758
              L+  E    I         +    VGT +V  + +E T+GR++ F V         E 
Sbjct: 1113 IRLEPDERPNCI--TVLQAQGHEILVVGTGFVSEQASETTQGRLVGFDVSNGSSRTKEER 1170

Query: 759  GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM-LRDDGTRELQSECGHHGHIL 817
            G+L+ + E    G VYS+ + N +L AA+N ++++Y  +  R       +      G   
Sbjct: 1171 GRLRKLFEHSETGNVYSVQSINNRLAAAVNSEVKIYSVVDPRPSEVPAPRIRVRQRGSWA 1230

Query: 818  ALYVQT-----RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD-- 870
              ++         D IVVGD ++S+++L        + E ARD +  W SA ++LDD+  
Sbjct: 1231 CSFIACNLSVIEPDRIVVGDALRSMNVLHVHPYTARLTELARDCDPFWTSATDLLDDESQ 1290

Query: 871  IYLGAENNFNLFTVR------------------------------KNSEGA--TDEERGR 898
             Y+GA+ +FNL+T +                              + + G    D     
Sbjct: 1291 TYIGADISFNLYTTQRVPLSEEVKARIRRARERESERTVVQTIDPRTTRGPDMVDRYAHV 1350

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
            ++    +H G+ +N+F   SLV   P S+    P ++F T  G IG++A +  ++   L 
Sbjct: 1351 MQRNAVWHYGDMINKFCRRSLVPD-PGSNAAVRPRLLFCTAAGAIGIVAHVRDDEAHLLA 1409

Query: 959  KLQTNLRKVIKG---------VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD--LS 1007
            K++ N+  +++          VG + H  WR+   + +      FLD ++++ F+D  L 
Sbjct: 1410 KVERNILSLLESSSTAAAAGVVGNIAHSDWRTLRTDHRVQAPAGFLDANVLQMFVDGRLD 1469

Query: 1008 RTRMDEISKTMNVSVEEL 1025
            R + D++    N   E L
Sbjct: 1470 RQQRDKVLAGPNSETEAL 1487


>gi|409078983|gb|EKM79345.1| hypothetical protein AGABI1DRAFT_39860 [Agaricus bisporus var.
            burnettii JB137-S8]
          Length = 1236

 Score =  275 bits (702), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 254/925 (27%), Positives = 404/925 (43%), Gaps = 160/925 (17%)

Query: 199  RPSITKAY--GRVDA----DG-SRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIA 251
            RP  T A+    + A    DG SR++LGD  G   +++++ +     GL +  LGE S  
Sbjct: 307  RPKATVAWPWSEISAWCCIDGLSRFILGDAYG--KMVLLSLDNLSDLGLLLIPLGEVSPP 364

Query: 252  STISYLDNAVVYIGSSYGDSQLIKL----------------------------------- 276
            +T++YL N V+Y+GS  GDSQL++L                                   
Sbjct: 365  TTLTYLTNQVIYVGSHLGDSQLVQLTSTPTTSTDKPTLPIPCDVHVIPSSSFDISAFKKG 424

Query: 277  ------------------NLQPDAKGSYVEVLERYV-------NLGPIVDFCVVDLERQG 311
                               + P   G+ VE    Y+       N+ PI+D C+VD +  G
Sbjct: 425  KARATSPEFDAMDLDGCDTVSPQESGNIVETRGSYLNVLERFKNIAPILDACLVDPD-SG 483

Query: 312  QGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFI 370
            Q Q+VTCSG    GS+  VRNG    E  ++  +  +  +W+L+S  ++  D+FL++SF 
Sbjct: 484  QRQIVTCSGGKNTGSINAVRNGADFEEIVNIPGVPHVLKIWALKSRLEEAEDSFLLLSFC 543

Query: 371  SETRILAMN--------------LEDELEETEIE-GFCSQTQTLFCHDA----IYNQLVQ 411
            + + ++ +N              +   L   E    F +  Q +   D     + + LV 
Sbjct: 544  NSSSLIKINDSGDNISFAPFDNAIASGLVTVEPTIAFANVAQRVKGQDGKARYMNSSLVV 603

Query: 412  VTSGSVRLVSSTSRELR-----NEWKSPPGYS------VNVATANASQVLLATGGGHLVY 460
              +GS   +    + L+     +EW +    +      +  A+ N+SQV LA  GG L  
Sbjct: 604  QVTGSGAFLLELDQGLQTYIRIDEWNAKKNVADGITPEIVAASINSSQVALAISGGKLAL 663

Query: 461  LEIGDG-----ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515
            L + +      +++ +     + EIS +  +P+    ++SQ   V  W    + +F+  D
Sbjct: 664  LSVAEDKKLRVVVSTLNDCNRQPEISAISCSPLNPARAFSQHIVVSYWESNIIEVFTPSD 723

Query: 516  LNLITKEHLGGEIIPRSVLLCAFEGI------------SYLLCALGDGHLLNFLLNMKTG 563
                 K       +P  V    F                YLL  L DG +  F    +  
Sbjct: 724  SGF--KSVCKSSPLPSLVSSLRFYNFGSDQSSKGADYHPYLLAGLSDGSVATF--RWQDQ 779

Query: 564  ELTDRKKVSLGTQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
            EL +RK VSLG  P++L   S   +   V AA DR  V      +L+YS + L+ ++   
Sbjct: 780  ELKERKIVSLGHAPVSLAVCSILGDAKTVLAAGDRAVVFAYERGRLVYSPILLRNIAAAS 839

Query: 623  PFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLK 681
            P N+++ P SL  A    L IG I D+ KLHIRSIP G ++PR+I HQ     F +  + 
Sbjct: 840  PLNTSSLPMSLVAATAEGLHIGKIKDLNKLHIRSIPFGLDNPRKITHQPLLSAFGVALVY 899

Query: 682  NQSC---AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS--NVYYCVGT 736
             +     A E    F  LLDD +FE ++ +  D  E   SI+  S   D     ++ +GT
Sbjct: 900  TEPAEIGAHEVRKSFFSLLDDASFEVLAKFDCDADEEIVSIIPFSAVVDGILTPFFVLGT 959

Query: 737  AYVLPEENEPTKGRILVFIV----------EDGKLQLIAEKETKGAVYSLNAF-----NG 781
                PEE EP KGRIL+  +          +  +L L A  E +G VY+L          
Sbjct: 960  FIFRPEEFEPDKGRILLISISTTNNPRNPRQGYQLSLAASIEVRGCVYALTPIVDDKPVA 1019

Query: 782  KLLAAINQKIQLYKWMLRDDGTR------ELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
            +++AA+N  + L+     D  T+       L+     + + L   +   G+ + VGD + 
Sbjct: 1020 RIVAAVNSSVNLFSL---DIDTKIYPAGLHLRKMAEWNHNYLVTGLGAVGNHVFVGDQIS 1076

Query: 836  SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
            S+SLL  K  E   +  ARDY   W  +VE +D+   + A +  N+FT       + +  
Sbjct: 1077 SVSLL--KCTEEKFQTVARDYGPRWPVSVEAIDEKNVIAANDALNIFTFTL----SRNLG 1130

Query: 896  RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY- 954
            R  L+  G YH+ + V +F  GSLV           P  +F T +G IGVI ++ +E   
Sbjct: 1131 RSVLDCTGNYHIADLVTKFIRGSLVTIEQSEKADLEPEELFFTSSGCIGVIVNVKNEDVS 1190

Query: 955  LFLEKLQTNLRKVIKGVGGLNHEQW 979
            + L  LQ N+  +I  VGG +H ++
Sbjct: 1191 IHLSGLQRNMSAIIPNVGGTSHARY 1215


>gi|426195893|gb|EKV45822.1| hypothetical protein AGABI2DRAFT_72896 [Agaricus bisporus var.
            bisporus H97]
          Length = 1229

 Score =  273 bits (699), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 246/916 (26%), Positives = 396/916 (43%), Gaps = 174/916 (18%)

Query: 212  DG-SRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
            DG SR++LGD  G   +++++ +     GL +  LGE S  +T++YL N V+Y+GS  GD
Sbjct: 319  DGFSRFILGDAYG--KMVLLSLDNLSDLGLLLIPLGEVSPPTTLTYLTNQVIYVGSHLGD 376

Query: 271  SQLIKL-----------------------------------------------------N 277
            SQL++L                                                      
Sbjct: 377  SQLVQLTSTPTTSTDKPTLPIPCDVHVIPSSSFDISAFKKGKARATSPEFDAMDLDGSDT 436

Query: 278  LQPDAKGSYVEVLERYV-------NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIV 330
            + P   G+ VE    Y+       N+ PI+D C+VD +  GQ Q+VTCSG    GS+  V
Sbjct: 437  VSPQESGNIVETRGSYLNVLERFKNIAPILDACLVDPD-SGQRQIVTCSGGKNTGSINAV 495

Query: 331  RNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN---------- 379
            RNG    E  ++  +  +  +W+++S  ++  D+FL++SF + + ++ +N          
Sbjct: 496  RNGADFEEIVNIPGVPHVLKIWAVKSRLEEAEDSFLLLSFCNSSSLIKINDSGDNISFAP 555

Query: 380  ----LEDELEETEIE-GFCSQTQTLFCHDA----IYNQLVQVTSGSVRLVSSTSRELR-- 428
                +   L   E    F +  Q +   D     + + LV   +GS   +    + L+  
Sbjct: 556  FDNAIASGLVTVEPTIAFANVAQRVKGQDGKARYMNSSLVVQVTGSGAFLLELDQGLQTY 615

Query: 429  ---NEWKSPPGYS------VNVATANASQVLLATGGGHLVYLEIGDG-----ILTEVKHA 474
               +EW +    +      +  A+ N+SQV LA  GG L  L + +      +++ +   
Sbjct: 616  IRIDEWNAKKNVADGITPEIVAASINSSQVALAISGGKLALLSVAEDKKLRVVVSTLNDC 675

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT-------------- 520
              + EIS +  +P+    ++SQ   V  W    + +F+  D    +              
Sbjct: 676  NRQPEISAISCSPLNPARAFSQHIVVSYWESNIIEVFTPSDSGFRSVCKSSPLPSLVSSL 735

Query: 521  --------KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 572
                    +   G +  P            YLL  L DG +  F    +  EL +RK VS
Sbjct: 736  RFYNFGSDQSSKGADYHP------------YLLAGLSDGSVATF--RWQDQELKERKIVS 781

Query: 573  LGTQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
            LG  P++L   S   +   V AA DR  V      +L+YS + L+ ++   P N+++ P 
Sbjct: 782  LGHAPVSLAVCSILGDAKTVLAAGDRAVVFAYERGRLVYSPILLRNIAAASPLNTSSLPM 841

Query: 632  SLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSC---AE 687
            SL  A    L IG I D+ KLHIRSIP G ++PR+I HQ     F +  +  +     A 
Sbjct: 842  SLVAATAEGLHIGKIKDLNKLHIRSIPFGLDNPRKITHQPLLNAFGVAFVYTEPAEIGAH 901

Query: 688  ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS--NVYYCVGTAYVLPEENE 745
            E    F  LLDD +FE ++ +  D  E   SI+  S   D     ++ +GT    PEE E
Sbjct: 902  EVRKSFFSLLDDASFEVLAKFDCDADEEIVSIIPFSAVVDGILTPFFVLGTFIFRPEEFE 961

Query: 746  PTKGRILVFIV----------EDGKLQLIAEKETKGAVYSLNAF-----NGKLLAAINQK 790
            P KGRIL+  +          +  +L L A  E +G VY+L          +++AA+N  
Sbjct: 962  PDKGRILLISISTTNNPRNPRQGYQLSLAASIEVRGCVYALTPIVDDKPVARIVAAVNSS 1021

Query: 791  IQLYKWMLRDDGTR------ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
            + L+     D  T+       L+     + + L   +   G  + VGD + S+SLL  K 
Sbjct: 1022 VNLFSL---DIDTKIYPAGLHLRKMAEWNHNYLVTGLGAVGSHVFVGDQISSVSLL--KC 1076

Query: 845  EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
             E   +  ARDY   W  +VE +D+   + A +  N+FT       + +  R  L+  G 
Sbjct: 1077 TEEKFQTVARDYGPRWPVSVEAIDEKNVIAANDALNIFTFTL----SRNLGRSVLDCTGN 1132

Query: 905  YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY-LFLEKLQTN 963
            YH+ + V +F  GSLV           P  +F T +G IGVI ++ +E   + L  LQ N
Sbjct: 1133 YHIADLVTKFIRGSLVTIEQSEKADLEPEELFFTSSGCIGVIVNVKNEDVSIHLSGLQRN 1192

Query: 964  LRKVIKGVGGLNHEQW 979
            +  +I  VGG +H ++
Sbjct: 1193 MSAIIPNVGGTSHARY 1208


>gi|224104491|ref|XP_002333932.1| predicted protein [Populus trichocarpa]
 gi|222839243|gb|EEE77594.1| predicted protein [Populus trichocarpa]
          Length = 221

 Score =  270 bits (689), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 147/223 (65%), Positives = 163/223 (73%), Gaps = 28/223 (12%)

Query: 52  MGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQV 111
           M DVS RIGRPT+NGQIGIIDPDCRLIGLHLYDGLFKVIP DNKGQLK+AF++R E L  
Sbjct: 1   MEDVSGRIGRPTNNGQIGIIDPDCRLIGLHLYDGLFKVIPSDNKGQLKQAFSMRHENLDG 60

Query: 112 LDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVP 171
             I FL            QDNKDAR+VKTYEVALKDKDF+EG WSQNNLDNGA+LLIPVP
Sbjct: 61  FFILFLS----------LQDNKDARYVKTYEVALKDKDFIEGRWSQNNLDNGANLLIPVP 110

Query: 172 PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVIT 231
            P CGVLIIGEET+VYC+AN F+AIPI+P                L   H   L L+V+ 
Sbjct: 111 LPFCGVLIIGEETVVYCNANVFRAIPIKPM---------GHWWNCLSPKHMEELMLMVL- 160

Query: 232 HEKEKVTGLKIELLGETSIAST-ISYLDNAVVYIGSSYGDSQL 273
                  GLKI+LLGETSIAST ISYLDNA V+IGSSY DSQ+
Sbjct: 161 -------GLKIKLLGETSIASTIISYLDNASVFIGSSYKDSQV 196


>gi|443894313|dbj|GAC71661.1| hypothetical protein PANT_5d00006 [Pseudozyma antarctica T-34]
          Length = 1625

 Score =  265 bits (677), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 257/979 (26%), Positives = 425/979 (43%), Gaps = 196/979 (20%)

Query: 217  LLGDHAGLLHLLVITHEKEKVT----------GLKIELLGETSIAS---TISYLDNAVVY 263
            L    +G L++L +T   E V+           +K+   G +S  +    ISYL   VV 
Sbjct: 633  LFACSSGALNVLTLTLPAEAVSESSCIPIQPRSMKVATFGYSSQPAGPQAISYLGEGVVC 692

Query: 264  IGSSYGDS-------------------QLIKLNLQPDAK----GSY-------------- 286
            IGSS GD+                   QLI     P       GS+              
Sbjct: 693  IGSSTGDTLLHQVVPRDVDEAADPTGAQLISPPSSPTQSRRRSGSFQLASSSQGRTELPS 752

Query: 287  ---VEVLERYVNLGPIVDFCVVD-------LERQGQGQVVTCSGAYKDGSLRIVRNGIGI 336
               +  +E + NLGPIVDF V D            Q ++VTCSGA   GS+R VR G  +
Sbjct: 753  AGTLHTIESWQNLGPIVDFVVDDGAGGDPSASSGAQARIVTCSGAGPSGSIREVRTGASV 812

Query: 337  NEQASVELQGIKGMWSLRS-STDDPFDTFLVVSFISETRILAMNLEDELEET--EIEGFC 393
             E  ++ +   + +W + +  T     + ++V F + T  L  +++  L +    +   C
Sbjct: 813  QEVCTLPIPNAQHIWPVHAGPTGSKQTSGILVGFATSTAYLYFDVDGNLVDATDRLAAVC 872

Query: 394  SQTQ--TLFCHDAIYNQ----LVQV--TSGSVRLVSSTSRELRNEWKSPPGYSVNVATAN 445
              T   TL     +       L++V  T  S+      S  L ++W  P G  +  A+AN
Sbjct: 873  RSTANPTLAASSLVDESGSALLLRVAHTDFSLFRCQGGSIALLHQWSPPSGCEITTASAN 932

Query: 446  -ASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
               Q ++A     LVYL      LTE     LE+E+SCLDI+ +      +QIAA+G W 
Sbjct: 933  DLGQCVIALSDKSLVYLTAEAESLTERGKTSLEHEVSCLDISCLSPG-EPAQIAAIGFWQ 991

Query: 505  DISVRIFSLPDLNLITKEHLGGE---IIPRSVLLCAF------EGIS------------- 542
              +++I SLPDL+ + +  +  +    +PRS+LL  F      +G S             
Sbjct: 992  ARTIQILSLPDLSPVGQSSVVQQQLPAVPRSLLLHRFASKQTRQGASRQSRTSLRNRDAL 1051

Query: 543  --YLLCALGDGHLLNFLLNMKTGE-------LTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
              +LL  LGDG L+ + +++ T E       L D K +SLGTQ + L    + +   V A
Sbjct: 1052 SPHLLIGLGDGTLITYSVSLPTDESFSRTVGLFDCKTLSLGTQALKLDAIETSSGARVVA 1111

Query: 594  AS-DRPTVIYSSNKKLLYSNVNLKEVSHMCP----------FNSAAFPDSLAIAKEGELT 642
             S  RPT++++ +K+  Y+   L+     C            N+ +  DS+ +A  G L 
Sbjct: 1112 VSGSRPTLVFADSKRFSYNA--LQHGDQRCVATLYVGAEQVLNAFSLGDSVKLASVGTL- 1168

Query: 643  IGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSL----KNQSCAEESEMHFVRLL 697
                   Q+  +R+ PLG + P  I      + FA+C+     +  +   +     VR+L
Sbjct: 1169 -------QQRDVRTFPLGLDQPLAIAQWPARQAFAVCTWAFLPRGSASGGDKPRGAVRIL 1221

Query: 698  DDQTFEFISTYPLDTFEY-GCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV 756
            D  TFE +    L+  E   C  +  ++  +      +GT Y+  +  E   GR++ F V
Sbjct: 1222 DQTTFEVLDEIRLEPDERPNCIAVLHAYGTE---MLVIGTGYIDSQSQETVSGRLVGFDV 1278

Query: 757  ---------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM--LRDDGTR- 804
                     E G+L+ + E +  G VYS+ +   +L AA+N ++++Y  +   R D +  
Sbjct: 1279 SPGSSRTKEERGRLRRLFEHDENGNVYSVQSIGNRLAAAVNSEVKIYSVIDPRRGDASSP 1338

Query: 805  --ELQSECGHHGHILALYVQT-RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWM 861
              +++         +A  +     D IVVGD ++S+++L    +   + E ARD +  W 
Sbjct: 1339 KIKIKQRGSWASSFIACSLSVVEPDRIVVGDALRSMNVLHVHPQTARVSEIARDCDPFWT 1398

Query: 862  SAVEILDD--DIYLGAENNFNLFTVRK--------------------NSEGATDEERGR- 898
            SA E+LDD    Y+GA+ +FNL+T ++                     +  A+D + G  
Sbjct: 1399 SATELLDDASQTYIGADISFNLYTTQRVLLSDEVKTRMRRAREREQERTVSASDSQGGGR 1458

Query: 899  ---------LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL 949
                     ++    +H G+ +++F   SLV     ++    P +IF T  G IGVIA +
Sbjct: 1459 DDGDQYAHVMQRSAVWHYGDMISKFCRKSLVPSA-GANAAIQPKLIFCTAAGAIGVIAHV 1517

Query: 950  PHEQYLFLEKLQTNLRKVIKG------------VGGLNHEQWRSFNNEKKTVDAKNFLDG 997
              E+   L K++ N   +I+             VGG+ HE+WR+   + +      FLD 
Sbjct: 1518 RDEEAQLLAKVERNTLSLIESGGGESNAASAGVVGGIRHEEWRTLRTDHRVQSPAGFLDA 1577

Query: 998  DLIESFLD--LSRTRMDEI 1014
             +++ F+D  L R + D I
Sbjct: 1578 GVLKMFIDGRLDRAQRDRI 1596


>gi|402582698|gb|EJW76643.1| CPSF A subunit region family protein [Wuchereria bancrofti]
          Length = 418

 Score =  264 bits (674), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 147/352 (41%), Positives = 206/352 (58%), Gaps = 46/352 (13%)

Query: 470 EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
           E+K A   Y I  +D    G+    S+I AVG WTD+SV + +LP L  + +E + G+++
Sbjct: 56  EIKGAA--YSIQSMD----GKGTLRSEICAVGYWTDLSVALRALPQLVEVVREKIAGDML 109

Query: 530 PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
            RS++L   EG  YLL ALGDG +  F ++MKTG L D KK +LGTQPI LR F S+ + 
Sbjct: 110 SRSIMLSPMEGHVYLLVALGDGTVHYFQIDMKTGALLDPKKATLGTQPIHLRKFRSRCSP 169

Query: 590 --HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
             ++F  SDRP VIYSSN+KLL+SNVNL+ VS M P  + A+PD+L +     L IG ID
Sbjct: 170 VHNIFVCSDRPAVIYSSNQKLLFSNVNLRMVSTMTPLYAEAYPDALVLTDGHSLVIGRID 229

Query: 648 DIQKLHIRSIPLGEHPRRICHQEQSRTFAI------------------CSLKN------- 682
           DIQKLHIR++PLGE P RI +Q ++ T A+                  C+ KN       
Sbjct: 230 DIQKLHIRTVPLGESPSRIAYQPETNTIAVIVERLEFVDAMGKHHFGQCASKNAMETSSS 289

Query: 683 -----------QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                      +  AEE E+  V LLD  TFE + ++ L+  E   S+ SC   +DS  Y
Sbjct: 290 RLSSMRREPTPECLAEEMEVSSVLLLDSNTFEILHSHELEGSEMAMSLTSCQLGNDSQPY 349

Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFNG 781
           + VGTA ++ +E E   GRI++F   +G  +++L+ EKE KGA YS+ + +G
Sbjct: 350 FVVGTAVIMSDETESKMGRIMMFQASEGPERMRLVYEKEIKGAAYSIQSMDG 401



 Score = 63.5 bits (153), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 44/67 (65%), Gaps = 2/67 (2%)

Query: 718 SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYS 775
           S+ SC   +DS  Y+ VGTA ++ +E E   GRI++F   +G  +++L+ EKE KGA YS
Sbjct: 4   SLTSCQLGNDSQPYFVVGTAVIMSDETESKMGRIMMFQASEGPERMRLVYEKEIKGAAYS 63

Query: 776 LNAFNGK 782
           + + +GK
Sbjct: 64  IQSMDGK 70


>gi|353235645|emb|CCA67655.1| hypothetical protein PIIN_01483 [Piriformospora indica DSM 11827]
          Length = 1145

 Score =  262 bits (670), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 273/1051 (25%), Positives = 465/1051 (44%), Gaps = 107/1051 (10%)

Query: 37   VLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK- 95
            VL++  +   L+  +   +    GRP        I PD R   + L+ G   ++ F  + 
Sbjct: 84   VLEYSPKQRNLVVSSSQKLVLPYGRPLSGCVNCAISPDGRHAAIALFQGTMTILSFTREK 143

Query: 96   --GQLKEAFNIRLEELQVLDIKFLYGCAKPTIVV--LYQDNKDARHVKTYEVALKDKDF- 150
               ++    +  + EL +L   FL   A+  I +  L++    +RH+ T  V L   D  
Sbjct: 144  GVARIARKVDSAIRELLLLSFAFLPLPAQDGIAISLLHKSYTGSRHISTRGVDLDTLDIS 203

Query: 151  VEGPWSQNNLDNGADLLIPVPPPLC---GVLIIGEETIVYCSANAFKAI--PIR------ 199
             EG   +   ++    +I VP       G+L+ G +  ++  A+    +  P +      
Sbjct: 204  AEGSSIELREEDEPTRMICVPRAASKPGGLLLFGPDCTIFYQADKLHKMQSPAKSRRTST 263

Query: 200  --PSITKAYGRVDADG------SRYLLGDHAGLLHLLVITHEKEKVTG--LKIELLGETS 249
              P + + +   D  G      SR LL D  G L LL + ++ +K+    + I LLGE S
Sbjct: 264  RQPDLYEEWKYSDVSGYGFIDESRLLLSDKYGKLVLLALDNDPKKIPAGAINIHLLGEAS 323

Query: 250  IASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER 309
              S I+YL+  V++IGS  GDSQL+K+          +EV++ + N  PI D  + DL+ 
Sbjct: 324  AGSCIAYLNAGVIFIGSETGDSQLMKIT-----SSGKLEVIDTFSNTAPIADAVLADLDN 378

Query: 310  QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLRSSTDDPFDTFLVVS 368
             G   VVTCSG  + GSLR +R+G  I E AS+E    IK ++ L+  T      ++++S
Sbjct: 379  TGDHVVVTCSGNGRTGSLRTIRSGANIEELASMETSIPIKNIFPLQE-TSGTSHLYMLIS 437

Query: 369  FISETRIL----AMNLEDELEETEIEGFCSQTQTLFCHD---AIYNQL---VQVTSGSVR 418
            +  ET+++    A  L  EL   +  G   +  TL   +    I++     VQVT+ +  
Sbjct: 438  YDQETKLVDAREAPRLS-ELSAHQFPGVAREFPTLAAGNVRRTIFDTTTLAVQVTTRAAI 496

Query: 419  LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY 478
            L    S      W      S+  A+ +   V L   GG ++ L++       V  A+  +
Sbjct: 497  LFDVQSGAEYCRWSG----SITTASVSGDAVCLGLRGGKVIALKVDIEAAKLVTQAERNF 552

Query: 479  --EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLC 536
              EIS L I PI      S +  +G W D  V+I  L +L  +  E +     P SVL  
Sbjct: 553  DKEISTLSIEPIQSGEVTSNVVVIGFWEDFLVKICQLHNLAQV-GEDIETPHTPHSVLAW 611

Query: 537  AF----EGISYLLCALGDGHLLNFLLNMKTGEL--TDRKKVSLGTQPITLRTFSSKNTTH 590
             F    EG+ YLL   G+GH+L+  L      +  T R+ V LG +P+ L   S      
Sbjct: 612  NFGDRKEGL-YLLVGTGNGHILSVKLKETKNRVLATSRRTVVLGDRPVLLHRCSIAGAEV 670

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            + A   R  ++  SN ++   +VN+K +  + PF S+AF D+L       L+I  I  ++
Sbjct: 671  IMATGSRAMLLSWSNGRIAQHHVNIKNIESVAPFTSSAFGDALIFKLTKGLSIARIGKLE 730

Query: 651  KLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSC---AEESEMHFVRLLDDQTFEFIS 706
            KL I S+ LG + P  + +    + FA+  L+ +     A ++     +L+D  TFEF++
Sbjct: 731  KLKIDSVSLGYDVPNTLAYHPDIKAFAVGCLRTEPSLNGASDNIGSSFKLIDALTFEFLN 790

Query: 707  TYPLDTFEYGCSIL--SCSFSDDSNVYYCVGTAYVLPEE-NEPTKGRILVFIVEDGK--- 760
            ++     E    ++       D    Y  VGTA    EE +EPTKGRIL+F     K   
Sbjct: 791  SHSFPPNEEVTKVIVGDLVMGDKQERYIIVGTAIWEDEEGSEPTKGRILLFRASLSKGMQ 850

Query: 761  ------------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
                        L L+ E++  G+   L   + +L   +N  + +Y+ + R    + L+ 
Sbjct: 851  VGSGAANAPPPVLTLVLEQDIPGSAVGLAVVDHRLAIIVNTIVVVYE-LRRTQTAQGLEL 909

Query: 809  ECGH---HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
            +      H + +   V      +V+GD ++S + L +   +  +E  A+D+    ++++ 
Sbjct: 910  KAVDQWIHNYAIWSIVPAGDSRVVIGDALQSATTLRWNGTK--LEVVAKDWTT--VNSLN 965

Query: 866  ILDDDIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP 924
            +  D+ Y + ++ + NL + +         E   L+  G YH GE ++    GS+  R  
Sbjct: 966  VTADETYVIQSDIDGNLMSYKP--------EPPILQQTGHYHFGETISCLVPGSIRSRSN 1017

Query: 925  DSDVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKG------VGGLN-- 975
             +        +  T  G I +I   L  E+ + L  ++ N+   +K       VG     
Sbjct: 1018 QNGSIVAAKHVLLTPGGRISLIQEILDEEKEMTLLAIERNMSAALKAERDHYDVGSWRAP 1077

Query: 976  HEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
            H   R     ++ + +  FLDGD++   L+L
Sbjct: 1078 HVNTRRVITSEEPLQSYGFLDGDILSHALEL 1108


>gi|388853409|emb|CCF53029.1| related to UV-damaged DNA-binding protein [Ustilago hordei]
          Length = 1508

 Score =  262 bits (670), Expect = 7e-67,   Method: Compositional matrix adjust.
 Identities = 260/985 (26%), Positives = 423/985 (42%), Gaps = 182/985 (18%)

Query: 217  LLGDHAGLLHLLVITHEKEKVT--------------GLKIELLGETSIAS---TISYLDN 259
            L G ++G L++L +T  ++  +               +++E LG TS  +    +SYL +
Sbjct: 512  LFGCNSGALNVLQLTLPQDAASEVVPSHSAAAPQPKSMRVETLGTTSQPAGPQALSYLGD 571

Query: 260  AVVYIGSSYGDSQLIKLNLQ---------PDAK--------------------------- 283
             +V +GS+ GDS L K+  Q         P A+                           
Sbjct: 572  GLVCVGSATGDSCLYKILQQDTSEEMDTGPGAQLLSPPSSPTQSRRRRSSLAANTTPKST 631

Query: 284  ----GSYVEVLERYVNLGPIVDFCVVD-------LERQGQGQVVTCSGAYKDGSLRIVRN 332
                G  +  +E + NLGP+VDF V D            Q ++VTCSGA   GS+R VR+
Sbjct: 632  ELPCGGSLVTIETWQNLGPVVDFTVDDGAGGDPTTSASAQARIVTCSGAGPSGSIREVRS 691

Query: 333  GIGINEQASVELQGIKGMWSLRSSTDDPFDTF-LVVSFISETRILAM----NLEDELEET 387
            G  + + +S+ +   + +WS+ +       T  L++ F + T  L      NL D  +  
Sbjct: 692  GASVQDVSSLPIPNAQQIWSVEAGDATSKQTAGLLIGFATSTAYLHFDANGNLADATDRL 751

Query: 388  EIEGFCSQTQTLFCH---DAIYNQ-LVQVTSGSVRLVS--STSRELRNEWKSPPGYSVNV 441
               G  +   TL      DA     LV+V   +  LV+    +  L  +WK   G  +  
Sbjct: 752  SAVGVDTTLPTLTASTVFDASQGPLLVRVAKDAASLVNLQDEAATLVQQWKPSAGLEITT 811

Query: 442  ATANA-SQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAV 500
            A+ N   Q+++A+    L YL + +G L E    QL++E+SCLD + +      +Q AA 
Sbjct: 812  ASVNPYGQLVIASSDKSLSYLVVEEGALIERNKIQLDHEVSCLDTSSVTAG-QAAQFAAC 870

Query: 501  GMWTDISVRIFSLPDLNLITKE---HLGGEIIPRSVLLCAFEG----------------- 540
            G W   S++IF+LP+L  + +      G   +PRS+LL  F                   
Sbjct: 871  GFWQTRSIQIFALPELAPVGESLVVQQGFAAVPRSILLHRFASKQSEASSQTKSSVGNRD 930

Query: 541  --ISYLLCALGDGHLLNFLLNMKTGE-------LTDRKKVSLGTQPITLRTFSSKNTTHV 591
              + +LL  LGDG L++F L++   +       L+D K VSLG Q + L    S     V
Sbjct: 931  ALMPHLLIGLGDGTLVSFSLSLPRDDSYSKIVGLSDCKTVSLGKQALKLDAIESWAGARV 990

Query: 592  FAAS-DRPTVIYSSNKKLLYSNVNLKEVSH--MCPFNSAAFPDSLAIAKEGELTIGTIDD 648
             A S  RPT++Y+ +K+  Y+ +  K      M          + A+A+  EL   +I  
Sbjct: 991  VAVSGSRPTLVYADSKRFSYNALKHKSQRSVTMLHVGPGRVLGAFALAESVEL--ASIGA 1048

Query: 649  IQKLHIRSIPLG-EHPRRICHQEQSRTFAICS---LKNQSCAEESEMH-FVRLLDDQTFE 703
            +++  IR+ PLG + P  I      + FA+C+   L   S  + S+    +R+LD  TFE
Sbjct: 1049 LRQRDIRTFPLGLDQPLAIAQWPNRKVFAVCTWAFLPRGSATKASKSRGAIRILDQSTFE 1108

Query: 704  FISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV------- 756
             +  + L+  E    I         +    VGT Y+   E+E   GR+L F V       
Sbjct: 1109 TLDEFRLEVDERPNCI--TVLRAQGHEMLVVGTGYISDGEHEVISGRLLGFDVSAGSIRG 1166

Query: 757  --EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM--LRDDGTRELQSECGH 812
              E G+L+ +  KE  G VYS+ + N +L  A+N ++++Y  +     D     +     
Sbjct: 1167 KEERGRLRKLFVKEQAGNVYSVQSINNRLATAVNSEVKIYSVVDPRASDEVSAPRINVVQ 1226

Query: 813  HGHILALYVQ-----TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
             G     ++         D IVVGD ++SI++L        + E ARD +  W SA E+L
Sbjct: 1227 RGSWACSFIACNLSVVEPDQIVVGDALRSINVLHVHPYTARLTEIARDCDPFWTSATELL 1286

Query: 868  DD--DIYLGAENNFNLFTVRK-------NSEGATDEERGRLEVVGE-------------- 904
            D+    Y+GA+ +FNL+T ++        +      ER     VGE              
Sbjct: 1287 DEASQTYIGADISFNLYTTQRVPLSEQVKARIRRTREREIERSVGEVNRLTTRDPNQVDR 1346

Query: 905  ----------YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
                      +H G+ +N+F   SLV   P +     P ++F T  G IGVIA +  EQ 
Sbjct: 1347 YAHVMQRNAVWHYGDMINKFYRKSLVAD-PGTAAEVHPRLLFCTAAGAIGVIAHVQEEQA 1405

Query: 955  LFLEKLQTN------------LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
              L K++ N                   V  ++H+ WR+   + +      FLD ++++ 
Sbjct: 1406 RLLAKVERNILSLIELSSSISTSNSSSVVTSISHQDWRTLRTDHRVQAPAGFLDANILQM 1465

Query: 1003 FLD--LSRTRMDEISKTMNVSVEEL 1025
            F+D  L   + D++ +  N  +E L
Sbjct: 1466 FVDGRLDSKQRDKVVQGPNSEMEAL 1490


>gi|403177920|ref|XP_003888718.1| hypothetical protein PGTG_22575 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|375173263|gb|EHS64847.1| hypothetical protein PGTG_22575 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1326

 Score =  262 bits (669), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 268/1016 (26%), Positives = 444/1016 (43%), Gaps = 175/1016 (17%)

Query: 26   LFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDG 85
            L + T       L++  +S+ +IT A   +     RP D  Q  I+DP  R + LH  +G
Sbjct: 75   LLVLTADLNLFTLRFCPKSATIITTASISLHQIGARPADYVQTSIVDPHGRCVVLHALNG 134

Query: 86   LFKVIPF---------------------------------------DNKGQLKEAFNIRL 106
            +  VIP                                        D   ++  +F +RL
Sbjct: 135  ILHVIPLVPGCLSNLRNLDPVLSKRKKANVSTGLASHSRNLHPSHNDPDCEVYRSFQLRL 194

Query: 107  EEL--QVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALK----DKDFVEGPWSQNNL 160
             E+  Q L+   L     PT++V+Y ++   R +++ ++ L+    +++F  G    +  
Sbjct: 195  NEVNVQALNFATLSPNYPPTLLVVYSNHLGDRVLRSRKIDLQAAHCEQEFFRG---YHCC 251

Query: 161  DNGADLLIPVPPP----------LCGVLIIGEET--IVYCSANAFKAIPIRPSITKA--- 205
            D    L+IP                G ++IGEET  +V      + A   R +I +A   
Sbjct: 252  DPATALIIPFSLTGEAGEDSDGVNNGAILIGEETAQLVRFGLVQYSASDDRGNIREAEAT 311

Query: 206  ---------------------------YGRVDADGSRYLLGDHAGLLHLLVITHEKEKVT 238
                                       +  VD   + +LLGD  G L  L +   K+  T
Sbjct: 312  AGQSNQLTEQGVIGGHVLRLPLGHYTCFCPVDGVPNCWLLGDLYGNLIFLFL---KQSTT 368

Query: 239  GLKIEL----LGETSIASTISYLDNAVVYIGSSYGDSQLIKL-----NLQPDAKGS-YVE 288
            G    L     G       + Y+ +  V++ S YGDSQL+KL     +L PD   S   E
Sbjct: 369  GTPPSLHYFHAGHVPSPEALVYITSGFVFLASHYGDSQLLKLPSPTSSLNPDNTASAQPE 428

Query: 289  VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 348
            V+  + NL PI DFCV +  +    Q+VTCSG+++DGSLR++++GIGI E  S+E+ G++
Sbjct: 429  VITTFPNLAPISDFCVTEDRKSLVNQIVTCSGSHRDGSLRVIKHGIGIRESGSLEVGGVQ 488

Query: 349  GMWSLRSSTD----DPFDTFLVVSFISETRILAMNLEDELEETEI-EGFCSQTQTLFC-- 401
             +W+LRSST     + FD  LV+S    TR LA+N +  +EE  +  GF S   T+    
Sbjct: 489  RLWALRSSTHVNSVEDFDDRLVLSCADCTRFLALNEDGTIEEIGLFNGFESDVPTILAGN 548

Query: 402  ----HDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
                 D+     +QVT  S ++++  +      W+     S+  A  + +   ++     
Sbjct: 549  LLDGSDSTTRYSIQVT--SRKIIAGDAL----VWEPDDAKSITRAALSVTTCAVSL-KEQ 601

Query: 458  LVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLN 517
            +V L I DG   E    +L  +IS L I+   EN       A   W    + I S+   +
Sbjct: 602  VVVLCIKDGKFVEKGTYKLLNDISSLAIDQ-SEN-----FVAAAQWVTNCIEIISVSSSS 655

Query: 518  LITKEHLGGEIIPRSVLLCAFEGIS----YLLCALGDGHLLNFLLNMKTG---ELTDRKK 570
             I + +   + +  S+ +  FEG       LL  LGDG ++N  L   TG   E    + 
Sbjct: 656  TICRVNTDSDFMVNSLKMTNFEGTESDGCRLLIGLGDGKIMNVALG-PTGMHVEGDSPRF 714

Query: 571  VSLGTQPITLRTFSSKNTTHVFAASDRPTVI--YSSNKKLLYSNVNLK--EVSHMCPFNS 626
             +LG +PI   +  +     ++A SD+PT+I    +N +  Y+ V ++   VS     ++
Sbjct: 715  TTLGIRPIEFVSMRNATGEFLWANSDQPTIIDRIQNNGRFAYTPVTVQGGSVSSATGLHA 774

Query: 627  AAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSC 685
              F DSL +A   E+ IG ++  +K++I  I LG E PRRI H E  + + +   + +  
Sbjct: 775  RFFQDSLVLASNDEIRIGKLNTTEKMNILKISLGNEQPRRIAHSEDMKAYGVVCARLELD 834

Query: 686  AEESEMHFV---RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPE 742
             +   +H +   ++ DD+TF+ + TY     E G SI +    ++   ++ +GT  +   
Sbjct: 835  QDTGTIHRIGTFKVFDDETFQLLYTYNFGPMEQGSSIAAVKLGEEMIEHFVIGTGVIKST 894

Query: 743  ENEPTKGRILVFIVEDGKLQLIAEK---------ETKGAVYSLNAF-NGKLLAAINQKIQ 792
            E E T GRIL     + K  L A+K         +  GAV  +    NG  +A+ N  + 
Sbjct: 895  EAEATIGRILAIRELNSKQDLTAKKRHFELTNVGKLSGAVGGVGGLPNGMFVASANAFVH 954

Query: 793  LYKWMLRD--------------------DGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
             +     D                    DG   L    G  G  ++  V T G  ++VGD
Sbjct: 955  AFGLKKGDSGRAFPSGTDTVLAGSVPEMDGGFRLLDTWG--GGFVSQTVVTDGTKVLVGD 1012

Query: 833  LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
            L KS+ LL +  E   +  +ARD++A  +  +  + +  ++ A++ FN+FTV+ +S
Sbjct: 1013 LYKSVVLLEFDLEHLELAVKARDFSAMSVRPIGAISEREFVAADSEFNMFTVQYDS 1068



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 39/117 (33%), Positives = 61/117 (52%), Gaps = 2/117 (1%)

Query: 917  GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLN 975
            GSLV R  +S +     +IF T  G IG+IA +   ++   L + Q++L K+   VG L 
Sbjct: 1151 GSLVPRFIESSLIGETKLIFVTSTGGIGLIAKIHSKKKTKQLARFQSDLSKISTSVGNLA 1210

Query: 976  HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE-ISKTMNVSVEELCKRVEE 1031
            H  +R F  E + + +  FLDGD +E  LDL+   ++  + K M +  EE   +  E
Sbjct: 1211 HSAYRMFKTESRKIPSMGFLDGDFLEGCLDLTPDEVENLVKKMMALKAEEAQAKATE 1267


>gi|384253371|gb|EIE26846.1| hypothetical protein COCSUDRAFT_52476 [Coccomyxa subellipsoidea
            C-169]
          Length = 1205

 Score =  260 bits (664), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 291/1171 (24%), Positives = 482/1171 (41%), Gaps = 171/1171 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G ++D++ + ++  +  +L++  E +               R    
Sbjct: 57   VFGCIRSLAPFRLTGASRDYVIVGSDSGRIVILEYIKEQNYFRKIHQETYGKSGCRRIVP 116

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-----Y 118
            GQ   +DP  R  LIG      L  V+  D    L    +  LE  +  +I F       
Sbjct: 117  GQYLAVDPKGRACLIGAMEKQKLVYVLNRDQAANL--TISSPLEAHKSHNIVFSVIGLDM 174

Query: 119  GCAKPTIVVLYQDNKDARHVKTYEVA--------LKDKDF----VEGPWSQNNLDNGADL 166
            G   P    +  D  DA    T E A        L + D     V   WSQ  +DNGA+L
Sbjct: 175  GFDNPVFAAIELDYADADQDPTGEAAGNAQKQLTLYELDLGLNHVTRKWSQE-VDNGANL 233

Query: 167  LIPVPPPL---CGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR----- 215
            L+PVP       GVL+  E  I+Y +   A+    IP R S+    G +    +      
Sbjct: 234  LVPVPGGADGPGGVLVCAENFIIYSNQDHADVRAVIPRRTSLLIDRGVLIVSYAMHKQKS 293

Query: 216  ---YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS- 271
               +L+    G L+ + + +  + VT LKI+         +I  L    ++  S +G+  
Sbjct: 294  LFFFLVQSEYGDLYKVTLAYSGDTVTELKIKYFDTIPPCISICVLKTGFLFAASEFGNHA 353

Query: 272  --QLIKLNLQPDA-KGSYVEVLE------------------RYVN----LGPIVDFCVVD 306
              Q + +    DA + S  E+ E                  R V+    L PI+DF V +
Sbjct: 354  LYQFLSIGDDDDAVEASSAELTETQEGFQPVFFDPRPLRNLRLVDEVESLSPIMDFKVAN 413

Query: 307  LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFL 365
            L ++   Q+    G     +LR++R G+   E A   L G    +W+L+ S +DPFD ++
Sbjct: 414  LLKEEIPQLYAMCGRGARSTLRVLRPGLAATEIAVSPLPGNPTAVWTLKRSVNDPFDAYI 473

Query: 366  VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
            +VSF + T +L++   + + E    GF     TL       + ++QV    +R + +  R
Sbjct: 474  IVSFTNATLVLSIG--ETVVEVNDSGFLGTVPTLRTQLLTDDSMLQVHPSGLRHIRADRR 531

Query: 426  ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEISCLD 484
               NEW++P   ++  A  N  QV++A  GG L+Y E+   G L EV+  ++  ++ CLD
Sbjct: 532  V--NEWRAPGRRTIVKAATNEQQVVIALSGGELIYFELSPTGQLMEVEKKEMAGDVVCLD 589

Query: 485  INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIPRSVLL-------- 535
            I P+ E    S+  AV  + D SVRI SL   ++++   +     +P S+L         
Sbjct: 590  IAPVPEGRQRSRFLAVASY-DSSVRILSLDPEDMMSALAVQMVSAVPESLLFIDSPAADI 648

Query: 536  ------CAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
                   +  G  +L   L +G LL   ++  TG+L+D +   LGT+P  L   S +   
Sbjct: 649  AGKGEDASGAGGLFLNIGLLNGVLLRTEVDKVTGQLSDTRTRFLGTRPPKLFAVSVRGKR 708

Query: 590  HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
             + A S RP + YS   +   + ++ + + +   F S   P+      +  L I T++ +
Sbjct: 709  SMLALSSRPWLGYSDMGRYTLAPLSYEALDYASGFASDQCPEGFCAVSKSMLRILTLERL 768

Query: 650  -QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC----------------------- 685
             +  + +   L   PR+     +S    +    + +                        
Sbjct: 769  GEAFNQQVTRLRYTPRKFVVHPESNMLIVAEADHAAVPLAERRAVEDGMEMDAALTEGIE 828

Query: 686  -----AEESEMH------------FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-- 726
                 A E E H             +R+LD  + +  S   LD  E   S+    FS+  
Sbjct: 829  FDEERAAEEEQHGAPKNSTGRWASCIRVLDPTSLQTSSVLELDGNEAAVSLCLLRFSNWP 888

Query: 727  DSNVYYCVGTAYVL---PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
            +  +   VGT   L   P   +    R+  F     +L+LI +  T G   +L AF G+L
Sbjct: 889  EEGMVLAVGTVQGLAFYPRTADEGYIRLYRFRDSGRQLELIHKTPTGGIPGALAAFKGRL 948

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSECGHHG---HILALYVQTRGDFIVVGDLMKSISLL 840
            LA +   +++Y     + G ++L  +C H     HI  L   T GD I VGDL +S+   
Sbjct: 949  LAGVGPTLRIY-----EAGKKKLLRKCEHRKLPTHIATL--ATSGDRIFVGDLQESMHYF 1001

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG--- 897
             YK  E A+ E A D     ++A   LD D   GA+   N+F  R   + +T  E     
Sbjct: 1002 RYKANENALYEYADDIAPRHLTAALPLDYDTVAGADKFCNIFVTRLPRDVSTQVEEDPTG 1061

Query: 898  ---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGV 942
                           +LE V  +H+G+ V   +   L         G    +++ TV G 
Sbjct: 1062 GKFAGAAGLLNGAPHKLEDVVNFHVGDLVTSLQRAVL-------QPGGREVLLYATVMGA 1114

Query: 943  IGVIASLP-HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
            IG +   P  E   F   L+ +LR+    +GG +H  +R           K+ +DGDL E
Sbjct: 1115 IGAMLPFPSREDVDFFSHLEMHLRQEHPPMGGRDHMSYRG-----SYFPVKDVIDGDLCE 1169

Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             F  L   +   I+  +  +  E+ K++E++
Sbjct: 1170 HFSQLPAAKQKSIADELERTPGEILKKLEDI 1200


>gi|440300137|gb|ELP92626.1| DNA damage-binding protein, putative [Entamoeba invadens IP1]
          Length = 1086

 Score =  259 bits (662), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 246/1037 (23%), Positives = 492/1037 (47%), Gaps = 53/1037 (5%)

Query: 23   QDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHL 82
            +DF+ I T  ++  + + +  + +++      V DR+GR    G +G++  +  LI L+L
Sbjct: 77   RDFVIILTSNHQLTITRLEGNTLKVVFGI--SVEDRLGRKAFYGALGVVMQNKYLI-LYL 133

Query: 83   YDGLFKVI--PFDNKGQLKEAF-NIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVK 139
            Y+   K+I  P  ++  L ++  ++R+++ +++ +  +        + L+ D    + V 
Sbjct: 134  YNHQIKIISLPQGDEELLSQSVKSVRIDQERIVGMHII----GDNQIALHSDQNTKKTVV 189

Query: 140  TYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR 199
             YEV  K  DF     ++       D  +       G  ++   ++ Y      K+I + 
Sbjct: 190  FYEV--KTDDFSVSNSTEYQPIENVDFFVAHNN---GFFVVQNGSLQYHYNTNTKSINVA 244

Query: 200  -PSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
             P IT A      D +R LL D+ G  HL+ +  + E+V    +    + SI S + YLD
Sbjct: 245  LPQITIA-TTCFLDNNRLLLCDNTGKSHLITLRLDVEQVHYYPLPF-AKLSIPSKVVYLD 302

Query: 259  NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTC 318
            N V++ GSS G+S LIK+  +       +EVLE + N GPI+D   +  E   +  ++ C
Sbjct: 303  NGVLFWGSSGGNSYLIKIGEER------MEVLETFENRGPILDMITLHDEISKKDDLLIC 356

Query: 319  SGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSS----TDDPFDTFLVVSFISETR 374
            S  Y  G+L+++R+G+G+N    VE +GI+ MW    +    +DD  + +++        
Sbjct: 357  SNTYHQGTLKLLRSGVGVNILGEVEYRGIEKMWKCFENAGDMSDDYKEMYVIEGAFGSNF 416

Query: 375  ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
            +     + +++  EI    +    +        +LV VT+G + +       +  ++ S 
Sbjct: 417  VHIEGRKGDVQIVEIANSVTVKGRIVGVGDFDGKLVVVTNGGISIARVHDTIVEEQFVST 476

Query: 435  PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSY 494
               +  V+       ++ +G   +   +    +++++K  + E EIS +    +    + 
Sbjct: 477  EEITHCVSEGKD---VVYSGRNTVKLFDAEKCVMSDIK--RYEEEISSIGCYTVSSENTR 531

Query: 495  SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLL 554
             +   VG W    + I  + D   +      G+++ +S+          ++  LGDG ++
Sbjct: 532  ERYIGVGRWEHKEIEI--IDDQGDVLDRVYIGDVVSKSIKFVGTPSTLKVIIGLGDGRVV 589

Query: 555  NFLLNMKTGELTDR-----KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLL 609
               +  K     +      K + +G   +++   +     +     +RPT++   + K+ 
Sbjct: 590  VSNIRKKNSTQMEEEHKVGKVIEVGMGGVSIDEMAIDGKMYEICVCERPTLMSLDDDKIK 649

Query: 610  YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
              +VN+ E       +    P+S+ IA +  L IG+I++++ ++ +S+ LG    R+   
Sbjct: 650  MMSVNIGESVGFLGVHITGIPNSVLIASKESLMIGSIEEVKAINTKSLELGVFVSRVVVS 709

Query: 670  EQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS 728
               R+  + S +  ++ + ++E+HF+ L+D +  E I    LD  E+G +I      +  
Sbjct: 710  SDGRSGVLLSSEIEETRSGKNEVHFINLIDLRKMEIIDKVRLDKDEHGMAIDVKEIEEKE 769

Query: 729  NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN---GKLLA 785
               Y VGTAY    E EP++GR ++F + + K+  ++ +   GAVYS+  F    G  +A
Sbjct: 770  --LYIVGTAYAKLGEVEPSRGRFIIFEIHEEKIIEVSNRYVDGAVYSVKRFENDVGNYIA 827

Query: 786  AINQK----IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
            A  QK     Q+ + ++       ++ + G +  ++ L+V+T G  I+VGDLMKSIS+  
Sbjct: 828  ATIQKKVVVYQIERKIVDGKFAVTIEEKGGANVKLIGLFVKTLGHEILVGDLMKSISVFK 887

Query: 842  Y--KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
            +  K    A+ E  RD+ A++ +AVE +D+  ++ +++  NL    +N+    + E+ +L
Sbjct: 888  FDEKATRNAVVETCRDFYASYTTAVEFMDEHCFMSSDSQGNLLVFTENTTTTNENEKFKL 947

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
            +     H+GE +N    GS+ +        Q   ++FG + G IG I  +  E Y  L  
Sbjct: 948  QNEAHIHVGECINVMCKGSIAVMNNAMWETQKKCMLFGGICGSIGGITEINLETYKKLFA 1007

Query: 960  LQTNLRKVIKGVGGL-NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            L++ + + +KGV    +  QW+   ++ K ++A+N +DG+++E FLDL +     I++ +
Sbjct: 1008 LESEMLREMKGVIECESFGQWKMVFDDWKRMEAQNVIDGNVVELFLDLPKESQKHIAEKI 1067

Query: 1019 NVSVEELCKRVEELTRL 1035
              + EEL   +E +  +
Sbjct: 1068 GYAGEELVTVLESMNTI 1084


>gi|71016864|ref|XP_758932.1| hypothetical protein UM02785.1 [Ustilago maydis 521]
 gi|46098463|gb|EAK83696.1| hypothetical protein UM02785.1 [Ustilago maydis 521]
          Length = 1549

 Score =  253 bits (646), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 254/1002 (25%), Positives = 407/1002 (40%), Gaps = 217/1002 (21%)

Query: 240  LKIELLGETSIAS---TISYLDNAVVYIGSSYGDSQLIKL----------------NLQP 280
            L++E LG TS  +    +SYL   +V +GS+ GD+ L ++                 L P
Sbjct: 551  LRVEALGTTSQPAGPQALSYLGQGLVCVGSATGDNCLYRIIGQNASQKMPASSSEQVLTP 610

Query: 281  DAK------------------------GSYVEVLERYVNLGPIVDFCVVD-------LER 309
             +                         GS V  LE + NLGP+VDF V D          
Sbjct: 611  PSSPTQSRSMLSHAASSSRDHSELSTGGSLVN-LETWQNLGPVVDFVVDDGAGGDPTYAS 669

Query: 310  QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP--FDTFLVV 367
              Q ++VTCSG    GS+R  R+G  + + AS+ +   + +WS+ +  DD       L+V
Sbjct: 670  SAQARIVTCSGLGPTGSVREARSGASVRDIASLPIPNAEQIWSVDAGVDDASKLTIGLLV 729

Query: 368  SFISETRILAMNLEDELEETEIEGFCS---------QTQTLFCHDAIYNQLVQV--TSGS 416
             F + T  L  N + +L +   +   +            T+  HD    QL++V  T  S
Sbjct: 730  GFATSTAYLHFNADGDLTDATDQLVAAGANVSLPTIAATTVLFHDR-SPQLLRVDRTGAS 788

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANA-SQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
            +  V   S  + ++W+ P    V  A+ N+  Q +LA     L+YL   DG L E     
Sbjct: 789  LFSVQDASISILDQWRPPNELEVTSASVNSVGQAILALSDKSLLYLTDEDGALIERNKKM 848

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI--IPRSV 533
            LE E+SC+DI+P+    + +Q+ A G W       ++LPDL ++ +     E   +PRS+
Sbjct: 849  LEDEVSCVDISPLIAGKA-AQLVACGFWGRDVFDFYNLPDLEVVPR-GFAAEFSSVPRSI 906

Query: 534  LLCAFEGI---------------------SYLLCALGDGHLLNFLLNMKTGE-------- 564
            LL  FE                       +YLL  LGDG L++F L     +        
Sbjct: 907  LLHRFESSQPEKASDAEFEFWGFNPNPLDAYLLIGLGDGTLVSFRLGGIVADGNAYVPVS 966

Query: 565  LTDRKKVSLGTQPITLRTFSSKNTTHVFAAS-DRPTVIYSSNKKLLYSNVNLKE---VSH 620
            L D K ++LGTQ + L    +     V A S  RPT++Y  +K+  YS +  K+   V+ 
Sbjct: 967  LYDAKTITLGTQALKLDAIKTSTGARVVAISGSRPTLVYHDSKRFSYSALKYKDPRSVAT 1026

Query: 621  MCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICS 679
            +C      F     +     L + +I  + +  IR+ PLG   P  I      + FA+C+
Sbjct: 1027 VCAGPGRVF---AVVVLTDSLELTSISALGQRDIRTFPLGLNQPLAIAQWADRKVFAVCT 1083

Query: 680  L----KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVG 735
                 +  +  +E     VR+LD  TFE +    L+  E    I         +    VG
Sbjct: 1084 WTFLPRGSASKQEGSRGAVRILDHTTFELLDEIRLEPDERPNCITLLDLP--GHKMLVVG 1141

Query: 736  TAYVLPEENEPTKGRILVFIV---------EDGKLQLIAEKETKGAVYSLNAFNGKLLAA 786
            T YV  + +E  +GR++ F V         E G+L+ + E    G VY + +    L AA
Sbjct: 1142 TGYVSKQSSETVRGRLVAFDVSTGSSRTKEERGRLRQLFECSENGNVYQVQSTRYHLCAA 1201

Query: 787  INQKIQLYKWM-------LRDDGTRELQSECGHHGHILALYVQT-----RGDFIVVGDLM 834
            +N +I+ Y           R  G      E    G     ++         D +VVGD +
Sbjct: 1202 VNSEIKTYSITNCKDMSDKRPSGGPRRPYEVRQEGSWACSFIACNLSAIEPDRLVVGDAL 1261

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD--IYLGAENNFNLFTVRKNSEGAT 892
            +S+++L      G + E ARD + +W SA E+LD +   Y+GA+ +FNL+T ++     +
Sbjct: 1262 RSMNVLNVDRYTGRLTEIARDCDPSWTSATELLDSETQTYIGADISFNLYTTQRMP--MS 1319

Query: 893  DEERGR-------------------------------LEVVGEYHLGEFVNRFRHGSL-V 920
            +E R                                 ++    +H G+ +N+FR     V
Sbjct: 1320 EEVRTSIRRARERETERSVSLAYPRTLRDTDDCYAHVMQRKAVWHYGDMINKFRQSEFWV 1379

Query: 921  MRL------------------------------------PDSDVGQIPTVIFGTVNGVIG 944
            +RL                                    P  D    P ++F T  G IG
Sbjct: 1380 VRLVPRHPIPSHPILSNLTLTLIFSFARRLISTASLVSDPGPDAAVRPKLVFCTAAGAIG 1439

Query: 945  VIASLPHEQYLFLEKLQTNLRKVIKG---------VGGLNHEQWRSFNNEKKTVDAKNFL 995
            VIA +  ++   L K++ N+  +I+          +G + H  WR+   + +      FL
Sbjct: 1440 VIAHVRDDEAQILAKVERNILSLIESPTEAASAGVIGNIAHSDWRTLRTDHRVQAPAGFL 1499

Query: 996  DGDLIESFLD--LSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            D D+++ FLD  L   +   +    N   E L  R E + +L
Sbjct: 1500 DADVLQMFLDGRLDHNQRYRVVHGPNSEAEALGVRSEVVEQL 1541


>gi|328869269|gb|EGG17647.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
          Length = 1194

 Score =  249 bits (637), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 275/1165 (23%), Positives = 503/1165 (43%), Gaps = 173/1165 (14%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G I ++  FR  G ++D+L + ++  +  +L+++  S  +  +   +   R G R    
Sbjct: 58   FGVIRSIAPFRLTGGSKDYLIVGSDSGRVVILEYNP-SKNVFEKVHQETFGRSGCRRIVP 116

Query: 66   GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY-----GC 120
            GQ    DP  R   +   +    V   +   Q K + +  LE  +   I F       G 
Sbjct: 117  GQYISTDPKGRAFMIGAIEKQKLVYILNRDSQAKLSISSPLEAHKAHTIVFSMCGVDVGF 176

Query: 121  AKP---TIVVLYQDNKDARHVKT---------YEVALKDKDFVEGPWSQNNLDNGADLLI 168
              P   TI V Y +  +   V+          YE+ L   + V   WS+  +D  A+L++
Sbjct: 177  ENPIFATISVDYSEETNIEDVEETHNTKVLTFYELDLGLNNVVR-KWSEE-VDRSANLVV 234

Query: 169  PVP-----PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRV--------DAD 212
             VP     P   GVL+  +  + Y +   A+   +IP R  +T+    +          D
Sbjct: 235  SVPGGSDGP--GGVLVCAQGRVYYRNIGHADISVSIPRRNGMTEEKSLMIVSHASHKQRD 292

Query: 213  GSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQ 272
               +L+    G L+ + + +  E V+G++I        A+ I+ L N  +++ S +GD  
Sbjct: 293  MFFFLVQSEYGDLYKITLDYSGEMVSGMQIAYFDTFPTANCITMLKNGFLFVASEFGDHG 352

Query: 273  L-----IKLNLQPDAKGS-------YVEVLE--------RYVNLGPIVDFCVVDLERQGQ 312
            L     + L+  P A  +       +  V E           +L PIVDF V DL ++G 
Sbjct: 353  LYLFKSLGLDDAPTASSAGNTEMVFFEPVFEPRNLVLTATISSLSPIVDFKVADLAQEGT 412

Query: 313  GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSS--------------T 357
             Q+   SG  +  +LR++R+G+ I +    +L G   G+W++  S               
Sbjct: 413  PQMYALSGVSERANLRVLRHGLPITQMVDSQLPGTPAGIWTIPQSLTTMRNPQYQGIGTV 472

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD--AIYNQLVQVTSG 415
            + P D ++VVSF+  T +L +   + +EE +  G  S T T+      A  + +VQ+ + 
Sbjct: 473  ESPADRYIVVSFVGSTLVLGVG--ETVEEVQDSGILSTTTTILIRSMGANLDSIVQIFAQ 530

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHA 474
             +R +++  R   +EW++P   ++++A AN  QV++  GG  ++Y E+   G LTEV   
Sbjct: 531  GIRHINAERRV--SEWRAPGRKTISIAAANQQQVIICIGGSEIIYFELDPAGNLTEVFKK 588

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL 534
             +  +I+C++  PI    + S+  A+  + D  VRI SL   N++ +  +      +   
Sbjct: 589  DMRKDINCIEFAPIPRGRTMSRFVAISDY-DGPVRILSLERDNMLNQVSMVDTDRQQVEQ 647

Query: 535  LCAFE------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
            LC  E                  G  +L   L +G L   +L+  TGE++D +   LG +
Sbjct: 648  LCVAELMVHEPGVDQQIVNTQRQGTLFLHIGLKNGVLKRAVLDGLTGEISDMRTRILGRR 707

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            PI       K +  V A S R  + YS+  +   + ++++ + H    +S   P+ +   
Sbjct: 708  PIKFFRVKIKGSPAVLALSTRVWMCYSNLGRYEITPLSVEPLDHAASLSSDQCPEGIVAT 767

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAEESEM--- 691
             E  L I +I+ +  L  +  IPL   P+R + H + +   +I +  N S   E E+   
Sbjct: 768  SENNLKIFSIEKLGDLFNQVQIPLSCTPKRFVVHPQTNYIVSIETEHNFSTQFEPELAQD 827

Query: 692  ------------HFVRLLD---DQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGT 736
                          +R+LD   +QT + +S   L   E   SI +  F  +  V   VG 
Sbjct: 828  LEMPRAGKGKWKSVIRILDPVSNQTLDLVS---LTNNEAAFSIGTVMF-QEGEVMLAVGC 883

Query: 737  A---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
            A    + P         +  F+    KL L+   ET    +++ +F GKL+A +   ++L
Sbjct: 884  AKDVQLQPRGFSSASIHLYQFVENGQKLALVHTTETDLIPHAVGSFQGKLIAGVGNVLRL 943

Query: 794  Y----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
            Y    K +LR    R++ ++           +Q++GD IVV D+ +S+    Y+  E  +
Sbjct: 944  YELGKKKLLRKCENRKVPNQITS--------IQSQGDRIVVSDVQESVHFFTYRRSENVL 995

Query: 850  EERARDYNANWMSAVEILD---------------------DDIYLGAENNFNLFTVRKNS 888
               A D    ++++  +LD                      D  +  EN  ++ T+   S
Sbjct: 996  VLFADDTAPRFVTSCVMLDFDTVAIADKFGTIAVVRMPPSSDTDIDIENELDIGTLTTKS 1055

Query: 889  EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS 948
            + A +    +LE+V  +++GE        SL+       VG    +I+ T+ G IG +  
Sbjct: 1056 KLALNGAANKLELVASFYVGETCCNLTRTSLM-------VGGSEVIIYTTLGGAIGALIP 1108

Query: 949  L-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
                E+  F  +L+  +R+  + + G +H  +RS+         KN +DGDL E +  L 
Sbjct: 1109 FSSREEVDFFAQLEMQMRQEKESLCGRDHLSYRSY-----YFPVKNVIDGDLCEQYSTLD 1163

Query: 1008 RTRMDEISKTMNVSVEELCKRVEEL 1032
             T+   +++ +  +  E+ K++E++
Sbjct: 1164 ATKQRSVAEELIRNPSEVLKKLEDI 1188


>gi|331247470|ref|XP_003336363.1| DNA damage-binding protein 1 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 962

 Score =  246 bits (627), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 249/928 (26%), Positives = 411/928 (44%), Gaps = 136/928 (14%)

Query: 209  VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIEL----LGETSIASTISYLDNAVVYI 264
            VD   + +LLGD  G L  L +   K+  TG    L     G       + Y+ +  V++
Sbjct: 7    VDGVPNCWLLGDLYGNLIFLFL---KQSTTGTPPSLHYFHAGHVPSPEALVYITSGFVFL 63

Query: 265  GSSYGDSQLIKL-----NLQPDAKGS-YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTC 318
             S YGDSQL+KL     +L PD   S   EV+  + NL PI DFCV +  +    Q+VTC
Sbjct: 64   ASHYGDSQLLKLPSPTSSLNPDNTASAQPEVITTFPNLAPISDFCVTEDRKSLVNQIVTC 123

Query: 319  SGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTD----DPFDTFLVVSFISETR 374
            SG+++DGSLR++++GIGI E  S+E+ G++ +W+LRSST     + FD  LV+S    TR
Sbjct: 124  SGSHRDGSLRVIKHGIGIRESGSLEVGGVQRLWALRSSTHVNSVEDFDDRLVLSCADCTR 183

Query: 375  ILAMNLEDELEETEI-EGFCSQTQTLFC------HDAIYNQLVQVTSGSVRLVSSTSREL 427
             LA+N +  +EE  +  GF S   T+         D+     +QVT  S ++++  +   
Sbjct: 184  FLALNEDGTIEEIGLFNGFESDVPTILAGNLLDGSDSTTRYSIQVT--SRKIIAGDAL-- 239

Query: 428  RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINP 487
               W+     S+  A  + +   ++     +V L I DG   E    +L  +IS L I+ 
Sbjct: 240  --VWEPDDAKSITRAALSVTTCAVSL-KEQVVVLCIKDGKFVEKGTYKLLNDISSLAID- 295

Query: 488  IGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGIS----Y 543
                       A   W    + I S+   + I + +   + +  S+ +  FEG       
Sbjct: 296  -----QSENFVAAAQWVTNCIEIISVSSSSTICRVNTDSDFMVNSLKMTNFEGTESDGCR 350

Query: 544  LLCALGDGHLLNFLLNMKTG---ELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
            LL  LGDG ++N  L   TG   E    +  +LG +PI   +  +     ++A SD+PT+
Sbjct: 351  LLIGLGDGKIMNVALG-PTGMHVEGDSPRFTTLGIRPIEFVSMRNATGEFLWANSDQPTI 409

Query: 601  I--YSSNKKLLYSNVNLK--EVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
            I    +N +  Y+ V ++   VS     ++  F DSL +A   E+ IG ++  +K++I  
Sbjct: 410  IDRIQNNGRFAYTPVTVQGGSVSSATGLHARFFQDSLVLASNDEIRIGKLNTTEKMNILK 469

Query: 657  IPLG-EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV---RLLDDQTFEFISTYPLDT 712
            I LG E PRRI H E  + + +   + +   +   +H +   ++ DD+TF+         
Sbjct: 470  ISLGNEQPRRIAHSEDMKAYGVVCARLELDQDTGTIHRIGTFKVFDDETFQC-------- 521

Query: 713  FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEK----- 767
                 SI +    ++   ++ +GT  +   E E T GRIL     + K  L A+K     
Sbjct: 522  ----SSIAAVKLGEEMIEHFVIGTGVIKSTEAEATIGRILAIRELNSKQDLTAKKRHFEL 577

Query: 768  ----ETKGAVYSLNAF-NGKLLAAINQKIQLYKWMLRD--------------------DG 802
                +  GAV  +    NG  +A+ N  +  +     D                    DG
Sbjct: 578  TNVGKLSGAVGGVGGLPNGMFVASANAFVHAFGLKKGDSGRAFPSGTDTVLAGSVPEMDG 637

Query: 803  TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMS 862
               L    G  G  ++  V T G  ++VGDL KS+ LL +  E   +  +ARD++A  + 
Sbjct: 638  GFRLLDTWG--GGFVSQTVVTDGTKVLVGDLYKSVVLLEFDLEHLELAVKARDFSAMSVR 695

Query: 863  AVEILD--DDIYLGAENNFNL---------FTVRKNSEGATDEERGRLEVVG-------- 903
             +  +   D    G++   +            V  +  G +DE+  +   +         
Sbjct: 696  PIGAISERDPTSPGSDGKSDRPNQDAHREDHGVHLDDGGGSDEDEEQALELEQEEEEEED 755

Query: 904  -EYHLGEFVNRFRH-----------------GSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
             +Y+   + +  +                  GSLV R  +S +     +IF T  G IG+
Sbjct: 756  QDYNQSSYHDHLQPPHSPTFDQDQNPAAYGGGSLVPRFIESSLIGETKLIFVTSTGGIGL 815

Query: 946  IASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
            IA +   ++   L + Q++L K+   VG L H  +R F  E + + +  FLDGD +E  L
Sbjct: 816  IAKIHSKKKTKQLARFQSDLSKISTSVGNLAHSAYRMFKTESRKIPSMGFLDGDFLEGCL 875

Query: 1005 DLSRTRMDE-ISKTMNVSVEELCKRVEE 1031
            DL+   ++  + K M +  EE   +  E
Sbjct: 876  DLTPDEVENLVKKMMALKAEEAQAKATE 903


>gi|18410222|ref|NP_567015.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
 gi|18410226|ref|NP_567016.1| putative splicing factor [Arabidopsis thaliana]
 gi|7019653|emb|CAB75754.1| spliceosomal-like protein [Arabidopsis thaliana]
 gi|7019655|emb|CAB75756.1| spliceosomal-like protein [Arabidopsis thaliana]
 gi|332645831|gb|AEE79352.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
 gi|332645833|gb|AEE79354.1| putative splicing factor [Arabidopsis thaliana]
          Length = 1214

 Score =  244 bits (622), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 277/1187 (23%), Positives = 499/1187 (42%), Gaps = 192/1187 (16%)

Query: 4    VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RP 62
            V ++G I +L  FR  G  +D++ + ++  +  +L+++ E + +  +   +   + G R 
Sbjct: 55   VEVFGAIRSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKEKN-VFDKVHQETFGKSGCRR 113

Query: 63   TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
               GQ   +DP  R  +IG      L  V+  D   +L  +  +   +   +        
Sbjct: 114  IVPGQYVAVDPKGRAVMIGACEKQKLVYVLNRDTTARLTISSPLEAHKSHTICYSLCGVD 173

Query: 118  YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGA 164
             G   P    +  D  +A             +H+  YE+ L   + V   WS N +DNGA
Sbjct: 174  CGFDNPIFAAIELDYSEADQDPTGQAASEAQKHLTFYELDLG-LNHVSRKWS-NPVDNGA 231

Query: 165  DLLIPVPPPL---CGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR--- 215
            ++L+ VP       GVL+  E  ++Y +    +    IP R  +    G +    +    
Sbjct: 232  NMLVTVPGGADGPSGVLVCAENFVIYMNQGHPDVRAVIPRRTDLPAERGVLVVSAAVHKQ 291

Query: 216  -----YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
                 +L+    G +  + + H  + V+ LK++      +AS+I  L    ++  S +G+
Sbjct: 292  KTMFFFLIQTEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSICVLKLGFLFSASEFGN 351

Query: 271  SQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDFCVV 305
              L +      +PD + S   ++E                      +  +L P++D  V+
Sbjct: 352  HGLYQFQAIGEEPDVESSSSNLMETEEGFQPVFFQPRRLKNLVRIDQVESLMPLMDMKVL 411

Query: 306  DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTF 364
            ++  +   Q+ +  G     SLRI+R G+ I E A  +L G    +W+++ +  D FD +
Sbjct: 412  NIFEEETPQIFSLCGRGPRSSLRILRPGLAITEMAVSQLPGQPSAVWTVKKNVSDEFDAY 471

Query: 365  LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
            +VVSF + T +L++   +++EE    GF   T +L       + L+QV    +R +    
Sbjct: 472  IVVSFTNATLVLSIG--EQVEEVNDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIREDG 529

Query: 425  RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCL 483
            R   NEW++P   S+     N  QV++A  GG L+Y E    G L EV+  ++  +++CL
Sbjct: 530  R--INEWRTPGKRSIVKVGYNRLQVVIALSGGELIYFEADMTGQLMEVEKHEMSGDVACL 587

Query: 484  DINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAFEG 540
            DI P+ E    S+  AVG + D +VRI SL PD  L +++ + +     P S+L    + 
Sbjct: 588  DIAPVPEGRKRSRFLAVGSY-DNTVRILSLDPDDCLQILSVQSVSSA--PESLLFLEVQA 644

Query: 541  IS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
                          +L   L +G L   +++M TG+L+D +   LG +P  L + S +  
Sbjct: 645  SIGGDDGADHPANLFLNSGLQNGVLFRTVVDMVTGQLSDSRSRFLGLKPPKLFSISVRGR 704

Query: 589  THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
            + +   S RP + Y        + ++ + +    PF+S    + +       L I  ID 
Sbjct: 705  SAMLCLSSRPWLGYIHRGHFHLTPLSYETLEFAAPFSSDQCAEGVVSVAGDALRIFMIDR 764

Query: 649  I-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAEESE---------------- 690
            + +  +   +PL   PR+ + H ++     I S +    AEE E                
Sbjct: 765  LGETFNETVVPLRYTPRKFVLHPKRKLLVIIESDQGAFTAEEREAARKECFEAGGVGENG 824

Query: 691  -----------------------------------MHFVRLLDDQTFEFISTYPLDTFEY 715
                                               +  +R+LD +T        L   E 
Sbjct: 825  NGNADQMENGADDEDKEDPLSDEQYGYPKAESEKWVSCIRVLDPKTATTTCLLELQDNEA 884

Query: 716  GCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRILVFI-----VEDGK-LQLIAEKE 768
              S+ + +F D +      VGT   +  +  P K  +  FI     VEDGK L+L+ + +
Sbjct: 885  AYSVCTVNFHDKEYGTLLAVGTVKGM--QFWPKKNLVAGFIHIYRFVEDGKSLELLHKTQ 942

Query: 769  TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI-LALYVQTRGDF 827
             +G   +L  F G+LLA I   ++LY     D G + L  +C +       + +QT  D 
Sbjct: 943  VEGVPLALCQFQGRLLAGIGPVLRLY-----DLGKKRLLRKCENKLFPNTIISIQTYRDR 997

Query: 828  IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-- 885
            I VGD+ +S     Y+ +E  +   A D    W++A   +D D   GA+   N++ VR  
Sbjct: 998  IYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLTASHHVDFDTMAGADKFGNVYFVRLP 1057

Query: 886  ----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 929
                            K  +G  +    +++ + ++H+G+ V   +  S++        G
Sbjct: 1058 QDLSEEIEEDPTGGKIKWEQGKLNGAPNKVDEIVQFHVGDVVTCLQKASMI-------PG 1110

Query: 930  QIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT 988
               ++++GTV G IG + A    +   F   L+ ++R+    + G +H  +RS       
Sbjct: 1111 GSESIMYGTVMGSIGALHAFTSRDDVDFFSHLEMHMRQEYPPLCGRDHMAYRS-----AY 1165

Query: 989  VDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEE 1031
               K+ +DGDL E F    +DL R   DE+ +T      E+ K++E+
Sbjct: 1166 FPVKDVIDGDLCEQFPTLPMDLQRKIADELDRT----PAEILKKLED 1208


>gi|291000406|ref|XP_002682770.1| predicted protein [Naegleria gruberi]
 gi|284096398|gb|EFC50026.1| predicted protein [Naegleria gruberi]
          Length = 1216

 Score =  243 bits (621), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 257/1033 (24%), Positives = 453/1033 (43%), Gaps = 155/1033 (15%)

Query: 127  VLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIV 186
            V YQ  K  + +  YE+ L     V+   S   +D  A+LLI +P PL GVL+  E  I 
Sbjct: 206  VAYQSLK--KMLTIYELDLGKNTIVKK--SSEEIDRSANLLIQIPEPLGGVLVCSENYIT 261

Query: 187  YCSA---NAFKAIPIRPSITKAYG--------RVDADGSRYLLG-DHAGLLHLLVITHEK 234
            Y      +    IP R  + +  G        +V   G  +LL  +  G L+ +    + 
Sbjct: 262  YKKEGQDDVRSPIPRRSGMPQERGLMIINSNLKVKKGGDIFLLAQNELGDLYRVSFNKDG 321

Query: 235  EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL------------------ 276
             +V+ + ++      ++++   L N  ++  S +GD +  K+                  
Sbjct: 322  NRVSSVSLQYFDTIPVSNSFLALKNGNLFCASEFGDHKCYKITKIEPNEYPETYESIGSE 381

Query: 277  ------------------NLQPDAKG-SYVEVLERYVNLGPIVDFCVVDLERQGQGQVVT 317
                              N QP     + +E++    +L P+ D  V DL  +G  Q+  
Sbjct: 382  TFPTFTPPMNRFEALMSDNDQPSTTNVTCLELVNEIKSLAPVTDMKVTDLMGEGTPQIYC 441

Query: 318  CSGAYKDGSLRIVRNGIGINEQASVEL-QGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
             +G     SLR +R G+ +NE+ +  L Q    +++++ S +D FD ++++SF   T +L
Sbjct: 442  LNGRGPTSSLRSLRYGLPVNEEVAAPLDQQATAIFTVKESMNDTFDKYIILSFSEFTMVL 501

Query: 377  AMNLEDELEETEIEGFCSQTQTLFCHD-AIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
            ++  E+  E TE  GF + T+T++  +     + VQV    +R +     +  NEW S  
Sbjct: 502  SVG-ENVAEVTE-SGFLTTTKTIYASNIGESGEFVQVHPKGIRHIHP---DRVNEWNSG- 555

Query: 436  GYSVNVATANASQVLLATGGGHLVYLE--IGDGILTEVKHAQLEYEISCLDINPIGENPS 493
               +  A  N  Q++++  GG ++Y E     G L E +   L  +++CL ++PI +  +
Sbjct: 556  NKIIEKAAVNGYQIVVSLSGGEIIYFEYDTSSGNLIETERNDLSQDVACLALSPIQDGRT 615

Query: 494  YSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLCAF-------EGISY 543
              +  AVG + D +VR+ SL +   +++++++ L  +  P S+ L          E   Y
Sbjct: 616  RGRFLAVGFY-DKTVRLISLGEYDMMSILSRQALPAD--PESLSLIELQSGHSRDETSLY 672

Query: 544  LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYS 603
            L   L +G LL   ++  TGEL+D +   LGT+ + LR    +    + A S +P +  S
Sbjct: 673  LNIGLSNGILLRSTVDSSTGELSDTRSRFLGTKGVKLRNVKVRGDNAILALSSKPWLGNS 732

Query: 604  SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD-IQKLHIRSIPLGEH 662
             N K+  + ++   ++  C F+S    D L       L I TI   + K     IPL   
Sbjct: 733  INGKIEMTPLSYPALNSACNFSSEQIRDGLVSITAEHLRIITISSLLDKFTPEIIPLNYT 792

Query: 663  PRR-ICHQEQSRTFAICSLKNQSCAEESEM-HFVR---------LLDDQTFEFISTYP-- 709
            PR+ + H E S    + +  N S    ++   +++         L  D  +  I T P  
Sbjct: 793  PRKFVVHDETSHMIILQTDHNVSKENSTDRPDYIKVDESQKDPSLSKDVEYGAIKTKPKS 852

Query: 710  -----------------LDTFEY-----GCSILSCSFS------DDSNVYYCVGTAYVLP 741
                             LD FE        SI +C FS        +     VGTA  + 
Sbjct: 853  NLWASYIRVYSPKKQANLDEFEIEDNEAAFSITTCKFSTSLSGAKSNESLIIVGTAKNM- 911

Query: 742  EENEPTK----GRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW 796
             +  PT+    G I VF I EDGKLQLI + E +   Y+L+AF G+LL  +   +++Y  
Sbjct: 912  -KLYPTRTCDCGYINVFQISEDGKLQLIHKTEVEDVPYALHAFRGRLLVGVKNMLRIY-- 968

Query: 797  MLRDDGTRELQSECGHHGHI-LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
               D G ++L  +C +         +   G+ I VGD+ +S   + +   E ++   A +
Sbjct: 969  ---DLGKKKLLRKCENKSFPNFITSIAVDGNRIFVGDITESFHFVKFNSSENSLTIFADN 1025

Query: 856  YNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE-------------ERGRLEVV 902
                W++A  ++D +   G +   N F  R  S+  +DE             ERG L   
Sbjct: 1026 TTPRWLTASALVDHNTIAGGDKFGNFFISRLPSD-VSDELEDSSTGKEKWIWERGLLN-- 1082

Query: 903  GEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLPHEQYL-FLEK 959
            G       + +F  GS++  +  + +  G    +I+ T+ G +GV      ++ + F  +
Sbjct: 1083 GAPQKATEIVKFYVGSMITSIYKTSLIAGGPSILIYTTITGAVGVFFPFTSKKDIEFFTQ 1142

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            L+ +LR+    + G +H  +RS+    K+V     +DGDLIE F D+      +IS+ + 
Sbjct: 1143 LEMHLREKNPPLCGRDHLAYRSYYFPVKSV-----VDGDLIEQFNDVDLQTKTKISEDLQ 1197

Query: 1020 VSVEELCKRVEEL 1032
             ++ E+ K++E++
Sbjct: 1198 RTINEIAKKIEDM 1210


>gi|374095609|gb|AEY85032.1| spliceosomal-like protein [Camellia sinensis]
          Length = 1212

 Score =  243 bits (621), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 275/1187 (23%), Positives = 502/1187 (42%), Gaps = 186/1187 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V I+G I +L  FR  G  +D++ + ++  +  +L+++ E + +  +   +   + G
Sbjct: 52   ILSVEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKEKN-VFDKVHQETFGKSG 110

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   IDP  R  +IG      L  V+  D   +L  +  +   +   +     
Sbjct: 111  CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIFYSIC 170

Query: 118  ---YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLD 161
                G   P    +  D  +A             +H+  YE+ L   + V   WS+  +D
Sbjct: 171  GVDCGFDNPIFASIELDYSEADQDSTGQAAAEAQKHLTFYELDLG-LNHVSRKWSEQ-VD 228

Query: 162  NGADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR 215
            NGA++L+ VP       GVL+  E  ++Y +    +    IP R  +    G +    + 
Sbjct: 229  NGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRYDLPAERGVLIVSAAM 288

Query: 216  --------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
                    +LL    G +  + + H+ ++VT L+I+      + +++  L +  ++  S 
Sbjct: 289  HKQKSMFFFLLQTEYGDIFKVTLDHDNDRVTELRIKYFDTIPVTASLCVLKSGFLFAASE 348

Query: 268  YGDSQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDF 302
            +G+  L +      +PD + S   ++E                      +  +L PI+D 
Sbjct: 349  FGNHALYQFQAIGDEPDVESSSATLMETEEGFQPVFFQPRRLKNLVRIDQVESLMPIMDM 408

Query: 303  CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPF 361
             +++L  +   Q+ T  G     SLRI+R G+ I+E A  +L G+   +W+++ + +D F
Sbjct: 409  KIINLFEEETPQIFTLCGRGPRSSLRILRPGLAISEMAVSQLPGVPSAVWTVKKNVNDEF 468

Query: 362  DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
            D ++VVSF + T +L++   + +EE    GF   T +L       + L+QV    +R + 
Sbjct: 469  DAYIVVSFANATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIR 526

Query: 422  STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEI 480
               R   NEW++P   ++    +N  QV++A  GG ++Y E+   G L EV+  ++  ++
Sbjct: 527  EDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGEIIYFEVDMTGQLMEVEKQEMSGDV 584

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCA 537
            +CLDI P+ E    S+  AVG + D  +RI SL PD  + +++ + +     P S+L   
Sbjct: 585  ACLDIAPVPEGRQRSRFLAVGSY-DNCIRILSLDPDDCMQVLSLQSVSSP--PESLLFLE 641

Query: 538  FEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
             +               +L   L +G L   +++M TG+L+D +   LG +   L +   
Sbjct: 642  VQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDARSRFLGLRAPKLFSVII 701

Query: 586  KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
            +    +   S RP + Y      L + ++ + +     F+S    + +       L + T
Sbjct: 702  RGRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEFAASFSSDQCAEGVVAVAGDALRVFT 761

Query: 646  IDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSL------------KNQSCAEESEM- 691
            I+ + +  +  +IPL   PR+   Q + +   I                 + C E++ M 
Sbjct: 762  IERLGETFNETAIPLRYTPRKFVLQPKRKLLVIIESDQGAYAAEQRENAKKECFEDAGMG 821

Query: 692  -------------------------------------HFVRLLDDQTFEFISTYPLDTFE 714
                                                   +R+LD +T        L   E
Sbjct: 822  ENGKVEQMENGGDDEDKEDPLSDEQYGYPKVESDRWVSCIRVLDPRTANTTCLLELQDNE 881

Query: 715  YGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGRILVF-IVEDGK-LQLIAEKET 769
               SI   +F D +      VGTA  L    +   + G I ++  VEDGK L+L+ + + 
Sbjct: 882  AAFSICLVNFHDKEYGTLLAVGTAKGLQFWPKRSISSGYIHIYRFVEDGKSLELLHKTQV 941

Query: 770  KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFI 828
                 +L  F GKLLA +   ++LY     D G R+L  +C +      +  + T  D I
Sbjct: 942  DDVPLALCQFQGKLLAGVGSVLRLY-----DLGKRKLLRKCENKLFPNTITSIHTYRDRI 996

Query: 829  VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--- 885
             VGD+ +S     Y+ +E  +   A D    W++A   +D D   GA+   N++ VR   
Sbjct: 997  YVGDIQESFHYCKYRRDENQLYIFADDCVPRWLTASYHIDFDTMAGADKFGNIYFVRLAQ 1056

Query: 886  ---------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
                           K  +G  +    ++E + ++H+G+ V   +  SL+   P      
Sbjct: 1057 DVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDVVTCLQKASLI---PSGG--- 1110

Query: 931  IPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
               VI+GTV G +G ++A    +   F   L+ ++R+    + G +H  +RS        
Sbjct: 1111 -ECVIYGTVMGSLGALLAFTSRDDVDFFSHLEMHMRQENPPLCGRDHMAYRS-----AYF 1164

Query: 990  DAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              K+ +DGDL E F    +D+ R   DE+ +T      E+ K++EE+
Sbjct: 1165 PVKDVIDGDLCEQFPTLPMDMQRKIADELDRTPG----EILKKLEEV 1207


>gi|356576847|ref|XP_003556541.1| PREDICTED: splicing factor 3B subunit 3-like [Glycine max]
          Length = 1214

 Score =  243 bits (619), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 282/1190 (23%), Positives = 498/1190 (41%), Gaps = 190/1190 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V I+G I +L  FR  G  +D++ + ++  +  +L+++ E + +  +   +   + G
Sbjct: 52   ILSVEIFGAIRSLAQFRLMGAQKDYIVVGSDSGRIVILEYNKEKN-VFDKIHQETFGKSG 110

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   IDP  R  +IG      L  V+  D   +L  +  +   +   L     
Sbjct: 111  CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTLVYSIC 170

Query: 118  ---YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLD 161
                G   P    +  D  +A             +H+  YE+ L   + V   WS+  +D
Sbjct: 171  GVDCGFENPIFAAIELDYSEADQDSTGLAASEAQKHLTFYELDL-GLNHVSRKWSEQ-VD 228

Query: 162  NGADLLIPVP---PPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGRVDADGSR 215
            NGA+LL+ VP       GVL+  E  ++Y           IP R  +    G +    + 
Sbjct: 229  NGANLLVTVPGGGDGPSGVLVCAENFVIYKNQAHPEVRAVIPRRTDLPAERGVLIVSAAM 288

Query: 216  --------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
                    +LL    G +  + + H  ++V+ LKI+      + +++  L +  ++  S 
Sbjct: 289  HKLKNMFFFLLQTEYGDIFKVTLEHNNDRVSELKIKYFDTIPVTASMCVLKSGFLFAASE 348

Query: 268  YGDSQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDF 302
            +G+  L +      + D + S   ++E                      +  +L PI+D 
Sbjct: 349  FGNHALYQFKSIGDEDDVEASSATLMETEQGFQPVFFQPRRLKNLVRIDQVESLMPIMDM 408

Query: 303  CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPF 361
             V +L  +   Q+ T  G     SLRI+R G+ ++E A  +L GI   +W+++ +  D F
Sbjct: 409  KVSNLFEEETPQIYTLCGRGPRSSLRILRTGLAVSEMAVSKLPGIPSAVWTVKKNAIDEF 468

Query: 362  DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
            D ++VVSF + T +L++   + +EE    GF   T +L       + L+QV    +R + 
Sbjct: 469  DAYIVVSFTNATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIR 526

Query: 422  STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEI 480
               R   NEW++P   +++   +N  QV++A  GG L+Y E+   G L EV+  ++  ++
Sbjct: 527  EDGR--INEWRTPGKRTISKVGSNRLQVVIALSGGELIYFEVDVTGQLMEVEKHEMSGDV 584

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFE 539
            +CLDI P+ E    S+  AVG + D ++RI SL PD  +           P S+L    +
Sbjct: 585  ACLDIAPVPEGRQRSRFLAVGSY-DKTIRILSLDPDDCMQALSVQSVSSAPESLLFLEVQ 643

Query: 540  GIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
                           +L   L +G +   +++M TG+L+D +   LG +   L     + 
Sbjct: 644  ASVGGEDGADHPASLFLNAGLQNGVMFRTVVDMVTGQLSDSRSRFLGLRAPKLFPIIVRG 703

Query: 588  TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
               +   S RP + Y      L + ++ + + +   F+S    + +       L I TI+
Sbjct: 704  KRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCVEGVVAVAGEALRIFTIE 763

Query: 648  DI-QKLHIRSIPLGEHPRRICHQEQSRTFA-ICSLKNQSCAEESEM-------------- 691
             + +  +   IPL   PR+   Q + +    I S +    AEE E               
Sbjct: 764  RLGETFNETVIPLRYTPRKFVLQPKRKLLVMIESDQGALTAEEREAARKECFESAQAGEN 823

Query: 692  ------------------------HF-------------VRLLDDQTFEFISTYPLDTFE 714
                                    H+             +R+LD +T        L   E
Sbjct: 824  GTESADQMENGGEDEDKDDPLSDEHYGYPKAESDKWASCIRVLDPRTGNTTCLLELQENE 883

Query: 715  YGCSILSCSFSD-DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKE 768
               SI + +F D +      VGTA     LP+    T G I ++  VEDG+ L+L+ + +
Sbjct: 884  AAFSICTINFHDKEYGTLLAVGTAKGLQFLPKRT-ITAGFIHIYRFVEDGRSLELLHKTQ 942

Query: 769  TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRG 825
             +G   +L  F G+LLA I   ++LY     D G R L  +C +      I++++     
Sbjct: 943  VEGVPLALCQFQGRLLAGIGPVLRLY-----DLGKRRLLRKCENKLFPNTIVSIHAYR-- 995

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR 885
            D I VGD+ +S     Y+ +E  +   A D    W++A   +D D   GA+   N++ VR
Sbjct: 996  DRIYVGDVQESFHYCKYRRDENQLYIFADDCVPRWLTASYHIDFDTMAGADKFGNIYFVR 1055

Query: 886  ------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
                              K  +G  +    ++E + ++H+G+ V   +  SL+       
Sbjct: 1056 LPQDVSDEIEEDPTGGRIKWEQGKLNGAPNKVEEIVQFHIGDVVTCLQKASLI------- 1108

Query: 928  VGQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
             G    ++FGTV G +G + A    +   F   L+ ++R+    + G +H  +RS     
Sbjct: 1109 PGGGECIVFGTVMGSVGALHAFTSRDDVDFFSHLEMHMRQDHPPLCGRDHMAYRS----- 1163

Query: 987  KTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                 K+ +DGDL E +    +DL R   DE+ +T      E+ K++EE+
Sbjct: 1164 AYFPVKDVIDGDLCEQYPTLPMDLQRKIADELDRTPG----EILKKLEEV 1209


>gi|255588145|ref|XP_002534515.1| spliceosomal protein sap, putative [Ricinus communis]
 gi|223525135|gb|EEF27867.1| spliceosomal protein sap, putative [Ricinus communis]
          Length = 1214

 Score =  242 bits (618), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 278/1187 (23%), Positives = 499/1187 (42%), Gaps = 185/1187 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V I+G I +L  FR  G  +D++ + ++  +  +L+++ E + +  +   +   + G
Sbjct: 53   ILSVEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKERN-VFDKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   IDP  R  +IG      L  V+  D   +L  +  +   +   +     
Sbjct: 112  CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSIC 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
                G   P    +  D  +A    T + A + +            + V   WS+  +DN
Sbjct: 172  GVDCGFDNPIFAAIELDYSEADLDSTGQAANEAQKVLTFYELDLGLNHVSRKWSE-QVDN 230

Query: 163  GADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VDADGS 214
            GA++L+ VP       GVL+  E  ++Y +    +    IP R  +    G   V A   
Sbjct: 231  GANMLVTVPGGGDGPSGVLVCAENFVIYKNEGHPDVRAVIPRRADLPAERGVLIVSAATH 290

Query: 215  R------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
            R      +LL    G +  + + HE +KV  LKI+      + S++  +    ++  S +
Sbjct: 291  RQKSMFFFLLQTEYGDIFKVTLDHENDKVKELKIKYFDTIPVTSSMCVMKLGFLFAASEF 350

Query: 269  GDSQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDFC 303
            G+  L +      + D + S   ++E                      +  +L PI+D  
Sbjct: 351  GNHALYQFQAIGEEADVEASSATLMETEEGFQPVFFQPRGLKNLVRIDQVESLMPIMDMK 410

Query: 304  VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFD 362
            + +L  +   Q+ +  G     SLRI+R G+ I+E A  +L G+   +W+++ + +D FD
Sbjct: 411  IANLFDEETPQIFSLCGRGPRSSLRILRPGLAISEMAVSQLPGVPSAVWTVKKNVNDEFD 470

Query: 363  TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
             +++VSF + T +L++   + +EE    GF   T +L       + L+QV    +R +  
Sbjct: 471  AYIIVSFNNATLVLSIG--ETVEEVNNSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIRE 528

Query: 423  TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEIS 481
              R   NEW++P   ++    +N  QV++A  GG L+Y E+   G L EV+  ++  +++
Sbjct: 529  DGR--INEWRTPGKRTIVKVGSNRVQVVIALSGGELIYFEVDITGQLMEVEKHEMSGDVA 586

Query: 482  CLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAF 538
            CLDI P+ E    S+  AVG + D ++RI SL PD  + +++ + +     P S+L    
Sbjct: 587  CLDIAPVPEGRQRSRFLAVGSF-DNTIRILSLDPDDCMQILSVQSVSSP--PESLLFLEV 643

Query: 539  EGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            +               +L   L  G L   L++M TG+L+D +   LG +   L +   +
Sbjct: 644  QASVGREDVADHPASLFLNAGLQSGVLFRTLVDMVTGQLSDSRSRFLGLRAPKLFSILVR 703

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
                +   S RP + Y      L + ++ + +     F+S    + +       L I TI
Sbjct: 704  GRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEFAASFSSDQCAEGVVAVAGDALRIFTI 763

Query: 647  DDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE-------------- 690
            + + +  +  +IPL   PR+   Q + +   I  S +    AEE E              
Sbjct: 764  ERLGETFNETAIPLRYTPRKFVLQPKKKLLVIVESDQGAYTAEEREAAKKECFEAAGMGE 823

Query: 691  ------------------------------------MHFVRLLDDQTFEFISTYPLDTFE 714
                                                +  +R+LD +T        L   E
Sbjct: 824  NGSANAEQMENGDDEDKDDPLSDEQYGYPKAEAEKWVSCIRVLDPRTAATTCLLELQDNE 883

Query: 715  YGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGRILVF-IVEDGK-LQLIAEKET 769
               S+ + +F D +      VGTA  L    +   + G I ++  V+DG+ L+L+ + + 
Sbjct: 884  AAFSVCTVNFHDKEHGTLLAVGTAKGLQFWPKRSLSAGFIHIYKFVDDGRALELLHKTQV 943

Query: 770  KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFI 828
            +G   +L+ F G+LLA I   ++LY     D G + L  +C +     ++  +QT  D I
Sbjct: 944  EGVPLALSQFQGRLLAGIGPVLRLY-----DLGKKRLLRKCENKLFPNSIVSIQTYRDRI 998

Query: 829  VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--- 885
             VGD+ +S     Y+ +E  +   A D    W++A   +D D   GA+   N++ VR   
Sbjct: 999  YVGDIQESFHFCKYRRDENQLYIFADDCVPRWLTASHHVDFDTMAGADKFGNIYFVRLPQ 1058

Query: 886  ---------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
                           K  +G  +    ++E + ++H+G+ V      SL+        G 
Sbjct: 1059 DVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHIGDVVTSLSKASLI-------PGG 1111

Query: 931  IPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
               +I+GTV G +G +        + F   L+ +LR+    + G +H  +RS        
Sbjct: 1112 GECIIYGTVMGSVGALLPFTSRDDVDFFSHLEMHLRQDHPPLCGRDHMAYRS-----AYF 1166

Query: 990  DAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              K+ +DGDL E F    LD  R   DE+ +T      E+ K++EE+
Sbjct: 1167 PVKDVIDGDLCEQFPTLPLDAQRKIADELDRTPG----EILKKLEEV 1209


>gi|168064351|ref|XP_001784126.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664326|gb|EDQ51050.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1214

 Score =  242 bits (617), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 278/1189 (23%), Positives = 509/1189 (42%), Gaps = 188/1189 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L+++ E + L  +   +   + G
Sbjct: 52   LLSVEVFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKEKN-LFEKVHQETFGKSG 110

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R  +IG      L  V+  DN  +L  +  +   +   +     
Sbjct: 111  CRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDNAARLTISSPLEAHKSHTIVYSIT 170

Query: 118  ---YGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLD 161
                G   P    +  D             N+  +H+  YE+ L     V   W++  +D
Sbjct: 171  GVDCGFDNPIFAAIELDYSEADQDSTGQAANEAQKHLTFYELDLGLNHVVR-KWTEP-ID 228

Query: 162  NGADLLIPVP-----PPLCGVLIIGEETIVYCSAN---AFKAIPIRPSITKAYGRVDADG 213
            NGA++L+ VP     P   GVL+  E  ++Y + N       IP R  +    G +    
Sbjct: 229  NGANMLVTVPGGGDGP--SGVLVCAENFVIYKNQNQPDVRAVIPRRQDLPPNRGVLIVSA 286

Query: 214  SR--------YLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYI 264
            +         +LL    G +  + + ++K+ +VT LKI+      + S +  L    ++ 
Sbjct: 287  ATHKQKALFFFLLQTEYGDIFKVTLDYDKDDQVTELKIKYFDTIPVTSAMCVLKAGFLFA 346

Query: 265  GSSYGDSQLIK---LNLQPDAKGS---YVEVLERYV-------------------NLGPI 299
             S +G+  L +   +   P+ + S    VE  E Y                    +L PI
Sbjct: 347  ASEFGNHSLYQFQSIGEDPEVESSSSTLVETEEGYQPVFFQPRKLKNLVQIDDIESLMPI 406

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTD 358
            +D  V +L  +   Q+ +  G     SLRI+R G+ + E A   L G+   +W+++   +
Sbjct: 407  MDMKVANLFEEETPQIFSLCGRGPRSSLRILRPGLAVTEMAVSPLPGVPSAVWTVKKHAN 466

Query: 359  DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
            D FD ++VVSF++ T +L++   + +EE    GF   T +L       + L+QV    +R
Sbjct: 467  DEFDAYIVVSFVNATLVLSIG--ETVEEVSDSGFLDTTPSLAISLLGDDSLMQVHPSGIR 524

Query: 419  LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLE 477
             + +  R   NEWK+P   ++     N  QV++A  GG L+Y E+   G L E++   + 
Sbjct: 525  HIRADGR--INEWKTPGKKTIVKVGYNRMQVVIALSGGELIYFEMDMSGQLMEIEKRDMT 582

Query: 478  YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVL 534
             +++CLDI P+ E    S+  AVG + D ++RI SL PD  + +++ + +     P S+L
Sbjct: 583  GDVACLDIAPVPEGRQRSRFLAVGSY-DSTIRILSLDPDDCMQILSVQAVSSP--PESLL 639

Query: 535  LCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
            L   +  +            +L   L +G LL   ++M TG+L+D +   LG +   L +
Sbjct: 640  LLEVQASTGGEDGADHPASVFLNAGLQNGVLLRTEVDMVTGQLSDTRTRFLGLRAPKLFS 699

Query: 583  FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
               +    +   S RP + Y      L + ++ + + +   F+S    + +       L 
Sbjct: 700  ALVRGRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCAEGVVAVAGDALR 759

Query: 643  IGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAEESEMH-------- 692
            + TI+ + +  +   +PL   PR+ I H +Q+    + S +     E+ E +        
Sbjct: 760  VFTIERLGETFNQTVVPLRYTPRKFILHPKQNSLIILESDQGAFNTEDREANKKEAVEAA 819

Query: 693  -----------------------------------------FVRLLDDQTFEFISTYPLD 711
                                                      +R+LD +T        L 
Sbjct: 820  GYHENGKADGEDEDMADGEDEEPLPDEQYGYPKAESNKWASCIRVLDPKTSTTTCLLELQ 879

Query: 712  TFEYGCSILSCSFSDDSNV--YYCVGTAYVLP--EENEPTKGRILVF-IVEDGK-LQLIA 765
              E   S+ + +F D+  +     VGTA  L    + E + G I ++   ++GK L+L+ 
Sbjct: 880  ENEAAFSLCAVNFHDNKELGTLIAVGTAKDLQFMPKKEASGGFIHIYRFADEGKVLELVH 939

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH---ILALYVQ 822
            +    G   +L  F G+LL  + Q +++Y     D G R+L  +C +      I+A++  
Sbjct: 940  KTPVDGVPTALCQFQGRLLVGVGQVLRIY-----DLGKRKLLRKCENKNFPNTIIAIH-- 992

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
            T GD I VGD+ +S   + Y+ +E  +   A D    W++A   +D D   GA+   N++
Sbjct: 993  TYGDRIYVGDIQESFHYVKYRRDENQLYTFADDSCPRWLTASLHIDFDTMAGADKFGNIY 1052

Query: 883  TVRKNSE-----------GATDEERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
             +R   +           G    E+GRL       E + ++H+GE V   +  SL+    
Sbjct: 1053 VMRLPQDVSEEIEDDPTGGKIKWEQGRLNGAPNKVEEIIQFHVGEVVTSLQKASLI---- 1108

Query: 925  DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
                G   +V++GT+ G +G +      E   F   L+ +LR+    + G +H  +RS  
Sbjct: 1109 ---PGGGESVLYGTIMGSMGALLPFSSREDVDFFSHLEMHLRQENPPLCGRDHMGFRS-- 1163

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                    K+ +DGDL E +  L+     +I+  ++ +  E+ K++E++
Sbjct: 1164 ---AYFPVKDVIDGDLCEQYPMLTSELQKKIADDLDRTPGEILKKLEDI 1209


>gi|449459948|ref|XP_004147708.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
 gi|449513493|ref|XP_004164340.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
          Length = 1214

 Score =  241 bits (614), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 277/1189 (23%), Positives = 503/1189 (42%), Gaps = 188/1189 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V I+G I +L  FR  G  +D++ + ++  +  +L+++ + + +  +   +   + G
Sbjct: 52   LLSVEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKDKN-VFDKIHQETFGKSG 110

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   IDP  R  +IG      L  V+  D   +L  +  +   +   +     
Sbjct: 111  CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSIC 170

Query: 118  ---YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLD 161
                G   P    +  D  +A             +H+  YE+ L   + V   WS+  +D
Sbjct: 171  GIDCGFDNPIFAAIELDYSEADQDSTGVAASEAQKHLTFYELDLG-LNHVSRKWSEP-VD 228

Query: 162  NGADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR 215
            NGA++L+ VP       GVL+  E  ++Y +    +    IP R  +    G +    + 
Sbjct: 229  NGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAAM 288

Query: 216  --------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
                    +LL    G +  + + H  + V  LKI+      + +++  L +  ++  S 
Sbjct: 289  HKQKTMFFFLLQTEYGDIFKVTLEHNNDSVKELKIKYFDTIPVTASMCVLKSGFLFAASE 348

Query: 268  YGDSQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDF 302
            +G+  L +        D + S   ++E                      +  +L PI+D 
Sbjct: 349  FGNHSLYQFQAIGEDADVESSSATLMETEEGFQPVFFQPRRLKNLMRIDQVESLMPIMDM 408

Query: 303  CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPF 361
             +++L  +   Q+ T  G     SLRI+R G+ I+E A  EL G+   +W+++ + +D F
Sbjct: 409  KIINLFEEETPQIFTLCGRGPRSSLRILRPGLAISEMAVSELPGVPSAVWTVKKNINDEF 468

Query: 362  DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
            D ++VVSF + T +L++   + +EE    GF   T +L       + L+QV    +R + 
Sbjct: 469  DAYIVVSFANATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIR 526

Query: 422  STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEI 480
               R   NEW++P   ++    +N  QV++A  GG L+Y E+   G L EV+  ++  ++
Sbjct: 527  EDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLMEVEKHEMSGDV 584

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVL--- 534
            +CLDI P+ E    S+  AVG + D ++RI SL PD  + +++ + +     P S+L   
Sbjct: 585  ACLDIAPVPEGRQRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQSVSAA--PESLLFLE 641

Query: 535  -LCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
             L +  G          +L  AL  G L   +++M TG+L+D +   LG +   L +   
Sbjct: 642  VLASVGGEDGADHPASLFLNAALHSGVLFRTVVDMVTGQLSDSRSRFLGLRAPKLFSVVL 701

Query: 586  KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
            +    +   S RP + Y      L + ++ + + +   F+S    + +       L + T
Sbjct: 702  RGRRAILCLSSRPWLGYIHQGHFLLTPLSYETLEYASSFSSDQCAEGVVAVAGNFLRVFT 761

Query: 646  IDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESEM------------ 691
            I+ + +  +   IPL   PR+   Q + +   +  S +    AEE E             
Sbjct: 762  IERLGETFNETVIPLRYTPRKFVLQPRRKLLVVIESDQGAFTAEEREAAKKECFEAAGAG 821

Query: 692  --------------------------HF-------------VRLLDDQTFEFISTYPLDT 712
                                      H+             +R+LD ++        L  
Sbjct: 822  ENGNGTMDQMENGGDDEDKDDPLSDEHYGYPKAESEKWVSCIRVLDPRSATTTCLLELQD 881

Query: 713  FEYGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGRILVF-IVEDGK-LQLIAEK 767
             E   S+ + +F D +      VGTA  L    +     G I ++  +EDGK L+L+ + 
Sbjct: 882  NEAAFSVCTVNFHDKEYGTLLAVGTAKGLQFFPKRSLVAGYIHIYRFLEDGKSLELLHKT 941

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
            + +G   +L  F G+LLA +   ++LY     D G R L  +C +      +  +QT  D
Sbjct: 942  QVEGVPLALAQFQGRLLAGLGSVLRLY-----DLGKRRLLRKCENKLFPNTIVSIQTYRD 996

Query: 827  FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
             I VGD+ +S     Y+ +E  +   A D    W++A   +D D   GA+   N++ VR 
Sbjct: 997  RIYVGDIQESFHYCKYRRDENQLYIFADDSVPRWLTASYHVDFDTMAGADKFGNIYFVRL 1056

Query: 886  -----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
                             K  +G  +    ++E + ++H+G+ V   +  SL+        
Sbjct: 1057 PQDVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIIQFHIGDVVTSLQKASLI-------P 1109

Query: 929  GQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
            G    +++GTV G +G + A    +   F   L+ ++R+    + G +H  +RS      
Sbjct: 1110 GGGECILYGTVMGSLGALHAFTSRDDVDFFSHLEMHMRQEHPPLCGRDHMGYRS-----A 1164

Query: 988  TVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                K+ +DGDL E F    LD+ R   DE+ +T      E+ K++EE+
Sbjct: 1165 YFPVKDVIDGDLCEQFPSLPLDMQRKIADELDRTPG----EILKKLEEV 1209


>gi|224100909|ref|XP_002312063.1| predicted protein [Populus trichocarpa]
 gi|222851883|gb|EEE89430.1| predicted protein [Populus trichocarpa]
          Length = 1213

 Score =  241 bits (614), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 273/1189 (22%), Positives = 500/1189 (42%), Gaps = 189/1189 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V I+G I +L  FR  G  +D++ + ++  +  +L+++ E + +  +   +   + G
Sbjct: 52   VLSVEIFGAIRSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKERN-VFDKIHQETFGKSG 110

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R  +IG      L  V+  D   +L  +  +   +   +     
Sbjct: 111  CRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDTVARLTISSPLEAHKSHTICYSVC 170

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
                G   P    +  D  +A    T + A + +            + V   WS+  +DN
Sbjct: 171  GVDCGFDNPIFAAIELDYSEADQDSTGQAAGEAQKNLTFYELDLGLNHVSRKWSE-QVDN 229

Query: 163  GADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR- 215
            GA++L+ VP       G+L+  E  ++Y +    +    IP R  +    G +    +  
Sbjct: 230  GANMLVTVPGGGDGPSGILVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAATH 289

Query: 216  -------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
                   +LL    G +  + + HE +KV  LKI+      + S+I  L +  ++  S +
Sbjct: 290  KQKSMFFFLLQTEYGDIFKVTLDHENDKVKELKIKYFDTIPVTSSICVLKSGFLFAASEF 349

Query: 269  GDSQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDFC 303
            G+  L +      + D + S   ++E                      +  +L P++D  
Sbjct: 350  GNHALYQFQAIGEEEDVEASSATLMETEEGFQPVFFQPRGLKNLVRIDQVESLMPVMDMK 409

Query: 304  VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFD 362
            V ++  +   Q+ +  G     SLRI+R G+ I+E A  +L G+   +W+++ + +D FD
Sbjct: 410  VANIFDEETPQIFSLCGRGPRSSLRILRPGLAISEMAVSQLPGVPSAVWTVKRNANDEFD 469

Query: 363  TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
             ++VVSF + T +L++   + +EE    GF   T +L       + L+Q+    +R +  
Sbjct: 470  AYIVVSFNNATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQIHPNGIRHIRE 527

Query: 423  TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEIS 481
              R   NEW++P   ++    +N  QV++A  GG L+Y E+   G L EV+  ++  +++
Sbjct: 528  DGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLMEVEKHEMSGDVA 585

Query: 482  CLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAF 538
            CLDI P+ E    S+  AVG + D ++R+ SL PD  + +++ + +     P S+L    
Sbjct: 586  CLDIAPVPEGRQRSRFLAVGSY-DNTIRVLSLDPDDCMQILSVQSVSAP--PESLLFLEV 642

Query: 539  EGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            +               +L   L  G L   +++M TG+L+D +   LG +   L   + +
Sbjct: 643  QASIGGEDGADHPASLFLNAGLQTGVLFRTVVDMVTGQLSDSRSRFLGLRAPKLFAINVR 702

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
                +   S RP + Y      L + ++ + + +   F+S    + +       L I TI
Sbjct: 703  GRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCAEGVVAVAGDALRIFTI 762

Query: 647  DDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE-------------- 690
            + + +  +  +IPL   PR+   Q + +   I  S +    AEE E              
Sbjct: 763  ERLGETFNETAIPLRYTPRKFVLQPKRKLLVIIESDQGAYTAEEREAAKKECFEAAGMGE 822

Query: 691  ------------------------------------MHFVRLLDDQTFEFISTYPLDTFE 714
                                                +  +R+LD ++        L   E
Sbjct: 823  NGSANAEKMENGDDDDKDDPLSDEQYGYPKAEADRWVSCIRVLDPRSATTTCLLELQDNE 882

Query: 715  YGCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRILVFI-----VEDGK-LQLIAEK 767
               S+ + +F D +      VGTA  L  +  P +  I  FI     V+DGK L+L+ + 
Sbjct: 883  AAFSVCTVNFHDKEHGTLLAVGTAKGL--QFWPKRSLIAGFIHIYKFVDDGKSLELLHKT 940

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
            + +G   +L  F G+LLA I   ++LY     D G + L  +C +     ++  + T  D
Sbjct: 941  QVEGVPLALCQFQGRLLAGIGSVLRLY-----DLGKKRLLRKCENKLFPNSIVSIHTYRD 995

Query: 827  FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
             I VGD+ +S     Y+ +E  +   A D    W++A   +D D   GA+   N++ VR 
Sbjct: 996  RIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTASYHVDFDTMAGADKFGNIYFVRL 1055

Query: 886  -----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
                             K  +G  +    ++E + ++H+G+ VN  +  SL+        
Sbjct: 1056 PQDVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHIGDVVNSLQKASLI-------P 1108

Query: 929  GQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
            G    +++GTV G +G +        + F   L+ +LR+    + G +H  +RS      
Sbjct: 1109 GGGECIMYGTVMGSVGALLPFTSRDDVDFFSHLEMHLRQDHPPLCGRDHMAYRS-----A 1163

Query: 988  TVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                K+ +DGDL E F    LD  R   DE+ +T      E+ K++EE+
Sbjct: 1164 YFPVKDVIDGDLCEQFPTLPLDAQRKIADELDRTPG----EILKKLEEV 1208


>gi|168046759|ref|XP_001775840.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162672847|gb|EDQ59379.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1214

 Score =  241 bits (614), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 275/1191 (23%), Positives = 508/1191 (42%), Gaps = 192/1191 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V I+G I +L  FR  G  +D++ + ++  +  +L+++ E + +  +   +   + G
Sbjct: 52   LLSVEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKEKN-IFEKVHQETFGKSG 110

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R  +IG      L  V+  DN  +L  +  +   +   +     
Sbjct: 111  CRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDNAARLTISSPLEAHKSHTIVYSIT 170

Query: 118  ---YGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLD 161
                G   P    +  D             N+  +H+  YE+ L     V   W++  +D
Sbjct: 171  GVDCGFDNPIFAAIELDYSEADQDSTGQAANEAQKHLTFYELDLGLNHVVR-KWTEP-ID 228

Query: 162  NGADLLIPVP-----PPLCGVLIIGEETIVYCSANA---FKAIPIRPSITKAYGRVDADG 213
            NGA++L+ VP     P   GVL+  E  ++Y + N       IP R  +    G +    
Sbjct: 229  NGANMLVTVPGGGDGP--SGVLVCAENFVIYKNQNQPDIRAVIPRRQDLPPNRGVLIVSA 286

Query: 214  SR--------YLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYI 264
            +         +LL    G +  + + ++K+ +VT LKI+      + S +  L +  ++ 
Sbjct: 287  ATHKQKALFFFLLQTEYGDIFKVTLDYDKDDQVTELKIKYFDTIPVTSAMCVLKSGFLFA 346

Query: 265  GSSYGDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGP 298
             S +G+  L +                             QP    + V++ +   +L P
Sbjct: 347  ASEFGNHSLYQFQSIGEEPEVESSSSTLVETEEGYQPIFFQPRKLKNLVQI-DDIESLMP 405

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSST 357
            I+D  V ++  +   Q+ +  G     SLRI+R G+ + E A   L G+   +W+++   
Sbjct: 406  IMDMKVANIFEEETPQIFSLCGRGPRSSLRILRPGLAVTEMAVSPLPGVPSAVWTVKKHA 465

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            +D FD ++VVSF++ T +L++   + +EE    GF   T +L       + L+QV    +
Sbjct: 466  NDEFDAYIVVSFVNATLVLSIG--ETVEEVSDSGFLDTTPSLAISLLGDDSLMQVHPSGI 523

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQL 476
            R + +  R   NEWK+P   ++     N  QV++A  GG L+Y E+   G L E++   +
Sbjct: 524  RHIRADGR--INEWKTPGKKTIVKVGYNRMQVVIALSGGELIYFEMDMSGQLMEIEKRDM 581

Query: 477  EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSV 533
              +++CL I P+ E    S+  AVG + D ++RI SL PD  + +++ + +     P S+
Sbjct: 582  TGDVACLHIAPVPEGRQRSRFLAVGSY-DSTIRILSLDPDDCMQILSVQAVSSP--PESL 638

Query: 534  LLCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
            LL   +  +            +L   L +G LL   ++M TG+L+D +   LG +   L 
Sbjct: 639  LLLEVQASTGGEDGADHPASVFLNAGLQNGVLLRTEVDMVTGQLSDTRTRFLGLRAPKLF 698

Query: 582  TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
            +   +    +   S RP + Y      L + ++ + + +   F+S    + +       L
Sbjct: 699  SARVRGRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCAEGVVAVAGDAL 758

Query: 642  TIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAEESEMH------- 692
             + TI+ + +  +   +PL   PR+ I H +Q     + S       EE E +       
Sbjct: 759  RVFTIERLGETFNQTVVPLRYTPRKFILHPKQKSLIILESDLGAFSTEEREANKKEAVKA 818

Query: 693  ------------------------------------------FVRLLDDQTFEFISTYPL 710
                                                       +R+LD +T        L
Sbjct: 819  TGGRDDEKANGEDEEMADGENEDPLPDEQYGYPKAESNKWASCIRVLDPKTSTTTCLLEL 878

Query: 711  DTFEYGCSILSCSFSDDSNV--YYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQL 763
               E   S+ + +F D+  +     VGTA     +P++ E + G I ++  VE+G+ L+L
Sbjct: 879  QENEAAFSLCAVNFHDNKELGTLIAVGTAKNMQFMPKK-ESSGGFIHIYRFVEEGRILEL 937

Query: 764  IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH---ILALY 820
            + +    G   +L  F G+LL  + Q +++Y     D G R+L  +C +      I+A++
Sbjct: 938  VHKTPVDGVPTALCQFQGRLLVGVGQVLRIY-----DLGKRKLLRKCENKNFPNTIIAIH 992

Query: 821  VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
              T GD I VGD+ +S   + Y+ +E  +   A D    W++A   +D D   GA+   N
Sbjct: 993  --TYGDRIYVGDIQESFHYVKYRRDENQLYTFADDSCPRWLTASLHIDFDTMAGADKFGN 1050

Query: 881  LFTVRKNSE-----------GATDEERGRL-------EVVGEYHLGEFVNRFRHGSLVMR 922
            ++ +R   +           G    E+GRL       E + ++H+GE V   +  SL+  
Sbjct: 1051 VYVMRLPQDVSEEIEDDPTGGKIKWEQGRLNGAPNKVEEIIQFHVGEVVTSLQKASLI-- 1108

Query: 923  LPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS 981
                  G   +V++GT+ G +G +      E   F   L+ +LR+    + G +H  +RS
Sbjct: 1109 -----PGGGESVLYGTIMGSVGALLPFSSREDVDFFSHLEMHLRQENPPLCGRDHMAFRS 1163

Query: 982  FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                      K+ +DGDL E +  L+     +I+  ++ +  E+ K++E++
Sbjct: 1164 -----AYFPVKDVIDGDLCEQYPMLTSELQRKIADDLDRTPGEVLKKLEDI 1209


>gi|380490733|emb|CCF35810.1| pre-mRNA-splicing factor rse-1 [Colletotrichum higginsianum]
          Length = 1212

 Score =  240 bits (612), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 280/1168 (23%), Positives = 493/1168 (42%), Gaps = 166/1168 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            I+G I ++  FR  G  +D+L +AT+  +  ++++    +      +        R    
Sbjct: 64   IFGIIRSMAAFRLAGSNKDYLILATDSGRITIIEYIPAQNRFQRLHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            G+    DP  R  LI     + L  V+  +++ +L  +  +   +  VL +  +    G 
Sbjct: 124  GEYLACDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSMVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
            A P    L  +  +A    T E A + +            + V   WS++ +D  A +L 
Sbjct: 184  ANPVFAALEIEYTEADQDPTGEAAREAETQLVYYELDLGLNHVVRKWSES-VDPTASMLF 242

Query: 169  PVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYLL 218
             VP     P   GVL+ GEE I Y  +N  AF+  IP R   T+  +  R    G  + L
Sbjct: 243  QVPGGQDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRKRHAVSGVMHKL 300

Query: 219  GDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNA 260
               AG    L+ T + +                  +V  LKI+      ++S++  L + 
Sbjct: 301  KGSAGAFFFLLQTEDGDLFKATLDMVEDTDGNPTGEVKRLKIKYFDTIPVSSSLCILKSG 360

Query: 261  VVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLERY 293
             +Y  S +G+ Q  +                +   D K  Y            + ++E  
Sbjct: 361  FLYAASQFGNHQFYQFEKLGDDDDELEFSSDDFPTDPKAGYDAVYFHPRPLENLALVESI 420

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 352
             ++ P++D  V +L  +   Q+ T  G     + R++++G+ +NE  + EL GI   +W+
Sbjct: 421  DSMNPLLDCKVANLTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWT 480

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            L+ +  D +D ++V+SF + T +L++   + +EE    GF +   TL       + L+QV
Sbjct: 481  LKLNRGDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGEDGLIQV 538

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV 471
                +R +        NEW +P   S+  AT N  QV +A   G +VY E+ GDG L E 
Sbjct: 539  HPKGIRHIRHGHV---NEWAAPQHRSIVAATTNEHQVAVALSSGEIVYFEMDGDGSLAEY 595

Query: 472  KHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
               +  +  ++CL +  + E    S   AVG   D +VR+ SL P+  L +K        
Sbjct: 596  DEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDSTVRVLSLDPESTLESKSVQALTAP 654

Query: 530  PRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
            P S+ + A +  S      YL   L  G  L  +L+  TGELTD ++  LG +P+ L   
Sbjct: 655  PSSLAIIAMDDSSSGGSTLYLHIGLHSGVYLRTVLDEITGELTDTRQKFLGPKPVRLFQV 714

Query: 584  SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
            + +  T V   S RP + YS    +  L + +N  ++  +  F+S    + +   +   L
Sbjct: 715  TVQGRTCVIGLSSRPWLGYSDPITRGFLVTPLNYVDLEWVWNFSSEQCEEGVVGIQGQSL 774

Query: 642  TIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV------ 694
             I  I+++   +  +SIPL   PRR+    +   F      N +   +     +      
Sbjct: 775  RIFAIENLGDTITQKSIPLSYTPRRLLKHPEHPMFYTIEADNNTLPPDLRAKLIAEPGVV 834

Query: 695  ----RLLDDQTFEF-----------------------ISTYPLDTFEYGCSILSCSF-SD 726
                R+L    F +                       + T  LD  E   S    SF S 
Sbjct: 835  NGDARILPPDEFGYPKGKGRWASCISVIDPLSEEQRVLQTVDLDNNEAAVSAAIVSFASQ 894

Query: 727  DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKL 783
            D+  +  VGT   ++    + ++G I V+   EDG +L+ I + + +    +L  F G+L
Sbjct: 895  DNESFLIVGTGKDMIVNPRQFSEGYIHVYRFSEDGHELEFIHKTKVEEPPSALLGFQGRL 954

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
            LA I Q +++Y   LR    R+ Q++       L + + T+G  I+VGD+   I+ ++YK
Sbjct: 955  LAGIGQTLRIYDLGLR-QMLRKAQADVAPQ---LIVSLSTQGSRIIVGDVQHGITYVVYK 1010

Query: 844  HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG--- 897
                 +     D  + W++   ++D +  +G +   N+F VR   K S+ A +E  G   
Sbjct: 1011 PTTNKLIPFVDDTISRWVTCTTMVDYESVVGGDKFGNIFLVRCPEKASQEADEESGGLHL 1070

Query: 898  ------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
                        RL ++G  +  +        SLV       VG    +++  +NG IGV
Sbjct: 1071 LNTRDYLHGTPHRLSLLGHSYTQDVPTSITKTSLV-------VGGQDVLLWSGINGTIGV 1123

Query: 946  -IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
             I  +  E   F + L+ ++R     + G +H  +RS+      V  K  +DGDL E + 
Sbjct: 1124 FIPFVTREDVDFFQNLEQHMRTEDAPLAGRDHLMYRSY-----YVPVKGVIDGDLCERYT 1178

Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             L   +   I+  ++ SV E+ +++ ++
Sbjct: 1179 LLPSEKKQMIAGELDRSVREIERKISDI 1206


>gi|302773427|ref|XP_002970131.1| hypothetical protein SELMODRAFT_171237 [Selaginella moellendorffii]
 gi|300162642|gb|EFJ29255.1| hypothetical protein SELMODRAFT_171237 [Selaginella moellendorffii]
          Length = 1207

 Score =  239 bits (611), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 280/1179 (23%), Positives = 500/1179 (42%), Gaps = 175/1179 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V I+G + +L  FR  G  +D+L + ++  K  +L+++ + +               
Sbjct: 52   LLAVEIFGVVRSLAQFRLTGSHKDYLVVGSDSGKIVILEYNRDRNAFDPVHKETFGKSGC 111

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   +DP  R  +I       L  V+  DN  +L  +  +   +   +    + 
Sbjct: 112  RRIVPGQFLSVDPKGRAVMIAATEKQKLVYVLNRDNAARLTISSPLEAHKSHTVVYSLVG 171

Query: 118  --YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDN 162
               G   P    +  D  DA             +H+  YE+ L     V   WS+  +DN
Sbjct: 172  VDCGFENPMFAAIELDYSDADQDATGQAALEAQKHLTFYELDLGLNHVVR-KWSEP-IDN 229

Query: 163  GADLLIPVPPPL---CGVLIIGEETIVYCS-ANAFKA-IPIRPSITKAYGRVDADGSR-- 215
            GA++L+ VP       GVL+  +  ++Y + +   +A IP R  +    G +    S   
Sbjct: 230  GANMLVTVPGGADGPSGVLVCADNFVIYKNQSQELRALIPRRADLPADRGVLVVSASTHR 289

Query: 216  ------YLLGDHAGLLHLLVITHEKEK--VTGLKIELLGETSIASTISYLDNAVVYIGSS 267
                  +LL    G +  + ++       VT LKI+      + S +  L +  ++  S 
Sbjct: 290  QKSVFFFLLQTEYGDVFKVTLSFNDTDMVVTELKIKYFDTFPVTSAMCVLKSGFLFAASE 349

Query: 268  YGDSQLIK---LNLQPDAKGSYVEV----------------------LERYVNLGPIVDF 302
            +G+  L +   +   PD + S   +                      ++   +L PI+D 
Sbjct: 350  FGNHALYQFQGIGDDPDVESSSATLEETEDGFKPIFFQPRKLKNLIQIDEGESLMPIMDM 409

Query: 303  CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPF 361
             + +L  +   Q+    G     +LRI+R G+ ++E A  +L G+   +W+++ S  D F
Sbjct: 410  KIANLFEEETPQIFVACGRGPRSTLRILRPGLAVSEMAVSQLPGVPSAVWTVKKSVTDEF 469

Query: 362  DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
            D ++VVSF++ T +L++   + +EE    GF   T +L       + L+QV    +R + 
Sbjct: 470  DAYIVVSFLNATLVLSIG--ETVEEVSDSGFLDTTPSLQVSLLGDDSLMQVYPNGIRHIR 527

Query: 422  STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEI 480
            S  R   NEWK+P   ++    +N  QV++   GG L+Y E+   G L E++  ++  ++
Sbjct: 528  SLVR--INEWKTPGKKTIVKVGSNRMQVVIGLSGGELIYFEMDPTGQLMEIEKREMSGDV 585

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLL-- 535
            +CL I P+ E    S+  AVG + D ++RI SL P+  +++++ + +     P S+LL  
Sbjct: 586  ACLAIGPVPEGRQRSRFLAVGSY-DNTIRILSLDPNDCMHILSVQMVASP--PESLLLLE 642

Query: 536  ----------CAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
                        +    +L   L +G LL   ++M TG+L+  +   LG +   L   S 
Sbjct: 643  VQASTGGEDGAVYPASVFLNAGLQNGVLLRTEVDMVTGQLSGTRTRFLGLRAPKLFATSV 702

Query: 586  KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
            +    +   S RP + Y      L + ++ + + +   F+S    + +       L + T
Sbjct: 703  RGRQALLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCAEGVVAVAGEALRVFT 762

Query: 646  IDDI-QKLHIRSIPLGEHPRR-ICH---------------------QEQSRTFA-----I 677
            I+ + +  +  SI L   PR+ I H                     +  SR+ A     +
Sbjct: 763  IERLGETFNQTSIQLRYTPRKFILHPKRKFLVLIETDQGTFTTEEREHMSRSAAAQDNGM 822

Query: 678  CSLKNQSCAEESEM----------------HFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
              + NQ  +EE  M                  +R+LD +         L   E   SI  
Sbjct: 823  MEIDNQEKSEEEPMPDEQYGYPKTDSRRWVSCIRVLDPKAAATTCLLELQENEAAFSICC 882

Query: 722  CSFSDDSNV--YYCVGTAYVLPEENEPTKGRILVFI-----VEDGK-LQLIAEKETKGAV 773
             +F D+ N+     VGTA  L  E  P +  +  FI     VEDG+ L+L+ +    G  
Sbjct: 883  VNFHDNKNLGTVLAVGTAKDL--EWWPKRRSMGGFIHIYRFVEDGRSLELVHKTPIDGVP 940

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
             +L  F G+LLA I Q +++Y     D G R+L  +C +      +  + + GD I VGD
Sbjct: 941  TALCQFQGRLLAGIGQILRIY-----DLGKRKLLRKCENKNFPNTITSIHSYGDRIYVGD 995

Query: 833  LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE--- 889
            + +S   + Y+ +E  +   A D +  W++A   +D D     +   NLF VR   +   
Sbjct: 996  IQESFHYVKYRRDENQLYAFADDSSPRWLTASLHIDFDTMAAGDKFGNLFFVRLPQDLSE 1055

Query: 890  --------GATDEERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
                    G    E+GRL       E + ++H+GE V   +  SL+        G   +V
Sbjct: 1056 EIEDDPTGGKIKWEQGRLNGAPNKVEEIIQFHVGEVVTCMQKASLI-------PGGGESV 1108

Query: 935  IFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
            I+GTV G +G +      E   F   L+ ++R+    + G +H  +RS          K+
Sbjct: 1109 IYGTVMGSVGALLPFSSREDVDFFSHLEMHMRQEHPPLCGRDHMAFRS-----AYFPVKD 1163

Query: 994  FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +DGDL E +  L      +I++ ++ +  E+ K++E++
Sbjct: 1164 VIDGDLCEQYPTLPPDLQRKIAEELDRTPGEVMKKLEDI 1202


>gi|356536504|ref|XP_003536777.1| PREDICTED: splicing factor 3B subunit 3-like [Glycine max]
          Length = 1214

 Score =  239 bits (611), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 281/1191 (23%), Positives = 497/1191 (41%), Gaps = 192/1191 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V I+G I +L  FR  G  +D++ + ++  +  +L+++ E + +  +   +   + G
Sbjct: 52   ILSVEIFGAIRSLAQFRLMGAQKDYIVVGSDSGRIIILEYNKEKN-VFDKIHQETFGKSG 110

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   IDP  R  +IG      L  V+  D   +L  +  +   +   L     
Sbjct: 111  CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTLVYSIC 170

Query: 118  ---YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLD 161
                G   P    +  D  +A             +H+  YE+ L   + V   WS+  +D
Sbjct: 171  GVDCGFENPIFAAIELDYSEADQDSTGQAASEAQKHLTFYELDL-GLNHVSRKWSEQ-VD 228

Query: 162  NGADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR 215
            NGA+LL+ VP       GVL+  E  ++Y +         IP R  +    G +    + 
Sbjct: 229  NGANLLVTVPGGGDGPSGVLVCAENFVIYKNQGHPEVRAVIPRRADLPAERGVLIVSAAM 288

Query: 216  --------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
                    +LL    G +  + + H  ++V+ LKI+      + +++  L +  ++  S 
Sbjct: 289  HKLKNMFFFLLQTEYGDIFKVTLEHNNDRVSELKIKYFDTIPVTASMCVLKSGFLFAASE 348

Query: 268  YGDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVD 301
            +G+  L +                             QP    + V + ++  +L PI+D
Sbjct: 349  FGNHALYQFKSIGDDDDVEASSATLMETEEGFQPVFFQPRRLKNLVRI-DQVESLMPIMD 407

Query: 302  FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDP 360
              V +L  +   Q+ T  G     SLRI+R G+ ++E A  +L GI   +W+++ +  D 
Sbjct: 408  MKVSNLFEEETPQIYTLCGRGPRSSLRILRTGLAVSEMAVSKLPGIPSAVWTVKKNVIDE 467

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FD ++VVSF + T +L++   + +EE    GF   T +L       + L+QV    +R +
Sbjct: 468  FDAYIVVSFTNATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHI 525

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYE 479
                R   NEW++P   S++   +N  QV++A  GG L+Y E+   G L EV+  ++  +
Sbjct: 526  REDGR--INEWRTPGKRSISKVGSNTLQVVIALSGGELIYFEMDVTGQLMEVEKHEMSGD 583

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAF 538
            ++CLDI P+ E    S+  AVG + D ++RI SL PD  +           P S+L    
Sbjct: 584  VACLDIAPVPEGRQRSRFLAVGSY-DKTIRILSLDPDDCMQALSVQSVSSAPESLLFLEV 642

Query: 539  EGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            +               +L   L +G +   +++M TG+L+D +   LG +   L     +
Sbjct: 643  QASVGGEDGADHPASLFLNAGLQNGVMFRTVVDMVTGQLSDSRSRFLGLRAPKLFPIIVR 702

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
                +   S RP + Y      L + ++ + + +   F+S    + +       L I TI
Sbjct: 703  GKRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCVEGVVAVAGEALRIFTI 762

Query: 647  DDI-QKLHIRSIPLGEHPRRICHQEQSRTFA-ICSLKNQSCAEESEM------------- 691
            + + +  +   IPL   PR+   Q + +    I S +    AEE E              
Sbjct: 763  ERLGETFNETVIPLRYTPRKFVLQPKRKLLVMIESDQGALTAEEREAARKECFEAAQAGE 822

Query: 692  -------------------------HF-------------VRLLDDQTFEFISTYPLDTF 713
                                     H+             +R+LD +T        L   
Sbjct: 823  NGTGSADQMENGGDDEDKDDPLSDEHYGYPKAESDKWASCIRVLDPRTSNTTCLLELQEN 882

Query: 714  EYGCSILSCSFSD-DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEK 767
            E   SI + +F D +      VGTA     LP+    T G I ++  VEDG+ L+L+ + 
Sbjct: 883  EAAFSICTVNFHDKEYGTLLAVGTAKGLQFLPKRTV-TAGFIHIYRFVEDGRSLELLHKT 941

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTR 824
            + +G   +L  F G+LLA I   ++LY     D G + L  +C +      I++++    
Sbjct: 942  QVEGVPLALCQFQGRLLAGIGPVLRLY-----DLGKKRLLRKCENKLFPNTIISIHAYR- 995

Query: 825  GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
             D I VGD+ +S     Y+ +E  +   A D    W++A   +D D   G +   N++ V
Sbjct: 996  -DRIYVGDVQESFHYCKYRRDENQLYIFADDCVPRWLTASYHIDFDTMAGTDKFGNIYFV 1054

Query: 885  R------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
            R                  K  +G  +    ++E + ++H+G+ V   +  SL+      
Sbjct: 1055 RLPQDVSDEIEEDPTGGRIKWEQGKLNGAPNKVEEIVQFHVGDVVTCLQKASLI------ 1108

Query: 927  DVGQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
              G    ++FGTV G +G + A    +   F   L+ ++R+    + G +H  +RS    
Sbjct: 1109 -PGGGECIVFGTVMGSVGALHAFTSRDDVDFFSHLEMHMRQDHPPLCGRDHMAYRS---- 1163

Query: 986  KKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                  K+ +DGDL E +    +DL R   DE+ +T      E+ K++EE+
Sbjct: 1164 -AYFPVKDVIDGDLCEQYPTLPMDLQRKIADELDRTPG----EILKKLEEV 1209


>gi|302807210|ref|XP_002985318.1| hypothetical protein SELMODRAFT_181612 [Selaginella moellendorffii]
 gi|300147146|gb|EFJ13812.1| hypothetical protein SELMODRAFT_181612 [Selaginella moellendorffii]
          Length = 1207

 Score =  239 bits (610), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 282/1179 (23%), Positives = 501/1179 (42%), Gaps = 175/1179 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V I+G + +L  FR  G  +D+L + ++  K  +L+++ + +               
Sbjct: 52   LLAVEIFGVVRSLAQFRLTGSHKDYLVVGSDSGKIVILEYNRDRNAFDPVHKETFGKSGC 111

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   +DP  R  +I       L  V+  DN  +L  +  +   +   +    + 
Sbjct: 112  RRIVPGQFLSVDPKGRAVMIAATEKQKLVYVLNRDNAARLTISSPLEAHKSHTVVYSLVG 171

Query: 118  --YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDN 162
               G   P    +  D  DA             +H+  YE+ L     V   WS+  +DN
Sbjct: 172  VDCGFENPMFAAIELDYSDADQDATGQAALEAQKHLTFYELDLGLNHVVR-KWSEP-IDN 229

Query: 163  GADLLIPVPPPL---CGVLIIGEETIVYCS-ANAFKA-IPIRPSITKAYGR--VDADGSR 215
            GA++L+ VP       GVL+  +  ++Y + +   +A IP R  +    G   V A   R
Sbjct: 230  GANMLVTVPGGADGPSGVLVCADNFVIYKNQSQELRALIPRRADLPADRGVLVVSAATHR 289

Query: 216  ------YLLGDHAGLLHLLVITHEKEK--VTGLKIELLGETSIASTISYLDNAVVYIGSS 267
                  +LL    G +  + ++       VT LKI+      + S +  L +  ++  S 
Sbjct: 290  QKSVFFFLLQTEYGDVFKVTLSFNDTDMVVTELKIKYFDTFPVTSAMCVLKSGFLFAASE 349

Query: 268  YGDSQLIK---LNLQPDAKGSYVEV----------------------LERYVNLGPIVDF 302
            +G+  L +   +   PD + S   +                      ++   +L PI+D 
Sbjct: 350  FGNHALYQFQGIGDDPDVESSSATLEETEDGFKPIFFQPRKLKNLIQIDEGESLMPIMDM 409

Query: 303  CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPF 361
             + +L  +   Q+    G     +LRI+R G+ ++E A  +L G+   +W+++ S  D F
Sbjct: 410  KIANLFEEETPQIFVACGRGPRSTLRILRPGLAVSEMAVSQLPGVPSAVWTVKKSVTDEF 469

Query: 362  DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
            D ++VVSF++ T +L++   + +EE    GF   T +L       + L+QV    +R + 
Sbjct: 470  DAYIVVSFLNATLVLSIG--ETVEEVSDSGFLDTTPSLQVSLLGDDSLMQVYPNGIRHIR 527

Query: 422  STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEI 480
            S  R   NEWK+P   ++    +N  QV++   GG L+Y E+   G L E++  ++  ++
Sbjct: 528  SLVR--INEWKTPGKKTIVKVGSNRMQVVIGLSGGELIYFEMDPTGQLMEIEKREMSGDV 585

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLL-- 535
            +CL I P+ E    S+  AVG + D ++RI SL P+  +++++ + +     P S+LL  
Sbjct: 586  ACLAIGPVPEGRQRSRFLAVGSY-DNTIRILSLDPNDCMHILSVQMVASP--PESLLLLE 642

Query: 536  ----------CAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
                        +    +L   L +G LL   ++M TG+L+  +   LG +   L   S 
Sbjct: 643  VQASTGGEDGAVYPASVFLNAGLQNGVLLRTEVDMVTGQLSGTRTRFLGLRAPKLFATSV 702

Query: 586  KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
            +    +   S RP + Y      L + ++ + + +   F+S    + +       L + T
Sbjct: 703  RGRQALLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCAEGVVAVAGEALRVFT 762

Query: 646  IDDI-QKLHIRSIPLGEHPRR-ICH---------------------QEQSRTFA-----I 677
            I+ + +  +  SI L   PR+ I H                     +  SR+ A     +
Sbjct: 763  IERLGETFNQTSIQLRYTPRKFILHPKRKFLVLIETDQGTFTTEEREHMSRSAAAQDNGM 822

Query: 678  CSLKNQSCAEESEM----------------HFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
              + NQ  +EE  M                  +R+LD +         L   E   SI  
Sbjct: 823  MEIDNQEKSEEEPMPDEQYGYPKTDSRRWVSCIRVLDPKAAATTCLLELQENEAAFSICC 882

Query: 722  CSFSDDSNV--YYCVGTAYVLPEENEPTKGRILVFI-----VEDGK-LQLIAEKETKGAV 773
             +F D+ N+     VGTA  L  E  P +  +  FI     VEDG+ L+L+ +    G  
Sbjct: 883  VNFHDNKNLGTVLAVGTAKDL--EWWPKRRSMGGFIHIYRFVEDGRSLELVHKTPIDGVP 940

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
             +L  F G+LLA I Q +++Y     D G R+L  +C +      +  + + GD I VGD
Sbjct: 941  TALCQFQGRLLAGIGQILRIY-----DLGKRKLLRKCENKNFPNTITSIHSYGDRIYVGD 995

Query: 833  LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE--- 889
            + +S   + Y+ +E  +   A D +  W++A   +D D     +   NLF VR   +   
Sbjct: 996  IQESFHYVKYRRDENQLYAFADDSSPRWLTASLHIDFDTMAAGDKFGNLFFVRLPQDLSE 1055

Query: 890  --------GATDEERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
                    G    E+GRL       E + ++H+GE V   +  SL+        G   +V
Sbjct: 1056 EIEDDPTGGKIKWEQGRLNGAPNKVEEIIQFHVGEVVTCMQKASLI-------PGGGESV 1108

Query: 935  IFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
            I+GTV G +G +      E   F   L+ ++R+    + G +H  +RS          K+
Sbjct: 1109 IYGTVMGSVGALLPFSSREDVDFFSHLEMHMRQEHPPLCGRDHMAFRS-----AYFPVKD 1163

Query: 994  FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +DGDL E +  L      +I++ ++ +  E+ K++E++
Sbjct: 1164 VIDGDLCEQYPTLPPDLQRKIAEELDRTPGEVMKKLEDI 1202


>gi|74212634|dbj|BAE31055.1| unnamed protein product [Mus musculus]
          Length = 261

 Score =  238 bits (608), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 114/204 (55%), Positives = 150/204 (73%), Gaps = 5/204 (2%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
           +V +YG+IA +ELFRP GE++D LFI T +Y  C+L++    ES ++ITRA G+V DRIG
Sbjct: 54  EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
           RP++ G IGIIDP+CR+IGL LYDGLFKVIP D   +  +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173

Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
             PTI  +YQD +  RHVKTYEV+L++K+F +GPW Q N++  A ++I VP P  G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232

Query: 181 GEETIVYCSANAFKAIPIRPSITK 204
           G+E+I Y   N  K + I P I K
Sbjct: 233 GQESITY--HNGDKYLAIAPPIIK 254


>gi|145348011|ref|XP_001418451.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578680|gb|ABO96744.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1196

 Score =  238 bits (607), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 282/1169 (24%), Positives = 482/1169 (41%), Gaps = 173/1169 (14%)

Query: 4    VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT 63
            V  +G   ++   R  G  +D+L + ++  +  VLQ+D E +  +            R  
Sbjct: 56   VEAFGIARSIATLRLAGANRDYLVLGSDSGRVVVLQFDKEKNAFVKVHQETYGKTGCRRI 115

Query: 64   DNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCA 121
              GQ    DP  R I L   +   L  V+  DN+  L    +  LE  +   I F + CA
Sbjct: 116  VPGQFLCADPKGRAICLGAVEKNKLVYVMNRDNEANL--TISSPLEANKSHTIVF-HMCA 172

Query: 122  ------KPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDN 162
                   P    +  D  DA             +H+  YE+ L     V   WS+  +DN
Sbjct: 173  LDCGLDNPIFAAIELDYGDADADPSGEAVAEAQKHLTYYELDLGLNHVVR-KWSEP-IDN 230

Query: 163  GADLLIPVP-----PPLCGVLIIGEETIVYCSAN---AFKAIPIRPSITKAYGRVDADGS 214
            GA+ LIPVP     P   GVL+  E  ++Y   N       IP R S+    G +    +
Sbjct: 231  GANHLIPVPGGSDGP--GGVLVCCENFVIYRHQNHDEVRAVIPRRNSLAADRGVLIVSSA 288

Query: 215  R--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
                     +L     G ++ L I +  E V+ +K++         ++  L    ++  S
Sbjct: 289  SHRSKNSFFFLAQSEYGDIYKLTIDYSGETVSEVKVKYFDTIPPCVSLCVLKTGFLFAAS 348

Query: 267  SYGDSQLIKL------NLQPDAKGSYVEVLERYV-------------------NLGPIVD 301
             +G+  L +       +    +  S +E  E Y                    +L P++D
Sbjct: 349  EFGNHALYQFAGIGDDDAVESSSASLMETEEGYEPVFFDPRPLTNLYTIDTIDSLCPVLD 408

Query: 302  FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDP 360
                +L  +   Q+ T  G     SLRI+R GI +NE A   L G    +++++ S+ + 
Sbjct: 409  MQAHNLTEEEMPQLYTLCGTGARSSLRILRQGIAMNELAMSPLPGQPNAIFTVKKSSSEE 468

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            +D++++VSF++ T +L++   D + E    G    T TL         L+Q+  G +R +
Sbjct: 469  YDSYIIVSFLNATLVLSIG--DTVSEVNDSGILGTTTTLQACLMSDESLLQIHPGGLRHI 526

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYE 479
             +  R   NEW++P    ++  T N+ QV++A  GG ++Y E+   G L EV+  +   +
Sbjct: 527  KADKR--INEWRTPGRKQISKCTCNSKQVVIALTGGEVIYFELDSAGQLIEVEKLETSGD 584

Query: 480  ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLC 536
            I+CLDI P+ E    ++  A+G + D +VR+ SL PD  L  +  + L G   P S+L+ 
Sbjct: 585  IACLDIGPVPEGALRNRFLAMGSY-DGTVRVMSLNPDDCLQTLAVQALKGSA-PSSLLVL 642

Query: 537  AFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
               G         L   + +G L+   ++  +G+L+D +   LG +   L   S +    
Sbjct: 643  QTAGTESTQGGLLLNVGMANGVLIRATIDQVSGQLSDMRMRFLGARAPKLVRTSVRGAPA 702

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI- 649
            + A S RP + YS     + + ++   +  +C FNS   P+ +       L I +I+ + 
Sbjct: 703  LVALSSRPWLGYSEKGTFVLAPISYVPLEEVCSFNSEQCPEGVVAISNQTLRIASIERLG 762

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAI-----CSLK------------------NQSCA 686
            +  +  ++ L   PR +     ++  A+     C++                   N    
Sbjct: 763  ENFNQTTVKLRYTPRAMSANPDTKMVALIESDQCTVPVGEREGPEATPADEAPETNDEDE 822

Query: 687  EESEM-----------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
            EE++M                   VR++D +  +      L   E   S+     +  + 
Sbjct: 823  EEAKMLPVEQFGAPKSSPGTWAACVRIVDPKEAKSTFVLELHKSEAALSLCHVFLTGPNE 882

Query: 730  VYYCVGTAYVLPEENEPTKGRI--LVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAA 786
            +   VGTA  L        G    L     DG+ L L+    T G V +L  + G LLA 
Sbjct: 883  LLLAVGTAVNLTFAPRNCDGGFIHLYRYGNDGRTLNLVHSTPTDGPVGALCGYKGHLLAG 942

Query: 787  INQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
            +N  +++Y    K +LR    R   +       I  L+    GD I VGD+ +SI  + Y
Sbjct: 943  VNNSLRIYDYGKKKLLRKVENRNFPN------FITTLHAA--GDRIYVGDVQESIHYVKY 994

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR----------------K 886
            K +EG+I   A D    +++A   LD D   GA+   N+F  R                K
Sbjct: 995  KADEGSIYIFADDTKPRYITATLPLDYDTLAGADKFGNIFVNRLPKDVSEDMDDDPTGGK 1054

Query: 887  N--SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
            N  S+G  +    + E   + ++GE V     G+L         G I  +++GT  G IG
Sbjct: 1055 NIYSQGVLNGAPNKSETSAQTYIGETVCALTKGAL-------QPGGIEIIMYGTFMGGIG 1107

Query: 945  VIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
             +        + F   L+ ++R+    + G +H  +RS+         KN +DGDL E F
Sbjct: 1108 CLLPFSSRSEIEFFTHLEMHMRQEAPSIVGRDHMAFRSY-----YAPVKNVIDGDLCEQF 1162

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              L       I++ M+ +  E+ K++E++
Sbjct: 1163 GALPADVQRRIAEEMDRTPGEILKKLEQV 1191


>gi|224109600|ref|XP_002315251.1| predicted protein [Populus trichocarpa]
 gi|222864291|gb|EEF01422.1| predicted protein [Populus trichocarpa]
          Length = 1213

 Score =  238 bits (606), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 273/1190 (22%), Positives = 496/1190 (41%), Gaps = 191/1190 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V I+G I +L  FR  G  +D++ + ++  +  +L+++ E + L             
Sbjct: 52   VLSVEIFGAIRSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKERNVLDKIHQETFGKSGC 111

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   +DP  R  +IG      L  V+  D   +L  +  +   +   +      
Sbjct: 112  RRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDTVARLTISSPLEAHKSHTICYSVCG 171

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNG 163
               G   P    +  D  +A    T + A + +            + V   WS+  +DNG
Sbjct: 172  VDCGFDNPIFAAIELDYSEADQDSTGQSASEAQKNLTFYELDLGLNHVSRKWSEQ-VDNG 230

Query: 164  ADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR-- 215
            A++L+ VP       GVL+  E  ++Y +    +    IP R  +    G +    +   
Sbjct: 231  ANMLVTVPGGGDGPSGVLVCVENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAATHK 290

Query: 216  ------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
                  +LL    G +  +++ HE +KV  LKI+      + S++  L +  ++  S +G
Sbjct: 291  QKSMFFFLLQTEYGDIFKVMLDHENDKVKELKIKYFDTIPVTSSMCVLKSGFLFAASEFG 350

Query: 270  DSQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDFCV 304
            +  L +      + D + S   ++E                      +  +L PI+D  V
Sbjct: 351  NHALYQFQAIGEEEDVEASSATLMETEEGFQPVFFQPRGLKNLVRIDQVESLMPIMDMKV 410

Query: 305  VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFDT 363
             +L  +   Q+ +  G     SLRI+R G+ I+E A  +L G+   +W+++ +  D FD 
Sbjct: 411  ANLFDEETPQIFSLCGRGPRSSLRILRPGLAISEMAVSQLPGVPSAVWTVKKNIYDEFDA 470

Query: 364  FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
            ++VVSF + T +L++   + +EE    GF   T +L       + L+Q+    +R +   
Sbjct: 471  YIVVSFNNATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQIHPNGIRHIRED 528

Query: 424  SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISC 482
             R   NEW++P   ++    +N  QV++A  GG L+Y E+   G L EV+  ++  +++C
Sbjct: 529  GR--INEWRTPAKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLMEVEKHEMSGDVAC 586

Query: 483  LDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAFE 539
            LDI P+ E    S+  AVG + D ++R+ SL PD  + +++ + +     P S+L    +
Sbjct: 587  LDIAPVPEGRQRSRFLAVGSY-DNTIRVLSLDPDDCMQILSVQSVSAP--PESLLFLEVQ 643

Query: 540  GIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
                           +L   L  G L   +++M TG+L+D +   LG +   L + + + 
Sbjct: 644  ASIGGEDGADHPASLFLNAGLQTGVLFRTVVDMVTGQLSDSRSRFLGLRAPKLFSINVRG 703

Query: 588  TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
               +   S RP + Y      L + ++ + + +   F+S    + +       L I TI+
Sbjct: 704  RRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCAEGVVSVAGDALRIFTIE 763

Query: 648  DI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC------------SLKNQSCAEESEM--- 691
             + +  +  +IPL   PR+   Q + +   I                 + C E S M   
Sbjct: 764  RLGETFNETAIPLRYTPRKFVLQPKRKLLVIIESDQGAYTAEEREAAKKECFEASGMGEN 823

Query: 692  ------------------------------------HFVRLLDDQTFEFISTYPLDTFEY 715
                                                  +R+LD ++        L   E 
Sbjct: 824  GSASAEQMENGDDDDKDDPLSDEQYGYPKAESDKWVSCIRVLDPRSAATTCLLELQDNEA 883

Query: 716  GCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRILVFI-----VEDGK-LQLIAEKE 768
              S+ + +F D +      VGTA  L  +  P +  +  FI     V+DGK L+L+ + +
Sbjct: 884  AFSLCTVNFHDKEHGTLLAVGTAKGL--QFWPKRSLVTGFIHIYKFVDDGKSLELLHKTQ 941

Query: 769  TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRG 825
             +G   +L  F G+LLA I   ++LY     D G + L  +C +      I++++  T  
Sbjct: 942  VEGVPLALCQFQGRLLAGIGSVLRLY-----DLGKKRLLRKCENKLFPNTIVSIH--TYR 994

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR 885
            D I VGD+ +S     Y+ +E  +   A D    W+++   +D D   GA+   N++  R
Sbjct: 995  DRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTSSYHVDFDSMAGADKFGNIYFAR 1054

Query: 886  ------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
                              K  +G  +    ++E + ++H+G+ VN  +  SL+       
Sbjct: 1055 LPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHIGDVVNSLQKASLI------- 1107

Query: 928  VGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
             G    +I+GTV G +G +        + F   L+ +LR+    + G +H  +RS     
Sbjct: 1108 PGGGECIIYGTVMGSVGALLPFTSRDDVDFFSHLEMHLRQDHPPLCGRDHMSYRS----- 1162

Query: 987  KTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                 K+ +DGDL E F    LD  R   DE+ +T      E+ K++EE+
Sbjct: 1163 AYFPVKDVIDGDLCEQFPTLPLDAQRKIADELDRTPG----EILKKLEEV 1208


>gi|348667612|gb|EGZ07437.1| hypothetical protein PHYSODRAFT_565381 [Phytophthora sojae]
          Length = 1197

 Score =  235 bits (600), Expect = 9e-59,   Method: Compositional matrix adjust.
 Identities = 283/1177 (24%), Positives = 481/1177 (40%), Gaps = 181/1177 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            ++   ++G +  L+ FR  G  +D+L + ++  K  VL+ +  S +   R          
Sbjct: 52   VISTQVFGVVRALQPFRLTGGERDYLVVGSDSGKIVVLEVNPSSGQFEARQSETYGKTGC 111

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIRLEELQV-- 111
            R    GQ    DP  R  L+G      L  V+  D   +L      EA       L V  
Sbjct: 112  RRITPGQYLAADPKGRALLVGAVEKQQLVYVMNRDASSRLTISSPLEAHRSNAIHLGVVG 171

Query: 112  LDIKF---LYGC---------AKPTIVVLYQDNKDA-RHVKTYEVALKDKDFVEGPWSQN 158
            LD+ F   ++ C         A PT     Q  +DA + +  YE+ L   + V   WS+ 
Sbjct: 172  LDVGFENPIFACLELDYAEADADPT----GQAARDAVKSLVYYELDL-GLNHVTRRWSEQ 226

Query: 159  NLDNGADLLIPVPPPLCG---VLIIGEETIVY---------CSANAFKAIPIRPSITKAY 206
             +   A++L+ VP    G   VL++GE T+ Y         C+    +  P    I  A 
Sbjct: 227  VV-RSANMLVAVPGGGDGPGGVLVLGENTVQYKNEGHPELTCAIPRRQGEPRDIVIVSAA 285

Query: 207  GRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
                 D    LL    G L+ + + +    V  +KI+      +AS++      +++  S
Sbjct: 286  THKQRDLFFVLLQSELGDLYKISLDYSGNAVEEIKIQFFDTVPVASSMCITKTGLLFCAS 345

Query: 267  SYGDSQLIK----------------------LNLQPDAKGSYVEVLERYVNLGPIVDFCV 304
             + +  L +                      L+  P  K + +++     +L P+    V
Sbjct: 346  EFSNHYLFQFLSIGEGDDTAKCSSLAMDPTELSTFPLRKLTNLQLASSMPSLSPVTQLLV 405

Query: 305  VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFDT 363
             DL  +   Q+    G     SLR++R+G+ I E A+  L G+ K +W L+ S  DP+D 
Sbjct: 406  DDLANEQTPQMYALCGNSNRSSLRVLRHGLPITEMAASALPGVAKAVWCLKESYADPYDK 465

Query: 364  FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
            ++VVSF  E   L + + + +EE    GF     +L       +  +Q+ +  +R V   
Sbjct: 466  YIVVSF--EDATLVLEVGETVEEVTQSGFLRDHGSLLVALLEDDSKLQIHANGIRHVPKF 523

Query: 424  SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEISC 482
              +   EWK+P    +    AN+ QV+++  GG ++Y E+G  G L E     L +E+  
Sbjct: 524  --QPVTEWKAPGKKVIEHCAANSRQVVISLAGGEIIYFELGQSGELAEKGKLDLGFEVCS 581

Query: 483  LDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGI 541
            LD+  + E     Q  AVG W D +VRI SL P+     K  L     P ++ L   +  
Sbjct: 582  LDLGEVPEGRQRFQFMAVGSW-DNTVRILSLDPNDLFRQKSTLALTSHPHTLCLAQLQSE 640

Query: 542  S------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
                         +L   L +G L   L++  T  LTD +   LGT P+ L   + +   
Sbjct: 641  PSTPDSEHSSQSLFLSIGLDNGVLQQSLIDPITANLTDSRSRFLGTNPVKLFRVAVEGKR 700

Query: 590  HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL-AIAKEGELTIGTIDD 648
             + A S R  + Y    +   + ++ + +S+   FNS   P  + A+  EG + I T+D 
Sbjct: 701  SILALSSRSWISYFHQTRRHLTPLSCELLSYASSFNSEQCPGGIVALTNEG-MKILTVDQ 759

Query: 649  I------QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-----------------SC 685
            +      QK ++R  P     + + H    R   I S  N+                 S 
Sbjct: 760  LGDTFNQQKCNLRYTPR----KAVVHAASRRLVVIESDHNEYGAAYKRQHGLRIPDIRSA 815

Query: 686  AEESEMH------------------------FVRLLDDQTFEFISTYPLDTFEYGCSILS 721
            AE+ + +                         VR++D  + + +    LD  E   SI S
Sbjct: 816  AEQDDEYEDEINEALLFPRGPLPAEKDKWASCVRIVDPVSCQTVVCEELDVDERARSIAS 875

Query: 722  CSFSD-DSNVYYCVGTA--YVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLN 777
            C F D     +  VGT     L  +  P  G + V+ +VE  +L L+   E     +++ 
Sbjct: 876  CVFHDRGGEAFIIVGTVKKMQLHPQKAPAGGYLRVYRVVEGMQLVLVHTTEIDDIPHAMC 935

Query: 778  AFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDL 833
             F G+LL ++ + +++Y    K MLR    R   S        + + ++  GD I   D+
Sbjct: 936  EFQGRLLVSVGRALRIYDLGKKKMLRKCENRNFPS--------ILVELKAAGDRIYASDM 987

Query: 834  MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE---- 889
             +S   + YK +E  +   A D    ++++  +LD D   GA+   N+F  R  SE    
Sbjct: 988  HESFHFVKYKKDENQLVIFADDCVPRFITSSVLLDYDTLCGADKFGNVFVSRLPSEVSDE 1047

Query: 890  -------------GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
                         G  +    +LE V ++H+G+ V      SLV        G I  +I+
Sbjct: 1048 IDNPTANRILWDSGLLNGAPNKLEQVAQFHVGDVVTSMVRTSLVP-------GGIEAIIY 1100

Query: 937  GTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
             T+ G IG +      Q + F   L+  +R+    + G +H  +RS+      +  KN  
Sbjct: 1101 ATIMGRIGALIPFTSRQDVDFYTHLEMYMRQEQPPLCGRDHLSYRSY-----YIPVKNVT 1155

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DGDL E F  LS  +   +++ ++ +  E+ K++E++
Sbjct: 1156 DGDLCEQFSSLSVEKQASVAEDLDRTPAEVLKKLEDI 1192


>gi|310793065|gb|EFQ28526.1| CPSF A subunit region [Glomerella graminicola M1.001]
          Length = 1212

 Score =  234 bits (598), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 280/1173 (23%), Positives = 490/1173 (41%), Gaps = 166/1173 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   I+G I ++  FR  G  +D+L +AT+  +  ++++    +      +        
Sbjct: 59   LLSHDIFGIIRSMAAFRLAGSNKDYLILATDSGRITIIEYIPAQNRFQRLHLETFGKSGV 118

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    G+    DP  R  LI     + L  V+  +++ +L  +  +   +  VL +  + 
Sbjct: 119  RRVIPGEYLACDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSMVA 178

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNG 163
               G A P    L  +  +A    T E A + +            + V   WS+  +D  
Sbjct: 179  LDVGYANPVFAALEIEYTEADQDPTGEAAREAETQLVYYELDLGLNHVVRKWSEP-VDPT 237

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVD--ADG 213
            A LL  VP     P   GVL+ GEE I Y  +N  AF+  IP R   T+   R      G
Sbjct: 238  ASLLFQVPGGQDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRKRHVVSG 295

Query: 214  SRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTIS 255
              + L   AG    L+ T + +                  +V  LKI+      ++S++ 
Sbjct: 296  VMHKLKGSAGAFFFLIQTEDGDLFKATIDMVEDADGNPTGEVKRLKIKYFDTIPVSSSLC 355

Query: 256  YLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VE 288
             L +  +Y  S +G+ Q  +                +   D K  Y            + 
Sbjct: 356  ILKSGFLYAASQFGNHQFYQFEKLGDDDEELEFSSDDFPTDPKAGYDAVYFHPRPLENLA 415

Query: 289  VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI- 347
            ++E   ++ P++D  V +L  +   Q+ T  G     + R++++G+ +NE  + EL GI 
Sbjct: 416  LVESIDSMNPLLDCKVANLTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIP 475

Query: 348  KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
              +W+L+ +  D +D ++V+SF + T +L++   + +EE    GF +   TL       +
Sbjct: 476  SAVWTLKLNRGDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGED 533

Query: 408  QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDG 466
             L+QV    +R +        NEW +P   S+  AT N  QV +A   G +VY E+ GDG
Sbjct: 534  GLIQVHPKGIRHIRQGQV---NEWAAPQHRSIVAATTNEHQVAVALSSGEIVYFEMDGDG 590

Query: 467  ILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
             L E    +  +  ++CL +  + E    S   AVG   D +VR+ SL P+  L +K   
Sbjct: 591  SLAEYDEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDSTVRVLSLDPESTLESKSVQ 649

Query: 525  GGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
                 P S+ + A +  S      YL   L  G  L  +L+  TGELTD ++  LG +P+
Sbjct: 650  ALTAPPSSLAIIAMDDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPV 709

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
             L   + +  T V   S RP + YS    +  L + +N  ++  +  F+S    + +   
Sbjct: 710  RLFQVTVQGRTCVIGLSSRPWLGYSDPITRGFLVTPLNYVDLEWVWNFSSEQCEEGVVGI 769

Query: 637  KEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV- 694
            +   L I  I+++   +  +SIPL   PRR+    +   F      N +   +     + 
Sbjct: 770  QGQSLRIFAIENLGDTITQKSIPLSYTPRRLLKHPEYPMFYTIEADNNTLPPDLRAKLIA 829

Query: 695  ---------RLLDDQTFEF-----------------------ISTYPLDTFEYGCSILSC 722
                     R+L    F +                       + T  LD  E   S    
Sbjct: 830  EPGVVNGDARILPPDEFGYPKGKGRWASCISVIDPLAEDQRVLQTIDLDNNEAAVSAAIV 889

Query: 723  SF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKETKGAVYSLNA 778
            SF S DS  +  VGT   ++    + ++G I V+   EDG +L+ I + + +    +L  
Sbjct: 890  SFASQDSESFLIVGTGKDMVVNPRQFSEGYIHVYRFGEDGHELEFIHKTKVEEPPSALLG 949

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            F G+LLA I + +++Y   LR    R+ Q++       L + + T+G  I+VGD+   I+
Sbjct: 950  FQGRLLAGIGKTLRIYDLGLR-QMLRKAQADVTPQ---LIVSLSTQGSRIIVGDVQHGIT 1005

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEE 895
             ++YK     +     D  + W++   ++D +   G +   N+F VR   K ++ A DE 
Sbjct: 1006 YVVYKPTTNKLIPFVDDTVSRWVTCTTMVDYESVAGGDKFGNMFLVRCSEKATQEADDES 1065

Query: 896  RG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
             G               RL ++   +  +        SLV       VG    +++  +N
Sbjct: 1066 GGLHLINTRDYLHGTPHRLSLLAHSYTQDVPTSITKTSLV-------VGGQDVLLWSGIN 1118

Query: 941  GVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDL 999
            G IGV I  +  E   F + L+ ++R     + G +H  +R +      V  K  +DGDL
Sbjct: 1119 GTIGVFIPFVTREDVDFFQNLEQHMRTEDAPLAGRDHLMYRGY-----YVPVKGVIDGDL 1173

Query: 1000 IESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             E +  L   +   I+  ++ SV E+ +++ ++
Sbjct: 1174 CERYTLLPSEKKQMIAGELDRSVREIERKISDI 1206


>gi|308805058|ref|XP_003079841.1| spliceosomal-like protein (ISS) [Ostreococcus tauri]
 gi|116058298|emb|CAL53487.1| spliceosomal-like protein (ISS) [Ostreococcus tauri]
          Length = 1166

 Score =  234 bits (598), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 277/1161 (23%), Positives = 473/1161 (40%), Gaps = 193/1161 (16%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
            +G + ++   R  G  +D+L + ++  +  +LQ+  E +  +            R    G
Sbjct: 59   FGVVRSICALRLAGANRDYLCVGSDSGRVVILQFKKERNAFVKVHQETFGKSGCRRVVPG 118

Query: 67   QIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLK-----------------EAFNIRLE 107
            Q    DP  R I L   +   L  V+  DN+  L                   A +  L+
Sbjct: 119  QFVCADPKGRAICLGAMERSKLVYVMNRDNEANLTISSPLEANKSHAITYHMCALDCGLD 178

Query: 108  ELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGAD 165
                  I+  YG     PT  V+ +  K   H+  YE+ L     V   WS+  +DNGA+
Sbjct: 179  NPVFAAIELDYGEVDDDPTGEVVAETQK---HLTYYELDLGLNHVVR-KWSEP-IDNGAN 233

Query: 166  LLIPVP-----PPLCGVLIIGEETIVYCSAN---AFKAIPIRPS--------ITKAYGRV 209
             LIPVP     P   GVL+  E  I+Y   +       IP R S        I  + G  
Sbjct: 234  HLIPVPGGSDGPG--GVLVCCENFIIYRHQDHEEVRAVIPRRTSLPGDRGVLIVSSAGHK 291

Query: 210  DADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
                  +L     G ++ L I +  +KV+ +K++         ++  L    ++  S +G
Sbjct: 292  SKKSFFFLAQSEYGDIYKLSIEYSGDKVSEVKVKYFDTIPPCVSMCVLKTGFLFAASEFG 351

Query: 270  DSQLIKLNLQPD------AKGSYVEVLERYV-------------------NLGPIVDFCV 304
            +  L +     D      +  S +E  + Y                    +L PI+D   
Sbjct: 352  NHALYQFAGIGDDDAVESSSASLMETEQGYEPVFFDPRRLTNLYPIDKIDSLCPILDMQA 411

Query: 305  VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDT 363
             +L  +   Q+ T  G     SLRI+R GIG+NE A   L G    +++++  + D +D 
Sbjct: 412  HNLTEEDTPQLYTLCGTGARSSLRILRQGIGMNELAMSSLPGQPNAIFTVKKKSSDQYDG 471

Query: 364  FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
            ++VVSF++ T +LA+   D + E    G    T TL       + L+QV +G +R + + 
Sbjct: 472  YIVVSFLNATLVLAIG--DTVTEVSDSGILGTTMTLQVCLMNDDSLLQVHAGGLRHIKAD 529

Query: 424  SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEISC 482
             R   NEW++P    ++  T N+ QV++A  GG ++Y E+   G L EV+  +   +I+C
Sbjct: 530  KR--INEWRTPGRKQISKCTCNSKQVVIALTGGEVIYFELDSAGQLIEVEKLETNGDIAC 587

Query: 483  LDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLC--- 536
            +DI P+ E    S+  A+G + D +VR+ SL     L  +  + L G   P S+L+    
Sbjct: 588  VDIAPVPEGALRSRFLAMGSY-DGTVRVMSLNSDDCLQTLAVQALKGST-PSSLLILQTA 645

Query: 537  ---AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
               + +G   L   + +G L+   L+  +G+L+D +   LGT+   L   S +  + + A
Sbjct: 646  GTESLQGSLLLNVGMANGVLMRATLDQVSGQLSDMRVRFLGTRAPKLVRTSVRGESALVA 705

Query: 594  ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKL 652
             S RP + YS     + S ++   +  +C F+S A P+ +       L I +++ + +  
Sbjct: 706  LSSRPWLGYSEKGTFVLSPISYVALEEVCSFSSEACPEGVVAISNQTLRIASVERLGENF 765

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAI-----CSL---------------KNQSCAEESEMH 692
            H  +I L   PR +     ++  A+     CS+                +++  +E E++
Sbjct: 766  HQTTIKLRYTPRAMSANPDTKMVALIESDQCSVPVGERESPEATSADEASEANGDEDELN 825

Query: 693  F-------------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
                                VR++D +  +      L   E   SI     +  + +   
Sbjct: 826  MLPVEQYGAPKSSPGTWAACVRIIDPKDAKSQYVLELGKNESAISICHVYLTGPNELLLA 885

Query: 734  VGTAYVL---PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
            VGTA  L   P   +     +  +  +   L L+    T+G V +L              
Sbjct: 886  VGTAQNLTFAPRNCDGGFIHLYRYGTDSKTLSLVHSTPTEGPVGAL-------------- 931

Query: 791  IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
                               CG+ GH+LA       D I VGD+ +SI  + YK +EG++ 
Sbjct: 932  -------------------CGYRGHLLAXXXXXXXDRIYVGDVQESIHFVKYKADEGSMY 972

Query: 851  ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR----------------KN--SEGAT 892
              A D    +M+A   LD D   GA+   N+F  R                KN  S+G  
Sbjct: 973  IFADDTKPAYMTAALPLDFDTLAGADKFGNIFVNRLPKDISEDMDEDPTGGKNIYSQGVL 1032

Query: 893  DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
            +    + E      +GE V     G+L         G I  +++GT  G IG +      
Sbjct: 1033 NGAPNKSETCARTFIGETVCALTKGAL-------QPGGIDIIMYGTFLGGIGALVPFQTR 1085

Query: 953  QYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
              + F   L+ ++R+    + G +H  +RS+         KN +DGDL E F  L+    
Sbjct: 1086 SEIDFFITLEMHMRQEASSIVGRDHMAFRSY-----YAPVKNVIDGDLCEQFGALAPEVQ 1140

Query: 1012 DEISKTMNVSVEELCKRVEEL 1032
              I++ M+ +  E+ KR+E++
Sbjct: 1141 RRIAEDMDRTPGEILKRLEQI 1161


>gi|409045147|gb|EKM54628.1| hypothetical protein PHACADRAFT_210427 [Phanerochaete carnosa
            HHB-10118-sp]
          Length = 1213

 Score =  234 bits (597), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 281/1182 (23%), Positives = 486/1182 (41%), Gaps = 188/1182 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+  + ++  +  +L +D ++S  +            R    
Sbjct: 58   VFGSIRSLTAFRLTGGTKDYAIVGSDSGRIVILDYDPKTSSFVKVHQETYGKSGARRIVP 117

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
            GQ    DP  R  +I       L  ++  D    L      EA   N  +  +  LD+ F
Sbjct: 118  GQYLATDPKGRSVMIAATEKAKLVYILNRDAAANLTISSPLEAHKNNAIIHHIVGLDVGF 177

Query: 117  LYGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNG 163
                  PT   L  D             N   + +  YE+ L     V   WS+   D  
Sbjct: 178  ----ENPTFAALEVDYSESDQDPTGEAFNNAEKLLTYYELDLGLNHVVRK-WSEPT-DPR 231

Query: 164  ADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------- 201
            A+LL+ VP            P   GVL+  E+ I+Y   +A +    IP R         
Sbjct: 232  ANLLVQVPGGQLASSDRYDGP--SGVLVCCEDHIIYRHMDAPQHRVPIPRRRHPLEDKER 289

Query: 202  ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
               IT A          +LL    G L+ + I HE++ V  LKI+      +AS++  L 
Sbjct: 290  GLIITAAVMHKMKGAFFFLLQSEEGDLYKVTIEHEEQDVKALKIKYFDTVPVASSLCILK 349

Query: 259  NAVVYIGSSYGDSQLIK---------------------------LNLQP---DAKG-SYV 287
            +  +++ + +G+  L +                           ++L P     +G   +
Sbjct: 350  SGFLFVAAEFGNHHLYQFQKLGDDDNEPEFSSTMYPNFGMANPAISLPPAYFSPRGLDNL 409

Query: 288  EVLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
             +++   +L P+VD  V++L       Q+    G     SLR++R+G+ + E  S +L G
Sbjct: 410  TLVDELESLDPVVDAKVLNLLPNSDSPQIFAACGRGGRSSLRMLRHGLEVEEVVSSDLPG 469

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            I   +W+ +   DDPFD+++++SF++ T +L++   + +EE +  GF S + TL      
Sbjct: 470  IPNAVWTTKLREDDPFDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSSPTLAVQQLG 527

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
             + L+QV    +R V    R   NEW+ P G ++  AT N  QV++A     LVY E+  
Sbjct: 528  ADALLQVHPHGIRHVLVDRRV--NEWRVPQGKTIVCATTNRRQVVVALSSAELVYFELDL 585

Query: 465  DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
            +G L E +  + +   +  L +  + E    +   AVG   D +VRI SL   + +    
Sbjct: 586  EGQLNEYQERKAMGSTVLALSVGEVPEGRQRTPFLAVGC-EDQTVRIVSLDPESTLETIS 644

Query: 524  LGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
            L     P S +  A          +  +++   L +G LL  +L+   G+LTD +   LG
Sbjct: 645  LQALTAPPSAICIADMLDAGINKVQPTTFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLG 704

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
            T+PI L     +    + A S R  + Y+    + ++ +    + +   F++   P+ L 
Sbjct: 705  TRPIKLVRVQIQRNPGILALSSRSWLNYTHQNLMHFTPLIFDNLDYAWSFSAELCPEGLI 764

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQ----------------EQSRTFA 676
                  L I  I  +  KL   SIPL   PR+ I H                 E+++   
Sbjct: 765  GISGSVLRIFQIPKLGTKLKQDSIPLSYTPRKMIVHPTNGFFYMIEGDHRVMGEEAKQKK 824

Query: 677  ICSLKNQSCAEESEM----------------------HFVRLLDDQTFEFISTYPLDTFE 714
            I  L+ Q    + EM                        V  +D QT  FI   PLD  E
Sbjct: 825  IEELRQQGKQVDEEMLNLPPEVFGRPKAPAGTWASGIRIVSPVDAQTVNFI---PLDNNE 881

Query: 715  YGCSILSCSFS-DDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETK 770
               SI    F+     +   VGTA   ++ P        R   F+ +   ++L+ + ET 
Sbjct: 882  AAFSIAVVPFTARGGELTLVVGTAQDTFLAPRSCTSGFLRTYRFLDDGRDIELLHKTETN 941

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIV 829
                ++ AF G+L A I + +++Y     D G ++L  +        A+  + T+G  I+
Sbjct: 942  DVPLAIMAFQGRLAAGIGKALRIY-----DIGKKKLLRKVESKNFSNAIVTLNTQGSRII 996

Query: 830  VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---K 886
            VGD+ +SI   +YK  E  +   A D    W++AV ++D +     +   N+F  R   K
Sbjct: 997  VGDMQESIFYAVYKPPENRLLIFADDAQPRWITAVTMIDYNTVAAGDRFGNVFVNRLDPK 1056

Query: 887  NSEGATDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
             S+   D+  G               +  ++  +H+G+ V      SLV        G  
Sbjct: 1057 ISDQVDDDPTGAGILHEKGILMGAPHKTAMIAHFHVGDIVTSIHKVSLV-------AGGR 1109

Query: 932  PTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
              +++  ++G +G++   +  E   F+  L+ ++R     + G +H  WR +      V 
Sbjct: 1110 ELLLYTGLHGTVGILVPFVSKEDVDFISTLEQHMRTEQLSLVGRDHLAWRGY-----YVP 1164

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             K  +DGDL E F  L  ++   I+  ++ +V E+ K++E+L
Sbjct: 1165 VKAVVDGDLCEMFARLPASKQSSIATELDRTVGEVLKKLEQL 1206


>gi|389740093|gb|EIM81285.1| hypothetical protein STEHIDRAFT_86633 [Stereum hirsutum FP-91666 SS1]
          Length = 1213

 Score =  234 bits (596), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 270/1182 (22%), Positives = 485/1182 (41%), Gaps = 188/1182 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+  + ++  +  +L++D ++S  +            R    
Sbjct: 58   VFGSIRSLAAFRLTGGTKDYAIVGSDSGRIVILEYDPKTSSFVKVHQETYGKSGARRIVP 117

Query: 66   GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
            GQ    DP  R + +   +   L  ++  D    L  +  +        +  +  LD+ F
Sbjct: 118  GQYLATDPKGRSVMVAAVEKAKLVYILNRDAAANLTISSPLEAHKNAAIIHHIVGLDVGF 177

Query: 117  LYGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNG 163
                  P    L  D             N   + +  YE+ L     V   WS+   D  
Sbjct: 178  ----ENPMFAALEVDYTESDQDPTGEAFNNADKVLTYYELDLGLNHVVRK-WSEPT-DPR 231

Query: 164  ADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIR------PS- 201
            A+LL+ VP            P   GVL+  E+ I+Y   +  +    IP R      P  
Sbjct: 232  ANLLVQVPGGQLASSDRYDGP--SGVLVCCEDHIIYRHTDVPQHRVPIPRRRHPLEDPDR 289

Query: 202  ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
               IT A          +LL    G L+ + I H++E+V  +KI+      +AS++  L 
Sbjct: 290  GLIITSAVMHKMKGAFFFLLQSEEGDLYKVTIEHQEEEVKAVKIKYFDTVPVASSLCILK 349

Query: 259  NAVVYIGSSYGDSQLIKL------NLQPDAKGS-------------------------YV 287
            +  +++ S +G+ QL +       + +P+   +                          +
Sbjct: 350  SGFLFVASEFGNHQLYQFQKLGDDDNEPEFSSTDYPSFGMADPSAALPRAHFRPHPLDNL 409

Query: 288  EVLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
             + +   +L P++D  V++L       Q+ T  G     + R +R+G+ + E  S +L G
Sbjct: 410  ALADELESLAPVIDSKVMNLLPNSDTPQIFTACGRGARSTFRTLRHGLEVEETVSSDLPG 469

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            I   +W+ ++  D P+D+++++SF++ T +L++   + +EE +  GF S   TL      
Sbjct: 470  IPNAVWTTKTREDAPYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIG 527

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
             + L+QV    +R V +  R   NEW+ P G ++  AT N  QV++A     LVY E+  
Sbjct: 528  ADALLQVHPHGIRHVLADRRV--NEWRVPQGKTIVSATTNKRQVVVALSSAELVYFELDL 585

Query: 465  DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
            DG L E +  + +   +  L I  + E    +   AVG   D +VRI SL   + +    
Sbjct: 586  DGQLNEYQDRKAMGSTVLALSIGEVPEGRQRTPYLAVGC-EDQTVRIISLDPESTLETIS 644

Query: 524  LGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
            L     P S L  A          +   ++   L +G LL  +L+   G+LTD +   LG
Sbjct: 645  LQALTAPPSALCIADMLDAGINKSQPTMFVNIGLQNGVLLRTVLDPVNGQLTDTRTRFLG 704

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
            T+P+ L     ++   + A S R  + Y+    + ++ +    +     F++   P+ L 
Sbjct: 705  TRPVRLIRVQIQHQPAILALSSRTWLNYTHQNLMHFTPLIFDNLDFAWSFSAELCPEGLI 764

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAI---------------- 677
                  L I  +  +  KL   SIPL   PR+      +  F I                
Sbjct: 765  GITGSVLRIFQMPKLGTKLKQDSIPLSYTPRKFASHPSNSYFYIIEADHRVTGPDATAKK 824

Query: 678  -CSLKNQSCAEESEM-------------------HFVRLLDDQTFEFISTYPLDTFEYGC 717
               L++Q    + EM                     +R++D    + ++  P+D  E   
Sbjct: 825  VAELRSQGKMVDDEMLELPAEVFGRSKAPAGTWASCIRIVDPAEGKSVAEIPIDNNEAAF 884

Query: 718  SILSCSFSDDSNVYY-CVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
            S+    FS  +N Y+  VGTA   ++ P        R   F+ +   L+L+ + ET    
Sbjct: 885  SLAIVPFSVRNNEYHLVVGTAQDTFLAPRSCTSGFLRTYKFVDDGAGLELLHKTETDDIP 944

Query: 774  YSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
             SL AF G+L+A I + +++Y    K +LR   ++   S          + + T+G  I+
Sbjct: 945  MSLLAFQGRLVAGIGKALRIYDIGKKKLLRKAESKTFASAI--------ISLNTQGSRII 996

Query: 830  VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---K 886
            VGD+ +SI+  +YK  E  +   A D  A W++   ++D       +   N+F  R   K
Sbjct: 997  VGDMQESIAYAVYKAPENKLLVFADDTQARWVTCSTMVDYTTVAAGDRFGNIFINRLDSK 1056

Query: 887  NSEGATDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
             S+   D+  G               +  ++  +H+G+ V      SLV        G  
Sbjct: 1057 VSDQVDDDPTGAGILHEKGILMGAPHKTAMLAHFHVGDLVTSIHKVSLV-------AGGR 1109

Query: 932  PTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
              +++  ++G IG++  L   E   F+  L+ ++R     + G +H  WR +      V 
Sbjct: 1110 EVLLYTGLHGTIGMLVPLVSKEDVDFISTLEQHIRTEQTSLVGRDHLAWRGY-----YVP 1164

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             K  +DGDL E+F  L   +   I+  ++ +V E+ K++++L
Sbjct: 1165 VKAVVDGDLCETFARLPAAKQSMIAGELDRTVSEVLKKLDQL 1206


>gi|169848339|ref|XP_001830877.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
 gi|116508046|gb|EAU90941.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
          Length = 1213

 Score =  233 bits (595), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 265/1174 (22%), Positives = 486/1174 (41%), Gaps = 174/1174 (14%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
            +G + +L  FR  G  +D+  + ++  +  +L +D +++  +            R    G
Sbjct: 59   FGSVRSLAAFRLTGGTKDYAILGSDSGRIVILDYDPKTTSFVKLHQETFGKSGARRIVPG 118

Query: 67   QIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGCA 121
            Q    DP  R  +IG      L  ++  D    L  +  +   +   +    +    G  
Sbjct: 119  QYLATDPKGRAVMIGAMEKAKLVYILNRDAAANLTISSPLEAHKNSAIIHHIVGVDVGFE 178

Query: 122  KPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
             P    L  D             N   + +  YE+ L     V   WS+   D  A+LL+
Sbjct: 179  NPLFAALEVDYSESDQDPTGEAFNNAEKMLTYYELDLGLNHVVRK-WSEPT-DPRANLLV 236

Query: 169  PVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIR------PS----IT 203
             VP            P   GVL+  E+ I+Y   +A +    IP R      P     IT
Sbjct: 237  QVPGGQLASSDRYDGP--SGVLVCCEDHIIYRHMDAPQHRVPIPRRRHPLEDPERGVIIT 294

Query: 204  KAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
             A          +LL +  G L  + I HE+E V  LKI+      +AS++  L +  ++
Sbjct: 295  AAVMHKMKGAFFFLLQNEDGDLFKVTIEHEEEDVKALKIKYFDTVPVASSLCILKSGFLF 354

Query: 264  IGSSYGDSQLIKLNL------QPDAKG-SY------------------------VEVLER 292
            + S +G+  L +         +P+    SY                        + + + 
Sbjct: 355  VASEFGNHHLYQFQKLGDDDDEPEVTSTSYPSFGMADPTAALPRAYFKPRPLDNLALADE 414

Query: 293  YVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
              +L PI+D  V++L       Q+    G     +LR +R+G+ + E  S EL GI   +
Sbjct: 415  LESLDPILDSKVLNLLPNSDAPQIFAACGRGARSTLRTLRHGLEVEESVSSELPGIPNAV 474

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+ + + +DP+D+++++SF++ T +L++   + +EE +  GF S + T+       + L+
Sbjct: 475  WTTKKTEEDPYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSSPTIAVQQIGADALL 532

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
            QV    +R V +  R   NEW+ P G ++  AT N  QV++A     LVY E+  DG L 
Sbjct: 533  QVHPHGIRHVLADRRV--NEWRVPQGKTIVTATTNKRQVVVALSSAELVYFELDLDGQLN 590

Query: 470  EVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
            E +  + +   +  L I  + E    +   AVG   D +VRI SL P+  L T       
Sbjct: 591  EYQDRKAMGSTVLALSIGEVPEGRQRTPYLAVGC-EDQTVRIISLDPETTLETLSLQALT 649

Query: 528  IIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
              P S+ +      S        ++   L +G LL  +L+   G+LTD +   LGT+PI 
Sbjct: 650  APPSSICIADMLDASINKSQPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGTRPIK 709

Query: 580  LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
            L          + A S R  + Y+    + ++ +    + +   F++   P+ L      
Sbjct: 710  LIRVPVHKNPAILALSSRSWINYTHQNMMHFTPLIYDNLDYAWSFSAELSPEGLIGIAGN 769

Query: 640  ELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE-------- 690
             L I TI  +  KL   ++PL   PR++    ++  F +    ++  +EE+         
Sbjct: 770  VLRIFTIPKLGSKLKQDTLPLSYTPRKLITHPENNYFYLIESDHRVYSEEATKAKLDELQ 829

Query: 691  ----------------------------MHFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
                                           +R++D    + ++ +PLD  E   SI   
Sbjct: 830  KKGKKIDEEIISLPPSEFGRPKAPAGTWASNIRIIDPVENKTVAVFPLDNNEAAFSIAIV 889

Query: 723  SFS-DDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
             FS  +  ++  VGTA    V P   E    R   F      L+L+ + ET     +L A
Sbjct: 890  PFSARNGELHLVVGTAKDTTVSPRTCESGFLRTYKFTENGTGLELLHKTETDDVPMALLA 949

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSI 837
            F G+L A + + +++Y     D G ++L  +  +     A+  + T+G  I+VGD+ +S+
Sbjct: 950  FQGRLAAGVGKALRIY-----DIGKKKLLRKVENKSFTTAIVTLTTQGSRILVGDMQESV 1004

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
              ++YK  E  +   A D    W++A+ ++D +  +  +   N+F  R   K S+   ++
Sbjct: 1005 QYVVYKQPENRLLTFADDTQPRWVTAITMVDYNTIVAGDRFGNIFVNRLDSKVSDQVDED 1064

Query: 895  ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
              G               + +++  +H+G+ +      SLV        G    +++  +
Sbjct: 1065 PTGAGILHEKPILMGAPHKTKMIAHFHVGDIITSLHKVSLV-------AGGREVIVYTGL 1117

Query: 940  NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
            +G IG++   +  E   F+  L+ ++R     + G +   +R +      V  K  +DGD
Sbjct: 1118 HGTIGILMPFISKEDVDFISTLEQHMRTEQPSLVGRDQLAYRGY-----YVPVKAVVDGD 1172

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L E++  L  ++   I+  ++ +V E+ K++E++
Sbjct: 1173 LCETYAHLPASKQSSIANELDRTVGEVLKKLEQM 1206


>gi|395333071|gb|EJF65449.1| hypothetical protein DICSQDRAFT_178021 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1213

 Score =  233 bits (594), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 283/1185 (23%), Positives = 487/1185 (41%), Gaps = 194/1185 (16%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D++ + ++  +  +L +D ++S  +            R    
Sbjct: 58   VFGSIRSLAAFRLTGGTKDYVIVGSDSGRIVILDYDPKTSSFVKVHQETFGKSGARRIVP 117

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
            GQ    DP  R  +I       L  ++  D    L      EA   N  +  +  LD+ F
Sbjct: 118  GQYLATDPKGRSVMIAAMEKAKLVYILNRDAAANLTISSPLEAHKNNAIIHHIVGLDVGF 177

Query: 117  LYGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNG 163
                  PT   L  D             N   + +  YE+ L     V   WS+   D  
Sbjct: 178  ----ENPTFAALEVDYSESDQDPSGEAFNNAEKMLTYYELDLGLNHVVRK-WSEPT-DPR 231

Query: 164  ADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIR------PS- 201
            A+LL+ VP            P   GVL+  E+ I+Y   +A +    IP R      P  
Sbjct: 232  ANLLVQVPGGQLATSDRYDGP--SGVLVCCEDHIIYRHMDAPQHRVPIPRRRHPLEDPER 289

Query: 202  ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
               IT A          +LL    G L+ + I HE++ V  LKI+      +AS +  L 
Sbjct: 290  GLIITAAVMHKMKGAFFFLLQSEEGDLYKVTIEHEEQDVKALKIKYFDTVPVASGLCILK 349

Query: 259  NAVVYIGSSYGDS---QLIKL---NLQPD-AKGSY------------------------V 287
            +  +++ S +G+    Q  KL   + +P+ + G Y                        +
Sbjct: 350  SGFLFVASEFGNHYFYQFQKLGDDDNEPEFSSGDYPSYGMADPTAALPRAYFRPRALDNL 409

Query: 288  EVLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
             +++   +L PI+D  V++L       Q+    G     + R +R+G+ + E  S +L G
Sbjct: 410  TLVDELESLCPILDSKVMNLLPNSDTPQIFAACGRGARSTFRTLRHGLEVEEVVSSDLPG 469

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            I   +W+ +   DDP+D+++++SF++ T +L +   + +EE +  GF S   TL      
Sbjct: 470  IPNAVWTTKLKEDDPYDSYIILSFVNGTLVLCIG--ETIEEVQDTGFLSSAPTLAVQQIG 527

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
             + L+QV    +R V S  R   NEW+ P G ++  AT N  QV++A     LVY E+  
Sbjct: 528  ADALLQVHPHGIRHVLSDRRV--NEWRVPSGKTIVCATTNKRQVVVALSSAELVYFELDL 585

Query: 465  DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
            DG L E +  + +   +  L I  + E    +   AVG   D +VRI SL   + +    
Sbjct: 586  DGQLNEYQDRKAMGSTVLALSIAEVPEGRQRTPYLAVGC-EDQTVRIVSLDPESTLETIS 644

Query: 524  LGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
            L     P S +  A          +  +++   L +G LL  +L+   G+LTD +   LG
Sbjct: 645  LQALTAPPSAICIADMLDAGINKTQPTTFVNIGLQNGVLLRTVLDPVNGQLTDTRTRFLG 704

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
            T+PI L   + +    + A S RP + Y+    + ++ +  + + +   F++    + L 
Sbjct: 705  TRPIRLLRVNIQRNPAILALSSRPWLNYTHQNFMHFTPLIFENLDYAWSFSAELCQEGLI 764

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRI-----------------CHQEQSRTFA 676
                  L I  I  +  KL   SIPL   PR++                    E++    
Sbjct: 765  GISGSLLRIFQIPKLGTKLKQDSIPLSYTPRKLIPHPHNGLLYLIEGDHRVMSEEAAAKQ 824

Query: 677  ICSLKNQSCAEESEM----------------------HFVRLLDDQTFEFISTYPLDTFE 714
            +  L+    A + EM                        +  LD QT   I    LD  E
Sbjct: 825  LQQLRESGRAVDEEMVNLPPEQFGRPKAPAGTWASCIRIISPLDAQTVNVIH---LDNNE 881

Query: 715  YGCSILSCSFSDDSN-VYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETK 770
               S+    F+  +N ++  VGTA   ++ P        R   F  +   L+L+ + ET 
Sbjct: 882  AAFSLAIVPFAAKNNELHLVVGTAQDTFLAPRSCTSGFLRTYRFTDDGRNLELLHKTETN 941

Query: 771  GAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGD 826
                ++ AF GKL+A + + ++LY    K +LR    + L S          + + T+G 
Sbjct: 942  DVPLAIMAFQGKLVAGVGKALRLYDIGKKKLLRKVENKTLGSTI--------VTLNTQGS 993

Query: 827  FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
             I++GD+ +S+   +YK  E  +   A D    W++A  +LD +  + ++   N+F  R 
Sbjct: 994  RIIIGDMQESVFYAVYKPPENRLLVFADDVQPRWVTATTMLDYNTVVASDRFGNVFVNRL 1053

Query: 886  --KNSEGATDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
              K S+   D+  G               +  ++  +H+G+ V      SLV        
Sbjct: 1054 DAKISDQIDDDPTGAGILHEKGVLFGAPHKTAMLAHFHIGDIVTSLNKISLV-------A 1106

Query: 929  GQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
            G    +++  ++G IG++   +  E    L  L+ ++R     + G +H  WR +     
Sbjct: 1107 GGREVILYTCLHGTIGILVPFVSKEDVDLLTTLEQHMRTEQLSLVGRDHLAWRGY----- 1161

Query: 988  TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             V  K  +DGDL ESF  L   +   I+  ++ +V E+ K++E+L
Sbjct: 1162 YVPVKAVVDGDLCESFAKLPANKQSSIAGELDRTVGEVLKKLEQL 1206


>gi|346971485|gb|EGY14937.1| pre-mRNA-splicing factor RSE1 [Verticillium dahliae VdLs.17]
          Length = 1230

 Score =  232 bits (592), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 269/1054 (25%), Positives = 453/1054 (42%), Gaps = 169/1054 (16%)

Query: 119  GCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADL 166
            G A P    L  D  +A    T + AL  +            + V   WS+  +DN A L
Sbjct: 200  GYANPVFAALETDYTEADQDPTGQAALDVETQLVYYELDLGLNHVVRKWSEP-VDNTASL 258

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRY 216
            L  VP     P   GVL+ GEE I Y  +N  AF+  +P R   T+  +  R    G  +
Sbjct: 259  LFQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPVPRRRGATEDPSRKRCIVAGVMH 316

Query: 217  LLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLD 258
             L   AG    L+ T + +                  +V  LKI+      +AS++  L 
Sbjct: 317  KLKGSAGAFFFLLQTEDGDLFKITIDMIEDRDGNPTGEVKRLKIKYFDTIPVASSLCILK 376

Query: 259  NAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLE 291
            +  +Y+ S +G+ Q  +                +   D K SY            + ++E
Sbjct: 377  SGFLYVASQFGNYQFYQFEKLGDDDEELEFSSDDFPTDPKQSYEAVFFHPRELENLALVE 436

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
               ++ P++D  V +L  +   Q+ T  G     + RI+++G+ +NE  + EL GI   +
Sbjct: 437  SIDSMNPLIDCKVANLTGEDAPQIYTACGNGARSTFRILKHGLEVNEIVASELPGIPSAV 496

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+L+ S  D +D ++V+SF + T +L++   + +EE    GF +   TL         L+
Sbjct: 497  WTLKLSRGDQYDAYIVLSFTNATLVLSIG--ETVEEVNDSGFLTSVPTLAAQLLGGEGLI 554

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILT 469
            QV    +R + +      NEW +P   S+  AT N  QV +A   G +VY E+  DG L 
Sbjct: 555  QVHPKGIRHIRNGQV---NEWAAPQHRSIVAATTNEHQVAVALSSGEIVYFEMDSDGSLA 611

Query: 470  EVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
            E    +  +  ++CL +  + E    S   AVG   D +VR+ SL P+  L +K      
Sbjct: 612  EYDEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDSTVRVLSLDPETTLESKSVQALT 670

Query: 528  IIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
              P S+ + A E  S      YL   L  G  L  +L+  TGELTD ++  LG +P+ L 
Sbjct: 671  APPTSLAVIAMEDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPVKLF 730

Query: 582  TFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
              + +  T V   S RP + Y+    K  + + +N  ++     F+S    + +   +  
Sbjct: 731  QVTVQGRTCVLGLSSRPWLGYADPITKGFVVTPLNYVDLEWGWNFSSEQCEEGVVGIQGQ 790

Query: 640  ELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICS--------LKNQSCAEES 689
             L I  I+ +   L  +SIPL   PRR+  H E    + I S        L+ Q  A+ S
Sbjct: 791  SLRIFAIEKLGDTLTQKSIPLTYTPRRMVKHPEHPMFYTIESDNNTLPPELRAQLLADPS 850

Query: 690  ------------EMHFVR----------LLD--DQTFEFISTYPLDTFEYGCSILSCSF- 724
                        E  + R          ++D   +  + + T  LD  E   S     F 
Sbjct: 851  VVNGDARTLPPAEFGYPRAKGRWASCISVIDPLSEELQTLQTVDLDNNEAAVSAAIVPFT 910

Query: 725  SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNG 781
            S D+  +  VGT   ++    + T+G I ++   EDG+ L+ I + + +    +L AF G
Sbjct: 911  SQDNESFLVVGTGKDMIVNPRQFTEGYIHIYRFSEDGRELEFIHKTKVEEPPTALLAFQG 970

Query: 782  KLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
            +L+A + + +++Y    K ML     R+ Q++       L + + T+G  IVVGD+ + +
Sbjct: 971  RLVAGVGKTLRIYDLGQKQML-----RKAQADVAPQ---LIVSLSTQGSRIVVGDVQQGV 1022

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
            + ++YK     +     D  A WM+   ++D +   G +   N+F VR   K S+ A +E
Sbjct: 1023 TYVVYKPLSNKLIPFVDDTVARWMTCTTMVDYESVAGGDKFGNIFIVRAPEKASQEADEE 1082

Query: 895  ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
              G               RL +V  ++  + +      SLV       VG    +++  +
Sbjct: 1083 GAGLHLTNTRDYLHGTPHRLSLVSHFYSQDVLTSITKTSLV-------VGGQDVLLWSGI 1135

Query: 940  NGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
            +G IGV I  +  E   F + L+ +LR     + G +H  +R +    K V     +DGD
Sbjct: 1136 SGTIGVFIPFVTREDADFFQTLEQHLRTEDAPLAGRDHLMYRGYYAPVKGV-----IDGD 1190

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L E +  L   +   I+  ++ SV E+ +++ ++
Sbjct: 1191 LCERYTLLPNDKKQMIAGELDRSVREIERKISDI 1224


>gi|322693432|gb|EFY85292.1| Pre-mRNA-splicing factor RSE1 [Metarhizium acridum CQMa 102]
          Length = 1221

 Score =  231 bits (588), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 289/1166 (24%), Positives = 496/1166 (42%), Gaps = 168/1166 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+L IA++  +  ++++    +      +        R    
Sbjct: 64   VFGIIRSLAAFRLAGSNKDYLIIASDSGRITIVEYLPAQNRFSRLHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            G+    DP  R  LI     + L  V+    + +L  +  +   +  VL I  +    G 
Sbjct: 124  GEYLACDPKGRACLIASTEKNKLVYVLNRSAQAELTISSPLEAHKPGVLVISMVALDVGY 183

Query: 121  AKPTIVVLYQDNKDA---------RHVKT----YEVALKDKDFVEGPWSQNNLDNGADLL 167
            A P    L  D  D            V+T    YE+ L     V   WS+  +D  A LL
Sbjct: 184  ANPVFAALEMDYSDVDQDSSGQALEEVETQLVYYELDLGLNHVVRK-WSEP-VDPTASLL 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYL 217
              VP     P   GVL+ GEE+I Y  +N  AF+  IP R   T+  +  R+   G  + 
Sbjct: 242  FQVPGGNDGP--SGVLVCGEESITYRHSNQDAFRVPIPRRKGATEDPSRKRIIVSGVMHK 299

Query: 218  LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
            L   AG    L+ T + +                  +V  LKI+      +A+++  L +
Sbjct: 300  LKGSAGAFFFLLQTEDGDLFKVTIDMVEDEEGSPTGEVRRLKIKYFDTVPVATSLCILKS 359

Query: 260  AVVYIGSSYGD---SQLIKL------------NLQPDAKGSY------------VEVLER 292
              +YI S +G+    Q  KL            +   D + SY            + ++E 
Sbjct: 360  GFLYIASQFGNFSFYQFEKLGDDDEELEFSSDDFPVDPQASYDPVYFHPRPAENLALVES 419

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
              ++ P++D  V +L  +   Q+ T  G     + R++R+G+ +NE  + EL GI   +W
Sbjct: 420  IPSMNPLLDCQVANLTGEDAPQIYTVCGNGARSTFRMLRHGLEVNEIVASELPGIPSAVW 479

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            +L+ +  + +D ++V+SF + T +L++   + +EE    GF +   TL       + L+Q
Sbjct: 480  TLKLNRGEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFSTSVPTLAAQLLGDDGLIQ 537

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
            V    +R V +      NEW +P   S+  A+ NA QV +A   G +VY E+  DG L E
Sbjct: 538  VHPKGIRHVRNGKV---NEWDAPQHRSIVAASTNAHQVAIALSSGEIVYFEMDSDGSLAE 594

Query: 471  VKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
                +  +  ++CL +  + E  + S   AVG   D +VRI SL P+  L +K       
Sbjct: 595  YDEKKEMFGTVTCLSLGEVPEGRARSSFLAVGC-DDCTVRILSLDPESTLESKSVQALTA 653

Query: 529  IPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
             P S+ + A +  S      YL   L  G  L  +L+  TGELTD ++  LG + + L  
Sbjct: 654  APSSLAIIAMDDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKQVRLFQ 713

Query: 583  FSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
             + + TT V   S RP + YS    K  + + +N  ++     F+S    + +   +   
Sbjct: 714  VTVQGTTCVLGLSSRPWLGYSDPITKGFVVTPLNYVDLEWGWNFSSEQCEEGIVGIQGQS 773

Query: 641  LTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICS--------LKNQSCAEESE 690
            L I +ID +   L  ++I L   P++ I H EQ   + I S        L+ Q  A+ + 
Sbjct: 774  LRIFSIDRLGDTLTQKAISLTYTPKKLIKHPEQPLFYTIESDNNTLPPDLRAQLIADPAV 833

Query: 691  MHF-VRLLDDQTF-----------------------EFISTYPLDTFEYGCSILSCSF-S 725
            ++   ++L  + F                       + + T  L+  E   S     F S
Sbjct: 834  VNGDTKVLPPEEFGYPKGNRRWASCINVIDPVSEEPQVVQTVDLENNEAAVSAAIVPFAS 893

Query: 726  DDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGK 782
             D+  +  VGT     V P         +  F  E  +L+ I + + +    +L  F GK
Sbjct: 894  QDNESFLIVGTGKDIVVNPRNFSEAYIYVYRFQEEGRELEFIHKTKIEEPALALIPFQGK 953

Query: 783  LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
            LLA + + +++Y   +R    R+ Q+E      I++L   T+G  I+VGD+ + ++ + Y
Sbjct: 954  LLAGVGKTLRVYDLGMR-QMLRKAQAEVAPQ-QIVSL--NTQGSRIIVGDVQQGVTYVTY 1009

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
            K     +   A D  A W++   ++D +   G +   N+F VR   K SE A +E+ G  
Sbjct: 1010 KPTTNKLIPFADDIIARWITCTTMVDYESVAGGDKFGNMFIVRCPPKASEEADEEQSGLH 1069

Query: 898  -------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
                         RL+++  ++  +        SLV       VG    +++  + G IG
Sbjct: 1070 LMNARDYLHGTSQRLDLMCHFYTQDIPTSMAKTSLV-------VGGQDVLLWSGLMGTIG 1122

Query: 945  V-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
            V I  +  E   F + L+++LR     + G +H  +RS+         K  +DGDL E +
Sbjct: 1123 VFIPLISREDADFFQSLESHLRTEDPPLAGRDHLMYRSY-----YAPVKGIIDGDLCERY 1177

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRV 1029
              L   +   I+  ++ SV E+ +++
Sbjct: 1178 TLLPNDKKQMIAGELDRSVREIERKI 1203


>gi|66811906|ref|XP_640132.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
 gi|74854972|sp|Q54SA7.1|SF3B3_DICDI RecName: Full=Probable splicing factor 3B subunit 3
 gi|60468134|gb|EAL66144.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
          Length = 1256

 Score =  231 bits (588), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 245/1065 (23%), Positives = 456/1065 (42%), Gaps = 192/1065 (18%)

Query: 133  KDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL---CGVLIIGEETIVYCS 189
            K  + +  YE+ L   + V   WS + +D+ A++++ VP       GVL+  E+ IVY +
Sbjct: 213  KKKKLLTYYELDLGLNNVVR-KWS-DQVDDSANIVMTVPGGTEGPGGVLVASEDYIVYRN 270

Query: 190  ANAFKAIPIRPSITKAYGRVDADGSR--------------YLLGDHAGLLHLLVITHEKE 235
             +  +   +R  I + YG     G                +L+    G L+ + + ++ +
Sbjct: 271  QDHAE---VRSRIPRRYGSDPNKGVLIISHSSHKQKGMFFFLVQSEHGDLYKITLDYQGD 327

Query: 236  KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI--------------------- 274
            +V+ + +       +A+ ++ L N  ++  S +GD  L                      
Sbjct: 328  QVSEVNVNYFDTIVLANCLTVLKNGFLFAASEFGDHTLYFFKSIGDEEEEGQAKRLEDKD 387

Query: 275  -KLNLQP-DAKGSYVEVLE------RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGS 326
              L   P ++ G+ +E L+         +L PI+DF V+DL R+   Q+ +  G   + S
Sbjct: 388  GHLWFTPRNSCGTKMEELKNLEPTSHLSSLSPIIDFKVLDLVREENPQLYSLCGTGLNSS 447

Query: 327  LRIVRNGIGINEQASVELQGI-KGMWSLRSSTD----DPFDTFLVVSFISETRILAMNLE 381
            L+++R+G+ +    +  L G+  G+W++  ST     D  D ++VVSF+  T +L++   
Sbjct: 448  LKVLRHGLSVTTITTANLPGVPSGIWTVPKSTSPNAIDQTDKYIVVSFVGTTSVLSVG-- 505

Query: 382  DELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNV 441
            D ++E    G    T TL       + ++QV     R + S  R   NEW++P   ++  
Sbjct: 506  DTIQENHESGILETTTTLLVKSMGDDAIIQVFPTGFRHIKSDLR--INEWRAPGRKTIVR 563

Query: 442  ATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAV 500
            A+AN SQ+ +A  GG ++Y E+     L E+    L  +I+C++I+PI +  + ++  AV
Sbjct: 564  ASANQSQLAIALSGGEIIYFELDQASNLIEIIKKDLRRDIACIEISPIPKGRNMARFIAV 623

Query: 501  GMWTDISVRIFSLPDLNLITKEHL--GGEIIPRSVLLCAFE----GIS------------ 542
              W +  +R+ SL   N + +  +    ++   S+ +   +    GI             
Sbjct: 624  SDW-EGPIRVLSLDRDNCLGQVSMLDTDKVYIESLSIIEMQLNEMGIETKKSQSQTGQTT 682

Query: 543  -------------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
                               +L   L +G +    L+  TGEL+D +   LG +P+ L   
Sbjct: 683  TTTTSTSSASSSVTSGGSLFLFVGLKNGVVKRATLDSVTGELSDIRTRLLGRKPVKLFKV 742

Query: 584  SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
              + +  + A S R  + Y +  KL    ++++ + +    +S    +S+    E ++ I
Sbjct: 743  KVRGSNAMLALSSRVWLNYINQGKLDIVPLSIEPLENASNLSSEQSAESIVATSENKIII 802

Query: 644  GTIDDIQKL-HIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSC--------AEESEM-- 691
             +ID +  L +  +I L   P+R I H + S    + +  N +          E+SE   
Sbjct: 803  FSIDKLGDLFNQETIKLNATPKRFIIHPQTSYIIILETETNYNTDNIDIDKINEQSEKLL 862

Query: 692  -----------------------------------------HFVRLLDDQTFEFISTYPL 710
                                                      +++++D  T E + +  L
Sbjct: 863  LEKQKELQQEMDIDDDDQNNNNEIEPFKKLFKPKAGKGKWKSYIKIMDPITHESLESLML 922

Query: 711  DTFEYGCSILSCSFSDDSNVYYCVG--TAYVL-PEENEPTKGRILVFIVEDGKLQLIAEK 767
            +  E G S+ +CSF +   ++  VG  T  VL P+ ++     +  FI    KL+L+ + 
Sbjct: 923  EDGEAGFSVCTCSFGESGEIFLVVGCVTDMVLNPKSHKSAHLNLYRFIDGGKKLELLYKT 982

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
            E +  VY++  F GKL+  + + I++Y     D G ++L  +C        +  + + GD
Sbjct: 983  EVEEPVYAMAQFQGKLVCGVGKSIRIY-----DMGKKKLLRKCETKNLPNTIVNIHSLGD 1037

Query: 827  FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
             +VVGD+ +SI  + YK  E  +   A D    WM++  +LD D   GA+   N+F +R 
Sbjct: 1038 RLVVGDIQESIHFIKYKRSENMLYVFADDLAPRWMTSSVMLDYDTVAGADKFGNIFVLRL 1097

Query: 886  -----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
                             K   G  +    +L+ +  + +G+ V      SLV       V
Sbjct: 1098 PLLISDEVEEDPTGTKLKFESGTLNGAPHKLDHIANFFVGDTVTTLNKTSLV-------V 1150

Query: 929  GQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
            G    +++ T++G IG +I     E   F   L+ N+R     + G +H  +RS+     
Sbjct: 1151 GGPEVILYTTISGAIGALIPFTSREDVDFFSTLEMNMRSDCLPLCGRDHLAYRSY----- 1205

Query: 988  TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                KN +DGDL E F  L+  +   IS+ ++ S  E+ K++EE+
Sbjct: 1206 YFPVKNIIDGDLCEQFSTLNYQKQLSISEELSRSPSEVIKKLEEI 1250


>gi|429859776|gb|ELA34542.1| pre-mRNA-splicing factor rse1 [Colletotrichum gloeosporioides Nara
            gc5]
          Length = 1212

 Score =  230 bits (587), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 281/1173 (23%), Positives = 489/1173 (41%), Gaps = 166/1173 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   I+G I ++  FR  G  +D+L +AT+  +  ++++    +      +        
Sbjct: 59   LLSHDIFGIIRSMAAFRLAGSNKDYLILATDSGRITIVEYIPAQNRFQRLHLETFGKSGV 118

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    G+    DP  R  LI     + L  V+  + + +L  +  +   +  VL +  + 
Sbjct: 119  RRVIPGEYLACDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHKPGVLVLSMVA 178

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNG 163
               G A P    L  +  +A    T E A + +            + V   WS+  +D  
Sbjct: 179  LDVGYANPVFAALEIEYTEADQDPTGEAAREAETQLVYYELDLGLNHVVRKWSEP-VDPT 237

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVD--ADG 213
            A LL  VP     P   GVL+ GEE I Y  +N  AF+  IP R   T+   R      G
Sbjct: 238  ASLLFQVPGGQDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRKRHIVSG 295

Query: 214  SRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTIS 255
              + L   AG    L+ T + +                  +V  LKI+      ++S++ 
Sbjct: 296  VMHKLKGSAGAFFFLLQTEDGDLFKAVIDMVEDADGNPTGEVKRLKIKYFDTVPVSSSLC 355

Query: 256  YLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VE 288
             L +  +Y  S +G+ Q  +                +   D K  Y            + 
Sbjct: 356  ILKSGFLYAASQFGNHQFYQFEKLGDDDEEKEFSSDDFPADPKAGYDAVYFYPRPLENLA 415

Query: 289  VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI- 347
            ++E   ++ P++D  V +L  +   Q+ T  G     + R++++G+ +NE  + EL GI 
Sbjct: 416  LVESIDSMNPLLDCKVANLTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIP 475

Query: 348  KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
              +W+L+ S  D +D ++V+SF + T +L++   + +EE    GF +   TL       +
Sbjct: 476  SAVWTLKLSRGDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGED 533

Query: 408  QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDG 466
             L+QV    +R +        NEW +P   S+  AT N  QV +A   G +VY E+ GDG
Sbjct: 534  GLIQVHPKGIRHIRHGQI---NEWAAPQHRSIVAATTNEHQVAVALSSGEIVYFEMDGDG 590

Query: 467  ILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
             L E    +  +  ++CL +  + E    S   AVG   D +VR+ SL PD  L +K   
Sbjct: 591  SLAEYDEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDSTVRVLSLDPDSTLESKSVQ 649

Query: 525  GGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
                 P S+ + A +  S      YL   L  G  L  +L+  TGELTD ++  LG +P+
Sbjct: 650  ALTAPPSSLAIIAMDDSSSGGSALYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPV 709

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
             L   + +  T V   S RP + YS    +  + + +N  ++  +  F+S    + +   
Sbjct: 710  RLFQVTVQGRTCVIGLSSRPWLGYSDPITRGFVVTPLNYVDLEWVWNFSSEQCEEGVVGI 769

Query: 637  KEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV- 694
            +   L I  I+++   +  +SIPL   PRR+        F      N +   +     + 
Sbjct: 770  QGQSLRIFAIENLGDTITQKSIPLTYTPRRLLKHPDHPMFYTVESDNNTLPPDLRAKLIA 829

Query: 695  ---------RLLDDQTFEF-----------------------ISTYPLDTFEYGCSILSC 722
                      +L    F +                       + T  LD  E   S    
Sbjct: 830  EPGVVNGDATVLPPDEFGYPKGKGRWASCISVIDPLSEEQRVLQTVDLDDNEAAVSAAIV 889

Query: 723  SF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKETKGAVYSLNA 778
            SF S D+  +  VGT   ++    + T+G I V+   EDG +L+ I + + +    +L A
Sbjct: 890  SFASQDNENFLIVGTGKDMIVNPRQFTEGYIHVYRFGEDGHELEFIHKTKVEEPPTALLA 949

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            F G+LLA + + +++Y   LR    R+ Q++       L + + T+G  IVVGD+   ++
Sbjct: 950  FQGRLLAGVGKTLRIYDLGLR-QMLRKSQADVAPQ---LIVSLSTQGSRIVVGDVQHGVT 1005

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEE 895
             ++YK     +     D  A W++   ++D +   G +   N+F VR   K S+ A +E+
Sbjct: 1006 YVVYKPTTNKLIPFVDDTIARWVTCTTMVDYESVAGGDKFGNMFLVRCPEKASQEADEEQ 1065

Query: 896  RG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
             G               RL ++   +  +        SLV       VG    +++  +N
Sbjct: 1066 AGLHLLNTRDYLHGAPHRLNLLSHSYTQDVPTSITKTSLV-------VGGQDVLLWSGIN 1118

Query: 941  GVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDL 999
            G IGV I  +  E   F + L+ ++R     + G +H  +R +      V  K  +DGDL
Sbjct: 1119 GTIGVFIPFVTREDVDFFQNLEQHMRTEDAPLAGRDHLMYRGY-----YVPVKGVIDGDL 1173

Query: 1000 IESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             E +  L   +   I+  ++ SV E+ +++ ++
Sbjct: 1174 CERYTLLPNDKKLMIAGELDRSVREIERKISDI 1206


>gi|336371417|gb|EGN99756.1| hypothetical protein SERLA73DRAFT_88390 [Serpula lacrymans var.
            lacrymans S7.3]
 gi|336384183|gb|EGO25331.1| hypothetical protein SERLADRAFT_355643 [Serpula lacrymans var.
            lacrymans S7.9]
          Length = 1216

 Score =  230 bits (586), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 270/1174 (22%), Positives = 484/1174 (41%), Gaps = 169/1174 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+  + ++  +  +L +D  +S  +            R    
Sbjct: 58   VFGSIRSLAAFRLTGGTKDYAIVGSDSGRIVILDYDPRTSSFVKLHQETFGKSGSRRIVP 117

Query: 66   GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
            GQ    DP  R + +   +   L  ++  D    L  +  +        +  +  LD+ F
Sbjct: 118  GQYLATDPKGRSVMISAMEKAKLVYILNRDAAANLTISSPLEAHKNAAIIHHIVGLDVGF 177

Query: 117  ---LYGCAKPTIVVLYQD------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
               L+   +       QD      N   + +  YE+ L     V   WS+   D  A+LL
Sbjct: 178  ENPLFAALEVDYTESDQDPTGEAFNNAEKMLTYYELDLGLNHVVRK-WSEPT-DPRANLL 235

Query: 168  IPVP------------PPLCGVLIIGEETIVYCSANAFK-AIPI-----------RPSIT 203
            + VP            P   GVL+  E+ I+Y      +  IPI           R  I 
Sbjct: 236  VQVPGGQLASSERFDGP--SGVLVCCEDHIIYRHMGVPQHRIPIPRRRHPLEDPERGIII 293

Query: 204  KAYGRVDADGSRY-LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
             A       G+ + LL    G L+ + I HE ++V  LKI+      +AS++  L +  +
Sbjct: 294  VAAVMHKMKGAFFFLLQSEDGDLYKVTIEHEDDEVKALKIKYFDTVPVASSLCILKSGFL 353

Query: 263  YIGSSYGDSQLIKLNLQPD-------AKGSY------------------------VEVLE 291
            ++ S +G+  L +     D       +  SY                        + + +
Sbjct: 354  FVASEFGNHYLYQFQKLGDDDNEPEFSSTSYPSFGMAEPSMPLPHAHFHPRALDNLALAD 413

Query: 292  RYVNLGPIVDFCVVD-LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KG 349
               +L PI+D  V++ L      Q+    G     SLR++R+G+ + E  S EL GI   
Sbjct: 414  EMESLDPILDSKVMNILPNSDTPQIFAACGRGARSSLRMLRHGLDVEESVSSELPGIPNA 473

Query: 350  MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
            +W+ +   DDPFD+++++SF++ T +L++   + +EE +  GF S   TL       + L
Sbjct: 474  VWTTKRKEDDPFDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGADAL 531

Query: 410  VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGIL 468
            +QV    +R V +  R   NEW+ P G ++  A  N  QV++A     LVY E+  DG L
Sbjct: 532  LQVHPQGIRHVLADRRV--NEWRVPQGKTIVSAATNKRQVVVALSSAELVYFELDLDGQL 589

Query: 469  TEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
             E +  + +   +  L +  + E    +   AVG   D +VRI SL   + +    L   
Sbjct: 590  NEYQDRKAMGSTVLALSVGEVPEGRQRTPYLAVGC-EDQTVRIISLDPESTLETISLQAL 648

Query: 528  IIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
              P S +  A          +   ++   L +G LL  +L+   G+LTD +   LGT+PI
Sbjct: 649  TAPPSAICIADMLDASINKTQPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGTRPI 708

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
             L          + A S R  + Y+    + ++ +  + + +   F++   P+ L     
Sbjct: 709  RLVRVQIHGNPAILALSSRSWLNYTHQNLMHFTPLIFENLDYAWSFSAELSPEGLIGIAG 768

Query: 639  GELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES------EM 691
              L I  I  +  KL   +IPL   PR+      +R F +    ++   E++      EM
Sbjct: 769  SVLRIFQIPKLGTKLKQDTIPLSYTPRKFISHPTNRYFYLIEGDHRVLGEDAAAKRLDEM 828

Query: 692  H------------------------------FVRLLDDQTFEFISTYPLDTFEYGCSILS 721
                                            +R++D    + +S   LD+ E   S+  
Sbjct: 829  RQRGKMVDDNMLDLPPEVFGRPKAPAGTWASAIRIIDPVEAKTLSMITLDSNECAFSLAI 888

Query: 722  CSFSDDSN-VYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
              FS   N ++  VGTA   ++ P        R   F  +   L+L+ + ET     +L 
Sbjct: 889  VPFSARGNELHLVVGTAADTFLAPRSCSSGFLRTYKFTEDGTGLELLHKTETDDVPLALM 948

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
            AF G+L+A + + +++Y  + +    R+++++         + + T+G  I+VGD+ +SI
Sbjct: 949  AFQGRLVAGVGKALRIYD-IGKKKLLRKVENKARATFSTAIVTLNTQGSRIIVGDMQESI 1007

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
            S + YK  E  +   A D    W++A  ++D       +   N+F  R   K SE   D+
Sbjct: 1008 SFVAYKAPENRLLVFADDNQPRWITATTMVDYTTIAAGDRFGNIFVNRLDPKVSEQVDDD 1067

Query: 895  ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
              G               + +++  +H+G+ V      SLV        G    +++  +
Sbjct: 1068 PTGAGILHEKGLLMGAPHKTKMLAHFHIGDLVTSINKVSLV-------AGGREVLLYTGL 1120

Query: 940  NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
            +G IG++   +  E   F+  L+ ++R     + G +   WR +    K+V     +DGD
Sbjct: 1121 HGTIGILVPFVSKEDVDFISTLEQHMRTEQGSLVGRDQLSWRGYYTPVKSV-----VDGD 1175

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L E++  L  T+   I+  ++ +V E+ K++E+L
Sbjct: 1176 LCETYARLPGTKQSAIAGELDRTVGEVLKKLEQL 1209


>gi|403411971|emb|CCL98671.1| predicted protein [Fibroporia radiculosa]
          Length = 1212

 Score =  230 bits (586), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 282/1179 (23%), Positives = 487/1179 (41%), Gaps = 183/1179 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+  + ++  +  +L +D ++S               R    
Sbjct: 58   VFGSIRSLAAFRLTGGTKDYAVVGSDSGRIVILDYDPKTSSFTKVHQETYGKSGARRIVP 117

Query: 66   GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
            GQ    DP  R + +   +   L  ++  D    L      EA   N  +  +  LD+ F
Sbjct: 118  GQYLATDPKGRSVMISAVEKAKLVYILNRDAAANLTISSPLEAHKNNSIIHHIVGLDVGF 177

Query: 117  LYGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNG 163
                  P    L  D  D+    T             YE+ L     V   WS+   D  
Sbjct: 178  ----DNPMFAALEVDYSDSDQDPTGEAFNNAEKMLTYYELDLGLNHVVRK-WSEPT-DPR 231

Query: 164  ADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------- 201
            A+LL+ VP            P   GVL+  E+ I+Y   +A +    IP R         
Sbjct: 232  ANLLVQVPGGQLASSDRYDGP--SGVLVCCEDHIIYRHMDAPQHRIPIPRRQHPLADTHR 289

Query: 202  ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIAST----- 253
               IT A          +LL    G L+ + I HE + V  LKI+      IAS      
Sbjct: 290  GLIITAAVMHKMKGAFFFLLQSEEGDLYKVTIEHEDQDVRALKIKYFDTVPIASIEYNFN 349

Query: 254  ----ISYL---------------DNAVVYIGSSYGD-------SQLIKLNLQPDAKGSYV 287
                +SY+               DN   +  +S+         S L     +P    + +
Sbjct: 350  CRICVSYIILESYLYQFQKLGDDDNEPEFSSTSFPSFGMADPSSPLPHAYFKPHILDN-L 408

Query: 288  EVLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
             +++   +L P++D  V++L       QV    G     + R++R+G+ + E  S +L G
Sbjct: 409  TLVDEMESLSPVLDAKVMNLLPNSDTPQVFAACGRGGRSTFRMLRHGLEVEEVVSSDLPG 468

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            I   +W+ +   DDP+D+++++SF++ T +L +   + +EE +  GF S   TL      
Sbjct: 469  IPNAVWTTKLKEDDPYDSYIILSFVNGTLVLCIG--ETIEEVQDTGFLSSAPTLAVQQIG 526

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
            Y+ L+QV    +R V S  R   NEW+ P G ++  AT N  QV++A     LVY E+  
Sbjct: 527  YDALLQVHPHGIRHVLSDKRV--NEWRVPQGKTIVAATTNKRQVVVALSSAELVYFELDL 584

Query: 465  DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
            DG L E +  + +   +  L I  + E    +   AVG   D +VRI SL   + +    
Sbjct: 585  DGQLNEYQDRKAMGSTVLALSIAEVPEGRQRTPYLAVGC-EDQTVRIVSLDPESTLETIS 643

Query: 524  LGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
            L     P S +  A          +  S++   L +G LL  +L+   G+LTD +   LG
Sbjct: 644  LQALTAPPSAICIADMLDAGINKTQPTSFVNIGLQNGVLLRTVLDPVNGQLTDTRTRFLG 703

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
            T+PI L     +    + A S R  + Y+    + ++ +  + + +   F++   P+ L 
Sbjct: 704  TRPIKLIRVLVQRNPGILALSSRSWLNYTYQNLMHFTPLIFENLDYAWSFSAELCPEGLI 763

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICH----------------QEQSRTFA 676
                  L I  I  +  KL   +IPL   PR+ I H                 +++   A
Sbjct: 764  GISGSVLRIFQIPKLGTKLKQDAIPLSYTPRKFISHPTNGLFYLIEGDHRVRSDEASAKA 823

Query: 677  ICSLKNQSCAEESEM-------------------HFVRLLDDQTFEFISTYPLDTFEYGC 717
            +  L+ Q    + E+                     +R+++    + ++  PLD  E   
Sbjct: 824  LGELRQQGKMVDDELVNLPPETFGRQKAPAGTWASCIRIINPVDAKTVNIIPLDNNEAAF 883

Query: 718  SILSCSFSDDSN-VYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
            S+   SFS  S  ++  VGTA   ++ P        R   F  +   L+L+ + ET    
Sbjct: 884  SLAVVSFSARSGELHLVVGTAQDTFLAPRSCTSGFLRTYRFTDDGTNLELLHKTETNDVP 943

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
             ++  F G+L+A + + ++LY     D G ++L  +  +     A+  + T+G  I+VGD
Sbjct: 944  LAVLGFQGRLVAGVGKALRLY-----DMGKKKLLRKVENKTFASAIVSLATQGSRILVGD 998

Query: 833  LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSE 889
            + +S+S  +YK  E  +   A D    W SA+ ++D +    A+   N++  R   K SE
Sbjct: 999  MQESVSFAVYKPPENKLLVFADDTQPRWTSAMTMVDYNTVASADRFGNIYVNRLDPKVSE 1058

Query: 890  GATDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
               D+  G               + E++  +H+G+ V      SLV        G    +
Sbjct: 1059 QVDDDPTGAGILHEKGLLAGAPHKTELLSHFHVGDIVTSINKVSLV-------AGGREVL 1111

Query: 935  IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
            ++  ++G IG++   +  E   F+  L+ ++R     + G +H  WR +      V  K 
Sbjct: 1112 LYTGLHGTIGILVPFVSKEDVDFISTLEQHMRTEQLSLVGRDHLTWRGY-----YVPVKA 1166

Query: 994  FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +DGDL E+F  L  ++   I+  ++ +V E+ K++E+L
Sbjct: 1167 VVDGDLCETFARLPASKQSAIAGELDRTVGEVLKKLEQL 1205


>gi|440636768|gb|ELR06687.1| pre-mRNA-splicing factor rse1 [Geomyces destructans 20631-21]
          Length = 1212

 Score =  230 bits (586), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 285/1168 (24%), Positives = 489/1168 (41%), Gaps = 166/1168 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I TL  FR  G  +D++ I ++  +  ++++    +      +        R    
Sbjct: 64   VFGIIRTLAAFRLAGSNKDYIIIGSDSGRITIVEFVPAQNRFNRLHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q L    +    G 
Sbjct: 124  GQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKAQTLVFALVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
            A P    L  D  D+    T             YE+ L     V   WS + +D  A +L
Sbjct: 184  ANPVFAALEVDYGDSDQDPTGQAFEEIEKTLVYYELDLGLNHVVR-KWS-DPVDRTASIL 241

Query: 168  IPVPPPL---CGVLIIGEETIVYCSAN--AFK-AIPIRPSITKAYGRV------------ 209
              VP       GVL+ GE+ I Y  +N  AF+ AIP R   T+   R             
Sbjct: 242  FQVPGGTDGPSGVLVCGEDNITYRHSNQEAFRVAIPRRKGATEDPQRKRSIVAGVMHKMR 301

Query: 210  DADGSRYLL--GDHAGLLHLLV--ITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
             A G+ + L   D   L  + +  I  +  + TG    LKI+      IA+++  L +  
Sbjct: 302  GAAGAFFFLLQSDDGDLFKITIEMIEDDNGQPTGEVRRLKIKYFDTVPIATSLCILKSGF 361

Query: 262  VYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYV 294
            +++ S +G+ Q  +                N   D    Y  V            +E   
Sbjct: 362  LFVASEFGNHQFYQFEKLGDDDDETEYISDNFPTDPAEPYTPVYFHPRPAENLNLVESID 421

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
            ++ P++D  V +L  +   Q+ +  G     + R +++G+ +NE    EL G+   +W+ 
Sbjct: 422  SMNPLMDCKVANLTEEDAPQIYSICGTGARSTFRTLKHGLEVNEIVESELPGVPSAVWTT 481

Query: 354  RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
            + +  D +D +++++F + T +L++   + +EE    GF S   TL       + L+QV 
Sbjct: 482  KLTRGDEYDAYIILAFSNGTLVLSIG--ETVEEVTDTGFLSSATTLAVQQLGEDGLIQVH 539

Query: 414  SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
               +R + +  R   NEW +P   S+  AT NA QV +A   G +VY E+  DG L E  
Sbjct: 540  PKGIRHIRADRR--VNEWAAPQHRSIVAATTNAQQVAVALSSGEIVYFEMDSDGSLAEYD 597

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
            +  ++   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P
Sbjct: 598  EKREMSGTVTCLSLGEVPEGRVRSNFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 656

Query: 531  RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
             S+ + A    S      YL   L  G  L  +L+  TGEL+D +   LG +P  L   S
Sbjct: 657  SSLSIMAMTDSSSGGSTLYLHIGLYSGVYLRTVLDEITGELSDTRTRFLGPKPAKLFRVS 716

Query: 585  SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
             +  T V A S RP + YS    K  + + ++   +     F+S   P+ +   +   L 
Sbjct: 717  VQGQTAVLALSSRPWLGYSDPVTKGFMLTPLDYAGLEWGWNFSSEQCPEGMVGIQGQNLR 776

Query: 643  IGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT 701
            I +I+ + + L   SIPL   PRR     +   F +    N   ++ ++    +LL+D +
Sbjct: 777  IFSIEKLTENLLQHSIPLSYTPRRFVKHPEHPYFYVIQSDNNILSKSTKQ---KLLEDPS 833

Query: 702  FEFISTYPLDTFEYG-----------------------------------CSILSCSF-S 725
             +   +  L   E+G                                    S+ + SF S
Sbjct: 834  LQNGDSAVLPAEEFGYPRGRGHWASCIQIVDPIREKKVLQQIDLEDNEAAVSMATVSFAS 893

Query: 726  DDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGK 782
             +  V+  VGT   ++      + G I V+   EDGK ++ I + + +    +L  F G+
Sbjct: 894  QEDEVFLVVGTGKDMVASPRSSSGGFIHVYRFHEDGKEIEFIHKTKVEEPPLALLGFQGR 953

Query: 783  LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
            LL  I +++++Y   +R    R+ Q+E       L + +QT+G  I+V D+ +SI+ ++Y
Sbjct: 954  LLVGIGRELRIYDLGMR-QLLRKAQTEIAAS---LIVGLQTQGSRIIVSDVQESITFVVY 1009

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRL 899
            K +E  +   A D  A W +   ++D +   G +   NL+ +R   K SE A +E  G  
Sbjct: 1010 KFQENKLIPFADDTIARWTTCTTMVDYETVAGGDKFGNLWLLRCPTKASEEADEEGSGAH 1069

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT-----------VNGVIGVIAS 948
             V    H  +++    H   V  +  +    IPT I  T            +G+ G IA 
Sbjct: 1070 LV----HERQYLQGAPH--RVALMAHNFANDIPTSIQKTNLVAGGRDCLLWSGLQGTIAI 1123

Query: 949  L----PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
            +      E   F + L+ +LR     + G +H  +RS+      V  K  +DGDL E + 
Sbjct: 1124 MIPFVSREDVDFFQTLEQHLRTEDAPLAGRDHLIYRSY-----YVPVKGVIDGDLCERYT 1178

Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             L   +   I+   + SV E+ +++ ++
Sbjct: 1179 LLPTDKKMMIAGEFDRSVREIERKISDM 1206


>gi|449546293|gb|EMD37262.1| hypothetical protein CERSUDRAFT_83017 [Ceriporiopsis subvermispora B]
          Length = 1213

 Score =  229 bits (585), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 270/1178 (22%), Positives = 490/1178 (41%), Gaps = 180/1178 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+  + ++  +  +L+++ ++S               R    
Sbjct: 58   VFGSIRSLAAFRLTGGTKDYAIVGSDSGRIVILEYEPKTSSFTKVHQETFGKSGARRIVP 117

Query: 66   GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
            GQ    DP  R + +   +   L  ++  D    L      EA   N  +  +  LD+ F
Sbjct: 118  GQYLATDPKGRSVMIAAVEKAKLVYILNRDAAANLTISSPLEAHKNNAIIHHIVGLDVGF 177

Query: 117  LYGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNG 163
                  P    L  DN ++    T             YE+ L     V   WS+   D  
Sbjct: 178  ----DNPMFAALEVDNSESDQDPTGEAFNNAEKMLTYYELDLGLNHVVRK-WSEPT-DPR 231

Query: 164  ADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------- 201
            A+LL+ VP            P   GVL+  E+ I+Y   +A +    IP R         
Sbjct: 232  ANLLVQVPGGQLAASDRYDGP--SGVLVCCEDHIIYRHMDAPQHRVPIPRRRHPLQDADR 289

Query: 202  ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
               IT A          +LL    G L+ + I HE++ V  LKI+      +AS++  L 
Sbjct: 290  GLIITAAVMHKMKGAFFFLLQSEEGDLYKVTIDHEEQDVKALKIKYFDTVPVASSLCILK 349

Query: 259  NAVVYIGSSYGDSQLIKL------NLQPD-AKGSY------------------------V 287
            +  +++ S +G+  L +       + +P+ +  SY                        +
Sbjct: 350  SGFLFVASEFGNHYLYQFQKLGDDDFEPEFSSASYPSFGMADPSIPLPPVYFRPRALDNL 409

Query: 288  EVLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
             +++   +L P+VD  V++L       Q+ T  G     S R++R+G+ + E  S +L G
Sbjct: 410  TLVDEMESLSPVVDSKVMNLLPNSDTPQIFTACGRGPRSSFRMLRHGLDVEEVVSSDLPG 469

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            I   +W+ +   DDP+D+++++SF++ T +L +   + +EE +  GF S   TL      
Sbjct: 470  IPNAVWTTKLKEDDPYDSYIILSFVNGTLVLCIG--ETIEEVQDTGFLSSAPTLAVQQIG 527

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
             + L+QV    +R V +  R   NEW+ P G ++  AT N  QV++A     LVY E+  
Sbjct: 528  ADALLQVHPHGIRHVLADKRV--NEWRVPQGKTIVCATTNKRQVVVALSSAELVYFELDL 585

Query: 465  DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
            DG L E +  + +   +  L I  + E    +   AVG   D +VRI SL   + +    
Sbjct: 586  DGQLNEYQDRKAMGSTVLALSIAEVPEGRQRTPYLAVGC-EDQTVRIISLDPESTLDTIS 644

Query: 524  LGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
            L     P S +  A          +  +++   L +G LL  +L+   G+LTD +   LG
Sbjct: 645  LQALTAPPSAICIADMLDAGINKTQQTTFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLG 704

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
            T+PI L     +    + A S RP + Y+    + ++ +  + + +   F++   P+ L 
Sbjct: 705  TRPIKLIRVLVQRNPGILALSSRPWLNYTHQNLMHFTPLIFENLDYAWSFSAELCPEGLI 764

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 693
                  L I  I  +  KL   +IPL   PR+      +    +    ++   E++    
Sbjct: 765  GISGSVLRIFHIPKLGTKLKQDAIPLLYTPRKFIPHPTNSLLYMIEGDHRVMGEDAAARK 824

Query: 694  V-------RLLDDQTFEF-----------------------------ISTYPLDTFEYGC 717
            +       R +DD+  +                              ++   LD  E   
Sbjct: 825  LDELRQKGREIDDEVVQLPPELFGRPKAPAGTWASCIRIINPVDAKTVNVIHLDNNEMAF 884

Query: 718  SILSCSF-SDDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
            S+    F +    ++  VGTA   ++ P        R   F  +   L+L+ + ET    
Sbjct: 885  SLAVVPFAARGGELHLVVGTAQDTFLTPRSCTSGFLRTYRFSDDGQNLELLHKTETNDVP 944

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDL 833
             ++ AF GKL+A + + +++Y  M +    R+++++      I+AL+  T+G  I VGD+
Sbjct: 945  LAVMAFQGKLVAGVGKSLRIYD-MGKKKLLRKVENKT-FSAAIVALH--TQGSHIYVGDM 1000

Query: 834  MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEG 890
             +S+   +YK  E  +   A D    W++A+ ++D +    A+   N+F  R   K SE 
Sbjct: 1001 QESVFYAVYKAPENRLLVFADDTQPRWITAMTMIDYNTVAAADRFGNVFVNRLDPKVSEQ 1060

Query: 891  ATDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVI 935
              D+  G               +  ++  +H+G+ +      +LV        G    ++
Sbjct: 1061 VDDDPTGAGILHEKSILFGAPHKTSMLAHFHVGDLITSINKVALV-------AGGREVLL 1113

Query: 936  FGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNF 994
            +  ++G IG++   +  E   F+  L+ ++R     + G +H  WR +      V  K  
Sbjct: 1114 YTGLHGTIGILMPFVSKEDVDFISTLEQHMRTEQLSLVGRDHLTWRGY-----YVPVKAV 1168

Query: 995  LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +DGDL E+F  L  ++   I+  ++ +V E+ K++E+L
Sbjct: 1169 VDGDLCETFARLPASKQSAIAGELDRTVGEVLKKLEQL 1206


>gi|168045572|ref|XP_001775251.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673464|gb|EDQ59987.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1201

 Score =  229 bits (585), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 248/1039 (23%), Positives = 449/1039 (43%), Gaps = 168/1039 (16%)

Query: 132  NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVP-----PPLCGVLIIGEETIV 186
            N+  +H+  YE+ L     V   W++  +DNGA++L+ VP     P   GVL+  E  ++
Sbjct: 188  NEAQKHLTFYELDLGLNHVVR-KWTEP-IDNGANMLVTVPGGGDGP--SGVLVCAENFVI 243

Query: 187  YCSAN---AFKAIPIRPSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKE 235
            Y + N       IP R  +    G +    +         +LL    G +  + + ++K+
Sbjct: 244  YKNQNHPDIRAVIPRRQDLPPNRGVLIVSAATHKQKALFFFLLQTEYGDIFKVTLDYDKD 303

Query: 236  -KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPD------AKGSYVE 288
             +VT LK++      + S +  L +  ++  S +G+  L +     D      +  + VE
Sbjct: 304  DQVTELKVKYFDTIPVTSAMCVLKSGFLFAASEFGNHSLYQFQSIGDDPEVESSSSTLVE 363

Query: 289  VLERYV-------------------NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRI 329
              E Y                    +L PI+D  V ++  +   Q+ +  G     SLRI
Sbjct: 364  TEEGYQPVFFQPRKLKNLVQIDDIESLMPIMDMKVANIFEEETPQIFSLCGRGPRSSLRI 423

Query: 330  VRNGIGINEQASVELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE 388
            +R G+ + E A   L G+   +W+++   +D FD ++VVSF++ T +L++   + +EE  
Sbjct: 424  LRPGLAVTEMAVSPLPGVPSAVWTVKKHANDEFDAYIVVSFVNATLVLSIG--ETVEEVS 481

Query: 389  IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQ 448
              GF   T +L       + L+QV    +R + S  R   NEWK+P   ++     N  Q
Sbjct: 482  DSGFLDTTPSLAISLLGDDSLMQVHPSGIRHIRSDGR--INEWKTPGKKTIVKVGYNRMQ 539

Query: 449  VLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDIS 507
            V++A  GG L+Y E+   G L E++   +  +++CLDI P+ E    S+  AVG + D +
Sbjct: 540  VVIALSGGELIYFEMDMSGQLMEIEKRDMTGDVACLDIAPVPEGRQRSRFLAVGSY-DST 598

Query: 508  VRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAFEGIS------------YLLCALGDGH 552
            +RI SL PD  + +++ + +     P S+LL   +  +            +L   L +G 
Sbjct: 599  IRILSLDPDDCMQILSVQAVSSP--PESLLLLEVQASTGGEDGADHPASVFLNAGLQNGV 656

Query: 553  LLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSN 612
            LL   ++M TG+L+D +   LG +   L +   +    +   S RP + Y      L + 
Sbjct: 657  LLRTEVDMVTGQLSDTRTRFLGLRAPKLFSALVRGRRAMLCLSSRPWLGYIHQGHFLLTP 716

Query: 613  VNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQE 670
            ++ + + +   F+S    + +       L + TI+ + +  +   +PL   PR+ I H +
Sbjct: 717  LSYETLEYAASFSSDQCAEGVVAVAGDALRVFTIERLGETFNQTVVPLRYTPRKFILHPK 776

Query: 671  QSRTFAICSLKNQSCAEESEMH-------------------------------------- 692
            Q     + S +     EE E +                                      
Sbjct: 777  QKTLIILESDEGAFSTEEREANKKEAVEATGGHENGKADGEDEEMADEESEDPLPDEQYG 836

Query: 693  -----------FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV--YYCVGTAYV 739
                        +R+LD +T        L   E   SI + +F D+  +     VGTA  
Sbjct: 837  YPKAEANKWVSCIRVLDPKTSTTTCLLELQENEAAFSICTVNFHDNKELGTLIAVGTAKD 896

Query: 740  LP--EENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
            L      E + G I ++   E+G+ L+L+ +    G   +L  F G+LL  + Q +++Y 
Sbjct: 897  LQFMPRKEASGGFIHIYRFAEEGRVLELVHKTPVDGVPTALCQFQGRLLVGVGQVLRIY- 955

Query: 796  WMLRDDGTRELQSECGHHGH---ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
                D G R+L  +C +      I+A++  T GD I VGD+ +S   + Y+ +E  +   
Sbjct: 956  ----DLGKRKLLRKCENKNFPNTIIAIH--TYGDRIYVGDIQESFHYVKYRRDENQLYTF 1009

Query: 853  ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE-----------GATDEERGRL-- 899
            A D    W++A   +D D   GA+   N++ +R   +           G    E+GRL  
Sbjct: 1010 ADDSCPRWLTASLHIDFDTMAGADKFGNVYVMRLPQDVSEEIEDDPTGGKIKWEQGRLNG 1069

Query: 900  -----EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQ 953
                 + + ++H+GE V   +  SL+        G   ++++GTV G +G +      E 
Sbjct: 1070 APNKVDEIIQFHVGEVVTSLQKASLI-------PGGGESMLYGTVMGSMGALLPFSSRED 1122

Query: 954  YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
              F   L+ +LR+    + G +H  +RS          K+ +DGDL E +  L+     +
Sbjct: 1123 VDFFSHLEMHLRQENPPLCGRDHMAFRS-----AYFPVKDVIDGDLCEQYSMLTSELQKK 1177

Query: 1014 ISKTMNVSVEELCKRVEEL 1032
            I+  ++ +  E+ K++E++
Sbjct: 1178 IADDLDRTPGEIVKKLEDI 1196


>gi|170083859|ref|XP_001873153.1| predicted protein [Laccaria bicolor S238N-H82]
 gi|164650705|gb|EDR14945.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 1213

 Score =  228 bits (582), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 268/1174 (22%), Positives = 476/1174 (40%), Gaps = 172/1174 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+  + ++  +  +L +D ++S  +            R    
Sbjct: 58   VFGSIRSLAAFRLTGGTKDYAIVGSDSGRIIILDYDPKTSSFVKLHQETYGKSGARRIVP 117

Query: 66   GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
            GQ    DP  R + +   +   L  ++  D    L  +  +        +  +  LD+ F
Sbjct: 118  GQYLATDPKGRSVMISAMEKAKLVYILNRDAAANLTISSPLEAHKNSAIIHHIVGLDVGF 177

Query: 117  ---LYGCAKPTIVVLYQDN-----KDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
               ++   +       QD      K+A  + TY       + V   WS+   D  A+LL+
Sbjct: 178  ENPMFAALEVDYAESDQDPTGDAFKNAEKMLTYYELDLGLNHVVRKWSEPT-DPRANLLV 236

Query: 169  PVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS----------IT 203
             VP            P   GVL+  E  I+Y  A+  +    IP R            IT
Sbjct: 237  QVPGGQLASSERFDGP--SGVLVCCENHIIYRHADVPQHRVPIPRRRHPLEDLNRGLIIT 294

Query: 204  KAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
             A          +LL    G L+ + I HE E+V  LKI+      +AS +  L +  ++
Sbjct: 295  AAVMHKMKGAFFFLLQSEDGDLYKVTIEHEDEEVKSLKIKYFDTVPVASNLCILKSGFLF 354

Query: 264  IGSSYGDSQLIKLNLQPD-------AKGSY---------VEVLERYVNLGPIVDFCVVD- 306
            + S +G+  L +     D       +  SY         V +   +  L P+ +  V D 
Sbjct: 355  VASEFGNHYLYQFQKLGDDDDEPEFSSTSYPSFGMADPSVPLPHAFFRLHPLDNLVVADE 414

Query: 307  ---------------LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
                           L      Q+    G     + R +R+G+ + E  S +L GI   +
Sbjct: 415  LESLDPILDSKVMNLLPNSDTPQIFAACGRGARSTFRTLRHGLEVEESVSSDLPGIPNAV 474

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+ + + DDPFD+++++SF++ T +L++   + +EE +  GF S   TL       + L+
Sbjct: 475  WTTKRTEDDPFDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGADALL 532

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
            QV    +R V +  R   NEW+ P G ++  AT N  QV++A     LVY E+  DG L 
Sbjct: 533  QVHPHGIRHVLADRRV--NEWRVPQGKTIVSATTNKRQVVVALSSAELVYFELDLDGQLN 590

Query: 470  EVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
            E +  + +   +  L I  + E    +   AVG   D +VRI SL     +    L    
Sbjct: 591  EYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGC-EDQTVRIISLDPETTLETISLQALT 649

Query: 529  IPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
             P S +  A          +   ++   L +G LL  +L+   G+LTD +   LGT+PI 
Sbjct: 650  APPSAICIADMLDASINKSQPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGTRPIK 709

Query: 580  LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
            L     +    + A S R  + Y+    + ++ +    + +   F++   P+ L      
Sbjct: 710  LVRVQIQRNPGILALSSRSWLNYTHQNLMHFTPLIYDNLDYAWSFSAELSPEGLIGIAGS 769

Query: 640  ELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES--------- 689
             L I  +  +  KL   SIPL   PR+      +  F +    +++  EE+         
Sbjct: 770  VLRIFQVPKLGTKLKQDSIPLSYTPRKFITHPGNNYFYLIEGDHRTLGEEAASTKLDEFR 829

Query: 690  ---------EMHF------------------VRLLDDQTFEFISTYPLDTFEYGCSILSC 722
                     E++                   +R++D    + ++  P+D+ E   S+   
Sbjct: 830  RQGKSFDDGEINIPPEVFGRPKAPAGTWASCIRIIDPVEAKTVAVIPMDSNEAAFSLAIV 889

Query: 723  SFS-DDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
             FS  +  ++  VGTA    V P        R   F  E   L+   + E      +L A
Sbjct: 890  PFSARNGELHLVVGTAADTIVSPRSCTSGYLRTYKFTNEGAGLEFQHKTEVDDVPLALLA 949

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSI 837
            F G+L+A + + +++Y     D G ++L  +  +     A+  + T+G  I+VGD+ +SI
Sbjct: 950  FQGRLVAGVGKALRIY-----DIGKKKLLRKVENKTFSSAIVTLSTQGSRIIVGDMQESI 1004

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
               +YK  E  +   A D    W++A  ++D +  +  +   N+F  R   K SE   D+
Sbjct: 1005 QFAVYKPPENRLLVFADDSQPRWITASSMVDYNTIVAGDRFGNIFVNRLDPKVSEQVDDD 1064

Query: 895  ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
              G               + +++  +HLG+ V      SLV        G    +++  +
Sbjct: 1065 PTGAGILHEKGILMGAPHKTKMLAHFHLGDLVTSIHKVSLV-------AGGREVLMYTGL 1117

Query: 940  NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
            +G IG++   +  E   F+  L+ ++R     + G +   WR +    K V     +DGD
Sbjct: 1118 HGTIGILVPFVSKEDVDFISTLEQHMRTEQSSLVGRDQLSWRGYYAPVKAV-----VDGD 1172

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L E++  L   +   I+  ++ +V E+ K++E+L
Sbjct: 1173 LCETYARLPAAKQSSIAGELDRTVGEVLKKLEQL 1206


>gi|347829304|emb|CCD45001.1| similar to pre-mRNA-splicing factor rse1 [Botryotinia fuckeliana]
          Length = 1212

 Score =  228 bits (580), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 275/1162 (23%), Positives = 491/1162 (42%), Gaps = 154/1162 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I  +  FR  G  +D++ I ++  +  ++++    ++     +        R    
Sbjct: 64   VFGIIRAIAAFRLAGSNKDYIIITSDSGRITIVEFVPAQNKFNRLHLETFGKSGVRRVVP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ   +DP  R  L      + L  V+  +++ +L  +  +   + Q L    +    G 
Sbjct: 124  GQYLAVDPKGRACLTASVEKNKLVYVLNRNSQAELTISSPLEAHKAQTLVFALVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
            A P    L  D  ++    T             YE+ L     V   WS+  +D  A++L
Sbjct: 184  ANPVFAALEIDYGESDQDPTGQAYDEIEKQLVYYELDLGLNHVVRK-WSEP-VDRTANIL 241

Query: 168  IPVPPPL---CGVLIIGEETIVYCSAN--AFK-AIPIRPSITKAYGRVD--ADGSRYLLG 219
              VP       GVL+ GE+ I Y  +N  AF+ AIP R   T+   R      G  + L 
Sbjct: 242  FQVPGGTDGPSGVLVCGEDNITYRHSNQEAFRVAIPRRRGATEDPQRKRNIVAGVMHKLK 301

Query: 220  DHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNAV 261
              AG    L+ T + +                  +V  LKI+      +A+++  L +  
Sbjct: 302  GAAGAFFFLLQTDDGDLFKITIEMVEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGF 361

Query: 262  VYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYV 294
            +++ S +G+ Q  +                +    A  SY  +            +E   
Sbjct: 362  LFVASEFGNHQFYQFEKLGDDDEETEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESID 421

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
            ++ P++D  V +L  +   Q+ +  G     + R +++G+ ++E    EL G+   +W+ 
Sbjct: 422  SMNPLMDCKVANLTDEDAPQIYSICGTGARSTFRTLKHGLEVSEIVESELPGVPSAVWTT 481

Query: 354  RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
            + +  D +D ++++SF + T +L++   + +EE    GF S   TL       + L+QV 
Sbjct: 482  KLTRGDTYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVH 539

Query: 414  SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
               +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG L E  
Sbjct: 540  PKGIRHIRADHR--VNEWAAPQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSLAEYD 597

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
            +  ++   ++CL +  + E    SQ  AVG   D +VRI SL PD  L  K        P
Sbjct: 598  EKKEMSGTVTCLSLGEVPEGRQRSQFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSPP 656

Query: 531  RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
             ++ + A    S      YL   L  G  L  +L+  TGEL+D +   LG +P+ L   +
Sbjct: 657  NALSIMAMSDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGPKPVKLFRVA 716

Query: 585  SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
             +  T V A S RP + YS    K  + + ++   +     F+S    + +   +   L 
Sbjct: 717  VQGQTAVLALSSRPWLGYSDPVTKGFMLTPLDYPALEWGWNFSSEQCTEGMVGIQGQNLR 776

Query: 643  IGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT 701
            I +I+ +   L   SIPL   PRR     +   F +    N   +  ++    +LL+D +
Sbjct: 777  IFSIEKLTDNLLQESIPLTYTPRRFVRHPEHACFYVIEADNNILSPATKQ---KLLEDPS 833

Query: 702  F-----------EF------------------------ISTYPLDTFEYGCSILSCSF-S 725
                        EF                        +ST  L+  E   SI   +F S
Sbjct: 834  VTNGDATVLPPEEFGYPRGTNHWASCISVVDPVTEKKVLSTIHLEDNECAVSIAVVAFAS 893

Query: 726  DDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGK 782
             +   + CVGT   ++      + G I V+   EDGK L+ I + + +    +L AF G+
Sbjct: 894  QEDETFLCVGTGKDMVVSPRSFSAGFIHVYRFHEDGKELEFIHKTKVEEPPMALLAFQGR 953

Query: 783  LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
            LLA + + +++Y   +R    R+ QSE   +   + + +QT+G  I+V D+ +SI++++Y
Sbjct: 954  LLAGVGKDLRIYDLGMR-QLLRKAQSEVAPN---MIVGLQTQGSRIIVSDVQESITMVVY 1009

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
            K +E  +     D  A W S   ++D +   G +   NL+ +R   K SE A +E  G  
Sbjct: 1010 KFQENRLIPFVDDTIARWTSCTTMVDYETVAGGDKFGNLWLLRCPAKASEEADEEGSGAH 1069

Query: 898  ----RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGV-IASLP 950
                R  + G  H    +       + M +  +++  G    +++  + G +G+ I  + 
Sbjct: 1070 LLHERQYLAGAPHRLTLMAHNFSQDIPMSIQKTNLVAGGRDCLLWSGLQGTLGILIPFVS 1129

Query: 951  HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTR 1010
             E   F + L+ +LR     + G +H  +RS+      V  K  +DGDL E +  L   +
Sbjct: 1130 REDVDFFQTLEQHLRSEDPPLAGRDHLIYRSY-----YVPVKGVIDGDLCERYTLLPTDK 1184

Query: 1011 MDEISKTMNVSVEELCKRVEEL 1032
               I+  ++ SV E+ +++ ++
Sbjct: 1185 KQMIAGELDRSVREIERKISDI 1206


>gi|302680006|ref|XP_003029685.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
 gi|300103375|gb|EFI94782.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
          Length = 1213

 Score =  227 bits (579), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 273/1175 (23%), Positives = 471/1175 (40%), Gaps = 174/1175 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I ++  FR  G  +D L + ++  +  +L +D +S+  +            R    
Sbjct: 58   VFGSIRSIAAFRLTGGTKDHLILGSDSGRIVILDYDPKSTSFVKLHQETYGKSGARRIVP 117

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  +IG      L  ++  D    L  +  +   +   +    +    G 
Sbjct: 118  GQYLATDPKGRAVMIGAMEKSKLVYILNRDAAANLTISSPLEAHKNSAIIHHIVGVDVGF 177

Query: 121  AKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
              P    L  D             N  A+ +  YE+ L     V   WS+   D  A+LL
Sbjct: 178  ENPIFAALEVDYSESDQDPTGEAFNNAAKMLTYYELDLGLNHVVRK-WSEPT-DPRANLL 235

Query: 168  IPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS----------I 202
            + VP            P   GVLI  E+ I+Y   +A +    IP R            I
Sbjct: 236  VQVPGGQLASSDRFDGP--SGVLICCEDHIIYRHMDAPQHRVPIPRRKHPLEDPERGIII 293

Query: 203  TKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
              A          +LL    G L  + I HE E V  +KI+      +AS +  L +  +
Sbjct: 294  VAAVMHKMKGAFFFLLQSEDGDLFKVTIEHEDEDVKEVKIKYFDTVPVASALCILKSGFL 353

Query: 263  YIGSSYGDSQLIKLNLQPD-------AKGSYVE------------------------VLE 291
            ++ S +G+  L +     D       +  SY +                        + +
Sbjct: 354  FVASEFGNHYLYQFQKLGDDDDEPEFSSSSYPQFGMADSSMPLPHVHFKPHPLDNLALAD 413

Query: 292  RYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KG 349
               +L PI+D  V++L       Q+    G     SLR +R+G+ + E  S +L GI   
Sbjct: 414  EVESLDPIIDSKVLNLMPNSDTPQIFAACGRGPRSSLRTLRHGLEVEESVSSDLPGIPNA 473

Query: 350  MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
            +W+ +   DD FD+++++SF++ T +L++   + +EE +  GF S   TL       + L
Sbjct: 474  VWTTKKKEDDAFDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGADAL 531

Query: 410  VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGIL 468
            +QV    +R V S  R   NEW+ P G S+  AT N  QV++A     LVY E+  DG L
Sbjct: 532  LQVHPQGIRHVLSDRRV--NEWRVPQGKSIVQATTNKRQVVVALSSAELVYFELDLDGQL 589

Query: 469  TEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGG 526
             E +  + +   +  L I  + E    +   AVG   D +VRI SL P+  L T      
Sbjct: 590  NEYQDRKAMGSTVLALSIGEVPEGRQRTPFLAVGC-EDQTVRIISLDPESTLDTISLQAL 648

Query: 527  EIIPRSVLLCAF--------EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
               P S+ +               ++   L +G LL  +L+  TG+LTD +   LGT+P+
Sbjct: 649  TAPPSSICIAEMLDAAINKTHPTMFVNIGLQNGVLLRTVLDPMTGQLTDTRTRFLGTRPV 708

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
             L   + +    + A S R  + Y+    + ++ +  + +     F++   P+ L     
Sbjct: 709  KLVRVAIQRNPAIMALSSRSWLNYTHQNMVHFTPLIFENLDFAWSFSAELSPEGLIGITG 768

Query: 639  GELTIGTIDDIQ-KLHIRSIPLGEHPR------------------RICHQEQSRTFAICS 679
              L I  I  +  KL   S+PL   PR                  R+   E +R   +  
Sbjct: 769  SVLRIFQIPKLGVKLKQDSLPLSYTPRKFITHPNNHYFYMIESDHRVYGDETAREKILDQ 828

Query: 680  LKNQSCAEESEMHF------------------VRLLDDQTFEFISTYPLDTFEYGCSILS 721
            +      +E  ++                   +R++D      ++  PLD  E   SI  
Sbjct: 829  MSRGKAVDEDVVNLPVTEFGRVKAPAGTWGSCIRIIDPTQNSTVAVIPLDNNEAAFSIAV 888

Query: 722  CSFS-DDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
              FS  +  ++  VGTA    V P        R   F  +   L+L  + ET     +L 
Sbjct: 889  VPFSARNGELFLVVGTAANTRVSPRTCSSGYLRTYQFTNDGAGLELHHKTETDDVPLALL 948

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH-ILALYVQTRGDFIVVGDLMKS 836
            AF G+L A + + +++Y     D G ++L  +  + G     + + T+G  I+ GD+ +S
Sbjct: 949  AFQGRLAAGVGKALRIY-----DIGKKKLLRKAENKGFGTTIVTLNTQGSRIIAGDMQES 1003

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATD 893
            +   +YK  E  +   A D    W+SA  ++D       +   N+F  R   K SE   D
Sbjct: 1004 LFYAVYKAPENRLLVFADDSQPRWISAATMVDYYTVAAGDRFGNVFVNRLDYKVSEQVDD 1063

Query: 894  EERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
            +  G               + +++  +H+G+ +      +LV        G    +++  
Sbjct: 1064 DPTGAGILHEKGILMGAPHKTKLLCHFHVGDLITSIHKVALV-------AGGREVLLYTG 1116

Query: 939  VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
            ++G IG++   +  E   F+  L+ ++R     + G +H  WR +      V  K  +DG
Sbjct: 1117 LHGTIGMLVPFVSKEDVDFISTLEQHMRSEQSSLVGRDHLSWRGY-----YVPVKAVVDG 1171

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DL E+F  L  ++   I+  ++ +V E+ K+++ L
Sbjct: 1172 DLCETFAKLPASKQSAIANELDRTVGEVLKKLDSL 1206


>gi|302916981|ref|XP_003052301.1| predicted protein [Nectria haematococca mpVI 77-13-4]
 gi|256733240|gb|EEU46588.1| predicted protein [Nectria haematococca mpVI 77-13-4]
          Length = 1212

 Score =  226 bits (577), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 282/1174 (24%), Positives = 489/1174 (41%), Gaps = 168/1174 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   I+G I +L  FR  G  +D+L +AT+  +  ++++    +      +        
Sbjct: 59   LLSHDIFGIIRSLAAFRLAGSNKDYLILATDSGRITIIEYLPAQNRFQRLHLETFGKSGV 118

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    G+    DP  R  LI     + L  V+  +++ +L  +  +   +  VL I  + 
Sbjct: 119  RRVIPGEYLACDPKGRACLIASTEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVISMVA 178

Query: 118  --YGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDN 162
               G + P    L  D  +     +             YE+ L     V   WS + +D 
Sbjct: 179  LDVGYSNPVFAALEIDYSEVDQDSSGQAMEDLETQLVYYELDLGLNHVVRR-WS-DPVDP 236

Query: 163  GADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDAD 212
             A LL  VP     P   GVL+ GEE I Y  +N  AF+ AIP R   T+     R    
Sbjct: 237  TASLLFQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVAIPRRRGATEDPNRKRTIVS 294

Query: 213  GSRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTI 254
            G  + L   AG    L+ T + +                  +V  LKI+      +AS++
Sbjct: 295  GIMHKLKGSAGAFFFLLQTEDGDLFKLSIDMVEDEDGNPTGEVKRLKIKYFDTVPVASSL 354

Query: 255  SYLDNAVVYIGSSYGDS---------------QLIKLNLQPDAKGSY------------V 287
              L +  +Y+ S +G+                + I  +   D + SY            +
Sbjct: 355  CILKSGFLYVASQFGNYSFYQFEKLGDDDEELEFISDDFPADPRASYDPVYFHPRPTENL 414

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
             ++E    + P++D  V +L  +   Q+ T  G     + R++++G+ +NE  + EL GI
Sbjct: 415  ALVESIPAMNPLLDCKVANLTGEDAPQIYTICGNGARSTFRMLKHGLEVNEIVASELPGI 474

Query: 348  -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
               +W+L+ +  + +D ++V+SF + T +L++   + +EE    GF +   TL       
Sbjct: 475  PSAVWTLKLNRSEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGE 532

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
            + L+QV    +R V +      NEW +P   S+  A+ANA QV +A   G +VY E+  D
Sbjct: 533  DGLIQVHPKGIRHVRNGHI---NEWAAPQHRSIVAASANAHQVAIALSSGEIVYFEMDAD 589

Query: 466  GILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
            G L E    +  +  ++CL +  + E    S   AVG   D +VRI SL PD  L  K  
Sbjct: 590  GSLAEYDEKKEMFGTVTCLSLGDVPEGRLRSSFLAVGC-DDCTVRILSLDPDSTLENKSV 648

Query: 524  LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
                  P S+ + A E  S      YL   L  G  L  +L+  TGELTD ++  LG + 
Sbjct: 649  QALTAAPTSLAIIAMEDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKE 708

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
            + L   + +  T V   S RP + Y     K  + + +N  ++     F+S    + +  
Sbjct: 709  VRLFQVTVQGKTCVLGLSSRPWLGYPDPITKGFVVTPLNYVDLEWGWNFSSEQCEEGIVG 768

Query: 636  AKEGELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
             +   L I  ID +    I +SIPL   P+++    +   F      N +   E     +
Sbjct: 769  IQGQSLRIFNIDRLGDTLIQKSIPLTYTPKKLVKHPEQPLFYTIEADNNTLPPELRAQLL 828

Query: 695  ----------RLLDDQTF-----------------------EFISTYPLDTFEYGCSILS 721
                      ++L  + F                       + + T   +  E   S   
Sbjct: 829  ADPNVVNGDSQVLPPEDFGYPRANRRWASCINVVDPLSEEGQVLQTVHFENNEAAVSATV 888

Query: 722  CSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
             SF S D+  +  VGT   ++      ++G + ++  VEDG+ L+ I + + +    +L 
Sbjct: 889  VSFASQDNENFLVVGTGKDMIVNPQSYSEGYLYIYRFVEDGRELEFIHKTKIEEPPLALL 948

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
             F GK+  A+  ++++Y   +R    R+ Q+E      I++L   T+G  IVVGD+ + +
Sbjct: 949  PFQGKVAVAVGTQLRIYDLGMR-QMLRKAQAEVSAQ-RIVSL--NTQGSRIVVGDVQQGV 1004

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
            +L++YK     +   A D  A W +   ++D +   G +   N+F VR   K SE A +E
Sbjct: 1005 TLVVYKSATNKLIPFADDTVARWTTCTTMVDYESIAGGDKFGNMFIVRCPEKASEEADEE 1064

Query: 895  ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
            + G               RL ++  ++  +        SLV       VG    +++  +
Sbjct: 1065 QSGLHLINARDYLHGTPHRLGLMCHFYTQDVPTSITKTSLV-------VGGQEILLWSGI 1117

Query: 940  NGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
             G IGV I  +  E   F + L+ +LR     + G +H  +R +         K  +DGD
Sbjct: 1118 MGTIGVFIPFVSREDADFFQNLEQHLRTEDPPLAGRDHLMYRGY-----YAPVKGVIDGD 1172

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L E +  L   +   I+  ++ SV E+ +++ ++
Sbjct: 1173 LCERYNLLPNDKKQMIAGELDRSVREIERKISDI 1206


>gi|240275059|gb|EER38574.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H143]
          Length = 1134

 Score =  226 bits (576), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 163/556 (29%), Positives = 275/556 (49%), Gaps = 29/556 (5%)

Query: 508  VRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTG 563
            V + SLPDL +     LG  G+ +PRSVL+        + L  ++ DG + +F  N +  
Sbjct: 581  VVVMSLPDLAIRRSTSLGEPGDAVPRSVLVAEVLPNNPATLFVSMADGSVFSFSFNSEDF 640

Query: 564  ELTDRKKVSLGT-QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
             LT   K++LG+ QP   +        +VFA  ++P++IY+   +++YS VN  + S +C
Sbjct: 641  SLTSMSKLTLGSEQPSFKKLPRGDGLYNVFATCEQPSLIYAVEGRIVYSAVNSDQASRIC 700

Query: 623  PFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN 682
             FNS A+P S+A+A   EL I  +D  +   I+++ +GE  RR+ +    R F I ++K 
Sbjct: 701  HFNSEAYPGSIALATPSELKITLVDAERTTQIQTLEVGETVRRVAYSAPERAFGIGTIKR 760

Query: 683  --QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVY---Y 732
              +  AE     F+ L D+  F  +  Y L+  E   S++   F     S+ ++++   +
Sbjct: 761  TLEDGAEVIASRFM-LADEIMFRELDIYDLNKDELVESVIRAQFPDGIGSEGNDLFKDLF 819

Query: 733  CVGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
             VGT+Y+        +GRIL F +  + +L  +AE   KGA  +L     K++AA+ + +
Sbjct: 820  VVGTSYLDDFGEGSIRGRILAFEVTANRQLAKVAEMPVKGACRALAIVQDKIVAALMKTV 879

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG---- 847
             +Y           L     +      + +   G+ I V DLMKS+S++ Y+        
Sbjct: 880  VVYTLSKGQFADYTLSKTASYRTSTAPVDIAVTGNLIAVADLMKSVSIVEYQQGANGLPD 939

Query: 848  AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
            ++ E AR +   W +AV  + +D +L ++   NL  + +N  G TD++R RLEV  E  L
Sbjct: 940  SLTEVARHFQTLWSTAVAPVAEDTWLESDAEGNLVMLHRNVNGVTDDDRRRLEVTSEILL 999

Query: 908  GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
            GE VNR R  + +     ++    P    GTV G I +   +       L +LQ+ +  +
Sbjct: 1000 GEMVNRIRPVN-IQGSQGAEAAISPRAFLGTVEGSIYLFGIINPTYQDLLMRLQSAMAGM 1058

Query: 968  IKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEI------SKTMNV 1020
            +   GG+   ++R+F N  ++T +   F+DG+LIE FL+      +EI           V
Sbjct: 1059 VVTPGGMPFNKFRAFRNTIRQTEEPYRFVDGELIERFLNCGVELQEEIVGKVIADGVAGV 1118

Query: 1021 SVEELCKRVEELTRLH 1036
            +VE +   VEEL R+H
Sbjct: 1119 TVESVKGLVEELRRMH 1134



 Score =  182 bits (462), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 140/456 (30%), Positives = 218/456 (47%), Gaps = 99/456 (21%)

Query: 16  FRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPD 74
           FRP     D LFI T+R  +  + WD ++  L T R   D++D   R +  G   +IDP 
Sbjct: 92  FRPQ---TDILFIGTDRASYFTVSWDQKTKNLRTERKYVDLADPSARASQLGDRCLIDPS 148

Query: 75  CRLIGLHLYDGLFKVIP----FDNKG--------------------------------QL 98
            R + L LY+G+  VIP      ++G                                +L
Sbjct: 149 GRFLTLELYEGIVTVIPIGQPLRSRGSGRRLGKRAANASASAAYANQDAAAGAAAADLEL 208

Query: 99  KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV------------- 143
            E    R+EEL V    FL+  A   P + +LY+D      +K  E+             
Sbjct: 209 GEPCQARVEELLVRSSTFLHTQADTLPRMALLYEDTMGKVRLKVRELEVMHGGMGMGGGT 268

Query: 144 --------------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC- 188
                          LK  D +     +  L+ GA  L+PVP PL G+L++GE +I Y  
Sbjct: 269 GSGGGIGADGGIVAVLKGLDLL-----KEELEMGASFLVPVPAPLGGLLVLGETSIRYLD 323

Query: 189 -SANAFKAIPIR-PSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELL 245
            ++N   + P++  +I  A+ +VD  G R+LL D  G L  L+++      V   K++LL
Sbjct: 324 DASNECISQPLKEATIFVAWEQVD--GQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLL 381

Query: 246 GETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVV 305
           G+   AS + Y+   + +IGS  GDS+LI++      +GS+ EV++ + N+ PI+DF ++
Sbjct: 382 GDIPRASVLVYMGGGITFIGSHQGDSELIRIT-----EGSF-EVIQTFANIAPILDFTIM 435

Query: 306 DL------ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWS 352
           DL      E Q      GQ ++VT SGA+ DGSLR VR+G+G+ E   +  ++ I  +W+
Sbjct: 436 DLGGRAIGESQTHEFSSGQARIVTGSGAFNDGSLRSVRSGVGMEEVGVLGAMKHITDLWA 495

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE 388
           LR +    F   L+VSF+ ETR+     + E+EE E
Sbjct: 496 LRVACPQEFSDTLLVSFVDETRVFYFTPDGEVEEKE 531


>gi|392570042|gb|EIW63215.1| hypothetical protein TRAVEDRAFT_161375 [Trametes versicolor FP-101664
            SS1]
          Length = 1213

 Score =  225 bits (574), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 277/1179 (23%), Positives = 482/1179 (40%), Gaps = 182/1179 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D++ + ++  +  +L +D  +S  +            R    
Sbjct: 58   VFGSIRSLAAFRLTGGTKDYVIVGSDSGRIVILDYDPRTSTFVKIHQETYGKSGARRIVP 117

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
            GQ    DP  R  +IG      L  ++  D    L      EA   N  +  +  LD+ F
Sbjct: 118  GQYLATDPKGRSVMIGSVEKAKLVYILNRDAAANLTISSPLEAHKNNAIIHHIVGLDVGF 177

Query: 117  LYGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNG 163
                  PT   L  D             N   + +  YE+ L     V   WS+   D  
Sbjct: 178  ----ENPTFAALEVDYSESDQDPTGEAFNNAEKMLTYYELDLGLNHVVRK-WSEPT-DPR 231

Query: 164  ADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------- 201
            A+LL+ VP            P   GVL+  E+ I+Y   ++ +    IP R         
Sbjct: 232  ANLLVQVPGGQLAASERYDGP--SGVLVCCEDHIIYRHMDSPQHRVPIPRRKHPLEDTER 289

Query: 202  ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
               IT A          +LL    G L+ + I HE++ V  +KI+      +AS++  L 
Sbjct: 290  GLIITAAVMHKMKGAFFFLLQSEEGDLYKVTIEHEEQDVKAVKIKYFDTVPVASSLCILK 349

Query: 259  NAVVYIGSSYGDS---QLIKLN---------------------LQPDAKGSY-------V 287
            +  +++ S +G+    Q  KL                        P  +G +       +
Sbjct: 350  SGFLFVASEFGNHYFYQFQKLGDDDNEPEWSSADYPSYGMADPSTPLPRGYFRPRALDNL 409

Query: 288  EVLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
             +++   +L PI+D  V++L       Q+ T  G     + R +R+G+ + E  S +L G
Sbjct: 410  TLVDELESLCPIIDSKVMNLLPNSDTPQIFTACGRGARSTFRTLRHGLEVEEVVSSDLPG 469

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            I   +W+ +   DD +D+++++SF++ T +L++   + +EE +  GF S   TL      
Sbjct: 470  IPNAVWTTKLKEDDLYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIG 527

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
             + L+QV    +R V +  R   NEWK P G ++  AT N  QV++A     LVY E+  
Sbjct: 528  ADALLQVYPHGIRHVLADRRV--NEWKVPSGKTIVCATTNKRQVVVALSSAELVYFELDL 585

Query: 465  DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
            DG L E +  + +   +  L +  + E    +   AVG   D +VRI SL   + +    
Sbjct: 586  DGQLNEYQDRKAMGSTVLALSVAEVPEGRQRTPYLAVGC-EDQTVRIVSLDPESTLETIS 644

Query: 524  LGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
            L     P S +  A          +  +++   L +G LL  +L+   G+LTD +   LG
Sbjct: 645  LQALTAPPSAICIADMLDASINKSQPTTFVNIGLQNGVLLRTVLDPVNGQLTDTRTRFLG 704

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
            T+PI L   + +    + A S RP + Y+    + ++ +  + + +   F++    + L 
Sbjct: 705  TRPIRLLRVNIQQNPAILALSSRPWLNYTYQNFMHFTPLIFENLDYAWSFSAELCTEGLI 764

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC--------------HQEQSRTFAICS 679
                  L I  I  +  KL   S+PL   PR+                H+  S   A   
Sbjct: 765  GISGSLLRIFQIPKLGTKLKQDSLPLSYTPRKFMPHPTNGLLYLIEGDHRVMSEEAASKK 824

Query: 680  LKNQSCAEESEMHFVRLLDDQTF----------------------EFISTYPLDTFEYGC 717
            L+      E     V LL  + F                        +    LD  E   
Sbjct: 825  LQEMRARGERIDEEVLLLPPEQFGRPKAPAGTWASCIRIINPLESSTVKVIHLDNNEAAF 884

Query: 718  SILSCSFSDDSN-VYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
            S+    F+   N ++  VGTA   ++ P        R   FI +   L+ + + ET    
Sbjct: 885  SMAIVPFAARGNELHLVVGTAQDTFLSPRSCTSGFLRTYRFIDDGRDLEFLHKTETSDVP 944

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
             ++ AF GKL+A + + ++LY     D G ++L  +  + G   A+  + T+G  I+VGD
Sbjct: 945  LAVMAFQGKLIAGVGKSLRLY-----DVGKKKLLRKVENKGFPAAIVTLNTQGSRIIVGD 999

Query: 833  LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE--- 889
            + +S+   +YK  E  +   A D    W++A  +LD +  +  +   N+F  R +S+   
Sbjct: 1000 MQESVFYAVYKAPENRLLVFADDAQPRWVTATTMLDYNTVVAGDRFGNVFVNRLDSKISD 1059

Query: 890  --------GATDEERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
                         E+G L        ++  +H+G+ V      +LV        G    +
Sbjct: 1060 QIDDDPTGAGILHEKGVLFGAPHKSVMLSHFHVGDIVTSLHKVALV-------AGGREVL 1112

Query: 935  IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
            ++  ++G IG++   +  E    L  L+ ++R     + G +H  WR +      V  K+
Sbjct: 1113 LYTCLHGTIGILVPFVSKEDVDLLTTLEQHMRTEQLSLVGRDHLTWRGY-----YVPVKS 1167

Query: 994  FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +DGDL ESF  L   +   I+  ++ +V E+ K++E+L
Sbjct: 1168 VVDGDLCESFAKLPANKQSTIAGELDRTVGEVLKKLEQL 1206


>gi|409075182|gb|EKM75565.1| hypothetical protein AGABI1DRAFT_64324 [Agaricus bisporus var.
            burnettii JB137-S8]
          Length = 1213

 Score =  225 bits (574), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 275/1177 (23%), Positives = 484/1177 (41%), Gaps = 178/1177 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+  + ++  +  +L++DA++S               R    
Sbjct: 58   VFGSIRSLASFRLTGATKDYAILGSDSGRIIILEYDAKTSSFAKLHQETFGKSGARRIVP 117

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
            GQ    DP  R  +I       L  ++  D    L  +  +        +  +  LD+ F
Sbjct: 118  GQYLATDPKGRSVMIAAMEKAKLVYILNRDAAANLTISSPLEAHKNSSIIHHIVGLDVGF 177

Query: 117  ---LYGCAKPTIVVLYQDN-----KDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
               +Y   +       QD      K A  + TY       + V   WS+   D  A+LL+
Sbjct: 178  ENPVYAALEVDYSESDQDPTGEAFKRAEKMLTYYELDLGLNHVVRKWSEPT-DPRANLLV 236

Query: 169  PVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS----------IT 203
             VP            P   GVL+  E+ I+Y   ++ +    IP R            IT
Sbjct: 237  QVPGGQLASSDRFDGP--SGVLVCCEDHIIYRHMDSPQHRVPIPRRQHPLEDKERGVIIT 294

Query: 204  KAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
             A          +LL    G L  + I HE E+V  LKI+      +AS++  L +  ++
Sbjct: 295  SAVMHKMKGAFFFLLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLF 354

Query: 264  IGSSYGDSQLIKLN---------------------LQPDAKGSYVEVLERYVN------- 295
            + S +G+  L +                        +P A    V    R ++       
Sbjct: 355  VASEFGNHYLYQFQKLGDDDEEPEFSSTSFPSSGMAEPQAALPRVYFKPRPLDNLALADE 414

Query: 296  ---LGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
               L PI+D  V++L       Q+    G     SLR +++G+ + E  S +L GI   +
Sbjct: 415  LESLDPIIDSKVLNLLPNSDTPQIFAACGRGARSSLRTLQHGLEVEESVSSDLPGIPNAV 474

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+ + + DDP+D+++++SF++ T +L++   + +EE +  GF S   TL       + L+
Sbjct: 475  WTTKRNEDDPYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGSDALL 532

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
            QV    +R V +  R   NEW+ P   ++  AT N  QV++A     LVY E+  DG L 
Sbjct: 533  QVHPHGIRHVLADRRV--NEWRVPSNKTIVAATTNKRQVVVALSSAELVYFELDLDGQLN 590

Query: 470  EVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
            E +  + +   +  L I  + E    +   AVG   D +VRI SL   + +    L    
Sbjct: 591  EYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGC-EDQTVRIISLDPESTLETISLQALT 649

Query: 529  IPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
             P S +  A          +   ++   L +G LL  +L+   G+LTD +   LGT+PI 
Sbjct: 650  APPSAICIADMLDASINKAQPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGTRPIK 709

Query: 580  LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
            L          + A S R  + Y+    + ++ +  + + +   F++   P+ L      
Sbjct: 710  LVRVLIHKHPAILALSSRSWLNYTYQNFMHFTPLIYENLDYAWSFSAELSPEGLIGISGS 769

Query: 640  ELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAIC-----------------SLK 681
             L I  +  +  KL   SIPL   PR+      +  F +                   L+
Sbjct: 770  VLRIFHVPRLGIKLKQDSIPLSYTPRKFITYPLNNLFYLIEGDHRVMGQDAVDKKLNELR 829

Query: 682  NQSCAEESEM-------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
             Q+ A + E+                     +R++D    + IS  PLD  E   S+   
Sbjct: 830  QQNRAIDQEVLNLSPEVFGRPKAANGTWASNIRIIDPVEGKTISVVPLDGNESAFSLAVV 889

Query: 723  SFSDDSN-VYYCVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
             FS   N ++  VGTA    + P        R+  F+ +  +L+L+ + E      +L A
Sbjct: 890  PFSAKGNELHLVVGTAADTKLSPRTCSTGFLRVYKFLEDGRQLELVHKTEIDDVPLALMA 949

Query: 779  FNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
            F G+L+A + + +++Y    K MLR    ++  S          + + T+G  I+VGD+ 
Sbjct: 950  FQGRLVAGVGKALRIYDIGKKKMLRKVENKQFGSAI--------VTLSTQGSRILVGDMQ 1001

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA--T 892
            +SI   +YK  E  +   A D    W+SA  ++D +  + A+   N+F  R +   +   
Sbjct: 1002 ESIFFAVYKAPENRLLIFADDSQPRWISAATMVDYNTVVAADRFGNIFVNRLDPRVSDQV 1061

Query: 893  DE---------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
            DE         E+G       + +++  +H+G+ +      SLV        G    +++
Sbjct: 1062 DEDPTGAGILHEKGLYMGAPHKTKMICHFHVGDLITSIHKVSLV-------AGGREVLLY 1114

Query: 937  GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
              ++G IG++   +  E   F+  L+ ++R     + G +H  WR +      V  K  +
Sbjct: 1115 TGLHGTIGILVPFVTKEDVDFISTLEQHMRTEQVSLVGRDHLGWRGY-----YVPVKAVV 1169

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DGDL E +  L  ++   I+  ++ S+ E+ K++E+L
Sbjct: 1170 DGDLCEMYAKLPGSKQSAIAGELDRSIGEVLKKLEQL 1206


>gi|322707263|gb|EFY98842.1| Pre-mRNA-splicing factor rse-1 [Metarhizium anisopliae ARSEF 23]
          Length = 1212

 Score =  224 bits (571), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 288/1169 (24%), Positives = 497/1169 (42%), Gaps = 168/1169 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+L IA++  +  V+++    +      +        R    
Sbjct: 64   VFGIIRSLAAFRLAGSNKDYLIIASDSGRITVVEYLPAQNRFSRLHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            G+    DP  R  LI     + L  V+    + +L  +  +   +  VL I  +    G 
Sbjct: 124  GEYLACDPKGRACLIASTEKNKLVYVLNRSAQAELTISSPLEAHKPGVLVISMVALDVGY 183

Query: 121  AKPTIVVLYQD----NKDA-----RHVKT----YEVALKDKDFVEGPWSQNNLDNGADLL 167
            A P    L  D    ++D+       V+T    YE+ L     V   WS+  +D  A LL
Sbjct: 184  ANPVFAALEMDYSEVDQDSSGQALEEVETQLVYYELDLGLNHVVRK-WSEP-VDPTASLL 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYL 217
              VP     P   GVL+ GEE+I Y  +N  AF+  IP R   T+  +  R    G  + 
Sbjct: 242  FQVPGGNDGP--SGVLVCGEESITYRHSNQDAFRVPIPRRKGATEDPSRKRTIVSGVMHK 299

Query: 218  LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
            L   AG    L+ T + +                  +V  LKI+      +A+++  L +
Sbjct: 300  LKGSAGAFFFLLQTEDGDLFKVTIDMVEDEEGSPTGEVKRLKIKYFDTVPVATSLCILKS 359

Query: 260  AVVYIGSSYGD---SQLIKL------------NLQPDAKGSY------------VEVLER 292
              +YI S +G+    Q  KL            +   D + SY            + ++E 
Sbjct: 360  GFLYIASQFGNFSFYQFEKLGDDDAELEFSSDDFPVDPQASYDPVYFHPRPAENLALVES 419

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
              ++ P++D  V +L  +   Q+ T  G     + R++R+G+ +NE  + EL GI   +W
Sbjct: 420  IPSMNPLLDCQVANLTGEDAPQIYTVCGNGARSTFRMLRHGLEVNEIVASELPGIPSAVW 479

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            +L+ +  + +D ++V+SF + T +L++   + +EE    GF +   TL       + L+Q
Sbjct: 480  TLKLNRGEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFSTSVPTLAAQLLGDDGLIQ 537

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
            V    +R + +      NEW +P   S+  A+ NA QV +A   G +VY E+  DG L E
Sbjct: 538  VHPKGIRHIRNGKV---NEWDAPQHRSIVAASTNAHQVAIALSSGEIVYFEMDSDGSLAE 594

Query: 471  VKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
                +  +  ++CL +  + E    S   AVG   D +VRI SL P+  L +K       
Sbjct: 595  YDEKKEMFGTVTCLSLGEVPEGRVRSSFLAVGC-DDCTVRILSLDPESTLESKSVQALTA 653

Query: 529  IPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
             P S+ + A +  S      YL   L  G  L  +L+  TGELTD ++  LG + + L  
Sbjct: 654  APSSLAIIAMDDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKQVRLFQ 713

Query: 583  FSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
             + + TT V   S RP + YS    K  + + +N  ++     F+S    + +   +   
Sbjct: 714  VTVQGTTCVLGLSSRPWLGYSDPITKGFVVTPLNYVDLEWGWNFSSEQCEEGVVGIQGQS 773

Query: 641  LTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICS--------LKNQSCAEESE 690
            L I +ID +   L  +++ L   P++ I H EQ   + I S        L+ Q  A+ + 
Sbjct: 774  LRIFSIDRLGDTLTQKAVSLTYTPKKLIKHPEQPLFYTIESDNNTLPPDLRAQLMADPAV 833

Query: 691  MHF-VRLLDDQTF-----------------------EFISTYPLDTFEYGCSILSCSF-S 725
            ++    +L  + F                       + + T  L+  E   S     F S
Sbjct: 834  VNGDATVLPPEDFGYPKGNRRWASCINVIDPVSEEPQVVQTVDLENNEAAVSAAIVPFAS 893

Query: 726  DDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGK 782
             D+  +  VGT     V P         +  F  E  +L+ I + + +    +L  F GK
Sbjct: 894  QDNESFLIVGTGKDVVVNPRNFSEAYIYVYRFQEEGRELEFIHKTKIEEPALALIPFQGK 953

Query: 783  LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
            LLA + + +++Y   +R    R+ Q+E      I++L   T+G  I+VGD+ + ++ + Y
Sbjct: 954  LLAGVGKTLRVYDLGMR-QMLRKAQAEVAPQ-QIVSL--NTQGSRIIVGDIQQGVTYVTY 1009

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
            K     +   A D  A W +   ++D +   G +   N+F VR   K SE A +E+ G  
Sbjct: 1010 KPTTNKLIPFADDTIARWTTCTTMVDYESVAGGDKFGNMFIVRCPPKASEEADEEQSGLH 1069

Query: 898  -------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
                         RL+++  ++  +        SLV       VG    +++  + G IG
Sbjct: 1070 LMNARDYLHGTSQRLDLMCHFYTQDIPTSMAKTSLV-------VGGQDVLLWSGLMGTIG 1122

Query: 945  V-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
            V I  +  E   F + L+++LR     + G +H  +RS+         K  +DGDL E +
Sbjct: 1123 VFIPLISREDADFFQSLESHLRTEDPPLAGRDHLMYRSY-----YAPVKGIIDGDLCERY 1177

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              L   +   I+  ++ SV E+ +++ ++
Sbjct: 1178 TLLPNDKKQMIAGELDRSVREIERKISDI 1206


>gi|426192113|gb|EKV42051.1| hypothetical protein AGABI2DRAFT_229642 [Agaricus bisporus var.
            bisporus H97]
          Length = 1213

 Score =  224 bits (570), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 275/1177 (23%), Positives = 483/1177 (41%), Gaps = 178/1177 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+  + ++  +  +L++DA++S               R    
Sbjct: 58   VFGSIRSLASFRLTGATKDYAILGSDSGRIIILEYDAKTSSFAKLHQETFGKSGARRIVP 117

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
            GQ    DP  R  +I       L  ++  D    L  +  +        +  +  LD+ F
Sbjct: 118  GQYLATDPKGRSVMIAAMEKAKLVYILNRDAAANLTISSPLEAHKNSSIIHHIVGLDVGF 177

Query: 117  ---LYGCAKPTIVVLYQDN-----KDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
               +Y   +       QD      K A  + TY       + V   WS+   D  A+LL+
Sbjct: 178  ENPVYAALEVDYSESDQDPTGEAFKRAEKMLTYYELDLGLNHVVRKWSEPT-DPRANLLV 236

Query: 169  PVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS----------IT 203
             VP            P   GVL+  E+ I+Y   ++ +    IP R            IT
Sbjct: 237  QVPGGQLASSDRFDGP--SGVLVCCEDHIIYRHMDSPQHRVPIPRRQHPLEDKERGVIIT 294

Query: 204  KAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
             A          +LL    G L  + I HE E+V  LKI+      +AS++  L +  ++
Sbjct: 295  SAVMHKMKGAFFFLLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLF 354

Query: 264  IGSSYGDSQLIKLN---------------------LQPDAKGSYVEVLERYVN------- 295
            + S +G+  L +                        +P A    V    R ++       
Sbjct: 355  VASEFGNHYLYQFQKLGDDDEEPEFSSTSFPSSGMAEPQAALPRVYFKPRPLDNLALADE 414

Query: 296  ---LGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
               L PI+D  V++L       Q+    G     SLR +++G+ + E  S +L GI   +
Sbjct: 415  LESLDPIIDSKVLNLLPNSDTPQIFAACGRGARSSLRTLQHGLEVEESVSSDLPGIPNAV 474

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+ + + DDP+D+++++SF++ T +L++   + +EE +  GF S   TL       + L+
Sbjct: 475  WTTKRNEDDPYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGSDALL 532

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
            QV    +R V +  R   NEW+ P    +  AT N  QV++A     LVY E+  DG L 
Sbjct: 533  QVHPHGIRHVLADRRV--NEWRVPSNKIIVAATTNKRQVVVALSSAELVYFELDLDGQLN 590

Query: 470  EVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
            E +  + +   +  L I  + E    +   AVG   D +VRI SL   + +    L    
Sbjct: 591  EYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGC-EDQTVRIISLDPESTLETISLQALT 649

Query: 529  IPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
             P S +  A          +   ++   L +G LL  +L+   G+LTD +   LGT+PI 
Sbjct: 650  APPSAICIADMLDASINKAQPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGTRPIK 709

Query: 580  LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
            L          + A S R  + Y+    + ++ +  + + +   F++   P+ L      
Sbjct: 710  LVRVLIHKHPAILALSSRSWLNYTYQNFMHFTPLIYENLDYAWSFSAELSPEGLIGISGS 769

Query: 640  ELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAIC-----------------SLK 681
             L I  +  +  KL   SIPL   PR+      +  F +                   L+
Sbjct: 770  VLRIFHVPRLGIKLKQDSIPLSYTPRKFITYPLNNLFYLIEGDHRVMGQDAVDKKLNELR 829

Query: 682  NQSCAEESEM-------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
             Q+ A + E+                     +R++D    + IS  PLD  E   S+   
Sbjct: 830  QQNKAIDQEVLNLSPEVFGRPKAANGTWASNIRIIDPVEGKTISVVPLDGNESAFSLAVV 889

Query: 723  SFSDDSN-VYYCVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
             FS   N ++  VGTA    + P        R+  F+ +  +L+L+ + E      +L A
Sbjct: 890  PFSAKGNELHLVVGTAADTKLSPRTCSTGFLRVYKFLEDGRQLELVHKTEIDDVPLALMA 949

Query: 779  FNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
            F G+L+A + + +++Y    K MLR    ++  S          + + T+G  I+VGD+ 
Sbjct: 950  FQGRLVAGVGKALRIYDIGKKKMLRKVENKQFGSAI--------VTLSTQGSRILVGDMQ 1001

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA--T 892
            +SI   +YK  E  +   A D    W+SA  ++D +  + A+   N+F  R +   +   
Sbjct: 1002 ESIFFAVYKAPENRLLIFADDSQPRWISAATMVDYNTVVAADRFGNIFVNRLDPRVSDQV 1061

Query: 893  DE---------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
            DE         E+G       + +++  +H+G+ +      SLV        G    +++
Sbjct: 1062 DEDPTGAGILHEKGLYMGAPHKTKMICHFHVGDLITSIHKVSLV-------AGGREVLLY 1114

Query: 937  GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
              ++G IG++   +  E   F+  L+ ++R     + G +H  WR +      V  K  +
Sbjct: 1115 TGLHGTIGILVPFVTKEDVDFISTLEQHMRTEQVSLVGRDHLGWRGY-----YVPVKAVV 1169

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DGDL E +  L  ++   I+  ++ S+ E+ K++E+L
Sbjct: 1170 DGDLCEMYAKLPGSKQSAIAGELDRSIGEVLKKLEQL 1206


>gi|221044336|dbj|BAH13845.1| unnamed protein product [Homo sapiens]
          Length = 222

 Score =  223 bits (569), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 113/208 (54%), Positives = 151/208 (72%), Gaps = 7/208 (3%)

Query: 223 GLLHLLVITHEKE-----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
           G L +L++  E++      +  L++ELLGETSIA  ++YLDN VV++GS  GDSQL+KLN
Sbjct: 3   GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLN 62

Query: 278 LQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
           +  + +GSYV  +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+
Sbjct: 63  VDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIH 122

Query: 338 EQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQ 397
           E AS++L GIKG+W LRS  +   D  LV+SF+ +TR+L +N E E+EETE+ GF    Q
Sbjct: 123 EHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQ 181

Query: 398 TLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
           T FC +  + QL+Q   GS RLV    +
Sbjct: 182 TFFCGNVAHQQLIQ-PPGSGRLVKPEGK 208


>gi|406868052|gb|EKD21089.1| pre-mRNA-splicing factor rse1 [Marssonina brunnea f. sp.
            'multigermtubi' MB_m1]
          Length = 1236

 Score =  223 bits (567), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 279/1163 (23%), Positives = 482/1163 (41%), Gaps = 162/1163 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I  L  FR  G  +D++ I ++  +  ++++    ++     +        R    
Sbjct: 64   VFGIIRALASFRLAGSNKDYIIITSDSGRITIVEFLPAQNKFNRLHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q L         G 
Sbjct: 124  GQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMCALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
            A P    L  D  D+    T             YE+ L     V   WS + +D  A +L
Sbjct: 184  ANPVFAALEVDYGDSDQDPTGQAYDEIEKSLVYYELDLGLNHVVRK-WS-DPVDRTASVL 241

Query: 168  IPVPPPL---CGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYLLG 219
              VP       GVL+ G++ I Y  +N  AF+ AIP R   T+     RV   G  + L 
Sbjct: 242  FQVPGGTDGPSGVLVCGDDNITYRHSNQEAFRVAIPRRRGATEDPNRKRVIVGGVMHKLK 301

Query: 220  DHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNAV 261
              AG    L+ T + +                  +V  LKI+      IA+++  L +  
Sbjct: 302  GAAGAFFFLLQTDDGDLFKVTIEMVENDEGAPTGEVKRLKIKYFDTVPIATSLCILKSGF 361

Query: 262  VYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYV 294
            +++ S +G+ Q  +                N   D    Y  V            +E   
Sbjct: 362  LFVASEFGNHQFYQFEKLGDDDEETEFISDNFPTDPLEPYTPVYFHPRPAENLSLVESID 421

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
            ++ P++D  V +L  +   Q+ +  G     + R +R+G+ ++E    EL G+   +W+ 
Sbjct: 422  SMNPLMDCKVANLTEEDAPQIYSICGTGARSTFRTLRHGLEVSEIVESELPGVPSAVWTT 481

Query: 354  RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
            + + +D +D ++++SF + T +L++   + +EE    GF S   TL       + L+QV 
Sbjct: 482  KLTRNDTYDAYIILSFTNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGDDGLIQVH 539

Query: 414  SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
               +R + +  R   NEW +P   S+  A  NA QV +A   G +VY E+  DG L E  
Sbjct: 540  PKGIRHIRADRRV--NEWAAPQHRSIVAAATNARQVAVALSSGEIVYFEMDSDGSLAEYD 597

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
            +  ++   ++CL +  + E    SQ  AVG   D +VRI SL PD  L  K        P
Sbjct: 598  EKKEMSGTVTCLSLGEVPEGRVRSQFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 656

Query: 531  RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
             ++ + A    S      YL   L  G  L  +L+  TGEL+D +   LG +P  L   S
Sbjct: 657  SALSIMAMSDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGPKPARLFRVS 716

Query: 585  SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
             +    V A S RP + YS    K  + + +N   +     F+S    + +   +   L 
Sbjct: 717  VQGQAAVLALSSRPWLGYSDPVTKGFMLTPLNYPGLEWGWNFSSEQCTEGMVGIQGQNLR 776

Query: 643  IGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT 701
            I +I+ +   L   +I L   PRR     +   F      N   +  ++    +LL+D +
Sbjct: 777  IFSIEKLTDNLLQDTISLTYTPRRFLRHPEHPLFYTIESDNNILSPATK---AKLLEDPS 833

Query: 702  F-----------EFISTYP--------------------------LDTFEYGCSILSCSF 724
                        EF   YP                          LD  E   S+ + SF
Sbjct: 834  IVNGDAAVLPAEEF--GYPRGKNHWASCISVVDPVTEKRVLQKIDLDDNEAAVSMAAVSF 891

Query: 725  SD-DSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFN 780
            S  D  V+  +GT   ++      T G I V+   D   +++ I + + +    +L  F 
Sbjct: 892  SSQDDEVFLVIGTGKDMIVSPRSSTAGFIHVYRFHDNGKEIEFIHKTKVEEPPMALLGFQ 951

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            G+LL  I + +++Y   +R    R+ Q+E   +   L + +QT+G  IVV D+ +SI ++
Sbjct: 952  GRLLVGIGKDLRIYDLGMR-QLLRKAQAEVAPN---LIVGLQTQGSRIVVSDVQESIIMI 1007

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE--- 894
            +YK +E  +     D  + W S   ++D +   G +   NL+ +R   K SE A +E   
Sbjct: 1008 VYKFQENKLIPFVDDTISRWTSCTTMVDYETVAGGDKFGNLWLLRCPTKASEEADEEGSA 1067

Query: 895  -----ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGV-I 946
                 ER  L+  G  H    +  F    + M +  +++  G    +++  + G +G+ I
Sbjct: 1068 SHLVHERSYLQ--GSPHRLTLMAHFFTQDIPMSIQKTNLVAGGRDCILWSGIQGTLGILI 1125

Query: 947  ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
              +  E   F + L+ +LR     + G +H  +RS+      V  K  +DGDL E +  L
Sbjct: 1126 PFVSREDVDFFQTLEQHLRSEDAPLAGRDHLIYRSY-----YVPVKGVIDGDLCERYTLL 1180

Query: 1007 SRTRMDEISKTMNVSVEELCKRV 1029
               +   I+  ++ SV E+ +++
Sbjct: 1181 PTDKKQMIAGELDRSVREIERKI 1203


>gi|393217872|gb|EJD03361.1| hypothetical protein FOMMEDRAFT_108572 [Fomitiporia mediterranea
            MF3/22]
          Length = 1213

 Score =  222 bits (566), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 270/1177 (22%), Positives = 472/1177 (40%), Gaps = 180/1177 (15%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
            +G I +L  FR  G  +D+  + ++  +  +L++D ++S  I            R    G
Sbjct: 59   FGSIRSLAAFRLTGGTKDYAILGSDSGRIVILEYDPKTSSFIKLHQETYGKSGARRIVPG 118

Query: 67   QIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF- 116
            Q    DP  R + +   +   L  ++  D    L  +  +        +  +  +D+ F 
Sbjct: 119  QYLATDPKGRSVMISAMEKAKLVYILNRDAAANLTISSPLEAHKNAAIIHHIVGVDVGFE 178

Query: 117  --LYGCAKPTIVVLYQD------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
              LY   +       QD      N   + +  YE+ L     V   WS+   D  A+LL+
Sbjct: 179  NPLYAALEVDYTESDQDPTGEAFNNAEKLLTFYELDLGLNHVVRK-WSEPT-DPRANLLV 236

Query: 169  PVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIR--PSITKAYGRVDA 211
             VP            P   GVL+  E+ I+Y + ++ +    IP R  P   K  G +  
Sbjct: 237  QVPGGQLATSDRFDGP--SGVLVCCEDHIIYRNMDSPQHRVPIPRRRHPLEDKERGLIIV 294

Query: 212  DGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
                        +LL    G L+ + I HE E V  LKI+      +AS++  L +  ++
Sbjct: 295  SAVMHKMKGAFFFLLQSEEGDLYKVTIDHEDEDVRALKIKYFDTVPVASSLCILKSGFLF 354

Query: 264  IGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLG-------------------------- 297
            + S +G+  L +     D           Y N G                          
Sbjct: 355  VASEFGNHHLYQFQKLGDDDNEPEFSSTDYPNFGMADPTVPLPAAYFKPRPLDNLVLADE 414

Query: 298  -----PIVDFCVVD-LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
                 PI+D  V++ L      Q+    G     S R +R+G+ + E  S +L GI   +
Sbjct: 415  IESLDPIIDSKVMNILPNSDTPQIFAACGRGARSSFRTLRHGLEVEEVVSSDLPGIPNAV 474

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+ R   DDP+D+++++SF++ T +L++   + +EE +  GF S   TL       + L+
Sbjct: 475  WTTRIKEDDPYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGSDALL 532

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
            QV    +R V +  R   NEW+ P G ++  AT N  QV++A     LVY E+  +G L 
Sbjct: 533  QVHPQGIRHVLADRRV--NEWRVPQGRTIVAATTNKRQVVVALSSAELVYFELDLEGQLN 590

Query: 470  EVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
            E +  + +   +  L +  +      +   AVG   D +VRI SL   + +    L    
Sbjct: 591  EYQDRKAMGSTVLALSVGEVPPGRQRTPYLAVGC-EDQTVRIVSLDPESTLETISLQALT 649

Query: 529  IPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
             P S +  A          +   ++   L +G LL  +L+   G+LTD +   LGT+P+ 
Sbjct: 650  APPSAICIADMLDSSIDKNQPTMFVNIGLQNGVLLRTVLDPVNGQLTDTRTRFLGTRPVK 709

Query: 580  LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
            L     +  + + A S R  + Y+    L ++ +  + + H   F++   P+ L      
Sbjct: 710  LMRVQVQRNSSILALSSRSWLNYTHQGLLHFTPLIFETLDHAWEFSAELCPEGLIGISGS 769

Query: 640  ELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTF-----------------AICSLK 681
             L I  I  +  KL   S+PL   PR+      ++ F                  +  L+
Sbjct: 770  VLRIFQIPRLGTKLKQDSMPLTYTPRKFIPHPMNQYFYMIEADHRVMGDDAAKEKLAELR 829

Query: 682  NQSCAEESEM-------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
             +    + E+                     +R+LD      +    LD  E   SI   
Sbjct: 830  QRGVKYDQEVVDLPPEVFGRPKAPAGTWGSCIRILDPINKATVKVVHLDNNEAAFSIAIV 889

Query: 723  SF-SDDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
             F + +S ++ CVGTA   ++ P        R   F      L+L+ + E      +L A
Sbjct: 890  PFAARNSELFLCVGTASSTFLAPRSCSSGFIRTYAFTNGGADLELVHKTEADDVPMALMA 949

Query: 779  FNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
            F G+L A + + +++Y    K +LR   T+   S          + + T+G  I+VGD+ 
Sbjct: 950  FQGRLCAGVGKSLRIYEIGKKKLLRKVETKTYGSAI--------VTLNTQGSRIIVGDMQ 1001

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE--GAT 892
            +SI   ++K  E  +   A D    W ++  ++D       +   N+F  R +S+     
Sbjct: 1002 ESIVYAVFKPPENRLLIFADDSQPRWTTSAVMVDYTTIAAGDKFGNVFINRLDSKISDQV 1061

Query: 893  DE---------ERGRLE-------VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
            DE         E+G L        ++  +H+G+ V      SLV        G    +++
Sbjct: 1062 DEDPTGAGILHEKGLLMGAPHKTGMIAHFHVGDIVTSIHKISLV-------AGGREVLLY 1114

Query: 937  GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
              ++G IG++   +  E   F+  L+ ++R     + G +H  WR +      V  K  +
Sbjct: 1115 TCLHGTIGILVPFVSKEDVDFISTLEQHMRSEKLSLVGRDHLAWRGY-----YVPVKAVV 1169

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DGDL E F  L   +   I+  ++ +V E+ K++E+L
Sbjct: 1170 DGDLCEQFARLPANKQSAIAVELDRTVGEVLKKLEQL 1206


>gi|301110252|ref|XP_002904206.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
 gi|262096332|gb|EEY54384.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
          Length = 1197

 Score =  222 bits (565), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 284/1175 (24%), Positives = 468/1175 (39%), Gaps = 177/1175 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            ++   ++G +  L+ FR  G  +D+L + ++  K  VL+ +  SS    R          
Sbjct: 52   VISTQVFGVVRALQPFRLTGGDRDYLVVGSDSGKIVVLEVNPTSSRFEARQSETYGKTGC 111

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIRLEELQVLD 113
            R    GQ    DP  R  LIG      L  V+  D   +L      EA       L V+ 
Sbjct: 112  RRITPGQYLAADPKGRAVLIGAVEKQQLVYVMNRDASSRLTISSPLEAHRSNAIHLGVVG 171

Query: 114  IKFLYGCAKPTIVVLYQDNKDARH----------VKT---YEVALKDKDFVEGPWSQNNL 160
            +    G   P    L  D  +A            VKT   YE+ L   + V   WS+  +
Sbjct: 172  LDV--GFENPIFASLELDYAEADADPSGQAAREVVKTLVYYELDL-GLNHVTRRWSEQVV 228

Query: 161  DNGADLLIPVPPPLCG---VLIIGEETIVYCSANAFK---AIPIRPS------ITKAYGR 208
               A+ L+ VP    G   VL++GE T+ Y +    +   AIP R        I  A   
Sbjct: 229  -RSANRLVAVPGGGDGPGGVLVLGENTVQYKNEGHPELTCAIPRREGEHRDIIIVSAATH 287

Query: 209  VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
               D    LL    G L+ + + +    V  +KI+      +AS++      +++  S +
Sbjct: 288  KQRDLFFVLLQSELGDLYKISLDYSGNVVEEIKIQFFDTIPVASSMCITKTGLLFCASEF 347

Query: 269  GDSQLIKL-------------NLQPDAKGSYVEVLERYVNLG---------PIVDFCVVD 306
             +  L +              +L  D        L +  NL          P+    V D
Sbjct: 348  SNHYLFQFLSIGEGDDAAKCSSLAMDPTEFSTFPLRKLTNLALASSSASLSPVTQLLVDD 407

Query: 307  LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFDTFL 365
            L  +   Q+    G     SLR++R+G+ I E A+  L G+ K +W L+ S  DP+D ++
Sbjct: 408  LANEQTPQMYALCGNNNRSSLRVLRHGLPITEMAASALPGVAKAVWCLKESYADPYDKYI 467

Query: 366  VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
            VVSF  E   L + + + +EE    GF     +L       +  +Q+ +  +R V     
Sbjct: 468  VVSF--EDATLVLEVGETVEEVAQSGFLRDHGSLLVALLEDDSKLQIHTNGIRHVPKF-- 523

Query: 426  ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEISCLD 484
            +   EWK+P    +    AN+ QV+++  GG ++Y E+G  G L E     L +E+  LD
Sbjct: 524  QPVTEWKAPGKKVIEHCAANSRQVVISLAGGEIIYFELGQSGELAEKGKLDLGFEVCSLD 583

Query: 485  INPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEG--- 540
            +  + E     Q  AVG W D +VRI SL P+     K  L     P ++ L   +    
Sbjct: 584  LGEVPEGRQRFQFMAVGSW-DNTVRILSLDPNELFRQKSTLALTSHPHTLCLAQLQNEPS 642

Query: 541  ---------ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
                       +L   L +G L   L++  T  L D +   LGT P+ L   + +    +
Sbjct: 643  TPDSEHSSQALFLSIGLDNGVLQQSLIDPITATLADSRSRFLGTNPVKLFRVAVEGKRSI 702

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL-AIAKEGELTIGTIDDI- 649
             A S R  + Y    +   + ++ + +S+   FNS   P  + A+  EG + I T+D + 
Sbjct: 703  LALSSRAWISYFHQTRRHLTPLSCELLSYASSFNSEQCPGGIVALTNEG-MKILTVDQLG 761

Query: 650  -----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH------------ 692
                 QK ++R  P     + + H    R   I S  N+  A     H            
Sbjct: 762  DTFNQQKCNLRYTPR----KAVVHSPSRRLVVIESDHNEYGAAYKRQHGLQIPDIRSAND 817

Query: 693  -----------------------------FVRLLDDQTFEFISTYPLDTFEYGCSILSCS 723
                                          VR++D  + + +    LD  E   SI +C 
Sbjct: 818  LEDEAEDEINDALLFPRGPLPAEKDKWASCVRIIDPASCQTVVCEELDVDERARSIAACV 877

Query: 724  FSD-DSNVYYCVGTA--YVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAF 779
            F D     +  VGT     L  +  P  G + V+ +VE  +L L+   E     +++  F
Sbjct: 878  FHDRGGEAFIIVGTVKKMQLHPQKAPAGGYLRVYRVVEGMQLVLVHTTEIDDIPHAMCEF 937

Query: 780  NGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
             G+LL ++ + +++Y    K MLR    R   S        + + ++  GD I   D+ +
Sbjct: 938  QGRLLVSVGRALRIYDLGKKKMLRKCENRNFPS--------ILVELKAAGDRIYASDMHE 989

Query: 836  SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE------ 889
            S   + YK +E  +   A D    ++++  +LD D   GA+   N+F  R  SE      
Sbjct: 990  SFHFVKYKKDENQLVIFADDCVPRFITSSVLLDYDTLCGADKFGNVFVSRLPSEVSDEID 1049

Query: 890  -----------GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
                       G  +    +LE V ++H+G+ V      SLV        G    VI+ T
Sbjct: 1050 NPTGNRILWDSGLLNGAPNKLEQVAQFHVGDVVTSMVRSSLVP-------GGTEAVIYAT 1102

Query: 939  VNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
            + G IG +I     E   F   L+  +R+    + G +H  +RS+      +  KN  DG
Sbjct: 1103 IMGRIGALIPFTSREDVDFYTHLEMYMRQEQPPLCGRDHLSYRSY-----YIPVKNITDG 1157

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DL E F  LS  +   +++ ++ +  E+ K++E++
Sbjct: 1158 DLCEQFSSLSVEKQASVAEDLDRTPAEVLKKLEDI 1192


>gi|392593521|gb|EIW82846.1| hypothetical protein CONPUDRAFT_81012 [Coniophora puteana RWD-64-598
            SS2]
          Length = 1213

 Score =  221 bits (564), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 266/1177 (22%), Positives = 481/1177 (40%), Gaps = 178/1177 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+  + ++  +  +L +D ++S  +            R    
Sbjct: 58   VFGSIRSLAAFRLTGSTKDYAIVGSDSGRIVILDYDPKTSSFVKLHQETFGKSGARRIVP 117

Query: 66   GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYGC 120
            GQ    DP  R + +   +   L  ++  D    L  +  +   +   +    +    G 
Sbjct: 118  GQYLATDPKGRSVMISAMEKAKLVYILNRDAAANLTISSPLEAHKSSAIIHHIVGLDVGF 177

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
              P    L  D  +A    T E A   +            + V   WS+   D  A+LL+
Sbjct: 178  ENPLFAALEVDYAEADQDPTGEAAQNAEKMLTYYELDLGLNHVVRKWSEPT-DPRANLLV 236

Query: 169  PVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS----------IT 203
             VP            P   GVL+  E+ I+Y   +  +    IP R            I 
Sbjct: 237  QVPGGQVASSERFDGP--SGVLVCCEDHIIYRHMDKPQHRVPIPRRSHPLEDPKRGVIIV 294

Query: 204  KAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
             A          +LL    G L  + I H++++V  LKI+      +AS++  L +  ++
Sbjct: 295  AAVMHKMKGAFFFLLQSEDGDLFKVTIDHDEDEVKSLKIKYFDTVPVASSLCILKSGFLF 354

Query: 264  IGSSYGDSQLIKLNL------QPDAKGSYVE---VLERYV-------------------- 294
            + S +G+  L +         +P+   +      + E ++                    
Sbjct: 355  VASEFGNHYLYQFQKLGDDDDEPEFSSTSFPSFGMAESFIPLPHAHFRPRGLDNLALADE 414

Query: 295  --NLGPIVDFCVVD-LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
              +L PI+D  V++ L      Q+ T  G     + R++R+G+ + E  S EL GI   +
Sbjct: 415  IESLDPILDAKVMNILPNSDTPQIFTACGRGSRSTFRMLRHGLEVEESVSSELPGIPNAV 474

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+ + + DDP+D+++++SF++ T +L++   + +EE +  GF S   TL       + L+
Sbjct: 475  WTTKRTEDDPYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGSDALL 532

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
            QV    +R V S  R   NEW+ P G ++  AT N  QV++A     LVY E+  DG L 
Sbjct: 533  QVHPQGIRHVLSDRRV--NEWRVPQGKTIVCATTNKRQVVVALSSAELVYFELDLDGQLN 590

Query: 470  EVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
            E +  + +   +  L +  + E    +   AVG   D +VRI SL   + +    L    
Sbjct: 591  EYQDWKAMGSTVLALSVGEVPEGRQRTPYLAVGC-EDQTVRIISLDPESTLETISLQALT 649

Query: 529  IPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
             P S +  A          +   ++   L +G LL  +L+   G+LTD +   LGT+PI 
Sbjct: 650  APPSAICIADMLDASINKSQPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGTRPIR 709

Query: 580  LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
            L   + +    + A S R  + Y+    + ++ +  + + +   F++   P+ L      
Sbjct: 710  LVRVTVQKNPGILALSSRSWLNYTHQSLMHFTPLIFENLDYAWSFSAELSPEGLIGITGS 769

Query: 640  ELTIGTIDDI-QKLHIRSIPLGEHPRRIC--------------HQEQSRTFAICSL---- 680
             L I  I  +  KL   ++PL   PR+                H+  S + A   L    
Sbjct: 770  VLRIFQIPKLGMKLKQDAVPLSYTPRKFVSHPANQYLYLIQGDHRVMSESAAEKKLQEMR 829

Query: 681  -KNQSCAEE-----------------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
             K Q   EE                 +    + ++D      I T  LD  E   S+   
Sbjct: 830  TKGQKVDEEILQLPVEVFGRPKAPAGTWASAICIIDPIEARTIHTVELDNNESAFSVAVV 889

Query: 723  SF-SDDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
             F + D+ ++  VGTA    + P        R   F  E   L+L+ + ET     ++ A
Sbjct: 890  PFAARDNELHLVVGTAADTLLTPRSCRSGYLRTYRFTDEGRSLELLHKTETDDVPLAVMA 949

Query: 779  FNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
            F G+L+A + + ++LY    K +LR    +   S          + + T+G  I+VGD+ 
Sbjct: 950  FQGRLIAGVGKSLRLYEIGKKKLLRKAENKSFASAI--------VTLNTQGSRIIVGDMQ 1001

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
            +S+    YK  E  +   A D    W++A+ ++D       +   N+F  R   + S+  
Sbjct: 1002 ESVHFAAYKAPENRLLIFADDMQPRWVTALTMVDYTTIAVGDRFGNVFINRLDMRVSDQV 1061

Query: 892  TDE--------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
             D+        E+G+L       +++  +H+G+ +      SLV        G    +++
Sbjct: 1062 DDDPTGAGILHEKGQLSGAPHKTKLLCHFHVGDLITSIHKVSLV-------AGGREVLLY 1114

Query: 937  GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
              ++G IG++   +  E   F+  L+ ++R     + G +   WR +    K V     +
Sbjct: 1115 TGIHGTIGILVPFVSKEDVDFISTLEQHMRSEQSSLVGRDQLSWRGYYTPVKAV-----V 1169

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DGDL E+F  L+ ++   I+  ++ +V E+ K++E+L
Sbjct: 1170 DGDLCEAFARLTGSKQSAIAGELDRTVGEVLKKLEQL 1206


>gi|345563727|gb|EGX46712.1| hypothetical protein AOL_s00097g460 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1308

 Score =  220 bits (561), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 265/1145 (23%), Positives = 481/1145 (42%), Gaps = 138/1145 (12%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I  L  FR  G  +D++ I ++  +  +L++    ++     +        R    
Sbjct: 178  VFGIIRCLAAFRLAGSGKDYIIIGSDSGRITILEYIPTQNKFKRLHLETFGKSGVRRVVP 237

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  LI     + L  V+  + + +L  +  +   +   L         G 
Sbjct: 238  GQYLACDPRGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHKPHCLVFALCALDVGY 297

Query: 121  AKPTIVVLYQDNKDARHVKT---YEVALKDKDFVE---------GPWSQNNLDNGADLLI 168
              P   VL  D  +     T   Y  A K   + E           WS+  ++  ++ L+
Sbjct: 298  ENPIFAVLEMDYSETDQDPTGEAYREAEKSLTYYELDLGLNHVVRKWSEP-VERSSNFLL 356

Query: 169  PVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRV-DADGSRY------ 216
             VP     P   GVL+  E+ + Y   N  +   +R  I +  G   D +  R+      
Sbjct: 357  HVPGGSDGP--SGVLVCSEDAVTYRHMNQPE---LRVPIPRRRGACEDPNRKRFITCGVM 411

Query: 217  ---------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
                     LL    G L  + +  ++  V  L+I+      IA+++  L +  +++ S 
Sbjct: 412  HKMRGAFFFLLQSEDGDLFKVGLDFDENSVVRLRIKYFDTVPIATSLCILKSGFLFVASE 471

Query: 268  YGDSQLIKLNLQPD------------AKGSY--------VEVLERYVNLGPIVDFCVVDL 307
             G+    +     D               +Y        + ++E   ++  ++D  + +L
Sbjct: 472  NGNHHFYQFEKLGDDDNETEFTSDDEINAAYFSPRLAENLALMESVDSMNSLLDCQIANL 531

Query: 308  ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLV 366
                  Q+ T  GA    S R +++G+ ++E  S EL G    +W+ + ++ D +D ++V
Sbjct: 532  TNDDAPQIYTICGANARSSFRTLKHGLEVSEIVSSELPGGPVAVWTTKLASQDEYDAYIV 591

Query: 367  VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
            +SF + T +L++   + +EE    GF S   TL       + L+QV    +R + +  R 
Sbjct: 592  LSFTNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDGLLQVHPKGIRHIRADRRV 649

Query: 427  LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVK-HAQLEYEISCLD 484
              NEW +P   S+  A+ N+ QV +A   G +VY E+  DG L E +   ++   ++CL 
Sbjct: 650  --NEWPAPQHRSIVAASTNSRQVAIALSSGEIVYFELDSDGQLAEYEDKKEMSGTVTCLS 707

Query: 485  INPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGIS- 542
            +  + E    S   AVG   D +VRI SL PD  L +K        P ++ + +    + 
Sbjct: 708  LGDVPEGKMRSSFLAVGC-DDSTVRILSLDPDSTLESKSVQALTSAPTALRIMSMPDSAG 766

Query: 543  --------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAA 594
                    +L   L  G  L  +L+  TGELTD +   LG QP+ L + S+  TT + A 
Sbjct: 767  GDSQRSTLFLHIGLYSGVYLRTVLDTITGELTDTRTRFLGPQPVKLFSVSAAGTTAILAL 826

Query: 595  SDRPTVIYSSNK-KLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKL 652
            S +P + Y++   + L + ++ + +++ C F+S   P+ +   +   L I T++ I + L
Sbjct: 827  SSKPWLGYTNGSGQYLLTPLSYQSLTYGCTFSSEQCPEGMVGIQGNNLRIFTVEKISENL 886

Query: 653  HIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES----------------------- 689
               SIPL   PR++     +  F + S    + + ES                       
Sbjct: 887  QQESIPLSYTPRKMAKHPTAPYFYVVSSDADTLSPESRNNLIAKAQNGDSTELPPSEFGY 946

Query: 690  ---EMHF---VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAY-VLP 741
               E H+   V ++D    E      L   E   SI    FS  D   +  VGTA  ++ 
Sbjct: 947  PRCEGHWASCVEVVDPINKEVTHKIDLGENEAALSIAVVPFSSQDDEHFLVVGTAKDLVL 1006

Query: 742  EENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
                 + G   V+ + DG   +Q I + + + A  +  AF+G+LLA I Q +++Y   ++
Sbjct: 1007 NPRSSSCGYAHVYRLSDGGKTIQFIHKTKLEYAPLAFAAFHGRLLAGIGQDLRIYDLGMK 1066

Query: 800  DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
                R+ Q+    +   L   + T+ + I+V D+ +S++ ++YK +E  +   A D    
Sbjct: 1067 Q-LLRKTQATVAPN---LITGLTTQNNRIIVSDIQESVTYVVYKQQENKLIPFADDMVPR 1122

Query: 860  WMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
            W +   +LD +   G +   NL+ VR   K SE A ++      +  + +L    NR   
Sbjct: 1123 WTTCTALLDYESVAGGDKFGNLWVVRAPPKASEEADEDPSSGYLLHEKSYLQGAPNRVNL 1182

Query: 917  GS--LVMRLPDS------DVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKV 967
             S      +P S        G    V++  + G IG++   +  E   F + L+ ++R  
Sbjct: 1183 MSHFFTQDVPTSIQKVNLVAGGRDCVVWSGLQGTIGIMVPFISREDVDFFQTLEQHMRTE 1242

Query: 968  IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
               + G +H  +RS+      V  K  +DGDL E F  L   +   I+  ++ S+ E+ +
Sbjct: 1243 DAPLAGRDHLIYRSY-----YVPVKGVIDGDLCERFALLPYDKKQMIAAELDRSIREVER 1297

Query: 1028 RVEEL 1032
            +V ++
Sbjct: 1298 KVADM 1302


>gi|255081708|ref|XP_002508076.1| predicted protein [Micromonas sp. RCC299]
 gi|226523352|gb|ACO69334.1| predicted protein [Micromonas sp. RCC299]
          Length = 1199

 Score =  220 bits (560), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 267/1165 (22%), Positives = 471/1165 (40%), Gaps = 165/1165 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+L   ++  +  VLQ+D E +               R    
Sbjct: 57   VFGVIRSLAAFRLTGANRDYLVCGSDSGRIVVLQYDKERNRFDKVHQETFGKSGCRRIVP 116

Query: 66   GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY---GC 120
            GQ   +DP  R I +   +   L  V+  D +  L  +  +   +  V+         G 
Sbjct: 117  GQFLCVDPKGRAIMIAAIEKAKLVYVLNRDTEANLTISSPLEANKSHVVAFHVAALDCGL 176

Query: 121  AKPTIVVLYQDNKDARHVKTYEVAL---KDKDFVE--------GPWSQNNLDNGADLLIP 169
              P    +  D  DA    T E A    K   F E           S   +DNGA+ LI 
Sbjct: 177  DNPVFAAIELDYADADQDSTGEAAAEAQKHLTFYELDLGLNHVARKSSEPIDNGANHLIA 236

Query: 170  VPPPLCG---VLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR-------- 215
            VP    G   VL+  E  I+Y +    +    IP R +++   G +    +         
Sbjct: 237  VPGGGDGPGGVLVCAENFIIYKNQGHPDVRAVIPRRNALSGDRGVLIVSSATHRTKQQFF 296

Query: 216  YLLGDHAGLLHLLVITHEK--EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL 273
            +L     G ++ + + ++K  E V+ +KI+         +I  L    ++  S +G+  L
Sbjct: 297  FLAQSEYGDIYKVTVEYQKGTEFVSEVKIKYFDTIPPCVSICVLKTGFLFAASEFGNHAL 356

Query: 274  IKLNLQPD-------AKGSYVEVLERYV-------------------NLGPIVDFCVVDL 307
             +     D       +  + VE  E Y                    +L P++D    +L
Sbjct: 357  YQFQGIGDDEDDVESSSATLVETDEGYQPVFFEPRALRNLHPIDDVESLCPVLDMQCHNL 416

Query: 308  ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLV 366
              +   Q+    G     +LR++R G+ ++E A   L G    ++++R S  D FD ++V
Sbjct: 417  VAEETPQLYALCGTGPRSTLRVLRQGVALSEMAVSPLPGNPNAVFTVRKSASDEFDAYIV 476

Query: 367  VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
            VSF + T +L++   + +EE    GF     TL       + L+QV  G +R + +  R 
Sbjct: 477  VSFTNATLVLSIG--ETVEEVSDSGFLGTVPTLSASLLGDDSLLQVHPGGLRHIRADKR- 533

Query: 427  LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEISCLDI 485
              NEW++P   +V   T NA QV++A  GG L+Y E+   G L EV+  +   +++CLDI
Sbjct: 534  -INEWRTPGRKTVTRVTTNARQVIIALSGGELIYFELDQTGQLMEVEKLETSGDVACLDI 592

Query: 486  NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII---PRSVLLCAFEGIS 542
             P+ E    ++  AVG + D +VRI SL   + +  + +G + +   P S+L+   +G  
Sbjct: 593  GPVPEGHLRNRFLAVGSF-DSTVRILSLGAEDCL--QTMGVQALAAAPNSLLMLRDDGSG 649

Query: 543  --YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
              YL   L +G LL   ++  TG+L+D +   LG +P  L   S +    + A S RP +
Sbjct: 650  SIYLNVGLTNGVLLRADVDSVTGQLSDTRARFLGARPPKLSAVSVQGKAAMMALSSRPWL 709

Query: 601  -IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIP 658
              +    +   S ++ + + H  PF+S A P+ +       L I  ++ + ++ + R+  
Sbjct: 710  GHFDLANRFALSPLSYEPLEHAAPFSSDACPEGVVAVAGNTLRIVAVERLGEQFNQRTCK 769

Query: 659  LGEHPRRICHQEQSRTFAICSLKNQSC--------------------------------- 685
            L   PR++       T A+      S                                  
Sbjct: 770  LRYTPRQMSVNVDRNTLAVVECDQSSVPYDERTGLEGAAGEKDTKMDVEGEEDDEEEDEV 829

Query: 686  ----AEE---------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732
                AE+         S    VR++D  +        +   E         F     ++ 
Sbjct: 830  TMTPAEQFGAPKAPPGSWASCVRIVDPASASTKQIVEMTGNEAALCCCHVYFPQADELFL 889

Query: 733  CVGTAYVLPEENEPTKGRI--LVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
             VG+A  L      ++G    L    +DG ++L  +    G   ++  F G+LL  +   
Sbjct: 890  AVGSAVSLTFSPRDSEGGFIHLYRYTQDGGIELFHKTPLDGVPGAMCGFKGRLLVGVGNT 949

Query: 791  IQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
            ++LY    K +LR    R   +            +  +G+ I VGD+ +S   + YK E+
Sbjct: 950  LRLYDFGKKKLLRKVENRNFPN--------FIKTIHAQGERIYVGDVQESFHYVRYKRED 1001

Query: 847  GAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR------------------KNS 888
            G++   A D     ++A   LD D   G +   N+F  R                     
Sbjct: 1002 GSMYIVADDVQPRHVTAACPLDYDTIAGGDRFGNVFVSRLAQDVSDEIEEDPTGGKTAYG 1061

Query: 889  EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS 948
            +GA +    ++  V ++H+GE V     G+L         G + ++I+ T+ G +G +  
Sbjct: 1062 QGALNGASHKINQVTQFHVGETVCALTKGTL-------QAGGLESMIYATLMGTLGALMP 1114

Query: 949  LPH-EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
              + E   F   L+ ++R+ +  + G +H  +RS          K+ +DGDL E +  L 
Sbjct: 1115 FGNREDVDFCTHLEMHMRQELPPLLGRDHLAFRS-----SYFPVKDVIDGDLCEMYTVLP 1169

Query: 1008 RTRMDEISKTMNVSVEELCKRVEEL 1032
                  +++ M+ +V E+ K++E+L
Sbjct: 1170 HEAQRRVAEDMDRTVSEVLKKLEDL 1194


>gi|348503892|ref|XP_003439496.1| PREDICTED: splicing factor 3B subunit 3-like [Oreochromis niloticus]
          Length = 1217

 Score =  220 bits (560), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 267/1190 (22%), Positives = 492/1190 (41%), Gaps = 189/1190 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYHP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R  +IG      L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMIGAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  ++E VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRLKYFDTIPVATAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V V E+  NL 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-NLS 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   + DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406  PIMSCQIADLANEDTPQLYVACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
            +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKTPGKKTIVRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL     L  ++ + L  +  P 
Sbjct: 582  EMSADVVCMSLANVPPGEQRSRFLAVGL-ADNTVRIISLDPSDCLQPLSMQALPAQ--PE 638

Query: 532  SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
            S+ +    G+               YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 639  SLCIVEMGGVEKQDELGDKGTIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L     +    V A S R  + YS   +   + ++ + + +   F S   P+ +    
Sbjct: 699  VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
               L I  ++ +  +  + + PL   PR+ + H E            + T A  + + Q 
Sbjct: 759  TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLILIETDHNAYTEATKAQRKQQ 818

Query: 685  CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
             AEE             +EM                           VRL++      + 
Sbjct: 819  MAEEMVEAAGEDERELAAEMAAAFLNENLPEAIFGSPKAGAGQWASLVRLVNPIQGSTLD 878

Query: 707  TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
               L+  E   S+ +C F++  + +Y  VG A  ++        G I  + +  G  KL+
Sbjct: 879  QVQLEQNEAAFSVAACRFTNTGDDWYVLVGVARDMILNPRSVGGGFIYTYRLVSGGEKLE 938

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYV 821
             + +   +    ++  F G+ L  + + +++Y     D G ++L  +C   H   L   +
Sbjct: 939  FVHKTPVEDVPLAIAPFQGRALVGVGKLLRIY-----DLGKKKLLRKCENKHVPNLVTGI 993

Query: 822  QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
             T G  ++V D+ +S+  + Y+  E  +   A D    W++   +LD D    A+   N+
Sbjct: 994  HTIGQRVIVSDVQESLFWVRYRRNENQLIIFADDTYPRWVTTACLLDYDTMAAADKFGNI 1053

Query: 882  FTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
              VR   N+    DE         +RG L       E++  YH+GE V   +  +L+   
Sbjct: 1054 SIVRLPPNTSDDVDEDPTGNKALWDRGLLNGASQKAEIIMNYHVGETVLSLQKTTLI--- 1110

Query: 924  PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
                 G   ++++ T++G IG++     HE + F + L+ ++R     + G +H  +RS+
Sbjct: 1111 ----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHLEMHMRSEFPPLCGRDHLSFRSY 1166

Query: 983  NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                     KN +DGDL E F  +   +   +++ ++ +  E+ K++E++
Sbjct: 1167 -----YFPVKNVIDGDLCEQFNSMDPHKQKSVAEELDRTPPEVSKKLEDI 1211


>gi|46125735|ref|XP_387421.1| hypothetical protein FG07245.1 [Gibberella zeae PH-1]
          Length = 1208

 Score =  220 bits (560), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 274/1169 (23%), Positives = 479/1169 (40%), Gaps = 168/1169 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+L IA++  +  ++++    +      +        R    
Sbjct: 64   VFGIIRSLAAFRLAGSNKDYLIIASDSGRITIIEYLPAQNRFQRLHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            G+    DP  R  LI     + L  V+  +++ +L  +  +   +  VL I  +    G 
Sbjct: 124  GEYLACDPKGRACLIASTEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVISMVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
            + P    L  D  +     T             YE+ L     V   WS + +D  A +L
Sbjct: 184  SNPVFAALEIDYSEVDQDSTGQAMEELDTQLVYYELDLGLNHVVRK-WS-DPVDPTASIL 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYL 217
              VP     P   GVL+ GEE I Y  +N  AF+ AIP R   T+     R    G  + 
Sbjct: 242  FQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVAIPRRRGATEDPNRKRTIVSGIMHK 299

Query: 218  LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
            L  + G    L+ T + +                  +V  LK++      +AS++  L +
Sbjct: 300  LKGNTGAFFFLLQTDDGDLFKLSIDMIEDEEGNPTGEVKRLKVKYFDTVPVASSLCILKS 359

Query: 260  AVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLER 292
              +Y+ + +G+    +                +   D K SY            + ++E 
Sbjct: 360  GFLYVATQFGNYSFYQFEKLGDDDEELEFYSDDFPADPKASYEPVYFHPRPTENLALVES 419

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
               + P++D  V +L  +   Q+ T  G     S R++++G+ +NE  + EL GI   +W
Sbjct: 420  IPAMNPLLDCKVANLTGEDAPQIYTICGNGPRSSFRMLKHGLEVNEIVASELPGIPSAVW 479

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            +L+ +  + +D ++V+SF + T +L++   + +EE    GF +   TL       + L+Q
Sbjct: 480  TLKLNRSEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGDDGLIQ 537

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
            V    +R + + +    NEW +P   S+  ATANA QV +A   G +VY E+  DG L E
Sbjct: 538  VHPKGIRHIRNGNV---NEWAAPQHRSIVAATANAHQVAVALSSGEIVYFEMDADGSLAE 594

Query: 471  VKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
                +  +  ++CL +  + E    S   AVG   D +VRI SL P+  L  K       
Sbjct: 595  YDEKKEMFGTVTCLSLGDVPEGRLRSSFLAVGC-DDCTVRILSLDPESTLENKSVQALTA 653

Query: 529  IPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
             P S+ + A E  S      YL   L  G  L  +L+  TGELTD ++  LG + + L  
Sbjct: 654  APTSLAIIAMEDSSSGGSTLYLHIGLHSGVYLRTVLDEITGELTDTRQKFLGPKEVRLFQ 713

Query: 583  FSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
             + +  T V   S RP + Y+    K  + + +N  ++     F+S    + +   +   
Sbjct: 714  VTVQGKTCVLGLSSRPWLGYADPITKGFVVTPLNYVDLEWGWNFSSEQCEEGIVGIQGQS 773

Query: 641  LTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV----- 694
            L I  ID + +  I +SIPL   P+++        F      N +   E     +     
Sbjct: 774  LRIFNIDRLGETLIQKSIPLTYTPKKLVKHPDQPLFYTIEADNNTLPPELRAQLLADPGV 833

Query: 695  -----RLLDDQTF-----------------------EFISTYPLDTFEYGCSILSCSFSD 726
                 R+L  + F                       + + T  L+  E   S     FS 
Sbjct: 834  VNGDSRVLPPEDFGYPKGTRRWASCINVIDPLSEEGQVLQTIDLENNEAAVSAAIVPFSS 893

Query: 727  -DSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGK 782
             D+  +  +GT     V P         I  F+    +L+ I + + +    +L AF G+
Sbjct: 894  QDNESFLVIGTGKDMVVNPRSFSEGYLHIYRFLEGGRELEFIHKTKVEEPPLALLAFQGR 953

Query: 783  LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
            +L A+   +++Y   +R    R+ Q+E      I++L   T+G  I+VGD+ + ++ ++Y
Sbjct: 954  VLVAVGTSLRIYDLGMR-QMLRKSQAEVATQ-QIVSL--NTQGSRIIVGDVQQGVTYVVY 1009

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
            K     +     D  A W +   ++D +   G +   N+F VR   K SE A +E+ G  
Sbjct: 1010 KPASNKLIPFVDDTIARWTTCTTMVDYESVAGGDKFGNMFIVRCPEKASEEADEEQSGLH 1069

Query: 898  -------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
                         R+ ++  ++  +        SLV       VG    +++  + G IG
Sbjct: 1070 LINARDYLHGTPHRVSLMCHFYTQDIPTSITKASLV-------VGGQEVLLWSGIMGTIG 1122

Query: 945  V-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
            V I  +  E   F + L+ +LR     + G +H  +R +         K  +DGDL E +
Sbjct: 1123 VFIPFVSREDADFFQNLEQHLRTEDPPLAGRDHLMYRGY-----YAPVKGVIDGDLCERY 1177

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              L   +   I+  ++ SV E+ +++  L
Sbjct: 1178 NLLPNDKKQMIAGELDRSVREIERKISYL 1206


>gi|390601867|gb|EIN11260.1| hypothetical protein PUNSTDRAFT_118747 [Punctularia strigosozonata
            HHB-11173 SS5]
          Length = 1214

 Score =  219 bits (559), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 265/1175 (22%), Positives = 483/1175 (41%), Gaps = 173/1175 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G  +D+  + ++  +  +L +D ++S  +            R    
Sbjct: 58   VFGSIRSLAAFRLTGGTKDYAIVGSDSGRIVILDYDPKTSSFVKLHQETYGKSGARRIVP 117

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
            GQ    DP  R  +IG      L  ++  D    L  +  +        +  +  LD+  
Sbjct: 118  GQYLATDPKGRAVMIGAMEKAKLVYILNRDAAANLTISSPLEAHKNAAIIHHIVGLDV-- 175

Query: 117  LYGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNG 163
              G   P    L  D             N   + +  YE+ L     V    S   +D  
Sbjct: 176  --GYENPMFAALEVDYTESDQDPSGEAFNNAEKMLTYYELDLGLNHVVRK--SSEPVDPR 231

Query: 164  ADLLIPVP----------PPLCGVLIIGEETIVYCSANAFK---AIPIR------PS--- 201
            A+LL+ VP          P   GVLI  E+ I+Y   +  +    IP R      P    
Sbjct: 232  ANLLVQVPGGQTGDRFDGP--SGVLICCEDHIIYRHPSGSQHRVPIPRRRHPLDDPERGV 289

Query: 202  -ITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
             IT A   ++  D    L  +   L  + +  +  E V  LKI+      +A+ +  L +
Sbjct: 290  IITFALMHKIKGDFFFLLQSEEGDLFKVTIDYNGDEDVNALKIKYFDTVPVATGLCILKS 349

Query: 260  AVVYIGSSYGDSQLIKLNLQPD-------AKGSYVE------------------------ 288
              +++ S +GD  L +     D       +   Y +                        
Sbjct: 350  GFLFVASEFGDHHLYQFQKLGDDDDEPEFSSADYPQRGMAYPSQALPRAWFRPRPLDNLA 409

Query: 289  VLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
            +L+   +L PI+D  V++L       Q+    G     + R +R+G+ + E  S +L GI
Sbjct: 410  LLDTVESLSPIIDARVMNLLPNSDTPQIFAACGRGARSTFRTLRHGLEVEETVSSDLPGI 469

Query: 348  -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
               +W+ +   DD  D+++++SF++ T +L++   + +EE +  GF S   T+       
Sbjct: 470  PNAVWTTKLKEDDAHDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTIAVQQIGA 527

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-D 465
            + L+QV    +R V +  R   NEW+ P G ++  AT N  QV++A     LVY E+  +
Sbjct: 528  DALLQVHPQGIRHVLADRRV--NEWRVPAGKTIVTATTNKRQVVVALSSAELVYFELDLE 585

Query: 466  GILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
            G L E +  + +   +  L I  + E    +   AVG   D +VRI SL   + +    L
Sbjct: 586  GQLNEYQDRKAMGSTVLALSIGEVPEGRQRTPFLAVGC-EDQTVRIISLDPESTLETISL 644

Query: 525  GGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT 575
                 P + +  A              ++   L +G LL  +L+   G+LTD +   LGT
Sbjct: 645  QALTAPPTAICIADMLDASINKVHPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGT 704

Query: 576  QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
            +P+ L   + +    + A S R  + Y+ +  + ++ +  + +     F++   PD L  
Sbjct: 705  RPVRLIRVNVQRNPAILALSSRSWLNYTHHNLMHFTPLIFENLDFAWSFSAELCPDGLIG 764

Query: 636  AKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES----- 689
                 L I  I  +  KL   ++PL   PR+      +R F +    +++  EE+     
Sbjct: 765  IAGSVLRIFQIPRLGTKLKQDAMPLTYTPRKFIPHPANRYFYLIEGDHRTWGEEAVQKKV 824

Query: 690  ------------EM-------------------HFVRLLDDQTFEFISTYPLDTFEYGCS 718
                        EM                     +R++D    + ++T P+D  E   S
Sbjct: 825  AELRQAGKQVDEEMLSLPADTFGRPKAPAGTWASLIRIIDPVDAKTVATIPIDNNECAFS 884

Query: 719  ILSCSF-SDDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVY 774
            I    + +    ++  VGTA   ++ P        R   F  E   L+L+ + ET     
Sbjct: 885  IAVVPWAARGGELHLVVGTAQDTFLAPRSCTSGFLRTYAFTNEGAGLELLHKTETDDVPM 944

Query: 775  SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDL 833
            ++ AF G+L+A + + +++Y     D G ++L  +  +     A+  + T+G  I+VGD+
Sbjct: 945  AVMAFQGRLVAGVGKSLRIY-----DIGKKKLLRKVENKSFATAIVTLATQGSRILVGDM 999

Query: 834  MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEG 890
             +S++  +YK  E  +   A D    W+S+  ++D +  +  +   N+F  R   K SE 
Sbjct: 1000 QESMAYAVYKPPENRLLVFADDVQPRWISSSTMVDYNTVIAGDRFGNVFVNRLDAKVSEQ 1059

Query: 891  ATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP----TVIFGTVN 940
              D+  G      +  ++G  H  + +  +  G +V  L  S V  +P     +++  ++
Sbjct: 1060 VDDDPTGAGILHEKGLLMGAPHKTKMLCHYHVGDIVTSL--SKVALVPGGREVILYTGLH 1117

Query: 941  GVIGVIAS-LPHEQYLFLEKLQTNLRKVIK--GVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
            G IGV+   +  E   F+  L+ ++R  +   G+ G +H  WR +      V  K  +DG
Sbjct: 1118 GTIGVLVPFVSKEDVDFISTLEQHMRTELTQFGLVGRDHLSWRGY-----YVPVKAVVDG 1172

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DL E+F  L   +   I+  ++ +V E+ K++++L
Sbjct: 1173 DLCEAFATLPAPKQSSIAGELDRTVGEVLKKLDQL 1207


>gi|408400551|gb|EKJ79630.1| hypothetical protein FPSE_00190 [Fusarium pseudograminearum CS3096]
          Length = 1212

 Score =  219 bits (557), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 274/1174 (23%), Positives = 483/1174 (41%), Gaps = 168/1174 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   ++G I +L  FR  G  +D+L IA++  +  ++++    +      +        
Sbjct: 59   LLSHDVFGIIRSLAAFRLAGSNKDYLIIASDSGRITIIEYLPAQNRFQRLHLETFGKSGV 118

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    G+    DP  R  LI     + L  V+  +++ +L  +  +   +  VL I  + 
Sbjct: 119  RRVIPGEYLACDPKGRACLIASTEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVISMVA 178

Query: 118  --YGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDN 162
               G + P    L  D  +     T             YE+ L     V   WS + +D 
Sbjct: 179  LDVGYSNPVFAALEIDYSEVDQDSTGQAMEELDTQLVYYELDLGLNHVVRK-WS-DPVDP 236

Query: 163  GADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDAD 212
             A +L  VP     P   GVL+ GEE I Y  +N   F+ AIP R   T+     R    
Sbjct: 237  TASILFQVPGGNDGP--SGVLVCGEENITYRHSNQETFRVAIPRRRGATEDPNRKRTIVS 294

Query: 213  GSRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTI 254
            G  + L  + G    L+ T + +                  +V  LK++      +AS++
Sbjct: 295  GIMHKLKGNTGAFFFLLQTDDGDLFKLSIDMIEDEEGNPTGEVKRLKVKYFDTVPVASSL 354

Query: 255  SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------V 287
              L +  +Y+ + +G+    +                +   D K SY            +
Sbjct: 355  CILKSGFLYVATQFGNYSFYQFEKLGDDDEELEFYSDDFPADPKASYEPVYFHPRPTENL 414

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
             ++E    + P++D  V +L  +   Q+ T  G     S R++++G+ +NE  + EL GI
Sbjct: 415  ALVESIPAMNPLLDCKVANLTGEDAPQIYTICGNGPRSSFRMLKHGLEVNEIVASELPGI 474

Query: 348  -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
               +W+L+ +  + +D ++V+SF + T +L++   + +EE    GF +   TL       
Sbjct: 475  PSAVWTLKLNRSEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGD 532

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
            + L+QV    +R + + +    NEW +P   S+  ATANA QV +A   G +VY E+  D
Sbjct: 533  DGLIQVHPKGIRHIRNGNV---NEWAAPQHRSIVAATANAHQVAVALSSGEIVYFEMDAD 589

Query: 466  GILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
            G L E    +  +  ++CL +  + E    S   AVG   D +VRI SL P+  L  K  
Sbjct: 590  GSLAEYDEKKEMFGTVTCLSLGDVPEGRLRSSFLAVGC-DDCTVRILSLDPESTLENKSV 648

Query: 524  LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
                  P S+ + A E  S      YL   L  G  L  +L+  TGELTD ++  LG + 
Sbjct: 649  QALTAAPTSLAIIAMEDSSSGGSTLYLHIGLHSGVYLRTVLDEITGELTDTRQKFLGPKE 708

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
            + L   + +  T V   S RP + Y+    K  + + +N  ++     F+S    + +  
Sbjct: 709  VRLFQVTVQGKTCVLGLSSRPWLGYADPITKGFVVTPLNYVDLEWGWNFSSEQCEEGIVG 768

Query: 636  AKEGELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
             +   L I  ID + +  I +SIPL   P+++        F      N +   E     +
Sbjct: 769  IQGQSLRIFNIDRLGETLIQKSIPLTYTPKKLVKHPDQPLFYTIEADNNTLPPELRAQLL 828

Query: 695  ----------RLLDDQTF-----------------------EFISTYPLDTFEYGCSILS 721
                      ++L  + F                       + + T  L+  E   S   
Sbjct: 829  ADPGVVNGDSKVLPPEDFGYPKGTRRWASCINVIDPLSEEGQVLQTIDLENNEAAVSAAI 888

Query: 722  CSFSD-DSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
             SFS  D+  +  +GT     V P         I  F+    +L+ I + + +    +L 
Sbjct: 889  VSFSSQDNESFLVIGTGKDMVVNPRSFSEGYLHIYRFLEGGRELEFIHKTKVEEPPLALL 948

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
            AF G++L A+   +++Y   +R    R+ Q+E      I++L   T+G  I+VGD+ + +
Sbjct: 949  AFQGRVLVAVGTSLRIYDLGMR-QMLRKSQAEVATQ-QIVSL--NTQGSRIIVGDVQQGV 1004

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
            + ++YK     +     D  A W +   ++D +   G +   N+F VR   K SE A +E
Sbjct: 1005 TYVVYKPASNKLIPFVDDTIARWTTCTTMVDYESVAGGDKFGNMFIVRCPEKASEEADEE 1064

Query: 895  ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
            + G               R+ ++  ++  +        SLV       VG    +++  +
Sbjct: 1065 QSGLHLINARDYLHGTPHRVSLMCHFYTQDIPTSITKASLV-------VGGQEVLLWSGI 1117

Query: 940  NGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
             G IGV I  +  E   F + L+ +LR     + G +H  +R +    K V     +DGD
Sbjct: 1118 MGTIGVFIPFVSREDADFFQNLEQHLRTEDPPLAGRDHLMYRGYYAPVKGV-----IDGD 1172

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L E +  L   +   I+  ++ SV E+ +++ ++
Sbjct: 1173 LCERYNLLPNDKKQMIAGELDRSVREIERKISDI 1206


>gi|367027320|ref|XP_003662944.1| hypothetical protein MYCTH_2304190 [Myceliophthora thermophila ATCC
            42464]
 gi|347010213|gb|AEO57699.1| hypothetical protein MYCTH_2304190 [Myceliophthora thermophila ATCC
            42464]
          Length = 1211

 Score =  218 bits (556), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 281/1168 (24%), Positives = 490/1168 (41%), Gaps = 167/1168 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I  +  FR  G  +D++ +AT+  +  ++++  +++      +        R    
Sbjct: 64   VFGIIRAIASFRLAGSHKDYIILATDSGRIAIIEYQPKTNRFSRIHLETFGKSGVRRVVP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  LI     + L  V+  + + +L  +  +   +  VL +  +    G 
Sbjct: 124  GQYLAADPKGRACLIASIEKNKLVYVLNRNAQAELTISSPLEAHKPGVLVLSLVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
            A P    L  D  +A    T E A + +            + V   WS + +D  + LL 
Sbjct: 184  ANPVFAALEIDYTEADQDPTGEAAKETEAQLVYYELDLGLNHVVRKWS-DVVDPTSSLLF 242

Query: 169  PVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYLL 218
             VP     P   GVL+ GEE I Y  +N  AF+  IP R   T+     R    G  + L
Sbjct: 243  QVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPQRKRTIVSGVMHKL 300

Query: 219  GDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNA 260
               AG    LV T + +                  +V  LKI+      IA ++  L + 
Sbjct: 301  KGSAGAFFFLVQTEDGDLMKVTLDMVEDNDGNPTGEVRRLKIKYFDTIPIAKSLCILKSG 360

Query: 261  VVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV----- 294
             ++  S +G+    +                +   D + SY  V      LE  V     
Sbjct: 361  FLFAASEFGNHHFYQFEKLGDDDDEQEFTSDDFPTDPRASYNPVYFRPRPLENLVLVESI 420

Query: 295  -NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG-MWS 352
             ++ P+VD  V +L  +   Q+    G     + R++++G+ ++E  + EL G    +W+
Sbjct: 421  DSMNPLVDCKVANLTGEDAPQIYAVCGNRARSTFRMLKHGLEVSEIVASELPGTPAAVWT 480

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
             + +  D +D ++V+SF + T +L++   + +EE    GF +   TL         L+QV
Sbjct: 481  TKLTKYDEYDGYIVLSFTNATLVLSIG--ETVEEVTESGFLTSVPTLAVQQMGEEGLIQV 538

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV 471
                +R +        NEW +P   S+  A  N +QV++A   G +VY E+  DG L E 
Sbjct: 539  HPKGIRHIVQGRV---NEWPAPQHRSIVAAATNENQVVIALSSGEIVYFEMDADGSLAEY 595

Query: 472  -KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
             +  ++   ++ L I  + E    S   AVG   D +VRI SL P+  L  K        
Sbjct: 596  DEKKEMSGTVTSLSIGKVPEGLRRSSFLAVGC-DDCTVRILSLDPESTLEMKSIQALTSA 654

Query: 530  PRSVLLCAFE----GIS-YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
            P S+ + + E    G++ YL   L  G  L  +L+  TGELTD ++  LG +P  L   S
Sbjct: 655  PSSLSIMSMEDSTGGMTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTKLFQVS 714

Query: 585  SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
             +N   V A S RP + Y+    K  + + +N  E+ +   FNS    + +       L 
Sbjct: 715  VQNQACVLALSSRPWLGYTDPITKNFVMTPLNYSELEYGWNFNSEQCLEGMVGIHANFLR 774

Query: 643  IGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICS--------LKNQSCAEE---- 688
            I TID + +  I +SIPL   P+R+  H EQ   + I S        L+ Q  A+     
Sbjct: 775  IFTIDKLGETMIQKSIPLTYTPKRLVKHPEQPYFYTIESDNNTIPPELRAQLLAQSGAVN 834

Query: 689  ---------------------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF-SD 726
                                 S +  V  L D+  + +    L+  E   S     F S 
Sbjct: 835  GDATVLPPEDFGYPRATGRWASCISIVDPLGDEP-KVLQRIDLEGNEAAVSAAVVPFASQ 893

Query: 727  DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
            +   +  VGT   ++    + T+G I V+   EDG+ L+ I + + +    +L  F G+L
Sbjct: 894  EGESFLVVGTGKDMVLNPRKFTEGYIHVYRFHEDGRELEFIHKTKVEEPPLALIPFQGRL 953

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
            LA I + +++Y   LR    + L+   G     L + +QT+G  I+VGD+ + ++ ++YK
Sbjct: 954  LAGIGKMLRVYDLGLR----QLLRKAQGEVAPQLIVTLQTQGSRIIVGDVQQGVTYVVYK 1009

Query: 844  HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEERGRLEV 901
             E   +   A D    W +   ++D +   G +   N++ +R  + +   +DE    +++
Sbjct: 1010 PESNKLLVFADDTINRWTTCTTMVDYESVAGGDKFGNVWILRCPERASQESDEPGSEIQL 1069

Query: 902  VGEYHLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV 945
            +          ++ HG+              LP S       VG    +++  + G +GV
Sbjct: 1070 L-------HARKYLHGAPNRLDLMVHFYTQDLPTSIVKTNLVVGGQDVLVWSGIQGTVGV 1122

Query: 946  -IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
             I  +  E   F + L++++R     + G +H  +R +      V  K  +DGDL E F 
Sbjct: 1123 LIPFVSREDVDFFQSLESHMRAEDPPLAGRDHLIYRGY-----YVPVKGVIDGDLCERFS 1177

Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             L   +   I+  ++ SV E+ +++ ++
Sbjct: 1178 LLPNDKKQMIAGELDRSVREIERKISDI 1205


>gi|402592185|gb|EJW86114.1| CPSF A subunit region family protein [Wuchereria bancrofti]
          Length = 278

 Score =  218 bits (556), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 110/265 (41%), Positives = 170/265 (64%), Gaps = 6/265 (2%)

Query: 752  LVFIVEDG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
            ++F   +G  +++L+ EKE KGA YS+ + +GKL+ A+N  ++L++W       +EL+ E
Sbjct: 1    MMFQASEGPERMRLVYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWT----ADKELRLE 56

Query: 810  CGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
            C    ++ ALY++T+ D I+VGDLM+S+SLL YK  E   E+ ARD+  NWMSA EI+D 
Sbjct: 57   CSDFDNVTALYLKTKNDLILVGDLMRSLSLLSYKSMESTFEKVARDFMTNWMSACEIIDS 116

Query: 870  DIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 929
            D +LGAEN++NLFTV K+S     EE  RL+ +G ++LGE VN F HGSL     D    
Sbjct: 117  DNFLGAENSYNLFTVMKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVDVAPL 176

Query: 930  QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
               ++++GT +G IGVI  +P   Y FL+ +Q  L +  +    ++H Q+R+F  EK++ 
Sbjct: 177  YHSSILYGTSDGGIGVIVQMPPVLYTFLQDVQKRLAEYAENCMRISHTQYRTFETEKRSE 236

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEI 1014
                F+DGDLIES LD+ +  ++++
Sbjct: 237  APNGFIDGDLIESLLDMGKDSVEQV 261


>gi|428180158|gb|EKX49026.1| hypothetical protein GUITHDRAFT_68305 [Guillardia theta CCMP2712]
          Length = 1202

 Score =  218 bits (555), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 274/1169 (23%), Positives = 477/1169 (40%), Gaps = 172/1169 (14%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
            +G I ++  FR  G  +D+L +  +  +  VL++  E ++     +        R    G
Sbjct: 58   FGLIRSMASFRLPGSNKDYLVLGADSGRISVLEFSKERNQFERVHLETYGKSGCRRIVPG 117

Query: 67   QIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-----YGCA 121
            Q    DP  R + +   +    V  F+     K   +  LE  +   I F       G  
Sbjct: 118  QFLASDPKGRAVMISAIEKQKLVYVFNRDASSKLTISSPLEAHKASTIHFSIVGVDVGFD 177

Query: 122  KPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLLI 168
             P    L  D  DA   +T             YE+ L     V    +   +D  +++LI
Sbjct: 178  NPIFAALEMDYSDADADETGQSAEEFNKVLTFYELDLGLNHVVRK--ASEPIDAASNMLI 235

Query: 169  PVPPPL---CGVLIIGEETIVYCSAN---AFKAIPIR-------PSITKAYGRV-DADGS 214
            PVP       GVL+  E  I Y   +       IP R       P +   Y  +   DG 
Sbjct: 236  PVPGDTDGPSGVLVCAENKIAYKKPDHEDVVALIPRRQGMPLDQPLLITGYSHLKQKDGF 295

Query: 215  RYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
             +LL    G L+ L +T+  E+V+ + I       +A +I+ L    +++ S +G+  L 
Sbjct: 296  FFLLQSEIGDLYRLTLTYNDEEVSEINITYFDTVPVAQSITILKTGFLFVASEFGNHALY 355

Query: 275  KL---------NLQP---DAKGSYVEV-------------LERYVNLGPIVDFCVVDLER 309
            +          ++ P   + +G  +E+             ++   +L PI+D  V+DL  
Sbjct: 356  QFLSIKGSDESDMMPVEVEIEGETIEIPHFAPRPLKNLLLVDEMESLSPILDMRVLDLAG 415

Query: 310  QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVS 368
            +   Q+    G     +LR +R+G+ + E A  EL      +W+++ S+ D  D ++VV+
Sbjct: 416  EETPQIYALCGKGPRSTLRTLRHGLAVAEMAVSELPSNPLAVWTVKGSSKDAADKYIVVT 475

Query: 369  FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 428
            F + T +L++   D +EE    GF +  +TL       + L+QV    +R V S  R   
Sbjct: 476  FANATIVLSIG--DTVEEVTDSGFLATNKTLSVSLLGDDSLLQVHPNGLRTVRSDKR--I 531

Query: 429  NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY---EISCLDI 485
            +++  P    +N+   N  QV++A     ++Y E+ D +    + A+ E    +I+ LDI
Sbjct: 532  SQFTPPNKGVINLVAVNQQQVVVALADHTMIYFEL-DAVGQLQEKAKPEIGGGQIAALDI 590

Query: 486  NPIGENPSYSQIAAVGMWTDIS--VRIFSLPD---LNLITKEHLGGEIIPRSVLLCAFEG 540
            +P+G   S  +  AVG   D S  VRI SL     +++++++ L  +  P S  LC  E 
Sbjct: 591  SPLGAGRSRGRFLAVGASVDGSWFVRILSLDPGSFMHIVSRQALPAK--PES--LCLIE- 645

Query: 541  IS-------------YLLCALGDGHLLNFLLNMKTGELTDRKKVS-LGTQPITLRTFSSK 586
            IS             +L   L +G L+   ++  TG+L    +   LGT+P+ L     +
Sbjct: 646  ISIGAAQSDGGTPTLFLFAGLENGVLMRITVDPITGQLAPEFRTRFLGTKPVKLFKVLVQ 705

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
                V A S R  + Y+   +   + ++ + + +   F S   P+         L I T+
Sbjct: 706  EQPAVLALSSRSWLAYNFQGRYQITPLSYETLEYASGFASDQCPEGFVCVAANTLRILTV 765

Query: 647  DDI-QKLHIRSIPLGEHPRRICHQEQSRTFAI----------------CSLK-------- 681
            + + +  +  S+ L   PR+    + S  F +                C +         
Sbjct: 766  ERLGEVFNQHSMKLSFTPRKSALLKDSGCFVVLETDHNSDMEKAKEARCDVAEDEEEENE 825

Query: 682  ---------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVY 731
                      Q   E      VR++D    E      LD  E   S+   +F D   + +
Sbjct: 826  NKLPHSIYGEQRAGEAKWASRVRVIDPNERETKQIIELDPNEAALSVCVATFYDRKGHTF 885

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
             C GTA      +    G +  + V   +L  + +    G   +L +F G+LL  +   +
Sbjct: 886  LCFGTAVGHKVGSRTGSGFLHTYSVVGSQLTFVHKTPIDGVPRALCSFQGRLLVGVGSAL 945

Query: 792  QLYKWMLRDDGTRELQSECGHHG-HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
            +LY+      G R+L  +C +     L + + T GD I VGD+ +SIS L Y      + 
Sbjct: 946  RLYEM-----GKRKLLRKCENRNIPNLVVTISTMGDRIYVGDVAESISFLKYNRILNELV 1000

Query: 851  ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEERG----------- 897
              A D +  WM+A   +D D   GA+   N+F  R   N      EE G           
Sbjct: 1001 IFADDTHPRWMTAACPVDYDTVAGADKFGNIFLTRLPDNVSDEISEEPGAVGMFEGNDLQ 1060

Query: 898  ----RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHE 952
                + E + +YH+GE V   +  +L         G    +I+GT+ G IG +   +  E
Sbjct: 1061 GAHYKAEEIVQYHVGETVCSLQKATL-------SPGGSDAIIYGTMYGGIGALQPFVSRE 1113

Query: 953  QYLFLEKLQTNLRKVI---------KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
               F   L+ +LR            +G+ G +   +RS+         K+ +DGDL E+F
Sbjct: 1114 DVDFFLHLEMHLRGAAGAREHKPAGEGICGRDQLSFRSY-----YFPVKDVVDGDLCETF 1168

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              LS +R  +I++ ++ +  E+ K++E++
Sbjct: 1169 NYLSPSRQKQIAEDLDRTPGEVAKKLEDM 1197


>gi|342885857|gb|EGU85809.1| hypothetical protein FOXB_03657 [Fusarium oxysporum Fo5176]
          Length = 1189

 Score =  218 bits (555), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 277/1163 (23%), Positives = 481/1163 (41%), Gaps = 169/1163 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   ++G I +L  FR  G  +D+L IA++  +  ++++    +      +        
Sbjct: 59   LLSHDVFGIIRSLAAFRLAGSNKDYLIIASDSGRITIIEYLPAQNRFHRLHLETFGKSGV 118

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    G+    DP  R  LI     + L  V+  +++ +L  +  +   +  VL I  + 
Sbjct: 119  RRVIPGEYLACDPKGRACLIASTEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVISMVA 178

Query: 118  --YGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDN 162
               G + P    L  D  +     T             YE+ L     V   WS + +D 
Sbjct: 179  LDVGYSNPVFAALEIDYSEIDQDSTGQAMEELDTQLVYYELDLGLNHVVRK-WS-DPVDP 236

Query: 163  GADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDAD 212
             A +L  VP     P   GVL+ GEE I Y  +N  AF+ AIP R   T+     R    
Sbjct: 237  TASILFQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVAIPRRRGATEDPNRKRTIVS 294

Query: 213  GSRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTI 254
            G  + L   AG    L+ T + +                  +V  LKI+      +AS++
Sbjct: 295  GIMHKLKGSAGAFFFLLQTDDGDLFKLSIDMIEDEEGNPTGEVKRLKIKYFDTVPVASSL 354

Query: 255  SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------V 287
              L +  +Y+ S +G+    +                +   D + SY            +
Sbjct: 355  CILKSGFLYVASQFGNYSFYQFEKLGDDDEELEFYSDDFPADPRASYEPVYFHPRPTENL 414

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
             ++E    + P++D  V +L  +   Q+ T  G     S R++++G+ +NE  + EL GI
Sbjct: 415  ALVESIPAMNPLLDCKVANLTGEDAPQIYTICGNGPRSSFRMLKHGLEVNEIVASELPGI 474

Query: 348  -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
               +W+L+ +  + +D ++V+SF + T +L++   + +EE    GF +   TL       
Sbjct: 475  PSAVWTLKLNRSEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGD 532

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
            + L+QV    +R V +      NEW +P   S+  ATANA QV +A   G +VY E+  D
Sbjct: 533  DGLIQVHPKGIRHVRNGHV---NEWAAPQHRSIVAATANAHQVAVALSSGEIVYFEMDAD 589

Query: 466  GILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
            G L E    +  +  ++CL +  + E    S   AVG   D +VRI SL P+  L  K  
Sbjct: 590  GSLAEYDEKKEMFGTVTCLSLGDVPEGRLRSSFLAVGC-DDCTVRILSLDPESTLENKSV 648

Query: 524  LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
                  P S+ + A +  S      YL   L  G  L  +L+  TGELTD ++  LG + 
Sbjct: 649  QALTAAPTSLAIIAMDDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKE 708

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
            + L   + +  T V   S RP + Y+    K  + + +N  ++     F+S    + +  
Sbjct: 709  VRLFQVTVQGKTCVLGLSSRPWLGYADPITKGFVVTPLNYVDLEWGWNFSSEQCEEGIVG 768

Query: 636  AKEGELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
             +   L I  ID +    I +SIPL   P+++        F      N +   E     +
Sbjct: 769  IQGQSLRIFNIDRLGDTLIQKSIPLTYTPKKLVKHPDQPLFYTIEADNNTLPPELRAQLL 828

Query: 695  ----------RLLDDQTF-----------------------EFISTYPLDTFEYGCSILS 721
                      R+L  + F                       + + T  L+  E   S   
Sbjct: 829  ADPKIVNGDSRVLPPEDFGYPKGTRRWASCINVIDPLSEEGQVVQTIDLENNEAAVSAAI 888

Query: 722  CSFSD-DSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
             SFS  D+  +  VGT   ++      ++G + ++  +DG           G   +L AF
Sbjct: 889  VSFSSQDNESFLVVGTGKDMVVNPRSYSEGYLHIYRFQDG-----------GENLTLLAF 937

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
             G++  A+  ++++Y   +R    R+ Q+E      I++L   T+G  I+VGD+ + ++ 
Sbjct: 938  QGRVAVAVGTQLRIYDLGMR-QMLRKSQAEVAAQ-QIVSL--NTQGSRIIVGDVQQGVTY 993

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEER 896
            ++YK     +     D  A W +   ++D +   G +   N+F VR   K SE A DEE+
Sbjct: 994  VVYKPASNKLIPFVDDTIARWTTCTTMVDYESVAGGDKFGNMFIVRCPEKASEEA-DEEQ 1052

Query: 897  GRLEVVGEYHLGEFVNRFRHGSLVMRLPDS------DVGQIPTVIFGTVNGVIGV-IASL 949
              L ++           + HG+    +P S       VG    +++  + G IGV I  +
Sbjct: 1053 TGLHLIN-------AREYLHGTPHRDIPTSITKTSLVVGGQEILLWSGIMGTIGVFIPFI 1105

Query: 950  PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRT 1009
              E   F + L+ +LR     + G +H  +R +         K  +DGDL E +  L   
Sbjct: 1106 SREDADFFQNLEQHLRTEDPPLAGRDHLMYRGY-----YAPVKGVIDGDLCERYNLLPND 1160

Query: 1010 RMDEISKTMNVSVEELCKRVEEL 1032
            +   I+  ++ SV E+ +++ ++
Sbjct: 1161 KKLMIAGELDRSVREIERKISDI 1183


>gi|400597418|gb|EJP65151.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
          Length = 1212

 Score =  218 bits (554), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 279/1170 (23%), Positives = 496/1170 (42%), Gaps = 170/1170 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            I+G + ++ +FR  G ++D++ +AT+  +  VL++   S    +R   +   + G R   
Sbjct: 64   IFGVLRSIAVFRLAGSSKDYIILATDSGRITVLEY-LPSPNRFSRLHMETFGKTGIRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             G+    DP  R  LI     + L  V+  +++ +L  +  +   +  VL I       G
Sbjct: 123  PGEYLACDPKGRACLISAVEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVIALTALDVG 182

Query: 120  CAKPTIVVLY-------QDNKDA------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
             A P    L        QDN          H+  YE+ L     V   WS + +D  A L
Sbjct: 183  YANPVFAALEIDYTEVDQDNTGEALSEVETHLVYYELDLGLNHVVRK-WS-DPVDPTASL 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRY 216
            L  VP     P   GVL+ GEE + Y  +N  A +  IP R   T+  +  R    G  +
Sbjct: 241  LFQVPGGNDGP--SGVLVCGEENVTYRHSNQDALRVPIPRRRGATEDPSRKRNIVAGVMH 298

Query: 217  LLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLD 258
             L   AG    L+ T + +                  +V  +KI+      +A+++  L 
Sbjct: 299  KLKGSAGAFFFLLQTDDGDLFKITIDMVEDEEGAPTGEVQRMKIKYFDTVPVATSLCILK 358

Query: 259  NAVVYIGSSYGDS---QLIKL------------NLQPDAKGSY------------VEVLE 291
            +  +Y+ S +G+    Q  KL            +   D   +Y            + +++
Sbjct: 359  SGFLYVASQFGNYAFYQFEKLGDDDDELEFSSDDFPVDPLAAYEPVYFYPRPAENLALVD 418

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
                + P++D  V +L  +   Q+ +  G     + R +++G+ +NE  + EL G+   +
Sbjct: 419  SIPAMNPLLDCKVANLTGEDAPQIYSICGNGARSTFRTIKHGLEVNEIVASELPGVPSAV 478

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+L+ ++D+ +DT++V+SF + T +L++   + +EE    GF +   T+       + L+
Sbjct: 479  WTLKLNSDEQYDTYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTIAAQLLGTDGLI 536

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILT 469
            QV    +R + + +    NEW +P   S+  A+ NA QV +A   G +VY E+  DG L 
Sbjct: 537  QVHPRGIRHIRNGNV---NEWSAPQHRSIVAASTNAHQVAIALSSGEIVYFEMDSDGSLA 593

Query: 470  EVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
            E    +  +  ++ L +  + E    S   AVG   D +VRI SL P+  L  K      
Sbjct: 594  EYDEKKEMFGTVTALSLGEVPEGRVRSSFLAVGC-DDATVRILSLDPESTLENKSVQALT 652

Query: 528  IIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
              P S+ +   +  S      YL   L  G  L  +L+  TG+L+D ++  LG +P+ L 
Sbjct: 653  AAPTSLAIIPMDDSSSGGSTLYLHIGLHSGVYLRTVLDEITGDLSDTRQKFLGPKPVRLF 712

Query: 582  TFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
              S    T V   S RP + Y+         + +N  ++     F+S    D +   +  
Sbjct: 713  QVSVGGRTCVLGLSSRPWLGYADPITNSFEITPLNYVDLEWGWNFSSEQCEDGVVGIQGQ 772

Query: 640  ELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV---- 694
             L I +ID + +  I+ SIPL   P++         F      N +   + +   +    
Sbjct: 773  SLRIFSIDRLGETLIQSSIPLTYTPKKFVKHPSEPLFYTIEADNHTLPPDLQAKLLADPA 832

Query: 695  ------RLLDDQTF----------------EFISTYP-------LDTFEYGCSILSCSF- 724
                  ++L  + F                + +S  P        +  E   S    SF 
Sbjct: 833  AVNGDAKVLPPEEFGHPRGNRRWASCISVVDPVSEEPSVLQKVDFENNEAAVSAAVVSFA 892

Query: 725  SDDSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFNG 781
            S D+  +  VGT   ++      ++  I ++  ++G  +L+ I + + +    +L AF G
Sbjct: 893  SQDNESFLVVGTGKDMILNPRSSSEAYIYIYRFQEGGRELEFIHKTKIEEPAMALLAFQG 952

Query: 782  KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
            KLLA I + +++Y   +R    R+ Q+E      I++L   T+G  IVVGD+ + ++L++
Sbjct: 953  KLLAGIGKTLRMYDLGMR-QLLRKAQAEVVPQ-QIVSL--NTQGSRIVVGDVQQGVTLVV 1008

Query: 842  YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG- 897
            YK     +   A D  A W +   ++D +   G +   N+F VR   K SE A +E+ G 
Sbjct: 1009 YKPASNKLIPFADDTIARWTTCTTMVDYESVAGGDKFGNMFIVRSPAKASEEADEEQAGL 1068

Query: 898  --------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVI 943
                          RLE++  +   +        SLV       VG    +++  + G I
Sbjct: 1069 HLVNARDYLHGAQHRLELMCHFFTQDVPTSINKTSLV-------VGGQDVLLWSGIMGTI 1121

Query: 944  GV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
            GV I  +  E   F + L+ +LR     + G +H  +RS+    K V     +DGDL E 
Sbjct: 1122 GVFIPFVSREDADFFQSLEQHLRTEDAPLAGRDHLMYRSYYAPVKGV-----IDGDLCER 1176

Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            F  L   +   ++  ++ SV E+ +++ ++
Sbjct: 1177 FAALPNDKKQMMAGELDRSVREIERKISDI 1206


>gi|223647430|gb|ACN10473.1| Splicing factor 3B subunit 3 [Salmo salar]
          Length = 1217

 Score =  217 bits (553), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 268/1190 (22%), Positives = 492/1190 (41%), Gaps = 189/1190 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L + ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTMEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYHP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R  +IG      L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMIGAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  ++E VT ++++      +A+++  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRMKYFDTIPVATSMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V V E+  NL 
Sbjct: 347  LFVSSEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-NLS 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   + DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406  PIMSCQIADLANEDTPQLYVACGRGPKSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  IEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
            +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKTPGKKTIVRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL     L  ++ + L  +  P 
Sbjct: 582  EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ--PE 638

Query: 532  SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
            S+ +    G+               YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 639  SLCIVEMGGVEKEDELGVKGTVGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L     +    V A S R  + YS   +   + ++ + + +   F S   P+ +    
Sbjct: 699  VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
               L I  ++ +  +  + + PL   PR+ + H E            + T A  + + Q 
Sbjct: 759  TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLILIETDHNAYTEATKAQRKQQ 818

Query: 685  CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
             AEE             +EM                           VRL++      + 
Sbjct: 819  MAEEMVEAAGEDERELAAEMAAAFLNENLPEAIFGAPKAGSGQWASLVRLVNPIQGTTLD 878

Query: 707  TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
               L+  E   S+  C F+     +Y  VG A  ++        G I  + +  G  KL+
Sbjct: 879  LVQLEQNEAAFSVTVCRFASGGEDWYVLVGVARDMILNPRSVGGGFIYTYRLGGGGDKLE 938

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYV 821
             + +   +    ++  F G++L  + + +++Y     D G ++L  +C   H   L   +
Sbjct: 939  FMHKTPVEDVPLAIAPFQGRVLVGVGKLLRIY-----DLGKKKLLRKCENKHVPNLVTGI 993

Query: 822  QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
             T G  ++V D+ +S+  + YK  E  +   A D +  W++   +LD D    A+   N+
Sbjct: 994  HTIGQRVIVTDVQESLFWVRYKRNENQLIIFADDTHPRWVTTACLLDYDTMASADKFGNI 1053

Query: 882  FTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
              VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+   
Sbjct: 1054 SIVRLPPNTSDDVDEDPTGNKALWDRGLLNGASQKAEVIMNYHIGETVLSLQKTTLI--- 1110

Query: 924  PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
                 G   ++++ T++G IG++     HE + F + L+ ++R     + G +H  +RS+
Sbjct: 1111 ----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHLEMHMRSEFPPLCGRDHLSFRSY 1166

Query: 983  NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                     KN +DGDL E F  +   +   +++ ++ +  E+ K++E++
Sbjct: 1167 -----YFPVKNVIDGDLCEQFNSMDPHKQKSVAEELDRTPPEVSKKLEDI 1211


>gi|443694993|gb|ELT96001.1| hypothetical protein CAPTEDRAFT_155561 [Capitella teleta]
          Length = 1215

 Score =  217 bits (553), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 267/1187 (22%), Positives = 502/1187 (42%), Gaps = 187/1187 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   ++G I +L  FR  G ++D++ + ++  +  +L++ A S  +  +   +   + G
Sbjct: 53   LLTSEVFGVIRSLMPFRLTGGSKDYIILGSDSGRIVILEYVA-SKNIFEKIQQETYGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   IDP  R  ++G      L  ++  D + +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQQLAIDPRGRAVMVGAVEKQKLVYILNRDAQARLTISSPLEAHKANTLGYYMV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   PT   L  D ++A    T E A K +    + E     N+        L+  
Sbjct: 172  GVDVGFENPTFACLEVDYEEADSDPTGESASKLQQMLTYYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR----- 215
            A++LI VP       GVLI  E  I Y   N      IR  I +    +D D  R     
Sbjct: 232  ANMLISVPGGSEGPSGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMIFV 288

Query: 216  ------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
                        +L     G +  + +  +++ VT ++++      +AS++  L    ++
Sbjct: 289  CNATHKTKSMFFFLTQTEQGDVFKITLETDEDMVTEVRLKYFDTVPVASSMCVLKTGFLF 348

Query: 264  IGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPIV 300
            I S +G+  L ++             +  P  +G       R +          +L PI+
Sbjct: 349  IASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLKNLVMVDEMDSLSPIM 408

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDD 359
               + DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++ + +D
Sbjct: 409  HCQIADLANEDTPQLFAMCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRNIED 468

Query: 360  PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
             FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQ+    +R 
Sbjct: 469  EFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSQLGDDALVQIYPDGIRH 526

Query: 420  VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLE 477
            + S  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  +L 
Sbjct: 527  IRSDKR--VNEWKTPGKKTIVRCAVNQRQVVIALSGGELVYFEMDPTGQLNEYTERKELS 584

Query: 478  YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LLC 536
             ++ C+ +  +       +  AVG+  D +VRI SL   + ++   L  + +P S   LC
Sbjct: 585  SDVVCMGLGRVPSGEQRCRFLAVGL-NDNTVRIISLDPTDCLSP--LSMQALPASPESLC 641

Query: 537  ------AFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
                  A EG          +L   L +G LL  +L+  TG+L+D +   LG++P+ L  
Sbjct: 642  IVEMGGAVEGQEEAHINHGLFLNIGLQNGVLLRAVLDNVTGDLSDTRTRYLGSRPVKLFR 701

Query: 583  FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
             S + +  V A S R  + Y+   +   + ++   + +   F S   P+ +       L 
Sbjct: 702  ISMQGSDSVLAMSSRSWLSYTYQNRFHLTPLSYDTLEYASGFASEQCPEGIVAISTNTLR 761

Query: 643  IGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKN-----------QSCAEE- 688
            I  ++ +  +  + SIPL   PR+ + H E      I S  N           Q  AEE 
Sbjct: 762  ILALEKLGAIFNQVSIPLQYTPRKFVIHAETGNVIVIESDHNAYTEDTKQQRKQQMAEEM 821

Query: 689  ------------SEMHF-------------------------VRLLDDQTFEFISTYPLD 711
                        +EM                           +++++    + +   PL+
Sbjct: 822  VEAAGEEEQQLAAEMAAAFLQENLPEDVFGAPKAGNGMWASQIQIMNPINGKTLCKLPLE 881

Query: 712  TFEYGCSILSCSFSDDSN-VYYCVGTAY-VLPEENEPTKGRILVFIVEDG--KLQLIAEK 767
              E   S+    F++  +  +  VGT   ++      + G IL F   +    L+L+   
Sbjct: 882  QNEAAFSVSIVRFANHGDEAFVLVGTVKDLVLSPRSCSCGYILCFRFTNNYEGLELLHRT 941

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTR 824
                   ++ AF G++L  + + +++Y     D G +++  +C +      + A++V   
Sbjct: 942  PVDDVPAAIAAFQGRVLIGVGRHLRIY-----DLGKKKMLRKCENKFLPNFVTAIHVN-- 994

Query: 825  GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
            G+ I+V D+  S   + YK  E  +   A D    W++   +LD +   GA+   N+  V
Sbjct: 995  GNRIMVSDIQDSFHFVRYKRHENQLIVFADDTCPRWLTCSCMLDYNTMAGADKFGNICVV 1054

Query: 885  RKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPD 925
            R  S  ATD+            +RG L       + +  +H+GE +   +  +L+     
Sbjct: 1055 RLPS-NATDDVDEDPTGNKALWDRGSLNGASQKADSIANFHVGEMITSLQKATLI----- 1108

Query: 926  SDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN 984
               G   ++++ T++G +G++     HE + F + L+ ++R  ++ + G +H  +RS+  
Sbjct: 1109 --PGGSESLVYTTLSGSLGMLVPFSSHEDHDFFQHLEMHMRSEVQSLCGRDHLAYRSY-- 1164

Query: 985  EKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
                   +N +DGDL E F  +  +R  +IS+ ++ +  E+ K++E+
Sbjct: 1165 ---YFPVRNVIDGDLCEMFTSMDVSRQKQISEELDRTPAEVSKKLED 1208


>gi|402222132|gb|EJU02199.1| hypothetical protein DACRYDRAFT_21931 [Dacryopinax sp. DJM-731 SS1]
          Length = 1209

 Score =  217 bits (552), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 271/1172 (23%), Positives = 482/1172 (41%), Gaps = 172/1172 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G A+D+  + ++  +  +L++D +++   T+   +   + G R   
Sbjct: 58   VFGTIRSLVAFRLTGGAKDYAIVGSDSGRLVILEYDPQANAF-TKLHQETYGKSGVRRIV 116

Query: 65   NGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIR-------LEELQVLDIK 115
             GQ   +DP  R + +   +   L  ++  D    L  +  +        +  +  +D+ 
Sbjct: 117  PGQYLAVDPKGRSVMIAALEKAKLVYILNRDAAANLTISSPLEAHKNHGIIHSIVGVDVG 176

Query: 116  F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
            F   L+   +       QD+    + KT      YE+ L     V   WS+   D  A+L
Sbjct: 177  FENPLFAALEVDYTESDQDSTGEAYEKTEKMLTYYELDLGLNHVVRK-WSEPT-DKRANL 234

Query: 167  LIPVP------------PPLCGVLIIGEETIVYCSANAFK-AIPI-----------RPSI 202
            L+ VP            P   GVLI  E  I++   +  +  +PI           R SI
Sbjct: 235  LLQVPGGQTSSEARHDGP--SGVLICCEGYIIWRHMDTVQHRVPIPKRRHPLDERTRDSI 292

Query: 203  TKAYGRVDADGSRY-LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
              A       G+ + LL    G L  + I HE E+V  +KI+      +AS++  L +  
Sbjct: 293  IVAAVMHKMKGAFFFLLQSEDGDLFKVTIDHEDEEVKTMKIKYFDTVPVASSLCILKSGF 352

Query: 262  VYIGSSYGDSQLIKLNLQPD-------AKGSYVE---------------VLERYV----- 294
            +++ S +G+  L +     D       +  SY +                LE  V     
Sbjct: 353  LFVASEFGNHYLYQFQKLGDDDDEIEYSSVSYPDNGMADPIPQAYFRPRPLENLVLADEL 412

Query: 295  -NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 352
             +  PIVD  V +L      Q+    G     S R++R+G+ + E  S EL GI   +W+
Sbjct: 413  NSFDPIVDAKVTNLLNTDTPQIFAACGRGARSSFRMLRHGLDVEETVSSELPGIPNAVWT 472

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            ++   DD +D ++++SF++ T +L++   + +EE    GF S + T+       + L+QV
Sbjct: 473  VKLKADDQYDAYIILSFVNGTLVLSIG--ETIEEVSDTGFLSSSPTIAVQQIGEDSLLQV 530

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV 471
                +R V S  R   NEW+ P   ++  AT N+ QV +A     LVY E+  +G L E 
Sbjct: 531  YPHGIRHVLSDRRV--NEWRCPQHTTIVAATTNSRQVAIALSSAQLVYFELDLEGQLNEY 588

Query: 472  K-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
            +    L   +  + I  + E    +   AVG   D +VRI SL PD  L           
Sbjct: 589  QDRKSLGSGVLAMSIAEVPEGRQRTPYLAVGC-EDQTVRIISLDPDTTLENISLQALTAP 647

Query: 530  PRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
            P S+ +      S        ++   L +G LL  +L+   G+LTD +   LG++P+ L 
Sbjct: 648  PSSICVAEIMDASIDKNQPTMFVNIGLQNGVLLRTVLDPVNGQLTDTRTRFLGSRPVRLI 707

Query: 582  TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
              +      + A S R  + Y+    L ++ +    + +   F++   PD L       L
Sbjct: 708  RVNVHGLPSILALSSRSWLNYTYQNLLHFTPLIFDPLEYAWSFSAELCPDGLIGISGNVL 767

Query: 642  TIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE---------- 690
             I  + ++ QKL    IPL   PR++      R F +    ++  + E+           
Sbjct: 768  RIFQVPNLGQKLKQDVIPLSYTPRKMLQHPTERLFYVIESDHRVLSPEAADKKLQKLKST 827

Query: 691  --------------------------MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF 724
                                         ++++D      +   PLD  E   S+   +F
Sbjct: 828  GQRLDQEVIDLPADIFGRPRADAGTWASCIQIIDPANVRSVLEVPLDNNEAAFSLAITTF 887

Query: 725  -SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFN 780
             +    ++  VGTA  V+        G +  + I EDG+ L+ + + E      +L +F 
Sbjct: 888  IARPGELFLVVGTAQDVIVSPKSCKSGFLRTYKISEDGRSLEFLHKTEVDDVPLALLSFQ 947

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISL 839
            G+L+A I + ++++     D G + L  +C +     A+  + T+G  I+VGD+ +SI  
Sbjct: 948  GRLVAGIGKALRIF-----DMGKKRLLRKCENKSFATAIVTLSTQGSRIIVGDMAESIYF 1002

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEER 896
              YK  E  +   A D    W++A  ++D D     +   N+F  R   K  E   ++  
Sbjct: 1003 ATYKPPENRLLIFADDSQPRWITASAMVDYDTVCAGDKFGNVFVNRLPPKVGEQVDEDPT 1062

Query: 897  G---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
            G               +  ++  Y++G+ +      +LV        G    V++  ++G
Sbjct: 1063 GAGVLHEKGLFMGAPHKTNMLAHYYVGDIITSMHKVALV-------TGGRDIVLYTGLHG 1115

Query: 942  VIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
             IGV I  +  E   F+  L+ ++R     + G +H  +R +      V  K  +DGDL 
Sbjct: 1116 TIGVLIPFISKEDVDFIRTLEQHMRTEAPSLVGRDHLTYRGY-----YVPVKGVVDGDLC 1170

Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            E F  L   +   I+  ++ +  E+ K++E+L
Sbjct: 1171 ELFSLLPTQKQQSIAGELDRTYSEVLKKLEQL 1202


>gi|358378986|gb|EHK16667.1| hypothetical protein TRIVIDRAFT_40938 [Trichoderma virens Gv29-8]
          Length = 1212

 Score =  215 bits (548), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 280/1174 (23%), Positives = 484/1174 (41%), Gaps = 168/1174 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   I+G I +L  FR  G  +D+L +AT+  +  ++++  + +      +        
Sbjct: 59   LLSHDIFGIIRSLAAFRLAGSNKDYLILATDSGRITIIEYLPKENRFTKLHLETFGKSGV 118

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    G+    DP  R  LI     + L  V+  +++ +L  +  +   +  VL I  + 
Sbjct: 119  RRVIPGEYLACDPKGRACLIASIEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVISMVA 178

Query: 118  --YGCAKPTIVVLYQDNKDA---------RHVKT----YEVALKDKDFVEGPWSQNNLDN 162
               G A P    L  D  +A         R ++T    YE+ L     V   WS+  +D+
Sbjct: 179  LDVGYANPVFAALEMDYTEADQDATGEAMRELETQLVYYELDLGLNHVVRK-WSEP-VDS 236

Query: 163  GADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDAD 212
             A LL  VP     P   GVL+ GEE I Y  +N  AF+  IP R   T+  +  R    
Sbjct: 237  TASLLFQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRKRTIIS 294

Query: 213  GSRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTI 254
            G  + L   AG    L+ T + +                  +V  LKI+       A+++
Sbjct: 295  GVMHKLKGSAGAFFFLLQTEDGDMFKVTIDMVEDDEGNTTGEVRRLKIKYFDTVPAAASL 354

Query: 255  SYLDNAVVYIGSSYGD---SQLIKL------------NLQPDAKGSYVEV---------- 289
              L +  +Y+ S +G+    Q  KL            +   DA+ SY  V          
Sbjct: 355  CILKSGFLYVASQFGNFSFYQFEKLGDDDEELEFTSDDFPVDAQASYNPVYFYPRPLENL 414

Query: 290  --LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
              +E   ++ P++D  V +L  +   Q+ T  G     + R +++G+ INE  S EL GI
Sbjct: 415  VLVESIPSMNPLLDCKVANLTNEDAPQIYTICGNGARSTFRTLKHGLEINEIVSSELPGI 474

Query: 348  -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
               +W+L+ +  + +D ++V+SF + T +L++   + +EE    GF +   TL       
Sbjct: 475  PSAVWTLKLNRTEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGD 532

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
            + L+QV    +R + +      N+W +P   S+  A+ NA QV +A   G +VY E+  D
Sbjct: 533  DGLIQVHPKGIRHIRNGQV---NQWDAPQHRSIVAASTNAHQVAIALSSGEIVYFEMDDD 589

Query: 466  GILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
            G L E    +  +  ++CL +  + E    S   AVG   D +VRI SL P+  L  K  
Sbjct: 590  GSLAEYDEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDCTVRILSLDPESTLENKSV 648

Query: 524  LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
                  P S+ + + E  S      YL   L  G  L  +L+  TGELTD ++  LG + 
Sbjct: 649  QALTAAPTSLAIISMEDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELTDTRQKFLGPKQ 708

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
            + L   + +  T V   S RP + Y+    K  + + +N  ++     F S    + +  
Sbjct: 709  VRLFQVTVQGKTCVLGLSSRPWLGYADPITKTFVVTPLNYVDLEWGWNFTSEQCEEGIVG 768

Query: 636  AKEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
             +   L I ++D +    I+ SIPL   P+++        F +    N + A E     +
Sbjct: 769  IQGQTLRIFSVDRLGDTLIQSSIPLTYTPKKMVKHPDHPLFYVIEADNHTLAPELCAKLL 828

Query: 695  ----------RLLDDQTF-----------------------EFISTYPLDTFEYGCSILS 721
                      ++L  + F                       + +    L+  E   S+  
Sbjct: 829  ADPARVNGDTKILPAEEFGHPRGNRRWASCISVVDPLAEDGQVLQRIDLEENEAAVSVAI 888

Query: 722  CSFSDDSN-VYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
             +F+   N  +  VGT     V P         I  F  +   L  I + + +    ++ 
Sbjct: 889  VTFASQENETFLVVGTGKDMVVNPRSFSDAFVHIYRFERDGRGLVFIHKTKVEEPPMAMI 948

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
             F G++L  I + +++Y   +R    R+ Q+E      I++L   T+G  IVVGD+ + I
Sbjct: 949  PFQGRVLVGIGKTLRIYDLGMR-QLLRKAQAEVAPQ-QIISL--STQGSRIVVGDVQQGI 1004

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
            +  +YK     +     D  A W +   ++D +   G +   N+F VR   K SE A +E
Sbjct: 1005 TYAVYKQSTNKLIPFVDDTVARWTTCTTMVDYETVAGGDKFGNIFIVRSPQKASEEADEE 1064

Query: 895  ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
            + G               RL+++      +        SLV       VG    +++  +
Sbjct: 1065 QAGLHLLNARDYLHGTSHRLDLMCHLFTQDIPTSIAKTSLV-------VGGQDVLLWSGL 1117

Query: 940  NGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
             G IGV I  +  E   F + L+ ++R     + G +H  +RS+         K  +DGD
Sbjct: 1118 MGTIGVLIPFITREDTDFFQSLEQHMRAEDPPLAGRDHLMYRSY-----YAPMKGIIDGD 1172

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L E +  L   +   I+  ++ SV E+ +++ ++
Sbjct: 1173 LCERYALLPNDKKQMIAGELDRSVREIERKISDI 1206


>gi|348572800|ref|XP_003472180.1| PREDICTED: LOW QUALITY PROTEIN: splicing factor 3B subunit 3-like
            [Cavia porcellus]
          Length = 1215

 Score =  215 bits (548), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 272/1194 (22%), Positives = 493/1194 (41%), Gaps = 199/1194 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD--LNLITKEHLGGEIIPR 531
             ++  ++ C+ +  +      S+    G+WT    RI S P   L  ++ + L  +  P 
Sbjct: 581  KEMSADVVCMSLANVSPGEQRSRFPGCGLWT--YCRIISDPSDCLQPLSMQALPAQ--PE 636

Query: 532  SVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT 575
            S  LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG+
Sbjct: 637  S--LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGS 694

Query: 576  QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
            +P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +  
Sbjct: 695  RPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVA 754

Query: 636  AKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKN 682
                 L I  ++ +  +  + + PL   PR+ + H E +            T A  + + 
Sbjct: 755  ISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRK 814

Query: 683  QSCAEE-------------SEM-------------------------HFVRLLDDQTFEF 704
            Q  AEE             +EM                           +R+++      
Sbjct: 815  QQMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNT 874

Query: 705  ISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-K 760
            +    L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G K
Sbjct: 875  LDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEK 934

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY 820
            L+ + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y
Sbjct: 935  LEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANY 987

Query: 821  ---VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
               +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+ 
Sbjct: 988  ISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADK 1047

Query: 878  NFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSL 919
              N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L
Sbjct: 1048 FGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTL 1107

Query: 920  VMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
            +        G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  
Sbjct: 1108 I-------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLS 1160

Query: 979  WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1161 FRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1209


>gi|149640712|ref|XP_001506454.1| PREDICTED: splicing factor 3B subunit 3 [Ornithorhynchus anatinus]
 gi|395508661|ref|XP_003758628.1| PREDICTED: splicing factor 3B subunit 3 [Sarcophilus harrisii]
          Length = 1217

 Score =  215 bits (547), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 273/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGAEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           VR+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVVRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+  + +Y  VG A  L        G  +    +V  G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGDDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNSGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>gi|357478269|ref|XP_003609420.1| Splicing factor 3B subunit [Medicago truncatula]
 gi|355510475|gb|AES91617.1| Splicing factor 3B subunit [Medicago truncatula]
          Length = 1225

 Score =  215 bits (547), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 268/1199 (22%), Positives = 494/1199 (41%), Gaps = 200/1199 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSD 57
            +L V ++G I +L  FR  G  +DF+ + ++  +  +L ++ + +   ++     G    
Sbjct: 55   ILSVQVFGTIRSLSQFRLTGAQKDFIVVGSDSGRIVILDYNKQKNVFDKIHQETFGKSGC 114

Query: 58   RIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
            R   P   GQ   IDP  R + +   +    V   +     +   +  LE  +   I F 
Sbjct: 115  RRIVP---GQYLAIDPKGRAVMIAACEKKKLVYVLNRDSLARLTISSPLEANKSHTIVFS 171

Query: 118  Y-----GCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNN 159
                  G   P    +  D  DA             +H+  YE+ L   + V   WS + 
Sbjct: 172  ICAVDCGFENPIFAAIELDCSDADQDATGVAASQAQKHLIFYELDLG-LNHVSRKWS-DQ 229

Query: 160  LDNGADLLIPVPPPL---CGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADG 213
            +DNGA++L+ VP       GVL+  E  ++Y +    +    IP R  +    G +    
Sbjct: 230  VDNGANMLVTVPGGADGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSA 289

Query: 214  SRY---------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
            + +               L  ++  +  + +     ++V+ L I+     ++A +I  L 
Sbjct: 290  AMHKTKNLKPEEFKLFFLLQTEYGDIFKVTLTDGGGDRVSELNIKYFDTIAVAVSICVLK 349

Query: 259  NAVVYIGSSYGDSQLIKLN---------------------------LQPDAKGSYVEVLE 291
            +  ++  S +G+  L +                              QP    + V + +
Sbjct: 350  SGFLFAASEFGNHALYQFKGIGDDDNDVGASSASLMETEEGFQPVFFQPRRLKNLVRI-D 408

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
            +  +L P++D  V +L  +   Q+ T  G     SLRI+R G+ ++E A  +L GI   +
Sbjct: 409  QVESLMPVMDMKVSNLFEEETPQIFTLCGRGPRSSLRIMRTGLAVSEMAVSKLPGIPSAV 468

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+++ +  D FD ++VVSF + T +L++   +  +E    GF     +L       + L+
Sbjct: 469  WTVKKNVMDEFDAYIVVSFTNATLVLSIG--ETADEVSDSGFLDTAPSLAVSLIGDDSLM 526

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
            QV    +R +    R   NEW++    ++    +N  QV++A  GG L+Y E+   G L 
Sbjct: 527  QVHPNGIRHIREDGRT--NEWQTSGKRTIAKVGSNRLQVVIALNGGELIYFEVDVTGQLM 584

Query: 470  EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
            EV+  ++  +++CLDI P+ +    S+  AVG + D ++RI SL PD  + T        
Sbjct: 585  EVERHEMSGDVACLDIAPVPKGRLRSRFLAVGSY-DKTIRILSLDPDDCMQTLGIQSLSS 643

Query: 529  IPRSVLLCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
             P S+L    +               +L   L +G L   +++M TG L+D +   LG +
Sbjct: 644  APESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLSRTVVDMVTGLLSDTRSRFLGLK 703

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS-AAFPDSLAI 635
               L     +    +   S RP + Y      L + ++ + + +   F+S   F   +++
Sbjct: 704  APKLFPIIVRGKRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCFEGVVSV 763

Query: 636  AKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC---------------- 678
            A E  L I T++ + +  +   IPL   PR+   Q + +   +                 
Sbjct: 764  ASEA-LRIFTVERLGETFNQNVIPLRYTPRKFVLQPKRKLLVVIESDQGALTAEEREAAR 822

Query: 679  --------SLKNQSCAEE---------------SEMHF-------------VRLLDDQTF 702
                    + +N++ +E+               S+ H+             +R+LD +T 
Sbjct: 823  KECFEAAHAGENKTGSEDQMENGGEDEDNDDSLSDEHYGYPKSESDKWVSCIRVLDPRTG 882

Query: 703  EFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL--PEENEPTKGRILVF-IVED 758
                   L   E   SI + +F D +      VGTA  L    +   T G I ++  ++D
Sbjct: 883  NTTCLLELQENEAAFSICTVNFHDKEYGTLLAVGTAKGLQFTPKRSLTAGFIHIYRFLDD 942

Query: 759  GK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
            G+ L+L+ + + +G   +L  F G+LLA I   ++LY     D G R L  +C +     
Sbjct: 943  GRSLELLHKTQVEGVPLALCQFQGRLLAGIGPVLRLY-----DLGKRRLLRKCENKSFPS 997

Query: 818  ALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
            ++  +    D I VG + +S     Y+ +E  +   A D    W+++   +D D   GA+
Sbjct: 998  SIVSIHAYRDRIYVGGIQESFHYCKYRRDENQLYIFADDSVPRWLTSSYHIDFDTMAGAD 1057

Query: 877  NNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGS 918
               N+F  R                  K  +G  +    ++E + ++H+G+ +   +  S
Sbjct: 1058 KFGNIFFARLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDVITSLQKAS 1117

Query: 919  LVMRLPDSDVGQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
            LV        G    +++GTV G +G + A    +   F   L+ ++R+    + G +H 
Sbjct: 1118 LV-------PGGGECIVYGTVMGSVGALHAFTSRDDVDFFSHLEMHMRQDNPPLCGRDHM 1170

Query: 978  QWRSFNNEKKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +RS          K+ +DGDL E F    +DL R   DE+ +T      E+ K++EE+
Sbjct: 1171 AYRS-----AYFPVKDVIDGDLCEQFPTLPMDLQRKIADELDRTPG----EILKKLEEV 1220


>gi|357478323|ref|XP_003609447.1| Splicing factor 3B subunit [Medicago truncatula]
 gi|355510502|gb|AES91644.1| Splicing factor 3B subunit [Medicago truncatula]
          Length = 1225

 Score =  215 bits (547), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 272/1199 (22%), Positives = 492/1199 (41%), Gaps = 200/1199 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSD 57
            +L V ++G I +L  FR  G  +DF+ + ++  +  +L ++ + +   ++     G    
Sbjct: 55   ILSVQVFGTIRSLSQFRLTGAQKDFIVVGSDSGRIVILDYNKQKNVFDKIHQETFGKSGC 114

Query: 58   RIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
            R   P   GQ   IDP  R + +   +    V   +     +   +  LE  +   I F 
Sbjct: 115  RRIVP---GQYLAIDPKGRAVMIAACEKKKLVYVLNRDSLARLTISSPLEANKSHTIVFS 171

Query: 118  Y-----GCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNN 159
                  G   P    +  D  DA             +H+  YE+ L   + V   WS + 
Sbjct: 172  ICAVDCGFENPIFAAIELDCSDADQDATGVAASQAQKHLIFYELDLG-LNHVSRKWS-DQ 229

Query: 160  LDNGADLLIPVPPPL---CGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADG 213
            +DNGA++L+ VP       GVL+  E  ++Y +    +    IP R  +    G +    
Sbjct: 230  VDNGANMLVTVPGGADGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSA 289

Query: 214  SRY---------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
            + +               L  ++  +  + +     ++V+ L I+     ++A +I  L 
Sbjct: 290  AMHKTKNLKPEEFKLFFLLQTEYGDIFKVTLTDGGGDRVSELNIKYFDTIAVAVSICVLK 349

Query: 259  NAVVYIGSSYGDSQLIKLNLQPD-------AKGSYVEV-------------LERYVNLGP 298
            +  ++  S +G+  L +     D       +  S +E              L+  V + P
Sbjct: 350  SGFLFAASEFGNHALYQFKGIGDDDNDVVASSASLMETEEGFQPVFFLPRRLKNLVRIDP 409

Query: 299  ------IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
                  I+D  V +L  +   Q+ T  G     SLRI+R G+ ++E A  +L GI   +W
Sbjct: 410  VESLMSIMDMKVSNLFEEETPQIFTLCGRGPRSSLRIMRTGLAVSEMAVSKLPGIPSAVW 469

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            +++ +  D FD+++VVSF + T +L++   + +EE    GF     +L       + ++Q
Sbjct: 470  TVKKNVMDEFDSYIVVSFTNATLVLSIG--ETVEEVSNSGFLDTAPSLAVSLIGDDSIMQ 527

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE 470
            V    +R +    R   NEW++    ++    +N  QV++A  GG LVY E+   G L E
Sbjct: 528  VHPNGIRHIREDGRT--NEWQTSGKRTIAKVGSNRLQVVIALNGGELVYFEVDVTGQLME 585

Query: 471  VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
            V+  ++  +++CLDI P+ +    S+  AVG + D ++RI SL PD  + T         
Sbjct: 586  VEKHEMSGDVACLDIAPVPKGRLRSRFLAVGSY-DNTIRILSLDPDDCMQTLSIQSLSSA 644

Query: 530  PRSVLLCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
            P S+L    +               +L   L  G L   +++  TG L+D +   LG + 
Sbjct: 645  PESLLFLEVQASVGGEDGADHPASLFLSAGLQSGVLSRTVVDTVTGLLSDARSRFLGLKA 704

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
              L          +   S RP + Y      L + ++ + + +   F+S    + +    
Sbjct: 705  PKLFPIILGRKCAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCLEGVVSVS 764

Query: 638  EGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTF--------AICSLKNQSCAEE 688
               L I T++ + +  +   IPL   PR++  Q + +          A+ + + ++  +E
Sbjct: 765  GEALRIFTVERLGETFNQTVIPLRYTPRKLVLQPKRKLLVVIESDQGALTAKEREAARKE 824

Query: 689  -------------------------------SEMHF-------------VRLLDDQTFEF 704
                                           S+ H+             +R+LD +T   
Sbjct: 825  CFEAAQTGENATGSEDQMENDGEDEDQGDSLSDEHYGYPKSESDKWVSCIRVLDPRTGNT 884

Query: 705  ISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRILVFI-----VED 758
                 L   E   SI + +F D +      VGTA  L  +  P +   + FI     ++D
Sbjct: 885  TCLLELQENEAAFSICTVNFHDKEYGTLLAVGTAKGL--QFTPKRSLTVGFIHIYRFLDD 942

Query: 759  GK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
            G+ L+L+ + + +G   +L  F G+LLA I   ++LY     D G R L  +C +    +
Sbjct: 943  GRSLELLHKTQVEGVPLALCQFQGRLLAGIGPVLRLY-----DLGKRRLLRKCENKSFPI 997

Query: 818  ALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
            ++  +    D I VGD+ +S     Y+ +E  +   A D    W++A   +D D   GA+
Sbjct: 998  SIVSIHAYRDRIYVGDIQESFHYCKYRRDENQLYIFADDSVPRWLTASYHIDFDTMAGAD 1057

Query: 877  NNFNLFTVR-----------KNSEGATDEERGRL-------EVVGEYHLGEFVNRFRHGS 918
               N+F  R             + G    E+G+L       E + ++H+G+ +   +  +
Sbjct: 1058 KFGNIFFARLPQDVSDEVEEDPTSGKIKWEQGKLNGALNKVEEIVQFHVGDVITSLQKAA 1117

Query: 919  LVMRLPDSDVGQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
            LV        G    +++GTV G +G + A    +   F   L+ ++R+    + G +H 
Sbjct: 1118 LV-------PGGGECIVYGTVMGCVGALHAFTSRDDVDFFSHLEMHMRQDNPPLCGRDHM 1170

Query: 978  QWRSFNNEKKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +RS          K+ LDGDL E F    +DL R   DE+ +T      E+ K++EEL
Sbjct: 1171 AYRS-----AYFPVKDVLDGDLCEQFPTLPMDLQRKIADELDRTPG----EILKKLEEL 1220


>gi|73957045|ref|XP_536791.2| PREDICTED: splicing factor 3B subunit 3 isoform 1 [Canis lupus
            familiaris]
 gi|149699332|ref|XP_001500880.1| PREDICTED: splicing factor 3B subunit 3 [Equus caballus]
 gi|301771131|ref|XP_002920989.1| PREDICTED: splicing factor 3B subunit 3-like [Ailuropoda melanoleuca]
 gi|344290794|ref|XP_003417122.1| PREDICTED: splicing factor 3B subunit 3-like [Loxodonta africana]
 gi|410983914|ref|XP_003998280.1| PREDICTED: splicing factor 3B subunit 3 [Felis catus]
 gi|426242169|ref|XP_004014947.1| PREDICTED: splicing factor 3B subunit 3 [Ovis aries]
 gi|417406150|gb|JAA49749.1| Putative damage-specific dna binding complex subunit ddb1 [Desmodus
            rotundus]
 gi|431912451|gb|ELK14585.1| Splicing factor 3B subunit 3 [Pteropus alecto]
          Length = 1217

 Score =  214 bits (546), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 497/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+  + +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGDDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>gi|40788938|dbj|BAA32662.2| KIAA0017 protein [Homo sapiens]
          Length = 1253

 Score =  214 bits (546), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 89   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 147

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 148  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 207

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 208  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 267

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 268  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 322

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 323  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 382

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 383  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 441

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 442  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 500

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 501  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 558

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 559  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 616

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 617  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 673

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 674  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 733

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 734  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 793

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 794  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 853

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 854  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 913

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 914  DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 973

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 974  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 1026

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 1027 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1086

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1087 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1146

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1147 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1199

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1200 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1247


>gi|395836972|ref|XP_003791420.1| PREDICTED: splicing factor 3B subunit 3 [Otolemur garnettii]
          Length = 1217

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGSGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGEEWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>gi|54112121|ref|NP_036558.3| splicing factor 3B subunit 3 [Homo sapiens]
 gi|118150814|ref|NP_001071319.1| splicing factor 3B subunit 3 [Bos taurus]
 gi|296231541|ref|XP_002761182.1| PREDICTED: splicing factor 3B subunit 3 isoform 2 [Callithrix
            jacchus]
 gi|332846357|ref|XP_511081.3| PREDICTED: splicing factor 3B subunit 3 [Pan troglodytes]
 gi|397518709|ref|XP_003829523.1| PREDICTED: splicing factor 3B subunit 3 [Pan paniscus]
 gi|402908991|ref|XP_003917214.1| PREDICTED: splicing factor 3B subunit 3 [Papio anubis]
 gi|403298333|ref|XP_003939977.1| PREDICTED: splicing factor 3B subunit 3 [Saimiri boliviensis
            boliviensis]
 gi|426382761|ref|XP_004057969.1| PREDICTED: splicing factor 3B subunit 3 [Gorilla gorilla gorilla]
 gi|116242787|sp|Q15393.4|SF3B3_HUMAN RecName: Full=Splicing factor 3B subunit 3; AltName:
            Full=Pre-mRNA-splicing factor SF3b 130 kDa subunit;
            Short=SF3b130; AltName: Full=STAF130; AltName:
            Full=Spliceosome-associated protein 130; Short=SAP 130
 gi|125987788|sp|A0JN52.1|SF3B3_BOVIN RecName: Full=Splicing factor 3B subunit 3; AltName:
            Full=Pre-mRNA-splicing factor SF3b 130 kDa subunit;
            Short=SF3b130; AltName: Full=Spliceosome-associated
            protein 130; Short=SAP 130
 gi|117306205|gb|AAI26519.1| Splicing factor 3b, subunit 3, 130kDa [Bos taurus]
 gi|119572190|gb|EAW51805.1| splicing factor 3b, subunit 3, 130kDa, isoform CRA_c [Homo sapiens]
 gi|168274284|dbj|BAG09562.1| splicing factor 3B subunit 3 [synthetic construct]
 gi|296478233|tpg|DAA20348.1| TPA: splicing factor 3B subunit 3 [Bos taurus]
 gi|355710371|gb|EHH31835.1| Spliceosome-associated protein 130 [Macaca mulatta]
 gi|355756944|gb|EHH60552.1| Spliceosome-associated protein 130 [Macaca fascicularis]
 gi|380811142|gb|AFE77446.1| splicing factor 3B subunit 3 [Macaca mulatta]
 gi|383417057|gb|AFH31742.1| splicing factor 3B subunit 3 [Macaca mulatta]
 gi|384946118|gb|AFI36664.1| splicing factor 3B subunit 3 [Macaca mulatta]
 gi|410209986|gb|JAA02212.1| splicing factor 3b, subunit 3, 130kDa [Pan troglodytes]
 gi|410260956|gb|JAA18444.1| splicing factor 3b, subunit 3, 130kDa [Pan troglodytes]
 gi|410355121|gb|JAA44164.1| splicing factor 3b, subunit 3, 130kDa [Pan troglodytes]
          Length = 1217

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>gi|109129162|ref|XP_001107025.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Macaca
            mulatta]
 gi|297284421|ref|XP_002802591.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Macaca
            mulatta]
          Length = 1199

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 35   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 93

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 94   CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 153

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 154  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 213

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 214  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 268

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 269  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 328

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 329  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 387

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 388  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 446

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 447  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 504

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 505  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 562

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 563  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 619

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 620  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 679

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 680  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 739

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 740  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 799

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 800  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 859

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 860  DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 919

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 920  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 972

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 973  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1032

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1033 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1092

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1093 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1145

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1146 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1193


>gi|197101659|ref|NP_001125500.1| splicing factor 3B subunit 3 [Pongo abelii]
 gi|75042033|sp|Q5RBI5.1|SF3B3_PONAB RecName: Full=Splicing factor 3B subunit 3; AltName:
            Full=Pre-mRNA-splicing factor SF3b 130 kDa subunit;
            Short=SF3b130; AltName: Full=Spliceosome-associated
            protein 130; Short=SAP 130
 gi|55728259|emb|CAH90875.1| hypothetical protein [Pongo abelii]
          Length = 1217

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSETAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGSGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALRDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVTKKLEDI 1211


>gi|425768510|gb|EKV07031.1| Pre-mRNA-splicing factor rse1 [Penicillium digitatum PHI26]
 gi|425775700|gb|EKV13954.1| Pre-mRNA-splicing factor rse1 [Penicillium digitatum Pd1]
          Length = 1209

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 289/1169 (24%), Positives = 490/1169 (41%), Gaps = 171/1169 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G  +D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRVAGSNKDYIIIGSDSGRITIVEY-VPSQNRFNRIHMETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q L    +    G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMICLDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  ++    T             YE+ L     V   WS + +D  A +
Sbjct: 183  YDNPVFAALEVDYSESDQDPTGQAYEEIEKVLVYYELDLGLNHVVRK-WS-DTVDRTASM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSR--- 215
            L  VP     P   GVL+ GE+ I Y  +N  AF+  IP R   T+   R  +  S    
Sbjct: 241  LFQVPGGDDGP--SGVLVCGEDNITYRHSNQGAFRVPIPRRSGATENPERKRSIVSGVMH 298

Query: 216  -------YLLGDHAG---LLHLLVITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
                   +LL    G    L+L ++  ++ + TG    L I+      +++ +  L +  
Sbjct: 299  KMRGAFFFLLQTEDGDLFKLNLEMVEDKQGQPTGEVKRLTIKYFDTIPVSTNLLILKSGF 358

Query: 262  VYIGSSYGDS---QLIKLNLQPD-----------------------AKGS-YVEVLERYV 294
            +Y  S  G+    Q  KL   PD                        +G+  V ++E   
Sbjct: 359  LYAASESGNHNFYQFEKLGDDPDELFFSSDSFSADHSVPSAPVFFHPRGAENVNLMESMN 418

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
            +L P++D  +++L      Q+ T  G+    S R +++G+ ++E    +LQ +   +W+ 
Sbjct: 419  SLSPLIDSKILNLSEDDAPQIYTICGSGARSSFRTLKHGLEVSEIVESDLQQVPSAVWTT 478

Query: 354  RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
            + +  D F T++++SF + T +L++   + +EE    GF S   TL       + LVQV 
Sbjct: 479  KLTRADEFHTYIILSFANGTLVLSIG--EIVEEVSDTGFLSSAPTLAVQQLGEDSLVQVH 536

Query: 414  SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
               +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  
Sbjct: 537  PRGIRHILADQRV--NEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGTLAEYD 594

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
            +  Q+   ++ L +  + E    S   AVG   D +VRI SL PD  L  K        P
Sbjct: 595  ERRQMSGTVTSLSMGEVPEGRMRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 653

Query: 531  RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
             ++ + A    S      YL   L  G  L  +L+  TGEL+D +   +G +P+ L   S
Sbjct: 654  SALQIMAMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFIGAKPVKLSQVS 713

Query: 585  SKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
             K  T V A S RP + YS    K  + + ++   +     F+S    + +   +   L 
Sbjct: 714  VKGQTAVLALSTRPWLGYSDIQTKSFMLTPLDYVGLEWGWNFSSEQCVEGMIGIQGRSLR 773

Query: 643  IGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQ- 700
            I T++ +    ++ SIPL   PRR         F +    N   +  +     RL+DD  
Sbjct: 774  IFTVEKLDNNMLQESIPLSYTPRRFVKHPDQHLFYVIESDNNVLSPATRQ---RLIDDSQ 830

Query: 701  ----------------------------------TFEFISTYPLDTFEYGCSILSCSFSD 726
                                              T   IST  L+  E   S+ + SFS 
Sbjct: 831  AQNGEVADLPPADFGYPRATGHWASCVQIVDPITTKSVISTLDLEDNEAAVSLAAVSFSS 890

Query: 727  -DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFN 780
             D   +  VGTA    V P  +  + G I ++   EDG+ L+ I + +      +L  F 
Sbjct: 891  QDDETFLVVGTAKDMTVSPPSS--SCGFIHIYRFQEDGRELEFIHKTQVDEPPLALLGFQ 948

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFIVVGDLMKSI 837
            G+LLA I   +++Y     D G ++L  +C        I+ L  QT+G  I+V D+ +S+
Sbjct: 949  GRLLAGIGPVLRVY-----DLGMKQLLRKCQAPVVPKTIVGL--QTQGSRIIVSDIRESV 1001

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
            + ++YK+++  +   A D  A W S+  ++D +   G +   NL+ VR   K SE A ++
Sbjct: 1002 TYVVYKYQDNVLIPFADDSIARWTSSTTMVDYETTAGGDKFGNLWLVRCPSKISEQADED 1061

Query: 895  --------ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIG 944
                    E+G L   G  H  E +  F    +   L  + +  G    V++  + G IG
Sbjct: 1062 GSGAHLIHEKGYLH--GTPHRLELMVHFFAQDIPTSLHKTQLVAGGRDIVVWTGLQGTIG 1119

Query: 945  V-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
            + +  +  E   F + L+T L      + G +H  +R +    K V     +DGDL E +
Sbjct: 1120 MFVPFVSREDVDFFQLLETQLASQQPPLAGRDHLMYRGYYAPVKGV-----IDGDLCEMY 1174

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L L       I+  ++ SV E+ +++ ++
Sbjct: 1175 LLLPNDTKLMIAGELDRSVREIERKISDM 1203


>gi|326676028|ref|XP_002667683.2| PREDICTED: splicing factor 3B subunit 3-like [Danio rerio]
 gi|123888555|sp|Q1LVE8.1|SF3B3_DANRE RecName: Full=Splicing factor 3B subunit 3
          Length = 1217

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 265/1190 (22%), Positives = 491/1190 (41%), Gaps = 189/1190 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L + ++G + +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTMEVFGVVRSLMAFRLTGGTKDYVVVGSDSGRIVILEYHP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R  +IG      L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMIGATEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  ++E VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRMKYFDTIPVATAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V V E+  +L 
Sbjct: 347  LFVSSEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-SLS 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   + DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406  PIMSCQIADLANEDTPQLYVACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
            +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKTPGKKTIIRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL     L  ++ + L  +  P 
Sbjct: 582  EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ--PE 638

Query: 532  SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
            S+ +    G+               YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 639  SLCIVEMGGVEKQDELGEKGTIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L     +    V A S R  + YS   +   + ++ + + +   F S   P+ +    
Sbjct: 699  VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
               L I  ++ +  +  + + PL   PR+ + H E            + T A  + + Q 
Sbjct: 759  TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLILIETDHNAYTEATKAQRKQQ 818

Query: 685  CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
             AEE             +EM                           VRL++      + 
Sbjct: 819  MAEEMVEAAGEDERELAAEMAAAFLNENLPEAIFGAPKAGSGQWASLVRLINPIQGNTLD 878

Query: 707  TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
               L+  E   S+  C F +  + +Y  VG A  ++        G I  + +  G  KL+
Sbjct: 879  LVQLEQNEAAFSVAICRFLNGGDDWYVLVGVARDMILNPRSVGGGYIYTYRIVGGGDKLE 938

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYV 821
             + +   +    ++  F G++L  + + +++Y     D G ++L  +C   H   L   +
Sbjct: 939  FLHKTPVEDVPLAIAPFQGRVLVGVGKLLRIY-----DLGKKKLLRKCENKHVPNLVTGI 993

Query: 822  QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
             T G  ++V D+ +S+  + Y+  E  +   A D    W++   +LD D    A+   N+
Sbjct: 994  HTIGQRVIVSDVQESLFWVRYRRNENQLIIFADDTYPRWITTACLLDYDTMASADKFGNI 1053

Query: 882  FTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
              VR   N+    DE         +RG L       E++  YH+GE V   +  +L+   
Sbjct: 1054 CVVRLPPNTSDDVDEDPTGNKALWDRGLLNGASQKAEIIINYHIGETVLSLQKTTLI--- 1110

Query: 924  PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
                 G   ++++ T++G IG++     HE + F + L+ ++R     + G +H  +RS+
Sbjct: 1111 ----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHLEMHMRSEFPPLCGRDHLSFRSY 1166

Query: 983  NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                     KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1167 -----YFPVKNVIDGDLCEQFNSMDPHKQKSVSEELDRTPPEVSKKLEDI 1211


>gi|19527174|ref|NP_598714.1| splicing factor 3B subunit 3 [Mus musculus]
 gi|297207121|ref|NP_001099657.2| splicing factor 3B subunit 3 [Rattus norvegicus]
 gi|354477789|ref|XP_003501101.1| PREDICTED: splicing factor 3B subunit 3 [Cricetulus griseus]
 gi|81879817|sp|Q921M3.1|SF3B3_MOUSE RecName: Full=Splicing factor 3B subunit 3; AltName:
            Full=Pre-mRNA-splicing factor SF3b 130 kDa subunit;
            Short=SF3b130; AltName: Full=Spliceosome-associated
            protein 130; Short=SAP 130
 gi|15030278|gb|AAH11412.1| Splicing factor 3b, subunit 3 [Mus musculus]
 gi|26353236|dbj|BAC40248.1| unnamed protein product [Mus musculus]
 gi|27503728|gb|AAH42580.1| Splicing factor 3b, subunit 3 [Mus musculus]
 gi|148679525|gb|EDL11472.1| splicing factor 3b, subunit 3 [Mus musculus]
 gi|187951307|gb|AAI39016.1| Splicing factor 3b, subunit 3 [Mus musculus]
 gi|187954163|gb|AAI39017.1| Splicing factor 3b, subunit 3 [Mus musculus]
 gi|344248014|gb|EGW04118.1| Splicing factor 3B subunit 3 [Cricetulus griseus]
          Length = 1217

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILSPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>gi|158256968|dbj|BAF84457.1| unnamed protein product [Homo sapiens]
          Length = 1217

 Score =  214 bits (545), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEEQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>gi|321478392|gb|EFX89349.1| hypothetical protein DAPPUDRAFT_303178 [Daphnia pulex]
          Length = 1215

 Score =  214 bits (544), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 265/1187 (22%), Positives = 489/1187 (41%), Gaps = 185/1187 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   ++G I ++  FR  G  +D+L + ++  +  +L+++A +  +  +   +   + G
Sbjct: 53   LLSQEVFGIIRSIISFRLTGGNKDYLVVGSDSGRIVILEYNA-AKNVFDKVHQETYGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   IDP  R + +   +   L  ++  D +  L  +  +   +   L    +
Sbjct: 112  CRRIVPGQYMAIDPKGRAVMISAIEKQKLVYILNRDAQAHLTISSPLEAHKSNTLVYNIV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A    T E A K +    F E     N+        LD  
Sbjct: 172  GVDVGFENPVFACLEMDYEEADGDPTGEAAQKTQQTLTFYELDLGLNHVVRKYSEPLDEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
            A+ LI VP     P   GVL+  E  I Y   N  +   IR  I +    +D D  R   
Sbjct: 232  ANFLITVPGGNDGP--SGVLVCSENYITY--KNLGEQHDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L+    G +  + +  ++E V  LKI+      ++S++  L    
Sbjct: 287  FVCSASHRTKAMFFFLVQTEQGDIFKITMDMDEEVVRELKIKYFDTVPVSSSMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKL------NLQPDAKGSY-----------------VEVLERYVNLGP 298
            +++ S +G+  L ++      + +P+   +                  + +++   +L P
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDDEPEFSSTIPLEEGDTFFFAPRPLKNLVLVDEMESLSP 406

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
            ++   + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R   
Sbjct: 407  VLACHIADLANEDTPQLYALCGRSSRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRKRA 466

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            DD FD ++VVSF++ T +L++   + +EE    GF   T TL C     N L+QV    +
Sbjct: 467  DDEFDAYIVVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLGCSSLGDNALLQVYPEGI 524

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
            R + +  R   NEWKSP   +++    N  QV+++  GG LVY E+   G L E  +  +
Sbjct: 525  RHIRADRR--INEWKSPGKRAISRCAVNQRQVVISLAGGELVYFEMDPTGQLNEYTERKE 582

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-L 534
            +  E+ C+ +  +      S+  AVG+  D +VR+ SL   + +T   L  + +P +   
Sbjct: 583  MTAEVVCMALANVPSGEQRSRFLAVGL-ADNTVRVISLDPNDCLTP--LSMQALPATPES 639

Query: 535  LCAFE--------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
            LC  E              G  +L   L +G LL  +L+  TG+L D +   LG++P+ L
Sbjct: 640  LCIVEMGGHDKDSETTGTSGQLFLNIGLQNGVLLRTVLDPVTGDLADTRTRYLGSRPVKL 699

Query: 581  RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
                ++N   V A S R  + Y    +   + ++   +     F S   P+ +       
Sbjct: 700  FRVMTQNHEAVLAISSRTWLSYYFQNRFHLAPLSYDSLEFASGFASEQCPEGVVAIASNT 759

Query: 641  LTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICS-------------------- 679
            L I  ++ +  +  + S PL   PR+    +  R   I +                    
Sbjct: 760  LRILALEKLGAVFNQVSYPLEYTPRKFVVHDSGRMIIIETDHNAYTLETKLERRQQIAQE 819

Query: 680  LKNQSCAEESEM-----------------------------HFVRLLDDQTFEFISTYPL 710
            +++ +  EE E+                               VR++D  +   + T  L
Sbjct: 820  MRDAATEEEQELAHQMADAFLNEDLPESDFGAPKAGAGMWASIVRIMDPTSGNSLHTIRL 879

Query: 711  DTFEYGCSILSCSF--SDDSNVYYCVGTAYVLPEENEPTKGRILV---FIVEDGKLQLIA 765
               E   SI    F   D  + +  VG A  L    +   G  +     I E   L+ + 
Sbjct: 880  PQNEAALSIGLARFMNQDPEDYFVLVGVAKDLKLNPKQCDGGFIYTYKLINEWSALEFVH 939

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTR 824
            +   +   Y++ A+ G++L  + + ++LY     D G ++L  +C   H     + +Q  
Sbjct: 940  KTPVEDTPYAICAYQGRVLIGVGRLLRLY-----DMGKKKLLRKCENKHLPSCVVNIQAL 994

Query: 825  GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
            G  + V D+ +SI  + YK  E  +   A D +  +++ + +LD D    A+   N+  +
Sbjct: 995  GQRVYVADVQESIHFVRYKRMENQLIIFADDTHPRYVTTMCLLDYDTVAVADKFGNISVL 1054

Query: 885  RKNSEGATDE-----------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDS 926
            R  S  + D            +RG L       EV+   H+GE     +  +L+      
Sbjct: 1055 RLPSRTSDDVDEDPTGNKSFWDRGVLNGASNKAEVLCNIHIGETALSLQRATLI------ 1108

Query: 927  DVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
              G   ++++ T++G IGV+      E + F + L+ +LR     + G +H  +RSF   
Sbjct: 1109 -PGGSESLVYTTLSGSIGVLVPFTSREDHDFFQALEMHLRSEHPPLCGRDHLAFRSF--- 1164

Query: 986  KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                  KN +DGDL E F  +  ++   ++  ++ +  E+ K++E++
Sbjct: 1165 --YFPVKNVIDGDLCEQFNSIDSSKQKAVADDLDRAPNEVSKKLEDI 1209


>gi|407919154|gb|EKG12409.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
            phaseolina MS6]
          Length = 1210

 Score =  214 bits (544), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 269/1178 (22%), Positives = 478/1178 (40%), Gaps = 181/1178 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I  L  FR  G ++D++ +A++  +  +L+++ + +      +        R    
Sbjct: 64   VFGVIRALAAFRLAGSSKDYIIVASDSGRIAILEYNPQKNIFTQLKLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  +I     + L  V+  D+   L  +  +     Q      +    G 
Sbjct: 124  GQYLATDPKGRACMIASIEKNKLVYVLNRDSNANLTISSPLEAHRPQTFVFSLIGLDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKTYEVAL-KDKDFVE-----------GPWSQNNLDNGADLLI 168
              P    L  D  D     T EVA  K K+ V              WS + +D  A++L 
Sbjct: 184  ENPIFAALEVDYDDVDQDHTGEVAASKQKELVYYELDLGLNHVVRKWS-DAVDRTANMLF 242

Query: 169  PVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD-ADGSRYLLGDHA 222
             VP     P   GVL+ GEE+I Y   N     P R  I +  G  +     RY+    A
Sbjct: 243  QVPGGQDGP--SGVLVCGEESITYRHINQ---DPFRVPIPRRKGATENPQRKRYIT---A 294

Query: 223  GLLHLL---------------------VITHEKEKVTG----LKIELLGETSIASTISYL 257
            G++H +                     ++  E+ + TG    LKI+      +AS++  L
Sbjct: 295  GVMHKMRGAFFFLLQTEDGDLFKVTIDMVEDEQGQPTGDVSRLKIKYFDTVPVASSLCIL 354

Query: 258  DNAVVYIGSSYGDS---------------QLIKLNLQPDAKGSY------------VEVL 290
             +  +++ +  G+                + I  N  P+    Y            + + 
Sbjct: 355  KSGYLFVTAESGNHHFYQFEKLGDDDEELEFISDNFSPEPDAPYEPVFFHPRGAENISLA 414

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KG 349
            +   ++ P +   V +L      Q+ T  G     + + + +G+ ++E    EL  + + 
Sbjct: 415  QSVPSMNPTLGCKVANLTDDDAPQIYTVCGTGARSTFKTLTHGLEVSEIVESELPSVPEA 474

Query: 350  MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
            +W+ +  T D +D ++++SF + T +L++   + +EE    GF S T+TL       + L
Sbjct: 475  VWTTKIRTGDEYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSTRTLAVQLLGEDAL 532

Query: 410  VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGIL 468
            +QV    +R + +  R   NEW +P   ++  AT N  QV +A   G +VY E+  DG L
Sbjct: 533  IQVHPKGIRHIQAGGR--INEWPAPQHRTIVAATTNERQVAVALSSGEIVYFEMDVDGSL 590

Query: 469  TEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGG 526
             E  +  ++   ++CL +  + E    S   AVG   D +VRI SL PD  L  K     
Sbjct: 591  AEYDEKKEMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQAL 649

Query: 527  EIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
               P ++ + A    S      YL   L  G  L  +++  TGELTD +   LG + + L
Sbjct: 650  TSAPSALSIMAMADSSSGGSTLYLHIGLYSGVYLRTVIDEITGELTDTRTRFLGAKGVKL 709

Query: 581  RTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
               + ++ + + A S R  + YS    K    + +    +     FNS      +   + 
Sbjct: 710  FRVTVQDQSAILATSSRSWLGYSDQQTKAFTLTPLAYAPLEWGWSFNSEQCTQGMVGIQG 769

Query: 639  GELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV--- 694
              L I TI+++   L   +IPL   PR +        F +    N   A  +    +   
Sbjct: 770  QHLRIFTIENLSNNLLFENIPLMYTPRNLVRHPTEPLFYVIESDNGVLASSTRQQLLNDP 829

Query: 695  -------RLLDDQTF----------------------EFISTYPLDTFEYGCSILSCSF- 724
                   ++L  + F                      E + T  L+  E   S+    F 
Sbjct: 830  AAVNGDAKILPPEEFGYPRAEGHWASCIQVVDPISSKEVVHTLELEENESAVSVCLAPFT 889

Query: 725  SDDSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAF 779
            S +   +  VGTA    V P       G + ++ + E+G+ L+ I + + +    +L  F
Sbjct: 890  SQNDETFLVVGTAKDLVVAPRSY--NCGYVHIYRLQENGRELEFIHKTKMEAPPMALLPF 947

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKSI 837
             GKLL  +   ++LY     D G R+L  +      +  + + +QT+G  IV  D+ +S+
Sbjct: 948  QGKLLVGVEADLRLY-----DLGLRQLLRKAQALNVVPNILIGLQTQGSRIVCSDVQESV 1002

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
            + ++YKH E  + +   D    W S   ++D +   G +   N++ VR   K SE A +E
Sbjct: 1003 TYVVYKHLENRLIQFCDDSIHRWTSCTAMVDYETTAGGDKFGNIWLVRCPPKASEEADEE 1062

Query: 895  ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
              G               RL+++  ++  +     +  +LV        G    +++  +
Sbjct: 1063 GSGLHLINERPYLQGTPNRLDLLAHFYTQDIPTSIQKTALV-------AGGRELLLWSGL 1115

Query: 940  NGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
             G +G+ I  +  E   F + L+  LR     + G +H  +RS+      V  K  +DGD
Sbjct: 1116 QGTLGIFIPFVSREDVDFFQSLEQQLRTEDPPIAGRDHLAYRSY-----YVPVKGVIDGD 1170

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRV-EELTRL 1035
            L E FL L R + + I+  ++ SV E+ +++ ++ TR 
Sbjct: 1171 LCERFLRLPRDKKETIAAELDRSVREVERKIGDQRTRF 1208


>gi|323447810|gb|EGB03719.1| hypothetical protein AURANDRAFT_72671 [Aureococcus anophagefferens]
          Length = 760

 Score =  214 bits (544), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 204/761 (26%), Positives = 349/761 (45%), Gaps = 79/761 (10%)

Query: 302 FCVVD-LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            CVV+  + +GQG++V C GA   G+L IV +G+ +   AS     I  +  + SS  + 
Sbjct: 1   MCVVEPADAEGQGRIVACCGAGDAGTLHIVHDGLDVTGTASRSPTRITNLSLILSSRGE- 59

Query: 361 FDTFLVVSFISETR--ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV- 417
               ++VS  + T   ++A +       +  +GF     T++C + + +  VQVT  ++ 
Sbjct: 60  ---MVIVSTEAATHAFLVANDGSTTAFPSNPQGFDETISTIYCCN-LGDAAVQVTPRAIS 115

Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLE 477
           +  +  +R     W+   G        N + +L++   G LV L   D  +  +  A L+
Sbjct: 116 KFYTGENRRKLKTWRPKDGRITAATGNNGAVILISLSNGKLVLL---DCDVEPIASAALD 172

Query: 478 YEISCLDINPIG----ENPSYSQIAAVGMWTDISVRIFSLPD--LNLITKEHLGGE---- 527
            E SC+ ++P G    ++   +   AV +W+D  V   S+ D  LN I    +  +    
Sbjct: 173 DEASCVALSPSGSASFKDLGMAHATAVSVWSD-PVSCVSVYDSILNKIHDPTITWQDKSS 231

Query: 528 -----IIPRSVLLCAFEGISYLLCALGDG--HLLNFLLNMKTGELTDRKKVSLGTQPITL 580
                   R+++L       YL CA GDG   +  F         T  + VSLG  P  +
Sbjct: 232 NEQTLQFVRALVLSEINEKPYLFCASGDGLVTVYQFACGNDVVTTTRLRAVSLGVLPARV 291

Query: 581 RTFSS--KNTTHVFAASDRPTVI----YSSNKKLLYSNVNLKEVSH-----MCPFNSAAF 629
               +  +    V   SDR  V+     S  + ++ S     E +      + P +  A 
Sbjct: 292 LILGAFDRMARSVLCFSDRACVVRRGCASHLQWVVASAKAFGETTEWRYYIVVPLHKKAT 351

Query: 630 PDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES 689
              LA   +    I  ID  Q + + S+PL E P  ICH  QS  FA+C++ ++    + 
Sbjct: 352 TAILACGSQ--FAICKIDANQGIKVVSVPLAEQPLCICHDLQSHLFAVCTIDHREGDNQG 409

Query: 690 EMHFVRLLDDQTFEFISTYPLDTFE--YGCSILS---CSFSDDSNVYYCVGTAYVLPEEN 744
            + F+R  D+  +  +    L+  E    CSI+S    S   D   ++ VGTA+   E +
Sbjct: 410 VIRFLR--DEAPYNDVHREALEPLEIPLCCSIISLDSISTYKDQRAHFVVGTAFAAQEND 467

Query: 745 -EPTKGRILVFIVEDGKL--QLIAEKETKGAVYSLNAFNGKLL-AAINQKIQLYKWMLRD 800
            EP  GR+++F      +   ++   E  GAVY + A    LL  A+N  I +Y  ++RD
Sbjct: 468 FEPCSGRMIIFRSGQANVAPSVLFFVEANGAVYDVAAMRASLLVCAVNHAIHIYDPVVRD 527

Query: 801 DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
           +    L+    + G ++AL VQ  G+ IVVGD+M+S++LL    ++  I E A DYN NW
Sbjct: 528 NRRGHLKPRASYDGLVVALKVQCYGNLIVVGDMMRSVTLLNLIRQKMIIVEVACDYNTNW 587

Query: 861 MSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 920
           + A+E++ D  ++ A+ + +L  + ++  G +D+    LE   + HLG+ V  F  GS++
Sbjct: 588 VCALEVIGDGSFIIADASGSLVAL-ESLYGNSDKGY-FLESRAKMHLGDVVTCFARGSIM 645

Query: 921 MR--LPDSDVGQIPT-VIFGT-----------------VNGVIGVIASLPHEQYLFLEKL 960
            +    +  V ++ T +IFG                  V+G IG I S+    +  LE+L
Sbjct: 646 TQDDWRNPKVSKVATPLIFGCVTSSRVLVLTPLIARYQVSGAIGCIVSIDDVTHSLLERL 705

Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK---NFLDGD 998
              L +   GVG  +HE +++ +N   T +A    +F+D +
Sbjct: 706 SKVLLEFHSGVGDFDHETFQALHNNVATCNAAPMDDFIDAN 746


>gi|350630003|gb|EHA18376.1| hypothetical protein ASPNIDRAFT_38018 [Aspergillus niger ATCC 1015]
          Length = 1219

 Score =  213 bits (543), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 281/1172 (23%), Positives = 486/1172 (41%), Gaps = 169/1172 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  +I     + L  V+  +++ +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACIIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMTALDVG 182

Query: 120  CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  ++             +H+  YE+ L     V   WS + +D  A L
Sbjct: 183  YENPVFAALEVDYSESDQDPTGQAFDELEKHLVYYELDLGLNHVVRK-WS-DPVDRTASL 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRP----------SITKAYGRVD 210
            L  VP       GVL+  E+++ Y  +N  AF+  IP R           SIT       
Sbjct: 241  LFQVPGGADGPSGVLVCAEDSVTYRHSNQDAFRVPIPRRSGATENPERKRSITAGVMHKM 300

Query: 211  ADGSRYLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVVY 263
                 +LL    G L  L I   ++   ++TG    LKI+      +AS++  L +  +Y
Sbjct: 301  RGAFFFLLQTEDGDLFKLTIDMVEDDNGQLTGEVKRLKIKYFDTVPLASSLLILKSGFLY 360

Query: 264  IGSSYGDSQLIKLNLQPD--------------------------AKGS-YVEVLERYVNL 296
            + S  G+    +     D                           +G+  + ++E   +L
Sbjct: 361  VASEAGNHHFYQFEKLGDDDEEIEFTSESFSADPSVPCDPIYFRPRGAENLNLVETINSL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P++D  VV++      Q+ T SGA    S R +++G+ ++E    EL  +   +W+ + 
Sbjct: 421  NPLIDSKVVNITEDDAPQIYTISGAGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            +  D FD ++++SF + T +L++   + +EE    GF S   TL       + L+Q+   
Sbjct: 481  TRSDEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQIHPR 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
             +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDSDGSLAEYDER 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P +
Sbjct: 597  RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTAAPSA 655

Query: 533  VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + + +    S      YL   L  G  L  +L+  TGEL+D +   LG++P+ L   S K
Sbjct: 656  LNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKPVKLFQVSVK 715

Query: 587  NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
              T V   S RP + YS    K  + + ++   +     F+S    + +   +   L I 
Sbjct: 716  GQTAVLGLSSRPWLGYSDVQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775

Query: 645  TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR-------- 695
            +I+ +   +  +SIPL   PRR     +   F +    N   A  +    +         
Sbjct: 776  SIEKLDNNMLQQSIPLSYTPRRFLKHPEQPLFYVIESDNNVLAPSTRAKLLEDSKSRGGD 835

Query: 696  --LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSF-SDDSNV 730
              +L  + F +                      + T  L+  E   SI +  F S D   
Sbjct: 836  ETVLPPEDFGYPRATGHWASCIQVVDPLDAKAVVHTIELEENEAAISIAAVPFTSQDDET 895

Query: 731  YYCVGTAYVLPEENEP--TKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAA 786
            +  VGTA  +   N P    G I ++   EDG+ L+ I + + +    +L  F G+L+A 
Sbjct: 896  FLVVGTAKDM-TVNPPGSAGGYIHIYRFQEDGRELEFIHKTKVEEPPLALLGFQGRLVAG 954

Query: 787  INQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
            I   +++Y     D G ++L  +C        I+ L  QT+G  IVV D+ +S++ ++YK
Sbjct: 955  IGSLLRIY-----DLGMKQLLRKCQAPVVPKTIVGL--QTQGSRIVVSDVRESVTYVVYK 1007

Query: 844  HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE------ 894
            ++E  +     D  + W +A  ++D +   G +   NL+ +R   K SE A ++      
Sbjct: 1008 YQENVLIPFVDDSVSRWTTATTMVDYETTAGGDKFGNLWLLRCPKKTSEEADEDGSGAHL 1067

Query: 895  --ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
              ERG       RLE++   +  +         LV        G    +++    G IG+
Sbjct: 1068 IHERGYLQGTPNRLELMIHVYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIGM 1120

Query: 946  IAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
            +   +  E   F + L+  L      + G +H  +RS+    K V     +DGDL E + 
Sbjct: 1121 LVPFIGREDVDFFQNLEMQLAAQHPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEMYF 1175

Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
             L       I+  ++ SV E+ +++  + RLH
Sbjct: 1176 LLPNDTKMMIAAELDRSVREIERKISCIYRLH 1207


>gi|358391805|gb|EHK41209.1| hypothetical protein TRIATDRAFT_135379 [Trichoderma atroviride IMI
            206040]
          Length = 1212

 Score =  213 bits (543), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 285/1177 (24%), Positives = 491/1177 (41%), Gaps = 174/1177 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW---DAESSELITRAMGDVSD 57
            +L   I+G I +L  FR  G  +D+L +AT+  +  ++++   +   S L     G    
Sbjct: 59   LLSHDIFGIIRSLAAFRLAGSNKDYLILATDSGRITIIEYLPRENRFSRLHLETFGKSGV 118

Query: 58   RIGRPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIK 115
            R   P   G+    DP  R  L+     + L  V+  +++ +L  +  +   +  VL I 
Sbjct: 119  RRVIP---GEYLACDPKGRACLVASIEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVIS 175

Query: 116  FL---YGCAKPTIVVLYQD----NKDA-----RHVKT----YEVALKDKDFVEGPWSQNN 159
             +    G A P    L  D    ++D+     R ++T    YE+ L     V   WS+  
Sbjct: 176  MVALDVGYANPVFAALEIDYTEVDQDSTGEALRELETQLVYYELDLGLNHVVRK-WSEP- 233

Query: 160  LDNGADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRV 209
            +D+ A LL  VP     P   GVL+ GEE I Y  +N  AF+  IP R   T+  +  R 
Sbjct: 234  VDSTASLLFQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRKRT 291

Query: 210  DADGSRYLLGDHAGLLHLLVITH--------------EKEKVTG----LKIELLGETSIA 251
               G  + L   AG    L+ T               E+   TG    LKI+       A
Sbjct: 292  IVSGVMHKLKGSAGAFFFLLQTEDGDMFKVTIDMVEDEEGNTTGEVRRLKIKYFDTVPAA 351

Query: 252  STISYLDNAVVYIGSSYGD---SQLIKL------------NLQPDAKGSYVEV------- 289
            S +  L +  +Y+ S +G+    Q  KL            +   DA+ SY  V       
Sbjct: 352  SNLCILKSGFLYVASQFGNFSFYQFEKLGDDDEELEFTSDDFPVDAQASYTPVYFYPRPL 411

Query: 290  -----LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL 344
                 +E   ++ P++D  + +L  +   Q+ T  G     + R +++G+ INE  S EL
Sbjct: 412  ENLVLVESIPSMNPLLDCKIANLTGEDAPQIYTVCGNGARSTFRTLKHGLEINEIVSSEL 471

Query: 345  QGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
             GI   +W+L+ +  + +D ++V+SF + T +L++   + +EE    GF +   TL    
Sbjct: 472  PGIPSAVWTLKLNRSEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQL 529

Query: 404  AIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
               + L+QV    +R + +      N+W +P   S+  A+ NA QV +A   G +VY E+
Sbjct: 530  LGDDGLIQVHPKGIRHIRNGQV---NQWDAPQHRSIVAASTNAHQVAIALSSGEIVYFEM 586

Query: 464  -GDGILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLIT 520
              DG L E    +  +  ++CL +  + E    S   AVG   D +VRI SL P+  L  
Sbjct: 587  DDDGSLAEYDDKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDCTVRILSLDPESTLEN 645

Query: 521  KEHLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
            K        P S+ + + E  S      YL   L  G  L  +L+  TGELTD ++  LG
Sbjct: 646  KSVQALTAAPTSLAIISMEDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELTDTRQKFLG 705

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
             + + L   + +  T V   S RP + Y+    K  + + ++  ++     F S    + 
Sbjct: 706  PKQVRLFQVTVQGRTCVLGLSSRPWLGYADPITKSFVVTPLSYVDLEWGWNFTSEQCEEG 765

Query: 633  LAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM 691
            +   +   L I ++D +    I+ SIPL   P+++    +   F +    N + + +   
Sbjct: 766  IVGIQGQTLRIFSVDRLGDTLIQNSIPLTYTPKKMVKHPEHPLFYVIEADNHTLSPDLCA 825

Query: 692  HFV----------RLLDDQTF-----------------------EFISTYPLDTFEYGCS 718
              +          ++L  + F                       + +    LD  E   S
Sbjct: 826  KLLADPARVNGDAKVLSPEEFGHPRGNRRWASCISVVDPLAEDGQALQKIDLDENEAAVS 885

Query: 719  ILSCSF-SDDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVY 774
            +   +F S D+  +  VGT     V P         I  F  E   L  I + + +    
Sbjct: 886  LAIVTFASQDNETFLVVGTGKDMVVNPRSFSDAYVHIYRFEQEGRGLVFIHKTKVEEPPM 945

Query: 775  SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
            ++  F G++L  I + +++Y   +R    R+ Q+E       L   + T+G+ I+VGD+ 
Sbjct: 946  AIIPFQGRVLVGIGKILRIYDLGMR-QLLRKTQAEVAPQ---LINSLSTQGNRIIVGDVQ 1001

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
            + I+ ++YK     +     D  A W +   ++D +   G +   N+F VR   K SE A
Sbjct: 1002 QGITYVVYKQTTNKLIPFVDDTVARWTTCSTMVDYETVAGGDKFGNIFVVRSPQKASEEA 1061

Query: 892  TDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
             +E+ G               RL+++   +  +        SLV       VG    +++
Sbjct: 1062 DEEQAGLHLLNARDYLHGRSHRLDLMCHLYTQDIPTSITKTSLV-------VGGQDVLLW 1114

Query: 937  GTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
              + G IGV I  +  E   F + L+ +LR     + G +H  +RS+    K V     +
Sbjct: 1115 SGLMGTIGVLIPFVTREDTDFFQSLELHLRAEDPPLAGRDHLMYRSYYAPVKGV-----I 1169

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DGDL E +  L   +   I+  ++ SV E+ +++ ++
Sbjct: 1170 DGDLCERYTLLPNDKKQMIAAELDRSVREIERKISDI 1206


>gi|47087273|ref|NP_998668.1| splicing factor 3B subunit 3 [Danio rerio]
 gi|28502779|gb|AAH47171.1| Splicing factor 3b, subunit 3 [Danio rerio]
          Length = 1217

 Score =  213 bits (543), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 264/1190 (22%), Positives = 491/1190 (41%), Gaps = 189/1190 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L + ++G + +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTMEVFGVVRSLMAFRLTGGTKDYVVVGSDSGRIVILEYHP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R  ++G      L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMMGATEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  ++E VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRMKYFDTIPVATAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V V E+  +L 
Sbjct: 347  LFVSSEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-SLS 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   + DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406  PIMSCQIADLANEDTPQLYVACGRGPGSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
            +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKTPGKKTIIRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL     L  ++ + L  +  P 
Sbjct: 582  EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ--PE 638

Query: 532  SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
            S+ +    G+               YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 639  SLCIVEMGGVEKQDELGEKGTICFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L     +    V A S R  + YS   +   + ++ + + +   F S   P+ +    
Sbjct: 699  VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
               L I  ++ +  +  + + PL   PR+ + H E            + T A  + + Q 
Sbjct: 759  TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLILIETDHNAYTEATKAQRKQQ 818

Query: 685  CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
             AEE             +EM                           VRL++      + 
Sbjct: 819  MAEEMVEAAGEDERELAAEMAAAFLNENLPEAIFGAPKAGSGQWASLVRLINPIQGNTLD 878

Query: 707  TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
               L+  E   S+  C F +  + +Y  VG A  ++        G I  + +  G  KL+
Sbjct: 879  LVQLEQNEAAFSVAICRFLNGGDDWYVLVGVARDMILNPRSVGGGYIYTYRIVGGGDKLE 938

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYV 821
             + +   +    ++  F G++L  + + +++Y     D G ++L  +C   H   L   +
Sbjct: 939  FLHKTPVEDVPLAIAPFQGRVLVGVGKLLRIY-----DLGKKKLLRKCENKHVPNLVTGI 993

Query: 822  QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
             T G  ++V D+ +S+  + Y+  E  +   A D    W++   +LD D    A+   N+
Sbjct: 994  HTIGQRVIVSDVQESLFWVRYRRNENQLIIFADDTYPRWITTACLLDYDTMASADKFGNI 1053

Query: 882  FTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
              VR   N+    DE         +RG L       E++  YH+GE V   +  +L+   
Sbjct: 1054 CVVRLPPNTSDDVDEDPTGNKALWDRGLLNGASQKAEIIINYHIGETVLSLQKTTLI--- 1110

Query: 924  PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
                 G   ++++ T++G IG++     HE + F + L+ ++R     + G +H  +RS+
Sbjct: 1111 ----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHLEMHMRSEFPPLCGRDHLSFRSY 1166

Query: 983  NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                     KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1167 -----YFPVKNVIDGDLCEQFNSMDPHKQKSVSEELDRTPPEVSKKLEDI 1211


>gi|342320507|gb|EGU12447.1| Pre-mRNA-splicing factor RSE1 [Rhodotorula glutinis ATCC 204091]
          Length = 1212

 Score =  213 bits (543), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 282/1186 (23%), Positives = 480/1186 (40%), Gaps = 187/1186 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +++  ++G+I +L  F+  G ++D++ I ++  +  VL++D  ++ +  +   +   R G
Sbjct: 55   IVEADVFGQIRSLTAFKLTGGSKDYIIIGSDSGRIVVLEYDPNNN-VFNKLHQETYGRSG 113

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIR--LEELQ 110
             R    GQ    DP  R  +IG      L  ++  D    L      EA   R  +  + 
Sbjct: 114  SRRIVPGQYLATDPKGRAVMIGAMEKSKLVYILNRDAAANLTISSPLEAHKPRAIIHSIV 173

Query: 111  VLDIKFLYGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQ 157
             +D+ F      P    L  D  +A    T             YE+ L     V   WS+
Sbjct: 174  GVDVGF----ENPMFAALEVDYTEADQDPTGAAFDAAEKMLTYYELDLGLNHVVRK-WSE 228

Query: 158  NNLDNGADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS- 201
               D  A+LL+ VP            P   GVL+  E+ I+Y    A +    IP R   
Sbjct: 229  AT-DPRANLLVQVPGGQNATMDRFDGP--SGVLVCCEDYIIYKHQGAKEHRVPIPKRAHP 285

Query: 202  ---------ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIAS 252
                     IT A          +LL    G L  + I HE+E+VT LKI+       AS
Sbjct: 286  LADPERGVIITAAVMHKMKGAFFFLLQSEEGDLFKVTIEHEEEEVTALKIKYFDTVPSAS 345

Query: 253  TISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLG--------------- 297
            ++  L +  +++ + +G+ +L +     D           Y NLG               
Sbjct: 346  SLCILKSGFLFVATEFGNPRLYQFEKLGDDDDETEFSSTDYDNLGAGTDPLPPALFRPRE 405

Query: 298  --------------PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE 343
                          PI+D  V +   +   Q+    G     SLRI+R G+ + E  S E
Sbjct: 406  LQNLAIADEVESLAPILDAKVANYLGEDTPQIYAACGRGARSSLRILRQGLEVMEAVSSE 465

Query: 344  LQGIK-GMWS--LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
            L G    +W+  LR+     +D ++++SF++ T +LA+   D +EE    GF S   TL 
Sbjct: 466  LPGAPIAVWTTKLRAEGASSYDAYIILSFVNGTLVLAIG--DTIEEVSDTGFISSAPTLG 523

Query: 401  CHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVY 460
                  + L+Q+    +R + +  R   NEWK     ++  AT N+ QV++    G LVY
Sbjct: 524  VQQLGDDALLQIYPRGIRHILADKRV--NEWKVGARETIVCATTNSRQVVIGLSTGELVY 581

Query: 461  LEIG-DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
             E+  DG L E +  + +  EI  L I  + E    +   AVG   D +VRI SL   N 
Sbjct: 582  FELDMDGQLNEFQERKPMGAEILALSIAEVPEGRQRTPYLAVGC-ADSTVRIISLDPENT 640

Query: 519  ITKEHLGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRK 569
            +    L     P S ++ A         +    ++   L +G LL  +L+  TG L D +
Sbjct: 641  LDSLSLQALTAPPSSIVMAEITDASIDKYHATMFVNIGLNNGVLLRTVLDPLTGSLGDTR 700

Query: 570  KVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAF 629
               LG++P+ L     + +  + A S RP + Y+    L ++ +    + +   F++   
Sbjct: 701  TRFLGSRPVKLARVPVQGSPAILALSSRPWLNYAYRGILQFTPLIFDALDYAWSFSAELC 760

Query: 630  PDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQS----------RTFAIC 678
            P+ L       L I T   + QK+    I L   PR++     S          RTF+  
Sbjct: 761  PEGLIGIVGNSLRIFTFPRLGQKVQQTVIDLSYTPRQLLTSPHSRLLYTVEADHRTFSPS 820

Query: 679  SL-KNQSCAEESEMHF-------------------------VRLLDDQTFEFISTYPLDT 712
            ++ K  S    +EM                           VR++D  T E +    L+ 
Sbjct: 821  AIQKTISDMRMAEMEVDEEVLNLDPKEFGLPRGPAGQWASCVRVIDPVTAETVFKVDLEQ 880

Query: 713  FEYGCSILSCSFSDDSN-VYYCVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKE 768
             E   S    +F    N V+  VGT     + P   +         + E  +L+L+ + E
Sbjct: 881  NEAAFSAAIVTFHSHPNEVFLVVGTGQDTSLAPRACKQAYLHTYKLLEEGRQLELLHKTE 940

Query: 769  TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH-ILALYVQTRGDF 827
                  +L AF G+L+A + + ++LY     D G ++L  +  + G   + + + T+G  
Sbjct: 941  VDDIPKALIAFQGRLVAGVGKALRLY-----DLGKKKLLRKAENKGFATMIMTLNTQGTR 995

Query: 828  IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
            I+VGD  +S+   +YK  E  +   A D +  W +A  I+ D   + A + F  F V + 
Sbjct: 996  IIVGDAQESVYYALYKAPENRLLIFADDISPRWTTA-SIMVDYETVAAGDKFGNFFVNRL 1054

Query: 888  SEGAT----DEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             +G +    D+  G               R  ++  YH+G+ +      +LV        
Sbjct: 1055 PKGVSSDVDDDPTGAGIMHEKPYLMGAPHRTHLLAHYHIGDIITSLHKVALV-------A 1107

Query: 929  GQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS-FNNEK 986
            G    +++  + G +GV+   + +E   F   L+ +LR     + G  H  +RS +   K
Sbjct: 1108 GGRDLLVYTGLMGTVGVLVPFVSNEDVDFFTTLEMHLRSEAPSLCGREHLAYRSAYTPVK 1167

Query: 987  KTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             TV      DGDL E +  L   +  +I+  +  +V E+ K+++ +
Sbjct: 1168 ATV------DGDLCEVYRSLPMAKQGQIAGELERTVSEVIKKLDNV 1207


>gi|388582014|gb|EIM22320.1| hypothetical protein WALSEDRAFT_60013 [Wallemia sebi CBS 633.66]
          Length = 1208

 Score =  213 bits (543), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 201/847 (23%), Positives = 378/847 (44%), Gaps = 91/847 (10%)

Query: 258  DNAVVYIGSSYGDSQLIKLN---------LQPDAKGSYVEVLERYVNLGPIVDFCVVDLE 308
            DN + Y  SSY ++ +  L            P    + V V +   +L PI+D  V  + 
Sbjct: 375  DNELEYSSSSYENNGMPSLENPLPIESAYFTPRGLDNLVPV-DEIQSLAPILDAKVQSIY 433

Query: 309  RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFDTFLVV 367
                 Q+ T SG     SLR++R+G+ + E  S EL     G+W+L+ +  D +D+ +V+
Sbjct: 434  AGDTPQIYTASGVGSRSSLRVMRHGLDVIEAVSSELPAPPNGIWTLKQNAQDMYDSLIVL 493

Query: 368  SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
            SF++ T +L +   + +EE    G  +   TL       + ++QV    +R + +  R  
Sbjct: 494  SFVNGTLVLGIG--ESIEEVSDTGLATSVSTLSVDQLGEDSMIQVFPQGIRHILNDKR-- 549

Query: 428  RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQ-LEYEISCLDI 485
             NEWKSP    +  +T N+ QV +A   G LVY E+  +G L E +  + +E  ++ + I
Sbjct: 550  VNEWKSPSDTYITASTTNSRQVCVALSNGELVYFEMDNEGQLNEFQERKSMESTVTTMSI 609

Query: 486  NPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGG--------EIIPRSVL 534
              + +        A+G   D ++RI SL P+  L +++ + +          EI+ +S  
Sbjct: 610  GEVPQGRQRCPYLALGC-DDQTIRIVSLDPENTLEVVSVQAVTAQPSSICVAEILDKS-- 666

Query: 535  LCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAA 594
            L  +    ++   L +G LL  +L+   G L D +   LG +P+ LR  +      V A 
Sbjct: 667  LDKYNPTLFVNIGLANGVLLRTVLDTVNGSLADTRTRFLGAKPVVLRRVTVDKQQAVMAL 726

Query: 595  SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLH 653
            S R  + Y+   ++ ++ +  + +  +  FN+   PD L    +  L I T+ ++ Q++ 
Sbjct: 727  STRTFLNYAHGDQMYFTPLLYEPLDQVSSFNAELCPDGLIGISDTVLRIFTLPNVGQRMK 786

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF-------------------- 693
              S+ L   PR+I     +  F      +++ + E +                       
Sbjct: 787  QDSVALSYTPRKILLHPTAPLFLTIESDHRTISRERQAELLTSKGYNPEEHNFDAVQFGN 846

Query: 694  -----------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-DDSNVYYCVGTA---Y 738
                       VR++D  T   ++   LD  E   S     ++  +   +  VGTA    
Sbjct: 847  VRMEAGNWASCVRMIDPVTLTTVNKVELDNNEAAFSAAFVQWAGHEDETHLVVGTAKDRM 906

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            ++P+ ++    R+   + +D +L+L+ + +     Y+++AF G+LLA + + ++LY    
Sbjct: 907  MMPQSHKEAYLRVYK-VTQDSQLELLHKTDIDDVPYAIHAFKGRLLAGVGKALRLY---- 961

Query: 799  RDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
             D G + L  +C +      +  +   G  I VGD+ +S+S  +YK  E  +   A D  
Sbjct: 962  -DLGKKRLLRKCENKSFAAGIVNLNVVGSRIYVGDMQESVSFAVYKAPENRLLVFADDIM 1020

Query: 858  ANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEER---GRLEVVGEYH----LG 908
            + W +    +D D   G +   N+F  R  K++    DE+    G L   G YH      
Sbjct: 1021 SRWTTTATPVDYDTVAGGDKFGNIFITRVDKSTSEWVDEDESGGGLLHARGLYHGAPNRS 1080

Query: 909  EFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLR 965
            + +  F  G +V  +  S +  G    +++  ++G +G+I     +  + F+  L+ ++R
Sbjct: 1081 KLLAHFYVGDIVTSITKSQLSAGGRDVLVYTCLHGTVGMIIPFASKDDIEFMSTLELHMR 1140

Query: 966  KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEEL 1025
            +    + G +H  +RS+      +  K F+DGDL E +  L  T+   I+  ++ +  E+
Sbjct: 1141 QESPSLVGRDHLGFRSY-----YIPCKAFVDGDLCELYASLPVTKQQAIANELDRTSGEV 1195

Query: 1026 CKRVEEL 1032
             K++E L
Sbjct: 1196 LKKIESL 1202


>gi|405970223|gb|EKC35151.1| Splicing factor 3B subunit 3 [Crassostrea gigas]
          Length = 1217

 Score =  213 bits (542), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 274/1196 (22%), Positives = 504/1196 (42%), Gaps = 201/1196 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G ++D++ + ++  +  +L++ A+ + +  +   +   + G
Sbjct: 53   VLSVEVFGMIRSLMPFRLTGGSKDYVVVGSDSGRIVILEYIAQKN-IFEKVHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R  +IG      L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQYLAVDPKGRAVMIGAVEKQKLVYILNRDASARLTISSPLEAHKANTLVYHMV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
                G   PT   L  D ++A    T E A   +            + V   +S++ L+ 
Sbjct: 172  GVDVGFENPTFACLEIDYEEADMDHTGEGAKNTQQLLTYYELDLGLNHVVRKYSES-LEE 230

Query: 163  GADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-- 215
             A+ LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R  
Sbjct: 231  HANFLISVPGGSDGP--SGVLICSENYITY--KNLGDQPDIRCPIPRRRNDLD-DPERGM 285

Query: 216  ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
                           +L     G +  + +  +++ VT + ++      +A+ +  L + 
Sbjct: 286  IFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIILKYFDTVPVAANMCVLKSG 345

Query: 261  VVYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNL 296
             ++I S +G+  L ++                           P    + V+V E   +L
Sbjct: 346  FLFIASEFGNHYLYQIAQLGDDDEEPRFSSASQLEEGEPHFFSPRPLKNLVQVDE-MDSL 404

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
             PI    + DL  +   Q+ T  G     +LR++R+G+ ++E A  EL G    +W+++ 
Sbjct: 405  SPITHCQIADLANEDTPQLYTLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKK 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            + DD FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQ+   
Sbjct: 465  NIDDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSQLGDDALVQIYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKH 473
             +R + S  R   NEWK+P   S+     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRSDKR--VNEWKTPGKKSIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTER 580

Query: 474  AQLEYEISCLDIN--PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
             ++  E+ C+ +   P+GE     +  AVG+ +D +VRI SL   + ++   L  + +P 
Sbjct: 581  KEMSSEVVCMALGRVPVGEQ--RCRFLAVGL-SDNTVRIISLDPSDCLSP--LSMQALPE 635

Query: 532  -SVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
             S  LC  E                G  +L   L +G LL  +L+  TG+L+D +   LG
Sbjct: 636  PSESLCIVEMGGTEAKEETGEPGTVGGLFLNIGLQNGVLLRTVLDNVTGDLSDTRTRYLG 695

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
             +P+ L   S + +  V A S R  + Y+   +   + ++ + + +   F S   P+ + 
Sbjct: 696  VRPVKLFRISMQGSEAVLAMSSRSWLSYTYQNRFHLTPLSYETLEYASGFASEQCPEGIV 755

Query: 635  IAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKN---------- 682
                  L I  ++ +  +  + S PL   PR+ + H E +    I +  N          
Sbjct: 756  AISTNTLRILALEKLGAVFNQVSHPLQHTPRKFVIHPETNNLIVIETEYNAYTEATKMER 815

Query: 683  ----------QSCAEESEM-----------------------------HFVRLLDDQTFE 703
                         AEE EM                               +R+++  +  
Sbjct: 816  KQQMAEEMVEMVSAEEREMAAEMAANFLNEERSESIFGAPKAGPGMWASIIRIINPISGN 875

Query: 704  FISTYPLDTFEYGCSILSCSFS---DDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG- 759
             +    L+  E   SI    F+   DD  V   V    VL      T G + ++ +EDG 
Sbjct: 876  TLEKIQLEQNESVHSIALLKFASRGDDQFVLVGVARDLVL-NPRSLTGGFLYLYQLEDGG 934

Query: 760  -KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG---H 815
              L+L+ +   +    ++ +F G++L  + + +++Y     D G +++  +C +     +
Sbjct: 935  ESLKLLHKTPVEDVPGAIASFQGRVLIGVGRYLRIY-----DIGKKKMLRKCENKSIPNY 989

Query: 816  ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
            + A++  T G+ I+  D+  S   + YK +E  +   A D N  W++A   LD +   GA
Sbjct: 990  VSAIH--TVGNRIIASDVQDSFHFVRYKRQENQLIVFADDTNPRWITASCSLDYNTVAGA 1047

Query: 876  ENNFNLFTVRKNSEGATDE-----------ERGRL-------EVVGEYHLGEFVNRFRHG 917
            +   N+  +R   + + D            +RG L       +V+  +HLGE V   +  
Sbjct: 1048 DKFGNITIIRLPGDVSDDVDEDPTGNKALWDRGLLNGASQKADVMMNFHLGEMVTSLQKA 1107

Query: 918  SLVMRLPDSDVGQIPTVIFGTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
            +L+        G   ++++ T++G IG+ +A   HE + F + L+  +R     + G +H
Sbjct: 1108 TLI-------PGGSESLVYTTLSGGIGMLVAFTSHEDHDFFQHLEMYMRTEHPPLCGRDH 1160

Query: 977  EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              +RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1161 LSFRSY-----YYPVKNVIDGDLCEMFNSMDPVKQKSVSEELDRTPSEVSKKLEDI 1211


>gi|432114152|gb|ELK36185.1| Splicing factor 3B subunit 3 [Myotis davidii]
          Length = 1217

 Score =  213 bits (542), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 271/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C F++    +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFANTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>gi|367050506|ref|XP_003655632.1| hypothetical protein THITE_2119532 [Thielavia terrestris NRRL 8126]
 gi|347002896|gb|AEO69296.1| hypothetical protein THITE_2119532 [Thielavia terrestris NRRL 8126]
          Length = 1211

 Score =  213 bits (541), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 270/1167 (23%), Positives = 483/1167 (41%), Gaps = 165/1167 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            I+G I +L  FR  G  +D++ +AT+  +  ++++  + +      +        R    
Sbjct: 64   IFGIIRSLASFRLAGSHKDYIILATDSGRIAIVEYLPKENRFSRIHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  LI     + L  V+  + + +L  +  +   +  VL +  +    G 
Sbjct: 124  GQYLAADPKGRACLIASIEKNKLVYVLNRNAQAELTISSPLEAHKPGVLVLSLVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
            A P    L  D  +A    T E   + +            + V   WS + +D  + LL 
Sbjct: 184  ANPVFAALEIDYSEADQDPTGEAGREAEAQLVYYELDLGLNHVVRKWS-DTVDPTSSLLF 242

Query: 169  PVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYLL 218
             VP     P   GVL+ GEE + Y  +N  AF+  IP R   T+     R    G  + L
Sbjct: 243  QVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRRGATEDPQRKRTIVSGVMHKL 300

Query: 219  GDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNA 260
               AG    L+ T + +                  +V  LKI+      IA ++  L + 
Sbjct: 301  KGSAGAFFFLLQTDDGDLFKVTIDMVEDSEGNPTGEVRRLKIKYFDTIPIAHSLCILKSG 360

Query: 261  VVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLERY 293
             ++  S +G+    +                +   D + SY            + ++E  
Sbjct: 361  FLFAASEFGNHHFYQFEKLGDDDEEPEFTSDDFPTDPRASYNPVYFHPRPLENLALVESL 420

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 352
             ++ P+VD  V +L  +   Q+    G     + R++++G+ ++E  + EL G    +W+
Sbjct: 421  NSMNPLVDCKVANLTGEDAPQIYAVCGNRARSTFRMLKHGLEVSEIVASELPGTPSAVWT 480

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
             + +  D +D ++V+SF + T +L++   + +EE    GF +   TL         L+QV
Sbjct: 481  TKLTKYDEYDGYIVLSFTNATLVLSIG--ETVEEVTESGFLTSVPTLAVQQLGEEGLIQV 538

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV 471
                +R +        NEW +P   S+  A  N SQV++A   G +VY E+  DG L E 
Sbjct: 539  HPKGIRHIVQGRV---NEWPAPQHRSIVAAATNESQVVIALSSGEIVYFEMDADGSLAEY 595

Query: 472  -KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
             +  ++   ++ L +  + E  + S   AVG   D +VRI  L P+  L  K        
Sbjct: 596  DEKKEMSGTVTSLSLGKVPEGLTRSSFLAVGC-DDCTVRILGLDPESTLEMKSIQALTSA 654

Query: 530  PRSVLLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
            P S+ + + E  +     YL   L  G  L  +L+  TGELTD +   LG +P  L   S
Sbjct: 655  PSSLSIMSMEDSTGGNTLYLHIGLHSGVYLRTVLDEVTGELTDTRSKFLGPKPTKLFQVS 714

Query: 585  SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
             +N   V A S RP + Y    +K  + + ++  E+ +   F+S    + +       L 
Sbjct: 715  VQNQPCVLALSSRPWLGYMDPLSKNFVMTPLSYAELEYGWNFSSEQCLEGMVGIHANFLR 774

Query: 643  IGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAI--------CSLKNQSCAEESEMH 692
            I TI+ + +  I +SIPL   P+R+  H EQ   + I          L+ Q  A+   ++
Sbjct: 775  IFTIEKLGETMIQKSIPLTYTPKRLVKHPEQPYFYTIEADNNTLPPELRAQLLAQSGAVN 834

Query: 693  F-VRLLDDQTFEF----------------ISTYP-------LDTFEYGCSILSCSF-SDD 727
                +L  + F +                +S  P       L+  E   S     F S +
Sbjct: 835  GDATVLPPEDFGYPRAKGRWASCISVVDPVSEEPRVLQRVDLEGNEAAVSAAVVPFASQE 894

Query: 728  SNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLL 784
               +  VGT   ++    + T+G I V+   EDG+ L+ I +   +    +L  F G+LL
Sbjct: 895  GESFLIVGTGKDMVLNPRQFTEGYIHVYRFHEDGRSLEFIHKTRVEEPPMALIPFQGRLL 954

Query: 785  AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
            A I + +++Y   LR    + L+   G     L + +QT+G+ IVVGD+ + ++ ++YK 
Sbjct: 955  AGIGKTLRVYDLGLR----QLLRKAQGEVAQQLIVSLQTQGNRIVVGDVQQGVTYVVYKP 1010

Query: 845  EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER-------- 896
            E   +     D    W + + ++D +   G +   NL+ VR     + + +         
Sbjct: 1011 ESNKLIPFVDDTINRWTTCITMVDYESVAGGDRFGNLWIVRAPERASQESDEPGSEVQLL 1070

Query: 897  ----------GRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV- 945
                       RL ++  ++  +        +LV       VG    +++  + G +GV 
Sbjct: 1071 HARSYLHGAPNRLNLMAHFYPQDLPTSITKTNLV-------VGGQDVLVWSGIQGTVGVL 1123

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
            I  +  E   F + L++++R     + G +H  +R +      V  K  +DGDL E F  
Sbjct: 1124 IPFVSREDVDFFQNLESHMRSEDPPLAGRDHLIYRGY-----YVPVKGVIDGDLCERFSL 1178

Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L   +   I+  ++ SV E+ +++ ++
Sbjct: 1179 LPNDKKQMIAGELDRSVREVERKISDI 1205


>gi|46362557|gb|AAH68974.1| Splicing factor 3b, subunit 3, 130kDa [Homo sapiens]
          Length = 1217

 Score =  212 bits (540), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 495/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                           L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFSLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>gi|224064143|ref|XP_002188384.1| PREDICTED: splicing factor 3B subunit 3 [Taeniopygia guttata]
          Length = 1217

 Score =  212 bits (540), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 269/1192 (22%), Positives = 494/1192 (41%), Gaps = 193/1192 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNVFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQYLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406  PILCCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
            +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSV 533
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P +  
Sbjct: 582  EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPE 638

Query: 534  LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
             LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 639  SLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L     +    V A S R  + YS   +   + ++ + +     F S   P+ +    
Sbjct: 699  VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIS 758

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQS 684
               L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q 
Sbjct: 759  TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQ 818

Query: 685  CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
             AEE             +EM                           +R+++      + 
Sbjct: 819  MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLD 878

Query: 707  TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQ 762
               L+  E   S+  C FS+  + +Y  VG A  L        G  +    +V  G KL+
Sbjct: 879  LVQLEQNEAAFSVAVCRFSNTGDEWYVLVGVAKDLILNPRSVAGGFVYTYKLVNSGEKLE 938

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-- 820
             + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y  
Sbjct: 939  FLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYIC 991

Query: 821  -VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNF 879
             +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+   
Sbjct: 992  GIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTATLLDYDTVAGADKFG 1051

Query: 880  NLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVM 921
            N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+ 
Sbjct: 1052 NICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI- 1110

Query: 922  RLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
                   G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +R
Sbjct: 1111 ------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFR 1164

Query: 981  SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            S+         KN +DGDL E F  +   +   +++ ++ +  E+ K++E++
Sbjct: 1165 SY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVAEELDRTPPEVSKKLEDI 1211


>gi|296417673|ref|XP_002838477.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295634416|emb|CAZ82668.1| unnamed protein product [Tuber melanosporum]
          Length = 1202

 Score =  212 bits (539), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 260/1162 (22%), Positives = 481/1162 (41%), Gaps = 154/1162 (13%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L + ++G I  L  FR  G ++D++ + ++  +  +L++  + ++     +        
Sbjct: 59   VLTIDVFGIIRCLAAFRLAGSSKDYVIVGSDSGRITILEYQPQVNKFHRLHLETFGKSGV 118

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ    DP  R  LI     + L  V+  + + +L  +  +   +   L    + 
Sbjct: 119  RRVIPGQFLACDPKGRACLISSVEKNKLVYVLNRNAQAELTISSPLEAHKPHCLVFALVA 178

Query: 118  --YGCAKPTIVVLYQD----NKDARHVKT---------YEVALKDKDFVEGPWSQNNLDN 162
               G   P    L  D    ++DA  V           YE+ L     V   WS + +D 
Sbjct: 179  LDVGYDNPVFAALEVDYGESDQDATGVAYENIEKMLVYYELDLGLNHVVRK-WS-DTVDR 236

Query: 163  GADLLIPVPPPL---CGVLIIGEETIVYCS-ANAFKAIPI------------RPSITKAY 206
             A++L  VP       GVL+  E++I Y     +   +PI            R +I    
Sbjct: 237  SANILFQVPGGTDGPSGVLVCAEDSIYYRHMGRSTHRVPIPRRRGACEDPNRRRTIISGV 296

Query: 207  GRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSIASTISYLDNAVVYIG 265
                     +LL    G L  + + ++ E  V+ +K++      IAS++  L +  +++ 
Sbjct: 297  MHKMRGAFFFLLQTEDGDLLKVTLDYDDESGVSRIKVKYFDTVPIASSLCILKSGFLFVA 356

Query: 266  SSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYVNLGP 298
            S  G+    +                N  PD     V +            +E   ++ P
Sbjct: 357  SEGGNHHFYQFEKLGDDDEEIEFSSDNFSPDPLDRTVPIYFHPRAPENLSLVEAIDSMNP 416

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSST 357
            +VD  + +L      Q+ T  G+    + R +++G+   E  S EL G+   +W+ + + 
Sbjct: 417  LVDCKIANLTDDDAPQIYTVCGSGPRSTFRTLKHGLETTEIVSSELPGVPSAVWTTKLTN 476

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
             D FD ++V+SF + T +L++   + +EE    GF S   TL       + L+QV    +
Sbjct: 477  GDEFDAYIVLSFTNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDALLQVHPKGI 534

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHA-Q 475
            R + +  R   NEW +P   S+  A+ N+ QV +A   G +VY E+  DG L E +   +
Sbjct: 535  RHIRADRRV--NEWPAPQHRSIVAASTNSRQVAVALSSGEIVYFEMDSDGQLAEYEEKKE 592

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVL 534
            +   ++CL +  + E    S   AVG   D +VRI SL PD  L +K        P S+ 
Sbjct: 593  MSGTVTCLSLGEVPEGRVRSSFLAVGC-DDATVRILSLDPDSTLESKSVQALTSAPSSLC 651

Query: 535  LCAFEGIS---------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
            + +    +         YL   L  G  L  +L+  TGELTD +   LG +P+ +    +
Sbjct: 652  IMSMPDSASETSRSLTLYLHIGLYSGVYLRTVLDTVTGELTDTRTRFLGPKPVKIFKVIA 711

Query: 586  KNTTHVFAASDRPTVIYSSNK-KLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
            +    V A S RP + YS  + + + + +    +     F+S   P+ +   +   L I 
Sbjct: 712  QGLPSVLALSSRPWLGYSDARGQFMLTPLTYPMLEWGWTFSSEQCPEGMVGIQGQNLRIF 771

Query: 645  TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-----------------SLKNQSCA 686
            TI+ +   L   SIPL   PR++        F +                  S +N    
Sbjct: 772  TIEKLTDNLQQESIPLSYTPRKMLKHPDQPVFYVAEADANTLSVATKQKLMNSAQNGDTN 831

Query: 687  EESEMHF------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYC 733
            E +   F            + ++D  + E      L+  E   SI + SFS  D   +  
Sbjct: 832  ELAPAEFGNPRGEGHWASCISVVDPISKEVTQRIELEENEAAFSIAAVSFSSQDDEWFLV 891

Query: 734  VGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
            VGT     + P ++      I  F+ +  +L+ I + +      +L  F G+LLA I + 
Sbjct: 892  VGTGKDTCLSPRKSANGYIHIYRFLNDGKELEFIHKTKVDEPPLALLGFQGRLLAGIGKD 951

Query: 791  IQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
            +++Y     D G ++L  +  G     +   +QT+G  I+V D+ +S++ ++YK+++  +
Sbjct: 952  LKIY-----DLGMKQLLRKAQGQVAPNVINGLQTQGSRIIVSDVQESVTYVVYKYQDNRL 1006

Query: 850  EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG--------- 897
               A D    + +   ++D +   G +   N + VR   K S+ + ++  G         
Sbjct: 1007 IPFADDMIPRFTTCTTMVDYETVAGGDRFGNFWIVRCPQKASDESDEDPAGGHLIHERSY 1066

Query: 898  ------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LP 950
                  RL ++  ++  +     +   LV        G   T+++  + G IG++   + 
Sbjct: 1067 LQGAPNRLNLMCHFYPQDIPTSVQKAQLV-------TGGRDTLVWTGLQGTIGLMVPFVS 1119

Query: 951  HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTR 1010
             E   F + L+ ++R     + G +H  +RS+      V  K  +DGDL E++  LSR +
Sbjct: 1120 REDVDFFQTLEQHMRTEDAPIAGRDHLIYRSY-----YVPVKGVIDGDLCENYSLLSRDK 1174

Query: 1011 MDEISKTMNVSVEELCKRVEEL 1032
               I+  ++ SV E+ +++ ++
Sbjct: 1175 KAMIAAELDRSVREVERKIADM 1196


>gi|327286386|ref|XP_003227911.1| PREDICTED: splicing factor 3B subunit 3-like [Anolis carolinensis]
          Length = 1217

 Score =  212 bits (539), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 270/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNVFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQYLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406  PILCCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
            +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSV 533
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P +  
Sbjct: 582  EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPE 638

Query: 534  LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
             LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 639  SLCIVEMGGAEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L     +    V A S R  + YS   +   + ++ + +     F S   P+ +    
Sbjct: 699  VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIS 758

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQS 684
               L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q 
Sbjct: 759  TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQ 818

Query: 685  CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
             AEE             +EM                           VR+++      + 
Sbjct: 819  MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVVRVMNPIQGNTLD 878

Query: 707  TYPLDTFEYGCSILSCSFS---DDSNVYYCVGTAYVLPEENEPTKGRILVF-IVEDG-KL 761
               L+  E   S+  C F+   DD +V   V    +L        G +  + +V +G KL
Sbjct: 879  LVQLEQNEAAFSVAVCRFANTGDDWHVLVGVAKDLIL-NPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFMHKTPVEEVPAAIAPFQGRILIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D +  W++   +LD D   GA+  
Sbjct: 991  CGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTHPRWVTTACLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>gi|326927039|ref|XP_003209702.1| PREDICTED: splicing factor 3B subunit 3-like [Meleagris gallopavo]
 gi|363738006|ref|XP_001232348.2| PREDICTED: splicing factor 3B subunit 3 [Gallus gallus]
          Length = 1217

 Score =  212 bits (539), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 267/1192 (22%), Positives = 495/1192 (41%), Gaps = 193/1192 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNVFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQYLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406  PILCCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
            +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSV 533
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P +  
Sbjct: 582  EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPE 638

Query: 534  LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
             LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 639  SLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L     +    V A S R  + YS   +   + ++ + +     F S   P+ +    
Sbjct: 699  VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIS 758

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQS 684
               L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q 
Sbjct: 759  TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQ 818

Query: 685  CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
             AEE             +EM                           +R+++      + 
Sbjct: 819  MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLD 878

Query: 707  TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
               L+  E   S+  C FS+    +Y  VG A  ++        G +  + + +G  KL+
Sbjct: 879  LVQLEQNEAAFSVAVCRFSNTGEEWYVLVGVAKDLILNPRSVAGGFVYTYKLVNGGEKLE 938

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-- 820
             + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y  
Sbjct: 939  FLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYIC 991

Query: 821  -VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNF 879
             +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+   
Sbjct: 992  GIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTATLLDYDTVAGADKFG 1051

Query: 880  NLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVM 921
            N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+ 
Sbjct: 1052 NICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI- 1110

Query: 922  RLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
                   G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +R
Sbjct: 1111 ------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFR 1164

Query: 981  SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            S+         KN +DGDL E F  +   +   +++ ++ +  E+ K++E++
Sbjct: 1165 SY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVAEELDRTPPEVSKKLEDI 1211


>gi|301604148|ref|XP_002931705.1| PREDICTED: splicing factor 3B subunit 3 [Xenopus (Silurana)
            tropicalis]
          Length = 1217

 Score =  211 bits (538), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 264/1190 (22%), Positives = 494/1190 (41%), Gaps = 189/1190 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIIILEYHP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFIAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V V E+  +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQD-SLS 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406  PIMSCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             +D +D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  IEDEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
            +R + +  R   NEWK+P    +     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKTPGKKIIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSV 533
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P +  
Sbjct: 582  EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPE 638

Query: 534  LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
             LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 639  ALCIVEMGGAERQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L     +    V A S R  + YS   +   + ++ + + +   F S   P+ +    
Sbjct: 699  VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQS 684
               L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q 
Sbjct: 759  TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQ 818

Query: 685  CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
             AEE             +EM                           +R+++      + 
Sbjct: 819  MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLD 878

Query: 707  TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVF-IVEDG-KLQ 762
               L+  E   S+  C F++  + +Y  VG A  ++        G +  + +V +G KL+
Sbjct: 879  LVQLEQNEAAFSVAVCRFTNTGDDWYVLVGVAKDLILNPRSVAAGLVYTYKLVNNGEKLE 938

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYV 821
             + +   +    ++  F G++L  + + +++Y     D G ++L  +C   H     + +
Sbjct: 939  FVHKTPVEEVPAAIAPFQGRVLIGVGKLLRIY-----DLGKKKLLRKCENKHISNFIVGI 993

Query: 822  QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
            QT G  +++ D+ +S   + YK  E  +   A D    W++   +LD D   GA+   N+
Sbjct: 994  QTIGQRVILSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTSCLLDYDTVAGADKFGNI 1053

Query: 882  FTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
              VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+   
Sbjct: 1054 CIVRLPSNTNDDVDEDPTGNKALWDRGLLNGASQKAEVIVNYHVGETVLSVQKTTLI--- 1110

Query: 924  PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
                 G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +RS+
Sbjct: 1111 ----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPICGRDHLSFRSY 1166

Query: 983  NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                     KN +DGDL E F  +   +   +++ ++ +  E+ K++E++
Sbjct: 1167 -----YFPVKNVIDGDLCEQFNSMEPVKQKSVAEELDRTPPEVSKKLEDI 1211


>gi|6006515|emb|CAB56791.1| spliceosomal protein SAP 130 [Homo sapiens]
          Length = 1217

 Score =  211 bits (538), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 271/1193 (22%), Positives = 495/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEAGNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +      G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFWAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>gi|410297890|gb|JAA27545.1| splicing factor 3b, subunit 3, 130kDa [Pan troglodytes]
          Length = 1217

 Score =  211 bits (538), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 272/1193 (22%), Positives = 495/1193 (41%), Gaps = 195/1193 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ    DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAGDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
                L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+++      +
Sbjct: 818  QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
                L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL
Sbjct: 878  DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
            + + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y 
Sbjct: 938  EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990

Query: 821  --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 991  SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211


>gi|302831461|ref|XP_002947296.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f. nagariensis]
 gi|300267703|gb|EFJ51886.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f. nagariensis]
          Length = 1221

 Score =  211 bits (536), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 248/1050 (23%), Positives = 432/1050 (41%), Gaps = 191/1050 (18%)

Query: 136  RHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL---CGVLIIGEETIVYCSANA 192
            +H+  YE+ L   + +   W++  +DNGA+LL+ VP       GVL+  E  I+Y + + 
Sbjct: 205  KHLTFYEMDLGLNNVLR-KWTEP-IDNGANLLVAVPGGADGPGGVLVCAENFIIYKNQDH 262

Query: 193  FKAIPIRPSITKAYGRVDADGSR-----------------YLLGDHAGLLHLLVITHEKE 235
             +   + P       R D  G R                 +L+    G ++ + + +E E
Sbjct: 263  EEVRAVIPR------RSDLPGDRGVLIVSYATHKKKAYSFFLVQSEYGDIYKVTLAYEGE 316

Query: 236  KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL------------------- 276
             VT LKI+        ++I+ L    ++  S YG+  L +                    
Sbjct: 317  AVTELKIKYFDTIPPCTSIAVLKTGFLFAASEYGNHALYQFVGTGEDDEDVESSSAALVQ 376

Query: 277  ---NLQP----DAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRI 329
                 QP          + +++   +L PI D  V +L  +   Q+    G     SL +
Sbjct: 377  TEEGFQPVFFEPRPLKNLLLIDEMASLMPITDMKVANLLNEEIPQIYALCGHGPRASLSV 436

Query: 330  VRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE 388
            +R G+ + E A   L G    +W++R +  D FD F+VVSF + T  L  ++ +E++ET 
Sbjct: 437  LRPGLAVTELAVSPLPGAPTAVWTVRRNATDEFDAFIVVSFANAT--LVFSIGEEVKETN 494

Query: 389  IEGFCSQTQTLFCHDAIYNQLVQVT----------SGSVRLVSSTSRELRNEWKSPPGYS 438
              GF     TL       N ++Q T           G +R +    R   NEWK P   +
Sbjct: 495  ESGFLGTVPTLHTQLLADNSMLQATFFDLSEPQVYPGGLRHIKPDRR--INEWKVPGRRN 552

Query: 439  VNVATANASQVLLATGGGHLVY--LEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQ 496
            +  A +N  QV +A  GG +    LE+G G+L E +   L  ++SCL+I P+ E    S 
Sbjct: 553  IKAAASNEKQVAIALQGGEVTVFELEVGTGLLVEAEKRDLGEDVSCLEIPPVPEGLVRSS 612

Query: 497  IAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLC---------AFEGISYLLC 546
              AVG + D +VR+ SL P   L          +P S+L+          A EG  +L  
Sbjct: 613  FLAVGTY-DQTVRVLSLDPGQGLKNLSMQALNSVPESILMLYNTGPGTERATEGGLFLHV 671

Query: 547  ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK 606
             L +G L+   ++  +G+LTD +   LGT+P  L   + +    + A S RP + Y+   
Sbjct: 672  GLQNGILIRSEVDRISGQLTDSRTRFLGTKPPKLFAAAVRGNRCMLALSSRPWLGYNDQG 731

Query: 607  KLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR 665
            +   S ++ + + +   F S   P+      +  L I  ++++    + ++  L   PRR
Sbjct: 732  RFNLSPLSYEALDYASGFASDQCPEGFVAVVKSTLRILAVENVGDAFNTQACRLRYTPRR 791

Query: 666  ICHQEQSRTFAICSLKNQSC---------------AEES----------------EMHF- 693
            +    +++   I    N +                AEE                 E  F 
Sbjct: 792  LLIHPETKLLMIAESDNAAIPLAEREDLQAKLAALAEEEGAPVQGVEFNDELAALEEQFG 851

Query: 694  ------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-------------DS 728
                        +RL+D  T   +    +D  E   +I+S +  D               
Sbjct: 852  APKGQPGQWAGCLRLVDPATLSTVFVTEVDNNE---AIVSMALVDLVLPAGPGFGGTEKL 908

Query: 729  NVYYCVGTAYVLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVY-SLNAFNGKLLA 785
             V  C      +P + E     I V+ + DG  +L+L+ +    G V  +L  F G+LLA
Sbjct: 909  LVVGCAKGLRYMPTDCE--AAYIRVYRLADGGKRLELVHKTIVDGGVPGALCGFKGRLLA 966

Query: 786  AINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRGDFIVVGDLMKSISLLIY 842
             +   ++LY     D G ++L  +C ++     I+ + VQ  G  I VGD  +S+ ++ Y
Sbjct: 967  GVGPTLRLY-----DMGKKKLLRKCEYNRLPHQIMNITVQ--GPRIYVGDAQESVHMMRY 1019

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV----RKNSEGATDEERG- 897
            K  + A    A D    +++ +  LD D  L A + F  F V    R+ S+   D+  G 
Sbjct: 1020 KKADNAFYIFADDIAPRYLTTILPLDYDT-LAAGDKFGNFVVLRLPREASQQVEDDPTGG 1078

Query: 898  --------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVI 943
                          +LE V ++H+G+ +   +   +         G    +++ TV G I
Sbjct: 1079 KMAAASGRLNGAPHKLEEVVKFHVGDTITSLQRAEM-------QAGGQEVLLYSTVMGAI 1131

Query: 944  GVIASLPH-EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
            GV+    + E   F   L+ +LR+    + G +H  +RS          ++ +DGDL   
Sbjct: 1132 GVLYPFTNREDVDFFGHLEMHLRQEHPPLCGRDHLSFRS-----AYFPVRSCVDGDLCGQ 1186

Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +  +   +   I++ M+ +  E+ K++E++
Sbjct: 1187 YASIPAKKQQMIAEAMDRTPGEMLKKLEDI 1216


>gi|261196131|ref|XP_002624469.1| splicing factor 3B subunit 3 [Ajellomyces dermatitidis SLH14081]
 gi|239587602|gb|EEQ70245.1| splicing factor 3B subunit 3 [Ajellomyces dermatitidis SLH14081]
 gi|239614558|gb|EEQ91545.1| splicing factor 3B subunit 3 [Ajellomyces dermatitidis ER-3]
 gi|327356791|gb|EGE85648.1| pre-mRNA-splicing factor rse1 [Ajellomyces dermatitidis ATCC 18188]
          Length = 1209

 Score =  210 bits (535), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 264/1172 (22%), Positives = 486/1172 (41%), Gaps = 177/1172 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I TL  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRTLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  + + +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALAALDVG 182

Query: 120  CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  +  +A             +H+  YE+ L     V   WS + +D  A +
Sbjct: 183  YDNPIFAALEVEYAEADQDPTGQAYEELEKHLVYYELDLGLNHVVRK-WS-DPVDRSATM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGD 220
            L  VP       G L+  E+ I Y  +N  AF+  IP R   T+     + +  RY+   
Sbjct: 241  LFQVPGGADGPSGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT-- 293

Query: 221  HAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIASTIS 255
             AG++H +                     ++  +  ++TG    LK++      IAS++ 
Sbjct: 294  -AGVMHKMRGAFFFLLQTEDGDLFKVTIDMVEDDNGQITGEVRRLKLKYFDTVPIASSLC 352

Query: 256  YLDNAVVYIGSSYGDSQLIKLNL--------------------QPDAKGSY-------VE 288
             L +  +++ S  G+    +                       +P A   +       V 
Sbjct: 353  ILKSGFLFVASETGNHHFYQFEKLGDDDDEIEFTSDSYSADPSEPLAPAYFRPRPYENVN 412

Query: 289  VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI- 347
            ++E   +L P++D  V +L      Q+ T  G     S R +++G+ ++E    EL  + 
Sbjct: 413  LVESINSLNPLMDCKVANLTDDDAPQIYTICGTGARSSFRTLKHGLEVSEIVESELPSVP 472

Query: 348  KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
              +W+ + + +D FD ++++SF + T +L++   + +EE    GF S   TL       +
Sbjct: 473  SAVWTTKLTREDEFDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGED 530

Query: 408  QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DG 466
             L+QV    +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG
Sbjct: 531  SLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAVALSSGEIVYFEMDTDG 588

Query: 467  ILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
             L E  +  ++   ++CL +  + +  + S   AVG   D +VRI SL PD  L  K   
Sbjct: 589  SLAEYDEKREMSGTVTCLSLGEVPKGRARSSFLAVGC-DDSTVRILSLDPDSTLENKSVQ 647

Query: 525  GGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
                 P ++ + A    +      YL   L  G  L  +L+  TGEL+D +   LG +P+
Sbjct: 648  ALTSAPSALSIMAMSDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLKPV 707

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
             L   S K    + A S RP + YS    K  + + ++   +     F+S    + +   
Sbjct: 708  KLFQVSVKEQKAILALSSRPWLGYSDLQTKAFMLTPLDYVGLEWGWNFSSEQCIEGMVGI 767

Query: 637  KEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR 695
            +   L I +++ ++  L   +IPL   PR      +   F +    N   +  +    + 
Sbjct: 768  QGQNLRIFSLEKLENNLLQETIPLQYTPRHFIKHPEHPLFYVIEADNNILSPGTRTKLLN 827

Query: 696  ----------LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCS 723
                      +L  + F +                      IS   L+  E   S+ + +
Sbjct: 828  DSGAVNGDAMVLPPEEFGYPRGTGHWASCIQIVDPVNSQSVISQIELEENEAAVSVAAVT 887

Query: 724  FSD-DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAF 779
            FS  D   +  VGT   ++      + G I V+   E+GK L+ I + + +    +L  F
Sbjct: 888  FSSQDDETFLVVGTGKDMVVNPRSSSAGFIYVYRFQEEGKELEFIHKTKVEQPPMALLGF 947

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
             G+LLA I   +++Y   +R    R+ Q+    H   L + +QT+G  I+V D+ +S++ 
Sbjct: 948  QGRLLAGIGTDVRIYDLGMR-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQESVTY 1003

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEER 896
            ++YK++E  +     D  + W +   ++D +   G +   NL+ +R   K SE A ++  
Sbjct: 1004 VVYKYQENRLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPEKASEEADEDGS 1063

Query: 897  G---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
            G               RL +V  ++  +     +   LV        G    +++  + G
Sbjct: 1064 GAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLV-------TGGRDVLVWTGLQG 1116

Query: 942  VIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
             + + I  +  E+  F + L+  L      + G +H  +RS+        AK  +DGDL 
Sbjct: 1117 TVAMLIPFVSREEVDFFQSLEMQLSSQNPPLAGRDHLIYRSY-----YAPAKGTIDGDLC 1171

Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            E++L L   +  +I+  ++ SV E+ +++ ++
Sbjct: 1172 ETYLLLPNDKKQQIAGELDRSVREIERKIADM 1203


>gi|70992737|ref|XP_751217.1| nuclear mRNA splicing factor [Aspergillus fumigatus Af293]
 gi|74670386|sp|Q4WLI5.1|RSE1_ASPFU RecName: Full=Pre-mRNA-splicing factor rse1
 gi|66848850|gb|EAL89179.1| nuclear mRNA splicing factor, putative [Aspergillus fumigatus Af293]
          Length = 1225

 Score =  210 bits (535), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 273/1166 (23%), Positives = 477/1166 (40%), Gaps = 171/1166 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I TL  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRTLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q +         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTVVFAMTALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  +A    T             YE+ L     V   W+ + +D  A +
Sbjct: 183  YENPIFAALEVDYSEADQDPTGRAYEESEKLLVYYELDLGLNHVVRK-WA-DPVDRTASM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGD 220
            L  VP       GVL+  E+ I Y  +N  AF+  IP R   T+   R  +  +  +   
Sbjct: 241  LFQVPGGADGPSGVLVCAEDNITYRHSNQDAFRVPIPRRIGATENPERKRSIVAGVMHKM 300

Query: 221  HAGLLHLL-------------VITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
                  LL             ++  +  ++TG    LKI+      IAS++  L +  +Y
Sbjct: 301  RGAFFFLLQTEDGDLFKVTIDMVEDDNGQLTGEVKRLKIKYFDTVPIASSLLILKSGFLY 360

Query: 264  IGSSYGDSQLIKLN----------------------------LQPDAKGSYVEVLERYVN 295
            + S  G+    +                               QP      + ++E   +
Sbjct: 361  VASEAGNHHFYQFEKLGDDDEETEFNSENFPADLSVPCEPVFFQPRG-AENLNLVETLNS 419

Query: 296  LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLR 354
            L P++D  +V+L      Q+ T SG+    S R +++G+ ++E    EL  +   +W+ +
Sbjct: 420  LNPLIDSKIVNLNEDDAPQIYTVSGSGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTK 479

Query: 355  SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
             +  D FD ++++SF + T +L++   + +EE    GF S   TL       + L+QV  
Sbjct: 480  LTRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVHP 537

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-K 472
              +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  +
Sbjct: 538  RGIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGTLAEYDE 595

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPR 531
              Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P 
Sbjct: 596  RRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPS 654

Query: 532  SVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
            ++ + +    S      YL   L  G  L  +L+  TGEL+D +   LG +P+ L   S 
Sbjct: 655  ALNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKPVKLFRVSV 714

Query: 586  KNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
            K  T V A S RP + YS    K  + + ++   +     F+S    + +   +   L I
Sbjct: 715  KGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQAQNLRI 774

Query: 644  GTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR------- 695
             +I+ +   +   SIPL   PRR+    +   F +    N   +  +    +        
Sbjct: 775  FSIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDNNVLSPATRARLIEDSKARNG 834

Query: 696  ---LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSFSD-DSN 729
               +L  + F +                      IST  L+  E   S+ +  FS  D  
Sbjct: 835  ETNVLPPEDFGYPRATGHWASCIQIVDPLDAKAVISTIELEENEAAVSMAAVPFSSQDDE 894

Query: 730  VYYCVGTAYVLPEENEPTKGRILVFIV---EDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
             +  VGTA  +   N P+     + I    EDGK L+ I + + +    +L  F G+LLA
Sbjct: 895  TFLVVGTAKDM-IVNPPSSAGGFIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRLLA 953

Query: 786  AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---QTRGDFIVVGDLMKSISLLIY 842
             I   +++Y     D G ++L  +C     +++  +   QT+G  IVV D+ +S++ ++Y
Sbjct: 954  GIGSTLRIY-----DLGMKQLLRKC--QAQVVSKTIVGLQTQGSRIVVSDVRESVTYVVY 1006

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE----- 894
            K+++  +     D  + W ++  ++D +   G +   NL+ VR   K SE A ++     
Sbjct: 1007 KYQDNILIPFVDDSVSRWTTSTTMVDYETVAGGDKFGNLWLVRCPKKASEEADEDGSGAH 1066

Query: 895  ---ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
               ERG       RL+++   +  +         LV        G    +++    G IG
Sbjct: 1067 LIHERGYLHGAPNRLDLMIHTYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIG 1119

Query: 945  VIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
            ++   +  E   F + L+  L      + G +H  +RS+    K V     +DGDL E +
Sbjct: 1120 MLVPFVSREDVDFFQNLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEMY 1174

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRV 1029
              L       I+  ++ SV E+ +++
Sbjct: 1175 FLLPNDTKMMIAAELDRSVREIERKI 1200


>gi|312072035|ref|XP_003138882.1| hypothetical protein LOAG_03297 [Loa loa]
 gi|307765956|gb|EFO25190.1| hypothetical protein LOAG_03297 [Loa loa]
          Length = 1197

 Score =  210 bits (535), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 256/1164 (21%), Positives = 495/1164 (42%), Gaps = 170/1164 (14%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G I +L  FR  G  +D++ + ++  +  +L+++ +      R   +   + G R    
Sbjct: 60   FGIIRSLLAFRLTGGTKDYIVVGSDAGRIVILEYNTQKV-CFERVHQETFGKTGCRRIVP 118

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ   IDP  R  LIG      L  ++  D    L  +  +   +   +    +    G 
Sbjct: 119  GQFLAIDPKGRAILIGAVERQKLVYIMNRDASANLTISSPLEAHKSHCICYSVVGVDVGF 178

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALK------------DKDFVEGPWSQNNLDNGADLLI 168
              PT   L  D ++  H  T  +A K              + V   +++  +D G ++LI
Sbjct: 179  ENPTFACLEVDYEEVDHDSTGHLATKIPQTLTFYELDLGLNHVVRKYAEPLVDKG-NILI 237

Query: 169  PVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-------- 215
             VP     P   GV++  E  +VY   N      I+  + +    +D D  R        
Sbjct: 238  SVPGGQDGPS--GVIVCCENYLVY--KNLGDQPDIKCPVPRRRNELD-DCDRTVIIVCAA 292

Query: 216  ---------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
                     +L+    G +  + +  E + VT LKI+      +++ +  L    ++  S
Sbjct: 293  THKTKLMYFFLVQTDQGDIFKVTLESEHDIVTELKIKYFDTIPVSNAMCILKTGFLFTAS 352

Query: 267  SYGDSQLIKL--------------NLQPDAKGSY---------VEVLERYVNLGPIVDFC 303
             +G+  L ++               +Q +   ++         + V+++  +L P++   
Sbjct: 353  EFGNHHLYQIAHLGDEDDEPEFSSRMQLEEGETFFFAPRGLTNLAVVDQMDSLSPLISSY 412

Query: 304  VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
            + DL  +   Q+ T  G     +++++RNG+ + E A  EL G    +W+++ + DD FD
Sbjct: 413  IDDLANEDSPQIYTLIGRGALSAVKVLRNGLEVTEMAVSELPGNPNAVWTVKRNIDDKFD 472

Query: 363  TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
            + +VVSF++ T +L++   + +EE    GF   T TL C     + L+QV    +R + +
Sbjct: 473  SHIVVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLGCALIGDDALLQVYPDGIRHIRA 530

Query: 423  TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEI 480
              R   NEWK+P   ++     N  QV +A  GG LVY E+   G L E  +  +L  ++
Sbjct: 531  DRRV--NEWKAPGKRTIMKCALNRRQVAIALAGGELVYFELDVTGQLNEYTERRELPADV 588

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLCA 537
             C+ ++ I E    S+   VG+  D +VRI SL     L+ ++ + L  E  P S+++  
Sbjct: 589  LCMSLSEIPEGELRSRFLTVGL-ADKTVRIISLDPQDCLSPLSMQALPSE--PESIIVLE 645

Query: 538  FEGIS-------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
              G         +L   L +G LL   ++  TGELTD +   LGT+ + L     ++   
Sbjct: 646  MFGTETQSASTVHLNIGLQNGCLLRTTVDQVTGELTDNRTRYLGTKSVKLFHVRIQSKDA 705

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            + AAS R  ++Y    +   + ++   +     F+S   P+ +    E  L I +++ + 
Sbjct: 706  IMAASSRAWLLYDYQSRFHLTPLSYAALEFAAGFSSEQCPEGIVAIAENTLRILSLEKLG 765

Query: 651  KLHIRSI-PLGEHPRR-ICHQ---------------------EQSRTFAICSLKNQSCAE 687
             +    + PL   PRR + H+                     E+ +  A   ++    AE
Sbjct: 766  AVFNHVVHPLDYTPRRMVVHKASGNLIIIENDHAAFTVKGKMERRKQLADELMEVAKEAE 825

Query: 688  ESEMHFVRLLDDQTFEFIST--------------YPLDTFEYGCSILSCSFSDDSNVYYC 733
            E++   V+ + D   ++ ST              +P    E   +I    F + S+  + 
Sbjct: 826  EADQQAVKEMADAIRKWASTVRVMRSNDGETLSHFPFAEDEAAFAIAMVQFQNQSDTQFV 885

Query: 734  -VGTAYVLP-EENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
             VG    L  +  +   G I  F++      LQL+    T   V +++ F G  LA + +
Sbjct: 886  LVGCGCELQLKPRKANGGCIYTFLLAANGTTLQLLHRTATDEVVNAIHDFRGMALAGVGK 945

Query: 790  KIQLYKWMLRDDGTRELQSECGHHG-HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
            K++LY     D G R+L ++C +       + +++ G  IVV D  +S+  + YK ++G 
Sbjct: 946  KVRLY-----DLGKRKLLAKCENRQIPTQVVDIRSMGQRIVVSDSQESVHFMRYKKQDGQ 1000

Query: 849  IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE------------ER 896
            +     + +  +++ V +LD D  +   + F    V +  +G T+E            +R
Sbjct: 1001 LSIFCDETSPRYVTCVCLLDYDT-VAVGDRFGNIAVLRLPKGVTEEVQEDPTGVRALWDR 1059

Query: 897  G-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS- 948
            G       +LE +   ++G+ +   +  SLV        G    + + T++G+IG++   
Sbjct: 1060 GNLNGASQKLEAIAHLYIGDAITSMQKTSLV-------PGANDCLCYTTISGIIGILVPF 1112

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +  +++ F + L+ ++R     + G +H  +RS+         K+ +DGDL E +  +  
Sbjct: 1113 MSRDEFEFFQNLEMHMRVEYPPLCGRDHLAYRSY-----YFPVKSIIDGDLCEQYSLMPL 1167

Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
             +   + + +     E+ K++E++
Sbjct: 1168 DKQKSVGEELGRKSTEIHKKLEDI 1191


>gi|340959394|gb|EGS20575.1| putative pre-mRNA splicing protein [Chaetomium thermophilum var.
            thermophilum DSM 1495]
          Length = 1213

 Score =  210 bits (535), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 267/1169 (22%), Positives = 476/1169 (40%), Gaps = 168/1169 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            I+G I  +  FR  G  +D++ +AT+  +  ++++  + +      +        R    
Sbjct: 65   IFGIIRAMAAFRLAGSHKDYIILATDSGRIAIIEYLPKENRFQRIHLETFGKSGVRRVIP 124

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  LI     + L  V+  + + +L  +  +   +  V+ +  +    G 
Sbjct: 125  GQYLAADPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHKPGVIVLSLVALDVGY 184

Query: 121  AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
            + P    L  +  +A    T             YE+ L     V   WS + +D  + LL
Sbjct: 185  SNPVFAALEYEYSEADQDPTGQAAKQLEMQLVYYELDLGLNHVVRK-WS-DTVDPTSSLL 242

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYL 217
              VP     P   GVL+ GEE I Y  +N  AF+  IP R   T+     R    G  + 
Sbjct: 243  FQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPNRKRTIVAGVMHK 300

Query: 218  LGDHAGLLHLLVITH--------------EKEKVTG----LKIELLGETSIASTISYLDN 259
            L   AG    L+ T               EK   TG    +KI+      IA ++  L +
Sbjct: 301  LKGSAGAFFFLLQTEDGDLFKVTIDMVEDEKGNPTGEVKRVKIKYFDTVPIAHSLCILKS 360

Query: 260  AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
              +++ S +G+    +                +   D    Y  V      LE  V    
Sbjct: 361  GFLFVASEFGNHHFYQFEKLGDDDDEPEFTSDDFPADWNAPYNPVYFKPRPLENLVLVES 420

Query: 295  --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
              ++ P+V   V +L  +   Q+    G     S R++++G+ ++E  + EL G    +W
Sbjct: 421  IDSMNPLVGCKVANLTGEDAPQIYAICGNGARSSFRMLKHGLEVSEIVASELPGTPSAVW 480

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            + + +  D +D ++V+SF + T +L++   + +EE    GF +   TL         L+Q
Sbjct: 481  TTKLTKYDEYDAYIVLSFTNATLVLSIG--ETVEEVSDSGFLTTVPTLAVQQMGEEGLIQ 538

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
            +    +R +        NEW +P   S+  AT N +QV++A   G +VY E+  DG L E
Sbjct: 539  IHPKGIRHIVQGRV---NEWPAPQHRSIVAATTNENQVVIALSSGEIVYFEMDADGSLAE 595

Query: 471  V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
              +  Q+   ++ L +  + E    S   AVG   D +VRI SL P+  L  K       
Sbjct: 596  YDEKKQMSGTVTSLSLGKVPEGLRRSSFLAVGC-DDCTVRILSLDPESTLEMKSIQALTA 654

Query: 529  IPRSVLLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
             P S+L+ + E  +     YL   L  G  L  +L+  TGELTD ++  LG +P  L   
Sbjct: 655  APSSLLIMSMEDSTGGTTLYLHIGLHSGVYLRTVLDEITGELTDTRQKFLGPKPTKLFQV 714

Query: 584  SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
            + +N T V A S RP + Y++   +  + + ++  E+ +   FNS    + +       L
Sbjct: 715  TVQNQTCVLALSSRPWLGYTAPITRNFVMTPLSYTELGYTWSFNSEQCQEGMVGIHANYL 774

Query: 642  TIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEE------------ 688
             I TI+ + +  I +S PL   P+R+    +   F +    N +   E            
Sbjct: 775  RIFTIEKLGQTMIQKSCPLTYTPKRLVKHPEQPYFYVIEADNNTLPPELRAQLLEQSGAV 834

Query: 689  ----------------------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF-S 725
                                  S +  V  + ++    +    L+  E   S     F S
Sbjct: 835  NGDATVLPPEDFGYPKARGRWASCIEIVDPVSEEQPRVLKRIELEGNEAAVSAAVVPFAS 894

Query: 726  DDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGK 782
             D   +  VGT   ++      T+G I V+  ++DG+ L+ I +   +    +   F G+
Sbjct: 895  QDGESFLIVGTGKDMVLNPRASTEGAIHVYRFIDDGRDLEFIHKTIIEEPPLAFCPFQGR 954

Query: 783  LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
            LLA I + +++Y   L+    R+ Q+E       L + + TR + IVVGD+   ++ ++Y
Sbjct: 955  LLAGIGKMLRIYDLGLK-QLLRKAQAEVSPQ---LIVSLDTRHNRIVVGDVQHGMTYVVY 1010

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER------ 896
            K +   +   A D  A W +   ++D +   G +   NL+ VR     + + +       
Sbjct: 1011 KPDSNKLIPFADDTIARWTTCTTMVDYESVAGGDKFGNLWIVRCPERASLESDEPGSEVQ 1070

Query: 897  ------------GRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
                         RL+++  ++  +        +LV       VG    +++  + G +G
Sbjct: 1071 LLHARPYLHGAPNRLDLMAHFYPQDLPTSICKTNLV-------VGGQDVLVWSGIQGTVG 1123

Query: 945  V-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
            V I  +  E   F + L++++R     + G +H  +R +      V  K  +DGDL E F
Sbjct: 1124 VLIPFVTREDADFFQNLESHMRAEDPPLAGRDHLIYRGY-----YVPVKGVIDGDLCERF 1178

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              L   +   I+  ++ SV E+ +++ ++
Sbjct: 1179 TLLPNDKKQMIAGELDRSVREIERKISDI 1207


>gi|159130328|gb|EDP55441.1| nuclear mRNA splicing factor, putative [Aspergillus fumigatus A1163]
          Length = 1225

 Score =  210 bits (534), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 274/1165 (23%), Positives = 480/1165 (41%), Gaps = 169/1165 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I TL  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRTLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q +         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTVVFAMTALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  +A    T             YE+ L     V   W+ + +D  A +
Sbjct: 183  YENPIFAALEVDYSEADQDPTGRAYEESEKLLVYYELDLGLNHVVRK-WA-DPVDRTASM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGD 220
            L  VP       GVL+  E+ I Y  +N  AF+  IP R   T+   R  +  +  +   
Sbjct: 241  LFQVPGGADGPSGVLVCAEDNITYRHSNQDAFRVPIPRRIGATENPERKRSIVAGVMHKM 300

Query: 221  HAGLLHLL-------------VITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
                  LL             ++  +  ++TG    LKI+      IAS++  L +  +Y
Sbjct: 301  RGAFFFLLQTEDGDLFKVTIDMVEDDNGQLTGEVKRLKIKYFDTVPIASSLLILKSGFLY 360

Query: 264  IGSSYGDSQLIKL---------------NLQPD-----------AKGS-YVEVLERYVNL 296
            + S  G+    +                N   D            +G+  + ++E   +L
Sbjct: 361  VASEAGNHHFYQFEKLGDDDEETEFNSENFPADLSVPCEPVFFRPRGAENLNLVETLNSL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P++D  +V+L      Q+ T SG+    S R +++G+ ++E    EL  +   +W+ + 
Sbjct: 421  NPLIDSKIVNLNEDDAPQIYTVSGSGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            +  D FD ++++SF + T +L++   + +EE    GF S   TL       + L+QV   
Sbjct: 481  TRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVHPR 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
             +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGTLAEYDER 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P +
Sbjct: 597  RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPSA 655

Query: 533  VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + + +    S      YL   L  G  L  +L+  TGEL+D +   LG +P+ L   S K
Sbjct: 656  LNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKPVKLFRVSVK 715

Query: 587  NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
              T V A S RP + YS    K  + + ++   +     F+S    + +   +   L I 
Sbjct: 716  GQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQAQNLRIF 775

Query: 645  TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR-------- 695
            +I+ +   +   SIPL   PRR+    +   F +    N   +  +    +         
Sbjct: 776  SIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDNNVLSPATRARLIEDSKARNGE 835

Query: 696  --LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSFSD-DSNV 730
              +L  + F +                      IST  L+  E   S+ +  FS  D   
Sbjct: 836  TNVLPPEDFGYPRATGHWASCIQIVDPLDAKAVISTIELEENEAAVSMAAVPFSSQDDET 895

Query: 731  YYCVGTAYVLPEENEPTKGRILVFIV---EDGK-LQLIAEKETKGAVYSLNAFNGKLLAA 786
            +  VGTA  +   N P+     + I    EDGK L+ I + + +    +L  F G+LLA 
Sbjct: 896  FLVVGTAKDM-IVNPPSSAGGFIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRLLAG 954

Query: 787  INQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYK 843
            I   +++Y     D G ++L  +C     +++  +   QT+G  IVV D+ +S++ ++YK
Sbjct: 955  IGSTLRIY-----DLGMKQLLRKC--QAQVVSKTIVGLQTQGSRIVVSDVRESVTYVVYK 1007

Query: 844  HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE------ 894
            +++  +     D  + W ++  ++D +   G +   NL+ VR   K SE A ++      
Sbjct: 1008 YQDNILIPFVDDSVSRWTTSTTMVDYETVAGGDKFGNLWLVRCPKKASEEADEDGSGAHL 1067

Query: 895  --ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
              ERG       RL+++   +  +         LV        G    +++    G IG+
Sbjct: 1068 IHERGYLHGAPNRLDLMIHTYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIGM 1120

Query: 946  IAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
            +   +  E   F + L+  L      + G +H  +RS+    K V     +DGDL E + 
Sbjct: 1121 LVPFVSREDVDFFQNLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEMYF 1175

Query: 1005 DLSRTRMDEISKTMNVSVEELCKRV 1029
             L       I+  ++ SV E+ +++
Sbjct: 1176 LLPNDTKMMIAAELDRSVREIERKI 1200


>gi|145240731|ref|XP_001393012.1| pre-mRNA-splicing factor rse1 [Aspergillus niger CBS 513.88]
 gi|134077536|emb|CAK96680.1| unnamed protein product [Aspergillus niger]
          Length = 1209

 Score =  209 bits (532), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 278/1168 (23%), Positives = 484/1168 (41%), Gaps = 169/1168 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  +I     + L  V+  +++ +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACIIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMTALDVG 182

Query: 120  CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  ++             +H+  YE+ L     V   WS + +D  A L
Sbjct: 183  YENPVFAALEVDYSESDQDPTGQAFDELEKHLVYYELDLGLNHVVRK-WS-DPVDRTASL 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRP----------SITKAYGRVD 210
            L  VP       GVL+  E+++ Y  +N  AF+  IP R           SIT       
Sbjct: 241  LFQVPGGADGPSGVLVCAEDSVTYRHSNQDAFRVPIPRRSGATENPERKRSITAGVMHKM 300

Query: 211  ADGSRYLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVVY 263
                 +LL    G L  L I   ++   ++TG    LKI+      +AS++  L +  +Y
Sbjct: 301  RGAFFFLLQTEDGDLFKLTIDMVEDDNGQLTGEVKRLKIKYFDTVPLASSLLILKSGFLY 360

Query: 264  IGSSYGDSQLIKLNLQPD--------------------------AKGS-YVEVLERYVNL 296
            + S  G+    +     D                           +G+  + ++E   +L
Sbjct: 361  VASEAGNHHFYQFEKLGDDDEEIEFTSESFSADPSVPCDPIYFRPRGAENLNLVETINSL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P++D  VV++      Q+ T SGA    S R +++G+ ++E    EL  +   +W+ + 
Sbjct: 421  NPLIDSKVVNITEDDAPQIYTISGAGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            +  D FD ++++SF + T +L++   + +EE    GF S   TL       + L+Q+   
Sbjct: 481  TRSDEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQIHPR 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
             +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDSDGSLAEYDER 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P +
Sbjct: 597  RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTAAPSA 655

Query: 533  VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + + +    S      YL   L  G  L  +L+  TGEL+D +   LG++P+ L   S K
Sbjct: 656  LNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKPVKLFQVSVK 715

Query: 587  NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
              T V   S RP + YS    K  + + ++   +     F+S    + +   +   L I 
Sbjct: 716  GQTAVLGLSSRPWLGYSDVQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775

Query: 645  TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR-------- 695
            +I+ +   +  +SIPL   PRR     +   F +    N   A  +    +         
Sbjct: 776  SIEKLDNNMLQQSIPLSYTPRRFLKHPEQPLFYVIESDNNVLAPSTRAKLLEDSKSRGGD 835

Query: 696  --LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSF-SDDSNV 730
              +L  + F +                      + T  L+  E   SI +  F S D   
Sbjct: 836  ETVLPPEDFGYPRATGHWASCIQVVDPLDAKAVVHTIELEENEAAISIAAVPFTSQDDET 895

Query: 731  YYCVGTAYVLPEENEP--TKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAA 786
            +  VGTA  +   N P    G I ++   EDG+ L+ I + + +    +L  F G+L+A 
Sbjct: 896  FLVVGTAKDM-TVNPPGSAGGYIHIYRFQEDGRELEFIHKTKVEEPPLALLGFQGRLVAG 954

Query: 787  INQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
            I   +++Y     D G ++L  +C        I+ L  QT+G  IVV D+ +S++ ++YK
Sbjct: 955  IGSLLRIY-----DLGMKQLLRKCQAPVVPKTIVGL--QTQGSRIVVSDVRESVTYVVYK 1007

Query: 844  HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE------ 894
            ++E  +     D  + W +A  ++D +   G +   NL+ +R   K SE A ++      
Sbjct: 1008 YQENVLIPFVDDSVSRWTTATTMVDYETTAGGDKFGNLWLLRCPKKTSEEADEDGSGAHL 1067

Query: 895  --ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
              ERG       RLE++   +  +         LV        G    +++    G IG+
Sbjct: 1068 IHERGYLQGTPNRLELMIHVYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIGM 1120

Query: 946  IAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
            +   +  E   F + L+  L      + G +H  +RS+    K V     +DGDL E + 
Sbjct: 1121 LVPFIGREDVDFFQNLEMQLAAQHPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEMYF 1175

Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             L       I+  ++ SV E+ +++ ++
Sbjct: 1176 LLPNDTKMMIAAELDRSVREIERKISDM 1203


>gi|307109500|gb|EFN57738.1| hypothetical protein CHLNCDRAFT_56079 [Chlorella variabilis]
          Length = 1144

 Score =  209 bits (532), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 269/1152 (23%), Positives = 480/1152 (41%), Gaps = 194/1152 (16%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G  QD++   ++  +  +LQ++ E +   TR   +   + G R   
Sbjct: 57   VFGIIRSLAPFRFPGAQQDYVICGSDSGRIVILQYNKEKN-CFTRIHTETFGKSGCRRIV 115

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
             G+   +DP  R  +I          V+  DN  +L  +  +   +       L  LD+ 
Sbjct: 116  PGEYLAVDPKGRACMIAAVEKSKFVYVLNRDNDARLTISSPLEAHKGSNLCYALTGLDMG 175

Query: 116  FLYGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDN 162
            F      P    +  D  +A             +H+  YE+ L     V   +S+  +DN
Sbjct: 176  F----DNPVFAAIELDYAEADTDPTGEAASEAQKHLTMYELDLGLNHVVR-KYSEP-VDN 229

Query: 163  GADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHA 222
            GA++LIPVP    G    G+  +        + +P        +G +             
Sbjct: 230  GANMLIPVPG--AGDGPGGQRAVWGGPCARAEKLPACMHACSEFGDI------------- 274

Query: 223  GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN-LQPD 281
               + + + +E E+V  L+I+       A++I  +    ++  S +GD  L + + L  D
Sbjct: 275  ---YRVTLDYEGEQVKELRIKYFDSIPPATSICLMRKGFLFAASEFGDHALYQFSSLGDD 331

Query: 282  AKG--------------------------SYVEVLERYVNLGPIVDFCVVDLERQGQGQV 315
             +G                          S +E L+R  +L PI+D  V +L  +   Q+
Sbjct: 332  DEGGVESSSATLMETEEGYQPVFFDPRPLSNLEALDRMDSLAPILDMKVANLAGEEIPQI 391

Query: 316  VTCSGAYKDGSLRIVRNGIGI------NEQASVELQG-IKGMWSLRSSTDDPFDTFLVVS 368
                G     +LR++R G+ +      +E A   L G    +W+++ S  D FD ++VVS
Sbjct: 392  YAACGRGSRSTLRVLRPGLAVTGKRAAHEMAVSPLPGNPTAVWTIKRSVGDEFDAYIVVS 451

Query: 369  FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 428
            F + T +L++   + +EE    GF     TL       + ++QV    +R +    R   
Sbjct: 452  FSNATLVLSIG--ETVEEVNDSGFNGNVPTLQTQLLADDSMLQVYPNGLRHIRPDRR--I 507

Query: 429  NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCLDINP 487
            NEW++P   ++  AT N  QV +A GGG ++Y E+   G+L E +  ++  +++CLD+ P
Sbjct: 508  NEWRAPGRKTIVKATTNERQVAIALGGGEVIYFELNPQGMLVESEKREMGGDVACLDVAP 567

Query: 488  IGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLC-------- 536
            + E  +     AVGM+ D + R+ SL PD  L +++ + +G    P SVLL         
Sbjct: 568  VPEGRTRCAFLAVGMY-DGAARVLSLQPDSTLKVLSTQAVGA--TPESVLLLDSPLMGKD 624

Query: 537  -----AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
                 A  G  +L   L +G LL   ++  TG+L+D +   LGT+P  L  F++      
Sbjct: 625  GTEEGAGSGALFLQVGLVNGVLLRTEVDRVTGQLSDTRTRFLGTKPPKL--FAAA-VPEG 681

Query: 592  FAASDRPTVIYSSNKKL--LYSNVNLKEVSHMCPFNSAAFPD----SLAIAKEGELTIGT 645
            F A  R T+   + ++L   ++  +L+      P      PD    ++A A    + +  
Sbjct: 682  FVAVARNTLRVITLERLGEFFNQQSLR--LRYTPRKFVIHPDLKVLAIAEADHAAIPLAQ 739

Query: 646  IDDIQKLHIRSIPLGEHPRRICHQEQSRTFAIC------SLKNQSCAEESE--------- 690
             +D+Q+            RR   + + +  A         +  ++ A E +         
Sbjct: 740  REDLQQ------------RRDGMEAEQQGGATAAAAAGPEMDEEAAAREDQWGAPKGEPG 787

Query: 691  --MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD--DSNVYYCVGTAYVL---PEE 743
                 VR++D  + +      ++  E   S+    F    +      VGTA  L   P+E
Sbjct: 788  QWASCVRIVDPVSLQTTHCIEMEDNEAALSVCLVEFDSHPEHGTLLAVGTAQGLKFYPKE 847

Query: 744  NEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLR 799
             +     +  F+ +  +++L+ +   +G   ++ AF G+LL  ++  ++LY    K MLR
Sbjct: 848  CQNGFVHLYRFLDDGKRIELLHKTAVEGVPGAMAAFKGRLLVGVDAVLRLYDMGKKRMLR 907

Query: 800  DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
                R L +       I  L+V   G  I VGD  +S   + YK  +      A D    
Sbjct: 908  KCEYRRLPT------RIATLHVS--GSRIYVGDGQESTFFMRYKKGDNQFYIFADDIVPR 959

Query: 860  WMSAVEILDDDIYLGAENNFNLFTVRKNSE-----------GATDEERG-------RLEV 901
             ++A   LD D   GA+   N+F  R   E           G    E G       +L  
Sbjct: 960  HVTAALHLDYDTLAGADRFGNVFVSRLPQEVSAQVEDDPTGGKYATETGLLGGAPNKLRT 1019

Query: 902  VGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKL 960
            +  +H+GE V   +   L         G    +++GT+NG IGV+      E   F + L
Sbjct: 1020 INSFHVGETVTALQRAVL-------QPGGRELIVYGTINGAIGVLYPFTSKEDCDFFQHL 1072

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            + ++R+    + G +H  +RSF         K+ +DGDL E +  L+  +   +++ ++ 
Sbjct: 1073 EMHMRQEHPPLLGRDHLAYRSF-----YFPVKDVVDGDLCEQYPQLAADKARGVAEELDR 1127

Query: 1021 SVEELCKRVEEL 1032
            S  E+ K++E++
Sbjct: 1128 SPGEVLKKLEDI 1139


>gi|255946770|ref|XP_002564152.1| Pc22g01070 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211591169|emb|CAP97395.1| Pc22g01070 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1209

 Score =  209 bits (532), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 288/1169 (24%), Positives = 489/1169 (41%), Gaps = 171/1169 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRVAGSSKDYIIIGSDSGRITIVEY-VPSQNRFNRIHMETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q L    +    G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQNLVFAMICLDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  +A    T             YE+ L     V   WS + +D  A +
Sbjct: 183  YDNPVFAALEVDYSEADQDPTGQAYEEIEKVLVYYELDLGLNHVVRK-WS-DPVDRTASM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSR--- 215
            L  VP     P   GVL+  E+ + Y  +N  AF+  IP R   T+   R  +  S    
Sbjct: 241  LFQVPGGDDGP--SGVLVCAEDNVTYRHSNQGAFRVPIPRRSGATENPERKRSIVSGVMH 298

Query: 216  -------YLLGDHAG---LLHLLVITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
                   +LL    G    L+L ++  ++ K TG    L I+      +++ +  L +  
Sbjct: 299  KMRGAFFFLLQTEDGDLFKLNLEMVEDKQGKPTGEVKRLTIKYFDTIPVSTNLLILKSGF 358

Query: 262  VYIGSSYGDS---QLIKLNLQPD-----------------------AKGS-YVEVLERYV 294
            +Y  S  G+    Q  KL   P                         +G+  V ++E   
Sbjct: 359  LYAASESGNHNFYQFEKLGDDPSEAFFSSDSFSADPSVPLTPIYFHPRGAENVNLMESMN 418

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
            +L P++D  +++L      Q+ T  G+    + R +++G+ ++E    +LQ +   +W+ 
Sbjct: 419  SLSPLIDSKILNLSEDDAPQIYTICGSGARSTFRTLKHGLEVSEIVESDLQQVPSAVWTT 478

Query: 354  RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
            + +  D F T++++SF + T +L++   + +EE    GF S   TL       + LVQV 
Sbjct: 479  KLTRGDEFHTYIILSFANGTLVLSIG--EIVEEVSDTGFLSSAPTLAVQQLGEDSLVQVH 536

Query: 414  SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
               +R + +  R   NEW +P   S+  A AN  QV +A   G +VY E+  DG L E  
Sbjct: 537  PRGIRHILADQRV--NEWPAPQHRSIVAAAANERQVAVALSSGEIVYFEMDADGTLAEYD 594

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
            +  Q+   ++ L +  + E    S   AVG   D +VRI SL PD  L  K        P
Sbjct: 595  ERRQMSGTVTSLSMGEVPEGRMRSSFLAVGC-DDSTVRILSLNPDSTLENKSVQALTSAP 653

Query: 531  RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
             ++ + A    S      YL   L  G  L  +L+  TGEL+D +   +G +P+ L   S
Sbjct: 654  SALQIMAMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFIGAKPVKLSQVS 713

Query: 585  SKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
             K  T V A S RP + YS    K  + + ++   +     F+S    + +   +   L 
Sbjct: 714  VKGQTAVLALSTRPWLGYSDVQTKSFMLTPLDYVGLEWGWNFSSEQCVEGMIGIQGRSLR 773

Query: 643  IGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD-- 699
            I T++ +    ++ SIPL   PRR     +   F +    N   +  +     RL+DD  
Sbjct: 774  IFTVEKLDNNMLQESIPLSYTPRRFVKHPEQPLFYVIESDNNVLSPATRQ---RLIDDSQ 830

Query: 700  ---------------------------QTFEFISTYP------LDTFEYGCSILSCSFSD 726
                                       Q  E I+T        L+  E   S+ + SFS 
Sbjct: 831  AQNGEATDLPPADFGYPRATGHWASCIQVVEPITTKSVIFNLDLEDNEAAVSLAAVSFSS 890

Query: 727  -DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFN 780
             D   +  VGTA    V P  +  + G I ++   EDG+ L+ I + +      +L  F 
Sbjct: 891  QDDETFLVVGTAKDMTVSPPSS--SCGFIHIYRFQEDGRELEFIHKTQVDEPPLALLGFQ 948

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFIVVGDLMKSI 837
            G+LLA I   +++Y     D G ++L  +C        I+ L  QT+G  I+V D+ +S+
Sbjct: 949  GRLLAGIGPILRVY-----DLGMKQLLRKCQAPVVPKTIVGL--QTQGSRIIVSDVRESV 1001

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
            + ++YK+++  +   A D  A W S+  ++D +   G +   NL+ VR   K SE A ++
Sbjct: 1002 TYVVYKYQDNVLIPFADDSIARWTSSTTMVDYETTAGGDKFGNLWLVRCPSKVSEQADED 1061

Query: 895  --------ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIG 944
                    E+G L   G  H  E +  F    +   L  + +  G    V++    G IG
Sbjct: 1062 GSGAHLIHEKGYLH--GTPHRLELMVHFYAQDIPTSLHKTQLVAGGRDIVVWTGFQGTIG 1119

Query: 945  VIASLP-HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
            +       E   F + L+T L      + G +H  +R +    K V     +DGDL E +
Sbjct: 1120 MFVPFASREDVDFFQLLETQLASQQPPLAGRDHLMYRGYYAPVKGV-----IDGDLCEMY 1174

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L L       I+  ++ SV E+ +++ ++
Sbjct: 1175 LLLPNDTKLMIAGELDRSVREIERKISDM 1203


>gi|449282569|gb|EMC89402.1| Splicing factor 3B subunit 3 [Columba livia]
          Length = 1225

 Score =  209 bits (532), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 266/1193 (22%), Positives = 489/1193 (40%), Gaps = 187/1193 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNVFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQYLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406  PILCCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
            +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPVGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSV 533
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P +  
Sbjct: 582  EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPE 638

Query: 534  LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
             LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 639  SLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L     +    V A S R  + YS   +   + ++ + +     F S   P+ +    
Sbjct: 699  VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIS 758

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQS 684
               L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q 
Sbjct: 759  TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQ 818

Query: 685  CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
             AEE             +EM                           +R+++      + 
Sbjct: 819  MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLD 878

Query: 707  TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQ 762
               L+  E   S+  C FS+  + +Y  VG A  L        G  +    +V  G KL+
Sbjct: 879  LVQLEQNEAAFSVAVCRFSNTGDEWYVLVGVAKDLILNPRSVAGGFVYTYKLVNSGEKLE 938

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILA 818
             + +   +    ++  F G++L  + + +++Y    K +LR    + L      H     
Sbjct: 939  FLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVYDLGKKKLLRKCENKILSFLSQKHIANYI 998

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+  
Sbjct: 999  CGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTATLLDYDTVAGADKF 1058

Query: 879  FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
             N+  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+
Sbjct: 1059 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1118

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +
Sbjct: 1119 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1171

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN +DGDL E F  +   +   +++ ++ +  E+ K++E++
Sbjct: 1172 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVAEELDRTPPEVSKKLEDI 1219


>gi|189044515|sp|Q7RYR4.2|RSE1_NEUCR RecName: Full=Pre-mRNA-splicing factor rse-1
          Length = 1209

 Score =  209 bits (531), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 279/1164 (23%), Positives = 487/1164 (41%), Gaps = 161/1164 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            I+G +  +  FR  G  +D++ +AT+  +  ++++  ++++     +        R    
Sbjct: 64   IFGIVRAIASFRLAGSHKDYIILATDSGRITIIEYLPKTNKFQRIHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  LI     + L  V+  +++ +L  +  +   +  VL +  +    G 
Sbjct: 124  GQYLAADPKGRACLISALEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSLVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
            A P    L  D  DA    T             YE+ L     V   WS + +D  + LL
Sbjct: 184  ANPVFAALELDYTDADQDPTGQAREEVETQLVYYELDLGLNHVVRK-WS-DTVDRTSSLL 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYL 217
              VP     P   GVL+ GEE + Y  +N  AF+  IP R   T+     RV   G  + 
Sbjct: 242  FQVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRSGATEDPQRKRVIVSGVMHK 299

Query: 218  LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
            L   AG    L+ T + +                  +V  LKI+      +A+++  L +
Sbjct: 300  LKGSAGAFFFLLQTDDGDLFKVTIDMIEDSDGNPTGEVKRLKIKYFDTIPVATSLCILKS 359

Query: 260  AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
              ++  S +G+    +                +   D   SY  V      LE  V    
Sbjct: 360  GFLFAASEFGNHHFYQFEKLGDDDEELEFSSDDFPTDPTASYNPVYFHPRPLENLVLVES 419

Query: 295  --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
              ++ P VD  V +L  +   Q+ +  G     + R++++G+ ++E  + EL G    +W
Sbjct: 420  IDSMNPQVDCKVANLTGEDAPQIYSVCGNGARSTFRMLKHGLEVSEIVASELPGTPSAVW 479

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            + + +  D +D ++V+SF + T +L++   + +EE    GF +   TL       + L+Q
Sbjct: 480  TTKLTKYDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTTAPTLAVQQMGEDGLIQ 537

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
            V    +R +        NEW +P   S+  ATAN +QV++A   G +VY E+  DG L E
Sbjct: 538  VHPKGIRHIVQGRV---NEWPAPQHRSIVAATANENQVVIALSSGEIVYFEMDSDGSLAE 594

Query: 471  V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
              +  ++   ++ L +  + E    S   AVG   D +VRI SL PD  L  K       
Sbjct: 595  YDEKKEMSGTVTSLSVGQVPEGLKRSSFLAVGC-DDCTVRILSLDPDSTLEMKSIQALTA 653

Query: 529  IPRSVLLCAFE-----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
             P ++ + + E        YL   L  G  L  +L+  TGELTD ++  LG +P  L   
Sbjct: 654  APSALSIMSMEDSFGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTRLFQV 713

Query: 584  SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
            S ++   V A S RP + Y+    K  + + ++  E+ +   F+S    + +       L
Sbjct: 714  SVQDQPCVLALSSRPWLGYTDPLTKGFMMTPLSYTELEYGWNFSSEQCLEGMVGIHANYL 773

Query: 642  TIGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKN-------QSCAEESEMH 692
             I +I+ +    I +SIPL   P+ +  H EQ   + I S  N           E+    
Sbjct: 774  RIFSIEKLGDNMIQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG 833

Query: 693  FVRLLDDQTFEF----------------ISTYP-------LDTFEYGCSILSCSF-SDDS 728
               +L  + F +                IS  P       LD  E   S     F S + 
Sbjct: 834  DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEG 893

Query: 729  NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
              +  VGT   ++ +  + T+G I V+   EDG+ L+ I +   +    +L  F G+LLA
Sbjct: 894  ESFLVVGTGKDMVLDPRQFTEGYIHVYRFHEDGRDLEFIHKTRVEEPPLALIPFQGRLLA 953

Query: 786  AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
             + + +++Y   L+    R+ Q++       L + +Q++G+ I+VGDL + I+ ++YK E
Sbjct: 954  GVGKTLRIYDLGLK-QLLRKAQADV---TPTLIVSLQSQGNRIIVGDLQQGITYVVYKAE 1009

Query: 846  EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
               +   A D    W +   ++D +   G +   N++ VR     + + +    E   E 
Sbjct: 1010 GNRLIPFADDTLNRWTTCTTMVDYESVAGGDKFGNIYIVRCPERVSQETD----EPGSEI 1065

Query: 906  HLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV-IAS 948
            HL    N + HG+              LP S       VG    +++  + G +GV I  
Sbjct: 1066 HLMHARN-YLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQDVLLWSGLQGTVGVFIPF 1124

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +  E   F + L+ ++R     + G +H  +R +    K V     +DGDL E F  L  
Sbjct: 1125 VSREDVDFFQNLENHMRAEDPPLAGRDHLIYRGYYTPVKGV-----IDGDLCERFSLLPN 1179

Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
             +   I+  ++ SV E+ +++ ++
Sbjct: 1180 DKKQMIAGELDRSVREIERKISDI 1203


>gi|350290373|gb|EGZ71587.1| Pre-mRNA-splicing factor rse-1 [Neurospora tetrasperma FGSC 2509]
          Length = 1209

 Score =  209 bits (531), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 279/1164 (23%), Positives = 487/1164 (41%), Gaps = 161/1164 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            I+G +  +  FR  G  +D++ +AT+  +  ++++  ++++     +        R    
Sbjct: 64   IFGIVRAIASFRLAGSHKDYIILATDSGRITIIEYLPKTNKFQRIHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  LI     + L  V+  +++ +L  +  +   +  VL +  +    G 
Sbjct: 124  GQYLAADPKGRACLISALEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSLVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
            A P    L  D  DA    T             YE+ L     V   WS + +D  + LL
Sbjct: 184  ANPVFAALELDYTDADQDPTGQAREEVETQLVYYELDLGLNHVVRK-WS-DTVDRTSSLL 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYL 217
              VP     P   GVL+ GEE + Y  +N  AF+  IP R   T+     RV   G  + 
Sbjct: 242  FQVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRSGATEDPQRKRVIVAGVMHK 299

Query: 218  LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
            L   AG    L+ T + +                  +V  LKI+      +A+++  L +
Sbjct: 300  LKGSAGAFFFLLQTDDGDLFKVTIDMIEDSDGNPTGEVKRLKIKYFDTIPVATSLCILKS 359

Query: 260  AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
              ++  S +G+    +                +   D   SY  V      LE  V    
Sbjct: 360  GFLFAASEFGNHHFYQFEKLGDDDEELEFSSDDFPTDPTASYNPVYFHPRPLENLVLVES 419

Query: 295  --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
              ++ P VD  V +L  +   Q+ +  G     + R++++G+ ++E  + EL G    +W
Sbjct: 420  IDSMNPQVDCKVANLTGEDAPQIYSVCGNGARSTFRMLKHGLEVSEIVASELPGTPSAVW 479

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            + + +  D +D ++V+SF + T +L++   + +EE    GF +   TL       + L+Q
Sbjct: 480  TTKLTKYDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTTAPTLAVQQMGEDGLIQ 537

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
            V    +R +        NEW +P   S+  ATAN +QV++A   G +VY E+  DG L E
Sbjct: 538  VHPKGIRHIVQGRV---NEWPAPQHRSIVAATANENQVVIALSSGEIVYFEMDSDGSLAE 594

Query: 471  V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
              +  ++   ++ L +  + E    S   AVG   D +VRI SL PD  L  K       
Sbjct: 595  YDEKKEMSGTVTSLSVGQVPEGLKRSSFLAVGC-DDCTVRILSLDPDSTLEMKSIQALTA 653

Query: 529  IPRSVLLCAFE-----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
             P ++ + + E        YL   L  G  L  +L+  TGELTD ++  LG +P  L   
Sbjct: 654  APSALSIMSMEDSFGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTKLFQV 713

Query: 584  SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
            S ++   V A S RP + Y+    K  + + ++  E+ +   F+S    + +       L
Sbjct: 714  SVQDQPCVLALSSRPWLGYTDPLTKGFMMTPLSYTELEYGWNFSSEQCLEGMVGIHANYL 773

Query: 642  TIGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKN-------QSCAEESEMH 692
             I +I+ +    I +SIPL   P+ +  H EQ   + I S  N           E+    
Sbjct: 774  RIFSIEKLGDNMIQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG 833

Query: 693  FVRLLDDQTFEF----------------ISTYP-------LDTFEYGCSILSCSF-SDDS 728
               +L  + F +                IS  P       LD  E   S     F S + 
Sbjct: 834  DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEG 893

Query: 729  NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
              +  VGT   ++ +  + T+G I V+   EDG+ L+ I +   +    +L  F G+LLA
Sbjct: 894  ESFLVVGTGKDMVLDPRQFTEGYIHVYRFHEDGRDLEFIHKTRVEEPPLALIPFQGRLLA 953

Query: 786  AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
             + + +++Y   L+    R+ Q++       L + +Q++G+ I+VGDL + I+ ++YK E
Sbjct: 954  GVGKTLRIYDLGLK-QLLRKAQADV---TPTLIVSLQSQGNRIIVGDLQQGITYVVYKAE 1009

Query: 846  EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
               +   A D    W +   ++D +   G +   N++ VR     + + +    E   E 
Sbjct: 1010 GNRLIPFADDTLNRWTTCTTMVDYESVAGGDKFGNIYIVRCPERVSQETD----EPGSEI 1065

Query: 906  HLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV-IAS 948
            HL    N + HG+              LP S       VG    +++  + G +GV I  
Sbjct: 1066 HLMHARN-YLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQDVLLWSGLQGTVGVFIPF 1124

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +  E   F + L+ ++R     + G +H  +R +    K V     +DGDL E F  L  
Sbjct: 1125 VSREDVDFFQNLENHMRAEDPPLAGRDHLIYRGYYTPVKGV-----IDGDLCERFSLLPN 1179

Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
             +   I+  ++ SV E+ +++ ++
Sbjct: 1180 DKKQMIAGELDRSVREIERKISDI 1203


>gi|340520436|gb|EGR50672.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1212

 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 286/1175 (24%), Positives = 487/1175 (41%), Gaps = 180/1175 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW---DAESSELITRAMGDVSDRIGRP 62
            I+G I +L  FR  G  +D+L +AT+  +  ++++   +   S+L     G    R   P
Sbjct: 64   IFGIIRSLAAFRLAGSNKDYLILATDSGRITIIEYLPKENRFSKLHLETFGKSGVRRVIP 123

Query: 63   TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
               G+    DP  R  LI     + L  V+  +++ +L  +  +   +  VL I  +   
Sbjct: 124  ---GEYLACDPKGRACLIASIEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVISMVALD 180

Query: 118  YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD-------------FVEGPWSQNNLDNGA 164
             G A P    L  D  +     T E AL+D +              V   WS+  +D+ A
Sbjct: 181  VGYANPVFAALEIDYTEVDQDSTGE-ALRDLETQLVYYELDLGLNHVVRKWSEP-VDSTA 238

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGS 214
             LL  VP     P   GVL+ GEE I Y  +N  AF+  IP R   T+  +  R    G 
Sbjct: 239  SLLFQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRKRTIISGV 296

Query: 215  RYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISY 256
             + L   AG    L+ T + +                  +V  LKI+       A+++  
Sbjct: 297  MHKLKGSAGAFFFLLQTEDGDMFKVTIDMVEDDEGNTTGEVRRLKIKYFDTVPPAASLCI 356

Query: 257  LDNAVVYIGSSYGD---SQLIKL------------NLQPDAKGSYVEV------------ 289
            L +  +Y+ S +G+    Q  KL            +   DA+ SY  V            
Sbjct: 357  LKSGFLYVASQFGNFSFYQFEKLGDDDEELEFTSDDFPVDAQASYDPVYFYPRPLENLVL 416

Query: 290  LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
            +E   ++ P++D  + +L  +   Q+ T  G     + R +++G+ INE  S EL GI  
Sbjct: 417  VESIPSMNPLLDCKIANLTGEDAPQIYTICGNGARSTFRTLKHGLEINEIVSSELPGIPS 476

Query: 349  GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
             +W+L+ + ++ +D ++V+SF + T +L++   + +EE    GF +   TL       + 
Sbjct: 477  AVWTLKLNRNEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGEDG 534

Query: 409  LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGI 467
            L+QV    +R + +      N+W +P   S+  AT NA QV +A   G +VY E+  DG 
Sbjct: 535  LIQVHPKGIRHIRNGQV---NQWDAPQHRSIVAATTNAHQVAIALSSGEIVYFEMDDDGS 591

Query: 468  LTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLG 525
            L E    +  +  ++CL +  + E    S   AVG   D +VRI SL P+  L  K    
Sbjct: 592  LAEYDEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDCTVRILSLDPESTLENKSVQA 650

Query: 526  GEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
                P S+ + + E  S      YL   L  G  L  +L+  TGELTD ++  LG + + 
Sbjct: 651  LTAAPTSLAIISMEDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELTDTRQKFLGPKQVR 710

Query: 580  LRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            L   + +  T V   S RP + Y+    K  + + +N  ++     F S    + +   +
Sbjct: 711  LFQVTVQGRTCVLGLSSRPWLGYADPITKTFVVTPLNYVDLEWGWNFTSEQCEEGIVGIQ 770

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQS-----CAE---- 687
               L I ++D +    I+ SIPL   P+++        F +    N +     CA+    
Sbjct: 771  GQTLRIFSVDRLGDTLIQSSIPLTYTPKKMVKHPDQPLFYVIEADNHTLSPALCAQLLAD 830

Query: 688  -------------------------ESEMHFVRLL--DDQTFEFISTYPLDTFEYGCSIL 720
                                      S +  V  L  D Q  + I    L+  E   S+ 
Sbjct: 831  PARVNGDSKVLPPEEFGHPRGNRRWASCISVVDPLAEDGQVLQRID---LEENEAAVSLA 887

Query: 721  SCSFSDDSN-VYYCVGTAYVL---PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSL 776
              +F+   N  +  VGT   +   P         I  F  +   L  I + + +    ++
Sbjct: 888  IVTFASQENETFLVVGTGKDMVLNPRSFSDAFVHIYRFERDGRGLVFIHKTKVEEPPMAM 947

Query: 777  NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
              F G++L  I + +++Y   +R    R+ Q+E      I++L  Q  G  IVVGD+ + 
Sbjct: 948  IPFQGRVLVGIGKMLRIYDLGMR-QLLRKSQAEVAPQ-QIVSLNAQ--GSRIVVGDVQQG 1003

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATD 893
            I+ +++K +   +     D  A W +   ++D +   G +   N+F VR   K SE A +
Sbjct: 1004 ITYVVFKQQTNKLIPFVDDTVARWTTCSTMVDYETTAGGDKFGNIFIVRAPQKASEEADE 1063

Query: 894  EERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
            E  G               RL+++   +  +        SLV       VG    +++  
Sbjct: 1064 EPAGLHLLNARSYLHGTSHRLDLMCHLYTQDIPTSITKTSLV-------VGGQEVLLWSG 1116

Query: 939  VNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
            + G IGV I  +  E   F + L+ +LR     + G +H  +RS+    K V     +DG
Sbjct: 1117 LMGTIGVLIPFVTREDADFFQSLEQHLRAEDPPLAGRDHLMYRSYYAPMKGV-----IDG 1171

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DL E +  L   +   I+  ++ SV E+ +++ ++
Sbjct: 1172 DLCERYALLPNDKKQMIAGELDRSVREIERKISDI 1206


>gi|334313376|ref|XP_003339894.1| PREDICTED: splicing factor 3B subunit 3-like [Monodelphis domestica]
          Length = 1202

 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 268/1183 (22%), Positives = 487/1183 (41%), Gaps = 190/1183 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDN--KDARHVKTYEVALKDKDFVEGPWSQNN---LDNGADLLIP 169
                G   P    L  D   K+   ++  +      D     +   N   L  G+D    
Sbjct: 172  GVDVGFENPMFACLEMDYEVKEGTGLENKDKKFGGVDVTHPSYMIQNVTILPGGSD---- 227

Query: 170  VPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-------------- 215
                  GVLI  E  I Y   N      IR  I +    +D D  R              
Sbjct: 228  ---GPSGVLICSENYITY--KNFGDQPDIRCPIPRRRKELD-DPERGKIFVCSATPKNKS 281

Query: 216  --YLLGD--HAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
              + LG      +  + + T E   VT ++++      +A+ +  L    +++ S +G+ 
Sbjct: 282  MFFFLGQTGQGDIFQITLETDENMGVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNH 341

Query: 272  QLIKLN------------------------LQPDAKGSYVEVLERYVNLGPIVDFC-VVD 306
             L ++                          QP    + V +++   +L PI+ FC + D
Sbjct: 342  YLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLSPIL-FCQIAD 399

Query: 307  LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFL 365
            L  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R   +D FD ++
Sbjct: 400  LANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYI 459

Query: 366  VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
            +VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +R + +  R
Sbjct: 460  IVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR 517

Query: 426  ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCL 483
               NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  ++  ++ C+
Sbjct: 518  --VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCM 575

Query: 484  DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE--- 539
             +  +      S+  AVG+  D +VRI SL   + +  + L  + +P +   LC  E   
Sbjct: 576  SLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPESLCIVEMGG 632

Query: 540  -------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
                         G  YL   L +G LL  +L+  TG+L+D +   LG++P+ L     +
Sbjct: 633  AEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQ 692

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
                V A S R  + YS   +   + ++ + +     F S   P+ +       L I  +
Sbjct: 693  GQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILAL 752

Query: 647  DDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQSCAEE----- 688
            + +  +  + + PL   PR+ + H E +            T A  + + Q  AEE     
Sbjct: 753  EKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAA 812

Query: 689  --------SEM-------------------------HFVRLLDDQTFEFISTYPLDTFEY 715
                    +EM                           VR+++      +    L+  E 
Sbjct: 813  GEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVVRVMNPIQGNTLDLVQLEQNEA 872

Query: 716  GCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIAEKETKG 771
              S+  C FS+  + +Y  VG A  L        G  +    +V  G KL+ + +   + 
Sbjct: 873  AFSVAVCRFSNTGDDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNSGEKLEFLHKTPVEE 932

Query: 772  AVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY---VQTRGDFI 828
               ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y   +QT G  +
Sbjct: 933  VPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQTIGHRV 985

Query: 829  VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--K 886
            +V D+ +S   + YK  E  +   A D    W++   +LD D   GA+   N+  VR   
Sbjct: 986  IVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPP 1045

Query: 887  NSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
            N+    DE         +RG L       EV+  YH+GE V   +  +L+        G 
Sbjct: 1046 NTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-------PGG 1098

Query: 931  IPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
              ++++ T++G IG++     HE + F + ++ +LR     + G +H  +RS+       
Sbjct: 1099 SESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YF 1153

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1154 PVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1196


>gi|299751161|ref|XP_001830098.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
 gi|298409248|gb|EAU91763.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
          Length = 1205

 Score =  208 bits (530), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 262/1172 (22%), Positives = 475/1172 (40%), Gaps = 176/1172 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
            ++G I +L  FR  G ++D+  + ++  +  +L++D +++   +L     G    R   P
Sbjct: 58   VFGSIRSLAAFRLTGGSKDYAIVGSDSGRIVILEYDPKTNVFNKLHQETYGKSGTRRIVP 117

Query: 63   TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNI---RLEELQVLDIKFL 117
               GQ    DP  R  +I       L  V+  D    L  +  +   R   +    +   
Sbjct: 118  ---GQYLATDPRGRSVMIAAMENSKLVYVLNRDVATNLTISSPLGAHRASTIVHDVVGVD 174

Query: 118  YGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGA 164
             G   P    L  D   +    T             YE+ L     V   WS+   D  A
Sbjct: 175  VGFDNPVYAALEVDYSQSDQDPTGNALRRAEKLLTYYELDLGLNHVVRK-WSEPT-DPRA 232

Query: 165  DLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIR--PSITKAYG 207
            +LL+ VP            P   GVL+  E  I+Y   +A      IP R  P       
Sbjct: 233  NLLVQVPGGQSTNGATHDGP--SGVLVCCENHIIYQHGDAPSHRVPIPRRQDPVADPNQD 290

Query: 208  RVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
             +      + +    G +    + HE   V+ LKI+      +AS +  L   ++++ S 
Sbjct: 291  VIIVAAVMHKMKTEDGDVFKATVDHEDGVVSSLKIKYFETVPVASNLCILKQGLLFVASE 350

Query: 268  YGDSQLIKL------------------------NLQPDAKGSY-------VEVLERYVNL 296
            +G+  L +                           QP  + S+       + +++   +L
Sbjct: 351  FGNHHLYRFCKLGDDDDQPEFSSSSYPSYGMAEPEQPLPRASFDPRPMENLMLIDELSSL 410

Query: 297  GPIVDFCVVDLERQGQG-QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI---KGMWS 352
             P++D  ++      +  ++    G     S +I+ +G+ ++E AS EL G      +WS
Sbjct: 411  NPVLDAKILKPSLDSEAPKIFAACGRGPASSFKILSHGLEVDEMASSELPGFLAPHSLWS 470

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
             +    D +D  LV+SF + T +L++   + +EE +  GF +   TL       N L+QV
Sbjct: 471  TKRMQTDHYDNLLVMSFQNATIVLSIG--ESMEEVKDSGFLTSMPTLAVQQIGENGLIQV 528

Query: 413  TSGSVR-LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE 470
             +  +R LV S      NEWK P G ++  A  N  QV++A     +VY E+  +G L E
Sbjct: 529  HTHGIRHLVDSQV----NEWKVPQGQTIVAANTNRRQVVVALSSAEIVYFELDQEGQLNE 584

Query: 471  VKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
             +  + +   +  L +  + E        AVG   D +VR+ SL PD  L T        
Sbjct: 585  YQDMKAMGSTVLALGLGEVPEGRLRFPYLAVGC-EDQTVRVMSLDPDSTLETISLQALTA 643

Query: 529  IPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
             P S+ +      S        ++   L +G LL  +L+   G+LTD +   LGT+P+ L
Sbjct: 644  PPSSICIAYMLDASINKVQRSMFVNIGLANGVLLRTVLDGTNGQLTDTRTRFLGTKPVRL 703

Query: 581  RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
                 K    + A S R  + YS  +K+ ++ +  +       F   + P+ +    +  
Sbjct: 704  LRVKLKGEDAILAISSRVWLNYSHQQKMEFTPLACETPECASSFTGESCPEGIISIAKNT 763

Query: 641  LTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM-------- 691
            L+I T+  +  KL   S+PL   PR+      +R F +     ++ +EE E         
Sbjct: 764  LSISTVSKLGMKLKQESVPLAFTPRKFVTHPGNRFFYLIESDQRTLSEEEEKKKLADLNI 823

Query: 692  -----------------------HF---VRLLDDQTFEFISTYPLDTFEYGCSILSCSFS 725
                                   H+   + + D    + ++T PL   E   SI    F+
Sbjct: 824  SRDDHPILQLPAKIFGRTRASAGHWASRIHIFDPMEAKTVATLPLKANEAAFSIAVVPFA 883

Query: 726  DDSNVYY-CVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA--VYSLNAF 779
                 Y+  VGTA    V P +   +  ++   + E   L+L+ E   + +    +L AF
Sbjct: 884  STGGEYHLVVGTAMHHLVTPPQASASYLKVYKIVNEGTGLELLHETPIQDSELPRALLAF 943

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
             G+LLA + + +++Y     D G ++L  +         + + T+G  IV+GD+ +S   
Sbjct: 944  QGRLLAGVGKALRIY-----DLGKKKLLRKAETKSPTAIVSLATQGSRIVIGDMQESTLF 998

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS----------- 888
             +YK  E  +     D    W+SA+ ++D +     +   N+F  R +S           
Sbjct: 999  AVYKEAENRLLIFGDDTQPRWVSAMTMVDYNTVAVGDKFGNIFVNRLDSTISDQVDEDPT 1058

Query: 889  ------EGAT-DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
                  E AT +    + +++  +H+G+ +      SLV       VG    +++  + G
Sbjct: 1059 GAGILHEKATLNGAPHKTKMLAHFHVGDIITSIHKVSLV-------VGGREVLLYTGLQG 1111

Query: 942  VIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
             IG++  L  ++ + FL  L+ ++R     + G +H  WR +      V  K  +DGDL 
Sbjct: 1112 TIGILVPLTSKEDIEFLTMLEQHIRNEQGSLVGRDHLSWRGY-----YVPVKAVIDGDLC 1166

Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            E++  LS ++   I+  ++ +V ++ K+++++
Sbjct: 1167 ETYGGLSSSKQSAIASELDRTVGDVLKKLDQM 1198


>gi|325096432|gb|EGC49742.1| pre-mRNA-splicing factor Rse1 [Ajellomyces capsulatus H88]
          Length = 1209

 Score =  208 bits (529), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 271/1176 (23%), Positives = 486/1176 (41%), Gaps = 185/1176 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I  L  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRALAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  + + +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALTALDVG 182

Query: 120  CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  +  +A             +H+  YE+ L     V   WS + +D  A +
Sbjct: 183  YDNPIFAALEVEYTEADQDPTGQAYEELEKHLVYYELDLGLNHVVRK-WS-DPVDRSASM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLL 218
            L  VP     P   G L+  E+ I Y  +N  AF+  IP R   T+     + +  RY+ 
Sbjct: 241  LFQVPGGADGP--SGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT 293

Query: 219  GDHAGLLH---------------------LLVITHEKEKVTG----LKIELLGETSIAST 253
               AG++H                     L ++  +  ++TG    LK++      IAS+
Sbjct: 294  ---AGVMHKMRGAFFFLLQTEDGDLFKVTLDMVEDDNGQITGEVSRLKLKYFDTVPIASS 350

Query: 254  ISYLDNAVVYIGSSYGDSQLIKL-------------------NLQPDAKGSY-------- 286
            +  L +  +++ S  G+    +                    +L      +Y        
Sbjct: 351  LCILKSGFLFVASETGNHHFYQFEKLGDDDDEIEFTSDAYSADLSEPLPPAYFRPRPYEN 410

Query: 287  VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            V ++E   +L P++D  + +L      Q+ T  G     S R +++G+ + E    EL  
Sbjct: 411  VNLVESIDSLNPLMDCKIANLTDDDAPQIYTICGTGARSSFRTLKHGLEVAEIVESELPS 470

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            +   +W+ + + +D FD ++++SF + T +L++   + +EE    GF S   TL      
Sbjct: 471  VPSAVWTTKLTREDEFDAYIILSFTNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLG 528

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
             + L+QV    +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  
Sbjct: 529  EDSLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDT 586

Query: 465  DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
            DG L E    + +   ++CL +  +    + S   AVG   D +VRI SL P+  L  K 
Sbjct: 587  DGSLAEYDEKRAMSGTVTCLSLGEVPRGRARSSFLAVGC-DDSTVRILSLDPESTLENKS 645

Query: 523  HLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
                   P ++ + A    +      YL   L  G  L  +L+  TGEL+D +   LG +
Sbjct: 646  VQALTSAPSALSIMAMSDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLK 705

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
            P+ L   S K  T V A S RP + YS    K  + + ++   +     F+S    + + 
Sbjct: 706  PVKLFQVSVKEQTAVLALSSRPWLGYSHLQTKAFMLTPLDYVSLEWGWNFSSEQCVEGMV 765

Query: 635  IAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 693
              +   L I +++ ++  L   +IPL   PR      +   F +   +N   +  +    
Sbjct: 766  GIQGQNLRIFSLEKLENNLLQETIPLQYTPRHFIKHPEHPLFYVIEAENNILSPGTR--- 822

Query: 694  VRLLDDQTFEFISTYPLDTFEYG-----------------------------------CS 718
             +LL+D       T PL   E+G                                    S
Sbjct: 823  TKLLNDSDAVNGDTTPLPPEEFGYPRGTGHWASCIQIVDPINSKRVISQIELEENEAAVS 882

Query: 719  ILSCSFSD-DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVY 774
            + +  FS  D   +  VGT   ++      T G I ++   E+GK L+ I + + +    
Sbjct: 883  VAAVPFSSQDDETFLVVGTGKDMVVNPRSCTAGFIHIYRFQEEGKELEFIHKTKVEQPPM 942

Query: 775  SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
            +L  F G+LLA I   +++Y   ++    R+ Q+    H   L + +QT+G  I+V D+ 
Sbjct: 943  ALLGFQGRLLAGIGTDLRIYDLGMK-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQ 998

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
            +S++ ++YK++E  +     D  + W +   ++D +   G +   NL+ +R   K SE A
Sbjct: 999  ESLTYVVYKYQENRLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPAKASEEA 1058

Query: 892  TDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
             ++  G               RL +V  ++  +     +   LV    D  V    T + 
Sbjct: 1059 DEDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLVTGGRDILVW---TGLQ 1115

Query: 937  GTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
            GTV+ +I  I+    E+  F + L+  L      + G +H  +RS+        AK  +D
Sbjct: 1116 GTVSMLIPFIS---REEVDFFQSLEMQLAAQNPPLAGRDHLIYRSY-----YAPAKGTID 1167

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            GDL E++L L   +  +I+  ++ SV E+ +++ ++
Sbjct: 1168 GDLCETYLLLPNDKKQQIAGELDRSVREIERKIADM 1203


>gi|119473054|ref|XP_001258481.1| nuclear mRNA splicing factor, putative [Neosartorya fischeri NRRL
            181]
 gi|119406633|gb|EAW16584.1| nuclear mRNA splicing factor, putative [Neosartorya fischeri NRRL
            181]
          Length = 1209

 Score =  208 bits (529), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 271/1168 (23%), Positives = 477/1168 (40%), Gaps = 169/1168 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q +         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTVVFAMTALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  +A    T             YE+ L     V   W+ + +D  A +
Sbjct: 183  YENPIFAALEVDYSEADQDPTGRAYEESEKLLVYYELDLGLNHVVRK-WA-DPVDRTASM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGD 220
            L  VP       GVL+  E++I Y  +N  AF+  IP R   T+   R  +  +  +   
Sbjct: 241  LFQVPGGADGPSGVLVCAEDSITYRHSNQDAFRVPIPRRIGATENPERKRSIVAGVMHKM 300

Query: 221  HAGLLHLL-------------VITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
                  LL             ++  +  ++TG    LKI+      IAS +  L +  +Y
Sbjct: 301  RGAFFFLLQTEDGDLFKVTIDMVEDDSGQLTGEVKRLKIKYFDTVPIASNLLILKSGFLY 360

Query: 264  IGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLERYVNL 296
            + S  G+    +                N   D    Y            + ++E   +L
Sbjct: 361  VASEAGNHHFYQFEKLGDDDEETEFSSENFPADPSVPYEPVFFRPRGAENLNLVETLNSL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P++D  +V+L      Q+ T SG     S R +++G+ ++E    EL  +   +W+ + 
Sbjct: 421  NPLIDSKIVNLNEDDAPQIYTVSGTGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            +  D FD ++++SF + T +L++   + +EE    GF S   TL       + L+QV   
Sbjct: 481  TRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVHPR 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
             +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGTLAEYDER 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P +
Sbjct: 597  RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPSA 655

Query: 533  VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + + +    S      YL   L  G  L  +L+  TGEL+D +   LG + + L   S K
Sbjct: 656  LNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKQVKLFRVSVK 715

Query: 587  NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
              T V A S RP + YS    K  + + ++   +     F+S    + +   +   L I 
Sbjct: 716  GQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQAQNLRIF 775

Query: 645  TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR-------- 695
            +I+ +   +   SIPL   PRR+    +   F +    N   +  +    +         
Sbjct: 776  SIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDNNVLSPATRARLIEDSKARNGE 835

Query: 696  --LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSFSD-DSNV 730
              +L  + F +                      IST  L+  E   S+ +  FS  D   
Sbjct: 836  TNVLPPEDFGYPRATGHWASCIQIVDPLDAKAVISTIELEENEAAVSMAAVPFSSQDDET 895

Query: 731  YYCVGTAYVLPEENEPTKGRILVFIV---EDGK-LQLIAEKETKGAVYSLNAFNGKLLAA 786
            +  VGTA  +   N P+     + I    EDGK L+ I + + +    +L  F G+LLA 
Sbjct: 896  FLVVGTAKDM-IVNPPSSAGGFIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRLLAG 954

Query: 787  INQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYK 843
            I   +++Y     D G ++L  +C     +++  +   QT+G  I+V D+ +S++ ++YK
Sbjct: 955  IGSTLRVY-----DLGMKQLLRKC--QAPVVSKTIVGLQTQGSRIIVSDVRESVTYVVYK 1007

Query: 844  HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE------ 894
            ++E  +     D  + W ++  ++D +   G +   NL+ VR   K SE A ++      
Sbjct: 1008 YQENVLIPFVDDSVSRWTTSTTMVDYETVAGGDKFGNLWLVRCPKKVSEEADEDGSGAHL 1067

Query: 895  --ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
              ERG       RL+++   +  +         LV        G    +++    G IG+
Sbjct: 1068 IHERGYLHGTPNRLDLMIHTYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIGM 1120

Query: 946  IAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
            +   +  E   F + L+  L      + G +H  +RS+         K  +DGDL E + 
Sbjct: 1121 LVPFVSREDVDFFQNLEMQLASQCPPLAGRDHLIYRSY-----YAPVKGVIDGDLCEMYF 1175

Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             L       I+  ++ SV E+ +++ ++
Sbjct: 1176 LLPNDTKMMIAAELDRSVREIERKISDM 1203


>gi|37359734|dbj|BAC97845.1| mKIAA0017 protein [Mus musculus]
          Length = 1122

 Score =  207 bits (527), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 257/1139 (22%), Positives = 471/1139 (41%), Gaps = 182/1139 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298  PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
            PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406  PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
              +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465  HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
             +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523  GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
             ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581  KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533  VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
              LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638  ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698  PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR 695
                L I  ++ +  +  + + PL   PR+     +S    I    + +  E ++    +
Sbjct: 758  STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 696  LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI 755
                                   ILS        VY      Y L    E          
Sbjct: 818  ----------------------QILSPRSVAGGFVY-----TYKLVNNGE---------- 840

Query: 756  VEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 815
                KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + H
Sbjct: 841  ----KLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKH 890

Query: 816  ILALY---VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
            I A Y   +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D  
Sbjct: 891  I-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTV 949

Query: 873  LGAENNFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRF 914
             GA+   N+  VR   N+    DE         +RG L       EV+  YH+GE V   
Sbjct: 950  AGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSL 1009

Query: 915  RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
            +  +L+        G   ++++ T++G IG++     +E + F + ++ +LR     + G
Sbjct: 1010 QKTTLI-------PGGSESLVYTTLSGGIGILVPFTSYEDHDFFQHVEMHLRSEHPPLCG 1062

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +H  +RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1063 RDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1116


>gi|357528934|sp|Q5B1X8.2|RSE1_EMENI RecName: Full=Pre-mRNA-splicing factor rse1
 gi|259485100|tpe|CBF81885.1| TPA: Pre-mRNA-splicing factor rse1
            [Source:UniProtKB/Swiss-Prot;Acc:Q5B1X8] [Aspergillus
            nidulans FGSC A4]
          Length = 1209

 Score =  207 bits (527), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 281/1165 (24%), Positives = 477/1165 (40%), Gaps = 163/1165 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I TL  FR  G  +D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRTLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q L    +    G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVYSVVALDAG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  ++    T             YE+ L     V   W+ + +D  + +
Sbjct: 183  YENPVFAALEVDYSESDQDPTGRAYEEVEKLLVYYELDLGLNHVVRK-WT-DPVDRTSSM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPS----------ITKAYGRVD 210
            L  VP       GVL+  E+ I Y  +N  AF+  IP R            IT       
Sbjct: 241  LFQVPGGADGPSGVLVCAEDNITYRHSNQDAFRVPIPRRKGAMENPERKRCITAGVMHKM 300

Query: 211  ADGSRYLLGDHAG-LLHLLVITHEKEK------VTGLKIELLGETSIASTISYLDNAVVY 263
                 +LL    G L  L +   E +K      V GLKI+      +AS++  L +  +Y
Sbjct: 301  RGAFFFLLQTEDGDLFKLTLDMVEDDKGQLTGEVKGLKIKYFDTVPLASSLLILKSGFLY 360

Query: 264  IGSSYGDSQLIKL-----------------NLQPDAKGSYV----------EVLERYVNL 296
            + +  G+    +                  +  P A  + V           ++E   +L
Sbjct: 361  VAAEGGNHHFYQFEKLGDDDEETEFNSDDFSADPAAPCTPVYFQPRGAENLNLVEAINSL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P+VD  VV++      Q+ T SG     + R +++G+ ++E    EL  +   +W+ + 
Sbjct: 421  NPLVDSKVVNISEDDAPQIFTVSGTGARSTFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            +  D FD ++V+SF + T +L++   + +EE    GF S   TL       + L+Q+   
Sbjct: 481  TRADEFDAYIVLSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQIHPR 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
             +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFELDADGSLAEYDER 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P +
Sbjct: 597  RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDTTLENKSVQALTAAPSA 655

Query: 533  VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + + A    S      YL   L  G  L   L+  TGEL+D +   LG++ + L   S  
Sbjct: 656  LNIIAMADSSSGGTTLYLHIGLHSGVYLRTALDEVTGELSDTRTRFLGSKAVKLFQVSVT 715

Query: 587  NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
              T V A S RP + YS    K  + + ++   +     F+S    + +   +   L I 
Sbjct: 716  GQTAVLALSSRPWLGYSDTQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775

Query: 645  TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFE 703
            +I+ +   +  +SIPL   PR      +   F +    N   +  +     RLL+D    
Sbjct: 776  SIEKLDNNMLQQSIPLAYTPRHFIKHPEEPLFYVIEADNNVLSPATR---ARLLEDSKAR 832

Query: 704  FIST---------YP--------------------------LDTFEYGCSILSCSF-SDD 727
               T         YP                          L+  E   SI +  F S D
Sbjct: 833  GGDTTVLPPEDFGYPRGTGHWASCIQIIDPLDAKAVVGAVELEENEAAVSIAAVPFTSQD 892

Query: 728  SNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
               +  VGTA  +   N P+   G I ++   EDGK L+ I + + +    +L  F G+L
Sbjct: 893  DETFLVVGTAKDM-TVNPPSSAGGYIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRL 951

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIY 842
            LA +   +++Y     D G ++L  +C       A+  +QT+G  IVV D+ +S++ ++Y
Sbjct: 952  LAGVGSVLRIY-----DLGMKQLLRKCQAAVAPKAIVGLQTQGSRIVVSDVRESVTYVVY 1006

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRL 899
            K+++  +     D  A W +A  ++D +   G +   NL+ VR   K SE A +E  G  
Sbjct: 1007 KYQDNVLIPFVDDSIARWTTAATMVDYETTAGGDKFGNLWLVRCPKKASEEADEEGSGAH 1066

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDV-----------GQIPTVIFGTVNGVIGVIAS 948
             +    +L    NR     L++ +   D+           G    +++    G IG++  
Sbjct: 1067 LIHDRGYLQGTPNRLE---LMIHVFTQDIPTSLHKTQLVAGGRDILVWTGFQGTIGILVP 1123

Query: 949  -LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
             +  E   F + L+  L      + G +H  +RS+    K V     +DGDL E +  LS
Sbjct: 1124 FVSREDVDFFQSLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEQYFLLS 1178

Query: 1008 RTRMDEISKTMNVSVEELCKRVEEL 1032
                  I+  ++ SV E+ +++ ++
Sbjct: 1179 NDTKMMIAAELDRSVREIERKISDM 1203


>gi|225560964|gb|EEH09245.1| pre-mRNA-splicing factor rse1 [Ajellomyces capsulatus G186AR]
          Length = 1209

 Score =  207 bits (527), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 270/1174 (22%), Positives = 485/1174 (41%), Gaps = 181/1174 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I  L  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRALAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  + + +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALTALDVG 182

Query: 120  CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  +  +A             +H+  YE+ L     V   WS + +D  A +
Sbjct: 183  YDNPIFAALEVEYTEADQDPTGQAYEELEKHLVYYELDLGLNHVVRK-WS-DPVDRSASM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGD 220
            L  VP       G L+  E+ I Y  +N  AF+  IP R   T+     + +  RY+   
Sbjct: 241  LFQVPGGADGPSGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT-- 293

Query: 221  HAGLLH---------------------LLVITHEKEKVTG----LKIELLGETSIASTIS 255
             AG++H                     L ++  +  ++TG    LK++      IAS++ 
Sbjct: 294  -AGVMHKMRGAFFFLLQTEDGDLFKVTLDMVEDDNGQITGEVSRLKLKYFDTVPIASSLC 352

Query: 256  YLDNAVVYIGSSYGDSQLIKL-------------------NLQPDAKGSY--------VE 288
             L +  +++ S  G+    +                    +L      +Y        V 
Sbjct: 353  ILKSGFLFVASETGNHHFYQFEKLGDDDDEIEFTSDAYSADLSEPLPPAYFRPRPYENVN 412

Query: 289  VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI- 347
            ++E   +L P++D  + +L      Q+ T  G     S R +++G+ + E    EL  + 
Sbjct: 413  LVESIDSLNPLMDCKIANLTDDDAPQIYTICGTGARSSFRTLKHGLEVAEIVESELPSVP 472

Query: 348  KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
              +W+ + + +D FD ++++SF + T +L++   + +EE    GF S   TL       +
Sbjct: 473  SAVWTTKLTREDEFDAYIILSFTNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGED 530

Query: 408  QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DG 466
             L+QV    +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG
Sbjct: 531  SLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDG 588

Query: 467  ILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
             L E    + +   ++CL +  +    + S   AVG   D +VRI SL P+  L  K   
Sbjct: 589  SLAEYDEKRAMSGTVTCLSLGEVPRGRARSSFLAVGC-DDSTVRILSLDPESTLENKSVQ 647

Query: 525  GGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
                 P ++ + A    +      YL   L  G  L  +L+  TGEL+D +   LG +P+
Sbjct: 648  ALTSAPSALSIMAMSDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLKPV 707

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
             L   S K  T V A S RP + YS    K  + + ++   +     F+S    + +   
Sbjct: 708  KLFQVSVKEQTAVLALSSRPWLGYSHLQTKAFMLTPLDYVGLEWGWNFSSEQCVEGMVGI 767

Query: 637  KEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR 695
            +   L I +++ ++  L   +IPL   PR      +   F +   +N   +  +     +
Sbjct: 768  QGQNLRIFSLEKLENNLLQETIPLQYTPRHFIKHPEYPLFYVIEAENNILSPGTR---TK 824

Query: 696  LLDDQTFEFISTYPLDTFEYG-----------------------------------CSIL 720
            LL+D       T PL   E+G                                    S+ 
Sbjct: 825  LLNDSDAVNGDTTPLPPEEFGYPRGTGHWASCIQIVDPINSKRVISQIELEENEAAVSVA 884

Query: 721  SCSFSD-DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSL 776
            +  FS  D   +  VGT   ++      T G I ++   E+GK L+ I + + +    +L
Sbjct: 885  AVPFSSQDDETFLVVGTGKDMVVNPRSCTAGFIHIYRFQEEGKELEFIHKTKVEQPPMAL 944

Query: 777  NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
              F G+LLA I   +++Y   ++    R+ Q+    H   L + +QT+G  I+V D+ +S
Sbjct: 945  LGFQGRLLAGIGTDLRIYDLGMK-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQES 1000

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATD 893
            ++ ++YK++E  +     D  + W +   ++D +   G +   NL+ +R   K SE A +
Sbjct: 1001 LTYVVYKYQENRLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPAKASEEADE 1060

Query: 894  EERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
            +  G               RL +V  ++  +     +   LV    D  V    T + GT
Sbjct: 1061 DGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLVTGGRDILVW---TGLQGT 1117

Query: 939  VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
            V+ +I  I+    E+  F + L+  L      + G +H  +RS+        AK  +DGD
Sbjct: 1118 VSMLIPFIS---REEVDFFQSLEMQLAAQNPPLAGRDHLIYRSY-----YAPAKGTIDGD 1169

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L E++L L   +  +I+  ++ SV E+ +++ ++
Sbjct: 1170 LCETYLLLPNDKKQQIAGELDRSVREIERKIADM 1203


>gi|302406266|ref|XP_003000969.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
 gi|261360227|gb|EEY22655.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
          Length = 1059

 Score =  207 bits (526), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 248/989 (25%), Positives = 424/989 (42%), Gaps = 143/989 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I ++  FR  G  +D+L +AT+  +  ++++    +      +        R    
Sbjct: 64   VFGIIRSMAAFRIAGSNKDYLILATDSGRIAIIEYLPAQNRFQRLHLETFGKSGIRRVVP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE--LQVLDIKFL-YGC 120
            G+    DP  R  LI     + L  V+  +++ +L  +  +   +  + VL +  L  G 
Sbjct: 124  GEFLACDPKGRACLIASLEKNKLVYVLNRNSQAELTISSPLEAHKPGVHVLSMVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
            A P    L  D  +A    T + AL  +            + V   WS+  +DN A LL 
Sbjct: 184  ANPVFAALETDYTEADQDPTGQAALDVETQLVYYELDLGLNHVVRKWSEP-VDNTASLLF 242

Query: 169  PVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYLL 218
             VP     P   GVL+ GEE I Y  +N  AF+  +P R   T+  +  R    G  + L
Sbjct: 243  QVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPVPRRRGATEDPSRKRSIVAGVMHKL 300

Query: 219  GDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNA 260
               AG    L+ T + +                  +V  LKI+      +AS++  L + 
Sbjct: 301  KGSAGAFFFLLQTEDGDLFKITIDMIEDRDGNPTGEVKRLKIKYFDTIPVASSLCILKSG 360

Query: 261  VVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLERY 293
             +Y+ S +G+ Q  +                +   D K SY            + ++E  
Sbjct: 361  FLYVASQFGNYQFYQFEKLGDDDEELEFSSDDFPTDPKQSYEAVFFHPRELENLALVESI 420

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 352
             ++ P++D  V +L  +   Q+ T  G     + RI+++G+ +NE  + EL GI   +W+
Sbjct: 421  DSMNPLIDCKVANLTGEDAPQIYTACGNGARSTFRILKHGLEVNEIVASELPGIPSAVWT 480

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            L+ S  D +D ++V+SF + T +L++   + +EE    GF +   TL         L+QV
Sbjct: 481  LKLSRGDQYDAYIVLSFTNATLVLSIG--ETVEEVNDSGFLTSVPTLAAQLLGGEGLIQV 538

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV 471
                +R + +      NEW +P   S+  AT N  QV +A   G +VY E+  DG L E 
Sbjct: 539  HPKGIRHIRNGQV---NEWAAPQHRSIVAATTNEHQVAVALSSGEIVYFEMDSDGSLAEY 595

Query: 472  KHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
               +  +  ++CL +  + E    S   AVG   D +VR+ SL P+  L +K        
Sbjct: 596  DEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDSTVRVLSLDPETTLESKSVQALTAP 654

Query: 530  PRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
            P S+ + A E  S      YL   L  G  L  +L+  TGELTD ++  LG +P+ L   
Sbjct: 655  PTSLAVIAMEDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPVKLFQV 714

Query: 584  SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
            + +  T V   S RP + Y+    K  + + +N  ++     F+S    + +   +   L
Sbjct: 715  TVQGRTCVLGLSSRPWLGYADPITKGFVVTPLNYVDLEWGWNFSSEQCEEGVVGIQGQSL 774

Query: 642  TIGTIDDIQ-KLHIRSIPLGEHPRRIC-HQEQSRTFAICS--------LKNQSCAEES-- 689
             I  I+ +   L  +SIPL   PRR+  H E    + I S        L+ Q  A+ S  
Sbjct: 775  RIFAIEKLSDTLTQKSIPLTYTPRRMVKHPEHPMFYTIESDNNTLPPELRAQLLADPSVV 834

Query: 690  ----------EMHFVR----------LLD--DQTFEFISTYPLDTFEYGCSILSCSF-SD 726
                      E  + R          ++D   +  + + T  LD  E   S     F S 
Sbjct: 835  NGDARTLPPVEFGYPRAKGRWASCISVIDPLSEELQTLQTVDLDNNEAAVSAAIVPFTSQ 894

Query: 727  DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
            D+  +  VGT   ++    + T+G I ++   EDG+ L+ I + + +    +L AF G+L
Sbjct: 895  DNESFLVVGTGKDMIVNPRQFTEGYIHIYRFSEDGRELEFIHKTKVEEPPTALLAFQGRL 954

Query: 784  LAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            +A + + +++Y    K ML     R+ Q++       L + + T+G  IVVGD+ + ++ 
Sbjct: 955  VAGVGKTLRIYDLGQKQML-----RKAQADVAPQ---LIVSLSTQGSRIVVGDVQQGVTY 1006

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILD 868
            ++YK     +     D  A WM+   ++D
Sbjct: 1007 VVYKALSNKLIPFVDDTVARWMTCTTMVD 1035


>gi|332263858|ref|XP_003280968.1| PREDICTED: splicing factor 3B subunit 3 [Nomascus leucogenys]
          Length = 1271

 Score =  207 bits (526), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 270/1199 (22%), Positives = 495/1199 (41%), Gaps = 203/1199 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  ++++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 103  LLIVEVFGVIRSLMAFRVTGGTKEYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 161

Query: 61   -RPTDNGQIGIIDPDCRLIGL---HLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---- 112
             R    GQ   +DP  R + +   H    L  ++  D   +L  +  +   +   L    
Sbjct: 162  CRRIVPGQFLAVDPKGRAVMISKCHEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHV 221

Query: 113  ---DIKF---LYGCAK---------PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQ 157
               D+ F   ++ C +         P I          + +  YE+ L     V      
Sbjct: 222  VGVDVGFENPMFACLEMDYEIFDCMPLISFYATVLCSQQTLTFYELDLGLNHVVRK--YS 279

Query: 158  NNLDNGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDAD 212
              L+   + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D
Sbjct: 280  EPLEEHGNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-D 334

Query: 213  GSR-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
              R                 +L     G +  + +  +++ VT ++++      +A+ + 
Sbjct: 335  PERGMIFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMC 394

Query: 256  YLDNAVVYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLE 291
             L    +++ S +G+  L ++                          QP    + V +++
Sbjct: 395  VLKTGFLFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVD 453

Query: 292  RYVNLGPIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKG 349
               +L PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    
Sbjct: 454  ELDSLSPIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNA 512

Query: 350  MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
            +W++R   +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + L
Sbjct: 513  VWTVRRHIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDAL 570

Query: 410  VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGIL 468
            VQV    +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L
Sbjct: 571  VQVYPDGIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQL 628

Query: 469  TE-VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
             E  +  ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  +
Sbjct: 629  NEYTERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQ 685

Query: 528  IIP-RSVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
             +P +   LC  E                G  YL   L +G LL  +L+  TG+L+D + 
Sbjct: 686  ALPAQPESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRT 745

Query: 571  VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
              LG++P+ L     +    V A S R  + YS   +   + ++ + +     F S   P
Sbjct: 746  RYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCP 805

Query: 631  DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
            + +       L I  ++ +  +  + + PL   PR+ + H E +            T A 
Sbjct: 806  EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEAT 865

Query: 678  CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
             + + Q  AEE             +EM                           +R+++ 
Sbjct: 866  KAQRKQQMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNP 925

Query: 700  QTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IV 756
                 +    L+  E   S+  C FS+    +Y  VG A  L        G  +    +V
Sbjct: 926  IQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLV 985

Query: 757  EDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 815
             +G KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + H
Sbjct: 986  NNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKH 1039

Query: 816  ILALY---VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
            I A Y   +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D  
Sbjct: 1040 I-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTV 1098

Query: 873  LGAENNFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRF 914
             GA+   N+  VR   N+    DE         +RG L       EV+  YH+GE V   
Sbjct: 1099 AGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSL 1158

Query: 915  RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
            +  +L+        G   ++++ T++G IG++     HE + F + ++ +LR     + G
Sbjct: 1159 QKTTLI-------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCG 1211

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +H  +RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 1212 RDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1265


>gi|384490247|gb|EIE81469.1| hypothetical protein RO3G_06174 [Rhizopus delemar RA 99-880]
          Length = 1197

 Score =  207 bits (526), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 266/1177 (22%), Positives = 485/1177 (41%), Gaps = 195/1177 (16%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G I ++  FR  G ++DF+ + ++  +  +L+++   + +  +   +   + G R    
Sbjct: 59   FGLIRSIAPFRLTGGSKDFIVVGSDSGRIVILEYNPVKN-IFEKVHQETYGKTGCRRIVP 117

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
            GQ    DP  R  +I       L  ++  D+   L  +  +        +  +  +D+ F
Sbjct: 118  GQYLTADPKGRAVMISATEKQKLVYILNRDSAANLTISSPLEAHRSHAIIHHIVAVDVGF 177

Query: 117  LYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGA 164
                  PT   L  D  DA    T E A   +            + V   WS + +D  A
Sbjct: 178  ----ENPTFACLEVDYADADTDSTGEAAQNAEKILTYYELDLGLNHVVRKWS-DVVDRKA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLL- 218
            +LL+PVP     P   GVL+  E  I Y   +A +   +R  I +    ++ D  R ++ 
Sbjct: 233  NLLVPVPGGNDGP--SGVLVCTENFIAYKHPDAEE---LRIPIPRRAQPLE-DPKRGIII 286

Query: 219  ---GDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQ--- 272
                   G L+ + + H +  V GL I+       A ++S L +  ++  S +G+ +   
Sbjct: 287  IASASEEGDLYKVTMDHVQGTVQGLTIKYFDTIPPAISLSILKSGFLFAASEFGNHRHYF 346

Query: 273  -----------------------------LIKLNLQPDAKGSYVEVLERYVNLGPIVDFC 303
                                         L+    +P    S V+ LE   ++ P+++  
Sbjct: 347  FEGLGDDDDDAEISSAVYMEQETYEGGIPLVYFKPRPLKNLSLVDELE---SMAPLMESK 403

Query: 304  VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
            V++L  +   ++    G     + RI+  G+   E A  EL G    +W+ +   DD + 
Sbjct: 404  VLNLADEETPRIYAICGRGAQSTFRILNQGVEAAELAVSELPGNPSAVWTTKLRADDQYH 463

Query: 363  TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
             ++VVSF + T +L++   + +EE    GF +   TL       + LVQV    +R + +
Sbjct: 464  AYIVVSFANATLVLSIG--ETVEEVTDTGFLTNAPTLAVQQIGEDALVQVHPHGIRHIRA 521

Query: 423  TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYEI 480
              R   NEW++P G ++  A  N+ Q+ +A   G +VY E+ + G L E  +H Q+   I
Sbjct: 522  DRR--VNEWRAPQGQTIVEAATNSRQIAIALSNGEIVYFEMDNMGQLNEHQEHRQMSAYI 579

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCA 537
            + L +  + E    ++  AVG   D +VRI SL PD  L  I+ + L G  +P S  LC 
Sbjct: 580  TTLALGEVPEGRVRARYIAVGC-EDQTVRILSLDPDSCLEPISMQALQG--VPSS--LCI 634

Query: 538  FE----------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
             E          G  YL   L +G  L  +L+  TG+L+D +   +G + + L   S + 
Sbjct: 635  IEMMDTGIEQGHGTQYLNIGLSNGIFLRTILDTITGQLSDTRARFIGAKSVKLFRISIQG 694

Query: 588  TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
               V A S +P V Y+   +L  + ++ + + H   F +   P+ +       L I T++
Sbjct: 695  HPAVLALSTKPWVSYTFQNRLYLTPLSYEMLEHGSAFVTEQCPEGVVAVAGNTLRIFTVE 754

Query: 648  DIQKLHIR-SIPLGEHPRRICHQEQSRTFAIC---------SLKNQSCAEESEMHF---- 693
             +  +  + SIPL   PR+      +RTF +          S K +   E+    F    
Sbjct: 755  KLGNIFNQVSIPLKYTPRKFALHAPTRTFVVIESDHATFSPSEKAKGIIEKENEGFEIDD 814

Query: 694  -----------------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSF------ 724
                                   +RL++      + T  L+  E   S+    F      
Sbjct: 815  DITNLDPLQFGHVRNAAGRWASCIRLIEPFEGRTLETIELEDNEAAFSVAMVQFRQNPHA 874

Query: 725  SDDSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDG-----KLQLIAEKETKGAVYSLNA 778
            ++ S  +  VGT   V       T G + V+ V  G     +L  I +       Y++ A
Sbjct: 875  TNSSEQFVIVGTGQNVNLSPRSCTSGYLHVYRVVQGEQGQLRLHFIHKTPIDDVPYAMLA 934

Query: 779  FNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
            F G+LL    + +++Y    K MLR   T+ + + C     I++L+  T+G  ++  D+ 
Sbjct: 935  FQGRLLVGAGKSLRIYDIGKKKMLRKCETKSIPN-C-----IVSLH--TQGHRVIATDVQ 986

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--------- 885
            +S+  +IYKH +  +   A D    WM+   ++D +   G +   N F  R         
Sbjct: 987  ESVHYVIYKHADNRLVVFADDTIPRWMTGSTMVDYETVAGGDKFGNFFVSRLPGSISREV 1046

Query: 886  ---------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
                      + +G       +++ + EY  G+ +      +L+        G    V+ 
Sbjct: 1047 DEDTTGNRIYHEKGYLQGAPNKIDSLCEYFTGDIITSLHKTTLLS-------GGREVVLT 1099

Query: 937  GTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
             +  G I + +  L  E   F + L+ ++R     + G +H  +RS+      +  K+ +
Sbjct: 1100 TSFLGAISIYVPFLSKEDVEFFQMLEMHMRAEAPPLAGRDHLLYRSY-----YIPVKSVI 1154

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DGDL E F  L+  +   I++ ++ SV ++ K++E++
Sbjct: 1155 DGDLCEQFNTLAAEKKRMIAEELDRSVADVQKKIEDM 1191


>gi|31213215|ref|XP_315551.1| AGAP005549-PA [Anopheles gambiae str. PEST]
 gi|21299714|gb|EAA11859.1| AGAP005549-PA [Anopheles gambiae str. PEST]
          Length = 1217

 Score =  206 bits (524), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 262/1189 (22%), Positives = 477/1189 (40%), Gaps = 187/1189 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   ++G + +L  FR  G ++D+  I ++  +  +L+++   ++L             
Sbjct: 53   LLQTEVFGVVRSLMSFRLTGGSKDYAVIGSDSGRIVILEYNPAKNQLEKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D++ +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQYLAIDPKGRAVMIGAVEKQKLVYILNRDSEARLTISSPLEAHKSNTLTYHMVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D ++A    T E A K +    F E     N+        L+  A
Sbjct: 173  VDVGFENPMFACLEIDYEEADTDPTGEAATKTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + LI VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLISVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +L+    G +  + +  + + V+ +K++       A+ +  L    +
Sbjct: 288  ICSATHRTKSMYFFLVQTEQGDIFKVTLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPI 299
            ++   +G+  L ++             +  P  +G       R +          +  PI
Sbjct: 348  FVACEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRQLKNLVMVDDIPSYAPI 407

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
            +   V DL  +   Q+    G     S+R++R+G+ ++E A  EL G    +W+++   D
Sbjct: 408  LGCQVADLANEDTPQLYLACGRGPRSSIRVLRHGLEVSEMAVSELPGNPNAVWTVKKRID 467

Query: 359  DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
            D FD +++VSF++ T +L++   D +EE    GF   T TL C     + LVQV    +R
Sbjct: 468  DEFDAYIIVSFVNATLVLSIG--DTVEEVTDSGFLGTTPTLCCSALGDDALVQVYPDGIR 525

Query: 419  LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQL 476
             + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  ++
Sbjct: 526  HIRADKR--VNEWKAPGKKTIMKCAVNQRQVVIALSGGELVYFEMDPTGQLNEYTERKKM 583

Query: 477  EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LL 535
              E+ C+ +  +      S   AVG+  D +VRI SL   + ++   +  + +P +   L
Sbjct: 584  PSEVMCMALGSVPSGEQRSWFLAVGL-ADNTVRIISLDPTDCLSPRSM--QALPSAAESL 640

Query: 536  CAFE-----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
            C  E                 G  YL   L +G LL  +L+  +G+L D +   LG++P+
Sbjct: 641  CIVEMGTVETSSEDDGVTITTGCIYLNIGLTNGVLLRTVLDPVSGDLADTRTRYLGSRPV 700

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
             L     + +  V A S R  + Y    +   + ++ + + +   F+S    + +     
Sbjct: 701  KLFRIQMQGSEAVLAMSSRSWLSYYYQNRFHLTPLSYETLEYASGFSSEQCSEGIVAIST 760

Query: 639  GELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLL 697
              L I  ++ +  +  + + PL   P+R    +++    I    + +  EE++    + +
Sbjct: 761  NTLRILALEKLGAVFNQITFPLEYTPKRFAIHQETGKLIISETDHNAYTEETKTVRKKQM 820

Query: 698  DDQTFE----------------FIS-TYPLDTF--------------------------- 713
             D+  E                FI+   P D F                           
Sbjct: 821  ADEMREAAGEDEQELANEMADAFINEVLPEDVFSSPKAGTGMWASQIRVMDPINGHTYSK 880

Query: 714  ------EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRIL-VFIVEDGKLQL--I 764
                  E   S+    F+ D   Y   G A  L    + + G  + V+ V+    QL  +
Sbjct: 881  VQLAQNEAVLSLALVRFAVDQKWYVVAGVAKDLQINPKISGGGFIDVYKVDSQTHQLEHM 940

Query: 765  AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQ 822
               E   A  +L  F G+LLA I + +++Y     D G ++L  +C  + HI    + +Q
Sbjct: 941  HRTEIDDAPGALCPFQGRLLAGIGKVLRIY-----DLGKKKLLRKC-ENKHIPNQIVNIQ 994

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
              G  + V D+ +S+  + YK  E  +   A D +  W+++  +LD D     +   N+ 
Sbjct: 995  GMGQRVYVSDVQESVYCIKYKRAENQLIIFADDTHPRWITSASLLDYDTVATGDKFGNIA 1054

Query: 883  TVRKNSEGATD-----------EERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
             +R     + D            +RG L       E +  +HLGE V   +  +L+    
Sbjct: 1055 ILRLPHSVSDDVDEDPTGNKALWDRGLLNGASQKAENICTFHLGEIVMSLQKATLI---- 1110

Query: 925  DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
                G   ++I+ T++G +G +      E Y F + L+ ++R     + G +H  +RS+ 
Sbjct: 1111 ---PGGSESLIYATMSGTVGALVPFTSREDYDFFQHLEMHMRNENPPLCGRDHLSYRSY- 1166

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                    KN +DGDL E F  L   +   I+  +  +  E+ K++E++
Sbjct: 1167 ----YYPVKNVMDGDLCEQFTSLDPAKQKSIASDLGRTPSEVAKKLEDI 1211


>gi|324501533|gb|ADY40680.1| Splicing factor 3B subunit 3 [Ascaris suum]
          Length = 1214

 Score =  206 bits (524), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 264/1183 (22%), Positives = 504/1183 (42%), Gaps = 189/1183 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G ++D++ + ++  +  +L+++ +      R   +   + G R   
Sbjct: 59   VFGIIRSLLAFRLTGGSKDYIVVGSDAGRIVILEYNPQKV-CFERVHQETFGKTGCRRIV 117

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LIG      L  V+  D    L  +  +   +   +    +    G
Sbjct: 118  PGQHLAVDPKGRAILIGAVERQKLVYVMNRDAAANLTISSPLEAHKSHCICYSIVGVDVG 177

Query: 120  CAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
               PT   L  D             NK  + +  YE+ L     V   +++  +D G ++
Sbjct: 178  FENPTFACLEVDYEEVDHDPSGHLANKIPQTLTFYELDLGLNHVVR-KYAEPLVDKG-NI 235

Query: 167  LIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-------- 215
            LI VP       GV++  E  +V+   N      IR  I +    +D D  R        
Sbjct: 236  LISVPGGQEGPSGVIVCCENYLVF--KNLGDQPDIRCPIPRRRNELD-DCDRTVMIVCTA 292

Query: 216  ---------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
                     +L+    G +  + +  E + VT LKI+      +A+ +  L    ++  S
Sbjct: 293  THKTKLMYFFLVQTDQGDIFKVTLESEHDIVTELKIKYFDTIPLANAMCILKTGFLFTAS 352

Query: 267  SYGDSQLIKL--------------NLQPDAKGSY---------VEVLERYVNLGPIVDFC 303
             +GD  L ++               +Q D   ++         + V+++  +L P++   
Sbjct: 353  EFGDHHLYQIAHLGDEDDEPEYSSRMQLDEGETFFFAPRGLLNLAVVDQIDSLCPLISCH 412

Query: 304  VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
            + DL  +   Q+    G     +L+++RNG+ + E A  EL G    +W+++ + DD +D
Sbjct: 413  IDDLANEDAPQLYALCGRSARSALKVLRNGLEVTEMAVSELPGNPNAVWTVKRNIDDKYD 472

Query: 363  TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
            + +VVSF++ T +L++   + +EE    GF   T TL C     + L+QV    +R + +
Sbjct: 473  SHIVVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLGCGLIGDDALLQVYPDGIRHIRA 530

Query: 423  TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEI 480
              R   NEWK+P   ++     N  QV +A  GG +VY E+   G L E  +  +L  ++
Sbjct: 531  DRRV--NEWKAPGKRTIIKCALNRRQVAIALAGGEIVYFELDVTGQLNEYTERRELPADV 588

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLCA 537
             C+ ++ I E    S+   VG+  D +VRI SL     L  ++ + L  E  P S+++  
Sbjct: 589  LCMSLSEIPEGELRSRFLTVGL-ADKTVRIISLDPQDCLAPLSMQALPSE--PESIIVLE 645

Query: 538  F---EGIS----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
                EG S    +L   L +G LL   ++  TGELTD +   LGT+ + L     ++   
Sbjct: 646  MFGGEGQSASTVHLNIGLQNGCLLRTTVDQVTGELTDNRTRYLGTRSVNLFRVRIQSKDA 705

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            + AAS R  ++Y+   +   + ++   + +   F+S   P+ +    E  L I +++ + 
Sbjct: 706  IMAASSRAWLLYNYQSRFHLTPLSYTALEYAAGFSSEQCPEGIVAIAENTLRILSLEKLG 765

Query: 651  KLHIRSI-PLGEHPRR-ICH---------------------QEQSRTFAICSLKNQSCAE 687
             +  +++ PL   PRR I H                     QE+ +  A   ++    A+
Sbjct: 766  AVFNQAVYPLDYTPRRMIIHKASGNLIVIETDHAAFTAKGKQERRKQLANELMEVAKEAD 825

Query: 688  ESEMHFV-------------------------------RLLDDQTFEFISTYPLDTFEYG 716
            E++   V                               R++     E +S +P    E  
Sbjct: 826  EADEQAVKEMAEAILTEKVNEREMGAPKNQKGKWASTVRIMRSSDGETLSLFPFAEDEAA 885

Query: 717  CSILSCSFSDDSNVYYC-VGTAYVLPEENEPTK---GRILVFIVED--GKLQLIAEKETK 770
             S+    F + ++  +  VG    L  + +P K   G I  F++ +   KL+ +    T 
Sbjct: 886  FSLAMVQFQNQNDAQFILVGCGLGL--QLKPRKSHGGSIYTFLLANNGSKLEFLHRTPTD 943

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-HILALYVQTRGDFIV 829
              V +++ F G  L  + +KI+LY     D G ++L ++C +    +  + +++ G  IV
Sbjct: 944  EVVNAVHDFRGMALVGVGKKIRLY-----DLGKKKLLAKCENKQLPVQVVDIRSMGQRIV 998

Query: 830  VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE 889
            V D  +S+  + YK ++  +     D +  +++ + ILD D  +   + F    V +  +
Sbjct: 999  VSDSQESLHFMRYKKQDNQLSIFCDDTSPRFVTCICILDYDT-VAVGDRFGSIAVLRLPK 1057

Query: 890  GATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
            G T+E            +RG L       E +G++++G+ V   +  SLV        G 
Sbjct: 1058 GVTEEVQEDPTGVRALWDRGNLNGASQKVEHIGQFYVGDTVTSMQKTSLV-------PGA 1110

Query: 931  IPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
               +++ T++G IG++   +  +++ F + L+ +LR     + G +H  +RSF    K +
Sbjct: 1111 NDCLVYTTISGTIGMLVPFVSRDEFDFFQNLEMHLRVEFPPLCGRDHLAFRSFYFPVKCI 1170

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                 +DGDL E +  +   +   +++ +     E+ K++E++
Sbjct: 1171 -----IDGDLCEQYALMPLDKQKAVAEELGRKPAEIHKKLEDI 1208


>gi|116207186|ref|XP_001229402.1| conserved hypothetical protein [Chaetomium globosum CBS 148.51]
 gi|88183483|gb|EAQ90951.1| conserved hypothetical protein [Chaetomium globosum CBS 148.51]
          Length = 1211

 Score =  206 bits (523), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 273/1168 (23%), Positives = 484/1168 (41%), Gaps = 167/1168 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            I+G I ++  FR  G  +D++ +AT+  +  ++++  +++      +        R    
Sbjct: 64   IFGIIRSIASFRLAGSHKDYIILATDSGRIAIIEYQPKTNRFSRIHLETFGKSGVRRVVP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  LI     + L  V+  + + +L  +  +   +  VL +  +    G 
Sbjct: 124  GQYLAADPKGRACLIAGVEKNKLVYVLNRNAQAELTISSPLEAHKAGVLVLSLVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
            A P    L  D  +A    + E   + +            + V   WS + +D  + LL 
Sbjct: 184  ANPVFAALEIDYSEADQDPSGEAGKEPEAQLVYYELDLGLNHVVRKWS-DAVDPTSSLLF 242

Query: 169  PVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYLL 218
             VP     P   GVL+ GEE + Y  +N  AF+  IP R   T+     R    G  + L
Sbjct: 243  QVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRRGATEDPQRKRTIVAGVMHKL 300

Query: 219  GDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNA 260
               AG    L+ T + +                  +V  LK++      IA ++  L + 
Sbjct: 301  KGSAGAFFFLLQTDDGDLMKVTLDMVEDNDGNPTGEVRRLKVKYFDTIPIAQSLCILKSG 360

Query: 261  VVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERY 293
             ++    +G+  L +                +   D++ SY  V            +E  
Sbjct: 361  FLFSAGEFGNHHLYQFEKLGDDDEELEFSSDDFPTDSRASYNPVYFHPRPLENLVLVESM 420

Query: 294  VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG-MWS 352
             ++ P++D  + +L  +   Q+ T  G     + R++++G+ ++E  + EL G    +W+
Sbjct: 421  DSMNPLIDCKIANLTGEDAPQIYTVCGNRARSTFRMLKHGLEVSEIVASELPGTPAAVWT 480

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
             + +  D +D ++V+SF + T +L++   + +EE    GF +   TL         L+QV
Sbjct: 481  TKLTKYDEYDGYIVLSFTNATLVLSIG--ETVEEVTESGFLTSVPTLAVQQLGEEGLIQV 538

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV 471
                +R +        NEW +P   S+  A  N +QV++A   G +VY E+  DG L E 
Sbjct: 539  HPKGIRHIVQGRV---NEWPAPQHRSIVAAATNENQVVIALSSGEIVYFEMDADGSLAEY 595

Query: 472  -KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
             +  ++   ++ L +  + E    S   AVG   D +VRI SL P+  L  K        
Sbjct: 596  DEKKEMSGTVTSLSLGRVPEGLRRSSFLAVGC-DDCTVRILSLDPESTLEMKSIQALTSA 654

Query: 530  PRSVLLCAFE----GIS-YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
            P S+L+ A E    G++ YL   L  G  L  +L+  TGELTD ++  LG +P  L   S
Sbjct: 655  PSSLLVMAMEDSTGGMTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTKLFQVS 714

Query: 585  SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
             +N   V A S RP + Y+    K    + +   E+     F+S    + +       L 
Sbjct: 715  VQNQPCVLALSSRPWLGYTDPITKNFSMTPLIYSELEFGWNFSSEQCLEGMVGIHANFLR 774

Query: 643  IGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKN------------QSCAEE 688
            I TI+ +    I +SIPL   P+R+  H EQ   + I S  N            QS A  
Sbjct: 775  IFTIERLGDTMIQKSIPLTYTPKRLVKHPEQPYFYTIESDNNTLPPELRAQLLEQSGAVN 834

Query: 689  SEMHFVRLLD-----------------DQTFEFISTYPLDTFEYGCSILSCSF----SDD 727
             +   +   D                 D   E  S      FE   + +S +     S +
Sbjct: 835  GDAAILPPEDFGYPRATGRWASCISVVDPLGEEPSVLQRIDFEGNEAAVSAAVVPFSSQE 894

Query: 728  SNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLL 784
               +  VGT   ++    + ++G I V+   EDG+ L+ I + + +    +L  F G+LL
Sbjct: 895  GESFLIVGTGKDMVLNPRKFSEGYIHVYRFHEDGRDLEFIHKTKVEEPPMALIPFQGRLL 954

Query: 785  AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
            A I + +++Y   LR    + L+   G     L + +QT+G  I+VGD+ + I+ ++YK 
Sbjct: 955  AGIGKTLRVYDLGLR----QLLRKAQGEVAPQLIVSLQTQGSRIIVGDVQQGITYVVYKP 1010

Query: 845  EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE--------- 895
            E   +   A D    W +   ++D +   G +   N++ +R  SE A+ E          
Sbjct: 1011 ESNKLLPFADDTINRWTTCTTMVDYESVAGGDKFGNIWILR-CSERASQESDEPGSEIQL 1069

Query: 896  ----------RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
                      + RL  +  ++  +        +LV       VG    +++  + G +GV
Sbjct: 1070 LHARNYLHGAQSRLSAMAHFYTQDLPTSIVKTNLV-------VGGQDVLVWSGIQGTVGV 1122

Query: 946  -IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
             I  +  E   F + L++++R     + G +H  +R +      V  K  +DGDL E F 
Sbjct: 1123 LIPFVSREDVDFFQNLESHMRAEDPPLAGRDHLIYRGY-----YVPVKGVIDGDLCERFS 1177

Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             L   +   I+  ++ SV E+ +++ ++
Sbjct: 1178 LLPNDKKQMIAGELDRSVREIERKISDI 1205


>gi|389638952|ref|XP_003717109.1| pre-mRNA-splicing factor RSE1 [Magnaporthe oryzae 70-15]
 gi|148887431|sp|Q52E49.2|RSE1_MAGO7 RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|351642928|gb|EHA50790.1| pre-mRNA-splicing factor RSE1 [Magnaporthe oryzae 70-15]
          Length = 1216

 Score =  206 bits (523), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 287/1167 (24%), Positives = 487/1167 (41%), Gaps = 160/1167 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I  L  FR  G ++D+L IA++  +  ++++   +    +R   +   + G R   
Sbjct: 64   VFGIIRDLASFRLAGSSKDYLIIASDSGRITIVEY-LPAQNRFSRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ    DP  R  LI       L  V+  +++ +L  +  +   +   L +  +    G
Sbjct: 123  PGQYLAADPKGRACLIASVERCKLVYVLNRNSQAELTISSPLEAHKNGTLTLSLVALDVG 182

Query: 120  CAKPTIVVLY-------QDNKD--ARHVKT----YEVALKDKDFVEGPWSQNNLDNGADL 166
             + P    L        QD K   A++V+T    YE+ L     V   W  + +D+ A +
Sbjct: 183  YSNPVFAALEVDYSEVDQDPKGDAAQNVETVLNYYELDLGLNHVVRK-WF-DVVDSTASM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRY 216
            L  VP     P   GVL+ GEE I Y  +N  AF+  IP R   T+  +  R    G  +
Sbjct: 241  LFQVPGGSDGP--SGVLVCGEENITYRHSNQDAFRVPIPRRKGATEDPSRKRNIVSGVMH 298

Query: 217  LLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLD 258
             L   AG    L+ T + +                  +V  LKI+      +++ +  L 
Sbjct: 299  KLKGSAGAFFFLLQTEDGDLFKVTIDMVEDAEGNPTGEVRRLKIKYFDTIPVSNNLCILK 358

Query: 259  NAVVYIGSSYGDS---QLIKL------------NLQPDAKGSY------------VEVLE 291
            +  +++ S +G+    Q  KL            +   D K  Y            + ++E
Sbjct: 359  SGFLFVASEFGNHLFYQFEKLGDDDEELEFFSSDFPVDPKEPYEPVYFYPRPTENLALVE 418

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
               ++ P++D  V +L  +   Q+ T SG     + R++++G+ +NE  + +L G    +
Sbjct: 419  SIDSMNPLMDLKVANLTEEDAPQIYTVSGKGARSTFRMLKHGLEVNEIVASQLPGTPSAV 478

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+ +   DD +D ++V+SF + T +L++   + +EE    GF S   TL       + LV
Sbjct: 479  WTTKLRRDDEYDAYIVLSFTNGTLVLSIG--ETVEEVSDTGFLSSVPTLAVQQLGDDGLV 536

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
            QV    +R + +    + NEW SP   S+  A  N  QV +A   G +VY E+  DG L 
Sbjct: 537  QVHPKGIRHIRNG---VVNEWSSPQHRSIVAAATNERQVAVALSSGEIVYFEMDTDGSLA 593

Query: 470  EVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
            E    +  +  ++ L +  + E    S   AVG   D +VRI SL P+  L +K      
Sbjct: 594  EYDEKKEMFGTVTSLSLGEVPEGRLRSSYLAVGC-DDCTVRILSLDPESTLESKSVQALT 652

Query: 528  IIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
              P ++ + + E  S      YL   L  G  L  +L+  TGELTD ++  LG + + L 
Sbjct: 653  AAPSALSIMSMEDSSSGGTTLYLHIGLNSGVYLRTVLDEVTGELTDTRQKFLGPKAVRLF 712

Query: 582  TFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
              S +  T V A S R  + +S    K    + +N +E+     F S    + +      
Sbjct: 713  QVSVQKRTCVLALSSRSWLGFSDPVTKGFTMTPLNYEELEWGWNFVSEQCEEGMVGVNGQ 772

Query: 640  ELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV---- 694
             L I  I+ +    I +SIPL   PR++      R F      N + A E     +    
Sbjct: 773  FLRIFAIEKLGDNVIQKSIPLTYTPRKLAKHPTQRIFYTIEADNNTLAPELREQLMAAPT 832

Query: 695  ------RLLDDQTFEF---------------------------ISTYPLDTFEYGCSILS 721
                  R+L    F +                           +    LD  E   S+  
Sbjct: 833  AVNGDARVLPPDEFGYPRGNGRWASCISVVDPLGDGEELEPGVVQRIDLDNNEAALSMAV 892

Query: 722  CSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
             SF S D   +  VGT   ++      T+G I V+   EDG+ L+ I + + +    +L 
Sbjct: 893  VSFASQDGESFLVVGTGKDMVVNPRRFTEGYIHVYRFSEDGRELEFIHKTKVEEPPTALL 952

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
             F G+L+A I + +++Y   LR    R+ Q+E       L + + T+G  I+VGD+   +
Sbjct: 953  PFQGRLVAGIGRMLRIYDLGLR-QLLRKAQAEVAPQ---LIVSLNTQGSRIIVGDVQHGL 1008

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEE 895
              + YK E   +   A D  A W +   ++D D   GA+   NL+ +R  + +   +DE 
Sbjct: 1009 IYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAGADKFGNLWILRCPEKASQESDEP 1068

Query: 896  RGRLEVV-GEYHLGEFVNRFRHGSLV--MRLPDS------DVGQIPTVIFGTVNGVIGV- 945
               + +V    +L    NR    + V    +P S       VG    +++G   G IGV 
Sbjct: 1069 GSEVHLVHSRDYLHGTSNRLALMAHVYTQDIPTSICKTNLVVGGQEVLLWGGFQGTIGVL 1128

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
            I  +  E   F + L+ +LR     + G +H  +R        V  K  +DGDL E +  
Sbjct: 1129 IPFVSREDADFFQSLEQHLRSEDPPLAGRDHLMYRGC-----YVPVKGVIDGDLCERYTM 1183

Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L   +   I+  ++ SV E+ +++ ++
Sbjct: 1184 LPNDKKQMIAGELDRSVREIERKISDI 1210


>gi|67538564|ref|XP_663056.1| hypothetical protein AN5452.2 [Aspergillus nidulans FGSC A4]
 gi|40743422|gb|EAA62612.1| hypothetical protein AN5452.2 [Aspergillus nidulans FGSC A4]
          Length = 1226

 Score =  206 bits (523), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 281/1162 (24%), Positives = 475/1162 (40%), Gaps = 163/1162 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I TL  FR  G  +D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRTLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q L    +    G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVYSVVALDAG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  ++    T             YE+ L     V   W+ + +D  + +
Sbjct: 183  YENPVFAALEVDYSESDQDPTGRAYEEVEKLLVYYELDLGLNHVVRK-WT-DPVDRTSSM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPS----------ITKAYGRVD 210
            L  VP       GVL+  E+ I Y  +N  AF+  IP R            IT       
Sbjct: 241  LFQVPGGADGPSGVLVCAEDNITYRHSNQDAFRVPIPRRKGAMENPERKRCITAGVMHKM 300

Query: 211  ADGSRYLLGDHAG-LLHLLVITHEKEK------VTGLKIELLGETSIASTISYLDNAVVY 263
                 +LL    G L  L +   E +K      V GLKI+      +AS++  L +  +Y
Sbjct: 301  RGAFFFLLQTEDGDLFKLTLDMVEDDKGQLTGEVKGLKIKYFDTVPLASSLLILKSGFLY 360

Query: 264  IGSSYGDSQLIKL-----------------NLQPDAKGSYV----------EVLERYVNL 296
            + +  G+    +                  +  P A  + V           ++E   +L
Sbjct: 361  VAAEGGNHHFYQFEKLGDDDEETEFNSDDFSADPAAPCTPVYFQPRGAENLNLVEAINSL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P+VD  VV++      Q+ T SG     + R +++G+ ++E    EL  +   +W+ + 
Sbjct: 421  NPLVDSKVVNISEDDAPQIFTVSGTGARSTFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            +  D FD ++V+SF + T +L++   + +EE    GF S   TL       + L+Q+   
Sbjct: 481  TRADEFDAYIVLSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQIHPR 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
             +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFELDADGSLAEYDER 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P +
Sbjct: 597  RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDTTLENKSVQALTAAPSA 655

Query: 533  VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + + A    S      YL   L  G  L   L+  TGEL+D +   LG++ + L   S  
Sbjct: 656  LNIIAMADSSSGGTTLYLHIGLHSGVYLRTALDEVTGELSDTRTRFLGSKAVKLFQVSVT 715

Query: 587  NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
              T V A S RP + YS    K  + + ++   +     F+S    + +   +   L I 
Sbjct: 716  GQTAVLALSSRPWLGYSDTQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775

Query: 645  TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFE 703
            +I+ +   +  +SIPL   PR      +   F +    N   +  +     RLL+D    
Sbjct: 776  SIEKLDNNMLQQSIPLAYTPRHFIKHPEEPLFYVIEADNNVLSPATR---ARLLEDSKAR 832

Query: 704  FIST---------YP--------------------------LDTFEYGCSILSCSF-SDD 727
               T         YP                          L+  E   SI +  F S D
Sbjct: 833  GGDTTVLPPEDFGYPRGTGHWASCIQIIDPLDAKAVVGAVELEENEAAVSIAAVPFTSQD 892

Query: 728  SNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
               +  VGTA  +   N P+   G I ++   EDGK L+ I + + +    +L  F G+L
Sbjct: 893  DETFLVVGTAKDM-TVNPPSSAGGYIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRL 951

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIY 842
            LA +   +++Y     D G ++L  +C       A+  +QT+G  IVV D+ +S++ ++Y
Sbjct: 952  LAGVGSVLRIY-----DLGMKQLLRKCQAAVAPKAIVGLQTQGSRIVVSDVRESVTYVVY 1006

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRL 899
            K+++  +     D  A W +A  ++D +   G +   NL+ VR   K SE A +E  G  
Sbjct: 1007 KYQDNVLIPFVDDSIARWTTAATMVDYETTAGGDKFGNLWLVRCPKKASEEADEEGSGAH 1066

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDV-----------GQIPTVIFGTVNGVIGVIAS 948
             +    +L    NR     L++ +   D+           G    +++    G IG++  
Sbjct: 1067 LIHDRGYLQGTPNRLE---LMIHVFTQDIPTSLHKTQLVAGGRDILVWTGFQGTIGILVP 1123

Query: 949  -LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
             +  E   F + L+  L      + G +H  +RS+    K V     +DGDL E +  LS
Sbjct: 1124 FVSREDVDFFQSLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEQYFLLS 1178

Query: 1008 RTRMDEISKTMNVSVEELCKRV 1029
                  I+  ++ SV E+ +++
Sbjct: 1179 NDTKMMIAAELDRSVREIERKI 1200


>gi|121700262|ref|XP_001268396.1| nuclear mRNA splicing factor, putative [Aspergillus clavatus NRRL 1]
 gi|119396538|gb|EAW06970.1| nuclear mRNA splicing factor, putative [Aspergillus clavatus NRRL 1]
          Length = 1209

 Score =  205 bits (522), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 274/1170 (23%), Positives = 482/1170 (41%), Gaps = 173/1170 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I TL  FR  G  +D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRTLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q +    +    G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTVVFAMIALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  ++    T             YE+ L     V   W+ + +D  +++
Sbjct: 183  YENPIFAALEVDYSESDQDPTGRAYEESEKVLVYYELDLGLNHVVRK-WA-DPVDRTSNM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRV--------- 209
            L  VP     P   GVL+  E+++ Y  +N  AFK  IP R   T+   R          
Sbjct: 241  LFQVPGGGDGP--SGVLVCAEDSVTYRHSNQDAFKVPIPRRSGATENPERKRTIVAGVMH 298

Query: 210  DADGSRYLL--GDHAGLLHLLV--ITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
               G+ + L   D   L  + +  +  +  ++TG    LKI+      +AS +  L +  
Sbjct: 299  KMRGAFFFLLQTDDGDLFKVTIDMVEDDNGQLTGEVKRLKIKYFDTVPVASNLLILKSGF 358

Query: 262  VYIGSSYGDSQLIKLNLQPD-------AKGSY--------------------VEVLERYV 294
            +Y+ S  G+    +     D       +  SY                    + ++E   
Sbjct: 359  LYVASEGGNHHFYQFEKLGDDDEEIEFSSESYSADPSVPCDPVFFRPRGAENLNLVESLN 418

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
            +L P++D  + +L      Q+   SG     + R +++G+ ++E    EL  +   +W+ 
Sbjct: 419  SLNPLIDSKIANLNEDDAPQIYAVSGTGARSTFRTLKHGLEVSEIVESELPSVPSAVWTT 478

Query: 354  RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
            + +  D FD ++++SF + T +L++   + +EE    GF S   TL       + L+QV 
Sbjct: 479  KLTRGDEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVH 536

Query: 414  SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
               +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  
Sbjct: 537  PRGIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGSLAEYD 594

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
            +  Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P
Sbjct: 595  ERRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 653

Query: 531  RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
             ++ + +    S      YL   L  G  L  +L+  TGEL+D +   LG +P+ L   S
Sbjct: 654  SALNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKPVKLFRVS 713

Query: 585  SKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
             +  T V A S RP + Y     K  + + ++   +     F+S    + +   +   L 
Sbjct: 714  VRGQTAVLALSSRPWLGYCDIQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLR 773

Query: 643  IGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR------ 695
            I +I+ +    ++ SIPL   PRR+    +   F +    N   +  +    +       
Sbjct: 774  IFSIEKLDNNMLQESIPLSYTPRRLLKHPEQPLFYVIGSDNNVLSPATRARLIEDSKARN 833

Query: 696  ----LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSFSD-DS 728
                 L  + F +                      IST  L+  E   S+ +  FS  D 
Sbjct: 834  GEADTLPPEEFGYPRATGHWASCIQVVDPVNAKAVISTIELEENEAAVSMAAVPFSSQDD 893

Query: 729  NVYYCVGTAYVLPEENEPTKGRILVFIV---EDGK-LQLIAEKETKGAVYSLNAFNGKLL 784
              +  VGTA  L   N P+     + I    EDGK L+ I + + +    +L AF G+L+
Sbjct: 894  ETFLVVGTAKDL-TVNPPSSAGGFIHIYRFQEDGKELEFIHKTKVEEPPLALLAFQGRLV 952

Query: 785  AAINQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFIVVGDLMKSISLLI 841
            A I   +++Y     D G ++L  +C        I+ L  QT+G  I+V D+ +S++ ++
Sbjct: 953  AGIGSILRIY-----DLGMKQLLRKCQAPVVPKTIVGL--QTQGSRIIVSDVRESVTYVV 1005

Query: 842  YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE---- 894
            YK++E  +     D  + WM++  ++D +   G +   NL+ VR   K SE A ++    
Sbjct: 1006 YKYQENVLIPFVDDTVSRWMTSTTMVDYETVAGGDKFGNLWLVRCPKKISEEADEDGSGA 1065

Query: 895  ----ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVI 943
                ERG       RLE++   +  +         LV        G    +++    G I
Sbjct: 1066 HLIHERGYLHGTPNRLELMIHTYTQDIPTSVHKTQLV-------AGGRDILVWTGFQGTI 1118

Query: 944  GVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
            G++   +  E   F + L+  L      + G +H  +RS+    K V     +DGDL E 
Sbjct: 1119 GMLVPFMSREDVDFFQNLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEM 1173

Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +  L       I+  ++ SV E+ +++ ++
Sbjct: 1174 YFLLPNDTKMMIAAELDRSVREIERKISDM 1203


>gi|440478305|gb|ELQ59147.1| pre-mRNA-splicing factor rse-1 [Magnaporthe oryzae P131]
          Length = 1223

 Score =  205 bits (522), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 287/1164 (24%), Positives = 485/1164 (41%), Gaps = 160/1164 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I  L  FR  G ++D+L IA++  +  ++++   +    +R   +   + G R   
Sbjct: 64   VFGIIRDLASFRLAGSSKDYLIIASDSGRITIVEY-LPAQNRFSRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ    DP  R  LI       L  V+  +++ +L  +  +   +   L +  +    G
Sbjct: 123  PGQYLAADPKGRACLIASVERCKLVYVLNRNSQAELTISSPLEAHKNGTLTLSLVALDVG 182

Query: 120  CAKPTIVVLY-------QDNKD--ARHVKT----YEVALKDKDFVEGPWSQNNLDNGADL 166
             + P    L        QD K   A++V+T    YE+ L     V   W  + +D+ A +
Sbjct: 183  YSNPVFAALEVDYSEVDQDPKGDAAQNVETVLNYYELDLGLNHVVRK-WF-DVVDSTASM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRY 216
            L  VP     P   GVL+ GEE I Y  +N  AF+  IP R   T+  +  R    G  +
Sbjct: 241  LFQVPGGSDGP--SGVLVCGEENITYRHSNQDAFRVPIPRRKGATEDPSRKRNIVSGVMH 298

Query: 217  LLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLD 258
             L   AG    L+ T + +                  +V  LKI+      +++ +  L 
Sbjct: 299  KLKGSAGAFFFLLQTEDGDLFKVTIDMVEDAEGNPTGEVRRLKIKYFDTIPVSNNLCILK 358

Query: 259  NAVVYIGSSYGDS---QLIKL------------NLQPDAKGSY------------VEVLE 291
            +  +++ S +G+    Q  KL            +   D K  Y            + ++E
Sbjct: 359  SGFLFVASEFGNHLFYQFEKLGDDDEELEFFSSDFPVDPKEPYEPVYFYPRPTENLALVE 418

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
               ++ P++D  V +L  +   Q+ T SG     + R++++G+ +NE  + +L G    +
Sbjct: 419  SIDSMNPLMDLKVANLTEEDAPQIYTVSGKGARSTFRMLKHGLEVNEIVASQLPGTPSAV 478

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+ +   DD +D ++V+SF + T +L++   + +EE    GF S   TL       + LV
Sbjct: 479  WTTKLRRDDEYDAYIVLSFTNGTLVLSIG--ETVEEVSDTGFLSSVPTLAVQQLGDDGLV 536

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
            QV    +R + +    + NEW SP   S+  A  N  QV +A   G +VY E+  DG L 
Sbjct: 537  QVHPKGIRHIRNG---VVNEWSSPQHRSIVAAATNERQVAVALSSGEIVYFEMDTDGSLA 593

Query: 470  EVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
            E    +  +  ++ L +  + E    S   AVG   D +VRI SL P+  L +K      
Sbjct: 594  EYDEKKEMFGTVTSLSLGEVPEGRLRSSYLAVGC-DDCTVRILSLDPESTLESKSVQALT 652

Query: 528  IIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
              P ++ + + E  S      YL   L  G  L  +L+  TGELTD ++  LG + + L 
Sbjct: 653  AAPSALSIMSMEDSSSGGTTLYLHIGLNSGVYLRTVLDEVTGELTDTRQKFLGPKAVRLF 712

Query: 582  TFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
              S +  T V A S R  + +S    K    + +N +E+     F S    + +      
Sbjct: 713  QVSVQKRTCVLALSSRSWLGFSDPVTKGFTMTPLNYEELEWGWNFVSEQCEEGMVGVNGQ 772

Query: 640  ELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV---- 694
             L I  I+ +    I +SIPL   PR++      R F      N + A E     +    
Sbjct: 773  FLRIFAIEKLGDNVIQKSIPLTYTPRKLAKHPTQRIFYTIEADNNTLAPELREQLMAAPT 832

Query: 695  ------RLLDDQTFEF---------------------------ISTYPLDTFEYGCSILS 721
                  R+L    F +                           +    LD  E   S+  
Sbjct: 833  AVNGDARVLPPDEFGYPRGNGRWASCISVVDPLGDGEELEPGVVQRIDLDNNEAALSMAV 892

Query: 722  CSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
             SF S D   +  VGT   ++      T+G I V+   EDG+ L+ I + + +    +L 
Sbjct: 893  VSFASQDGESFLVVGTGKDMVVNPRRFTEGYIHVYRFSEDGRELEFIHKTKVEEPPTALL 952

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
             F G+L+A I + +++Y   LR    R+ Q+E       L + + T+G  I+VGD+   +
Sbjct: 953  PFQGRLVAGIGRMLRIYDLGLR-QLLRKAQAEVAPQ---LIVSLNTQGSRIIVGDVQHGL 1008

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEE 895
              + YK E   +   A D  A W +   ++D D   GA+   NL+ +R  + +   +DE 
Sbjct: 1009 IYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAGADKFGNLWILRCPEKASQESDEP 1068

Query: 896  RGRLEVV-GEYHLGEFVNRFRHGSLV--MRLPDS------DVGQIPTVIFGTVNGVIGV- 945
               + +V    +L    NR    + V    +P S       VG    +++G   G IGV 
Sbjct: 1069 GSEVHLVHSRDYLHGTSNRLALMAHVYTQDIPTSICKTNLVVGGQEVLLWGGFQGTIGVL 1128

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
            I  +  E   F + L+ +LR     + G +H  +R        V  K  +DGDL E +  
Sbjct: 1129 IPFVSREDADFFQSLEQHLRSEDPPLAGRDHLMYRGC-----YVPVKGVIDGDLCERYTM 1183

Query: 1006 LSRTRMDEISKTMNVSVEELCKRV 1029
            L   +   I+  ++ SV E+ +++
Sbjct: 1184 LPNDKKQMIAGELDRSVREIERKI 1207


>gi|195996829|ref|XP_002108283.1| hypothetical protein TRIADDRAFT_49802 [Trichoplax adhaerens]
 gi|190589059|gb|EDV29081.1| hypothetical protein TRIADDRAFT_49802 [Trichoplax adhaerens]
          Length = 1208

 Score =  205 bits (522), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 278/1182 (23%), Positives = 495/1182 (41%), Gaps = 192/1182 (16%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I  L  FR  G ++D++ I ++  +  +L++   S     +   +   + G R   
Sbjct: 58   VFGEIRCLAPFRLTGGSKDYVVIGSDSGRITILEY-IPSKNTFEKIHQETFGKSGCRRIV 116

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIRLEELQVLDIKFL 117
             GQ   +DP  R  +IG      L  ++  D   +L      EA         V+ I   
Sbjct: 117  PGQYLAVDPKGRAVMIGAMEKQKLVYILNRDTSARLTISSPLEAHKSHTVVFSVVGIDV- 175

Query: 118  YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADL 166
             G   P    L  D ++A +  T E AL  +    + E     N+        L + +++
Sbjct: 176  -GFENPVFACLEVDYEEADNDPTGEAALTTRQMLTYYELDLGLNHVVRKFTETLQDFSNM 234

Query: 167  LIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIR-PSITKAYGRVDAD---------G 213
            LIPVP       G+L+  E  I Y   N      IR P   + Y   D D         G
Sbjct: 235  LIPVPGGNDGPSGILVCSENFITY--KNFGDQPDIRMPIPRRRYDLSDPDRGILFVCYAG 292

Query: 214  SR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
             +      + + +  G +  L +T E + V+ +K++      +AS++  L    ++  S 
Sbjct: 293  HKTKSLFFFFIQNEQGDIFKLTLTVEDDMVSSIKLKYFDTIPVASSMIVLKTGFLFASSE 352

Query: 268  YGDSQLIKL--------------------------NLQPDAKGSYVEVLERYVNLGPIVD 301
            +G+  L ++                          +L+P    + VEV E   +L PI  
Sbjct: 353  FGNHHLYQIAHLGDNDEETEFSSTMLLDEGETFYYSLRP--LKNLVEVDE-VDSLCPITG 409

Query: 302  FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDP 360
              V DL  +   Q+    G   + +LRI+R+G+ + E A  EL G   G+W++++S    
Sbjct: 410  CQVADLANEDTPQLYVSCGRGPNSTLRILRHGLEVTEMAVSELPGNPNGVWTVKTSASAE 469

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            +D ++VVSF++ T +L++   + +EE    GF   T TL C     + L+QV +  +R V
Sbjct: 470  YDAYIVVSFVNATLVLSIG--ESVEEVSDSGFLGTTPTLHCCQIGDDALLQVYANGIRHV 527

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEY 478
             +  R   NEWK+P    ++    N  QV +A  GG LVY E+   G L E  +  +   
Sbjct: 528  RADKR--VNEWKAPGKKIISKCAVNNRQVAIALTGGELVYFEMDLSGQLNEYTERREFSS 585

Query: 479  EISCLDIN--PIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLL 535
            E+ C+ I   P+GE     +  AVG+  D +VR+ SL P   L          +P S+ +
Sbjct: 586  EVICMSIGSVPVGEK--RCRFLAVGL-ADHTVRMISLDPSDCLQPMSMQALPTVPESLCI 642

Query: 536  CAF---------EGI---SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
             A          +G+    YL   L +G LL  +L+  TG+++D +   LG++P+ L   
Sbjct: 643  VAMGSGDSSESEQGVLSTYYLNIGLQNGVLLRSVLDSVTGDMSDTRTRYLGSRPVRLFKV 702

Query: 584  SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
              +N+  V A S R  + Y        + ++   + +   F+S   P+ +       L I
Sbjct: 703  KIQNSEAVLAISSRSWLGYMFQSVSRLTPLSYDALDYASGFSSDQCPEGVVSIAGDTLRI 762

Query: 644  GTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAIC------------SLKNQSCA---- 686
              ++ +  +  + +I L   PRR    +Q+    +             S + Q  A    
Sbjct: 763  LALEKLGAVFNQMTINLKLTPRRFAIDQQNSNLVVVGSDHLCFTDSTKSERKQQMAKEII 822

Query: 687  -----EESEM------HFVR------------------------LLDDQTFEFISTYPLD 711
                 EE+E+       F+                         L  DQ+ E      LD
Sbjct: 823  ESAGDEEAELAKEVAESFMNEVLPATEFGEPKAGNGQWASCIQLLAPDQSLE------LD 876

Query: 712  TFEYGCSILSCSFS-DDSNVYYCVGTAYVLPEENEPTKGRIL-VFIVEDGKLQLIAEKET 769
              E   S+  C F+      +  VG A  L      + G ++  + + +G+L+L+ +   
Sbjct: 877  QDEAALSVAICRFAYKPDETFVVVGVAKELNLNPSSSSGGLMNTYRMANGQLELVHKTVV 936

Query: 770  KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFI 828
            +    ++ AF G+LL    + +++Y     D G ++L  +C +      +  + + G  +
Sbjct: 937  EEVPRAMAAFQGRLLVGTGRILRVY-----DLGRKKLLRKCENKNFPYRIVTISSMGSRV 991

Query: 829  VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD-DDIYLGAENNFNLFTVRKN 887
            +VGD+ +S+  + Y+ +E  +   A D +  +++A   LD D I +G  + F    + + 
Sbjct: 992  IVGDVQESVHFVKYRAKENRLVVFADDVSPRYVTATCFLDYDTIAVG--DKFGSIAILRL 1049

Query: 888  SEGATDE------------ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPT 933
            S+   DE            +RG L   G          F  G  VM L  + +  G   +
Sbjct: 1050 SDDINDEIEEDPTGAKAFWDRGLLN--GASQKANLEASFYIGETVMSLQKTTIIPGGSES 1107

Query: 934  VIFGTVNGVIGVIASLP---HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
            +I+ T++G IGV+  LP    E+  F + L+ +LR     + G +H  +RS+        
Sbjct: 1108 LIYTTLSGSIGVL--LPFTSREEVDFFQHLEMHLRSENAPICGRDHLAYRSY-----YFP 1160

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            AKN +DGD+ E F  L  ++   ++  ++ +  E+ K++E++
Sbjct: 1161 AKNVIDGDMCEQFNALDGSKRRTLAMELDRTPPEISKKLEDM 1202


>gi|440473070|gb|ELQ41892.1| pre-mRNA-splicing factor rse-1 [Magnaporthe oryzae Y34]
          Length = 1229

 Score =  205 bits (521), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 287/1164 (24%), Positives = 485/1164 (41%), Gaps = 160/1164 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I  L  FR  G ++D+L IA++  +  ++++   +    +R   +   + G R   
Sbjct: 64   VFGIIRDLASFRLAGSSKDYLIIASDSGRITIVEY-LPAQNRFSRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ    DP  R  LI       L  V+  +++ +L  +  +   +   L +  +    G
Sbjct: 123  PGQYLAADPKGRACLIASVERCKLVYVLNRNSQAELTISSPLEAHKNGTLTLSLVALDVG 182

Query: 120  CAKPTIVVLY-------QDNKD--ARHVKT----YEVALKDKDFVEGPWSQNNLDNGADL 166
             + P    L        QD K   A++V+T    YE+ L     V   W  + +D+ A +
Sbjct: 183  YSNPVFAALEVDYSEVDQDPKGDAAQNVETVLNYYELDLGLNHVVRK-WF-DVVDSTASM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRY 216
            L  VP     P   GVL+ GEE I Y  +N  AF+  IP R   T+  +  R    G  +
Sbjct: 241  LFQVPGGSDGP--SGVLVCGEENITYRHSNQDAFRVPIPRRKGATEDPSRKRNIVSGVMH 298

Query: 217  LLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLD 258
             L   AG    L+ T + +                  +V  LKI+      +++ +  L 
Sbjct: 299  KLKGSAGAFFFLLQTEDGDLFKVTIDMVEDAEGNPTGEVRRLKIKYFDTIPVSNNLCILK 358

Query: 259  NAVVYIGSSYGDS---QLIKL------------NLQPDAKGSY------------VEVLE 291
            +  +++ S +G+    Q  KL            +   D K  Y            + ++E
Sbjct: 359  SGFLFVASEFGNHLFYQFEKLGDDDEELEFFSSDFPVDPKEPYEPVYFYPRPTENLALVE 418

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
               ++ P++D  V +L  +   Q+ T SG     + R++++G+ +NE  + +L G    +
Sbjct: 419  SIDSMNPLMDLKVANLTEEDAPQIYTVSGKGARSTFRMLKHGLEVNEIVASQLPGTPSAV 478

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+ +   DD +D ++V+SF + T +L++   + +EE    GF S   TL       + LV
Sbjct: 479  WTTKLRRDDEYDAYIVLSFTNGTLVLSIG--ETVEEVSDTGFLSSVPTLAVQQLGDDGLV 536

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
            QV    +R + +    + NEW SP   S+  A  N  QV +A   G +VY E+  DG L 
Sbjct: 537  QVHPKGIRHIRNG---VVNEWSSPQHRSIVAAATNERQVAVALSSGEIVYFEMDTDGSLA 593

Query: 470  EVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
            E    +  +  ++ L +  + E    S   AVG   D +VRI SL P+  L +K      
Sbjct: 594  EYDEKKEMFGTVTSLSLGEVPEGRLRSSYLAVGC-DDCTVRILSLDPESTLESKSVQALT 652

Query: 528  IIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
              P ++ + + E  S      YL   L  G  L  +L+  TGELTD ++  LG + + L 
Sbjct: 653  AAPSALSIMSMEDSSSGGTTLYLHIGLNSGVYLRTVLDEVTGELTDTRQKFLGPKAVRLF 712

Query: 582  TFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
              S +  T V A S R  + +S    K    + +N +E+     F S    + +      
Sbjct: 713  QVSVQKRTCVLALSSRSWLGFSDPVTKGFTMTPLNYEELEWGWNFVSEQCEEGMVGVNGQ 772

Query: 640  ELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV---- 694
             L I  I+ +    I +SIPL   PR++      R F      N + A E     +    
Sbjct: 773  FLRIFAIEKLGDNVIQKSIPLTYTPRKLAKHPTQRIFYTIEADNNTLAPELREQLMAAPT 832

Query: 695  ------RLLDDQTFEF---------------------------ISTYPLDTFEYGCSILS 721
                  R+L    F +                           +    LD  E   S+  
Sbjct: 833  AVNGDARVLPPDEFGYPRGNGRWASCISVVDPLGDGEELEPGVVQRIDLDNNEAALSMAV 892

Query: 722  CSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
             SF S D   +  VGT   ++      T+G I V+   EDG+ L+ I + + +    +L 
Sbjct: 893  VSFASQDGESFLVVGTGKDMVVNPRRFTEGYIHVYRFSEDGRELEFIHKTKVEEPPTALL 952

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
             F G+L+A I + +++Y   LR    R+ Q+E       L + + T+G  I+VGD+   +
Sbjct: 953  PFQGRLVAGIGRMLRIYDLGLR-QLLRKAQAEVAPQ---LIVSLNTQGSRIIVGDVQHGL 1008

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEE 895
              + YK E   +   A D  A W +   ++D D   GA+   NL+ +R  + +   +DE 
Sbjct: 1009 IYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAGADKFGNLWILRCPEKASQESDEP 1068

Query: 896  RGRLEVV-GEYHLGEFVNRFRHGSLV--MRLPDS------DVGQIPTVIFGTVNGVIGV- 945
               + +V    +L    NR    + V    +P S       VG    +++G   G IGV 
Sbjct: 1069 GSEVHLVHSRDYLHGTSNRLALMAHVYTQDIPTSICKTNLVVGGQEVLLWGGFQGTIGVL 1128

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
            I  +  E   F + L+ +LR     + G +H  +R        V  K  +DGDL E +  
Sbjct: 1129 IPFVSREDADFFQSLEQHLRSEDPPLAGRDHLMYRGC-----YVPVKGVIDGDLCERYTM 1183

Query: 1006 LSRTRMDEISKTMNVSVEELCKRV 1029
            L   +   I+  ++ SV E+ +++
Sbjct: 1184 LPNDKKQMIAGELDRSVREIERKI 1207


>gi|327309050|ref|XP_003239216.1| pre-mRNA splicing factor rse1 [Trichophyton rubrum CBS 118892]
 gi|326459472|gb|EGD84925.1| pre-mRNA splicing factor rse1 [Trichophyton rubrum CBS 118892]
          Length = 1209

 Score =  204 bits (520), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 278/1164 (23%), Positives = 491/1164 (42%), Gaps = 161/1164 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G ++D++ I ++  +  ++++   +     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
             GQ   +DP  R  LI     + L  V+  + + +L  +  +           L  LD+ 
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182

Query: 116  F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
            +   ++   +     + QD     + +T      YE+ L     V   W+ + +D  A +
Sbjct: 183  YENPIFAALEVEYTEVDQDPTGQAYEETEKMLVYYELDLGLNHVVRR-WA-DPVDRTASM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
            L  VP       GVL+  E+ IVY  +N  AFK  IP R  P+      R    G     
Sbjct: 241  LFQVPGGADGPSGVLVCAEDNIVYRHSNQDAFKVPIPRRRGPTENPERKRCITAGVMHKM 300

Query: 216  -----YLLGDHAGLLH---LLVITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
                 +LL    G L    + ++  E EK TG    LK++      +AS++  L +  ++
Sbjct: 301  RGAFFFLLQSEDGDLFKVTMEMVEDENEKATGEVKRLKLKYFDTVPLASSLCILKSGFLF 360

Query: 264  IGSSYGD-----------------------SQLIKLNLQP----DAKGSYVEVLERYVNL 296
            + S  G+                       S +I   L P          + ++E   +L
Sbjct: 361  VASETGNQHFYQFEKLGDDDDEIEFISDDYSAIISEPLPPVYFRPRPAENLNLVESIASL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P++   + ++  +   Q+ T  G     S R +++G+ ++E    EL  +   +W+ + 
Sbjct: 421  NPLMAASIANITEEDAPQIYTLCGTGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            S +D FD ++V+SF + T +L++   + +EE    GF S   TL       + L+QV   
Sbjct: 481  SRNDQFDAYIVLSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
             +R + +  R   NEW +P   S+  AT+N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHIHADQR--VNEWPAPQHRSIVAATSNERQVAIALSSGEIVYFEMDTDGSLAEYDEK 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P +
Sbjct: 597  RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPSA 655

Query: 533  V-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + ++   +  S     YL   L  G  L  +L+  TGEL+D +   LG +P+ L + S K
Sbjct: 656  LSIMSMIDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFSVSVK 715

Query: 587  NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
                V A S R  + YS    K    + +N   +     F+S    + +   +   L I 
Sbjct: 716  EQRAVLALSSRSWLGYSDVQTKSFTLTPLNYVGLEWSWNFSSEQCVEGMVGIQGQNLRIF 775

Query: 645  TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEM---------- 691
            +I+ +   L    IPL   PR      +   F +    N   S A ++++          
Sbjct: 776  SIEKLDNNLLQEPIPLAYTPRNFVRHPEYPLFYVIGSDNNILSPATKAKLLSESTTVNGD 835

Query: 692  ----------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILSCSF-SDDSNV 730
                            H+   ++++D   T   +S+  L+  E   SI + SF S +   
Sbjct: 836  SAELPPEGFGYPRGTNHWASSIQVVDPIHTKSVLSSLELEDNEAAVSIAAVSFTSQEDET 895

Query: 731  YYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAI 787
            +  VGT   ++      T G I ++   E+GK L+ I + + +    +L  F G+LLA I
Sbjct: 896  FLVVGTGKDMVVSPRTFTCGFIHIYRFQEEGKELEFIHKTKVEQPPLALLGFQGRLLAGI 955

Query: 788  NQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
               +++Y     D G R+L  +C       + + +QT+G  I+V D+ +S++ ++YK++E
Sbjct: 956  GPDLRIY-----DLGMRQLLRKCQAQITPRVIVGLQTQGSRIIVSDVQESVTYVVYKYQE 1010

Query: 847  GAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVG 903
             A+   A D    W +   ++D +   G +   N++ +R   K SE A ++  G   +  
Sbjct: 1011 NALISFADDIIPRWTTCTTMVDYETVAGGDKFGNIWLLRCPTKASEEADEDGSGAHLIHE 1070

Query: 904  EYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT--------------VNGVIGV-IAS 948
              +L    NR    SLV+     D   IPT I  T              + G +G+ +  
Sbjct: 1071 RQYLQGAPNRL---SLVIHFYSQD---IPTSIQKTQLVAGGRDILVWTGLQGTVGMFVPF 1124

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +  +   F + L+  L      + G +H  +R +         K  +DGDL E+FL L  
Sbjct: 1125 ITRDDVDFFQTLEMQLASQNPPLAGRDHLIYRGY-----YAPCKGVIDGDLCETFLLLPN 1179

Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
             +   I+  ++ SV E+ +++ ++
Sbjct: 1180 DKKQAIAGELDRSVREIERKISDM 1203


>gi|336257679|ref|XP_003343663.1| hypothetical protein SMAC_08834 [Sordaria macrospora k-hell]
 gi|380091896|emb|CCC10625.1| unnamed protein product [Sordaria macrospora k-hell]
          Length = 1209

 Score =  204 bits (520), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 279/1168 (23%), Positives = 487/1168 (41%), Gaps = 169/1168 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            I+G +  +  FR  G  +D++ +AT+  +  ++++  ++++     +        R    
Sbjct: 64   IFGIVRAIASFRLAGSHKDYIILATDSGRITIIEYLPKTNKFQRIHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  LI     + L  V+  +++ +L  +  +   +  VL +  +    G 
Sbjct: 124  GQYLAADPKGRACLISSLEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSLVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
            A P    L  D  DA    T             YE+ L     V   WS + +D  + LL
Sbjct: 184  ANPVFAALEIDYSDADQDPTGQAREEVETQLVYYELDLGLNHVVRK-WS-DTVDRTSSLL 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYL 217
              VP     P   GVL+ GEE + Y  +N  AF+  IP R   T+     RV   G  + 
Sbjct: 242  FQVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRKGATEDPQRKRVIVSGVMHK 299

Query: 218  LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
            L   AG    L+ T + +                  +V  LKI+      +A+++  L +
Sbjct: 300  LKGSAGAFFFLLQTDDGDLFKITIDMIEDADGNPTGEVKRLKIKYFDTIPVATSLCILKS 359

Query: 260  AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
              ++  S +G+    +                +   D   SY  V      LE  V    
Sbjct: 360  GFLFAASEFGNHHFYQFEKLGDDDEELEFSSDDFPTDPTASYNPVYFHPRPLENLVLVES 419

Query: 295  --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
              ++ P VD  V +L      Q+ +  G     + R +++G+ ++E  + EL G    +W
Sbjct: 420  IDSMNPQVDCKVANLTGDDAPQIYSVCGNGARSTFRSLKHGLEVSEIVASELPGTPSAVW 479

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            + + +  D +D ++V+SF + T +L++   + +EE    GF +   TL       + L+Q
Sbjct: 480  TTKLTKYDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTTAPTLAVRQMGEDGLIQ 537

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
            V    +R +        NEW +P   S+  ATAN +QV++A   G +VY E+  DG L E
Sbjct: 538  VHPKGIRHIVQGRV---NEWPAPQHRSIVAATANENQVVIALSSGEIVYFEMDSDGSLAE 594

Query: 471  V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
              +  ++   ++ L +  + E    S   AVG   D +VRI SL PD  L  K       
Sbjct: 595  YDEKKEMSGTVTSLSVGQVPEGLKRSSFLAVGC-DDCTVRILSLDPDSTLEMKSIQALTA 653

Query: 529  IPRSVLLCAFE----GIS-YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
             P ++ + + E    G + YL   L  G  L  +L+  TGELTD ++  LG +P  L   
Sbjct: 654  APSALSIMSMEDSFGGFTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTKLFQV 713

Query: 584  SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
            S ++   V A S RP + Y+    K  + + ++  E+ +   F+S    + +       L
Sbjct: 714  SVQDQPCVLALSSRPWLGYTDPLTKGFMMTPLSYTELEYGWNFSSEQCLEGMVGIHANYL 773

Query: 642  TIGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEMHFVRLLDD 699
             I +I+ +    I +SIPL   P+R+  H EQ   + I S  N    E       +LL+ 
Sbjct: 774  RIFSIEKLGDNMIQKSIPLTYTPKRLVKHPEQPYFYTIESDNNTLPPELR----AKLLEQ 829

Query: 700  QTFEFISTYPLDTFEY--------GCSILSCSFSDDSNVYYCVG--------TAYVLP-- 741
            Q+    +  P + F Y         C  +    SD+  V   +         +A ++P  
Sbjct: 830  QSNGDATILPPEDFGYPKAKGRWASCISIIDPISDEPRVLQRIDLDNNEAAVSAAIVPFA 889

Query: 742  -EENEP-----------------TKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNG 781
             +E E                  T+G I V+   EDG+ L+ I +   +    +L  F G
Sbjct: 890  SQEGESFLVVGTGKDMVLNPRQFTEGYIHVYRFHEDGRDLEFIHKTRVEEPPMALIPFQG 949

Query: 782  KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
            +LLA + + +++Y   L+    R+ Q++       L + +Q++G+ I+VGDL + ++ ++
Sbjct: 950  RLLAGVGKTLRIYDLGLK-QLLRKAQADV---TPTLIVSLQSQGNRIIVGDLQQGVTYVV 1005

Query: 842  YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEV 901
            YK E   +     D    W +   ++D +     +   N+  VR     + D +    E 
Sbjct: 1006 YKAEGNRLIPFVDDTLNRWTTCTTMVDYESVASGDKFGNISIVRCPERVSQDTD----EP 1061

Query: 902  VGEYHLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV 945
              E HL    N + HG+              LP S       VG    +++  + G +GV
Sbjct: 1062 GSEIHLMHARN-YLHGTPNRLSLQVHFFTQDLPTSICKTSLVVGGQDVLLWSGLQGTVGV 1120

Query: 946  -IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
             I  +  E   F + L+ ++R     + G +H  +R +    K V     +DGDL E F 
Sbjct: 1121 FIPFVSREDVDFFQNLENHMRAEDPPLAGRDHLIYRGYYTPVKGV-----IDGDLCERFS 1175

Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             L   +   I+  ++ SV E+ +++ ++
Sbjct: 1176 LLPNDKKQMIAGELDRSVREIERKISDI 1203


>gi|358366518|dbj|GAA83139.1| nuclear mRNA splicing factor [Aspergillus kawachii IFO 4308]
          Length = 1209

 Score =  204 bits (520), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 273/1168 (23%), Positives = 485/1168 (41%), Gaps = 169/1168 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  +I     + L  V+  +++ +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACIIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMTALDVG 182

Query: 120  CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  ++             +H+  YE+ L     V   WS + +D  A L
Sbjct: 183  YENPVFAALEVDYSESDQDPTGQAFEELEKHLVYYELDLGLNHVVRK-WS-DPVDRTASL 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDA--------- 211
            L  VP       GVL+  E+++ Y  +N  AF+  IP R   T+   R  +         
Sbjct: 241  LFQVPGGADGPSGVLVCAEDSVTYRHSNQDAFRVPIPRRSGATENPERKRSITAGVMHKM 300

Query: 212  DGSRYLL--GDHAGLLHLLV--ITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
             G+ + L   +   L  L +  +  +  ++TG    LKI+      +AS++  L +  +Y
Sbjct: 301  RGAFFFLIQTEDGDLFKLTIDMVEDDNGQLTGEVKRLKIKYFDTVPLASSLLILKSGFLY 360

Query: 264  IGSSYGDSQLIKLNLQPD--------------------------AKGS-YVEVLERYVNL 296
            + S  G+    +     D                           +G+  + ++E   +L
Sbjct: 361  VASEAGNHHFYQFEKLGDDDEEIEFTSEAFSADPSVPCDPIYFRPRGAENLNLVETINSL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P++D  VV++      Q+ T SGA    S R +++G+ ++E    EL  +   +W+ + 
Sbjct: 421  NPLIDSKVVNITEDDAPQIYTISGAGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            +  D FD ++++SF + T +L++   + +EE    GF S   TL       + L+Q+   
Sbjct: 481  TRSDEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQIHPR 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
             +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDSDGSLAEYDER 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P +
Sbjct: 597  RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTAAPSA 655

Query: 533  VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + + +    S      YL   L  G  L  +L+  TGEL+D +   LG++ + L   S K
Sbjct: 656  LNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKAVKLFQVSVK 715

Query: 587  NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
              T V   S RP + YS    K  + + ++   +     F+S    + +   +   L I 
Sbjct: 716  GQTAVLGLSSRPWLGYSDVQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775

Query: 645  TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR-------- 695
            +I+ +   +  +SIPL   PRR     +   F +    N   +  +    +         
Sbjct: 776  SIEKLDNNMLQQSIPLSYTPRRFLKHPEQPLFYVIESDNNVLSPSTRAKLLEDSKSRGGD 835

Query: 696  --LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSF-SDDSNV 730
              +L  + F +                      + T  L+  E   SI +  F S D   
Sbjct: 836  ETVLPPEDFGYPRATGHWASCIQVVDPLDAKAVVHTIELEENEAAISIAAVPFTSQDDET 895

Query: 731  YYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAA 786
            +  VGTA  +   N P    G I ++   EDG+ L+ I + + +    +L  F G+L+A 
Sbjct: 896  FLVVGTAKDM-SVNPPKSAGGYIHIYRFQEDGRELEFIHKTKVEEPPLALLGFQGRLVAG 954

Query: 787  INQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
            I   +++Y     D G ++L  +C        I+ L  QT+G  IVV D+ +S++ ++YK
Sbjct: 955  IGSLLRIY-----DLGMKQLLRKCQAPVVPKTIVGL--QTQGSRIVVSDVRESVTYVVYK 1007

Query: 844  HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE------ 894
            ++E  +     D  + W +A  ++D +   G +   NL+ +R   K SE A ++      
Sbjct: 1008 YQENVLIPFVDDSVSRWTTATTMVDYETTAGGDKFGNLWLLRCPKKTSEEADEDGSGAHL 1067

Query: 895  --ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
              ERG       RLE++   +  +         LV        G    +++    G IG+
Sbjct: 1068 IHERGYLQGTPNRLELMIHVYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIGM 1120

Query: 946  IAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
            +   +  E   F + L+  L      + G +H  +RS+    K V     +DGDL E + 
Sbjct: 1121 LVPFIGREDVDFFQNLEMQLAAQHPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEMYF 1175

Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             L       I+  ++ SV E+ +++ ++
Sbjct: 1176 LLPNDTKMMIAAELDRSVREIERKISDM 1203


>gi|242772631|ref|XP_002478075.1| nuclear mRNA splicing factor, putative [Talaromyces stipitatus ATCC
            10500]
 gi|218721694|gb|EED21112.1| nuclear mRNA splicing factor, putative [Talaromyces stipitatus ATCC
            10500]
          Length = 1209

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 273/1169 (23%), Positives = 487/1169 (41%), Gaps = 171/1169 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G  +D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKAQTLVFALTALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    +  D  +     T             YE+ L     V   WS + +D  +++
Sbjct: 183  YDNPIFAAIEVDYSECDQDPTGRAYEEVEKQLVYYELDLGLNHVVRR-WS-DVIDRTSNI 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKAIPI------------RPSITKAYGRV 209
            L  VP       GVL+  E++I Y  +N  AF+ +PI            + SIT      
Sbjct: 241  LFQVPGGADGPSGVLVCAEDSITYRHSNQDAFR-VPIPRRRGPTENPERKRSITAGVMHK 299

Query: 210  DADGSRYLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVV 262
                  +LL    G L  L I   ++   ++TG    LKI+      IA+ +  L +  +
Sbjct: 300  MRGAFFFLLQTEDGDLFKLTIDMVEDDNGQLTGEVKRLKIKYFDTVPIATNLLILKSGFL 359

Query: 263  YIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYVN 295
            ++ S  G+    +                ++  D     + V            +E   +
Sbjct: 360  FVASETGNHHFYQFEKLGDDDEETEFPSDDVSADLADPIIPVYFNVRDAENLNLVESVNS 419

Query: 296  LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLR 354
            L PI+D  + DL  +   Q+ T  G     + R +++G+ ++E    EL  +   +W+ +
Sbjct: 420  LNPIMDCKITDLLAEDAPQIYTICGTGARSTFRTLKHGLDVSEIVESELPSVPSAVWTTK 479

Query: 355  SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
             +  D FD ++V+SF + T +L++   + +EE    GF S   TL       + L+QV  
Sbjct: 480  LTRKDEFDAYIVLSFTNGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVHP 537

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-K 472
              +R + +  R   NEW +P   ++  A  N  QV +A   G +VY E+  DG L E  +
Sbjct: 538  KGIRHILADHR--VNEWPAPQHRTIVAAATNERQVAVALSSGEIVYFEMDTDGSLAEYDE 595

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPR 531
              Q+   ++ L +  + E    S   AVG   D +VRI SL PD  L  K        P 
Sbjct: 596  KRQMSGTVTSLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPS 654

Query: 532  SVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
            ++ + +    +      YL   L  G  L  +L+  TGEL+D +   LG +P+ L   S 
Sbjct: 655  ALNIMSMADSTSGGSTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKPVKLFGVSV 714

Query: 586  KNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
            K  + V A S RP + YS    K  + + ++   +     F+S    + +   +   L I
Sbjct: 715  KGQSAVLALSSRPWLGYSDIQTKSFMLTPLDYVGLEWGWNFSSEQCLEGMVGIQGQNLRI 774

Query: 644  GTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD--- 699
             +I+ +    ++ SIPL   PR      +   F +   +N   A  ++    RLL+D   
Sbjct: 775  FSIEKLDNNVLQESIPLAYTPRHFVKHPEQPLFYVIESENNVLAPATQ---TRLLEDSKL 831

Query: 700  ----------QTFEF----------------------ISTYPLDTFEYGCSILSCSF-SD 726
                      +TF F                      +S   L+  E   S+ + SF S 
Sbjct: 832  QNGEAVIPPAETFGFPRATGHWASCIEVVDPINSKSVLSRLELEENESAVSVAAVSFASQ 891

Query: 727  DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
            D+  +  VGT   V+      + G I ++   EDG+ L+ I + + +    +L AF G+L
Sbjct: 892  DNETFLVVGTGKDVVTYPRSFSAGFIHIYRFQEDGRELEFIHKTKIEEPPLALLAFQGRL 951

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSECGHHGHI-LALYVQTRGDFIVVGDLMKSISLLIY 842
            +A I + +++Y     D G +++  +C       L + +QT+G  I+V D+ +S++ ++Y
Sbjct: 952  VAGIGKNLRVY-----DLGMKQMLRKCQVEASPNLIVGLQTQGSRIIVSDVQESVTYVVY 1006

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
            K++E  +     D  A W +A  ++D +   G +   NL+ VR   K SE + ++  G  
Sbjct: 1007 KYQENQLIPFVDDVIARWTTATTMVDYETTAGGDKFGNLWLVRCPKKVSEESDEDGSGAH 1066

Query: 898  -------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
                         RL+++  ++  +        +LV       VG    +++  + G IG
Sbjct: 1067 LIHERSYLQGTPNRLDLMVHFYTQDIPTSLHKTNLV-------VGGRDILVWTGLQGTIG 1119

Query: 945  -VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
             +I  +  E   F + L+  L      + G  H  +RS+      V  K  +DGDL ES+
Sbjct: 1120 MMIPFISREDVDFFQNLEMQLASQNPPLAGREHLIYRSY-----YVPVKGVIDGDLCESY 1174

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              L   +   I+  ++ SV E+ +++ ++
Sbjct: 1175 FLLPNDKKLMIAGELDRSVREIERKISDM 1203


>gi|195490209|ref|XP_002093045.1| GE20993 [Drosophila yakuba]
 gi|194179146|gb|EDW92757.1| GE20993 [Drosophila yakuba]
          Length = 1227

 Score =  204 bits (519), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 263/1199 (21%), Positives = 470/1199 (39%), Gaps = 197/1199 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   I+G I  L  FR  G  +D++ + ++  +  +L+++A  + L             
Sbjct: 53   LLSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEYNAAKNALEKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D ++A    + + A + +    F E     N+        L+  A
Sbjct: 173  VDVGFDNPMFACLEIDYEEADMDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +LL    G +  + +  + + V+ +K++       A+ +  L    +
Sbjct: 288  ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
            ++ S +G+  L ++                           P A  + V +++   +  P
Sbjct: 348  FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
            I+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++   
Sbjct: 407  IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRA 466

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            DD FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +
Sbjct: 467  DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQ 475
            R + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G L E  + ++
Sbjct: 525  RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
            +  EI C+ +  + E    S   AVG+  D +VRI SL   N +T   +     P    L
Sbjct: 583  MPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-L 640

Query: 536  CAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
            C  E                         G  YL   L +G LL  +L+  +G+L D + 
Sbjct: 641  CLVEMGHTESTTQGALDDDAPAPRSGNNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 700

Query: 571  VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
              LG++P+ L     +    V A S R  + Y    +   + ++ + + +   F+S    
Sbjct: 701  RYLGSRPVKLFRIKMQGAEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 760

Query: 631  DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
            + +       L I  ++ +  +  + + PL   PR  + H +  R           T   
Sbjct: 761  EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDT 820

Query: 678  CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
             S + +  AEE              EM                           +R LD 
Sbjct: 821  KSARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDA 880

Query: 700  QTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
               + + + PL   E     ++L  S + D   Y  VG A  L      ++G  +     
Sbjct: 881  MHGQTMFSVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKI 940

Query: 758  D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
            D     L+ +   E      +L  F G+LLA   + +++Y     D G +++  +C   H
Sbjct: 941  DPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKH 995

Query: 814  GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
                 + +Q  G  + V D+ +S+  + Y+  E  +   A D +  W++A  +LD D  +
Sbjct: 996  IPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1054

Query: 874  GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
               + F   ++++     TD+            +RG L       E +  +H+GE +   
Sbjct: 1055 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1114

Query: 915  RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
            +  +L+        G    +I+ T++G +G        E Y F + L+ ++R     + G
Sbjct: 1115 QKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1167

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +H  +RS          KN LDGDL E +L +   +   I+  M  +  ++CK++E++
Sbjct: 1168 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1221


>gi|357147370|ref|XP_003574320.1| PREDICTED: splicing factor 3B subunit 3-like [Brachypodium
            distachyon]
          Length = 1228

 Score =  204 bits (518), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 270/1190 (22%), Positives = 475/1190 (39%), Gaps = 187/1190 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D+L + ++  +  +L++  + +               
Sbjct: 63   LLSVDVFGAIRSLAQFRLTGATKDYLVVGSDSGRLVILEYSPDRNRFDKVHQETFGKSGC 122

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE---------LQV 111
            R    GQ+  +DP  R + +   +    V   +     +   +  LE          L  
Sbjct: 123  RRIVPGQLLTVDPKGRALCIAALEKQKLVYVLNRDASARLTISSPLEAHKSHTLTFALTA 182

Query: 112  LDIKF---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN--------L 160
            LD  F   ++G     I + Y D+      +    A K   F E     N+        +
Sbjct: 183  LDCGFDNPVFG----AIELEYGDSDRDPTGQAASHAQKLLTFYELDLGLNHVSRKVSEPI 238

Query: 161  DNGADLLIPVP-----PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VD 210
            DNGA+LL+ VP     P   G+L+  +  ++Y +         IP R  +    G   V 
Sbjct: 239  DNGANLLVTVPGGGDGP--SGLLVCCDNFVLYRNQGHPEVRAVIPRRVDLPAERGVLIVA 296

Query: 211  ADGSR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
            A   R      +LL    G +  + + H  + V+ L+I+      +AS I  L +  ++ 
Sbjct: 297  AATHRQKSLFFFLLQTEYGDIFKVDLEHNGDSVSELRIKYFDTIPVASAICVLRSGFLFA 356

Query: 265  GSSYGDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGP 298
             S +G+  L +                             QP A  + V + E   +L P
Sbjct: 357  ASEFGNHALYQFRDIGRDVDVESSSATLMETEEGFQPVFFQPRALKNLVRIDE-IESLMP 415

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSST 357
            I+D  + +L  +   QV T  G     ++RI+R G+ I+E A   L      +W+++ + 
Sbjct: 416  IMDMRIANLFDEETPQVYTACGRGSRSTMRILRPGLAISEMARSMLPAEPIAVWTVKKNI 475

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            +D FD ++VVSF + T +L++   + +EE     F   T +L       + L+QV    +
Sbjct: 476  NDMFDAYIVVSFANVTLVLSIG--ETIEEVSDSQFLDTTHSLAVSLLGEDSLMQVHPNGI 533

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQL 476
            R +    R   NEW++P   ++    +N  QV++A  GG L+Y E+   G L EV+   +
Sbjct: 534  RHIREDGRV--NEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDM 591

Query: 477  EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLL 535
              +++CL I P+ E    S+  AVG + D ++RI SL PD  L           P S+L 
Sbjct: 592  SGDVACLAIAPVPEGRQRSRFLAVGSY-DNTIRILSLDPDDCLQPLSVQSVSSAPESLLF 650

Query: 536  CAFE-------GISY-----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
               +       G  Y     L   L +G L    ++M TG+L+D +   LG +P  L   
Sbjct: 651  LEVQASVGGEDGADYPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPC 710

Query: 584  SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
               +   +   S RP + Y      L + ++   +     F+S    + +       L I
Sbjct: 711  IVSHRQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGDALRI 770

Query: 644  GTIDDI-QKLHIRSIPLGEHP------------------------------RRICHQ--- 669
             TI+ + +  +  +IPL   P                              +R C +   
Sbjct: 771  FTIERLGETFNETAIPLRYTPRKFVILPKKKYLAVIESDKGAFSAEQREAAKRECLEASG 830

Query: 670  ----------------------EQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIST 707
                                  E+S T             E  +  +R+LD ++ +    
Sbjct: 831  AVENGNGNGDQMENGDGQEDGAEESNTLPDEQYGYPKAESEKWVSCIRILDPRSRDTTCL 890

Query: 708  YPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGRILVF-IVEDGK-LQ 762
              L   E   SI + +F D +      VGT   L    +     G I ++  V++GK L+
Sbjct: 891  LELQDNEAAVSICTVNFHDKEHGTLLAVGTTKGLQFWPKRSLASGFIHIYKFVDEGKSLE 950

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-V 821
            L+ + + +    SL  F G+LLA +   ++LY     D G R+L  +C +      +  +
Sbjct: 951  LLHKTQVEEVPLSLCQFQGRLLAGVGSVLRLY-----DLGKRKLLRKCENKLFPRTIVSI 1005

Query: 822  QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
             T  D I VGD+ +S     Y+ +E  +   A D    W++A   +D D   GA+   N+
Sbjct: 1006 HTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAANHIDFDTMAGADKFGNI 1065

Query: 882  FTVR------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRL 923
            +  R                  K  +G  +    ++E + ++H+G+ V   +  SL+   
Sbjct: 1066 YFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDVVTCLQKASLI--- 1122

Query: 924  PDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
                 G    +I+GTV G +G ++A    E   F   L+ +LR+    + G +H  +RS 
Sbjct: 1123 ----PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQEHPPLCGRDHMAYRS- 1177

Query: 983  NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                     K+ +DGDL E +  L      +I+  ++ +  E+ K++E++
Sbjct: 1178 ----AYFPVKDVIDGDLCEQYPSLPADMQRKIADELDRTPGEILKKLEDI 1223


>gi|91092128|ref|XP_972649.1| PREDICTED: similar to AGAP005549-PA [Tribolium castaneum]
 gi|270004662|gb|EFA01110.1| hypothetical protein TcasGA2_TC010322 [Tribolium castaneum]
          Length = 1219

 Score =  203 bits (517), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 262/1190 (22%), Positives = 480/1190 (40%), Gaps = 187/1190 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V I+G + +L  FR  G  +DF+ + ++  +  +L++   S     +   +   + G
Sbjct: 53   LLTVEIFGVVRSLMSFRLTGGTKDFVIVGSDSGRIVILEY-IPSKNNFEKVHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ    DP  R  +IG      L  ++  D + +L  +  +   +   L    +
Sbjct: 112  VRRIVPGQYLATDPRGRAVMIGAVEKQKLAYILNRDVQARLTISSPLEAHKSNTLVYHMV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A    + E A K +    F E     N+        L+  
Sbjct: 172  GVDVGYDNPMFACLEIDYEEADSDPSGEAAQKTQQTLTFYELDLGVNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
            A+ L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R   
Sbjct: 232  ANFLVTVPGGDDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT +K++      +AS +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVASAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGP 298
            +++ S +G+  L ++             +  P  +G       R +          +L P
Sbjct: 347  LFVTSEFGNHYLYQIAHLGDDDDELEFSSAMPLEEGDTFFFAPRSLRNLVLVDEMESLSP 406

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
            I+   V DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W+++  +
Sbjct: 407  ILSCRVADLAGEDTPQLYMLCGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVKRRS 466

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            DD +D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV  G +
Sbjct: 467  DDEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALSDDALVQVYPGGI 524

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVK-HAQ 475
            R + S  R   NEWK+P   ++     N  QV++A  GG L Y E+   G L E K   +
Sbjct: 525  RHICSDKR--VNEWKAPGKKTIVKCAINQRQVVIALSGGELAYFEMDPTGQLHEYKERKR 582

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII------ 529
            +  ++ C+ +  +      S   AVG+  D +VRI SL   + +    + G  +      
Sbjct: 583  MNADVVCMALANVAPGEQLSLFLAVGL-ADSTVRIISLDPSDCLAPRSIQGLPVCAESLC 641

Query: 530  ----------PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
                      P +    +     YL   L +G LL  +L+  +GEL+D +   LG++P+ 
Sbjct: 642  IVEMGCTDREPDNAAAASTTSTLYLNIGLTNGALLRNVLDPVSGELSDTRTRYLGSRPVK 701

Query: 580  LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
            L     + +  V A S R  + Y    +   + ++ + + +   F+S   P+ +      
Sbjct: 702  LFRIRMQQSEAVLAMSSRSWLSYYYQSRFYLTPLSYESLEYASGFSSEQCPEGIVAISTN 761

Query: 640  ELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKN--------------- 682
             L I  ++ +  +  + S PL   PR+ I H E +    I +  N               
Sbjct: 762  TLRILALEKLGAVFNQVSFPLEYTPRKFIIHPESNNLLIIETEHNAYTEETKKQRRLQMA 821

Query: 683  ---QSCAEESEMHFVR-----LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
               +  A E E    +      L++   E I + P        S L         V+ C+
Sbjct: 822  EEMKEAAGEEEQELAKEMADAFLNEDLPESIFSAPKAGHGMWASTLKIMDPVQGIVHKCI 881

Query: 735  G--------TAYVLPEENEPTKGRILVFIVEDG-----------------------KLQL 763
                     ++ ++  +N+P +   L+  +                          +L+ 
Sbjct: 882  RLEQNEAAMSSVLVKFQNQPQQTLFLIVGISKDFQLNPRHCNTGFLDTYKMDPMGRELEF 941

Query: 764  IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYV 821
            +          +L A+NG LLA + + ++LY     D G ++L  +C  + HI    + +
Sbjct: 942  VHRTPVDEVPMALCAYNGLLLAGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAIVNI 995

Query: 822  QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
            Q  G  I V D+ +S+ ++ YK  E  +   A D +  W++   +LD D    A+   N+
Sbjct: 996  QAMGKRIFVSDVQESVFMVRYKRAENQLIIFADDTHPRWVTCNCVLDYDTVAVADKFGNI 1055

Query: 882  FTVRKNSEGATDEE-----------RGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
              +R     + D E           RG L       + +  +H+GE V   +  +L+   
Sbjct: 1056 AILRLPPNVSDDVEEDPTGHKSLWDRGLLNGASQKADTIATFHVGETVTWLQKATLI--- 1112

Query: 924  PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
                 G   ++I+ T++G +GV+      E + F + L+ ++R     + G +H  +RS+
Sbjct: 1113 ----PGGWESLIYTTLSGSVGVLVPFTSREDHDFFQHLEMHMRSENSPLCGRDHLSFRSY 1168

Query: 983  NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                     KN +DGDL E +  L   +   I+  ++ +  E+ K++E++
Sbjct: 1169 -----YFPVKNVIDGDLCEQYNSLEPVKQKSIASDLDRTPAEVSKKLEDI 1213


>gi|392869416|gb|EJB11761.1| pre-mRNA-splicing factor rse1 [Coccidioides immitis RS]
          Length = 1209

 Score =  203 bits (517), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 273/1176 (23%), Positives = 483/1176 (41%), Gaps = 185/1176 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSSKDYIIIGSDSGRIAIVEY-VPSQNRFNRIHLETFGKSGIRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  + + +L  +  +     Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTLVFALTALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  ++    T             YE+ L     V   W+ + +D  A +
Sbjct: 183  YENPIFAALEVDYTESDQDPTGAAYQEAEKLLVYYELDLGLNHVVRR-WA-DPVDRSAAM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLL 218
            L  VP     P   GVL+ GE  I Y  +N  AF+  IP R   T+     + +  RY+ 
Sbjct: 241  LFQVPGGADGP--SGVLVCGEGNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT 293

Query: 219  GDHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIAST 253
               AG++H +                     ++  E  ++TG    LK++      +AS+
Sbjct: 294  ---AGVVHKMRRAFFCLLQTEDGDLFKVTMDMVEDENGQLTGEVQRLKLKYFDTVPVASS 350

Query: 254  ISYLDNAVVYIGSSYGDSQLIKLN-----------------------LQP----DAKGSY 286
            +  L N  +++ S  G+    +                         L P          
Sbjct: 351  LCILKNGFLFVASETGNHHFYQFEKLGDDDEETEFSSDDFSADPSEPLAPVYFRPRPAEN 410

Query: 287  VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            + ++E   +L P++   + +L      Q  T SG     + R +++G+ ++E    EL  
Sbjct: 411  LNLVESINSLNPLMSCKITNLTEDDAPQFYTLSGTGARSTFRTLKHGLEVSEIVESELPS 470

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            +   +W+ + + +D +D ++++SF + T +L++   + +EE    GF S   TL      
Sbjct: 471  VPSAVWTTKLTRNDQYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLG 528

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
             + L+QV    +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  
Sbjct: 529  EDSLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDT 586

Query: 465  DGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
            DG L E  +  ++   ++CL +  +      S   AVG   D +VRI SL PD  L  K 
Sbjct: 587  DGSLAEYDEKREMSGTVTCLSLGEVPPGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKS 645

Query: 523  HLGGEIIPRSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
                   P ++ ++   +  S     YL   L  G  L  +L+  TGEL+D +   LG +
Sbjct: 646  VQALTSAPSALSIMSMVDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLK 705

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
            P+ L + S K    V A S RP + YS    K  + + ++   +     F+S    + + 
Sbjct: 706  PVKLFSVSVKEQRAVLALSSRPWLGYSDLQTKNFMLTPLDYVPLEWSWNFSSEQCVEGMV 765

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 693
              +   L I +I+ +   L   +IPL   PR      +   F +    N   +  ++   
Sbjct: 766  GIQGQNLRIFSIEKLDNNLLQETIPLAYTPRHFVRHPEQPLFYVIEADNNILSPSTKAKL 825

Query: 694  ----------VRLLDDQTFEF----------------------ISTYPLDTFEYGCSILS 721
                      V  L  + F +                      IS   L+  E   S+ +
Sbjct: 826  LQDSKAANGEVAELPPEDFGYPRGTGHWASCIQVVDPINSKAVISRIELEENEAAVSVAA 885

Query: 722  CSFSD-DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYS 775
              FS  D   +  VGT     V P  +  + G I ++   EDGK L+ I + + +   ++
Sbjct: 886  VPFSSQDDETFLVVGTGKDMVVYPPSS--SCGFIHIYRFQEDGKELEFIHKTKVESPPHA 943

Query: 776  LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLM 834
            L AF G+LLA I + +++Y     D G ++L  +C       L + +QT+G  I+V D+ 
Sbjct: 944  LLAFQGRLLAGIGRNLRIY-----DLGMKQLLRKCQAEVVPRLIVGLQTQGSRIIVSDVQ 998

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
            +S++ ++YK++E  +   A D  A W +   ++D +   G +   NL+ +R   K SE A
Sbjct: 999  ESVTYVVYKYQENRLIPFADDVIARWTTCTAMVDYETVAGGDKFGNLWLLRCPQKASEEA 1058

Query: 892  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT------------- 938
             ++  G   +    +L    NR    SL++     D   IPT I  T             
Sbjct: 1059 DEDGSGAHLIHERQYLQGAPNRL---SLMVHFYPQD---IPTSIQKTQLVAGGRDILVWT 1112

Query: 939  -VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
             + G +G++   +  E   F + L+  L      + G +H  +RS+        AK  +D
Sbjct: 1113 GLQGTVGMLVPFVSREDVDFFQSLEMQLTSQTPPLAGRDHLIYRSY-----YAPAKGTID 1167

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            GDL E++  L   +   I+  ++ SV E+ +++ ++
Sbjct: 1168 GDLCETYFTLPNDKKLMIAGELDRSVREIERKISDM 1203


>gi|154277742|ref|XP_001539706.1| hypothetical protein HCAG_05173 [Ajellomyces capsulatus NAm1]
 gi|150413291|gb|EDN08674.1| hypothetical protein HCAG_05173 [Ajellomyces capsulatus NAm1]
          Length = 1233

 Score =  203 bits (517), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 271/1173 (23%), Positives = 483/1173 (41%), Gaps = 185/1173 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I  L  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRALAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  + + +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALTALDVG 182

Query: 120  CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  +  +A             +H+  YE+ L     V   WS + +D  A +
Sbjct: 183  YDNPIFAALEVEYTEADQDPTGQAYEELEKHLVYYELDLGLNHVVRK-WS-DPVDRSASM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLL 218
            L  VP     P   G L+  E+ I Y  +N  AF+  IP R   T+     + +  RY+ 
Sbjct: 241  LFQVPGGADGP--SGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT 293

Query: 219  GDHAGLLH---------------------LLVITHEKEKVTG----LKIELLGETSIAST 253
               AG++H                     L ++  +  ++TG    LK++      IAS+
Sbjct: 294  ---AGVMHKMRGAFFFLLQTEDGDLFKVTLDMVEDDNGQITGEVSRLKLKYFDTVPIASS 350

Query: 254  ISYLDNAVVYIGSSYGDSQLIKL-------------------NLQPDAKGSY-------- 286
            +  L +  +++ S  G+    +                    +L      +Y        
Sbjct: 351  LCILKSGFLFVASETGNHHFYQFEKLGDDDDEIEFTSDAYSADLSEPLPPAYFRPRPYEN 410

Query: 287  VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            V ++E   +L P++D  + +L      Q+ T  G     S R +++G+ + E    EL  
Sbjct: 411  VNLVESIDSLNPLMDCKIANLTDDDAPQIYTICGTGARSSFRTLKHGLEVAEIVESELPS 470

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            +   +W+ + + +D FD ++++SF + T +L++   + +EE    GF S   TL      
Sbjct: 471  VPSAVWTTKLTREDEFDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLG 528

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
             + L+QV    +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  
Sbjct: 529  EDSLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDT 586

Query: 465  DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
            DG L E    + +   ++CL +  +    + S   AVG   D +VRI SL P+  L  K 
Sbjct: 587  DGSLAEYDEKRAMSGTVTCLSLGEVPRGRARSSFLAVGC-DDSTVRILSLDPESTLENKS 645

Query: 523  HLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
                   P ++ + A    +      YL   L  G  L  +L+  TGEL+D +   LG +
Sbjct: 646  VQALTSAPSALSIMAMSDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLK 705

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
            P+ L   S K  T V A S RP + YS    K  + + ++   +     F+S    + + 
Sbjct: 706  PVKLFQVSVKEQTVVLALSSRPWLGYSHLQTKAFMLTPLDYVSLEWGWNFSSEQCVEGMV 765

Query: 635  IAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 693
              +   L I +++ ++  L   +IPL   PR      +   F +   +N   +  +    
Sbjct: 766  GIQGQNLRIFSLEKLENNLLQETIPLQYTPRHFIKHPEYPLFYVIEAENNILSPGTR--- 822

Query: 694  VRLLDDQTFEFISTYPLDTFEYG-----------------------------------CS 718
             +LL+D       T PL   E+G                                    S
Sbjct: 823  TKLLNDSDTVNGDTTPLPPEEFGYPRGTGHWASCIQIVDPINSKRVISQIELEENEAAVS 882

Query: 719  ILSCSFSD-DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVY 774
            + +  FS  D   +  VGT   ++      T G I ++   E+GK L+ I +   +    
Sbjct: 883  VAAVPFSSQDDETFLVVGTGKDMVVNPRSCTAGFIHIYRFQEEGKELEFIHKTMVEQPPM 942

Query: 775  SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
            +L  F G+LLA I   +++Y   ++    R+ Q+    H   L + +QT+G  I+V D+ 
Sbjct: 943  ALLGFQGRLLAGIGTDLRIYDLGMK-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQ 998

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
            +S++ ++YK++E  +     D  + W +   ++D +   G +   NL+ +R   K SE A
Sbjct: 999  ESLTYVVYKYQENRLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPAKASEEA 1058

Query: 892  TDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
             ++  G               RL +V  ++  +     +   LV    D  V    T + 
Sbjct: 1059 DEDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLVTGGRDILVW---TGLQ 1115

Query: 937  GTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
            GTV+ +I  I+    E+  F + L+  L      + G +H  +RS+        AK  +D
Sbjct: 1116 GTVSMLIPFIS---REEVDFFQSLEMQLAAQNPPLAGRDHLIYRSY-----YAPAKGTID 1167

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
            GDL E++L L   +  +I+  ++ SV E+ +++
Sbjct: 1168 GDLCETYLLLPNDKKQQIAGELDRSVREIERKI 1200


>gi|295666353|ref|XP_002793727.1| pre-mRNA-splicing factor rse1 [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278021|gb|EEH33587.1| pre-mRNA-splicing factor rse1 [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1209

 Score =  203 bits (517), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 277/1164 (23%), Positives = 488/1164 (41%), Gaps = 161/1164 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I TL  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRTLAAFRLAGGSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  + + +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALTALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  +  +A    T             YE+ L     V   WS + +D  A +
Sbjct: 183  YDNPIFAALEVEYTEADQDPTGQAYEELEKQLVYYELDLGLNHVVRK-WS-DPVDRSATM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
            L  VP       G L+  E+ I Y  +N  AF+  IP R  P+      R    G     
Sbjct: 241  LFQVPGGADGPSGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTENPERKRCIVAGVMHKM 300

Query: 216  -----YLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVVY 263
                 +LL    G L  + I   ++   ++TG    LK++      IAS++  L +  ++
Sbjct: 301  RGAFFFLLQTEDGDLFKVTIDMVEDDNGQITGEVRRLKLKYFDTVPIASSLCILKSGFLF 360

Query: 264  IGSSYGDS---QLIKLNLQPD----AKGSY--------------------VEVLERYVNL 296
            + S  G+    Q  KL    D       S+                    V ++E   +L
Sbjct: 361  VASETGNHHFYQFEKLGDDDDEIEFTSDSFSANPSEPLPPVYFHPRLSENVHLVESVNSL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P++D  + +L      Q+ T  G     S R +++G+ + E    EL  +   +W+ + 
Sbjct: 421  NPLMDCKITNLTEDDAPQIYTICGTGARSSFRTLKHGLEVTEIVESELPSVPSAVWTSKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            + +D FD ++++SF + T +L++   + +EE    GF S   TL       + L+QV   
Sbjct: 481  TREDEFDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
             +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAVALSSGEIVYFEMDTDGSLAEYDEK 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             ++   ++CL +  + +  + S   AVG   D +VRI SL PD  L  K        P +
Sbjct: 597  REMSGTVTCLSLGEVPKGRARSSFLAVGC-DDSTVRILSLDPDSTLGNKSVQALTSAPSA 655

Query: 533  VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + + A    +      YL   L  G  L  +L+  TGEL+D +   LG +P+ L   S K
Sbjct: 656  LSIMAMADSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLKPVKLFQVSVK 715

Query: 587  NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
                V A S RP + YS    K  + + ++   +     F+S    + +   +   L I 
Sbjct: 716  EQKAVLALSSRPWLGYSDLQTKAFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775

Query: 645  TIDDIQ-KLHIRSIPLGEHPRR-ICHQEQSRTFAI------------CSLKNQSCAEESE 690
            +I+ ++  L   +I L   PR+ I H EQ   + I              L N S A   +
Sbjct: 776  SIEKLENNLLQETISLTYTPRQFIKHPEQPLFYVIEADNNILSPATRAKLLNDSDAVNGD 835

Query: 691  M---------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILSCSFSD-DSNV 730
                            H+   ++++D   +   +S   L+  E   S+ + SFS  D   
Sbjct: 836  ATVLPSEDFGYPRGTGHWASCIQIVDPVNSKSVVSQIELEENEAAVSVAAVSFSSQDDET 895

Query: 731  YYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAI 787
            +  VGT   ++      + G I ++   EDGK L+ I + + +    +L  F G+LLA I
Sbjct: 896  FLVVGTGKDMVVNPRSCSAGFIHIYRFQEDGKELEFIHKTKVEQPPVALLGFQGRLLAGI 955

Query: 788  NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
               +++Y   +R    R+ Q+    H   L + +QT+G  I+V D+ +S++ ++YK +E 
Sbjct: 956  GTDVRIYDLGMR-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQESVTYVVYKSQEN 1011

Query: 848  AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG------- 897
             +     D  + W +   ++D +   G +   NL+ +R   K SE A ++  G       
Sbjct: 1012 RLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPAKASEEADEDGSGAHLIHER 1071

Query: 898  --------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV-IAS 948
                    RL +V  ++  +     +   LV        G    +++  + G +G+ I  
Sbjct: 1072 QYLQGAPNRLNLVAHFYPQDLPTSIQKAQLV-------TGGRDILVWTGLQGTVGMLIPF 1124

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +  E+  F + L+  L      + G +H  +RS+        AK  +DGDL E++L L  
Sbjct: 1125 ISREEVDFFQSLEMQLAAQNPPLAGRDHLIYRSY-----YAPAKGTIDGDLCETYLLLPN 1179

Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
             +  +I+  ++ SV E+ +++ ++
Sbjct: 1180 DKKQQIAGELDRSVREIERKIADM 1203


>gi|197246825|gb|AAI68883.1| Sf3b3 protein [Rattus norvegicus]
          Length = 908

 Score =  203 bits (516), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 217/927 (23%), Positives = 401/927 (43%), Gaps = 150/927 (16%)

Query: 228  LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN---------- 277
            + +  +++ VT ++++      +A+ +  L    +++ S +G+  L ++           
Sbjct: 4    ITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQIAHLGDDDEEPE 63

Query: 278  --------------LQPDAKGSYVEVLERYVNLGPIVDFC-VVDLERQGQGQVVTCSGAY 322
                           QP    + V +++   +L PI+ FC + DL  +   Q+    G  
Sbjct: 64   FSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLSPIL-FCQIADLANEDTPQLYVACGRG 121

Query: 323  KDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLE 381
               SLR++R+G+ ++E A  EL G    +W++R   +D FD +++VSF++ T +L++   
Sbjct: 122  PRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIVSFVNATLVLSIG-- 179

Query: 382  DELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNV 441
            + +EE    GF   T TL C     + LVQV    +R + +  R   NEWK+P   ++  
Sbjct: 180  ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR--VNEWKTPGKKTIVK 237

Query: 442  ATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAA 499
               N  QV++A  GG LVY E+   G L E  +  ++  ++ C+ +  +      S+  A
Sbjct: 238  CAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPGEQRSRFLA 297

Query: 500  VGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE----------------GIS 542
            VG+  D +VRI SL   + +  + L  + +P +   LC  E                G  
Sbjct: 298  VGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPESLCIVEMGGTEKQDELGERGSIGFL 354

Query: 543  YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIY 602
            YL   L +G LL  +L+  TG+L+D +   LG++P+ L     +    V A S R  + Y
Sbjct: 355  YLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSY 414

Query: 603  SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGE 661
            S   +   + ++ + +     F S   P+ +       L I  ++ +  +  + + PL  
Sbjct: 415  SYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVAFPLQY 474

Query: 662  HPRR-ICHQEQSR-----------TFAICSLKNQSCAEE-------------SEM----- 691
             PR+ + H E +            T A  + + Q  AEE             +EM     
Sbjct: 475  TPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAAGEDERELAAEMAAAFL 534

Query: 692  --------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
                                  +R+++      +    L+  E   S+  C FS+    +
Sbjct: 535  NENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDW 594

Query: 732  YC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAI 787
            Y  VG A  L        G  +    +V +G KL+ + +   +    ++  F G++L  +
Sbjct: 595  YVLVGVAKDLILSPRSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGV 654

Query: 788  NQKIQLYKWMLRDDGTRELQSECGHHGHILALY---VQTRGDFIVVGDLMKSISLLIYKH 844
             + +++Y     D G ++L  +C  + HI A Y   +QT G  ++V D+ +S   + YK 
Sbjct: 655  GKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKR 707

Query: 845  EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE-------- 894
             E  +   A D    W++   +LD D   GA+   N+  VR   N+    DE        
Sbjct: 708  NENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKAL 767

Query: 895  -ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
             +RG L       EV+  YH+GE V   +  +L+        G   ++++ T++G IG++
Sbjct: 768  WDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGIL 820

Query: 947  ASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
                 HE + F + ++ +LR     + G +H  +RS+         KN +DGDL E F  
Sbjct: 821  VPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNS 875

Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +   +   +S+ ++ +  E+ K++E++
Sbjct: 876  MEPNKQKNVSEELDRTPPEVSKKLEDI 902


>gi|119173562|ref|XP_001239205.1| hypothetical protein CIMG_10227 [Coccidioides immitis RS]
          Length = 1208

 Score =  203 bits (516), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 275/1180 (23%), Positives = 483/1180 (40%), Gaps = 185/1180 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSSKDYIIIGSDSGRIAIVEY-VPSQNRFNRIHLETFGKSGIRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  + + +L  +  +     Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTLVFALTALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  ++    T             YE+ L     V   W+ + +D  A +
Sbjct: 183  YENPIFAALEVDYTESDQDPTGAAYQEAEKLLVYYELDLGLNHVVRR-WA-DPVDRSAAM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLL 218
            L  VP     P   GVL+ GE  I Y  +N  AF+  IP R   T+     + +  RY+ 
Sbjct: 241  LFQVPGGADGP--SGVLVCGEGNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT 293

Query: 219  GDHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIAST 253
               AG++H +                     ++  E  ++TG    LK++      +AS+
Sbjct: 294  ---AGVVHKMRRAFFCLLQTEDGDLFKVTMDMVEDENGQLTGEVQRLKLKYFDTVPVASS 350

Query: 254  ISYLDNAVVYIGSSYGDSQLIKLN-----------------------LQP----DAKGSY 286
            +  L N  +++ S  G+    +                         L P          
Sbjct: 351  LCILKNGFLFVASETGNHHFYQFEKLGDDDEETEFSSDDFSADPSEPLAPVYFRPRPAEN 410

Query: 287  VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            + ++E   +L P++   + +L      Q  T SG     + R +++G+ ++E    EL  
Sbjct: 411  LNLVESINSLNPLMSCKITNLTEDDAPQFYTLSGTGARSTFRTLKHGLEVSEIVESELPS 470

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            +   +W+ + + +D +D ++++SF + T +L++   + +EE    GF S   TL      
Sbjct: 471  VPSAVWTTKLTRNDQYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLG 528

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
             + L+QV    +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  
Sbjct: 529  EDSLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDT 586

Query: 465  DGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
            DG L E  +  ++   ++CL +  +      S   AVG   D +VRI SL PD  L  K 
Sbjct: 587  DGSLAEYDEKREMSGTVTCLSLGEVPPGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKS 645

Query: 523  HLGGEIIPRSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
                   P ++ ++   +  S     YL   L  G  L  +L+  TGEL+D +   LG +
Sbjct: 646  VQALTSAPSALSIMSMVDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLK 705

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
            P+ L + S K    V A S RP + YS    K  + + ++   +     F+S    + + 
Sbjct: 706  PVKLFSVSVKEQRAVLALSSRPWLGYSDLQTKNFMLTPLDYVPLEWSWNFSSEQCVEGMV 765

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 693
              +   L I +I+ +   L   +IPL   PR      +   F +    N   +  ++   
Sbjct: 766  GIQGQNLRIFSIEKLDNNLLQETIPLAYTPRHFVRHPEQPLFYVIEADNNILSPSTKAKL 825

Query: 694  ----------VRLLDDQTFEF----------------------ISTYPLDTFEYGCSILS 721
                      V  L  + F +                      IS   L+  E   S+ +
Sbjct: 826  LQDSKAANGEVAELPPEDFGYPRGTGHWASCIQVVDPINSKAVISRIELEENEAAVSVAA 885

Query: 722  CSFSD-DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYS 775
              FS  D   +  VGT     V P  +  + G I ++   EDGK L+ I + + +   ++
Sbjct: 886  VPFSSQDDETFLVVGTGKDMVVYPPSS--SCGFIHIYRFQEDGKELEFIHKTKVESPPHA 943

Query: 776  LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLM 834
            L AF G+LLA I + +++Y     D G ++L  +C       L + +QT+G  I+V D+ 
Sbjct: 944  LLAFQGRLLAGIGRNLRIY-----DLGMKQLLRKCQAEVVPRLIVGLQTQGSRIIVSDVQ 998

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
            +S++ ++YK++E  +   A D  A W +   ++D +   G +   NL+ +R   K SE A
Sbjct: 999  ESVTYVVYKYQENRLIPFADDVIARWTTCTAMVDYETVAGGDKFGNLWLLRCPQKASEEA 1058

Query: 892  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT------------- 938
             ++  G   +    +L    NR    SL++     D   IPT I  T             
Sbjct: 1059 DEDGSGAHLIHERQYLQGAPNRL---SLMVHFYPQD---IPTSIQKTQLVAGGRDILVWT 1112

Query: 939  -VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
             + G +G++   +  E   F + L+  L      + G +H  +RS+        AK  +D
Sbjct: 1113 GLQGTVGMLVPFVSREDVDFFQSLEMQLTSQTPPLAGRDHLIYRSY-----YAPAKGTID 1167

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            GDL E++  L   +   I+  ++ SV E+ +++     LH
Sbjct: 1168 GDLCETYFTLPNDKKLMIAGELDRSVREIERKISVSPLLH 1207


>gi|361131929|gb|EHL03544.1| putative Pre-mRNA-splicing factor rse1 [Glarea lozoyensis 74030]
          Length = 967

 Score =  203 bits (516), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 234/957 (24%), Positives = 412/957 (43%), Gaps = 115/957 (12%)

Query: 155  WSQNNLDNGADLLIPVPPPL---CGVLIIGEETIVYCSAN--AFK-AIPIRPSITKAYGR 208
            W+ + +D  A +L  VP       GVL+ G++ I Y  +N  AF+ AIP R   T+   R
Sbjct: 41   WA-DAVDRTASVLFQVPGGTDGPSGVLVCGDDNITYRHSNQEAFRVAIPRRRGATEDPQR 99

Query: 209  VDA--DGSRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGET 248
              +   G  + L   AG    L+ T + +                  +V  LKI+     
Sbjct: 100  KRSIVAGVMHKLKGAAGAFFFLLQTEDGDLFKITIEMVEDDQGQPTGEVKRLKIKYFDTV 159

Query: 249  SIASTISYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV---- 289
             +AS++  L +  +++ S +GD Q  +                +   D    Y  V    
Sbjct: 160  PVASSLCILKSGFLFVASEFGDHQFYQFEKLGDDDEETEFVSDDFPTDPTEPYTPVYFHP 219

Query: 290  --------LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQAS 341
                    +E   ++ P++D  V +L      Q+ +  G     + R +++G+ ++E   
Sbjct: 220  RQAENLSLVESIDSMNPLMDCKVANLTNADAPQIYSICGTGARSTFRSLKHGLEVSEIVE 279

Query: 342  VELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
             EL G+   +W+ + +  D +D ++++SF + T +L++   + +EE    GF S   TL 
Sbjct: 280  SELPGVPSAVWTTKLTQGDTYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSTASTLA 337

Query: 401  CHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVY 460
                  + L+QV    +R + +  R   NEW +P   S+  AT N  QV +A   G +VY
Sbjct: 338  VQQLGEDGLIQVHPKGIRHIRADRR--VNEWAAPQHRSIVAATTNERQVAVALSSGEIVY 395

Query: 461  LEI-GDGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLN 517
             E+  DG L E  +  ++   ++CL +  + E    S   AVG   D +VRI SL PD  
Sbjct: 396  FEMDSDGSLAEYDEKKEMSGTVTCLSLGDVPEGRVRSPYLAVGC-DDSTVRILSLDPDST 454

Query: 518  LITKEHLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKV 571
            L  K        P ++ + A    S      YL   L  G  L  +L+  TGELTD +  
Sbjct: 455  LENKSVQALTSAPSALSIMAMADSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELTDTRLR 514

Query: 572  SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAF 629
             LG +P  L   S +  T V A S RP + Y+    K  + + +N   +     F+S   
Sbjct: 515  FLGPKPAKLFKVSVQGQTAVLALSSRPWLGYTDPVTKAFMLTPLNYPALEWGWNFSSEQC 574

Query: 630  PDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAE 687
             + +   +   L I +I+ +   L   SIPL   PRR + H EQ   + I S  N     
Sbjct: 575  TEGMVGIQGQNLRIFSIEKLTDNLLQESIPLTYTPRRFVRHPEQPLFYTIESDNNVLSPA 634

Query: 688  ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPT 747
              +    +LL+D +        L   E+G       ++       C+         +  T
Sbjct: 635  TKQ----KLLEDPSVVNGDAAVLPPEEFGYPRGRGHWAS------CISVV------DPVT 678

Query: 748  KGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
            + ++L  I  +G    + E        +L  F G+LLA + + +++Y   ++    R+ Q
Sbjct: 679  EKKVLHTIHFEGNEAAVEEPPM-----ALLGFQGRLLAGVGKDLRIYDLGMK-QLLRKSQ 732

Query: 808  SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
            +E   +   L + ++T+G  I+V D+ +S+ +++YK +E  +     D  A W S   ++
Sbjct: 733  AEVVPN---LIVGLRTQGSRIIVSDVQESVVMVVYKFQENRLIPFVDDTIARWTSCSTMV 789

Query: 868  DDDIYLGAENNFNLFTVR---KNSEGATDEERG------RLEVVGEYHLGEFVNRFRHGS 918
            D +   G +   NL+ +R   K SE A +E  G      R  + G  H    +  F    
Sbjct: 790  DYETVAGGDKFGNLWLLRCPAKASEEADEEGSGAHLLHERQYLQGAPHRLTLMAHFYSQD 849

Query: 919  LVMRLPDSD--VGQIPTVIFGTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLN 975
            + M +  ++  VG    +++  + G +G+ I  +  E   F + L+ ++R     + G +
Sbjct: 850  IPMSIQKTNLVVGGPDCLLWAGLQGTLGILIPFVGREDVDFFQTLEQHMRNEDAPLAGRD 909

Query: 976  HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            H  +RS+      V  K  +DGDL E +  L   +   I+  ++ SV E+ +++ ++
Sbjct: 910  HLIYRSY-----YVPVKGMIDGDLCERYTLLPTDKKQMIAGELDRSVREIERKISDI 961


>gi|402077250|gb|EJT72599.1| pre-mRNA-splicing factor RSE1 [Gaeumannomyces graminis var. tritici
            R3-111a-1]
          Length = 1216

 Score =  203 bits (516), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 277/1176 (23%), Positives = 487/1176 (41%), Gaps = 178/1176 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I  L  FR  G  +D++ IA++  +  ++++    +      +        R    
Sbjct: 64   VFGIIRDLASFRLAGSNKDYIIIASDSGRITIIEYLPAKNRFHRIHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  LI       L  V+  +++ +L  +  +   +   L +  +    G 
Sbjct: 124  GQYLAADPKGRACLIASVERCKLVYVLNRNSQAELTISSPLEAHKNGTLTLSLVALDVGY 183

Query: 121  AKPTIVVLYQDNKDA---------RHVKT----YEVALKDKDFVEGPWSQNNLDNGADLL 167
            + P    L  D  DA         ++V+T    YE+ L     V   W  + +D  A++L
Sbjct: 184  SNPVFAALEIDYTDADQDSKSDATQNVETVLNYYELDLGLNHVVRK-WF-DVVDPTANML 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVD--ADGSRYL 217
              VP     P   GVL+ G+E I Y  +N  AF+  IP R   T+   R      G  + 
Sbjct: 242  FQVPGGSDGP--SGVLVCGQENITYRHSNQDAFRVPIPRRKGATEDPNRKRNIVSGVMHK 299

Query: 218  LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
            L   AG    L+ T + +                  +V  LKI+      ++S +  L +
Sbjct: 300  LKGSAGAFFFLLQTEDGDLFKVTIDMLEDAEGNTTGEVQRLKIKYFDTIPVSSNLCILKS 359

Query: 260  AVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLER 292
              +++ S +G+    +                N   D    Y            + ++E 
Sbjct: 360  GFLFVASEFGNHHFYQFEKLGDDDEELEFSSENFPSDPAEPYEPAYFYPRPTENLALVES 419

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
              ++ P++D  V +L  +   Q+ T SG     + R++++G+ +NE  + +L G    +W
Sbjct: 420  VESMNPLMDLKVANLTDEDAPQIYTVSGNGARSTFRMLKHGLEVNEIVASQLPGTPSAVW 479

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            + + + DD +D+++V+SF + T +L++   + +EE    GF S   TL       + LVQ
Sbjct: 480  TTKIARDDQYDSYIVLSFTNGTLVLSIG--ETVEEVSDTGFLSSVSTLAVQQLGEDGLVQ 537

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE 470
            V    +R + S    + NEW +P   S+  A  N  QV +A   G +VY E+  DG L E
Sbjct: 538  VHPRGIRHIRSG---VVNEWPTPQHRSIVAAATNERQVAVALSSGEIVYFEMDTDGSLAE 594

Query: 471  V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
              +  ++   ++ L +  + E    S   AVG   D +VRI SL P+  L +K       
Sbjct: 595  YDERKEMSGTVTSLSLGEVPEGRLRSSFLAVGC-DDCTVRILSLDPETTLESKSVQALTA 653

Query: 529  IPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
             P ++ + A E  S      YL   L  G  L  +L+  TGELTD ++  LG + + L  
Sbjct: 654  APSALSIMAMEDSSSGGTTLYLHIGLNSGVYLRTVLDEVTGELTDTRQKFLGPKAVRLFQ 713

Query: 583  FSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
             S +  T V A S R  + YS   +K    + ++ +E+     F S    + + +  +G+
Sbjct: 714  VSVQGMTCVLALSSRSWLGYSDPISKGFTMTPLSYEELEWAWNFRSEQCEEGM-VGVQGQ 772

Query: 641  L----TIGTIDD--IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
                  IG + D  IQK    SI L   P+++        F     +N +   E     +
Sbjct: 773  FLRICAIGKLGDSMIQK----SISLAYTPKKLIKNPTHPIFYTIEAENNTLPPELREQLL 828

Query: 695  ----------RLLDDQTFEF---------------------------ISTYPLDTFEYGC 717
                      ++L  + F +                           +    L+  E   
Sbjct: 829  AAPTAVNGDTKVLPPEEFGYPRGNGRWASCISVVDPLGDGEEREPSVLQQIHLENNEATV 888

Query: 718  SILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAV 773
            S+    F S DS  +  VGT   ++      T+G I V+  +EDG+ L+ I + + +   
Sbjct: 889  SVAVVPFASQDSESFLVVGTGKDMVLNPRCFTEGYIHVYRFLEDGRELEFIHKTKVEEPP 948

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDL 833
             +L AF GKL+A + + +++Y   LR    R+ QSE       + + +QT+G  IVVGD 
Sbjct: 949  MALLAFQGKLVAGVGRSLRIYDLGLRQL-LRKAQSEVAPR---VIVSLQTQGSRIVVGDS 1004

Query: 834  MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
               +  + YK E   +   A D    W +   ++D +   G +   N++ +R   + + +
Sbjct: 1005 QHGLIYVAYKQEANKLIAFADDSIQRWTTCSTMVDYESTAGGDKFGNIWILRCPEKASQE 1064

Query: 894  EERGRLEVVGEYHLGEFVNRFRHGS-----LVMRLPDSD-----------VGQIPTVIFG 937
             ++   EV    HL      + HG+     L+  +   D           VG    +++G
Sbjct: 1065 ADQPGSEV----HLMH-ARDYLHGTSNRLALMAHVYTQDIATSICKTNLVVGGQEVLLWG 1119

Query: 938  TVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
             + G IGV I  +  E   F + L+ ++R     + G +H  +RS+      V  K  +D
Sbjct: 1120 GIQGTIGVLIPFVSREDADFFQTLEQHMRSEDPPLAGRDHLMYRSY-----YVPVKGVID 1174

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            GDL E F  L   +   I+  ++ SV E+ +++ ++
Sbjct: 1175 GDLCERFTMLPNDKKQMIAGELDRSVREIERKISDI 1210


>gi|195586770|ref|XP_002083143.1| GD13507 [Drosophila simulans]
 gi|194195152|gb|EDX08728.1| GD13507 [Drosophila simulans]
          Length = 1227

 Score =  203 bits (516), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 261/1199 (21%), Positives = 471/1199 (39%), Gaps = 197/1199 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   I+G +  L  FR  G  +D++ + ++  +  +L+++   + L             
Sbjct: 53   LLSTEIFGCVRALMAFRLTGGTKDYIVVGSDSGRIVILEYNPSKNALEKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D ++A    + + A + +    F E     N+        L+  A
Sbjct: 173  VDVGFDNPMFACLEIDYEEADMDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +LL    G +  + +  + + V+ +K++       A+ +  L    +
Sbjct: 288  ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
            ++ S +G+  L ++                           P A  + V +++   +  P
Sbjct: 348  FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
            I+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++   
Sbjct: 407  IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRA 466

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            DD FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +
Sbjct: 467  DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQ 475
            R + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G L E  + ++
Sbjct: 525  RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
            +  EI C+ +  + E    S   AVG+ +D +VRI SL   N +T   +     P    L
Sbjct: 583  MPAEIMCMALGTVPEGEQRSWFLAVGL-SDNTVRILSLDPNNCLTPCSMQALPSPAES-L 640

Query: 536  CAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
            C  E                         G  YL   L +G LL  +L+  +G+L D + 
Sbjct: 641  CLVEMGHTESTTQGGLDDDAPAQRSGNNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 700

Query: 571  VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
              LG++P+ L     + +  V A S R  + Y    +   + ++ + + +   F+S    
Sbjct: 701  RYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 760

Query: 631  DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
            + +       L I  ++ +  +  + + PL   PR  + H +  R           T   
Sbjct: 761  EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDT 820

Query: 678  CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
             S + +  AEE              EM                           +R LD 
Sbjct: 821  KSARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDA 880

Query: 700  QTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
               + + + PL   E     ++L  S + D   Y  VG A  L      ++G  +     
Sbjct: 881  MHGQTMFSVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKI 940

Query: 758  D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
            D     L+ +   E      +L  F G+LLA   + +++Y     D G +++  +C   H
Sbjct: 941  DPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKH 995

Query: 814  GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
                 + +Q  G  + V D+ +S+  + Y+  E  +   A D +  W++A  +LD D  +
Sbjct: 996  IPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1054

Query: 874  GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
               + F   ++++     TD+            +RG L       E +  +H+GE +   
Sbjct: 1055 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1114

Query: 915  RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
            +  +L+        G    +I+ T++G +G        E Y F + L+ ++R     + G
Sbjct: 1115 QKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1167

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +H  +RS          KN LDGDL E +L +   +   I+  M  +  ++CK++E++
Sbjct: 1168 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1221


>gi|194864680|ref|XP_001971056.1| GG14635 [Drosophila erecta]
 gi|190652839|gb|EDV50082.1| GG14635 [Drosophila erecta]
          Length = 1227

 Score =  202 bits (515), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 262/1199 (21%), Positives = 469/1199 (39%), Gaps = 197/1199 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   I+G I  L  FR  G  +D++ + ++  +  +L+++A  + L             
Sbjct: 53   LLSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEYNASKNALEKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D ++A    + + A + +    F E     N+        L+  A
Sbjct: 173  VDVGFDNPMFACLEIDYEEADMDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +LL    G +  + +  + + V+ +K++       A+ +  L    +
Sbjct: 288  ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
            ++ S +G+  L ++                           P A  + V +++   +  P
Sbjct: 348  FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
            I+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++   
Sbjct: 407  IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRA 466

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            DD FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +
Sbjct: 467  DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQ 475
            R + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G L E  + ++
Sbjct: 525  RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPSGELNEYTERSE 582

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
            +  EI C+ +  + +    S   AVG+  D +VRI SL   N +T   +     P    L
Sbjct: 583  MPAEIMCMALGTVPDGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-L 640

Query: 536  CAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
            C  E                         G  YL   L +G LL  +L+  +G+L D + 
Sbjct: 641  CLVEMGHTESTTQGALDDDAPAQRSGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 700

Query: 571  VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
              LG++P+ L     +    V A S R  + Y    +   + ++ + + +   F+S    
Sbjct: 701  RYLGSRPVKLFRIKMQGAEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 760

Query: 631  DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
            + +       L I  ++ +  +  + + PL   PR  + H +  R           T   
Sbjct: 761  EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDT 820

Query: 678  CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
             S + +  AEE              EM                           +R LD 
Sbjct: 821  KSARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDA 880

Query: 700  QTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
               + +   PL   E     ++L  S + D   Y  VG A  L      ++G  +     
Sbjct: 881  MHGQTMFNVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKI 940

Query: 758  D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
            D     L+ +   E      +L  F G+LLA   + +++Y     D G +++  +C   H
Sbjct: 941  DPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKH 995

Query: 814  GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
                 + +Q  G  + V D+ +S+  + Y+  E  +   A D +  W++A  +LD D  +
Sbjct: 996  IPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1054

Query: 874  GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
               + F   ++++     TD+            +RG L       E +  +H+GE +   
Sbjct: 1055 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1114

Query: 915  RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
            +  +L+        G    +I+ T++G +G        E Y F + L+ ++R     + G
Sbjct: 1115 QKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1167

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +H  +RS          KN LDGDL E +L +   +   I+  M  +  ++CK++E++
Sbjct: 1168 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1221


>gi|195336406|ref|XP_002034829.1| GM14250 [Drosophila sechellia]
 gi|194127922|gb|EDW49965.1| GM14250 [Drosophila sechellia]
          Length = 1227

 Score =  202 bits (514), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 261/1199 (21%), Positives = 470/1199 (39%), Gaps = 197/1199 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   I+G +  L  FR  G  +D++ + ++  +  +L+++   + L             
Sbjct: 53   LLSTEIFGCVRALMAFRLTGGTKDYIVVGSDSGRIVILEYNPSKNALEKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D ++A    + + A + +    F E     N+        L+  A
Sbjct: 173  VDVGFDNPMFACLEIDYEEADMDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +LL    G +  + +  + + V+ +K++       A+ +  L    +
Sbjct: 288  ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
            ++ S +G+  L ++                           P A  + V +++   +  P
Sbjct: 348  FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
            I+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++   
Sbjct: 407  IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRA 466

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            DD FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +
Sbjct: 467  DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQ 475
            R + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G L E  + ++
Sbjct: 525  RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
            +  EI C+ +  + E    S   AVG+  D +VRI SL   N +T   +     P    L
Sbjct: 583  MPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-L 640

Query: 536  CAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
            C  E                         G  YL   L +G LL  +L+  +G+L D + 
Sbjct: 641  CLVEMGHTESTTQGGLDDDAPAQRSGNNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 700

Query: 571  VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
              LG++P+ L     + +  V A S R  + Y    +   + ++ + + +   F+S    
Sbjct: 701  RYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 760

Query: 631  DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
            + +       L I  ++ +  +  + + PL   PR  + H +  R           T   
Sbjct: 761  EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDT 820

Query: 678  CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
             S + +  AEE              EM                           +R LD 
Sbjct: 821  KSARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDA 880

Query: 700  QTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
               + + + PL   E     ++L  S + D   Y  VG A  L      ++G  +     
Sbjct: 881  MHGQTMFSVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKI 940

Query: 758  D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
            D     L+ +   E      +L  F G+LLA   + +++Y     D G +++  +C   H
Sbjct: 941  DPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKH 995

Query: 814  GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
                 + +Q  G  + V D+ +S+  + Y+  E  +   A D +  W++A  +LD D  +
Sbjct: 996  IPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1054

Query: 874  GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
               + F   ++++     TD+            +RG L       E +  +H+GE +   
Sbjct: 1055 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1114

Query: 915  RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
            +  +L+        G    +I+ T++G +G        E Y F + L+ ++R     + G
Sbjct: 1115 QKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1167

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +H  +RS          KN LDGDL E +L +   +   I+  M  +  ++CK++E++
Sbjct: 1168 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1221


>gi|405121632|gb|AFR96400.1| hypothetical protein CNAG_03173 [Cryptococcus neoformans var. grubii
            H99]
          Length = 1276

 Score =  202 bits (514), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 225/912 (24%), Positives = 407/912 (44%), Gaps = 125/912 (13%)

Query: 186  VYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIEL- 244
            V+ S   F  + I  ++ + +G     G+  ++GD  G    L    EK    G+   + 
Sbjct: 353  VWRSRQGFGTV-IAATVIEDHG----SGASVVIGDEYGAFTALGWEFEKGLGAGMDGRVR 407

Query: 245  -----LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQ------PDAKGSY------- 286
                 LG +S  S+I+YLD++ +++ S+  DS L++L         P  KG         
Sbjct: 408  VLRTYLGASSPPSSITYLDSSHLFVSSAVADSVLLRLPTVESSTNVPSGKGKGREVISPI 467

Query: 287  -----------------------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 323
                                   +E+LER++N+ P+ D C V  E      +V  SGA +
Sbjct: 468  GDQLDKWEVLYEIGKDRNDTDEGLEILERWMNIAPVKDLCAVKDEGGNLSHLVLASGASE 527

Query: 324  DGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLED 382
              SLR+VR+G+G+ E  +++ L  ++ MWSL  ST  P    L++S  + T  + + L+ 
Sbjct: 528  SNSLRVVRSGVGLEELVTIQGLHDVQKMWSLTDSTAVPR---LLLS--TSTSTVLLQLQP 582

Query: 383  ELEETEIEGFCSQTQTLFCHDAIYNQLV-QVTSGSVRLVSSTS-RELRNEWKSPPGYSVN 440
            E+    I     +++TL        +L+ QVT   + L S  S  +L  + K   G  + 
Sbjct: 583  EISVIPIVDVIFKSETLAAGILPGAELLAQVTPRGLSLWSDLSVGQLEAQMKVDKGTEIV 642

Query: 441  VATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAV 500
             A   A   ++A  GG+LV   + D   +      ++ E+S + I+   ++ S   + A 
Sbjct: 643  CAQVTADWAVVAKKGGNLVVFHVSDTGFSPEGTIDVKEEVSAVAISNSSDSSSPIIVIAT 702

Query: 501  GMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISY--------LLCALGDG- 551
              WT  +  +++L  ++      + G  IP      + +  S+        LL  L +G 
Sbjct: 703  --WTAKTF-VYTLSQIS----NGVDGLSIPTKSSATSLQLRSHPFYPAGIQLLSGLDNGL 755

Query: 552  ---HLLNFLLNMKTGELT--DRKKVSLGTQPITLRTFSSKNTTHVFAA---SDRPTVIYS 603
               H L+   +   GEL     K  SLG +P+ L    S +      +   ++R +VI+ 
Sbjct: 756  LHIHDLDTSDSGDAGELMVKSSKSASLGLRPLVLHPCESTHGDEKVISVGLTERMSVIFE 815

Query: 604  SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EH 662
            S  ++ +S+VN+K +      N+++ P     ++   L++  I+ ++KLH+++   G E 
Sbjct: 816  SKGRVEFSSVNIKNIMAATAVNTSSGPVFALFSRTSGLSLVKINSLKKLHVQTCDTGNES 875

Query: 663  PRRICHQEQSRTFAICSLKNQSCAEES---EMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
              ++ + ++ +  A C L  ++   +    E +FV++ D  + E +S++ L   E   S+
Sbjct: 876  VSKLTYMDEYKAVA-CGLTRRTQLRDGDVEEENFVQIRDGTSLEPLSSFSLRGRELVTSL 934

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPT-----------KGRILVFIVEDGK------LQ 762
             S   +   ++Y  VGT  + P++ E +           +GR+L+   ++G       ++
Sbjct: 935  RSVFLT--GSIYLAVGTGILPPDDGEDSSWDEGNLAVVREGRVLLLEFKEGDAGSGWDIK 992

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-------H 815
            + AE  T GAVY+L   +G L  A   K+ +++    D    EL+               
Sbjct: 993  VKAELATVGAVYALEEIHGFLAVAAGSKLTIHR---LDHNPVELEETSSWASAYVISSLS 1049

Query: 816  ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD--DIYL 873
            +L   +      ++VGD M+S+ +L     +G I +  R+   + ++A+ +L D  D  +
Sbjct: 1050 VLPPSLMRPEGALIVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGDGVV 1109

Query: 874  GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
             ++ + NL T R N          +LE    + L E V RF+ GSLV      ++  IP 
Sbjct: 1110 ISDAHSNLLTYRLNQ---------KLERAATFGLHEEVTRFQSGSLVPTTTAPEI-IIPD 1159

Query: 934  VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
            V+F T  G +G+I  L       L+ LQ N+ K+ KG G +    WR   +     D   
Sbjct: 1160 VLFATREGRLGIIGELGTRSSRTLDDLQRNMSKIWKGPGEVGWSNWRRAGSNLVGKDTAG 1219

Query: 994  FLDGDLIESFLD 1005
            F+DGD ++ FLD
Sbjct: 1220 FVDGDFVQKFLD 1231


>gi|226293297|gb|EEH48717.1| pre-mRNA-splicing factor rse1 [Paracoccidioides brasiliensis Pb18]
          Length = 1208

 Score =  202 bits (514), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 277/1161 (23%), Positives = 486/1161 (41%), Gaps = 161/1161 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I TL  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRTLAAFRLAGGSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  + + +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALTALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  +  +A    T             YE+ L     V   WS + +D  A +
Sbjct: 183  YDNPIFAALEVEYTEADQDPTGQAYEELEKQLVYYELDLGLNHVVRK-WS-DPVDRSATM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
            L  VP       G L+  E+ I Y  +N  AF+  IP R  P+      R    G     
Sbjct: 241  LFQVPGGADGPSGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTENPERKRCIVAGVMHKM 300

Query: 216  -----YLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVVY 263
                 +LL    G L  + I   ++   ++TG    LK++      IAS++  L +  ++
Sbjct: 301  RGAFFFLLQTEDGDLFKVTIDMVEDDNGQITGEVRRLKLKYFDTVPIASSLCILKSGFLF 360

Query: 264  IGSSYGDS---QLIKLNLQPD----AKGSY--------------------VEVLERYVNL 296
            + S  G+    Q  KL    D       S+                    V ++E   +L
Sbjct: 361  VASETGNHHFYQFEKLGDDDDEIEFTSDSFSANPSEPLPPVYFHPRLSENVHLVENVNSL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P++D  + +L      Q+ T  G     S R +++G+ + E    EL  +   +W+ + 
Sbjct: 421  NPLMDCKITNLTEDDAPQIYTICGTGARSSFRTLKHGLEVTEIVESELPSVPSAVWTSKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            + +D FD ++++SF + T +L++   + +EE    GF S   TL       + L+QV   
Sbjct: 481  TREDEFDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
             +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAVALSSGEIVYFEMDTDGSLAEYDEK 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             ++   ++CL +  + +  + S   AVG   D +VRI SL PD  L  K        P +
Sbjct: 597  REMSGTVTCLSLGEVPKGRARSSFLAVGC-DDSTVRILSLDPDSTLGNKSVQALTSAPSA 655

Query: 533  VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + + A    +      YL   L  G  L  +L+  TGEL+D +   LG +P+ L   S K
Sbjct: 656  LSIMAMADSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLKPVKLFQVSVK 715

Query: 587  NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
                V A S RP + YS    K  + + ++   +     F+S    + +   +   L I 
Sbjct: 716  EQKAVLALSSRPWLGYSDLQTKAFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775

Query: 645  TIDDIQ-KLHIRSIPLGEHPRR-ICHQEQSRTFAI------------CSLKNQSCAEESE 690
            +I+ ++  L   +I L   PR+ I H EQ   + I              L N S A   +
Sbjct: 776  SIEKLENNLLQETISLTYTPRQFIKHPEQPLFYVIEADNNILSPATRTKLLNDSDAVNGD 835

Query: 691  M---------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILSCSF-SDDSNV 730
                            H+   ++++D   +   +S   L+  E   S+ + SF S D   
Sbjct: 836  ATVLPPEDFGYPRGTGHWASCIQIVDPVNSKSVVSQIELEENEAAVSVAAVSFSSQDDET 895

Query: 731  YYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAI 787
            +  VGT   ++      + G I ++   EDGK L+ I + + +    +L  F G+LLA I
Sbjct: 896  FLVVGTGKDMVVNPRSCSAGFIHIYRFQEDGKELEFIHKTKVEQPPVALLGFQGRLLAGI 955

Query: 788  NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
               +++Y   +R    R+ Q+    H   L + +QT+G  I+V D+ +S++ ++YK +E 
Sbjct: 956  GTDVRIYDLGMR-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQESVTYVVYKSQEN 1011

Query: 848  AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG------- 897
             +     D  + W +   ++D +   G +   NL+ +R   K SE A ++  G       
Sbjct: 1012 RLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPAKASEEADEDGSGAHLIHER 1071

Query: 898  --------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV-IAS 948
                    RL +V  ++  +     +   LV        G    +++  + G +G+ I  
Sbjct: 1072 QYLQGAPNRLNLVAHFYPQDLPTSIQKAQLV-------TGGRDILVWTGLQGTVGMLIPF 1124

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +  E+  F + L+  L      + G +H  +RS+        AK  +DGDL E++L L  
Sbjct: 1125 ISREEVDFFQSLEMQLAAQNPPLAGRDHLIYRSY-----YAPAKGTIDGDLCETYLLLPN 1179

Query: 1009 TRMDEISKTMNVSVEELCKRV 1029
             +  +I+  ++ SV E+ +++
Sbjct: 1180 DKKQQIAGELDRSVREIERKI 1200


>gi|60677959|gb|AAX33486.1| RE01065p [Drosophila melanogaster]
          Length = 1227

 Score =  202 bits (514), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 260/1199 (21%), Positives = 471/1199 (39%), Gaps = 197/1199 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   I+G +  L  FR  G  +D++ + ++  +  +L+++   + L             
Sbjct: 53   LLSTEIFGCVRALMAFRLTGGTKDYIVVGSDSGRIVILEYNPSKNALEKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P +  L  D ++A    + + A + +    F E     N+        L+  A
Sbjct: 173  VDVGFDNPMLACLEIDYEEADMDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +LL    G +  + +  + + V+ +K++       A+ +  L    +
Sbjct: 288  ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
            ++ S +G+  L ++                           P A  + V +++   +  P
Sbjct: 348  FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
            I+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++   
Sbjct: 407  IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRA 466

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            DD FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +
Sbjct: 467  DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
            R + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G L E  + ++
Sbjct: 525  RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
            +  EI C+ +  + E    S   AVG+  D +VRI SL   N +T   +     P    L
Sbjct: 583  MPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-L 640

Query: 536  CAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
            C  E                         G  YL   L +G LL  +L+  +G+L D + 
Sbjct: 641  CLVEMGHTESTTQGGLDDDAPAQRSGNNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 700

Query: 571  VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
              LG++P+ L     + +  V A S R  + Y    +   + ++ + + +   F+S    
Sbjct: 701  RYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 760

Query: 631  DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
            + +       L I  ++ +  +  + + PL   PR  + H +  R           T   
Sbjct: 761  EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDT 820

Query: 678  CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
             S + +  AEE              EM                           +R LD 
Sbjct: 821  KSARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDA 880

Query: 700  QTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
               + + + PL   E     ++L  S + D   Y  VG A  L      ++G  +     
Sbjct: 881  MHGQTMFSVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKI 940

Query: 758  D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
            D     L+ +   +      +L  F G+LLA   + +++Y     D G +++  +C   H
Sbjct: 941  DPTCSSLEFMHRTDIDEIPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKH 995

Query: 814  GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
                 + +Q  G  + V D+ +S+  + Y+  E  +   A D +  W++A  +LD D  +
Sbjct: 996  IPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1054

Query: 874  GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
               + F   ++++     TD+            +RG L       E +  +H+GE +   
Sbjct: 1055 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1114

Query: 915  RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
            +  +L+        G    +I+ T++G +G        E Y F + L+ ++R     + G
Sbjct: 1115 QKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1167

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +H  +RS          KN LDGDL E +L +   +   I+  M  +  ++CK++E++
Sbjct: 1168 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1221


>gi|326483043|gb|EGE07053.1| pre-mRNA-splicing factor rse1 [Trichophyton equinum CBS 127.97]
          Length = 1209

 Score =  202 bits (513), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 277/1164 (23%), Positives = 489/1164 (42%), Gaps = 161/1164 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G ++D++ I ++  +  ++++   +     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
             GQ   +DP  R  LI     + L  V+  + + +L  +  +           L  LD+ 
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182

Query: 116  F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
            +   ++   +     + QD     + +T      YE+ L     V   W+ + +D  A +
Sbjct: 183  YENPIFAALEVEYTEVDQDPTGQAYEETEKMLVYYELDLGLNHVVRR-WA-DPVDRTASM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
            L  VP       GVL+  E+ IVY  +N  AF+  IP R  P+      R    G     
Sbjct: 241  LFQVPGGADGPSGVLVCAEDNIVYRHSNQDAFRVPIPRRRGPTENPERKRCITAGVMHKM 300

Query: 216  -----YLLGDHAGLLH---LLVITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
                 +LL    G L    + ++  E EK TG    LK++      +AS++  L +  ++
Sbjct: 301  RGAFFFLLQSEDGDLFKVTMEMVEDENEKATGEVKRLKLKYFDTVPLASSLCILKSGFLF 360

Query: 264  IGSSYGD-----------------------SQLIKLNLQP----DAKGSYVEVLERYVNL 296
            + S  G+                       S +I   L P          + ++E   +L
Sbjct: 361  VASETGNQHFYQFEKLGDDDDEIEFISDDYSAVISEPLPPVYFRPRPAENLNLVESIASL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P++   + ++  +   Q+ T  G     S R +++G+ ++E    EL  +   +W+ + 
Sbjct: 421  NPLMAASITNITEEDAPQIYTLCGTGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            S +D FD ++V+SF + T +L++   + +EE    GF S   TL       + L+QV   
Sbjct: 481  SRNDQFDAYIVLSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
             +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHIHADQR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDGSLAEYDEK 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P +
Sbjct: 597  RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPSA 655

Query: 533  V-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + ++   +  S     YL   L  G  L  +L+  TGEL+D +   LG +P+ L + S K
Sbjct: 656  LSIMSMIDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFSVSVK 715

Query: 587  NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
                V A S R  + YS    K    + +N   +     F+S    + +   +   L I 
Sbjct: 716  EQRAVLALSSRSWLGYSDVQTKSFTLTPLNYVGLEWSWNFSSEQCVEGMVGIQGQNLRIF 775

Query: 645  TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEM---------- 691
            +I+ +   L    IPL   PR      +   F +    N   S A ++++          
Sbjct: 776  SIEKLDNNLLQEPIPLAYTPRNFVRHPEYPLFYVIGSDNNILSPATKAKLLSESTAVNGD 835

Query: 692  ----------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILSCSF-SDDSNV 730
                            H+   ++++D   T   +S   L+  E   SI + SF S +   
Sbjct: 836  SAELPPEDFGYPRGTNHWASSIQVVDPIHTKSVLSNLELEDNEAAVSIAAVSFTSQEDET 895

Query: 731  YYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAI 787
            +  VGT   ++      T G I ++   E+GK L+ I + + +    +L  F G+LLA I
Sbjct: 896  FLVVGTGKDMVVSPRTFTCGFIHIYRFQEEGKELEFIHKTKVEQPPLALLGFQGRLLAGI 955

Query: 788  NQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
               +++Y     D G R+L  +C       + + +QT+G  I+V D+ +S++ ++YK++E
Sbjct: 956  GPDLRIY-----DLGMRQLLRKCQAQITPRVIVGLQTQGSRIIVSDVQESVTYVVYKYQE 1010

Query: 847  GAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVG 903
             A+   A D    W +   ++D +   G +   N++ +R   K SE A ++  G   +  
Sbjct: 1011 NALIPFADDIIPRWTTCTTMVDYETVAGGDKFGNIWLLRCPTKASEEADEDGSGAHLIHE 1070

Query: 904  EYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT--------------VNGVIGV-IAS 948
              +L    NR    SLV+     D   IPT I  T              + G +G+ +  
Sbjct: 1071 RQYLQGAPNRL---SLVIHFYSQD---IPTSIQKTQLVAGGRDILVWTGLQGTVGMFVPF 1124

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +  +   F + L+  L      + G +H  +R +         K  +DGDL E+FL L  
Sbjct: 1125 ITRDDVDFFQTLEMQLASQNPPLAGRDHLIYRGY-----YAPCKGVIDGDLCETFLLLPN 1179

Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
             +   I+  ++ SV E+ +++ ++
Sbjct: 1180 DKKQAIAGELDRSVREIERKISDM 1203


>gi|413935524|gb|AFW70075.1| hypothetical protein ZEAMMB73_605375 [Zea mays]
          Length = 1229

 Score =  202 bits (513), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 261/1186 (22%), Positives = 472/1186 (39%), Gaps = 179/1186 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D+L + ++  +  +L++  + + L             
Sbjct: 64   LLSVDVFGAIRSLAQFRLTGANKDYLVVGSDSGRLVILEYSPDRNRLDKVHQETFGKSGC 123

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY-- 118
            R    GQ+  +DP  R + +   +    V   +     +   +  LE  +   + F    
Sbjct: 124  RRIVPGQLLAVDPKGRALCIAALEKQKLVYVLNRDAAARLTISSPLEAHKSNTLTFALAA 183

Query: 119  ---GCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    +  +  ++    T + A + +    F E     N+        +DNGA
Sbjct: 184  LDCGFDNPVFAAIELEYAESDRDPTGQAANQAQKLLTFYELDLGLNHVSRKVSEPIDNGA 243

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGRVDADGSR- 215
            +LL+ VP     P   GVL+  +  ++Y +    +    IP R  +    G +    +  
Sbjct: 244  NLLVTVPGGGDGP--SGVLVCCDNFVLYRNQGHPEIRAVIPRRADLPAERGVLIVAAATH 301

Query: 216  -------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
                   +LL    G +  + + H  + V+ L+I+      + S +  L +  ++  S +
Sbjct: 302  KQKSMFFFLLQTEYGDIFKVDLDHNGDAVSELRIKYFDTIPVTSAVCVLRSGFLFAASEF 361

Query: 269  GDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVDF 302
            G+  L +                             QP A  + + + E   +L P++D 
Sbjct: 362  GNHALYQFRDIGREADVESSSATLMETEEGFQPVFFQPRALKNLMRIDE-IESLMPVMDM 420

Query: 303  CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPF 361
             V +L  +   Q+ T  G     +LRI+R G+ I+E A   L      +W+++ + +D F
Sbjct: 421  RVANLFDEETPQLFTACGRGPRSTLRILRPGLAISEMARSMLPAEPIAVWTVKKNINDMF 480

Query: 362  DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
            D ++VVSF + T +L++   + +EE     F   T +L       + L+QV    +R + 
Sbjct: 481  DAYIVVSFTNVTLVLSIG--ETIEEVSDSQFLDTTHSLAVTLLGEDSLMQVHPNGIRHIR 538

Query: 422  STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEI 480
               R   NEW++P   ++    +N  QV++A  GG L+Y E+   G L EV+   +  ++
Sbjct: 539  EDGRV--NEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDMSGDV 596

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFE 539
            +CL I P+ E    S+  AVG + D ++RI SL PD  L           P S+L    +
Sbjct: 597  ACLAIAPVPEGRQRSRFLAVGSY-DNTIRILSLDPDDCLQPLSVQSVSSAPESLLFLEVQ 655

Query: 540  -------GISY-----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
                   G  Y     L   L +G L    ++M TG+L+D +   LG +P  L      +
Sbjct: 656  ASVGGEDGADYPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPCIVSH 715

Query: 588  TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
               +   S RP + Y      L + ++   +     F+S    + +       L I TI+
Sbjct: 716  RQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGDALRIFTIE 775

Query: 648  DI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESEMHFVRLLD------- 698
             + +  +  +IPL   PR+     + +  A+  S K    AEE E      LD       
Sbjct: 776  RLGETFNETAIPLRYTPRKFVILPKKKYIAVIESDKGAFSAEEREAAKKECLDASGAAEN 835

Query: 699  --------------DQTFEFISTYPLDTFEY----------------------------- 715
                              E  +T+P + + Y                             
Sbjct: 836  GNANNGDPMENGDGQDGAEEGNTFPDEQYGYPKAESERWVSCIRILDPRSRDTTCLLELQ 895

Query: 716  ----GCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKG----RILVFIVEDGKLQLIAE 766
                  SI + +F D +      +GTA  L      T       +  F+ E   L+L+ +
Sbjct: 896  DNEAAVSICTVNFHDKEHGTLLAIGTAKGLQFWPRRTLAGGFIHVYKFVDEGRSLELLHK 955

Query: 767  KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRG 825
             + +    +L  F G+LLA +   ++LY     D G R+L  +C +      L  + T  
Sbjct: 956  TQVEEVPLALCQFQGRLLAGVGSVLRLY-----DLGKRKLLRKCENKLFPRTLVSIHTYR 1010

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR 885
            D I VGD+ +S     Y+ +E  +   A D    W++  + +D D   GA+   N++  R
Sbjct: 1011 DRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTTAQHIDFDTMAGADKFGNIYFAR 1070

Query: 886  ------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
                              K  +G  +    ++E + ++H+G+ V   +  SL+       
Sbjct: 1071 LPQDISDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDVVTCLQKASLI------- 1123

Query: 928  VGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
             G    +I+GTV G +G ++A    E   F   L+ +LR+    + G +H  +RS     
Sbjct: 1124 PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQEHPPLCGRDHMAYRS----- 1178

Query: 987  KTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                 K+ +DGDL E +  L      +I+  ++ +  E+ K++E++
Sbjct: 1179 AYFPVKDVIDGDLCEQYPSLPADMQRKIADELDRTPGEILKKLEDI 1224


>gi|317143715|ref|XP_001819645.2| pre-mRNA-splicing factor rse1 [Aspergillus oryzae RIB40]
          Length = 1209

 Score =  201 bits (512), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 277/1177 (23%), Positives = 492/1177 (41%), Gaps = 187/1177 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G  +D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMSALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT---YEVALK---------DKDFVEGPWSQNNLDNGADLL 167
               P    L  D  ++    T   YE A K           + V   W+ + +D  + +L
Sbjct: 183  YENPIFAALEVDYSESDQDPTGQAYEEAEKLLVYYELDLGLNHVVRKWA-DPVDRTSSML 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLG 219
              VP     P   GVL+  E+++ Y  +N  AF+  IP R   T+     + +  R++  
Sbjct: 242  FQVPGGADGP--SGVLVCAEDSVTYRHSNQDAFRVPIPRRSGPTE-----NPERKRFIT- 293

Query: 220  DHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIASTI 254
              AG++H +                     ++  +  ++TG    LKI+      +AS++
Sbjct: 294  --AGVMHKMRGAFFFLLQTEDGDLFKLNIDMVEDDNGQLTGEVKRLKIKYFDTVPVASSL 351

Query: 255  SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAK------------GSYV 287
              L +  +Y+ S  G+    +                N   D                 +
Sbjct: 352  LILKSGFLYVASEAGNHHFYQFEKLGDDDEEIEFSSENFSADPSVPLEPIYFRPRSAENL 411

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
             ++E   +L P++D  V +L  +   Q+ T SG     + R +++G+ ++E    EL  +
Sbjct: 412  NLVETINSLNPLIDSKVANLSEEDAPQIYTISGTGARSTFRTLKHGLEVSEIVDSELPSV 471

Query: 348  -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
               +W+ + +  D FD ++++SF + T +L++   + +EE    GF S   TL       
Sbjct: 472  PSAVWTTKLTRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGE 529

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
            + L+Q+    +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  D
Sbjct: 530  DSLIQIHPRGIRHIMADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDAD 587

Query: 466  GILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
            G L E  +  Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K  
Sbjct: 588  GTLAEYDERRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSV 646

Query: 524  LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
                  P ++ + +    S      YL   L  G  L  +L+  TGEL+D +   LG++P
Sbjct: 647  QALTAAPSALNIMSMSDSSSGGTALYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKP 706

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
            + L   S K  T V A S RP + YS    K  + + ++   +     F+S    + +  
Sbjct: 707  VKLFQVSVKGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVPLEWGWNFSSEQCLEGMVG 766

Query: 636  AKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICS----LKNQSCAEES 689
             +   L I +I+ +   +  +SI L   PRR   H EQ   + I S    L   + A+  
Sbjct: 767  IQGQNLRIFSIEKLDNNMLQQSISLAYTPRRFLKHPEQPLFYVIESDNNVLSPSTRAKLL 826

Query: 690  EMHFVRLLDDQTF---EF------------------------ISTYPLDTFEYGCSILSC 722
            E    R  D+      EF                        +ST  L+  E   S+ + 
Sbjct: 827  EDSKARTGDETVLPPEEFGYPRGSGHWASCIQVVDPVHAKAVVSTIELEENEAAVSVAAV 886

Query: 723  SF-SDDSNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
             F S D   +  VGTA  +   N P+   G I ++   EDG+ L+ I + + +    +L 
Sbjct: 887  PFTSQDDETFLVVGTAKDM-NVNPPSSAGGYIHIYRFQEDGRELEFIHKTKVEEPPLALL 945

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRGDFIVVGDLM 834
             F G+L+A I   +++Y     D G ++L  +C        I+ L  QT+G  IVV D+ 
Sbjct: 946  GFQGRLVAGIGPMLRIY-----DLGMKQLLRKCNAQVVPKTIVGL--QTQGSRIVVSDVR 998

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
            +S++ ++YK++E  +     D  + W ++  ++D +   G +   N++ +R   K SE A
Sbjct: 999  ESVTYVVYKYQENVLIPFVDDSVSRWTTSTTMVDYETTAGGDKFGNIWMLRCPKKISEQA 1058

Query: 892  TDE--------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
             ++        ERG       RLE++   +  +         LV        G    +++
Sbjct: 1059 DEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKTQLV-------AGGRDILVW 1111

Query: 937  GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
               +G IG++   +  E   F + L+  L      + G +H  +RS+    K V     +
Sbjct: 1112 SGFHGTIGMLVPFVSREDVDFFQNLEMQLAAQNPPLAGRDHLIYRSYYAPVKGV-----I 1166

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DGDL E++  L       I+  ++ SV E+ +++ ++
Sbjct: 1167 DGDLCETYFLLPNDTKMMIAAELDRSVREIERKISDM 1203


>gi|225683909|gb|EEH22193.1| pre-mRNA-splicing factor rse1 [Paracoccidioides brasiliensis Pb03]
          Length = 1209

 Score =  201 bits (512), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 276/1164 (23%), Positives = 488/1164 (41%), Gaps = 161/1164 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I TL  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRTLAAFRLAGGSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  + + +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALTALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  +  +A    T             YE+ L     V   WS + +D  A +
Sbjct: 183  YDNPIFAALEVEYTEADQDPTGQAYEELEKQLVYYELDLGLNHVVRK-WS-DPVDRSATM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
            L  VP       G L+  E+ I Y  +N  AF+  IP R  P+      R    G     
Sbjct: 241  LFQVPGGADGPSGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTENPERKRCIVAGVMHKM 300

Query: 216  -----YLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVVY 263
                 +LL    G L  + I   ++   ++TG    LK++      IAS++  L +  ++
Sbjct: 301  RGAFFFLLQTEDGDLFKVTIDMVEDDNGQITGEVRRLKLKYFDTVPIASSLCILKSGFLF 360

Query: 264  IGSSYGDS---QLIKLNLQPD----AKGSY--------------------VEVLERYVNL 296
            + S  G+    Q  KL    D       S+                    V ++E   +L
Sbjct: 361  VASETGNHHFYQFEKLGDDDDEIEFTSDSFSANPSEPLPPVYFHPRLSENVHLVENVNSL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P++D  + +L      Q+ T  G     S R +++G+ + E    EL  +   +W+ + 
Sbjct: 421  NPLMDCKITNLTEDDAPQIYTICGTGARSSFRTLKHGLEVTEIVESELPSVPSAVWTSKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            + +D FD ++++SF + T +L++   + +EE    GF S   TL       + L+QV   
Sbjct: 481  TREDEFDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
             +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAVALSSGEIVYFEMDTDGSLAEYDEK 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             ++   ++CL +  + +  + S   AVG   D +VRI SL PD  L  K        P +
Sbjct: 597  REMSGTVTCLSLGEVPKGRARSSFLAVGC-DDSTVRILSLDPDSTLGNKSVQALTSAPSA 655

Query: 533  VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + + A    +      YL   L  G  L  +L+  TGEL+D +   LG +P+ L   S K
Sbjct: 656  LSIMAMADSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLKPVKLFQVSVK 715

Query: 587  NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
                V A S RP + YS    K  + + ++   +     F+S    + +   +   L I 
Sbjct: 716  EQKAVLALSSRPWLGYSDLQTKAFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775

Query: 645  TIDDIQ-KLHIRSIPLGEHPRR-ICHQEQSRTFAI------------CSLKNQSCAEESE 690
            +I+ ++  L   +I L   PR+ I H EQ   + I              L N S A   +
Sbjct: 776  SIEKLENNLLQETISLTYTPRQFIKHPEQPLFYVIEADNNILSPATRTKLLNDSDAVNGD 835

Query: 691  M---------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILSCSFSD-DSNV 730
                            H+   ++++D   +   +S   L+  E   S+ + SFS  D   
Sbjct: 836  ATVLPPEDFGYPRGTGHWASCIQIVDPVNSKSVVSQIELEENEAAVSVAAVSFSSQDDET 895

Query: 731  YYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAI 787
            +  VGT   ++      + G I ++   EDGK L+ I + + +    +L  F G+LLA I
Sbjct: 896  FLVVGTGKDMVVNPRSCSAGFIHIYRFQEDGKELEFIHKTKVEQPPVALLGFQGRLLAGI 955

Query: 788  NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
               +++Y   +R    R+ Q+    H   L + +QT+G  I+V D+ +S++ +++K +E 
Sbjct: 956  GTDVRIYDLGMR-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQESVTYVVFKSQEN 1011

Query: 848  AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG------- 897
             +     D  + W +   ++D +   G +   NL+ +R   K SE A ++  G       
Sbjct: 1012 RLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPAKASEEADEDGSGAHLIHER 1071

Query: 898  --------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV-IAS 948
                    RL +V  ++  +     +   LV        G    +++  + G +G+ I  
Sbjct: 1072 QYLQGAPNRLNLVAHFYPQDLPTSIQKAQLV-------TGGRDILVWTGLQGTVGMLIPF 1124

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +  E+  F + L+  L      + G +H  +RS+        AK  +DGDL E++L L  
Sbjct: 1125 ISREEVDFFQSLEMQLAAQNPPLAGRDHLIYRSY-----YAPAKGTIDGDLCETYLLLPN 1179

Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
             +  +I+  ++ SV E+ +++ ++
Sbjct: 1180 DKKQQIAGELDRSVREIERKIADM 1203


>gi|19115326|ref|NP_594414.1| U2 snRNP-associated protein Sap130 (predicted) [Schizosaccharomyces
            pombe 972h-]
 gi|73919127|sp|Q9UTT2.1|RSE1_SCHPO RecName: Full=Pre-mRNA-splicing factor prp12; AltName:
            Full=Pre-mRNA-processing protein 12; AltName:
            Full=Spliceosome-associated protein 130
 gi|6451681|dbj|BAA86918.1| Prp12p/SAP130 [Schizosaccharomyces pombe]
 gi|7981404|emb|CAB92100.1| U2 snRNP-associated protein Sap130 (predicted) [Schizosaccharomyces
            pombe]
          Length = 1206

 Score =  201 bits (512), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 264/1176 (22%), Positives = 476/1176 (40%), Gaps = 177/1176 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L+   +G I  +   R  G  +D+L + ++  +  +L+++ E ++L+            
Sbjct: 59   ILNQNCFGIIRNVAPLRLTGFKRDYLVVTSDSGRITILEYNVEKNKLVPIYQETFGKSGI 118

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
            R    G+   ID   R  +I     + L  V+  D++  L    +  LE  +  +I F  
Sbjct: 119  RRVVPGEYLAIDAKGRAAMIASVEKNKLVYVLNRDSEANL--TISSPLEAHKANNICFHL 176

Query: 119  -----GCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLD 161
                 G A P    L  D  +  H  T E     +            + V   WS+  +D
Sbjct: 177  IGLDTGYANPIFAALEVDYSEIDHDSTREAFTSSEKVLSYYELDLGLNHVVKRWSKV-VD 235

Query: 162  NGADLLIPVP-----PPLCGVLIIGEETIVYCS-ANAFKAIPI-------RPSITKAYGR 208
              + +LIPVP     P   G L+I    I Y     AF  IPI         +I+  + +
Sbjct: 236  RNSYMLIPVPGGNDGP--SGTLVISNGWISYRHLQKAFHQIPILRRQAASANAISTPWNQ 293

Query: 209  VDADGSR------------------YLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETS 249
            V+++ +                   YLL    G L  L I H+ +  V  L+++      
Sbjct: 294  VNSNSANDGPLIVSAVLHKMKGSFFYLLQTGDGDLLKLTIEHDGQGNVVELRLKYFDTVP 353

Query: 250  IASTISYLDNAVVYIGSSYGDSQLIKL---------------------------NLQPDA 282
            +A  ++ L    +++ + +G+ QL +                            N+    
Sbjct: 354  LAVQLNILKTGFLFVATEFGNHQLYQFENLGIDDDELEITSLDFQAQDNEVGTKNVHFGV 413

Query: 283  KG-SYVEVLERYVNLGPIVDFCVVDLERQGQG-QVVTCSGAYKDGSLRIVRNGIGINEQA 340
            +G   + ++E   +L  + D  ++     G+  Q+ T  G   + SLR +R G+   E  
Sbjct: 414  RGLQNLSLVEEIPSLYSLTDTLLMKAPSSGEANQLYTVCGRGSNSSLRQLRRGLETTEIV 473

Query: 341  SVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
            + EL G    +W+L+ +  D +D+++++SF + T +L++   + +EE    GF S   TL
Sbjct: 474  ASELPGAPIAIWTLKLNQTDVYDSYIILSFTNGTLVLSIG--ETVEEISDSGFLSSVSTL 531

Query: 400  FCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLV 459
                   + LVQ+    +R + +  +   +EWK P    V  +  N  Q+++A   G LV
Sbjct: 532  NARQMGRDSLVQIHPKGIRYIRANKQT--SEWKLPQDVYVVQSAINDMQIVVALSNGELV 589

Query: 460  YLEIGD----GILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP 514
            Y E+ D    G L E +  + L   ++ L + P+ E    S    +    D +VR+ SL 
Sbjct: 590  YFEMSDDVEGGQLNEYQERKTLTANVTSLALGPVQEGSRRSNFMCLAC-DDATVRVLSL- 647

Query: 515  DLNLITKEHLGGE----------IIPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKT 562
            DL   T E+L  +          IIP +V      G+S  YL   L +G  L  ++++ +
Sbjct: 648  DL-YTTLENLSVQALSSPANSLCIIPMNV-----NGVSTLYLHIGLMNGVYLRTVIDVTS 701

Query: 563  GELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
            G+L D +   LG + + +   + KN   V A S R  + YS  + L  S +    + H  
Sbjct: 702  GQLLDTRTRFLGPRAVKIYPITMKNQNTVLAVSSRTFLAYSYQQNLQLSPIAYSAIDHAS 761

Query: 623  PFNSAAFPDSLAIAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQ---------EQS 672
             F S   P+ +   ++  L I T+D +Q  L     PL   PR+I            +  
Sbjct: 762  SFASEQCPEGIVAIQKNTLKIFTVDSLQDDLKSDIYPLICTPRKIVKHPNFPVLYILQSE 821

Query: 673  RTF--------------AICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
            R F              +    K      +S + F+ + D  + + I   PL   E   S
Sbjct: 822  RNFDSFKYAQENGDVGSSYTKEKQNEHTSKSWVSFISVFDMISKKIIHESPLGDNEAAFS 881

Query: 719  ILSCSFSDDSNVYYCVGTAYVLPEENEPTKG---RILVFIVEDGKLQLIAEKETKGAVYS 775
            + +  F +    +   G+A  +  E         R+  F  E  KL+LI+  E  G   +
Sbjct: 882  MTAAFFKNRDEFFLVAGSATNMDLECRTCSHGNFRVYRFHDEGKKLELISHTEIDGIPMA 941

Query: 776  LNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVG 831
            L  F G++LA + + +++Y    K MLR      +         +   ++  +   IVV 
Sbjct: 942  LTPFQGRMLAGVGRFLRIYDLGNKKMLRKGELSAVP--------LFITHITVQASRIVVA 993

Query: 832  DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSE 889
            D   S+  ++YK E+  +   A D    W +   ++D D   G +   N++ +R  ++  
Sbjct: 994  DSQYSVRFVVYKPEDNHLLTFADDTIHRWTTTNVLVDYDTLAGGDKFGNIWLLRCPEHVS 1053

Query: 890  GATDEERGRLEVVGEYHLGEFVNRFRHG-SLVMRLPDSDV-----------GQIPTVIFG 937
               DEE    +++   H   F+N   H   L+     +D+           G    +++ 
Sbjct: 1054 KLADEENSESKLI---HEKPFLNSTPHKLDLMAHFFTNDIPTSLQKVQLVEGAREVLLWT 1110

Query: 938  TVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
             + G +GV    +  E   F ++L+  LRK    + G +H  +RS+    K V     +D
Sbjct: 1111 GLLGTVGVFTPFINQEDVRFFQQLEFLLRKECPPLAGRDHLAYRSYYAPVKCV-----ID 1165

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            GDL E +  L     + I+  ++ ++ E+ K++E+ 
Sbjct: 1166 GDLCEMYYSLPHPVQEMIANELDRTIAEVSKKIEDF 1201


>gi|193786710|dbj|BAG52033.1| unnamed protein product [Homo sapiens]
          Length = 897

 Score =  201 bits (512), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 217/918 (23%), Positives = 396/918 (43%), Gaps = 150/918 (16%)

Query: 237  VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN------------------- 277
            VT ++++      +A+ +  L    +++ S +G+  L ++                    
Sbjct: 2    VTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEE 61

Query: 278  -----LQPDAKGSYVEVLERYVNLGPIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVR 331
                  QP    + V +++   +L PI+ FC + DL  +   Q+    G     SLR++R
Sbjct: 62   GDTFFFQPRPLKNLV-LVDELDSLSPIL-FCQMADLANEDTPQLYVACGRGPRSSLRVLR 119

Query: 332  NGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE 390
            +G+ ++E A  EL G    +W++R   +D FD +++VSF++ T +L++   + +EE    
Sbjct: 120  HGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDS 177

Query: 391  GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVL 450
            GF   T TL C     + LVQV    +R + +  R   NEWK+P   ++     N  QV+
Sbjct: 178  GFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVV 235

Query: 451  LATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISV 508
            +A  GG LVY E+   G L E  +  ++  ++ C+ +  +      S+  AVG+  D +V
Sbjct: 236  IALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTV 294

Query: 509  RIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE----------------GISYLLCALGDG 551
            RI SL   + +  + L  + +P +   LC  E                G  YL   L +G
Sbjct: 295  RIISLDPSDCL--QPLSMQALPAQPESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNG 352

Query: 552  HLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYS 611
             LL  +L+  TG+L+D +   LG++P+ L     +    V A S R  + YS   +   +
Sbjct: 353  VLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLT 412

Query: 612  NVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQ 669
             ++ + +     F S   P+ +       L I  ++ +  +  + + PL   PR+ + H 
Sbjct: 413  PLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHP 472

Query: 670  EQSR-----------TFAICSLKNQSCAEE-------------SEM-------------- 691
            E +            T A  + + Q  AEE             +EM              
Sbjct: 473  ESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAAGEDERELAAEMAAAFLNENLPESIF 532

Query: 692  -----------HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYV 739
                         +R+++      +    L+  E   S+  C FS+    +Y  VG A  
Sbjct: 533  GAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKD 592

Query: 740  LPEENEPTKGRILVF--IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW 796
            L        G  +    +V +G KL+ + +   +    ++  F G++L  + + +++Y  
Sbjct: 593  LILNPRSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-- 650

Query: 797  MLRDDGTRELQSECGHHGHILALY---VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
               D G ++L  +C  + HI A Y   +QT G  ++V D+ +S   + YK  E  +   A
Sbjct: 651  ---DLGKKKLLRKC-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFA 705

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE---------ERGRL--- 899
             D    W++   +LD D   GA+   N+  VR   N+    DE         +RG L   
Sbjct: 706  DDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGA 765

Query: 900  ----EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQY 954
                EV+  YH+GE V   +  +L+        G   ++++ T++G IG++     HE +
Sbjct: 766  SQKAEVIMNYHVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGILVPFTSHEDH 818

Query: 955  LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
             F + ++ +LR     + G +H  +RS+         KN +DGDL E F  +   +   +
Sbjct: 819  DFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNV 873

Query: 1015 SKTMNVSVEELCKRVEEL 1032
            S+ ++ +  E+ K++E++
Sbjct: 874  SEELDRTPPEVSKKLEDI 891


>gi|238487250|ref|XP_002374863.1| nuclear mRNA splicing factor, putative [Aspergillus flavus NRRL3357]
 gi|220699742|gb|EED56081.1| nuclear mRNA splicing factor, putative [Aspergillus flavus NRRL3357]
          Length = 1210

 Score =  201 bits (512), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 280/1182 (23%), Positives = 493/1182 (41%), Gaps = 188/1182 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G  +D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMSALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT---YEVALK---------DKDFVEGPWSQNNLDNGADLL 167
               P    L  D  ++    T   YE A K           + V   W+ + +D  + +L
Sbjct: 183  YENPIFAALEVDYSESDQDPTGQAYEEAEKLLVYYELDLGLNHVVRKWA-DPVDRTSSML 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLG 219
              VP     P   GVL+  E+++ Y  +N  AF+  IP R   T+     + +  R++  
Sbjct: 242  FQVPGGADGP--SGVLVCAEDSVTYRHSNQDAFRVPIPRRSGPTE-----NPERKRFIT- 293

Query: 220  DHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIASTI 254
              AG++H +                     ++  +  ++TG    LKI+      +AS++
Sbjct: 294  --AGVMHKMRGAFFFLLQTEDGDLFKLNIDMVEDDNGQLTGEVKRLKIKYFDTVPVASSL 351

Query: 255  SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAK------------GSYV 287
              L +  +Y+ S  G+    +                N   D                 +
Sbjct: 352  LILKSGFLYVASEAGNHHFYQFEKLGDDDEEIEFSSENFSADPSVPLEPIYFRPRSAENL 411

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
             ++E   +L P++D  V +L  +   Q+ T SG     + R +++G+ ++E    EL  +
Sbjct: 412  NLVETINSLNPLIDSKVANLSEEDAPQIYTISGTGARSTFRTLKHGLEVSEIVDSELPSV 471

Query: 348  -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
               +W+ + +  D FD ++++SF + T +L++   + +EE    GF S   TL       
Sbjct: 472  PSAVWTTKLTRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGE 529

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
            + L+Q+    +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  D
Sbjct: 530  DSLIQIHPRGIRHIMADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDAD 587

Query: 466  GILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
            G L E  +  Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K  
Sbjct: 588  GTLAEYDERRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSV 646

Query: 524  LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
                  P ++ + +    S      YL   L  G  L  +L+  TGEL+D +   LG++P
Sbjct: 647  QALTAAPSALNIMSMSDSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKP 706

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
            + L   S K  T V A S RP + YS    K  + + ++   +     F+S    + +  
Sbjct: 707  VKLFQVSVKGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVPLEWGWNFSSEQCLEGMVG 766

Query: 636  AKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICS----LKNQSCAEES 689
             +   L I +I+ +   +  +SI L   PRR   H EQ   + I S    L   + A+  
Sbjct: 767  IQGQNLRIFSIEKLDNNMLQQSISLAYTPRRFLKHPEQPLFYVIESDNNVLSPSTRAKLL 826

Query: 690  EMHFVRLLDDQTF---EF------------------------ISTYPLDTFEYGCSILSC 722
            E    R  D+      EF                        +ST  L+  E   S+ + 
Sbjct: 827  EDSKARTGDETVLPPEEFGYPRGSGHWASCIQVVDPVHAKAVVSTIELEENEAAVSVAAV 886

Query: 723  SF-SDDSNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
             F S D   +  VGTA  +   N P+   G I ++   EDG+ L+ I + + +    +L 
Sbjct: 887  PFTSQDDETFLVVGTAKDM-NVNPPSSAGGYIHIYRFQEDGRELEFIHKTKVEEPPLALL 945

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRGDFIVVGDLM 834
             F G+L+A I   +++Y     D G ++L  +C        I+ L  QT+G  IVV D+ 
Sbjct: 946  GFQGRLVAGIGPMLRIY-----DLGMKQLLRKCNAQVVPKTIVGL--QTQGSRIVVSDVR 998

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
            +S++ ++YK++E  +     D  + W ++  ++D +   G +   N++ +R   K SE A
Sbjct: 999  ESVTYVVYKYQENVLIPFVDDSVSRWTTSTTMVDYETTAGGDKFGNIWMLRCPKKISEQA 1058

Query: 892  TDE--------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
             ++        ERG       RLE++   +  +         LV        G    +++
Sbjct: 1059 DEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKTQLV-------AGGRDILVW 1111

Query: 937  GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
               +G IG++   +  E   F + L+  L      + G +H  +RS+    K V     +
Sbjct: 1112 SGFHGTIGMLVPFVSREDVDFFQNLEMQLAAQNPPLAGRDHLIYRSYYAPVKGV-----I 1166

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE-LTRLH 1036
            DGDL E++  L       I+  ++ SV E+ +++    TR H
Sbjct: 1167 DGDLCETYFLLPNDTKMMIAAELDRSVREIERKISVCFTRYH 1208


>gi|212531303|ref|XP_002145808.1| nuclear mRNA splicing factor, putative [Talaromyces marneffei ATCC
            18224]
 gi|210071172|gb|EEA25261.1| nuclear mRNA splicing factor, putative [Talaromyces marneffei ATCC
            18224]
          Length = 1209

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 270/1167 (23%), Positives = 491/1167 (42%), Gaps = 167/1167 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q +         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKAQTVVFALTALDVG 182

Query: 120  CAKP---TIVVLYQDNKDARHVKTYEVALKDKDFVE---------GPWSQNNLDNGADLL 167
               P    I V Y ++      + YE   K   + E           WS + +   +++L
Sbjct: 183  YDNPIFAAIEVDYSESDQDPTGRAYEEVEKQLVYYELDLGLNHVVRRWS-DVIHRTSNIL 241

Query: 168  IPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR---- 215
              VP       GVL+  E+++ Y  +N  AF+  IP R  P+      R    G      
Sbjct: 242  FQVPGGADGPSGVLVCAEDSVTYRHSNQDAFRVPIPRRNGPTENPERKRSIVAGVMHKMR 301

Query: 216  ----YLLGDHAGLLHLLVI---THEKEKVTG----LKIELLGETSIASTISYLDNAVVYI 264
                +LL    G L  L I     E  ++TG    LKI+      IA+ +  L +  +++
Sbjct: 302  GAFFFLLQTEDGDLFKLTIDMVEDENGQLTGEVKRLKIKYFDTVPIAANLLILKSGFLFV 361

Query: 265  GSSYGDSQLIKLN-----------------------LQP---DAKGS-YVEVLERYVNLG 297
             S  G+    +                         + P   + +G+  + ++E   +L 
Sbjct: 362  ASETGNHHFYQFEKLGDDDEETEFSSDDVSADPADPINPVYFNLRGAENLNLVESINSLN 421

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSS 356
            PI+D  + D   +   Q+ T  G     + R +++G+ ++E    EL  +   +W+ + +
Sbjct: 422  PIMDCKITDAIGEDAPQIYTICGTGARSTFRTLKHGLEVSEIVESELPSVPSAVWTTKLT 481

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             +D FD ++V+SF + T +L++   + +EE    GF S   TL       + L+QV    
Sbjct: 482  RNDEFDAYIVLSFTNGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVHPKG 539

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KHA 474
            +R + +  R   NEW +P   ++  A  N  QV +A   G +VY E+  DG L E  +  
Sbjct: 540  IRHILADHR--VNEWPAPQHRTIVAAATNERQVAVALSSGEIVYFEMDTDGSLAEYDEKR 597

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSV 533
            Q+   ++ L +  + E    S   AVG   D +VRI SL PD  L  K        P ++
Sbjct: 598  QMSGTVTSLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPSAL 656

Query: 534  -LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
             ++   +  S     YL   L  G  L  +L+  TGEL+D +   LG +P+ L   S K 
Sbjct: 657  NIMSMIDSTSGGSTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKPVKLFGVSVKG 716

Query: 588  TTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
               V A S RP + YS    K  + ++++   +     F+S    + +   +   L I +
Sbjct: 717  QPAVLALSSRPWLGYSDVQTKSFMLTSLDYVGLEWGWNFSSEQCLEGMVGIQGQNLRIFS 776

Query: 646  IDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD----- 699
            I+ +    ++ SIPL   PR      +   F +   +N   A  ++    RLL++     
Sbjct: 777  IEKLDNNVLQESIPLAYTPRHFVKHPEQPLFYVIESENNVLAPATQ---TRLLEESKLQN 833

Query: 700  --------QTFEF----------------------ISTYPLDTFEYGCSILSCSF-SDDS 728
                    +TF +                      +S   L+  E   SI + SF S D+
Sbjct: 834  GEAVIPPAETFGYPRATGHWASCIEVVDPVNAKSVLSRLELEENESAVSIAAVSFASQDN 893

Query: 729  NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
              +  VGT   V+      + G I ++   EDG+ L+ I + + +    +L AF G+L+A
Sbjct: 894  ETFLVVGTGKDVVTYPRSFSAGFIHIYRFQEDGRELEFIHKTKIEEPPLALLAFQGRLVA 953

Query: 786  AINQKIQLYKWMLRDDGTRELQSECGHHG-HILALYVQTRGDFIVVGDLMKSISLLIYKH 844
             I + +++Y     D G +++  +C       L + +QT+G  I+V D+ +S++ ++YK+
Sbjct: 954  GIGKNLRIY-----DLGMKQMLRKCQVEAVPNLIVGLQTQGSRIIVSDVQESVTYVVYKY 1008

Query: 845  EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER-------- 896
            +E  +     D  A W +A  ++D +   G +   NL+ VR   + + D +         
Sbjct: 1009 QENQLIPFVDDVIARWTTATTMVDYETTAGGDKFGNLWLVRCPQKASDDSDEDGSGAHLI 1068

Query: 897  ----------GRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
                       RL ++  Y+  +        +LV       VG    +++  + G IG++
Sbjct: 1069 HERSYLQGTANRLNLMIHYYTQDIPTSLHKTNLV-------VGGRDILVWTGLQGTIGIM 1121

Query: 947  AS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
               +  E   F + L+T L      + G +H  +RS+      V +K  +DGDL ES+  
Sbjct: 1122 VPFISREDVDFFQNLETQLASQNPPLAGRDHLIYRSY-----YVPSKGVIDGDLCESYFL 1176

Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L   +   I+  ++ SV E+ +++ ++
Sbjct: 1177 LPNDKKLMIAGELDRSVREIERKISDM 1203


>gi|24654874|ref|NP_728546.1| CG13900, isoform A [Drosophila melanogaster]
 gi|23092721|gb|AAF47416.2| CG13900, isoform A [Drosophila melanogaster]
 gi|60678131|gb|AAX33572.1| LD01809p [Drosophila melanogaster]
 gi|220950356|gb|ACL87721.1| CG13900-PA [synthetic construct]
 gi|289803030|gb|ADD20765.1| FI04459p [Drosophila melanogaster]
          Length = 1227

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 260/1199 (21%), Positives = 470/1199 (39%), Gaps = 197/1199 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   I+G +  L  FR  G  +D++ + ++  +  +L+++   + L             
Sbjct: 53   LLSTEIFGCVRALMAFRLTGGTKDYIVVGSDSGRIVILEYNPSKNALEKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D ++A    + + A + +    F E     N+        L+  A
Sbjct: 173  VDVGFDNPMFACLEIDYEEADMDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +LL    G +  + +  + + V+ +K++       A+ +  L    +
Sbjct: 288  ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
            ++ S +G+  L ++                           P A  + V +++   +  P
Sbjct: 348  FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
            I+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++   
Sbjct: 407  IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRA 466

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            DD FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +
Sbjct: 467  DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
            R + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G L E  + ++
Sbjct: 525  RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
            +  EI C+ +  + E    S   AVG+  D +VRI SL   N +T   +     P    L
Sbjct: 583  MPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-L 640

Query: 536  CAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
            C  E                         G  YL   L +G LL  +L+  +G+L D + 
Sbjct: 641  CLVEMGHTESTTQGGLDDDAPAQRSGNNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 700

Query: 571  VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
              LG++P+ L     + +  V A S R  + Y    +   + ++ + + +   F+S    
Sbjct: 701  RYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 760

Query: 631  DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
            + +       L I  ++ +  +  + + PL   PR  + H +  R           T   
Sbjct: 761  EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDT 820

Query: 678  CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
             S + +  AEE              EM                           +R LD 
Sbjct: 821  KSARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDA 880

Query: 700  QTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
               + + + PL   E     ++L  S + D   Y  VG A  L      ++G  +     
Sbjct: 881  MHGQTMFSVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKI 940

Query: 758  D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
            D     L+ +   +      +L  F G+LLA   + +++Y     D G +++  +C   H
Sbjct: 941  DPTCSSLEFMHRTDIDEIPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKH 995

Query: 814  GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
                 + +Q  G  + V D+ +S+  + Y+  E  +   A D +  W++A  +LD D  +
Sbjct: 996  IPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1054

Query: 874  GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
               + F   ++++     TD+            +RG L       E +  +H+GE +   
Sbjct: 1055 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1114

Query: 915  RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
            +  +L+        G    +I+ T++G +G        E Y F + L+ ++R     + G
Sbjct: 1115 QKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1167

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +H  +RS          KN LDGDL E +L +   +   I+  M  +  ++CK++E++
Sbjct: 1168 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1221


>gi|17508021|ref|NP_491953.1| Protein TEG-4 [Caenorhabditis elegans]
 gi|351060889|emb|CCD68627.1| Protein TEG-4 [Caenorhabditis elegans]
          Length = 1220

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 259/1185 (21%), Positives = 491/1185 (41%), Gaps = 187/1185 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
            I+G + +L  FR     +DF+ + ++  +  +LQ++AE +    L     G    R   P
Sbjct: 59   IFGIVRSLLAFRLTAGTRDFIAVGSDSGRIVILQYNAEKTCFERLHQETFGKTGCRRIVP 118

Query: 63   TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
               G   + DP  R  +IG      L  ++  D++  L  +  +   +   L    +   
Sbjct: 119  ---GHFLVGDPRGRALMIGAVERQKLVYIMNRDSEAHLTISSPLEAHKHHTLCYAMVGID 175

Query: 118  YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADL 166
             G   PT   L  D +DA +  T E A + +    F E     N+        L++  +L
Sbjct: 176  VGFENPTFACLEFDYEDADNDPTGEAAKRTQQTLTFYELDLGLNHVVRKYAEPLNDPGNL 235

Query: 167  LIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-------- 215
            LI VP       GV++  E  +VY   N      IR  I +    +D D  R        
Sbjct: 236  LIAVPGGNDGPSGVIVCCENYLVY--KNLGDQPDIRCPIPRRRNELD-DADRTMLIIATA 292

Query: 216  ---------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
                     +L+    G +  + +  +++ V+ +K++       A+ +  L +  +++ +
Sbjct: 293  THKTKNMYFFLVQAENGDIFKVTLETDEDLVSEMKLKYFDTVPPANALCILKSGFLFVAA 352

Query: 267  SYGDSQLIKLN-----------------------LQPDAKGSYVEVLERYVNLGPIVDFC 303
             +G+ +L ++                         +P    S + + +   +L P+ D  
Sbjct: 353  EFGNHELYQIASLGEGDDDEFSSAMGFGENDAAFFEPHELKSLIPI-DSMDSLSPLTDAV 411

Query: 304  VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
            + D+ R+   Q+ +  G     SL+++RNG+ I+E A  +L G    +W+++ + +D +D
Sbjct: 412  IGDIAREDAAQIYSLVGRGARSSLKVLRNGLEISEMAVSDLPGNPNAVWTVKKNIEDQYD 471

Query: 363  TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
            +++VVSF++ T  LA+ + D +EE    GF   T T+ C     + LVQ+ S  +R + +
Sbjct: 472  SYIVVSFVNAT--LALTIGDTVEEASDSGFLPTTPTIGCAMIGDDSLVQIYSEGIRHIRA 529

Query: 423  TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQL-EYEI 480
              R   NEWK+PP   +     N  QV +A  GG LVY E+  +G L E    +L   +I
Sbjct: 530  DKR--INEWKAPPRRQIVKCAVNRRQVAVALTGGELVYFELDLNGTLNEFTERKLFNADI 587

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAF- 538
            +C+  + I E    S+  A+G   D +VRI SL P+  L+          P S+LL    
Sbjct: 588  ACMTFSEISEGELNSRFLALGT-VDNAVRIISLDPNDMLMPLSTQSLPCPPESILLIDTP 646

Query: 539  ----EGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
                +G++  +L   L +G L    ++  TG + D +   LGT+P+ L     +  + + 
Sbjct: 647  NEDGKGVAAVHLNIGLQNGCLFRNTVDNVTGAIMDTRTRYLGTRPVKLFKVQCQGRSAIL 706

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI------GTI 646
              S R  ++Y   ++   + ++   + +   F S    + +       L I      G  
Sbjct: 707  CTSSRSWLLYHFQRRFHLTPLSYANLEYAASFCSNQCSEGIVAISASTLRIIAAEKLGVA 766

Query: 647  DDIQKLHIRSIP--LGEHPRRIC-----HQEQSRTFAICSLKNQSCAEESEMHFVRLLDD 699
             ++Q    +  P  +  HP   C         S T    ++K    A + E       + 
Sbjct: 767  FNVQSFEHKMTPRRVAVHPSMPCLIVIETDHASYTEVTKNIKRNQMAADVEAMASDETEA 826

Query: 700  QTFEFIST----YPLDTFEYGC------------SILSCSFSDDSNVY--------YCVG 735
            Q  + I+T      LD   YG             S++S +  D  + +         CV 
Sbjct: 827  QLAQEIATNLRERRLDERVYGAPRAARGKWASAISLISATSGDKLSYFELPQDENAKCVA 886

Query: 736  TAYVLPEENE--------------------------PTKGRILVF-IVEDG-KLQLIAEK 767
                    NE                          PT+G +  F +  +G +   +   
Sbjct: 887  LVQFSKHPNEAMVLVGCGVNEVLNVHDIDPNDTSIRPTRGCVYTFHLSANGDRFDFLHRT 946

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
            ET   V +++ F G  L    + +++Y     D G ++L ++C +    +++  +Q+ G 
Sbjct: 947  ETPLPVGAIHDFRGMALVGFGRFLRMY-----DIGQKKLLAKCENKNFPVSIVNIQSTGQ 1001

Query: 827  FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
             I+V D  +S+  L Y+  +  +   A D    +++ V +LD      A+   NL  VR 
Sbjct: 1002 RIIVSDSQESVHFLRYRKGDNQLVVFADDTTPRYVTCVCVLDYHTVAVADKFGNLAVVRL 1061

Query: 886  --KNSEGATDE--------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
              + +E   D+        +RG       ++E+V  + +G+ +   +  SL   +P    
Sbjct: 1062 PERVNEDVQDDPTVSKSVWDRGWLNGASQKVELVSNFFIGDTITSLQKTSL---MP---- 1114

Query: 929  GQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
            G    +++ T+ G IG + S +  ++  F   L+ ++R     + G +H  +RS+    K
Sbjct: 1115 GANEALVYTTIGGAIGCLVSFMSKDEVDFFTNLEMHVRSEYPPLCGRDHLAYRSYYAPCK 1174

Query: 988  TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +V     +DGD+ E F  +   +  ++++ +  +V E+ K++E++
Sbjct: 1175 SV-----IDGDICEQFSLMDTQKQKDVAEELGKTVSEISKKLEDI 1214


>gi|83767504|dbj|BAE57643.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1270

 Score =  201 bits (511), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 279/1181 (23%), Positives = 494/1181 (41%), Gaps = 188/1181 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G  +D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  +++ +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMSALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT---YEVALK---------DKDFVEGPWSQNNLDNGADLL 167
               P    L  D  ++    T   YE A K           + V   W+ + +D  + +L
Sbjct: 183  YENPIFAALEVDYSESDQDPTGQAYEEAEKLLVYYELDLGLNHVVRKWA-DPVDRTSSML 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLG 219
              VP     P   GVL+  E+++ Y  +N  AF+  IP R   T+     + +  R++  
Sbjct: 242  FQVPGGADGP--SGVLVCAEDSVTYRHSNQDAFRVPIPRRSGPTE-----NPERKRFIT- 293

Query: 220  DHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIASTI 254
              AG++H +                     ++  +  ++TG    LKI+      +AS++
Sbjct: 294  --AGVMHKMRGAFFFLLQTEDGDLFKLNIDMVEDDNGQLTGEVKRLKIKYFDTVPVASSL 351

Query: 255  SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAK------------GSYV 287
              L +  +Y+ S  G+    +                N   D                 +
Sbjct: 352  LILKSGFLYVASEAGNHHFYQFEKLGDDDEEIEFSSENFSADPSVPLEPIYFRPRSAENL 411

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
             ++E   +L P++D  V +L  +   Q+ T SG     + R +++G+ ++E    EL  +
Sbjct: 412  NLVETINSLNPLIDSKVANLSEEDAPQIYTISGTGARSTFRTLKHGLEVSEIVDSELPSV 471

Query: 348  -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
               +W+ + +  D FD ++++SF + T +L++   + +EE    GF S   TL       
Sbjct: 472  PSAVWTTKLTRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGE 529

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
            + L+Q+    +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  D
Sbjct: 530  DSLIQIHPRGIRHIMADRRV--NEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDAD 587

Query: 466  GILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
            G L E  +  Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K  
Sbjct: 588  GTLAEYDERRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSV 646

Query: 524  LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
                  P ++ + +    S      YL   L  G  L  +L+  TGEL+D +   LG++P
Sbjct: 647  QALTAAPSALNIMSMSDSSSGGTALYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKP 706

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
            + L   S K  T V A S RP + YS    K  + + ++   +     F+S    + +  
Sbjct: 707  VKLFQVSVKGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVPLEWGWNFSSEQCLEGMVG 766

Query: 636  AKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICS----LKNQSCAEES 689
             +   L I +I+ +   +  +SI L   PRR   H EQ   + I S    L   + A+  
Sbjct: 767  IQGQNLRIFSIEKLDNNMLQQSISLAYTPRRFLKHPEQPLFYVIESDNNVLSPSTRAKLL 826

Query: 690  EMHFVRLLDDQTF---EF------------------------ISTYPLDTFEYGCSILSC 722
            E    R  D+      EF                        +ST  L+  E   S+ + 
Sbjct: 827  EDSKARTGDETVLPPEEFGYPRGSGHWASCIQVVDPVHAKAVVSTIELEENEAAVSVAAV 886

Query: 723  SF-SDDSNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
             F S D   +  VGTA  +   N P+   G I ++   EDG+ L+ I + + +    +L 
Sbjct: 887  PFTSQDDETFLVVGTAKDM-NVNPPSSAGGYIHIYRFQEDGRELEFIHKTKVEEPPLALL 945

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRGDFIVVGDLM 834
             F G+L+A I   +++Y     D G ++L  +C        I+ L  QT+G  IVV D+ 
Sbjct: 946  GFQGRLVAGIGPMLRIY-----DLGMKQLLRKCNAQVVPKTIVGL--QTQGSRIVVSDVR 998

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
            +S++ ++YK++E  +     D  + W ++  ++D +   G +   N++ +R   K SE A
Sbjct: 999  ESVTYVVYKYQENVLIPFVDDSVSRWTTSTTMVDYETTAGGDKFGNIWMLRCPKKISEQA 1058

Query: 892  TDE--------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
             ++        ERG       RLE++   +  +         LV        G    +++
Sbjct: 1059 DEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKTQLV-------AGGRDILVW 1111

Query: 937  GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
               +G IG++   +  E   F + L+  L      + G +H  +RS+    K V     +
Sbjct: 1112 SGFHGTIGMLVPFVSREDVDFFQNLEMQLAAQNPPLAGRDHLIYRSYYAPVKGV-----I 1166

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE-ELTRL 1035
            DGDL E++  L       I+  ++ SV E+ +++   +TR+
Sbjct: 1167 DGDLCETYFLLPNDTKMMIAAELDRSVREIERKISLRVTRM 1207


>gi|195169735|ref|XP_002025674.1| GL20829 [Drosophila persimilis]
 gi|194109167|gb|EDW31210.1| GL20829 [Drosophila persimilis]
          Length = 1225

 Score =  201 bits (510), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 264/1199 (22%), Positives = 465/1199 (38%), Gaps = 199/1199 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   I+G I  L  FR  G  +D++ + ++  +  +L++    + L             
Sbjct: 53   LLSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEYMPAKNALEKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D +++    T + A + +    F E     N+        L+  A
Sbjct: 173  VDVGFDNPMFACLEIDYEESDLDPTGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +LL    G +  + +  + + V+ +K++       AS +  L    +
Sbjct: 288  ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPI 299
            ++ S +G+  L ++             +  P  +G       R +          +  PI
Sbjct: 348  FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRTLKNLVLVDELPSFAPI 407

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
            +   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++   D
Sbjct: 408  ITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRAD 467

Query: 359  DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
            D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +R
Sbjct: 468  DEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGIR 525

Query: 419  LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQL 476
             + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G L E  + +++
Sbjct: 526  HIRSDKRV--NEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSEM 583

Query: 477  EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLC 536
              EI C+ +  + +    S   AVG+  D +VRI SL   N +T   +     P    LC
Sbjct: 584  PAEIMCMALGTVPDGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-LC 641

Query: 537  AFE--------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
              E                          G  YL   L +G LL  +L+  +G+L D + 
Sbjct: 642  LVEMGHTESTTSAGALDDDAPPQRSGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 701

Query: 571  VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
              LG++P+ L     +    V A S R  + Y    +   + ++ + + +   F+S    
Sbjct: 702  RYLGSRPVKLFRIKMQGAEAVLAMSSRSWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 761

Query: 631  DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKN------ 682
            + +       L I  ++ +  +  + + PL   PR  + H +  R     +  N      
Sbjct: 762  EGIVAISTNTLRILALEKLGAVFNQVAFPLQFTPRTFVIHPDTGRMLIAETDHNAYTEDT 821

Query: 683  ------------QSCAEESEMHFVRLLDDQTFEFIS-TYPLDTFEY--------GCSILS 721
                        +S A + E    R + +    FI+   P D F          G S + 
Sbjct: 822  KSARKEQMAEEMRSAAGDEERELAREMANA---FINEVLPEDVFSAPKAGAGVCGASQIR 878

Query: 722  C------------------------SFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
            C                        S + D   Y  VG A  L      ++G  L     
Sbjct: 879  CPGTPCMGQTMFKTRAIMSMAMVKFSVAADGRYYLAVGIARDLQLNPRISQGGCLDIYKI 938

Query: 758  D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
            D     L+ +   E      +L  F G+LLA   + +++Y     D G +++  +C   H
Sbjct: 939  DPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCENKH 993

Query: 814  GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
                 + +Q  G  + V D+ +S+  L Y+  E  +   A D +  W++A  +LD D  +
Sbjct: 994  IPYQIVNIQAMGHRVYVSDVQESVFFLRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1052

Query: 874  GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
               + F   ++++     TD+            +RG L       E +  +H+GE +   
Sbjct: 1053 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1112

Query: 915  RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
            +  +L+        G    +I+ T+NG +G        E Y F + L+ ++R     + G
Sbjct: 1113 QKATLI-------PGGSEALIYSTLNGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1165

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +H  +RS          KN LDGDL E +L +   +   I+  M  +  ++CK++E++
Sbjct: 1166 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1219


>gi|242060436|ref|XP_002451507.1| hypothetical protein SORBIDRAFT_04g003000 [Sorghum bicolor]
 gi|241931338|gb|EES04483.1| hypothetical protein SORBIDRAFT_04g003000 [Sorghum bicolor]
          Length = 1232

 Score =  201 bits (510), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 262/1187 (22%), Positives = 477/1187 (40%), Gaps = 180/1187 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D+L + ++  +  +L++  + + L             
Sbjct: 66   LLSVDVFGAIRSLAQFRLTGANKDYLVVGSDSGRLVILEYSPDRNRLDKVHQETFGKSGC 125

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY-- 118
            R    GQ+  +DP  R + +   +    V   +     +   +  LE  +   + F    
Sbjct: 126  RRIVPGQLLAVDPKGRALCIAALEKQKLVYVLNRDAAARLTISSPLEAHKSNTLTFALAA 185

Query: 119  ---GCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    +  +  ++    T + A + +    F E     N+        +DNGA
Sbjct: 186  LDCGFDNPVFAAIELEYAESDRDPTGQAANQAQKLLTFYELDLGLNHVSRKVSEPIDNGA 245

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGRVDADGSR- 215
            +LL+ VP     P   GVL+  +  ++Y +    +    IP R  +    G +    +  
Sbjct: 246  NLLVTVPGGGDGP--SGVLVCCDNFVLYRNQGHPEIRAVIPRRADLPAERGVLIVAAATH 303

Query: 216  -------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
                   +LL    G +  + + H  + V+ L+I+      + S +  L +  ++  S +
Sbjct: 304  KQKSMFFFLLQTEYGDIFKVDLDHNGDTVSELRIKYFDTIPVTSAVCVLRSGFLFAASEF 363

Query: 269  GDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVDF 302
            G+  L +                             QP A  + + + E   +L P++D 
Sbjct: 364  GNHALYQFRDIGRDADVESSSATLMETEEGFQPVFFQPRALKNLMRIDE-IESLMPVMDM 422

Query: 303  CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPF 361
             V +L  +   Q+ T  G     +LRI+R G+ I+E A   L      +W+++ + +D F
Sbjct: 423  RVANLFDEETPQLFTACGRGPRSTLRILRPGLAISEMARSMLPAEPIAVWTVKKNINDMF 482

Query: 362  DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
            D ++VVSF + T +L++   + +EE     F   T +L       + L+QV    +R + 
Sbjct: 483  DAYIVVSFTNVTLVLSIG--ETIEEVSDSQFLDTTHSLAVTLLGEDSLMQVHPNGIRHIR 540

Query: 422  STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEI 480
               R   NEW++P   ++    +N  QV++A  GG L+Y E+   G L EV+   +  ++
Sbjct: 541  EDGRV--NEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDMSGDV 598

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFE 539
            +CL I P+ E    S+  AVG + D ++RI SL PD  L           P S+L    +
Sbjct: 599  ACLAIAPVPEGRQRSRFLAVGSY-DNTIRILSLDPDDCLQPLSVQSVSSAPESLLFLEVQ 657

Query: 540  -------GISY-----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
                   G  Y     L   L +G L    ++M TG+L+D +   LG +P  L      +
Sbjct: 658  ASIGGEDGADYPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPCVVSH 717

Query: 588  TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
               +   S RP + Y      L + ++   +     F+S    + +       L I TI+
Sbjct: 718  RQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESASSFSSDQCSEGVVAVAGDALRIFTIE 777

Query: 648  DI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESEMHFVRLLD------- 698
             + +  +  +IPL   PR+     + +  A+  S K    AEE E      L+       
Sbjct: 778  RLGETFNETAIPLRYTPRKFVILPKKKYIAVIESDKGAFSAEEREAAKKECLEASGAAEN 837

Query: 699  ---------------DQTFEFISTYPLDTFEY---------------------------- 715
                           +   E  +T+P + + Y                            
Sbjct: 838  GNANNGDPMENGDGQEDGAEEGNTFPDEQYGYPKAESERWVSCIRILDPRSRDTTCLLEL 897

Query: 716  -----GCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRILVFI---VEDGK-LQLIA 765
                   SI + +F D +      VGTA  L      T     + I   V++G+ L+L+ 
Sbjct: 898  QENEAAVSICTVNFHDKEHGTLLAVGTAKGLQFWPRRTLAGGFIHIYKFVDEGRSLELLH 957

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTR 824
            + + +    +L  F G+LLA +   ++LY     D G R+L  +C +      +  + T 
Sbjct: 958  KTQVEEVPLALCQFQGRLLAGVGSVLRLY-----DLGKRKLLRKCENKLFPRTIVSIHTY 1012

Query: 825  GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
             D I VGD+ +S     Y+ +E  +   A D    W++A + +D D   GA+   N++  
Sbjct: 1013 RDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAAQHIDFDTMAGADKFGNIYFA 1072

Query: 885  R------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
            R                  K  +G  +    ++E + ++H+G+ V   +  SL+      
Sbjct: 1073 RLPQDISDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDVVTCLQKASLI------ 1126

Query: 927  DVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
              G    +I+GTV G +G ++A    E   F   L+ +LR+    + G +H  +RS    
Sbjct: 1127 -PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQEHPPLCGRDHMAYRS---- 1181

Query: 986  KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                  K+ +DGDL E +  L      +I+  ++ +  E+ K++E++
Sbjct: 1182 -AYFPVKDVIDGDLCEQYPSLPADMQRKIADELDRTPGEILKKLEDI 1227


>gi|195126264|ref|XP_002007593.1| GI12293 [Drosophila mojavensis]
 gi|193919202|gb|EDW18069.1| GI12293 [Drosophila mojavensis]
          Length = 1227

 Score =  200 bits (509), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 260/1200 (21%), Positives = 471/1200 (39%), Gaps = 199/1200 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            ++   I+G I  L  FR  G  +D++ + ++  +  +L++   S   + +   +   + G
Sbjct: 53   LMSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEY-MPSKNALEKVHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A    + + A + +    F E     N+        L+  
Sbjct: 172  GVDVGFDNPMFACLEIDYEEADLDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
            A+ L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R   
Sbjct: 232  ANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +LL    G +  + +  + + V+ +K++       A+ +  L    
Sbjct: 287  FICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                           P A  + V +++   +  
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFA 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++  
Sbjct: 406  PIITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKR 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             DD FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  IDDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
            +R + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G L E  + +
Sbjct: 524  IRHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERS 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL 534
            ++  EI C+ +  + E    S   AVG+  D +VRI SL   N ++   +     P    
Sbjct: 582  EMPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLSPCSMQALPSPAES- 639

Query: 535  LCAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRK 569
            LC  E                         G  YL   L +G LL  +L+  +G+L D +
Sbjct: 640  LCLVEMGHTESTTNAGADDDVPAQRSGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTR 699

Query: 570  KVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAF 629
               LG++P+ L     +    V A S R  + Y    +   + ++ + + +   F+S   
Sbjct: 700  TRYLGSRPVKLFRIKMQGAEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQC 759

Query: 630  PDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKN----- 682
             + +       L I  ++ +  +  + + PL   PR  + H +  R     +  N     
Sbjct: 760  SEGIVAISTNTLRILALEKLGAVFNQVAFPLQFTPRAFVIHPDTGRMLIAETDHNAYTEE 819

Query: 683  -------------QSCAEESEMHF-------------------------------VRLLD 698
                         +S A E E                                  +R LD
Sbjct: 820  TKNARKEQMAEEMRSAAGEEERELAREMANAFINEVLPEDVFSAPKAGLGLWASQIRCLD 879

Query: 699  DQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV 756
                + + T PL   E     ++L  S + D   Y  VG A  L      ++G  +    
Sbjct: 880  AMHGQTMFTVPLTQNEAIMSMAMLKFSVAADGRYYLAVGIAKDLQLNPRISQGGCIDIYK 939

Query: 757  ED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GH 812
             D     L+ +   E +    +L  F G+LLA   + +++Y     D G +++  +C   
Sbjct: 940  IDPTCSALEFLHRTEIEEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCENK 994

Query: 813  HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
            H     + +Q  G  + V D+ +S+  + Y+  E  +   A D +  W++A  +LD D  
Sbjct: 995  HIPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT- 1053

Query: 873  LGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNR 913
            +   + F   ++++     TD+            +RG L       E +  +H+GE +  
Sbjct: 1054 IAIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMS 1113

Query: 914  FRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVG 972
             +  +L+        G    +I+ T++G +G        E Y F + L+ ++R     + 
Sbjct: 1114 LQKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLC 1166

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            G +H  +RS          KN LDGDL E +L +   +   I+  M  +  ++CK++E++
Sbjct: 1167 GRDHLSYRS-----SYYPVKNVLDGDLCEQYLSIDAVKQKSIAGDMFRTPNQICKKLEDI 1221


>gi|195428692|ref|XP_002062402.1| GK16677 [Drosophila willistoni]
 gi|194158487|gb|EDW73388.1| GK16677 [Drosophila willistoni]
          Length = 1273

 Score =  200 bits (509), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 259/1202 (21%), Positives = 477/1202 (39%), Gaps = 202/1202 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            ++   I+G I  L  FR  G  +D++ + ++  +  +L++    + L  +   +   + G
Sbjct: 98   LMSTEIFGCIRALMSFRLTGGTKDYIVVGSDSGRIVILEYIPAKNAL-EKVHQETFGKTG 156

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    +
Sbjct: 157  CRRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMV 216

Query: 118  ---YGCAKPTIVVLYQDNKDA---------RHVKT----YEVALKDKDFVEGPWSQNNLD 161
                G   P    L  D ++A         +H +     YE+ L     V   +S+  L+
Sbjct: 217  GVDVGFDNPMFACLEIDYEEADLDPSGDAAQHTQQTLTFYELDLGLNHVVRK-YSEP-LE 274

Query: 162  NGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR- 215
              A+ L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R 
Sbjct: 275  EHANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERG 329

Query: 216  ----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
                            +LL    G +  + +  + + V+ +K++       A+ +  L  
Sbjct: 330  MIFICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKT 389

Query: 260  AVVYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVN 295
              +++ S +G+  L ++                           P A  + V +++   +
Sbjct: 390  GFLFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPS 448

Query: 296  LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLR 354
              PIV   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++
Sbjct: 449  FAPIVTSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVK 508

Query: 355  SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
               DD FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV  
Sbjct: 509  KRVDDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYP 566

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VK 472
              +R + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G L E  +
Sbjct: 567  DGIRHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTE 624

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLI------------- 519
             +++  EI C+ +  + +    S   +VG+  D +VRI SL   N +             
Sbjct: 625  RSEMPAEIMCMALGTVPDGEQRSWFLSVGL-ADNTVRILSLDPNNCLSPCSMQALPSPAE 683

Query: 520  ------------TKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTD 567
                        T   +G E +P      + +G  YL   L +G LL  +L+  +G+L D
Sbjct: 684  SLCLVEMGHTESTTTAVGDEDVPAQRSGGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLAD 743

Query: 568  RKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
             +   LG++P+ L     + +  V A S R  + Y    +   + ++ + + +   F+S 
Sbjct: 744  TRTRYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSE 803

Query: 628  AFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICS------ 679
               + +       L I  ++ +  +  + + PL   PR  + H +  R     +      
Sbjct: 804  QCSEGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYT 863

Query: 680  --------------LKNQSCAEESEM-----------------------------HFVRL 696
                          ++N +  EE E+                               +R 
Sbjct: 864  EDTKSARKEQMAEEMRNAAGDEERELAREMANAFISEALPEDVFSAPKAGLGLWASQIRC 923

Query: 697  LDDQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF 754
            LD    + +   PL   E     ++L  S + D   Y  VG A  L      ++G  +  
Sbjct: 924  LDAMHGQTMFNVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDI 983

Query: 755  IVED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC- 810
               D     L+ +   E +    +L  F G+LLA   + +++Y     D G +++  +C 
Sbjct: 984  YKIDPTCSSLEFLHRTEIEEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCE 1038

Query: 811  GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
              H     + +Q  G  + V D+ +S+  + Y+  E  +   A D +  W++A  +LD D
Sbjct: 1039 NKHIPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYD 1098

Query: 871  IYLGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFV 911
              +   + F   ++++     TD+            +RG L       E +  +H+GE +
Sbjct: 1099 T-IAIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEII 1157

Query: 912  NRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKG 970
               +  +L+        G    +I+ T++G +G        E Y F + L+ ++R     
Sbjct: 1158 MSLQKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPP 1210

Query: 971  VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
            + G +H  +RS          KN LDGDL E +L +   +   I+  M  +  ++CK++E
Sbjct: 1211 LCGRDHLSYRS-----SYYPVKNVLDGDLCEQYLSIDAVKQKSIAGDMFRTPNQICKKLE 1265

Query: 1031 EL 1032
            ++
Sbjct: 1266 DI 1267


>gi|346327528|gb|EGX97124.1| pre-mRNA splicing factor RSE1 [Cordyceps militaris CM01]
          Length = 1206

 Score =  200 bits (508), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 273/1165 (23%), Positives = 491/1165 (42%), Gaps = 160/1165 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            I+G + ++ +FR  G  +D++ +AT+  +  +L++    +      M        R    
Sbjct: 58   IFGILRSIAVFRLAGSNKDYIILATDSGRITILEYLPGPNRFNRLHMETFGKSGIRRVVP 117

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            G+    DP  R  LI     + L  V+  +++ +L  +  +   +  VL I  +    G 
Sbjct: 118  GEYLACDPKGRACLISAVEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVIAMVALDVGY 177

Query: 121  AKPTIVVLY-------QD--NKDARHVKT----YEVALKDKDFVEGPWSQNNLDNGADLL 167
            A P    L        QD   +    V+T    YE+ L     V   WS + +D  A LL
Sbjct: 178  ANPVFAALEIEYTEVDQDITGEALSEVETQLVYYELDLGLNHVVRK-WS-DPVDPTASLL 235

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGR---VDADGSRY 216
              VP     P   GVL+ GEE I Y  +N  A +  IP R   T+   R   + A     
Sbjct: 236  FQVPGGNDGP--SGVLVCGEENITYRHSNQDALRVPIPRRRGATEDPSRKRNIVAGVMHK 293

Query: 217  LLGDHAGLLHLL-------------VITHEKEKVTG----LKIELLGETSIASTISYLDN 259
            L G       LL             ++  E+   TG    +KI+      +A+++  L +
Sbjct: 294  LKGSAGAFFFLLQSDDGDLFKITIDMVEDEEGAPTGEVQRMKIKYFDTVPVATSLCILKS 353

Query: 260  AVVYIGSSYGDS---QLIKL------------NLQPDAKGSY------------VEVLER 292
              +Y+ S +G+    Q  KL            +   D   +Y            + +++ 
Sbjct: 354  GFLYVASQFGNYAFYQFEKLGDDDDEVEFSSEDFPVDPLAAYEPVYFYPRLAENLALVDS 413

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
               + P++D  V +L  +   Q+ T  G     + R +++G+ +NE  + EL G+   +W
Sbjct: 414  IPAMNPLLDCKVANLTGEDAPQIFTICGNGARSTFRTLKHGLEVNEIVASELPGVPSAVW 473

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            +L+ ++D+ +D ++V+SF + T +L++   + +EE    GF +   T+       + L+Q
Sbjct: 474  TLKLNSDEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTIAAQLLGTDGLIQ 531

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
            V    +R + + +    NEW +P   S+  A+ N+ QV +A   G +VY E+  DG L E
Sbjct: 532  VHPRGIRHIRNGNV---NEWSAPQHRSIVAASTNSHQVAIALSSGEIVYFEMDSDGSLAE 588

Query: 471  VKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
                +  +  ++ L +  + E    S   AVG   D +VRI SL P+  L  K       
Sbjct: 589  YDEKKEMFGTVTALSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPESTLENKSVQALTA 647

Query: 529  IPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
             P S+ +   +  S      YL   L  G  L  +L+  TGELTD ++  LG +P+ L  
Sbjct: 648  APTSLAIIPMDDSSSGGSTLYLHIGLHSGVYLRTVLDEITGELTDTRQKFLGPKPVRLFQ 707

Query: 583  FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHM-----CPFNSAAFPDSLAIAK 637
             +    T V   S RP + Y++    + S+  +  +S++       F+S    D +   +
Sbjct: 708  VTVGGRTCVLGLSSRPWLGYANP---ITSSFEITPLSYVDLEWGWNFSSEQCEDGVVGIQ 764

Query: 638  EGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV-- 694
               L I +ID + + L   SIPL   P++         F      N +   + +   +  
Sbjct: 765  GQSLRIFSIDRLGETLLQSSIPLTYTPKKFVKHPDEPLFYTIEADNHTLPPDLQAKLLAD 824

Query: 695  --------RLLDDQTF----------------EFISTYP-------LDTFEYGCSILSCS 723
                    ++L  + F                + +S  P        +  E   S+   S
Sbjct: 825  PAAVNGDAKVLPPEDFGHPRGNRRWASCISVVDPVSEEPSVLQRVDFENNEAAVSVAVVS 884

Query: 724  F-SDDSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAF 779
            F S D+  +  VGT   ++      ++  I ++  + G  +L+ I + + +    +L  F
Sbjct: 885  FASQDNESFLVVGTGKDIVLNPRSSSEAYIYIYRFQQGGRELEFIHKTKIEEPATALLPF 944

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
             GKLLA I + +++Y   +R    R+ Q+E      I++L   T+G  IVV D+ + ++L
Sbjct: 945  QGKLLAGIGKTLRMYDLGMR-QLLRKAQAEVVPQ-QIVSL--NTQGSRIVVSDVQQGVTL 1000

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA--TDEERG 897
            ++YK     +     D  A W +   ++D +   G +   N+F VR  ++ +   DE+  
Sbjct: 1001 VVYKSASNKLIPFVDDSIARWSTCTTMVDYESVAGGDKFGNMFIVRSPAKASEEADEDAA 1060

Query: 898  RLEVV-------GEYHLGEFVNRFRHGSLVMRLPDSD--VGQIPTVIFGTVNGVIGV-IA 947
             L +V       G  H  E +  F    ++  +  +   VG    +++  + G IGV I 
Sbjct: 1061 GLHLVNARDYLHGTQHRLELMCHFFTQDILTSINKTGLVVGGQDVLLWSGIMGTIGVFIP 1120

Query: 948  SLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
             +  E   F + L+ +LR     + G +H  +RS+    K V     +DGDL E F  L 
Sbjct: 1121 FVSREDTDFFQSLEQHLRTEDGPLAGRDHLMYRSYYAPVKGV-----IDGDLCERFSILP 1175

Query: 1008 RTRMDEISKTMNVSVEELCKRVEEL 1032
              +   I+  ++ SV E+ +++ ++
Sbjct: 1176 NDKKQMIAGELDRSVREIERKISDI 1200


>gi|115390120|ref|XP_001212565.1| splicing factor 3B subunit 3 [Aspergillus terreus NIH2624]
 gi|114194961|gb|EAU36661.1| splicing factor 3B subunit 3 [Aspergillus terreus NIH2624]
          Length = 1217

 Score =  200 bits (508), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 275/1169 (23%), Positives = 476/1169 (40%), Gaps = 177/1169 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G  +D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  +I     + L  V+  +++ +L  +  +   + Q L         G
Sbjct: 123  PGQYLAVDPKGRACIIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMCALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  ++    T             YE+ L     V   W+ + +D  A +
Sbjct: 183  YENPVFAALEVDYSESDQDPTGRAFEETEKLLVYYELDLGLNHVVRK-WA-DPVDRTATM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSR----- 215
            L  VP       GVL+  E+ + Y   N  AF+  IP R   T+   R     +      
Sbjct: 241  LFQVPGGADGPSGVLVCAEDNVTYRHLNQDAFRVPIPRRRGATENPERKRCITAGVMHKM 300

Query: 216  -----YLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVVY 263
                 +LL    G L  L I   ++   ++TG    LKI+      IAS +  L +  +Y
Sbjct: 301  RGAFFFLLQTEDGDLFKLTIDMMEDDNGQLTGEVKRLKIKYFDTVPIASNLLILKSGFLY 360

Query: 264  IGSSYGDSQLIKLN----------------------------LQPDAKGSYVEVLERYVN 295
            + S  G+    +                               QP      + +++   +
Sbjct: 361  VASEAGNHHFYQFEKLGDDDEEIEFSSEDFSADPSVPCEPVYFQPRG-AENLNLVDTINS 419

Query: 296  LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLR 354
            L P++D  V +L  +   Q+ T SGA    + R +++G+ ++E    EL  +   +W+ +
Sbjct: 420  LNPLIDSKVANLSEEDAPQIYTISGAGARSTFRTLKHGLEVSEIVESELPSVPSAVWTTK 479

Query: 355  SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
             + +D FD ++++SF + T +L++   + +EE    GF S   TL       + LVQ+  
Sbjct: 480  LTREDEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLVQIHP 537

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-K 472
              +R +    R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  +
Sbjct: 538  RGIRHIMVDRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGTLAEYDE 595

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPR 531
              Q+   ++ L +  + E    S   AVG   D +VRI SL PD  L  K        P 
Sbjct: 596  RRQMSGTVTSLSLGEVPEGRIRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTAAPS 654

Query: 532  SVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
            ++ + +    S      YL   L  G  L  +L+  TGEL+D +   LG++P+ L   S 
Sbjct: 655  ALNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKPVKLFQVSV 714

Query: 586  KNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
            K  T V A S RP + YS    K  + + ++   +     F+S    + +   +   L I
Sbjct: 715  KGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVPLEWGWNFSSEQCLEGMVGIQGQNLRI 774

Query: 644  GTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF 702
             +I+ +   +  +SIPL   PRR     +   F +    N   +  +     RLL+D   
Sbjct: 775  FSIEKLDNNMLQQSIPLSYTPRRFVKHPEQPLFYVIESDNNVLSPSTR---ARLLEDSKS 831

Query: 703  -----------EF------------------------ISTYPLDTFEYGCSILSCSF-SD 726
                       EF                        + T  L+  E   S+ +  F S 
Sbjct: 832  RNGDTTVLPPEEFGYPRATGHWASCIQVVDPLDAKAVVHTVELEDNEAAVSVAAVPFTSQ 891

Query: 727  DSNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGK-LQLIAEKETKGAVYSLNAFNGK 782
            D   +  VGT   +   N P+     + I    EDG+ L+ I + + +    +L AF G+
Sbjct: 892  DDETFLVVGTVKDM-TVNPPSSAGGFIHIYRFQEDGRELEFIHKTKVEEPPLALLAFQGR 950

Query: 783  LLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRGDFIVVGDLMKSISL 839
            L   +   +++Y     D G ++L  +C  H     I+ L  QT+G  IVV D+ +S++ 
Sbjct: 951  LAVGLGSLLRIY-----DLGMKQLLRKCQAHVVPKTIVGL--QTQGSRIVVSDVRESVTY 1003

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE-- 894
            ++YK++E  +     D  + W ++  ++D +   G +   NL+ VR   K SE A ++  
Sbjct: 1004 VVYKYQENVLIPFVDDSISRWTTSTTMVDYETVAGGDKFGNLWLVRCPKKVSEQADEDGS 1063

Query: 895  ------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
                  ERG       RLE++   +  +         LV        G    +++   +G
Sbjct: 1064 GAHLIHERGYLHGTPNRLELMIHVYTQDIPTSLHKTQLV-------AGGRDILVWTGFHG 1116

Query: 942  VIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
             IG++   +  E   F + L+  L      + G +H  +RS+    K V     +DGDL 
Sbjct: 1117 TIGMLVPFVSREDVDFFQNLEMQLASQHPPLAGRDHLIYRSYYAPVKGV-----IDGDLC 1171

Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
            E++  L       I+  ++ SV E+ +++
Sbjct: 1172 ETYFLLPNDTKMMIAADLDRSVREIERKI 1200


>gi|125977518|ref|XP_001352792.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
 gi|54641542|gb|EAL30292.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
          Length = 1228

 Score =  200 bits (508), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 263/1202 (21%), Positives = 466/1202 (38%), Gaps = 202/1202 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   I+G I  L  FR  G  +D++ + ++  +  +L++    + L             
Sbjct: 53   LLSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEYMPAKNALEKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D +++    T + A + +    F E     N+        L+  A
Sbjct: 173  VDVGFDNPMFACLEIDYEESDLDPTGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +LL    G +  + +  + + V+ +K++       AS +  L    +
Sbjct: 288  ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPI 299
            ++ S +G+  L ++             +  P  +G       R +          +  PI
Sbjct: 348  FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRTLKNLVLVDELPSFAPI 407

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
            +   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++   D
Sbjct: 408  ITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRAD 467

Query: 359  DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
            D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +R
Sbjct: 468  DEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGIR 525

Query: 419  LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQL 476
             + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G L E  + +++
Sbjct: 526  HIRSDKRV--NEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSEM 583

Query: 477  EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLC 536
              EI C+ +  + +    S   AVG+  D +VRI SL   N +T   +     P    LC
Sbjct: 584  PAEIMCMALGTVPDGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-LC 641

Query: 537  AFE--------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
              E                          G  YL   L +G LL  +L+  +G+L D + 
Sbjct: 642  LVEMGHTESTTSAGALDDDAPPQRSGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 701

Query: 571  VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
              LG++P+ L     +    V A S R  + Y    +   + ++ + + +   F+S    
Sbjct: 702  RYLGSRPVKLFRIKMQGAEAVLAMSSRSWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 761

Query: 631  DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKN------ 682
            + +       L I  ++ +  +  + + PL   PR  + H +  R     +  N      
Sbjct: 762  EGIVAISTNTLRILALEKLGAVFNQVAFPLQFTPRTFVIHPDTGRMLIAETDHNAYTEDT 821

Query: 683  ------------QSCAEESEMHFVRLLDDQTFEFIS-TYPLDTFE--------------- 714
                        +S A + E    R + +    FI+   P D F                
Sbjct: 822  KSARKEQMAEEMRSAAGDEERELAREMANA---FINEVLPEDVFSAPKAGLGLWASQIRC 878

Query: 715  ----YGCSILSCSFSD----------------DSNVYYCVGTAYVLPEENEPTKGRILVF 754
                +G ++ S S +                 D   Y  VG A  L      ++G  L  
Sbjct: 879  LDAMHGQTMFSVSLTQNEAIMSMAMVKFSVAADGRYYLAVGIARDLQLNPRISQGGCLDI 938

Query: 755  IVED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC- 810
               D     L+ +   E      +L  F G+LLA   + +++Y     D G +++  +C 
Sbjct: 939  YKIDPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCE 993

Query: 811  GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
              H     + +Q  G  + V D+ +S+  L Y+  E  +   A D +  W++A  +LD D
Sbjct: 994  NKHIPYQIVNIQAMGHRVYVSDVQESVFFLRYRRAENQLIIFADDTHPRWVTATTLLDYD 1053

Query: 871  IYLGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFV 911
              +   + F   ++++     TD+            +RG L       E +  +H+GE +
Sbjct: 1054 T-IAIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEII 1112

Query: 912  NRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKG 970
               +  +L+        G    +I+ T+NG +G        E Y F + L+ ++R     
Sbjct: 1113 MSLQKATLI-------PGGSEALIYSTLNGTVGAFVPFTSREDYDFFQHLEMHMRNENPP 1165

Query: 971  VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
            + G +H  +RS          KN LDGDL E +L +   +   I+  M  +  ++CK++E
Sbjct: 1166 LCGRDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLE 1220

Query: 1031 EL 1032
            ++
Sbjct: 1221 DI 1222


>gi|321260749|ref|XP_003195094.1| hypothetical protein CGB_G1120W [Cryptococcus gattii WM276]
 gi|317461567|gb|ADV23307.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
          Length = 1276

 Score =  199 bits (507), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 221/885 (24%), Positives = 395/885 (44%), Gaps = 120/885 (13%)

Query: 213  GSRYLLGDHAGLLHLLVITHEKEKVTGLKIEL------LGETSIASTISYLDNAVVYIGS 266
            G+  ++GD  G         EK    G    +      LG +S  S+I+YLD++ +++ S
Sbjct: 375  GATVVIGDEYGAFTAFGWEFEKGSGAGTDGRVRVLRTYLGASSPPSSITYLDSSHLFVSS 434

Query: 267  SYGDSQLIKLNL--------QPDAKGSYV----------------------------EVL 290
            +  DS L++               KG                               E+L
Sbjct: 435  AVADSVLLRFTTVESSSSVSSSKGKGRAATGPIGDQLDKWEVMYENGKERNDTDGGPEIL 494

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKG 349
            ER++N+ P+ D CVV  E      +V  SGA +  SLR+VR+G+G+ E  +++ L  ++ 
Sbjct: 495  ERWMNIAPVKDLCVVKDEGGNLSHLVLASGASESNSLRVVRSGVGLEELVTIQGLHDVQK 554

Query: 350  MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
            MWSL   TD    + L++S  + T +L +  +  +  T    F S+T            L
Sbjct: 555  MWSL---TDSTATSRLLLSTSNSTVLLQLQPDISVIPTTDIIFNSETLAAGILPG-AELL 610

Query: 410  VQVTSGSVRLVSS-TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGIL 468
             Q+T   + L S   S +L  + +      +  A   A   ++A  GG+LV   + +   
Sbjct: 611  AQITPRGLFLWSDLNSGQLEAQVEVDKETEIVCAQVTAEWAVVAKNGGNLVVFHVSNTGF 670

Query: 469  TEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
                   ++ E+S + I+    + S S I A+  WT  +V +++L  ++      + G  
Sbjct: 671  NPKGTIDVKEEVSAVAISR--NSDSSSPIIAISTWTAKTV-VYTLSQIS----SGVDGLS 723

Query: 529  IPRSVLLCAFEGISY--------LLCALGDGHLLNFLLNMKTGELTD------RKKVSLG 574
            IP      + +  S+        LL  L +G L  + L+       D       K  SLG
Sbjct: 724  IPSKSSATSLQLRSHPSYPAGIQLLSGLDNGLLQIYDLDKDDMGGVDGLVVKSSKTTSLG 783

Query: 575  TQPITLR----TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
             +P+ L     T   +    V A ++R +VI+ S  ++ +S+VN+K +      N+++ P
Sbjct: 784  LRPLALHPCEITGGDEKVISV-ALTERMSVIFESKDRIEFSSVNIKNIMAAAAVNTSSGP 842

Query: 631  DSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSCAEES 689
                 ++   L++  I+ ++KLH+++   G E   ++ + ++ +  A C L  ++   + 
Sbjct: 843  VLALFSRTSGLSLLKINSLKKLHVQTCDTGNESISKLNYMDEYKAIA-CGLTRRTQLRDG 901

Query: 690  ---EMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEP 746
               E +FV++ D  + E +S++ L   E   S+ S       ++Y  VGTA++ P++ E 
Sbjct: 902  DVEEENFVQIRDGTSLEPLSSFTLRERELVTSLRSVFLV--GSMYLAVGTAFLPPDDGED 959

Query: 747  T-----------KGRILVFIVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
            +           +GR+L+  +++G       +++ AE  T GAVY+L   +G L  A   
Sbjct: 960  SSWDEGNLAVVKEGRVLLLEIKEGDAGGGWDVKIKAELTTVGAVYALEEIHGFLAVAAGS 1019

Query: 790  KIQLYKWMLRDDGTRELQSECGHHG-----HILALY-VQTRGD-FIVVGDLMKSISLLIY 842
            K+ +++    D  + EL+             + AL+   TR +  ++VGD M+S+ +L  
Sbjct: 1020 KLTMHR---LDHNSVELEETSSWASAYVISSLSALHPSHTRPEGALIVGDGMRSVIVLNV 1076

Query: 843  KHEEGAIEERARDYNANWMSAVEILDD--DIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
               +G I +  R+   + ++A+ +L D  D  + ++ + NL T R N          +LE
Sbjct: 1077 DEGDGMIYDDERNMATHGVTALGLLKDKGDAVVVSDAHSNLLTYRLNQ---------KLE 1127

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
                + L E V RF++GSLV      ++  IP V+F T  G +GVI  L       L+ L
Sbjct: 1128 RAATFGLHEEVTRFQNGSLVPTTTAPEI-IIPDVLFATREGRLGVIGELGTMSSRTLDDL 1186

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
            Q N+ K+ KG G +    WR   +     D   F+DGD ++ FLD
Sbjct: 1187 QRNMGKIWKGPGDVGWSNWRRAGSNLVGKDTAGFVDGDFVQKFLD 1231


>gi|194749950|ref|XP_001957397.1| GF24063 [Drosophila ananassae]
 gi|190624679|gb|EDV40203.1| GF24063 [Drosophila ananassae]
          Length = 1228

 Score =  199 bits (505), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 258/1201 (21%), Positives = 473/1201 (39%), Gaps = 200/1201 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            ++   I+G I  L  FR  G  +D++ + ++  +  +L++   S   + +   +   + G
Sbjct: 53   LMSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEY-IPSKNALEKVHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A    + + A + +    F E     N+        L+  
Sbjct: 172  GVDVGFDNPMFACLEIDYEEADLDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
            A+ L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R   
Sbjct: 232  ANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +LL    G +  + +  + + V+ +K++       A+ +  L    
Sbjct: 287  FICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                           P A  + V +++   +  
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFA 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++  
Sbjct: 406  PIITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKR 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             DD FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  ADDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
            +R + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G L E  + +
Sbjct: 524  IRHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERS 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL 534
            ++  EI C+ +  + E    S   AVG+  D +VRI SL   N +T   +     P    
Sbjct: 582  EMPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES- 639

Query: 535  LCAFE--------------------------GISYLLCALGDGHLLNFLLNMKTGELTDR 568
            LC  E                          G  YL   L +G LL  +L+  +G+L D 
Sbjct: 640  LCLVEMGHTESTTSAGGVNEDAPAQRSGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADT 699

Query: 569  KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
            +   LG++P+ L     + +  V A S R  + Y    +   + ++ + + +   F+S  
Sbjct: 700  RTRYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQ 759

Query: 629  FPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICS------- 679
              + +       L I  ++ +  +  + + PL   PR  + H +  R     +       
Sbjct: 760  CSEGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTE 819

Query: 680  -------------LKNQSCAEESEM-----------------------------HFVRLL 697
                         +++ +  EE E+                               +R L
Sbjct: 820  DTKNARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSAPKAGLGLWASQIRCL 879

Query: 698  DDQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI 755
            D    + +   PL   E     ++L  S + D   Y  VG A  L      ++G  +   
Sbjct: 880  DAMHGQTMFNVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIY 939

Query: 756  VED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-G 811
              D     L+ +   E      +L  F G+LLA   + +++Y     D G +++  +C  
Sbjct: 940  KIDPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCEN 994

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
             H     + +Q  G  + V D+ +S+  + Y+  E  +   A D +  W++A  +LD D 
Sbjct: 995  KHIPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT 1054

Query: 872  YLGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVN 912
             +   + F   ++++     TD+            +RG L       E +  +H+GE + 
Sbjct: 1055 -IAIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIM 1113

Query: 913  RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGV 971
              +  +L+        G    +I+ T++G +G        E Y F + L+ ++R     +
Sbjct: 1114 SLQKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPL 1166

Query: 972  GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
             G +H  +RS          KN LDGDL E +L +  ++   I+  M  +  ++CK++E+
Sbjct: 1167 CGRDHLSYRS-----SYYPVKNVLDGDLCEQYLSIDASKQKSIAGDMFRTPNQICKKLED 1221

Query: 1032 L 1032
            +
Sbjct: 1222 I 1222


>gi|195012560|ref|XP_001983703.1| GH16029 [Drosophila grimshawi]
 gi|193897185|gb|EDV96051.1| GH16029 [Drosophila grimshawi]
          Length = 1228

 Score =  199 bits (505), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 257/1200 (21%), Positives = 470/1200 (39%), Gaps = 198/1200 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            ++   I+G I  L  FR  G  +D++ I ++  +  +L++    + L             
Sbjct: 53   LMSTEIFGCIRALMAFRLTGGTKDYIVIGSDSGRIVILEYIPSKNSLEKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D ++A    + + A + +    F E     N+        L+  A
Sbjct: 173  VDVGFDNPMFACLEIDYEEADLDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +LL    G +  + +  + + V+ +K++       A+ +  L    +
Sbjct: 288  ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
            ++ S +G+  L ++                           P A  + V +++   +  P
Sbjct: 348  FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
            I+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++   
Sbjct: 407  IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRI 466

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            DD FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +
Sbjct: 467  DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
            R + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G L E  + ++
Sbjct: 525  RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
            +  EI C+ +  + E    S   AVG+  D +VRI SL   N ++   +     P    L
Sbjct: 583  MPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLSPCSMQALPSPAES-L 640

Query: 536  CAFE--------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRK 569
            C  E                          G  YL   L +G LL  +L+  +G+L D +
Sbjct: 641  CLVEMGHTESTTNASGADEDVPAQRSGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTR 700

Query: 570  KVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAF 629
               LG++P+ L     + +  V A S R  + Y    +   + ++ + + +   F+S   
Sbjct: 701  TRYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQC 760

Query: 630  PDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEE 688
             + +       L I  ++ +  +  + + PL   PR       +    I    + +  EE
Sbjct: 761  SEGIVAISTNTLRILALEKLGAVFNQVAFPLQFTPRAFVIHPDTGRMLIAETDHNAYTEE 820

Query: 689  S----------EMHF----------------------------------------VRLLD 698
            +          EM                                          +R LD
Sbjct: 821  TKNERKEQMAEEMRSAAGDEERDLAREMANAFINEVLPEDMFSAPKAGLGLWASQIRCLD 880

Query: 699  DQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRIL-VFI 755
                + + + PL   E     ++L  S + D   Y  VG A  L      ++G  + V+ 
Sbjct: 881  AMHGQTMFSVPLAQNEAIMAMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDVYK 940

Query: 756  VED--GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GH 812
            ++     L+ +   E +    +L  F G+LLA   + +++Y     D G +++  +C   
Sbjct: 941  IDPTCSALEFLHRTEIEEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCENK 995

Query: 813  HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
            H     + +Q  G  + V D+ +S+  + Y+  E  +   A D +  W++A  +LD D  
Sbjct: 996  HIPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT- 1054

Query: 873  LGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNR 913
            +   + F   ++++     TD+            +RG L       E +  +H+GE +  
Sbjct: 1055 IAIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMS 1114

Query: 914  FRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVG 972
             +  +L+        G    +I+ T++G +G        E Y F + L+ ++R     + 
Sbjct: 1115 LQKATLI-------PGGSEALIYSTLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLC 1167

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            G +H  +RS          KN LDGDL E +L +  ++   I+  M  +  ++CK++E++
Sbjct: 1168 GRDHLSYRS-----SYYPVKNVLDGDLCEQYLSVDASKQKSIAGDMFRTPNQICKKLEDI 1222


>gi|353236335|emb|CCA68332.1| probable splicing factor 3B subunit 3 [Piriformospora indica DSM
            11827]
          Length = 1243

 Score =  198 bits (504), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 258/1188 (21%), Positives = 481/1188 (40%), Gaps = 198/1188 (16%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
            ++G I +L  FR  G  +D+  + ++  +  +L++D + +   +L     G    R   P
Sbjct: 86   VFGSIRSLAAFRLTGGTKDYAIVGSDSGRIVILEYDPKQNAFNKLYQETYGKTGARRIVP 145

Query: 63   TDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
               GQ+   DP  R + +   +   L  V+  D    L  +  +   +   +    +   
Sbjct: 146  ---GQMLATDPKGRSLMVSAMEKAKLVYVLNRDAAANLTISSPLEAHKPSAIIHHIVGVD 202

Query: 118  YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGA 164
             G   P    L  D  +A             + +  YE+ L     V   WS+   D  A
Sbjct: 203  VGFENPLYAALEVDYSEADNDPSGQAIRDTEKMLTFYELDLGLNHVVRK-WSEPT-DRRA 260

Query: 165  DLLIPVP--------PPLC------GVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD 210
            +LL+ VP        PP+       GVL+  E+ ++Y   +  +   +R  I +    ++
Sbjct: 261  NLLVQVPGGQTTTGDPPITRIDGPSGVLVCCEDHVIYRHMDQPQ---LRVPIPRREHPLE 317

Query: 211  ADGSR-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIAST 253
               SR                 +LL    G L+ + + H+ E V  LKI+     S A  
Sbjct: 318  DASSRGLLIISAVMHRMKGAFFFLLQSELGDLYKVTLEHQDEDVVALKIKYFDTVSPAVN 377

Query: 254  ISYLDNAVVYIGSSYGDSQL------------------------IKLNLQPDAKGSY--- 286
            +  L +  +++ + +G+                           ++   QP  +  +   
Sbjct: 378  LCILKSGHLFVPAEFGNHAFYQFQKLGDDDKEPEYSSADYPGNGMRTPSQPLPRAYFRPK 437

Query: 287  ----VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
                + +L+   +L PI+   V +L         T  G     + R +++G+ + E  + 
Sbjct: 438  PLENLVLLDELESLDPILAARVQNLP--DTSLFYTACGKGAKSTFRTLKHGLEVEENGNS 495

Query: 343  ELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
            +L  I   +W+L+ +  D +D+++V+SFI+ T +L+    +E+EE    GF S   TL  
Sbjct: 496  DLPSIPNAVWTLKLAETDQYDSYIVLSFINGTLVLSFG--EEIEEIPNSGFLSSEPTLAA 553

Query: 402  HDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL 461
                 + L+QV    +R V S  R   NEW++P G ++  AT N  QV++A     LVY 
Sbjct: 554  QQLGSDALLQVHPRGIRHVLSDKRV--NEWRAPIGMAIVAATTNKRQVVVALSSAELVYF 611

Query: 462  EIG-DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--L 516
            E+  +G L E +  + +   +  L +  + E     +  AVG   D +VRI SL PD  L
Sbjct: 612  ELDYEGQLNEFQERKAMGSTVLALSVGEVPEGLQRFKYLAVGC-EDQTVRIISLDPDSTL 670

Query: 517  NLITKEHLGGEIIPRSVLLCAF--------EGISYLLCALGDGHLLNFLLNMKTGELTDR 568
             +I+ + +     P S+ +              +++   L +G LL  +L+   G+L D 
Sbjct: 671  EMISLQAVTAP--PSSISIADMFDSSIDKHRPTTFVNIGLQNGVLLRTVLDPTNGKLADT 728

Query: 569  KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
            +   LG +P+ L       +  V A S R  + Y+    + ++ +    +   C  N+  
Sbjct: 729  RTRFLGNRPVRLVRTQVHGSPGVLALSSRSWLNYTYQGLVHFTPLAYDRLDGACSVNAEL 788

Query: 629  FPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE 687
             PD         L I  +  +  KL    +PL   PR+     Q+  F +    +++ +E
Sbjct: 789  CPDGFIGIANNTLRIFQVPKLGSKLKQEILPLSYTPRKFVSHPQNSYFYLIESDHRAMSE 848

Query: 688  -----------------ESEM---------HF----------VRLLDDQTFEFISTYPLD 711
                             + EM         HF          +R++D      ++ + LD
Sbjct: 849  TMIEERVKAIEMSGEKVDREMLELDPRIYGHFKAPEGVWASCIRIIDPVNLRSVAAFSLD 908

Query: 712  TFEYGCSILSCSF-SDDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEK 767
              E   SI    F + +  +   VGTA   ++ P        R+  F      L+L+ + 
Sbjct: 909  NNEAAFSIAVVPFAARNGELLLVVGTAVDTHLAPRSCSTGYLRVYSFTEGGSGLELLHKT 968

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQT 823
            +      +L AF G+L+A + + ++LY    K +LR    R+  +          + + T
Sbjct: 969  DIDEVPTALMAFQGRLIAGVGKALRLYDIGKKKLLRKAENRQFATAI--------VTLST 1020

Query: 824  RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
            +G  I+ GD+ +SI  + YK  E  +   A D +A W++A  +LD +  + A+   N+F 
Sbjct: 1021 QGSRILAGDINQSIYYVAYKAAENRLLIFADDTSARWITASTMLDYNTVVAADKFGNVFV 1080

Query: 884  VRKN---SEGATDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPD 925
             R +   S+   D+  G               + +++  +H+G+ +   +  +LV     
Sbjct: 1081 NRLDEAVSKQVDDDPTGAGILHEKSVLMGAPHKTKMLTHFHVGDVITSIQRVALV----- 1135

Query: 926  SDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN 984
               G    V++  ++G IG++  L   +   F+  L+ ++R     + G +H  WR +  
Sbjct: 1136 --AGAREVVVYFGLHGTIGILVPLVTKDDVDFISTLEQHMRTENLSLVGRDHLSWRGYYT 1193

Query: 985  EKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              K       +DGDL E F  L   +   I+  ++ +V ++ K++E L
Sbjct: 1194 PVKAT-----VDGDLCEYFAKLPTQKQLAIAGELDRTVGDVLKKLESL 1236


>gi|303324325|ref|XP_003072150.1| Splicing factor 3B subunit 3, putative [Coccidioides posadasii C735
            delta SOWgp]
 gi|240111860|gb|EER30005.1| Splicing factor 3B subunit 3, putative [Coccidioides posadasii C735
            delta SOWgp]
          Length = 1209

 Score =  198 bits (503), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 271/1176 (23%), Positives = 481/1176 (40%), Gaps = 185/1176 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G ++D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSSKDYIIIGSDSGRIAIVEY-VPSQNRFNRIHLETFGKSGIRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
             GQ   +DP  R  LI     + L  V+  + + +L  +  +     Q L         G
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTLVFALTALDVG 182

Query: 120  CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
               P    L  D  ++    T             YE+ L     V   W+ + +D  A +
Sbjct: 183  YENPIFAALEVDYTESDQDPTGAAYQEAEKLLVYYELDLGLNHVVRR-WA-DPVDRSAAM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLL 218
            L  VP     P   GVL+  E  I Y  +N  AF+  IP R   T+     + +  RY+ 
Sbjct: 241  LFQVPGGADGP--SGVLVCSEGNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT 293

Query: 219  GDHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIAST 253
               AG++H +                     ++  E  ++TG    LK++      +AS+
Sbjct: 294  ---AGVVHKMRRAFFCLLQTEDGDLFKVTMDMVEDENGQLTGEVQRLKLKYFDTVPVASS 350

Query: 254  ISYLDNAVVYIGSSYGDSQLIKLN-----------------------LQP----DAKGSY 286
            +  L N  +++ S  G+    +                         L P          
Sbjct: 351  LCILKNGFLFVASETGNHHFYQFEKLGDDDEETEFSSDDFSADPSEPLAPVYFRPRPAEN 410

Query: 287  VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            + ++E   +L P++   + +L      Q  T SG     + R +++G+ ++E    EL  
Sbjct: 411  LNLVESINSLNPLMSCKITNLTEDDAPQFYTLSGTGARSTFRTLKHGLEVSEIVESELPS 470

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            +   +W+ + + +D +D ++++SF + T +L++   + +EE    GF S   TL      
Sbjct: 471  VPSAVWTTKLTRNDQYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLG 528

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
             + L+QV    +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  
Sbjct: 529  EDSLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDT 586

Query: 465  DGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
            DG L E  +  ++   ++CL +  +      S   AVG   D +VRI SL PD  L  K 
Sbjct: 587  DGSLAEYDEKREMSGTVTCLSLGEVPPGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKS 645

Query: 523  HLGGEIIPRSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
                   P ++ ++   +  S     YL   L  G  L  +L+  TGEL+D +   LG +
Sbjct: 646  VQALTSAPSALSIMSMVDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLK 705

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
             + L + S K    V A S RP + YS    K  + + ++   +     F+S    + + 
Sbjct: 706  SVKLFSVSVKEQRAVLALSSRPWLGYSDLQTKNFMLTPLDYVPLEWSWNFSSEQCVEGMV 765

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 693
              +   L I +I+ +   L   +IPL   PR      +   F +    N   +  ++   
Sbjct: 766  GIQGQNLRIFSIEKLDNNLLQETIPLAYTPRHFVRHPEQPLFYVIEADNNILSPSTKAKL 825

Query: 694  ----------VRLLDDQTFEF----------------------ISTYPLDTFEYGCSILS 721
                      V  L  + F +                      IS   L+  E   S+ +
Sbjct: 826  LQDSKAANGEVAELPPEDFGYPRGTGHWASCIQVVDPINSKAVISRIELEENEAAVSVAA 885

Query: 722  CSFSD-DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYS 775
              FS  D   +  VGT     V P  +  + G I ++   EDGK L+ I + + +   ++
Sbjct: 886  VPFSSQDDETFLVVGTGKDMVVYPPSS--SCGFIHIYRFQEDGKELEFIHKTKVESPPHA 943

Query: 776  LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLM 834
            L AF G+LLA I + +++Y     D G ++L  +C       L + +QT+G  I+V D+ 
Sbjct: 944  LLAFQGRLLAGIGRNLRIY-----DLGMKQLLRKCQAEVVPRLIVGLQTQGSRIIVSDVQ 998

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
            +S++ ++YK++E  +   A D  A W +   ++D +   G +   NL+ +R   K SE A
Sbjct: 999  ESVTYVVYKYQENRLIPFADDVIARWTTCTAMVDYETVAGGDKFGNLWLLRCPQKASEEA 1058

Query: 892  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT------------- 938
             ++  G   +    +L    NR    SL++     D   IPT I  T             
Sbjct: 1059 DEDGSGAHLIHERQYLQGAPNRL---SLMVHFYPQD---IPTSIQKTQLVAGGRDILVWT 1112

Query: 939  -VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
             + G +G++   +  E   F + L+  L      + G +H  +RS+        AK  +D
Sbjct: 1113 GLQGTVGMLVPFVSREDVDFFQSLEMQLTSQTPPLAGRDHLIYRSY-----YAPAKGTID 1167

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            GDL E++  L   +   I+  ++ SV E+ +++ ++
Sbjct: 1168 GDLCETYFTLPNDKKLMIAGELDRSVREIERKISDM 1203


>gi|213405251|ref|XP_002173397.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
            yFS275]
 gi|212001444|gb|EEB07104.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
            yFS275]
          Length = 1166

 Score =  197 bits (502), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 259/1140 (22%), Positives = 483/1140 (42%), Gaps = 145/1140 (12%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT---RAMGDVSD 57
            M+    YG +  +   R  G  +D+L + ++  +F +L++D   ++L++    A G    
Sbjct: 59   MISQNCYGIVRNIAPLRLTGFKKDYLVLTSDSGRFTILEYDIGKNKLVSVYQEAFGKSGI 118

Query: 58   RIGRPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIK 115
            R   P   G+   +D   R  ++     + L  V+  D++  L  +  +   +   +   
Sbjct: 119  RRIVP---GEYLALDAKGRAAMVASTEKNKLVYVLNRDSEANLTISSPLEAHKAGTICFD 175

Query: 116  FL---YGCAKPTIVVLYQDNKDARH-------------VKTYEVALKDKDFVEGPWSQNN 159
             +    G   P    L  +  D  H             +  YE+ L     V+  WS+  
Sbjct: 176  LVGLDTGYENPIFAALEVEYSDLDHDPLGELYKHSEKVLTYYELDLGLNHVVKR-WSKV- 233

Query: 160  LDNGADLLIPVP-----PPLCGVLIIGEETIVY---CSANAFKAIPIR----------PS 201
            +D  A  LI VP     P   GV++I    I Y      + F  IP R          P 
Sbjct: 234  VDRSAYKLIRVPGGNDGP--SGVIVISTGWISYRHLQRQSHFVPIPTRETKATTNTALPI 291

Query: 202  ITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
            I  A      D   YLL +  G LL L +   +  +V  L+I+       A+ ++ L + 
Sbjct: 292  IVSAVMHKMRDSFFYLLQNSDGDLLKLTMELDDHSQVKELRIKYFDTIPFAAILNILKSG 351

Query: 261  VVYIGSSYGDSQLIKL------NLQPD---------------AKGSY-------VEVLER 292
            +++ G   G+  L +       + +P+                K +Y       + +L+ 
Sbjct: 352  LLFAGCEGGNHHLYQFESLAIDDDEPEFSSANFSEEQSKHSPKKLTYKLHPLQNISLLDE 411

Query: 293  YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMW 351
              +L P+ D  V         Q+ T  G +K+ SLR+++ G+   E    EL G    +W
Sbjct: 412  IPSLFPLTDAIVTRTSTDANSQLYTLCGRHKEASLRLLKRGVSATEVVLSELPGAPIAIW 471

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            +++   +DP+D ++V+SF + T +L++   + +EE    G  S   TL       + +VQ
Sbjct: 472  TVKQKLNDPYDKYMVLSFTNGTLVLSIG--ETVEEVLDSGLLSSVSTLNVRQLGRSSVVQ 529

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD--GILT 469
            + S  +R +S+ ++E+  EWK+P    +  +  N  Q++++     L Y E+ D  G L 
Sbjct: 530  IHSKGIRCISA-NKEV-TEWKTPADTVITNSAINEQQIVVSLSNDELAYFEMDDEYGQLN 587

Query: 470  EVKHAQL-EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
            E +  +L    ++ L + P+ +    S    +    D +VRI SL     +    +    
Sbjct: 588  EYQERKLLTSPVTALALGPVPQGSKRSNFLCLAS-EDSTVRIVSLDPYTTLENLSVQALS 646

Query: 529  IPRSVL------LCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
             P S L      +  +E + YL   L +G  L  ++++ +G+L D +   LG +PI L  
Sbjct: 647  APASSLCMVNMEVTGYETL-YLHIGLSNGVYLRTVVDVTSGQLIDTRTRFLGPRPIRLSP 705

Query: 583  FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
               +    V A ++R  + YS+ + L  S +    + +   F S   P+ +    +  L 
Sbjct: 706  IIVRGKQSVLAIANRSYLSYSNEQTLQVSPLLYSPLEYADSFASHQCPEGIVGIHQNILK 765

Query: 643  IGTI----DDIQKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEMHFVRLL 697
            I T+    DD+++      PL   P+R+  H E    + + S ++   +    +  + + 
Sbjct: 766  IFTVEATHDDLKQ---DVFPLSCTPKRVIKHPELDILYILQSERHCPNSTNEWITVLSIF 822

Query: 698  DDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPE--ENEPTKGRILVFI 755
            D    + I      TF      L  +F    N  + V ++ +  +  +   + G + V+ 
Sbjct: 823  DMNEKKMIEGL---TFNEASFDLCYAFFRSRNEGFLVCSSAINYDISKQSCSNGVLRVYS 879

Query: 756  VEDGKLQL--IAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSE 809
            + DG L L  I+E ET      L  F+G+L+A +   +++Y    K +LR    R + + 
Sbjct: 880  LNDGGLSLKCISETETDSFARVLKPFHGRLIAGVGPFLRVYDLGNKKLLRKSEVRAVPN- 938

Query: 810  CGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
                       +QT+G  I+V D   S   +++K E+      A D  A W +A  ++D 
Sbjct: 939  -------FITTIQTQGYRIIVTDAQHSAFFVVFKPEDNRYIVFADDCVARWATATAMVDY 991

Query: 870  DIYLGAENNFNLFTVR--KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
            D  +G +   NL+ +R  ++     DEE    +++   H   F++   H   +M    + 
Sbjct: 992  DTVVGGDKFSNLWLLRCPESVSQLADEENSGSKLL---HEKPFLHSSPHKLDLM--AHTF 1046

Query: 928  VGQIPT--------------VIFGTVNGVIGVIASLPH-EQYLFLEKLQTNLRKVIKGVG 972
            V  IPT              +++  + G IGV++   + E   F ++L++ LR     + 
Sbjct: 1047 VNDIPTSIQKVQLVEGARDVIMWTGLLGTIGVLSPFVNREDVRFFQQLESLLRAEDLSIV 1106

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            G +H  +RS+      V  K  +DGDL E + +L R + + I+  ++ +V E+CK++E+L
Sbjct: 1107 GRDHLAYRSY-----YVPVKCVVDGDLCEQYYNLPRDKQESIANELDRTVIEVCKKIEDL 1161


>gi|170041368|ref|XP_001848437.1| splicing factor 3B subunit 3 [Culex quinquefasciatus]
 gi|167864946|gb|EDS28329.1| splicing factor 3B subunit 3 [Culex quinquefasciatus]
          Length = 1215

 Score =  197 bits (502), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 256/1187 (21%), Positives = 473/1187 (39%), Gaps = 185/1187 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   ++G I +L  FR  G  +D+  + ++  +  +L+++A  ++L             
Sbjct: 53   LLQTEVFGVIRSLMSFRLTGGTKDYAVVGSDSGRIVILEYNAAKNQLEKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D++ +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQFLAIDPKGRAVMIGAIEKQKLVYILNRDSEARLTISSPLEAHKSSTLTYHMVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D ++A    T E A K +    F E     N+        L+  A
Sbjct: 173  VDVGFENPMFACLEIDYEEADLDPTGEAAAKTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + LI VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLISVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +L+    G +  + +  + + V  +K++       A+ +  L    +
Sbjct: 288  ICSATHRTKSMFFFLVQTEQGDIFKVTLETDDDVVAEIKLKYFDTVPPATAMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPI 299
            ++   +G+  L ++             +  P  +G       R +          +  PI
Sbjct: 348  FVACDFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLKNLVMVDEIHSFAPI 407

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
            +   V DL  +   Q+    G     S+R++R+G+ ++E A  EL G    +W+++   D
Sbjct: 408  LGCQVADLANEDTPQLYLACGRGPRSSIRVLRHGLEVSEMAVSELPGNPNAVWTVKKRAD 467

Query: 359  DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
            D FD +++VSF++ T +L++   D +EE    GF   T TL C     + LVQV    +R
Sbjct: 468  DEFDAYIIVSFVNATLVLSIG--DTVEEVTDSGFLGTTPTLCCSALGDDALVQVYPDGIR 525

Query: 419  LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQL 476
             + +  R   NEWK+P   ++     N+ QV++A  GG LVY E+   G L E  +  ++
Sbjct: 526  HIRADKR--VNEWKAPGKKTIIKCAVNSRQVVIALSGGELVYFEMDPTGQLNEYTERKKM 583

Query: 477  EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LL 535
              ++ C+ +  +      +   AVG+  D +VRI SL   + ++   +  + +P +   L
Sbjct: 584  PSDVMCMALGSVPAGEQRAWFLAVGL-ADNTVRIISLDPADCLSPRSM--QALPSAAESL 640

Query: 536  CAFE---------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
            C  E               G  YL   L +G LL  +L+  +G+L D +   LG++P+ L
Sbjct: 641  CIVEMGTGESNEEGTVSTAGCFYLNIGLTNGVLLRTVLDPVSGDLADTRTRYLGSRPVKL 700

Query: 581  RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
                 + +  V A S R  + Y    +   + ++ + + +   F+S    + +       
Sbjct: 701  FRIKMQGSEAVLAMSSRTWLSYYYQNRFHLTPLSYETLEYASGFSSEQCAEGIVAISTNT 760

Query: 641  LTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD 699
            L I  ++ +  +  + + PL   P+R     ++    I    + +  EE++    + + D
Sbjct: 761  LRILALEKLGAVFNQITFPLEYTPKRFLIHNETGKLIISETDHNAYTEETKNIRKKQMAD 820

Query: 700  QTFE----------------FIS-TYPLDTF----------------------------- 713
            +  E                FI+   P D F                             
Sbjct: 821  EMREAAGEDEQELANEMADAFINEVLPEDQFSSPKAGAGMWASQIRVMDPINGHTYSKVQ 880

Query: 714  ----EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG---RILVFIVEDGKLQLIAE 766
                E   S+    F+ D   Y   G A  L    + T G    +  +     +L+    
Sbjct: 881  LAQNEAVMSMALVRFTVDQKWYVVAGVAKDLALNPKITNGGFIDVYKYDFHTHQLEHYHR 940

Query: 767  KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTR 824
             E   A  ++ +F G++L  I + +++Y     D G ++L  +C  + HI    + +Q  
Sbjct: 941  TEIDDAPGAICSFQGRVLVGIGKVLRIY-----DLGKKKLLRKC-ENKHIPNQIVNIQAM 994

Query: 825  GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
            G  + V D+ +SI  + YK  E  +   A D +  W++   +LD D    A+   N+  +
Sbjct: 995  GSRVFVSDVQESIYCVKYKRAENQLIIFADDTHPRWITTSTLLDYDTVATADKFGNVSVL 1054

Query: 885  RKNSEGATD-----------EERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDS 926
            R     + D            +RG L       E V  +HLGE +   +  +L+      
Sbjct: 1055 RLPHSVSDDVDEDPTGNKALWDRGLLNGASQKAENVCTFHLGETIMSLQKATLI------ 1108

Query: 927  DVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
              G   ++I+ T++G +G +      E + F + L+ ++R     + G +H  +RS+   
Sbjct: 1109 -PGGSESLIYATMSGTVGALVPFTSREDFDFFQHLEMHMRNENPPLCGRDHLSFRSY--- 1164

Query: 986  KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                  KN +DGDL E F  +   +   I+  +  +  E+ K++E++
Sbjct: 1165 --YYPVKNVMDGDLCEQFTSMDPAKQKSIATDLGRTPNEVAKKLEDI 1209


>gi|156051126|ref|XP_001591524.1| hypothetical protein SS1G_06970 [Sclerotinia sclerotiorum 1980]
 gi|154704748|gb|EDO04487.1| hypothetical protein SS1G_06970 [Sclerotinia sclerotiorum 1980 UF-70]
          Length = 1147

 Score =  197 bits (502), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 244/1020 (23%), Positives = 426/1020 (41%), Gaps = 144/1020 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I  +  FR  G  +D++ I ++  +  ++++    ++     +        R    
Sbjct: 64   VFGIIRAIAAFRLAGSNKDYIIITSDSGRITIVEFIPAQNKFNRLHLETFGKSGVRRVVP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ   +DP  R  L      + L  V+  +++ +L  +  +   + Q L    +    G 
Sbjct: 124  GQYLAVDPKGRACLTASVEKNKLVYVLNRNSQAELTISSPLEAHKAQTLVFALVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
            A P    L  D  ++    T             YE+ L     V   WS + +D  A++L
Sbjct: 184  ANPVFAALEVDYGESDQDPTGQAYEDIEKQLVYYELDLGLNHVVRK-WS-DPVDRTANIL 241

Query: 168  IPVPPPL---CGVLIIGEETIVYCSAN--AFK-AIPIRPSITKAYGRVD--ADGSRYLLG 219
              VP       GVL+ GE+ I Y  +N  AF+ AIP R   T+   R      G  + L 
Sbjct: 242  FQVPGGTDGPSGVLVCGEDNITYRHSNQEAFRVAIPRRRGATEDPQRKRNIVAGVMHKLK 301

Query: 220  DHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNAV 261
              AG    L+ T + +                  +V  LKI+      +A+++  L +  
Sbjct: 302  GAAGAFFFLLQTDDGDLFKITIEMVEDDNGQPTGEVKRLKIKYFDTVPVAASLCILKSGF 361

Query: 262  VYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYV 294
            +++ S +G+ Q  +                +       SY  V            +E   
Sbjct: 362  LFVASEFGNHQFYQFEKLGDDDEEIEFVSDDFPTGTNESYTPVYFHPRPAENLSLVESID 421

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
            ++ P++D  V +L  +   Q+ +  G     + R +++G+ ++E    EL G+   +W+ 
Sbjct: 422  SMNPLMDCKVANLTDEDAPQIYSICGTGARSTFRTLKHGLEVSEIVESELPGVPSAVWTT 481

Query: 354  RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
            + + +D +D ++++SF + T +L++   + +EE    GF S   TL       + L+QV 
Sbjct: 482  KLTRNDVYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVH 539

Query: 414  SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
               +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG L E  
Sbjct: 540  PKGIRHIRADRR--VNEWAAPQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSLAEYD 597

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
            +  ++   ++CL +  + E    SQ  AVG   D +VRI SL PD  L  K        P
Sbjct: 598  EKKEMSGTVTCLSLGEVPEGRQRSQFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSPP 656

Query: 531  RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
             ++ + A    S      YL   L  G  L  +L+  TGEL+D +   LG +P+ L   S
Sbjct: 657  NALSIMAMSDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGPKPVKLFRVS 716

Query: 585  SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
             +  T V A S RP + YS    K  + + ++   +     F+S    + +   +   L 
Sbjct: 717  VQGQTAVLALSSRPWLGYSDPVTKGFMLTPLDYPALEWGWNFSSEQCTEGMVGIQGQNLR 776

Query: 643  IGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT 701
            I +I+ +   L   SIPL   PRR     +   F +    N   +  ++    +LL+D +
Sbjct: 777  IFSIEKLTNNLLQESIPLTYTPRRFVRHPEHPLFYVVEADNNILSPATKQ---KLLEDPS 833

Query: 702  F-----------EFISTYPLDTFEYGCSI-----------LSCSFSDDS----------- 728
                        EF   YP  T  +   I           L+    +D+           
Sbjct: 834  VVNGDATVLPAEEF--GYPRGTNHWASCISVVDPVTEKKVLATIHLEDNESAVSVAVVAF 891

Query: 729  -----NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFN 780
                   +  VGT   ++      + G I V+   EDGK L+ I + + +    +L AF 
Sbjct: 892  ASQEDETFLVVGTGKDLVVSPRSSSAGFIHVYRFHEDGKELEFIHKTKVEEPPMALLAFQ 951

Query: 781  GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            G+LLA + + +++Y   +R    R+ QSE   +   + + +QT+G  I+V D+ +SI+++
Sbjct: 952  GRLLAGVGKDLRIYDLGMR-QLLRKAQSEIAPN---MIVGLQTQGSRIIVSDVQESITMV 1007

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG 897
            +YK +E  +     D  A W S   ++D +   G +   NL+ +R   K SE A +E  G
Sbjct: 1008 VYKFQENRLIPFVDDTIARWTSCTTMVDYETVAGGDKFGNLWLLRCPPKASEEADEEGSG 1067


>gi|171685748|ref|XP_001907815.1| hypothetical protein [Podospora anserina S mat+]
 gi|170942835|emb|CAP68488.1| unnamed protein product [Podospora anserina S mat+]
          Length = 1235

 Score =  197 bits (502), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 280/1193 (23%), Positives = 496/1193 (41%), Gaps = 193/1193 (16%)

Query: 6    IYGRIATLELFRPHGEAQ---DFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
            ++G I  +  FR  G  +   D++ +AT+  +  ++++  +++      +        R 
Sbjct: 64   VFGIIRAIASFRLAGSHKGNLDYIILATDSGRIAIIEYLPKTNRFNRIHLETFGKSGVRR 123

Query: 63   TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
               GQ    DP  R  LI     + L  V+  + + +L  +  +   +  VL +  +   
Sbjct: 124  VVPGQYLAADPKGRACLIASLEKNKLVYVLNRNAQAELTISSPLEAHKPGVLVLSLVALD 183

Query: 118  YGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGA 164
             G A P    L  D  +A    T             YE+ L     V   WS + +D  +
Sbjct: 184  VGYANPVFAALELDYTEADQDPTGQAGQEPEAQLVYYELDLGLNHVVRK-WS-DAVDPTS 241

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGS 214
             LL  VP     P   GVL+ GEE + Y  +N  AF+  IP R  P+      R    G 
Sbjct: 242  SLLFQVPGGSDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRRGPTEDPQRKRTIVSGV 299

Query: 215  RYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISY 256
             + L   AG    L+ T + +                  +V  LKI+      IA+++  
Sbjct: 300  MHKLKGSAGAFFFLLQTEDGDLFKVTLDMVEDDNGNPTGEVKRLKIKYFDTIPIATSLCI 359

Query: 257  LDNAVVYIGSSYGDSQLIKLN---------------LQPDAKGSY------------VEV 289
            L +  +++ S +G+  L +                    D++ +Y            + +
Sbjct: 360  LKSGFLFVASEFGNHSLYQFEKLGDDDEELEFSSEEFPTDSRAAYNPVYFQPRPLENLTL 419

Query: 290  LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG 349
            +E   ++ P +D  V +L  +   Q+ +  G     S R++++G+ ++E  + EL G   
Sbjct: 420  VESIDSMNPQIDCKVANLTGEDAPQIYSVCGNGARSSFRMLKHGLEVSEIVASELPGTPA 479

Query: 350  -MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
             +W+ + +  D +D ++V+SF + T +L++   + +EE    GF +   TL         
Sbjct: 480  AVWTTKLTKYDEYDAYIVLSFTNATLVLSIG--ETVEEVSDSGFLTTVPTLAVQQLGEEG 537

Query: 409  LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGI 467
            L+QV    +R +        NEW +P   S+  A  N +QV++A   G +VY E+  DG 
Sbjct: 538  LIQVHPKGIRHIVQGRV---NEWPAPQHRSIVAAATNENQVVIALSSGEIVYFEMDADGS 594

Query: 468  LTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLG 525
            L E  +  ++   ++ L +  + E    S   AVG   D +VRI SL P+  L  K    
Sbjct: 595  LAEYDEKKEMSGTVTSLSLGRVPEGLRRSSFLAVGC-DDCTVRILSLDPESTLEMKSIQA 653

Query: 526  GEIIPRSVLLCAFE----GIS-YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
                P S+ + + E    G + YL   L  G  L  +L+  TGELTD ++  LG +P  L
Sbjct: 654  LTAAPASLSIMSMEDSFGGTTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTKL 713

Query: 581  RTFSSKNTTHVFAASDRPTVIYS-----------------------SNKKLLYSNVNLKE 617
               S +    V A + RP + Y+                       S+++ L   V +  
Sbjct: 714  FQVSVQGRPCVLALNSRPWLGYTDPITKGFVMTPLSYVDLEYGWNFSSEQCLEGMVGIHA 773

Query: 618  VSHMCPFNSAAFP-------DSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRIC-HQ 669
                  FN    P       DS AIA   +  +G  + IQK    SIPL   P+R+  H 
Sbjct: 774  NFLRYVFNLFLLPLLYDDKRDSYAIAMRVDYDLGD-NMIQK----SIPLTYTPKRLVKHP 828

Query: 670  EQSRTFAICS--------LKNQSCAEESEMHFVRLLDDQTFEF----------------I 705
            EQ   + I +        L+ Q  A+ +      +L  + F +                +
Sbjct: 829  EQPYFYTIEADNNTLPPELRAQLLAQSNTNGDAAILPPEEFGYPKARNRWASCISIVDPV 888

Query: 706  STYP-------LDTFEYGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-I 755
            S  P       LD  E   S    SF S D   +  VGT   ++    + ++G I V+  
Sbjct: 889  SDEPSVLNRVDLDNNEAAISAAVVSFASQDGESFLIVGTGKDMILSPRQFSEGYIHVYRF 948

Query: 756  VEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
             +DG+ L+ I + + +    +L  F G+LLA I + +++Y   L+    R+ Q+E     
Sbjct: 949  HDDGRDLEFIHKTKIEEPPMALIPFQGRLLAGIGKTLRIYDLGLK-QLLRKAQAEIAPQ- 1006

Query: 815  HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
              L + +QT+G+ IVVGD+ + I+  +YK E   +   A D    W +   ++D +   G
Sbjct: 1007 --LIVSLQTQGNRIVVGDVQQGITYAVYKPESNKLLAWADDTINRWTTCTAMVDYESVAG 1064

Query: 875  AENNFNLFTVRKNSEGA--TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------LPDS 926
             +   N++ +R     +  +DE    +++V   H   +++   + + +M       LP S
Sbjct: 1065 GDKFGNVWILRAPERASQESDEPGSEIQLV---HAKSYLHGAPNRTALMAHFYTQDLPTS 1121

Query: 927  ------DVGQIPTVIFGTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                   VG    +++  + G +GV I  +  E   F + L++++R     + G +H  +
Sbjct: 1122 ITKTNLVVGGQDVLLWSGIQGTVGVLIPFVSREDVDFFQSLESHMRAEDPPLAGRDHLIY 1181

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            R +      V  K  +DGDL E F  L+  +   I+  ++ SV E+ +++ ++
Sbjct: 1182 RGY-----YVPVKGVIDGDLCERFALLANDKKQMIAGELDRSVREIERKISDI 1229


>gi|164656549|ref|XP_001729402.1| hypothetical protein MGL_3437 [Malassezia globosa CBS 7966]
 gi|159103293|gb|EDP42188.1| hypothetical protein MGL_3437 [Malassezia globosa CBS 7966]
          Length = 1207

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 266/1171 (22%), Positives = 479/1171 (40%), Gaps = 180/1171 (15%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +  I +   FR  G ++D+L + ++  +  VL+++   ++   +   +   R G R    
Sbjct: 59   FSNIRSAASFRLMGGSKDYLILGSDAGRIVVLEYNGSLNQF-EKVHEETFGRSGNRRIVP 117

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNI---RLEELQVLDIKFLYGC 120
            GQ    DP  R  LIG      L  ++  D    L  +  +   R   +    +    G 
Sbjct: 118  GQYLATDPKGRAVLIGAVEKSKLIYILNRDTDAHLTISSPLEAHRHSAIVQCIVGVDVGY 177

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
              P    L  D  DA    T E   + +            + V   W++  + + A  LI
Sbjct: 178  ENPLFASLETDYADADADPTGEAYARTEQMLTYYELDLGLNHVVRRWTEP-VASSAHHLI 236

Query: 169  PVPPPL----------CGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGRVDADGSR 215
            PVP              GVL+  ++ IVY      +    IP+R    + +G   + GS 
Sbjct: 237  PVPGGYNHATERWDGPSGVLVCMDDYIVYKHPGQPEHRVPIPVR----EQHGFGQSRGSM 292

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                           L+ +  G L  + + HE E+V  L+I       +A+T+  L +  
Sbjct: 293  IVASVLHKMKTSFFLLVQNEDGDLFKVSVDHEDEQVEALRIRYFDTVPVAATLCILRSGF 352

Query: 262  VYIGSSYGDSQLIKLNL------------------QPDAKGSYVEVLERYV--------- 294
            + + S  G  QL                         DA  S +  L  +          
Sbjct: 353  LLVASETGAQQLYAFQKLGDDDDERFPEYISTDYGSSDAGPSPLPSLPTFCPRPLDNLAL 412

Query: 295  -----NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
                  L P++D  V +       Q+    G     S + +R+G+ ++E  S +L G+ +
Sbjct: 413  AYELDALDPLLDAKVSNPLHSDVPQIYAACGRGARSSFKRLRHGLEVSEVVSSDLPGVPE 472

Query: 349  GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
             +WS +  + D +D ++V+SF++ T +L++   + +EE E  GF +  +TL       + 
Sbjct: 473  AVWSTKLRSSDQYDGYIVLSFVNGTLVLSIG--ETIEEVEDSGFLTTERTLAVQQLGTDA 530

Query: 409  LVQVTSGSVRLVSSTSRELRNEWKSPP-----GYSVNVATANASQVLLATGGGHLVYLEI 463
            L+QV    +R + S  +   NEW +P         +  AT N  QV+LA     +VY E+
Sbjct: 531  LLQVHPRGIRHILSNKQV--NEWTTPSCQDGSPTQIVAATTNERQVVLALDSQEIVYFEL 588

Query: 464  G-DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLIT 520
              DG L E +  + +E +I  L I+   E    +   A+    D +VRI SL PD  L  
Sbjct: 589  DMDGQLNEFQERRDIEADIVALSISACPEGSQRTPYVAIAC-ADQTVRIVSLDPDSTLAP 647

Query: 521  KEHLGGEIIPRSVLLCAF-------EGISYLLC-ALGDGHLLNFLLNMKTGELTDRKKVS 572
                     P S+ +C           ++  +C  L +G  +  +L+  TG+LTD +   
Sbjct: 648  MSLQALTAPPSSISVCEMLDASLDRHHLTMFVCIGLANGVYIRTVLDPSTGQLTDTRTRF 707

Query: 573  LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
            LG +P+ L        T + A S R  + Y+ +  L ++ + L+ ++H+  F++   PD 
Sbjct: 708  LGGRPVRLVRTQVHGDTAMMALSTRSWLAYTLHSHLHFTPLMLEALTHVSTFHTELCPDG 767

Query: 633  LAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAI----------CSL 680
            L   +   L I T+  +  +L + S+ L   PR++  H +Q+  F +            +
Sbjct: 768  LLGIEGNALRILTVPRLGSELKMDSLRLTYTPRKLAVHPQQASLFYVIESDHRVVPPTDV 827

Query: 681  KNQSCAEESEMHF-------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDD 727
             +Q+      + F             +R++D  T +  + Y L+  E   S+    F+  
Sbjct: 828  VSQARTAWDPVQFGHVRASAGHWASCLRVVDGTTMQTCAEYALEKDEAALSMALVPFAAC 887

Query: 728  SN-VYYCVGTAYVLPEENEPTKGRILVF----IVEDG-KLQLIAEKETKGAVYSLNAFNG 781
             N ++  VG+A  L   + P   R        + ++G  L L+ + E      +L AF+G
Sbjct: 888  GNELFLVVGSA--LGVTHAPLTWRAAFLSTYRLTDNGCGLALVHKTEVDHVPLALRAFHG 945

Query: 782  KLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRGDFIVVGDLMKSIS 838
            +LLA     ++++     D GT++L  +C        +++L VQ  G  ++VGD+ +S+ 
Sbjct: 946  RLLAGTGPYVRIF-----DMGTKKLLRKCQSRPFPSKVVSLQVQ--GYRVIVGDMQESVH 998

Query: 839  LLIYKHEEGAIEERARDYNANW-MSAVEILDDDIYLGAENNFNLFTVRKNSEG--ATDE- 894
              +YK     +   A D    W  SA+ +LD D  +  +   N+F +R +S    + DE 
Sbjct: 999  YSVYKPATNTLVAFADDIMPRWTTSALLMLDYDTVMAGDKFGNVFVLRIDSSASLSADED 1058

Query: 895  --------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
                    ER        R +++  YH+G+ +      SLV        G  P V++  V
Sbjct: 1059 PTGLMLQNERSYLMGAAHRAQLLAHYHVGDIITSLSMESLV-------PGGRPVVLYTCV 1111

Query: 940  NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
            NG IG +   +  E       L+ ++R+    + G +H  +R      K V     +D D
Sbjct: 1112 NGTIGALVPFISREDVRLFTTLEMHMRQENLSLTGRDHLAYRGHYTPVKAV-----VDAD 1166

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
            L E +  L   + + I+  ++ +  ++ K++
Sbjct: 1167 LCELYTALPHEKQESIADELDRTPADIAKKL 1197


>gi|331221690|ref|XP_003323519.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|309302509|gb|EFP79100.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1213

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 270/1181 (22%), Positives = 479/1181 (40%), Gaps = 186/1181 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I  L  FR  G  +D++ +A++  +  VL++D  S+  I            R    
Sbjct: 59   VFGTIRALSSFRLTGGTKDYIILASDSGRIVVLEFDPSSNTFIKLHQETYGKSGARRVVP 118

Query: 66   GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
            GQ    DP  R + +   +   L  ++  D+   L      EA   N  +  +  +D+ F
Sbjct: 119  GQYLATDPKGRAVMIAAIEKSKLVYILNRDSAANLTISSPLEAHKSNAIIHHIVGVDVGF 178

Query: 117  LYGCAKPTIVVLYQDNKDARHVKTYEV-ALKDK-----------DFVEGPWSQNNLDNGA 164
                  P    L  D  +A    T E   L +K           + V   WS+   +  A
Sbjct: 179  ----ENPLFAALEVDYGEADQDPTGEAFNLAEKMLTYYELDLGLNHVVRKWSEPT-EPRA 233

Query: 165  DLLIPVP-------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------- 201
            ++L+ VP             P   GV++  E+ ++Y   NA +    IP R S       
Sbjct: 234  NMLLQVPGGQSTSHPDKFDGP--SGVIVCCEDLLIYKHQNAKEHRVPIPKRDSPFSDPAQ 291

Query: 202  ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
               I  A          +L+    G L  + I HE E+V  LKI+      +AS +S L 
Sbjct: 292  GVIIVAAVMHKMRGAFFFLIQSEEGDLFKVTIDHEDEEVQALKIKYFDTVPVASNLSILK 351

Query: 259  NAVVYIGSSYGDSQLIKLNLQPD-------AKGSYVEV----------------LERYV- 294
            +  +++ +  G+  L +     D       +   Y E                 LE  V 
Sbjct: 352  SGFLFVAAETGNHALYQFEKLGDDDEETEFSSADYPEFGASGAALPGAFFKPRGLENLVL 411

Query: 295  -----NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK- 348
                 +L PI       L      Q V   G     SLR++R+G+ ++E  + EL G   
Sbjct: 412  SDELESLAPITGAKTGHLLNSDSAQTVASCGTGSRSSLRMLRHGLEVSEIVTSELPGPPT 471

Query: 349  GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
             +W+ R +  D FD ++++ F++ T +LA+   + + E    G  + + T+       N 
Sbjct: 472  NVWTTRLAETDEFDRYIILGFLNATLVLAIG--ETIVEVADTGLLTNSPTIAIQQLDSNG 529

Query: 409  LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGI 467
            L+Q+    +R +         EWK PPG  + V+T+N  QV++   GG L+Y E+  DG 
Sbjct: 530  LLQIHPTGIRHIHLDGA--ITEWKVPPGRKIVVSTSNRRQVVIGLSGGELIYFELDLDGQ 587

Query: 468  LTEVK-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLG 525
            L E +   ++   I+ L ++ + +    +   A+G+  +++V+I SL P+  L T     
Sbjct: 588  LNEYQEQKEMGATITSLSLSEVSKGRQRTPFLAIGL-ENLTVQIISLDPNSVLETISLQA 646

Query: 526  GEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
               +P S+ +      S        ++   L +G LL  +L+   G+LTD +   LG++P
Sbjct: 647  LTAVPTSICIAELLDSSIDKNSETLFVNIGLANGVLLRTVLDSVNGQLTDTRTRFLGSRP 706

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L      N T V   S R  + Y+    L +  +    + ++  F++   P+ L    
Sbjct: 707  VKLLRVKVDNKTSVIGLSSRTWLNYTFQNLLHFDPLIYDAIDNVHSFSAELCPEGLIGIV 766

Query: 638  EGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-----AEESEM 691
               L I TI  +  K+   ++PL   PR++     S+       ++++      AE   +
Sbjct: 767  GSSLRIFTIPKLGVKVKQDAMPLAYTPRKMLLDPSSKHVITIESEHRTMAPGVKAERLAL 826

Query: 692  HFVR-------LLDDQTF-----------------EFISTYPLDTFEYG-------CSIL 720
            H  +       LLD   F                 + I    L   + G        +I+
Sbjct: 827  HKAQGLQVDESLLDQDAFGVPRAEAGKWASCIRISDPIEKTTLVREDLGDNEAATSLAIV 886

Query: 721  SCSFS---DDSNVYYCVGT---AYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVY 774
            S +F+     +     VG+   A+V P   +    R+  F+ +   ++L+ + E      
Sbjct: 887  SFAFAAHHPATEPLLVVGSAKDAFVQPRTCKNGFLRVYRFVNDGKAIELVHKTEVDEMPS 946

Query: 775  SLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
            +L  F G+L A + + +++Y    K +LR    +   S       I++L VQ  G  I+V
Sbjct: 947  ALVGFQGRLAAGVGKALRIYDLGKKKLLRKVENKSFGSA------IISLSVQ--GSRILV 998

Query: 831  GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNS 888
            GD   S+S  +YK  E  +   A D    W +   ++D D   G +   NL+  R  KN 
Sbjct: 999  GDSQDSVSYAVYKPAENRLIVFADDVVPRWTTCATMVDYDTVAGGDRFGNLWVSRLPKNV 1058

Query: 889  EGATDE---------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
                DE         E+G       +L+ +  +HL +     +  SLV        G   
Sbjct: 1059 SDEVDEDPTGAGIMHEKGYLMGAPHKLKNLVHFHLNDIPTSIQKTSLVP-------GGRE 1111

Query: 933  TVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 991
             +++  V G IG++   +  E   F + L+ ++R  +  + G +H  +R +         
Sbjct: 1112 VLLYTGVQGSIGILVPFISKEDVDFFQTLEMHMRNEMPSLVGRDHLAYRGY-----YFPV 1166

Query: 992  KNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            KN +DGDL ESF  L   +  +++  ++ SV ++ K++E +
Sbjct: 1167 KNCVDGDLCESFALLPSAKQLQVASELDRSVSDVLKKIEAV 1207


>gi|325189950|emb|CCA24429.1| splicing factor putative [Albugo laibachii Nc14]
          Length = 1644

 Score =  197 bits (501), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 263/1178 (22%), Positives = 462/1178 (39%), Gaps = 175/1178 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            ++   ++G I ++  FR  G   D+L + ++  K  V+Q D  S+   T    +   + G
Sbjct: 489  IITTSVFGVIRSIIPFRLTGGTHDYLVVGSDSGKIVVIQVDP-STNTFTACQNETYGKTG 547

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIRLEELQVL 112
             R    GQ    DP  R  +IG      L  V+  D   +L      EA         V+
Sbjct: 548  CRRITPGQYLASDPKGRALMIGAVEKQKLVYVMNRDASNRLTISSPLEAHRSHTVHFDVV 607

Query: 113  DIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNL 160
             +    G   P    L  ++ +A    T + A   +            + V   WSQ  +
Sbjct: 608  GVDV--GFENPIFASLELNHAEADIDPTGQAAHTSEKVLVYYELDLGLNHVTRRWSQV-V 664

Query: 161  DNGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFK---AIPIRPS----------- 201
               A+ L+ VP     P   GVL+I +  I Y + N      + P+R +           
Sbjct: 665  PRSANKLVAVPGGNDGP--GGVLVIAQGLIQYQNENHPPLSCSFPLRSTGGPNPVQDERK 722

Query: 202  ------ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
                  I         D    L+    G L  + + +    V  L+I+      +A  + 
Sbjct: 723  QGYPMMIVSTATHKQRDLFFVLMQSEWGDLFKISLEYAGSSVQKLRIQYFDTIPVALALC 782

Query: 256  YLDNAVVYIGSSYGDSQLIKL------------------NLQPDAKGSYVEVLERYV--- 294
                 +++  S + +  L +                   + +P+     V  L+      
Sbjct: 783  ITKTGLLFAASEFSNHYLFQFLSIGEDDDAAQCVSAAENDQEPEIPSFSVRKLKNLAMIS 842

Query: 295  ---NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
               ++ PI    V D   +   Q+    G     SLRI+R+G+ + E A+  L G+ K +
Sbjct: 843  NIPSISPITQLLVDDFANEQTPQLYALCGQGNRSSLRILRHGLPVMEMAASALPGVAKAV 902

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W L+ S  D  D ++VVSF  E   L + + D +EE    G      +L       N ++
Sbjct: 903  WCLKESFTDTCDKYIVVSF--EDATLVLEIGDTVEEITDSGLLRDHSSLLVALLEDNSML 960

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
            QV     R +  +  +   EWK+P    +    ANA Q++++  GG ++Y E+G   L E
Sbjct: 961  QVHQNGFRHIRKS--QPVTEWKAPGKKIIERCAANARQLVVSLAGGEVIYFELGVNELME 1018

Query: 471  VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
                 L +EI+ LD+  + +        AVG W D SVR+ SL P+     K  L     
Sbjct: 1019 KGKLDLGFEITSLDVGQVPQGRQRFPFLAVGSW-DNSVRVLSLDPNDLFRQKSTLALTSH 1077

Query: 530  PRSVLLCAFEGIS-----------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
            P ++     +  S           +L   L +G      ++  T  L D     LGT+P+
Sbjct: 1078 PHTLCFSNLQTESAAGSDHTTQCLFLNIGLHNGVFQQTRMDSITAALADSSARFLGTKPV 1137

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
             L   + +    + A S R  + Y    +     ++ + + +   F S   P+ LA    
Sbjct: 1138 KLFCVTIQGKRAILALSSRAWLSYFYQTRRQLMPLSTEMLHYASSFQSEQCPEGLAALTS 1197

Query: 639  GELTIGTIDDI------QKLHIRSIP--LGEHP--RRICHQEQ----------------- 671
              + I T+D +      QK  +R  P  +  HP  RR+   E                  
Sbjct: 1198 EGMKILTLDHLGDTFNQQKCFLRYTPRRMVLHPPSRRLIIVESDYGEANAASKQLFGESL 1257

Query: 672  SRTFAICSLKNQSCAEESEMHF---------------VRLLDDQTFEFISTYPLDTFEYG 716
            S T  I S + +   +   + +               VR+ D  T E I+ + L+  E+ 
Sbjct: 1258 SDTPVIASEEPEEDEDRQALLYPRVERSHEPGKWASCVRIFDPITCETIACHELEDNEHT 1317

Query: 717  CSILSCSFSD-DSNVYYCVGTA--YVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGA 772
             SI +C F D    V+  +G+     L   + P  G + V+ +VE  +L L+      G 
Sbjct: 1318 RSITTCVFHDRGGEVFVIIGSVKNLRLHPISSPEGGLLRVYRVVEGSQLVLVHTTPVDGI 1377

Query: 773  VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVG 831
             Y++  F G+LL ++ + +++Y     D G R+L  +C +      +  +++ GD I   
Sbjct: 1378 PYAMIEFQGRLLVSVGKVLRIY-----DLGKRKLLRKCENRYFTSPMIDLKSAGDRIYAS 1432

Query: 832  DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE-- 889
            D+ +SI  + YK E+  +   A D   ++M++  +LD D   G +   N+F  R  +E  
Sbjct: 1433 DVHESIHFVKYKAEDNQLITFADDCVPHFMTSSTLLDYDTIAGGDKFGNVFVTRLPAEVS 1492

Query: 890  ---------------GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
                           G  +    +LE + ++H+GE +      SLV        G +  +
Sbjct: 1493 DEIDNPTGNRMLWDTGLLNGAPHKLEQIAQFHVGEVITSVLRTSLV-------PGGMEVI 1545

Query: 935  IFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
            ++ T+ G IG +        + F   L+  +R+    + G +H  +RS+      +  KN
Sbjct: 1546 LYTTILGRIGALVPFTSRDDVDFYTHLEMYMRQEKAPLCGRDHLSYRSY-----FIPVKN 1600

Query: 994  FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
              DGDL E F  L   +   I++ ++ +  E+ K++++
Sbjct: 1601 VTDGDLCEQFSSLGPDKQKNIAEDLDRTPTEVVKKLQD 1638


>gi|331238007|ref|XP_003331659.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
 gi|309310649|gb|EFP87240.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
            75-36-700-3]
          Length = 1213

 Score =  197 bits (500), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 274/1183 (23%), Positives = 480/1183 (40%), Gaps = 190/1183 (16%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I  L  FR  G  +D++ +A++  +  VL++D  S+  I            R    
Sbjct: 59   VFGTIRALSSFRLTGGTKDYIILASDSGRIVVLEFDPSSNTFIKLHQETYGKSGARRVVP 118

Query: 66   GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
            GQ    DP  R + +   +   L  ++  D+   L      EA   N  +  +  +D+ F
Sbjct: 119  GQYLATDPKGRAVMIAAIEKSKLVYILNRDSAANLTISSPLEAHKSNAIIHHIVGVDVGF 178

Query: 117  LYGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNG 163
                  P    L  D  +A    T             YE+ L     V   WS+   +  
Sbjct: 179  ----ENPLFAALEVDYGEADQDPTGEAFNLAEKMLTYYELDLGLNHVVR-KWSEPT-EPR 232

Query: 164  ADLLIPVP-------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------ 201
            A++L+ VP             P   GV++  E+ ++Y   NA +    IP R S      
Sbjct: 233  ANMLLQVPGGQSTSHPDKFDGP--SGVIVCCEDLLIYKHQNAKEHRVPIPKRDSPFSDPA 290

Query: 202  ----ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
                I  A          +L+    G L  + I HE E+V  LKI+      +AS +S L
Sbjct: 291  QGVIIVAAVMHKMRGAFFFLIQSEEGDLFKVTIDHEDEEVQALKIKYFDTVPVASNLSIL 350

Query: 258  DNAVVYIGSSYGDSQLIKLNLQPD-------AKGSYVEV----------------LERYV 294
             +  +++ +  G+  L +     D       +   Y E                 LE  V
Sbjct: 351  KSGFLFVAAETGNHALYQFEKLGDDDEETEFSSADYPEFGASGAALPGAFFKPRGLENLV 410

Query: 295  ------NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 348
                  +L PI       L      Q V   G     SLR++R+G+ ++E  + EL G  
Sbjct: 411  LSDELESLAPITGAKTGHLLNSDSAQTVASCGTGSRSSLRMLRHGLEVSEIVTSELPGPP 470

Query: 349  -GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
              +W+ R +  D FD ++++ F++ T +LA+   + + E    G  + + T+       N
Sbjct: 471  TNVWTTRLAETDEFDRYIILGFLNATLVLAIG--ETIVEVADTGLLTNSPTIAIQQLDSN 528

Query: 408  QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DG 466
             L+Q+    +R +         EWK PPG  + V+T+N  QV++   GG L+Y E+  DG
Sbjct: 529  GLLQIHPTGIRHIHLDGA--ITEWKVPPGRKIVVSTSNRRQVVIGLSGGELIYFELDLDG 586

Query: 467  ILTEVK-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
             L E +   ++   I+ L ++ + +    +   A+G+  +++V+I SL P+  L T    
Sbjct: 587  QLNEYQEQKEMGATITSLSLSEVSKGRQRTPFLAIGL-ENLTVQIISLDPNSVLETISLQ 645

Query: 525  GGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
                +P S+ +      S        ++   L +G LL  +L+   G+LTD +   LG++
Sbjct: 646  ALTAVPTSICIAELLDSSIDKNSETLFVNIGLANGVLLRTVLDSVNGQLTDTRTRFLGSR 705

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L      N T V   S R  + Y+    L +  +    + ++  F++   P+ L   
Sbjct: 706  PVKLLRVKVDNKTSVIGLSSRTWLNYTFQNLLHFDPLIYDAIDNVHSFSAELCPEGLIGI 765

Query: 637  KEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-----AEESE 690
                L I TI  +  K+   ++PL   PR++     S+       ++++      AE   
Sbjct: 766  VGSSLRIFTIPKLGVKVKQDAMPLAYTPRKMLLDPSSKHVITIESEHRTMAPGVKAERLA 825

Query: 691  MHFVR-------LLDDQTF-----------------EFISTYPLDTFEYG-------CSI 719
            +H  +       LLD   F                 + I    L   + G        +I
Sbjct: 826  LHKAQGLQVDESLLDQDAFGVPRAEAGKWASCIRISDPIEKTTLVREDLGDNEAATSLAI 885

Query: 720  LSCSFS---DDSNVYYCVGT---AYVLPEENEPTKGRILVFIVEDGK-LQLIAEKETKGA 772
            +S +F+     +     VG+   A+V P   +    R+  F V DGK ++L+ + E    
Sbjct: 886  VSFAFAAHHPATEPLLVVGSAKDAFVQPRTCKNGFLRVYRF-VNDGKVIELVHKTEVDEM 944

Query: 773  VYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
              +L  F G+L A + + +++Y    K +LR    +   S       I++L VQ  G  I
Sbjct: 945  PSALVGFQGRLAAGVGKALRIYDLGKKKLLRKVENKSFGSA------IISLSVQ--GSRI 996

Query: 829  VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--K 886
            +VGD   S+S  +YK  E  +   A D    W +   ++D D   G +   NL+  R  K
Sbjct: 997  LVGDSQDSVSYAVYKPAENRLIVFADDVVPRWTTCATMVDYDTVAGGDRFGNLWVSRLPK 1056

Query: 887  NSEGATDE---------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
            N     DE         E+G       +L+ +  +HL +     +  SLV        G 
Sbjct: 1057 NVSDEVDEDPTGAGIMHEKGYLMGAPHKLKNLVHFHLNDIPTSIQKTSLVP-------GG 1109

Query: 931  IPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
               +++  V G IG++   +  E   F + L+ ++R  +  + G +H  +R +       
Sbjct: 1110 REVLLYTGVQGSIGILVPFISKEDVDFFQTLEMHMRNEMPSLVGRDHLAYRGY-----YF 1164

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              KN +DGDL ESF  L   +  +++  ++ SV ++ K++E +
Sbjct: 1165 PVKNCVDGDLCESFALLPSAKQLQVASELDRSVSDVLKKIEAV 1207


>gi|393245024|gb|EJD52535.1| hypothetical protein AURDEDRAFT_111199 [Auricularia delicata
            TFB-10046 SS5]
          Length = 1214

 Score =  197 bits (500), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 273/1171 (23%), Positives = 476/1171 (40%), Gaps = 169/1171 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSE---LITRAMGDVSDRIGRP 62
            ++G + +L  FR  G  +D+  + ++  +  +L +D +++E   L T   G       R 
Sbjct: 58   VFGAVRSLVAFRLTGGTKDYAIVGSDSGRIVILDYDPKANEFKRLYTETYGKSG---ARR 114

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL----- 117
               GQ   +DP  R + +   +    V   +         +  LE  +   I F      
Sbjct: 115  IVAGQYLAVDPKGRAVMIAAMEKSKLVYILNRDAATNLTISSPLEAHRNQSILFHITGVD 174

Query: 118  YGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGA 164
             G   P    L  D  ++    T             YE+ L     V   WS+   D  A
Sbjct: 175  VGFENPLFAALEIDYGESDQDPTGEAFHNAEKMLTFYELDLGLNHVVRK-WSEPT-DPRA 232

Query: 165  DLLIPVPPPLC-----------GVLIIGEETIVYCSANAFK---AIPIRPS--------- 201
            + L+ VP               GVL+  E  I+Y   +  +    IP R +         
Sbjct: 233  NFLVQVPGGTSASSPDKFDGPSGVLVCCENHIIYRHMDHPQHRVPIPRRRAPLDSEERGL 292

Query: 202  -ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
             I  A          +LL    G L+ + I H+ ++VT LKI+      ++S++  L + 
Sbjct: 293  IIVAAVMHKMKGAFFFLLQSEEGDLYKVTIDHQDDEVTSLKIKYFDTVPVSSSLCILKSG 352

Query: 261  VVYIGSSYGDSQLIKLNLQPD-------AKGSYVE------------------VLERYV- 294
             +++ S +G   L +     D       +  SY +                  +LE  + 
Sbjct: 353  FLFVASDFGPHHLYQFQKLGDDDAETEFSSSSYPKNGMADPDEALPSAYFQPRLLENLLL 412

Query: 295  -----NLGPIVDFCVVDLER--QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
                 ++ PI+D  V++L        Q     G     S R++R+G+ + E  S +L  G
Sbjct: 413  CDSLDSINPIIDAKVLNLMPTISDTPQFYVACGRGARSSFRLLRHGLEVEENISSDLPSG 472

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            I   +W+ +   DDPFDT++V+SF++ T  L + + + +EE +  GF S   TL      
Sbjct: 473  IPNAVWTTKIRADDPFDTYIVLSFVNGT--LVLTIGETIEECQETGFLSAEPTLGVQQIG 530

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
             + L+QV    +R + +  R   NEWK P G ++  A  N+ QV++A     LVY E+  
Sbjct: 531  DDALLQVYPHGIRHILADKRV--NEWKVPSGKTIVQAATNSRQVVVALNSAELVYFELDL 588

Query: 465  DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
            DG L E +  + +   +  L +  + E    +   AVG   D +VRI SL   N +    
Sbjct: 589  DGQLNEYQDRKAMGSVVLALSMAEVPEGCQRTPYLAVGC-EDQTVRIISLDPDNTLETIS 647

Query: 524  LGGEIIPRSVLL------CAFEGIS---YLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
            L     P S +        A   IS   ++   L +G  L  +L+  TGELTD +   LG
Sbjct: 648  LQALTAPPSAICIVAMIDAAVNKISETLFVNIGLTNGLFLRTVLDPVTGELTDTRTRFLG 707

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
            T+PI L   +      + A S R  + Y+  + L  + +    + ++  F +   PD   
Sbjct: 708  TRPIRLNRVTIAGKPALLALSSRMWLNYAHQEALHLTPLIFDPLDYVSGFTADLCPDGFI 767

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH- 692
              K   + I  I  I Q+L    +PL   PR+I     +  F I    +++ ++E+ +H 
Sbjct: 768  GLKGSTIKIFQITKIGQRLKHEVMPLSYTPRKIAVHPMNGLFYIAESDHRTHSQEA-VHG 826

Query: 693  ----------------------------------FVRLLDDQTFEFISTYPLDTFEYGCS 718
                                               VR+++      I    LD  E   S
Sbjct: 827  ALQQPDLKKYDRDVLELPPEVFGHPRAPAGIWASCVRIVNPVELTTIYRLDLDNNEAAFS 886

Query: 719  ILSCSFS-DDSNVYYCVGT---AYVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGA 772
            I    FS  D+ +   VGT   A++ P     T G +  + +++DG+ L+L+ + E    
Sbjct: 887  IALVPFSARDNELTLVVGTAKDAFLAPRSC--TSGYLRTYRVLDDGRGLELLHKTECDDI 944

Query: 773  VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVG 831
              SL AF GKL+A I + +++Y     D G ++L  +        A+  + T+G  I+ G
Sbjct: 945  PLSLLAFQGKLVAGIGKCLRIY-----DMGKKKLLRKAESKLFTTAITSLSTQGARIIAG 999

Query: 832  DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN---S 888
            D  +SI   +YK  E  +   A D    W ++  +LD +  +  +   N+F  R +   S
Sbjct: 1000 DAQQSIFFCVYKAAENRLLVFADDSQQRWTTSQLMLDYNTVVAGDKFGNVFVNRLSDHVS 1059

Query: 889  EGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVN 940
                D+  G      +   +G  H    +  F  G ++M L  + +  G    +++  ++
Sbjct: 1060 NLVDDDPTGAGLLHEKGMFMGAPHKTSMLCHFHVGDIIMSLQRTSLVPGGREVIVYFGLH 1119

Query: 941  GVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDL 999
            G IG++     ++ + F   L+ ++R     +   +   WR +      V  K  +DGDL
Sbjct: 1120 GTIGMLVPFASKEDVDFFTTLEQHMRSENLSLVSRDILAWRGY-----YVPVKAVVDGDL 1174

Query: 1000 IESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
             E F  L + +   ++  ++ +V E+ K++E
Sbjct: 1175 CEYFAKLPQGKQGGLAGELDRTVGEVLKKLE 1205


>gi|429329284|gb|AFZ81043.1| CPSF A subunit region domain-containing protein [Babesia equi]
          Length = 1175

 Score =  197 bits (500), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 207/902 (22%), Positives = 375/902 (41%), Gaps = 119/902 (13%)

Query: 217  LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
            L+    G ++ + +T+E+  V  +         ++ ++  L +  +++ S +G+  L + 
Sbjct: 302  LIQSEFGDVYKVELTYEEGVVKEIVCRYFDTVPVSVSLCILRSGYLFVASEFGNHHLYQF 361

Query: 277  -------------NLQPDAKGSYV----------EVLERYVNLGPIVDFCVVDLERQGQG 313
                         +L P  + + +          ++++   +L  I D  V D++  GQ 
Sbjct: 362  TGLGTDERDPLCTSLHPHGRSAIIAFKPRALQNLQLVDELSSLSAITDMKVADIQGLGQH 421

Query: 314  QVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISE 372
            Q+    G     SLR++R GI I   AS EL G  K +W++RSS +  +D F++V F  E
Sbjct: 422  QIFLGCGRGSRSSLRVLRYGIAIEGLASSELPGRPKSVWTVRSSFESAYDGFIIVGF--E 479

Query: 373  TRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWK 432
               L +++ + +EE     F +   TL          +QV    +R V     +   EW+
Sbjct: 480  GNTLVLSVGEAVEEVTDSCFLTSITTLHVALMGDGSFIQVHDAGIRHVYD---QRVKEWR 536

Query: 433  SPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEISCLDINPIGEN 491
            +P    V VA +N  QV+L   GG ++Y EI D G L E     L  EISCLD+ P  + 
Sbjct: 537  APSSKRVKVAASNDRQVILGLSGGDVIYFEIDDSGNLVEYAKKSLSVEISCLDLQPTPKG 596

Query: 492  PSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAF----EGISYLLC 546
               +   A+G   D SVR+ +L   L +++ + L     P SV +  F      + YL  
Sbjct: 597  RILANFMAIGTL-DNSVRVLTLDKSLKVVSTQILSNNSTPESVCISEFAVGDSSLVYLHV 655

Query: 547  ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK 606
             L  G +L   ++  +G L+D++   LG + +  R  S  ++  + A SD+P +IY+   
Sbjct: 656  GLNTGVMLRSTVDPISGALSDQESRFLGGRAVKFRRVSLGSSFAIVALSDKPWLIYTHRG 715

Query: 607  KLLYSNVNLKE-------VSHMCPFNSAAF---------------------------PDS 632
             LL S +N+         +S +CP    A                            P  
Sbjct: 716  ILLVSPINVGTLESADSLISPICPDGFVAVSGNTLRIFRCTSLGETFAESQLPLTYTPRK 775

Query: 633  LAIAKEGELTIGTIDDI--------------QKLHIRSIPLG-EHPRRICHQEQSRTFAI 677
            L +      ++G+++ +              Q L I+++  G E P   C       F  
Sbjct: 776  LVLMPSEAPSVGSLNYMLAIVESDHARYNEDQSLEIKNVHAGIELPSDYCESLDYTDFKA 835

Query: 678  CSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTA 737
               K  SC        +R+++  T E ++    +T E   ++ +     D      VGTA
Sbjct: 836  EPGKWGSC--------LRIVNPLTLETVAKLLFETDE--AAMCAQVVVLDGIQCLVVGTA 885

Query: 738  YVLPEENEP--TKGRILVFIV-EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
              +  + +P    G + V+    + +++L+      G   +L  + GKL+ A+  +++LY
Sbjct: 886  IGMNLKGDPDSVSGYLRVYAYGANYEIRLLHATPITGVPRALAGYEGKLICALGSRLRLY 945

Query: 795  KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL---IYKHEEGAIEE 851
                R    +     C  HG I   ++   G  I  GD+ +   LL    Y  +    E 
Sbjct: 946  ALGKRQLLLKAEHRTCTDHGFI---WISVCGSRIFAGDIREGFQLLRLRFYAEDAAEFEW 1002

Query: 852  RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
                    W+S  E LD    +G +   +LF  R   E  T  +  + E   ++HLG+  
Sbjct: 1003 IGHSTGPRWLSCCEQLDYHTVIGGDKFDSLFIARVPQEEFT--KATQFENHAQFHLGDL- 1059

Query: 912  NRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKG 970
                  + + ++  +++ Q P VI+ T+ G IG      ++  L  ++ L+  +      
Sbjct: 1060 -----PTAISKVSFNNMSQ-PIVIYSTILGSIGAFIPYANKDELDLMQHLEMIMANEHPP 1113

Query: 971  VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
            + G  H  +RS+         +N +DGDL E    L      +I+  ++ +V  L ++V+
Sbjct: 1114 LCGREHAFFRSY-----YYPVQNIVDGDLCEQVKTLPEAVQRKIATQLDTNVHTLLRKVD 1168

Query: 1031 EL 1032
            ++
Sbjct: 1169 DV 1170


>gi|378730762|gb|EHY57221.1| pre-mRNA-splicing factor rse1 [Exophiala dermatitidis NIH/UT8656]
          Length = 1210

 Score =  196 bits (499), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 267/1175 (22%), Positives = 484/1175 (41%), Gaps = 184/1175 (15%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G I +L  FR  G ++D+L + ++  +  +L++  + +    R   +   + G R    
Sbjct: 65   FGIIRSLAAFRLAGSSKDYLIVGSDSGRITILEYVKDQNRF-NRVHLETFGKSGIRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYGC 120
            GQ   +DP  R  +I     + L  V+  + + +L  +  +   + Q L    +    G 
Sbjct: 124  GQYLAVDPKGRACIIASVEKNKLVYVLNRNAQAELTISSPLEAHKPQTLVFDCVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
              P    L  D  +     T             YE+ L     V   W++  +   A  L
Sbjct: 184  ENPIFAALEVDYTEVDQDPTGQALQEVEKLLVYYELDLGLNHVVRQ-WAEP-VARDATKL 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLG 219
              +P     P   GVL+  E+ I Y  +N  AF+  IP R  + +     + +  R+++ 
Sbjct: 242  FQIPGGSDGP--SGVLVCAEDNITYRHSNQEAFRVPIPRRRGVLE-----NPERKRHIV- 293

Query: 220  DHAGLLH-------LLVITHEKE------------------KVTGLKIELLGETSIASTI 254
              AG++H       LL+ T + +                  +V  LKI+      +A+++
Sbjct: 294  --AGVMHKMKGQFFLLLQTEDGDLFKVTIDMVEDDNGQLTGEVRALKIKYFDTVPVANSL 351

Query: 255  SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------V 287
              L +  +++ S  G+    +                +   D   +Y            +
Sbjct: 352  HILKSGFLFVASESGNHHFYQFEKLGDDDEETEFTSDDYPADPTAAYTPAFFHVRPLSNL 411

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
             +++      P++D  V +L  +   Q+ +  GA    S + +++G+ ++E    EL   
Sbjct: 412  NLVQSVDAANPLLDCKVANLLDEDAPQIYSICGAGARSSFKTLKHGLSVSEIVESELPDK 471

Query: 347  IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
             + +W+ + + DD +D ++++SF   T  L +++ + +EE    GF S   TL       
Sbjct: 472  PEAVWTTKLTRDDQYDAYIILSF--RTGTLVLSIGETVEEVTDTGFLSTAPTLAVQQLGE 529

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-D 465
            + LVQV    +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  D
Sbjct: 530  DALVQVHPKGIRHIRADKR--VNEWPAPQHRSIVAATTNERQVAVALSSGEIVYFEMDTD 587

Query: 466  GILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
            G L E  +  ++   ++CL +  + E    S   AVG   D +VRI SL PD  L  K  
Sbjct: 588  GSLAEYDERREMTGTVTCLSLGDVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSV 646

Query: 524  LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
                  P ++ + A    +      YL   L  G  L  +L+  TGEL+D +   LG +P
Sbjct: 647  QALTSAPSALSIMAMADSTSGGTTMYLHIGLYSGVYLRTVLDEITGELSDTRTRFLGLRP 706

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
              L   S K    V A S RP + Y+  +    + + ++   + ++  F S   P+ +  
Sbjct: 707  AKLFRVSVKGQNAVMALSSRPWLGYTDTQTNGFMLTPLDYVPLQYVWNFTSEQCPEGMVG 766

Query: 636  AKEGELTIGTIDDIQK-LHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
             +   L I +I+D+ + L   +IPL   PR+         F +    N   A  +    +
Sbjct: 767  IQGQNLRIFSIEDLSRNLLQENIPLPYTPRKFVKHPDQPLFYVIEADNNVLAPATRQKLL 826

Query: 695  R----------LLDDQTFEF----------------------ISTYPLDTFEYGCSILSC 722
                       +L  + F +                      + T  L+  E   SI + 
Sbjct: 827  NESTAVNGDAVILPPEEFGYPKATGHWASCIQVVDPVNTKSVVFTLELEDNECATSITTA 886

Query: 723  SF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNA 778
             F S D  V+  VGTA  ++      + G + V+   EDGK L+ I + + +    +L A
Sbjct: 887  PFASQDDEVFLIVGTAKDLVVSPRSFSAGFLHVYRFHEDGKELEFIHKTKVEQPPTALLA 946

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
            F G+LLA +   +++Y     D G +++  +C      L + +QT+G  I+V D+ +S++
Sbjct: 947  FQGRLLAGVGPDLRIY-----DLGMKQMLRKCQVTTPNLIVGLQTQGSRIIVSDIQESVT 1001

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE- 894
              +YK +E  +     D  A W +   ++D +   G +   NL+ +R   K SE A ++ 
Sbjct: 1002 YCVYKFQENKLIPFCDDVIARWTTCCTMVDYETVAGGDKFGNLWMLRCPQKVSEEADEDN 1061

Query: 895  -------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
                   ERG       RL ++  Y  G+     +  +LV        G    V +    
Sbjct: 1062 SGVHLLHERGYLNGTPNRLSLMIHYFPGDIPTSIQKTNLV-------AGGRDVVFWTGFQ 1114

Query: 941  GVIGVIAS-LPHEQYLFLEKLQTNLRKV--IKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
            G +G++   +  E   F + L+  L        + G +H  +RS+    K V     +DG
Sbjct: 1115 GTLGILVPFVSREDVDFFQSLEMQLASSNGNPPLLGRDHLIYRSYYAPSKGV-----IDG 1169

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DL E+F  L   +   I+  ++ SV E+ +++ ++
Sbjct: 1170 DLCETFFLLPNDKKLMIAGELDRSVREIERKISDM 1204


>gi|195376606|ref|XP_002047087.1| GJ13230 [Drosophila virilis]
 gi|194154245|gb|EDW69429.1| GJ13230 [Drosophila virilis]
          Length = 1229

 Score =  196 bits (499), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 256/1201 (21%), Positives = 468/1201 (38%), Gaps = 199/1201 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            ++   I+G I  L  FR  G  +D++ + ++  +  +L++    + L             
Sbjct: 53   LMSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEYIPSKNSLEKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D ++A    + + A + +    F E     N+        L+  A
Sbjct: 173  VDVGFDNPMFACLEIDYEEADLDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +LL    G +  + +  + + V+ +K++       A+ +  L    +
Sbjct: 288  ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
            ++ S +G+  L ++                           P A  + V +++   +  P
Sbjct: 348  FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
            I+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++   
Sbjct: 407  IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRI 466

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            DD FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +
Sbjct: 467  DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
            R + S  R   NEWK+P   S+     N  QV++   G  LVY E+   G L E  + ++
Sbjct: 525  RHIRSDKRV--NEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
            +  EI C+ +  + E    S   AVG+  D +VRI SL   N ++   +     P    L
Sbjct: 583  MPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLSPCSMQALPSPAES-L 640

Query: 536  CAFE---------------------------GISYLLCALGDGHLLNFLLNMKTGELTDR 568
            C  E                           G  YL   L +G LL  +L+  +G+L D 
Sbjct: 641  CLVEMGHTESTTNAGGADDDVPAQRSGGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADT 700

Query: 569  KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
            +   LG++P+ L     +    V A S R  + Y    +   + ++ + + +   F+S  
Sbjct: 701  RTRYLGSRPVKLFRIKMQGAEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQ 760

Query: 629  FPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICS------- 679
              + +       L I  ++ +  +  + + PL   PR  + H +  R     +       
Sbjct: 761  CSEGIVAISTNTLRILALEKLGAVFNQVAFPLQFTPRAFVIHPDTGRMLIAETDHNAYTE 820

Query: 680  -------------LKNQSCAEESEM-----------------------------HFVRLL 697
                         +++ +  EE E+                               +R L
Sbjct: 821  DTKNTRKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSAPKAGLGLWASQIRCL 880

Query: 698  DDQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI 755
            D    + + + PL   E     ++L  S + D   Y  VG A  L      ++G  +   
Sbjct: 881  DAMHGQTMFSVPLTQNEAIMSMTLLKFSVAADGRYYLAVGIAKDLQLNPRISQGGCIDIY 940

Query: 756  VED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-G 811
              D     L+ +   E      +L  F G+LLA   + +++Y     D G +++  +C  
Sbjct: 941  KIDPTCSALEFLHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCEN 995

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
             H     + +Q  G  + V D+ +S+  + Y+  E  +   A D +  W++A  +LD D 
Sbjct: 996  KHIPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT 1055

Query: 872  YLGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVN 912
             +   + F   ++++     TD+            +RG L       E +  +H+GE + 
Sbjct: 1056 -IAIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIM 1114

Query: 913  RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGV 971
              +  +L+        G    +I+ T++G +G        E Y F + L+ ++R     +
Sbjct: 1115 SLQKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPL 1167

Query: 972  GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
             G +H  +RS          KN LDGDL E +L +   +   I+  M  +  ++CK++E+
Sbjct: 1168 CGRDHLSYRS-----SYYPVKNVLDGDLCEQYLSIDAVKQKSIAGDMFRTPNQICKKLED 1222

Query: 1032 L 1032
            +
Sbjct: 1223 I 1223


>gi|351712542|gb|EHB15461.1| Splicing factor 3B subunit 3 [Heterocephalus glaber]
          Length = 1240

 Score =  196 bits (498), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 280/1218 (22%), Positives = 497/1218 (40%), Gaps = 222/1218 (18%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLL 218
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPER--- 283

Query: 219  GDHAGLLHLLVITH----------EKEKVTGLKIELLGETSIASTISY------LDNAVV 262
                G++ +   TH          + E+    KI L  +  + +T+         +N  +
Sbjct: 284  ----GMIFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMLATVGMNKSFVSAENCYL 339

Query: 263  YIGSSYGD-------SQLIKLN------LQPDAKGSYVEVLERYVNLGPIVDFC-VVDLE 308
            Y  +  GD       S  + L        QP    + V +++   +L PI+ FC + DL 
Sbjct: 340  YQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLSPIL-FCQIADLA 397

Query: 309  RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVV 367
             +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R   +D FD +++V
Sbjct: 398  NEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIV 457

Query: 368  SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
            SF++ T +L++   + +EE    GF   T TL C     + LVQV    +R + +  R  
Sbjct: 458  SFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR-- 513

Query: 428  RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDI 485
             NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  ++  ++ C+ +
Sbjct: 514  VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSL 573

Query: 486  NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE----- 539
              +      S+  AVG+  D +VRI SL   + +  + L  + +P +   LC  E     
Sbjct: 574  ANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPESLCIVEMGGTE 630

Query: 540  -----------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
                       G  YL   L +G LL  +L+  TG+L+D +   LG++P+ L     +  
Sbjct: 631  KQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQ 690

Query: 589  THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
              V A S R  + YS   +   + ++ + +     F S   P+ +       L I  ++ 
Sbjct: 691  EAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEK 750

Query: 649  IQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQSCAEE------- 688
            +  +  + + PL   PR+ + H E +            T A  + + Q  AEE       
Sbjct: 751  LGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAAGE 810

Query: 689  ------SEM-------------------------HFVRLLDDQTFEFISTYPLDTFEYGC 717
                  +EM                           +R+++      +    L+  E   
Sbjct: 811  DERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAF 870

Query: 718  SILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIAEKETKGAV 773
            S+  C FS+    +Y  VG A  L        G  +    +V +G KL+ + +   +   
Sbjct: 871  SVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVP 930

Query: 774  YSLNAFNGKLLAAINQKIQLY----KWMLR------------------------------ 799
             ++  F G++L  + + +++Y    K +LR                              
Sbjct: 931  AAIAPFQGRVLIGVGKLLRVYDLGKKKLLRKCENKTPVEEVPAAIAPFQGRVLIGVGKLL 990

Query: 800  ---DDGTRELQSECGHHGHILALY---VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
               D G ++L  +C  + HI A Y   +QT G  ++V D+ +S   + YK  E  +   A
Sbjct: 991  RVYDLGKKKLLRKC-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFA 1048

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE---------ERGRL--- 899
             D    W++   +LD D   GA+   N+  VR   N+    DE         +RG L   
Sbjct: 1049 DDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGA 1108

Query: 900  ----EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQY 954
                EV+  YH+GE V   +  +L+        G   ++++ T++G IG++     HE +
Sbjct: 1109 SQKAEVIMNYHVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGILVPFTSHEDH 1161

Query: 955  LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
             F + ++ +LR     + G +H  +RS+         KN +DGDL E F  +   +   +
Sbjct: 1162 DFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNV 1216

Query: 1015 SKTMNVSVEELCKRVEEL 1032
            S+ ++ +  E+ K++E++
Sbjct: 1217 SEELDRTPPEVSKKLEDI 1234


>gi|321249291|ref|XP_003191408.1| U2 snRNA binding protein [Cryptococcus gattii WM276]
 gi|317457875|gb|ADV19621.1| U2 snRNA binding protein, putative [Cryptococcus gattii WM276]
          Length = 1217

 Score =  196 bits (497), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 278/1181 (23%), Positives = 478/1181 (40%), Gaps = 184/1181 (15%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G I  +  FR  G  +D++   ++  +  +L++    +        +V  + G R    
Sbjct: 60   FGTIRNIAGFRLAGMTKDYILATSDSGRLSILEFVISPTPHFESLYQEVFGKSGSRRIVP 119

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ   +DP  R  L+G      L  V+  + +G+L  +  +   +   L    +    G 
Sbjct: 120  GQFLAVDPKGRSCLVGSLEKTKLVYVLNRNTEGKLYPSSPLEAHKNHTLVTHVIGVDQGY 179

Query: 121  AKPTIVVLYQDNKDARHVKT---YEVALKDKDFVE---------GPWSQNNLDNGADLLI 168
              P    L  D  ++    T   YE A K   F E           WS+   D  A+LL+
Sbjct: 180  DNPLYAALEIDYSESDQDPTGEAYENAQKHLTFYELDLGLNHVVRKWSEPT-DRRANLLV 238

Query: 169  PVP------------PPLCGVLIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSR 215
             VP            P   GVL+  E+ I++   +     IPI         R   D SR
Sbjct: 239  QVPGGQNANSDRFEGP--SGVLVCTEDHIIWKHMDVDAHRIPIPRRRNPLVQR--GDKSR 294

Query: 216  -----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
                             +LL    G L+ + I H  E V  LKI+      +A+++  L 
Sbjct: 295  GLIIVSAVMHKIKGAFFFLLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCILK 354

Query: 259  NAVVYIGSSYGDSQLIKLNLQPDAKG-------SYVE----------------------V 289
               +Y+ S + D  L +     +  G        Y E                      +
Sbjct: 355  RGYIYVASEFSDQNLYQFQSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLLL 414

Query: 290  LERYVNLGPIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
            ++   +L PI D  VV+L        Q+    G     + R +++G+ + E  S  L G+
Sbjct: 415  VDSVPSLDPITDAHVVNLLGASSDTPQIYAACGRGARSTFRTLKHGLDVTEMVSSPLPGV 474

Query: 348  K-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
               +W+L+ + DD +D+++V+SF + T +L++   + +EE    GF S   TL       
Sbjct: 475  PTNVWTLKLTEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLGN 532

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-D 465
              L+QV    +R + +  R   +EW +PPG ++  AT N  QV++A     LVY E+  +
Sbjct: 533  AGLLQVHPYGLRHIRAADR--VDEWPAPPGQTIVAATTNQRQVVIALSTAELVYFELDPE 590

Query: 466  GILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
            G L+E +  + L    +C+ I  + E    +   AVG   + +V I SL PD  L T   
Sbjct: 591  GSLSEYQEKKALPGNATCVTIAEVPEGRRRTPFLAVGC-DNQTVSIISLEPDSTLDTLSL 649

Query: 524  LGGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT 575
                  P S+ L      S        +L   L +G LL  +++   G L+D +   LG 
Sbjct: 650  QALTAPPTSICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLGA 709

Query: 576  QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
            +P  L   + +    V A S R  ++Y+    L    +    + +    ++A  PD L  
Sbjct: 710  KPPKLVRANVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLIG 769

Query: 636  AKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICH---------QEQSRTFAICSLKNQS 684
                 L I +I  + +KL   S PL   PR+ I H         +   RT++  +++   
Sbjct: 770  ISGNTLRIFSIPKLGEKLKQDSTPLTYTPRKFISHPFNPVFYMIEADHRTYSKGAIERIV 829

Query: 685  CAEESE--------------------------MHFVRLLDDQTFEFISTYPLDTFEYGCS 718
              +ESE                             VR+LD    E I T+ LD  E   S
Sbjct: 830  KQKESEGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLDPLANETIMTFDLDEDEAAFS 889

Query: 719  ILSCSFS-DDSNVYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAVY 774
            I    F       +  VGT      + +  K G + V+ I E G+ L+ + + +T     
Sbjct: 890  IAIAYFERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAIKEQGRVLEFLHKTKTDDIPL 949

Query: 775  SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDL 833
             L  F G LLA + + ++LY+      G + L  +C ++G   A+  +  +G  I+VGD+
Sbjct: 950  CLAGFQGFLLAGVGKSLRLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGDM 1004

Query: 834  MKSISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN- 887
             +S    +Y+    +I  R     A D    W++ V  +D +     +   N+F  R + 
Sbjct: 1005 QESTFYCVYR----SIPTRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLDP 1060

Query: 888  --SEGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV----- 934
              SE   D+  G      +  ++G  H  E +  +  GS+V     + + +IP V     
Sbjct: 1061 SISEKVDDDPTGATILHEKSFLMGAAHKTEMIAHYNIGSVV-----TSITKIPLVAGGRD 1115

Query: 935  --IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 991
              ++ T++G +G +   +  +   F+  L+ ++R     + G +H  +R +      V  
Sbjct: 1116 VLVYTTISGAVGALVPFVSPDDIEFMSTLEMHMRTQDISLVGRDHIAYRGY-----YVPI 1170

Query: 992  KNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            K  +DGDL ESF  L   +   I+  ++ SV ++ K++E++
Sbjct: 1171 KGVVDGDLCESFSLLPYPKQQAIASDLDRSVGDVLKKLEQM 1211


>gi|391867503|gb|EIT76749.1| splicing factor 3b, subunit 3 [Aspergillus oryzae 3.042]
          Length = 1034

 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 243/1010 (24%), Positives = 430/1010 (42%), Gaps = 168/1010 (16%)

Query: 155  WSQNNLDNGADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAY 206
            W+ + +D  + +L  VP     P   GVL+  E+++ Y  +N  AF+  IP R   T+  
Sbjct: 55   WA-DPVDRTSSMLFQVPGGADGP--SGVLVCAEDSVTYRHSNQDAFRVPIPRRSGPTE-- 109

Query: 207  GRVDADGSRYLLGDHAGLLHLL---------------------VITHEKEKVTG----LK 241
               + +  R++    AG++H +                     ++  +  ++TG    LK
Sbjct: 110  ---NPERKRFIT---AGVMHKMRGAFFFLLQTEDGDLFKLNIDMVEDDNGQLTGEVKRLK 163

Query: 242  IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAK--- 283
            I+      +AS++  L +  +Y+ S  G+    +                N   D     
Sbjct: 164  IKYFDTVPVASSLLILKSGFLYVASEAGNHHFYQFEKLGDDDEEIEFSSENFSADPSVPL 223

Query: 284  ---------GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGI 334
                        + ++E   +L P++D  V +L  +   Q+ T SG     + R +++G+
Sbjct: 224  EPIYFRPRSAENLNLVETINSLNPLIDSKVANLSEEDAPQIYTISGTGARSTFRTLKHGL 283

Query: 335  GINEQASVELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFC 393
             ++E    EL  +   +W+ + +  D FD ++++SF + T +L++   + +EE    GF 
Sbjct: 284  EVSEIVDSELPSVPSAVWTTKLTRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFL 341

Query: 394  SQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLAT 453
            S   TL       + L+Q+    +R + +  R   NEW +P   S+  A  N  QV +A 
Sbjct: 342  SSAPTLAVQQLGEDSLIQIHPRGIRHIMADRR--VNEWPAPQHRSIVAAATNERQVAVAL 399

Query: 454  GGGHLVYLEI-GDGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 511
              G +VY E+  DG L E  +  Q+   ++CL +  + E    S   AVG   D +VRI 
Sbjct: 400  SSGEIVYFEMDADGTLAEYDERRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRIL 458

Query: 512  SL-PDLNLITKEHLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGE 564
            SL PD  L  K        P ++ + +    S      YL   L  G  L  +L+  TGE
Sbjct: 459  SLDPDSTLENKSVQALTAAPSALNIMSMSDSSSGGTTLYLHIGLYSGVYLRTVLDEVTGE 518

Query: 565  LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMC 622
            L+D +   LG++P+ L   S K  T V A S RP + YS    K  + + ++   +    
Sbjct: 519  LSDTRTRFLGSKPVKLFQVSVKGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVPLEWGW 578

Query: 623  PFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICS- 679
             F+S    + +   +   L I +I+ +   +  +SI L   PRR   H EQ   + I S 
Sbjct: 579  NFSSEQCLEGMVGIQGQNLRIFSIEKLDNNMLQQSISLAYTPRRFLKHPEQPLFYVIESD 638

Query: 680  ---LKNQSCAEESEMHFVRLLDDQTF---EF------------------------ISTYP 709
               L   + A+  E    R  D+      EF                        +ST  
Sbjct: 639  NNVLSPSTRAKLLEDSKARTGDETVLPPEEFGYPRGSGHWASCIQVVDPVHAKAVVSTIE 698

Query: 710  LDTFEYGCSILSCSF-SDDSNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLI 764
            L+  E   S+ +  F S D   +  VGTA  +   N P+   G I ++   EDG+ L+ I
Sbjct: 699  LEENEAAVSVAAVPFTSQDDETFLVVGTAKDM-NVNPPSSAGGYIHIYRFQEDGRELEFI 757

Query: 765  AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYV 821
             + + +    +L  F G+L+A I   +++Y     D G ++L  +C        I+ L  
Sbjct: 758  HKTKVEEPPLALLGFQGRLVAGIGPMLRIY-----DLGMKQLLRKCNAQVVPKTIVGL-- 810

Query: 822  QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
            QT+G  IVV D+ +S++ ++YK++E  +     D  + W ++  ++D +   G +   N+
Sbjct: 811  QTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTTMVDYETTAGGDKFGNI 870

Query: 882  FTVR---KNSEGATDE--------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRL 923
            + +R   K SE A ++        ERG       RLE++   +  +         LV   
Sbjct: 871  WMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKTQLV--- 927

Query: 924  PDSDVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
                 G    +++   +G IG++   +  E   F + L+  L      + G +H  +RS+
Sbjct: 928  ----AGGRDILVWSGFHGTIGMLVPFVSREDVDFFQNLEMQLAAQNPPLAGRDHLIYRSY 983

Query: 983  NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                K V     +DGDL E++  L       I+  ++ SV E+ +++ ++
Sbjct: 984  YAPVKGV-----IDGDLCETYFLLPNDTKMMIAAELDRSVREIERKISDM 1028


>gi|167522323|ref|XP_001745499.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775848|gb|EDQ89470.1| predicted protein [Monosiga brevicollis MX1]
          Length = 1172

 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 261/1177 (22%), Positives = 476/1177 (40%), Gaps = 184/1177 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            I+G I  L  FR  G  +D+L ++T+  +  +LQ+D  +++L    M        R    
Sbjct: 24   IFGLIRDLRAFRLTGGTKDYLIVSTDSGRITILQYDVTANKLERVHMETFGKSGCRRIVP 83

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL-------DIKF 116
            GQ    DP  R  ++G      L  ++  D   +L  +  +   +   L       D+ F
Sbjct: 84   GQYLATDPKGRAVMVGAVEKQKLVYILNRDAAARLTISSPLEAHKGHSLVYDMVGVDVGF 143

Query: 117  ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN--------LDNGAD 165
               L+ C    + + Y+D  +    +  E+  +   F E     N+        LD+ A+
Sbjct: 144  ENPLFAC----LELDYEDVDEDPSGEALELLAQTLVFYELDLGLNHVVRKESIELDSFAN 199

Query: 166  LLIPVPPPL---CGVLIIGEETIVYCSANAFKAIPI------------RPSITKAYGRVD 210
             LIPVP       GVL+ G+  I + +      I +            R +I  A+    
Sbjct: 200  KLIPVPGGADGPSGVLVCGDGQITWRTVGEHTPISVSIPRRLDPFRQPRSTIINAFCMHK 259

Query: 211  ADGSRY-LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
               + + LL    G L  L + H++++V G+ ++      +A ++  L   +++I S +G
Sbjct: 260  TKKTFFFLLQTEEGDLFKLTMVHDEDEVQGMILKYFDTVPVAKSMVLLKIGLLFIASEHG 319

Query: 270  DSQLIK-------------LNLQPDAKGSY--------VEVLERYVNLGPIVDFCVVDLE 308
            D QL +             L+ +P+ K  Y        +  L+   +L P++D  + D  
Sbjct: 320  DHQLYQIAQLGDNKDEPSFLSTEPEDKVLYFRPRPLLNLAPLDVIESLAPVMDCQLADAG 379

Query: 309  RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVV 367
            +Q   Q+    G     SLR++R+G+ ++  A   L G    +WS++   +D  DT++V+
Sbjct: 380  QQDGPQLYALCGRGSGSSLRVLRHGLEVSAMAESPLPGNPSAVWSVKRHVEDEADTYIVM 439

Query: 368  SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
            SF+  T +L +   + +EE    GF     TL       + L+Q+ +  +R +    R  
Sbjct: 440  SFVDATLVLGIG--ETVEEVTDSGFLGSVATLSASRIGDDALLQIHAMGIRHILHDGR-- 495

Query: 428  RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDI 485
             NEWK+P    ++    N  Q ++A   G LVY E+   G L E  +  ++  +++ L +
Sbjct: 496  INEWKAPDRTKISHCAVNERQAVVALSNGELVYFELDRSGQLIEHSERVEMSSQVTALAL 555

Query: 486  NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL-LCAFE----- 539
             PI E    ++  A+G+  D +VR+ SL   + +    L  + +P +   LC  E     
Sbjct: 556  APIPEGAQTARFLALGL-DDQTVRLMSLDKSDCLAP--LSMQALPGAAASLCMVEVRGTH 612

Query: 540  ----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS 595
                G+S L   LG+G L+   ++  T +L+D +   LG + + L          V A S
Sbjct: 613  GEPSGLS-LAIGLGNGVLMRSRVDTLTADLSDTRTRYLGARAVKLFPVKVAEEPAVLALS 671

Query: 596  DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHI 654
             +P + Y        + ++   + +   F+S   P+ +A      L I   + + Q  H 
Sbjct: 672  TKPWLSYRYQGHSRITPLSYDALEYASAFSSDQCPEGVAAVAGNTLRIFAFEKLGQVFHQ 731

Query: 655  RSIPLGEHPRRICHQEQSRTFAICSLKNQSC----------------------AEESEMH 692
             SIPL    RR+    ++ + A  +  +Q C                      AE+++  
Sbjct: 732  NSIPLEYTGRRLLLDPEA-SLAFIAEGDQGCLSVTAKAERLSRIASHQAGTAMAEQTDPQ 790

Query: 693  --------FVRLLDDQTF----------------------EFISTYPLDTFEYGCSILSC 722
                        LD + F                      E     PL   E   S+   
Sbjct: 791  EDLTSPAALAEPLDVKQFGQSYAGDNLWASNIRVVNLAQGETTCLVPLAQDEMVMSLARV 850

Query: 723  SFSDDSNVYYCVGTAYV--LPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV-YSLNAF 779
             F+   N  + V        P++       +L F V+  ++ L+      G +  +L  F
Sbjct: 851  RFASSPNDKHIVAGVVKGWKPKQQSMDGAFLLTFQVQGDQVVLLHRTAVDGGLPCALAEF 910

Query: 780  NGKLLAAINQKIQLYKWMLRDDG----TRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
             GK+LA +   ++++    +        R+L S+  H        + T G  I   D   
Sbjct: 911  AGKVLAGVGNTLRIFDLGKKKLLLKTENRQLPSQVVH--------ITTMGTRICAADQKH 962

Query: 836  SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE- 894
            S   L YK  E A+   A D N  W +   +LD     GA + F  F V +     TD+ 
Sbjct: 963  SFVWLKYKPAENALTIFADDTNPRWCTRGVLLDYQTVAGA-DKFGNFVVARLGTDLTDQI 1021

Query: 895  -----------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
                        RG       +++++  YH+GE +   +  +LV        G    +++
Sbjct: 1022 DDDPTGSKAFWSRGILNGASQKMDILCNYHVGETILSLQKVTLV-------PGGAECILY 1074

Query: 937  GTVNGVIGVIASLPH-EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
             T++G IG+     + + + F   L+ +LR+    + G +H  +RS     K+V     +
Sbjct: 1075 TTMSGGIGLFLPFSNRDDFEFFTSLELHLRQEHAPLCGRDHLHYRSAYFPVKSV-----I 1129

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DGDL E +  LS +  +E+++ +     ++ K++E++
Sbjct: 1130 DGDLCEQYPLLSASVKNEVAEGLERPTTDVIKKIEDI 1166


>gi|427798971|gb|JAA64937.1| Putative damage-specific dna binding complex subunit ddb1, partial
            [Rhipicephalus pulchellus]
          Length = 1259

 Score =  195 bits (495), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 260/1173 (22%), Positives = 479/1173 (40%), Gaps = 194/1173 (16%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I ++  FR  G ++D+L + ++  +  +L+++ + S    +   +   + G R   
Sbjct: 58   VFGIIRSMMSFRLTGGSKDYLVVGSDSGRIVILEYNPQKS-CFEKVHQETFGKSGCRRIV 116

Query: 65   NGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCA- 121
             GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +Y C  
Sbjct: 117  PGQHLAMDPKGRALMVAAVEKQKLVYILNRDAAARLTISSPLEAHKSSTL----VYHCVG 172

Query: 122  ------KPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
                   P    L  D ++A    + E A        F E     N+        L+   
Sbjct: 173  VDVGFENPVFACLEMDYEEADSDPSGEAANNTHQTLTFYELDLGLNHVVRKYSEPLEEHG 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLIAVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +L     G +  + +  +++ VT +K++      +A+++  L    +
Sbjct: 288  VCSANHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTIPVAASMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
            ++ + +G+  L ++                           P A  + + V E   +L P
Sbjct: 348  FVAAEFGNHCLYQIARLGEEDEEPEFSSAIPLEEGDTFFFAPRALRNLLPV-EELDSLSP 406

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
             +   + DL  +   Q+    G      +R++R+G+ ++E A  EL G    +W+++   
Sbjct: 407  AMGCTIADLANEDTPQLYVACGRGPRSCIRVLRHGLEVSEMAVSELPGNPNAVWTVKRKA 466

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            D+ +D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +
Sbjct: 467  DEDYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCAQIGDDALVQVYPEGI 524

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
            R + +  R   NEW++P   ++     N  QV++A  GG LVY E+   G L E     +
Sbjct: 525  RHIRADKR--VNEWRTPGKKAIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTDRRE 582

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIPRSVL 534
            +  +I C+ +  +      S+  AVG+  D +VRI SL   + ++   +      P S+ 
Sbjct: 583  MSADIICMSLASVPTGEQRSRFLAVGL-ADNTVRIISLDPSDCLSPLSMQALPATPESLS 641

Query: 535  LCAF---------EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
            +            +GI YL   L +G LL  +L+  TG+L+D +   LG++P+ L     
Sbjct: 642  IVEMGMTEVETTGQGILYLNIGLQNGVLLRTVLDQVTGDLSDTRTRYLGSRPVKLFKVRM 701

Query: 586  KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
            + +    A S R  + Y    +   + ++ + +     F+S   P+ +       L I  
Sbjct: 702  QGSDAALAMSSRSWLSYYYQNRFHLTPLSYETLEFASGFSSEQCPEGIVAISTNTLRILA 761

Query: 646  IDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSL------------KNQSCAEE---- 688
            ++ +  +  + S  L   PR+      S    I               + Q  AEE    
Sbjct: 762  LEKLGAVFNQVSTTLEYTPRKFVVNSDSGHLIIVETDHNAYTEKMKQQRKQQMAEEMVEA 821

Query: 689  ---------SEM-------------------------HFVRLLDDQTFEFISTYPLDTFE 714
                     +EM                           +R+L+    + +    L+  E
Sbjct: 822  AGEGEQELAAEMAAAFLSEVLPETVFGSPKAGSGMWASVIRVLNPADSQTLCKVALEQNE 881

Query: 715  YGCSILSCSF-SDDSNVYYCVGTAYVLPEE--NEPTKGRILVFIVE---DGKLQLIAEKE 768
               S+    F S     Y  VG A  L  +  +  + G +L + +    + +L+L+    
Sbjct: 882  AALSVALVKFASQPDEQYVVVGAARELSLQPWHARSGGLLLTYRLSHAGETRLELVHATS 941

Query: 769  TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDF 827
             + A  +L  F G+LLA + + ++LY     D G ++L  +C +     A+  +Q+ G+ 
Sbjct: 942  VEEAPTALCPFQGRLLAGVGKCLRLY-----DLGRKKLLRKCENKYIPSAIVSIQSMGNR 996

Query: 828  IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-K 886
            +VVGD+ +S   L YK +E  +   A D    W++A  +LD D   GA+   N+  +R  
Sbjct: 997  VVVGDVQESFFFLRYKRQENQLVIFADDAVPRWITASCMLDYDTVAGADKFGNVSIIRLP 1056

Query: 887  NSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
            NS   +DE            +RG L       EV+  +H+GE V   +  +L+       
Sbjct: 1057 NS--VSDEVDEDPTGIKSLWDRGWLGGSSQKAEVISNFHIGETVLSLQKATLI------- 1107

Query: 928  VGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
             G   ++++ T++G IGV+     HE + F + L+ ++R     + G +H  +RS     
Sbjct: 1108 PGGSESLVYVTLSGTIGVLVPFTAHEDHDFFQHLEMHMRSENPPLCGRDHLSFRS----- 1162

Query: 987  KTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
                 KN +DGDL E F  L  ++   I++ ++
Sbjct: 1163 SYFPVKNVIDGDLCEQFNSLDPSKQKSIAEELD 1195


>gi|391341057|ref|XP_003744848.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Metaseiulus
            occidentalis]
          Length = 1211

 Score =  194 bits (492), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 247/1174 (21%), Positives = 479/1174 (40%), Gaps = 173/1174 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G ++D+L + ++  + C+L+++   + L    M        R    
Sbjct: 58   VFGVIRSLMSFRLTGGSKDYLIVGSDSGRICILEFNPAKNVLEKIHMETFGKSGCRRIVP 117

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ   +DP  R  +IG      L  ++  D+  +L  +  +   +   L    +    G 
Sbjct: 118  GQYLAVDPKGRAVMIGAVEKQKLVYILNRDSNARLTISSPLEAHKSNTLVYHMVGVDVGF 177

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADLLIP 169
              P    +  D ++A +  T E A K +    F E     N+        L+  ++ LI 
Sbjct: 178  ENPVFACIELDYEEADNDHTGEAAKKAQQSLTFYELDLGLNHVVRKYSEPLEEPSNFLIA 237

Query: 170  VPPPL---CGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR----------- 215
            VP  +    GVL+  E  I Y   N      IR  I +    +D D  R           
Sbjct: 238  VPGGVDGPSGVLVCSENFITY--KNLGDQADIRCPIPRRRNDLD-DPDRGMLFVCSTQHK 294

Query: 216  ------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
                  +L     G +  + +  + + VT +K++      +A T+  L +  +++ S +G
Sbjct: 295  TKSMFFFLAQTEQGDIFKITLEFDDDAVTEIKLKYFDSLPVAQTMHVLKSGFLFVASEFG 354

Query: 270  DSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPIVDFCVVD 306
            +  L ++             ++ P  +G     L R +          +L PI+   V D
Sbjct: 355  NHSLYQIAHLGDNTDEPEFSSIFPLEEGDTFFFLPRELKNLVLVDEMDSLSPIMTARVAD 414

Query: 307  LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFL 365
            L  +   Q+    G     ++R++R+G+ ++E A  EL G    +W+++   DD +D ++
Sbjct: 415  LTNEDTPQLYAACGRGPRSTMRVLRHGLEVSEMAVSELPGNPSAVWTVKKRADDEYDAYI 474

Query: 366  VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
            VVSFI+ T +L++   + +EE    GF   T TL CH   ++ LVQ+    +R + +  R
Sbjct: 475  VVSFINATLVLSIG--ETVEEVTDSGFLGTTPTLACHQIGHDALVQIYPEGIRHIRADRR 532

Query: 426  ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE-VKHAQLEYEISCL 483
               NEW++     +     N  QV++A  GG L+Y E+   G L E  +  ++  ++ C+
Sbjct: 533  --VNEWRTSGKKLIVKCAVNQRQVVIALTGGELIYFEMDSSGQLNEYAERKEMNSDVLCM 590

Query: 484  DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG--EIIPRSVLLCAFEGI 541
             +  +      ++  AVG  +D +V + SL   + ++   + G  E  P S+ +    G 
Sbjct: 591  ALGSVPAGEQRTKFLAVGS-SDGTVHVISLDPKSCLSILSVQGMTESNPESLAIVDMSGH 649

Query: 542  S--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
                     YL   L +G L   +L+  TG+++D +   LG++ + L     + T  V A
Sbjct: 650  EEGSGSSALYLNIGLQNGILKRMVLDAVTGDMSDTRTRFLGSRSVKLFKIKMQGTDAVLA 709

Query: 594  ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
             S R  + Y    +   + ++   + +   F S   P+ +       L I  ++ +  + 
Sbjct: 710  MSSRCWLSYLFQNRFHLTPLSYDSLEYASGFCSEQCPEGIVAIAGNTLRILALEKLGAVF 769

Query: 654  IR-SIPLGEHPRRICHQEQSR------------TFAICSLKNQSCAEE------------ 688
             + S PL   PR     ++S+            T ++   + Q  AEE            
Sbjct: 770  NQLSAPLSFTPRSFVIDKKSQKLIVIETDHNAYTDSVKEKRKQHVAEEMIEAAGEAEQEM 829

Query: 689  -------------SEMHF-------------VRLLDDQTFEFISTYPLDTFEYGCSILSC 722
                          E  F             +R+L+      +    L+  E   S+   
Sbjct: 830  AAELAAAFLSEDLPESQFGAPRAGKKHWASVIRILNPSDLSTVYKIQLEQNEAAVSVALV 889

Query: 723  SFSDDSNVYYCVGTAYVL---PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            +F   ++    VG +  L   P E   +       + +  +L+L+          ++  F
Sbjct: 890  NFDKSTDPILLVGISKDLQLSPRECRNSFINAYRVVKDCTELELVHTTVMDDVPQAMCNF 949

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKSI 837
              ++L  + + +++Y     D G +++  +C  + HI  L + +   G+ IVVGD+ +S 
Sbjct: 950  GNRVLIGVGRCLRIY-----DFGKKKMLRKC-ENKHIPNLIVTINAVGNRIVVGDVQESF 1003

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE- 894
              + Y+  E  +   A D+   W +A  ++D    +G +   N++ +R   N+    DE 
Sbjct: 1004 FFIRYRMLENQLIIFADDFTPRWTTAACMVDYRTVVGGDKFGNVYILRLPGNTSDDVDED 1063

Query: 895  --------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
                    +RG L       EV+   H+GE +   +  +L+   P++       +++ T+
Sbjct: 1064 PTGVRSLWDRGWLGGAGQKAEVLSMTHVGELIVSLQKTALIPGGPEA-------IVYTTI 1116

Query: 940  NGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
             G +G +      + + F + L+  +R     + G +H  +RS+    K V     +DGD
Sbjct: 1117 AGGVGALIPFSSKDDHEFFQHLEMYMRTEHPPICGRDHLSFRSYYFPVKAV-----IDGD 1171

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L E +  L   +  +I+  +     E+ K++E++
Sbjct: 1172 LCEQYNSLDANKQKQIADELERLPHEVAKKLEDI 1205


>gi|258570355|ref|XP_002543981.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
 gi|237904251|gb|EEP78652.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
          Length = 1209

 Score =  193 bits (491), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 274/1177 (23%), Positives = 490/1177 (41%), Gaps = 187/1177 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G  +D++ I ++  +  ++++   S     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIVEY-VPSQNRFNRIHLETFGKSGIRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
             GQ   +DP  R  LI     + L  V+  + + +L  +  +           L  LD+ 
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTIVFALTALDVG 182

Query: 116  F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
            +   ++G  +       QD   + + ++      YE+ L     V   W +  +D  A +
Sbjct: 183  YENPIFGALEVEYTESDQDPTGSAYEESEKLLVYYELDLGLNHVVRK-WVEP-VDRTAMM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLL 218
            L  VP     P   GVL+  E  I Y  +N   F+  IP R   T+     + D  RY+ 
Sbjct: 241  LFQVPGGADGP--SGVLVCAEGNITYRHSNQDVFRVPIPRRSGPTE-----NPDRKRYIT 293

Query: 219  GDHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIAST 253
               AG++H +                     ++  +  + TG    LK++      IAS+
Sbjct: 294  ---AGVVHKMRRAFFCLLQTEDGDLFKVTIDMVEDDNGQPTGEVRRLKLKYFDTVPIASS 350

Query: 254  ISYLDNAVVYIGSSYGDSQLIKLN--------------------LQPDAKGSY------- 286
            +  L N  +++ S  G+    +                      L+P A   +       
Sbjct: 351  LCILKNGFLFVASENGNHHFYQFEKLGDDDEETEFTSDDFSSDPLEPLAPVYFRPRPAEN 410

Query: 287  VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            + ++E   ++ P++   V +L      Q+ T  G     + R +++G+ ++E    EL  
Sbjct: 411  LNLVESINSVNPLMSCKVANLTEDDAPQLYTLCGTGARSTFRTLKHGLEVSEIVESELPS 470

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            +   +W+ + + +D +D ++++SF + T +L++   + +EE    GF S   TL      
Sbjct: 471  VPSAVWTTKLTRNDQYDAYIILSFTNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLG 528

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
             + L+QV    +R + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  
Sbjct: 529  EDSLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDT 586

Query: 465  DGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
            DG L E  +  ++   ++CL +  I      S   AVG   D +VRI SL PD  L  K 
Sbjct: 587  DGSLAEYDEKREMSGTVTCLSLGEILPGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKS 645

Query: 523  HLGGEIIPRSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
                   P ++ ++   +  S     YL   L  G  L  +L+  TGEL+D +   LG +
Sbjct: 646  VQALTSAPSALSIMSMIDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLK 705

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
            P+ L + S K    V A S RP + YS    K  + + ++   +     F+S    + + 
Sbjct: 706  PVKLFSVSVKEQRAVLALSSRPWLGYSDLQTKNFMLTPLDYVPLEWSWNFSSEQCVEGMV 765

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA------- 686
              +   L I +I+ +   L   +IPL   PR      +   F +    N   +       
Sbjct: 766  GIQGQNLRIFSIEKLDNNLLQETIPLAYTPRHFVRHPEHPLFYVIGADNNILSPATKAKL 825

Query: 687  ---------EESEM------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILS 721
                     E SE+            H+   ++++D   +   +S   L+  E   S+ +
Sbjct: 826  LEDSKAVNGEASELPPEDFGYSRGTGHWASCIQVVDPINSKTVLSRIELEENEAAVSVAA 885

Query: 722  CSFSD-DSNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSL 776
              FS  D   +  VGT   +   N P+   G I ++   EDGK L+ I + + +    +L
Sbjct: 886  VPFSSQDDETFLVVGTGKDM-VVNPPSSSCGYIHIYRFQEDGKELEFIHKTKVESPPQAL 944

Query: 777  NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---QTRGDFIVVGDL 833
             AF G+LLA I   +++Y     D G ++L  +C     ++   +   QT+G  I+V D+
Sbjct: 945  LAFQGRLLAGIGTNLRIY-----DLGMKQLLRKC--QAEVVPRMIVGLQTQGSRIIVSDV 997

Query: 834  MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEG 890
             +S++ ++YK++E  +   A D  A W +   ++D +   G +   NL+ +R   K SE 
Sbjct: 998  QESVTYVVYKYQENRLIPFADDIIARWTTCTTMVDYETVAGGDKFGNLWLLRCPQKASEE 1057

Query: 891  ATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT------------ 938
            A ++  G   +    +L    NR    SL++     D   IPT I  T            
Sbjct: 1058 ADEDGSGAHLIHERQYLQGAPNRL---SLMIHFYPQD---IPTSIQKTQLVAGGRDILVW 1111

Query: 939  --VNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
              + G IG+ I  +  E   F + L+  L      + G +H  +RS+        AK  +
Sbjct: 1112 TGLQGTIGMLIPFVSREDVDFFQSLEMQLTSQTPPIAGRDHLIYRSY-----YAPAKGTI 1166

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DGDL E++  L   +   I+  ++ SV E+ +++ ++
Sbjct: 1167 DGDLCETYFTLPNDKKLMIAGELDRSVREIERKISDM 1203


>gi|390357128|ref|XP_001198237.2| PREDICTED: splicing factor 3B subunit 3-like [Strongylocentrotus
            purpuratus]
          Length = 949

 Score =  193 bits (490), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 207/936 (22%), Positives = 406/936 (43%), Gaps = 144/936 (15%)

Query: 216  YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
            +L     G +  + +  + + VT ++++      +A++++ L    ++I S YG+  L +
Sbjct: 33   FLAQTEQGDIFKITLETDDDMVTEIRMKYFDTVPVATSMNVLKTGFLFIASEYGNHYLYQ 92

Query: 276  L-------------NLQPDAKGSY----------VEVLERYVNLGPIVDFCVVDLERQGQ 312
            +             +  P  +G            +E +++  +L PI+   + DL  +  
Sbjct: 93   IAHLGDDDDEPEFSSATPLEEGDTFFFAPRTLRNLEEVDQLESLSPILSCQIADLASEDT 152

Query: 313  GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFIS 371
             Q+    G     S+R++R+G+ ++E A  EL G    +W+++  +DD +D +++VSF++
Sbjct: 153  PQLYVACGRGPRSSMRVLRHGLEVSEMAVSELPGNPNAVWTVKKKSDDEYDAYIIVSFVN 212

Query: 372  ETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
             T +L++   + +EE    GF   T TL       + L+Q+    +R + +  R   NEW
Sbjct: 213  ATLVLSIG--ETVEEVTDSGFLGTTPTLSSSLIGDDALLQIYPDGIRHIRADKR--VNEW 268

Query: 432  KSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINPIG 489
            K+P   S+     N  QV++A  GG +VY E+   G L E  +  ++  ++ C+ +  + 
Sbjct: 269  KTPGKKSIVKCAVNQRQVVIALTGGEMVYFEMDPTGQLNEYTERKEMNADVKCMSLATVP 328

Query: 490  ENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LLCAFE--------- 539
                 ++  AVG+  D +VR+ SL   + +  + L  + +P     LC  E         
Sbjct: 329  SGEQRARFLAVGL-DDNTVRVISLDQSDCL--QPLSMQALPAPAEALCIIEMGGTEAREE 385

Query: 540  -------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
                   G  +L   L +G LL  +L+  TG+L+D +   LG++P+ L   + + +  V 
Sbjct: 386  TGERGSSGGLFLNIGLQNGVLLRMVLDSVTGDLSDTRTRYLGSRPVKLFRINMQGSEAVL 445

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
            A S R  + Y    +   + ++ + +     F S   P+ +       L I  ++ +  +
Sbjct: 446  AMSSRSWLSYWYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAV 505

Query: 653  HIR-SIPLGEHPRRIC-HQEQSRTFAICSLKN-----------QSCAEE----------- 688
              + S  L   PR+   H E +    I +  N           Q  AEE           
Sbjct: 506  FNQVSTQLKYTPRKFAVHLENNNIVVIETDHNTYTDSTKAQRKQQMAEEMVEAAGEEEKE 565

Query: 689  --SEM-------------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
              +EM                           +RLL+    + +    L+  E   S+  
Sbjct: 566  LAAEMAEAFLQEELPESTFSSPKAGSGMWASTIRLLNPVQGQTLDMVELEQNEAAFSLAL 625

Query: 722  CSFSD-DSNVYYCVGTAY-VLPEENEPTKGRILVFIV--EDGKLQLIAEKETKGAVYSLN 777
              F++ +   +  VGTA  V       + G I  +++  E  KL+ + +        S+ 
Sbjct: 626  VKFANREDETHVLVGTAKDVTLSPRTCSGGAIHTYLLTEEGKKLEFLHKTLVDDVPSSIA 685

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMK 835
            AF G++L  I + +++Y     D G ++L  +C  + HI      + T G+ +++ D+ +
Sbjct: 686  AFQGRVLIGIGRLLRIY-----DLGKKKLLRKC-ENKHIPNFITNILTTGNRVIISDVQE 739

Query: 836  SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATD 893
            S+  L YK +E  +   A D N  W++A  +LD D  + A+   N+  +R   +     D
Sbjct: 740  SLHFLKYKRQENQMVIFADDSNPRWITATCLLDHDTVMCADKFGNITVLRLPTSVNDNLD 799

Query: 894  E---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
            E         +RG L       +VV  + +GE V   +  +L+        G   +V++ 
Sbjct: 800  EDPTGVKALWDRGLLNGASQKTDVVSSFFIGETVLSLQKATLI-------PGGSESVVYT 852

Query: 938  TVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
            T++G IGV+     HE + F + L+ ++R     + G +H  +RS+         KN +D
Sbjct: 853  TLSGAIGVLVPFTAHEDHDFFQHLEMHMRSEYAPLCGRDHLSYRSY-----YYPVKNVID 907

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            GDL E +  +  ++   +++ ++ +  E+ K++E++
Sbjct: 908  GDLCEQYNSMELSKQRTVAEELDRNPSEVSKKLEDI 943


>gi|312370905|gb|EFR19207.1| hypothetical protein AND_22901 [Anopheles darlingi]
          Length = 1287

 Score =  192 bits (489), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 264/1197 (22%), Positives = 476/1197 (39%), Gaps = 206/1197 (17%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   ++G + +L  FR  G ++D+  I ++  +  +L+++   ++L             
Sbjct: 126  LLQTEVFGVVRSLMSFRLTGGSKDYAVIGSDSGRIVILEYNPVKNQLEKVHQETFGKSGC 185

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D++ +L  +  +   +   L    + 
Sbjct: 186  RRIVPGQYLSIDPKGRAVMIGAIEKQKLVYILNRDSEARLTISSPLEAHKSNTLTYHMVG 245

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D ++A    T E A K +    F E     N+        L+   
Sbjct: 246  VDVGFENPMFACLEIDYEEADGDPTGEAAAKTQQTLTFYELDLGLNHVVRKYSEPLEEHV 305

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + LI VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 306  NFLISVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 360

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +L+    G +  + +  + + V+ +K++       A+ +  L    +
Sbjct: 361  ICSATHRTKSMYFFLVQTEQGDIFKVTLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 420

Query: 263  YIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPI 299
            ++   +G+  L ++             +  P  +G       R +          +  PI
Sbjct: 421  FVACEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRQLKNLVMVDDIPSYAPI 480

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
            +   V DL  +   Q+    G     S+R++R+G+ ++E A  EL G    +W+++   D
Sbjct: 481  LGCQVADLANEDTPQLYLACGRGPRSSIRVLRHGLEVSEMAVSELPGNPNAVWTVKKRID 540

Query: 359  DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
            D FD +++VSF++ T +L++   D +EE    GF          DA    LVQV    +R
Sbjct: 541  DEFDAYIIVSFVNATLVLSIG--DTVEEVTDSGFL----VPLGDDA----LVQVYPDGIR 590

Query: 419  LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQL 476
             + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  ++
Sbjct: 591  HIRADKR--VNEWKAPGKKTIMKCAVNQRQVVIALSGGELVYFEMDPTGQLNEYTERKKM 648

Query: 477  EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LL 535
              E+ C+ +  +      S   AVG+  D +VRI SL   + ++   +  + +P +   L
Sbjct: 649  PSEVMCMALGSVPSGEQRSWFLAVGL-ADNTVRIISLDPSDCLSPRSM--QALPSAAESL 705

Query: 536  CAFE----------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573
            C  E                      G  YL   L +G LL  +L+  +G+L D +   L
Sbjct: 706  CIVEMGTVETAANDDDDDDDAPLKTSGCIYLNIGLTNGVLLRTVLDPVSGDLADTRTRYL 765

Query: 574  GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
            G++P+ L     + +  V A S R  + Y    +   + ++ + + +   F+S    + +
Sbjct: 766  GSRPVKLFRIQMQGSEAVLAMSSRSWLSYYYQNRFHLTPLSYETLEYASGFSSEQCSEGI 825

Query: 634  AIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
                   L I  ++ +  +  + S PL   P+R     ++    I    + +  EE++  
Sbjct: 826  VAISTNTLRILALEKLGAVFNQISFPLEYTPKRFAIHAETGKLIISETDHNAYTEETKTV 885

Query: 693  FVRLLDDQTFE----------------FIS-TYPLDTF---------------------- 713
              R + D+  E                FI+   P D F                      
Sbjct: 886  RKRQMADEMREAAGDDEQELANEMADAFINEVLPEDVFSAPKAGTGMWASQIRVMDPING 945

Query: 714  -----------EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGR---ILVFIVED- 758
                       E   S+    F+ D   Y   G A  L  +  P  G    I V+ V++ 
Sbjct: 946  HTYSKVQLAQNEAVLSLALVRFAVDQKWYVVAGVAKDL--QINPKIGNGGFIDVYRVDNQ 1003

Query: 759  -GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI- 816
              +L+ +   E   A  +L AF G+LLA I + +++Y     D G ++L  +C  + HI 
Sbjct: 1004 TNELEHVHRTEIDDAPGALCAFQGRLLAGIGKVLRMY-----DLGKKKLLRKC-ENKHIP 1057

Query: 817  -LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
               + +Q  G  + V D+ +S+  L YK  E  +   A D +  W+++  +LD D  +  
Sbjct: 1058 NQIVNIQGMGQRVYVSDVQESVYCLKYKRPENQLIIFADDTHPRWVTSATLLDYDT-VAT 1116

Query: 876  ENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRH 916
             + F    V +     +D+            +RG L       E +  +HLGE V   + 
Sbjct: 1117 GDKFGNIAVLRLPHSVSDDVDEDPTGNKALWDRGLLNGASQKAENICTFHLGEIVMSLQK 1176

Query: 917  GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLN 975
             +L+        G   ++I+ T++G +G +      E Y F + L+ ++R     + G +
Sbjct: 1177 ATLI-------PGGSESLIYATMSGTVGALVPFTSREDYDFFQHLEMHMRNENPPLCGRD 1229

Query: 976  HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            H  +RS+         KN +DGDL E F  L   +   I+  +  +  E+ K++E++
Sbjct: 1230 HLSYRSY-----YYPVKNVMDGDLCEQFTSLDPAKQKSIASDLGRTPSEVAKKLEDI 1281


>gi|378730761|gb|EHY57220.1| pre-mRNA-splicing factor rse1, variant [Exophiala dermatitidis
            NIH/UT8656]
          Length = 914

 Score =  192 bits (489), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 211/894 (23%), Positives = 379/894 (42%), Gaps = 119/894 (13%)

Query: 236  KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL---------------NLQP 280
            +V  LKI+      +A+++  L +  +++ S  G+    +                +   
Sbjct: 37   EVRALKIKYFDTVPVANSLHILKSGFLFVASESGNHHFYQFEKLGDDDEETEFTSDDYPA 96

Query: 281  DAKGSY------------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
            D   +Y            + +++      P++D  V +L  +   Q+ +  GA    S +
Sbjct: 97   DPTAAYTPAFFHVRPLSNLNLVQSVDAANPLLDCKVANLLDEDAPQIYSICGAGARSSFK 156

Query: 329  IVRNGIGINEQASVEL-QGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET 387
             +++G+ ++E    EL    + +W+ + + DD +D ++++SF   T  L +++ + +EE 
Sbjct: 157  TLKHGLSVSEIVESELPDKPEAVWTTKLTRDDQYDAYIILSF--RTGTLVLSIGETVEEV 214

Query: 388  EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANAS 447
               GF S   TL       + LVQV    +R + +  R   NEW +P   S+  AT N  
Sbjct: 215  TDTGFLSTAPTLAVQQLGEDALVQVHPKGIRHIRADKR--VNEWPAPQHRSIVAATTNER 272

Query: 448  QVLLATGGGHLVYLEIG-DGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTD 505
            QV +A   G +VY E+  DG L E  +  ++   ++CL +  + E    S   AVG   D
Sbjct: 273  QVAVALSSGEIVYFEMDTDGSLAEYDERREMTGTVTCLSLGDVPEGRVRSSFLAVGC-DD 331

Query: 506  ISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLL 558
             +VRI SL PD  L  K        P ++ + A    +      YL   L  G  L  +L
Sbjct: 332  STVRILSLDPDSTLENKSVQALTSAPSALSIMAMADSTSGGTTMYLHIGLYSGVYLRTVL 391

Query: 559  NMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLK 616
            +  TGEL+D +   LG +P  L   S K    V A S RP + Y+  +    + + ++  
Sbjct: 392  DEITGELSDTRTRFLGLRPAKLFRVSVKGQNAVMALSSRPWLGYTDTQTNGFMLTPLDYV 451

Query: 617  EVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK-LHIRSIPLGEHPRRICHQEQSRTF 675
             + ++  F S   P+ +   +   L I +I+D+ + L   +IPL   PR+         F
Sbjct: 452  PLQYVWNFTSEQCPEGMVGIQGQNLRIFSIEDLSRNLLQENIPLPYTPRKFVKHPDQPLF 511

Query: 676  AICSLKNQSCAEESEMHFVR----------LLDDQTFEF--------------------- 704
             +    N   A  +    +           +L  + F +                     
Sbjct: 512  YVIEADNNVLAPATRQKLLNESTAVNGDAVILPPEEFGYPKATGHWASCIQVVDPVNTKS 571

Query: 705  -ISTYPLDTFEYGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK 760
             + T  L+  E   SI +  F S D  V+  VGTA  ++      + G + V+   EDGK
Sbjct: 572  VVFTLELEDNECATSITTAPFASQDDEVFLIVGTAKDLVVSPRSFSAGFLHVYRFHEDGK 631

Query: 761  -LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
             L+ I + + +    +L AF G+LLA +   +++Y     D G +++  +C      L +
Sbjct: 632  ELEFIHKTKVEQPPTALLAFQGRLLAGVGPDLRIY-----DLGMKQMLRKCQVTTPNLIV 686

Query: 820  YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNF 879
             +QT+G  I+V D+ +S++  +YK +E  +     D  A W +   ++D +   G +   
Sbjct: 687  GLQTQGSRIIVSDIQESVTYCVYKFQENKLIPFCDDVIARWTTCCTMVDYETVAGGDKFG 746

Query: 880  NLFTVR---KNSEGATDE--------ERG-------RLEVVGEYHLGEFVNRFRHGSLVM 921
            NL+ +R   K SE A ++        ERG       RL ++  Y  G+     +  +LV 
Sbjct: 747  NLWMLRCPQKVSEEADEDNSGVHLLHERGYLNGTPNRLSLMIHYFPGDIPTSIQKTNLV- 805

Query: 922  RLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKG--VGGLNHEQ 978
                   G    V +    G +G++   +  E   F + L+  L        + G +H  
Sbjct: 806  ------AGGRDVVFWTGFQGTLGILVPFVSREDVDFFQSLEMQLASSNGNPPLLGRDHLI 859

Query: 979  WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +RS+    K V     +DGDL E+F  L   +   I+  ++ SV E+ +++ ++
Sbjct: 860  YRSYYAPSKGV-----IDGDLCETFFLLPNDKKLMIAGELDRSVREIERKISDM 908


>gi|157131296|ref|XP_001662180.1| spliceosomal protein sap [Aedes aegypti]
 gi|108881836|gb|EAT46061.1| AAEL002709-PA [Aedes aegypti]
          Length = 1215

 Score =  192 bits (488), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 251/1187 (21%), Positives = 471/1187 (39%), Gaps = 185/1187 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   ++G I +L  FR  G  +D+  + ++  +  +L+++   ++L             
Sbjct: 53   LLQTEVFGVIRSLMSFRLTGGTKDYAVVGSDSGRIVILEYNPAKNQLEKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   IDP  R  +IG      L  ++  D++ +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQFLAIDPKGRAVMIGAIEKQKLVYILNRDSEARLTISSPLEAHKSSTLTYHMVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D ++A    + E A K +    F E     N+        L+  A
Sbjct: 173  VDVGFENPMFACLEIDYEEADLDPSGEAAAKTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            + LI VP     P   GVLI  E  + Y   N      IR  I +    +D D  R    
Sbjct: 233  NFLISVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287

Query: 216  -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                         +L+    G +  + +  + + V+ +K++       A+ +  L    +
Sbjct: 288  ICSATHRTKSMYFFLVQTEQGDIFKVTLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347

Query: 263  YIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPI 299
            ++   +G+  L ++             +  P  +G       R +          +  PI
Sbjct: 348  FVACEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRQLKNLVMVDEIHSYAPI 407

Query: 300  VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
            +   V DL  +   Q+    G     S+R++R+G+ ++E A  EL G    +W+++   D
Sbjct: 408  LGCQVADLANEDTPQLYLACGRGPRSSIRVLRHGLEVSEMAVSELPGNPNAVWTVKKRID 467

Query: 359  DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
            + FD +++VSF++ T +L++   D +EE    GF   T TL C     + LVQV    +R
Sbjct: 468  EEFDAYIIVSFVNATLVLSIG--DTVEEVTDSGFLGTTPTLSCSALGDDALVQVYPDGIR 525

Query: 419  LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQL 476
             + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  ++
Sbjct: 526  HIRADKR--VNEWKAPGKKTITKCAVNQRQVVIALSGGELVYFEMDPTGQLNEYTERKKM 583

Query: 477  EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LL 535
              ++ C+ +  +      S   AVG+  D +VR+ SL   + ++   +  + +P +   L
Sbjct: 584  PSDVMCMALGSVPAGEQRSWFLAVGL-ADNTVRVISLDPSDCLSPRSM--QALPSAAESL 640

Query: 536  CAFE---------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
            C  E               G  YL   L +G LL  +L+  +G+L D +   LG++P+ L
Sbjct: 641  CIVEMGTGDTNEEGVASSAGCIYLNIGLTNGVLLRTVLDPVSGDLADTRTRYLGSRPVKL 700

Query: 581  RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
                 + +  V A S R  + Y    +   + ++ + + +   F+S    + +       
Sbjct: 701  FRIKMQGSEAVLAMSSRTWLSYYFQNRFHLTPLSYETLEYASGFSSEQCSEGIVAISTNT 760

Query: 641  LTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD 699
            L I  ++ +  +  + + PL   P+R     ++    I    + +  EE++    + + D
Sbjct: 761  LRILALEKLGAVFNQITFPLEYTPKRFLIHNETGKLVISETDHNAYTEETKNIRKKQMAD 820

Query: 700  QTFE----------------FIS-TYPLDTF----------------------------- 713
            +  E                FI+   P D F                             
Sbjct: 821  EMREAAGEDEQELANEMADAFINEVLPEDVFSAPKAGTGMWASQIRVMDPINGHTYSKVQ 880

Query: 714  ----EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG---RILVFIVEDGKLQLIAE 766
                E   S+    F+ D   Y   G A  L    +   G    +  + V   +L+    
Sbjct: 881  LAQNEAVLSMALVRFAVDQKWYVVAGVAKDLQMNPKIANGGFIDVYKYDVHTHQLEHYHR 940

Query: 767  KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTR 824
             E   A  ++  F G++L  + + +++Y     D G ++L  +C  + HI    + +Q  
Sbjct: 941  TEIDDAPGAIAGFQGRVLVGVGRVLRIY-----DLGKKKLLRKC-ENKHIPNQIVNIQAM 994

Query: 825  GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
            G  + V D+ +SI  + YK  E  +   A D +  W++   +LD D    A+   N+  +
Sbjct: 995  GSRVFVSDVQESIYCIKYKRAENQLIIFADDTHPRWITTSTLLDYDTVATADKFGNIAIL 1054

Query: 885  RKNSEGATD-----------EERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDS 926
            R     + D            +RG L       E +  +HLGE +   +  +L+      
Sbjct: 1055 RLPHSVSDDVDEDPTGNKALWDRGLLNGASQKAENICTFHLGETIMSLQKATLI------ 1108

Query: 927  DVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
              G   ++I+ T++G +G +      E Y F + L+ ++R     + G +H  +RS+   
Sbjct: 1109 -PGGSESLIYATMSGTVGALVPFTSREDYDFFQHLEMHMRNENTPLCGRDHLSYRSY--- 1164

Query: 986  KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                  K+ +DGDL E F  +   +   I+  +  +  E+ K++E++
Sbjct: 1165 --YYPVKHVMDGDLCEQFTSMDPAKQKSIASDLGRTPNEVAKKLEDI 1209


>gi|443918546|gb|ELU38987.1| CPSF A subunit region domain-containing protein [Rhizoctonia solani
            AG-1 IA]
          Length = 1037

 Score =  191 bits (485), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 217/845 (25%), Positives = 356/845 (42%), Gaps = 97/845 (11%)

Query: 212  DGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
            DG   ++GD  G LHL  ++ E +    +   LLG+ S    +   + +V     S G  
Sbjct: 219  DGKHIIVGDSFGHLHLATMSMEPQ--FAMSCMLLGDVSTCMILLDPEGSV-----SRGKK 271

Query: 272  QLIKLNLQPDAKGSYVEVL-ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIV 330
            +      QPD + +++EV+ E   NL PI+D  +V+++  GQ ++   SG    G L +V
Sbjct: 272  K------QPDDETTHLEVIVEHNKNLAPILDATLVEIDGSGQPRIALISGDESGGWLSVV 325

Query: 331  RNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDE----LE 385
              G    E A ++ L  ++ ++ L+   D+   ++LV S  + T IL+  L D     L 
Sbjct: 326  HKGASFRELAILDGLGHLENIFPLKKYFDERTHSYLVASTTTSTYILS--LADSSISLLS 383

Query: 386  ETEIEGFCSQTQTLFCHDAIY---NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVA 442
              E+ G      T+   +      +  + VT   + L+   +    + WK P G     A
Sbjct: 384  REELSGISRSETTILASNVPLGGVDAAIHVTPQKIVLIDIITGRAISSWKPPKGDITAAA 443

Query: 443  -TANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVG 501
               ++S + +AT  GHL  L I    L +    +  ++IS L IN        + I A  
Sbjct: 444  LDTSSSTICVATSEGHLFSLNIQSAGLLQT--GKPHWQISSLAIN--------AGIVAAA 493

Query: 502  MWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYL--LCA--LGDGHLLNFL 557
             W    V+IF LP L  +  E +  E    SV+  +  GI  +   CA    + +  +++
Sbjct: 494  FWGSNEVQIFLLPGLKRV-GESVIQEPSAASVVYLSNFGIPNVPDRCAPRWSNCYTSDYI 552

Query: 558  LNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDR---PTVIYSSNKKLLYSNVN 614
                  +            P   R+ S  + +  F  S      T   S   K    +V 
Sbjct: 553  QGRTRAQYA-------ACSPTGCRSCSYHDGSRAFRGSGHRCWETFNGSFGCKQAPFSVL 605

Query: 615  LKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID---DIQKLHIRSIPLGEHPRRICHQEQ 671
            +  +       S  F D     K G + IG      D Q L  +     + P  + H E+
Sbjct: 606  VAPIGEQNGHQSKCFSD----IKSGCMRIGGSQRTWDSQYLFYQYNLGSDTPLCLAHHEE 661

Query: 672  SRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
               +AI  ++ ++   E E HFVR  DD TF+ +  Y L   E   SI    ++   N Y
Sbjct: 662  LSKYAIGCVRPRTDVLE-ERHFVRFHDDSTFKDLGQYKLKYSEMVTSI--GVYTHGGNSY 718

Query: 732  YCVGTAYVLPEENEPTKGRILVFIV-EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
               GTA + P ENEP  GRI++F   E+  ++  A K+ +G V S+     +++AAI   
Sbjct: 719  ILAGTAIINPGENEPLAGRIILFGQDEENMIKFKASKDVEGGVSSIKQLGARIIAAIGHG 778

Query: 791  IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA-- 848
            I LY  + R + T          G+I+   +  R + IVV D ++S+S+L +        
Sbjct: 779  IYLYN-LGRGEVTISDPVARWERGYIVHDII-VRPNMIVVSDRLRSVSVLRFIERTSTPE 836

Query: 849  ---------------IEERARDYNANWMSAVEIL-DDDIYLGAENNFNLFTVRKNSEGAT 892
                            E  A D +A W ++VE+L D+   + ++ + N+ T         
Sbjct: 837  SHEEIETEEDSTILQFETVAMDMHAVWPTSVEVLPDNKTIIASQTDGNILT--------W 888

Query: 893  DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
            + E G LE    +H GE +++F     +     S  G     IF T  G IG ++++   
Sbjct: 889  ELEDGNLEPRAAFHTGEIIHKF-----IASTAKSSAGPRTVAIFVTNTGRIGTLSTVDDA 943

Query: 953  QYLFLEKLQTNLRKVIKGVGGLNHEQWRS---FNNEKKTVDAKNFLDGDLIESFLDLSRT 1009
              L L +L+  L   IKG+G + H +WR+    +   K    +   DGD I+ FL+LS  
Sbjct: 944  DALQLTRLEMKLGDAIKGLGNIKHPEWRAPKLLHTGTKPPPRRGVTDGDFIKKFLELSSE 1003

Query: 1010 RMDEI 1014
                I
Sbjct: 1004 EAKRI 1008


>gi|401883281|gb|EJT47496.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 2479]
          Length = 1216

 Score =  191 bits (484), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 267/1174 (22%), Positives = 471/1174 (40%), Gaps = 170/1174 (14%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G +  +  FR  G  +D++ ++++  +  +++     +        +V  + G R T  
Sbjct: 59   FGTVRNVAAFRLAGMGKDYIVLSSDSGRLSIIELVISPTPHFESLYQEVYGKSGSRRTIP 118

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY---GC 120
            GQ   +DP  R  + G      L  ++  + +G++  +  +   +   L +  +    G 
Sbjct: 119  GQFLAVDPKGRSAMFGAVEKQKLCYILNRNTEGKVYPSSPLEAHKNHTLVVNMIACDTGY 178

Query: 121  AKPTIVVLYQDNKDARHVKT---YEVALKDKDFVE---------GPWSQNNLDNGADLLI 168
              P    L  D  D+ H  T   Y  A K   F E           WS+   D  A+LL+
Sbjct: 179  DNPMFAALELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRKWSEPT-DRRANLLV 237

Query: 169  PVP------------PPLCGVLIIGEETIVY--CSANAFKA-IPIRPSITKAYGRVDA-- 211
             VP            P   GVL+  E+ +++    A A +  IP R +     G+     
Sbjct: 238  QVPGGQNANTDRFDGP--GGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPGQSSRGI 295

Query: 212  ----------DGSRY-LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
                       GS + LL    G L    I HE E V  L+I+      +A+++  L + 
Sbjct: 296  IIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATSLCILKSG 355

Query: 261  VVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLG----------------------- 297
             +++ S +GD  L +     D  G        Y   G                       
Sbjct: 356  YLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPLQNLLLAD 415

Query: 298  ------PIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
                  PI+D  VV+L        Q+    G     + R +++G+ IN      L G+  
Sbjct: 416  TLSSLDPILDAQVVNLLGNASDTPQIYAACGRGPRSTFRSLKHGLDINVLVESPLPGVPN 475

Query: 349  GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
             +W+L+ S DD +D+++V+SF + T +L++   + +EE    GF S   TL         
Sbjct: 476  AVWTLKLSEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLGSAG 533

Query: 409  LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGI 467
            L+QV    +R + +  R   +EW +P G ++  AT N +QV++A     LVY E+  +G 
Sbjct: 534  LLQVHPAGLRHIRAADRV--DEWNAPAGTTIVSATTNKNQVVIALSTQELVYFELDEEGS 591

Query: 468  LTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLG 525
            L E +  + L    +C+ I  + E    +   AVG   + +V I SL PD  L T     
Sbjct: 592  LNEYQERKSLPGNATCVSIADVPEGRRRTPYLAVGC-DNQTVHIISLEPDNTLETLSLQA 650

Query: 526  GEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
                P  + L      S        +L   L +G LL  +++   G L+D +   LG++P
Sbjct: 651  LTAPPSDICLAEIFDTSIDKNRATMFLTIGLLNGVLLRTVVDPVDGSLSDTRLRFLGSKP 710

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
              +   +   +  V A S R  ++Y+    L    +    + + C  ++A  PD L    
Sbjct: 711  PRIVRSAVHGSPAVMAFSSRTWLLYTYQDLLQTQPLIYDMLEYACNLSAAMCPDGLIGIS 770

Query: 638  EGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTF-------AICSLKNQSCAEES 689
               L I TI  +  KL   S+PL   PR+      +  F        + S +    A E 
Sbjct: 771  GNTLRIFTIPRLGTKLKQDSMPLSYTPRKFVSYPYNTVFYGIEADHRVWSAQAMEKATEE 830

Query: 690  EMH-----------------------------FVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            +                                +R++D      + T  LD  E   S+ 
Sbjct: 831  KKREGDEKALAVLELPPAEFGRVRAPAGHWGSLIRVIDPLQNTTLQTLELDEDEAAFSLA 890

Query: 721  SCSFSD-DSNVYYCVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSL 776
             C F          VGT     ++P+  +    R+     E  +L+ + + +T     ++
Sbjct: 891  VCYFERMGGEPSLVVGTGVKTTLMPKGCKEGWLRVYAIRDEGKRLEFMHKTKTDDVPLAV 950

Query: 777  NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMK 835
             AF G LLA + + ++LY+      G + L  +C ++G    +  +   G  I+VGDL +
Sbjct: 951  AAFQGYLLAGVGKSLRLYEM-----GKKALLRKCENNGFPTGVATINVVGARIIVGDLQE 1005

Query: 836  SISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KN 887
            S    +Y+    +I  R     A D    +++AV  +D D    A+   N+F  R   + 
Sbjct: 1006 STFYCVYR----SIPSRQLLIFADDSQPRFLTAVCNVDYDTVCCADKFGNIFVNRLEERV 1061

Query: 888  SEGATDEERGRL------EVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTV 939
            SE   D+  G +       ++G  +  + +  +  GS+V  L    V  G    V++ T+
Sbjct: 1062 SEKVDDDPTGAVILHEKGFLMGSANKTDLIAHYNVGSVVTSLTKVSVAPGGRDVVVYTTI 1121

Query: 940  NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
            +G +G +   + ++   F+  L+ ++R +   + G +H  +R +    K V     +DGD
Sbjct: 1122 SGAVGALVPFISNDDVEFMTTLEMHIRSLNTSLVGRDHLAYRGYYAPVKAV-----VDGD 1176

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L ESF  L   +   I+  ++ +V E+ K++E+L
Sbjct: 1177 LCESFNMLPYPQQQAIAADLDRNVGEVLKKLEQL 1210


>gi|430813298|emb|CCJ29330.1| unnamed protein product [Pneumocystis jirovecii]
          Length = 1197

 Score =  190 bits (482), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 256/1169 (21%), Positives = 482/1169 (41%), Gaps = 174/1169 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   ++G I TL  FR  G  +D L + ++  +  +L++  +S+   ++   +   + G
Sbjct: 60   LLSHDVHGIIRTLVGFRLAGTNKDHLIVGSDSGRITILEYKPDSNAF-SKVHQETYGKSG 118

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R  +I     + L  V+  D+   L  +  +   +   L    +
Sbjct: 119  VRRVVPGQYLAVDPKGRATMIASIEKNKLVYVLNRDSATNLTISSPLEAHKSCSLVFHLI 178

Query: 118  ---YGCAKPTIVVLYQDNKDA---------RHVKT----YEVALKDKDFVEGPWSQNNLD 161
                G   P    L  D  +A         R ++     YE+ L     V   WS + +D
Sbjct: 179  GMDVGYENPVFAALEVDYTEAESDPSGKAYREIQKVLTYYELDLGLNHVVR-KWS-DPVD 236

Query: 162  NGADLLIPVP-----PPLCGVLIIGEETIVYC-SANAFKAIPIRPSITKAYGRVDADGSR 215
              A+LL+ VP     P   G L+  E +I Y         IPI   I    G ++    +
Sbjct: 237  RKANLLVTVPGGSDGP--SGALVCTEGSIFYKHKGKKTHRIPIPTRI----GSLENSQKK 290

Query: 216  ----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
                            +LL +  G L  + I     +V  LKI+      +++ +S L +
Sbjct: 291  QIIVSSVVHKMRGAFFFLLQNEDGDLFKVTIDSNDGEVESLKIKYFDTVPVSTGLSILKS 350

Query: 260  AVVYIGSSYGDSQLIK--------------------LNLQPDAKGSYVE--------VLE 291
              +++ S YG+  L +                    L+L    + SY          +++
Sbjct: 351  GFLFVASEYGNHHLYQFEKLGDDNNEIEFSSVDFPVLDLNEGYEPSYFRPRSLENLLLVD 410

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GM 350
               ++ P++D  +++L  +   Q+    G     + R +R G+ +NE  +  L G    +
Sbjct: 411  DLNSMNPLMDSKILNLTDEDAPQIYALCGRGPRSTFRTLRYGLEVNEIVASGLPGSPTAV 470

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+ + ++ D +D ++V+SF++ T +L++   + +EE    GF S + TL       + L+
Sbjct: 471  WTTKLTSSDQYDAYIVLSFVNGTLVLSIG--ETVEEVSDTGFLSSSPTLAVQQLGDDALI 528

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILT 469
            QV    +R + +  R   NEWK+P   S+   T N  QV++A   G +VY E+ D G L 
Sbjct: 529  QVHPKGIRHIQADKRV--NEWKTPQHRSIVQVTTNQRQVVVALSNGEIVYFELDDEGNLN 586

Query: 470  EVK-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
            E +   Q+   ++ L I  + E        AVG   D +VRI SL   N  T E L  + 
Sbjct: 587  EYQDRKQMTGSVTSLSIGHVPEGRQRHPFLAVGC-DDSTVRIISLEPEN--TLESLSVQA 643

Query: 529  I---PRSVLLCAFEG----ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
            +   P S+ + + +       YL   L +G  L  +L + TG+L D +   LG++P+ L 
Sbjct: 644  LTAPPNSLCIMSMKDGNIETFYLHIGLFNGVYLRSVLELSTGQLKDTRTRFLGSRPVKLF 703

Query: 582  TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
            + + +N   V A S RP + Y  N  L    +    + +   F+S    + +   +  +L
Sbjct: 704  SVTIQNQLSVIALSSRPWLSYIINASLHLVPLIYDSLEYCWGFSSEQCSEGIVGIQGQDL 763

Query: 642  TIGTIDDIQK-LHIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAEESEMHF------ 693
             I  ++ +   L   SI L   PRR I H ++   + I S  N     E +         
Sbjct: 764  KIFMVERLDNVLKQDSISLMYTPRRFIKHPDEHIFYIIESDHNVLPLSERQKRVEGLQNG 823

Query: 694  ----------------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN-V 730
                                  + +LD  + + ++   LD  E   SI   +F + ++ +
Sbjct: 824  DHILLPEDIGLPRGLSGNWASCITILDPLSKKILTRIELDDNEAAFSIAMVTFKNQNDEI 883

Query: 731  YYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAI 787
            +  +G+     + P+        I  FI +   ++L+ + E      +L  F G+LLA +
Sbjct: 884  FLAIGSGKNVILAPKSFSAAYISIYRFIDQGKSIELVHKTEVDDIPLALLGFQGRLLAGL 943

Query: 788  NQKIQLYKWMLRDDGTRELQSECGHHGHILA-----LYVQTRGDFIVVGDLMKSISLLIY 842
             + +++Y+  ++         +C     + A     + + T+G  I++ D+ +SI   +Y
Sbjct: 944  GKMLRIYEMGMK---------KCLRKCEVRAVPNCIVQLHTQGSRIIIADIQESIHFAVY 994

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
            K+ E  +   A D    W +   +LD +     +   N +  R   + SE A ++  G  
Sbjct: 995  KYLENRLIVFADDVIPRWTTTSTMLDYETVAAGDKFGNFWINRCPLEVSESADEDPSGAQ 1054

Query: 898  -------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
                         RL+++  +++G+         L+        G    +++  + G IG
Sbjct: 1055 LIHEKSYLFGAAKRLKMLAHFYIGDTFTSMHKVQLI-------AGGRDIIVYTGMMGSIG 1107

Query: 945  V-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
            + +  +  E   F ++L+  +R     + G +H  +R +      V  K+ +DGDL E F
Sbjct: 1108 IFLPFVGREDVDFFQQLEALMRTEDLSLIGRDHLMYRGY-----YVPVKSVVDGDLCERF 1162

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L L   +   I+  ++  + E+ K++E++
Sbjct: 1163 LMLPYNKKQVIANELDREISEIAKKIEDM 1191


>gi|58258783|ref|XP_566804.1| U2 snRNA binding protein [Cryptococcus neoformans var. neoformans
            JEC21]
 gi|338819361|sp|P0CR23.1|RSE1_CRYNB RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|338819362|sp|P0CR22.1|RSE1_CRYNJ RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|57222941|gb|AAW40985.1| U2 snRNA binding protein, putative [Cryptococcus neoformans var.
            neoformans JEC21]
          Length = 1217

 Score =  190 bits (482), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 277/1182 (23%), Positives = 476/1182 (40%), Gaps = 186/1182 (15%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G I  +  FR  G  +D++   ++  +  +L++    +        +V  + G R    
Sbjct: 60   FGTIRNIAGFRLAGMTKDYILATSDSGRLSILEFVISPTPHFESLYQEVFGKSGSRRIVP 119

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ   +DP  R  L+G      L  V+  + +G+L  +  +   +   L    +    G 
Sbjct: 120  GQFLAVDPKGRSCLVGSLEKTKLVYVLNRNTEGKLYPSSPLEAHKNHTLVTHIVGVDQGY 179

Query: 121  AKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
              P    L  D  ++             +H+  YE+ L     V   WS+   D  A+LL
Sbjct: 180  DNPLYAALETDYSESDQDSTGEAYENTQKHLTFYELDLGLNHVVRK-WSEPT-DRRANLL 237

Query: 168  IPVP------------PPLCGVLIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGS 214
            + VP            P   GVL+  E+ I++   +     IPI         R   D S
Sbjct: 238  VQVPGGQNANSDRFEGP--SGVLVCTEDHIIWKHMDVEAHRIPIPRRRNPLVQR--GDKS 293

Query: 215  R-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
            R                 +LL    G L+ + I H  E V  LKI+      +A+++  L
Sbjct: 294  RGLIIVSAVMHKIKGAFFFLLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCIL 353

Query: 258  DNAVVYIGSSYGDSQLIKLNLQPDAKG-------SYVE---------------------- 288
                +Y+ S + D  L +     +  G        Y E                      
Sbjct: 354  KRGYIYVASEFSDQNLYQFQSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLL 413

Query: 289  VLERYVNLGPIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            +++   +L PI D  VV+L        Q+    G     + R +++G+ + E  S  L G
Sbjct: 414  LVDTVPSLDPITDAHVVNLLGASSDTPQIYAACGRGARSTFRTLKHGLDVAEMVSSPLPG 473

Query: 347  IK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            +   +W+L+ + DD +D+++V+SF + T +L++   + +EE    GF S   TL      
Sbjct: 474  VPTNVWTLKLTEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLG 531

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
               L+QV    +R + +  R   +EW +PPG ++  AT N  QV++A     LVY E+  
Sbjct: 532  NAGLLQVHPYGLRHIRAADRV--DEWPAPPGQTIVAATTNRRQVVIALSTAELVYFELDP 589

Query: 465  DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
            +G L+E +  + L    +C+ I  + E    +   AVG   + +V I SL PD  L T  
Sbjct: 590  EGSLSEYQEKKALPGNATCVTIAEVPEGRRRTSFLAVGC-DNQTVSIISLEPDSTLDTLS 648

Query: 523  HLGGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
                   P S+ L      S        +L   L +G LL  +++   G L+D +   LG
Sbjct: 649  LQALTAPPTSICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLG 708

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
             +P  L   + +    V A S R  ++Y+    L    +    + +    ++A  PD L 
Sbjct: 709  AKPPKLVRANVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLI 768

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQS---------RTFAICSLKNQ 683
                  L I  I  + +KL   S  L   PR+ I H   S         RT++  +++  
Sbjct: 769  GISGNTLRIFNIPKLGEKLKQDSTALTYTPRKFISHPFNSVFYMIEADHRTYSKSAIERI 828

Query: 684  SCAEESE--------------------------MHFVRLLDDQTFEFISTYPLDTFEYGC 717
               +ESE                             VR+LD    E I T  LD  E   
Sbjct: 829  VKQKESEGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLDPLANETIMTLDLDEDEAAF 888

Query: 718  SILSCSFS-DDSNVYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAV 773
            SI    F       +  VGT      + +  K G + V+ I E G+ L+ + + +T    
Sbjct: 889  SIAIAYFERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAIKEQGRILEFLHKTKTDDIP 948

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
              L  F G LLA I + ++LY+      G + L  +C ++G   A+  +  +G  I+VGD
Sbjct: 949  LCLAGFQGFLLAGIGKSLRLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGD 1003

Query: 833  LMKSISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
            + +S    +Y+    +I  R     A D    W++ V  +D +     +   N+F  R +
Sbjct: 1004 MQESTFYCVYR----SIPTRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLD 1059

Query: 888  ---SEGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV---- 934
               SE   D+  G      +  ++G  H  E +  +  GS+V     + + +IP V    
Sbjct: 1060 PSISEKVDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVV-----TSITKIPLVAGGR 1114

Query: 935  ---IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
               ++ T++G +G +   +  +   F+  L+ ++R     + G +H  +R +      V 
Sbjct: 1115 DVLVYTTISGAVGALVPFVSSDDIEFMSTLEMHMRTQDISLVGRDHIAYRGY-----YVP 1169

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             K  +DGDL ESF  L   +   I+  ++ SV ++ K++E++
Sbjct: 1170 IKGVVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQM 1211


>gi|391341059|ref|XP_003744849.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Metaseiulus
            occidentalis]
          Length = 1223

 Score =  190 bits (482), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 249/1186 (20%), Positives = 481/1186 (40%), Gaps = 185/1186 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I +L  FR  G ++D+L + ++  + C+L+++   + L    M        R    
Sbjct: 58   VFGVIRSLMSFRLTGGSKDYLIVGSDSGRICILEFNPAKNVLEKIHMETFGKSGCRRIVP 117

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ   +DP  R  +IG      L  ++  D+  +L  +  +   +   L    +    G 
Sbjct: 118  GQYLAVDPKGRAVMIGAVEKQKLVYILNRDSNARLTISSPLEAHKSNTLVYHMVGVDVGF 177

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADLLIP 169
              P    +  D ++A +  T E A K +    F E     N+        L+  ++ LI 
Sbjct: 178  ENPVFACIELDYEEADNDHTGEAAKKAQQSLTFYELDLGLNHVVRKYSEPLEEPSNFLIA 237

Query: 170  VPPPL---CGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR----------- 215
            VP  +    GVL+  E  I Y   N      IR  I +    +D D  R           
Sbjct: 238  VPGGVDGPSGVLVCSENFITY--KNLGDQADIRCPIPRRRNDLD-DPDRGMLFVCSTQHK 294

Query: 216  ------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
                  +L     G +  + +  + + VT +K++      +A T+  L +  +++ S +G
Sbjct: 295  TKSMFFFLAQTEQGDIFKITLEFDDDAVTEIKLKYFDSLPVAQTMHVLKSGFLFVASEFG 354

Query: 270  DSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPIVDFCVVD 306
            +  L ++             ++ P  +G     L R +          +L PI+   V D
Sbjct: 355  NHSLYQIAHLGDNTDEPEFSSIFPLEEGDTFFFLPRELKNLVLVDEMDSLSPIMTARVAD 414

Query: 307  LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFL 365
            L  +   Q+    G     ++R++R+G+ ++E A  EL G    +W+++   DD +D ++
Sbjct: 415  LTNEDTPQLYAACGRGPRSTMRVLRHGLEVSEMAVSELPGNPSAVWTVKKRADDEYDAYI 474

Query: 366  VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
            VVSFI+ T +L++   + +EE    GF   T TL CH   ++ LVQ+    +R + +  R
Sbjct: 475  VVSFINATLVLSIG--ETVEEVTDSGFLGTTPTLACHQIGHDALVQIYPEGIRHIRADRR 532

Query: 426  ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE-VKHAQLEYEISCL 483
               NEW++     +     N  QV++A  GG L+Y E+   G L E  +  ++  ++ C+
Sbjct: 533  --VNEWRTSGKKLIVKCAVNQRQVVIALTGGELIYFEMDSSGQLNEYAERKEMNSDVLCM 590

Query: 484  DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG--EIIPRSVLL-----C 536
             +  +      ++  AVG  +D +V + SL   + ++   + G  E  P S+ +     C
Sbjct: 591  ALGSVPAGEQRTKFLAVGS-SDGTVHVISLDPKSCLSILSVQGMTESNPESLAIVELKAC 649

Query: 537  AFEGIS---------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
                IS               YL   L +G L   +L+  TG+++D +   LG++ + L 
Sbjct: 650  FLPSISVDMSGHEEGSGSSALYLNIGLQNGILKRMVLDAVTGDMSDTRTRFLGSRSVKLF 709

Query: 582  TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
                + T  V A S R  + Y    +   + ++   + +   F S   P+ +       L
Sbjct: 710  KIKMQGTDAVLAMSSRCWLSYLFQNRFHLTPLSYDSLEYASGFCSEQCPEGIVAIAGNTL 769

Query: 642  TIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSR------------TFAICSLKNQSCAEE 688
             I  ++ +  +  + S PL   PR     ++S+            T ++   + Q  AEE
Sbjct: 770  RILALEKLGAVFNQLSAPLSFTPRSFVIDKKSQKLIVIETDHNAYTDSVKEKRKQHVAEE 829

Query: 689  -------------------------SEMHF-------------VRLLDDQTFEFISTYPL 710
                                      E  F             +R+L+      +    L
Sbjct: 830  MIEAAGEAEQEMAAELAAAFLSEDLPESQFGAPRAGKKHWASVIRILNPSDLSTVYKIQL 889

Query: 711  DTFEYGCSILSCSFSDDSNVYYCVGTAYVL---PEENEPTKGRILVFIVEDGKLQLIAEK 767
            +  E   S+   +F   ++    VG +  L   P E   +       + +  +L+L+   
Sbjct: 890  EQNEAAVSVALVNFDKSTDPILLVGISKDLQLSPRECRNSFINAYRVVKDCTELELVHTT 949

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRG 825
                   ++  F  ++L  + + +++Y     D G +++  +C  + HI  L + +   G
Sbjct: 950  VMDDVPQAMCNFGNRVLIGVGRCLRIY-----DFGKKKMLRKC-ENKHIPNLIVTINAVG 1003

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR 885
            + IVVGD+ +S   + Y+  E  +   A D+   W +A  ++D    +G +   N++ +R
Sbjct: 1004 NRIVVGDVQESFFFIRYRMLENQLIIFADDFTPRWTTAACMVDYRTVVGGDKFGNVYILR 1063

Query: 886  --KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
               N+    DE         +RG L       EV+   H+GE +   +  +L+   P++ 
Sbjct: 1064 LPGNTSDDVDEDPTGVRSLWDRGWLGGAGQKAEVLSMTHVGELIVSLQKTALIPGGPEA- 1122

Query: 928  VGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
                  +++ T+ G +G +      + + F + L+  +R     + G +H  +RS+    
Sbjct: 1123 ------IVYTTIAGGVGALIPFSSKDDHEFFQHLEMYMRTEHPPICGRDHLSFRSYYFPV 1176

Query: 987  KTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            K V     +DGDL E +  L   +  +I+  +     E+ K++E++
Sbjct: 1177 KAV-----IDGDLCEQYNSLDANKQKQIADELERLPHEVAKKLEDI 1217


>gi|406698009|gb|EKD01256.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 8904]
          Length = 1216

 Score =  190 bits (482), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 266/1174 (22%), Positives = 471/1174 (40%), Gaps = 170/1174 (14%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G +  +  FR  G  +D++ ++++  +  +++     +        +V  + G R T  
Sbjct: 59   FGTVRNVAAFRLAGMGKDYIVLSSDSGRLSIIELVISPTPHFESLYQEVYGKSGSRRTIP 118

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY---GC 120
            GQ   +DP  R  + G      L  ++  + +G++  +  +   +   L +  +    G 
Sbjct: 119  GQFLAVDPKGRSAMFGAVEKQKLCYILNRNTEGKVYPSSPLEAHKNHTLVVNMIACDTGY 178

Query: 121  AKPTIVVLYQDNKDARHVKT---YEVALKDKDFVE---------GPWSQNNLDNGADLLI 168
              P    L  D  D+ H  T   Y  A K   F E           WS+   D  A+LL+
Sbjct: 179  DNPMFAALELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRKWSEPT-DRRANLLV 237

Query: 169  PVP------------PPLCGVLIIGEETIVY--CSANAFKA-IPIRPSITKAYGRVDA-- 211
             VP            P   GVL+  E+ +++    A A +  IP R +     G+     
Sbjct: 238  QVPGGQNANTDRFDGP--GGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPGQSSRGI 295

Query: 212  ----------DGSRY-LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
                       GS + LL    G L    I HE E V  L+I+      +A+++  L + 
Sbjct: 296  IIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATSLCILKSG 355

Query: 261  VVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLG----------------------- 297
             +++ S +GD  L +     D  G        Y   G                       
Sbjct: 356  YLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPLQNLLLAD 415

Query: 298  ------PIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
                  PI+D  VV+L        Q+    G     + R +++G+ +N      L G+  
Sbjct: 416  TLSSLDPILDAQVVNLLGNASDTPQIYAACGRGPRSTFRSLKHGLDVNVLVESPLPGVPN 475

Query: 349  GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
             +W+L+ S DD +D+++V+SF + T +L++   + +EE    GF S   TL         
Sbjct: 476  AVWTLKLSEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLGSAG 533

Query: 409  LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGI 467
            L+QV    +R + +  R   +EW +P G ++  AT N +QV++A     LVY E+  +G 
Sbjct: 534  LLQVHPAGLRHIRAADRV--DEWNAPAGTTIVSATTNKNQVVIALSTQELVYFELDEEGS 591

Query: 468  LTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLG 525
            L E +  + L    +C+ I  + E    +   AVG   + +V I SL PD  L T     
Sbjct: 592  LNEYQERKSLPGNATCVSIADVPEGRRRTPYLAVGC-DNQTVHIISLEPDNTLETLSLQA 650

Query: 526  GEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
                P  + L      S        +L   L +G LL  +++   G L+D +   LG++P
Sbjct: 651  LTAPPSDICLAEIFDTSIDKNRATMFLTIGLLNGVLLRTVVDPVDGSLSDTRLRFLGSKP 710

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
              +   +   +  V A S R  ++Y+    L    +    + + C  ++A  PD L    
Sbjct: 711  PRIVRSAVHGSPAVMAFSSRTWLLYTYQDLLQTQPLIYDMLEYACNLSAAMCPDGLIGIS 770

Query: 638  EGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTF-------AICSLKNQSCAEES 689
               L I TI  +  KL   S+PL   PR+      +  F        + S +    A E 
Sbjct: 771  GNTLRIFTIPRLGTKLKQDSMPLSYTPRKFVSYPYNTVFYGIEADHRVWSAQAMEKATEE 830

Query: 690  EMH-----------------------------FVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            +                                +R++D      + T  LD  E   S+ 
Sbjct: 831  KKREGDEKALAVLELPPAEFGRVRAPAGHWGSLIRVIDPLQNTTLQTLELDEDEAAFSLA 890

Query: 721  SCSFSD-DSNVYYCVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSL 776
             C F          VGT     ++P+  +    R+     E  +L+ + + +T     ++
Sbjct: 891  VCYFERMGGEPSLVVGTGVKTTLMPKGCKEGWLRVYAIRDEGKRLEFMHKTKTDDVPLAV 950

Query: 777  NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMK 835
             AF G LLA + + ++LY+      G + L  +C ++G    +  +   G  I+VGDL +
Sbjct: 951  AAFQGYLLAGVGKSLRLYEM-----GKKALLRKCENNGFPTGVATINVVGARIIVGDLQE 1005

Query: 836  SISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KN 887
            S    +Y+    +I  R     A D    +++AV  +D D    A+   N+F  R   + 
Sbjct: 1006 STFYCVYR----SIPSRQLLIFADDSQPRFLTAVCNVDYDTVCCADKFGNIFVNRLEERV 1061

Query: 888  SEGATDEERGRL------EVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTV 939
            SE   D+  G +       ++G  +  + +  +  GS+V  L    V  G    V++ T+
Sbjct: 1062 SEKVDDDPTGAVILHEKGFLMGSANKTDLIAHYNVGSVVTSLTKVSVAPGGRDVVVYTTI 1121

Query: 940  NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
            +G +G +   + ++   F+  L+ ++R +   + G +H  +R +    K V     +DGD
Sbjct: 1122 SGAVGALVPFISNDDVEFMTTLEMHIRSLNTSLVGRDHLAYRGYYAPVKAV-----VDGD 1176

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L ESF  L   +   I+  ++ +V E+ K++E+L
Sbjct: 1177 LCESFNMLPYPQQQAIAADLDRNVGEVLKKLEQL 1210


>gi|313235544|emb|CBY10999.1| unnamed protein product [Oikopleura dioica]
          Length = 1185

 Score =  190 bits (482), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 251/1147 (21%), Positives = 478/1147 (41%), Gaps = 147/1147 (12%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
            +  I +L  FR  G  +D++ + ++  +  +L+++ +   L           + R    G
Sbjct: 59   FSLIRSLISFRLTGGTKDYIVLGSDSGRISILEYNPDKGSLERVHCETFGKSMARRIVPG 118

Query: 67   QIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-----YGCA 121
            Q    DP  R I +   +    V   +   Q +   +  LE  +   I F       G  
Sbjct: 119  QFLAHDPKGRAIMVAACEKQKLVYILNRDAQARLTISSPLEAHKSSTIHFYPVGLDVGFE 178

Query: 122  KPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADLLIPV 170
             P   V+  D +DA    T E A + K    F E     N+        L+  A++LIPV
Sbjct: 179  NPVFAVIELDYEDADADPTGEAAQEAKQSLTFYELDLGLNHVVRKYTDPLECAANMLIPV 238

Query: 171  P-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---------- 215
            P     P   GV++  E  ++Y   N      IR  I +    +D D  R          
Sbjct: 239  PGGSDGP--GGVIVCAENYLIY--KNFGDQPDIRFPIPRRRNDLD-DPERGMIIVAHATH 293

Query: 216  -------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
                   +LL    G L  + +  E++ VT ++++      ++S++  L    +++   +
Sbjct: 294  KTRSMFFFLLQTEQGDLFKVTLETEEDIVTEIRLKYFDTVPVSSSLCVLRTGFLFVAGEF 353

Query: 269  GDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPIVDFCVV 305
            G+  L ++             + +P  +G       R +          +L P+++  V 
Sbjct: 354  GNHNLYQITRLGEDDDEPEFSSAEPLEEGETFFFTPRGLRNLALTDEMDSLSPVLNCEVA 413

Query: 306  DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTF 364
            DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W++++S D   D++
Sbjct: 414  DLANEDTPQLYVTCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKTSADADHDSY 473

Query: 365  LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
            ++VSF++ T +L++   + +EE    GF   T TL       + LVQ+    +R + S  
Sbjct: 474  IIVSFVNATLVLSIG--ETVEEITDSGFLGTTPTLSSGLMGEDALVQIYPEGIRHIRSDR 531

Query: 425  RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYEISC 482
            R   NEW++P    +     N  QV++A  GG +VY E+   G L E  +  +   EI  
Sbjct: 532  RV--NEWRAPDRKQIVRCACNRQQVVIALTGGEIVYFEMDPTGQLNEYTERREFGSEIIA 589

Query: 483  LDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGI 541
            LD+  +       +  AVG+ +D +VRI SL P+  L  +       +P+ V +   +G 
Sbjct: 590  LDVGDVPAGEQRCRFLAVGL-SDGTVRIISLDPNDCLQPRTMQALPTVPQDVAITEHKGQ 648

Query: 542  SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 601
              L   L +G LL   ++  TGE++D +   LGT+ + L    ++    V A S R  + 
Sbjct: 649  YVLQIGLQNGVLLRTTIDSVTGEISDTRTRYLGTKAVKLYKVVTEGENSVLAVSSRSWLS 708

Query: 602  YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL--HIRSIPL 659
            Y   ++   + ++ + +     F+S   P+ +       L I  ++ +  +   +R+ PL
Sbjct: 709  YRHQQRFHLTPLSYEALDSATGFSSEQCPEGIVAIAGNTLRILALEKLGAVFNQVRN-PL 767

Query: 660  GEHPRRICHQEQSRTFAIC---------SLKNQSCAEESEMHFVR-LLDDQTF------- 702
               PR+      +    I            K Q  A E    F++  L +  F       
Sbjct: 768  QLTPRKFVIHPDTANLVIAESDHNAFTEETKKQPLAAEMAAAFLKEELPESDFGSPRAGQ 827

Query: 703  ----EFISTYPLDT--------FEYGCSILSCSFSDDSNV-----YYCVGTAYVL---PE 742
                  IS    +T         E G  I S   +   +V     Y  VGT+  +   P 
Sbjct: 828  GMWASKISIISPETGGTLHHVWLEQGQGIHSMCLTKFQSVGMIDWYLLVGTSNSMILNPR 887

Query: 743  ENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWM 797
            +++   G + V+ +  G + + +    T+ A  ++  F G++L  +   +++Y    K M
Sbjct: 888  QSQ--GGSLHVYRIHAGERFEYLHTTYTEDAPTAITPFQGRVLIGVGNLLRIYDLGKKKM 945

Query: 798  LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
            LR    R + +            + T G  I+  D  +S   + Y+ +E  +   A D  
Sbjct: 946  LRKCELRSIPNRVTK--------IITAGQRIIAFDQQESAFFIKYRVKENQLTIFADDTF 997

Query: 858  ANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG------RLEVVGEYHLG 908
              W++A  +LD    + A+   ++  +R   + +E   D+  G      R  + G     
Sbjct: 998  PRWVTAACMLDYWTVVIADKFGSVSVIRLPAETNENTQDDPSGAKALWSRGNLNGASQKA 1057

Query: 909  EFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLR 965
            E +N F  G ++  +  + +  G    +++ T++G IG++     +E   F + L+ ++R
Sbjct: 1058 ETLNNFFVGDMITSIQKTILIPGGGECIVYATMSGRIGILVPFASNEDADFFQSLEMHMR 1117

Query: 966  KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEEL 1025
            +    + G +H  +R +   +K++     +DGDL E+F  L + +   I++ M+  + E+
Sbjct: 1118 QEHTFLTGRDHLAFRGYYYPQKSI-----VDGDLCEAFSLLGQDQQKGIAEEMDRQLAEV 1172

Query: 1026 CKRVEEL 1032
             K++E++
Sbjct: 1173 SKKLEDI 1179


>gi|320037168|gb|EFW19106.1| pre-mRNA-splicing factor rse1 [Coccidioides posadasii str. Silveira]
          Length = 970

 Score =  189 bits (481), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 232/982 (23%), Positives = 410/982 (41%), Gaps = 156/982 (15%)

Query: 176  GVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGDHAGLLHLL---- 228
            GVL+  E  I Y  +N  AF+  IP R   T+     + +  RY+    AG++H +    
Sbjct: 14   GVLVCSEGNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT---AGVVHKMRRAF 65

Query: 229  -----------------VITHEKEKVTG----LKIELLGETSIASTISYLDNAVVYIGSS 267
                             ++  E  ++TG    LK++      +AS++  L N  +++ S 
Sbjct: 66   FCLLQTEDGDLFKVTMDMVEDENGQLTGEVQRLKLKYFDTVPVASSLCILKNGFLFVASE 125

Query: 268  YGDSQLIKLN-----------------------LQP----DAKGSYVEVLERYVNLGPIV 300
             G+    +                         L P          + ++E   +L P++
Sbjct: 126  TGNHHFYQFEKLGDDDEETEFSSDDFSADPSEPLAPVYFRPRPAENLNLVESINSLNPLM 185

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDD 359
               + +L      Q  T SG     + R +++G+ ++E    EL  +   +W+ + + +D
Sbjct: 186  SCKITNLTEDDAPQFYTLSGTGARSTFRTLKHGLEVSEIVESELPSVPSAVWTTKLTRND 245

Query: 360  PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
             +D ++++SF + T +L++   + +EE    GF S   TL       + L+QV    +R 
Sbjct: 246  QYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPKGIRH 303

Query: 420  VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KHAQLE 477
            + +  R   NEW +P   S+  A  N  QV +A   G +VY E+  DG L E  +  ++ 
Sbjct: 304  IHADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDTDGSLAEYDEKREMS 361

Query: 478  YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSV-LL 535
              ++CL +  I      S   AVG   D +VRI SL PD  L  K        P ++ ++
Sbjct: 362  GTVTCLSLGEIPPGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPSALSIM 420

Query: 536  CAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
               +  S     YL   L  G  L  +L+  TGEL+D +   LG + + L + S K    
Sbjct: 421  SMVDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLKSVKLFSVSVKEQRA 480

Query: 591  VFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
            V A S RP + YS    K  + + ++   +     F+S    + +   +   L I +I+ 
Sbjct: 481  VLALSSRPWLGYSDLQTKNFMLTPLDYVPLEWSWNFSSEQCVEGMVGIQGQNLRIFSIEK 540

Query: 649  I-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF----------VRLL 697
            +   L   +IPL   PR      +   F +    N   +  ++             V  L
Sbjct: 541  LDNNLLQETIPLAYTPRHFVRHPEQPLFYVIEADNNILSPSTKAKLLQDSKAANGEVAEL 600

Query: 698  DDQTFEF----------------------ISTYPLDTFEYGCSILSCSF-SDDSNVYYCV 734
              + F +                      IS   L+  E   S+ +  F S D   +  V
Sbjct: 601  PPEDFGYPRGTGHWASCIQVVDPINSKAVISRIELEENEAAVSVAAVPFSSQDDETFLVV 660

Query: 735  GTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
            GT     V P  +  + G I ++   EDGK L+ I + + +   ++L AF G+LLA I +
Sbjct: 661  GTGKDMVVYPPSS--SCGFIHIYRFQEDGKELEFIHKTKVESPPHALLAFQGRLLAGIGR 718

Query: 790  KIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
             +++Y     D G ++L  +C       L + +QT+G  I+V D+ +S++ ++YK++E  
Sbjct: 719  NLRIY-----DLGMKQLLRKCQAEVVPRLIVGLQTQGSRIIVSDVQESVTYVVYKYQENR 773

Query: 849  IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEY 905
            +   A D  A W +   ++D +   G +   NL+ +R   K SE A ++  G   +    
Sbjct: 774  LIPFADDVIARWTTCTAMVDYETVAGGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQ 833

Query: 906  HLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT--------------VNGVIGVIAS-LP 950
            +L    NR    SL++     D   IPT I  T              + G +G++   + 
Sbjct: 834  YLQGAPNRL---SLMVHFYPQD---IPTSIQKTQLVAGGRDILVWTGLQGTVGMLVPFVS 887

Query: 951  HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTR 1010
             E   F + L+  L      + G +H  +RS+        AK  +DGDL E++  L   +
Sbjct: 888  REDVDFFQSLEMQLTSQTPPLAGRDHLIYRSY-----YAPAKGTIDGDLCETYFTLPNDK 942

Query: 1011 MDEISKTMNVSVEELCKRVEEL 1032
               I+  ++ SV E+ +++ ++
Sbjct: 943  KLMIAGELDRSVREIERKISDM 964


>gi|198420618|ref|XP_002125906.1| PREDICTED: similar to Splicing factor 3B subunit 3
            (Spliceosome-associated protein 130) (SAP 130)
            (Pre-mRNA-splicing factor SF3b 130 kDa subunit) (SF3b130)
            (STAF130) [Ciona intestinalis]
          Length = 1216

 Score =  189 bits (480), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 256/1190 (21%), Positives = 479/1190 (40%), Gaps = 190/1190 (15%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            ++ V ++G I +L  FR  G  +D++   ++  +  +L++    ++     +        
Sbjct: 53   LITVEVFGEIRSLMSFRLTGGNKDYVVCGSDSGRISILEYIPNKNKFEKIHLETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
            R    GQ   IDP  R  +IG      L  ++  D++ +L  +  +   +       F+Y
Sbjct: 113  RRIVPGQYLAIDPKGRAVMIGALEKQKLVYILNRDSQARLTISSPLEAHKANT----FVY 168

Query: 119  -------GCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------L 160
                   G   P    L  D ++A    T E     +    F E     N+        L
Sbjct: 169  HMVGIDVGFENPMFACLELDYEEADTDPTGEAVQNTQQTLTFYELDLGLNHVVRKYSEPL 228

Query: 161  DNGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR 215
            +  A+ LI VP     P   GVL+  E  I Y   N      IR  I +    +D D  R
Sbjct: 229  EERANHLISVPGGNDGP--GGVLVCAENYITY--KNFGDQPDIRTPIPRRRNDLD-DPER 283

Query: 216  -----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
                             +L+    G +  + +  +++ VT ++++      ++  +  L 
Sbjct: 284  GMIVVCSATHKTKSMFFFLIQTEQGDIFKVTLETDEDMVTEIRLKYFDTVPVSMAMCVLR 343

Query: 259  NAVVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------N 295
               +++ +  G+  L ++             +  P  +G       R +          +
Sbjct: 344  TGFLFVAAEMGNHCLYQIAHLGDDDDETEFSSAMPLEEGDTFFYAPRALRNLVLVDELDS 403

Query: 296  LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLR 354
            L PI+   + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W+++
Sbjct: 404  LSPIMTCLISDLANEDTPQLYVTCGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVK 463

Query: 355  SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
               ++ FD++++VSF++ T +L++   + +EE    GF   T TL C     N LVQV  
Sbjct: 464  IKEEEEFDSYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGENALVQVYP 521

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VK 472
              +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +
Sbjct: 522  DGIRHIRADKR--VNEWKTPGKKTILRCAVNQRQVVIALTGGELVYFEMDQSGQLNEYTE 579

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIPR 531
              ++  E+ C+D++ +      ++  AVG+  D +VRI SL   + +    +      P 
Sbjct: 580  RKEMNSEVVCMDLSKVPPTEQRTRFLAVGL-ADNTVRIISLDPTDCLQPLSMQALPATPE 638

Query: 532  SVLLCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
            SV +    G               L   L +G LL   L+  TG+L+D +   LGT+ + 
Sbjct: 639  SVCIIEIGGGEDETGAQKASGGLVLNIGLQNGVLLRTTLDNVTGDLSDTRTRYLGTRAVK 698

Query: 580  LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
            L    ++ +  V A S R  + Y    +   + ++ + +     F S   P+ +      
Sbjct: 699  LFRVLTQGSDAVLAMSSRTWLSYQHQNRFHLTPLSYESLEFASGFASEQCPEGIVAISAN 758

Query: 640  ELTIGTIDDIQKL-HIRSIPLGEHPRRICHQEQSRTFAICSL------------KNQSCA 686
             L I  ++ +  + +  S  L   PRR      S    I               + Q  A
Sbjct: 759  TLRILALEKLGTVFNQASTGLQYTPRRFAIHYDSGNVIILETDHNAYTEETKENRKQQMA 818

Query: 687  EE-------------SEMHF-------------------------VRLLDDQTFEFISTY 708
            EE             +EM                           +RL+  ++ E I   
Sbjct: 819  EEMIEAAGEDERELAAEMTAAFLRESLPEDKFGASKAGPGMWSSQIRLMHPRSGETIQKI 878

Query: 709  PLDTFEYGCSILSCSF---SDDSNVYYCVGTAYVLPEENEPTKGRILVFIV--EDGKLQL 763
             L+  E   SI  C F   SDD+ V   V    ++        G I  + +  E  KL+L
Sbjct: 879  CLEQNEAALSIAICKFPNNSDDTFVLVGVACELLINPRQARGGGEIHTYKINEEGNKLEL 938

Query: 764  IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYV 821
            + +        ++  + G++L  + + +++Y     D G ++L  +C  + HI      +
Sbjct: 939  VHKTVVDEVPSAICPYQGRVLIGVGKLLRIY-----DLGKKKLLRKC-ENKHIPNYIASI 992

Query: 822  QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
            Q  G  I+V D+ +S+  + Y+  E  +   A D    W+++  +LD +    ++   N+
Sbjct: 993  QAVGHRIIVCDVQESVHWVRYRRHENQLVVFADDTYPRWVTSATVLDWNTVAVSDKFGNI 1052

Query: 882  FTVRKNSEGATDEE-----------RG-------RLEVVGEYHLGEFVNRFRHGSLVMRL 923
              +R  S+   D +           RG       + EV+  YH+GE V   +  +L+   
Sbjct: 1053 SVLRLPSDVNDDVQDDPSGTKALWTRGILNGAMQKCEVLCMYHVGETVLSLQKTTLI--- 1109

Query: 924  PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
                 G   ++++ T++G IG++     HE + F + L+ ++R     + G +H  +RS+
Sbjct: 1110 ----PGGSESLVYTTLSGSIGMLVPFTSHEDHDFFQHLEMHMRNECPPLLGRDHLAYRSY 1165

Query: 983  NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                     KN +DGDL E F  +  ++   +++ ++    E+ K++E++
Sbjct: 1166 -----YFPVKNVIDGDLCELFNSMEASKQKSVAEELDRVPSEVSKKLEDI 1210


>gi|405117821|gb|AFR92596.1| pre-mRNA-splicing factor RSE1 [Cryptococcus neoformans var. grubii
            H99]
          Length = 1217

 Score =  188 bits (478), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 276/1182 (23%), Positives = 476/1182 (40%), Gaps = 186/1182 (15%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G I  +  FR  G  +D++   ++  +  +L++    +        +V  + G R    
Sbjct: 60   FGTIRNIAGFRLAGMTKDYILATSDSGRLSILEFVISPTPHFESMYQEVFGKSGSRRIVP 119

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ   +DP  R  L+G      L  V+  + +G+L  +  +   +   L    +    G 
Sbjct: 120  GQFLAVDPKGRSCLVGSLEKTKLVYVLNRNTEGKLYPSSPLEAHKNHTLVTHVVGVDQGY 179

Query: 121  AKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
              P    L  D  ++             +H+  YE+ L     V   WS+   D  A+LL
Sbjct: 180  DNPLYAALEIDYSESDQDPTGEAYENTQKHLTFYELDLGLNHVVRK-WSEPT-DRRANLL 237

Query: 168  IPVP------------PPLCGVLIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGS 214
            + VP            P   GVL+  E+ I++   +     IPI         R   D S
Sbjct: 238  VQVPGGQSANSDRFEGP--SGVLVCTEDHIIWKHMDVEAHRIPIPRRRNPLVQR--GDKS 293

Query: 215  R-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
            R                 +LL    G L+ + I H  E V  LKI+      +A+++  L
Sbjct: 294  RGLIIVSAVMHKIKGAFFFLLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCIL 353

Query: 258  DNAVVYIGSSYGDSQLIKLNLQPDAKG-------SYVE---------------------- 288
                +Y+ S + D  L +     +  G        Y E                      
Sbjct: 354  KRGYIYVASEFSDQNLYQFQSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLL 413

Query: 289  VLERYVNLGPIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            +++   +L PI D  VV+L        Q+    G     + R +++G+ + E  S  L G
Sbjct: 414  LVDTVPSLDPITDAHVVNLLGASSDTPQIYAACGRGARSTFRTLKHGLDVAEMVSSPLPG 473

Query: 347  IK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            +   +W+L+ + DD +D+++V+SF + T +L++   + +EE    GF S   TL      
Sbjct: 474  VPTNVWTLKLTEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLG 531

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
               L+QV    +R + +  R   +EW +PPG ++  AT N  QV++A     LVY E+  
Sbjct: 532  NAGLLQVHPYGLRHIRAADRV--DEWPAPPGQTIVAATTNQRQVVIALSTAELVYFELDP 589

Query: 465  DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
            +G L+E +  + L    +C+ I  + E    +   AVG   + +V I SL PD  L T  
Sbjct: 590  EGSLSEYQEKKALPGNATCVTIAEVPEGRRRTPFLAVGC-DNQTVSIISLEPDSTLDTLS 648

Query: 523  HLGGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
                   P S+ L      S        +L   L +G LL  +++   G L+D +   LG
Sbjct: 649  LQALTAPPTSICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLG 708

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
             +P  L   + +    V A S R  ++Y+    L    +    + +    ++A  PD L 
Sbjct: 709  AKPPKLVRANVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLI 768

Query: 635  IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQS---------RTFAICSLKNQ 683
                  L I +I  + +KL   S  L   PR+ I H   S         RT++  +++  
Sbjct: 769  GISGNTLRIFSIPKLGEKLKQDSTALTYTPRKFISHPFNSVFYMIEADHRTYSKSAIERI 828

Query: 684  SCAEESE--------------------------MHFVRLLDDQTFEFISTYPLDTFEYGC 717
               +ESE                             VR+LD    E I T  LD  E   
Sbjct: 829  VKQKESEGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLDPLANETIMTLDLDEDEAAF 888

Query: 718  SILSCSFS-DDSNVYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAV 773
            SI    F       +  VGT      + +  K G + V+ I E G+ L+ + + +T    
Sbjct: 889  SIAIAYFERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAIKEQGRVLEFLHKTKTDDIP 948

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
              L  F G LLA + + ++LY+      G + L  +C ++G   A+  +  +G  I+VGD
Sbjct: 949  LCLAGFQGFLLAGVGKSLRLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGD 1003

Query: 833  LMKSISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
            + +S    +Y+    +I  R     A D    W++ V  +D +     +   N+F  R +
Sbjct: 1004 MQESTFYCVYR----SIPTRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLD 1059

Query: 888  ---SEGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV---- 934
               SE   D+  G      +  ++G  H  E +  +  GS+V     + + +IP V    
Sbjct: 1060 PSISEKVDDDPTGATILHEKSFLMGAAHKTEMIAHYNIGSVV-----TSITKIPLVAGGR 1114

Query: 935  ---IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
               ++ T++G +G +   +  +   F+  L+ ++R       G +H  +R +      V 
Sbjct: 1115 DVLVYTTISGAVGALVPFVSSDDIEFMSTLEMHMRTQDISPVGRDHIAYRGY-----YVP 1169

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             K  +DGDL ESF  L   +   I+  ++ SV ++ K++E++
Sbjct: 1170 IKGVVDGDLCESFSLLPYPKQQAIATDLDRSVGDVLKKLEQM 1211


>gi|164429062|ref|XP_957282.2| pre-mRNA splicing factor RSE1 [Neurospora crassa OR74A]
 gi|157072394|gb|EAA28046.2| pre-mRNA splicing factor RSE1 [Neurospora crassa OR74A]
          Length = 1192

 Score =  188 bits (477), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 276/1164 (23%), Positives = 475/1164 (40%), Gaps = 178/1164 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            I+G +  +  FR  G  +D++ +AT+  +  ++++  ++++     +        R    
Sbjct: 64   IFGIVRAIASFRLAGSHKDYIILATDSGRITIIEYLPKTNKFQRIHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  LI     + L  V+  +++ +L  +  +   +  VL +  +    G 
Sbjct: 124  GQYLAADPKGRACLISALEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSLVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
            A P    L  D  DA    T             YE+ L     V   WS + +D  + LL
Sbjct: 184  ANPVFAALELDYTDADQDPTGQAREEVETQLVYYELDLGLNHVVRK-WS-DTVDRTSSLL 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYL 217
              VP     P   GVL+ GEE + Y  +N  AF+  IP R   T+     RV   G  + 
Sbjct: 242  FQVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRSGATEDPQRKRVIVSGVMHK 299

Query: 218  LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
            L   AG    L+ T + +                  +V  LKI+      +A+++  L +
Sbjct: 300  LKGSAGAFFFLLQTDDGDLFKVTIDMIEDSDGNPTGEVKRLKIKYFDTIPVATSLCILKS 359

Query: 260  AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
              ++  S +G+    +                +   D   SY  V      LE  V    
Sbjct: 360  GFLFAASEFGNHHFYQFEKLGDDDEELEFSSDDFPTDPTASYNPVYFHPRPLENLVLVES 419

Query: 295  --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
              ++ P VD  V +L  +   Q+ +  G     + R++++G+ ++E  + EL G    +W
Sbjct: 420  IDSMNPQVDCKVANLTGEDAPQIYSVCGNGARSTFRMLKHGLEVSEIVASELPGTPSAVW 479

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            + + +  D          I ET          +EE    GF +   TL       + L+Q
Sbjct: 480  TTKLTKYD---------HIGET----------VEEVSDSGFLTTAPTLAVQQMGEDGLIQ 520

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
            V    +R +        NEW +P   S+  ATAN +QV++A   G +VY E+  DG L E
Sbjct: 521  VHPKGIRHIVQGRV---NEWPAPQHRSIVAATANENQVVIALSSGEIVYFEMDSDGSLAE 577

Query: 471  V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
              +  ++   ++ L +  + E    S   AVG   D +VRI SL PD  L  K       
Sbjct: 578  YDEKKEMSGTVTSLSVGQVPEGLKRSSFLAVGC-DDCTVRILSLDPDSTLEMKSIQALTA 636

Query: 529  IPRSVLLCAFE-----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
             P ++ + + E        YL   L  G  L  +L+  TGELTD ++  LG +P  L   
Sbjct: 637  APSALSIMSMEDSFGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTRLFQV 696

Query: 584  SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
            S ++   V A S RP + Y+    K  + + ++  E+ +   F+S    + +       L
Sbjct: 697  SVQDQPCVLALSSRPWLGYTDPLTKGFMMTPLSYTELEYGWNFSSEQCLEGMVGIHANYL 756

Query: 642  TIGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKN-------QSCAEESEMH 692
             I +I+ +    I +SIPL   P+ +  H EQ   + I S  N           E+    
Sbjct: 757  RIFSIEKLGDNMIQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG 816

Query: 693  FVRLLDDQTFEF----------------ISTYP-------LDTFEYGCSILSCSF-SDDS 728
               +L  + F +                IS  P       LD  E   S     F S + 
Sbjct: 817  DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEG 876

Query: 729  NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
              +  VGT   ++ +  + T+G I V+   EDG+ L+ I +   +    +L  F G+LLA
Sbjct: 877  ESFLVVGTGKDMVLDPRQFTEGYIHVYRFHEDGRDLEFIHKTRVEEPPLALIPFQGRLLA 936

Query: 786  AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
             + + +++Y   L+    R+ Q++       L + +Q++G+ I+VGDL + I+ ++YK E
Sbjct: 937  GVGKTLRIYDLGLK-QLLRKAQADV---TPTLIVSLQSQGNRIIVGDLQQGITYVVYKAE 992

Query: 846  EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
               +   A D    W +   ++D +   G +   N++ VR     + + +    E   E 
Sbjct: 993  GNRLIPFADDTLNRWTTCTTMVDYESVAGGDKFGNIYIVRCPERVSQETD----EPGSEI 1048

Query: 906  HLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV-IAS 948
            HL    N + HG+              LP S       VG    +++  + G +GV I  
Sbjct: 1049 HLMHARN-YLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQDVLLWSGLQGTVGVFIPF 1107

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +  E   F + L+ ++R     + G +H  +R +    K V     +DGDL E F  L  
Sbjct: 1108 VSREDVDFFQNLENHMRAEDPPLAGRDHLIYRGYYTPVKGV-----IDGDLCERFSLLPN 1162

Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
             +   I+  ++ SV E+ +++ ++
Sbjct: 1163 DKKQMIAGELDRSVREIERKISDI 1186


>gi|336469942|gb|EGO58104.1| pre-mRNA splicing factor RSE1 [Neurospora tetrasperma FGSC 2508]
          Length = 1192

 Score =  188 bits (477), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 276/1164 (23%), Positives = 475/1164 (40%), Gaps = 178/1164 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            I+G +  +  FR  G  +D++ +AT+  +  ++++  ++++     +        R    
Sbjct: 64   IFGIVRAIASFRLAGSHKDYIILATDSGRITIIEYLPKTNKFQRIHLETFGKSGVRRVIP 123

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ    DP  R  LI     + L  V+  +++ +L  +  +   +  VL +  +    G 
Sbjct: 124  GQYLAADPKGRACLISALEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSLVALDVGY 183

Query: 121  AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
            A P    L  D  DA    T             YE+ L     V   WS + +D  + LL
Sbjct: 184  ANPVFAALELDYTDADQDPTGQAREEVETQLVYYELDLGLNHVVRK-WS-DTVDRTSSLL 241

Query: 168  IPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYL 217
              VP     P   GVL+ GEE + Y  +N  AF+  IP R   T+     RV   G  + 
Sbjct: 242  FQVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRSGATEDPQRKRVIVAGVMHK 299

Query: 218  LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
            L   AG    L+ T + +                  +V  LKI+      +A+++  L +
Sbjct: 300  LKGSAGAFFFLLQTDDGDLFKVTIDMIEDSDGNPTGEVKRLKIKYFDTIPVATSLCILKS 359

Query: 260  AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
              ++  S +G+    +                +   D   SY  V      LE  V    
Sbjct: 360  GFLFAASEFGNHHFYQFEKLGDDDEELEFSSDDFPTDPTASYNPVYFHPRPLENLVLVES 419

Query: 295  --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
              ++ P VD  V +L  +   Q+ +  G     + R++++G+ ++E  + EL G    +W
Sbjct: 420  IDSMNPQVDCKVANLTGEDAPQIYSVCGNGARSTFRMLKHGLEVSEIVASELPGTPSAVW 479

Query: 352  SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
            + + +  D          I ET          +EE    GF +   TL       + L+Q
Sbjct: 480  TTKLTKYD---------HIGET----------VEEVSDSGFLTTAPTLAVQQMGEDGLIQ 520

Query: 412  VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
            V    +R +        NEW +P   S+  ATAN +QV++A   G +VY E+  DG L E
Sbjct: 521  VHPKGIRHIVQGRV---NEWPAPQHRSIVAATANENQVVIALSSGEIVYFEMDSDGSLAE 577

Query: 471  V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
              +  ++   ++ L +  + E    S   AVG   D +VRI SL PD  L  K       
Sbjct: 578  YDEKKEMSGTVTSLSVGQVPEGLKRSSFLAVGC-DDCTVRILSLDPDSTLEMKSIQALTA 636

Query: 529  IPRSVLLCAFE-----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
             P ++ + + E        YL   L  G  L  +L+  TGELTD ++  LG +P  L   
Sbjct: 637  APSALSIMSMEDSFGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTKLFQV 696

Query: 584  SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
            S ++   V A S RP + Y+    K  + + ++  E+ +   F+S    + +       L
Sbjct: 697  SVQDQPCVLALSSRPWLGYTDPLTKGFMMTPLSYTELEYGWNFSSEQCLEGMVGIHANYL 756

Query: 642  TIGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKN-------QSCAEESEMH 692
             I +I+ +    I +SIPL   P+ +  H EQ   + I S  N           E+    
Sbjct: 757  RIFSIEKLGDNMIQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG 816

Query: 693  FVRLLDDQTFEF----------------ISTYP-------LDTFEYGCSILSCSF-SDDS 728
               +L  + F +                IS  P       LD  E   S     F S + 
Sbjct: 817  DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEG 876

Query: 729  NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
              +  VGT   ++ +  + T+G I V+   EDG+ L+ I +   +    +L  F G+LLA
Sbjct: 877  ESFLVVGTGKDMVLDPRQFTEGYIHVYRFHEDGRDLEFIHKTRVEEPPLALIPFQGRLLA 936

Query: 786  AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
             + + +++Y   L+    R+ Q++       L + +Q++G+ I+VGDL + I+ ++YK E
Sbjct: 937  GVGKTLRIYDLGLK-QLLRKAQADV---TPTLIVSLQSQGNRIIVGDLQQGITYVVYKAE 992

Query: 846  EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
               +   A D    W +   ++D +   G +   N++ VR     + + +    E   E 
Sbjct: 993  GNRLIPFADDTLNRWTTCTTMVDYESVAGGDKFGNIYIVRCPERVSQETD----EPGSEI 1048

Query: 906  HLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV-IAS 948
            HL    N + HG+              LP S       VG    +++  + G +GV I  
Sbjct: 1049 HLMHARN-YLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQDVLLWSGLQGTVGVFIPF 1107

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +  E   F + L+ ++R     + G +H  +R +    K V     +DGDL E F  L  
Sbjct: 1108 VSREDVDFFQNLENHMRAEDPPLAGRDHLIYRGYYTPVKGV-----IDGDLCERFSLLPN 1162

Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
             +   I+  ++ SV E+ +++ ++
Sbjct: 1163 DKKQMIAGELDRSVREIERKISDI 1186


>gi|134113697|ref|XP_774433.1| hypothetical protein CNBG0790 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|50257071|gb|EAL19786.1| hypothetical protein CNBG0790 [Cryptococcus neoformans var.
            neoformans B-3501A]
          Length = 1276

 Score =  188 bits (477), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 222/912 (24%), Positives = 402/912 (44%), Gaps = 125/912 (13%)

Query: 186  VYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIEL- 244
            V+ S   F  + I  ++ + +G     G+  ++GD  G         EK    G    + 
Sbjct: 353  VWRSRQGFGTV-IAATVIEDHG----SGASVVIGDEYGAFTAFGWEFEKGSGAGTDGRVR 407

Query: 245  -----LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL--------------------- 278
                 LG +S  S+I+YLD++ +++ S+  DS L++L                       
Sbjct: 408  VLRTYLGASSPPSSITYLDSSHLFVSSAVADSVLLRLPKVESSSSVSSGKGKGRAVTSPI 467

Query: 279  ----------------QPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAY 322
                            + D  G   E+LER++N+ P+ D CVV  E      +V  SGA 
Sbjct: 468  GDQADKWEVLYEIGKDRNDTDGGP-EILERWMNIAPVKDLCVVKDEGGNLSHLVLASGAS 526

Query: 323  KDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLE 381
            +  SLR+VR+G+G+ E  +++ L  ++ MW L  ST  P    L++S  + T  + + L+
Sbjct: 527  ESNSLRVVRSGVGLEELVTIQGLHDVQKMWPLTDSTAIPR---LLLS--TSTSTILLQLQ 581

Query: 382  DELEETEIEGFCSQTQTLFCHDAIYNQLV-QVTSGSVRLVSSTS-RELRNEWKSPPGYSV 439
             E+    I      ++TL        +L+ QVT   + L S  S  +L  + +      +
Sbjct: 582  PEISAIPITDVIFNSETLAAGILPGAELLAQVTPRGLSLWSDLSVGQLEAQVEVDKETEI 641

Query: 440  NVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENP------- 492
              A   A   ++A  GG LV   + +   +      ++ E+S + I+    +        
Sbjct: 642  VCAQVTADWAVVAKKGGSLVVFHVSNTGFSPQGTIDVKEEVSAVAISSSSGSSSPIIVIS 701

Query: 493  SYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGH 552
            +++    V M + IS  +  L     I  E     +  RS   C   G   LL  L +G 
Sbjct: 702  TWTAKTFVYMLSQISNGVDGLS----IQSESSATSLQLRSHPFCP--GGIQLLSGLDNGL 755

Query: 553  LLNFLLNM-KTGE-----LTDRKKVSLGTQPITLRTFSSKNTTHVFAA---SDRPTVIYS 603
            L  + LN  ++GE     +   K  SLG +P+ L    S +      +   ++R +VI+ 
Sbjct: 756  LHIYDLNTSESGEAEGLTVKSSKTTSLGLRPLVLHPCESTHGDEKVISVGLTERMSVIFE 815

Query: 604  SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EH 662
            S  ++ +S+VN+K V      ++++ P     ++   L++  ++ ++KLH+++   G E 
Sbjct: 816  SKDRIEFSSVNIKNVMAATSVDTSSGPVFALFSRTSGLSLVKVNSLKKLHVQTCDTGNES 875

Query: 663  PRRICHQEQSRTFAICSLKNQSC--AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
              ++ + ++ +  A  S +       +  E +FV++ D  + E +S++ L   E   S+ 
Sbjct: 876  ISKLTYMDEYKAIACGSTRRTQLRDGDVKEENFVQIRDGTSLEPLSSFSLRGRELVTSLR 935

Query: 721  SCSFSDDSNVYYCVGTAYVLPEENEPT-----------KGRILVFIVEDGK------LQL 763
            S   +    +Y  VGTA++ P++ E +           +GR+L+   ++G       +++
Sbjct: 936  SVFLT--GRIYLAVGTAFLPPDDGEDSSWDEGNLAVVREGRVLLLEFKEGDAGGGWDIKI 993

Query: 764  IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ-SECGHHGHILAL--- 819
             AE  T GAVY+L   +G L  A   K+ +++    D    EL+ +      ++++    
Sbjct: 994  KAELATVGAVYALEEIHGFLAVAAGSKLTIHR---LDHNPVELEETSSWASAYVISSLSV 1050

Query: 820  ----YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD--DIYL 873
                +++  G  IV GD M+S+ +L     +G I +  R+   + ++A+ +L D  D  +
Sbjct: 1051 LPPSHIRPEGALIV-GDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGDGVV 1109

Query: 874  GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
             ++   NL T R N          +LE    + L E V RF+ GSLV      ++  IP 
Sbjct: 1110 ISDAYSNLLTYRLNQ---------KLERAATFGLHEEVTRFQSGSLVPTTTAPEI-IIPD 1159

Query: 934  VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
            V+F T  G +G+I  L       L+ LQ N+ K+ KG G +    WR   +     D   
Sbjct: 1160 VLFATREGRLGIIGELGTRSSRTLDDLQRNMSKIWKGPGEVGWSNWRRAGSNLVGKDTAG 1219

Query: 994  FLDGDLIESFLD 1005
            F+DGD ++ FLD
Sbjct: 1220 FVDGDFVQKFLD 1231


>gi|401407861|ref|XP_003883379.1| putative Splicing factor 3B subunit 3 [Neospora caninum Liverpool]
 gi|325117796|emb|CBZ53347.1| putative Splicing factor 3B subunit 3 [Neospora caninum Liverpool]
          Length = 1233

 Score =  187 bits (475), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 257/1198 (21%), Positives = 456/1198 (38%), Gaps = 198/1198 (16%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I ++  FR  G  +D+L I ++  +  ++Q+ AE +E   R   +   + G R   
Sbjct: 58   VFGIIRSIAAFRLTGANRDYLAIGSDSGRLVIVQFSAEKNEF-ERVHCETYGKTGIRRVV 116

Query: 65   NGQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAFNIRLEELQVL--DIKFL-YG 119
             G+   +DP  R + +   +      ++  DNK QL  +  +   +   +  D+  +  G
Sbjct: 117  PGEYLAVDPKGRTLMVAAVERQKFVYIVNRDNKAQLTISSPLEAHKSHAICHDLCGIDMG 176

Query: 120  CAKPTIVVLYQ--DNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGV 177
               P    L Q  ++ D +         K     E     N++   A L  PVPP    +
Sbjct: 177  FDNPLFASLEQNVESTDRKPATPGVTVPKGVCLWEMDLGLNHVIKKATL--PVPPSAHCL 234

Query: 178  LIIGEET-------IVYCSANAFK-----------AIPIRPSITKAYGRVDADGSRYLLG 219
            + +           ++ C  N              AIP R       G V    + + + 
Sbjct: 235  IPVPGGGGADGPSGVLVCCGNFLLYKKPDHEEVSCAIPRRLETGSDRGLVVVAFAVHRMK 294

Query: 220  D--------HAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
            D          G ++ + I+H++  V  +         +A+ +  L +  +++ S +G+ 
Sbjct: 295  DFFFILIQTEYGDIYKIEISHDEGVVREIVCRYFDTVPVANALCVLKSGYLFVASEFGNH 354

Query: 272  QLIKLN------LQPDAKGSY-----------------VEVLERYVNLGPIVDFCVVDLE 308
               +          P    S+                 + +++   +L PI D  V+D +
Sbjct: 355  LFYQFTGIGSDASDPRCSSSHPLGREAIIAFKPRPLKNLALVDELQSLSPITDLKVLDAQ 414

Query: 309  RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVV 367
              G  QV    G     +LRI+++G+G+ E A  EL G  + +W+ + S    FD +++V
Sbjct: 415  GTGAPQVYVLCGKGPRSALRILQHGLGVEEMADNELPGRARAVWTTKLSHQSAFDGYIIV 474

Query: 368  SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
            +F  E   L + + D +EE     F +   +L       +  +QV    +R +  + R  
Sbjct: 475  AF--EGSSLVLQIGDTVEEVTDSLFLTNVSSLLVALMYDDSFIQVHETGIRHILKSKRV- 531

Query: 428  RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEISCLDIN 486
             NEW++P G  +  A AN  Q++++  GG L+  E+ +   L E     +  E +C+ + 
Sbjct: 532  -NEWRTPGGRRIKAADANERQLVISLAGGELILFEVDESHTLVETARRNINVETTCMSMQ 590

Query: 487  PIGENPSYSQIAAVGMWTDISVRIFSL-PDLNL--ITKEHLGGEIIPRSVLLCAFEGIS- 542
             I +    +   AVG   D  VRI SL  D NL  +  + L     P SV L    G+  
Sbjct: 591  AIPKGRLRASFLAVG-GLDNMVRILSLEKDRNLRQLATQLLPNNATPESVCLATLTGLGA 649

Query: 543  --------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
                          YL   L  G ++  +++   G L D++   LG + +     + +  
Sbjct: 650  HGKDAGKSEDNGVLYLHVGLNTGVMIRSVVDPVLGTLLDQRSRFLGGRAVRFHAVTLQGQ 709

Query: 589  THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
              + A S++  + Y+   KL  + +N   +  +  F S    D       G L I     
Sbjct: 710  PAILALSEKSWLCYTFQHKLHCTPLNYDPLECVASFCSEQCTDGFVAIAGGSLRIFRCQR 769

Query: 649  IQKLHIRS-IPLGEHPRRICH--------------------QEQSRTFAICSLKNQSCAE 687
            + +   ++ +PL   PR +                        ++   AI    + +  E
Sbjct: 770  LGETFSQTLLPLSFTPRAMAALPHISATESQGAVDAAGVGPSRRAAALAIVEADHNAYDE 829

Query: 688  ESEMHFVRLL----------------DDQTFEFISTYPLDTFEYGC-------------- 717
             ++    R L                DD   E    + L   +YG               
Sbjct: 830  STKAEIRRALRGIKVNQEEEEEKEETDDMQLEEKEQHDLPEDQYGTFKAGPGKWGSCIRI 889

Query: 718  -------SILSCSFSDDSNVYYC------------VGT--AYVLPEENEPTKGRILVFIV 756
                   +I   S   D     C            VGT  A  L     P +  I VF  
Sbjct: 890  VNPLMAMTIDKVSLETDEAALSCCYCEMEGLPLLVVGTVTAMTLKPRKVP-QASIKVFSY 948

Query: 757  EDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 815
            +D   L L+     +    +L  F G LLA +  K++LY       G + L  +C +   
Sbjct: 949  DDKFSLNLVHSTPVEDYPMALAPFRGMLLAGVGHKLRLYAL-----GKKRLLKKCEYKNL 1003

Query: 816  ILAL-YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
               + +++  GD + VGDL +S+ ++ Y+  E      A D    W++  E+LD   ++ 
Sbjct: 1004 PCGVAFIRVAGDRLFVGDLRESVHVMRYRLSENLFYVLADDVVPRWLTKGEVLDYHTFVA 1063

Query: 875  AENNFNLFTVRKNSEGATDE--------ERG----------RLEVVGEYHLGEFVNRFRH 916
            A+   ++F  R  SE   DE         RG          +L+ +  +H+GE V     
Sbjct: 1064 ADKFDSVFICRVPSEAKQDELGDTTGLRLRGDTTYLTDKCFKLQSLLHFHIGEVVTALER 1123

Query: 917  GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL--PHEQYLFLEKLQTNLRKVIKGVGGL 974
             +L         G   ++I+GT+ G IG  +     HE  LF   L+  LR     +GG 
Sbjct: 1124 ATLT-------AGASESIIYGTIMGSIGAFSPFLTKHELDLFTH-LEMVLRSEKPPLGGR 1175

Query: 975  NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             H  +RS+ +      AKN +DGDL ES+  L       I++    +  ++ K +E++
Sbjct: 1176 EHIMFRSYYHP-----AKNTVDGDLCESYALLPYDVQKRIAQDFEKTPADILKHLEDI 1228


>gi|328853180|gb|EGG02320.1| hypothetical protein MELLADRAFT_44871 [Melampsora larici-populina
            98AG31]
          Length = 1210

 Score =  187 bits (475), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 261/1180 (22%), Positives = 486/1180 (41%), Gaps = 189/1180 (16%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
            +G I +L  FR  G  +D++ + ++  +  VL++D   +  +            R    G
Sbjct: 60   FGTIRSLTSFRLTGGTKDYIILGSDSGRIVVLEFDPVLNAFVKVHQETYGKSGARRVVPG 119

Query: 67   QIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKFL 117
            Q    DP  R  ++       L  ++  D+   L      EA   N  +  +  +D+ F 
Sbjct: 120  QYLATDPKGRAVMVAAMEKSKLVYILNRDSAANLTISSPLEAHKSNAIIHHIVGVDVGF- 178

Query: 118  YGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA 164
                 P    L  D             N   + +  YE+ L     V   WS+   +  A
Sbjct: 179  ---ENPLFAALEVDYGESDQDHTGEAFNSAEKMLTYYELDLGLNHVVRK-WSEPT-EPRA 233

Query: 165  DLLIPVP-------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------- 201
            ++L+ VP             P   GVL+  E+ I+Y   N  +    IP R +       
Sbjct: 234  NMLLQVPGGQSATHADKFDGP--SGVLVCCEDMIIYKHQNVKEHRVPIPKRNTPFNDVER 291

Query: 202  ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
               I  A          +L+    G L  + I HE E+V  LKI+      ++S+++ L 
Sbjct: 292  GVIIVAAVMHKMRGAFFFLVQTEEGDLFKVTIDHEDEEVQALKIKYFDTVPVSSSLTILK 351

Query: 259  NAVVYIGSSYG------------------------------DSQLIKLNLQPDAKGSYVE 288
            +  +++ +  G                              D  L K   +P    + V 
Sbjct: 352  SGFLFVAAEMGNHYLYQFEKLGDDDDETEFSSEQYPDNGCSDEPLPKAFFKPRPLQNLV- 410

Query: 289  VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 348
            + +   +L PI+D    +L      Q++T  G     SLRI+R+G+ ++E  + +L G  
Sbjct: 411  LSDELESLAPIIDAKAANLLNTDSPQILTSCGRGSRSSLRILRHGLDVSEIVTSDLPGPP 470

Query: 349  -GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
              +W+ + + DD FD ++++ F++ T +L++   + + E    GF + + T+       N
Sbjct: 471  TNVWTTKLNDDDLFDRYIILGFLNATLVLSIG--ETIVEVSDTGFLTNSPTISIQQLDKN 528

Query: 408  QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DG 466
             L+Q+    +R +         EWK P G  + V+T+N  QV++   GG L+Y E+  +G
Sbjct: 529  GLLQIHPTGIRHIHLNGG--VTEWKVPAGRKIVVSTSNQRQVVVGLSGGELIYFELDLEG 586

Query: 467  ILTEVK-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
             L E +   ++   I+ L ++ + +    +   A+G+  +++V+I SL P+  L T    
Sbjct: 587  QLNEYQEQKEMGSTITTLSLSEVPKGRQRTPFLAIGL-ENLTVQIISLDPNSVLETISLQ 645

Query: 525  GGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
                +P S+ +      S        ++   L +G LL  +L+   G+LTD +   LG++
Sbjct: 646  ALTSVPSSICIAELLDSSIDKNNETLFVNIGLSNGVLLRTVLDSVNGQLTDTRTRFLGSR 705

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L      + T V A S R  + Y+    L ++ +    + ++  F++   P+ L   
Sbjct: 706  PVKLLRVKVDSKTSVIALSSRTWLNYTYQNMLQFNPLIYDSIDNVHSFSAELCPEGLIGI 765

Query: 637  KEGELTIGTIDDIQ-KLHIRSIPLGEHPRR----------ICHQEQSRTFAICSLKNQSC 685
                L I TI  +  K+   S+PL   PR+          I  + + RT +  S+K +  
Sbjct: 766  VGSSLRIFTIPKLGIKVKQDSMPLSYTPRKMILDPVTKHVITIESEHRTMS-PSVKAERL 824

Query: 686  AE------ESEMHF-------------------VRLLDDQTFEFISTYPLDTFEYGCSIL 720
            AE      E + H                    +R+ D      ++   L+  E   S  
Sbjct: 825  AEHKAQGLEFDEHIFNDEVLGLPRSEPGQWASCIRISDPLEKTTLTKIELENNEAATSAA 884

Query: 721  SCSFS--DDSNVYYCVGT---AYVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAV 773
              SF+  D       VGT   A+V P       G + VF   ++GK L+L+ + E     
Sbjct: 885  IVSFANQDLQGPLLVVGTAKDAFVQPRTCR--NGYLSVFKFTDNGKGLELLHKTEVDDIP 942

Query: 774  YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL-YVQTRGDFIVVGD 832
             ++  F G+L A I + ++++     D G ++L  +  +     A+  + T+G  ++VGD
Sbjct: 943  TAIIGFQGRLAAGIGKALRIF-----DLGKKKLLRKVENKTFSAAISSLSTQGPRLLVGD 997

Query: 833  LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
               SIS  +YK  E  +   A D +  W +   ++D D   G +   N++ V +  +  +
Sbjct: 998  AQDSISYAVYKPAENRLLVFADDISPRWTTCATMVDYDTCAGGDRFGNVW-VNRIPKTVS 1056

Query: 893  DE------------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
            DE            E+G       +++ +  +HL +        SLV        G    
Sbjct: 1057 DEVDDDPTGAGIMHEKGYLMGAPHKVKNLVHFHLNDIPTSMEKTSLV-------PGGREV 1109

Query: 934  VIFGTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK 992
            +++  + G +G+ I  L  E   F + L+ ++R  +  + G +H  +R +         K
Sbjct: 1110 LLYTGLQGTVGILIPFLSKEDVDFFQTLEMHMRSELPSLVGRDHLAYRGY-----YFPVK 1164

Query: 993  NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            N +DGD+ ESF  L  ++  +++  ++ SV ++ K++E +
Sbjct: 1165 NCVDGDMCESFALLPMSKQQQVAAELDRSVSDVLKKIEAV 1204


>gi|403370717|gb|EJY85226.1| Spliceosomal protein sap, putative [Oxytricha trifallax]
          Length = 1203

 Score =  187 bits (475), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 262/1164 (22%), Positives = 475/1164 (40%), Gaps = 160/1164 (13%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I  +  FR  G  +DFL + ++  K  +L++D E ++ +            R    
Sbjct: 58   VFGLIRKMIPFRLLGMQKDFLVVGSDSGKIVILEYDGEHNKFVKIHQETFGKTGCRRIVP 117

Query: 66   GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY-----GC 120
            G+    DP  R I +   +    V   +   Q K   +  LE  +   I F       G 
Sbjct: 118  GEYIAADPKGRAIMIGAVEKQKFVFILNRDSQNKLTISSPLEAHKPHTITFAMVGVDVGI 177

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADLLIP 169
              P    L  D  +A    +  V  + +    F E     N+        +DN A +LI 
Sbjct: 178  ENPQFACLEVDYGEADSTYSAVVTGQHQKMLVFYEMDLGLNHVVRKYSTPVDNSAHMLIA 237

Query: 170  VP-PPL--CGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGRVDADGSRY------- 216
            VP  P    G+L++ E  I Y   +      +IPIR       G      S Y       
Sbjct: 238  VPGEPYGPSGILVVCENMISYKKVDHDDRECSIPIRNEQADKNGTFFTAHSTYTSKDMFF 297

Query: 217  -LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
             LL +  G +  + +     +V G+ I+    T+    ++ L    ++  S + +  L  
Sbjct: 298  FLLSNEFGDIFKINLNFTNAQVHGMTIQYFDTTTPTVCMNILRPGQLFCASEFSNHTLYT 357

Query: 276  L----------------NLQPDAKGSY----------VEVLERYVNLGPIVDFCVVDLER 309
                             + +P  + +           ++ ++ + NL  I D  V DL  
Sbjct: 358  FLDIGENDPNPIITSSADRKPHGQQTLTTYNPREFLNLQAVDEFPNLASINDMKVEDLTG 417

Query: 310  QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QGIKGMWSLRSSTDDPFDTFLVVS 368
            +G  Q+   SG    G LR++R+G+ I E+A   + Q    + +++  T D FD +++VS
Sbjct: 418  EGNPQIYLASGRGAQGCLRVLRHGLTIIEKAVTAMPQKPLNVITVKGKTTDVFDKYMIVS 477

Query: 369  FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL- 427
            F  +T +L++  ++++ E +  GF    +TL       N  +QVT  S+  V    +   
Sbjct: 478  FQQQTLVLSIG-QEKVSEVKDSGFVDNERTLHVGILEDNSYIQVTPKSIIHVKGDQQNRK 536

Query: 428  RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD--GILTEVKHAQLEYEISCLDI 485
            R +W S  G  V  A +N  QV ++  GG +VY E+ +  G L EV+    + E++C+DI
Sbjct: 537  RAKWDSGQGKIVK-ACSNQRQVAVSIEGGQIVYFELDEMSGTLNEVESRFYDSEVACIDI 595

Query: 486  NPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAFEGIS 542
              + E     +  AVG + D +V+I SL P+  L  I+ + L     P SV L +F+   
Sbjct: 596  ADVPEGRQRCRFLAVG-YADKTVKIMSLDPESCLQRISMQALPAH--PESVALISFQRDE 652

Query: 543  --------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAA 594
                    +L   L +G LL  L++  TG L+D +   LGT  I+L      NT  + A 
Sbjct: 653  VAQQQQQLFLHVGLVNGVLLRTLVDNVTGVLSDSRTRFLGTNSISLAKVRQGNTNALVAL 712

Query: 595  SDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE-LTIGTIDDI-Q 650
             ++P + Y+  S  K+  + ++ + +     F S   PD   +A  G  L I T++ + +
Sbjct: 713  CNKPWLCYTHMSTNKVNITPLSYEMLEVASSFCSEKCPDGGIVAISGNTLRIITVERLGE 772

Query: 651  KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA-----------------------E 687
                + +PL   P +I    ++    I    + S +                       +
Sbjct: 773  NFTHKVMPLRYTPTKIQIHRETNYLVILEKDHNSYSYSERLRMKEEIAKNTEDENYLALD 832

Query: 688  ESEMHFVRLLDDQ---TFEFISTYPLDTFEY---------GCSILSCSFSDDSNVYYCVG 735
            ES + + R   ++       +  Y L T E              ++ +  +    +  +G
Sbjct: 833  ESRISYPRAGQNKFASCIRIVDPYLLQTLELIEFENNEVVFSHFIATTLGNPGETHLILG 892

Query: 736  TAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
            TA  V  +    + G I  +  V +G+ LQL+     +    + N + G+L+A +   ++
Sbjct: 893  TALNVTFQPRSCSVGFIKTYKFVNNGQSLQLMHSTPCEDIPMAFNEYKGRLIAGVGPILR 952

Query: 793  LY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
            +Y    K +LR    +  Q+          + +Q     I  GDL +S+ +L YK E+  
Sbjct: 953  IYELGQKKLLRKVENKNFQAPI--------IQIQVDEGRIYAGDLQESVHVLKYKPEDVQ 1004

Query: 849  IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR------------------KNSEG 890
            +   + D    W+++  +LD D   G +   N+F  R                  K   G
Sbjct: 1005 LYIFSDDILNRWLTSFCLLDHDTIAGVDKFENVFINRLPVGCEDDAEDDPTATKFKWENG 1064

Query: 891  ATDEERGRLEVVGEYHLGEFVNRFRHGSL-VMRLPDSDVGQIPTVIFGTVNGVIGVIASL 949
              +    +++ + ++  GE     +  SL  +   +S++     ++FGT +G +G +   
Sbjct: 1065 YLNGAAFKMDPICQFFTGEVGTCIQKCSLNTLSGTNSEI-----ILFGTTSGSLGALLPF 1119

Query: 950  -PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
               E+  F   L+  LR   + + G +H  +RS       V  K+ +DGDL E F  L  
Sbjct: 1120 ETREEIDFFVHLEMYLRIEAQPLCGRDHVTFRS-----SYVPVKDVVDGDLCEQFASLEF 1174

Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
             +   +++ M+ +  E+ K++E +
Sbjct: 1175 NKQRVLAEEMDRTPPEVMKKLENM 1198


>gi|296814646|ref|XP_002847660.1| pre-mRNA-splicing factor rse1 [Arthroderma otae CBS 113480]
 gi|238840685|gb|EEQ30347.1| pre-mRNA-splicing factor rse1 [Arthroderma otae CBS 113480]
          Length = 1235

 Score =  187 bits (475), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 272/1164 (23%), Positives = 478/1164 (41%), Gaps = 167/1164 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G  +D++ I ++  +  ++++   +     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
             GQ   +DP  R  LI     + L  V+  + + +L  +  +           L  LD+ 
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182

Query: 116  F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
            +   ++   +     + QD+    + +T      YE+ L     V   W+ + +D  A +
Sbjct: 183  YENPIFAALEVEYTEVDQDSTGQAYEETEKMLVYYELDLGLNHVVRK-WA-DPVDRTASM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
            L  VP       GVL+  E+ IVY  +N  AF+  IP R  P+      R    G     
Sbjct: 241  LFQVPGGADGPSGVLVCAEDNIVYRHSNQDAFRVPIPRRRGPTENPERKRCITAGVMHKM 300

Query: 216  -----YLLGDHAGLLHLL---VITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
                 +LL    G L  L   ++  E E+ TG    LK++      +AS++  L +  ++
Sbjct: 301  RGAFFFLLQSEDGDLFKLTMEMVEDENEQATGEVKRLKLKYFDTVPLASSLCILKSGFLF 360

Query: 264  IGSSYGDSQLIKLN-----------LQPDAKGSYVEVL----------------ERYVNL 296
            + S  G+    +             +  D      E L                E   +L
Sbjct: 361  VASETGNQHFYQFEKLGDDDDEIEFISDDYSADLTEPLPPVYFRPRPAENLNLVESIASL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P++   + +L  +   Q+ T  G     S R +++G+ ++E    EL  +   +W+ + 
Sbjct: 421  NPLMSCSIANLTEEDAPQLYTLCGTGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            + +D +D ++V+SF + T +L++   + +EE    GF S   TL       + L+QV   
Sbjct: 481  TRNDQYDAYIVLSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
             +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHIHADQR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDGSLAEYDEK 596

Query: 474  AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
             Q+   ++CL +  + E    S   AVG   D +VRI SL P+  L  K        P +
Sbjct: 597  RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPESTLENKSVQALTSAPSA 655

Query: 533  V-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            + ++   +  S     YL   L  G  L  +L+  TGEL+D +   LG +P+ L   S K
Sbjct: 656  LSIMSMIDSTSGGPTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFGVSVK 715

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS--HMCPFNSAAFPDSLAIAKEGELTIG 644
                V A S R  + YS  + + ++   L  V       F+S    + +   +   L I 
Sbjct: 716  EQRAVLALSSRSWLGYSDVQTMSFTLTPLNYVGLEWSWNFSSEQCVEGMVGIQGQNLRIF 775

Query: 645  TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT-- 701
            +I+ +   L   SIPL    R +    +   F +    N   +  ++    +LL + T  
Sbjct: 776  SIEKLDNNLLQESIPLAYTSRSLVRHPEYPIFYVIGSDNNVLSPSTK---AKLLSESTTV 832

Query: 702  -----------FEF----------------------ISTYPLDTFEYGCSILSCSF-SDD 727
                       F +                      +S   L+  E   SI + SF S +
Sbjct: 833  NGDNAELPPEDFGYPRGTNHWASCIQVVDPINAKAVMSRIELEDNEAAVSIAAVSFTSQE 892

Query: 728  SNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLL 784
               +  VGT   ++      T G I ++   E+GK L+ I + + +    +L  F G+LL
Sbjct: 893  DETFLVVGTGKGMVVSPRSFTCGFIHIYRFQEEGKELEFIHKTKVEQPPLALLGFQGRLL 952

Query: 785  AAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
            A I   +++Y     D G R+L  +C       + + +QT+G  I+V D+ +S++ ++YK
Sbjct: 953  AGIGPDLRIY-----DLGMRQLLRKCQAQITPRVIVGLQTQGSRIIVSDVQESVTYVVYK 1007

Query: 844  HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLE 900
            ++E  +   A D    W +   ++D +   G +   N++ +R   K SE A ++  G   
Sbjct: 1008 YQENNLIPFADDIIPRWTTCTTMVDYETVAGGDKFGNIWLLRCPPKASEEADEDGSGAHL 1067

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT--------------VNGVIGV- 945
            +    +L    NR    SLV      D   IPT I  T              + G +G+ 
Sbjct: 1068 IHERQYLQGAPNRL---SLVAHFYSQD---IPTSIQKTQLVAGGRDILVWTGLQGTVGML 1121

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
            I  +  +   F + L+  L      + G +H  +R +         K  +DGDL E+F  
Sbjct: 1122 IPFVTRDDVDFFQTLEMQLTSQNPPLAGRDHLIYRGY-----YAPCKGVIDGDLCETFFL 1176

Query: 1006 LSRTRMDEISKTMNVSVEELCKRV 1029
            L   +   I+  ++ SV E+ +++
Sbjct: 1177 LPNDKKQAIAGELDRSVREIERKI 1200


>gi|392578232|gb|EIW71360.1| hypothetical protein TREMEDRAFT_71141 [Tremella mesenterica DSM 1558]
          Length = 1250

 Score =  187 bits (474), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 224/890 (25%), Positives = 375/890 (42%), Gaps = 140/890 (15%)

Query: 210  DADGSRYLLGDHAGLLHLLVITHEKEKVTG--------LKIELLGETSIASTISYLDNAV 261
            D D +  LLGD AG L  L I   +E ++         LK +L G  +  S+++ L ++ 
Sbjct: 356  DRDNAVVLLGDDAGHLTKLTIESAREDLSSKLSAVMKVLKTDL-GFAAAPSSLTLLGSSH 414

Query: 262  VYIGSSYGDSQLIKLN---------LQPDAKG---------------------SYVEVLE 291
            V++GS+ GD+ +I+L          L P  K                      + VE++E
Sbjct: 415  VFMGSACGDALIIRLPSVTSSHSTPLSPALKSKGKARADDSLDGFGVEHYHETATVEMVE 474

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGM 350
            R++NL P+ DFC V+ +  G   +V  +GA    SLR VR+G+ +     +E +QGI+ M
Sbjct: 475  RWMNLAPVKDFCAVEEDGGGLSHLVVAAGASNTNSLRAVRSGVSLETLMDIEGVQGIERM 534

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL- 409
            W +   + D    FL  S    T  + + L  E++   +        T+    A + ++ 
Sbjct: 535  WPIMLPSRDQ-GIFLSTS----TSSMLLGLAPEVQALALPESVVNHPTIAA--ASWEEVA 587

Query: 410  VQVTSGSVRLVSSTSRELR-NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG---- 464
            V VT+ +V + S  +  ++   W       +  A  +     +A  GG LV L++     
Sbjct: 588  VIVTANNVTVWSDLTGSVKVGSWSHGQSRQILAAQISGGLAAIAISGGELVILQVSAHGV 647

Query: 465  DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
            D ILT     QL  EI+ + I         S I     WT+  + +F+L  L       +
Sbjct: 648  DAILTR----QLGGEIASVAI-----LDGLSPIVVTSTWTN-EIFLFTLDQLR---SPDM 694

Query: 525  GGEIIPR-----SVLLCAFEGISYLLCALGDGHLLNFLLNMKTG---ELTDRKKVSLGTQ 576
             G  I       S+ LC   G + LL  L DG ++ + +        EL++RK VSLGTQ
Sbjct: 695  QGSTIRENSFCASLQLCPLSGGARLLAGLSDGTMVTYHIESSANNALELSERKAVSLGTQ 754

Query: 577  PITLRTFS---SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
            P+ L   S     +       S+R  VI  S +++  S+VN  +       N+ +    L
Sbjct: 755  PLRLSPTSLSCGDDRIISVGLSERMCVISESRERIESSSVNRNDTRAAATINTPSHGSCL 814

Query: 634  AIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA------- 686
            A+A    +++     ++K+H+R++ LG       H+  SR   I  LK  +         
Sbjct: 815  ALATSSGISLVKPTSLKKVHVRTLDLG-------HRSVSRLTNISPLKAIAAGSTERPFD 867

Query: 687  -EESEMH---FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPE 742
             E  E+H   +V L D  T E +   PL++ E   +I   +  D +     VG A    +
Sbjct: 868  RETGEIHQSSYVELRDSTTLELLVEKPLESREIVTTIAYVTLGDQN--LLAVGIATFSED 925

Query: 743  ENEPTKGRILVFI-VEDGKLQL---IAEKET-------------KGAVYSLNAFNGKLLA 785
            + +      +V I  + G+L L   + ++++             + AV  +      L  
Sbjct: 926  DEDLPDDLDMVTISAQSGRLVLYEPVVDQDSAEPNLIELTSVGLESAVNDIKVIKNLLAV 985

Query: 786  AINQKIQLYKWMLRDD---GTRELQSECGHHGHILA----LYVQTRGDFIVVGDLMKSIS 838
            A    + +YK          T    S       ++A    L+ + R   +VVGD M+SI 
Sbjct: 986  ATGSNVTIYKHEKASHLLIPTSRFASAFVAKSLVVAPPDKLHPEER---LVVGDGMRSIF 1042

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDD----IYLGAENNFNLFTVRKNSEGATDE 894
            +L      G I    RD   + + A+E L D     I   A +N + F +R+        
Sbjct: 1043 VLDIDEGTGMIMGDERDMATHSVMAMEGLRDGGQAVIVADAHSNISTFRLREE------- 1095

Query: 895  ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
                +E    + L E ++ FR GSL       DV   P +IF T++G +G++  L     
Sbjct: 1096 ----IETAATFGLHEDISVFRRGSLAPASSAEDVLS-PEIIFATIDGRLGIVGELTPSAA 1150

Query: 955  LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
              L+ LQ N+ + I+G G +    +R    E    D   F+DGD +++++
Sbjct: 1151 RTLDDLQRNMDRYIRGPGDIAWRSYRRAGTELVQRDTAGFIDGDFVQTYI 1200


>gi|320587625|gb|EFX00100.1| nuclear mRNA splicing factor protein [Grosmannia clavigera kw1407]
          Length = 1220

 Score =  186 bits (473), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 274/1180 (23%), Positives = 484/1180 (41%), Gaps = 184/1180 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW---DAESSELITRAMGDVSDRIGRP 62
            ++G I ++  FR  G  +D++ +AT+  +  ++++   D     +     G    R   P
Sbjct: 65   VFGIIRSIAAFRLAGSNKDYVILATDSGRITIIEYIPADNRFQRIHLETFGKSGVRRVVP 124

Query: 63   TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
               GQ    DP  R  LI     + L  V+  + + +L  +  +   +  V+ I  +   
Sbjct: 125  ---GQYLAADPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHKPGVIVISLVALD 181

Query: 118  YGCAKPTIVVLYQDNKDA---------RHVKT----YEVALKDKDFVEGPWSQNNLDNGA 164
             G A P    L  D  ++         R  +T    YE+ L     V   WS + +D+ A
Sbjct: 182  VGYANPVFAALELDYAESDQDPTGEAYREAETLLVYYELDLGLNHVVRR-WS-DTVDSTA 239

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGS 214
             +L  VP     P   GVL+ GEE I Y  +N  A++  IP R   T+     R    G 
Sbjct: 240  SMLFQVPGGSDGP--SGVLVCGEENITYRHSNQEAYRVPIPRRRGATEDPNRKRTIVSGV 297

Query: 215  RYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISY 256
             + L   AG    L+ T + +                  +V  LKI+      +AS++  
Sbjct: 298  MHKLKGSAGAFFFLLQTEDGDLLKLTLDMVEDDDGNPTGEVLRLKIKYFDTIPVASSLCI 357

Query: 257  LDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEV 289
            L +  ++  S +G+ Q  +                N   + K  Y            + +
Sbjct: 358  LKSGFLFSASEFGNHQFYQFEKLGDDDDELEFSSDNFPHEPKAPYEPVYFYPRPAENLAL 417

Query: 290  LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK- 348
            +E   ++ P++D  V +L      Q+ +  G     + R++++ + +NE  + +L G   
Sbjct: 418  VESIESMNPMLDCKVANLTDDDVPQIYSVCGNGARSTFRMLKHALEVNEIVASQLPGTPT 477

Query: 349  GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
             +W+ +   D+ +D F+V+SF + T +L++   + +EE    GF S   TL       + 
Sbjct: 478  AVWTTKVRRDEEYDAFIVLSFNNGTLVLSIG--ETVEEVTDTGFLSSVPTLAVQQLGDDG 535

Query: 409  LVQVTSGSVRLVSSTSRELR-NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DG 466
            L+QV    +R +    R+ R NEW +P   S+  A  N  QV++A   G +VY E+  DG
Sbjct: 536  LIQVHPKGIRHI----RDGRVNEWAAPQHRSIVAAATNERQVVVALSSGEIVYFEMDTDG 591

Query: 467  ILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
             L E    +  +  ++CL +  + E    S   AVG   D +VRI SL P+  L      
Sbjct: 592  SLAEYDEKKEMFGTVTCLSLGAVPEGRLRSSYLAVGC-DDCTVRILSLDPETTLDNMSIQ 650

Query: 525  GGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
                 P S+L+ A +  S      YL   L  G  L  +L+  TGELTD ++  LG + +
Sbjct: 651  ALTAAPSSLLIMAMDDSSAGGTALYLHIGLHSGVYLRTVLDEVTGELTDTRQRFLGPKLV 710

Query: 579  TLRTFSSKNTTHVFAASDRPTVIY--SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
             L   S +    V A S RP + Y  S  K    + ++  E+     F+S    + +   
Sbjct: 711  RLFQVSVQRRVCVLALSSRPWLGYDDSKAKNFAMTRLDYSELEWGWNFSSEQCEEGMVGI 770

Query: 637  KEGELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR 695
                L I +I+ +    I +S PL   PRR+    +   F      N + + +     + 
Sbjct: 771  HGNFLRIFSIEKLGDALIQKSFPLTYTPRRLVKHPEFGIFYTIEADNNTLSPDLRRQLIE 830

Query: 696  ----------LLDDQTFEF-----------------------ISTYPLDTFEYGCSILSC 722
                      +L  + F +                         T  L+  E    +   
Sbjct: 831  APGVTNGDASVLPPEDFGYPRGNGRWASCISVVDPVGEDPGVTQTIELEGNEAAVCMAVA 890

Query: 723  SF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKETKGAVYSLNA 778
            SF S     Y  VGT   ++    + ++G I  +  V+DG +L+L+ + + +    +L A
Sbjct: 891  SFVSRGGESYLIVGTGRNMILSPRQFSEGYIHAYRFVKDGTQLELVHKTKVEEPPTALLA 950

Query: 779  FNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
            F G++L  I   +++Y    K ML     R+ QSE       L + +QT+G  +VVGD+ 
Sbjct: 951  FQGRILVGIGNVLRIYDLGIKQML-----RKAQSEVSSK---LIVSLQTQGSRVVVGDVE 1002

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
            + ++ ++YK E   +     D    W +   ++D     G +   NL+ +R + + + D 
Sbjct: 1003 EGVTYVVYKPEINKLLPFVDDTIKRWTTCTTMVDYQSVAGGDKFGNLWILRVSDKASQDA 1062

Query: 895  E-----------RG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
            +           RG       RL ++   +  +        SLV       VG    +++
Sbjct: 1063 DEPGSELQLVHARGYLHGAPNRLALMAHVYTQDVPMSICKASLV-------VGGQEVLVW 1115

Query: 937  GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
              + G +G +   +  +   F + L+T++R+    + G +H  +RS+      V  K  +
Sbjct: 1116 SGLQGTVGALVPFVSRDDADFFQNLETHMRQEDAPLAGRDHLMYRSY-----YVPVKGVI 1170

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            DGDL E F  L R +   I+  ++ SV E+ +++    R+
Sbjct: 1171 DGDLCERFNLLPREKKQMIAGELDRSVREIERKISVSARV 1210


>gi|242018509|ref|XP_002429717.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
 gi|212514723|gb|EEB16979.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
          Length = 1218

 Score =  186 bits (472), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 259/1194 (21%), Positives = 494/1194 (41%), Gaps = 196/1194 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V I+G I +L  FR  G  +D++ + ++  +  +L++   +  L  +   +   + G
Sbjct: 53   LLTVEIFGVIRSLMSFRLTGGTKDYIVVGSDSGRIVILEY-IPAKNLFEKVHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   IDP  R  ++G      L  ++  D + +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQYLAIDPKGRAVMVGAVEKQKLVYILNRDAEARLTISSPLEAHKSNTLVYHMV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A    T E A K +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEIDYEEADTDPTGEAASKTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
            A+ L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R   
Sbjct: 232  ANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  +++ VT +K++      +A+++  +    
Sbjct: 287  FVCSATHKTKSVFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVATSMCVMKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                           P A  + V+V E   +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRALRNLVQVDE-MDSLS 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++  
Sbjct: 406  PIMACQVADLANEDTPQLYMLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRR 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             ++ +D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  VEEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGDDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
            +R + +  R   NEWK+P   ++     N  QV++A   G LVY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKAPGKKTIMKCAVNQRQVVIALTAGELVYFEMDPTGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV- 533
             +  ++SC+ +  +      S+  AVG+  D +VRI SL   + ++   L  + +P +  
Sbjct: 582  AMPSDVSCMALGNVVPGELRSRFLAVGL-ADNTVRIISLDPSDCLSP--LSMQALPAAAE 638

Query: 534  LLCAFE-GIS----------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
             LC  E G +                YL   L +G LL  +L+  TG+L D +   LG++
Sbjct: 639  SLCIVEMGAADKKPDSEESTVTQSNLYLNVGLQNGVLLRTVLDPVTGDLADTRTRYLGSR 698

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     + +  V A S R  + Y    +   + ++ + + +   F+S   P+ +   
Sbjct: 699  PVKLFRIKMQGSEAVLAMSSRSWLSYYYQNRFHLTPLSYESLEYASGFSSEQCPEGIVAI 758

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE--------------------QSRT 674
                L I  ++ +  +  + S PL   PR+ + H E                    Q R 
Sbjct: 759  STNTLRILALEKLGAVFNQISFPLEYTPRKFVVHPETGKMILLETEHNAYTEETKKQRRV 818

Query: 675  FAICSLKNQSCAEESEM-----------------------------HFVRLLDDQTFEFI 705
                 ++  +  EE E+                               +R++D       
Sbjct: 819  QMAEEMQEAAGDEEQELAKEMAEAFLSEDLPESIFGAPKAGSGMWASIIRIIDPVEGRTD 878

Query: 706  STYPLDTFEYGCSILSCSFSD-DSNVYYCVGTA--YVLPEENEPTKGRILVF-IVED-GK 760
                L+  E   SI    F++   +++  VG    Y L    + + G +  F I ED   
Sbjct: 879  KIVRLEQNEAALSIALVKFNNHPESLFLVVGVVKEYQL-SPRQVSFGYLYTFRINEDVTD 937

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHIL 817
            L+L+ +     A  ++  ++G+LL  + + ++LY     D G ++L  +C +      I+
Sbjct: 938  LELVHKTTVDEAPAAVCPYHGRLLVGVGRMLRLY-----DLGKKKLLRKCENKYIPNQIV 992

Query: 818  ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
            ++     G  + V D+ +S+ ++ YK +E  +   A D +  W++   ILD D    A+ 
Sbjct: 993  SICAT--GQRVFVSDVQESVYMVRYKRQENQLIIFADDTHPRWITCTTILDYDTVATADK 1050

Query: 878  NFNLFTVRKNS--EGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSL 919
              N+  +R +S      DE         +RG L       +V+  +H+GE     +  +L
Sbjct: 1051 FGNIAIIRLSSIITDDVDEDPTGNKALWDRGLLNGASQKADVLANFHVGETCMSLQKATL 1110

Query: 920  VMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
            +        G   ++++ +++G +GV+      E + F + L+ ++R     + G +H  
Sbjct: 1111 I-------PGGSESLVYTSLSGTVGVLVPFTSREDHDFFQHLEMHMRSEHPPLCGRDHLS 1163

Query: 979  WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +RS+         KN +DGDL E +  +   +   I++ ++ +  ++ K++E++
Sbjct: 1164 FRSY-----YYPVKNVIDGDLCEQYNSIEPAKQKSIAEDLDRNPSDVSKKLEDI 1212


>gi|357606250|gb|EHJ64976.1| putative Splicing factor 3B subunit [Danaus plexippus]
          Length = 1216

 Score =  186 bits (471), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 272/1183 (22%), Positives = 483/1183 (40%), Gaps = 176/1183 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            ++ V I+G I ++  FR  G  +D++ + ++  +  +L++   +  ++ +   +   + G
Sbjct: 53   LMKVEIFGVIRSMMSFRLTGGTKDYIVVGSDSGRIVILEY-IPAKNILEKVHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDAEARLTISSPLEAHKSNTLVYHMV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A    T E A K +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEIDYEEADSDPTGEAAQKTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
            A+ LI VP     P   GVLI  E  + Y   N      IR  I +    +D D  R   
Sbjct: 232  ANFLITVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + I  +++ VT +K++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKITIETDEDMVTEIKLKYFDTVPVATAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGP 298
            +++   +G+  L ++             +  P  +G       R +          +L P
Sbjct: 347  LFVACEFGNHYLYQIAHLGDEDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDELDSLSP 406

Query: 299  IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
            I+   V DL  +   QV    G     SLR +R+G+ + E A  EL G    +W++R   
Sbjct: 407  ILACHVADLTGEDTPQVYLACGRGPRSSLRALRHGLEVAEMAVSELPGSPNAVWTVRRHK 466

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            DD +D++++VSF++ T +L++   + +EE    GF   T TL CH    + LVQV    +
Sbjct: 467  DDDYDSYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCHALGSDALVQVYPDGI 524

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQ 475
            R + +  R   NEWK+P   S+     N  QV++A  GG LVY E+   G L E  +  +
Sbjct: 525  RHIRADKRV--NEWKAPGKKSIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERKK 582

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-L 534
            L  ++SC+ +  +      +   AVG+  D +VRI SL   + +    +  + +P S   
Sbjct: 583  LSSDVSCMALGSVATGEQRAWFLAVGL-VDNTVRIISLDPADCLAPRSM--QALPASPES 639

Query: 535  LC----AFEGIS----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
            LC     FE  +    +L   L +G LL   L+  +G+L D +   LG++P+ L     +
Sbjct: 640  LCIVDQPFESGAKSALHLNIGLSNGVLLRTTLDSVSGDLADTRTRYLGSRPVKLFKVRVQ 699

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
            +   V A S R  + Y    +   + ++ + + +   F+S    + +       L I  +
Sbjct: 700  SAEAVLAVSSRTWLGYQYQNRFHLTPLSYECLEYAAGFSSEQCTEGIVAISSNTLRILAL 759

Query: 647  DDIQKLHIRSI-PLGEHPRRICHQEQSRTFAICSLKNQSCAEE----------SEMHFV- 694
            + +  +  ++   L   PR+      +    +    + +  EE           EM    
Sbjct: 760  EKLGAVFNQTFQQLDYTPRKFVINSDNNHIIVLETDHNAYTEEMKKQRRVQMAQEMREAA 819

Query: 695  --------RLLDDQTFEFIS-TYPLDTF---EYGCSILSCSFS--DDSNVYYCVGTAYVL 740
                    +L ++    F+S   P + F   + G  + +      D S       T  +L
Sbjct: 820  AGGTPEEQQLANEMADAFLSDVLPENIFSSPKAGAGMWASQIRILDMSGGVGGCSTVCLL 879

Query: 741  P-EENEPTKGRILV-----------FIVEDGKLQLIAEKE-TKGAVYSLNAFN-GKL--- 783
            P E+NE      +V            +V   K  L++ +  ++G+++    +N GKL   
Sbjct: 880  PLEQNEAAVSLCVVRWAALTDNTPHLVVGVAKDALLSPRSCSEGSLHVYKIYNTGKLELV 939

Query: 784  -----------LAAINQKIQL-YKWMLR--DDGTRELQSEC-GHHGHILALYVQTRGDFI 828
                       LAA N K+      MLR  D G R+L  +C   H   L   ++T    I
Sbjct: 940  HKTPIDEYPGALAAFNGKLLAGVGRMLRLYDIGRRKLLRKCENRHIPNLIADIKTIRQRI 999

Query: 829  VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--K 886
             V D+ +S+  + YK  E  +   A D N  W++   ILD D    A+   N+  +R  +
Sbjct: 1000 FVSDVQESVFCVKYKKRENQLIIFADDTNPRWITNTCILDYDTVAMADKFGNVAVLRLPQ 1059

Query: 887  NSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
            +     DE         +RG L       ++   +H+GE V   +  +L+        G 
Sbjct: 1060 SVSDDVDEDPTGNKALWDRGLLNGASQKGDITVNFHVGETVTSLQRATLI-------PGG 1112

Query: 931  IPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
               +++ TV+G +GV       E + F + L+ ++R     + G +H  +RS+       
Sbjct: 1113 SEALLYATVSGALGVFLPFTSREDHDFFQHLEMHMRSENSPLCGRDHLSFRSY-----YY 1167

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              KN +DGDL E F  L   +   I+  +  +  E+ K++E++
Sbjct: 1168 PVKNVIDGDLCEQFNSLEPAKQKAIAGDLERTPAEVSKKLEDI 1210


>gi|449664009|ref|XP_004205855.1| PREDICTED: splicing factor 3B subunit 3-like [Hydra magnipapillata]
          Length = 1213

 Score =  183 bits (465), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 209/941 (22%), Positives = 399/941 (42%), Gaps = 148/941 (15%)

Query: 216  YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
            +L+    G +  + +  E++ VT ++++      +AS +  L    ++  S +G+  L +
Sbjct: 291  FLVQSEQGDIFKVTLETEEDMVTEIRVKYFDTIPVASALCVLKTGFLFAASEFGNHALYQ 350

Query: 276  L------NLQPDAKG-------------------SYVEVLERYVNLGPIVDFCVVDLERQ 310
            +      + +P+                        + +++   +L PI+   V DL  +
Sbjct: 351  ITHLGDDDEEPEFSSLMATELDEGETFFFHARDLQNLVLVDEMESLAPIMHCQVADLANE 410

Query: 311  GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSF 369
               Q+    G     SLR++R+G+ ++E A  EL G    +W+++ ++ D  D ++VVSF
Sbjct: 411  DTPQLYAACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNSVWTVKKNSTDEQDAYIVVSF 470

Query: 370  ISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
            ++ T +L++   + +EE    GF   T TL C     + L+Q+    +R + S  R   N
Sbjct: 471  VNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALLQIYPDGIRHIRSDRR--VN 526

Query: 430  EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINP 487
            EWK+P   ++     N  Q+++A  G  LVY E+   G L E  +  ++  ++ C+ + P
Sbjct: 527  EWKTPGKKNIIQCAVNERQIVIALTGNELVYFELDQSGQLNEYTERKEMSADVVCMALGP 586

Query: 488  IGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS-VLLCAFE------- 539
            +      S+  AVG+  D +VRI SL   + +  E L  + +P +   LC  E       
Sbjct: 587  VHAGEQRSRFLAVGL-IDNTVRIISLDPNDCL--EPLSMQALPAAGESLCIIEMGGTEVG 643

Query: 540  --------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
                    G  +L   L +G LL  +L+  TG+L+D +   LG++P+ L    +   + V
Sbjct: 644  EKGTAGNAGGLFLNIGLANGVLLRTVLDSVTGDLSDTRIRYLGSKPVKLFNVKTYGCSSV 703

Query: 592  FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
             A S R  + Y+   +   + ++ + + +   F S   P+ +       L I  ++ +  
Sbjct: 704  LAMSSRTWLSYTYQSRFHLTPLSYESLEYASSFASEQCPEGIVAISSNTLRILALEKLGA 763

Query: 652  LHIR-SIPLGEHPRRIC--------------HQEQSRTFAI-------------CSLKNQ 683
            +  + + PL   PR+                H   ++TF +             C   ++
Sbjct: 764  VFNQVTTPLDFTPRKFVVDHKSHTLIMIETDHNAMTKTFKLDRKQRIAEEMVESCGDNDE 823

Query: 684  SCAEES------------EMHF-------------VRLLDDQTFEFISTYPLDTFEYGCS 718
            + A ES            E+ F             VR++D    E      L+  E   S
Sbjct: 824  ARAAESQVAANFLNTDIPEIQFGAPKAGIGRWASIVRIMDPIRSETHHEIQLEQDEAAFS 883

Query: 719  ILSCSFSDD-SNVYYCVGTAYVL---PEENEPTKGRILVFIVED-GKLQ--LIAEKETKG 771
            +    F++   + +  VG A  +   P  ++            D G++Q  L+ +     
Sbjct: 884  VCLTEFANQPGDRFVIVGCATKMILNPHSSQSCSIHTYQLYQNDRGRMQLELLHKTPVDD 943

Query: 772  AVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-HILALYVQTRGDFIVV 830
               +L +F G+LL  + + +++Y     D G ++L  +C +         + T G  I+V
Sbjct: 944  IPGALCSFQGRLLVGVGRILRIY-----DLGKKKLLRKCENKKVSTFITGIDTIGYRIMV 998

Query: 831  GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
             D+  S   L YK ++  +   A D N  +++   ++D D   GA+   N+F VR  ++ 
Sbjct: 999  HDIQDSFHFLKYKRKDNQLVLFADDVNKRFLTCGCLVDYDTMAGADKFGNIFFVRLPADV 1058

Query: 891  ATDEE-----------RGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
              D E           RG L       +VV  YH+GE     +  +L+        G   
Sbjct: 1059 KDDIEEDPTGVKALWDRGLLNGASQKADVVSVYHIGETALSLQKATLI-------PGGSE 1111

Query: 933  TVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 991
            ++++ TV+G IG++      E   F + L+ ++R+    + G +H  +RS          
Sbjct: 1112 SLVYTTVSGGIGMLVPFTSREDIDFFQHLEMHMRQDSPPLCGRDHLWFRS-----SFSSV 1166

Query: 992  KNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            KN +DGDL E F  L   +   I++ ++ +  E+ K++E++
Sbjct: 1167 KNCVDGDLCEQFNSLEYAKRKAIAEELDRTPAEVSKKLEDI 1207


>gi|355718834|gb|AES06402.1| splicing factor 3b, subunit 3, 130kDa [Mustela putorius furo]
          Length = 1101

 Score =  182 bits (462), Expect = 9e-43,   Method: Compositional matrix adjust.
 Identities = 256/1124 (22%), Positives = 461/1124 (41%), Gaps = 197/1124 (17%)

Query: 10   IATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQI 68
            I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G R    GQ 
Sbjct: 2    IRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSGCRRIVPGQF 60

Query: 69   GIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGCAKP 123
              +DP  R + +   +   L  ++  D   +L  +  +   +   L    +    G   P
Sbjct: 61   LAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVVGVDVGFENP 120

Query: 124  TIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADLLIPVP- 171
                L  D ++A +  T E A   +    F E     N+        L+   + LI VP 
Sbjct: 121  MFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEHGNFLITVPG 180

Query: 172  ----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR------------ 215
                P   GVLI  E  I Y   N      IR  I +    +D D  R            
Sbjct: 181  GSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMIFVCSATHKT 235

Query: 216  ---------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
                     +L     G +  + +  +++ VT ++++      +A+ +  L    +++ S
Sbjct: 236  KSMFLSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVAS 295

Query: 267  SYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGPIVDF 302
             +G+  L ++                          QP    + V +++   +L PI+ F
Sbjct: 296  EFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLSPIL-F 353

Query: 303  C-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDP 360
            C + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R   +D 
Sbjct: 354  CQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDE 413

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +R +
Sbjct: 414  FDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHI 471

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEY 478
             +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  ++  
Sbjct: 472  RADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSA 529

Query: 479  EISCLDIN--PIGENPS-YSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVL 534
            ++ C+ +   P GE  S +  + AVG+  D +VRI SL   + +  + L  + +P +   
Sbjct: 530  DVVCMSLANVPPGEQRSRFLAVLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPES 586

Query: 535  LCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
            LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++P+
Sbjct: 587  LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPV 646

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
             L     +    V A S R  + YS   +   + ++ + +     F S   P+ +     
Sbjct: 647  KLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIST 706

Query: 639  GELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQSC 685
              L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q  
Sbjct: 707  NTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQM 766

Query: 686  AEE-------------SEM-------------------------HFVRLLDDQTFEFIST 707
            AEE             +EM                           +R+++      +  
Sbjct: 767  AEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDL 826

Query: 708  YPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQL 763
              L+  E   S+  C FS+  + +Y  VG A  L        G  +    +V +G KL+ 
Sbjct: 827  VQLEQNEAAFSVAVCRFSNTGDDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEF 886

Query: 764  IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY--- 820
            + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y   
Sbjct: 887  LHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISG 939

Query: 821  VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
            +QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+   N
Sbjct: 940  IQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGN 999

Query: 881  LFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMR 922
            +  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+  
Sbjct: 1000 ICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-- 1057

Query: 923  LPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLR 965
                  G   ++++ T++G IG++     HE + F + ++ +LR
Sbjct: 1058 -----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLR 1096


>gi|300120114|emb|CBK19668.2| unnamed protein product [Blastocystis hominis]
          Length = 1240

 Score =  180 bits (457), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 267/1171 (22%), Positives = 471/1171 (40%), Gaps = 173/1171 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
            +YG I  L+ FR  G   DF+ I ++  +  VL++D E +   ++     G V    G P
Sbjct: 57   VYGLIRALKPFRLSGNNTDFVLIGSDSGRIVVLKYDTEKNTFEKIHQETFGKVGVIRGLP 116

Query: 63   TDNGQIGIIDPDCR--LIGLHLY----------DGLFKVIPFDNKGQLKEAFNIRLEELQ 110
               GQ   +DP+ R  +I   L+            L  V+  D++G +  +  +   + Q
Sbjct: 117  ---GQYLAVDPNGRAFMISGFLFHVSSLGAIEKQNLVYVLNRDSRGNVTISSPLEAHKSQ 173

Query: 111  --VLDIKFL-YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPW 155
              VL  + L  G   P    L  +  +A    T E A + +            + V   W
Sbjct: 174  TIVLTTEGLDVGYDNPMFACLELNYGEADEDSTGEAAQQTEKVLTYYQLDLGLNHVIRKW 233

Query: 156  SQNNLDNGADLLIPVP--PPLCGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGRVD 210
            S+  ++  A+LL  VP      GVL+  E  ++YC+ N  +    IP R  + +  G + 
Sbjct: 234  SK-TVERSANLLARVPGDSGPGGVLVFAENWVMYCNENHPEIRAPIPRREFLPENRGTMI 292

Query: 211  ADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
               +         YL+    G L+ + ++ +   V  L ++       A+T+       +
Sbjct: 293  ITAALFQREGNLFYLVQSEYGDLYKITLSFDSSHVHDLVVKYFDTIPAATTLHITKKGFL 352

Query: 263  YIGSSYGDS--------------QLIKLNLQPDAKGSYVEVLE----------------- 291
               S  GD               Q I L     A+ S++  L+                 
Sbjct: 353  LATSEVGDQYALPLALLTGSCFYQFIALGDDDSAQASFLGSLQPDGSVQVPVFSPRALTN 412

Query: 292  -RYVN----LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ- 345
             R V+    L P+V     DL   G  Q+    G      L++++ G+ I+  +   L  
Sbjct: 413  LRPVDTLPSLSPLVKIHAEDLRGDGTPQLFALCGRSSRSQLKVLQQGLAISLLSQNPLPY 472

Query: 346  GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
               G+W+LR +       F+V+SF + T +LA+     +E+    GF     TL      
Sbjct: 473  APSGLWTLRDARTGQ-HRFMVISFNNATIVLAVG--KSVEQVMDTGFKLDESTLATGVLE 529

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD 465
             N  +QV  G  R +          W+ P   S+  A  N  Q+++A   G ++Y E+ D
Sbjct: 530  GNSFLQVYPGGFRQIFEDGHT--KVWEPPSRRSITAAAMNLRQIVVALSNGEVLYFEL-D 586

Query: 466  GILTEVKHAQLEY--EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
              L  V+   + +  E+ CLD+  +  N   +   AVG + D S R++SL   +L+ +  
Sbjct: 587  ERLEWVERESMNHREEVICLDLPALAPNSLRAPFLAVG-YGDRSCRVYSLAPNSLLEELS 645

Query: 524  LGG-EIIPRSVLLCAFEGIS------YLLCALG--DGHLLNFLLNMKTGELTD-RKKVSL 573
            +     +P ++ L      S       LL  +G  +G L+   ++  +G+L++ R    L
Sbjct: 646  MQALNAMPSNLTLDTMRMGSGSLARETLLLTVGMENGILMRVEVDPVSGKLSNARSTRFL 705

Query: 574  GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
            G +P+ L    +     V A S +P + Y +N  L  +++    +    PF +    + +
Sbjct: 706  GPRPVRLFKILAGGNPCVLALSVKPWLCYCANNTLTLTSLVSDPLDLAAPFCNEDCSEGI 765

Query: 634  AIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTF-------AICSLKNQS 684
                   L I  IDD+ Q     SIPL   PR  + +  Q R         A   L+ QS
Sbjct: 766  VCVAGTNLNIIRIDDLTQPFTATSIPLSYTPRELVVYPGQPRLLLLETDHNAYSELEKQS 825

Query: 685  CAEESEMHFV--------------------RLLDDQTFEFISTYPLDTFEYGCSILSCSF 724
              ++  + +V                    R++D  + + +    L   E   S+  C F
Sbjct: 826  FYQQHNVSYVNEYDCGAPIPAEPDKWASCIRVVDAISLQTLERLELADNEAAFSMCVCRF 885

Query: 725  SDDSN-VYYCVGTAYVLP-EENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNG 781
            +   +  +  +GTA  L       ++G I VF  VE   LQL+   E      +L  F+G
Sbjct: 886  ASKGDEPFVVIGTAKNLKIHPRSCSQGFISVFRFVEGHSLQLLHRTEVDEVPAALCEFDG 945

Query: 782  KLLAAINQKIQLYKWMLRDDGTRELQSECGHHG--HILALYVQTRGDFIVVGDLMKSISL 839
            KL A I + +++Y     D G ++L  +C +    H +   ++  G+ +  GDL  ++S 
Sbjct: 946  KLAAGIGRSVRVY-----DLGKKKLLRKCENKAMPHFVT-KLRAMGERLYAGDLTDNVSF 999

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE---------- 889
            + ++     + E A       ++A+++LD +  +  +   NLF  R + +          
Sbjct: 1000 VKFRKGTNQLVEFADGGIPRSITALDVLDYNTVVCGDKGGNLFVERVDPKVDDDIANPTG 1059

Query: 890  -------GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGV 942
                   G       + E     +LGE V   +   L+   P  D      V++GT+ G 
Sbjct: 1060 SRSLWNSGLLSAAPNKAEQAASIYLGEIVTSVQKTVLI---PGGD----EVVLYGTIFGT 1112

Query: 943  IGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
            IG +  +P    L  L  ++  +RK    + G +   WRS          K  +DG+L E
Sbjct: 1113 IGALLPMPSRDDLHHLMHIEMYIRKQEPSLVGRDILSWRS-----AYTPMKGIIDGNLCE 1167

Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +F  L + + +EI+  + +SV  + K++E+L
Sbjct: 1168 TFSMLPQIKQEEIANALVLSVSSIVKKMEDL 1198


>gi|326469377|gb|EGD93386.1| splicing factor 3B subunit 3 [Trichophyton tonsurans CBS 112818]
          Length = 1188

 Score =  180 bits (457), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 276/1166 (23%), Positives = 478/1166 (40%), Gaps = 186/1166 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G ++D++ I ++  +  ++++   +     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGVRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
             GQ   +DP  R  LI     + L  V+  + + +L  +  +           L  LD+ 
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182

Query: 116  F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
            +   ++   +     + QD     + +T      YE+ L     V   W+ + +D  A +
Sbjct: 183  YENPIFAALEVEYTEVDQDPTGQAYEETEKMLVYYELDLGLNHVVRR-WA-DPVDRTASM 240

Query: 167  LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR- 215
            L  VP     P   GVL+  E+ IVY  +N  AF+  IP R  P+      R    G   
Sbjct: 241  LFQVPGGADGP--SGVLVCAEDNIVYRHSNQDAFRVPIPRRRGPTENPERKRCITAGVMH 298

Query: 216  -------YLLGDHAGLLH---LLVITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
                   +LL    G L    + ++  E EK TG    LK++      +AS++  L +  
Sbjct: 299  KMRGAFFFLLQSEDGDLFKVTMEMVEDENEKATGEVKRLKLKYFDTVPLASSLCILKSGF 358

Query: 262  VYIGSSYGD-----------------------SQLIKLNLQP----DAKGSYVEVLERYV 294
            +++ S  G+                       S +I   L P          + ++E   
Sbjct: 359  LFVASETGNQHFYQFEKLGDDDDEIEFISDDYSAVISEPLPPVYFRPRPAENLNLVESIA 418

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
            +L P++   + ++  +   Q+ T  G            G  I+E    EL  +   +W+ 
Sbjct: 419  SLNPLMAASITNITEEDAPQIYTLCG-----------TGARISEIVESELPSVPSAVWTT 467

Query: 354  RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
            + S +D FD ++V+  I ET          +EE    GF S   TL       + L+QV 
Sbjct: 468  KLSRNDQFDAYIVL--IGET----------VEEVTDTGFLSSAPTLAVQQLGEDSLIQVH 515

Query: 414  SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV- 471
               +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG L E  
Sbjct: 516  PKGIRHIHADQR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDGSLAEYD 573

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
            +  Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P
Sbjct: 574  EKRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 632

Query: 531  RSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
             ++ ++   +  S     YL   L  G  L  +L+  TGEL+D +   LG +P+ L + S
Sbjct: 633  SALSIMSMIDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFSVS 692

Query: 585  SKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
             K    V A S R  + YS    K    + +N   +     F+S    + +   +   L 
Sbjct: 693  VKEQRAVLALSSRSWLGYSDVQTKSFTLTPLNYVGLEWSWNFSSEQCVEGMVGIQGQNLR 752

Query: 643  IGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEM-------- 691
            I +I+ +   L    IPL   PR      +   F +    N   S A ++++        
Sbjct: 753  IFSIEKLDNNLLQEPIPLAYTPRNFVRHPEYPLFYVIGSDNNILSPATKAKLLSESTAVN 812

Query: 692  ------------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILSCSF-SDDS 728
                              H+   ++++D   T   +S   L+  E   SI + SF S + 
Sbjct: 813  GDSAELPPEDFGYPRGTNHWASSIQVVDPIHTKSVLSNLELEDNEAAVSIAAVSFTSQED 872

Query: 729  NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
              +  VGT   ++      T G I ++   E+GK L+ I + + +    +L  F G+LLA
Sbjct: 873  ETFLVVGTGKDMVVSPRTFTCGFIHIYRFQEEGKELEFIHKTKVEQPPLALLGFQGRLLA 932

Query: 786  AINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
             I   +++Y     D G R+L  +C       + + +QT+G  I+V D+ +S++ ++YK+
Sbjct: 933  GIGPDLRIY-----DLGMRQLLRKCQAQITPRVIVGLQTQGSRIIVSDVQESVTYVVYKY 987

Query: 845  EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEV 901
            +E A+   A D    W +   ++D +   G +   N++ +R   K SE A ++  G   +
Sbjct: 988  QENALIPFADDIIPRWTTCTTMVDYETVAGGDKFGNIWLLRCPTKASEEADEDGSGAHLI 1047

Query: 902  VGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT--------------VNGVIGV-I 946
                +L    NR    SLV+     D   IPT I  T              + G +G+ +
Sbjct: 1048 HERQYLQGAPNRL---SLVIHFYSQD---IPTSIQKTQLVAGGRDILVWTGLQGTVGMFV 1101

Query: 947  ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
              +  +   F + L+  L      + G +H  +R +         K  +DGDL E+FL L
Sbjct: 1102 PFITRDDVDFFQTLEMQLASQNPPLAGRDHLIYRGY-----YAPCKGVIDGDLCETFLLL 1156

Query: 1007 SRTRMDEISKTMNVSVEELCKRVEEL 1032
               +   I+  ++ SV E+ +++ ++
Sbjct: 1157 PNDKKQAIAGELDRSVREIERKISDM 1182


>gi|406602265|emb|CCH46158.1| Pre-mRNA-splicing factor [Wickerhamomyces ciferrii]
          Length = 1123

 Score =  179 bits (454), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 256/1125 (22%), Positives = 465/1125 (41%), Gaps = 158/1125 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L+V I+  I ++E F+  G  +D+L I ++   F +L+ D  + +  T           
Sbjct: 59   LLNVQIFAIIRSMENFKIAGSGKDYLAITSDSGNFSILELDLNNGKFNTLFNEPYHKSGI 118

Query: 61   RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE------------ 108
            R    GQ   +DP  R I       L   I    K +L   FN  L+             
Sbjct: 119  RRLSPGQDLAVDPKGRAI-------LATAIE---KNKLCYVFNRDLDNNLTISSPLEANR 168

Query: 109  ---LQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNL----- 160
               L +  I    G   P    +  D  +  +++      +   F E     N++     
Sbjct: 169  SKILTLTSIGLDVGYENPVFATIEIDYSNYEYIENLTEVERFLTFYELDLGLNHIVRRKT 228

Query: 161  ---DNGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDAD 212
                + ++LL+ VP     P   GVL+  +  I Y +    K     P +  +     ++
Sbjct: 229  EKIHDSSNLLLQVPGGNDGP--SGVLVCSKNLISYRNLIGDKVSINIPKLENS-----SN 281

Query: 213  GSRYLLGDHAGLLH-------LLVITHEKEKVTGLKIELLGETSI----------ASTIS 255
               +++   AG++H        +V T+  +     KIELL E SI          AS+I 
Sbjct: 282  QDSFIV---AGVVHKMKNQFFFIVQTNHGDL---FKIELLEEGSITISYFDTIPLASSII 335

Query: 256  YLDNAVVYIGSSYG------------DSQLIKLNLQPDAKGSYVEVLERYVNL------- 296
             L +  +Y  S YG            +S  I  N +P  + ++    E    L       
Sbjct: 336  ILRSGFLYCDSEYGSKFFYQFEKLGSESPFISSNDEPIEELTFERTEELDNLLLVDVLDS 395

Query: 297  -GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG-MWSLR 354
              P++D  +V+       ++ + SG     SL+I++ G+ INE    +L GI   +W+ +
Sbjct: 396  LNPLIDSKLVN--DDAFTKIYSLSGVKDSSSLKILQYGLSINEIVESDLPGIANKVWTTK 453

Query: 355  SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
             + +D FD +LV+SF+  T +L++   + +EE    G     +T+       N LVQ+ S
Sbjct: 454  LNKNDEFDKYLVISFMDTTLVLSIG--ENVEEITDSGLALNEETIGIQQIGINSLVQIHS 511

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG--ILTEVK 472
              +R + +   EL NEW+ P G  +   +    Q+ +      LVY E+ D   ++   +
Sbjct: 512  NGIRNIKNG--ELINEWQPPAGIKILTTSTTNRQIAIGLSNDELVYFEVDDRDRLIEYNE 569

Query: 473  HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL---PDLNLITKEHLGGEII 529
              +L   I  L +  I E    S    VG   D ++R+ S      L L++ + L    I
Sbjct: 570  RKELTSRIVSLSLGDIPEGRLRSPFLIVG-CQDSTIRVLSTDPGSTLELLSLQALSS--I 626

Query: 530  PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
            P  +L  +     ++   L +G  +  L++ + G+L+D +   LG +P++L   S     
Sbjct: 627  PFDILTLSMNNQLFVNIGLENGVYVRTLIDEQNGQLSDTRIKYLGNKPVSLSKISISGVN 686

Query: 590  HVFAASDRPTVIY-SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
             V A S++  +++ + N     +++ +  +     FNS    D +    +  L I TIDD
Sbjct: 687  VVLAFSNKTWIVHETKNSTFKINSLLINPLKFGFMFNSEDCIDGIVGVYKKNLIIFTIDD 746

Query: 649  IQK-LHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIST 707
            +    +I SI L   P+ +    Q     I   +N S      + F+   D Q  E   +
Sbjct: 747  LDNDFNINSIQLKTTPKNLLKNNQD----IFITQNSSS-----LGFIEKFD-QELELNKS 796

Query: 708  YPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI-VEDGKLQLIAE 766
              L   E   +     F   S  Y  V  +       +   G    FI   D +L LI  
Sbjct: 797  IQLSESETIITSTIVKFESKSTSYLLVSIS------KKHDNGSNEFFINTYDLELNLIHI 850

Query: 767  KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA--LYVQTR 824
             +     Y++  F GK+L  I+  ++L+     D G ++L S+       ++  + ++T+
Sbjct: 851  TKISDIAYAITEFQGKVLIGISNHLRLF-----DMGLKQLLSKANSKIDTISKIVKIETQ 905

Query: 825  GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
            G  +VVGD+ +SI+ L+YK +         D     ++++++LD +  +G +   NLF +
Sbjct: 906  GYRVVVGDIRESITFLVYKPKSNEFLTFTDDILPRHITSIKMLDYNTVIGGDKFGNLFIL 965

Query: 885  RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGS--LVMRLPDSDVGQIPT--------- 933
            R + E +   +     ++ +       ++F +G+   +M L +  +  IPT         
Sbjct: 966  RASEESSKISDTNSTFLITK-------DKFLNGAPFKLMNLCNFFIEDIPTSFAKGSFTI 1018

Query: 934  -----VIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
                 +I+  + G +G +I  L      F   L+  +R     + G  + ++R +     
Sbjct: 1019 GGKDLIIYTGLQGTVGALIPLLTKSDIKFFINLEKQMRLHKPDLLGRMNLKYRGYYQP-- 1076

Query: 988  TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                KN +DGDLIE F  LS +   EIS  ++ +  E+ K+++E+
Sbjct: 1077 ---VKNVIDGDLIELFNTLSESTKVEISNELDKTPREISKKIDEI 1118


>gi|328770812|gb|EGF80853.1| hypothetical protein BATDEDRAFT_29900 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1213

 Score =  178 bits (451), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 252/1182 (21%), Positives = 480/1182 (40%), Gaps = 177/1182 (14%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   ++G I +L  FR  G ++D+L + ++  +  +L+++   +               
Sbjct: 53   LLSHQVFGLIRSLAAFRLAGASKDYLVVGSDSGRIVILEYNPAKNTFDKVHQETYGKTGA 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DIK 115
            R    GQ    DP  R  +IG      L  ++  D+  +L  +  +   +  ++    I 
Sbjct: 113  RRVVPGQYLAADPKGRAVMIGAIEKQKLAYILNRDSSLKLTISSPLEAHKTAIICHGIIG 172

Query: 116  FLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNG 163
               G   P    +  D  DA    T E A   +            + +   W++  +D  
Sbjct: 173  VDVGYENPIFASIEVDYSDADQDPTGEAARDAEKMLTYYELDLGLNHIVRKWTEK-IDFS 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIV--------YCSANAFKAIPIRPSITKAYGRVD 210
            A+ LI +P     P   GVL+  E  I         +C     +  P+   I     +V 
Sbjct: 232  ANHLIAIPGGDDGP--SGVLVCSEGRITWKHMQKPSFCVPIPQRPDPLTSVIPGTQNKVI 289

Query: 211  ADGS-------RYLLGDHAGLLHLLVITHE--------KEKVTGLKIELLGETSIASTIS 255
               S        + +     +  +  IT E          +V  +KI+      +A  +S
Sbjct: 290  VVSSVVHRLKRGFFVLAQTEVGDVFKITVECTADSSGNAGQVQAIKIKYFDTIPVAVNLS 349

Query: 256  YLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEV-------------------------- 289
             L +  ++  S +G+  L ++    D   S VE                           
Sbjct: 350  LLKSGFLFAASEFGNHYLYQIENLGDDDESQVEYSSVDFPQGDSLPDSDSMPVVAFIPRE 409

Query: 290  ------LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE 343
                  ++   +L P++D  V++L  +   Q+    G     + RI+R+G+ ++E A  E
Sbjct: 410  LRNLAPVDEMESLCPLIDAKVLNLTDENSPQIYALCGRGSRSTFRILRHGLDVSEMAVSE 469

Query: 344  LQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH 402
            L G    +W++R S  D +D+++++SFI+ T +L++   + +EE    G  + T T+   
Sbjct: 470  LPGNPNAIWTVRRSVSDIYDSYIIISFINATLVLSIG--ETVEEVTDTGVLATTSTITVG 527

Query: 403  DAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
                N LVQV    +R + +  R   +EWK+P   S+  A  N  QV++A     +VY E
Sbjct: 528  QLGENALVQVYPQGIRYIRADKRV--SEWKAPTNQSIVSAACNQRQVVVALSNNEIVYFE 585

Query: 463  IG-DGILTEVK-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LN 517
            +   G L E +   ++   + CL ++PI      ++  A+G   D++VRI SL P+  L+
Sbjct: 586  LDVSGHLNEFQDRKEMSSRVLCLSLSPIPTGRLRARFLAIGC-ADLTVRILSLDPESCLH 644

Query: 518  LITKEHLGGEIIPRSVLLCAFE----GIS--YLLCALGDGHLLNFLLNMKTGELTDRKKV 571
             ++ + L     P S+ +        G+S  +L   L +G LL   ++  +G ++D +  
Sbjct: 645  PLSMQALSAP--PDSLAMIDMPDPTTGVSNLFLNIGLANGVLLKTCVDSGSGNMSDTRMR 702

Query: 572  SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
             LG++ + L     +  T + A S RP + ++ + +     ++ + + +   F S    +
Sbjct: 703  FLGSRGVKLFLLKIQGETGLLALSSRPWISFTYHSRSKLMPLSYESLEYGSSFCSEQCTE 762

Query: 632  SLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTF--------------- 675
             +       L I   + +  +  + SIPL   PRR    + S+ F               
Sbjct: 763  GIVAITGNTLRILNTEKLGSVFKQASIPLKYTPRRFIFDQVSQNFVVIESDHGVFCPSDR 822

Query: 676  ---------------------AICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE 714
                                 A+    +     E     +R++     E +    LD  E
Sbjct: 823  AKILETKATLDADEGTIPEELAVEQFGHSKAGPERWASCIRVISPIHGETLHLEDLDDNE 882

Query: 715  YGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKETK 770
                I  C F S  +  +  VGTA  V    +  T+G + V+ +  DG  L+ + +   K
Sbjct: 883  AAFCISFCIFQSSLTTTHIVVGTASNVNLSSSSFTEGYLRVYKLAPDGTSLEFLHKTPIK 942

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIV 829
            G    + +F G+LL  +   +++Y     D G +++  +C   G    +  + T+G+ I+
Sbjct: 943  GIPKVMCSFQGRLLVGVGSLLRIY-----DLGKKKMLRKCECKGFPTTIVTLHTQGNRII 997

Query: 830  VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE 889
            +GD  +S+   +Y+  +  I   A D    W++A  ++D D  +G +   N+F  R ++E
Sbjct: 998  LGDAQESVHYAMYRAFDNRIVIFADDTIPRWVTATCMVDYDTVVGGDKMGNIFVNRLSAE 1057

Query: 890  --GATDE---------ERGRLEVV-------GEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
                 DE         +RG L+          ++ LGE +      SLV        G  
Sbjct: 1058 VSKGIDEDTTGNQAIFDRGYLQGAPHKVHHEADFFLGETLTSLTKTSLV-------PGGR 1110

Query: 932  PTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
              +++ T+ G IG +I  +  +   F + L+  +R     + G +H  +RSF      + 
Sbjct: 1111 EILLYTTLMGGIGLLIPFISKDDVDFFQTLEMTMRSECPPLCGRDHLAYRSFYTPVHAI- 1169

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                +DGDL E F  +   +   I+++++ SV ++ K++E++
Sbjct: 1170 ----IDGDLCEMFNVMVGDKKRGIAESVDRSVADVGKKLEDM 1207


>gi|58269920|ref|XP_572116.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21]
 gi|57228352|gb|AAW44809.1| conserved hypothetical protein [Cryptococcus neoformans var.
            neoformans JEC21]
          Length = 1276

 Score =  178 bits (451), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 205/844 (24%), Positives = 379/844 (44%), Gaps = 108/844 (12%)

Query: 245  LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL-------------------------- 278
            LG +S  S+I+YLD++ +++ S+  DS L++L                            
Sbjct: 413  LGASSPPSSITYLDSSHLFVSSAVADSVLLRLPKVESSSSVSSGKGKGRAVTSPIGDQAD 472

Query: 279  -----------QPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSL 327
                       + D  G   E+LER++N+ P+ D CVV  E      +V  SGA +  SL
Sbjct: 473  KWEVLYEIGKDRNDTDGGP-EILERWMNIAPVKDLCVVKDEGGNLSHLVLASGASESNSL 531

Query: 328  RIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
            R+VR+G+G+ E  +++ L  ++ MW L  ST  P    L++S  + T  + + L+ E+  
Sbjct: 532  RVVRSGVGLEELVTIQGLHDVQKMWPLTDSTAIPR---LLLS--TSTSTILLQLQPEISA 586

Query: 387  TEIEGFCSQTQTLFCHDAIYNQLV-QVTSGSVRLVSSTS-RELRNEWKSPPGYSVNVATA 444
              I      ++TL        +L+ QVT   + L S  S  +L  + +      +  A  
Sbjct: 587  IPITDVIFNSETLAAGILPGAELLAQVTPRGLSLWSDLSVGQLEAQVEVDKETEIVCAQV 646

Query: 445  NASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
             A   ++A  GG LV   + +   +      ++ E+S + I+    + S   + +  M  
Sbjct: 647  TADWAVVAKKGGSLVVFHVSNTGFSPQGTIDVKEEVSAVAISSSSGSSSPIIVISTWM-- 704

Query: 505  DISVRIFSLPDL-NLITKEHLGGEIIPRSVLLCA---FEGISYLLCALGDGHLLNFLLNM 560
                 +++L  + N +    +  E    S+ L +   +     LL  L +G L  + LN 
Sbjct: 705  -AKTFVYTLSQISNGVDGLSIQSESSATSLQLRSHPFYPAGIQLLSGLDNGLLHIYDLNT 763

Query: 561  K-TGE-----LTDRKKVSLGTQPITLRTFSSKNTTHVFAA---SDRPTVIYSSNKKLLYS 611
              +GE     +   K  SLG +P+ L    S +      +   ++R +VI+ S  ++ +S
Sbjct: 764  SDSGEAEGLMVKSSKTTSLGLRPLVLHPCESTHGDEKVISVGLTERMSVIFESKDRIEFS 823

Query: 612  NVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQE 670
            +VN+K V      ++++ P     ++   L++  ++ ++KLH+++   G E   ++ + +
Sbjct: 824  SVNIKNVMAATSVDTSSGPVFALFSRTSGLSLVKVNSLKKLHVQTCDTGNESISKLTYMD 883

Query: 671  QSRTFAICSLKNQSC--AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS 728
            + +  A  S +       +  E +FV++ D  + E +S++ L   E   S+ S   +   
Sbjct: 884  EYKAIACGSTRRTQLRDGDVEEENFVQIRDGTSLEPLSSFSLRGRELVTSLRSVFLT--G 941

Query: 729  NVYYCVGTAYVLPEENEPT-----------KGRILVFIVEDGK------LQLIAEKETKG 771
             +Y  VGTA++  ++ E +           +GR+L+   ++G       +++ AE  T G
Sbjct: 942  RIYLAVGTAFLPADDGEDSSWDEGNLAVVREGRVLLLEFKEGDAGGGWDIKIKAELATVG 1001

Query: 772  AVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ-SECGHHGHILAL-------YVQT 823
            AVY+L   +G L  A   K+ +++    D    EL+ +      ++++        +++ 
Sbjct: 1002 AVYALEEIHGFLAVAAGSKLTIHR---LDHNPVELEETSSWASAYVISSLSVLPPSHIRP 1058

Query: 824  RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD--DIYLGAENNFNL 881
             G  IV GD M+S+ +L     +G I +  R+   + ++A+ +L D  D  + ++   NL
Sbjct: 1059 EGALIV-GDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGDGVVISDAYSNL 1117

Query: 882  FTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
             T R N          +LE    + L E V RF+ GSLV      ++  IP V+F T  G
Sbjct: 1118 LTYRLNQ---------KLERAATFGLHEEVTRFQSGSLVPTTTAPEI-IIPDVLFATREG 1167

Query: 942  VIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
             +G+I  L       L+ LQ N+ K+ KG G +    WR   +     D   F+DGD ++
Sbjct: 1168 RLGIIGELGTRSSRTLDDLQRNMSKIWKGPGEVGWSNWRRAGSNLVGKDTAGFVDGDFVQ 1227

Query: 1002 SFLD 1005
             FLD
Sbjct: 1228 KFLD 1231


>gi|145510432|ref|XP_001441149.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124408388|emb|CAK73752.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1174

 Score =  177 bits (449), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 207/992 (20%), Positives = 409/992 (41%), Gaps = 164/992 (16%)

Query: 159  NLDNGADLLIP-VPPPLCGVLIIGEETIVY-CSANAFKAIPI---------RPSITKAYG 207
            ++ + A L++P  P    GV+++ E+ + Y    N    +P          R ++  ++G
Sbjct: 224  HISSHALLMVPNAPDGPGGVIVLCEDFLHYRGPKNQEMKVPYPKNQGMPTDRGAMISSFG 283

Query: 208  RVDADGS-RYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
                  S  YLL    G L  L +   K++V  +++       +A+++  + +  ++   
Sbjct: 284  FYKQKSSFLYLLQSEYGDLFELSLQFTKDEVHSIRMIYFDTIPVANSLCLMRSKHLFAAC 343

Query: 267  SYGDSQLIKLN----------------------LQPDAKGSYVEVLERYVNLGPIVDFCV 304
              G+    K                         +P  K  ++ +L+   N   I D  V
Sbjct: 344  EKGNHCFYKYQREETNTNIICTDSSMALEEQIFFKPQ-KLKHLSLLQELNNFSCISDLKV 402

Query: 305  VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDT 363
             DL ++G  Q+  C  A    +LR++R+G+ I + A+  LQ    G+W+L+   +DP   
Sbjct: 403  ADLAKEGNPQIYICCAAGNRSTLRVLRHGLEITQLANTNLQAKPLGIWTLKERYEDPVHK 462

Query: 364  FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
            ++V+S+I++T  L + + +++E+    G     QT+     I +  +Q+ +   R +   
Sbjct: 463  YIVISYINKT--LVLKIGEKVEQVHDTGLEGTKQTIHVGTLIDDSQIQILTNGYRHIRKN 520

Query: 424  SRELRNEWKSPPGYSVN----VATANASQVLLATGGGHLVYLEIG----DGILTEVKHAQ 475
                    K P  Y ++       +N  QV  A  GG + Y E+      G L E+   Q
Sbjct: 521  --------KPPTDYIIDGKVIKGVSNEKQVAFALAGGDVYYFELDTTSFQGNLIEITKDQ 572

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-L 534
            ++ EI  L++ PI E     +   V + +D ++R+ SL   +   +  +  + +P     
Sbjct: 573  MDNEIKALELGPIEEGRQRCKFLCVAL-SDQTIRLLSLEPESCFERGAM--QALPSEAES 629

Query: 535  LCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
            LC  E  S            +L   L +G L+   ++  +G L+D +   LGT+P+    
Sbjct: 630  LCMMEMASEQEGQQSFTKQLFLFIGLNNGLLMRTSVDQLSGGLSDTRTRYLGTKPVKCLR 689

Query: 583  FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
             ++     + A S R  + Y+++ ++    ++   + +   F          +AKE +  
Sbjct: 690  INANQQQAMLALSSRSWLCYNNSGRIFMQPLSYDYLDYASAF----------LAKEFQGI 739

Query: 643  IGTIDDIQKLHI----------RSIPLGEHPRRICHQEQSRTFAICSLKNQSC------- 685
            +GT     ++ +          +S+ L   PR++   E S+   I    N+S        
Sbjct: 740  VGTNQSTLRIIMPERFGEIFNQQSLDLTYSPRKMIFHEPSKAIFIIESDNRSYNNVEQKV 799

Query: 686  -----------------AEESE-MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDD 727
                             AE+ +    +R++D    E I+ +     ++ CSI    F+  
Sbjct: 800  QEVYQTQEIPEQWNQVQAEQYKWASLIRIVDATKLETINVHQFYENQHACSICYIQFAGY 859

Query: 728  SNVYYCVGTAYVLPEENEPTK----GRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
               Y CVGT   L   NEPT+    G I  FI ++  L+L    +     Y+L A+ G+L
Sbjct: 860  PEQYVCVGTVKDLV--NEPTRKFSQGFIHTFIYDNKTLKLKHSTQIDEIPYALAAWRGRL 917

Query: 784  LAAINQKIQLYKW----MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            L      +++Y+     +L+    + L S            +  + D I V ++  SI L
Sbjct: 918  LVGAGCNLRVYEMGNQRILKKAEIKNLNS--------FITSIMVKEDRIYVAEVSDSIHL 969

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-------------- 885
            L Y   +    E A D    +++A  +LD    +  +   N+F  R              
Sbjct: 970  LRYNIRDQTFMELADDILPRYVTASTVLDYHTVIAGDKFENIFVSRVPLDIDEEQEEHPY 1029

Query: 886  ----KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
                K  +G  +    +++ +  +++GE +   +  +LV     S+V     V++GT  G
Sbjct: 1030 EYKMKMDQGCMNGAPFKMDQICNFYVGEVITSLQKIALVS--TSSEV-----VVYGTSMG 1082

Query: 942  VIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
             I  +    +++ + F   L+  LR   + + G +H Q+RS     K++     +DGDL 
Sbjct: 1083 SIAALYPFDNKEDIDFFLHLEMYLRVEHQPLSGRDHMQFRSAYGPCKSI-----IDGDLC 1137

Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            E F ++   +   +++  + +  ++ K++E++
Sbjct: 1138 EQFGNMQYNKQRTVAEEFDRTPADIIKKLEDI 1169


>gi|260819270|ref|XP_002604960.1| hypothetical protein BRAFLDRAFT_126697 [Branchiostoma floridae]
 gi|229290289|gb|EEN60970.1| hypothetical protein BRAFLDRAFT_126697 [Branchiostoma floridae]
          Length = 1049

 Score =  177 bits (448), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 233/1010 (23%), Positives = 409/1010 (40%), Gaps = 162/1010 (16%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L V I+G I +L  FR  G  +D++   T+  +  +L++    ++              
Sbjct: 53   LLSVEIFGVIRSLMSFRLTGGTKDYIVCGTDSGRIVILEYLPAKNKFDKVHQETFGKSGC 112

Query: 61   RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
            R    GQ   +DP  R  +IG      L  ++  D   +L  +  +   +   L    + 
Sbjct: 113  RRIVPGQYLAVDPKGRALMIGAVEKQKLVYILNRDAAARLTISSPLEAHKSNTLCYHLVG 172

Query: 118  --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
               G   P    L  D +DA +  T E +   +    F E     N+        L+   
Sbjct: 173  VDIGFENPMFACLEIDYEDADNDPTGESSQSTQQCLTFYELDLGLNHVVRKYSEPLEEHG 232

Query: 165  DLLIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR------ 215
            + LI VP       GVLI  E  I Y   N      IR  I +    +D D  R      
Sbjct: 233  NFLIAVPGGSEGPSGVLICSENYITY--KNFGDQPDIRSPIPRRRNDLD-DPERGMIFVC 289

Query: 216  -----------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
                       +L     G +  + +  +++ VT +K++      +A+++  L    +++
Sbjct: 290  SATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVATSMCVLKTGFLFV 349

Query: 265  GSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPIVD 301
             S +G+  L ++             +  P  +G       R +          +L PI+ 
Sbjct: 350  ASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFLFTPRGLKNLVLVDEMDSLSPIMS 409

Query: 302  FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDP 360
              + DL  +   Q+   +G     S R++R+G+ ++E A  EL G    +W+++ + DD 
Sbjct: 410  CQIADLANEDTPQLYVANGRGPRSSFRVLRHGLEVSEMAVSELPGNPNAVWTVKKNVDDE 469

Query: 361  FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
            FD ++VVSF++ T +L++   + +EE    GF   T TL C     + L+Q+    +R +
Sbjct: 470  FDAYIVVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLIGEDALLQIYPEGIRHI 527

Query: 421  SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEY 478
             S  R   NEW++P   ++     N  QV++A  GG LVY E+   G L E  +  ++  
Sbjct: 528  RSDKR--VNEWRTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERKEMSA 585

Query: 479  EISCLDIN--PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLI----------TKEHL-- 524
            E+ C+ +   P+GE     +  AVG+ TD +VRI SL   + +          T E L  
Sbjct: 586  EVICMALANVPVGEQ--RCRFLAVGL-TDDTVRIISLDPADCLQPLSMQALPATPESLCI 642

Query: 525  ----GGEII----PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
                GGE I    PR+ +L       +L   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 643  LQMGGGEGIDDTGPRNTML-------FLNIGLQNGVLLRTVLDQITGDLSDTRTRYLGSR 695

Query: 577  PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
            P+ L     +    V A S R  + Y+   +   + ++ + + +   F S   P+ +   
Sbjct: 696  PVKLFQVKMQGNEAVLAMSSRSWLSYTYQNRFHLTPLSYETLEYASGFASEQCPEGIVAI 755

Query: 637  KEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSR------------TFAICSLKNQ 683
                L I  ++ +  +  + S PL   PR+     QS             T A  + + Q
Sbjct: 756  SANTLRILALEKLGAVFNQISTPLMYTPRKFVIDTQSNNIIMIETDHNSYTEATKAQRKQ 815

Query: 684  SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
              AEE             +EM                           +R+L+      +
Sbjct: 816  QMAEEMVEAAGEEEKELAAEMAAAFINENLPEATFGSPKAGPGMWASVIRVLNPIQGNTL 875

Query: 706  STYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRIL--VFIVEDG-KL 761
                L+  E   SI  C F++   + +  VGTA  +        G  L    +V +G KL
Sbjct: 876  DLIQLEQNEAAFSIAICKFANRGEDTFVVVGTAKDMTMNPRTCSGGFLHTYQLVNNGEKL 935

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALY 820
            +LI +   +    ++++F G++L  I + +++Y     D G ++L  +C   H     + 
Sbjct: 936  ELIHKTPVEDVPGAISSFQGRILVGIGRLLRIY-----DLGKKKLLRKCENKHVPNFIMS 990

Query: 821  VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
            + T G  I V D+ +S   + YK +E  +   + D N  W++    LD D
Sbjct: 991  IHTMGHRIFVSDVQESFHFIRYKRQENQLIIFSDDTNYRWITCSTQLDYD 1040


>gi|118400837|ref|XP_001032740.1| CPSF A subunit region family protein [Tetrahymena thermophila]
 gi|89287084|gb|EAR85077.1| CPSF A subunit region family protein [Tetrahymena thermophila SB210]
          Length = 1197

 Score =  177 bits (448), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 280/1177 (23%), Positives = 478/1177 (40%), Gaps = 194/1177 (16%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I T+ +FR HG+ QD++ + ++  +  +L+++ E  E   +   +   + G R   
Sbjct: 60   VFGIIRTINIFRLHGQTQDYIIVGSDSGRIVILKFNKEKGEF-EKVHQETYGKTGCRRIV 118

Query: 65   NGQIGIIDPD---CRLIGLHLYDGLFKVIPFDNKGQLK-----EA------------FNI 104
             GQ   +DP    C    +     ++ ++  DN+ +L      EA             ++
Sbjct: 119  PGQYIAVDPKGRACMFAAIEKQKFVY-ILNRDNEDKLTISSPLEAHKSHTLCYDIIGIDV 177

Query: 105  RLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA 164
              E  Q   I+  YG  +     ++   +  + +  YE+       V+   S   +   A
Sbjct: 178  GYENSQFACIECDYGEVEQKDSAVH-TGQIQKQLTIYELDFGLNHVVKK--SSEPIPETA 234

Query: 165  DLLIPVP-----PPLCGVLIIGEETIVYCSANA-FKA-IPIRPSITK----AYGRV---- 209
             +L+PVP     P   GVLI  E+ ++Y   N   K   P R S +K    ++G +    
Sbjct: 235  HMLLPVPGLPDGPG--GVLIACEDMLIYKGNNQELKCRYPQRFSTSKIMINSFGFLKQKS 292

Query: 210  -DADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
             + + S +LL    G L  L I    +KV GL ++       A +I       ++     
Sbjct: 293  KNQNLSFFLLQTELGDLFKLQIKSTDDKVHGLTLQYFDSIPTAISICITKTGHLFAACEK 352

Query: 269  GDSQLIKLNL------QPDAKGSY---------------VEVLERYVNLGPIVDFCVVDL 307
            GD  L +         +P    S                +E ++   NL  I D  V DL
Sbjct: 353  GDHLLYRFKSLGEQEERPTKTTSADFEQQAIFIPRKLVNLEQVDSLDNLSAISDIKVNDL 412

Query: 308  ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLRSSTDDPFDTFLV 366
              +G  Q+     A    +LRI+R+G+ ++E A+  L     G+W+++   D+    ++V
Sbjct: 413  TGEGNPQIYALCAAGSRSTLRILRHGLQVSEYATSRLPLRPNGIWTIKQRHDEGLTKYIV 472

Query: 367  VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
            +S   +T +L+  ++D +         + +QTL       N ++QVT  S R +      
Sbjct: 473  LSSSKKTYVLS--IKDTISAVNDSSLDTNSQTLHAGILENNCIIQVTPESFRQIRDNRVT 530

Query: 427  LRNEWKSPPGYSVNVATANASQVLLA------TGGGHLVYLE--IGDGILTEVKHAQLEY 478
            L   +K+     V  A +N+ Q+ LA        GG ++Y E  +G  +    + A+L  
Sbjct: 531  L---YKTESNKFVK-ACSNSRQIALALQGGQGQPGGDIIYFEFDLGGQLKRIEEKAKLTS 586

Query: 479  EISCLDIN--PIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSV 533
            +I C+DI   PIG  P + +  AVG   D +VRI SL     L  ++ + L G  IP SV
Sbjct: 587  DIECIDIGEVPIGR-PKF-KFLAVG-CKDQTVRILSLEQDQYLQRLSLQALPG--IPESV 641

Query: 534  LLC-----------AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
             L            A E   YL   L +G LL   ++  TG L+D +   L   PI    
Sbjct: 642  SLIEMKRGTGTEQEAEEYQLYLFVGLQNGILLRATVDQVTGSLSDVRTRVLSAAPIRTCK 701

Query: 583  FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCP--FNSAAFPDSLAIAKEGE 640
            +  +    + A S RP + Y+   K     ++   + +  P  FN         ++  GE
Sbjct: 702  YVVQGQPALLALSTRPWICYTYLGKYQMVPISYDMLEYAAPAVFNMQGEQKQCIVSTSGE 761

Query: 641  LTIGTIDDIQK----LHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF--- 693
                 I + QK    L+ + I L   PR++   E+S    +    N+   E  +      
Sbjct: 762  SV--RIIEPQKYGDLLNQQVIKLRYSPRKMAIHERSHNIVLIESDNKVYNESQKKQLIEQ 819

Query: 694  ---------------------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD 726
                                       VR++D  T + ++ Y L+  E+  S+   +F +
Sbjct: 820  YYSNQNKGDLATQVDLLTAPSGNWASCVRIVDPVTRKTLNIYELENNEHALSMCMVNFDN 879

Query: 727  DSNVYYCVGTAYVLPEENEPTKGRILVFI------VEDGKLQLIAEKETKGAVYSLNAFN 780
                Y CVGT  V   +  P +   + +I       +   L+L+ + E      +L+A  
Sbjct: 880  KDETYVCVGT--VKDYQVHPNRNFSVCYINTFQLNEKLNTLELLHKTEIFEIPGALHAHK 937

Query: 781  GKLLAAINQKIQLYKW----MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
            GKLLA     ++ Y+     +L+    + LQS            +QT GD I V  +  S
Sbjct: 938  GKLLAGCGTFLRYYELGKKKLLKKAEIKGLQSPVNG--------IQTFGDRIYVSMVADS 989

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER 896
            + +L Y+ ++    E   D    WMS+ ++LD   Y+G +   N F  R  S    + E 
Sbjct: 990  VHVLKYRSKDQTFYEVCDDILPRWMSSFQVLDYHTYIGGDKFENAFVCRVPSNADEEMEE 1049

Query: 897  G------------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
                               + E +  ++ GE +   +  SLV     S   +I  V++GT
Sbjct: 1050 NPMAYKLRWETGYLNGAPFKTEQICHFYAGEVITTLQKASLV-----STGNEI--VLYGT 1102

Query: 939  VNGVIGVIASLP---HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
              G IG  A LP    E   F   L+  LR  +  + G +H  +RSF    K+V     +
Sbjct: 1103 SMGSIG--AYLPFQTKEDIDFFIHLEMYLRLDVLPLAGRDHVMFRSFYGPVKSV-----I 1155

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DGDL E F+ LS  +   ++  M+    E+ K+++E+
Sbjct: 1156 DGDLCEQFIKLSSGKQKVLADEMDRHPHEVHKKLDEI 1192


>gi|156406895|ref|XP_001641280.1| predicted protein [Nematostella vectensis]
 gi|156228418|gb|EDO49217.1| predicted protein [Nematostella vectensis]
          Length = 1171

 Score =  174 bits (440), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 260/1183 (21%), Positives = 475/1183 (40%), Gaps = 221/1183 (18%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L + ++G +  L  FR  G ++DF+ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53   VLAIEMFGIVRDLIAFRLTGGSKDFIVVGSDSGRIVILEY-IPSKNVFEKVHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ    DP  R  +IG      L  ++  D+  +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQYLAADPKGRAIMIGAIEKQKLVYIMNRDSAARLTISSPLEAHKSHTLVYHMV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A    T E A   +    F E     N+        L+  
Sbjct: 172  GVDAGFENPMFACLEMDYEEADTDHTGEAAQTTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             +LLI VP     P   GVL+  E  + Y   N      IR  I +    +D D  R   
Sbjct: 232  GNLLIAVPGGSDGP--SGVLVCSENYLTY--KNFGDQPDIRMPIPRRKYDLD-DPERGML 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L+    G +  + +  +++ VT ++++       A ++  L    
Sbjct: 287  FICSATHKTKSMFFFLVQTEQGDIFKITLETDEDMVTEIRMKYFDTVPPAVSMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            ++  S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347  LFTASEFGNHYLYQIAHLGDDDDELEFSSTMELEEGTTFFFQPRGLKNLV-LVDELESLA 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI++  + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W+++ +
Sbjct: 406  PIMNCQIADLANEDTPQLFAACGRGSRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVKHT 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
            T D FD ++VVSFI+ T +L++   + +EE    GF   T TL C       LVQ+    
Sbjct: 466  TADEFDAYIVVSFINATLVLSIG--ETVEEVTDSGFLGTTPTLSCSQLGEEALVQIYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE-VKHA 474
            +R + +  R   NEWK+P   ++  +  N  QV++A  GG +VY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKAPGKKTIVKSAVNERQVVIALTGGEVVYFEMDSSGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIPRSV 533
            ++  ++ C+ +  I      S+  A+G+ +D +VRI SL   + +T   +     +P S 
Sbjct: 582  EMSADVQCMALGTILHGELRSRFLAIGL-SDNTVRIISLDPQDCLTPLSMQALPAMPES- 639

Query: 534  LLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
             LC  E           G +    LN+   E      + L  +P             V A
Sbjct: 640  -LCIVEMGGKEGAESEGGSMGGLFLNIGLSE------ICLTLEP-----------GEVLA 681

Query: 594  ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
             S R  + Y+   ++  + ++ + + +   F+S   P+ +       L I  ++ +  + 
Sbjct: 682  MSSRSWLSYTHQSRVHLTPLSYETLEYASGFSSEQCPEGIVAIAGNTLRILALEKLGAVF 741

Query: 654  IR-SIPLGEHPRRICHQEQSRTFAICSL------------KNQSCAEE------------ 688
             + + PL   PR+      + T  +               + Q  AEE            
Sbjct: 742  NQVATPLKYTPRKFVIHSPTNTLVVIETDHNTYTDAAREERKQQMAEEMVEMASEEEKEL 801

Query: 689  -------------SEMHF-------------VRLLDDQTFEFISTYPLDTFEYGCSILSC 722
                         SE  F             +R+LD    E      LD  E   S+  C
Sbjct: 802  AAQAAAEFLSEELSEQQFGSPKAGTGMWASVIRVLDPIKGETHDVVQLDQNEAAFSVCVC 861

Query: 723  SFS---DDSNVYYCVGTAYVLP-EENEPTKGRILVF-IVEDG----KLQLIAEKETKGAV 773
            SFS   DD+  +  VGTA  +       + G +  + +V+      KL+   + E     
Sbjct: 862  SFSARPDDT--FVIVGTAKDMALSPRSCSTGYLYAYRLVQQPSGGIKLEFQHKTEVDDVP 919

Query: 774  YSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
             +L  F G+LLA I + +++Y    K MLR    ++L +            + + G  IV
Sbjct: 920  AALEPFQGRLLAGIGRLLRVYDIGKKKMLRKCENKKLPN--------FICNINSMGTRIV 971

Query: 830  VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE 889
            V D+ +S   + YK  +  +   A D N  W++    LD D   GA+   N+FTVR  S 
Sbjct: 972  VSDIQESFHFVKYKPRDNQLVVFADDVNPRWLTCCCYLDYDTLAGADKFGNIFTVRLPS- 1030

Query: 890  GATDE------------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
            G++D+            +RG       +LE +  Y +GE V   +  +L+        G 
Sbjct: 1031 GSSDDVDEDPTGTKAFWDRGLLNGAQQKLETMCNYFVGETVLSLQKATLI-------PGG 1083

Query: 931  IPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
              +++F T++G +G++      E   F + L+ +LR                  +E+ ++
Sbjct: 1084 SESLVFTTLSGGVGMLVPFTSREDIDFFQHLEMHLR------------------SEQPSL 1125

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
               + +DGDL E +  L  ++   I++ ++ +  E+ K++E++
Sbjct: 1126 ---SVVDGDLCEHYNSLDYSKRRTIAEELDRTPAEVSKKLEDI 1165


>gi|154295205|ref|XP_001548039.1| pre-mRNA splicing factor 3b [Botryotinia fuckeliana B05.10]
          Length = 1020

 Score =  173 bits (439), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 224/947 (23%), Positives = 392/947 (41%), Gaps = 137/947 (14%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           ++G I  +  FR  G  +D++ I ++  +  ++++    ++     +        R    
Sbjct: 64  VFGIIRAIAAFRLAGSNKDYIIITSDSGRITIVEFVPAQNKFNRLHLETFGKSGVRRVVP 123

Query: 66  GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
           GQ   +DP  R  L      + L  V+  +++ +L  +  +   + Q L    +    G 
Sbjct: 124 GQYLAVDPKGRACLTASVEKNKLVYVLNRNSQAELTISSPLEAHKAQTLVFALVALDVGY 183

Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
           A P    L  D  ++    T             YE+ L     V   WS+  +D  A++L
Sbjct: 184 ANPVFAALEIDYGESDQDPTGQAYDEIEKQLVYYELDLGLNHVVRK-WSEP-VDRTANIL 241

Query: 168 IPVPPPL---CGVLIIGEETIVYCSAN--AFK-AIPIRPSITKAYGRVD--ADGSRYLLG 219
             VP       GVL+ GE+ I Y  +N  AF+ AIP R   T+   R      G  + L 
Sbjct: 242 FQVPGGTDGPSGVLVCGEDNITYRHSNQEAFRVAIPRRRGATEDPQRKRNIVAGVMHKLK 301

Query: 220 DHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNAV 261
             AG    L+ T + +                  +V  LKI+      +A+++  L +  
Sbjct: 302 GAAGAFFFLLQTDDGDLFKITIEMVEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGF 361

Query: 262 VYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYV 294
           +++ S +G+ Q  +                +    A  SY  +            +E   
Sbjct: 362 LFVASEFGNHQFYQFEKLGDDDEETEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESID 421

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
           ++ P++D  V +L  +   Q+ +  G     + R +++G+ ++E    EL G+   +W+ 
Sbjct: 422 SMNPLMDCKVANLTDEDAPQIYSICGTGARSTFRTLKHGLEVSEIVESELPGVPSAVWTT 481

Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
           + +  D +D ++++SF + T +L++   + +EE    GF S   TL       + L+QV 
Sbjct: 482 KLTRGDTYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVH 539

Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
              +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG L E  
Sbjct: 540 PKGIRHIRADHR--VNEWAAPQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSLAEYD 597

Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
           +  ++   ++CL +  + E    SQ  AVG   D +VRI SL PD  L  K        P
Sbjct: 598 EKKEMSGTVTCLSLGEVPEGRQRSQFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSPP 656

Query: 531 RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
            ++ + A    S      YL   L  G  L  +L+  TGEL+D +   LG +P+ L   +
Sbjct: 657 NALSIMAMSDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGPKPVKLFRVA 716

Query: 585 SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
            +  T V A S RP + YS    K  + + ++   +     F+S    + +   +   L 
Sbjct: 717 VQGQTAVLALSSRPWLGYSDPVTKGFMLTPLDYPALEWGWNFSSEQCTEGMVGIQGQNLR 776

Query: 643 IGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT 701
           I +I+ +   L   SIPL   PRR     +   F +    N   +  ++    +LL+D +
Sbjct: 777 IFSIEKLTDNLLQESIPLTYTPRRFVRHPEHACFYVIEADNNILSPATKQ---KLLEDPS 833

Query: 702 F-----------EF------------------------ISTYPLDTFEYGCSILSCSF-S 725
                       EF                        +ST  L+  E   SI   +F S
Sbjct: 834 VTNGDATVLPPEEFGYPRGTNHWASCISVVDPVTEKKVLSTIHLEDNECAVSIAVVAFAS 893

Query: 726 DDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGK 782
            +   + CVGT   ++      + G I V+   EDGK L+ I + + +    +L AF G+
Sbjct: 894 QEDETFLCVGTGKDMVVSPRSFSAGFIHVYRFHEDGKELEFIHKTKVEEPPMALLAFQGR 953

Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
           LLA + + +++Y   +R    R+ QSE   +   + + +QT+G  I+
Sbjct: 954 LLAGVGKDLRIYDLGMR-QLLRKAQSEVAPN---MIVGLQTQGSRII 996


>gi|225448823|ref|XP_002282354.1| PREDICTED: splicing factor 3B subunit 3-like [Vitis vinifera]
          Length = 1214

 Score =  172 bits (437), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 176/754 (23%), Positives = 329/754 (43%), Gaps = 88/754 (11%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V I+G I +L  FR  G  +D++ + ++  +  +L+++ E + +  +   +   + G
Sbjct: 52  ILSVEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKEKN-VFDKIHQETFGKSG 110

Query: 61  -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
            R    GQ   IDP  R  +IG      L  V+  D   +L  +  +   +   +     
Sbjct: 111 CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTVARLTISSPLEAHKSHTIVYSIT 170

Query: 118 ---YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLD 161
               G   P    +  D  +A             +H+  YE+ L   + V   WS+  +D
Sbjct: 171 GVDCGFDNPIFAAIELDYSEADQDSTGQAASEAQKHLTFYELDLG-LNHVSRKWSEQ-VD 228

Query: 162 NGADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VDADG 213
           NGA++L+ VP       GVL+  E  ++Y +    +    IP R  +    G   V A  
Sbjct: 229 NGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAAT 288

Query: 214 SR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
            R      +LL    G +  + + HE ++++ LKI+      + S++  L +  ++  S 
Sbjct: 289 HRQKSMFFFLLQTEYGDVFKVTLEHENDRISELKIKYFDTIPVTSSMCVLKSGFLFAASE 348

Query: 268 YGDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVD 301
           +G+  L +                             QP    + V + ++  +L PI+D
Sbjct: 349 FGNHGLYQFQAIGDDADVESSSASLMETEEGFQPVFFQPRGLKNLVRI-DQVESLMPIMD 407

Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDP 360
             V +L  +   Q+    G     S+RI+R G+ I+E A  +L G+   +W+++ + +D 
Sbjct: 408 MKVSNLFEEETPQIFALCGRGPRSSIRILRPGLAISEMAVSQLPGVPSAVWTVKKNVNDE 467

Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
           FD ++VVSF + T +L++   + +EE    GF   T +L       + L+QV    +R +
Sbjct: 468 FDAYIVVSFANATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHI 525

Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYE 479
               R   NEW++P   ++    +N  QV++A  GG L+Y E+   G L EV+  ++  +
Sbjct: 526 REDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLMEVEKHEMSGD 583

Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLC 536
           ++CLDI P+ E    S+  AVG + D ++RI SL PD  + +++ + +     P S+L  
Sbjct: 584 VACLDIAPVPEGRQRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQSVSSP--PESLLFL 640

Query: 537 AFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
             +               +L   L +G L   +++M TG+L+D +   LG +   L +  
Sbjct: 641 EVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDARSRFLGLRAPKLFSVI 700

Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
            +    +   S RP + Y      L + ++ + +     F+S    + +       L + 
Sbjct: 701 VRGRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEFAASFSSDQCAEGVVAVAGDALRVF 760

Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAI 677
           TI+ + +  +   IPL   PR+   Q + +   +
Sbjct: 761 TIERLGETFNETVIPLRYTPRKFVLQPKRKLLVV 794



 Score = 97.1 bits (240), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 100/370 (27%), Positives = 168/370 (45%), Gaps = 54/370 (14%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGR 750
            +R+LD +T        L   E   SI + +F D +      VGTA  L    +     G 
Sbjct: 863  IRILDPRTATTTCLLELQDNEAAFSICTVNFHDKEYGTLLAVGTAKSLQFWPKRSFDAGY 922

Query: 751  ILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
            I ++  +EDGK L+L+ + + +G   +L  F G+LLA I   ++LY     D G R L  
Sbjct: 923  IHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSVLRLY-----DLGKRRLLR 977

Query: 809  ECGHH---GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
            +C +      I++++  T  D I VGD+ +S     Y+ +E  +   A D    W++A  
Sbjct: 978  KCENKLFPNTIVSIH--TYRDRIYVGDIQESFHYCKYRRDENQLYIFADDSVPRWLTASY 1035

Query: 866  ILDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHL 907
             +D D   GA+   N++ VR                  K  +G  +    ++E + ++H+
Sbjct: 1036 HIDFDTMAGADKFGNIYFVRLPQDVSDEVEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHV 1095

Query: 908  GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRK 966
            G+ V   +  SL+        G    +I+GTV G +G ++A    +   F   L+ ++R+
Sbjct: 1096 GDVVTCLQKASLI-------PGGGECIIYGTVMGSLGALLAFTSRDDVDFFSHLEMHMRQ 1148

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSV 1022
                + G +H  +RS          K+ +DGDL E F    LDL R   DE+ +T     
Sbjct: 1149 EHPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPTLPLDLQRKIADELDRTPG--- 1200

Query: 1023 EELCKRVEEL 1032
             E+ K++EE+
Sbjct: 1201 -EILKKLEEV 1209


>gi|145549784|ref|XP_001460571.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
 gi|124428401|emb|CAK93174.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1178

 Score =  172 bits (436), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 201/984 (20%), Positives = 404/984 (41%), Gaps = 162/984 (16%)

Query: 166  LLIP-VPPPLCGVLIIGEETIVY-CSANAFKAIPI---------RPSITKAYGRVDADGS 214
            L++P  P    GV+++ E+ + Y    N    +P          R ++  ++G      S
Sbjct: 235  LMVPNAPDGPGGVIVLCEDFLHYRGPKNQEMKVPYPKNQGMPADRGAMISSFGFYKQKSS 294

Query: 215  -RYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL 273
              YLL    G L  L +   K++V  +++       +A+++  + +  ++     G+   
Sbjct: 295  FLYLLQSEYGDLFELSLQFTKDEVHSIRMIYFDTIPVANSLCLMRSKHLFAACEKGNHCF 354

Query: 274  IKLNLQPDA---------------------KGSYVEVLERYVNLGPIVDFCVVDLERQGQ 312
             K   +  +                     K  ++ +L+   N   I D  V DL ++G 
Sbjct: 355  YKYQREETSTNIICTDSSMALEEQIFFKPQKLKHLSLLQELNNFSCISDLKVADLAKEGN 414

Query: 313  GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFIS 371
             Q+  C  A    +LR++R+G+ I + A+  LQ    G+W+L+   +DP   ++V+S+I+
Sbjct: 415  PQIYICCAAGNRSTLRVLRHGLEITQLANTNLQAKPLGIWTLKERYEDPVHKYIVISYIN 474

Query: 372  ETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
            +T  L + + +++E+    G     QT+     I +  +Q+ +   R +           
Sbjct: 475  KT--LVLKIGEKVEQVHDTGLEGTKQTIHVGTLIDDSQIQILTNGYRHIRKN-------- 524

Query: 432  KSPPGYSVN----VATANASQVLLATGGGHLVYLEIG----DGILTEVKHAQLEYEISCL 483
            K P  Y ++       +N  QV  A  GG + Y E+      G L E+   Q++ EI  L
Sbjct: 525  KPPTDYIIDGKVIKGVSNEKQVAFALAGGDVYYFELDTTSVSGNLIEITKDQMDNEIKAL 584

Query: 484  DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LLCAFEGIS 542
            ++ PI E     +   V + +D ++R+ SL   +   +  +  + +P     LC  E  S
Sbjct: 585  ELGPIEEGRQRCKFLCVAL-SDQTIRLLSLEPESCFERGAM--QALPSEAESLCMMEMAS 641

Query: 543  ------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
                        +L   L +G L+   ++  +G L+D +   LGT+P+     ++     
Sbjct: 642  EQEGQQSFTKQLFLFIGLNNGLLMRTSVDQLSGGLSDTRTRYLGTKPVKCLRITANQQQA 701

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
            + A S R  + Y+++ ++    ++   + +   F          +AKE +  +GT     
Sbjct: 702  MLALSSRSWLCYNNSGRIFMQPLSYDYLDYASAF----------LAKEFQGIVGTNQSTL 751

Query: 651  KLHI----------RSIPLGEHPRRICHQEQSRTFAICSLKNQSC--------------- 685
            ++ +          +S+ L   PR++   E S+   I    N+S                
Sbjct: 752  RIIMPERFGEIFNQQSLDLTYSPRKMIFHEPSKAIFIIESDNRSYNNIQQKVQEVYQTEE 811

Query: 686  ---------AEESE-MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVG 735
                     AE+ +    +R++D    E ++ +     ++ CSI    F+     Y CVG
Sbjct: 812  LPEQWNQIQAEQYKWASLIRIVDATKLETVNVHQFYENQHACSICYIQFAGYPEQYICVG 871

Query: 736  TAYVLPEENEPTK----GRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            T   L   NEP++    G I  F+ ++  L+L          Y+L A+ G+LL      +
Sbjct: 872  TVKDLV--NEPSRKFSQGFIHTFVYDNKTLKLKHSTPIDEIPYALAAWRGRLLVGAGCNL 929

Query: 792  QLYKW----MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
            ++Y+     +L+    + L S            +  + D I V ++  SI LL Y   + 
Sbjct: 930  RVYEMGNQRILKKAEIKNLNS--------FITSIMVKEDRIYVAEVADSIHLLRYNIRDQ 981

Query: 848  AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR------------------KNSE 889
               E A D    +++A  +LD    +  +   N+F  R                  K  +
Sbjct: 982  TFMELADDILPRYVTASTVLDYHTVIAGDKFENIFVSRVPLDIDEEQEEHPYEYKMKMDQ 1041

Query: 890  GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL 949
            G  +    +++ +  +++GE +   +  +LV     S+V     V++GT  G I  +   
Sbjct: 1042 GCMNGAPFKMDQICNFYVGEVITSLQKIALVS--TSSEV-----VVYGTSMGSIAALYPF 1094

Query: 950  PHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
             +++ + F   L+  LR   + + G +H Q+RS     K++     +DGDL + F ++  
Sbjct: 1095 DNKEDIDFFLHLEMYLRVEHQPLSGRDHMQFRSAYGPCKSI-----IDGDLCDQFGNMQY 1149

Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
             +   +++  + +  ++ K++E++
Sbjct: 1150 NKQRAVAEEFDRTPADIIKKLEDI 1173


>gi|342888540|gb|EGU87812.1| hypothetical protein FOXB_01669 [Fusarium oxysporum Fo5176]
          Length = 1408

 Score =  172 bits (436), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 199/829 (24%), Positives = 348/829 (41%), Gaps = 124/829 (14%)

Query: 177  VLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD-------------------ADGSRYL 217
            VL+  EE+I Y   N     P+R  I +  G  +                   AD   +L
Sbjct: 592  VLVFAEESITYHHLNQ---DPLRVPIPRRRGATEDLPRKRTIIGGVMHKLKNTADAFFFL 648

Query: 218  LGDHAG-LLHLLVITHEKE-----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
            L    G LL L    H+ E     +V  +KI+      +AS++  L    +++ + +G+ 
Sbjct: 649  LQTEDGDLLKLTFDMHDSEGSLAGEVRRIKIKYFDTVPVASSLCILKTGFLFVAAEFGNH 708

Query: 272  ---QLIKLNLQPDA-------------KGSYVEV------------LERYVNLGPIVDFC 303
               Q+ KL    DA             +G+Y  V            LE   +L P++D  
Sbjct: 709  HFYQIEKLGEDDDAEPVITSDDFSTGPRGAYQPVYFDPRPLKNLALLESIDSLSPLLDCE 768

Query: 304  VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
            V D   +G  Q+ + SG        ++++G+ IN+ AS +L G + G+W+ R +  D FD
Sbjct: 769  VADPTGEGPPQIYSESGNGARSHFWMLKHGLEINKVASSDLPGTVSGVWTTRMTRHDKFD 828

Query: 363  TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
            ++++++  S    + M++ DE+E+    GF +   TL       + +VQ+ S  +R + +
Sbjct: 829  SYIILT--SSDGTVVMSVGDEVEQVSDSGFLTTVTTLAIQQIGDDGIVQIHSKGIRHLRA 886

Query: 423  TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQ-LEYEI 480
                  NEW +P   S+   T N  Q+ +A   G +VY E+  DG L E    + +   +
Sbjct: 887  GQI---NEWPAPQHRSIVAVTTNKRQIAIALSSGEIVYFEVDSDGSLAEYDETKVISTTV 943

Query: 481  SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFE 539
            +CL + P+ E    S +  VG   D +VRI  L PD  L +        +P ++++ A E
Sbjct: 944  TCLSLGPVPEGRLRSPLLVVGC-EDCTVRILGLNPDSMLESNSIKTLTAVPSALIIMAME 1002

Query: 540  ---GISYLLC---ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
                 S  LC    L  G  L   +N  TGEL+D +   LG + I L   + K  T V A
Sbjct: 1003 HPLTSSSGLCLHIGLSSGVYLRTGMNEITGELSDTQTRFLGLKAIKLFQVTVKGQTCVLA 1062

Query: 594  ASDRPTVIYSSNKK-LLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
               +P + Y   K+    + +  +E+     F+S    + +       L I +++++   
Sbjct: 1063 LGSKPWMGYIDPKRGFTMTPLECEELHWASSFSSEQCQEGIIAIHANFLHIFSVENLHNN 1122

Query: 653  HIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF------------------ 693
             +R SIPL   PR          F    + N + + E                       
Sbjct: 1123 VVRKSIPLTYTPRHFVKHPAEPYFYTIEVDNNTLSPELRAQLLAVDDHGDDKVLPPELFG 1182

Query: 694  -----------VRLLDDQTFEFISTYPLDTFEYGCSILSCSF-SDDSNVYYCVGTAY-VL 740
                       + ++D    + +    L+  E   SI    F S D  ++  VGT   ++
Sbjct: 1183 YPRGNGRWASCISIIDPIGEQVLQRIDLEGNEAAISIAVAPFTSQDGEIFLLVGTGKDMI 1242

Query: 741  PEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
                + + G I V+   ++G+ LQ I + + +    +L AF  +L   + + + +Y    
Sbjct: 1243 LNPRQSSGGYIHVYRFHQNGRELQFIHKTKVEEPPMALVAFRDRLAVGLGKDLCIY---- 1298

Query: 799  RDDGTRELQSECGHHGHI-----LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
             D G ++L  +     HI     L + + T+GD IVVGD+ + ++++++ HEE  +    
Sbjct: 1299 -DLGLKQLLRKA----HIEAAPQLIVSLDTQGDRIVVGDVQQGMTMVMFNHEEQRLIPFV 1353

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRL 899
             D  A W +   ++D +  +G +   N++ VR   K S    D  R  L
Sbjct: 1354 DDIIARWTTCTTMVDYESVVGGDKFGNIWIVRCPKKTSHHVGDYARNYL 1402


>gi|167539942|ref|XP_001741428.1| DNA repair protein xp-E [Entamoeba dispar SAW760]
 gi|165894130|gb|EDR22214.1| DNA repair protein xp-E, putative [Entamoeba dispar SAW760]
          Length = 1004

 Score =  172 bits (436), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 241/1058 (22%), Positives = 472/1058 (44%), Gaps = 153/1058 (14%)

Query: 10   IATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIG 69
            I+ L  ++P  +  D+L I T+  +  +L+    +  +I+   G + D  GR    G   
Sbjct: 65   ISLLLKYKPLKDITDYLIIVTQSNEIEILKLTTSALHIISS--GSLKDTFGRKAFFGVKA 122

Query: 70   IIDPDCRLIGLHLYDGLFKVI--PFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVV 127
             I P+ +++ L+LY+ L KV+  P   K QL  A NI++    +LD+ F        + +
Sbjct: 123  AISPNNQILLLNLYEQLIKVVLLPQYPKDQLT-ASNIKVNHSHILDMTFCLLNDSYQLAL 181

Query: 128  LYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
            L+++ +D RH+ TY +     +  +G ++Q N+ +    ++     + G+ +I +ET  Y
Sbjct: 182  LHENKRDIRHIDTYTLN-SFNELEKGSFNQPNVGSNTSRILGFKDGIIGIFVISDETGCY 240

Query: 188  CSANA---FKAIPIRPSITKAYGRVD----ADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
             + N       +   P+    + + D     D  R L      L   + I++E  +VT  
Sbjct: 241  FNGNGKHILCNLNQYPNTACCFIKKDVLIMTDSKRNL----DTLEFKVNISNEIIEVTKK 296

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            +   L + SI+STIS L N +++IGS  GD+ ++ LN +         +LE++ N G ++
Sbjct: 297  QAPKLLQ-SISSTISDLSNQILFIGSLNGDNLIMNLNGK---------ILEKWSNFGSLM 346

Query: 301  DFCVVDLERQ--GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTD 358
            D       RQ   +   +        GS+ ++  G GI E  + E+ GIK + S+  ++ 
Sbjct: 347  D------ARQISNREDYLVAGNGGGKGSIGLMIKGSGIEELGACEIDGIKSIESIEYNSK 400

Query: 359  DPFDTFLVVSFISETRILAMNLEDE---LEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
                 +++V F  E+ I  ++ + +   +EE+ ++    + + + C   I  +L+ V   
Sbjct: 401  ----KYVIVGFEEESNIWEIHQKSKHIKIEESLLKKIIGKERVICC-GIIKERLIFVCKK 455

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
             +  +    +  +N       +  +    N  Q +    G  L   +I + +  + K   
Sbjct: 456  GIYSIDKHQK--KNTLMKFDSFITHAKFIN--QQIYFIYGTEL--FKINEELKIQ-KMKD 508

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
            LE  +SC D N         +I  +G+W D S+ IF   D+N + K+    + I RSVL+
Sbjct: 509  LEQNVSCFDAN---------EIIVIGLW-DGSILIF---DINGVLKKTEVVDTIGRSVLI 555

Query: 536  CAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL---GTQPITLRTFSSKNTTHVF 592
               +   Y+ C   D  ++ FL +    +   + K+      +Q + L+   S       
Sbjct: 556  DNNK--IYVGC---DNEIIIFLYHQMEEDCKYKNKLKYQVEASQSVKLKKLYS------- 603

Query: 593  AASDRPTVIYSSNKKLLYSNVNL-----KEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
                 P V+  +NK   +    L     +++  +C     A+   +  A +  +  G + 
Sbjct: 604  ----IPCVV--ANKTFTFQQTGLVPLAVEDLIDICESPVGAY--GIICATKRGIVFGLMK 655

Query: 648  DIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIST 707
            ++ ++  + I  GE+  +I   + SR   +             +  + L+D        +
Sbjct: 656  EMSRVTFKMIHSGENCCKIA-TDGSRGLIV----------GKTIKSINLVD-------GS 697

Query: 708  YPLDTFEYGCSILS-CSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE-DGKLQLIA 765
            + + T E   + L+ C  S + N+Y  VGTA +   E EP+ GRIL+   + +G + ++ 
Sbjct: 698  FGVTTVELKSNELALCVDSLEDNIY-AVGTAIIRENEIEPSSGRILLIRQDSEGLIYIVG 756

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG 825
             ++  GAVY L  +   ++A IN+ + + +   +D  T++          ++ + +    
Sbjct: 757  TEDYDGAVYCLKKYQKGIVAFINRNVHVIEKKGKDLSTKQ-----NMLLPLIGVSLDICK 811

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI---YLGAENNFNLF 882
            D+I+ GDL +S+S+  Y+++   ++   +D    W S V  ++ +    +L A+ + N+ 
Sbjct: 812  DYIIAGDLARSVSVYRYRNDIEHLDIVGKDNQIVWSSCVGSIESEYGTSFLVADVSGNIK 871

Query: 883  TVRKNSEG-ATDEERGRLEVVGEYHLGEFVN----RFRHGSLVMRLPDSDVGQIPTVIFG 937
                N E   TD+E+  + ++ + H+ + +N     F  G L+                G
Sbjct: 872  IFNSNEEEPKTDDEK--INLISQVHIADSINFIQKSFYKGCLM----------------G 913

Query: 938  TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
             V+G I  I  +  EQYLFL+K+Q+ L K            WR   N ++T    N +DG
Sbjct: 914  GVHGGIYNICEISKEQYLFLDKIQSKLVK----------PNWRESVNTQQTNPMMNCIDG 963

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            D IES L+ S  +   +++ + V  +E+ ++++ L  L
Sbjct: 964  DKIESVLEWSEKKQILLAQKIGVEYQEMIEKIQSLFSL 1001


>gi|353442065|gb|AER00318.1| xeroderma pigmentosum group E, partial [Hydra vulgaris]
          Length = 156

 Score =  172 bits (435), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 83/154 (53%), Positives = 110/154 (71%), Gaps = 1/154 (0%)

Query: 859  NWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGS 918
            NWM+AVEI+DDD +LGAEN+FNLF  +K++    DEER  L+ +G+YHLG+FVN F+HGS
Sbjct: 2    NWMTAVEIIDDDTFLGAENSFNLFICQKDNSSVNDEERHHLQTIGKYHLGDFVNVFKHGS 61

Query: 919  LVMRLPDSDVGQI-PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
            LVM      +  I  ++++GTV G IG++A LP   + FL ++Q  L K IK VG + HE
Sbjct: 62   LVMHHSTEQLTPISSSILYGTVRGAIGLVAGLPKNTFDFLSQVQEKLSKTIKSVGKIEHE 121

Query: 978  QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
             WRSF N+KKT  A   +DGDLIES LDL+RT+M
Sbjct: 122  FWRSFYNDKKTDLAVGCVDGDLIESCLDLTRTQM 155


>gi|71004436|ref|XP_756884.1| hypothetical protein UM00737.1 [Ustilago maydis 521]
 gi|74704394|sp|Q4PGM6.1|RSE1_USTMA RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|46095609|gb|EAK80842.1| hypothetical protein UM00737.1 [Ustilago maydis 521]
          Length = 1221

 Score =  172 bits (435), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 259/1175 (22%), Positives = 483/1175 (41%), Gaps = 171/1175 (14%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G I +L  FR  G ++D++ + ++  +  +L++  +++ L  +   +   R G R    
Sbjct: 62   FGVIRSLAAFRLTGGSKDYVIVGSDSGRIVILEYQPKTNSL-EKVHQETFGRSGSRRIVP 120

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
            GQ    DP  R  +IG      L  ++  D +  L  +  +        +  +  +D+ F
Sbjct: 121  GQYLATDPKGRATMIGAMEKAMLVYILNRDAQANLTISSPLEAHRPSAIIHHIVGVDVGF 180

Query: 117  ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE---------GPWSQNNLDNGA 164
               L+ C +    V Y D+      + +E A K   + E           WS+  +D  +
Sbjct: 181  ENPLFACLE----VDYSDSDHDPSGRAFEEAAKTLTYYELDLGLNHVVRKWSEP-VDPRS 235

Query: 165  DLLIPVPPPL----------CGVLIIGEETIVYCSANAFK-AIPIRPSITKAYGRVDADG 213
            +LL+ VP              GVL+  E+ I Y   +  +  +PI   +       +  G
Sbjct: 236  NLLVQVPGGYNQNLEKWDGPSGVLVCSEDYITYKHQDQPEHRVPIPKRLNPVEKLSERRG 295

Query: 214  SR--------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
            +               +L+    G L  + + H+ +++  LKI+      +AS +  L +
Sbjct: 296  TLIVASVLHKMKNAFFFLVQTEDGDLFKITMEHQDDEIRSLKIKYFDTVPVASGLVILRS 355

Query: 260  AVVYIGSSYGDSQLIKLN------------------------------LQPDAKGSYVEV 289
              +++ S YG   L                                    P    + V+V
Sbjct: 356  GFLFVASEYGAQLLYSFQKLGDDDDLPEYISTDYDENGAGRRRPQLPTFTPRPLDNLVQV 415

Query: 290  LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
             E   +L PI+D   ++       Q+    G     S +++R+G+ + E  S +L G+  
Sbjct: 416  DE-MPSLDPILDAKPLNPLAADSPQIFAACGRGARSSFKMLRHGLEVQEAVSSDLPGVPS 474

Query: 349  GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
             +W+ + +  D +D+++++SF++ T +L++   + +EE    GF + + TL       + 
Sbjct: 475  AVWTTKITQQDEYDSYIILSFVNGTLVLSIG--ETIEEVSDSGFLTSSSTLAVQQLGQDA 532

Query: 409  LVQVTSGSVRLVSSTSRELRNEWKSP--PG--YSVNVAT-ANASQVLLATGGGHLVYLEI 463
            L+QV    +R V    +++ NEW +P  P    +  VAT  N  QV++A     LVY E+
Sbjct: 533  LLQVHPHGIRHVL-VDKQI-NEWATPSLPNGRQTTIVATCTNERQVVVALSSNELVYFEL 590

Query: 464  G-DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK 521
              DG L E +  + +   +  + +    E    +   AVG   D +VRI SL   + +  
Sbjct: 591  DMDGQLNEYQERKAMGAGVLTMSMPDCPEGRQRTPYLAVGC-DDSTVRIISLEPNSTLAS 649

Query: 522  EHLGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 572
              +     P S +  A             +++   L +G LL  +L+  TG+LTD +   
Sbjct: 650  ISIQALTAPASSICMAEMLDATIDRNHATTFVNIGLQNGVLLRTILDAVTGQLTDTRTRF 709

Query: 573  LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
            LG++ + L          V A S R  + Y+   +L +  +    + H   F++   P+ 
Sbjct: 710  LGSKAVRLIRTKVHGQAAVMALSTRTWLSYTYQDRLQFVPLIFDVLDHAWSFSAELCPEG 769

Query: 633  LAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCA---- 686
            L       L I TI  +  KL   S+ L   PR+I  H  +   F +   ++++ +    
Sbjct: 770  LIGIVGSTLRIFTIPSLASKLKQDSVALSYTPRKIANHPNEQGLFYVVEAEHRTLSPGAQ 829

Query: 687  --------EESEMHFVRLLDDQTFEF------------------------ISTYPLDTFE 714
                    +E + H   +LD    EF                             +D  E
Sbjct: 830  RRRTEMLGKELKPHQRGVLDLNPAEFGAIRAEAGNWASCIRAVDGVQAQTTHRLEMDDNE 889

Query: 715  YGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETK 770
               SI    F S +  V   VG+A  V+       K  +  + ++++G+ L+L+ + E  
Sbjct: 890  AAFSIAVVPFASAEKEVMLVVGSAVDVVLSPRSCKKAYLTTYRLLDNGRELELLHKTEVD 949

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIV 829
                 L AF G+LLA I + +++Y     D G ++L  +C +     A+  +  +G  IV
Sbjct: 950  DIPLVLRAFQGRLLAGIGKALRIY-----DLGKKKLLRKCENRSFPTAVVSLDAQGSRIV 1004

Query: 830  VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KN 887
            VGD+ +SI    YK  E  +   A D    +++   +LD D    A+   N++ +R   N
Sbjct: 1005 VGDMQESIIFASYKPLENRLVTFADDVMPKFVTRCTMLDYDTVAAADKFGNIYVLRLDGN 1064

Query: 888  SEGATDEERGRLEVV-------GEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGT 938
            +  + DE+   + +V       G  H    V  F  G ++  L  + +  G    +++  
Sbjct: 1065 TSRSVDEDPTGMTIVHEKPVLMGAAHKASLVAHFFVGDIITSLHRTAMVAGGREVLLYTG 1124

Query: 939  VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
            ++G IG +   +  E    L  L+++LR+    + G +H  +RS     K+V     +DG
Sbjct: 1125 LSGSIGALVPFVSKEDVDTLSTLESHLRQENNSIVGRDHLAYRSSYAPVKSV-----IDG 1179

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DL E+F  LS  + + I+  ++    E+ K++ +L
Sbjct: 1180 DLCETFGLLSPAKQNAIAGELDRKPGEINKKLAQL 1214


>gi|315053737|ref|XP_003176243.1| pre-mRNA-splicing factor rse1 [Arthroderma gypseum CBS 118893]
 gi|311338089|gb|EFQ97291.1| pre-mRNA-splicing factor rse1 [Arthroderma gypseum CBS 118893]
          Length = 1181

 Score =  171 bits (433), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 264/1165 (22%), Positives = 472/1165 (40%), Gaps = 191/1165 (16%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
            ++G I +L  FR  G  +D++ I ++  +  ++++   +     R   +   + G R   
Sbjct: 64   VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGIRRVV 122

Query: 65   NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
             GQ   +DP  R  LI     + L  V+  + + +L  +  +           L  LD+ 
Sbjct: 123  PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182

Query: 116  F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
            +   ++   +     + QD     +  T      YE+ L     V   W+ + +D  A +
Sbjct: 183  YENPIFAALEVEYTEVDQDPTGQAYDDTEKMLVYYELDLGLNHVVRR-WA-DPVDRTASM 240

Query: 167  LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
            L  VP       GVL+  E+ IVY  +N  AF+  IP R  P+      R    G     
Sbjct: 241  LFQVPGGADGPSGVLVCAEDNIVYRHSNQDAFRVPIPRRRGPTENPERKRCITAGVMHKM 300

Query: 216  -----YLLGDHAGLLH---LLVITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
                 +LL    G L    + ++  E E  TG    LK++      +AS++  L +  ++
Sbjct: 301  RGAFFFLLQSEDGDLFKVTMEMVEDENENATGEVKRLKLKYFDTVPLASSLCILKSGFLF 360

Query: 264  IGSSYGD-----------------------SQLIKLNLQP----DAKGSYVEVLERYVNL 296
            + S  G+                       S +I   L P          + ++E   +L
Sbjct: 361  VASETGNQHFYQFEKLGDDDDEIEFISDDYSAVISEPLPPVYFRPRPAENLNLVESIASL 420

Query: 297  GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
             P++   V ++  +   Q+ T  G     S R +++G+ ++E    EL  +   +W+ + 
Sbjct: 421  NPLMAASVTNVTEEDAPQIYTLCGTGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            S +D +D ++V+SF + T +L++   + +EE    GF S   TL       + L+QV   
Sbjct: 481  SRNDQYDAYIVLSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
             +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG L E  + 
Sbjct: 539  GIRHIHADQR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDGSLAEYDEK 596

Query: 474  AQLEYEISCLDINPIGENPS----YSQIAAVGMWTD-ISVRIFSLPDLNLITKEHLGGEI 528
             Q+   ++CL    +GE P     +  +  +    D  ++R+    D+ L    H+G   
Sbjct: 597  RQMSGTVTCLS---LGEVPKDDVDFCPVCPIHHVHDRFTLRV----DVIL----HIG--- 642

Query: 529  IPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
                               L  G  L  +L+  TGEL+D +   LG +P+ L + S K  
Sbjct: 643  -------------------LYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFSVSVKEQ 683

Query: 589  THVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
              V A S R  + YS    K    + +N   +     F+S    + +   +   L I +I
Sbjct: 684  RAVLALSSRSWLGYSDLQTKSFTLTPLNYVGLEWSWNFSSEQCVEGMVGIQGQNLRIFSI 743

Query: 647  DDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF------------ 693
            + +   L    IPL   PR      +   F +    N   +  ++               
Sbjct: 744  EKLDNNLLQEPIPLAYTPRNFVRHPEYPLFYVIGSDNNVLSPSTKAKLLGESTAVNGDSA 803

Query: 694  -------------------VRLLDD-QTFEFISTYPLDTFEYGCSILSCSF-SDDSNVYY 732
                               + ++D   T   +S   L+  E   SI + SF S +   + 
Sbjct: 804  ELPPEDFGYPRGTNHWASCIEVVDPINTKSVLSKLELEDNEAAVSIAAVSFTSQEDETFL 863

Query: 733  CVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
             VGT   ++      T G I ++   E+GK L+ I + + +    +L  F G+LLA +  
Sbjct: 864  VVGTGKDMVVSPRTYTCGFIHIYRFQEEGKELEFIHKTKVEQPPLALLGFQGRLLAGVGP 923

Query: 790  KIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
             +++Y     D G R+L  +C       + + +QT+G  I+V D+ +S++ ++YK++E A
Sbjct: 924  DLRIY-----DLGMRQLLRKCQAQITPRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENA 978

Query: 849  IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEY 905
            +   A D  + W +   ++D +   G +   N++ +R   K SE A ++  G   +    
Sbjct: 979  LIPFADDIISRWTTCTTMVDYETVAGGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQ 1038

Query: 906  HLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT--------------VNGVIGV-IASLP 950
            +L    NR    SLV+     D   IPT I  T              + G +G+ +  + 
Sbjct: 1039 YLQGAPNRL---SLVVHFYSQD---IPTSIQKTQLVAGGRDILVWTGLQGTVGMFVPFIT 1092

Query: 951  HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTR 1010
             +   F + L+  L      + G +H  +R +         K  +DGDL E+FL L   +
Sbjct: 1093 RDDVDFFQTLEMQLASQNAPLAGRDHLIYRGY-----YAPCKGVIDGDLCETFLLLPNDK 1147

Query: 1011 MDEISKTMNVSVEELCKRVEELTRL 1035
               I+  ++ SV E+ +++   T L
Sbjct: 1148 KQAIAGELDRSVREIERKISVRTIL 1172


>gi|358060450|dbj|GAA93855.1| hypothetical protein E5Q_00501 [Mixia osmundae IAM 14324]
          Length = 1153

 Score =  171 bits (432), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 204/915 (22%), Positives = 392/915 (42%), Gaps = 124/915 (13%)

Query: 217  LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
            L+    G L+ + +  ++E+V  LKI+      +AS++  L +  +++ S +G+  + + 
Sbjct: 257  LVQSEEGDLYKVTVERQEEEVLALKIKYFDTVPVASSLCILRSGFLFVASEFGNHNVYQF 316

Query: 277  N------------------LQPDAKGSYVEV--LERYV------NLGPIVDFCVVDLERQ 310
            +                  ++ +   ++ +   LE  V      +L PIVD  V ++   
Sbjct: 317  DKLGDDGDEVSSADYPSFGMEREEPATFFKPRPLENLVLVDELDSLCPIVDAKVANVLGA 376

Query: 311  GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK--GMWSLRSSTDDPFDTFLVVS 368
               Q++T  G     SL+++R+G+ + E  S  L G +  G+W+ +  +DD +D+++V+S
Sbjct: 377  DAPQIITACGRGHRSSLKMLRHGLEVEEMVSSGL-GFEPTGLWTTKLKSDDTYDSYIVLS 435

Query: 369  FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV-TSGSVRLVSSTSREL 427
              + T +L +   + +E+    G     +TL       + +VQV  +G +R+ +  S+E+
Sbjct: 436  APAATIVLTIG--ESIEQASNSGLVENQRTLCVQQVGDDSIVQVHPAGFLRVRADGSKEV 493

Query: 428  RNEWK-SPPGYSVNVATANASQVLLATGGGHLVYL--EIGDGILTEVKHAQLEYEISCLD 484
               W  +P   ++ VA AN  Q++LAT  G ++YL  E+   I       QL   ++ + 
Sbjct: 494  ---WPLAPERVNLAVACANQRQIVLATTSGDIIYLRCELDGEITVFDDRKQLGVSVTSMS 550

Query: 485  INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIPRSVLLCAF----- 538
            I  + E+   +    VG   D +VRI SL + N +    +     +P S+ +        
Sbjct: 551  IGELSEDRLQTDYLCVGC-EDQTVRIISLQEQNCLETISIQALTALPSSICIAEILDSSV 609

Query: 539  ---EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS 595
                  +++   L +G LL  +L+  +G LTD +   LGT+P+ LR  +  +++ V + S
Sbjct: 610  DKTRPTTFVNIGLQNGVLLRTVLDPMSGRLTDTRTRFLGTRPVKLRRLAIGDSSGVLSIS 669

Query: 596  DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHI 654
             R  V Y+    L +  +    + H     +   P+ +       L I T+  I  K+ +
Sbjct: 670  SRTWVNYTHQGLLQFDPLISDPLDHATGLRAEVCPEGIIGVTGDTLRIFTLPKIGTKVKM 729

Query: 655  RSIPLGEHPRRICH----------QEQSRTFAICSL--KNQSCAEESEMHF--------- 693
             SIPL   PRR             Q   RT +  +   K +   E SE  +         
Sbjct: 730  DSIPLSLTPRRTAFHPAGTLLYMIQSDHRTLSPITQEEKAKDLMEPSEAMWTAEINGLMR 789

Query: 694  ---------VRLLDDQTFE--FISTYPLDTFEYGCSILSCSFSDDSNVYY-CVGTAY--- 738
                     + ++D    E   ++   LD  E   S+    F++    ++  VGTA    
Sbjct: 790  AEAGQWSSCISIIDPTEPENATVTQIYLDNNEAAFSVAVAQFAERPGKWFLLVGTAQDTT 849

Query: 739  VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW 796
            V P     T G I  + I E G+ L+L+ + E      S+ AF G+ +  + + ++LY  
Sbjct: 850  VSPRTC--THGFIRTYEITEAGRSLELLHKTELDDVPLSIAAFQGRAVVGVGRALRLYTM 907

Query: 797  MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
                  +R L+           + +Q +G  I   D   S+  + YK  +  +   A D 
Sbjct: 908  ----GKSRLLRKSENKSFPAAVVSLQVQGSRIYASDAQDSVYFVAYKAADNRLLIFADDT 963

Query: 857  NANWMSAVEILDDDIYLGAENNFNLFTVRKN---SEGATDEERG---------------R 898
               W++   ++D D     +   N+F  R +   SE   +++ G               R
Sbjct: 964  QQRWITCNTVVDYDTVASGDKFGNVFVSRVDKLVSEDVDEDQTGAGILHEKPLFMGAPHR 1023

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV-IASLPHEQYLFL 957
            L+++  +++G+ +   +  SLV        G    +++  + G +G+ I  +  E   F 
Sbjct: 1024 LQLLTHFNVGDILTCIQKVSLV-------AGGREILLYTCLGGTVGMLIPFISKEDVEFS 1076

Query: 958  EKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKT 1017
              L+ ++R     + G +H  +R +      V  K  +DGDL E+F  L   +  +I+  
Sbjct: 1077 STLEMHMRAENPSIVGRDHLAYRGYY-----VPQKATVDGDLCETFALLPMQKQAQIAGE 1131

Query: 1018 MNVSVEELCKRVEEL 1032
            ++ SV E+ K+++ +
Sbjct: 1132 LDRSVSEVLKKIDSM 1146


>gi|393236715|gb|EJD44262.1| hypothetical protein AURDEDRAFT_137476 [Auricularia delicata
            TFB-10046 SS5]
          Length = 1244

 Score =  170 bits (431), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 252/1115 (22%), Positives = 450/1115 (40%), Gaps = 163/1115 (14%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSE---LITRAMGDVSDRIGRP 62
            ++G + +L  FR  G  +D+  + ++  +  +L +D +++E   L T   G       R 
Sbjct: 51   VFGAVRSLVAFRLTGGTKDYAIVGSDSGRIFILDYDPKANEFKRLYTETYGKSG---ARR 107

Query: 63   TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL----- 117
               GQ   +DP  R + +   +    V   +         +  LE  +   I F      
Sbjct: 108  IVAGQYFAVDPKGRAVMIAAMEKSKLVYILNRDAATNLTISSPLEVHRNQSILFHITGVD 167

Query: 118  YGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA 164
             G   P    L  D             +   + +  YE+ L     V   WS+   D  A
Sbjct: 168  VGFENPLFAALEIDYGKSDQDPTGEAFHNAEKMLTFYELDLGLNHVVRK-WSEPT-DPRA 225

Query: 165  DLLIPVP-------------PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDA 211
            + L+ VP             P   GVL+  E  I+Y   +  +    R  + +    +D+
Sbjct: 226  NFLVQVPGGTSASSPDKFDGP--SGVLVCCENHIIYRHMDHPQH---RVPVPRRRAPLDS 280

Query: 212  DGSRYLLGDHAGLLHLLVIT--HEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
            +    ++   A ++H + +T  H+ ++VT LKI+      ++S++  L +  +++ S +G
Sbjct: 281  EERGLIIV--AAVMHKMKVTIDHQDDEVTSLKIKYFDTVPVSSSLCILKSGFLFVASDFG 338

Query: 270  DSQLIKLNLQPD-------AKGSYVE------------------------VLERYVNLGP 298
               L +     D       +  SY +                        + +   ++ P
Sbjct: 339  PHHLYQFQKLGDDDAETEFSSSSYPKNGMADPDEALPSAYFQPRPLENLLLCDSLDSINP 398

Query: 299  IVDFCVVDLER--QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QGI-KGMWSLR 354
            I+D  V++L        Q     G     S R++R+G+ + E  S +L  GI   +W+ +
Sbjct: 399  IIDAKVLNLMPTISDTPQFYVACGRGARSSFRLLRHGLEVEENISSDLPSGIPNAVWTTK 458

Query: 355  SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
               DDPFDT++V+SF++ T  L + + + +EE +  GF S   TL       + L+QV  
Sbjct: 459  IRADDPFDTYIVLSFVNGT--LVLTIGETIEECQETGFLSAEPTLGVQQIGDDALLQVYP 516

Query: 415  GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKH 473
              +R + +  R   NEWK P G ++  A  N+ QV++A     LVY E+  DG L E + 
Sbjct: 517  HGIRHILADKRV--NEWKVPSGKTIVQAATNSRQVVVALNSAELVYFELDLDGQLNEYQD 574

Query: 474  AQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
             + +   +  L +  + E    +   AVG   D +VRI SL   N +    L     P S
Sbjct: 575  RKAMGSVVLALSMAEVPEGRQRTPYLAVGC-EDQTVRIISLDPDNTLETISLQALTAPPS 633

Query: 533  VLL------CAFEGIS---YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
             +        A   IS   ++   L  G  L  +L+  TGELTD +   LGT+PI L   
Sbjct: 634  AICIVAIIDAAVNKISETLFVNIGLTSGLFLRTVLDPVTGELTDTRTRFLGTRPIRLNRV 693

Query: 584  SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
            +      + A S R  + Y+  + L  + +    + ++  F +   PD     K   + I
Sbjct: 694  TIAGKPALLALSSRMWLNYAHQEALHLTPLIFDPLDYVSGFTADLCPDGFIGLKGSTIKI 753

Query: 644  GTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH---------- 692
              I  I Q+L    +PL   PR+I     +  F I    +++ ++E+ +H          
Sbjct: 754  FQITKIGQRLKHEVMPLSYTPRKIAVHPMNGLFYIAESDHRTHSQEA-VHGALQQPDLKK 812

Query: 693  -------------------------FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-D 726
                                      VR+++      I    LD  E    I    FS  
Sbjct: 813  YDRDVLELLPEVFGHPRAPACVWASCVRIVNPVELTTIYRLDLDNNEAAFLIALVPFSAR 872

Query: 727  DSNVYYCVGT---AYVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNG 781
            D+ +   VGT   A++ P     T G +  + +++DG+ L+L+ + E      SL AF G
Sbjct: 873  DNELTLVVGTAKDAFLAPRSC--TSGYLRTYRVLDDGRGLELLHKTECDDIPLSLLAFQG 930

Query: 782  KLL-AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISL 839
             LL A I + +++Y     D G ++L  +        A+  + T+G  I+ GD  +SI  
Sbjct: 931  NLLVAGIGKCLRIY-----DMGKKKLLRKAESKLFTTAITSLSTQGARIIAGDAQQSIFF 985

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN---SEGATDEER 896
             +YK  E  +   A D    W ++  +LD +  +  +   N+F  R +   S    D+  
Sbjct: 986  CVYKATENRLLVFADDSQQRWTTSQVMLDYNTVVAGDKFGNVFVNRLSDHVSNLVDDDPT 1045

Query: 897  G------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIAS 948
            G      +   +G  H    +  F  G ++M L  + +  G    +++  ++G IG++  
Sbjct: 1046 GTGLLHEKGMFMGAPHKTSMLCHFHVGDIIMSLQRTSLVPGGREVIVYFGLHGTIGMLVP 1105

Query: 949  LPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSF 982
               ++ + F   L+ ++R     +   +   WR +
Sbjct: 1106 FASKEDVDFFTPLEQHMRSENLSLVSRDILAWRGY 1140


>gi|410926437|ref|XP_003976685.1| PREDICTED: splicing factor 3B subunit 3-like, partial [Takifugu
            rubripes]
          Length = 1020

 Score =  170 bits (430), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 223/992 (22%), Positives = 404/992 (40%), Gaps = 137/992 (13%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L + ++G I +L  FR  G  +D++ I ++  +  +L++   S  +  +   +   + G
Sbjct: 53   LLTMEVFGIIRSLMAFRLTGGTKDYIVIGSDSGRIVILEYHT-SKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R  +IG      L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMIGAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  ++E VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRLKYFDTIPVATAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V V E+  +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-SLS 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   + DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406  PIMSCQIADLANEDTPQLYVACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             +D FD +++VSF++ T +L++   + +EE    GF   T TL C       LVQV    
Sbjct: 466  VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDEALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
            +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKTPGKKTIVRCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL     L  ++ + L  +  P 
Sbjct: 582  EMSADVVCMSLANVPPGEQRSRFLAVGL-ADNTVRIISLDPSDCLQPLSMQALPAQ--PE 638

Query: 532  SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
            S+ +    G+               YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 639  SLCIVEMGGVEKQDELGEKGSMGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L     +    V A S R  + YS   +   + ++ + + +   F S   P+ +    
Sbjct: 699  VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
               L I  ++ +  +  + + PL   PR+ + H E            + T A  + + Q 
Sbjct: 759  TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLVLIESDHNAYTEATKAQRKQQ 818

Query: 685  CAEE-------------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
             AEE             +EM     L++   E I   P        S++        N  
Sbjct: 819  MAEEMVEAAGEDERELAAEMA-AAFLNENLPEAIFGAPKAGAGQWASLVRL-----INPI 872

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
              V    V  E+NE         +    KL+ + +   + A  ++  F G++L  + + +
Sbjct: 873  QGVTLDQVQLEQNEAAFXXXXRLVSGGEKLEFVHKTPVEDAPLAIAPFQGRVLVGVGKLL 932

Query: 792  QLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
            ++Y     D G ++L  +C  + HI  +   + T G  ++V D+ +S+  + Y+  E  +
Sbjct: 933  RIY-----DMGKKKLLRKC-ENKHIPNVVTSIHTSGQRVIVTDVQESLFWVRYRRNENQL 986

Query: 850  EERARDYNANWMSAVEILDDDIYLGAENNFNL 881
               A D    W++   +LD D    A+   N+
Sbjct: 987  IIFADDTYPRWVTTACLLDYDTMASADKFGNI 1018


>gi|134106833|ref|XP_777958.1| hypothetical protein CNBA4270 [Cryptococcus neoformans var.
            neoformans B-3501A]
 gi|50260658|gb|EAL23311.1| hypothetical protein CNBA4270 [Cryptococcus neoformans var.
            neoformans B-3501A]
          Length = 1218

 Score =  170 bits (430), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 223/924 (24%), Positives = 381/924 (41%), Gaps = 138/924 (14%)

Query: 216  YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
            +LL    G L+ + I H  E V  LKI+      +A+++  L    +Y+ S + D  L +
Sbjct: 320  FLLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCILKRGYIYVASEFSDQNLYQ 379

Query: 276  LNLQPDAKG-------SYVE----------------------VLERYVNLGPIVDFCVVD 306
                 +  G        Y E                      +++   +L PI D  VV+
Sbjct: 380  FQSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLLLVDTVPSLDPITDAHVVN 439

Query: 307  L--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDT 363
            L        Q+    G     + R +++G+ + E  S  L G+   +W+L+ + D     
Sbjct: 440  LLGASSDTPQIYAACGRGARSTFRTLKHGLDVAEMVSSPLPGVPTNVWTLKLTED----- 494

Query: 364  FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
              V+SF + T +L++   + +EE    GF S   TL         L+QV    +R + + 
Sbjct: 495  --VLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLGNAGLLQVHPYGLRHIRAA 550

Query: 424  SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQ-LEYEIS 481
             R   +EW +PPG ++  AT N  QV++A     LVY E+  +G L+E +  + L    +
Sbjct: 551  DRV--DEWPAPPGQTIVAATTNRRQVVIALSTAELVYFELDPEGSLSEYQEKKALPGNAT 608

Query: 482  CLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEG 540
            C+ I  + E    +   AVG   + +V I SL PD  L T         P S+ L     
Sbjct: 609  CVTIAEVPEGRRRTSFLAVGC-DNQTVSIISLEPDSTLDTLSLQALTAPPTSICLAEIFD 667

Query: 541  IS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
             S        +L   L +G LL  +++   G L+D +   LG +P  L   + +    V 
Sbjct: 668  TSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLGAKPPKLVRANVQGQPSVM 727

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QK 651
            A S R  ++Y+    L    +    + +    ++A  PD L       L I  I  + +K
Sbjct: 728  AFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLIGISGNTLRIFNIPKLGEK 787

Query: 652  LHIRSIPLGEHPRR-ICHQEQS---------RTFAICSLKNQSCAEESE----------- 690
            L   S  L   PR+ I H   S         RT++  +++     +ESE           
Sbjct: 788  LKQDSTALTYTPRKFISHPFNSVFYMIEADHRTYSKSAIERIVKQKESEGRRVDTLLLDL 847

Query: 691  ---------------MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-DDSNVYYCV 734
                              VR+LD    E I T  LD  E   SI    F       +  V
Sbjct: 848  PANEFGRPRAPAGHWASCVRVLDPLANETIMTLDLDEDEAAFSIAIAYFERGGGEPFLVV 907

Query: 735  GTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            GT      + +  K G + V+ I E G+ L+ + + +T      L  F G LLA I + +
Sbjct: 908  GTGVKTTLQPKGCKEGYLRVYAIKEQGRILEFLHKTKTDDIPLCLAGFQGFLLAGIGKSL 967

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
            +LY+      G + L  +C ++G   A+  +  +G  I+VGD+ +S    +Y+    +I 
Sbjct: 968  RLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFYCVYR----SIP 1018

Query: 851  ER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN---SEGATDEERG----- 897
             R     A D    W++ V  +D +     +   N+F  R +   SE   D+  G     
Sbjct: 1019 TRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLDPSISEKVDDDPTGATILH 1078

Query: 898  -RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV-------IFGTVNGVIGVIAS- 948
             +  ++G  H  E +  +  GS+V     + + +IP V       ++ T++G +G +   
Sbjct: 1079 EKSFLMGAAHKTEMIGHYNIGSVV-----TSITKIPLVAGGRDVLVYTTISGAVGALVPF 1133

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
            +  +   F+  L+ ++R     + G +H  +R +      V  K  +DGDL ESF  L  
Sbjct: 1134 VSSDDIEFMSTLEMHMRTQDISLVGRDHIAYRGY-----YVPIKGVVDGDLCESFSLLPY 1188

Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
             +   I+  ++ SV ++ K++E++
Sbjct: 1189 PKQQAIALDLDRSVGDVLKKLEQM 1212


>gi|407034933|gb|EKE37449.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
          Length = 995

 Score =  168 bits (425), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 243/1057 (22%), Positives = 459/1057 (43%), Gaps = 151/1057 (14%)

Query: 10   IATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIG 69
            I  L  ++P  +  D+L + T+  +  +L+    +  +I+   G + D  GR    G   
Sbjct: 56   ITLLLKYKPLKDITDYLIVVTQSNEIEILKLTTSALHIISS--GSLKDTFGRKAFFGVKA 113

Query: 70   IIDPDCRLIGLHLYDGLFKVI--PFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVV 127
            +I P+ +++ L+LY+ L KV+  P   K QL  A NI++    +LD+ F        + +
Sbjct: 114  VISPNNQILLLNLYEQLIKVVLLPQYPKDQLT-ASNIKVNHSHILDMTFCLLNDSYQLAL 172

Query: 128  LYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
            L+++ +D RH+ TY +     +  +G +SQ N+      ++     + G+ +I +E   Y
Sbjct: 173  LHENKRDIRHIDTYTLN-SFNELEKGSFSQPNVGPTTSRIVGFKDGIIGIFVISDEIGCY 231

Query: 188  CSANA---FKAIPIRPSITKAYGRVD----ADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
             + N       +   P+    + + D     D  R L      L   + I +E  ++T  
Sbjct: 232  FNGNGKHILCNLNQYPNTACCFIKKDVLIMTDSKRNL----DTLEFKVNINNEIIEITKK 287

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            +   L + SI+STISYL N +++IGS  GD+ ++ LN +         +LE++ N G ++
Sbjct: 288  QSPKLLQ-SISSTISYLSNKILFIGSLNGDNLIMNLNGK---------ILEKWSNFGSLM 337

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            D     +  +    VV   G      L I   G GI E  S E+  IK + S+       
Sbjct: 338  D--ARQISNREDYLVVGNGGGKGSIGLMI--KGSGIEELGSCEINDIKSVESIEYDRKK- 392

Query: 361  FDTFLVVSFISETRILAMNLEDE---LEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
               +++V F  E+ I  ++ + +   +EE+ ++    + + + C   I  +L+ V    +
Sbjct: 393  ---YVIVGFEEESNIWEIHQKSKHIKIEESSLKKVIGKERVMCC-GIIKERLIFVCKKGI 448

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLE 477
              +    +  +N       +  +    N  Q +    G  L   +I + +  + K    E
Sbjct: 449  YSIDKHQK--KNTLMKFDSFITHAKFIN--QQIYFIYGTEL--FKITEELKIQ-KMKDFE 501

Query: 478  YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCA 537
              +SC DIN         +I  +G+W D S+ IF   D+N I K+    + I RS+L+  
Sbjct: 502  QNVSCFDIN---------EIIVIGLW-DGSILIF---DINGILKKTEVVDTIGRSILI-- 546

Query: 538  FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL---GTQPITLRTFSSKNTTHVFAA 594
                 Y+ C   D  ++ FL N    + T + K+      +Q I L+   S         
Sbjct: 547  DHNKIYVGC---DNEIIVFLYNQMEEDCTYKNKLKYHIETSQSIKLKKIYS--------- 594

Query: 595  SDRPTVIYSSNKKLLYSNVNL-----KEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
               P VI  +NK   +    L     +++  +C   S      +  A +  +  G + ++
Sbjct: 595  --IPCVI--ANKTFTFQQTGLVPLAVEDLIDICE--SPVGDYGVICATKRGIVFGLMKEM 648

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYP 709
             ++  + I  GE+  +I            ++K+           + L+D        ++ 
Sbjct: 649  SRVTFKMIHSGENCCKIATDGSYGLLVGKTIKS-----------INLID-------GSFG 690

Query: 710  LDTFEYGCSILS-CSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE-DGKLQLIAEK 767
            + T E   + L+ C  S + N+Y  VGTA +   E EP+ GRIL+   + +G + ++  +
Sbjct: 691  VTTIELKSNELALCVDSLEDNIY-AVGTAIIRENEIEPSSGRILLIRQDTEGLIYIVGTE 749

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDF 827
            +  GAVY L      ++A IN+ + + +   +D  T++          ++ + +    D+
Sbjct: 750  DYDGAVYCLKKCQKGIVAFINRNVHVIEKKGKDLNTKQ-----NMLLPLIGVSLDICKDY 804

Query: 828  IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI---YLGAE--NNFNLF 882
            I+ GDL +S+S+  Y+++   ++   +D    W S V  ++ +    +L A+   N  +F
Sbjct: 805  IIAGDLARSLSVYRYRNDIEHLDVVGKDNQIVWSSCVGSIESEYGTSFLVADVSGNIKIF 864

Query: 883  TVRKNSEGATDEERGRLEVVGEYHLGEFVN----RFRHGSLVMRLPDSDVGQIPTVIFGT 938
                  E  TD+E+  + ++ + H+ + +N     F  G L+                G 
Sbjct: 865  N-SDEEEPKTDDEK--ISLISQVHVADSINFIQKSFYKGCLM----------------GG 905

Query: 939  VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
            V+G I  I  +  E YLFL K+Q+ L K            WR   N ++T    N +DGD
Sbjct: 906  VHGGIYNICEISKEHYLFLNKIQSKLVK----------SNWRESVNTQQTSPMMNCIDGD 955

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
             IES L+ S  +   +++ + V  +E+ ++++ L  L
Sbjct: 956  KIESILEWSEKKQMLLAQKIGVEHQEMIEKIQSLFSL 992


>gi|392580116|gb|EIW73243.1| hypothetical protein TREMEDRAFT_37240 [Tremella mesenterica DSM 1558]
          Length = 1214

 Score =  166 bits (421), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 258/1175 (21%), Positives = 479/1175 (40%), Gaps = 175/1175 (14%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G + ++  FR  G  +D++ ++++  +  ++++    +        +V  + G R    
Sbjct: 60   FGTVRSVAGFRLAGMTKDYILLSSDSGRMSIIEFVVTPTPHFESLYQEVFGKSGSRRVVP 119

Query: 66   GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGCAK 122
            GQ   +DP  R   +     L  V+  +++G+L  +  +   +   L    +    G   
Sbjct: 120  GQFLAVDPKGRSC-MFAASKLVYVLNRNSEGKLFPSSPLEAHKNHALITHVIGVDQGYDN 178

Query: 123  PTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIP 169
            P    L  D  ++             +H+  YE+ L     V   WS+   D  A+LL+ 
Sbjct: 179  PLYAALEVDYSESDEDPTGAAFEAAEKHLVFYELDLGLNHVVRK-WSEPT-DRRANLLVQ 236

Query: 170  VP------------PPLCGVLIIGEETIVYCSANA-FKAIPI---------RPSITKAYG 207
            VP            P   GVL+  E+ +++   +A    +PI         R  I++   
Sbjct: 237  VPGGQNATSDRFEGP--SGVLVCCEDHVIWKHMDAEAHRVPIPRRRNPLAQRGEISRGMI 294

Query: 208  RVDADGSR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
             V A   +      +LL +  G L+ L I H+ E V  +KI+      +++++  L +  
Sbjct: 295  IVSAVMHKIRGAFFFLLQNEDGDLYKLWIDHQGEDVVAVKIKYFDTVPVSNSLCILKSGY 354

Query: 262  VYIGSSYGDSQLIKL-NLQ-------------PDAKGSYVEVLERYVN------------ 295
            ++  S +GD  L +  NL              PD   +   +   + N            
Sbjct: 355  LFAASEFGDQNLYQFQNLAEDDDEQEWSSTDYPDNGNTTGALPYAFFNPRPLQNLLLVDT 414

Query: 296  ---LGPIVDFCVVDLERQGQG--QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KG 349
               L PI+D  VV+L  Q     Q+    G     + R +++G+ I +  +  L G+   
Sbjct: 415  LSSLDPILDAQVVNLLGQSSDTPQIYAACGRGPRSTFRTLKHGLEIQQIVASPLPGVPNA 474

Query: 350  MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
            +W+L+ + +D FD+++V+SF + T +L++   + +EE    GF S   TL         L
Sbjct: 475  VWTLKLTEEDEFDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLGDAGL 532

Query: 410  VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGIL 468
            +QV    +R + +  R   +EW  PPG ++  AT N  QV++A     LVY E+  +G L
Sbjct: 533  LQVHPYGLRHIRAADRV--DEWACPPGSAITAATTNKRQVVIALSTAELVYFELDPEGSL 590

Query: 469  TEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGG 526
            +E +  + L    +C+ I  + E    +   AVG   + +V + SL P+  L T      
Sbjct: 591  SEYQDKKSLPGNATCVSIAEVPEGRRRTPFLAVGC-DNQTVHVISLEPESTLTTLSLQAL 649

Query: 527  EIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
               P S+ L      S        +L   L +G LL  +++   G L+D +   LG +P 
Sbjct: 650  TAPPASICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVEGSLSDTRLRFLGAKPP 709

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
             L   S   +  V A S R  ++Y+    L    +    + +    ++A  P+ L     
Sbjct: 710  KLVRSSIHGSPSVMAFSSRAWLLYTYQDMLQTQPLIYDALEYASTLSAAMCPEGLIGISG 769

Query: 639  GELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTF-------------AICSLKNQS 684
              L I TI  + +KL   S+ L   PR+      S  F             AI  + +Q 
Sbjct: 770  NTLRIFTIPRLGEKLKQDSMSLTYTPRKFISHPFSTIFYMIESDHRVLGPKAIQRIVSQK 829

Query: 685  CAEESEM-----------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
             A    +                         +R+LD  T + +ST  LD  E   S+  
Sbjct: 830  KAAGDRVDGSILELPPSEFGRPRAGPGHWASLIRILDPLTNQTVSTIELDEDEAAFSLTI 889

Query: 722  CSFSDDSN-VYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
              F + +      VGTA          K G + V+ I E+G+ L+ + + +       + 
Sbjct: 890  AYFENMAGEPSLVVGTAVKTTLTPRGCKEGWLRVYAIKENGRTLEFMHKTKLDEIPLCVA 949

Query: 778  AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKS 836
             F G LL    + ++LY     + G + L  +C ++     +  +   G  I+VGD+ +S
Sbjct: 950  GFQGYLLVGAGKSLRLY-----EAGKKALLRKCENNSFPTVIATINVIGARIIVGDMQES 1004

Query: 837  ISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN---S 888
                +Y+    +I  R       D    +++ V  +D D     +   N+F  R +   S
Sbjct: 1005 TFFCVYR----SIPTRQLLVFGDDTQPRFLTCVTNVDYDTVACGDKFGNVFVNRMDQAVS 1060

Query: 889  EGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP----TVIFGT 938
            E   D+  G      +  ++G  H    +  ++ GS+V  L  + V  +P     +++ T
Sbjct: 1061 EKVDDDPTGAGILHEKGFLMGAAHKTTLIAHYQVGSVVTSL--TKVSLVPGGRDVLVYTT 1118

Query: 939  VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
            ++G +G +   +  +   F+  L+ ++R     + G +H  +R +      V     +DG
Sbjct: 1119 ISGAVGALVPFISMDDVEFMTTLEMHMRSQNISLVGRDHLAYRGYYAPVMGV-----VDG 1173

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DL +++  L  T+   I+  ++ SV ++ K++E++
Sbjct: 1174 DLCDAYSSLPYTKQSSIANELDRSVGDVLKKLEQM 1208


>gi|440907208|gb|ELR57379.1| Splicing factor 3B subunit 3 [Bos grunniens mutus]
          Length = 1133

 Score =  166 bits (420), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 185/788 (23%), Positives = 332/788 (42%), Gaps = 136/788 (17%)

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
            T D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 364  TTDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDG 421

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
            +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 422  IRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 479

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSV 533
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P +  
Sbjct: 480  EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPE 536

Query: 534  LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
             LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 537  SLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 596

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L     +    V A S R  + YS   +   + ++ + +     F S   P+ +    
Sbjct: 597  VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIS 656

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQS 684
               L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q 
Sbjct: 657  TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQ 716

Query: 685  CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
             AEE             +EM                           +R+++      + 
Sbjct: 717  MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLD 776

Query: 707  TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQ 762
               L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL+
Sbjct: 777  LVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLE 836

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-------H 815
             + +   +    ++  F G++L  + + +++Y     D G ++L  +C +          
Sbjct: 837  FLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKCENKAFLADGSSS 891

Query: 816  ILALYV------------QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
            +  +YV            QT G  ++V D+ +S   + YK  E  +   A D    W++ 
Sbjct: 892  VFPMYVAVRHIANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTT 951

Query: 864  VEILDDDIYLGAENNFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEY 905
              +LD D   GA+   N+  VR   N+    DE         +RG L       EV+  Y
Sbjct: 952  ASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNY 1011

Query: 906  HLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNL 964
            H+GE V   +  +L+        G   ++++ T++G IG++     HE + F + ++ +L
Sbjct: 1012 HVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHL 1064

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
            R     + G +H  +RS+         KN +DGDL E F  +   +   +S+ ++ +  E
Sbjct: 1065 RSEHPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPE 1119

Query: 1025 LCKRVEEL 1032
            + K++E++
Sbjct: 1120 VSKKLEDI 1127



 Score = 41.6 bits (96), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 76/378 (20%), Positives = 147/378 (38%), Gaps = 52/378 (13%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53  LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61  -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
            R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
               G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
            + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                         +L     G +  + +  +++ VT ++++      +A+ +  L    
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346

Query: 262 VYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFC-----VVDLERQGQGQVV 316
           +++ S +G+  L+K+ +  D   +Y+ V   +VN   ++        V D    G    +
Sbjct: 347 LFVASEFGNQWLVKVLITTDEFDAYIIV--SFVNATLVLSIGETVEEVTDSGFLGTTPTL 404

Query: 317 TCSGAYKDGSLRIVRNGI 334
           +CS    D  +++  +GI
Sbjct: 405 SCSLLGDDALVQVYPDGI 422


>gi|330790247|ref|XP_003283209.1| CPSF domain-containing protein [Dictyostelium purpureum]
 gi|325086890|gb|EGC40273.1| CPSF domain-containing protein [Dictyostelium purpureum]
          Length = 1233

 Score =  165 bits (418), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 175/758 (23%), Positives = 333/758 (43%), Gaps = 104/758 (13%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
           ++G I ++  FR    ++D++ + ++  +  +L++++  ++   +   +   R G R   
Sbjct: 57  VFGVIRSIIPFRLTSGSKDYIVVGSDSGRVVILEYNSTKNQF-DKVHQETFGRSGCRRIV 115

Query: 65  NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY---- 118
            GQ   +DP  R  +IG      L  ++  D+  +L    +  LE  +   I F      
Sbjct: 116 PGQYLAVDPKGRAFMIGAIEKQKLVYILNRDSSAKL--TISSPLEAHKSSTIIFSMCGVD 173

Query: 119 -GCAKP---TIVVLY------------QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDN 162
            G   P   TI V Y            +DN   + +  YE+ L   + V   WS + +D+
Sbjct: 174 VGFENPIFATISVDYSDDSGGNIEEDLEDNGKKKVLTFYELDLGLNNVVR-KWS-DQIDD 231

Query: 163 GADLLIPVPPP---LCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
            A++++ VP       GVL+  E+ IVY + +  +   IR  I + Y      G      
Sbjct: 232 TANIVMTVPGSNEGPGGVLVTSEDYIVYRNQDHPE---IRAKIPRRYNSNPDKGLLIVSY 288

Query: 216 ----------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIG 265
                     +L+    G L+ + +  + E VT + I       +A++++ L N  ++  
Sbjct: 289 SSHKQKNMFFFLVQSEFGDLYKITLDFKGEMVTAVNISYFDTIPLANSLTVLKNGFLFAA 348

Query: 266 SSYGDSQLIKLN--------------LQPDAKGSYVEVLER-----------YVNLGPIV 300
           S +GD  L                  +  + K  ++   +R             +L PIV
Sbjct: 349 SEFGDHSLYFFKSLGDDTEEGNNEGTITMEEKDGFIWFKQREELKNLVQTSTLSSLSPIV 408

Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL----RS 355
           DF V+D+ R+   Q+ +  G  KD SL+++R+G+ ++   + +L G+  G+W++      
Sbjct: 409 DFKVLDIVREENPQLYSLCGTSKDSSLKVLRHGLSVSTITNAKLPGVPSGIWTVPKSNSP 468

Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
           S  D  D ++VVSF+  T +L++   + ++E    G    T TL       + ++QV   
Sbjct: 469 SAQDQTDKYIVVSFVGSTSVLSVG--ETIQENHESGILESTTTLLIGSMGEDSILQVFPT 526

Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHA 474
             R + S  R   NEW++P   ++  AT N +Q+++A  GG ++Y E+   G LTE+   
Sbjct: 527 GFRFIRSDMRT--NEWRAPGRKTIVRATCNQNQLVIALSGGEVIYFELDQSGGLTEIIKK 584

Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL--GGEIIPRS 532
               +I+C++I+PI +  S ++  A+  W +  VR+ SL   N + +  +    ++   S
Sbjct: 585 DFRRDIACIEISPIPKGRSMARFLAISDW-EGPVRLLSLDRDNCLGQISMLDTDKVYIES 643

Query: 533 VLLCAFE-----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT 575
           + +   +                 G  +L   L +G +    L+  +GEL+D +   LG 
Sbjct: 644 LSIVEMQVNEAGVDNFKVSSQNMGGSLFLFVGLKNGVIKRATLDNYSGELSDIRTRLLGR 703

Query: 576 QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
           +P+ L     +    + A S R  + Y++  KL    ++++ + +    +S   P+S+  
Sbjct: 704 RPVKLFKIKVRGGNGILALSSRVWLNYANQGKLEIVPLSVEPLENASNLSSDQSPESIVA 763

Query: 636 AKEGELTIGTIDDIQKL-HIRSIPLGEHPRRICHQEQS 672
             E  + I TID +  L +  +I LG  PRR     Q+
Sbjct: 764 TSEDHIIIFTIDKLGDLFNQETIKLGATPRRFIVHPQT 801



 Score =  116 bits (291), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 95/364 (26%), Positives = 167/364 (45%), Gaps = 42/364 (11%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD--DSNVYYCVG---TAYVLPEENEPTK 748
            ++++D  + +   +Y LD  E G S+ + SFSD  D  ++  VG      + P+  E   
Sbjct: 881  IKIIDPISHQVYESYQLDENEAGFSLCTLSFSDRGDGEIFLVVGCGNNVILNPKSCESAS 940

Query: 749  GRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
              +  F  E  KLQL+ + E +  VY++  F G+L A + + I++Y     D G ++L  
Sbjct: 941  INLYKFTEEGKKLQLVYKTEAEEPVYAMAPFQGRLCAGVGKNIRIY-----DMGKKKLLR 995

Query: 809  ECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
            +C        +  + + GD +VVGD+ +SI  + YK  E  +   A D    W+++  +L
Sbjct: 996  KCETKNLPNTIVNIHSLGDRLVVGDIQESIHFIKYKKLENMLYVFADDLAPRWITSSVML 1055

Query: 868  DDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHLGE 909
            D D   GA+   N+F +R                  K   G  +    +LE +     G+
Sbjct: 1056 DYDTVAGADKFGNIFILRLPSNVSDEVEEDPTGSKLKFESGLLNGAPHKLEHIANIFAGD 1115

Query: 910  FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVI 968
             +      SLV       VG    +++ T++G IG +I  +  E   F   L+  LR   
Sbjct: 1116 AITTLNKTSLV-------VGGSDVLLYTTISGAIGALIPFVSREDVDFFSSLELQLRNEH 1168

Query: 969  KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
              + G +H  +RS+         KN +DGDL E F+ L   +  +I++ ++ S  E+ K+
Sbjct: 1169 APLCGRDHLAYRSY-----YFPVKNIIDGDLCEQFITLDPQKQRQIAEELSRSPSEVLKK 1223

Query: 1029 VEEL 1032
            +E++
Sbjct: 1224 LEDI 1227


>gi|183233163|ref|XP_654084.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
 gi|169801703|gb|EAL48698.2| damaged DNA binding protein, putative [Entamoeba histolytica
            HM-1:IMSS]
 gi|449708240|gb|EMD47737.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
          Length = 995

 Score =  165 bits (417), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 241/1057 (22%), Positives = 459/1057 (43%), Gaps = 151/1057 (14%)

Query: 10   IATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIG 69
            I  L  ++P  +  D+L + T+  +  +L+    +  +I+   G + D  GR    G   
Sbjct: 56   ITLLLKYKPLKDITDYLIVVTQSNEIEILKLTTSALHIISS--GSLKDTFGRKAFFGVKA 113

Query: 70   IIDPDCRLIGLHLYDGLFKVI--PFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVV 127
            +I P+ +++ L+LY+ L KV+  P   K QL  A NI++    +LD+ F        + +
Sbjct: 114  VISPNNQILLLNLYEQLIKVVLLPQYPKDQLT-ASNIKVNHSHILDMTFCLLNDSYQLAL 172

Query: 128  LYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
            L+++ +D RH+ TY +     +  +G +SQ N+ +    ++     + G+ +I +E   Y
Sbjct: 173  LHENKRDIRHIDTYTLN-SFNELEKGSFSQPNVGSTTSRILGFKDGIIGIFVISDEIGCY 231

Query: 188  CSANA---FKAIPIRPSITKAYGRVDA----DGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
             + N       +   P+    + + D     D  R L      L   + I +E  ++T  
Sbjct: 232  FNGNGKHILCNLNQYPNTACCFIKKDVLIMTDSKRNL----DTLEFKVNINNEILEITKK 287

Query: 241  KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
            +   L + SI+STISYL N +++IGS  GD+ ++ LN +         +LE++ N G ++
Sbjct: 288  QSPKLLQ-SISSTISYLSNKILFIGSLNGDNLIMNLNGK---------ILEKWSNFGSLM 337

Query: 301  DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
            D     +  +    VV   G      L I   G GI E  S E+  IK + S+       
Sbjct: 338  D--ARQISNREDYLVVGNGGGKGSIGLMI--KGSGIEELGSCEINDIKSVESIEYDRKK- 392

Query: 361  FDTFLVVSFISETRILAMNLEDE---LEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
               +++V F  E+ I  +  + +   +EE+ ++    + + + C   I  +L+ V    +
Sbjct: 393  ---YVIVGFEEESNIWEIYQQSKHIKIEESSLKKVIGKERVICC-GIIKERLIFVCKKGI 448

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLE 477
              +    +  +N       +  +    N  Q +    G  L   +I + +  + K   LE
Sbjct: 449  YSIDKHQK--KNTLMKFDSFITHAKFIN--QQIYFIYGTEL--FKITEELKIQ-KMKDLE 501

Query: 478  YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCA 537
              +SC DIN         +I  +G+W D ++ IF   D+N + K+    + I RS+L+  
Sbjct: 502  QNVSCFDIN---------EIIVIGLW-DGNILIF---DINGVLKKTEVVDTIGRSILI-- 546

Query: 538  FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL---GTQPITLRTFSSKNTTHVFAA 594
                 Y+ C   D  ++ FL N    + T + K+      +Q I L+   S         
Sbjct: 547  DHNKIYVGC---DNEIIVFLYNQMEEDCTYKNKLKYHIETSQSIKLKKIYS--------- 594

Query: 595  SDRPTVIYSSNKKLLYSNVNL-----KEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
               P VI  +NK   +    L     +++  +C   S      +  A +  +  G + ++
Sbjct: 595  --IPCVI--ANKTFTFQQTGLVPLAVEDLIDICE--SPVGDYGVICATKRGIVFGLMKEM 648

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYP 709
             ++  + I  GE+  +I            ++K+           + L+D        ++ 
Sbjct: 649  SRVTFKMIHSGENCCKIATDGSYGLLVGKTIKS-----------INLID-------GSFG 690

Query: 710  LDTFEYGCSILS-CSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE-DGKLQLIAEK 767
            + T E   + L+ C  S + N+Y  VGTA +   E EP+ GRIL+   + +G + ++  +
Sbjct: 691  VTTIELKSNELALCVDSLEDNIY-AVGTAIIRENEIEPSSGRILLIRQDTEGLIYIVGTE 749

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDF 827
            +  GAVY L      ++A IN+ + + +   +D  T++          ++ + +    D+
Sbjct: 750  DYDGAVYCLKKCQKGIVAFINRNVHVIEKKGKDLNTKQ-----NMLLPLIGVSLDICKDY 804

Query: 828  IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI---YLGAE--NNFNLF 882
            I+ GDL +S+S+  Y+++   ++   +D    W S V  ++ +    +L A+   N  +F
Sbjct: 805  IIAGDLARSLSVYRYRNDIEHLDVVGKDNQIVWSSCVGSIESEYGTSFLVADVSGNIKIF 864

Query: 883  TVRKNSEGATDEERGRLEVVGEYHLGEFVN----RFRHGSLVMRLPDSDVGQIPTVIFGT 938
                  E  TD+E+  + ++ + H+ + +N     F  G L+                G 
Sbjct: 865  N-SDEEEPKTDDEK--INLISQVHVADSINFIQKSFYKGCLM----------------GG 905

Query: 939  VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
            V+G I  I  +  E YLFL K+Q+ L K            WR   N ++     N +DGD
Sbjct: 906  VHGGIYNICEISKEHYLFLNKIQSKLVK----------SNWRESVNTQQASPMMNCIDGD 955

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
             IES L+ S  +   +++ + V  +E+ ++++ L  L
Sbjct: 956  KIESILEWSEKKQMLLAQKIGVEHQEMIEKIQSLFSL 992


>gi|440792421|gb|ELR13643.1| splicing factor 3b subunit 3, putative [Acanthamoeba castellanii
           str. Neff]
          Length = 1227

 Score =  165 bits (417), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 168/730 (23%), Positives = 315/730 (43%), Gaps = 73/730 (10%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G I ++  FR  G+ +D++ + ++  +  +LQ++ E +               
Sbjct: 53  VLSVEVFGTIRSIVPFRLTGDTRDYIIVGSDSGRIVILQYNNERNTFDKIHQETFGKSGC 112

Query: 61  RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
           R    GQ   +DP  R  +IG      L  ++  D+  +L  +  +   +   ++I  + 
Sbjct: 113 RRIVPGQYLAVDPKGRAVMIGAVEKQKLVYILNRDSAARLTISSPLEAHKSHTINIHMVG 172

Query: 118 --YGCAKPTIVVLYQDNKDARHVKT---YEVALKDKDFVE---------GPWSQNNLDNG 163
              G   P    L  D  D  H  T   +E   K+  F E           WS+  +D  
Sbjct: 173 VDVGFENPIFATLEVDYADVDHDPTGKEFEALHKNLTFYELDLGLNHVVRKWSEP-VDIT 231

Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSAN---AFKAIPIRPS--------ITKAYG 207
           A++L+ VP     P   GVL++GE  +V+ +        A+P R          I  A  
Sbjct: 232 ANMLVAVPGGSDGP--GGVLVMGENFVVWKNQGHQEVRAALPRRKILGEERSVLIVSATA 289

Query: 208 RVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
               D    LL    G ++   +  E++ V+ +KI        ++ +  L    +++ + 
Sbjct: 290 HKQKDLFFILLQSEYGDIYKATLAWEEDNVSAIKINYFDTVPTSNAMCILKTGFLFVAAE 349

Query: 268 YGDSQLIKLN--------------------LQPDAKGSYVEVLERYVNLGPIVDFCVVDL 307
           +G+  L +                        P    + + + E   +L PI+DF V D+
Sbjct: 350 FGNHFLFQFQSIGDEEEESTMDQDEDEIPTFDPQPLKNLIAIDE-IESLAPIMDFKVADM 408

Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLV 366
            ++   Q +T  G     SLR++++G+ + E A   L G    ++++R +  D +D++++
Sbjct: 409 VKEETKQFLTLCGRGPRSSLRLLKHGLAVAEMADSPLPGNPNNIFTVRKNVADEYDSYII 468

Query: 367 VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
           VSF++ T +L++   D +EE +  GF     TL       + LVQV    +R + S  R 
Sbjct: 469 VSFLNATLVLSIG--DNVEEVKDAGFNENASTLNVGLVGDDSLVQVFPTGIRFIRSDKR- 525

Query: 427 LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCLDI 485
              EW +P   ++  +  N  QV++A  GG L+Y E+   G L EV    +  +I+C+DI
Sbjct: 526 -ITEWPTPARRTIVRSAINNKQVVIALTGGELLYFELDITGSLVEVGRKDMGRDIACIDI 584

Query: 486 NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIPRSVLLCAFEGIS-- 542
            PI E    ++  AVG + + +VR+ SL   ++ +   +      P S+ +   +G +  
Sbjct: 585 APIPEGRLRARFLAVGDYEN-TVRVLSLDPEDVFSSLAVQALPAPPESLCIVKMKGGTDS 643

Query: 543 -----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDR 597
                +L   L +G L   +L+  TGEL+D +   LG++P+ L      +   + A + R
Sbjct: 644 SAGTLFLNIGLTNGVLQRTVLDKVTGELSDTRTRFLGSRPVRLLKLRVGDQPAMLALTSR 703

Query: 598 PTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRS 656
             + Y+    L  + ++   + +   F S   P+ +       L I + + + +  H   
Sbjct: 704 AWLCYNYQLHLHLTPLSYPALDYASNFCSERCPEGMVATIGNALRIISPERLGEVFHQEL 763

Query: 657 IPLGEHPRRI 666
           IPL   PR++
Sbjct: 764 IPLRYTPRKM 773



 Score =  112 bits (281), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 96/362 (26%), Positives = 168/362 (46%), Gaps = 44/362 (12%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---PEENEPTKG 749
            VRLLD  T + I    LD  E   S  +C F D    ++  VGTA  L   P   +   G
Sbjct: 878  VRLLDVNTKQTIDVVELDNNEAAFSACTCVFHDRGGEIFLVVGTAKGLVLNPRSCD--AG 935

Query: 750  RILVFIVEDG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
             I V+ + DG  +  L+ + + +G   ++  F G+LL  I + +++Y     D G R+L 
Sbjct: 936  YIHVYRLLDGGKRFSLVHKTQVEGVPTAVCGFQGRLLVGIGKMLRIY-----DLGKRKLL 990

Query: 808  SECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
             +C + G    +  + T+G+ I+VGDL +S   + Y+  E  +   A D N  W++A ++
Sbjct: 991  RKCENKGFPHCIQSITTQGERIIVGDLAESFHFVKYRKAENQLNVYADDSNPRWLTASQM 1050

Query: 867  LDDDIYLGAENNFNLFTVRKNSEGATDEERG-----------------RLEVVGEYHLGE 909
            LD D   GA+   N+F VR  SE   + E                   +L+ +  +H+G+
Sbjct: 1051 LDYDTMAGADKFGNVFIVRLPSEVNEELEDNPMGNFLMSKQSLNGAAFKLQTLINFHVGD 1110

Query: 910  FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVI 968
             +N     SL         G    +++ T+ G +G ++  +  E   F   L+ ++R  +
Sbjct: 1111 TINSMTKASLF-------TGGADVLVYTTLMGGMGALLPFVSREDVDFFSHLEMHMRSEL 1163

Query: 969  KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
              + G +H  +RS+         K+ +DGDL E F  L   +   I++ ++ +  E+ K+
Sbjct: 1164 PPLCGRDHLAYRSY-----YFPVKDVIDGDLCEQFSLLPPEKQRTIAEELDRTPGEVLKK 1218

Query: 1029 VE 1030
            +E
Sbjct: 1219 LE 1220


>gi|388855100|emb|CCF51231.1| probable splicing factor 3B subunit 3 [Ustilago hordei]
          Length = 1221

 Score =  164 bits (415), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 248/1175 (21%), Positives = 474/1175 (40%), Gaps = 171/1175 (14%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G + +L  FR  G ++D+L + ++  +  +L++  +++    +   +   R G R    
Sbjct: 62   FGVVRSLAAFRLTGGSKDYLIVGSDSGRIVILEFQPQTNSF-DKVHQETFGRSGSRRIVP 120

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
            GQ    DP  R  +IG      L  ++  D +  L      EA   N  +  +  +D+ F
Sbjct: 121  GQYLATDPKGRATMIGAMEKAKLVYILNRDAQANLTISSPLEAHRPNGIIHHIVGVDVGF 180

Query: 117  ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE---------GPWSQNNLDNGA 164
               L+ C    + + Y ++      + Y+ A K   + E           WS+  +D  +
Sbjct: 181  ENPLFAC----LEIDYSESDHDPSGRAYDEAEKTLTYYELDLGLNHVVRKWSEP-VDPRS 235

Query: 165  DLLIPVPPPL----------CGVLIIGEETIVYCSANAFK-AIPIRPSITKAYGRVDADG 213
            +LL+ VP              GVL+  E+ I Y   +  +  +PI   +       +  G
Sbjct: 236  NLLLQVPGGYNHNLEKWDGPSGVLVCSEDYITYKHQDQPEHRVPIPRRLNPIEKMSERRG 295

Query: 214  SR--------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
            +               +L+    G L  + + H+ +++  LKI+      +AS ++ L +
Sbjct: 296  TLVVASVLHKMKNAFFFLVQTEDGDLFKITMEHQDDEIRSLKIKYFDTVPVASGLAILRS 355

Query: 260  AVVYIGSSYGDSQLIKLN------------------------------LQPDAKGSYVEV 289
              +++ S +G   L                                    P +  + V+V
Sbjct: 356  GFLFVASEFGPQLLYSFQKLGDDDDLPEYISTDYDENGAGRKRPQLPTFTPRSLDNLVQV 415

Query: 290  LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
             E   +L PI+D   ++       Q+    G     S +++R+G+   E  S +L G+  
Sbjct: 416  DE-MPSLDPILDAKPLNPLASDSPQIFVACGRGARSSFKMLRHGLEAQEAVSSDLPGVPS 474

Query: 349  GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
             +W+ + +  D +D+++++SF++ T +L++   + +EE    GF + + TL       + 
Sbjct: 475  AVWTTKITRQDEYDSYIILSFLNGTLVLSIG--ETIEEVGDSGFLTSSPTLAVQQLGEDA 532

Query: 409  LVQVTSGSVRLVSSTSRELRNEWKSPP---GYSVNVAT--ANASQVLLATGGGHLVYLEI 463
            L+QV    +R +    +   NEW +P    G    +     N  QV +A     LVY E+
Sbjct: 533  LLQVHPYGIRHILVDKQV--NEWATPSLPNGVQTTIVAICTNERQVAVALSSNELVYFEL 590

Query: 464  G-DGILTEVKHAQLE-YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK 521
              DG L E +  +     +  + +    E    +   A G   D +VRI SL   + +  
Sbjct: 591  DMDGQLNEYQDRKATGATVLTMSMADCPEGRQRTPYLAAGC-DDSTVRIISLEPASTLAS 649

Query: 522  EHLGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 572
              +     P S +  A          +  +++   L +G LL  +L+  TG+LTD +   
Sbjct: 650  ISIQALTAPASSICVAEMKDATVDRNQATTFVNIGLSNGVLLRTVLDAMTGQLTDTRTRF 709

Query: 573  LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
            LG++ + L        + V A S R  + ++   +L ++ +    + H   F++   P+ 
Sbjct: 710  LGSKAVRLIRTKVHGQSAVMALSTRTWLSFTYQSRLQFTPLIFDALDHAWSFSAELCPEG 769

Query: 633  LAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICH-----------QEQSRTFAICSL 680
            L       L I TI  +  KL   S+ L   PR+I H           +   RT +  + 
Sbjct: 770  LIGIVGSTLRIFTIPSLASKLKQDSVALSYTPRKIAHHPDEQGLFYVVEADRRTLSPGAQ 829

Query: 681  KNQSCAEESEMH--------------------------FVRLLDDQTFEFISTYPLDTFE 714
            + +  A E E+                            VR++D    +      LD  E
Sbjct: 830  RRRVEALEKELKPHQRGVLDLKPAEFGLIRGEAGNWASCVRVVDGPQSQTTHKIELDDNE 889

Query: 715  YGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDG--KLQLIAEKETK 770
               S+    F S +   +  VG+A  V+       K  +  + + +G  +L++  + E  
Sbjct: 890  AAFSVAIVPFASAEKQSFLVVGSAVDVVLSPRSFKKAYLTTYRLINGGRELEVHHKTEID 949

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIV 829
                 L  F G+LLA + + +++Y     D G ++L  +C +     A+  +  +G  IV
Sbjct: 950  DIPLVLRPFQGRLLAGVGKALRIY-----DLGKKKLLRKCENKSFPTAIVSLDAQGSRIV 1004

Query: 830  VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KN 887
            VGD+ +SI    YK  E  +   A D    +++   +LD D    A+   NL+ +R   +
Sbjct: 1005 VGDMQESIVFTSYKPLENRLVTFADDVMPKFVTRCTMLDYDTVAAADKFGNLYVLRIDAD 1064

Query: 888  SEGATDEERGRLEVV-------GEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGT 938
            +  + DE+   + +V       G  H    +  +  G ++  L  + +  G    +++  
Sbjct: 1065 TSRSVDEDPTGMTIVHEKPVLMGAAHKATLLAHYFVGDIITSLNRTVMVPGGREVLMYTG 1124

Query: 939  VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
            ++G IG +   +  E    L  LQT LR+    + G +H  +RS     K+V     +DG
Sbjct: 1125 ISGTIGALVPFVSKEDVDTLSTLQTQLRQENNSLVGRDHLAYRSSYAPVKSV-----IDG 1179

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DL E+F  L   + + I++ ++    E+ K++ +L
Sbjct: 1180 DLCETFGLLQPAKQNAIAQELDRKPSEINKKLAQL 1214


>gi|452820919|gb|EME27955.1| splicing factor 3B subunit 3 [Galdieria sulphuraria]
          Length = 1294

 Score =  164 bits (415), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 203/870 (23%), Positives = 356/870 (40%), Gaps = 148/870 (17%)

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSL 353
            +L P+  + + D   +G  Q+V  SG    GS + +R G    E A  EL G   G++++
Sbjct: 404  SLAPLTCWQIGDFYGEGSPQIVVGSGRRHRGSWKTLRLGYRWTEMAVSELPGYPVGVFTV 463

Query: 354  RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
            R    D +D++++VSF++ + +L++   + +EE    GF +   T+F      N +VQV 
Sbjct: 464  REQQQDSYDSYIIVSFVNASLVLSVG--ETVEEVSDSGFSTFVSTIFVQLFSNNSIVQVH 521

Query: 414  SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVK 472
            +  +R +    + + +EWK P G  + VA  NASQ+++      L+Y E+ + G+LTE++
Sbjct: 522  AQGIRHIRPDLQTV-HEWKPPKGLFIQVADGNASQLVIILNSKQLLYFELDETGMLTEIQ 580

Query: 473  HAQL----EYEIS----CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
              +L     YE S    C+ I P+    S +   AV        R++SL   + +    L
Sbjct: 581  EVELIQDQSYESSSFLPCVSIAPLVAGQSKASFVAVSDGISYYTRLYSLKKESCLKPISL 640

Query: 525  GG-EIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
               + +  SVLL      +       LL     G  +  L++  TG L++++   +GTQP
Sbjct: 641  QALDTVANSVLLIGLSAPASSRSELVLLIGSHSGIYIRALVDPITGALSEQQSRLVGTQP 700

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYS-NVNLKEV-SHMCPFNSAAFPDSLAI 635
            + L+    +        S R  ++Y  N  LL +  +    V  H+  F S   PD +  
Sbjct: 701  VRLQRIEIQGKPVALVVSSRSYLVYLDNHLLLQTLPIGYDWVFDHVVGFCSEQCPDGVIA 760

Query: 636  AKEGELTIGTIDDIQKL-------HIRSIPLGEHPRRICHQEQSRTFAICSL-KNQSCAE 687
                 L I + ++ + L       H  + P    PRR+  +E   + +I +  +N+    
Sbjct: 761  CCGSSLCILSFEEDRDLFQTTHLFHSDTQPSLYTPRRLVDREDILSSSITNYTRNRILTL 820

Query: 688  ESEMHFVRLLDD-------------------------QTFEFIST--------------- 707
            E +   V    D                         Q   F+                 
Sbjct: 821  ECDQQIVISPQDIVSHSSSLRQTGYPLGPIGSWSCAIQQMTFMDNQFPNHEDPVESTSCR 880

Query: 708  -------YPLDTFEYGCSILSCSFS-DDSNVYYCVGTA--YVL----PEENEPTKGRI-- 751
                   Y  +T EY  SI    FS D S++Y CVG    Y L    P E  PTK  I  
Sbjct: 881  WQQKCEIYNSETKEYISSICCIRFSHDSSSLYLCVGVVMDYALQVTNPLEKNPTKDNIDI 940

Query: 752  ----LVF---------------------IVEDGKLQLIAEKETKGA-------------- 772
                +VF                      ++D     +      GA              
Sbjct: 941  QRESIVFPRSEIRVYRWNETQWEYIHTTPIQDASSHRMEWNYDYGASHLSFHAASRYRNE 1000

Query: 773  -VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFI 828
             + ++ +F G LL A+   +++Y     D G ++L  +  H     H +   ++T  D I
Sbjct: 1001 IITTMASFQGHLLVAVGTSLRMY-----DLGKKQLLKKTQHPRATPHKITC-IETCYDRI 1054

Query: 829  VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
             + D+ +S+ L  Y   +      A DY   W + + +LD D     +   N+  +R   
Sbjct: 1055 FLSDVQESVFLYRYSAADNLFLCIADDYLPKWCTTMCLLDYDTVAIGDKMGNISILRLPP 1114

Query: 889  EGAT----DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD--VGQIPTVIFGTVNGV 942
            E  T    D   G L     +H  +    +  GS++  L   +   G +P + +GT++G 
Sbjct: 1115 EAGTFIEQDPTGGLLSKEAPHHF-QLEACYYVGSVIQCLSKVEWTTGDVPLLFYGTLDGA 1173

Query: 943  IGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
            IGV+  L     +   + L+  LR+    + G +H  +RS+         ++ +DGDL E
Sbjct: 1174 IGVMIPLRSTLDMELFQALELQLREYRSPLCGRHHLAYRSY-----FFPVRHVIDGDLCE 1228

Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
             F  LS  + ++I K ++ S+ ++ +++E+
Sbjct: 1229 EFYRLSLEQQEKIVKELDRSIVDVHRKLED 1258


>gi|323508292|emb|CBQ68163.1| probable splicing factor 3B subunit 3 [Sporisorium reilianum SRZ2]
          Length = 1221

 Score =  163 bits (412), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 254/1178 (21%), Positives = 477/1178 (40%), Gaps = 177/1178 (15%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G + +L  FR  G ++D+L + ++  +  +L++ + ++    +   +   R G R    
Sbjct: 62   FGVVRSLAAFRLTGGSKDYLIVGSDSGRIVILEYQSRTNSF-DKVHQETFGRSGSRRIVP 120

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
            GQ    DP  R  +IG      L  ++  D +  L      EA   N  +  +  +D+ F
Sbjct: 121  GQYLATDPKGRAAMIGAMEKAKLVYILNRDAQANLTISSPLEAHRPNGIIHHIIGVDVGF 180

Query: 117  ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE---------GPWSQNNLDNGA 164
               L+ C    + V Y ++      + +E A K   + E           WS+  +D+ +
Sbjct: 181  ENPLFAC----LEVDYSESDHDPSGRAFEEAEKALTYYELDLGLNHVVRKWSEP-VDSRS 235

Query: 165  DLLIPVPPPL----------CGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGS 214
            +LLI VP              GVL+  E+ I Y   +  +    R  I K    V+    
Sbjct: 236  NLLIQVPGGYNQNLEKWDGPSGVLVCSEDYITYKHQDQPEH---RVPIPKRLNPVEKSSE 292

Query: 215  R------------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISY 256
            R                  +L+    G L  + + H+ +++  LKI+      +AS ++ 
Sbjct: 293  RRGTLIVASVLHKMKNAFFFLVQSEDGDLFKVTMEHQDDEIRSLKIKYFDTVPVASGLAI 352

Query: 257  LDNAVVYIGSSYGDSQLIKLN------------------------------LQPDAKGSY 286
            L +  +++ S +G   L                                    P    + 
Sbjct: 353  LRSGFLFVASEFGPQLLYSFQKLGDDDDLPEYSSTDYDENGAGRRRPQLPTFTPRPLDNL 412

Query: 287  VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
            V+V E   +L PI+D   ++       Q+    G     S +++R+G+   E  S +L G
Sbjct: 413  VQVDE-MPSLDPILDAKPLNPLASDSPQIFAACGRGARSSFKMLRHGLEAQEAVSSDLPG 471

Query: 347  I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
            +   +W+ + +  D  D+++++SF++ T +L++   + +EE    GF + + TL      
Sbjct: 472  VPSAVWTTKITRQDEHDSYIILSFVNGTLVLSIG--ETIEEVSDSGFLTSSSTLAVQQLG 529

Query: 406  YNQLVQVTSGSVRLVSSTSRELRNEWKSP--PG--YSVNVAT-ANASQVLLATGGGHLVY 460
             + L+QV    +R +    +++ NEW +P  P    +  VAT  N  QV++A     LVY
Sbjct: 530  EDALLQVHPHGIRHIL-VDKQI-NEWATPSLPNGRQTTIVATCTNERQVVVAFSSNELVY 587

Query: 461  LEIG-DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
             E+  DG L E +  + +   +  + +    E    +   AVG   D +VRI SL   + 
Sbjct: 588  FELDMDGQLNEYQERKAMGAAVLTMSMADCPEGRQRTPYLAVGC-DDSTVRIISLEPNST 646

Query: 519  ITKEHLGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRK 569
            +    +     P S +  A             +++   L +G LL  +L+  TG+LTD +
Sbjct: 647  LASISIQALTAPASSICMAEMHDATIDRNHATTFVNIGLQNGVLLRTVLDAVTGQLTDTR 706

Query: 570  KVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAF 629
               LG++ + L        + V A S R  + ++   +L +  +    + H   F++   
Sbjct: 707  TRFLGSKAVRLIRTKVHGQSAVMALSTRTWLSFTYQDRLQFVPLIFDALDHAWSFSAELC 766

Query: 630  PDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICH-----------QEQSRTFAI 677
            P+ L       L I T+  +  KL   S+ L   PR+  H           + + RT + 
Sbjct: 767  PEGLIGIVGSTLRIFTMPSLASKLKQDSVALSYTPRKFAHHPNEQGLFYVVEAEHRTLSP 826

Query: 678  CSLKNQSCAEESEMH--------------------------FVRLLDDQTFEFISTYPLD 711
             + + ++   E E+                            VR++D    +      LD
Sbjct: 827  GAQRRRTEMLEKELKPHQRGVLDLNPAEFGLIRAEAGNWASCVRVVDGVQSQTTHKLELD 886

Query: 712  TFEYGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEK 767
              E   S+    F S +      VG+A  V+       K  +  + +  +G+ L+++ + 
Sbjct: 887  DNEAAFSVAVVPFASAEKEAMLVVGSAVDVVLSPRSFKKAYLTTYRLTNNGRELEVLHKT 946

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
            E       L  F G+LLA I + +++Y     D G ++L  +C +     A+  +  +G 
Sbjct: 947  EVDDIPLVLRPFQGRLLAGIGKALRIY-----DLGKKKLLRKCENKSFATAIVSLDAQGS 1001

Query: 827  FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
             IVVGD+ +SI    YK  E  +   A D    +++   +LD D    A+   N++ +R 
Sbjct: 1002 RIVVGDMQESIIFTSYKPLENRLVTFADDVMPKFVTRCAMLDYDTVAAADKFGNVYVLRI 1061

Query: 886  -KNSEGATDEERGRLEVV-------GEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVI 935
              ++  + DE+   + +V       G  H    V  F  G +V  L  + +  G    ++
Sbjct: 1062 DADTSRSVDEDPTGMTIVHEKPVLMGAAHKATLVAHFFVGDIVTSLNRTVMVPGGREVLL 1121

Query: 936  FGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNF 994
            +  V+G IG +   +  E    L  L+++LR+    + G +H  +RS          K+ 
Sbjct: 1122 YTGVSGTIGALVPFVSKEDVDTLSTLESHLRQENSSLVGRDHLAYRS-----SYAPVKSV 1176

Query: 995  LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +DGDL E+F  L   + + I+  ++    E+ K++ +L
Sbjct: 1177 IDGDLCETFGLLPPAKQNAIATELDRKPSEINKKLAQL 1214


>gi|320164105|gb|EFW41004.1| splicing factor 3B subunit 3 [Capsaspora owczarzaki ATCC 30864]
          Length = 1218

 Score =  162 bits (410), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 259/1190 (21%), Positives = 467/1190 (39%), Gaps = 200/1190 (16%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            I+G +  L  FR  G ++D+L + ++  +  +L+++    E     +        R    
Sbjct: 60   IFGVVRCLTAFRLTGASKDYLVVGSDSGRVAILEYNPTKYEFERVHLETYGKSGSRRIVP 119

Query: 66   GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVL---DIKFLYGC 120
            GQ    D   R I +   +   L  V+  D+  +L     +   +   +    +    G 
Sbjct: 120  GQYLAADAKGRAIMISAVEKQKLVYVMNRDSTARLTITSPLDAHKAHAVVFATVGLDVGF 179

Query: 121  AKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
              PT   +  D ++A             +H+  YE+ L     V    S   L   A++L
Sbjct: 180  EAPTFACVELDYEEADNDPSGEAVEATQQHLVFYELDLGLNHVVRK--SSEPLPGWANML 237

Query: 168  IPVPPP---LCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHA-- 222
            I VP       GV++  E  +VY S       P  P+I     R    G R+ + D +  
Sbjct: 238  IQVPGADDGPGGVVVCCEGFLVYKS-------PEDPAIDV---RCPFPGRRHDISDPSKR 287

Query: 223  GLLHLLVITHEKEK---------------------------VTGLKIELLGETSIASTIS 255
            G    +V+ H   K                           V  ++++L     +A+++ 
Sbjct: 288  GAGRGIVVAHASHKTKTKFFFVVQLDNGDMFKVTLDVADDNVVSIRMKLFDTLPVATSLH 347

Query: 256  YLDNAVVYIGSSYGD------SQLIKLNLQPD-----------------AKG-----SYV 287
             L +  +++ +  GD      +QL + + +P+                 A G     ++V
Sbjct: 348  ILRSGHLFVAAELGDHHLYQITQLAENDDEPEFTTRGLVMNGRVVPSSVAHGGVELSAFV 407

Query: 288  EVLERYVNL---------GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 338
                  +NL          P +   V DL  +   Q     G   + +LRI+R+G+ +++
Sbjct: 408  YTPRPLLNLVLLDVMESTAPTMQCRVEDLLGEDAPQFYLLCGRGPNSTLRILRHGLEVSQ 467

Query: 339  QASVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQ 397
             A+ EL      +WS++ S  D  DT+ +VS+ S T +L++   D++      G      
Sbjct: 468  LAATELTASPVAIWSVKRSIHDIHDTYFIVSWASATVVLSVG--DQVAPVSDSGLLLTVG 525

Query: 398  TLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
            T+       + L+QV +  +R + + SR   NEW++P    +  A  N  QV++A  GG 
Sbjct: 526  TIAVSRIGEDDLLQVYTDGIRHIRADSRV--NEWRTPGRRQIVRAAINDRQVVIALAGGE 583

Query: 458  LVYLEIG-DGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515
            LVY E+   G L E  +      E+  L I P+  +   ++  AV    D +VRI SL  
Sbjct: 584  LVYFELDITGQLNEFAERFTSSAEVCALAIAPVPADRRRARFLAVAA-EDNTVRIVSLEL 642

Query: 516  LNLITKEHLGGEIIP-RSVLLCAFEGISY--------LLCALGDGHLLNFLLNMKTGELT 566
             +  T + LG + +  R+  LC F   S         L   L +G LL   +   +G LT
Sbjct: 643  SD--TLQSLGVQTVADRASSLC-FADPSLDNSSADLVLGVGLQNGILLRTSVEPSSGNLT 699

Query: 567  DRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS 626
            D +   LGT+P+ L            A S RP + Y        + ++   +     F++
Sbjct: 700  DTRTRYLGTRPVMLFPSKVHGAAGFLALSSRPWIFYVQQGTPTLTPLSYDALDCAATFSA 759

Query: 627  AAFPDSLAIAKEGELTIGTIDDIQKL-HIRSIPLGEHPRRIC----HQ------------ 669
            A  P+ L       L +  +  +  + +  S+PL   PRR      H+            
Sbjct: 760  ANCPNGLVAIAGNTLRVLNLPRLGSVFNQTSMPLKYTPRRFIVDSEHRLLVIAESDHNTF 819

Query: 670  -------EQSRTFAICSLKNQSCAEE-SEMHF-------------VRLLDDQTFEFISTY 708
                   EQ+R  A  +    + AE   +  F             +R++D Q  E +   
Sbjct: 820  AAGRKAVEQARVAAETNTTQAAIAESLPDAVFGAPAAGAARWGSCLRIVDAQLRETLELI 879

Query: 709  PLDTFEYGCSILSCSF-SDDSNVYYCVGTAYVLPEENEPTKGRILV---FIVEDGKLQLI 764
             LD  E   S+ SCSF   +   ++ VGT   L  +    +G  L    F  +  +L  +
Sbjct: 880  ELDEDEGVFSMTSCSFHGHEGQTFFIVGTTKALNLQTRAHQGGCLYTYRFNPQTRRLDFV 939

Query: 765  AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-HILALYVQT 823
               E +    ++  F G+LL  +   +++Y     D G R+L  +C +         + T
Sbjct: 940  HRTEVEDVPGAMYPFQGRLLVGVGSLLRVY-----DLGKRKLLRKCENRSIPNFVSSITT 994

Query: 824  RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
             G  IVV D+ +S   L Y+  +  +   A D N  W+++  ++D D     +   N+F 
Sbjct: 995  SGGRIVVTDVQESFHFLRYRPSDNMLAVFADDSNPRWVTSSTMVDYDTVAAGDKFGNVFV 1054

Query: 884  VR-------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP 924
            +R                    +S+   +    + + +  +H+G+ V   +  +L+   P
Sbjct: 1055 LRLPQDLQDDLEDDPTGGRLLGSSKNTLNGAAQKADTIINFHVGDTVTTMQKTALI---P 1111

Query: 925  DSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
                     +++ T  G IGV+     +  + F + L+ ++R+    + G +H  +RS  
Sbjct: 1112 SGS----ECLVYTTTLGAIGVLIPFTTQSDIDFFKHLEMHMRQENPPICGRDHLAFRS-- 1165

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM-NVSVEELCKRVEEL 1032
                   +KN +DGDL E F  L   +   I+  + + +  E+ K++E+L
Sbjct: 1166 ---HYFPSKNVIDGDLCEQFNSLDPHKKRLIAGDLADRTPSEVSKKLEDL 1212


>gi|47212603|emb|CAF93045.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1171

 Score =  161 bits (408), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 235/1076 (21%), Positives = 422/1076 (39%), Gaps = 207/1076 (19%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L + ++G I +L  FR  G  +D++ I ++  +  +L++ + S  +  +   +   + G
Sbjct: 53   LLTMEVFGIIRSLMAFRLTGGTKDYIVIGSDSGRIVILEY-SPSKNMFEKIHQETFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
             R    GQ   +DP  R  +IG      L  ++  D   +L  +  +   +   L    +
Sbjct: 112  CRRIVPGQFLAVDPKGRAVMIGAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118  ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
                G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172  GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231

Query: 164  ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
             + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232  GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216  --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                          +L     G +  + +  ++E VT ++++      +A+ +  L    
Sbjct: 287  FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRLKYFDTIPVATAMCVLKTGF 346

Query: 262  VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
            +++ S +G+  L ++                          QP    + V V E+  +L 
Sbjct: 347  LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-SLS 405

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
            PI+   + DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406  PIMSCQIADLANEDTPQLYVACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465

Query: 357  TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
             +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466  VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDG 523

Query: 417  VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
            +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524  IRHIRADKR--VNEWKTPGKKTIVRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL     L  ++ + L  +  P 
Sbjct: 582  EMSADVVCMSLANVPPGEQRSRFLAVGL-ADNTVRIISLDPSDCLQPLSMQALPAQ--PE 638

Query: 532  SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
            S+ +    G+               YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 639  SLCIVEMGGVEKQDELGEKASIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            + L     +    V A S R  + YS   +   + ++ + + +   F S   P+ +    
Sbjct: 699  VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758

Query: 638  EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
               L I  ++ +  +  + + PL   PR+ + H E            + T A  + + Q 
Sbjct: 759  TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLVLIESDHNAYTEATKAQRKQQ 818

Query: 685  CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
             AEE             +EM                           VRL++      + 
Sbjct: 819  MAEEMVEAAGEDERELAAEMAAAFLNENLPEAIFGAPKAGAGQWASLVRLVNPIQGSTLD 878

Query: 707  TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
               L+  E   S+  C F +  + +Y  VG A  ++      T G I  + +  G  KL+
Sbjct: 879  QVQLEQNEAAFSVAVCRFPNTGDDWYVLVGVARDMILNPKSVTGGFIYTYRLVAGGEKLE 938

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLR-------------DDGTRE 805
             + +   +    ++  F G++L  + + +++Y    K +LR             D+GT  
Sbjct: 939  FVHKTPVEDVPLAIAPFQGRVLVGVGKLLRIYDMGKKKLLRKCENKPARRRRFQDEGTTS 998

Query: 806  LQSECGHHG-----------------------------------HI--LALYVQTRGDFI 828
            L + C H G                                   HI  L   + T G  +
Sbjct: 999  L-AHCNHVGPLDGPSSDPGPQLGQLTASGGPMTTSSQPIRQSRQHIPNLVTGIHTTGQRV 1057

Query: 829  VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
            +V D+ +S+  + Y+  E  +   A D    W++   +LD D    A+   N+  V
Sbjct: 1058 IVTDVQESLFWVRYRRNENQLIIFADDTYPRWVTTACLLDYDTMASADKFGNISIV 1113


>gi|168031491|ref|XP_001768254.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162680432|gb|EDQ66868.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1391

 Score =  159 bits (401), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 220/990 (22%), Positives = 390/990 (39%), Gaps = 192/990 (19%)

Query: 200  PSITKAYGRVDADG-SRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLG-ETSIASTISY 256
            P+IT       A+G SR  L    G +HL  +I    + V  ++++    + S  + + +
Sbjct: 378  PTITAWSWEPYAEGQSRLALAMDTGEIHLARLIFESPDDVPRIEVQQRQYKCSPCNVVLW 437

Query: 257  LDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVV 316
                ++ +    GD Q+++ +   + K  +  +++   N+ PI+DF + D   + Q Q+ 
Sbjct: 438  TKGGLLAVFVEMGDGQVLQCS--DNGKLIFKSLIQ---NVAPILDFSLADYHNEKQDQMF 492

Query: 317  TCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETR 374
             CSGA  +GSLR++RNGI + +    S   QG+ G +++R    DP+  F V+SF+ ETR
Sbjct: 493  ACSGAGNEGSLRVIRNGISVEKLYTTSPIYQGVTGTYTMRMCCRDPYHAFFVMSFVQETR 552

Query: 375  ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR--------- 425
            +L++ L + ++ TE  GF     TL C       +VQV S  V +   T           
Sbjct: 553  VLSVGL-NFVDITEAVGFQPCASTLACGTIEDYHVVQVCSKEVIVCVPTKTAHPAGIDSP 611

Query: 426  -ELRNEWKSPPGYSVNV-ATANASQVLLATGGGHLVYLEIGDGI-----LTEVKHAQLEY 478
                + WK P G  V++ A A+ + VL  +  G +V L    G      L   +  +L+ 
Sbjct: 612  LPFCSSWKPPQGLVVSLGAVASKAIVLALSKPGLIVMLGSQRGANGALELCMTQQCELKA 671

Query: 479  EISCLDI---------------------NPIGENPS---YSQIAAVGMWTDISVRIFSL- 513
            E+SC+ I                      P   NPS     +I  VG   + SV + S+ 
Sbjct: 672  ELSCISIPDEEDWTSSPLPPSIVGLVEGTPKSRNPSGVEVGRICVVGT-HEPSVEVLSIV 730

Query: 514  --PDLNLITKEHLG---------GEIIPRSVLLCAFEGISYLLCALGDGHLLNF------ 556
                L  +   H+             +P SV L  F+ + Y+L  L +G LL +      
Sbjct: 731  PGEGLAPLAVGHISLVSCVGTTLSGCVPESVRLAQFDRL-YILAGLRNGMLLRYEWPASS 789

Query: 557  ---------LLNMKTGE---------------LTDRKKV--------SLGTQPITLRTFS 584
                     LL+    E               L D   V         +G  P++L +  
Sbjct: 790  TATLPDCTNLLSTSDWENIGITQPNLGGDKDVLEDSSPVLLHLVAVRRMGVSPVSLISLQ 849

Query: 585  SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
            +  +  V A SDRP ++ ++  ++++ +++++    SH  P NS   P+ +    +  L 
Sbjct: 850  ASLSADVIALSDRPWLLQTARHSQRIAHTSISFPSSSHAAPVNSVDCPNGILFVADCSLH 909

Query: 643  IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF 702
            +  ++ +++L+++ +PLG  PRR+ +  +S+T  +  ++     +   +  V  +D  + 
Sbjct: 910  LVEMEHLKRLNVQKLPLGRTPRRVLYHTESKTLIV--MRTDYGPDGGLVSDVCCVDPLSG 967

Query: 703  EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY------VLPEENEPTKGRILVFIV 756
               S Y LD  E   SI    +         VGT+       +   E E  KGR+LVF +
Sbjct: 968  ANYSCYTLDAGEVARSIQ--LWKRRQEQLLLVGTSLIGGGGIMSSGEAESAKGRLLVFQL 1025

Query: 757  ------------------------------------------------EDGKLQLIAEKE 768
                                                            E  +L+L     
Sbjct: 1026 LSKHVGTHSQPVMSSTNTPTLSNQSTPGSSAADPMVLSESDESDLTDGEGWELRLKTHII 1085

Query: 769  TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
              GAV S++++ G+ + A      L+    R D  + L+         +   +      I
Sbjct: 1086 LPGAVLSVSSYLGQYVLASAGNC-LFCLGFRPDSPQRLRRMAMVKTRFMITSLSVHLSRI 1144

Query: 829  VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD--IYLGAENNFNLFTVRK 886
             VGD    I    Y+   G +E          ++   ++D D  +      NF  F+   
Sbjct: 1145 AVGDCRDGILFYTYQEVSGQLELLYCGGIQQLVADCVLMDTDTAVVTDRRGNFCTFSSAS 1204

Query: 887  NSEG--------ATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP---------DSDVG 929
              EG        +   ER  L +   YH+GE + R    S     P          +D  
Sbjct: 1205 TPEGDLNFFFAESVSPER-NLSLGCWYHIGETLMRIHKASFAYESPAEESMKNCGSNDAI 1263

Query: 930  QIPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLR--KVIKGVGGLNHEQWRSFNN 984
              PT   V+  ++ G + +   +  E+Y  L+ +Q+ L    +   + G NHE +R    
Sbjct: 1264 AHPTHSSVVASSLLGSVFIFIKVTREEYDLLKAVQSRLAHYPITTPLLGNNHEDYR---G 1320

Query: 985  EKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
            +         LDGD++  FL+L+  + + +
Sbjct: 1321 QGCPAGVCQVLDGDMLCQFLELTSAQQENV 1350


>gi|443922899|gb|ELU42250.1| splicing factor 3B subunit 3 [Rhizoctonia solani AG-1 IA]
          Length = 1212

 Score =  158 bits (400), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 262/1207 (21%), Positives = 467/1207 (38%), Gaps = 239/1207 (19%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            ++G I ++  FR  G  +D+  + ++  +  VL++D +++               R    
Sbjct: 58   VFGTIRSIAAFRLTGGTKDYCIVGSDSGRIIVLEYDPKTNSFTKLQQETFGKSGSRRIVP 117

Query: 66   GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEE--------------- 108
            G++  +DP  R + +   +   L  V+  D    L  +  +                   
Sbjct: 118  GEMLAVDPKGRSVMISAVEKSKLVYVLNRDAAANLTISSPLEAHRNGAIIHAIVGVDVGF 177

Query: 109  ----LQVLDIKFLYGCAKPTIVVLYQDNKDARHVKT-YEVALKDKDFVEGPWSQNNLDNG 163
                   L++ +    A PT     Q  ++A  + T YE+ L     V    S    D  
Sbjct: 178  ENPLFAALEVDYTESDADPT----GQAFRNAEKMLTFYELDLGLNHVVRK--SSEPTDRR 231

Query: 164  ADLLIPVP------------PPLCGVLIIGEETIVYCSANA-FKAIPIRPSITKAYGRVD 210
            A+LL+ VP            P   GVL+  E+ I+Y   +    A+  R  I +      
Sbjct: 232  ANLLVQVPGGQSATTNAFDGP--SGVLVCCEDHIIYRPVDLDGGAVSHRVPIPRRRTPWG 289

Query: 211  ADGSRYLLGDHAGL-------LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
             D  R L+   A +       L+ + + HE E V  + I+       A+++  L +  ++
Sbjct: 290  GDEERGLIITSAVMHKMKVCDLYKVTLEHEDEIVKSMTIKYFDTVPTATSLCILKSGFLF 349

Query: 264  IGSSYGDSQLIKLN-------------------------LQP--------------DAKG 284
            + +  G+  L +                           L P              D   
Sbjct: 350  VAAEGGNHHLYQFQKLGDDDAEPEFTSNSYSNNGISSSPLPPCYFRPHPLDNLVLADEMT 409

Query: 285  SYVEVLE-RYVNL-GP-----------------IVDFCVVDLERQGQG-QVVTCSGAYKD 324
            S+  +L+ R +NL GP                 IV + +V    Q    Q     G    
Sbjct: 410  SFCPILDARILNLFGPTSGGMSAFTQRSDEPSLIVSYRIVPATLQSDTPQFYLACGRGAR 469

Query: 325  GSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDE 383
             S R++R+G+ + E  S EL G+  G+W+++ +  D +D ++++SF++ T +L++   + 
Sbjct: 470  SSFRMLRHGLEVEESVSSELPGVPNGVWTVKVNDGDQYDKYIILSFVNGTLVLSIG--ET 527

Query: 384  LEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSRELRN-----EWKSPPGY 437
            +EE    GF S   TL       N  L+QV  G VR V  +  +L +     EW++P G 
Sbjct: 528  IEEVADTGFLSSAPTLAVQQLADNGGLIQVHPGGVRHVRPSQGDLASTAGVTEWRAPQGR 587

Query: 438  SVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQ-LEYEISCLDIN--PIGENPS 493
             V  AT N  QV +A   G LVY E+  +G+L E    + +   + C+ I   P G  P+
Sbjct: 588  QVASATTNTRQVCVALNSGELVYFELDLNGVLQEYGEMRAVGSAVVCMSIAEVPEGRQPN 647

Query: 494  YSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV--LLCAFEGISYLLCALGDG 551
             S               F  P ++L      G   + R V  L+      + ++ A G  
Sbjct: 648  GSN-------------HFLGPRVDLGNHLVTG---LSRHVWMLVVGLTNETSVVGAHGTS 691

Query: 552  HLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYS 611
            ++  F                LGT+P+ L   S +  T + A S R  + Y+    L +S
Sbjct: 692  NVYFF----------------LGTRPVRLTRVSVQRNTCILALSSRSWLNYTHQNLLRFS 735

Query: 612  NVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQE 670
             +  + + H   F++   P+ L       L I  I  +  KL   ++PL   PR+I    
Sbjct: 736  PLIYENLDHAWSFSAELCPEGLIGIAGSVLRIFQIPKLSDKLKQVTMPLSYTPRKIAVHP 795

Query: 671  QSRTFAICSLKNQSCAEESE----MHFVR--------LLD-------------------- 698
            + + F +    +++   E++       VR        L+D                    
Sbjct: 796  EHQLFYVIESDHRTWGSEAKNKRLAELVRAGRQIDQELVDLPAEDFGLPRAGAGQWASCI 855

Query: 699  ---DQTFEFIST----YPLDTFEYGCSILSCSFSDDSN-VYYCVGTAY---VLPEE--NE 745
               D T  F S       LD  E   S+    F+   N ++  VGTA    VLP +    
Sbjct: 856  RIIDPTEVFCSATLFKIELDNNESAFSVAVVPFAARENELFLVVGTAKDTNVLPRQCVGA 915

Query: 746  PTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRE 805
             T G ++        L    + E      +L    G+L A + + +++Y+      G ++
Sbjct: 916  VTSGSLVKLGWSTHILTRPIQTEVDDVPLALLGIKGRLCAGVGKALRIYEM-----GKKK 970

Query: 806  LQSECGHHGHILALYVQT-RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV 864
            L  +  + G   A+   T +G  I+VG++ +S+    YK E   +   A D +A W+++ 
Sbjct: 971  LLRKSENKGFATAIVTLTSQGSRIIVGEMQESVHYATYKPESNRLLVFADDTSARWVTSA 1030

Query: 865  EILDDDIYLGAENNFNLFTVR---KNSEGATDEERG---------------RLEVVGEYH 906
             ++D D     +   N+F  R     S+   D+  G               + +++  Y+
Sbjct: 1031 ALVDYDTVAVGDKFGNIFVNRLPANISQQVDDDPTGAGIMHEREFLHGAPHKTKLLAHYN 1090

Query: 907  LGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLR 965
            +G+ V      +LV        G    V +  ++G IGV+  L  ++ + F+  L+ ++R
Sbjct: 1091 VGDIVTSVHRAALVP-------GGRDVVAYTGLHGTIGVLIPLASKEDVDFITTLEQHMR 1143

Query: 966  KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEEL 1025
                 + G +H  +R +      V  K  +DGDL E F  L  T+   I+  ++ +V E+
Sbjct: 1144 SEHSSLVGRDHLAYRGY-----YVPVKAVVDGDLCERFAMLPSTKQKSIAGELDRTVGEV 1198

Query: 1026 CKRVEEL 1032
             K++E L
Sbjct: 1199 LKKLEGL 1205


>gi|302822731|ref|XP_002993022.1| hypothetical protein SELMODRAFT_136264 [Selaginella moellendorffii]
 gi|300139222|gb|EFJ05968.1| hypothetical protein SELMODRAFT_136264 [Selaginella moellendorffii]
          Length = 1277

 Score =  157 bits (396), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 196/853 (22%), Positives = 345/853 (40%), Gaps = 135/853 (15%)

Query: 269  GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
            GD  ++K+    D + S+   ++   N+ PI+DF +VD   + Q Q+  C G  ++GS+R
Sbjct: 402  GDGHVLKVE---DGQLSFQSFVQ---NIAPILDFSLVDYYGEKQDQMFACCGGDEEGSVR 455

Query: 329  IVRNGIGINEQASVE--LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
            I+RNG  + +        QG+ G+W++R    DP+  F ++SF+ ETR+L++ L + ++ 
Sbjct: 456  IIRNGNSVEKLICTPPVYQGVSGIWTMRYRFKDPYHAFFLISFVEETRVLSVGL-NFVDI 514

Query: 387  TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPG 436
            T+  GF SQ  TL C       + QV    V+L S T          S  L   W+  PG
Sbjct: 515  TDAVGFESQVNTLACGLVEDGWVAQVWRYEVKLCSPTKAAHPAGVSGSSPLSTTWRK-PG 573

Query: 437  YSVNVATANASQVLLATGGGHLVYLEIGDGI-------LTEVKHAQLEYEISCLDINPIG 489
            Y ++V     S+V+LA     L+ L +G          L E++   +E EISC+ I P G
Sbjct: 574  YPISVGAVCRSRVILALARPGLL-LMLGTTQTSAESFELVELQLCMMEAEISCISI-PQG 631

Query: 490  E--NPSYSQIAAVGMWTDISVRI----------------------------FSLPDLNLI 519
            +   P    IA +     +   +                            F++  ++LI
Sbjct: 632  DISIPVPPTIAGLHAGNTVPAGVDLGNVCVVGTHKPSVELLSIVPGDKFAPFAVGQVSLI 691

Query: 520  TKEHLGGEI---IPRSVLLCAFEGISYLLCALGDGHLLNF--------LLNMKTGELTDR 568
            T   +G  +   IP+ V L  F+ + Y+L  L +G LL +        L+  K  EL   
Sbjct: 692  TS--VGTAVSGCIPQDVRLALFDRL-YILAGLRNGMLLRYEWPEDTPSLVLSKPAELHLI 748

Query: 569  KKVSLGTQPITLRTF-SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
                LG  P+ L    +      + A SDRP ++  + K++ Y++++ +  +H  P  S 
Sbjct: 749  AARRLGVSPVCLVPLEACALRADIIALSDRPWLLQMA-KRISYTSISFQPSTHATPVCSK 807

Query: 628  AFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ---- 683
              P  +    +  L +  ++  + L+++ + LG   RR+ +  +S    +  L ++    
Sbjct: 808  DCPKGIIFLADCSLHLVEMEQSRTLNVQKLRLGCTGRRVLYHPESGVLIVLRLLSEHRSD 867

Query: 684  -SCAEESE-----------------MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS 725
              C E                    M  ++L D+Q    + T   DT     ++++   +
Sbjct: 868  VCCIEPLSGAVLCVHPFGVGQIVKCMELMKLGDEQLL-LVGTAS-DTRR---AVMATGEA 922

Query: 726  DDSNVYYCVGTAYVLPEEN--------------------------EPTKGRILVFIVEDG 759
            +    Y+CV      P  +                          E + G  +VF  +D 
Sbjct: 923  ERQAFYFCVSNTGYFPSSSRGVLVVLYLDAPPPPSPHSPMSSPASESSGGASIVFQPDDY 982

Query: 760  KLQLIAEKETKGAVYSLNAFNGK-LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 818
                 A     G V ++ ++ G+ +LA     +        D+  R  +         + 
Sbjct: 983  CFVPRANVGLPGPVNAVASYLGQYVLACAGNHLFCLGIASMDESPRRWKKLASIKTRFVI 1042

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA--E 876
              +  R   I VGD    + L  Y+ +   +E    D     +S   ++D D  +     
Sbjct: 1043 TSISVRFTTIAVGDCRDGVLLFTYREDSKKLEPIRCDPMRRLVSDCTLVDVDTAVVVDRH 1102

Query: 877  NNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR--HGSLVMRLPDSDVGQIPTV 934
             NF   +  + +EG    E+  LE    +H+GE     R    S  +    S    +  V
Sbjct: 1103 GNFCALSTNEETEGNGSPEK-NLEAHCWFHIGEVCTTVRKVRTSRFLCKDTSRECSVSCV 1161

Query: 935  IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNF 994
            I  T+ G + +   +  E+Y  L+ LQ  L  +      L ++  R F  + +    K  
Sbjct: 1162 IATTLLGSVFIFVRITGEEYSLLQALQRRLSFLPATAPVLGNDHAR-FRGQGRPAGVKEV 1220

Query: 995  LDGDLIESFLDLS 1007
            LDGDL+E FL+L+
Sbjct: 1221 LDGDLLEQFLELT 1233


>gi|281202530|gb|EFA76732.1| CPSF domain-containing protein [Polysphondylium pallidum PN500]
          Length = 933

 Score =  156 bits (395), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 134/577 (23%), Positives = 254/577 (44%), Gaps = 88/577 (15%)

Query: 304 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFD 362
           V+DL  + Q Q+   +G     ++R++R G+ I + A   L GI  G+W++  S +D  D
Sbjct: 246 VMDLVNEEQPQIYALTGVADRSAMRVLRYGLPIAQIAGTPLPGIPSGLWTIPRSQEDSID 305

Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
            ++VVSFI  T +L++   + +EE    G  + T ++       + ++QV    +R +  
Sbjct: 306 KYIVVSFIGSTLVLSVG--ETVEEVVDSGILATTTSILVRPIGADSIIQVFPHGIRHIKG 363

Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEIS 481
             R   NEW++P   ++++AT+NA+QV++A GGG ++Y E+   G L+EV   +   EIS
Sbjct: 364 DRR--INEWRAPGRKTISLATSNATQVVIALGGGEVIYFELDAAGNLSEVAKKEFRREIS 421

Query: 482 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK------EHLGGEIIPRSVLL 535
            L+I P+ +    ++  A+G W +  VR+ SL   NL+ +      E +  E +  + + 
Sbjct: 422 ALEIAPVRKGRQMARFVAIGDW-EGPVRVLSLDKDNLLNQVSVLDTERMHIESLMMNEMT 480

Query: 536 CAFE-----------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 572
              E                        + +L   L +G +   +++  TGEL+D +   
Sbjct: 481 IGVEMPEGTASSSSAIAKQQQLQNQSNQVLFLNVGLKNGVMKRAVVDPITGELSDTRTRL 540

Query: 573 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
           LG +P+       K+   V A S R  + Y +  +     V+++ + +   F+S   PD 
Sbjct: 541 LGRKPVKFSRVKLKSGNAVLALSSRVWLCYVNQGRYDMVPVSVEPLDNASGFSSEHCPDG 600

Query: 633 LAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICSLKN-------- 682
           +    E  L I  I+ + +K +  ++P    PRR + H +     +I +  N        
Sbjct: 601 IVATSENNLKIFVIEKLSEKFNQVNVPYKCTPRRFVVHPQTHYIVSIETEHNYLNELPTP 660

Query: 683 -----QSCAEESE---------------------------------MHFVRLLDDQTFEF 704
                   AE+ +                                 + ++R+L+  ++  
Sbjct: 661 KENGSNKMAEDKDPEVKREGDDMDYEVDQFSLNQNDKPPKAGDGKWLSYIRVLEPVSYRT 720

Query: 705 ISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---PEENEPTKGRILVFIVEDGK 760
           +    LD  E   S+ +C F D +  V+  VG    +   P + E     +  F      
Sbjct: 721 LDLVKLDQDEAAYSLATCIFHDREGEVFLAVGCGKGVQLNPRKVESASIHLYRFTNNGQT 780

Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
           LQL+ + E +   Y+++ F G+LL  I  ++++Y+ +
Sbjct: 781 LQLVYKTEVEEVPYAISHFQGRLLVGIANQLRIYEMV 817



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 39/140 (27%), Positives = 67/140 (47%), Gaps = 15/140 (10%)

Query: 896  RGRLEV--VGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHE 952
            +GRL V    +  + E VN     SL        VG    +++ T+NG IG +   +  E
Sbjct: 800  QGRLLVGIANQLRIYEMVNHISKTSL-------SVGGPEVLVYATLNGTIGALVPFVSRE 852

Query: 953  QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD 1012
               F   L+  +R+    + G +H  +RS+         KN +DGDL E ++ L  T+  
Sbjct: 853  DVDFYTSLELQMRQENPPLCGRDHLAYRSY-----YFPVKNVIDGDLCEQYISLDPTKQQ 907

Query: 1013 EISKTMNVSVEELCKRVEEL 1032
             I++ ++ S  E+ K++E+L
Sbjct: 908  SIAEELSRSPSEILKKLEDL 927


>gi|399216895|emb|CCF73582.1| unnamed protein product [Babesia microti strain RI]
          Length = 1232

 Score =  155 bits (391), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 249/1215 (20%), Positives = 462/1215 (38%), Gaps = 233/1215 (19%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            +YG I  +  FR  G ++D+L + ++  +  +L+++   +    R   +   + G    N
Sbjct: 58   VYGIIRCIAPFRLTGASRDYLVVTSDSGRIVILEFNTAKNNF-DRVHCETYGKSGNNGYN 116

Query: 66   --------GQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAFNIRLEE------- 108
                    G    +DP  R + L   +      ++  DNK  L  +  +   +       
Sbjct: 117  LGNRRIVPGHFLGVDPKGRALMLAALERQKFVYILNRDNKSNLTISSPLEAHKSHSICHY 176

Query: 109  LQVLDIKFLYGCAKPTIVVLYQD----NKDARHVKTYEVALKDKDFVEGPWSQNNL---- 160
            L  +D+ F      P  V + Q+    N D       + + K   F E     N++    
Sbjct: 177  LTGIDVGF----DNPIFVTIEQNYATVNGDLSSSSVGDESKKYLSFWEMDLGLNHVTKKI 232

Query: 161  ----DNGADLLIPVPPPLCG-------VLIIGEETIVYCSANAFKAIPIRPSITKAYGRV 209
                D  A +L+P+    CG       VL+  E  ++Y         P  P +  AY R 
Sbjct: 233  SMQTDATAHMLVPI----CGGTDGPGGVLVCCENYLIYTK-------PDHPELCCAYPRR 281

Query: 210  DADGSRYLLGDHAGLLHLLV----ITHEKEKVTGLKIELLGE--------------TSIA 251
                    +  +A ++H +V    I  + E    LK+E++                  IA
Sbjct: 282  LECSQDTGILINAWVVHKMVDFFFILLQTEYGDLLKVEIVSNDQTVKEIVCRYFDTVPIA 341

Query: 252  STISYLDNAVVYIGSSYGDS---QLIKL---------NLQPDAKGSYVE----------V 289
            +++  L    +++GS +G+    Q I L         +L P+   + V           +
Sbjct: 342  NSLCVLKTGYLFVGSEFGNHCFYQFISLGNDETTACTSLHPEGGNAIVAFNPRSPKNLLL 401

Query: 290  LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
            ++   +L P+VD  + D       ++    G     + +I+R GI + E A+ EL G  +
Sbjct: 402  VDELTSLSPVVDMKIADARGLDTSEIYLACGRGSRSTFQILRPGISVEELANNELPGYPR 461

Query: 349  GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN- 407
             ++SL+    D    +++V F  E   L +++ D +EE     F ++T TL C   +Y+ 
Sbjct: 462  YVFSLKDKNADDCVGYIIVVF--EGNTLVLSVGDAVEEVSNSFFNTETTTL-CAMLMYDD 518

Query: 408  QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG- 466
              +Q+  G +R +        +EWK P    +   +AN  Q+L+A  GG ++Y EI +  
Sbjct: 519  SFIQIHEGGIRHIIDNHV---SEWKPPTSKRIKCCSANTRQILIALSGGEVIYFEIDESH 575

Query: 467  ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEH 523
             L E+       +I+C+ I  +  +  YS   A+    ++ V++ SL     L  ++ + 
Sbjct: 576  ALVEIFKRNFGVDITCMAIQAVPTDRVYSSFGAISGLDNV-VKLISLEKEKGLKQLSTQL 634

Query: 524  LGGEIIPRSVLLCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKV 571
            L       SV +   + +             +L+  L  G ++   ++  TG L+D+   
Sbjct: 635  LPNNATAESVCIAQIDSLVRDAGHMRRNSSLFLVVGLNTGVMIRSNIDAITGALSDQHSR 694

Query: 572  SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
             LG + +        +     A SD+  + Y+   KL  S +N   V H+  F S+   D
Sbjct: 695  FLGARAVRFSLVKVGDGMGFMAMSDKSWLCYAHQGKLFTSYINYDMVEHVASFCSSHCSD 754

Query: 632  SLAIAKEGELTIGTIDDIQKLHIRSI-PLGEHPRRI------------------------ 666
                     L I    ++ +   +S+ PL   PR+I                        
Sbjct: 755  GFVAISGNSLRIYRCLNLGQEFSQSVAPLTYTPRKISILPSLSPITDNGTETGVTKNRHM 814

Query: 667  ---------CHQEQSRTFAICSLKN----QSCAEESEMHF-----------VRLLDDQTF 702
                      + E +RT     L+N       AE+ E+             VR++D  T 
Sbjct: 815  LAIVECDHNTYDELTRTEIKKGLENIMPEGEQAEDVELGLYKAGEGKWGSCVRIIDPTTL 874

Query: 703  EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVG----TAYVLPEENEPTKGRILVFIVED 758
               +   LDT E   S  +C    D   Y C+     T + L   N  +    +     +
Sbjct: 875  STAAKLLLDTDEAAISCCAC----DLEGYRCLAVGTVTGWNLANSNSNSCHIRMYAYGPN 930

Query: 759  GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL- 817
             ++  +   +  G   +L A+ G+LLA +   + LY       G R+L  +  + G ++ 
Sbjct: 931  FEITFLHSTKVTGIPRALLAYEGRLLAGVGPDVILYAL-----GKRQLLKKAEYRGGVID 985

Query: 818  -------------------ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
                                +++   G+ I VGD+ +SI++L +  +   +     D   
Sbjct: 986  IQGYGVATPRTIGNGGLFGVMWLGASGNRIFVGDIRESITVLKFDEQMAKLSLICDDIRP 1045

Query: 859  NWMSAVEILDDDIYLGAENNFNLFTV-RKNSEGATDE-----ERGRLEVV---------- 902
             W++   +LD    +   + F+ F V R  SE +          G LE V          
Sbjct: 1046 RWITGATVLDHHT-VALVDKFDTFAVCRVPSEASASNLSSALNSGSLEAVMPTILGVGNK 1104

Query: 903  ----GEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFL 957
                 ++HLG+        +L         G    V++ T+ G IG +I  +  ++   L
Sbjct: 1105 FEQEAQFHLGDLSTCIDKVTLCS-------GCTEAVVYATILGSIGALIPFISSDELDTL 1157

Query: 958  EKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKT 1017
            + L+  +      + G  H  +RS+    + V     +DGDL E F  L       I+  
Sbjct: 1158 QHLELLMANENPPLSGREHSIYRSYYGPVQHV-----IDGDLCEEFESLDSITQSRIAAK 1212

Query: 1018 MNVSVEELCKRVEEL 1032
            ++ +V E+ K++ ++
Sbjct: 1213 IDKTVTEIIKKLRDI 1227


>gi|340367933|ref|XP_003382507.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Amphimedon
           queenslandica]
          Length = 1214

 Score =  154 bits (390), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 173/770 (22%), Positives = 333/770 (43%), Gaps = 90/770 (11%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           ++   ++G I +L  FR  G  +D++ + ++  +  +L++D   + +  +   +   + G
Sbjct: 53  LVSTEVFGLIRSLITFRLTGSTKDYIVLGSDSGRIVILEYDPVKN-VFEKVHQETYGKSG 111

Query: 61  -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL----- 112
            R    GQ   +DP  R  +IG      L  ++  D+  +L  +  +   +   L     
Sbjct: 112 CRRIVPGQYLAVDPKGRALMIGAVEKQKLVYILNRDSAARLTISSPLEAHKSHTLVYHVV 171

Query: 113 --DIKF---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN-------- 159
             D+ F   L+ C +    + Y+++ +    +  +VA ++  + E     N+        
Sbjct: 172 GVDVGFENPLFACLE----LDYEESDNDHTGEAAQVAQQNLTYYELDLGLNHVVRKYSEP 227

Query: 160 LDNGADLLIPVPPPL---CGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD-ADGSR 215
           LD+ A++LI VP       GVL+  E  I+Y   N      IR  I +    +D AD S 
Sbjct: 228 LDDMANMLITVPGGTDGPSGVLVCCENYIIY--KNFGDQPDIRCPIPRRQNDLDDADRSM 285

Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
                          +LL    G +  +++  E + V  +KI+      + S++  L + 
Sbjct: 286 LIITSAMHKTKHLFFFLLQSEQGDIFKVMLDVEDDIVVRIKIKYFDTVPVCSSLCVLKSG 345

Query: 261 VVYIGSSYGDSQLIKL-NLQPDAKGSY---------------------VEVLERYVNLGP 298
           +++  S +G+  L ++ +L  D++  +                     + +++   +L P
Sbjct: 346 LLFCSSEFGNHYLYQIAHLGDDSEREFHSDYPLEEGETFFFGPRPLKNLILVDEMESLSP 405

Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
           I+   + DL  +   Q+    G     SLR++R+G+ + E A  EL G    +W+++  +
Sbjct: 406 IMSCQIADLANEDTPQLYAACGRGPRSSLRVLRHGLEVTEMAVSELPGNPHAVWTVKKDS 465

Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            + FD+++VVSF++ T IL++   + +EE    GF   T TL C     + L+Q+    +
Sbjct: 466 KEDFDSYIVVSFMNATLILSIG--ETVEEVTDSGFLGTTPTLACSQLGDDALIQIYPEGI 523

Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
           R + S  R   NEW+SP    +     N  QV++A  GG +VY E+   G L E  +  +
Sbjct: 524 RHIRSDKRV--NEWRSPGKRLIRQCAVNEHQVVIALSGGEIVYFEMDQSGQLNEYTERKE 581

Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVL 534
           +  E+ C+ +  +       +  AVG+ +D +VRI SL P   L          +P S+ 
Sbjct: 582 MTAEVICISLGSVPPGQQRCRFLAVGL-SDQTVRIISLDPHDTLQPLSMQALPALPESLC 640

Query: 535 LCAFEG-------IS----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
           +    G       +S    +L   L +G LL   L+  TG+L+D +   LGT+P+ L   
Sbjct: 641 IVNMSGNVSDDTTVSTGGLFLNIGLQNGVLLRTALDNVTGDLSDTRTRYLGTRPVKLFRV 700

Query: 584 SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
             + T  V A S R  + Y+   +   + ++   + +   F S   P+ +       L I
Sbjct: 701 RIQGTEGVIAVSSRTWLNYTYQSRFHLTPLSYDLLEYASSFTSEQCPEGMVAISSNTLRI 760

Query: 644 GTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
             ++ +  +  + S PL   PR++          +    + +  E +++H
Sbjct: 761 LALEKLGVVFNQVSTPLQYTPRKLLIHPPMSNLVLIETDHNAFTEATKLH 810



 Score = 80.1 bits (196), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 81/345 (23%), Positives = 157/345 (45%), Gaps = 44/345 (12%)

Query: 714  EYGCSILSCSFSDDSNV--YYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKE 768
            E   S+  C F    ++  +  VGTA  ++      + G ++VF +  DG KL+ +   +
Sbjct: 882  EAAFSLAVCQFVSKGDLEWFVVVGTAKDMIITPRAISSGSLIVFRLSPDGSKLEHVHTTQ 941

Query: 769  TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDF 827
                  ++  F G+LL  + + +++Y     D G +++  +C   H   L + ++  G  
Sbjct: 942  LDDVPIAMAPFQGRLLVGVGKLLRIY-----DIGKKKMLRKCENKHLPYLVVDIKVMGRR 996

Query: 828  IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
            + VGD+ +++  L Y+  E  +   A +    + +   ILD +    A+   N+  +R  
Sbjct: 997  VYVGDVQEAVHFLYYRPHENQLVIFADEVVPRFCTTSCILDYNTVASADKFGNITILRLP 1056

Query: 888  SEGATDE------------ERGRLE-------VVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
            S+  TD+            +RG L        V+  YH+GE +N     SL+        
Sbjct: 1057 SD-VTDQVDEDPSGSRSLWDRGFLNGATQKANVMTSYHVGEGINTLHKVSLI-------P 1108

Query: 929  GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
            G    +++ T++G IG++      E   F + L+ ++R     + G +H  +RS+     
Sbjct: 1109 GGSEVLVYTTLSGSIGILVPFSSKEDSDFFQHLEMHMRSEWSNLVGRDHLSFRSY----- 1163

Query: 988  TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             V  K+ +DGDL E +  L  ++  EI+  ++ S  E+ K++E+L
Sbjct: 1164 YVPVKSVIDGDLCEVYNSLDPSKRREIALDLDRSPSEVAKKLEDL 1208


>gi|432863837|ref|XP_004070177.1| PREDICTED: splicing factor 3B subunit 3-like [Oryzias latipes]
          Length = 1146

 Score =  154 bits (388), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 207/948 (21%), Positives = 382/948 (40%), Gaps = 157/948 (16%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L + ++G + +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53  LLTMEVFGIVRSLMAFRLTGGTKDYIVVGSDSGRIVILEYHP-SKNMFEKIHQETFGKSG 111

Query: 61  -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
            R    GQ   +DP  R  +IG      L  ++  D   +L  +  +   +   L    +
Sbjct: 112 CRRIVPGQYLAVDPKGRAVMIGAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
               G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231

Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
            + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                         +L     G +  + +  ++E VT ++++      +A+ +  L    
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRLKYFDTIPVATAMCVLKTGF 346

Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
           +++ S +G+  L ++                          QP    + V V E+  N+ 
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-NMS 405

Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
           PI+   + DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406 PIMSCQIADLANEDTPQLYVACGRGPKSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465

Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
            +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466 VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523

Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
           +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524 IRHIRADKR--VNEWKTPGKKTIVRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581

Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
           ++  ++ C+ +  +      S+  AVG+  D +VRI SL     L  ++ + L  +  P 
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ--PE 638

Query: 532 SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
           S+ +    G+               YL   L +G LL  +L+  TG+L+D +   LG++P
Sbjct: 639 SLCIVEMGGVEKQDELGEKGTMGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698

Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
           + L     +    V A S R  + YS   +   + ++ + + +   F S   P+ +    
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758

Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
              L I  ++ +  +  + + PL   PR+ + H E            + T A  + + Q 
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLVLIETDHNAYTEATKAQRKQQ 818

Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
            AEE             +EM                           VRL++      + 
Sbjct: 819 MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGAGQWASLVRLVNPIQGTTLD 878

Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
              L+  E   S+  C F +  + +Y  VG A  ++      + G I  + +  G  KL+
Sbjct: 879 QVQLEQNEAAFSVAVCRFLNTGDDWYVLVGVARDMILNPRSVSGGFIYTYRLTGGGEKLE 938

Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
            + +   +    ++  F G++L  + + +++Y     D G ++L  +C
Sbjct: 939 FVHKTPVEDVPLAIAPFQGRVLIGVGKLLRIY-----DLGKKKLLRKC 981



 Score = 56.2 bits (134), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 34/136 (25%), Positives = 70/136 (51%), Gaps = 13/136 (9%)

Query: 898  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLF 956
            + EVV  YH+GE V   +  +L+        G   ++++ T++G IG++     HE + F
Sbjct: 1017 KAEVVVNYHVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGILVPFTSHEDHDF 1069

Query: 957  LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
             + L+ ++R     + G +H  +RS+         KN +DGDL E F  +   +   +++
Sbjct: 1070 FQHLEMHMRSEFPPICGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSMDTHKQKSVAE 1124

Query: 1017 TMNVSVEELCKRVEEL 1032
             ++ +  E+ K++E++
Sbjct: 1125 ELDRTPPEVSKKLEDI 1140


>gi|444722328|gb|ELW63026.1| Splicing factor 3B subunit 3 [Tupaia chinensis]
          Length = 840

 Score =  153 bits (387), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 196/905 (21%), Positives = 367/905 (40%), Gaps = 162/905 (17%)

Query: 216  YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
            +L     G +  + +  +++ VT ++++      +A+ +  L    +++ S +G+  L +
Sbjct: 4    FLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQ 63

Query: 276  L----NLQPDAKGSYVEVLER----YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSL 327
            +    +   + + S    LE     +    P+ +  +VD E      ++ C         
Sbjct: 64   IAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVD-ELDSLSPILFCQ-------- 114

Query: 328  RIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
                    ++E A  EL G    +W++R   +D FD +++VSF++ T +L++   + +EE
Sbjct: 115  --------VSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIVSFVNATLVLSIG--ETVEE 164

Query: 387  TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANA 446
                GF   T TL C     + LVQV    +R + +  R   NEWK+P   ++     N 
Sbjct: 165  VTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR--VNEWKTPGKKTIVKCAVNQ 222

Query: 447  SQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
             QV++A  GG LVY E+   G L E  +  ++  ++ C+ +  +      S+  AVG+  
Sbjct: 223  RQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPGEQRSRFLAVGL-V 281

Query: 505  DISVRIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE----------------GISYLLCA 547
            D +VRI SL   + +  + L  + +P +   LC  E                G  YL   
Sbjct: 282  DNTVRIISLDPSDCL--QPLSMQALPAQPESLCIVEMGGTEKQDELGERGSIGFLYLNIG 339

Query: 548  LGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK 607
            L +G LL  +L+  TG+L+D +   LG++P+ L     +    V A S R  + YS   +
Sbjct: 340  LQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSR 399

Query: 608  LLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR- 665
               + ++ + +     F S   P+ +       L I  ++ +  +  + + PL   PR+ 
Sbjct: 400  FHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKF 459

Query: 666  ICHQEQSR-----------TFAICSLKNQSCAEE-------------SEM---------- 691
            + H E +            T A  + + Q  AEE             +EM          
Sbjct: 460  VIHPESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAAGEDERELAAEMAAAFLNENLP 519

Query: 692  ---------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VG 735
                             +R+++      +    L+  E   S+  C FS+    +Y  VG
Sbjct: 520  ESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVG 579

Query: 736  TAYVLPEENEPTKGRILVF--IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
             A  L        G  +    +V +G KL+ + +   +    ++  F G++L  + + ++
Sbjct: 580  VAKDLILNPRSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLR 639

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALY---VQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
            +Y     D G ++L  +C  + HI A Y   +QT G  ++V D+ +S   + YK  E  +
Sbjct: 640  VY-----DLGKKKLLRKC-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQL 692

Query: 850  EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEERGRLEVVGEYHL 907
               A D    W++   +LD D   GA+   N+  VR   N+    DE+            
Sbjct: 693  IIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDED------------ 740

Query: 908  GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
                                 G       G +NG          + + F + ++ +LR  
Sbjct: 741  -------------------PTGNKALWDRGLLNGA-------SQKDHDFFQHVEMHLRSE 774

Query: 968  IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
               + G +H  +RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K
Sbjct: 775  HPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSK 829

Query: 1028 RVEEL 1032
            ++E++
Sbjct: 830  KLEDI 834


>gi|298713790|emb|CBJ27162.1| spliceosomal protein sap, putative [Ectocarpus siliculosus]
          Length = 1256

 Score =  152 bits (385), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 168/750 (22%), Positives = 306/750 (40%), Gaps = 92/750 (12%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L  P +  + +L  FR     +D+  + T+  K  +L++  E+SEL    M        
Sbjct: 54  ILQQPAFCVVRSLIAFRLTSTHKDYCVVGTDAGKVTILEYLPETSELKAVHMETFGKTGC 113

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
           R    GQ    DP  R I +   +    V   +     + A +  LE  +   I F    
Sbjct: 114 RRIVPGQYLAADPQGRAIMISAVEKQKLVYVLNRDTASRLAISSPLEAHKSHAIIFHTCG 173

Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNG 163
              G   P   V+  D  +A    T E A   +            + V   WS + +  G
Sbjct: 174 VDVGFDNPIFAVIELDYSEADQDSTGEEAANTEKQLVYYQLDLGLNHVTRLWS-DPISRG 232

Query: 164 ADLLIPVP---PPLCGVLIIGEETIVYCSAN---AFKAIPIRPSITKAYGRVDADGSRYL 217
           A LL+PVP       GVL+ GE  + Y   +       +P R ++    G +   G+   
Sbjct: 233 ASLLLPVPGGEAGPSGVLVCGENWVAYKHQDHPEVRAPLPRRKTMPTGRGLLVTSGT--- 289

Query: 218 LGDHAGLL-HL------------LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
           L    GL  HL            LV+  E+ +VT + IE+     +A+++       ++ 
Sbjct: 290 LHQQPGLFFHLIQSELGDLYKVSLVVNDEQTEVTDISIEVFDTIPVANSLQITKKGKLFC 349

Query: 265 GSSYGDSQLIKLNLQPDAKGSYVEV--------------------------------LER 292
            S + +  +   +     +G+ V                                  L+ 
Sbjct: 350 ASEFSNHMMFHFHKMGGEEGAVVATQCADPELDDSSESAAQVAPTFTPSATLKNLWGLDE 409

Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
             +L P+ D  V DL R+   Q+ T  G     +LRI+R+G+ + E A  +L G+   ++
Sbjct: 410 VESLAPLTDMLVGDLAREDTPQMYTLCGRGNRSTLRILRHGLAVTELAENDLPGVPSAVF 469

Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
           + +    D +D ++V+SF + T +L   ++D +EE E  GF + + TL       N+++Q
Sbjct: 470 AFKERLTDDYDRYIVLSFTNATMVL--EVQDSVEEVENSGFLATSSTLDVKLMANNKILQ 527

Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI--LT 469
           V S  +R++     +   EW++P    +  A AN  QV+LA  GG +++ E+ + +  + 
Sbjct: 528 VYSHGLRVI--MKGQPPQEWRAPGKKQIEKACANERQVVLALAGGEIIFFELDEAMQNIQ 585

Query: 470 EVKHAQL-EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
           EV   QL   EI+CL++  + ++   +   AVG W   +V+I  L   NL+ +  +    
Sbjct: 586 EVGTKQLGGVEIACLEMGEVPKDRVRAPFLAVGDWNG-NVKILGLSPENLLEQVAMINLP 644

Query: 529 IPRSVLLCAFEGIS----------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
            P   L  A               +L   L  G      ++   G L+D ++  LG++P+
Sbjct: 645 HPAESLCLAHMAAEQAAGGSNEQLFLYIGLASGVCQRVAVDATAGTLSDPRQRFLGSKPV 704

Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
            L     ++   V A S R  + Y+   +   + ++   +     F++   P+ +     
Sbjct: 705 KLFRIQVQDKRGVLALSSRSWLSYNYQGRYQMTPLSYDPLDFAAEFSTEMCPEGVVAVSG 764

Query: 639 GELTIGTIDDI-QKLHIRSIPLGEHPRRIC 667
            +L I  ++ + +  +  ++PL   PR+I 
Sbjct: 765 EKLRIFGVERLGEVFNQAALPLKYTPRQIA 794



 Score = 92.8 bits (229), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 82/359 (22%), Positives = 158/359 (44%), Gaps = 38/359 (10%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRIL 752
            +RLLD      +    LD  E   S+   +F + +   +  VGTA  L       +G  +
Sbjct: 909  IRLLDPVEGTTVECLDLDDNEAALSVAPVAFHNRNGEAFVAVGTAKSLTFHPRGHEGCFV 968

Query: 753  -VFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC- 810
             V+ + + +L L+ + E      ++  F G+LL  + Q +++Y     D G ++L  +C 
Sbjct: 969  HVYRILENRLVLLHKTEVPDVPLAMKEFQGRLLVGVGQSLRMY-----DLGRKKLLRKCE 1023

Query: 811  GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
                  + + +   GD +  GD M+S     Y+  E  + E A D    +M+   +LD D
Sbjct: 1024 NKRMPSMVVSLTVTGDRVFAGDQMESCHCFKYRRAENRLVEFADDQVPRFMTKTCLLDYD 1083

Query: 871  IYLGAENNFNLFTVRKNSEGATDEER-----------------GRLEVVGEYHLGEFVNR 913
               GA+   N+F +R   + + D +                   +++   ++H+GE V+ 
Sbjct: 1084 SIAGADKFGNIFVLRVPLDVSDDVDNPTGNRLLWDSGHLSGAPNKVQQQLQFHVGEVVSS 1143

Query: 914  FRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVG 972
             R  +LV        G    +++ T+NG IG +        + F   ++  +R+    + 
Sbjct: 1144 LRRTTLV-------PGGAEVLLYSTINGSIGALLPFKSRDDVDFFTHMEMYMRQEKPTLC 1196

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
            G +H  +RS+      + AK+ +DGDL E F  L   +   ++  ++ +V E+ K++E+
Sbjct: 1197 GRDHISYRSY-----YLPAKDVIDGDLCEQFSSLPFEKQKLVANGLDRTVGEVVKKLED 1250


>gi|406700450|gb|EKD03620.1| hypothetical protein A1Q2_02097 [Trichosporon asahii var. asahii CBS
            8904]
          Length = 1119

 Score =  152 bits (384), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 211/874 (24%), Positives = 345/874 (39%), Gaps = 158/874 (18%)

Query: 217  LLGDHAGLLHLLVITHEKEKVTGLKIEL--LGETSIASTISYLDNAVVYIGSSYGDSQLI 274
            LLGD  G   L  +  E E +  +++ +  +G  S  S+++YLDN  ++  S+ GDS L+
Sbjct: 291  LLGDEYG--RLTAVGWEFEDMEKVQVGMIDMGVVSAPSSLTYLDNGYLFSASACGDSLLV 348

Query: 275  KLNL-------QPDAKG----------------------SYVEVLERYVNLGPIVDFCVV 305
             L L       QP  KG                        V+V ER++N+ P  DF +V
Sbjct: 349  YLVLPSPNSTRQPSGKGKEKAHDISDAGAYEIVTAEPQHGRVDVRERWMNIAPAKDFAIV 408

Query: 306  DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTF 364
              +      VV  SG+    S R+VR+G+G     ++E + GI+ MW++ ++ D P    
Sbjct: 409  KEDDGRVSHVVVASGSASSNSFRVVRSGVGFENLMTIEEIPGIERMWTIPAA-DGPS--- 464

Query: 365  LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
            L+VSF   T IL   LE E+   +     +   T        + L+QVT   +RL S  +
Sbjct: 465  LLVSFAYSTTIL--QLEPEVSVFKAADQVTAVPTFAAGLVDKSLLLQVTPEGIRLWSDLA 522

Query: 425  RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLD 484
                         + NV     ++++ A   G++      DG +T  + +          
Sbjct: 523  AGT---------LAGNVDAPEDNRIVTANVRGNIAVAAFRDGTVTLFRASSQ-------- 565

Query: 485  INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI--- 541
                G         A+  W           ++ L+  E      + +S+   A+      
Sbjct: 566  ----GLQEVVKGFLAIADWCG---------EIELVAVEGADQGTVLKSIREAAYATSLQF 612

Query: 542  -------SYLLCALGDGHLLNFLLNMKTGELTD-RKKVSLGTQPITLRTF--SSKNTTHV 591
                   + +L  L DG  +++ + +   E +D R   SLG +P+ L T   S     HV
Sbjct: 613  QESNNEPTRMLAGLSDGTFVSYSVKLNGAECSDNRHASSLGLRPLRLITLDISPNAEEHV 672

Query: 592  FAA--SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
             AA  SDR ++++ S     +S+   K +                           +  +
Sbjct: 673  VAAGISDRLSLVFESRDHYEFSSSGKKGI-----------------------VFERLTSL 709

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES----EMHFVRLLDDQTFEFI 705
            +KL ++++ LG                +     +S   ++    +   V L +  T E +
Sbjct: 710  KKLQVQTLDLGNRSATRVAALPGYNLVVAGTVTRSMDHQTGDVLQSSSVELRNATTLELL 769

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENE----------------PTKG 749
            S + L   E   S+ + +       Y  VGTA     ENE                  +G
Sbjct: 770  SEFQLPEREAVASVNAVTL--HGRKYILVGTAIF---ENEDALEDATLEDVTSFIATNRG 824

Query: 750  RILVFIVEDG---KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
            R+L+F + +     L L+      G VY     +G L  A + K+ + +   +     E 
Sbjct: 825  RLLLFQINESAGPSLDLVTSMTFNGPVYDTVVIHGFLAVATSTKVSILRLTTQPPSLEEA 884

Query: 807  QS-ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW---MS 862
             S       H LA+    +   +VVGD M+SI +L    E G I    RD NA+    +S
Sbjct: 885  ASFAFAFETHHLAVVEIDKEKRLVVGDAMRSIIVLSVDPESGDIVGDQRDMNAHLVRCLS 944

Query: 863  AVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR 922
            AV  ++  + + A+N  NL T R   +G T              L E V R + G+L   
Sbjct: 945  AVHDVEPGVMI-ADNYANLLTFRLG-KGITP--------AASIGLSEDVTRLQPGTLA-- 992

Query: 923  LPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
             P S  G I    ++  TVNG +GVI  L       L+ LQ N+ K+ KG GGL   +W+
Sbjct: 993  -PVSAEGDILRADLLCTTVNGRLGVIGELGKGSIRTLDDLQRNMNKLYKGPGGL---EWK 1048

Query: 981  SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
               N     +   F+DGD ++ F  L  +  D+I
Sbjct: 1049 ESGNMLVPRETVGFIDGDFVQRFSSLDSSLQDKI 1082


>gi|42409127|dbj|BAD10377.1| putative splicing factor 3b, subunit 3, 130kDa [Oryza sativa
           Japonica Group]
 gi|42409258|dbj|BAD10521.1| putative splicing factor 3b, subunit 3, 130kDa [Oryza sativa
           Japonica Group]
 gi|125538000|gb|EAY84395.1| hypothetical protein OsI_05771 [Oryza sativa Indica Group]
          Length = 1234

 Score =  150 bits (379), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 179/765 (23%), Positives = 312/765 (40%), Gaps = 83/765 (10%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G I +L  FR  G  +D+L + ++  +  +L++  + + L             
Sbjct: 68  LLSVDVFGAIRSLAQFRLTGATKDYLVVGSDSGRLVILEYSPDRNRLDKVHQETFGKSGC 127

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
           R    GQ+  +DP  R + +   +   L  V+  D   +L  +  +   +   L      
Sbjct: 128 RRIVPGQLLAVDPKGRALCIAALEKQKLVYVLNRDAAARLTISSPLEAHKSNTLTFSLTA 187

Query: 119 ---GCAKPTIVVL---YQDNKDARHVKTYEVALKDKDFVEGPWSQNN--------LDNGA 164
              G   P    +   Y ++      +  E A K   F E     N+        +DNGA
Sbjct: 188 LDCGFDNPVFAAIELEYAESDRDPTGQAAEQAQKHLTFYELDLGLNHVSRKASEPIDNGA 247

Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VDADGS 214
           +LL+ VP     P   G+L+  +  ++Y +         IP R  +    G   V A   
Sbjct: 248 NLLVTVPGGGDGP--SGLLVCCDNFVLYRNQGHPEVRAVIPRRADLPAERGVLIVAAATH 305

Query: 215 R------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
           R      +LL    G +  + + H  + VT L+I+      + S I  L +  ++  S +
Sbjct: 306 RQKSLFFFLLQTEYGDIFKVDLEHSNDTVTELRIKYFDTIPVTSAICVLRSGFLFAASEF 365

Query: 269 GDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVDF 302
           G+  L +                             QP A  +   + E   +L PI+D 
Sbjct: 366 GNHALYQFRDIGRDVDVESSSATLMETDEGFQPVFFQPRALKNLYRIDE-IESLMPIMDM 424

Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPF 361
            V +L  +   QV T  G     +LRI+R G+ I+E A   L      +W+++ + +D F
Sbjct: 425 RVANLFDEETPQVFTACGRGPRSTLRILRPGLAISEMARSMLPAEPIAVWTVKKNINDMF 484

Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
           D ++VVSF + T +L++   + +EE     F   T +L       + L+QV    +R + 
Sbjct: 485 DAYIVVSFANVTLVLSIG--ETIEEVSDSQFLDTTHSLAVSLLGEDSLMQVHPNGIRHIR 542

Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEI 480
              R   NEW++P   ++    +N  QV++A  GG L+Y E+   G L EV+   +  ++
Sbjct: 543 EDGRV--NEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDMSGDV 600

Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFE 539
           +CL I P+ E    S+  AVG + D ++RI S+ PD  L           P S++    +
Sbjct: 601 ACLAIAPVPEGRQRSRFLAVGSF-DNTIRILSVDPDDCLQPLSVQSVSSAPESLMFLEVQ 659

Query: 540 GIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
                          +L   L +G L    ++M TG+L+D +   LG +P  L      +
Sbjct: 660 ASVGGEDGADHPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPCIVSH 719

Query: 588 TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
              +   S RP + Y      L + ++   +     F+S    + +       L I TI+
Sbjct: 720 RQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGDALRIFTIE 779

Query: 648 DI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE 690
            + +  +  +IPL   PR+     + +  A+  S K    AEE E
Sbjct: 780 HLGETFNETAIPLRYTPRKFVILPKKKYLAVIESDKGALSAEERE 824



 Score = 93.2 bits (230), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 166/365 (45%), Gaps = 44/365 (12%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---PEENEPTKG 749
            +R+LD ++ +      L   E   SI + +F D +      VGTA  L   P+ N  + G
Sbjct: 883  IRILDPKSRDTTCLLELQDNEAAVSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNL-SAG 941

Query: 750  RILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
             I ++  V++G+ L+L+ + + +    +L  F G+LLA +   ++LY     D G R+L 
Sbjct: 942  FIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSVLRLY-----DLGKRKLL 996

Query: 808  SECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
             +C +      +  + T  D I VGD+ +S     Y+ +E  +   A D    W++A   
Sbjct: 997  RKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAANH 1056

Query: 867  LDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHLG 908
            +D D   GA+   N++  R                  K  +G  +    ++E + ++H+G
Sbjct: 1057 IDFDTMAGADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVG 1116

Query: 909  EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKV 967
            + V   +  SL+        G    +I+GTV G +G ++A    E   F   L+ +LR+ 
Sbjct: 1117 DVVTCLQKASLI-------PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQE 1169

Query: 968  IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
               + G +H  +RS          K+ +DGDL E F  L      +I+  ++ +  E+ K
Sbjct: 1170 HPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPSLPADMQRKIADELDRTPGEILK 1224

Query: 1028 RVEEL 1032
            ++E++
Sbjct: 1225 KLEDI 1229


>gi|254572247|ref|XP_002493233.1| Protein involved in pre-mRNA splicing [Komagataella pastoris GS115]
 gi|238033031|emb|CAY71054.1| Protein involved in pre-mRNA splicing [Komagataella pastoris GS115]
 gi|328352752|emb|CCA39150.1| Pre-mRNA-splicing factor rse1 [Komagataella pastoris CBS 7435]
          Length = 1179

 Score =  150 bits (379), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 203/797 (25%), Positives = 352/797 (44%), Gaps = 86/797 (10%)

Query: 296  LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLR 354
            L PI D     L  Q + Q+V  SG     SL+++++ + INE AS EL G    +++ +
Sbjct: 409  LNPITD----GLLDQEKSQLVLTSGGSSRSSLKLLKHELAINELASSELPGSALNIFTTK 464

Query: 355  SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
             S  D +D ++V+SFI  T +L++   + ++E    G   Q  T+       N L QV S
Sbjct: 465  VSHKDQYDKYMVISFIDGTLVLSIG--ESVDEVSDSGLELQVSTIAVQQVGRNSLAQVHS 522

Query: 415  -GSVRL--VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGILT 469
             G V L  ++  S E ++ W+ P G  + VA+   SQ+ L      LVY EI   D ++ 
Sbjct: 523  HGIVHLKGLNEPSFE-KSLWQPPVGAEILVASMTNSQIALGLSNRELVYFEIDEYDQLIE 581

Query: 470  EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGG 526
              +  ++   IS L +  I +    S   AVG   D++++I S  P+  L +++ + L  
Sbjct: 582  HKERKEMSGRISALSLGQIPDGRLRSPFLAVG-CNDMTIKILSTDPNSCLEVLSLQALSS 640

Query: 527  EIIPRSVLLCAFE-GISYLL-CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
              IP  + +   E G  Y +   L  G  +  LLN  +G+L+D +   LG + ++L    
Sbjct: 641  --IPSDLQITNLEQGAGYFVHIGLDSGIYIRTLLNRISGQLSDTRVNYLGPKRVSLSPVK 698

Query: 585  SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE-LTI 643
                  V A SD   + Y+S      + +   + ++ C F S   P++  +    E LTI
Sbjct: 699  IFGKNLVLAFSDTSYLSYNSGFDAKITPLCDGKFTYGCGFISEECPENGVVGVSAENLTI 758

Query: 644  GTIDDIQK-LHIRSIPLGEHPRRIC-----------HQEQSRTFAICSLKNQSCAE---- 687
             TID +   ++++ IPL   PRRI             Q  ++   + S+  +   E    
Sbjct: 759  FTIDQLDTDMNVKDIPLTFTPRRITLVSEDVVVVTESQHYTKNPYLNSVATEESRELYQQ 818

Query: 688  ---ESEMHF---VRLLDDQTFEFISTYPLDT--FEYGCSILSCSFSDDSNVYYCVGTAY- 738
               E   H+   +  +D Q+ + I    LD     +G + +S +   D  VY+ V  +  
Sbjct: 819  IGYEKSSHWSSVIETVDIQSQQVIQAINLDKELAIFGSTKVSFASHPD-EVYFIVSCSVE 877

Query: 739  --VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW 796
              +LP  N+ T+ R    I E+G LQ +   ET   VYSL  F GKLLA +   + LY  
Sbjct: 878  QELLPNINKGTQLRTYK-ITEEG-LQFLHSTETDQLVYSLTEFQGKLLAGVGNLLVLYDL 935

Query: 797  MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
             L+    + L     H        +  +G  IV+ D+ +S   L +  EE        D 
Sbjct: 936  GLKKLLKKSLTQTSFHQ----ITSIDYQGFRIVISDMKQSSMFLSFNAEENRFISLCDDI 991

Query: 857  NANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGE--------- 904
                ++  ++LD    +  +    +  +R   K ++   D    + +++GE         
Sbjct: 992  TQRHITCSKMLDYSTVVTGDRFGTISVLRCPEKVNKDHLDYIEAKSKMIGECPFKLKQLA 1051

Query: 905  -YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL--PHEQYLFL---E 958
             Y++ + V     GSL        VG   ++I+  + G IG+++ +  P +   F+   +
Sbjct: 1052 SYYVQDVVTSLSRGSLT-------VGGKQSIIYTGLQGTIGILSPIQTPSDIDFFVSLED 1104

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
             L+  L K    + G +H ++R +      V A+  +DGDLI+ F  L  +    IS  +
Sbjct: 1105 SLRQELLKSKHLLTGRSHLKYRGY-----YVPAQGVVDGDLIQYFYQLPDSTKQFISTKL 1159

Query: 1019 NVSVEELCKRVEELTRL 1035
            + S  E+ +++  +  L
Sbjct: 1160 DRSTLEIERKISIMRSL 1176


>gi|74190887|dbj|BAE28225.1| unnamed protein product [Mus musculus]
          Length = 937

 Score =  149 bits (377), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 178/788 (22%), Positives = 327/788 (41%), Gaps = 100/788 (12%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53  LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61  -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
            R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
               G   P    L  D ++A +  T E A   +    F E     N+        L+  
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
            + LI VP     P   GVLI  E  I Y   N      IR  I +    +D D  R   
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                         +L     G +  + +  +++ VT ++I+      +A+ +  L    
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRIKYFDTVPVAAAMCVLKTGF 346

Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
           +++ S +G+  L ++                          QP    + V +++   +L 
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405

Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
           PI+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R 
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464

Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
             +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV   
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522

Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
            +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  + 
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580

Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
            ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P + 
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637

Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
             LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697

Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
           P+ L     +    V A S R  + YS   +   + ++ + +     F S   P+ +   
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757

Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR 695
               L I  ++ +  +  + + PL   PR+     +S    I    + +  E ++    +
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817

Query: 696 LLDDQTFE 703
            + ++  E
Sbjct: 818 QMAEEMVE 825


>gi|341886300|gb|EGT42235.1| hypothetical protein CAEBREN_28831 [Caenorhabditis brenneri]
          Length = 1005

 Score =  148 bits (374), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 170/735 (23%), Positives = 312/735 (42%), Gaps = 86/735 (11%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
           I+G + +L  FR     +DF+ + ++  +  +LQ++A+ +    L     G    R   P
Sbjct: 59  IFGIVRSLLAFRLTAGTRDFIAVGSDSGRIVILQYNADKTCFERLHQETFGKTGCRRIVP 118

Query: 63  TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
              G     DP  R  +IG      L  ++  D +  L  +  +   +   L    +   
Sbjct: 119 ---GHYLAGDPRGRALMIGAVERQKLVYIMNRDAEAHLTISSPLEAHKQHTLCYAMVGVD 175

Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADL 166
            G   PT   L  D +DA +  T E A + +    F E     N+        L +  +L
Sbjct: 176 VGFENPTFACLEFDYEDADNDPTGEAAKRTQQTLTFYELDLGLNHVVRKYAEPLTDPGNL 235

Query: 167 LIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-------- 215
           LI VP       GV++  E  +VY   N      IR  I +    +D D  R        
Sbjct: 236 LIAVPGGNDGPSGVIVCCENYLVY--KNLGDQPDIRCPIPRRRNELD-DADRTMLIIATA 292

Query: 216 ---------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
                    +L+    G +  + +  +++ VT +K++       A+ +  L +  +++ +
Sbjct: 293 THKTKNMFFFLVQAENGDVFKVTLETDEDLVTEMKLKYFDTVPPANAMCILKSGFLFVAA 352

Query: 267 SYGDSQLIKLN-----------------------LQPDAKGSYVEVLERYVNLGPIVDFC 303
            +G+ +L ++                         +P    S + + +   +L P+ D  
Sbjct: 353 EFGNHELYQIASLGEGGDDEFSSAMGFGENDAAFFEPHELRSLIPI-DSMDSLSPLTDAV 411

Query: 304 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
           + D+ R+   Q+ T  G     +L+++RNG+ I+E A  +L G    +W+++ + +D +D
Sbjct: 412 IGDVAREDAAQLYTLVGRGARSNLKVLRNGLEISEMAVSDLPGNPNAVWTVKKNIEDQYD 471

Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
           +++VVSF++ T  LA+ + D +EE    GF   T T+ C     + LVQ+ +  +R + +
Sbjct: 472 SYIVVSFVNAT--LALTIGDTVEEASDSGFLPTTPTIGCSMIGDDSLVQIYAEGIRHIRA 529

Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQL-EYEI 480
             R   NEWK+PP   +     N  QV +A  GG LVY E+  +G L E    +L   +I
Sbjct: 530 DKR--INEWKAPPRRQIVKCCVNRRQVAVALSGGELVYFELDLNGTLNEFTERKLFNADI 587

Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAF- 538
           +C+  + I E    S+  A+G   D +VRI SL P+  L+          P S+LL    
Sbjct: 588 ACMTFSEISEGELNSRFLALGT-VDNAVRIISLDPNDMLMPLSTQNLPCPPESILLIDTP 646

Query: 539 ----EGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
               +G++  +L   L +G L    ++  TG + D +   LGT+P+ L     +  + + 
Sbjct: 647 NDDGKGVAAVHLNIGLQNGCLFRNTVDNVTGAIMDTRTRYLGTRPVKLFKVQVQGRSAIL 706

Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ-K 651
             S R  ++Y   ++   + ++   + +   F S    + +       L I   + +   
Sbjct: 707 CTSSRSWLLYHFQRRFHLTPLSYVNLEYAASFCSNQCAEGIVAISSSTLRIIAAEKLGVA 766

Query: 652 LHIRSIPLGEHPRRI 666
            +++S      PRRI
Sbjct: 767 FNVQSFEQKMTPRRI 781


>gi|341886293|gb|EGT42228.1| CBN-TEG-4 protein [Caenorhabditis brenneri]
          Length = 1006

 Score =  148 bits (373), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 167/695 (24%), Positives = 299/695 (43%), Gaps = 92/695 (13%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
           I+G + +L  FR     +DF+ + ++  +  +LQ++A+ +    L     G    R   P
Sbjct: 59  IFGIVRSLLAFRLTAGTRDFIAVGSDSGRIVILQYNADKTCFERLHQETFGKTGCRRIVP 118

Query: 63  TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
              G     DP  R  +IG      L  ++  D +  L  +  +   +   L    +   
Sbjct: 119 ---GHYLAGDPRGRALMIGAVERQKLVYIMNRDAEAHLTISSPLEAHKQHTLCYAMVGVD 175

Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADL 166
            G   PT   L  D +DA +  T E A + +    F E     N+        L +  +L
Sbjct: 176 VGFENPTFACLEFDYEDADNDPTGEAAKRTQQTLTFYELDLGLNHVVRKYAEPLTDPGNL 235

Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR------ 215
           LI VP     P   GV++  E  +VY   N      IR  I +    +D D  R      
Sbjct: 236 LIAVPGGNDGP--SGVIVCCENYLVY--KNLGDQPDIRCPIPRRRNELD-DADRTMLIIA 290

Query: 216 -----------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
                      +L+    G +  + +  +++ VT +K++       A+ +  L +  +++
Sbjct: 291 TATHKTKNMFFFLVQAENGDVFKVTLETDEDLVTEMKLKYFDTVPPANAMCILKSGFLFV 350

Query: 265 GSSYGDSQLIKLN-----------------------LQPDAKGSYVEVLERYVNLGPIVD 301
            + +G+ +L ++                         +P    S + + +   +L P+ D
Sbjct: 351 AAEFGNHELYQIASLGEGGDDEFSSAMGFGENDAAFFEPHELRSLIPI-DSMDSLSPLTD 409

Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDP 360
             + D+ R+   Q+ T  G     +L+++RNG+ I+E A  +L G    +W+++ + +D 
Sbjct: 410 AVIGDVAREDAAQLYTLVGRGARSNLKVLRNGLEISEMAVSDLPGNPNAVWTVKKNIEDQ 469

Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
           +D+++VVSF++ T  LA+ + D +EE    GF   T T+ C     + LVQ+ +  +R +
Sbjct: 470 YDSYIVVSFVNAT--LALTIGDTVEEASDSGFLPTTPTIGCSMIGDDSLVQIYAEGIRHI 527

Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQL-EY 478
            +  R   NEWK+PP   +     N  QV +A  GG LVY E+  +G L E    +L   
Sbjct: 528 RADKR--INEWKAPPRRQIVKCCVNRRQVAVALSGGELVYFELDLNGTLNEFTERKLFNA 585

Query: 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCA 537
           +I+C+  + I E    S+  A+G   D +VRI SL P+  L+          P S+LL  
Sbjct: 586 DIACMTFSEISEGELNSRFLALGT-VDNAVRIISLDPNDMLMPLSTQNLPCPPESILLID 644

Query: 538 F-----EGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
                 +G++  +L   L +G L    ++  TG + D +   LGT+P+ L     +  + 
Sbjct: 645 TPNDDGKGVAAVHLNIGLQNGCLFRNTVDNVTGAIMDTRTRYLGTRPVKLFKVQVQGRSA 704

Query: 591 VFAASDRPTVIYSSNKKL---LYSNVNLKEVSHMC 622
           +   S R  ++Y   ++      S VNL+  +  C
Sbjct: 705 ILCTSSRSWLLYHFQRRFHLTPLSYVNLEYAASFC 739


>gi|170580631|ref|XP_001895346.1| splicing factor 3B subunit 3 [Brugia malayi]
 gi|158597745|gb|EDP35799.1| splicing factor 3B subunit 3, putative [Brugia malayi]
          Length = 1181

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 242/1178 (20%), Positives = 463/1178 (39%), Gaps = 214/1178 (18%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G I +L  FR  G ++D++ I ++  +  +L+++A+      R   +   + G R    
Sbjct: 60   FGIIRSLLAFRLTGGSKDYIVIGSDAGRIVILEYNAQKV-CFERVHQETFGKTGCRRIVP 118

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
            GQ   +DP  R  LIG      L  ++  D    L  +  +   +   +    +    G 
Sbjct: 119  GQFLAVDPKGRAILIGAVERQKLVYIMNRDASANLTISSPLEAHKSHCICYSVVGVDVGF 178

Query: 121  AKPTIVVLYQDNKDARHVKTYEVALK--------DKDFV----EGPWS----------QN 158
              PT   L  D ++  H  T  +A K        + D V    +GP              
Sbjct: 179  ENPTFACLEVDYEEVDHDPTGHLATKIPQTLTFYELDLVPGGQDGPSGVIVCCENYLVYK 238

Query: 159  NLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLL 218
            NL +  D+  PVP     +       I+ CSA     +                   +L+
Sbjct: 239  NLGDQPDIKCPVPRRRNELDDCDRTVIIVCSATHKTKLMYF----------------FLV 282

Query: 219  GDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL-- 276
                G +  + +  E + VT LKI+      +++ +  L    ++  S +G+  L ++  
Sbjct: 283  QTDQGDIFKVTLESEHDIVTELKIKYFDTIPVSNAMCILKTGFLFTASEFGNHHLYQIAH 342

Query: 277  ------------NLQPDAKGSY---------VEVLERYVNLGPIVDFCV---VDLERQGQ 312
                         +Q +   ++         + ++++  +L P++ F          Q  
Sbjct: 343  LGDEDDEPEFSSRMQLEEGETFFFAPRGLTNLAIVDQMDSLSPLIFFPTRTRCTFSSQSI 402

Query: 313  GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
             Q       +   +  ++ N +                W+++ + DD FD+ +VVSF++ 
Sbjct: 403  AQWFGGKRKWLSLNCLVIPNAV----------------WTVKRNIDDKFDSHIVVSFVNA 446

Query: 373  TRILAMNLEDELEETEIEGFCSQTQTLFC----HDAIYNQLVQVTSGSVRLVSSTSRELR 428
            T +L++   + +EE    GF   T TL C     DA+  + V    G +  + S  R L 
Sbjct: 447  TLVLSIG--ETVEEVTDSGFLGTTPTLGCALIGDDALCIRTVFDILGLIGELMS-GRHLV 503

Query: 429  NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDIN 486
            N         +     N  QV +A  GG LVY E+   G L E  +  +L  ++ C+ ++
Sbjct: 504  N-------VQLLKCALNRRQVAIALAGGELVYFELDVTGQLNEYTERRELPADVLCMSLS 556

Query: 487  PIGENPSYSQIAAVGMWTDISVRIFSL------PDLNLITKEHLGGEIIPRSVLLCAFEG 540
             I E    S+   VG+  D +VRI SL        L LI+         P S+++    G
Sbjct: 557  EIPEGELRSRFLTVGL-ADKTVRIISLDPQDCLSPLQLISFSMQALPSEPESIIVLEMFG 615

Query: 541  IS-------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
                     +L   L +G LL   ++  TGELTD +   LGT+ + L     ++   + A
Sbjct: 616  TETQSASTVHLNIGLQNGCLLRTTVDQVTGELTDNRTRYLGTKSVKLFRVRIQSKDAIMA 675

Query: 594  ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
            AS R  ++Y    +   + ++   +     F+S   P+ +    E  L I +++ +  + 
Sbjct: 676  ASSRAWLLYDYQSRFHLTPLSYAALEFAAGFSSEQCPEGIVAIAENTLRILSLEKLGAVF 735

Query: 654  IRSI-PLGEHPRR-ICHQ---------------------EQSRTFAICSLKNQSCAEESE 690
               + PL   PRR + H+                     E+ +  A   ++    AEE++
Sbjct: 736  NHVVHPLDYTPRRMVVHKASGNLIIIENDHAAFTVKGKMERRKQLADELMEVAKEAEEAD 795

Query: 691  MHFV-------------------------------RLLDDQTFEFISTYPLDTFEYGCSI 719
               V                               R++     E +S +P    E   +I
Sbjct: 796  QQAVKEMADAIRTEKVDERVYGSPKNQKGKWASTVRIMRSNDGETLSHFPFAEDEAAFAI 855

Query: 720  LSCSFSDDSNVYYC-VGTAYVLP-EENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYS 775
                F + S+  +  VG    L  +  +   G I  F++      LQL+    T   V +
Sbjct: 856  AMVQFQNQSDTQFVLVGCGCDLQLKPRKANGGCIYTFLLAANGTTLQLLHRTPTDEVVNA 915

Query: 776  LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-HILALYVQTRGDFIVVGDLM 834
            ++ F G  LA + +K++LY     D G R+L ++C +       + +++ G  IVV D  
Sbjct: 916  IHDFRGMALAGVGKKVRLY-----DLGKRKLLAKCENRQIPTQVVDIRSMGQRIVVSDSQ 970

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
            +S+  + YK ++G +     + +  +++ V +LD D  +   + F    V +  +G T+E
Sbjct: 971  ESVHFMRYKKQDGQLSIFCDETSPRYVTCVCLLDYDT-VAVGDRFGNVAVLRLPKGVTEE 1029

Query: 895  ------------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVI 935
                        +RG       +LE +   ++G+ +   +  SLV        G    + 
Sbjct: 1030 VQEDPTGVRALWDRGNLNGASQKLEAIAHLYIGDAITSMQKTSLV-------PGANDCLS 1082

Query: 936  FGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNF 994
            + T++G+IG++   +  +++ F + L+ ++R     + G +H  +RS+    K+V     
Sbjct: 1083 YTTISGIIGILVPFMSRDEFEFFQNLEMHMRVEYPPLCGRDHLAYRSYYFPVKSV----- 1137

Query: 995  LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +DGDL E +  +   +   + + +     E+ K++E++
Sbjct: 1138 IDGDLCEQYSLMPLDKQKSVGEELGRKPTEIHKKLEDI 1175


>gi|326497839|dbj|BAJ94782.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1227

 Score =  147 bits (372), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 181/767 (23%), Positives = 313/767 (40%), Gaps = 87/767 (11%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G I +L  FR  G  +D+L + ++  +  +L++  + +               
Sbjct: 63  LLSVDVFGAIRSLAQFRLTGATKDYLVVGSDSGRLVILEYSPDRNRFDKVHQETFGKSGC 122

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE---------LQV 111
           R    GQ+  +DP  R + +   +    V   +     +   +  LE          L  
Sbjct: 123 RRIVPGQLLSVDPKGRALCIAALEKQKLVYVLNRDASARLTISSPLEAHKSHTLTFALTA 182

Query: 112 LDIKF---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN--------L 160
           LD  F   ++G     I + Y D+      +    A K   F E     N+        +
Sbjct: 183 LDCGFDNPVFG----AIELEYGDSDRDPTGQAASHAQKLLTFYELDLGLNHVSRKVSEPI 238

Query: 161 DNGADLLIPVPPPL---CGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VDAD 212
           DNGA++L+ VP       G+L+  +  ++Y +         IP R  +    G   V A 
Sbjct: 239 DNGANMLVTVPGGADGPSGLLVCCDNFVLYRNQGHPEVRAVIPRRVDLPAERGVLIVAAA 298

Query: 213 GSR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
             R      +LL    G +  + + H  + V+ L+I+      +AS I  L +  ++  S
Sbjct: 299 THRQKSLFFFLLQTEYGDIFKVDLDHHGDTVSELRIKYFDTIPVASAICVLRSGFLFAAS 358

Query: 267 SYGDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIV 300
            +G+  L +                             QP A  + V + E   +L PI+
Sbjct: 359 EFGNHALYQFRDIGRDVDVESSSATLMETEEGFQPVFFQPRALKNLVRIDE-IESLMPIM 417

Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDD 359
           D  + +L  +   QV T  G     +LRI+R G+ I+E A   L      +W+++ + +D
Sbjct: 418 DMRIANLFDEETPQVYTACGRGSRSTLRILRPGLAISEMARSMLPAEPIAVWTVKKNIND 477

Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
            FD ++VVSF + T +L++   + +EE     F   T +L       + L+QV    +R 
Sbjct: 478 MFDAYIVVSFANVTLVLSIG--ETIEEVSDSHFLDTTHSLAVSLLGEDSLMQVHPNGIRH 535

Query: 420 VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEY 478
           +    R   NEW++P   ++    +N  QV++A  GG L+Y E+   G L EV+   +  
Sbjct: 536 IREDGRV--NEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDMSG 593

Query: 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCA 537
           +++CL I P+ E    S+  AVG + D ++RI S  PD  L           P S+L   
Sbjct: 594 DVACLAIAPVPEGRQRSRFLAVGSY-DNTIRILSFDPDDCLQPLSVQSVSSAPESLLFLE 652

Query: 538 FE-------GISY-----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
            +       G  Y     L   L +G L    ++M TG+L+D +   LG +P  L     
Sbjct: 653 VQASVGGEDGADYPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPCIV 712

Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
            +   +   S RP + Y      L + ++   +     F+S    + +       L I T
Sbjct: 713 SHRQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGDALRIFT 772

Query: 646 IDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE 690
           I+ + +  +  SIPL   PR+     + +  A+  S K    AE+ E
Sbjct: 773 IERLGETFNETSIPLRYTPRKFVILPKKKYLAVIESDKGAFSAEQRE 819



 Score = 89.7 bits (221), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 88/364 (24%), Positives = 160/364 (43%), Gaps = 42/364 (11%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---PEENEPTKG 749
            +R+LD ++ +      L   E   SI + +F D +      VGT   L   P+ +  +  
Sbjct: 876  IRILDPRSRDTTCLLELQDNEAAVSICTVNFHDKEHGTLLAVGTTKGLQFWPKRSLASGF 935

Query: 750  -RILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
              I  F+ E   L+L+ + + +    SL  F G+LLA +   ++LY     D G R+L  
Sbjct: 936  IHIYKFVNEGKSLELLHKTQVEEVPLSLCQFQGRLLAGVGSVLRLY-----DLGKRKLLR 990

Query: 809  ECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
            +C +      +  + T  D I VGD+ +S     Y+ +E  +   A D    W++A   +
Sbjct: 991  KCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAANHI 1050

Query: 868  DDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHLGE 909
            D D   GA+   N++  R                  K  +G  +    ++E + ++H+G+
Sbjct: 1051 DFDTMAGADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGD 1110

Query: 910  FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVI 968
             V   +  SL+        G    +I+GTV G +G ++A    E   F   L+ +LR+  
Sbjct: 1111 VVTCLQKASLI-------PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQEH 1163

Query: 969  KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
              + G +H  +RS          K+ +DGDL E +  L      +I+  ++ +  E+ K+
Sbjct: 1164 PPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQYPSLPADMQRKIADELDRTPGEILKK 1218

Query: 1029 VEEL 1032
            +E++
Sbjct: 1219 LEDI 1222


>gi|229577321|ref|NP_001153352.1| splicing factor 3b, subunit 3 [Nasonia vitripennis]
          Length = 1216

 Score =  147 bits (372), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 177/782 (22%), Positives = 327/782 (41%), Gaps = 96/782 (12%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           ML V ++G I +L  FR  G  +D++ + ++  +  +L++   +  +  +   +   + G
Sbjct: 53  MLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEY-VPAKNVFEKVHQETFGKSG 111

Query: 61  -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDPEARLTISSPLEAHKSNTLVYHMV 171

Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
               G   P    L  D ++A    T E A+K +    F E     N+        L+  
Sbjct: 172 GVDVGFENPMFACLEIDYEEADTDPTGEAAIKTQQTLTFYELDLGLNHVVRKYSEPLEEH 231

Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
           A+ L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R   
Sbjct: 232 ANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQPDIRCPIPRRRNDLD-DPERGMI 286

Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                         +L     G +  + +  +++ VT +K++      +A+++  L    
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDVVTEIKLKYFDTVPVANSMCVLKTGF 346

Query: 262 VYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGP 298
           +++ S +G+  L ++             +  P  +G       R +          +L P
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSP 406

Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
           I+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++   
Sbjct: 407 IMACQVADLANEDTPQLYIACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRRI 466

Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
           D+ FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +
Sbjct: 467 DEEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDGI 524

Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
           R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  +
Sbjct: 525 RHIRADKR--VNEWKAPGKKTIIKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERKK 582

Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-L 534
           +  E+ C+ +  +      S   AVG+  D +VRI SL   + +    +  + +P +   
Sbjct: 583 MPSEVMCMALGNVAAGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSM--QALPAAAES 639

Query: 535 LCAFE-GIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
           LC  E GI               +L   L +G LL  +L+  +G+L+D +   LG++P+ 
Sbjct: 640 LCIVEMGIKDDTSEDSNQMQSTLHLNIGLQNGVLLRTVLDPISGDLSDTRTRYLGSRPVK 699

Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
           L     +    V A S R  + Y    +   + ++ + +     F+S   P+ +      
Sbjct: 700 LFRIRMQGNQAVLAMSSRSWLSYYYQNRFHLTPLSYESLEFASGFSSEQCPEGIVAISTN 759

Query: 640 ELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAEESEMHFVRLL 697
            L I  ++ +  +  + S P    PR+ + H E +    I +  N    E  +   +++ 
Sbjct: 760 TLRILALEKLGAVFNQVSFPFEYTPRKFVIHNESAHVLLIETEHNAYTEETKQQRRLQMA 819

Query: 698 DD 699
           ++
Sbjct: 820 EE 821



 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 66/275 (24%), Positives = 126/275 (45%), Gaps = 39/275 (14%)

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKS 836
            + G++L  + + ++LY     D G ++L  +C  + HI    + +   G  I V D+ +S
Sbjct: 954  YQGRVLVGVGKMLRLY-----DMGKKKLLRKC-ENKHIPNAVICINAIGQRIYVSDVQES 1007

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE--GATDE 894
            +  + YK +E  +   A D +  W++   +LD D    A+   N+  +R  S      DE
Sbjct: 1008 VYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKFGNIAVIRLASSINDDVDE 1067

Query: 895  ---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
                     +RG L       + V  +H+GE V   +  +L+        G   ++++ T
Sbjct: 1068 DPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI-------PGGSESLVYTT 1120

Query: 939  VNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
            ++G +GV+     HE + F + L+ ++R     + G +H  +RS+         KN +DG
Sbjct: 1121 LSGTVGVLVPFTSHEDHDFFQHLEMHMRAEHPPLCGRDHLSFRSY-----YYPVKNVIDG 1175

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DL E F  +   +   IS  +  +  E+ K++E++
Sbjct: 1176 DLCEQFNSIEPGKQKSISGDLERTPSEVSKKLEDI 1210


>gi|443896643|dbj|GAC73987.1| predicted DNA methylase [Pseudozyma antarctica T-34]
          Length = 1285

 Score =  147 bits (371), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 247/1174 (21%), Positives = 476/1174 (40%), Gaps = 169/1174 (14%)

Query: 7    YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
            +G I +L  FR  G ++D+L + ++  +  +L++   SS    +   +   R G R    
Sbjct: 126  FGVIRSLAAFRLTGGSKDYLIVGSDSGRIVILEYQP-SSNSFDKVHQETFGRSGSRRIVP 184

Query: 66   GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
            GQ    DP  R  +IG      L  ++  D +  L      EA   N  +  +  +D+ F
Sbjct: 185  GQYLATDPKGRATMIGAMEKAKLVYILNRDAEANLTISSPLEAHRPNGIIHHIVGVDVGF 244

Query: 117  ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNL--------DNGAD 165
               L+ C +   V   + ++DA   + +E A K   + E     N++        D  ++
Sbjct: 245  ENPLFACLE---VDNSESDRDASG-RAFEEAEKTLTYYELDLGLNHVVRRWAEPVDPRSN 300

Query: 166  LLIPVPPPL----------CGVLIIGEETIVYCSANAFK-AIPI--------RPS----- 201
            LLI VP              GVL+  E+ I Y   +  +  +PI        RP+     
Sbjct: 301  LLIQVPGGYNQNQEKWDGPSGVLVCSEDYITYKHQDQPEHRVPIPKRLNPIERPNERRGT 360

Query: 202  -ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
             I  +      +   +L+    G L  + + H+ +++  L+I+      +AS +  L + 
Sbjct: 361  LIVASVLHKMKNAFFFLVQTEDGDLFKVTMDHQDDEIRALRIKYFDTVPVASGLCILRSG 420

Query: 261  VVYIGSSYGDSQLIKLN------------------------------LQPDAKGSYVEVL 290
             +++ S +G   L                                    P    + ++V 
Sbjct: 421  FLFVASEFGPQLLYSFQKLGDDDELPEYISTDYDDYGAGRRRPQLPTFTPRPLDNLMQVD 480

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KG 349
            E   +L PI+D   ++       Q+    G     S +++R+G+   E  S +L G+   
Sbjct: 481  E-VPSLDPILDATPLNPLASDSPQIFAACGRGARSSFKMLRHGLEALEAVSSDLPGVPSA 539

Query: 350  MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
            +W+ + +  D +D+++V+SF++ T +L++   + +EE    G  + + TL       + L
Sbjct: 540  VWTTKITRRDEYDSYIVLSFVNGTLLLSIG--ETIEEVSDSGLLTSSSTLAVQQLGEDAL 597

Query: 410  VQVTSGSVRLVSSTSRELRNEWKSP--PG--YSVNVAT-ANASQVLLATGGGHLVYLEIG 464
            +QV    +R +    +++ NEW +P  P    +  VAT  N  QV++A     LVY E+ 
Sbjct: 598  LQVHPHGIRHIL-VDKQI-NEWVTPSLPNGRQTTIVATCTNERQVVVALSSNELVYFELD 655

Query: 465  -DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKE 522
             DG L E +  + +   +  + +    E    +   AVG   D +VRI SL   + +   
Sbjct: 656  MDGQLNEYQERKAMGAPVLTMSMPECPEGRQRTAYLAVGCG-DSTVRIVSLEPSSTLASI 714

Query: 523  HLGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573
             +     P S +  A             +++   L +G LL  +L+  TG+LTD +   L
Sbjct: 715  SIQALTAPASSICMAEMHDSTVDRHHATTFVNIGLQNGVLLRTVLDGVTGQLTDTRTRFL 774

Query: 574  GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
            G++ + L        T V A S R  + ++   ++ ++ +    + H   F++   PD L
Sbjct: 775  GSKAVRLVRTRVHGQTAVMALSTRTWLSFTYQSRVQFTPLIFDALDHAWSFSAELCPDGL 834

Query: 634  AIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEM 691
                   L I TI  +  KL   S+ L   PRR+  H +    F +    +++ +  ++ 
Sbjct: 835  IGIVGSTLRIFTIASLASKLKQDSVALSYTPRRMASHPDGQGLFYVAEADHRTLSPGAQR 894

Query: 692  HFVRLLDDQTFEFI-STYPLDTFEYGC------SILSCSFS--------------DDSNV 730
              V  LD +          LD  E+G       +  SC                 DD+  
Sbjct: 895  RRVEALDKELKPHQRGVLDLDPAEFGSIRAEAGNWASCVRVVDGVNAQTTHRIELDDNEA 954

Query: 731  YYCVGTAYVLPEENE--------------P---TKGRILVFIVEDG--KLQLIAEKETKG 771
             + +        + +              P    K  +  + + +G  +L+++ + E   
Sbjct: 955  AFSIAVVPFASADKQLFVVVGSAVEVVMSPRSFKKAYLTTYRLGNGGRELEVVHKTEVDD 1014

Query: 772  AVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVV 830
                L AF G+LLA + + +++Y+      G ++L  +C +     A+  +  +G  IVV
Sbjct: 1015 VPLVLRAFQGRLLAGVGKVLRIYEL-----GKKKLLRKCENRSFPTAIVALDAQGSRIVV 1069

Query: 831  GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNS 888
            GD+ +S+    YK  E  +   A D    +++   +LD D    A+   N++ VR   ++
Sbjct: 1070 GDMQESVIFASYKPLENRLVTFADDIMPRYVTRCTMLDYDTVAAADKFGNVYVVRIDADT 1129

Query: 889  EGATDEERGRLE-------VVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTV 939
              + DE+   +        ++G  H    +  +  G ++  L  + +  G    +++  +
Sbjct: 1130 SRSVDEDVTGMTTMHEKPLLMGAAHKATLLAHYFVGDIITSLSRAVMVPGGREVLLYTGI 1189

Query: 940  NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
            +G IG +   +  E    +  L+  LR+    + G +H  +RS          K+ +DGD
Sbjct: 1190 SGTIGALVPFVSKEDVDTMTTLEMQLRQQSDSLVGRDHLAYRS-----SYAPVKHVIDGD 1244

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L ESF  L   +   +++ ++    E+ K++ +L
Sbjct: 1245 LCESFGLLPPAKQSAVAQELDRKPSEVNKKLAQL 1278


>gi|451852814|gb|EMD66108.1| hypothetical protein COCSADRAFT_34693 [Cochliobolus sativus ND90Pr]
          Length = 1235

 Score =  147 bits (371), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 227/1002 (22%), Positives = 394/1002 (39%), Gaps = 157/1002 (15%)

Query: 155  WSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVY------CSANAFKAIPIRPSITKA 205
            WS+  +D  A+ L  VP       GVL  GE++I Y       S+    AIP R   T+ 
Sbjct: 224  WSEP-VDRTANTLFRVPGGPSAPSGVLCCGEDSITYRRIFNNKSSVRRLAIPRREGATED 282

Query: 206  YGR-----------VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTI 254
              R           +      YLL    G +  L +      V  +KI+      + ++I
Sbjct: 283  PNRKRMIVAGTLYSLKGGDFFYLLQTEDGDVFKLTVDAPNGTVENIKIKYFDTIPVTTSI 342

Query: 255  SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------V 287
              L    VY     GD  L +L                   D + S+            +
Sbjct: 343  CILRAGFVYAACESGDRILYELESLGDETDDPVFESSQFPVDPEASFAPPFFRPRALVNL 402

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
              +E   +L PI+D  V +   +   Q+ T +G     S R  RN + + +     L Q 
Sbjct: 403  TAVEAMPSLNPIMDMEVANPALEDAPQIYTINGTGGRSSFRTTRNALEVLDLIESPLPQN 462

Query: 347  IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
               +W+ + +++D  DT +V+     +R L + + D++EE    GF   T TL       
Sbjct: 463  ASDVWTTKLTSEDETDTLIVLCL--HSRTLVLKIGDDVEEASNTGFLPDTNTLGVQQFGE 520

Query: 407  NQLVQVTSGSVRLV---------SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
            + ++Q+    +R +         +S +     +W+ P   ++     N  QV +A   G 
Sbjct: 521  DCIIQIHPKGIRHIQGIQFPNDDASATHASLTDWQPPAHRTIVACATNNRQVAIALSSGQ 580

Query: 458  LVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-P 514
            ++Y E   DG L   +    LE  I+CL +  + E    +   AVG  +D +VRI++L P
Sbjct: 581  ILYFECDSDGSLAMAEEEIVLESTINCLAMPDVPEGSVRAFFLAVGC-SDQTVRIYNLSP 639

Query: 515  DL--NLITKEHLGGEIIPRSVLLC------AFEGIS-YLLCALGDGHLLNFLLNMKTGEL 565
            D+  N++    +     P S L        +  G S +L   L  G  +  +L+  TG++
Sbjct: 640  DMEGNILRSISVQALTSPPSDLTINLMTDKSSRGYSQFLHIGLRSGVYIRSVLDEMTGDI 699

Query: 566  TDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHMCP 623
             D ++  LG +PI     +      + A + RP + Y+  +   L  + +N         
Sbjct: 700  GDTRRRFLGPEPIKFAKVTVAGEPAILAMTSRPWLGYTHPRTGVLQLTPLNYISFKSAWN 759

Query: 624  FNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLK 681
            F+ + F   + ++   EL I T +D+       SIPL   PR++  + +Q   + I S  
Sbjct: 760  FDGSQFKGIICVSAN-ELRIFTFNDLTDNTTYESIPLKYTPRKMVGYHDQGVFYVIQSDN 818

Query: 682  N------------QSCAEESEMHFVRLLDDQTFEFIST--YPLDTFEY------------ 715
            N            QSC ++ +     +  +Q+     +  +P   F Y            
Sbjct: 819  NTISADRRQQLIAQSCGKKEDGTNGSMETEQSNGAADSDEFPAVDFGYPRAQGSWASCIQ 878

Query: 716  ------------------GCSILSCSF----SDDSNVYYCVGTAYVL---PEENEPTKGR 750
                                S++S +     S +   +  VGTA  L   P +      +
Sbjct: 879  VVDPVTEKTVTHTVELNGNISLVSAALVFFESRNDEAFLAVGTAKDLSFTPYKFSSASIQ 938

Query: 751  ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
            I        +L+   E        +L AF GKLLA + + + LY   ++    R+ Q+  
Sbjct: 939  IYKINPTGRELEFFHETTVSDPPLALLAFKGKLLAGVGRHLCLYDCGMK-SVLRKAQAPN 997

Query: 811  GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSAVEILD 868
                 I  L  +T+G  +VV D  +S++ +++K +     +     D  A   +A E+LD
Sbjct: 998  CVPTRITGL--KTQGSRLVVSDQAQSVTYVVHKDQVHPNRLIPFVDDTIARHTTASEMLD 1055

Query: 869  DDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD 925
             D  +G +   N++ VR   K SE + +   G   +V + +LG   NR     L+     
Sbjct: 1056 YDTTVGGDKFGNIWLVRCPQKVSESSDESPDGSDLLVDKSYLGGTPNRL---DLIAHYFT 1112

Query: 926  SDV-----------GQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
            +D+           G    V +  + G +G +        +   ++L+  LR   K + G
Sbjct: 1113 NDIPVSIQKTVLLSGGERVVFWAGLQGTLGALIPFNSRRSHKMFQQLELQLRSDDKPLSG 1172

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
             +H  +RS+    K+V     +DGDLIE FL L R + + I+
Sbjct: 1173 RDHLAFRSYFAPVKSV-----IDGDLIERFLVLPRDKRESIA 1209


>gi|428180132|gb|EKX49000.1| hypothetical protein GUITHDRAFT_105085 [Guillardia theta CCMP2712]
          Length = 1207

 Score =  146 bits (368), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 234/1054 (22%), Positives = 433/1054 (41%), Gaps = 173/1054 (16%)

Query: 124  TIVVLYQDNKD---------ARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL 174
            +I V Y DN +         +R V  YEV + +   +E  ++ + +DN ++LLIPV   L
Sbjct: 184  SIEVDYVDNPNDEFQDPVVSSRRVCLYEVDMANNS-LEVRYN-DFIDNSSNLLIPVYNHL 241

Query: 175  CG---VLIIGEETIVYCSANAFKAIPIRPS----------ITKAY--GRVDADGSRYLLG 219
             G   VLI  E  I   S +A + + + P           I  +Y    +  D S ++  
Sbjct: 242  SGLFGVLICAENKIALKSPDAEEIVLLIPRRSMLPLEQSVIITSYVLMGIQDDLSIFIAQ 301

Query: 220  DHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN-- 277
            +  G L+ + ++    +   L IE L    +A ++  + +  + + S +G+  L   +  
Sbjct: 302  NDIGDLYKITVSLRCTQDHVLGIEYLDTVPVAQSMIIVRDQYLLLCSEFGNHILFSFSAS 361

Query: 278  -------------LQPDAKG-----------SYVEVLERYVNLGPIVDFCVVDLERQGQG 313
                         +Q D +             Y  V+E   +L PIV   V+DL  +G  
Sbjct: 362  EVGQVTSTIGLTSVQFDDENIDIPNFLPHNLQYFRVVEEIESLSPIVGMKVMDLYSEGNF 421

Query: 314  QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFI-S 371
            QV T  G     S+R++R+G+ + E A  +L  +   +W+L++++ DP D ++VVS   S
Sbjct: 422  QVFTLCGKGPRSSIRVLRHGLSVVEMAISQLPTVPVAVWTLKANSTDPHDRYIVVSLAHS 481

Query: 372  ETRILAMNLEDELEETEIEGFCSQTQTLF------CHDAIYNQLVQVTSGSVRLVSSTSR 425
              + L +++ + +E     G     +++       C   ++   +QV  G   + S  + 
Sbjct: 482  SVQTLVLSIGETVEAVANHGLLPHAKSISIATMGDCMIQVHTNGIQVMKGGKNIPSPFAS 541

Query: 426  ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLE-YEISCL 483
                   +P   SVN    N  Q+L+    G   YLE+ G G L E     +   E+  +
Sbjct: 542  ------SNPVILSVN----NEQQMLVTQTDGVTSYLEMEGTGTLREKGKININAQEVCAV 591

Query: 484  DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII----PRSV-LLCAF 538
            ++ P+  + +  + A +G + D +     +  L+  +   + G  +    P S+ LL ++
Sbjct: 592  EVTPLSASQTLGKFAIMGAFMD-NAWFLCVVSLDASSFSSVVGRQVLQARPSSIALLQSY 650

Query: 539  --------EGISYLLCALGDGHLLNFLLNMKTGELTDRKKV-SLGTQPITLRTFSSKNTT 589
                      + +L   L +G L+    N +TGE++   +  SLG+ P+ L     +   
Sbjct: 651  SRSSREPGRSVFFLYVGLENGVLMRMSFNAETGEISQEFRTRSLGSNPVKLVRVKVQEKE 710

Query: 590  HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
             + A S R  + Y    K     ++   +     F+S   P+      +G L I +++ +
Sbjct: 711  ALLALSSRSWLAYHHQGKQCLDPLSYDMLDFAWNFSSQQCPEGFVCISQGSLRILSLERV 770

Query: 650  QKLHIRSIP-LGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLD----DQTFEF 704
             ++  ++   L   PR+    + +    +    + + A ++       +D    D   + 
Sbjct: 771  GEIFSQATAKLAHTPRQCAPIKGTTMMMVIESDHNTDANKALQDGANKVDMEGQDGEADK 830

Query: 705  IS--TYPLDTFEYGC-----------SILS---CSFSDD-SNVYYC------------VG 735
            +S   + +     GC           S++S    ++SDD + +  C            VG
Sbjct: 831  LSYEMFGVQRAGNGCWASSLRVIDMDSLVSRQIINYSDDEAALSLCSSVGANGEHLVLVG 890

Query: 736  TAYVLPE-ENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
            T+  L   E   + G +  + V    L L  +    G   ++  + G++L  +   +++Y
Sbjct: 891  TSVGLRMGEKHASSGFVYTYSVSGSHLHLEHKTPMDGIPRAICNYQGRVLVGVGSALRMY 950

Query: 795  ----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
                K +LR   +R+  +        L   + T+GD I VGD  +S SLL +     + E
Sbjct: 951  EIGKKKLLRKCESRKFPN--------LICSIHTQGDRIFVGDSAESFSLLRFNSLSNSFE 1002

Query: 851  ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-----------KNSEGATDE--ERG 897
              A D  A W++A   LD D   GA+   NLF  R           +     TD    RG
Sbjct: 1003 LFAEDARARWLTASCPLDFDTVAGADKFGNLFICRIPEEAASELEEEGEAVQTDSVLHRG 1062

Query: 898  ---RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHEQ 953
               +L+ V   ++GE V   +  +LV        G    VI+GT+ G +G +   +  E 
Sbjct: 1063 AKHKLDEVAHQYVGEAVLGLQRSALVQ-------GGTEAVIYGTILGGLGALQPFVSKED 1115

Query: 954  YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN-----------EKKTVDAKNFLDGDLIES 1002
              F  +L+  LR  +     +N    +SF+N                  +  +DGDL E+
Sbjct: 1116 VDFFLRLEMLLRGNLGVRESVND---KSFDNPSLCGNDQLGFRSYFAPMRGVVDGDLCET 1172

Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEEL-TRL 1035
            F  L   R  +++  +  + EE+ K++E++ TRL
Sbjct: 1173 FHTLDSGRQHKVAAELGKTPEEVAKKIEDMRTRL 1206


>gi|396488712|ref|XP_003842924.1| similar to pre-mRNA splicing factor 3b [Leptosphaeria maculans JN3]
 gi|312219502|emb|CBX99445.1| similar to pre-mRNA splicing factor 3b [Leptosphaeria maculans JN3]
          Length = 1236

 Score =  145 bits (365), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 227/1000 (22%), Positives = 395/1000 (39%), Gaps = 152/1000 (15%)

Query: 155  WSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVY-----CSANAFK-AIPIR--PSIT 203
            WS+  +D  A+ L  VP       GVL+ GE+ I Y        N  + AIP R  P+  
Sbjct: 230  WSEP-IDRTANTLFRVPGGANAPSGVLVCGEDNITYRRIFNQKENVHRLAIPRREGPTED 288

Query: 204  KAYGRVDADGSRYLL--GDHAGLLHL-------LVITHEKEKVTGLKIELLGETSIASTI 254
                R+   G+ Y L  GD   LL         L +     KV  +KI+      + ++I
Sbjct: 289  PNRKRMIVAGTLYSLKGGDFFYLLQTEDGDVFKLTVEASNGKVEKIKIKYFDTIPVCTSI 348

Query: 255  SYLDNAVVYIGSSYGDSQLIKLN-----------------LQPDAKG----------SYV 287
              L    VY     GD  L +L                  +  DAK           + +
Sbjct: 349  CILRAGFVYAACESGDRILYELESLGDETEDPLFSSDQFPVDTDAKFAPPYFKPRALTNL 408

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
              +E+  +L PI+   V +   +   Q+ T +G     S R  RN + + +     L Q 
Sbjct: 409  TAVEQMPSLNPIMGMEVANPALEDAPQIYTINGTGGRSSFRSTRNALEVLDLIESPLPQN 468

Query: 347  IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
               +W+ + ++DD  DT +V+     +R L + + D++EE    GF   T TL       
Sbjct: 469  ASDVWTTKLTSDDETDTLIVLCL--HSRTLVLKIGDDVEEASNTGFLPDTNTLGVQQFGE 526

Query: 407  NQLVQVTSGSVRLV---------SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
            + ++Q+    +R +         ++ + +   +W+ P   ++     N  QV +A   G 
Sbjct: 527  DCIIQIHPKGIRHIQGIQFPDNDANATHQSLTDWQPPAHRTIVACATNNRQVAIALSSGQ 586

Query: 458  LVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-P 514
            ++Y E   DG L   +    L+  I+CL +  + E    +   AVG  +D +VRIF+L P
Sbjct: 587  ILYFECDSDGSLAMAEEEIALDSTINCLAMPDVPEGSVRAFFLAVGC-SDQTVRIFNLSP 645

Query: 515  DL--NLITKEHLGGEIIPRSVLLCAF------EGIS-YLLCALGDGHLLNFLLNMKTGEL 565
            D+  N++    +     P S L           G S +L   L  G  +  +L+  TG++
Sbjct: 646  DMEGNILRSISVQALTSPPSDLTINLMSDKSPRGYSQFLHIGLRSGVYIRSVLDEMTGDI 705

Query: 566  TDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHMCP 623
             D ++  LG +PI     +      + A + RP + Y+  +   L  + +N         
Sbjct: 706  GDTRRRFLGPEPIKFAKVTVAGEPAILAMTSRPWLGYTHPRTGVLQLTPLNYIAFKSAWN 765

Query: 624  FNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN 682
            F+ + F   + ++   EL I T +D+       +IPL   PR++    +   F I   + 
Sbjct: 766  FDGSQFKGIICVSAN-ELRIFTFNDLTDNTTYENIPLKYTPRKMVGYHEQGIFYIIESEC 824

Query: 683  QSCAEESEMHFVRLLDDQT----------FEFISTYPLDTFEY--------GC------- 717
             +   ++  + ++    Q            E     P   F Y         C       
Sbjct: 825  NTLNADTRQNLIQKASKQDENGNGEATNGTEESDELPAVDFGYPRAQGRWASCIQAVDPV 884

Query: 718  ---------------SILSCSF----SDDSNVYYCVGTAYVL---PEENEPTKGRILVFI 755
                           S++S +     S   + +  VGTA  L   P +      +I    
Sbjct: 885  SEKAVTHTIELKTNQSLVSAALVFFESRGEDAFLAVGTAKDLSFTPYKYSGASIQIYKIS 944

Query: 756  VEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 815
                + +   E E      +L AF GKL+A + + + LY   ++    R+ Q+       
Sbjct: 945  PNGREFEFFHETEVGEPPLALLAFKGKLIAGVGRHLCLYDCGMK-SVLRKAQAPNCVATR 1003

Query: 816  ILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSAVEILDDDIYL 873
            I    ++T+G  +VV D  +S++ +++K +     +   A D      SA ++LD +  +
Sbjct: 1004 ITD--IKTQGSRLVVSDQAQSVTYVVHKDQVHPNRLIPFADDTVPRHTSASDMLDYETTV 1061

Query: 874  GAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV-- 928
            G +   N++ VR   K SE + +   G   +V +  LG   NR     LV     +D+  
Sbjct: 1062 GGDKFGNIWLVRCPPKVSEASDESPDGSDLLVDKSFLGGTPNRL---DLVAHYFANDIPV 1118

Query: 929  ---------GQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
                     G    + +  + G +G +       Q+   ++L+  LR+  K + G +H  
Sbjct: 1119 SIQKTVLLSGGERIIFWAGLQGTLGALIPFNSRRQHKMFQQLELQLRQDDKPLSGRDHLA 1178

Query: 979  WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            +RS+    K V     +DGDLIE FL LSR + + I+  M
Sbjct: 1179 YRSYFAPVKCV-----IDGDLIERFLVLSRDKRESIAGQM 1213


>gi|147787360|emb|CAN64633.1| hypothetical protein VITISV_043788 [Vitis vinifera]
          Length = 1143

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 167/753 (22%), Positives = 315/753 (41%), Gaps = 109/753 (14%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V I+G I +L  FR  G  +D++ + ++  +  +L+++ E + +  +   +   + G
Sbjct: 34  ILSVEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKEKN-VFDKIHQETFGKSG 92

Query: 61  -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
            R    GQ   IDP  R  +IG      L  V+  D   +L  +  +   +   +     
Sbjct: 93  CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTVARLTISSPLEAHKSHTIVYSIT 152

Query: 118 ---YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLD 161
               G   P    +  D  +A             +H+  YE+ L   + V   WS+  +D
Sbjct: 153 GVDCGFDNPIFAAIELDYSEADQDSTGQAASEAQKHLTFYELDL-GLNHVSRKWSEQ-VD 210

Query: 162 NGADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VDADG 213
           NGA++L+ VP       GVL+  E  ++Y +    +    IP R  +    G   V A  
Sbjct: 211 NGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAAT 270

Query: 214 SR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
            R      +LL    G +  + + HE ++++ LKI+      + S++  L +  ++  S 
Sbjct: 271 HRQKSMFFFLLQTEYGDVFKVTLEHENDRISELKIKYFDTIPVTSSMCVLKSGFLFAASE 330

Query: 268 YGDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVD 301
           +G+  L +                             QP    + V + ++  +L PI+D
Sbjct: 331 FGNHGLYQFQAIGDDADVESSSASLMETEEGFQPVFFQPRGLKNLVRI-DQVESLMPIMD 389

Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDP 360
             V +L  +   Q+    G     S+RI+R G+ I+E A  +L G+   +W+++ + +D 
Sbjct: 390 MKVSNLFEEETPQIFALCGRGPRSSIRILRPGLAISEMAVSQLPGVPSAVWTVKKNVNDE 449

Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
           FD ++VVSF + T +L++   + +EE    GF   T +L       + L+QV    +R +
Sbjct: 450 FDAYIVVSFANATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHI 507

Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
               R   NEW++P   ++    +N  QV++A  GG L+Y E                  
Sbjct: 508 REDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFE------------------ 547

Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCA 537
               ++  G+    S+  AVG + D ++RI SL PD  + +++ + +     P S+L   
Sbjct: 548 ----VDMTGQLMERSRFLAVGSY-DNTIRILSLDPDDCMQILSVQSVSSP--PESLLFLE 600

Query: 538 FEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
            +               +L   L +G L   +++M TG+L+D +   LG +   L +   
Sbjct: 601 VQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDARSRFLGLRAPKLFSVIV 660

Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
           +    +   S RP + Y      L + ++ + +     F+S    + +       L + T
Sbjct: 661 RGRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEFAASFSSDQCAEGVVAVAGDALRVFT 720

Query: 646 IDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAI 677
           I+ + +  +   IPL   PR+   Q + +   +
Sbjct: 721 IERLGETFNETVIPLRYTPRKFVLQPKRKLLVV 753



 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 93/351 (26%), Positives = 154/351 (43%), Gaps = 46/351 (13%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGR 750
            +R+LD +T        L   E   SI + +F D +      VGTA  L    +     G 
Sbjct: 822  IRILDPRTATTTCLLELQDNEAAFSICTVNFHDKEYGTLLAVGTAKSLQFWPKRSFDAGY 881

Query: 751  ILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
            I ++  +EDGK L+L+ + + +G   +L  F G+LLA I   ++LY     D G R L  
Sbjct: 882  IHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSVLRLY-----DLGKRRLLR 936

Query: 809  ECGHH---GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
            +C +      I++++  T  D I VGD+ +S     Y+ +E  +   A D    W++A  
Sbjct: 937  KCENKLFPNTIVSIH--TYRDRIYVGDIQESFHYCKYRRDENQLYIFADDSVPRWLTASY 994

Query: 866  ILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD 925
             +D D   GA+   N++ VR   +  +DE       V E   G  + ++  G L      
Sbjct: 995  HIDFDTMAGADKFGNIYFVRL-PQDVSDE-------VEEDPTGGKI-KWEQGKL------ 1039

Query: 926  SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
                  P  +   V   +G + +   +  L     + ++R+    + G +H  +RS    
Sbjct: 1040 ---NGAPNKVEEIVQFHVGDVVTCLQKASLIPGGWRMHMRQEHPPLCGRDHMAYRS---- 1092

Query: 986  KKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                  K+ +DGDL E F    LDL R   DE+ +T      E+ K++EE+
Sbjct: 1093 -AYFPVKDVIDGDLCEQFPTLPLDLQRKIADELDRTPG----EILKKLEEV 1138


>gi|303285956|ref|XP_003062268.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456679|gb|EEH53980.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1213

 Score =  144 bits (364), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 172/733 (23%), Positives = 304/733 (41%), Gaps = 80/733 (10%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           ++G + +L  FR  G  +D++ + ++  +  VL +D E S  +            R    
Sbjct: 57  VFGVVRSLSAFRLTGANRDYVVVGSDSGRIVVLSFDKEKSAFVKVHQETFGKSGCRRIVP 116

Query: 66  GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY---GC 120
           GQ   +DP  R  +IG      +  V+  D    L  +  +   +  V+         G 
Sbjct: 117 GQFLAVDPKGRAVMIGAIEKSKIVYVLNRDTSANLTISSPLEANKSHVITFAVCALDCGL 176

Query: 121 AKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
             P    +  D  DA             +H+  YE+ L     V   WS+  +DNGA+ L
Sbjct: 177 DNPQFAAVELDYGDADQDPTGEAAAEAQKHLVFYELDLGLNHVVR-KWSEA-IDNGANHL 234

Query: 168 IPVPPPLCG---VLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR------ 215
           I VP    G    L+  E  I+Y +   A+    IP R S++   G +    +       
Sbjct: 235 IAVPGGGDGPGGCLVCAENFIIYKNEGHADVRAVIPRRSSLSGDRGVLIVSSATHRTKQQ 294

Query: 216 --YLLGDHAGLLHLLVI--THEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
             +L     G ++ + +  T   E V+ +KI+        +++  L    ++  S + + 
Sbjct: 295 FFFLAQSEYGDVYKVTVEWTPGSETVSEVKIKYFDTIPPCASLCVLKTGFLFAASEFSNH 354

Query: 272 QLIKLN-------LQPDAKGSYVEVLERYV-------------------NLGPIVDFCVV 305
            L +             +  + VE  E Y                    +L P++D    
Sbjct: 355 ALYQFQGIGDDDDDVESSSATLVETDEGYQPVFFEPRALRNLHPIDEIESLCPVLDAQCH 414

Query: 306 DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTF 364
           +L  +   QV    GA    +LR++R G+ ++E A   L G    +++++ +  D FD +
Sbjct: 415 NLTSEETPQVYALCGAGSRSTLRVLRQGVALSEMAVSPLPGNPNAVFTVKKNIADEFDAY 474

Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
           +VVSF++ T +L++   + +EE    GF   T TL       + L+Q   G +R V +  
Sbjct: 475 IVVSFVNATLVLSIG--ETVEEVSDSGFLGTTPTLSASLLGDDSLLQAHPGGLRHVRADK 532

Query: 425 RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEISCL 483
           R   NEW+ P   ++     N  QV++A  GG ++Y E+   G L E++  +   +++CL
Sbjct: 533 R--INEWRCPGRKTITRVACNNRQVVIALSGGEIIYFELDSTGQLMEIEKLETNGDVACL 590

Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVL-LCAFEGI 541
            + P+ E    ++  AVG + D +VR+ SL  D  L T         P S+L L   EG 
Sbjct: 591 HVGPVPEGSLRNRFLAVGSF-DSTVRVLSLSADDCLQTMGVQALAAAPCSLLMLQTREGG 649

Query: 542 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 601
            YL   L +G LL   ++  TG+L+D +   LG +P  L   + +    + A S RP + 
Sbjct: 650 LYLNVGLANGVLLRAEVDRVTGQLSDTRARFLGARPPKLHGATVRGQPAMLALSSRPWLG 709

Query: 602 YSS-NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI------QKLHI 654
           +    ++   + ++ + + H   F S   P+ +       L I +++ +          +
Sbjct: 710 HVDLQRRFALAPLSYEALEHAADFTSDQCPEGVVAVAGSTLRIVSVERLGDAFNQTSCKL 769

Query: 655 RSIP--LGEHPRR 665
           R  P  L  HP R
Sbjct: 770 RYTPRQLTVHPDR 782



 Score = 83.6 bits (205), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 81/332 (24%), Positives = 138/332 (41%), Gaps = 43/332 (12%)

Query: 724  FSDDSNVYYCVGTAYVLPEENEPTKGRI--LVFIVEDGKLQLIAEKETKGAVYSLNAFNG 781
            F   + V+  VGTA  L        G    L   +EDG + L+ +    G   ++  F G
Sbjct: 897  FPVANEVFLAVGTAVGLTFAPRDCDGGFVHLYRYLEDGTVTLVHKTPLDGVPGAMCGFKG 956

Query: 782  KLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
            +LL      ++LY    K +LR    R   +            V   GD I VGD+ +S 
Sbjct: 957  RLLLGCGNALRLYDFGKKKLLRKVENRNFPN--------FITTVHASGDRIYVGDVQESF 1008

Query: 838  SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR------------ 885
              + YK E+ ++   A D     ++A   LD D   GA+   N+F  R            
Sbjct: 1009 HFVKYKREDLSLIIVADDVQPRHITAALPLDYDTMAGADKFGNVFVARLAQDVSADIEED 1068

Query: 886  ----KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
                K S G  +    ++E V ++H+GE V     G+L         G +  +++ T+ G
Sbjct: 1069 PTGGKASGGTLNGAPRKVEHVAQFHVGETVCALTKGTL-------QAGGLECMLYATLMG 1121

Query: 942  VIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
             +G +        + F   L+ ++R+    + G +H  +RS          K+ +DGDL 
Sbjct: 1122 TVGALMPFTSRADVDFATHLEMHVRQENPPLLGRDHMAYRS-----AYFPVKDVVDGDLC 1176

Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            E ++ L       I++ M+ +  E+ K++E+L
Sbjct: 1177 EQYVTLPAETQRAIAEEMDRTPAEVMKKLEDL 1208


>gi|326432370|gb|EGD77940.1| splicing factor 3b subunit 3 [Salpingoeca sp. ATCC 50818]
          Length = 1232

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 172/740 (23%), Positives = 315/740 (42%), Gaps = 87/740 (11%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
           IYG + +L+ FR  G  +D+L + T+  +  VL++D +++ L  +   +   + G R   
Sbjct: 67  IYGVVRSLQPFRLTGGNKDYLVVGTDSGRITVLEYDTDTN-LFKKVHQETFGKSGCRRIV 125

Query: 65  NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQV---------LDIK 115
            GQ    DP  R + +  ++    V  F+         +  LE  +          +D+ 
Sbjct: 126 PGQYLAADPKGRAVMIGAFEKQKLVYIFNRDAAAHLTISSPLEAHKSHTIVYDTVGVDVG 185

Query: 116 F---LYGCAKPTIVVLYQD-NKDARHVKTYEVALKDKDFVEGPWSQNN---LDNGADLLI 168
           F   ++ C +     + +D   +AR   T  +   + D       +     LD  A+ L+
Sbjct: 186 FDNPIFACLEVDYEEVDEDPTGEARQFLTQSLTFYELDLGLNHVVRKESIPLDEFANKLV 245

Query: 169 PVP-----PPLCGVLIIGEETIVYCSANAFKAI---------PIR----PSITKAYGRVD 210
            VP     P   GVL+     I + +    + +         P+R    P +T       
Sbjct: 246 SVPGGSDGP--GGVLVCSPGRITWRTYGEHEPVAINLPRRDDPLRTDRAPLVTAVTMHKT 303

Query: 211 ADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
                +L+    G L  L +  EK +V+GL I+      +A+ +  L N ++++ + YG+
Sbjct: 304 KRMFFFLVQTEEGDLFKLTMVAEKGEVSGLIIKYFDTVPVANAMCLLRNGLLFVAAEYGN 363

Query: 271 SQLIK-------------LNLQPDAKGSY--------VEVLERYVNLGPIVDFCVVDLER 309
             L +             L++ P  K  Y        + +++   +L P++   + DL  
Sbjct: 364 HHLYQIASLGDNEDEPSYLSIDPLDKIHYFRPRDLLNLALVDDQESLHPMIACQLADLHE 423

Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVS 368
           +   Q+    G     S R++R+G+ ++E A  EL G    +WS++   DD  DT+LV+S
Sbjct: 424 EETPQLYALCGRGAKSSFRVLRHGLEVSEVAVSELPGNPSAVWSVKRHVDDESDTYLVLS 483

Query: 369 FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 428
           F+ +T +L +   + +EE +  GF  +  TL       + L+QV  G +R +    R   
Sbjct: 484 FVDKTLVLGIG--ETVEEVKDSGFLEEVPTLSASRIGDDSLLQVYPGGIRHIRFDQR--V 539

Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYEISCLDIN 486
            EWK+P   ++     N  QV++      LVY E+   G L E  +  ++  +++ + + 
Sbjct: 540 KEWKAPGSTAITNCAVNERQVVITLSSNELVYFELDRAGQLNEYTERIEMTSKVTAMALA 599

Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLCAFE---- 539
           P+ E+   SQ  A+G+  D +VR+ SL     L  +  + L G   P S  LC  E    
Sbjct: 600 PVAEDAFTSQFLALGL-EDNTVRVLSLDPSSCLQPLRMQALPGA--PSS--LCIIEIAGQ 654

Query: 540 ----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS 595
               G   L   L +G +    ++  TG+L+D +   LG + + L    +     V A S
Sbjct: 655 AGEPGTLQLHIGLANGVVSRSTMDKVTGDLSDSRTRYLGVREVRLFPVRAHGHPAVLALS 714

Query: 596 DRPTV--IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
            RP +   Y    ++   +    E +HM  F S   P+ +    +  L I ++ ++  + 
Sbjct: 715 TRPWIAFTYQGQPRMAPLSYEALEYAHM--FCSEQLPEGIVAVAKNTLRILSLQNLGSIF 772

Query: 654 IRS-IPLGEHPRRICHQEQS 672
            +S IPL   PR+    E++
Sbjct: 773 NQSTIPLAYTPRKFFLDEKT 792



 Score = 72.0 bits (175), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 72/306 (23%), Positives = 129/306 (42%), Gaps = 44/306 (14%)

Query: 753  VFIVEDG-----KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
            +F   DG     KL  +   E +    +L  F G+L+A +   +++Y     D G ++L 
Sbjct: 939  LFSAPDGSARLTKLDFVHRTEVEAMPCALTPFAGRLIAGVGNIVRIY-----DMGRKKLL 993

Query: 808  SEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
             +C   H     + ++  G  +VV D  +S+  L YK  E  +     D    W +A+ +
Sbjct: 994  RKCENKHLPSRVVDIEVMGTRVVVADQRESVFFLKYKPTENVLSVFCDDTTPRWCTAMLM 1053

Query: 867  LDDDIYLGAENNFNLFTVRKNSEGATDE------------ERG-------RLEVVGEYHL 907
            +D      A + F   +V +  +  TD              RG       +L  V  +++
Sbjct: 1054 VDYSTVCVA-DKFGNVSVLRCPDDVTDTLQEDPSGAKAFWARGYLNGAPQKLVQVANFYI 1112

Query: 908  GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH-EQYLFLEKLQTNLRK 966
            GE V      +L    P         + + T++G IG +    H E   F + L+ +LR+
Sbjct: 1113 GEIVQSLHKTTLT---PSG----TECIAYTTLSGSIGALMPFSHKEDAEFFQTLELHLRQ 1165

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
                + G +H  +RS       V  K+ +DGDL E +  LS +   +I+  +  + +E+ 
Sbjct: 1166 EHPPICGRDHLAFRS-----AYVPCKSVIDGDLCEEYNMLSASLKSDIADGLERTPQEVA 1220

Query: 1027 KRVEEL 1032
            K++EE 
Sbjct: 1221 KKLEEF 1226


>gi|308505958|ref|XP_003115162.1| CRE-TEG-4 protein [Caenorhabditis remanei]
 gi|308259344|gb|EFP03297.1| CRE-TEG-4 protein [Caenorhabditis remanei]
          Length = 1013

 Score =  143 bits (361), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 172/742 (23%), Positives = 312/742 (42%), Gaps = 95/742 (12%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
           I+G + +L  FR     +DF+ + ++  +  +LQ++AE +    L     G    R   P
Sbjct: 59  IFGIVRSLLAFRLTAGTRDFIAVGSDSGRIVILQYNAEKTCFERLHQETFGKTGCRRIVP 118

Query: 63  TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
              G     DP  R  +IG      L  ++  D++  L  +  +   +   L    +   
Sbjct: 119 ---GHYLAGDPRGRALMIGAVERQKLVYIMNRDSEAHLTISSPLEAHKHHTLCYAMVGVD 175

Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADL 166
            G   PT   L  D +D+ +  T E A + +    F E     N+        L++  +L
Sbjct: 176 VGFENPTFACLEFDYEDSDNDPTGEAAKRTQQTLTFYELDLGLNHVVRKYAEPLNDPGNL 235

Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR------ 215
           LI VP     P   GV++  E  +VY   N      IR  I +    +D D  R      
Sbjct: 236 LIAVPGGNDGP--SGVIVCCENYLVY--KNLGDQPDIRCPIPRRRNELD-DADRTMLIIA 290

Query: 216 -----------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
                      +L+    G +  + +  +++ VT +K++       A+ +  L    ++I
Sbjct: 291 TATHKTRNMFFFLVQAENGDIFKVTLETDEDLVTEMKLKYFDTVPPANAMCILKAGFLFI 350

Query: 265 GSSYGDSQLIKLN-----------------------LQPDAKGSYVEVLERYVNLGPIVD 301
            + +G+ +L ++                         +P    S + + +   +L P+ D
Sbjct: 351 AAEFGNHELYQIASLGEGGDDEFSSAMGFGENDAAFFEPHELRSLIPI-DSMDSLSPLTD 409

Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDP 360
             + D+ R+   Q+ T  G      ++++RNG+ I+E A  +L G    +W+++ + +D 
Sbjct: 410 AVIGDIAREDAAQLYTLIGRGSRSHMKVLRNGLEISEMAVSDLPGNPNAVWTVKKNIEDQ 469

Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
           +D+++VVSF++ T  LA+ + D +EE    GF   T T+ C     + LVQV     ++ 
Sbjct: 470 YDSYIVVSFVNAT--LALTIGDTVEEASDSGFLPTTPTIGCSMIGDDSLVQVIRN--QIY 525

Query: 421 SSTSRELR-----NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHA 474
           S   R +R     NEWK PP   +     N  QV +A  GG LVY E+  +G L E    
Sbjct: 526 SEGIRHIRADKRINEWKVPPRRQIVKCAVNRRQVAVALSGGELVYFELDLNGTLNEFTER 585

Query: 475 QL-EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
           +L   +I+C+  + I E    S+  A+G   D +VRI SL P+  L+          P S
Sbjct: 586 KLFNADIACMTFSEISEGELNSRFLALGT-VDNAVRIISLDPNDMLMPLSTQNLPCPPES 644

Query: 533 VLLCAF-----EGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
           +LL        +G++  +L   L +G L    ++  TG + D +   LGT+P+ L     
Sbjct: 645 ILLIDTPNEDGKGVAAVHLNIGLQNGCLFRNTVDNVTGAIMDTRTRYLGTRPVKLFKVQC 704

Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
           +  + +   S R  ++Y   ++   + ++   + +   F S    + +       L I  
Sbjct: 705 QGRSAILCTSSRSWLLYHFQRRFHLTPLSYANLEYAASFCSNQCAEGIVAISASTLRIIA 764

Query: 646 IDDIQ-KLHIRSIPLGEHPRRI 666
            + +    +++S      PRR+
Sbjct: 765 AEKLGVAFNVQSFEHKLTPRRV 786


>gi|357496593|ref|XP_003618585.1| Splicing factor 3B subunit [Medicago truncatula]
 gi|355493600|gb|AES74803.1| Splicing factor 3B subunit [Medicago truncatula]
          Length = 702

 Score =  143 bits (360), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 173/699 (24%), Positives = 303/699 (43%), Gaps = 76/699 (10%)

Query: 379  NLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYS 438
            ++ + ++E    GF   T +L       + L+QV    +R +        NEW++P   +
Sbjct: 30   SVRETVKEVTDNGFLDTTPSLSVSLIGDDSLMQVHPNGIRHIREDGH--INEWRTPGKRT 87

Query: 439  VNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCLDINPIGENPSYSQI 497
            +    +N  QV++A  GG L+Y E+   G L EV+  ++  +++CLDI P+ E    S+ 
Sbjct: 88   IAKVGSNRLQVVIALNGGELIYFEVDVTGQLMEVEKHEMSGDVACLDIAPVPEGRQRSRF 147

Query: 498  AAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCAL-GDGHLLN 555
             AVG + D ++RI SL PD  + T   LG + +  +     F  +   +    G+G L  
Sbjct: 148  LAVGSY-DKTIRILSLDPDDCMQT---LGIQSLSSASESLLFLEVQASVGGEDGNGVLSR 203

Query: 556  FLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNL 615
             +++M TG L+D +   LG +P  L     +    +   S RP + Y      L + ++ 
Sbjct: 204  TVVDMVTGLLSDSRSPFLGLRPPKLFPIVVRGKRAMLCLSSRPWLGYIHQGHFLLTPLSY 263

Query: 616  KEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRT 674
            + +     F+S    + +       L I TI+ + +  +   IPL   P +   Q + + 
Sbjct: 264  ETLEFAASFSSDQCVEGVVALAGEALRIFTIERLGETFNETVIPLRYTPMKFVLQPKRKL 323

Query: 675  FAIC-SLKNQSCAEESEM---------------HF-------------VRLLDDQTFEFI 705
              +  S +    AEE E                H+             +R+LD +T    
Sbjct: 324  LVVIESDQGAFTAEEREANGGEDEDKDDPLSDEHYGYPKAESDKWASCIRILDPKTGNTT 383

Query: 706  STYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL--PEENEPTKGRILVF-IVEDGK- 760
                L   E   S  + +F D +      VGTA  L        T G I ++  +EDG+ 
Sbjct: 384  CLLELQDNEAAFSGCTVNFHDKEYGTLLDVGTAKGLQFTPRRSLTAGFIHIYRFLEDGRS 443

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA-- 818
            L+L+ + + +G   +L+ F G+LLA I   ++ Y     D G R L  +  +   +    
Sbjct: 444  LELLHKTQVEGVPLALSQFQGRLLAGIGPVLRFY-----DLGKRRLLRK--YENKLFPNT 496

Query: 819  -LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
             + +QT  D I VGD  +S     Y+ +E  +   A D    W++A   +D D   G E 
Sbjct: 497  IVSIQTYRDRIYVGDTQESFHYCKYRWDENQLYIFADDCVPRWLTASYHIDFDTMAGIEE 556

Query: 878  NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
            +     + K  +G  +    ++E + ++H+G+ ++  +  SL+        G    ++ G
Sbjct: 557  DPTGGRI-KWEQGKLNGAPNKVEEIVQFHVGDVISCLQKASLI-------PGGGECILNG 608

Query: 938  TVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
            TV G IG + A    +   F   L+ ++R+    + G +H  +RS          K+ +D
Sbjct: 609  TVMGSIGALHAFTSRDDVDFFSHLEMHMRQDNPPLCGRDHMAYRS-----AYFPVKDVID 663

Query: 997  GDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEE 1031
            GDL E F    +DL R   DE+ +T      E+ K++EE
Sbjct: 664  GDLCEQFPTLPMDLQRKIADELDRTRG----EILKKLEE 698


>gi|322797581|gb|EFZ19622.1| hypothetical protein SINV_00421 [Solenopsis invicta]
          Length = 1217

 Score =  143 bits (360), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 171/775 (22%), Positives = 318/775 (41%), Gaps = 96/775 (12%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G I +L  FR  G  +D++ + ++  +  +L++ A  +               
Sbjct: 53  LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYIAAKNTFEKVHQETFGKSGC 112

Query: 61  RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DIK 115
           R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    + 
Sbjct: 113 RRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDAEARLTISSPLEAHKSNTLVYHTVG 172

Query: 116 FLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNG 163
              G   P    L  D ++A    T + A+K +            + V   +S+  L+  
Sbjct: 173 VDVGFENPMFACLEIDYEEADSDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEEH 231

Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
           A+ L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R   
Sbjct: 232 ANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMI 286

Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                         +L     G +  + +  +++ VT +K++      +A+++  L    
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGF 346

Query: 262 VYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGP 298
           +++ S +G+  L ++             +  P  +G       R +          +L P
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSP 406

Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
           I+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++   
Sbjct: 407 IMACQVADLANEDTPQLYIACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRRI 466

Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
           D+ +D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +
Sbjct: 467 DEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDGI 524

Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
           R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  +
Sbjct: 525 RHIRADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERKK 582

Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-L 534
           +  E+ C+ +  +      S   AVG+  D +VRI SL   + +    +  + +P +   
Sbjct: 583 MPSEVMCMALGNVAVGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSM--QALPAAAES 639

Query: 535 LCAFEGIS----------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
           LC  E  +                YL   L +G LL  +L+  +G+L D +   LG++P+
Sbjct: 640 LCIVEMGAKEADNSEDAAPQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRPV 699

Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
            L     +    V A S R  + Y    +   + ++ + +     F+S   P+ +     
Sbjct: 700 KLFRIRMQGNQAVLAMSSRSWLSYYHQNRFHLTPLSYESLEFASGFSSEQCPEGIVAIST 759

Query: 639 GELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
             L I  ++ +  +  + S PL   PR+      S    +   ++ +  EE++  
Sbjct: 760 NTLRILALEKLGAVFNQVSFPLEYTPRKFAIHADSAHLVVIETEHNAYTEETKQQ 814



 Score = 79.3 bits (194), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 66/276 (23%), Positives = 128/276 (46%), Gaps = 41/276 (14%)

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKS 836
            + G++L  + + ++LY     D G ++L  +C  + HI    + +   G  I V D+ +S
Sbjct: 955  YQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAVVSINAIGQRIYVSDVQES 1008

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE-- 894
            +  + YK +E  +   A D +  W++   +LD D    A+   N+  +R  + G  D+  
Sbjct: 1009 VYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKFGNIAVIRL-ATGINDDVD 1067

Query: 895  ----------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
                      +RG L       + V  +H+GE V   +  +L+        G   ++++ 
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI-------PGGSESLVYT 1120

Query: 938  TVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
            T++G +GV+     HE + F + L+ ++R     + G +H  +RS+         KN +D
Sbjct: 1121 TLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLSFRSY-----YYPVKNVID 1175

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            GDL E F  +  T+   IS  +  +  E+ K++E++
Sbjct: 1176 GDLCEQFNSIEPTKQKSISGDLERTPSEVSKKLEDI 1211


>gi|116191283|ref|XP_001221454.1| hypothetical protein CHGG_05359 [Chaetomium globosum CBS 148.51]
 gi|88181272|gb|EAQ88740.1| hypothetical protein CHGG_05359 [Chaetomium globosum CBS 148.51]
          Length = 979

 Score =  143 bits (360), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 190/850 (22%), Positives = 354/850 (41%), Gaps = 126/850 (14%)

Query: 236  KVTGLKIELLGETSIASTISYLDNAVVYIGSSY-----------GDSQL----------I 274
            +V  +K+       +AS++  L +  + + S +           GD Q+          +
Sbjct: 112  EVNCIKLRYFDTVPVASSLCILKSGFLLVASEFENHHLYQFEQLGDDQVEFNSNRDIAPV 171

Query: 275  KLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQG--QVVTCSGAYKDGSLRIVRN 332
              + QP    S VE +E   ++ PI+   VV+L R+G    Q+ +  G+      RI+R+
Sbjct: 172  FFDPQPSGNLSLVETIE---SMNPILASGVVNL-REGDDAPQIYSACGSGTRSHFRILRH 227

Query: 333  GIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 391
            G+ +NE  + EL G +  +W+ + ++ D  DT+++++  S T  L +++ +E+++    G
Sbjct: 228  GLEVNEIVASELPGTVSAVWTTKLTSQDKHDTYIILT--SSTDTLVLSIGEEVKQVSDSG 285

Query: 392  FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 451
            F +   TL       + L QV    +R + S+ + +  EW +P   ++     N  QV +
Sbjct: 286  FLTSVSTLAIQQIGDDSLAQVHPKGIRHIRSSDQII--EWPAPQHRTIVATATNQRQVAV 343

Query: 452  ATGGGHLVYLEI--GDGILTEVK-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISV 508
            A   G ++Y E+   DG L E     ++   ++CL +  + E    S   AVG   D ++
Sbjct: 344  ALSSGEIIYFELDDADGSLAEYDGREEMSGTVTCLSLGQVPEGRLRSSFLAVGC-DDRTI 402

Query: 509  RIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTD 567
            RI SL P+  L ++               + + ++    AL    + + LL       T+
Sbjct: 403  RILSLDPESTLESR---------------SVQALTAAPSALAIIPMDDSLLGGSNSSDTN 447

Query: 568  RKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSS-NKKLLYSNVNLKEVSHMCPFNS 626
             K   LG +P+ L   + K    V A S +  + +    +  L+  +   E + + P   
Sbjct: 448  YK--FLGVRPVRLFPVTVKQKPRVLALSSKAWLGHVDLARGFLFYTIE-SENNTLAPELR 504

Query: 627  AAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--S 684
            A             +  GT          + P GE P  +        F     + +  S
Sbjct: 505  AQL-----------VAAGT----------TAPGGEKPDHVAEILPPEQFGYPRGQGRWAS 543

Query: 685  CAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS---DDSNVYYCVGTAYVLP 741
            C        + ++D      I T  L   E   S+   SF+   D+S +    G   VL 
Sbjct: 544  C--------ISIVDPIGETVIRTIHLQENEAAASLAIASFTSQDDESFLVVSTGRDMVL- 594

Query: 742  EENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
               + + G   V+   +DG+ L+LI +  T     +L AF G+L+A I + + +Y   L+
Sbjct: 595  NPRQLSGGYSYVYRFHDDGRDLELIHKTGTTEPPTALEAFRGRLVAGIGKTLVVYDLGLK 654

Query: 800  DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
                R+ Q+     G I++L  QT+G+ IVVGD+   ++++ Y+ E   +     D  A 
Sbjct: 655  -QMLRKTQANDVVPGLIVSL--QTQGNRIVVGDVQHGVAMVAYRTESNQLIPFVDDTIAR 711

Query: 860  WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD-EERG----------------RLEVV 902
            W +   ++D D   G +   N + VR   + + + +E G                RL++ 
Sbjct: 712  WTTCTTMVDYDSVAGGDKFGNFWIVRTPQQASLEADEPGAHRLLHAREHLHGAPHRLQLT 771

Query: 903  GEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV-IASLPHEQYLFLEKLQ 961
              +H  +         LV       VG    +++    G +GV +  +  E   F   L+
Sbjct: 772  AHFHTQDIPTGITKTHLV-------VGGQEVLVWSGFQGTVGVFVPFVTREDADFFLALE 824

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             ++R     + G +H  +R +        AK  +DGDL E +  L   +   I+  ++ S
Sbjct: 825  QHMRGEEPSLIGRDHLAYRGYYEP-----AKGVVDGDLCERYQLLPGDKKQRIAAELDRS 879

Query: 1022 VEELCKRVEE 1031
            V E+ +++ +
Sbjct: 880  VREVERKISK 889


>gi|383847297|ref|XP_003699291.1| PREDICTED: splicing factor 3B subunit 3-like [Megachile rotundata]
          Length = 1217

 Score =  143 bits (360), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 172/776 (22%), Positives = 321/776 (41%), Gaps = 98/776 (12%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G I +L  FR  G  +D++ + ++  +  +L++   +  +  +   +   + G
Sbjct: 53  LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEY-IPAKNIFEKVHQETFGKSG 111

Query: 61  -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DI 114
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDPEARLTISSPLEAHKSNTLVYHTV 171

Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
               G   P    L  D ++A    T + A+K +            + V   +S+  L+ 
Sbjct: 172 GVDVGFENPMFACLEIDYEEADSDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEE 230

Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-- 215
            A+ L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R  
Sbjct: 231 HANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGM 285

Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
                          +L     G +  + +  +++ VT +K++      +A+++  L   
Sbjct: 286 IFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTG 345

Query: 261 VVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLG 297
            +++ S +G+  L ++             +  P  +G       R +          +L 
Sbjct: 346 FLFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLS 405

Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
           PI+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++  
Sbjct: 406 PIMACQVADLANEDTPQLYITCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRR 465

Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
            D+ +D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466 VDEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDG 523

Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
           +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524 IRHIRADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERK 581

Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV- 533
           ++  E+ C+ +  +      S   AVG+  D +VRI SL   + +    +  + +P +  
Sbjct: 582 KMPSEVMCMALGNVAIGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSM--QALPAAAE 638

Query: 534 LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
            LC  E                   YL   L +G LL  +L+  +G+L D +   LG++P
Sbjct: 639 SLCIVEMGAKDANNSEELSSQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRP 698

Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
           + L     +    V A S R  + Y    +   + ++ + +     F+S   P+ +    
Sbjct: 699 VKLFRIKMQGNQAVLAMSSRSWLSYYYQNRFHLTPLSYESLEFASGFSSEQCPEGIVAIS 758

Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
              L I  ++ +  +  + S PL   PR+      S    I   ++ +  EE++  
Sbjct: 759 TNTLRILALEKLGAVFNQISFPLEYTPRKFAIHSDSAHLIIIETEHNAYTEETKQQ 814



 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 67/294 (22%), Positives = 134/294 (45%), Gaps = 41/294 (13%)

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LA 818
            L+L+ +        ++  + G++L  + + ++LY     D G ++L  +C  + HI    
Sbjct: 937  LELVHKTTLDEVPLAICPYQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAI 990

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
            + +   G  I V D+ +S+  + YK +E  +   A D +  W++   +LD D    A+  
Sbjct: 991  VSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKF 1050

Query: 879  FNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSL 919
             N+  +R  S G  D+            +RG L       + V  +H+GE V   +  +L
Sbjct: 1051 GNIAVIRLAS-GINDDVDEDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATL 1109

Query: 920  VMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
            +        G   ++++ T++G +GV+     HE + F + L+ ++R     + G +H  
Sbjct: 1110 I-------PGGSESLVYTTLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLS 1162

Query: 979  WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +RS+         KN +DGDL E F  +   +   I+  +  +  E+ K++E++
Sbjct: 1163 FRSY-----YYPVKNVIDGDLCEQFNSIEPAKQKSIAGDLERTPSEVSKKLEDI 1211


>gi|414883930|tpg|DAA59944.1| TPA: hypothetical protein ZEAMMB73_987949 [Zea mays]
          Length = 1355

 Score =  143 bits (360), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 149/618 (24%), Positives = 253/618 (40%), Gaps = 91/618 (14%)

Query: 214  SRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL 273
            +R L     G  H+L  T + E       E +  T     + ++ N ++      GD  +
Sbjct: 403  ARLLFCLDDGEFHILEFTLDVEGAKLYTFEYIDRTLPCRPLFWMKNRMIIGFVEMGDGMI 462

Query: 274  IKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNG 333
             KL  +     S ++      N+ PI+D  + D   + Q Q+  C G   +GSLR++RNG
Sbjct: 463  FKLGHRRLLHKSTIQ------NVAPILDLAIADYHGEKQDQMFACCGMSPEGSLRVLRNG 516

Query: 334  IGINEQASVE--LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 391
            + ++     E   QG+ G+W+LR    D + +FLV+SF+ ETRIL++ L    + ++  G
Sbjct: 517  VNVDRLLKTEAIYQGVTGLWTLRMKATDAYHSFLVLSFVEETRILSVGLSFN-DISDAVG 575

Query: 392  FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPGYSVNV 441
            F     TL C     N LVQ+ S  V++   T          +  +  +W   P  +++V
Sbjct: 576  FQPDVCTLACGLVADNLLVQIYSKGVKVCLPTVYAHPEGAPLTSPICTDWY--PAITISV 633

Query: 442  ATANASQVLLATGGGHLVYLEIGDGI---------LTEVKHAQLEYEISCLDI---NPIG 489
                 + V++AT     +Y+    G+         L    H QL+YE+SC+ I   + I 
Sbjct: 634  GAVGHNIVVVATSNPCCLYVL---GVRSSSSYQYELYATHHVQLQYEVSCISIPQEDCIH 690

Query: 490  ENPSYS------------------QIAAVGMWTDISVRIFSLPD---LNLIT------KE 522
            +N S+S                  + A +G     SV I SL     L ++T        
Sbjct: 691  DNVSFSCGEGDDICKNPPPKVNVCKFAVIGTHRP-SVEIISLEPGEALRVLTIGTVSVNN 749

Query: 523  HLGGEI---IPRSVLLCAFEGISYLLCALGDGHLLNFLLNMK---TGELTDRKKVS---- 572
             LG  +   IP +V   A E   Y+L  L +G LL F    +    G L     +     
Sbjct: 750  ALGAPMSGCIPENVRFVAAERF-YILAGLRNGMLLRFESETRDYLPGFLYKDSSIPSVNT 808

Query: 573  ---------LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCP 623
                     +G  P+ L          +   SDRP +++++   L YS+++    SH+ P
Sbjct: 809  FLQLISIRRIGITPVLLVPIHDSANADIIVLSDRPWLLHAARHSLAYSSISFLSASHVTP 868

Query: 624  FNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ 683
             +S   P+ L    E  L +  +   ++L+ +   +G  PR++ +  +SRT  +      
Sbjct: 869  VSSVDCPNGLLFVAESCLHLVELVHGKRLNAQKFSIGGTPRKVLYHNESRTLLVLRTGLN 928

Query: 684  SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS----NVYYCVGTAYV 739
              +  S+   V  +D Q    +S Y  +  E    +      +D           G   +
Sbjct: 929  GASSSSD---VVQVDPQNGVLLSRYKCEPGETAKCMQIAKIGNDQVLIVGTTKSAGRPMM 985

Query: 740  LPEENEPTKGRILVFIVE 757
               E E  KGR++V  +E
Sbjct: 986  SNGEAESIKGRLIVLSLE 1003


>gi|125580741|gb|EAZ21672.1| hypothetical protein OsJ_05303 [Oryza sativa Japonica Group]
          Length = 1224

 Score =  142 bits (359), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 179/763 (23%), Positives = 309/763 (40%), Gaps = 85/763 (11%)

Query: 5   PIYGRIATLEL--FRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
           P  GR+ TL    FR  G  +D+L + ++  +  +L++  + + L             R 
Sbjct: 60  PETGRLRTLLSVDFRLTGATKDYLVVGSDSGRLVILEYSPDRNRLDKVHQETFGKSGCRR 119

Query: 63  TDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY-- 118
              GQ+  +DP  R + +   +   L  V+  D   +L  +  +   +   L        
Sbjct: 120 IVPGQLLAVDPKGRALCIAALEKQKLVYVLNRDAAARLTISSPLEAHKSNTLTFSLTALD 179

Query: 119 -GCAKPTIVVL---YQDNKDARHVKTYEVALKDKDFVEGPWSQNN--------LDNGADL 166
            G   P    +   Y ++      +  E A K   F E     N+        +DNGA+L
Sbjct: 180 CGFDNPVFAAIELEYAESDRDPTGQAAEQAQKHLTFYELDLGLNHVSRKASEPIDNGANL 239

Query: 167 LIPVP-----PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VDADGSR- 215
           L+ VP     P   G+L+  +  ++Y +         IP R  +    G   V A   R 
Sbjct: 240 LVTVPGGGDGP--SGLLVCCDNFVLYRNQGHPEVRAVIPRRADLPAERGVLIVAAATHRQ 297

Query: 216 -----YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
                +LL    G +  + + H  + VT L+I+      + S I  L +  ++  S +G+
Sbjct: 298 KSLFFFLLQTEYGDIFKVDLEHSNDTVTELRIKYFDTIPVTSAICVLRSGFLFAASEFGN 357

Query: 271 SQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVDFCV 304
             L +                             QP A  +   + E   +L PI+D  V
Sbjct: 358 HALYQFRDIGRDVDVESSSATLMETDEGFQPVFFQPRALKNLYRIDE-IESLMPIMDMRV 416

Query: 305 VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDT 363
            +L  +   QV T  G     +LRI+R G+ I+E A   L      +W+++ + +D FD 
Sbjct: 417 ANLFDEETPQVFTACGRGPRSTLRILRPGLAISEMARSMLPAEPIAVWTVKKNINDMFDA 476

Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
           ++VVSF + T +L++   + +EE     F   T +L       + L+QV    +R +   
Sbjct: 477 YIVVSFANVTLVLSIG--ETIEEVSDSQFLDTTHSLAVSLLGEDSLMQVHPNGIRHIRED 534

Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISC 482
            R   NEW++P   ++    +N  QV++A  GG L+Y E+   G L EV+   +  +++C
Sbjct: 535 GRV--NEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDMSGDVAC 592

Query: 483 LDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGI 541
           L I P+ E    S+  AVG + D ++RI S+ PD  L           P S++    +  
Sbjct: 593 LAIAPVPEGRQRSRFLAVGSF-DNTIRILSVDPDDCLQPLSVQSVSSAPESLMFLEVQAS 651

Query: 542 S------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
                        +L   L +G L    ++M TG+L+D +   LG +P  L      +  
Sbjct: 652 VGGEDGADHPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPCIVSHRQ 711

Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
            +   S RP + Y      L + ++   +     F+S    + +       L I TI+ +
Sbjct: 712 AMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGDALRIFTIEHL 771

Query: 650 -QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE 690
            +  +  +IPL   PR+     + +  A+  S K    AEE E
Sbjct: 772 GETFNETAIPLRYTPRKFVILPKKKYLAVIESDKGALSAEERE 814



 Score = 93.2 bits (230), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 166/365 (45%), Gaps = 44/365 (12%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---PEENEPTKG 749
            +R+LD ++ +      L   E   SI + +F D +      VGTA  L   P+ N  + G
Sbjct: 873  IRILDPKSRDTTCLLELQDNEAAVSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNL-SAG 931

Query: 750  RILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
             I ++  V++G+ L+L+ + + +    +L  F G+LLA +   ++LY     D G R+L 
Sbjct: 932  FIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSVLRLY-----DLGKRKLL 986

Query: 808  SECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
             +C +      +  + T  D I VGD+ +S     Y+ +E  +   A D    W++A   
Sbjct: 987  RKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAANH 1046

Query: 867  LDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHLG 908
            +D D   GA+   N++  R                  K  +G  +    ++E + ++H+G
Sbjct: 1047 IDFDTMAGADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVG 1106

Query: 909  EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKV 967
            + V   +  SL+        G    +I+GTV G +G ++A    E   F   L+ +LR+ 
Sbjct: 1107 DVVTCLQKASLI-------PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQE 1159

Query: 968  IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
               + G +H  +RS          K+ +DGDL E F  L      +I+  ++ +  E+ K
Sbjct: 1160 HPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPSLPADMQRKIADELDRTPGEILK 1214

Query: 1028 RVEEL 1032
            ++E++
Sbjct: 1215 KLEDI 1219


>gi|332026090|gb|EGI66238.1| Splicing factor 3B subunit 3 [Acromyrmex echinatior]
          Length = 1217

 Score =  142 bits (359), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 172/783 (21%), Positives = 325/783 (41%), Gaps = 98/783 (12%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G I +L  FR  G  +D++ + ++  +  +L++   +  +  +   +   + G
Sbjct: 53  LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEY-ISAKNIFEKVHQETFGKSG 111

Query: 61  -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DI 114
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDAEARLTISSPLEAHKSNTLVYHTV 171

Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
               G   P    L  D ++A    T + A+K +            + V   +S+  L+ 
Sbjct: 172 GVDVGFENPMFACLEIDYEEADSDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEE 230

Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-- 215
            A+ L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R  
Sbjct: 231 HANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGM 285

Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
                          +L     G +  + +  +++ VT +K++      +A+++  L   
Sbjct: 286 IFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTG 345

Query: 261 VVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLG 297
            +++ S +G+  L ++             +  P  +G       R +          +L 
Sbjct: 346 FLFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLS 405

Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
           PI+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++  
Sbjct: 406 PIMACQVADLANEDTPQLYIACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRR 465

Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
            D+ +D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466 IDEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDG 523

Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
           +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524 IRHIRADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERK 581

Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV- 533
           ++  E+ C+ +  +      S   AVG+  D +VRI SL   + +    +  + +P +  
Sbjct: 582 KMPSEVMCMALGNVAVGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSM--QALPAAAE 638

Query: 534 LLCAFEGIS----------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
            LC  E  +                YL   L +G LL  +L+  +G+L D +   LG++P
Sbjct: 639 SLCIVEMGAKDADNSEDAAPQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRP 698

Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
           + L     +    V A S R  + Y    +   + ++ + +     F+S   P+ +    
Sbjct: 699 VKLFRIRMQGNQAVLAMSSRSWLSYYHQNRFHLTPLSYESLEFASGFSSEQCPEGIVAIS 758

Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRL 696
              L I  ++ +  +  + S PL   PR+      S    +   ++ +  EE++    + 
Sbjct: 759 TNTLRILALEKLGAVFNQVSFPLEYTPRKFAIHTDSAHLVVIETEHNAYTEETKQQRRQQ 818

Query: 697 LDD 699
           + D
Sbjct: 819 MAD 821



 Score = 79.7 bits (195), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 66/276 (23%), Positives = 128/276 (46%), Gaps = 41/276 (14%)

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKS 836
            + G++L  + + ++LY     D G ++L  +C  + HI    + +   G  I V D+ +S
Sbjct: 955  YQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAVVSINAIGQRIYVSDVQES 1008

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE-- 894
            +  + YK +E  +   A D +  W++   +LD D    A+   N+  +R  + G  D+  
Sbjct: 1009 VYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKFGNIAVIRL-ATGINDDVD 1067

Query: 895  ----------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
                      +RG L       + V  +H+GE V   +  +L+        G   ++++ 
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI-------PGGSESLVYT 1120

Query: 938  TVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
            T++G +GV+     HE + F + L+ ++R     + G +H  +RS+         KN +D
Sbjct: 1121 TLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLSFRSY-----YYPVKNVID 1175

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            GDL E F  +  T+   IS  +  +  E+ K++E++
Sbjct: 1176 GDLCEQFNSIEPTKQKSISGDLERTPSEVSKKLEDI 1211


>gi|328717412|ref|XP_003246201.1| PREDICTED: splicing factor 3B subunit 3-like [Acyrthosiphon pisum]
          Length = 1218

 Score =  142 bits (359), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 177/778 (22%), Positives = 320/778 (41%), Gaps = 105/778 (13%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G I  L  FR  G  +D++ + ++  +  +L++   S  ++ +   +   + G
Sbjct: 53  LLTVEVFGIIRALMSFRQSGGTKDYIVVGSDSGRIVILEY-LPSKNVLDKVHQETFGKSG 111

Query: 61  -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQL-------KEAFNIRLEELQ 110
            R    GQ   IDP  R  +IG      L  +   D + +L           N  +  + 
Sbjct: 112 CRRIVPGQYLAIDPKGRAIMIGAVEKQKLVYIPNRDAEARLTISSPLEANKSNTLVYHMV 171

Query: 111 VLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN-------- 159
            +D+ F      P    L  D ++     T E     +    F E  +  N+        
Sbjct: 172 GIDVAF----DNPIFACLEIDYEEVDADPTGEAVQTTRQTLTFYEVDFGLNHVVRKYSEP 227

Query: 160 LDNGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGS 214
           L+  A+ LI VP     P   GVLI  E  I Y   N  +   IR  I +    +D D  
Sbjct: 228 LEEHANSLITVPGDKDGP--GGVLICSENYITY--KNQGEQSDIRCPIPRRRNDLD-DPE 282

Query: 215 R-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
           R                 +L+    G +  + +   +E VT ++++      +AS +  L
Sbjct: 283 RGMILVCSATHKTKSMFFFLVQTEQGDVFKVTLETNEEFVTEIRLKYFDTVPVASAMCVL 342

Query: 258 DNAVVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV---------- 294
               +++ S +G+  L ++             +  P  +G       R +          
Sbjct: 343 KTGFLFVASEFGNHYLYQIANLGDDDDEPEFSSAMPLEEGDTFFFAPRQLRNLVLVDEMD 402

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSL 353
           +L PI+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W++
Sbjct: 403 SLSPIMACQVADLAAEDTPQLYMACGRGSRSTLRVLRHGLEVSEMAVSELPGSPNAVWTV 462

Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
           +   D+ FD +++VSF + T +L++   + +EE    GF   T TL C     + +VQ+ 
Sbjct: 463 KRRADENFDAYIIVSFSNATLVLSIG--ETVEEVSDSGFLGTTPTLSCSPLGDDAVVQIY 520

Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVK 472
              VR + S  R   ++WK+P    +    AN  QV++A GGG LVY E+   G L E K
Sbjct: 521 PNGVRHIRSDKR--MHDWKAPEKKKIVKCAANQRQVVIALGGGELVYFEMDPTGHLNEHK 578

Query: 473 -HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIP 530
              ++  ++ C+ +         S+  AVG+ TD +VRI SL   + + +  +     +P
Sbjct: 579 DRKEMNSDVLCMALANAPSGEQMSRFLAVGL-TDETVRIISLDTTDCLVQLKMQAIPAMP 637

Query: 531 RSVLLCAFE-----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573
            S  LC  E                  + YL   L +G LL  +L+  TGE+ D +   L
Sbjct: 638 ES--LCIVEMGASDGGSSDEPAMNSLSMLYLNIGLQNGVLLRTVLDGVTGEMADTRARYL 695

Query: 574 GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
           G +P+ L    ++    V A S R  + Y    +   + ++ + + +   F+S   P+ +
Sbjct: 696 GGKPVKLFKIRTRGNEAVLAMSSRSWLSYYYQNRFHLTPLSYESLEYASGFSSEQCPEGI 755

Query: 634 AIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE 690
                  L I  ++ +  +  + S P+   PR+      S    +   ++ +  E+++
Sbjct: 756 VAISGNTLRILALEKLGAVFNQVSFPVEYTPRKFVIHPDSAHLIVVETEHNAYTEQTK 813



 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 69/293 (23%), Positives = 135/293 (46%), Gaps = 39/293 (13%)

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-HILAL 819
            L+L+ +        ++  F G+++  I + ++LY     D G ++L  +C +    +L +
Sbjct: 938  LELVHKTTVDNVPTAICGFQGRVIIGIGRILRLY-----DIGKKKLLRKCENKQIPLLIM 992

Query: 820  YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNF 879
             ++  G  I V D+ +S+ ++ YK  E  +   A D    +++A+EILD +    A+   
Sbjct: 993  GIRVMGCRIYVSDVQESVYMVRYKRNENQLIIFADDTQPRYITAMEILDYNTVATADKCG 1052

Query: 880  NLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
            N+  VR  S   +DE            +RG L       + +  +H+GE     +  +L+
Sbjct: 1053 NISVVRLAS-SISDEVDDDPTGNKSLWDRGLLNGASQKADFIVNFHIGEICTSIQKATLI 1111

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                    G   ++++ TV G IGV+     HE+  F + L+ ++R     + G +H  +
Sbjct: 1112 -------PGGSESLVYATVTGTIGVLVPFTAHEEQDFFQHLEMHMRSENPPLCGRDHLSY 1164

Query: 980  RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            RS+         KN  DGDL E +  +   +   I+  ++ +  E+ KR+E++
Sbjct: 1165 RSY-----YFPVKNVCDGDLCEQYNSIDIAKQKSIAADLDKTPAEVSKRLEDI 1212


>gi|340721347|ref|XP_003399083.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus terrestris]
 gi|350406701|ref|XP_003487854.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus impatiens]
          Length = 1217

 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 170/774 (21%), Positives = 320/774 (41%), Gaps = 94/774 (12%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G I +L  FR  G  +D++ + ++  +  +L++   +  +  +   +   + G
Sbjct: 53  LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEY-IPAKNVFEKVHQETFGKSG 111

Query: 61  -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DI 114
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDPEARLTISSPLEAHKSNTLVYHTV 171

Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
               G   P    L  D ++A +  T + A+K +            + V   +S+  L+ 
Sbjct: 172 GVDVGFENPMFACLEIDYEEADNDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEE 230

Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-- 215
            A+ L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R  
Sbjct: 231 HANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGM 285

Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
                          +L     G +  + +  +++ VT +K++      +A+++  L   
Sbjct: 286 IFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTG 345

Query: 261 VVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLG 297
            +++ S +G+  L ++             +  P  +G       R +          +L 
Sbjct: 346 FLFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLS 405

Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
           PI+   V DL  +   ++    G     +LR++R+G+ ++E A  EL G    +W+++  
Sbjct: 406 PIMACQVADLANEDTPELYITCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRR 465

Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
            D+ +D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466 VDEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDG 523

Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
           +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524 IRHIRADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERK 581

Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKE------------ 522
           ++  E+ C+ +  +      S   AVG+  D +VRI SL   + +               
Sbjct: 582 KMPSEVMCMALGNVAVGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSMQALPAAAESL 640

Query: 523 ---HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
               +G +    S  L   +   YL   L +G LL  +L+  +G+L D +   LG++P+ 
Sbjct: 641 CIVEMGAKDANNSEDLSPQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRPVK 700

Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
           L     +    V A S R  + Y    +   + ++ + +     F+S   P+ +      
Sbjct: 701 LFRIKMQGNQAVLAMSSRSWLSYYYQNRFHLTPLSYESLEFASGFSSEQCPEGIVAISTN 760

Query: 640 ELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
            L I  ++ +  +  + S PL   PR+      S    I   ++ +  EE++  
Sbjct: 761 TLRILALEKLGAVFNQISFPLEYTPRKFAIHTDSAHLIIIETEHNAYTEETKQQ 814



 Score = 82.0 bits (201), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 69/294 (23%), Positives = 135/294 (45%), Gaps = 41/294 (13%)

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LA 818
            L+L+ +        ++  + G++L  + + ++LY     D G ++L  +C  + HI    
Sbjct: 937  LELVHKTTLDEVPLAICPYQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAV 990

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
            + +   G  I V D+ +S+  + YK +E  +   A D +  W++   +LD D    A+  
Sbjct: 991  VSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKF 1050

Query: 879  FNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSL 919
             N+  +R  S G  D+            +RG L       + V  +H+GE V   +  +L
Sbjct: 1051 GNIAVIRLAS-GINDDVDEDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATL 1109

Query: 920  VMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
            +        G   ++++ T++G +GV+     HE + F + L+ ++R     + G +H  
Sbjct: 1110 I-------PGGSESLVYTTLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLS 1162

Query: 979  WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +RS+         KN +DGDL E F  +  T+   IS  +  +  E+ K++E++
Sbjct: 1163 FRSY-----YYPVKNVIDGDLCEQFNSIEPTKQKSISGDLERTASEVSKKLEDI 1211


>gi|401882870|gb|EJT47110.1| hypothetical protein A1Q1_04103 [Trichosporon asahii var. asahii CBS
            2479]
          Length = 1107

 Score =  142 bits (357), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 204/874 (23%), Positives = 339/874 (38%), Gaps = 170/874 (19%)

Query: 217  LLGDHAGLLHLLVITHEKEKVTGLKIEL--LGETSIASTISYLDNAVVYIGSSYGDSQLI 274
            LLGD  G   L  +  E E +  +++ +  +G  S  S+++YLDN  ++  S+ GDS L+
Sbjct: 291  LLGDEYG--RLTAVGWEFEDMEKVQVGMIDMGVVSAPSSLTYLDNGYLFSASACGDSLLV 348

Query: 275  KLNL-------QPDAKGS----------------------YVEVLERYVNLGPIVDFCVV 305
             L L       QP  KG                        V+V ER++N+ P  DF +V
Sbjct: 349  YLVLPSPNSTRQPSGKGKEKAHDISDAGAYEIVTAEPQHGRVDVRERWMNIAPAKDFAIV 408

Query: 306  DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTF 364
              +      VV  SG+    S R+VR+G+G     ++E + GI+ MW++ ++ D P    
Sbjct: 409  KEDDGRVSHVVVASGSASSNSFRVVRSGVGFENLMTIEEIPGIERMWTIPAA-DGPS--- 464

Query: 365  LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
            L+VSF   T IL   +E E+   +     +   T        + L+QVT   +RL S  +
Sbjct: 465  LMVSFAYSTTIL--QIEPEVSVFKAADQVTAVPTFAAGLVDKSLLLQVTPEGIRLWSDLA 522

Query: 425  RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLD 484
              +          + NV     +++  A   G++      DG +T  + +          
Sbjct: 523  AGM---------LAGNVDAPEDNRIATANVRGNIAVAAFRDGTVTLFRASSQ-------- 565

Query: 485  INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI--- 541
                G         A+  W           +++L+  E      + +S+   A+      
Sbjct: 566  ----GLQEVVKGFLAIADWCG---------EIDLVAVEGADQGTVLKSIREAAYATSLQF 612

Query: 542  -------SYLLCALGDGHLLNFLLNMKTGELTD-RKKVSLGTQPITLRTF--SSKNTTHV 591
                   + LL  L DG  +++ + +   E +D R   SLG +P+ L     S     HV
Sbjct: 613  QESNNEPTRLLAGLSDGTFVSYSVKLNGAECSDSRHASSLGLRPLRLIALDISPNAEEHV 672

Query: 592  FAA--SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
             AA  SDR ++++ S     +S+   K +                           +  +
Sbjct: 673  VAAGISDRLSLVFESRDHYEFSSSGKKGI-----------------------VFERLTSL 709

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES----EMHFVRLLDDQTFEFI 705
            +KL ++++ LG                +     +S   ++    +   V L +  T E +
Sbjct: 710  KKLQVQTLDLGNRSATRVAALPGYNLVVAETVTRSMDHQTGDVLQSSSVELRNATTLELL 769

Query: 706  STYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENE----------------PTKG 749
            S + L   E   S+ + +       Y  VGTA     ENE                  +G
Sbjct: 770  SEFQLPEREAVASVNAVTL--HGRKYILVGTAIF---ENEDALEDATLEDVTSFIATNRG 824

Query: 750  RILVFIVEDG---KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
            R+L+F + +     L L+      G VY     +G L  A + K+ + +   +     E 
Sbjct: 825  RLLLFQINESAGPSLDLVTSMTFNGPVYDTVVIHGFLAVATSTKVSILRLTTQPPSLEEA 884

Query: 807  QS-ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW---MS 862
             S       H LA+    +   +VVGD M+SI +L    E G I    RD NA+    +S
Sbjct: 885  ASFAFAFETHHLAVVEIDKEKRLVVGDAMRSIIVLSVDPESGDIVGDQRDMNAHLVRCLS 944

Query: 863  AVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR 922
            AV  ++  + +              ++G T              L E V R + G+L   
Sbjct: 945  AVHDVEPGVMI--------------ADGITP--------AASIGLSEDVTRLQPGTLA-- 980

Query: 923  LPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
             P S  G I    ++  TVNG +GVI  L       L+ LQ N+ K+ KG GGL   +W+
Sbjct: 981  -PVSAEGDILRADLLCTTVNGRLGVIGELGKGSIRTLDDLQRNMNKLYKGPGGL---EWK 1036

Query: 981  SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
               N     +   F+DGD ++ F  L  +  D+I
Sbjct: 1037 ESGNMLVPRETVGFIDGDFVQRFSSLDSSLQDKI 1070


>gi|307205956|gb|EFN84082.1| Splicing factor 3B subunit 3 [Harpegnathos saltator]
          Length = 1217

 Score =  141 bits (356), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 170/776 (21%), Positives = 322/776 (41%), Gaps = 98/776 (12%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G + +L  FR  G  +D++ + ++  +  +L++   +  +  +   +   + G
Sbjct: 53  LLTVEVFGIVRSLMAFRLTGGTKDYIVVGSDSGRIVILEY-IPAKNIFEKVHQETFGKSG 111

Query: 61  -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DI 114
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDAEARLTISSPLEAHKSNTLVYHTV 171

Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
               G   P    L  D ++A    T + A+K +            + V   +S+  L+ 
Sbjct: 172 GVDVGFENPMFACLEIDYEEADSDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEE 230

Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-- 215
            A+ L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R  
Sbjct: 231 HANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGM 285

Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
                          +L     G +  + +  + + VT +K++      +A+++  L   
Sbjct: 286 IFVCSATHKTKSMFFFLAQTEQGDIFKITLETDDDMVTEIKLKYFDTVPVAASMCVLKTG 345

Query: 261 VVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLG 297
            +++ S +G+  L ++             +  P  +G       R +          +L 
Sbjct: 346 FLFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLS 405

Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
           PI+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++  
Sbjct: 406 PIMACQVADLANEDTPQLYIACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRR 465

Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
            D+ +D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466 VDEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDG 523

Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
           +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524 IRHIRADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERK 581

Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV- 533
           ++  E+ C+ +  +      S   AVG+  D +VRI SL   + +    +  + +P +  
Sbjct: 582 KMPSEVMCMALGNVAVGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSM--QALPAAAE 638

Query: 534 LLCAFE-GIS---------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
            LC  E G+                YL   L +G LL  +L+  +G+L D +   LG++P
Sbjct: 639 SLCIVEMGVKDADNSEDSAPQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRP 698

Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
           + L     +    V A S R  + Y    +   + ++ + +     F+S   P+ +    
Sbjct: 699 VKLFRIRMQGNQAVLAMSSRSWLSYYYQNRFHLTPLSYESLEFASGFSSEQCPEGIVAIS 758

Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
              L I  ++ +  +  + S PL   PR+      S    +   ++ +  +E++  
Sbjct: 759 TNTLRILALEKLGAVFNQVSFPLEYTPRKFAIHADSAHLVVIETEHNAYTDETKQQ 814



 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 67/294 (22%), Positives = 134/294 (45%), Gaps = 41/294 (13%)

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LA 818
            L+L+ +        ++  + G++L  + + ++LY     D G ++L  +C  + HI    
Sbjct: 937  LELLHKSPLDEVPLAICPYQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAV 990

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
            + +   G  I V D+ +S+  + YK +E  +   A D +  W++   +LD D    A+  
Sbjct: 991  VSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKF 1050

Query: 879  FNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSL 919
             N+  +R  + G  D+            +RG L       + V  +H+GE V   +  +L
Sbjct: 1051 GNIAVIRL-ATGINDDVDEDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATL 1109

Query: 920  VMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
            +        G   ++++ T++G +GV+     HE + F + L+ ++R     + G +H  
Sbjct: 1110 I-------PGGSESLVYTTLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLS 1162

Query: 979  WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +RS+         KN +DGDL E F  +   +   IS  +  +  E+ K++E++
Sbjct: 1163 FRSY-----YYPVKNVIDGDLCEQFNSIEPAKQKSISGDLERTPSEVSKKLEDI 1211


>gi|268568396|ref|XP_002640241.1| C. briggsae CBR-TAG-203 protein [Caenorhabditis briggsae]
          Length = 1218

 Score =  139 bits (350), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 170/735 (23%), Positives = 307/735 (41%), Gaps = 88/735 (11%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
           I+G + +L  FR     +DF+ + ++  +  +LQ++ E +    L     G    R   P
Sbjct: 59  IFG-VRSLLAFRLTAGTRDFIAVGSDSGRIVILQYNPEKTCFERLHQETFGKTGCRRIVP 117

Query: 63  TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
              G     DP  R  +IG      L  ++  D +  L  +  +   +   L    +   
Sbjct: 118 ---GHYLAGDPRGRALMIGAVERQKLVYIMNRDAEAHLTISSPLEAHKQHTLCYAMVGID 174

Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADL 166
            G   PT   L  D +DA +  T E A + +    F E     N+        L +  +L
Sbjct: 175 VGFENPTFACLEFDYEDADNDPTGEAAKRTQQTLTFYELDLGLNHVVRKYAEPLTDPGNL 234

Query: 167 LIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-------- 215
           LI VP       GV++  E  IVY   N      IR  I +    +D D  R        
Sbjct: 235 LIAVPGGNEGPSGVIVCCENYIVY--KNLGDQPDIRCPIPRRRNELD-DADRTMLIIATA 291

Query: 216 ---------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
                    +L+    G +  + +  +++ VT +K++       A+ +  L    +++ +
Sbjct: 292 THKTKNMFFFLIQAENGDIFKVTLETDEDLVTEMKLKYFDTVPPANALCILKAGFLFVAA 351

Query: 267 SYGDSQLIKLN-----------------------LQPDAKGSYVEVLERYVNLGPIVDFC 303
            +G+ +L ++                         +P    S + + +   +L P+ D  
Sbjct: 352 EFGNHELYQIASLGEGGDDEFSSAMGFGENDAAFFEPHELRSLIPI-DSMDSLSPLTDAV 410

Query: 304 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
           + D+ R+   Q+ T  G      ++++RNG+ I+E A  +L G    +W+++ + +D +D
Sbjct: 411 IGDIAREDAAQLFTLVGRGARSHMKVLRNGLEISEMAVSDLPGNPNAVWTVKKNIEDQYD 470

Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
           +++VVSF++ T  L + + D +EE    GF   T T+ C     + LVQ+ S  +R + +
Sbjct: 471 SYIVVSFVNAT--LTLTIGDTVEEASDSGFLPTTPTIGCSMIGDDSLVQIYSEGIRHIRA 528

Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQL-EYEI 480
             R   NEWK+PP   +     N  QV +A  GG LVY E+  +G L E    +L   +I
Sbjct: 529 DKR--INEWKAPPRRQIVKCAVNRRQVAVALSGGELVYFELDLNGTLNEFTERKLFNADI 586

Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAF- 538
           +C+  + I E    S+  A+G   D +VRI SL P+  L+          P S+LL    
Sbjct: 587 ACMTFSEISEGELNSRFLALGT-VDNAVRIISLDPNDMLMPLSTQNLPCPPESILLIDTP 645

Query: 539 ----EGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
               +G++  +L   L +G L    ++  TG + D +   LGT+P  L     +  + + 
Sbjct: 646 NEDGKGVASVHLNIGLQNGCLFRNTVDNVTGAIMDTRTRYLGTRP-KLFKVQVQGRSAIL 704

Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ-K 651
             S R  ++Y   ++   + ++   + +   F S    + +       L I   + +   
Sbjct: 705 CTSSRSWLLYHFQRRFHLTPLSYANLEYAASFCSNQCAEGVVAISASTLRIIAAEKLGVA 764

Query: 652 LHIRSIPLGEHPRRI 666
            +++S      PRRI
Sbjct: 765 FNVQSFEHKMTPRRI 779



 Score = 74.7 bits (182), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 68/309 (22%), Positives = 143/309 (46%), Gaps = 39/309 (12%)

Query: 746  PTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT 803
            P +G +  F +     +   +   ET   V +++ F G  L    + +++Y     D G 
Sbjct: 921  PVRGCVYTFHLSPNGDRFDFLHRTETPLPVGAIHDFRGMALVGFGKFLRMY-----DIGQ 975

Query: 804  RELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMS 862
            ++L ++C +    + +  +Q+ G  I+V D  +S+  L Y+  +  +   A D    ++S
Sbjct: 976  KKLLAKCENKNFPVNIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQLVVFADDTTPRYVS 1035

Query: 863  AVEILDDDIYLGAENNFNLFTVR---KNSEGATDE--------ERG-------RLEVVGE 904
             V +LD      A+   NL  VR   + +E   D+        +RG       ++E+V  
Sbjct: 1036 CVCVLDYHTVAIADKFGNLSVVRLPERVNEDVQDDPTVSKSVWDRGWLNGASQKVELVAN 1095

Query: 905  YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTN 963
            + +G+ +   +  SL   +P    G    +++ T+ G IG + S +  ++  F   L+ +
Sbjct: 1096 FFIGDTITSLQKTSL---MP----GANEALVYTTIGGAIGCLVSFMSKDEVDFFTNLEMH 1148

Query: 964  LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
            +R     + G +H  +RS+    K+V     +DGD+ E F  +  ++  E+++ +  +V 
Sbjct: 1149 VRSEYPPLCGRDHLSYRSYYAPCKSV-----IDGDICEQFSLMELSKQKEVAEELGKTVS 1203

Query: 1024 ELCKRVEEL 1032
            E+ K++E++
Sbjct: 1204 EISKKLEDI 1212


>gi|66553024|ref|XP_623333.1| PREDICTED: splicing factor 3B subunit 3 isoform 1 [Apis mellifera]
 gi|380015815|ref|XP_003691890.1| PREDICTED: splicing factor 3B subunit 3-like [Apis florea]
          Length = 1217

 Score =  139 bits (349), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 170/774 (21%), Positives = 318/774 (41%), Gaps = 94/774 (12%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G I +L  FR  G  +D++ + ++  +  +L++   +  +  +   +   + G
Sbjct: 53  LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEY-IPAKNVFEKVHQETFGKSG 111

Query: 61  -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DI 114
            R    GQ   IDP  R  +IG      L  ++  D + +L  +  +   +   L    +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDPEARLTISSPLEAHKSNTLVYHTV 171

Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
               G   P    L  D ++A    T + A+K +            + V   +S+  L+ 
Sbjct: 172 GVDVGFENPMFACLEIDYEEADSDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEE 230

Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-- 215
            A+ L+ VP     P   GVLI  E  + Y   N      IR  I +    +D D  R  
Sbjct: 231 HANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGM 285

Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
                          +L     G +  + +  +++ VT +K++      +A+++  L   
Sbjct: 286 IFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTG 345

Query: 261 VVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLG 297
            +++ S +G+  L ++             +  P  +G       R +          +L 
Sbjct: 346 FLFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLS 405

Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
           PI+   V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++  
Sbjct: 406 PIMACQVADLANEDTPQLYITCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRR 465

Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
            D+ +D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 466 VDEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDG 523

Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
           +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 524 IRHIRADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERK 581

Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKE------------ 522
           ++  E+ C+ +  +      S   AVG+  D +VRI SL   + +               
Sbjct: 582 KMPSEVMCMALGNVAIGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSMQALPAAAESL 640

Query: 523 ---HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
               +G +    S  L   +   YL   L +G LL  +L+  +G+L D +   LG++ + 
Sbjct: 641 CIVEMGAKDANNSEELSLQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRAVK 700

Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
           L     +    V A S R  + Y    +   + ++ + +     F+S   P+ +      
Sbjct: 701 LFRIKMQGNQAVLAMSSRSWLSYYYQNRFHLTPLSYESLEFASGFSSEQCPEGIVAISTN 760

Query: 640 ELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
            L I  ++ +  +  + S PL   PR+      S    I   ++ +  EE++  
Sbjct: 761 TLRILALEKLGAVFNQISFPLEYTPRKFAIHSDSAHLIIIETEHNAYTEETKQQ 814



 Score = 80.1 bits (196), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 68/294 (23%), Positives = 134/294 (45%), Gaps = 41/294 (13%)

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LA 818
            L+L+ +        ++  + G++L  + + ++LY     D G ++L  +C  + HI    
Sbjct: 937  LELVHKTTLDEVPLAICPYQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAV 990

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
            + +   G  I V D+ +S+  + YK +E  +   A D +  W++   +LD D    A+  
Sbjct: 991  VSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKF 1050

Query: 879  FNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSL 919
             N+  +R  S G  D+            +RG L       + V  +H+GE V   +  +L
Sbjct: 1051 GNIAVIRLAS-GINDDVDEDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATL 1109

Query: 920  VMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
            +        G   ++++ T++G +GV+     HE + F + L+ ++R     + G +H  
Sbjct: 1110 I-------PGGSESLVYTTLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLS 1162

Query: 979  WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +RS+         KN +DGDL E F  +   +   IS  +  +  E+ K++E++
Sbjct: 1163 FRSY-----YYPIKNVIDGDLCEQFNSIEPAKQKSISSDLERTASEVSKKLEDI 1211


>gi|452002380|gb|EMD94838.1| hypothetical protein COCHEDRAFT_1128717 [Cochliobolus heterostrophus
            C5]
          Length = 1235

 Score =  138 bits (347), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 226/1004 (22%), Positives = 393/1004 (39%), Gaps = 161/1004 (16%)

Query: 155  WSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVY------CSANAFKAIPIRPSITKA 205
            WS+  +D  A+ L  VP       GVL  GE++I Y       S+    AIP R   T+ 
Sbjct: 224  WSEP-VDRTANTLFRVPGGPSAPSGVLCCGEDSITYRRIFNNKSSVRRLAIPRREGATED 282

Query: 206  YGR-----------VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTI 254
              R           +      YLL    G +  L +      V  +KI+      + ++I
Sbjct: 283  PNRKRMIVAGTLYSLKGGDFFYLLQTEDGDVFKLTVDAPNGTVENIKIKYFDTIPVTTSI 342

Query: 255  SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------V 287
              L    VY     GD  L +L                   D + S+            +
Sbjct: 343  CILRAGFVYAACESGDRILYELESLGDETDDPVFESSQFPVDPEASFAPPFFRPRALVNL 402

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
              +E   +L PI+D  V +   +   Q+ T +G     S R  RN + + +     L Q 
Sbjct: 403  TAVEAMPSLNPIMDMEVANPALEDAPQIYTINGTGGRSSFRTTRNALEVLDLIESPLPQN 462

Query: 347  IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
               +W+ + +++D  DT +V+     +R L + + D++EE    GF   T TL       
Sbjct: 463  ASDVWTTKLTSEDETDTLIVLCL--HSRTLVLKIGDDVEEASNTGFLPDTNTLGVQQFGE 520

Query: 407  NQLVQVTSGSVRLV---------SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
            + ++Q+    +R +         +S +     +W+ P   ++     N  QV +A   G 
Sbjct: 521  DCIIQIHPKGIRHIQGIQFPNDDASATHASLTDWQPPAHRTIVACATNNRQVAIALSSGQ 580

Query: 458  LVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-P 514
            ++Y E   DG L   +    L+  I+CL +  + E    +   AVG  +D +VRI++L P
Sbjct: 581  ILYFECDSDGSLAMAEEEIVLDSTINCLAMPDVPEGSVRAFFLAVGC-SDQTVRIYNLSP 639

Query: 515  DL--NLITKEHLGGEIIPRSVLLC------AFEGIS-YLLCALGDGHLLNFLLNMKTGEL 565
            D+  N++    +     P S L        +  G S +L   L  G  +  +L+  TG++
Sbjct: 640  DMEGNILRSISVQALTSPPSDLTVNLMTDKSSRGYSQFLHIGLRSGVYIRSVLDEMTGDI 699

Query: 566  TDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHMCP 623
             D ++  LG +PI     +      + A + RP + Y+  +   L  + +N         
Sbjct: 700  GDTRRRFLGPEPIKFAKVTVAGEPAILAMTSRPWLGYTHPRTGVLQLTPLNYISFKSAWN 759

Query: 624  FNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPL--------GEHPRRICHQEQSRT 674
            F+ + F   + ++   EL I T +D+       SIPL        G H + + +  QS  
Sbjct: 760  FDGSQFKGIICVSAN-ELRIFTFNDLTDNTTYESIPLKYTPRKMVGYHDQGVFYVIQSDN 818

Query: 675  FAICSLKNQSC---------------------------------------AEESEMHFVR 695
              I + + Q                                         A+ S    ++
Sbjct: 819  NTISADRRQQLIAQSGGKKEDGTNGSMETEQSNGATDSDEFPAVDFGYPRAQGSWASCIQ 878

Query: 696  LLDDQTFEFISTYPLDTFEYGCSILSCSF----SDDSNVYYCVGTAYVL---PEENEPTK 748
            ++D  T E   T+ ++      S++S +     S +   +  VGTA  L   P +     
Sbjct: 879  VVDPVT-EKAVTHTIE-LNGNISLVSAALVFFESRNDEAFLAVGTAKDLSFTPYKFSSAS 936

Query: 749  GRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
             +I        +L+   E        +L AF GKLLA I + + LY   ++    R+ Q+
Sbjct: 937  IQIYKINPTGRELEFFHETTVSDPPLALLAFKGKLLAGIGRHLCLYDCGMKS-VLRKAQA 995

Query: 809  ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSAVEI 866
                   I  L  +T+G  +VV D  +S++ +++K +     +     D  A   +A E+
Sbjct: 996  PNCVPTRITGL--KTQGSRLVVSDQAQSVTYVVHKDQVHPNRLIPFVDDTVARHTTASEM 1053

Query: 867  LDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRL 923
            LD D  +G +   N++ VR   K SE + +   G   +V + +LG   NR     L+   
Sbjct: 1054 LDYDTTVGGDKFGNIWLVRCPQKVSESSDESPDGSDLLVDKSYLGGTPNRL---DLIAHY 1110

Query: 924  PDSDV-----------GQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGV 971
              +D+           G    V +  + G +G +        +   ++L+  LR   K +
Sbjct: 1111 FTNDIPVSIQKTVLLSGGERVVFWAGLQGTLGALIPFNSRRSHKMFQQLELQLRSDDKPL 1170

Query: 972  GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
             G +H  +RS+    K+V     +DGDLIE FL L R + + I+
Sbjct: 1171 SGRDHLAFRSYFAPVKSV-----IDGDLIERFLVLPRDKRESIA 1209


>gi|452845193|gb|EME47126.1| hypothetical protein DOTSEDRAFT_69180 [Dothistroma septosporum NZE10]
          Length = 1223

 Score =  138 bits (347), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 272/1197 (22%), Positives = 474/1197 (39%), Gaps = 211/1197 (17%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +LD  ++G +  +  FR  G + D L ++++  +  V+ +DAE +      +        
Sbjct: 67   VLDQNVFGIVRGVTAFRIPGTSTDQLIVSSDSGRVAVVNYDAEKNRFNKVHLETYGKSGV 126

Query: 61   RPTDNGQIGIIDPDCRLIGLH-------LY------DGLFKVIPFDNKGQLKE------A 101
            R T  GQ    D   R I L        +Y      DG  ++       Q         A
Sbjct: 127  RRTIPGQYLASDTRGRCIMLASAEKNKVVYMMNRNADGTIQISSPHEANQWGSLCFAVCA 186

Query: 102  FNIRLEE--LQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN 159
             +   E      L++ +    + PT  +     K  + +  Y V L     V+  WS + 
Sbjct: 187  LDTGWEPPVFAALEVDYTEAESDPTGTM---HEKREKQLVYYTVDLGLNHVVKS-WS-DT 241

Query: 160  LDNGADLLIPVP-----PPLCGVLIIGEETIVY-CSANAFKAIPI--RPSITKAYGRVDA 211
            +D  A+++  VP     P   GVL+  E+ I Y    NA  +IPI  R   T+     D 
Sbjct: 242  VDYTANMIFGVPGGQDGP--SGVLVCAEDRIYYRHDKNASLSIPIPRRKGSTE-----DP 294

Query: 212  DGSRYLLGDHAGLLHLLVITHE------KEKVTGLKIEL-LGETSIASTIS--------Y 256
            +  R ++   AG LHL    HE       E     K+ + +GE +   T S        Y
Sbjct: 295  NRKRTIV---AGCLHLAKTRHEFFFLLQTEDGDVFKLSMAMGEDAQGRTTSNPERIILKY 351

Query: 257  LDNAVV------------YIGSSYGDSQLIKLN-----LQPDAKGSYV------EVLERY 293
             D   V            Y+ +  G+SQL  ++     L+ +   ++       E  +RY
Sbjct: 352  YDTFPVARQMLLHKKGYLYVATENGNSQLFHIDDLADDLEFEPHNTFTSDDISPEPGDRY 411

Query: 294  -------------------VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGI 334
                                 L P++   V +L  +   Q+    G       + +R+G+
Sbjct: 412  EPMYFQPRELTMTHLAVDIPGLHPLLKTKVDNLTNEDAPQIYAIQGTGNKSQFKTIRHGL 471

Query: 335  GI-----NEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI 389
             +     N   +V    I   W+ +  + D    +L++S       +A ++ D +E+ E 
Sbjct: 472  DVEVLINNSMGNVPYDNI---WTFKHRSSDDHHKYLLLSSSYGDLTIACSIGDSVEQIES 528

Query: 390  EGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQV 449
              F     T+         LVQV +  +R +  +     NEW+SP   +V VA+AN  Q+
Sbjct: 529  SPFLENRATVHAQQMGDATLVQVHARGIRSILESG--AHNEWQSPAHRTVVVASANERQL 586

Query: 450  LLATGGGHLVYLEIG-DGILTEVKH-AQLEYEISCLDINPIGENPSYSQIAAVGMWTDIS 507
            LLA     L +  +G DGIL +++   ++  +I+ L +         ++ A VG   D +
Sbjct: 587  LLALSSAELAFFFMGDDGILNQLEEMPEMSGKITALSVGQTPRGRQQARYAVVGC-DDCT 645

Query: 508  VRIFSLP-DLNLITKEHLGGEIIPRSVLLCAFEG------ISYLLCALGDGHLLNFLLNM 560
            +R+ S+  D  L  +       IP S+ +           ++ +   L  G  L  +++ 
Sbjct: 646  IRVLSIELDSPLEPRSVQALSAIPTSLEVVEMLDPASNTIVNVVHIGLQSGLYLRAIIDE 705

Query: 561  KTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSH 620
             TGEL D +   LGTQP  L      +   + A S RP + ++  +  LY+   L     
Sbjct: 706  TTGELGDVRTKFLGTQPPRLCPVEVNDQDCILACSSRPWLGFNHPQNNLYTVTPLITE-- 763

Query: 621  MCPFNSA-AF--PD--SLAIAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRT 674
              P N+A +F  PD   L   +   L I T+  I+ +L   +I L   PR +        
Sbjct: 764  --PINAARSFISPDLSGLCAIQGSSLLIFTVPSIEGRLSHSTIDLNYTPRSMTRNPWYPI 821

Query: 675  FAIC-------------SLKNQSCAEES-----EMHF------------VRLLDDQTFE- 703
            + I               L+ +S  E+      E H             ++ +D  T   
Sbjct: 822  WYIAESDGNALSKATRDQLRGKSIEEDEGATALEKHLGLPRGVQHWASCIQAVDPVTKNA 881

Query: 704  FISTYPLDTFEYGCSILSCSF-SDDSNVYYCVGTA-YVLPEENEPTKGRILVF-IVEDG- 759
             +ST  L   E        +F S D  V+  VGT  ++ P       G + ++ +VEDG 
Sbjct: 882  VVSTVELAENEAALCCTCVAFESRDWEVFLAVGTGQHMQPGTGVQAVGYVHIYKLVEDGT 941

Query: 760  KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRD-DGTRELQSECGHHG 814
            KL+ + + +    VY++ AFNG+L   +  ++ +Y    K MLR   GT           
Sbjct: 942  KLEFVHKTKFDLPVYTVLAFNGRLALGVGNELFIYDMGIKAMLRKARGT-------ATPN 994

Query: 815  HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
             I++L  Q  G+ IV GD+ + I+ ++YK +   +     D    W +   ++D +   G
Sbjct: 995  QIVSLEAQ--GNRIVCGDVSEGITYVVYKPKFNRMIPFVDDVVQRWTTCTTMVDYETAAG 1052

Query: 875  AENNFNLFTVR---KNSEGATDEERG---------------RLEVVGEYHLGEFVNRFRH 916
             +   NL+ VR   + S+ A +E  G               RL++   Y   +     + 
Sbjct: 1053 GDKFGNLWVVRCPEQPSQEADEEGAGGFIMNERSYLNGAPYRLDLRAHYFCQDIPTSMQR 1112

Query: 917  GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLN 975
             +LV        G    + +  + G +G++   +  E   F  +L+  +R     + G +
Sbjct: 1113 TALV-------AGGQELLFWSGLQGTLGMLVPFVTREDVEFFTQLEQQMRAEDPPLAGRD 1165

Query: 976  HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            H  +RS+      V  K  +DGDL E F+ LS     +I+  ++  V+E+ K+++E+
Sbjct: 1166 HLMYRSY-----YVPVKGVIDGDLCERFMHLSYDGKQKIAAEVDRGVKEIEKKIQEM 1217


>gi|307166104|gb|EFN60356.1| Splicing factor 3B subunit 3 [Camponotus floridanus]
          Length = 1201

 Score =  137 bits (346), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 170/771 (22%), Positives = 317/771 (41%), Gaps = 104/771 (13%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G I +L  FR  G  +D++ + ++  +  +L++   +  +  +   +   + G
Sbjct: 53  LLTVEVFGIIRSLMAFRLTGATKDYIVVGSDSGRIVILEY-IPAKNMFDKVHQETFGKSG 111

Query: 61  -RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYG 119
            R    GQ   IDP  R + +     L    P +         N  +     +D+ F   
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIEAR--LTISSPLEAHKS-----NTLVYHTVGVDVGF--- 161

Query: 120 CAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLL 167
              P    L  D ++A    T + A+K +            + V   +S+  L+  A+ L
Sbjct: 162 -DNPMFACLEIDYEEADSDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEEHANFL 219

Query: 168 IPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR------- 215
           + VP     P   GVLI  E  + Y   N      IR  I +    +D D  R       
Sbjct: 220 VSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIFVCS 274

Query: 216 ----------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIG 265
                     +L     G +  + +  +++ VT +K++      +A+++  L    +++ 
Sbjct: 275 ATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVA 334

Query: 266 SSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPIVDF 302
           S +G+  L ++             +  P  +G       R +          +L PI+  
Sbjct: 335 SEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMAC 394

Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPF 361
            V DL  +   Q+    G     +LR++R+G+ ++E A  EL G    +W+++   D+ +
Sbjct: 395 QVADLANEDTPQLYIACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRRIDEEY 454

Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
           D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +R + 
Sbjct: 455 DAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDGIRHIR 512

Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYE 479
           +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  ++  E
Sbjct: 513 ADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERKKMPSE 570

Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LLCAF 538
           + C+ +  +      S   AVG+  D +VRI SL   + +    +  + +P +   LC  
Sbjct: 571 VMCMALGNVAVGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSM--QALPAAAESLCIV 627

Query: 539 EGIS----------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
           E  +                YL   L +G LL  +L+  +G+L D +   LG++P+ L  
Sbjct: 628 EMGAKEADNSEDSAPQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRPVKLFR 687

Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
              +    V A S R  + Y    +   + ++ + +     F+S   P+ +       L 
Sbjct: 688 IRMQGNQAVLAMSSRSWLSYYHQNRFHLTPLSYESLEFASGFSSEQCPEGIVAISTNTLR 747

Query: 643 IGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
           I  ++ +  +  + S PL   PR+      S    I   ++ +  EE++  
Sbjct: 748 ILALEKLGAVFNQVSFPLEYTPRKFAIHTDSAHLVIIETEHNAYTEETKQQ 798



 Score = 74.3 bits (181), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 64/276 (23%), Positives = 127/276 (46%), Gaps = 41/276 (14%)

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKS 836
            + G++L  + + ++LY     D G ++L  +C  + HI    + +   G  I V D+ +S
Sbjct: 939  YQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAVVSINAIGQRIYVSDVQES 992

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE-- 894
            +  + YK +E  +   A D +  +++   +LD D    A+   N+  +R  + G  D+  
Sbjct: 993  VYAVRYKRQENQLIVFADDTHPRFITTTCVLDYDTVATADKYGNIAVIRL-ATGINDDVD 1051

Query: 895  ----------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
                      +RG L       + V  +H+GE V   +  +L+        G   ++++ 
Sbjct: 1052 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI-------PGGSESLVYT 1104

Query: 938  TVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
            T++G +GV+     HE + F + L+ ++R     + G +H  +RS+         KN +D
Sbjct: 1105 TLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLSFRSY-----YYPVKNVID 1159

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            GDL E F  +   +   IS  +  +  E+ K++E++
Sbjct: 1160 GDLCEQFNSIEPVKQKSISGDLERTPSEVSKKLEDI 1195


>gi|302654421|ref|XP_003019018.1| hypothetical protein TRV_07031 [Trichophyton verrucosum HKI 0517]
 gi|291182708|gb|EFE38373.1| hypothetical protein TRV_07031 [Trichophyton verrucosum HKI 0517]
          Length = 802

 Score =  137 bits (345), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 175/701 (24%), Positives = 296/701 (42%), Gaps = 100/701 (14%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
           ++G I +L  FR  G ++D++ I ++  +  ++++   +     R   +   + G R   
Sbjct: 64  VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGVRRVV 122

Query: 65  NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
            GQ   +DP  R  LI     + L  V+  + + +L  +  +           L  LD+ 
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182

Query: 116 F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
           +   ++   +     + QD     + +T      YE+ L     V   W+ + +D  A +
Sbjct: 183 YENPIFAALEVEYTEVDQDPTGQAYEETEKMLVYYELDLGLNHVVRR-WA-DPVDRTASM 240

Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR- 215
           L  VP     P   GVL+  E+ IVY  +N  AF+  IP R  P+      R    G   
Sbjct: 241 LFQVPGGADGP--SGVLVCAEDNIVYRHSNQDAFRVPIPRRRGPTENPERKRCITAGVMH 298

Query: 216 -------YLLGDHAGLLH---LLVITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
                  +LL    G L    + ++  E EK TG    LK++      +AS++  L +  
Sbjct: 299 KMRGAFFFLLQSEDGDLFKVTMEMVEDENEKATGEVKRLKLKYFDTVPLASSLCILKSGF 358

Query: 262 VYIGSSYGD-----------------------SQLIKLNLQP----DAKGSYVEVLERYV 294
           +++ S  G+                       S +I   L P          + ++E   
Sbjct: 359 LFVASETGNQHFYQFEKLGDDDDEIEFISDDYSAVISEPLPPVYFRPRPAENLNLVESIA 418

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
           +L P++   + ++  +   Q+ T  G     S R +++G+ ++E    EL  +   +W+ 
Sbjct: 419 SLNPLMAASIANITEEDAPQIYTLCGTSARSSFRTLKHGLEVSEIVESELPSVPSAVWTT 478

Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
           + S +D FD ++V+SF + T +L++   + +EE    GF S   TL       + L+QV 
Sbjct: 479 KLSRNDQFDAYIVLSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVH 536

Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV- 471
              +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG L E  
Sbjct: 537 PKGIRHIHADQR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDGSLAEYD 594

Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
           +  Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P
Sbjct: 595 EKRQMSGTVTCLSLGEVPEGRGRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 653

Query: 531 RSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
            ++ ++   +  S     YL   L  G  L  +L+  TGEL+D +   LG +P+ L + S
Sbjct: 654 SALSIMSMIDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFSVS 713

Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFN 625
            K    V A S R          L YS+V  K  + + P N
Sbjct: 714 VKEQRAVLALSSR--------SWLGYSDVQTKSFT-LTPLN 745


>gi|357111224|ref|XP_003557414.1| PREDICTED: DNA damage-binding protein 1-like [Brachypodium
           distachyon]
          Length = 1356

 Score =  137 bits (344), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 122/464 (26%), Positives = 206/464 (44%), Gaps = 74/464 (15%)

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE--LQGIKGMWS 352
           N+GPI+D  + D   + Q Q+ +CSG   +GSLR++RNGI + +    E   QG+ G+W+
Sbjct: 478 NVGPILDLAIADYHGEKQDQMFSCSGMCPEGSLRVIRNGINVEKLLRTEPIYQGVTGLWT 537

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           LR    D + +FLV++F+ ETRIL++ L    + ++  GF     TL C       LVQ+
Sbjct: 538 LRMKRTDMYHSFLVLAFVEETRILSVGLSFN-DISDAVGFQPDVCTLACGLIADGVLVQI 596

Query: 413 TSGSVRLVSSTSRELRN--EWKSP------PGYSVNVATANASQVLLATGGGHLVYLEIG 464
            S  V+L   T+          SP      P  +++V     + V +AT     +Y+   
Sbjct: 597 HSKGVKLCLPTAYAHPEGAPLTSPVCVDWYPDVTISVGAVGHNIVAVATSNPCCLYILSV 656

Query: 465 DGI------LTEVKHAQLEYEISCLDI-------NPIGENPSYSQIAAVGMWTDISVRIF 511
             +      L E++H QL YE+SC+ I       +P+    ++ +     +   + VR+F
Sbjct: 657 RPLSSFQYELYEIQHVQLRYEVSCISIPEEDSRRSPVAVRRAFGRGKRNNLPAKVDVRMF 716

Query: 512 SL-----PDLNLITKE-----------------HLGGEI---IPRSVLLCAFEGISYLLC 546
           ++     P + +I+ E                   G  +   IP SV   A E   Y+L 
Sbjct: 717 AVIGTHKPSVEVISLEPGEAFMLLSIGSISVNNSFGAPVSGRIPESVRFVASERF-YILA 775

Query: 547 ALGDGHLLNFLLNMKTGE---LTDR--KKVS---------------LGTQPITLRTFSSK 586
            L +G LL F    +T E   L D   K+ S               +G  P+ L   S  
Sbjct: 776 GLRNGMLLRF--ESETSEEHYLPDSFYKESSTHSVNTLLQLIAMRHIGITPVGLVPLSDS 833

Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
             + +   SDR  ++++S   L YS+++    SH+ P +S   P  L    E  L +  +
Sbjct: 834 ANSDIILLSDRSWLLHASRHSLAYSSISFLPASHVTPVSSMDCPSGLLFVAENCLHLVEM 893

Query: 647 DDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEE 688
              ++L+ + + +   PR++ +   SRT  +    L   SC+ +
Sbjct: 894 VHGKRLNAQKLSIEGTPRKVLYHSDSRTLLVMRTGLTGASCSSD 937


>gi|323454388|gb|EGB10258.1| hypothetical protein AURANDRAFT_23619 [Aureococcus anophagefferens]
          Length = 1212

 Score =  137 bits (344), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 106/374 (28%), Positives = 176/374 (47%), Gaps = 12/374 (3%)

Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
           I D    DL  +   QV    G     SLR++R+G+ ++E A  EL G    +W++R   
Sbjct: 415 ITDVYCGDLAGEEAPQVYALCGKGHRSSLRVLRHGVAVSEMAVSELPGRPSAVWTVRGRH 474

Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
           D+P+D ++VVSF + T +L++   + +EE    GF +   TL       N L+QV    +
Sbjct: 475 DEPYDKYIVVSFTNATLVLSIG--ETVEEVTDSGFLATAPTLDVALLADNALLQVHGEGI 532

Query: 418 RLVSSTSRELR-NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQ 475
           R V     +LR +EWK+P   ++  A AN  QV  A  GG ++Y E+   G L E+   +
Sbjct: 533 RHVRG---DLRISEWKTPGRKAIEKAAANERQVAAALAGGEVIYFELDASGALAELGTKE 589

Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVL 534
           L  E++CLD+  +    + +   A+G W D S+R+ SL PD  L+    +       SV 
Sbjct: 590 LGVEVACLDVGVVPAGRARAPFLALGGW-DGSLRLLSLAPDELLVQVATMQLGARAESVR 648

Query: 535 LC-AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
            C   +G   +   L +G L    ++  TG+L D +   LG++ + L          + A
Sbjct: 649 FCETPDGRLGVAAGLANGVLQRAAVDASTGQLGDARARFLGSRAVRLFRVDVGGAPGLLA 708

Query: 594 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKL 652
            S R  + Y+   +L  + ++   + H   F S   P+ +       L I   D + +  
Sbjct: 709 LSSRAWLCYAHAGRLETAPLSYDALEHAAGFKSEQCPEGVVAIAGSTLRIFVPDKLGETF 768

Query: 653 HIRSIPLGEHPRRI 666
           +  ++PL   PRR+
Sbjct: 769 NQSALPLRYTPRRL 782


>gi|302504587|ref|XP_003014252.1| hypothetical protein ARB_07557 [Arthroderma benhamiae CBS 112371]
 gi|291177820|gb|EFE33612.1| hypothetical protein ARB_07557 [Arthroderma benhamiae CBS 112371]
          Length = 762

 Score =  136 bits (343), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 175/701 (24%), Positives = 296/701 (42%), Gaps = 100/701 (14%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
           ++G I +L  FR  G ++D++ I ++  +  ++++   +     R   +   + G R   
Sbjct: 64  VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGVRRVV 122

Query: 65  NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
            GQ   +DP  R  LI     + L  V+  + + +L  +  +           L  LD+ 
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182

Query: 116 F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
           +   ++   +     + QD     + +T      YE+ L     V   W+ + +D  A +
Sbjct: 183 YENPIFAALEVEYTEVDQDPTGQAYEETEKMLVYYELDLGLNHVVRR-WA-DPVDRTASM 240

Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR- 215
           L  VP     P   GVL+  E+ IVY  +N  AF+  IP R  P+      R    G   
Sbjct: 241 LFQVPGGADGP--SGVLVCAEDNIVYRHSNQDAFRVPIPRRRGPTENPERKRCITAGVMH 298

Query: 216 -------YLLGDHAGLLH---LLVITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
                  +LL    G L    + ++  E EK TG    LK++      +AS++  L +  
Sbjct: 299 KMRGAFFFLLQSEDGDLFKVTMEMVEDENEKATGEVKRLKLKYFDTVPLASSLCILKSGF 358

Query: 262 VYIGSSYGD-----------------------SQLIKLNLQP----DAKGSYVEVLERYV 294
           +++ S  G+                       S +I   L P          + ++E   
Sbjct: 359 LFVASETGNQHFYQFEKLGDDDDEIEFISDDCSAVISEPLPPVYFRPRPAENLNLVESIA 418

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
           +L P++   + ++  +   Q+ T  G     S R +++G+ ++E    EL  +   +W+ 
Sbjct: 419 SLNPLMAASIANITEEDAPQIYTLCGTGARSSFRTLKHGLEVSEIVESELPSVPSAVWTT 478

Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
           + S +D FD ++V+SF + T +L++   + +EE    GF S   TL       + L+QV 
Sbjct: 479 KLSRNDQFDAYIVLSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVH 536

Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV- 471
              +R + +  R   NEW +P   S+  AT N  QV +A   G +VY E+  DG L E  
Sbjct: 537 PKGIRHIHADQR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDGSLAEYD 594

Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
           +  Q+   ++CL +  + E    S   AVG   D +VRI SL PD  L  K        P
Sbjct: 595 EKRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 653

Query: 531 RSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
            ++ ++   +  S     YL   L  G  L  +L+  TGEL+D +   LG +P+ L + S
Sbjct: 654 SALSIMSMIDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFSVS 713

Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFN 625
            K    V A S R          L YS+V  K  + + P N
Sbjct: 714 VKEQRAVLALSSR--------SWLGYSDVQTKSFT-LTPLN 745


>gi|452986188|gb|EME85944.1| hypothetical protein MYCFIDRAFT_59215 [Pseudocercospora fijiensis
            CIRAD86]
          Length = 1223

 Score =  136 bits (343), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 228/1017 (22%), Positives = 412/1017 (40%), Gaps = 176/1017 (17%)

Query: 155  WSQNNLDNGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFK-AIPI--RPSITKAY 206
            WS+  +D  A++LI VP     P   GVL+  E+ I Y    A   AIPI  R   T+  
Sbjct: 238  WSEP-VDYSANMLIGVPGGQNGP--SGVLVCCEDRIYYKHDKAASLAIPIPRRKGATE-- 292

Query: 207  GRVDADGSRYLLGDHAGLLHL--------LVITHEKEKVTGLKIELLGETSIAST----- 253
               D +  RY++   A  LHL         ++  E   V  L I +  +     T     
Sbjct: 293  ---DPNRKRYIV---ASCLHLAASRKEFFFLLQTEDGDVFKLNITMAVDAQGRQTADPEQ 346

Query: 254  --ISYLDN------------AVVYIGSSYGDSQLIKLN------------------LQPD 281
              + Y D               +YI +  G+SQL  ++                  + PD
Sbjct: 347  IILKYYDTFPIAKQMLLHKKGFLYIATENGNSQLYHVDDLADDLEFEPHNNFTSDGVSPD 406

Query: 282  AKGSY------------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRI 329
               +Y              +      L P++   V +L  +   Q+    G       + 
Sbjct: 407  PAEAYEPTYFQPRELTMTHLAVDVPGLHPLMRTKVDNLTGEDAPQIYGIQGTGNKSLFKT 466

Query: 330  VRNGIGI-----NEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDEL 384
            +R+G+ +     N   +V   GI   W+ +  + D    +L++S       +A ++ D +
Sbjct: 467  IRHGLDVEIMIDNNMGNVPYDGI---WTFKHRSSDEHHKYLIISSSYGDLTVACSIGDSV 523

Query: 385  EETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATA 444
            E+ E   F     T+       + LVQV +  +R +  T     NEW +PP  +V  A+A
Sbjct: 524  EQIENSPFLENRATVHAQQMGDSTLVQVHARGIRSILETG--AFNEWPTPPHRTVAAASA 581

Query: 445  NASQVLLATGGGHLVYLEIG-DGILTEVKH-AQLEYEISCLDINPIGENPSYSQIAAVGM 502
            N  Q+LLA     L +  +G DGIL +++   ++  +I+ + +    +    ++ A VG 
Sbjct: 582  NERQLLLALSSAELAFFFMGEDGILIQLEEMPEMSGKITAISVGQTPKGRQQAKYAVVGC 641

Query: 503  WTDISVRIFSLP-DLNLITKEHLGGEIIPRSV----LLCAFEGISYLLCALG--DGHLLN 555
              D ++R+ S+  D  L  +       +P S+    +L    G +  +  +G   G  L 
Sbjct: 642  -EDCTIRVLSIELDSPLEARSVQALSAVPTSLEVVEMLDPASGTTVNVVHIGLRSGLYLR 700

Query: 556  FLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNL 615
             +++  TGEL D +   LGT+P  L          V A S RP + Y+  +  LY+   L
Sbjct: 701  AIIDETTGELGDVRTKFLGTKPPRLCPVEVDGEDCVLACSSRPWLGYNHPQSKLYTVTPL 760

Query: 616  --KEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQS 672
              + +     F+S       AI +   L I +I  I+ +L   S+ L   PR +      
Sbjct: 761  IAEPMEAARSFSSPELKGLCAI-QGSSLLIFSIPSIEGRLSHSSVSLQYTPRSMTRNPWY 819

Query: 673  RTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTY------------------PLDT 712
              + +   +    S A   ++    + DD   + +  +                  P+  
Sbjct: 820  PLWYVAQSEGNTLSQATRDQLRGKSIGDDDEAKAMERHLGLPRGNGHWASCIQAVDPIGG 879

Query: 713  FEYGCSI--------LSCSF----SDDSNVYYCVGTA-YVLPEENEPTKGRILVF-IVED 758
             E  C+I        LSC+     S +  VY  VGT  ++ P     T G + ++ +++D
Sbjct: 880  REVTCTIELGENEAALSCACVAFESKNWEVYLAVGTGQHMQPGTGVQTAGYVHIYKLLKD 939

Query: 759  G-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH---HG 814
            G +L+ + + + +  VY+L  F G+L   +  ++ +Y     D G + L  +  +     
Sbjct: 940  GAELEFVHKTKFELPVYALMPFRGRLALGVGNELFIY-----DMGMKALLRKARNIAVPN 994

Query: 815  HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
             I++L  +++G+ I+ GD+ + ++ L+YK     +     D    W +   ++D +   G
Sbjct: 995  QIVSL--ESQGNRIICGDVSEGVTYLVYKPTFNRLIPFVDDTVQRWTTTTTMVDYETAAG 1052

Query: 875  AENNFNLFTVR---KNSEGATDEERG---------------RLEVVGEYHLGEFVNRFRH 916
             +   NL+ VR   + S+ A +E  G               RL++   Y   +     + 
Sbjct: 1053 GDKFGNLWIVRCPEQPSQEADEEGAGGYIMNERSYLNGAPYRLDLRAHYFCQDIPMSMQR 1112

Query: 917  GSLVMRLPDSDVGQIPTVIFGTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLN 975
             +LV        G    + +  + G +G+ I  +  E   F   L+  +R     + G +
Sbjct: 1113 TALV-------AGGQEVLFWSGLQGTLGILIPFVTREDVEFFTALEQQMRTEDPPLAGRD 1165

Query: 976  HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            H  +RS+      V  K  +DGDL+E F+ LS     +I+  ++ SV+E+ K+V+E+
Sbjct: 1166 HLMYRSY-----YVPVKGVIDGDLLERFMGLSYDTKQKIAAEVDRSVKEIEKKVQEM 1217


>gi|169599248|ref|XP_001793047.1| hypothetical protein SNOG_02443 [Phaeosphaeria nodorum SN15]
 gi|160704565|gb|EAT90655.2| hypothetical protein SNOG_02443 [Phaeosphaeria nodorum SN15]
          Length = 1246

 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 222/1003 (22%), Positives = 397/1003 (39%), Gaps = 156/1003 (15%)

Query: 155  WSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVY-----CSANAFK-AIPIRPSITKA 205
            WS+  +D  A+ L  VP       GVL  G + I Y      ++N  + AIP R   T+ 
Sbjct: 230  WSEP-VDRTANALYRVPGGPNAPSGVLCCGLDNITYRRIFNTNSNVHRLAIPRREGATED 288

Query: 206  YGR-----------VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTI 254
              R           +      YLL    G +  L +      V  +KI+      +A++I
Sbjct: 289  PNRKRSIVAGTLYSLKGGDFFYLLQTDDGDVFKLTVDAPSGTVERIKIKYFDTIPVATSI 348

Query: 255  SYLDNAVVYIGSSYGDSQLIKLN-----------------LQPDA----------KGSYV 287
              L    VY     GD  L +L                  + P+A            + +
Sbjct: 349  CILRAGFVYAACESGDRILYELESLGDETDDPMFESGQFPVDPEAVFAPPFFKPRALTNL 408

Query: 288  EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
              +E   +L PI+   + +   +   Q+ T +GA    + R  RN + + +     L Q 
Sbjct: 409  TAVETMPSLNPIMGMEIANPMLEDAPQIYTINGAGGRSTFRTTRNALEVLDLIESPLPQN 468

Query: 347  IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
               +W+ + + +D  DT +V+     +R L + + +++EE    GF + T TL       
Sbjct: 469  ASDVWTTKLTLEDETDTLIVLCL--HSRTLVLKIGEDVEEASNTGFMADTNTLGVQQFGE 526

Query: 407  NQLVQVTSGSVRLVSS---------TSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
            + ++QV    +R +            +     +W+ P   ++     N  QV +A   G 
Sbjct: 527  DCIIQVHPKGIRHIQGIQFPEDGAVATHTSLTDWQPPAHRTIVACATNNRQVAIALSSGQ 586

Query: 458  LVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-P 514
            ++Y E   DG L   +    L+  I+CL I  + E    +   AVG  +D +VRIF+L P
Sbjct: 587  ILYFECDADGSLAMAEEEIALDSTINCLAIPDVPEGSVRAFFMAVGC-SDQTVRIFNLSP 645

Query: 515  DL--NLITKEHLGGEIIPRSVLLC------AFEGIS-YLLCALGDGHLLNFLLNMKTGEL 565
            D+  N++    +     P S L        +  G S +L   L  G  +  +L+  TG++
Sbjct: 646  DMEGNILRSISVQALTSPPSGLTINMMSDKSSRGYSQFLHIGLRSGVYIRSVLDEMTGDI 705

Query: 566  TDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHMCP 623
             D ++  LG +PI     +      + A + RP + Y+  +   L  + +N         
Sbjct: 706  GDTRRRFLGPEPIKFAKVTVAGEPAILAMTSRPWLGYTHPRTGVLQLTPLNYIAFKSAWN 765

Query: 624  FNSAAFPDSLAIAKEGELTIGTIDDI------QKLHIRSIP---LGEHPRRICHQEQSRT 674
            F  + F   + ++   EL I T +D+      + + ++  P   +G H + + +  QS  
Sbjct: 766  FEGSQFKGIICVSAN-ELRIFTFNDLTDNTTYETIQLKYTPRKMVGYHEQGVFYVIQSEN 824

Query: 675  FAICSLKNQSCAEES-------------------EMHFVRLLDDQTFEFISTYPLDTFEY 715
              + +   Q    E+                   E    R   D+       YP     +
Sbjct: 825  NTLSADTRQKLIAEADTKQEDEDDAPMEIEGANGETTNGRATSDELPPTAFGYPRAQGNW 884

Query: 716  GC-----------------------SILSCSF----SDDSNVYYCVGTAYVLP-EENEPT 747
                                     S++S +     S  ++ +  VGTA  L  +    +
Sbjct: 885  ASCIQVVDPVTEKAVTHTVEINANQSLVSAALVYFESRGNDAFLAVGTAKDLSFQPYHYS 944

Query: 748  KGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRE 805
               I V+ I  DG+ L+   E E      +L AF GK++A + + + LY   +R    R+
Sbjct: 945  SASIQVYKISPDGRELEFFHETEVTEPPTALLAFKGKMVAGVGRNLALYDCGMRS-VLRK 1003

Query: 806  LQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSA 863
             Q+       I+ L  +T+G  +VV D ++S++ +++K +     +     D      + 
Sbjct: 1004 AQATNCVPTRIVDL--KTQGSRLVVSDSVQSVTYVVHKDQIHPNRMIPFIDDTIQRHTTC 1061

Query: 864  VEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNR------F 914
             E+LD D  +G +   NL+ VR   K SE + +   G   +V + +LG   NR      +
Sbjct: 1062 TEMLDYDTTVGGDKFGNLWLVRCPQKVSESSDESPDGSDILVDKSYLGGTANRLDSVAHY 1121

Query: 915  RHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGV 971
                + + +  + +  G    + +  + G +G +       Q+   ++L+  LR   K +
Sbjct: 1122 FANDIPISIQKTHLLSGAERIIFWAGLQGTLGALIPFNSRRQHKMFQQLELQLRSDDKPM 1181

Query: 972  GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
             G +H  +RS+ N  K+V     +DGDLIE FL LSR + + I
Sbjct: 1182 SGRDHLAFRSYFNPVKSV-----IDGDLIERFLVLSRDKRESI 1219


>gi|124359136|gb|ABD32504.2| CPSF A subunit, C-terminal; WD40-like [Medicago truncatula]
          Length = 632

 Score =  135 bits (339), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 161/647 (24%), Positives = 279/647 (43%), Gaps = 80/647 (12%)

Query: 429  NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCLDINP 487
            NEW++P   ++    +N  QV++A  GG L+Y E+   G L EV+  ++  +++CLDI P
Sbjct: 18   NEWRTPGKRTIAKVGSNRLQVVIALNGGELIYFEVDVTGQLMEVEKHEMSGDVACLDIAP 77

Query: 488  IGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGISYLLC 546
            + E    S+  AVG + D ++RI SL PD  + T   LG + +  +     F  +     
Sbjct: 78   VPEGRQRSRFLAVGSY-DKTIRILSLDPDDCMQT---LGIQSLSSASESLLFLEVQ---- 129

Query: 547  ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK 606
               +G L   +++M TG L+D +   LG +P  L     +    +   S RP + Y    
Sbjct: 130  -ASNGVLSRTVVDMVTGLLSDSRSPFLGLRPPKLFPIVVRGKRAMLCLSSRPWLGYIHQG 188

Query: 607  KLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR 665
              L + ++ + +     F+S    + +       L I TI+ + +  +   IPL   P +
Sbjct: 189  HFLLTPLSYETLEFAASFSSDQCVEGVVALAGEALRIFTIERLGETFNETVIPLRYTPMK 248

Query: 666  ICHQEQSRTFAIC-SLKNQSCAEESEM--------------HF-------------VRLL 697
               Q + +   +  S +    AEE E               H+             +R+L
Sbjct: 249  FVLQPKRKLLVVIESDQGAFTAEEREAAKKDEDKDDPLSDEHYGYPKAESDKWASCIRIL 308

Query: 698  DDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL--PEENEPTKGRILVF 754
            D +T        L   E   S  + +F D +      VGTA  L        T G I ++
Sbjct: 309  DPKTGNTTCLLELQDNEAAFSGCTVNFHDKEYGTLLDVGTAKGLQFTPRRSLTAGFIHIY 368

Query: 755  -IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
              +EDG+ L+L+ + + +G   +L+ F G+LLA I   ++ Y     D G R L  +  +
Sbjct: 369  RFLEDGRSLELLHKTQVEGVPLALSQFQGRLLAGIGPVLRFY-----DLGKRRLLRK--Y 421

Query: 813  HGHILA---LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
               +     + +QT  D I VGD  +S     Y+ +E  +   A D    W++A   +D 
Sbjct: 422  ENKLFPNTIVSIQTYRDRIYVGDTQESFHYCKYRWDENQLYIFADDCVPRWLTASYHIDF 481

Query: 870  DIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 929
            D          +    K  +G  +    ++E + ++H+G+ ++  +  SL+        G
Sbjct: 482  DTMAEDPTGGRI----KWEQGKLNGAPNKVEEIVQFHVGDVISCLQKASLI-------PG 530

Query: 930  QIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT 988
                ++ GTV G IG + A    +   F   L+ ++R+    + G +H  +RS       
Sbjct: 531  GGECILNGTVMGSIGALHAFTSRDDVDFFSHLEMHMRQDNPPLCGRDHMAYRS-----AY 585

Query: 989  VDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEE 1031
               K+ +DGDL E F    +DL R   DE+ +T      E+ K++EE
Sbjct: 586  FPVKDVIDGDLCEQFPTLPMDLQRKIADELDRTRG----EILKKLEE 628


>gi|218199276|gb|EEC81703.1| hypothetical protein OsI_25307 [Oryza sativa Indica Group]
          Length = 1429

 Score =  134 bits (336), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 146/610 (23%), Positives = 256/610 (41%), Gaps = 94/610 (15%)

Query: 223  GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDA 282
            G  HLL  + + E V  L  E +        + ++D  +V      GD  +++L      
Sbjct: 414  GEFHLLEFSLDMEGVKVLP-ECVHRGLPCKPLLWMDKGMVVGFVEMGDGMILQLENNRLV 472

Query: 283  KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
              S ++      N+ PI+D  + D   + Q Q+  C G   +GSLR++RNG+ + +    
Sbjct: 473  HKSAIQ------NVAPILDLAIADHHGEKQDQMFACCGMCPEGSLRVIRNGVNVEKLLRT 526

Query: 343  E--LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
            +    G+ G+W+LR    D + +FLV+SF+ ETRIL++ L    +  +  GF +   TL 
Sbjct: 527  DPIYHGVTGLWTLRMKRTDAYHSFLVLSFVEETRILSVGLSFN-DICDAVGFQTDVCTLA 585

Query: 401  CHDAIYNQLVQVTSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVL 450
            C       LVQ+ S  V+L   T+            +  +W   P  +++V     + V+
Sbjct: 586  CGLVADGLLVQIHSKCVKLCLPTACAHPEGTLLPSPVCADWY--PDVTISVGAVGHNVVV 643

Query: 451  LATGGGHLVYLEIGDGILT-------EVKHAQLEYEISCLDINPIG---ENPSYSQIAAV 500
            +AT     +Y+ +G   L+       E++H QL YE+SC+ I       +N S S   + 
Sbjct: 644  VATSNPCCLYI-LGVRSLSSFQYELYEIQHVQLHYEVSCISIPQEDWRLDNSSSSCATSG 702

Query: 501  GMWTDIS--VRIFSL-----PDLNLITKE-----------------HLGGEI---IPRSV 533
                D +  +R F++     P +++I+ E                  LG  I   IP +V
Sbjct: 703  DFRKDFAANIRKFAVIGTHEPSVHIISLEPGEAFQQLAVGHISVNNALGTPISGCIPENV 762

Query: 534  LLCAFEGISYLLCALGDGHLLNFLLNMKTGEL----------TDRKKVSL--------GT 575
               A     Y+L  L +G LL F      G            T     SL        G 
Sbjct: 763  RFVAAARF-YILAGLRNGMLLRFESQTSKGHCFPGSFYKESSTPCDDTSLMLIAVRRIGI 821

Query: 576  QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
             P+ L     +    +    DRP +++S+   L YS+++    SH+ P +S   P+ L  
Sbjct: 822  TPVVLVPLHDRANADIIVLGDRPWLLHSARHSLAYSSISFLPASHVTPVSSTDCPNGLLF 881

Query: 636  AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMHF 693
              E  L +  +   ++L+ +   +G  PR++ +   SRT  +    L + SC+ +     
Sbjct: 882  VSENCLHLVEMVHGKRLNAQKFSIGGTPRKVLYHSDSRTLLVLRTGLTSVSCSSD----- 936

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY-----VLPE-ENEPT 747
            +  +D      +S +  +  E    +      +D  +   VGT+      ++P  E E  
Sbjct: 937  IVQIDPSNGALLSRFKCEPGETAKCMQIAKIGNDQVLI--VGTSKSNGRPMMPNGEAESI 994

Query: 748  KGRILVFIVE 757
            KGR+++  +E
Sbjct: 995  KGRLILLSLE 1004


>gi|300176205|emb|CBK23516.2| unnamed protein product [Blastocystis hominis]
          Length = 702

 Score =  133 bits (335), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 117/441 (26%), Positives = 205/441 (46%), Gaps = 53/441 (12%)

Query: 621  MCPFNS------AAFPDS------LAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICH 668
            M P +S      A+ PD          A+   L++G IDD++ +H   IPL  +   +  
Sbjct: 227  MAPLDSPPLCSCASLPDEHEHRFFFVSARSSALSLGEIDDVRSVHAARIPLQHNLLALSV 286

Query: 669  QEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGC-----SILSCS 723
            Q  +++  +   +  +    +    ++L D +T+      PL   E        SI   S
Sbjct: 287  QPATQSLLLLLSRAAASQNAA---LLQLRDPKTYAIRDELPLKPSEIALCVASGSIFPLS 343

Query: 724  FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
             + + N  + VGTA+VLPEENEP++GR+LV    + +L+L+AE    G   S+  F GK+
Sbjct: 344  NAPERNEVFVVGTAFVLPEENEPSQGRLLVLRAVEHRLELVAETMLSGGCLSICLFKGKV 403

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
            +  +N ++Q++    +     +L SE      + +L      + I +GD++ S+  ++YK
Sbjct: 404  VCGVNSELQVFDVDEKTSTISKLASEVACIS-VTSLSPNEADETIALGDILYSV--VVYK 460

Query: 844  -------------HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
                          E  A E R RD  A  +  +     ++ +G +   NL  ++   E 
Sbjct: 461  LVLEVVRGRQLAQLECIASERRRRDVTA--LERLPEAQSEMVVG-DAYGNLMVMQVVEEA 517

Query: 891  ATDEERGRLEVVGE--YHLGEFVNR------FRHGS----LVMRLPDSDVGQIPTVIFGT 938
              D    +  VV +  +HL + +NR      FR G+       R  +S++     + F T
Sbjct: 518  DLDRSNPQKIVVTKESFHLDDQINRFVPVQLFRSGAEDKKKEKRAEESEIAF--NLAFAT 575

Query: 939  VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
            V+G IG+I +L   ++  L  ++T +  VI  VGGL+H+QWR  N      +    +DGD
Sbjct: 576  VSGRIGMIGALNDREFRMLRAIETAMENVITPVGGLDHKQWRCSNTPFGIKNLAYCIDGD 635

Query: 999  LIESFLDLSRTRMDEISKTMN 1019
            L+E FL+L      +I+ +++
Sbjct: 636  LVEMFLELDDESQAKIADSVS 656


>gi|326510951|dbj|BAJ91823.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1360

 Score =  133 bits (334), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 118/471 (25%), Positives = 207/471 (43%), Gaps = 86/471 (18%)

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE--LQGIKGMWS 352
           N+GP++D  + D   + Q Q+ +C G   +GSLR++RNG+ + +    E   QGI G+W+
Sbjct: 480 NVGPMLDLAIADYHGEKQDQLFSCCGMCPEGSLRVIRNGVNVEKLLRTEPIYQGITGLWT 539

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           LR    D + +FLV++F+ ETRIL++ L    + ++  GF  +  TL C       LVQ+
Sbjct: 540 LRMKRTDTYHSFLVLAFVEETRILSVGLSFN-DISDAVGFQPEVCTLACGLIADGLLVQI 598

Query: 413 TSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
            S  V+L   TS            +  +W   P  +++V       V +AT     +Y+ 
Sbjct: 599 HSKGVKLCLPTSYAHPEGATLTSPVCVDWY--PDVTISVGAVGHDVVAVATSNPCCLYIL 656

Query: 463 IGDGI------LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVG----------MWTDI 506
               +      L E +H QL+YE+SC+ I P  ++   +  +A+G             ++
Sbjct: 657 RVRPLSSLQYELYETQHVQLQYEVSCISI-PEEDSRLRTPSSAIGGDFRERKGNNSVAEV 715

Query: 507 SVRIFSL-----PDLNLITKE-----------------HLGGEI---IPRSVLLCAFEGI 541
           +VR+F++     P + +I+ E                  LG  +   IP +V + A E  
Sbjct: 716 NVRMFAVIGTHKPSVEVISLEPDEAFRLLSTGSISVNNALGAPVSGCIPENVRVVASERF 775

Query: 542 SYLLCALGDGHLLNFLLNMKTGELTDRKKVS----------------------LGTQPIT 579
            Y+L  L +G LL F    ++G   D+                          +G  P+ 
Sbjct: 776 -YILAGLRNGMLLRF----ESGTSKDQYLPGSFYKESFAPSLNTLLQLVAIRHIGITPVG 830

Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
           L        + +   SDR  ++++S   L YS+++    SH+ P +S   P  L    E 
Sbjct: 831 LVPLRDSANSDIIVLSDRSWLLHASRHSLAYSSISFLPASHVIPVSSVDCPSGLLFVAEN 890

Query: 640 ELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEE 688
            L +  +   ++L+ +   +G  PR++ +   SRT  +    L   SC+ +
Sbjct: 891 CLHLVEMVHGKRLNAQKFSIGGTPRKVLYHSDSRTLLVMRTGLTGASCSSD 941


>gi|398407593|ref|XP_003855262.1| hypothetical protein MYCGRDRAFT_69118 [Zymoseptoria tritici IPO323]
 gi|339475146|gb|EGP90238.1| hypothetical protein MYCGRDRAFT_69118 [Zymoseptoria tritici IPO323]
          Length = 1218

 Score =  133 bits (334), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 173/807 (21%), Positives = 340/807 (42%), Gaps = 92/807 (11%)

Query: 296  LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK--GMWSL 353
            L P++   V +L  +   Q+    G       + +R+G+ + E  +  +  +    +W+ 
Sbjct: 428  LHPLIKTKVDNLTSEDAPQIYAIQGTGNKSQFKTIRHGLNVEEIINNSMGNVPYDNIWTF 487

Query: 354  RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
            +  + D F  +L++S       +A ++ D +E+ E   F     T+         LVQV 
Sbjct: 488  KHRSTDEFHRYLLLSSSYGDLTIACSIGDSVEQIENSSFLENRATVHAQQMGDATLVQVH 547

Query: 414  SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVK 472
            +  +R +  +     NEW +P   +V VA+AN  Q+LLA     L +  +  DGIL +++
Sbjct: 548  ARGIRSILESG--ALNEWPTPAHRTVVVASANERQLLLALSSAELAFFFMQDDGILVQLE 605

Query: 473  H-AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP-DLNLITKEHLGGEIIP 530
               ++  +++ L +    +    ++ A VG   D ++R+ S+  D  L  +       +P
Sbjct: 606  EMPEMSGKVTALSVGRTPKGRQQAKYAVVGC-DDCTIRVLSIELDSPLEPRSVQALSAVP 664

Query: 531  RSVLLCAFEG------ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
             S+ +           ++ +   L  G  L  +++  TGEL D +   LGT+P  L    
Sbjct: 665  TSLEVVEMHDPSSNSYVNVVHIGLQSGLYLRAVIDETTGELGDVRTKFLGTRPPRLCPVE 724

Query: 585  SKNTTHVFAASDRPTVIYSSNKKLLYSNVNL-------------KEVSHMCPFNSAAFPD 631
              +   V A S RP + Y+  +   Y+   L              ++  +C    ++   
Sbjct: 725  INDQECVLACSSRPWLGYNHPQSQQYTVTPLIAEPIEAARSFISPDLVGLCAIQGSSLLI 784

Query: 632  SLAIAKEGELTIGTID-DIQKLHIRSIPLGEHPRRICHQEQSRTFAIC---SLKNQSCAE 687
                + EG L+  TID +     +   PL  +P     Q    T +      L+ +S  +
Sbjct: 785  MQVPSIEGRLSSSTIDLNYTPRAMDRNPL--YPIWYTAQADGNTLSKAMRDQLRGKSIDD 842

Query: 688  ESEM--------------HF---VRLLDDQTFEFIS-TYPLDTFEYGCSILSCSF-SDDS 728
            + E               H+   ++ +D  T + ++ T  L   E    I    F S   
Sbjct: 843  DEEATALEKHLGVSRGTGHWASCIQAIDPITQKAVTHTIELGENEAALCIACVPFTSRQD 902

Query: 729  NVYYCVGTA-YVLPEENEPTKGRILVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLA 785
             V+  VGT  ++ P  ++P+ G + ++ ++E+G KL+ +   +    +Y+L  FNG+L  
Sbjct: 903  EVFLAVGTGQHMSPGTSQPSTGFVHIYRLLEEGTKLEFVHRTQFDSPIYALCKFNGRLAL 962

Query: 786  AINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
             +  ++ +Y     D G + L  +  G     L  ++ + G+ ++  D+ +S++ ++YK 
Sbjct: 963  GVGNELFIY-----DMGMKHLLRKARGTAVPNLITHITSVGNRLICADVSESVTYVVYKP 1017

Query: 845  EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD-EERG------ 897
                +     D    W +A  ++D +   GA+   NL+ VR     +T+ +E G      
Sbjct: 1018 AFNRLIPFVDDTIQRWTTATALVDYETVAGADKFGNLWVVRCPEATSTEADEDGAGGYIM 1077

Query: 898  -----------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
                       RLE+   +   +     +  SLV        G    + +  + G +G++
Sbjct: 1078 NERSYLGGAPYRLELRSHFFANDIPTSLQRTSLV-------AGGAEVIFWSGLQGTLGML 1130

Query: 947  AS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
               +  E   F   L++ +R+    + G +H  +RS+      V  K  +DGDL E +L 
Sbjct: 1131 VPFVSREDVEFFTALESQMRQEDPPLAGRDHLMYRSY-----YVPVKGVIDGDLCERYLG 1185

Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L      +I+  ++ SV+E+ K+V+E+
Sbjct: 1186 LREDVKGKIAAEVDRSVKEVEKKVQEM 1212


>gi|326519701|dbj|BAK00223.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 1360

 Score =  132 bits (333), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 118/471 (25%), Positives = 207/471 (43%), Gaps = 86/471 (18%)

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE--LQGIKGMWS 352
           N+GP++D  + D   + Q Q+ +C G   +GSLR++RNG+ + +    E   QGI G+W+
Sbjct: 480 NVGPMLDLAIADYHGEKQDQLFSCCGMCPEGSLRVIRNGVNVEKLLRTEPIYQGITGLWT 539

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           LR    D + +FLV++F+ ETRIL++ L    + ++  GF  +  TL C       LVQ+
Sbjct: 540 LRMKRTDTYHSFLVLAFVEETRILSVGLSFN-DISDAVGFQPEVCTLACGLIADGLLVQI 598

Query: 413 TSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
            S  V+L   TS            +  +W   P  +++V       V +AT     +Y+ 
Sbjct: 599 HSKGVKLCLPTSYAHPEGATLTSPVCVDWY--PDVTISVGAVGHDVVAVATSNPCCLYIL 656

Query: 463 IGDGI------LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVG----------MWTDI 506
               +      L E +H QL+YE+SC+ I P  ++   +  +A+G             ++
Sbjct: 657 RVRPLSSLQYELYETQHVQLQYEVSCISI-PEEDSRLRTPSSAIGGDFRERKGNNSVAEV 715

Query: 507 SVRIFSL-----PDLNLITKE-----------------HLGGEI---IPRSVLLCAFEGI 541
           +VR+F++     P + +I+ E                  LG  +   IP +V + A E  
Sbjct: 716 NVRMFAVIGTHKPSVEVISLEPDEAFRLLSTGSISVNNALGAPVSGCIPENVRVVASERF 775

Query: 542 SYLLCALGDGHLLNFLLNMKTGELTDRKKVS----------------------LGTQPIT 579
            Y+L  L +G LL F    ++G   D+                          +G  P+ 
Sbjct: 776 -YILAGLRNGMLLRF----ESGTSKDQYLPGSFYKESFAPSLNTLLQLVAIRHIGITPVG 830

Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
           L        + +   SDR  ++++S   L YS+++    SH+ P +S   P  L    E 
Sbjct: 831 LVPLRDSANSDIIVLSDRSWLLHASRHSLAYSSISFLPASHVIPVSSVDCPSGLLFVAEN 890

Query: 640 ELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEE 688
            L +  +   ++L+ +   +G  PR++ +   SRT  +    L   SC+ +
Sbjct: 891 CLHLVEMVHGKRLNAQKFSIGGTPRKVLYHSDSRTLLVMRTGLTGASCSSD 941


>gi|412993390|emb|CCO16923.1| predicted protein [Bathycoccus prasinos]
          Length = 1273

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 167/737 (22%), Positives = 294/737 (39%), Gaps = 109/737 (14%)

Query: 7   YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
           +G I ++  FR  G  +D + I ++  +  VL++DAE ++              R    G
Sbjct: 58  FGIIRSMAAFRLAGANKDHVVIGSDSGRIVVLRFDAEKNQFEQVHKETFGKSGVRRVVPG 117

Query: 67  QIGIIDPDCRLIGLHLYDGLFKVI---------------PFD-NKGQLKEAFNI-----R 105
           Q   +DP  R + L   +   KV+               P + NKG+   AF++      
Sbjct: 118 QFCCVDPKGRAVMLAALEKQ-KVVYVLNRDSQANLTISSPLEANKGKTI-AFDVCALDNG 175

Query: 106 LEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGAD 165
           LE    + I+  YG A   +    +   +A+ + TY       + V   WS+  +DNGA+
Sbjct: 176 LENPGFMAIELFYGEAD--VDDTGEAAAEAQKMLTYYELDLGLNHVARKWSEP-VDNGAN 232

Query: 166 LLIPVP---PPLCGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGRVDA-------- 211
            LIPVP       GV++  E  +VY +    +    IP R S+    G + A        
Sbjct: 233 KLIPVPGGSDGPGGVIVCCENFLVYRAEKHEEIRCVIPRRTSLDAERGVLIASFASHRSK 292

Query: 212 DGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
           +G  ++     G  + + +     KV+ LK++      + S +  L    ++ GS +G  
Sbjct: 293 NGFFFIAQSEYGDCYKVTLDWTNRKVSELKMKYFDTVPVCSALCVLKTGFLFCGSEFGAH 352

Query: 272 QLIKL-----------------------------------------NLQP----DAKGSY 286
            L +                                          N QP      K   
Sbjct: 353 ALFQFIALGDDEESAESSSKTLKKIDNATKKKGRGKNDDEDDEEEDNFQPVFFNPRKLKN 412

Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
           + +++   +L P +        ++   +++   G     + R++R G+ ++E A   L G
Sbjct: 413 LALIDEIESLSPTIALASNISRKEETPRLLALCGQGPRSTFRVLRQGVPLSEMARSPLPG 472

Query: 347 -IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH--- 402
              G++++R S  D  D ++VVSF + T +L++   D +EE    G  + + TL      
Sbjct: 473 NPNGVFTIRKSKSDTTDAYIVVSFTNATLVLSIG--DTVEEVTDTGILATSSTLAVSALG 530

Query: 403 DAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
           D     L+Q+    VR V   ++ + NEW++P    +     N  Q ++A  GG LVY E
Sbjct: 531 DDDDGSLIQIHPSGVRHVRGNNKGV-NEWRAPGRKKITACACNRGQAIVALTGGELVYFE 589

Query: 463 IGD-GILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLIT 520
           + + G L E++  +   E+  +DI PI +    ++ AAV  + D +VR+ SL P   L T
Sbjct: 590 LDEAGQLLEIEKIETSSEVVSVDIPPIPDGSLRAKFAAVAGY-DSTVRVLSLNPGEALRT 648

Query: 521 KEHLGGEIIPRSVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELT 566
                    P SVLL      +              +L   L +G L+   ++  +G L+
Sbjct: 649 VGVQATPSPPESVLLLEVNNKTTKNKKAGSSAPPSMFLNVGLSNGILVRCEVDRVSGALS 708

Query: 567 DRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS 626
           D +   +G +P  L        +   A S RP + ++ N +        + +   C F S
Sbjct: 709 DARSRFVGQRPPKLNRIEMNEESGFVALSTRPWLGFNENGRFSIVPACHETIDRACGFAS 768

Query: 627 AAFPDSLAIAKEGELTI 643
              P+ +    +G L I
Sbjct: 769 EQVPEGIVAVVDGSLRI 785



 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 67/301 (22%), Positives = 122/301 (40%), Gaps = 46/301 (15%)

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL--QSECGHHGHILA 818
            L+L+ E   +GA ++L  F GKLL  ++  ++LY     D G ++L  ++EC        
Sbjct: 985  LELVHETPCEGAPFALRNFEGKLLVGVDDVLRLY-----DFGKKKLLRKAECAQKFPSFI 1039

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHE---EGAIEERARDYNANWMSAVEILDDDIYLGA 875
              ++  GD   V D  +S   + Y  E   E ++   A D    +++++  LD D    +
Sbjct: 1040 NDIRCSGDRFFVTDACESAFFVKYVREDDQECSMHIFADDIAPRYVTSMLPLDRDTVAVS 1099

Query: 876  ENNFNLFTVRKNSE-----------------------GATDEERGRLEVVGEYHLGEFVN 912
            +   N   +R   +                       GA +    +L+   ++H+G+ + 
Sbjct: 1100 DKFGNFAALRLPKDVSDEIESDISGGKHAALTSSAALGALNGANNKLQACAQFHVGDVIC 1159

Query: 913  RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGV 971
                 +L         G    +++ T+ G +G        ++  F   L+ +LR     V
Sbjct: 1160 SLTKCAL-------QTGGSEVIVYATLGGALGAFVPFASKDEADFCTHLEMHLRIEAPPV 1212

Query: 972  GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
             G  H  +RS          K  +DGDL E F  L       IS+ M+ +  E+ KR+E+
Sbjct: 1213 LGNEHGAFRS-----SYFPVKAVVDGDLCEQFGRLGADAQRRISEEMDRTPSEIVKRLEQ 1267

Query: 1032 L 1032
            +
Sbjct: 1268 I 1268


>gi|33146591|dbj|BAC79787.1| putative Splicing factor 3B subunit 3 [Oryza sativa Japonica Group]
 gi|222636635|gb|EEE66767.1| hypothetical protein OsJ_23488 [Oryza sativa Japonica Group]
 gi|429459546|gb|AFZ84679.1| spotted leaf 5 [Oryza sativa Japonica Group]
          Length = 1355

 Score =  132 bits (331), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 146/610 (23%), Positives = 254/610 (41%), Gaps = 94/610 (15%)

Query: 223  GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDA 282
            G  HLL  + + E V  L  E +        + ++D  +V      GD  +++L      
Sbjct: 414  GEFHLLEFSLDMEGVKVLP-ECVHRGLPCKPLLWMDKGMVVGFVEMGDGMILQLENNRLV 472

Query: 283  KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
              S ++      N+ PI+D  + D   + Q Q+  C G   +GSLR++RNG+ + +    
Sbjct: 473  HKSAIQ------NVAPILDLAIADHHGEKQDQMFACCGMCPEGSLRVIRNGVNVEKLLRT 526

Query: 343  E--LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
            +    G+ G+W+LR    D + +FLV+SF+ ETRIL++ L    +  +  GF +   TL 
Sbjct: 527  DPIYHGVTGLWTLRMKRTDAYHSFLVLSFVEETRILSVGLSFN-DICDAVGFQTDVCTLA 585

Query: 401  CHDAIYNQLVQVTSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVL 450
            C       LVQ+ S  V+L   T+            +  +W   P  +++V     + V+
Sbjct: 586  CGLVADGLLVQIHSKCVKLCLPTACAHPEGTLLPSPVCADWY--PDVTISVGAVGHNVVV 643

Query: 451  LATGGGHLVYLEIGDGILT-------EVKHAQLEYEISCLDINPIG---ENPSYSQIAAV 500
            +AT     +Y+ +G   L+       E++H QL YE+SC+ I       +N S S   + 
Sbjct: 644  VATSNPCCLYI-LGVRSLSSFQYELYEIQHVQLHYEVSCISIPQEDWRLDNSSSSCATSG 702

Query: 501  GMWTDIS--VRIFSL-----PDLNLITKE-----------------HLGGEI---IPRSV 533
                D +  +R F++     P +++I+ E                  LG  I   IP +V
Sbjct: 703  DFRKDFAANIRKFAVIGTHEPSVHIISLEPGEAFQQLAVGHISVNNALGTPISGCIPENV 762

Query: 534  LLCAFEGISYLLCALGDGHLLNFLLNMKTGEL----------TDRKKVSL--------GT 575
               A     Y+L  L +G LL F      G            T     SL        G 
Sbjct: 763  RFVAAARF-YILAGLRNGMLLRFESQTSKGHCFPGSFYKESSTPCDDTSLMLIAVRRIGI 821

Query: 576  QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
             P+ L     +    +    DRP ++ S+   L YS+++    SH+ P +S   P  L  
Sbjct: 822  TPVVLVPLHDRANADIIVLGDRPWLLQSARHSLAYSSISFLPASHVTPVSSTDCPSGLLF 881

Query: 636  AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMHF 693
              E  L +  +   ++L+ +   +G  PR++ +   SRT  +    L + SC+ +     
Sbjct: 882  VSENCLHLVEMVHGKRLNAQKFSIGGTPRKVLYHSDSRTLLVLRTGLTSVSCSSD----- 936

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY-----VLPE-ENEPT 747
            +  +D      +S +  +  E    +      +D  +   VGT+      ++P  E E  
Sbjct: 937  IVQIDPSNGALLSRFKCEPGETAKCMQIAKIGNDQVLI--VGTSKSNGRPMMPNGEAESI 994

Query: 748  KGRILVFIVE 757
            KGR+++  +E
Sbjct: 995  KGRLILLSLE 1004


>gi|19075704|ref|NP_588204.1| silencing protein Rik1 [Schizosaccharomyces pombe 972h-]
 gi|13124796|sp|Q10426.2|RIK1_SCHPO RecName: Full=Chromatin modification-related protein rik1; AltName:
            Full=Silencing protein rik1
 gi|6016996|emb|CAB57850.1| silencing protein Rik1 [Schizosaccharomyces pombe]
          Length = 1040

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 226/1060 (21%), Positives = 428/1060 (40%), Gaps = 127/1060 (11%)

Query: 4    VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT 63
            +P++  +  +  +RP G  +D+LF+      +  + WD +  ++I      V  R+  P 
Sbjct: 53   IPLFSTVKHIRPYRPPGLDRDYLFVVLNDDTYFSIYWDEDYQKVIVDH-PPVRYRVTFPW 111

Query: 64   DNGQIGIIDPDCRLIGLHLYDGLFKVI---------PFDNKGQLKEAFNIRLEELQVLDI 114
            +         D R+  + L      +I                +   F        + D+
Sbjct: 112  NRNAKSYCLVDLRMRAIFLSIDEISMICIRILSAEERLKTGRSIDSGFPFSFPVHLIYDM 171

Query: 115  KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL 174
              L   + PT+VVL+ D  D  +V  + + L  K          +L  G  L   V P +
Sbjct: 172  CILNDSSTPTLVVLHSDGLDC-YVTAFLLDLSSK----------SLGKGIRLFERVKPSM 220

Query: 175  CGV-----LIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLL 228
                    L++ E   ++C     F  I    +    +  +      Y++ D  G L  +
Sbjct: 221  IMPFGKRGLLVFESLFIHCMYRGNFVTINGPCTTYMHWTPLKGQKMHYIVCDTNGYLFGV 280

Query: 229  VITHEKEKVTGLKIELLG--ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSY 286
              +   +    L +E L        ++++ +   +++IGS   +S+LI L+   D     
Sbjct: 281  YSSILGKNKWSLVMERLPIPPFDFITSLNSIHEGLLFIGSKNSESKLINLSTLKD----- 335

Query: 287  VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
               ++   NLGPI D  V  L+   +   + C+G  ++ SL   ++ + ++     ++ G
Sbjct: 336  ---VDSIPNLGPIHDLLV--LKNDIEKSFLVCAGTPRNASLIYFQHALKLDILGQTKISG 390

Query: 347  IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
            I     L S  +      L + F SET  +A N++++ +  E++   S  +         
Sbjct: 391  ILRAMVLPSYPEHK----LFLGFPSET--VAFNIKEDFQ-LELDPSLSTKERTIALSGTN 443

Query: 407  NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG 466
             + VQVTS  + +  S  R      +       N A       ++  G    ++ +    
Sbjct: 444  GEFVQVTSTFLCIYDSAKRSRLVYIEK----ITNAACYQEYSAIVINGTALAIFKKD--- 496

Query: 467  ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT-KEHLG 525
              TEV     E EISCLD +      +  QI  VG W+   V I +  D + I+      
Sbjct: 497  --TEVARKVFESEISCLDFS------AQFQIG-VGFWSK-QVMILTFSDNSSISCAFQTN 546

Query: 526  GEIIPRSVLLCAFEGI----SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
               +PR+++L   EG+    + LL + G G   +++L       ++ K    GT P++ R
Sbjct: 547  VPSLPRNIIL---EGVGVDRNLLLVSSGSGEFKSYVLFKNNLVFSETKH--FGTTPVSFR 601

Query: 582  TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
             F+    T++   +D P ++Y  N  L Y  +++ +   +C F   +  D L     G L
Sbjct: 602  RFTMNIGTYIICNNDCPHMVYGFNGALCYMPLSMPQSYDVCQFRDNSGKDFLISVSLGGL 661

Query: 642  TIGTIDDIQKLHIRSIPLGEHP-RRICHQEQSRTFAICSLKN-----QSCAEESEMHFVR 695
                ++ + +L  R + L   P + I  Q +     + +L+N     +S  E   +  V 
Sbjct: 662  KFLQLNPLPELTPRKVLLEHVPLQAIIFQNK---LLLRTLENRYEDYESYKENYHLELVD 718

Query: 696  LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILV-- 753
              DD +F   S    +  E    I   S          VGT+ +  ++  P  GR+++  
Sbjct: 719  SYDDNSFRVFSFTENERCEKVLKINESSL--------LVGTSIIEQDKLVPVNGRLILLE 770

Query: 754  FIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH 813
            F  E   L++++      AV  L  +N + + A  Q++ + K     +    + S     
Sbjct: 771  FEKELQSLKVVSSMVLSAAVIDLGVYNDRYIVAFGQQVAIVKL---TEERLMIDSRISLG 827

Query: 814  GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
              +L L V+  G+ I + D +   +++ +  ++  +  R   +  N + A  + +  +Y+
Sbjct: 828  SIVLQLIVE--GNEIAIADSIGRFTIMYFDGQKFIVVARYL-FGENIVKAA-LYEGTVYI 883

Query: 874  GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-- 931
             A N+  L  +R N +     +R   E V  YHL + V++F++       P ++      
Sbjct: 884  IATNSGLLKLLRYNKDAKNFNDRFICESV--YHLHDKVSKFQN------FPITNTNSFLE 935

Query: 932  PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV-IKGVGGLNHEQWRSFNNEKKTVD 990
            P ++F T  G IG I SL  ++ L LE+L   +RK+    +  +++E       E   + 
Sbjct: 936  PKMLFATEIGAIGSIVSLKDKE-LELEELTRKIRKLKFSYLSSMDYESI-----EADLIS 989

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
               F+DGDL+   +D+ R    E+ +        LC+ VE
Sbjct: 990  PVPFIDGDLV---IDVKRWASSELFR--------LCRSVE 1018


>gi|302820387|ref|XP_002991861.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
 gi|300140399|gb|EFJ07123.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
          Length = 1292

 Score =  131 bits (330), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 121/475 (25%), Positives = 215/475 (45%), Gaps = 73/475 (15%)

Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
           GD  ++K+    D + S+   ++   N+ PI+DF +VD   + Q Q+  C G  ++GS+R
Sbjct: 413 GDGHVLKVE---DGQLSFQSFVQ---NIAPILDFSLVDYYGEKQDQMFACCGGDEEGSVR 466

Query: 329 IVRNGIGINEQASVE--LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
           I+RNG  + +        QG+ G+W++R    DP+  F ++SF+ ETR+L++ L + ++ 
Sbjct: 467 IIRNGNSVEKLICTPPVYQGVSGIWTMRYRFKDPYHAFFLISFVEETRVLSVGL-NFVDI 525

Query: 387 TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPG 436
           T+  GF SQ  TL C       + QV    V+L S T          S  L   W+  PG
Sbjct: 526 TDAVGFESQVNTLACGLVEDGWVAQVWRYEVKLCSPTKAAHPAGVSGSNPLSTTWRK-PG 584

Query: 437 YSVNVATANASQVLLATGGGHLVYL----EIGDGI--LTEVKHAQLEYEISCLDI----- 485
           Y ++V     S+V+LA     L+ +    +  DG   L E++   +E EISC+ I     
Sbjct: 585 YPISVGAVCRSRVILALARPGLLLMLGATQTSDGSFELVELQLCMMEAEISCISIPQGDI 644

Query: 486 --------------NPIGENPSYSQIAAVGMWTDISVRI-----------FSLPDLNLIT 520
                         N +        +  VG     SV +           F++  ++LIT
Sbjct: 645 SIPVPPTIAGLHAGNTVPAGLDLGNVCVVGTHKP-SVELLSIVPGDKFAPFAVGQVSLIT 703

Query: 521 KEHLGGEI---IPRSVLLCAFEGISYLLCALGDGHLLNF--------LLNMKTGELTDRK 569
              +G  +   IP+ V L  F+ + Y+L  L +G LL +        L+  K  +L    
Sbjct: 704 S--VGTAVSGCIPQDVRLALFDRL-YILAGLRNGMLLRYEWPEDTPSLVLSKPAQLHLVA 760

Query: 570 KVSLGTQPITLRTF-SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
              LG  P+ L    +      + A SDRP ++  + K++ Y++++ +  +H  P  S  
Sbjct: 761 ARRLGVSPVCLVPLEACALRADIIALSDRPWLLQMA-KRISYTSISFQPSTHATPVCSKD 819

Query: 629 FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ 683
            P  +    +  L +  ++  + L+++ + LG   RR+ +  +S    +  L ++
Sbjct: 820 CPKGIIFLADCSLHLVEMEQSRTLNVQKLHLGCTGRRVLYHPESGVLIVLRLLSE 874



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 51/201 (25%), Positives = 82/201 (40%), Gaps = 15/201 (7%)

Query: 821  VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA--ENN 878
            +  R   I VGD    + L  Y+ +   +E    D     +S   ++D D  +    + N
Sbjct: 1049 ISVRFTTIAVGDCRDGVLLFTYREDSKKLEPIRCDPMRRLVSDCTLVDVDTAVVVDRQGN 1108

Query: 879  FNLFTVRKNSEGATDEERG----RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ---I 931
            F   +  + +EG  D   G     LE    +H+GE     R GS   +  +        I
Sbjct: 1109 FCALSANEETEGKCDSNSGSPEKNLEAHCWFHIGEVCTTVRKGSFAFKAVEDSCSVDRLI 1168

Query: 932  P-----TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
            P      VI  T+ G + +   +  E+Y  L+ LQ  L  +      L ++  R F  + 
Sbjct: 1169 PNMGKSCVIATTLLGSVFIFVRMTGEEYSLLQALQRRLSVLPATAPVLGNDHAR-FRGQG 1227

Query: 987  KTVDAKNFLDGDLIESFLDLS 1007
            +    K  LDGDL+E FL+L+
Sbjct: 1228 RPAGVKEVLDGDLLEQFLELT 1248


>gi|340367935|ref|XP_003382508.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Amphimedon
           queenslandica]
          Length = 1160

 Score =  129 bits (324), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 163/758 (21%), Positives = 322/758 (42%), Gaps = 104/758 (13%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           ++   ++G I +L  FR  G  +D++ + ++  +  +L++D   + +  +   +   + G
Sbjct: 53  LVSTEVFGLIRSLITFRLTGSTKDYIVLGSDSGRIVILEYDPVKN-VFEKVHQETYGKSG 111

Query: 61  -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL----- 112
            R    GQ   +DP  R  +IG      L  ++  D+  +L  +  +   +   L     
Sbjct: 112 CRRIVPGQYLAVDPKGRALMIGAVEKQKLVYILNRDSAARLTISSPLEAHKSHTLVYHVV 171

Query: 113 --DIKF---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN-------- 159
             D+ F   L+ C +    + Y+++ +    +  +VA ++  + E     N+        
Sbjct: 172 GVDVGFENPLFACLE----LDYEESDNDHTGEAAQVAQQNLTYYELDLGLNHVVRKYSEP 227

Query: 160 LDNGADLLIPVPPPL---CGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD-ADGSR 215
           LD+ A++LI VP       GVL+  E  I+Y   N      IR  I +    +D AD S 
Sbjct: 228 LDDMANMLITVPGGTDGPSGVLVCCENYIIY--KNFGDQPDIRCPIPRRQNDLDDADRSM 285

Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
                          +LL    G +  +++  E + V  +KI+      + S++  L + 
Sbjct: 286 LIITSAMHKTKHLFFFLLQSEQGDIFKVMLDVEDDIVVRIKIKYFDTVPVCSSLCVLKSG 345

Query: 261 VVYIGSSYGDSQLIKL-NLQPDAKGSY---------------------VEVLERYVNLGP 298
           +++  S +G+  L ++ +L  D++  +                     + +++   +L P
Sbjct: 346 LLFCSSEFGNHYLYQIAHLGDDSEREFHSDYPLEEGETFFFGPRPLKNLILVDEMESLSP 405

Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
           I+   + DL  +   Q+    G     SLR++R+G+ + E A  EL G    +W+++  +
Sbjct: 406 IMSCQIADLANEDTPQLYAACGRGPRSSLRVLRHGLEVTEMAVSELPGNPHAVWTVKKDS 465

Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            + FD+++VVSF++ T IL++   + +EE    GF   T TL C     + L+Q+    +
Sbjct: 466 KEDFDSYIVVSFMNATLILSIG--ETVEEVTDSGFLGTTPTLACSQLGDDALIQIYPEGI 523

Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQ 475
           R + S  R   NEW+SP    +     N  QV++A  GG +VY E+   G L E  +  +
Sbjct: 524 RHIRSDKR--VNEWRSPGKRLIRQCAVNEHQVVIALSGGEIVYFEMDQSGQLNEYTERKE 581

Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
           +  E+ C+ +  +       +  AVG+ +D +VRI SL   +  T + L  + +P     
Sbjct: 582 MTAEVICISLGSVPPGQQRCRFLAVGL-SDQTVRIISLDPHD--TLQPLSMQALP----- 633

Query: 536 CAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS 595
                +   LC          ++NM +G ++D   VS G   + +          V A S
Sbjct: 634 ----ALPESLC----------IVNM-SGNVSDDTTVSTGGLFLNI----GLQNGGVIAVS 674

Query: 596 DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR 655
            R  + Y+   +   + ++   + +   F S   P+ +       L I  ++ +  +  +
Sbjct: 675 SRTWLNYTYQSRFHLTPLSYDLLEYASSFTSEQCPEGMVAISSNTLRILALEKLGVVFNQ 734

Query: 656 -SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
            S PL   PR++          +    + +  E +++H
Sbjct: 735 VSTPLQYTPRKLLIHPPMSNLVLIETDHNAFTEATKLH 772



 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 76/345 (22%), Positives = 150/345 (43%), Gaps = 60/345 (17%)

Query: 714  EYGCSILSCSFSDDSNV--YYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKE 768
            E   S+  C F    ++  +  VGTA  ++      + G ++VF +  DG KL+ +   +
Sbjct: 844  EAAFSLAVCQFVSKGDLEWFVVVGTAKDMIITPRAISSGSLIVFRLSPDGSKLEHVHTTQ 903

Query: 769  TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDF 827
                  ++  F G+LL  + + +++Y     D G +++  +C   H   L + ++  G  
Sbjct: 904  LDDVPIAMAPFQGRLLVGVGKLLRIY-----DIGKKKMLRKCENKHLPYLVVDIKVMGRR 958

Query: 828  IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
            + VGD+ +++  L Y+  E  +   A +    + +   ILD +    A+   N+  +R  
Sbjct: 959  VYVGDVQEAVHFLYYRPHENQLVIFADEVVPRFCTTSCILDYNTVASADKFGNITILRLP 1018

Query: 888  SEGATDE------------ERGRLE-------VVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
            S+  TD+            +RG L        V+  YH+GE +N     SL+        
Sbjct: 1019 SD-VTDQVDEDPSGSRSLWDRGFLNGATQKANVMTSYHVGEGINTLHKVSLI-------P 1070

Query: 929  GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
            G    +++ T++G IG++      E   F + L+ ++R            +W + +    
Sbjct: 1071 GGSEVLVYTTLSGSIGILVPFSSKEDSDFFQHLEMHMRS-----------EWSNLS---- 1115

Query: 988  TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                   +DGDL E +  L  ++  EI+  ++ S  E+ K++E+L
Sbjct: 1116 ------VIDGDLCEVYNSLDPSKRREIALDLDRSPSEVAKKLEDL 1154


>gi|308812436|ref|XP_003083525.1| UV-damaged DNA binding protein 1 (ISS) [Ostreococcus tauri]
 gi|116055406|emb|CAL58074.1| UV-damaged DNA binding protein 1 (ISS), partial [Ostreococcus
           tauri]
          Length = 265

 Score =  128 bits (321), Expect = 2e-26,   Method: Composition-based stats.
 Identities = 62/103 (60%), Positives = 73/103 (70%)

Query: 868 DDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
           DD+ YLGAEN+ NLFTV +N    TDEER RLE+ GEYHLGE VN F  GSLVM L D +
Sbjct: 2   DDETYLGAENSLNLFTVSRNVNAVTDEERSRLEITGEYHLGELVNAFAPGSLVMSLRDGE 61

Query: 928 VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
              +PT++FGT NGVIGV+ASLP + Y F E+LQ     VI G
Sbjct: 62  SLSVPTLLFGTANGVIGVLASLPKDVYEFTERLQALGEFVICG 104


>gi|221061705|ref|XP_002262422.1| splicing factor 3b, subunit 3, 130kd [Plasmodium knowlesi strain H]
 gi|193811572|emb|CAQ42300.1| splicing factor 3b, subunit 3, 130kd, putative [Plasmodium knowlesi
           strain H]
          Length = 1276

 Score =  127 bits (320), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 150/698 (21%), Positives = 292/698 (41%), Gaps = 81/698 (11%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
           I+G I  L+ FR  G  +D++ I ++  +  +LQ+  E ++ + R   +   + G R   
Sbjct: 58  IFGIIRCLQTFRLTGSNKDYVVIGSDSGRLVILQFSNEKNDFV-RVHCETYGKSGLRRII 116

Query: 65  NGQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAFNIRLEELQVL--DIKFL-YG 119
            G+   +DP  R + +   +      ++  DNK QL  +  +   +   +  D+  +  G
Sbjct: 117 PGEYIAVDPKGRALMICAIERQKFVYILNRDNKEQLTISSPLDAHKSHTICHDVVGMDVG 176

Query: 120 CAKPTIVVLYQDNK--DARHVKTYEVALKDKDFVEGPWSQN------------NLDNGAD 165
              P    + Q+ +  D +   T E+    +  +   W  +             +D  A 
Sbjct: 177 FENPMFASIEQNYEMYDKQVTNTTEIDACTRKTLLCLWEMDLGLNHVIRKHTLPIDMSAH 236

Query: 166 LLIPVP---PPLCGVLIIGEETIVYCSA---NAFKAIPIR------PSITKAYGRVDADG 213
           LLIP+P       GV++  +  +VY      + + A P R       +I+     V    
Sbjct: 237 LLIPIPGGQQGPSGVIVCCDNYLVYKKVEHVDVYCAYPRRLETGQEKNISIVCSTVHRIR 296

Query: 214 SRY--LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG-- 269
             +  L+    G L+ + + H+   V  +  +      +A+ I  + +  +++ + +G  
Sbjct: 297 KFFFILIQSEYGDLYKIEMDHQDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFGNH 356

Query: 270 -----------DSQLIKLNLQPDAKGSYVE----------VLERYVNLGPIVDFCVVDLE 308
                      D++ +  +  P  + + +           ++++  +L PI+D  ++D +
Sbjct: 357 FFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKLTNLFLIDQVYSLSPILDMKILDAK 416

Query: 309 RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVV 367
                Q+    G     SLRI+++G+ I E A  EL G  K +W+++      +D +++V
Sbjct: 417 NANSPQIYALCGRGPRSSLRILQHGLSIEELADNELPGRPKYIWTIKKDNASDYDGYIIV 476

Query: 368 SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
           SF   T IL +   + +EE       +   T+  +    N L+QV    +R ++    ++
Sbjct: 477 SFEGSTLILEIG--ETVEEVVDTLLLTNVTTIHVNILYDNSLIQVHDTGIRHING---KV 531

Query: 428 RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEISCLDIN 486
            NEW  P    V  AT+NA+Q++++  GG L+Y EI +   L E+    L  EI CL I 
Sbjct: 532 INEWVPPKNKQVKAATSNATQIVISLSGGELIYFEIDESHSLVEIFRKSLNVEILCLSIQ 591

Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL---LCAFE---- 539
            + EN   +   AVG   ++ VR+ S+       +  L   I+P +     +C  E    
Sbjct: 592 EVEENKVRANFLAVGCLDNV-VRLLSIEKEKYFNQ--LSTFILPNNSSAQDICISEMCEL 648

Query: 540 ------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
                  + +L   L +G LL  +++  TG LT+     LG + + +     K    +  
Sbjct: 649 GNDKERKLLFLNIGLNNGVLLRSVVDPITGTLTNHYSKYLGAKNVKICPVHVKKNPALLV 708

Query: 594 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
             ++  + Y    K +YS +N   + +   F+S    D
Sbjct: 709 LCEKTYLCYVHQGKYIYSPLNYDILEYASSFHSEQCSD 746



 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 82/364 (22%), Positives = 158/364 (43%), Gaps = 47/364 (12%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC--VGTAYVLPEENE-PTKGR 750
            ++++   + + I    L+  E   S+ +C         +C  VGT   L  +N   T   
Sbjct: 930  IKIIHPVSLQTIDKISLEMEEAALSVCACELE----ALHCLIVGTTTNLSLKNRTATTAA 985

Query: 751  ILVFIVE-DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
            + V+  + + KL L+     +   +  + FNG+LLA++  K+++Y       G ++L  +
Sbjct: 986  LRVYTYDINYKLNLLHITPVEDQPFCFSPFNGRLLASVGNKLRIYAL-----GKKKLLKK 1040

Query: 810  CGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
            C +     A+  ++  GD I   D+ +S+ +  Y     A+   + D    W++  EILD
Sbjct: 1041 CEYKDIPEAIISIKVSGDRIFASDIRESVLVFFYDANMNALRLISDDIIPRWITCSEILD 1100

Query: 869  DDIYLGAENNFNLFTVRKNSEGATDEERG-------------------RLEVVGEYHLGE 909
                + A+   ++F +R   E A  EE G                   RLE +  +H+GE
Sbjct: 1101 HHTIMAADKFDSVFVLRVPEE-AKQEEYGISNKCWYGGEMMAGSNKNRRLEHIMNFHVGE 1159

Query: 910  FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL-EKLQTNLRKVI 968
             V   +   L    P S       +I+ T+ G IG      +++ L L + L+  LR   
Sbjct: 1160 IVTSLQKVKLS---PTSS----ECIIYSTIMGTIGAFIPYDNKEELELTQHLEIILRTEN 1212

Query: 969  KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
              + G  H  +RS+ +  + V     +DGDL E F  L      +++  +  + +++ ++
Sbjct: 1213 PPLCGREHIFFRSYYHPVQHV-----IDGDLCEQFSSLPYDIQRKVAADLERTPDDILRK 1267

Query: 1029 VEEL 1032
            +E++
Sbjct: 1268 LEDI 1271


>gi|429328598|gb|AFZ80358.1| hypothetical protein BEWA_032110 [Babesia equi]
          Length = 1387

 Score =  127 bits (320), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 214/1089 (19%), Positives = 417/1089 (38%), Gaps = 234/1089 (21%)

Query: 176  GVLIIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLL-VIT 231
             VL++G  ++ + + N  K+I       S+++  G    + S+++L D  G +HL+ ++ 
Sbjct: 303  SVLLLGAGSVGFFAFNDPKSIKDFVSDFSMSEITGYCQIEHSKFVLSDDNGNIHLMELVL 362

Query: 232  HEKEKVTGLK-----------------------------IELLGETSIASTISYLDNAVV 262
              K   TGLK                             I  L    I S+I  + N  +
Sbjct: 363  SSKPANTGLKRRIRPINSNNNCDMHAHRGRTIATISDVIITRLYNCDIPSSIINIGNNTI 422

Query: 263  YIGSSYGDSQLIKLNL--QPDAKGSYVEVLERYV-------NLGPIVDFCVVDLERQGQG 313
            +  S  G++   +++   +P       ++ E  V       NLGPI DF   +    G  
Sbjct: 423  FSTSKVGNTCTFRIDFGNKPSICSDSFKIEEFNVYKTWNQTNLGPIADFTYGEENSHGNK 482

Query: 314  QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS--STDDPFDTFLVVSFIS 371
             V+ C G    G    +  G+GI+   S  + GI  ++S+ S  S  D  ++ L  SF  
Sbjct: 483  SVLACCGMGSAGKFCSITLGVGIDTICSNTILGISNLFSVSSGGSKTDSNESILCCSFYR 542

Query: 372  ETRILAMNLED-------------ELEETEI----------EGFCSQTQTLFCHDAIYNQ 408
             T+  ++N                ++ E  +            F    +T+   +   ++
Sbjct: 543  NTQFYSVNFVSMNNTQGNINSSLLDINEPNVGYVNKMVMDGSTFREDERTILLSNIDKDK 602

Query: 409  LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNV--ATANASQVLLATGGGHLVYLEIGD- 465
            +VQ+T+ S+ ++SS  +  R  +K+       +   +   ++++ A   G  +++ + + 
Sbjct: 603  IVQITTSSIIILSSNPKLNRIAYKTVEEIMTEIDEGSEERNRIVHACLCGSYIFIILSNH 662

Query: 466  --GILTEVKHAQLEYEISCLDINP---IGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT 520
               IL   K   +  E   ++I+    I    + ++  A+  W   S+ I S P+L+++ 
Sbjct: 663  WAAILDGKKLEMVNVEPLSINISSVAYISHVKNSAEFLALSTWEKSSILILSFPNLDIVR 722

Query: 521  KEHLGGE--IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMK----------------- 561
            +  +     I  +S+   A     +   +L DG L+ + + M                  
Sbjct: 723  RIKIECTFGIYIKSMKFGATNNNLFFFVSLSDGQLMVYKVQMSRNITNAVSKGKSDVFKN 782

Query: 562  --------------TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK 607
                          +   T  + ++L    I  +T  + +  ++ A   + T I+S   K
Sbjct: 783  DVEIDFILDSTMHLSNTCTKLEFLNLSNNYIHEQT-CNLDCNNIIAIGKQSTAIFSKRDK 841

Query: 608  LLYSNVNLKEVSHMCPF----NSAAFPDSL-AIAKEGELTIGTIDDIQKLHIRSIPLGEH 662
            + ++ VN+  VS +C       S+ F   L A      L IG +D I++ H+++I  G  
Sbjct: 842  VEFAKVNISHVSCLCTIPTQRTSSEFNSVLIAYYSYPSLVIGKLDPIEQFHVQNIISGRT 901

Query: 663  PRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIS---------------- 706
              ++C+   S+   +      S      +  +  +DD    F+                 
Sbjct: 902  FDKVCYHVPSKVAVVGCPPELSATTSDSVPGLNDIDDNNPPFMCLDGSESEKVNSLKLPA 961

Query: 707  ------------TYPLD-TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILV 753
                         Y L+   E+  S L     D +N    +GT+ V    + PT+G   +
Sbjct: 962  LALFINVSSKEVIYNLNLPREHLVSSLHTMSFDGTNEVILIGTSKVDESHDAPTEG--FL 1019

Query: 754  FIVE-----DGKLQLIAEKETK---GAVYSLNAFNGKLLAAINQKIQL------------ 793
            +IV+     + +L +  ++ TK   G V  + +     + A+N  +              
Sbjct: 1020 YIVDISSHSEDRLVMTIQRTTKPFAGGVVEITSCGQNPVIAVNNVVLALSLCSNIGVRNE 1079

Query: 794  -------YKWMLRD--DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
                   Y +   D  D  + L  E  +  +   + + +  + I+VGDLM S  LL +K 
Sbjct: 1080 KDRYDLEYLYACDDIIDSRQCLVVEAKYTSNTYVVSLDSCENNIIVGDLMTSAKLLRFK- 1138

Query: 845  EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
             +    E  RD+N+ W +AV  +D +  + ++++ N     KN     D +  + +V+G 
Sbjct: 1139 -DNCFYELCRDFNSIWCTAVASIDKNTSIVSDDSGNFSIFSKNETPTNDAQSAKFQVMGL 1197

Query: 905  YHLGEFVNRF-----------RHGSLVMR-------------LPDSDVGQI--------- 931
            +H GE +N+            R  SL+               L  +D   +         
Sbjct: 1198 FHHGEIINKIVEHVSKECKYERKKSLIKLGGVASSCKRNFCCLSTADTCPMDPTNGSKST 1257

Query: 932  --------------PTVIFGTVNGVIGVIASLPH-EQYLFLEKLQTNLRKVIKGVGGLNH 976
                           T +  T +G I  +++    + +L L  L+  +  +    G + +
Sbjct: 1258 NIHKDAAPTRYRFDKTYLCATSSGSILKMSTFKDMDAFLRLALLEEAIIGIQSDAGNIPN 1317

Query: 977  EQWRSFNNEKKTVDAKNFLDGDLIESFL----DLSRT---RMDEISKTMN---VSVEELC 1026
             +WR+F N    +  + F+DGD +ESFL    DL R    ++ E SK +     S+E L 
Sbjct: 1318 SKWRNFQNSHTNIQTRGFIDGDTVESFLHLPMDLKRRVLEKLQESSKDIQGDFDSIETLT 1377

Query: 1027 KRVEELTRL 1035
              +E++ R+
Sbjct: 1378 LEIEQIQRV 1386


>gi|68531971|ref|XP_723667.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
 gi|23478038|gb|EAA15232.1| Drosophila melanogaster CG13900 gene product [Plasmodium yoelii
           yoelii]
          Length = 1235

 Score =  126 bits (316), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 156/700 (22%), Positives = 292/700 (41%), Gaps = 85/700 (12%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
           ++G I +LE+FR  G  +D++ I ++  +  +L++D E ++ I R   +   + G R   
Sbjct: 58  VFGIIRSLEIFRLLGSNKDYIAIGSDSGRLVILKYDDEKNDFI-RVHCETYGKSGIRRII 116

Query: 65  NGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYG 119
            G+   IDP  R + L   +      ++  DNK  L  +  +   +   +    +    G
Sbjct: 117 PGEYIAIDPKGRALMLCAIEKQKFVYILNRDNKENLTISSPLEAHKSHSICHAVVALNVG 176

Query: 120 CAKPTIVVLYQ--DNKDARHVKTYEVAL----KDKDFVEGPWSQNN--------LDNGAD 165
              P  V + Q  +N D + +   E  +    K   F E     N+        +D  A 
Sbjct: 177 FENPMFVSIEQNYENLDKQVLNANEQIMEYPKKGLCFWEMDLGLNHVIKKHTIPIDITAH 236

Query: 166 LLIPVP-----PPLCGVLIIGEETIVYCSAN---AFKAIPIR------PSITKAYGRVDA 211
           LLIP+P     P   G+++  E  +VY   +    F + P R       +I+     +  
Sbjct: 237 LLIPLPGGQQGPS--GLIVCCENYLVYKKIDHDDIFCSYPRRLEVGEEKNISIVCWTIHR 294

Query: 212 DGSRY--LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
             + +  L+    G L+ + + HE   V  +  +      IA++I  L +  +++ + +G
Sbjct: 295 IKTFFFILIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFG 354

Query: 270 DSQLIKLN-------------LQPDAKGSYVE----------VLERYVNLGPIVDFCVVD 306
           +    + +               P  K + +           ++++  +L PIVD  ++D
Sbjct: 355 NHFFYQFSGIGNDSNDAMCTSNHPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILD 414

Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFL 365
            +     Q+    G     SLRI+++G+ I E A+ EL G  + +W+++      +D ++
Sbjct: 415 AKNSNLPQIYALCGRGPRSSLRILQHGLSIEELANNELPGKPRYIWTVKKDNSSEYDGYI 474

Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
           +VSF   T IL +   + +EE       +   T+  +    N  +QV    +R ++    
Sbjct: 475 IVSFEGNTLILEIG--ETVEEVYDSLLLTNVTTIHINLLYDNSFIQVYDTGIRHING--- 529

Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEISCLD 484
           ++  EW  P    +N AT+N SQ++++  GG L+Y EI +   LTE+    +  EI CL 
Sbjct: 530 KIVQEWIPPKNKQINAATSNGSQIVVSLSGGELIYFEIDESHTLTEIFRKNINVEILCLS 589

Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL---LCAFE-- 539
           I  I +N   +   AVG   ++ VR+ S+        + L   I+P +     +C  E  
Sbjct: 590 IQQIQQNKLRASFLAVGCLDNV-VRLLSIEKDQYF--KQLSTYILPNNSSPQDICISEMK 646

Query: 540 --------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
                    I YL   L  G LL  +++   G L++     LG + + +          +
Sbjct: 647 ELGNQKEHTILYLNIGLNTGVLLRSVIDPICGTLSNHYSKYLGAKSVKICHVQVNKNPAL 706

Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
              S++  + Y    K +YS +N   + +   F S    D
Sbjct: 707 LVLSEKTYLCYVYQGKYIYSPLNYDVLEYASSFYSEQCSD 746



 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 78/376 (20%), Positives = 154/376 (40%), Gaps = 57/376 (15%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG-RIL 752
            +++++    + I    L+  E   S+ +C       +     T   L   N P+   R+ 
Sbjct: 875  IKIINPINLQIIDKISLELEEAALSVCACELEALHCLIVGTTTNMTLKNRNVPSASLRVY 934

Query: 753  VFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
             + + + KL L+     +   Y    FNG+++ ++  K+++Y       G ++L  +C +
Sbjct: 935  TYDI-NYKLNLLHITPIEDQPYCFCPFNGRVIVSVGNKLRIYAL-----GKKKLLKKCEY 988

Query: 813  HGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                 A+  ++  GD I   D+ +S+ +  Y   +  I   + D    W++  EILD   
Sbjct: 989  KDIPEAIVSIKVSGDRIFASDIRESVLIFFYDSNQNLIRLISDDIIPRWITCSEILDHHT 1048

Query: 872  YLGAENNFNLFTVRKN-------------SEGATDEERG-------------------RL 899
             + A+   ++F +R +              E A  EE G                   ++
Sbjct: 1049 IIAADKFDSVFILRVSLLTFFITPFCHLVPEEAKQEEYGIANKCWYGGEVINSSTKNRKM 1108

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH---EQYLF 956
            E +  +H+GE V   +   L    P S       +I+ T+ G IG  A +P+   E+   
Sbjct: 1109 EHIMSFHIGEIVTSLQKVKLS---PTSS----ECIIYSTIMGTIG--AFIPYDSKEELEL 1159

Query: 957  LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
             + L+  LR     + G  H  +RS+ +  + V     +DGDL E F  L      +I  
Sbjct: 1160 TQHLEIILRTEKHSLCGREHIFFRSYYHPVQHV-----IDGDLCEQFSSLPFEVQRKIGS 1214

Query: 1017 TMNVSVEELCKRVEEL 1032
             +  + +E+ +++E++
Sbjct: 1215 DLEKTPDEILRKLEDI 1230


>gi|146420838|ref|XP_001486372.1| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1206

 Score =  125 bits (315), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 237/1169 (20%), Positives = 447/1169 (38%), Gaps = 176/1169 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            + G I  ++  R  G   D L I ++  +  +L++D +  +         S      T  
Sbjct: 66   VIGVIQNIDRIRKGGSNLDLLVITSDSGRLSILEFDKDELKFFPVVQEPHSKNGMNRTTP 125

Query: 66   GQIGIIDPDCRLIGLHLYDG---LFKVIPFDNKGQLKEAF-NIRLEELQVLDIKFLYGCA 121
            G+   +DP  R I +   +    ++K    +NK +L     ++    L +  +    G  
Sbjct: 126  GEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKLELLSPLESVSKNTLTIQMVSLDTGYE 185

Query: 122  KPTIVVL------------YQDNKDARHVKTYEV------ALKDKDFVEGPWSQNNLDNG 163
             P +  +            Y        ++ YE         + KD +E P S   L   
Sbjct: 186  NPMLAAIECNYAHYDASLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTL--- 242

Query: 164  ADLLIPVPPPLCGVLIIGEETIVY----CSANAFKAIPIR------PSITKAYGRVDADG 213
                +P+P P+ GV++ G   I Y         +  IP+R      P +  A  ++  + 
Sbjct: 243  ----VPLPTPIGGVIVAGSSFIFYHNPTIDQQLYLPIPLRAGSSPVPIVCYAVHKLKKNN 298

Query: 214  SRYLLGDHAGLLHLLVITH--EKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
               LL +  G    ++I +  + EKVT L +      S +++I+      ++   +  D 
Sbjct: 299  FFILLHNELGDCFRVLIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDK 358

Query: 272  QLIKLN--------------------LQPDAKGSYVEVLERYVNLGPIVDF-------CV 304
             L ++                        + K  +     R + L  I+D         +
Sbjct: 359  MLYQIEDLGDNDSYISSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGAL 418

Query: 305  VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDT 363
            V      + ++   +G      L++  +GI I+   S  L  I   +++ R S +   D 
Sbjct: 419  VKTSESKESRIAMITG---HSHLKLKTHGIPISTLVSSPLPMIATSVFTTRLSAESKNDE 475

Query: 364  FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
            ++V+S  + ++ L + + + +EE +   F +   T+         L+Q+ S  +R +  T
Sbjct: 476  YMVISSSASSKTLVLAIGEVVEEVQDSSFVTDQPTIGVQQVGLKSLIQIYSNGIRHIRQT 535

Query: 424  SRELR-----NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY 478
              E +      +W  P G ++  A+ N  QVL+      L Y EI D    ++   Q   
Sbjct: 536  ETEGKITKKTFDWYPPAGITIISASTNQEQVLIGLSNRELCYFEI-DPTDDQLIEYQERI 594

Query: 479  EISCLDINPIGENPSYSQIA------AVGMWTDISVRIFSLPD---LNLITKEHLGGEII 529
            E+S   I  +    S+   +      A+   TD +V++ SL     L  +T + L     
Sbjct: 595  EMSGGQICALALASSFVNKSQRKSPFALVACTDETVQVISLQQHNCLETLTFQALSANCT 654

Query: 530  PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
               ++      +++L   L +G  +   L   TG+ +D +   LG   + L   S +   
Sbjct: 655  SAVIISNETSLVAHL--GLENGLYVRSTLESITGKFSDTRVKYLGADSVKLSAISLRQLQ 712

Query: 590  H--VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
               V A S  P + Y S  K   + +    +     F S    + +  A   ELTI TI 
Sbjct: 713  QVGVLAVSTYPWLCYRSKNKTRITPLMGANIKCGTSFYSEEIGEGIVGASASELTIFTIG 772

Query: 648  DIQ---------KLHIRSIPLGEHPRRICHQEQSRTFAICS-----LKNQSCAE------ 687
            + +         +L I  I L   P ++  ++Q+  F I S     L  QS  E      
Sbjct: 773  EDEDNSEMSIEDELAIEDIRLRYSPTKMILEDQT-AFVIESEYGVVLPYQSGEEVDSDYY 831

Query: 688  ---------ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY 738
                     ES    V++LD ++ +   +  L+  +   S+   +F +    Y  VGTA 
Sbjct: 832  SAFGYLRKDESWASCVQILDLESKKITFSTELEDNQKPLSLCRMNFGNQK--YLMVGTAK 889

Query: 739  VLP-EENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY--- 794
             L  + N   K +I+ F++   +L+L+   E      ++  F GKLL  + + ++LY   
Sbjct: 890  DLTFQLNNNPKYKIITFLINGSELELLHYTEVDHPPAAMIPFEGKLLVGMGKYLRLYELG 949

Query: 795  -KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
             K +LR   T        +   I+ +  Q +   IVVGD   S + L Y   +      A
Sbjct: 950  KKQLLRKSSTL-----VDYLTKIVQITHQGKQR-IVVGDGSNSTTFLKYDSLDNIFVSFA 1003

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVR----------------KNSEGATDEERG 897
             D     ++A+E LD D  +G +   N+F  R                K  +   +    
Sbjct: 1004 DDVMKRHITALECLDHDTVIGGDKFGNVFVNRIPFTLSKQADQEWSLVKYQDHYLNSAGN 1063

Query: 898  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE-QYLF 956
            RL+ + E+ L +    F  G+LV        G   ++ +  + G +G    L  + +  F
Sbjct: 1064 RLKGLCEFFLQDIPTLFFKGTLV-------TGGKESIFYTGLCGSLGFFEPLISKLEVSF 1116

Query: 957  LEKLQTNLRKVIK-------------GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
               L+ ++RKV+               + G +  ++R + N       KN +DGD +E +
Sbjct: 1117 FTALENSIRKVLDPNLEEHDKKRLYCQLLGKDQLKFRGYYNP-----VKNVIDGDFVEYY 1171

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +L      +I+  ++ +  ++ +++ ++
Sbjct: 1172 FELDPRTKTKIATELDRTPRDIERKIADI 1200


>gi|297598550|ref|NP_001045829.2| Os02g0137400 [Oryza sativa Japonica Group]
 gi|255670583|dbj|BAF07743.2| Os02g0137400 [Oryza sativa Japonica Group]
          Length = 845

 Score =  124 bits (312), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 110/413 (26%), Positives = 187/413 (45%), Gaps = 22/413 (5%)

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSL 353
           +L PI+D  V +L  +   QV T  G     +LRI+R G+ I+E A   L      +W++
Sbjct: 28  SLMPIMDMRVANLFDEETPQVFTACGRGPRSTLRILRPGLAISEMARSMLPAEPIAVWTV 87

Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
           + + +D FD ++VVSF + T +L++   + +EE     F   T +L       + L+QV 
Sbjct: 88  KKNINDMFDAYIVVSFANVTLVLSIG--ETIEEVSDSQFLDTTHSLAVSLLGEDSLMQVH 145

Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVK 472
              +R +    R   NEW++P   ++    +N  QV++A  GG L+Y E+   G L EV+
Sbjct: 146 PNGIRHIREDGR--VNEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVE 203

Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPR 531
              +  +++CL I P+ E    S+  AVG + D ++RI S+ PD  L           P 
Sbjct: 204 KQDMSGDVACLAIAPVPEGRQRSRFLAVGSF-DNTIRILSVDPDDCLQPLSVQSVSSAPE 262

Query: 532 SVLLCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
           S++    +               +L   L +G L    ++M TG+L+D +   LG +P  
Sbjct: 263 SLMFLEVQASVGGEDGADHPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPK 322

Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
           L      +   +   S RP + Y      L + ++   +     F+S    + +      
Sbjct: 323 LFPCIVSHRQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGD 382

Query: 640 ELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE 690
            L I TI+ + +  +  +IPL   PR+     + +  A+  S K    AEE E
Sbjct: 383 ALRIFTIEHLGETFNETAIPLRYTPRKFVILPKKKYLAVIESDKGALSAEERE 435



 Score = 93.6 bits (231), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 91/365 (24%), Positives = 166/365 (45%), Gaps = 44/365 (12%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---PEENEPTKG 749
            +R+LD ++ +      L   E   SI + +F D +      VGTA  L   P+ N  + G
Sbjct: 494  IRILDPKSRDTTCLLELQDNEAAVSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNL-SAG 552

Query: 750  RILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
             I ++  V++G+ L+L+ + + +    +L  F G+LLA +   ++LY     D G R+L 
Sbjct: 553  FIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSVLRLY-----DLGKRKLL 607

Query: 808  SECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
             +C +      +  + T  D I VGD+ +S     Y+ +E  +   A D    W++A   
Sbjct: 608  RKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAANH 667

Query: 867  LDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHLG 908
            +D D   GA+   N++  R                  K  +G  +    ++E + ++H+G
Sbjct: 668  IDFDTMAGADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVG 727

Query: 909  EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKV 967
            + V   +  SL+        G    +I+GTV G +G ++A    E   F   L+ +LR+ 
Sbjct: 728  DVVTCLQKASLI-------PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQE 780

Query: 968  IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
               + G +H  +RS          K+ +DGDL E F  L      +I+  ++ +  E+ K
Sbjct: 781  HPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPSLPADMQRKIADELDRTPGEILK 835

Query: 1028 RVEEL 1032
            ++E++
Sbjct: 836  KLEDI 840


>gi|294875343|ref|XP_002767276.1| spliceosome factor, putative [Perkinsus marinus ATCC 50983]
 gi|239868839|gb|EEQ99993.1| spliceosome factor, putative [Perkinsus marinus ATCC 50983]
          Length = 1258

 Score =  124 bits (311), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 174/744 (23%), Positives = 300/744 (40%), Gaps = 115/744 (15%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           ++ V  +G I +LE FR  G  +D+L + ++  +  +L+++  +  +  +   +   + G
Sbjct: 53  VISVNCFGLIRSLETFRLVGANRDYLLVGSDSGRIVILEYNT-TKNIFDKVHQETYGKTG 111

Query: 61  -RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-- 117
            R T  GQ   +DP  R   +   +    V   +   Q +   +  LE  +   I F   
Sbjct: 112 CRRTVPGQYLAVDPKGRSCMVSAVERQKFVYILNRDLQSRLTISSPLEAHKSYTIVFATV 171

Query: 118 ---YGCAKPTIVVL---YQD-----NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
               G   P    +   Y D         + +  YE+ L            + +   A  
Sbjct: 172 GLDVGFDNPQFAAIECQYDDFGKKGPNPQKLLTIYEMDLGVNHVTRK--HSDKIPFTAHD 229

Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN---AFKAIPIRPSITKAYGRVDADGSR----- 215
           LIPVP       GVL+  E  + Y          A+P R       G +    SR     
Sbjct: 230 LIPVPGGTDGPGGVLLCCENFLSYYKQGHPVLSCAMPRRLDTPAEKGLMVVCWSRHKLKN 289

Query: 216 ---YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQ 272
              +L+    G L+ + ++H++  V+ ++        +A +I  L    ++  S +G+  
Sbjct: 290 FFFFLIQSEYGDLYKVTLSHKEGVVSEIQCSYFDSIPVAISICVLKTGFLFAASEFGNHA 349

Query: 273 LIKL------------NLQPDAKGSYVEVLERYV----------NLGPIVDFCVVD---- 306
           L +             +  P+ + + V    R +          +L PI D  V+D    
Sbjct: 350 LYQFASLGDVTPALVTSSHPNRENAVVAFKPRTLKNLTPFDELSSLAPITDMKVMDCFST 409

Query: 307 ----LERQGQG-------------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IK 348
               L+  G G             Q+   SG     +LRI+R+G+ + E  + EL G   
Sbjct: 410 QTQVLQADGSGMQQTVTTGMSVGCQIYALSGRGPRSALRILRHGLTLGEAGASELPGQPN 469

Query: 349 GMWSLRSSTDDPF--------------DTFLVVSFISETRILAMNLEDELEETEIEGFCS 394
            +++++     PF              D ++VVSF+ +T  L +   D + E    GF  
Sbjct: 470 ALFTIK-----PFGASYAPVAEGEVESDRYIVVSFVDQTLTLLVT-SDNIHEVTDSGFAK 523

Query: 395 QTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATG 454
           +  TLF         +QV    +R V++  R    EW++PPG  V +A +N SQV++A  
Sbjct: 524 EQPTLFAMRMQDKSAIQVMPTGIRHVAAGRRT--TEWRAPPGRQVTMAASNGSQVVIALS 581

Query: 455 GGHLVYLEIG---DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 511
           GG +   E+    +G L+EV    +  E++ L + P+    + SQ  AV    D SVR+ 
Sbjct: 582 GGEIQLFELDADTNGHLSEVAKRDIGCEVAALTVQPLSSGRTRSQFMAVA-GVDSSVRVL 640

Query: 512 SL-PDLNLITKEHLGGEIIPRSVLLCAF-EGIS---YLLCALGDGHLLNFLLNMKTGELT 566
           SL  D  L             SV +  F EG +   YL   L DG L+  +++  TG+L+
Sbjct: 641 SLDSDRPLRQLAAQALRTTASSVCMLQFGEGTTATIYLAIGLEDGVLVRSVVDGVTGQLS 700

Query: 567 DRKKVSLGTQPITLRTFS-SKNTTHVFAASDRPTVIYSSNKKLL-------YSNVNLKEV 618
           D+++  LG + +TL   +       + A S RP + + +N           Y  V L ++
Sbjct: 701 DQRQRVLGPRRVTLCPITVDGGRPAMVAMSSRPFLCFQANDYSAAGASGGQYQCVPLTKI 760

Query: 619 SHMCPFNSAAFPDSLAIAKEGELT 642
             +  F+ AA P S+   +EG  T
Sbjct: 761 PALPRFDHAA-PFSIENCQEGICT 783



 Score = 76.3 bits (186), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 82/366 (22%), Positives = 159/366 (43%), Gaps = 41/366 (11%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD--DSNVYYCVGTAYVL----PEENEPT 747
            +R++D  T        LD  E   ++  C F    D+     VGTA  +    P  +   
Sbjct: 902  IRVVDPLTASTSFKLDLDVDEAATAMTVCYFYQLKDNRPCLVVGTATGVDPHNPSRSAHG 961

Query: 748  KGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM-LRDDGTRE 805
            K  I  ++ ++   LQLI     +G   ++  F G+LL A+     +   + + + G + 
Sbjct: 962  KCYIKTYLYDESYNLQLIHVTPLEGVPSAMYPFEGRLLVALRGSPTVAPVLRIYELGKKR 1021

Query: 806  LQSECGHH-----GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
            L  +C +      G I+ L V    D I   D   SI +L +++ +  ++  + D     
Sbjct: 1022 LLKKCEYKFLPESGGIMWLDVNK--DRIFAADSRDSILVLRWRYSDNQMQVISDDTYPRC 1079

Query: 861  MSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR-----------LEVVGEYHLGE 909
            ++A  +LD +  +  +   N+  +R   +       GR           ++++G +H+GE
Sbjct: 1080 ITAAAVLDYNTIVVGDKFDNIAVLRVPGDAKDAGAWGRDNDYASGNTFKMDLIGHFHVGE 1139

Query: 910  FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE-QYLFLEKLQTNLRKVI 968
             +   +  ++V        G    VI+ TV G IG +     + ++ FL+ L+ ++R   
Sbjct: 1140 TITSLQRVTMV-------AGGAEIVIYSTVLGTIGALYPFSSKREHGFLQALEMHMRNTA 1192

Query: 969  KG--VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
                + G  H  +RSF +       KNF+D DL E +  L   +  +I+  M+ + +E+ 
Sbjct: 1193 ASPSLTGREHVMYRSFYHP-----IKNFVDADLCEVYYQLPAEKQRQIAVDMDKTPQEVM 1247

Query: 1027 KRVEEL 1032
            K++E++
Sbjct: 1248 KKLEDI 1253


>gi|147839354|emb|CAN61260.1| hypothetical protein VITISV_043608 [Vitis vinifera]
          Length = 831

 Score =  124 bits (310), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 59/67 (88%), Positives = 59/67 (88%)

Query: 46  ELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIR 105
           E I  AMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIR
Sbjct: 763 EKIYGAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIR 822

Query: 106 LEELQVL 112
              L  L
Sbjct: 823 YFFLSFL 829



 Score =  103 bits (258), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 47/53 (88%), Positives = 49/53 (92%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG 53
           MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE+SE+I    G
Sbjct: 483 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAENSEVIASIYG 535


>gi|389586447|dbj|GAB69176.1| splicing factor 3B subunit 3 [Plasmodium cynomolgi strain B]
          Length = 1286

 Score =  124 bits (310), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 145/713 (20%), Positives = 288/713 (40%), Gaps = 111/713 (15%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
           ++G I  L+ FR  G  +D++ I ++  +  +LQ+  E ++ + R   +   + G R   
Sbjct: 58  VFGIIRCLQTFRLTGSNKDYVVIGSDSGRLVILQFSNEKNDFV-RVHCETYGKSGLRRII 116

Query: 65  NGQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAF-------------------- 102
            G+   +DP  R + +   +      ++  D K QL  +                     
Sbjct: 117 PGEYIAVDPKGRALMICAIERQKFVYILNRDTKEQLTISSPLDAHKSHTICHDVVGMDVG 176

Query: 103 -----------NIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFV 151
                      N  + + QV +   + GC + T++ L++ +    HV             
Sbjct: 177 FENPIFASIEQNYEMYDKQVTNTNEIDGCTRKTLLCLWEMDLGLNHVIRKHTL------- 229

Query: 152 EGPWSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVYCSA---NAFKAIPIR------ 199
                   +D+ A LLIP+P       GV++  +  +VY      + + A P R      
Sbjct: 230 -------PIDSSAHLLIPIPGGQQGPSGVIVCCDNYLVYKKVEHVDVYCAYPRRLETGQE 282

Query: 200 ---PSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISY 256
                +  A  R+       L+    G L+ + + HE   V  +  +      +A+ I  
Sbjct: 283 KNISIVCSALHRI-RKFFFILIQSEFGDLYKIEMDHEDGIVKEITCKYFDTVPVANAICV 341

Query: 257 LDNAVVYIGSSYG-------------DSQLIKLNLQPDAKGSYVE----------VLERY 293
           + +  +++ + +G             D++ +  +  P  + + +           ++++ 
Sbjct: 342 MKSGSLFVAAEFGNHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKLTNLFLIDQV 401

Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWS 352
            +L PI+D  ++D +     Q+    G     SLRI+++G+ I E A  EL G  K +W+
Sbjct: 402 YSLSPILDMKILDAKNANSPQIYALCGRGPRSSLRILQHGLSIEELADNELPGRPKFIWT 461

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           ++      +D +++VSF   T IL +   + +EE       +   T+  +    N L+QV
Sbjct: 462 IKKDNASDYDGYIIVSFEGSTLILEIG--ETVEEVVDSLLLTNVTTIHVNILYDNTLIQV 519

Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEV 471
               +R ++    ++ +EW  P    +  AT+N++Q++++  GG L+Y EI +   L E+
Sbjct: 520 HDTGIRHING---KVVHEWVPPKNKQIKAATSNSTQIVISLSGGELIYFEIDESHTLVEI 576

Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
               L  E  CL I  + EN   +   AVG   ++ VR+ S+       +  L   I+P 
Sbjct: 577 FRKNLNVETLCLSIQQVEENKLRANFLAVGCLDNV-VRLLSIEKEKYFNQ--LSTFILPN 633

Query: 532 S--------VLLCAF-----EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
           +          +C         + +L   L +G LL  +++  TG LT+     LG + +
Sbjct: 634 NSSAQDICISQMCELGNDKERKLIFLNIGLNNGVLLRSVVDPITGTLTNHYSKYLGAKNV 693

Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
            +     K  + +    ++  + Y    K +YS +N   + +   F+S    D
Sbjct: 694 KICPVHLKKNSALLVLCEKTYLCYVHQGKYIYSPLNYDILEYASSFHSEQCSD 746



 Score = 70.5 bits (171), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 82/364 (22%), Positives = 152/364 (41%), Gaps = 47/364 (12%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC--VGTAYVLPEENEPTKGRI 751
            ++++   + + I    L+  E   S+ +C         +C  VGT   L  +N       
Sbjct: 940  IKIIHPISLQTIDKISLEMEEAALSVCACELE----ALHCLIVGTTTNLSLKNRTAPAAA 995

Query: 752  LVFIVED--GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
            L     D   KL L+     +   +    FNG+LLA+I  K+++Y       G ++L  +
Sbjct: 996  LRVYTYDINYKLNLLHITPVEDQPFCFCPFNGRLLASIGNKLRIYAL-----GKKKLLKK 1050

Query: 810  CGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
            C +     A+  ++  GD I   D+ +S+ +  Y      +   + D    W++  EILD
Sbjct: 1051 CEYKDIPEAIISIKVSGDRIFASDIRESVLIFFYDSNMNTLRLISDDIIPRWITCSEILD 1110

Query: 869  DDIYLGAENNFNLFTVRKNSEGATDEERG-------------------RLEVVGEYHLGE 909
                + A+   ++F +R   E A  EE G                   RLE +  +H+GE
Sbjct: 1111 HHTIMAADKFDSVFVLRVPEE-AKQEEYGISNKCWYGGEIMAGSNKNRRLEHIMSFHVGE 1169

Query: 910  FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL-EKLQTNLRKVI 968
             V   +   L    P S       +I+ T+ G IG      +++ L L + L+  LR   
Sbjct: 1170 IVTSLQKVKLS---PTSS----ECIIYSTIMGTIGAFIPYDNKEELELTQHLEIILRTEN 1222

Query: 969  KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
              + G  H  +RS+ +  + V     +DGDL E F  L      +++  +  + +++ ++
Sbjct: 1223 PPLCGREHIFFRSYYHPVQHV-----IDGDLCEQFSSLPYDVQRKVAADLERTPDDILRK 1277

Query: 1029 VEEL 1032
            +E++
Sbjct: 1278 LEDI 1281


>gi|190345965|gb|EDK37945.2| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC 6260]
          Length = 1206

 Score =  123 bits (309), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 233/1169 (19%), Positives = 443/1169 (37%), Gaps = 176/1169 (15%)

Query: 6    IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            + G I  ++  R  G   D L I ++  +  +L++D +  +         S      T  
Sbjct: 66   VIGVIQNIDRIRKGGSNLDLLVITSDSGRLSILEFDKDELKFFPVVQEPHSKNGMNRTTP 125

Query: 66   GQIGIIDPDCRLIGLHLYDG---LFKVIPFDNKGQLKEAF-NIRLEELQVLDIKFLYGCA 121
            G+   +DP  R I +   +    ++K    +NK +L     ++    L +  +    G  
Sbjct: 126  GEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKLELSSPLESVSKNTLTIQMVSLDTGYE 185

Query: 122  KPTIVVL------------YQDNKDARHVKTYEV------ALKDKDFVEGPWSQNNLDNG 163
             P +  +            Y        ++ YE         + KD +E P S   L   
Sbjct: 186  NPMLAAIECNYAHYDASLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTL--- 242

Query: 164  ADLLIPVPPPLCGVLIIGEETIVY----CSANAFKAIPIR------PSITKAYGRVDADG 213
                +P+P P+ GV++ G   I Y         +  IP R      P +  A  ++  + 
Sbjct: 243  ----VPLPTPIGGVIVAGSSFIFYHNPTIDQQLYLPIPSRAGSSPVPIVCYAVHKLKKNN 298

Query: 214  SRYLLGDHAGLLHLLVITH--EKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
               LL +  G    ++I +  + EKVT L +      S +++I+      ++   +  D 
Sbjct: 299  FFILLHNELGDCFRVLIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDK 358

Query: 272  QLIKLN--------------------LQPDAKGSYVEVLERYVNLGPIVDF-------CV 304
             L ++                        + K  +     R + L  I+D         +
Sbjct: 359  MLYQIEDLGDNDSYISSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGAL 418

Query: 305  VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDT 363
            V      + ++   +G      L++  +GI I+   S  L  I   +++ R S +   D 
Sbjct: 419  VKTSESKESRIAMITG---HSHLKLKTHGIPISTLVSSPLPMIATSVFTTRLSAESKNDE 475

Query: 364  FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
            ++V+S  + ++ L + + + +EE +   F +   T+         L+Q+ S  +R +  T
Sbjct: 476  YMVISSSASSKTLVLAIGEVVEEVQDSSFVTDQPTIGVQQVGLKSLIQIYSNGIRHIRQT 535

Query: 424  SRELR-----NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY 478
              E +      +W  P G ++  A+ N  QVL+      L Y EI D    ++   Q   
Sbjct: 536  ETEGKITKKTFDWYPPAGITIISASTNQEQVLIGLSNRELCYFEI-DPTDDQLIEYQERI 594

Query: 479  EISCLDINPIGENPSYSQIA------AVGMWTDISVRIFSLPD---LNLITKEHLGGEII 529
            E+S   I  +    S+   +      A+   TD +V++ SL     L  +T + L     
Sbjct: 595  EMSGGQICALALASSFVNKSQRKSPFALVACTDETVQVISLQQHNCLETLTFQALSANCT 654

Query: 530  PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
               ++      +++L   L +G  +   L   TG+ +D +   LG   + L   S + + 
Sbjct: 655  SAVIISNETSLVAHL--GLENGLYVRSTLESITGKFSDTRVKYLGADSVKLSAISLRQSQ 712

Query: 590  H--VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
               V A S  P + Y S  K   + +    +     F S    + +  A   ELTI TI 
Sbjct: 713  QVGVLAVSTYPWLCYRSKNKTRITPLMGANIKCGTSFYSEEIGEGIVGASASELTIFTIG 772

Query: 648  DIQ---------KLHIRSIPLGEHPRRICHQEQSRTFAI--------------------C 678
            + +         +L I  I L   P ++  ++Q+  F I                     
Sbjct: 773  EDEDNSEMSIEDELAIEDIRLRYSPTKMILEDQT-AFVIESEYGVVSPYQSGEEVDSDYY 831

Query: 679  SLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY 738
            S    S  +ES    V++LD ++ +   +  L+  +   S+   +F +    Y  VGTA 
Sbjct: 832  SAFGYSRKDESWASCVQILDLESKKITFSTELEDNQKPLSLCRMNFGNQK--YLMVGTAK 889

Query: 739  VLP-EENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY--- 794
             L  + N   K +I+ F++   +L+L+   E      ++  F GKLL  + + ++LY   
Sbjct: 890  DLTFQLNNNPKYKIITFLINGSELELLHYTEVDHPPAAMIPFEGKLLVGMGKYLRLYELG 949

Query: 795  -KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
             K +LR   T        +   I+ +  Q +   IVVGD   S + L Y   +      A
Sbjct: 950  KKQLLRKSSTL-----VDYLTKIVQITHQGK-QRIVVGDGSNSTTFLKYDSSDNIFVSFA 1003

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVR----------------KNSEGATDEERG 897
             D     ++A+E LD D  +G +   N+F  R                K  +   +    
Sbjct: 1004 DDVMKRHITALECLDHDTVIGGDKFGNVFVNRIPFTLSKQADQEWSLVKYQDHYLNSAGN 1063

Query: 898  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLF 956
            R + + E+ L +    F  G+LV        G   ++ +  + G +G    L    +  F
Sbjct: 1064 RSKGLCEFFLQDIPTSFFKGTLV-------TGGKESIFYTGLCGSLGFFEPLISKSEVSF 1116

Query: 957  LEKLQTNLRKVIK-------------GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
               L+ ++RKV+               + G +  ++R + N       KN +DGD +E +
Sbjct: 1117 FTALENSIRKVLDPNLEEHDKKRLYCQLLGKDQLKFRGYYNP-----VKNVIDGDFVEYY 1171

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +L      +I+  ++ +  ++ +++ ++
Sbjct: 1172 FELDPRTKTKIATELDRTPRDIERKIADI 1200


>gi|380488197|emb|CCF37544.1| hypothetical protein CH063_08850 [Colletotrichum higginsianum]
          Length = 271

 Score =  123 bits (309), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 76/268 (28%), Positives = 134/268 (50%), Gaps = 16/268 (5%)

Query: 782  KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
            KL+A +++ + +Y++      +  L+           + +   G+ I + DLM+S++L+ 
Sbjct: 7    KLVAGLSKTVVVYEYAEESSTSGALRKLATFRPSTFPVDIDVNGNMIGIADLMQSMTLVE 66

Query: 842  Y----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG 897
            +       +  + ERAR +   W ++V  L+   +L A+   NL  +R+N +  T+ ++ 
Sbjct: 67   FIPAQDGNKAKLVERARHFQYIWATSVCHLEGHSWLEADAQGNLMVLRRNPDAPTEHDQK 126

Query: 898  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL 957
            ++EV  E+HLGE VN+ R   +    P+ +   +P     TV G + V A +  E    L
Sbjct: 127  QMEVTSEFHLGEQVNKIRPLDIT---PNENDPIVPKAFLATVEGSLYVFADIKSEYQSLL 183

Query: 958  EKLQTNLRKVIKGVG--------GLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSR 1008
             + Q  L  V+K +G        GL+   WR F N K+  D    F+DG+LIE FLDL  
Sbjct: 184  LQFQERLADVVKTLGQAGGDSTSGLSFMAWRGFRNAKRAADGPFRFVDGELIERFLDLDE 243

Query: 1009 TRMDEISKTMNVSVEELCKRVEELTRLH 1036
             + + + + +  +VE +   VEEL R+H
Sbjct: 244  AKQEAVVQGLGPTVENMRNLVEELKRMH 271


>gi|156095699|ref|XP_001613884.1| Splicing factor 3B subunit 3 [Plasmodium vivax Sal-1]
 gi|148802758|gb|EDL44157.1| Splicing factor 3B subunit 3, putative [Plasmodium vivax]
          Length = 1230

 Score =  123 bits (308), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 150/700 (21%), Positives = 289/700 (41%), Gaps = 85/700 (12%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
           I+G I  L+ FR  G  +D++ I ++  +  +LQ+  E ++ + R   +   + G R   
Sbjct: 58  IFGIIRCLQTFRLTGSNKDYVVIGSDSGRLTILQFSNEKNDFV-RVHCETYGKSGLRRII 116

Query: 65  NGQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAFNIRLEELQVL--DIKFL-YG 119
            G+   +DP  R + +   +      ++  D K QL  +  +   +   +  D+  +  G
Sbjct: 117 PGEYIAVDPKGRALMICAIERQKFVYILNRDTKEQLTISSPLDAHKSHTICHDVVGMDVG 176

Query: 120 CAKPTIVVLYQDNK--DARHVKTYEVALKDKDFVEGPWSQN------------NLDNGAD 165
              P    + Q+ +  D +   T E+    +  +   W  +             +D  A 
Sbjct: 177 FENPMFASIEQNYEALDKQVTNTSEIDSYTRKTLLSLWEMDLGLNHVIRKYTFPIDASAH 236

Query: 166 LLIPVP-----PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSI--TKAYGRVDADGSR 215
           LLIP+P     P   GV++  +  +VY     A+ + A P R      K    V +   R
Sbjct: 237 LLIPIPGGQQGPS--GVIVCCDNFLVYKKVDHADVYCAYPRRLETGQEKNLSIVCSTLHR 294

Query: 216 Y------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
                  L+    G L+ + + HE   V  +  +      +A+ I  + +  +++ + +G
Sbjct: 295 IRKFFFILIQSELGDLYKIEMEHEDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354

Query: 270 -------------DSQLIKLNLQPDAKGSYVE----------VLERYVNLGPIVDFCVVD 306
                        D++ +  +  P  + + +           ++++  +L PI+D  V+D
Sbjct: 355 NHFFYQFSGIGDEDNEAMCTSKHPSGRNAIIAFRTKKLTNLFLIDQVYSLSPILDMKVID 414

Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFL 365
            +     Q+    G     SLRI+++G+ I E A  EL G  K +W+++      +D ++
Sbjct: 415 AKNASSPQIYALCGRGPRSSLRILQHGLSIEELADNELPGRPKFIWTIKKDNASDYDGYI 474

Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
           +VSF   T IL +   + +EE       +   T+  +    N L+QV    +R ++    
Sbjct: 475 IVSFEGSTLILEIG--ETVEEVVDSLLLTNVTTIHVNILYDNSLIQVHDAGIRHING--- 529

Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEISCLD 484
           ++ +EW  P    +  AT+N +Q++++  GG L+Y EI +   L E     L  E  CL 
Sbjct: 530 KVIHEWVPPKNKQIKAATSNCAQIVISLSGGELLYFEIDESHTLVETFRKNLNVETLCLS 589

Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL---LCAFE-- 539
           I  + EN   +   AVG   ++ VR+ S+       +  L   I+P +     +C  E  
Sbjct: 590 IQQVQENKLRANFLAVGCLDNV-VRLLSIEKEKYFNQ--LSTFILPNNSSAQDICITEMS 646

Query: 540 --------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
                    + +L   L +G LL  +++  TG LT+     LG + + +     K    +
Sbjct: 647 ELGNDKERKLLFLNIGLNNGVLLRSVVDPITGTLTNHYSKYLGAKNVKICPVHVKKNAAL 706

Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
               ++  + Y    K +YS +N   + +   F+S    D
Sbjct: 707 LVLCEKTYLCYVHQGKYIYSPLNYDILEYASSFHSEQCSD 746



 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 82/364 (22%), Positives = 151/364 (41%), Gaps = 47/364 (12%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC--VGTAYVLPEENEPTKGRI 751
            ++++     + I    L+  E   S+ +C         +C  VGT   L  +N       
Sbjct: 884  IKIIHPVNLQTIDKISLEMEEAALSVCACELE----ALHCLIVGTTTSLSLKNRSAPAAA 939

Query: 752  LVFIVED--GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
            L     D   KL L+     +   +    FNG+LLA+I  K+++Y       G ++L  +
Sbjct: 940  LRVYTYDINYKLNLLHITPVEDQPFCFCPFNGRLLASIGNKLRIYAL-----GKKKLLKK 994

Query: 810  CGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
            C +     A+  ++  GD I   D+ +S+ +  Y      +   + D    W++  EILD
Sbjct: 995  CEYKDIPEAIISIKVSGDRIFASDIRESVLIFFYDANMNTLRLISDDIIPRWITCSEILD 1054

Query: 869  DDIYLGAENNFNLFTVRKNSEGATDEERG-------------------RLEVVGEYHLGE 909
                + A+   ++F +R   E A  EE G                   RLE +  +H+GE
Sbjct: 1055 HHTIMAADKFDSVFVLRVPEE-AKQEEYGISNKCWYGGEIMAGSNKNRRLEHIMSFHVGE 1113

Query: 910  FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL-EKLQTNLRKVI 968
             V   +   L    P S       +I+ T+ G IG      +++ L L + L+  LR   
Sbjct: 1114 IVTSLQKVKLS---PTSS----ECIIYSTIMGTIGAFIPYDNKEELELTQHLEIILRTEN 1166

Query: 969  KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
              + G  H  +RS+ +  + V     +DGDL E F  L      +++  +  + +++ ++
Sbjct: 1167 PPLCGREHIFFRSYYHPVQHV-----IDGDLCEQFSSLPYDVQRKVAADLERTPDDILRK 1221

Query: 1029 VEEL 1032
            +E++
Sbjct: 1222 LEDI 1225


>gi|164658491|ref|XP_001730371.1| hypothetical protein MGL_2753 [Malassezia globosa CBS 7966]
 gi|159104266|gb|EDP43157.1| hypothetical protein MGL_2753 [Malassezia globosa CBS 7966]
          Length = 919

 Score =  123 bits (308), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 190/827 (22%), Positives = 336/827 (40%), Gaps = 105/827 (12%)

Query: 215  RYLLGDHAGLLHLLVITHEKEKVTGLKIELL--GETSIAST---ISYLDNAVVYIGSSYG 269
            R LL    G +++L ++        + + LL    +S+A++   +S L    V++ S+ G
Sbjct: 130  RVLLSTTRGSMYILTVSSSTNAWETIHMHLLRLQMSSVATSPQGLSSLGEDFVFLASTTG 189

Query: 270  DSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRI 329
            DS L  ++  P     Y     R+ +  PIVD  V+D       +++T SGA    SLR 
Sbjct: 190  DSVLWHVDSSPSLHEVY-----RWPSTAPIVDM-VLD-NSSVMDRIITASGAGPTSSLR- 241

Query: 330  VRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI 389
                  I  +A      IK           P D   V +  S   ++  N    ++  E 
Sbjct: 242  -----AIMYKALTSTLFIK-----------PMDLVSVYALESALVLVHRNSASIVQLNEH 285

Query: 390  EGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNE------WKSPPGYSVNVAT 443
            E     TQ    HDA   +++     +   + +TS+++R        W +P   S     
Sbjct: 286  E---DPTQISIVHDA---RVLHAGMLAGLYLVATSKDIRTYGRGKTVWTAPDEISA-CHV 338

Query: 444  ANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMW 503
             +   V++      L+ L       + +   +    ISCL     G     S I A G W
Sbjct: 339  LDDKHVVIGLVTPQLLVLHHDHHNWSILHSLRTHAPISCL----FG---CASAIVA-GFW 390

Query: 504  TDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMK 561
             D +V+ +  P +      H+    +P S+L  AF   G ++ +    DGH+  +     
Sbjct: 391  -DRTVQKYVWPSMEG-DPIHIACTSLPTSILQHAFTPHGENHTVVGYADGHVDVY----- 443

Query: 562  TGELTDRKKV----SLGTQPITLRT-----FSSKNTTHVFAASDRPTVIYSSNKKLLYSN 612
                 D+ K+     +GT+PI L T         NT  + A+      +Y  +     S 
Sbjct: 444  ----NDQDKIVHTCQVGTEPIHLTTAPLSMMELPNTHAIVASGSYTCFLYPQDSAFRASI 499

Query: 613  VNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL-GEHPRRICHQEQ 671
              ++ V      ++++   ++  A +  L +  I  +  L I+S+ L G+ P  I     
Sbjct: 500  WLIEHVRAAACVDASSQALTMVCATQFGLEVHAIHTLHHLDIQSLSLKGQQPTSIA---S 556

Query: 672  SRTFAICSLKNQSCAEESE--MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
               + + +       E  E  +  +R++     + ++++ L  +E   S  +C     + 
Sbjct: 557  FGPYIVLTTWPMHAWEHHERGLSTIRVMTKAGLKQVTSHSLHAYERPNS--ACVKKFYNT 614

Query: 730  VYYCVGTAYVLPEENEPTKGRILVFIVED--GKLQLIAEKETKGAVYSLNAFNGK-LLAA 786
             Y  VGT + +   +  + GR++ +       KLQLI   +  G VY +    G  L+AA
Sbjct: 615  PYIVVGTGFYVGGRDSTSSGRLIGYTWSTPLSKLQLIWSCDVPGHVYGVTDVPGSYLVAA 674

Query: 787  INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
            ++ ++  Y      DG  +L    G     LA  + ++G  +VVGD M S+++L     +
Sbjct: 675  VDAQVHTYALC---DGKLKLCDRWG--CAFLATCLTSQGSTVVVGDAMHSLTVLQVD-PK 728

Query: 847  GAIEERARDYNANWMSAVEIL--DDDIYLGAENNFNLFTV-RKNSEGATDEERGRLEVVG 903
            G + E ARD +  W +AV +L  +   Y+G +   N+F   R +    TD          
Sbjct: 729  GQLHEIARDVDPYWTTAVGVLSTEQQQYIGTDIAMNMFVAERVHQTDGTDPWS------- 781

Query: 904  EYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
              H+      F HG +V  +     G+I   +FGT +G +G +  +P E    L  +Q  
Sbjct: 782  --HVMHRATAFHHGDMVNTIQSVADGRI---LFGTASGSVGTLMYVPDETASMLWCVQEA 836

Query: 964  LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL-SRT 1009
            L       G L+ E WR+   + +    ++ LD +L+ +F    SRT
Sbjct: 837  LSHQRHAHGDLSWESWRTLRTDMQICAPRHVLDAELLTTFFTCESRT 883


>gi|68075683|ref|XP_679761.1| splicing factor 3b, subunit 3, 130kD [Plasmodium berghei strain
           ANKA]
 gi|56500578|emb|CAH95367.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium berghei]
          Length = 1216

 Score =  123 bits (308), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 157/699 (22%), Positives = 289/699 (41%), Gaps = 84/699 (12%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
           I+G I +LE+FR  G  +D++ I ++  +  +L++D E ++ I R   +   + G R   
Sbjct: 58  IFGIIRSLEIFRLMGSNKDYIAIGSDSGRLVILKYDDEKNDFI-RVHCETYGKSGIRRII 116

Query: 65  NGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYG 119
            G+   IDP  R + L   +      ++  DNK  L  +  +   +   +    +    G
Sbjct: 117 PGEYIAIDPKGRALMLCAIEKQKFVYILNRDNKENLTISSPLEAHKSHSICHAVVGLNVG 176

Query: 120 CAKPTIVVLYQDNK--DARHVKTYEVAL----KDKDFVEGPWSQNN--------LDNGAD 165
              P  V + Q+ +  D + + T E  +    K   F E     N+        +D  A 
Sbjct: 177 FENPMFVSIEQNYEALDKQVLNTNEQIMEYPKKGLCFWEMDLGLNHVIKKHTIPIDITAH 236

Query: 166 LLIPVP-----PPLCGVLIIGEETIVY---------CSANAFKAIPIRPSITKAYGRVDA 211
           LLIP+P     P   G+++  E  +VY         CS      +    +I+     +  
Sbjct: 237 LLIPLPGGQQGPS--GLIVCCENYLVYKKIDHDDIYCSYPRRLEVGEEKNISIVCWTIHR 294

Query: 212 DGSRY--LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
             + +  L+    G L+ + + HE   V  +  +      IA++I  L +  +++ + +G
Sbjct: 295 IKTFFFILIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFG 354

Query: 270 DSQLIKLN-------------LQPDAKGSYVEV-LERYVNLG--------PIVDFCVVDL 307
           +    + +               P  K + +    ++  NL         PIVD  ++D 
Sbjct: 355 NHFFYQFSGIGNDSNESMCTSNHPSGKNAIIAFKTQKLKNLYLVDQIYSLPIVDMKILDA 414

Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLV 366
           +     Q+    G     SLRI+++G+ I E A+ EL G  + +W+++      +D +++
Sbjct: 415 KNSNIPQIYALCGRGPRSSLRILQHGLSIEELANNELPGKPRYIWTIKKDNSSEYDGYII 474

Query: 367 VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
           VSF   T IL +   + +EE       +   T+  +    N  +QV    +R ++    +
Sbjct: 475 VSFEGNTLILEIG--ETVEEVYDSLLLTNVTTIHINLLYDNSFIQVYDTGIRHING---K 529

Query: 427 LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEISCLDI 485
           +  EW  P    +N AT+N SQ++++  GG L+Y EI +   LTE+    +  EI CL I
Sbjct: 530 IVQEWVPPKNKQINAATSNGSQIVISLSGGELIYFEIDESHTLTEIFRKNINVEILCLSI 589

Query: 486 NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL---LCAFE--- 539
             I +N   +   AVG   ++ VR+ S+        + L   I+P +     +C  E   
Sbjct: 590 QQIQQNKLRASFLAVGCLDNV-VRLLSIEKDQYF--KQLSTYILPNNSSPQDICISEMKE 646

Query: 540 -------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
                   I YL   L  G LL  +++   G L++     LG + + +          + 
Sbjct: 647 LGSQKEHTILYLNIGLNTGVLLRSVIDPICGTLSNHYSKYLGAKSVKICHVQVNKNPALL 706

Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
             S++  + Y    K +YS +N   + +   F S    D
Sbjct: 707 VLSEKTYLCYVYQGKYIYSPLNYDVLEYASSFYSEQCSD 745



 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 75/360 (20%), Positives = 149/360 (41%), Gaps = 40/360 (11%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG-RIL 752
            +++++    + I    L+  E   S+ +C       +     T   L   N P+   R+ 
Sbjct: 871  IKIINPINLQVIDKISLELEEAALSVCACELEALHCLIVGTTTNMTLKSRNVPSASLRVY 930

Query: 753  VFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
             + + + KL L+     +   Y    FNGK++ ++  K+++Y       G ++L  +C +
Sbjct: 931  TYDI-NYKLNLLHITPIEDQPYCFCPFNGKVIVSVGNKLRIYAL-----GKKKLLKKCEY 984

Query: 813  HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
                 A+      + I   D+ +S+ +  Y   +  I   + D    W++  EILD    
Sbjct: 985  KDIPEAIVSIKVSNRIFASDIRESVLIFFYDSNQNVIRLISDDIIPRWITCSEILDHHTI 1044

Query: 873  LGAENNFNLFTVRKNSEGATDEERG-------------------RLEVVGEYHLGEFVNR 913
            + A+   ++F +R   E A  EE G                   ++E +  +H+GE V  
Sbjct: 1045 IAADKFDSVFILRVPEE-AKQEEYGIANKCWYGGEVINSSTKNRKMEHIMSFHIGEIVTS 1103

Query: 914  FRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL-EKLQTNLRKVIKGVG 972
             +   L    P S       +I+ T+ G IG      +++ L L + L+  LR     + 
Sbjct: 1104 LQKVKLS---PVSS----ECIIYSTIMGTIGAFIPYDNKEELELTQHLEIILRTEKHALC 1156

Query: 973  GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            G  H  +RS+ +       ++ +DGDL E F  L      +I   +  + +E+ +++E++
Sbjct: 1157 GREHIFFRSYYHP-----VQHVIDGDLCEQFSSLPFEVQRKIGSDLEKTPDEILRKLEDI 1211


>gi|167380951|ref|XP_001733297.1| DNA repair protein xp-E [Entamoeba dispar SAW760]
 gi|165902459|gb|EDR28278.1| DNA repair protein xp-E, putative [Entamoeba dispar SAW760]
          Length = 349

 Score =  122 bits (306), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 74/235 (31%), Positives = 133/235 (56%), Gaps = 5/235 (2%)

Query: 805  ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY---KHEEGAIEERARDYNANWM 861
            +LQ +   +  ++ LYV+T G+ I+VGDLMKSIS+  +    + +  + E +RD+ A++ 
Sbjct: 114  KLQEKGSCNVKLIGLYVKTMGNKILVGDLMKSISVYSFDNNGNNKNCLNEVSRDFYASYT 173

Query: 862  SAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVM 921
            +A+E +D++ YL +++N NL     NS G  + ER RL      H+GE +N    GS+  
Sbjct: 174  TAIEFVDENCYLSSDSNSNLLVFNTNSTG-NESERFRLNNCAHIHVGECINVMCKGSIAP 232

Query: 922  RLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG-VGGLNHEQWR 980
                 +  Q   ++FG V G IG I  +P+E Y  L K+Q  +   +KG V     ++W+
Sbjct: 233  THSTYETIQKKCILFGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTPDEWK 292

Query: 981  SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
               ++ K + + N +DG+++ES+L++S+ +  EI+    V+ E++   +E +  L
Sbjct: 293  KVIDDWKRMPSSNIIDGNIVESYLEMSKEKQCEIAHLSGVNEEQINDIIENMISL 347


>gi|449296290|gb|EMC92310.1| hypothetical protein BAUCODRAFT_151722 [Baudoinia compniacensis UAMH
            10762]
          Length = 1224

 Score =  122 bits (305), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 164/779 (21%), Positives = 330/779 (42%), Gaps = 85/779 (10%)

Query: 296  LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK--GMWSL 353
            L P++   V +L  +   Q+    G      L+ +R+G+ INE  S  L  ++   +WSL
Sbjct: 433  LNPLMRTKVDNLTGEDAPQIYAIQGTGNSSVLKTMRHGLEINEIVSSPLGNLQYDNLWSL 492

Query: 354  RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
            +    D + ++L++S     + + +++ DE+E  E   F +   T+  +      LVQV 
Sbjct: 493  KHRAVDDYHSYLLLSSGYGDKTIVLSIGDEVETMENSPFLTNRATITANQMGDATLVQVH 552

Query: 414  SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVK 472
            +  +  +S       NEW +P   ++  A++N  Q+LL      L +  +G DG+L +++
Sbjct: 553  ARGI--LSVLESGALNEWPAPAHRTIVAASSNDRQLLLGLSSSELAFFFMGDDGVLNQLE 610

Query: 473  H-AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP-DLNLITKEHLGGEIIP 530
               ++  +I+ L +         ++ A VG   D ++R++S+  +  L  +       +P
Sbjct: 611  EMPEMSGKITALSVGQTPRGQQQAKYAVVGC-DDCTIRVYSIELETALEPRSVQALSAVP 669

Query: 531  RSVLLCAFEG------ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
             S+ +           ++Y+   L  G  L  +++  TGEL + +   LG +P  +    
Sbjct: 670  TSIEVVEMLDPSSNVLVNYVHIGLQSGLYLRAVIDDVTGELGEVRTRFLGARPTRIFPVE 729

Query: 585  SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL--T 642
                  + A+S RP + Y++      + +   +V    PF +       A+  +  L  T
Sbjct: 730  IAEEQGIVASSSRPWLGYNNKGTYTLTPLVTGQVEAARPFVTEHLRGFCAVQGQSLLIFT 789

Query: 643  IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE--------------- 687
            + T D   +L    + L   PR +     S  + +      + A+               
Sbjct: 790  LPTTDG--RLSSSEVGLKYTPRAMTRHPYSPVWYVAQSDGNTLAQGTKDQLLSADGAMNG 847

Query: 688  ------------ESEMHFVRLLD--DQTFEFISTYPLDTFEYGCSILSCSF----SDDSN 729
                         S  H+   +   D  F   +T+ ++ F    + L C+     S +  
Sbjct: 848  VDEDVSKQLGLSRSAGHWASCIQAVDPIFAKAATHTIE-FTDNEAALCCAAVPFESKNWE 906

Query: 730  VYYCVGTAYVL----PEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
            VY  VGT   L      + +  +G + ++ ++EDG+ ++L+ +      +Y+++AF G+L
Sbjct: 907  VYLAVGTGQHLRPGVSAKGQAPRGYVHIYKLLEDGRNMELVHKTPFDAPIYAVHAFQGRL 966

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA---LYVQTRGDFIVVGDLMKSISLL 840
               +   + LY     D G + L  +    G ++A     + +RG+ ++  D+ +SI+ +
Sbjct: 967  ALGVGADLFLY-----DVGLKSLLRKS--RGTVVANTITSIDSRGNRLICADVSESITYV 1019

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEERGR 898
            ++K +   +     D    W +   ++D +   G +   NL+ VR  + +   +DEE   
Sbjct: 1020 VFKPKHNRMIGFVDDVIQRWTTTAAMIDYETSAGGDKFGNLWVVRCPEQASKESDEEGLG 1079

Query: 899  LEVVGEY-HLG------EFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIAS- 948
              +V E  +LG      E  + F    + M +  + +  G    + +  + G +G++   
Sbjct: 1080 GYIVNERSYLGGAPYRLELRSHFFTQDIPMSIQRTALVAGGQEVLFWSGLQGTLGMLVPF 1139

Query: 949  LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
            +  E   F  +L+T +R     + G +H  +RS+      V  K  +DGDL E F+ LS
Sbjct: 1140 VSREDVEFFTQLETLIRAEEPPLAGRDHLMYRSY-----YVPVKGVIDGDLCERFMKLS 1193


>gi|237837399|ref|XP_002367997.1| splicing factor 3B subunit 3, putative [Toxoplasma gondii ME49]
 gi|211965661|gb|EEB00857.1| splicing factor 3B subunit 3, putative [Toxoplasma gondii ME49]
 gi|221488748|gb|EEE26962.1| splicing factor 3B subunit, putative [Toxoplasma gondii GT1]
 gi|221509241|gb|EEE34810.1| splicing factor 3B subunit, putative [Toxoplasma gondii VEG]
          Length = 1233

 Score =  122 bits (305), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 117/512 (22%), Positives = 211/512 (41%), Gaps = 66/512 (12%)

Query: 223 GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD------------ 270
           G ++ + I+HE+  V  +         +A+ +  L +  +++ S +G+            
Sbjct: 306 GDIYKVEISHEEGVVREVVCRYFDTVPVANALCVLKSGYLFVASEFGNHLFYQFTGIGSD 365

Query: 271 --------------SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVV 316
                           +I    +P    + V+ L+   +L PI D  V+D +  G  QV 
Sbjct: 366 ASDPRCSSTHPLGREAIIAFKPRPLRNLALVDELQ---SLSPITDLKVLDAQGTGAPQVY 422

Query: 317 TCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRI 375
              G     +LRI+++G+G+ E A  EL G  + +W+ + S  + FD ++ V+F   +  
Sbjct: 423 VLCGKGPRSTLRILQHGLGVEEMADNELPGRARAVWTTKLSHQNAFDGYIFVAFDGSS-- 480

Query: 376 LAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
           L + + D +EE     F +   +L       +  +QV    +R +  + R   NEW++P 
Sbjct: 481 LVLQIGDTVEEVTDSAFLTNVSSLLVALMYDDSFIQVHETGIRHILKSKRV--NEWRAPG 538

Query: 436 GYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEISCLDINPIGENPSY 494
           G  +  A AN  Q++++  GG LV  E+ D   L E     +  E +C+ +  I +    
Sbjct: 539 GRRIKAAAANERQLVISLAGGELVLFEVDDAHTLVETARRNINVESTCMSMQAIPKGRLR 598

Query: 495 SQIAAVGMWTDISVRIFSL-PDLNL--ITKEHLGGEIIPRSVLLCAFEGIS--------- 542
           +   AVG   D  VRI SL  D NL  ++ + L  +  P SV L    G+          
Sbjct: 599 ASFLAVG-GLDNMVRILSLEKDRNLRQLSTQLLPNDATPESVCLATLTGLGANSTDAGKS 657

Query: 543 ------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASD 596
                 YL   L  G ++  +++   G L D++   LG + +     + +    + A S+
Sbjct: 658 QDNGVLYLHVGLNTGVMIRSVVDPVLGTLLDQRSRFLGGRAVRFHAVTLQGQPAILALSE 717

Query: 597 RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
           +  + Y+   KL    +N   +  +  F S    D       G L I     + +   ++
Sbjct: 718 KSWLCYTFQHKLHCIPLNYDPLECVASFCSEQCTDGFVAIAGGSLRIFRCQRLGETFGQT 777

Query: 657 I-PLGEHPRRICHQEQSRTFAICSLKNQSCAE 687
           + PL   PR           A+ +L + S AE
Sbjct: 778 VLPLSFTPR-----------AMAALPHPSAAE 798



 Score = 90.9 bits (224), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 92/364 (25%), Positives = 158/364 (43%), Gaps = 47/364 (12%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV-YYCVGT--AYVLPEENEPTKGR 750
            +R+++      I    L+T E   + LSC F +   +    VGT  A  L  +  P    
Sbjct: 887  IRIVNPLMAMTIDKVSLETDE---AALSCCFCEMEGLPLLVVGTVTAMTLKPKKVP-HAS 942

Query: 751  ILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
            I VF  +D   L L+     +    +L AF G LLA +  K++LY       G + L  +
Sbjct: 943  IKVFSYDDKFSLSLVHSTPVEDYPMALTAFRGMLLAGVGHKLRLYAL-----GRKRLLKK 997

Query: 810  CGHHGHILAL-YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
            C +      + +++  GD + VGD+ +S+ ++ Y+  E      A D    W++  E+LD
Sbjct: 998  CEYKNLPCGVAFIRVAGDRLFVGDVRESVHVMRYRLSENLFYVLADDVVPRWLTKGEVLD 1057

Query: 869  DDIYLGAENNFNLFTVRKNSEGATDE--------ERG----------RLEVVGEYHLGEF 910
               ++ A+   ++F  R  SE   DE         RG          +L+ +  +H+GE 
Sbjct: 1058 YHTFVAADKFDSVFICRVPSEAKEDELGDTTGLRLRGDTTYLTDKCFKLQSLLHFHIGEI 1117

Query: 911  VNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL--PHEQYLFLEKLQTNLRKVI 968
            V      +L     +S       +++GT+ G IG  +     HE  LF   L+  +R   
Sbjct: 1118 VTALERATLTSAASES-------IVYGTIMGSIGSFSPFLTKHELDLFTH-LEMVMRSEK 1169

Query: 969  KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
              + G  H  +RS+ +      AKN +DGDL ES+  L       I++    +  ++ K 
Sbjct: 1170 PPLAGREHIMFRSYYHP-----AKNTVDGDLCESYALLPYEDQKRIAQDFEKTPADILKH 1224

Query: 1029 VEEL 1032
            +E++
Sbjct: 1225 LEDI 1228


>gi|297829750|ref|XP_002882757.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297328597|gb|EFH59016.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 1384

 Score =  121 bits (304), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 128/549 (23%), Positives = 241/549 (43%), Gaps = 96/549 (17%)

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
            N+ PI+DF V+D + + + Q+  C G  ++GSLRI+R+GI + +  + +   QGI G W+
Sbjct: 465  NIAPILDFSVMDDQNEKRDQIFACCGVTREGSLRIIRSGINVEKLLKTAPVYQGITGTWT 524

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            ++    D + +FLV+SF+ ETR+L++ L  + + T+  GF S   TL C       LVQ+
Sbjct: 525  VKMKLTDVYHSFLVLSFVEETRVLSVGLSFK-DVTDSVGFQSDVCTLACGLVADGLLVQI 583

Query: 413  TSGSVRLVSST----------SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
               ++RL   T          S    + W  P   S+++     + ++++T     + + 
Sbjct: 584  HQDAIRLCMPTMDAHSDGIPVSSPFFSSW-FPDNVSISLGAVGQNLIVVSTSNPCFLSIL 642

Query: 463  IGDGI------LTEVKHAQLEYEISCLDI--------NPIGENPSYSQIAAV--GMWTDI 506
                +      + E++   L+YE+SC+ +             +P  S  AA+  GM    
Sbjct: 643  GVKSVSSQCCEIYEIQRVTLQYEVSCISVPQKHIGKKRSCASSPDNSCKAAIPSGMEQGY 702

Query: 507  ---------SVRIFSLPD---------LNLITKEHLGGEI----IPRSVLLCAFEGISYL 544
                     SV + S  +           L++  +  G +    IP+ V L   + + Y+
Sbjct: 703  SFLIGTHKPSVEVLSFSEDGVGVRVLASGLVSLTNTMGAVISGCIPQDVRLVLVDQL-YV 761

Query: 545  LCALGDGHLLNF---LLNMKTG--------------ELTDRKKVSL------------GT 575
            L  L +G LL F   L +  +G              ++   KK +L            G 
Sbjct: 762  LSGLRNGMLLRFEWPLFSNASGLNCPDYFSYCKEEMDIVVGKKDNLPINLLLIATRRIGI 821

Query: 576  QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
             P+ L  FS    + + A SDRP ++ ++ + L Y++++ +  +H  P  S+  P  +  
Sbjct: 822  TPVFLVPFSDSLDSDIIALSDRPWLLQTARQSLSYTSISFQPSTHATPVCSSECPQGILF 881

Query: 636  AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFV 694
              E  L +  +   ++ + +   LG  PR++ +  +S+   +       +C  +     +
Sbjct: 882  VSENCLHLVEMVHSKRRNAQKFHLGGTPRKVIYHSESKLLIVMRTDLYDTCTSD-----I 936

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY-----VLPE-ENEPTK 748
              +D  +   +S+Y L   E G S+      ++  +   VGT+      +LP  E E TK
Sbjct: 937  CCVDPLSGSVLSSYKLKPGETGKSMELVRVGNEHVLV--VGTSLSSGPAILPSGEAESTK 994

Query: 749  GRILVFIVE 757
            GR+++  +E
Sbjct: 995  GRLIILCLE 1003



 Score = 53.5 bits (127), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 40/146 (27%), Positives = 67/146 (45%), Gaps = 18/146 (12%)

Query: 882  FTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI---------- 931
            F+ R + E ++ E    L     Y++GE     + G  + +LP  DV +           
Sbjct: 1204 FSPRDDPEYSSPESNLNLNCA--YYMGEIAMAIKKGCNIYKLPADDVLRSYGLSKSIDTA 1261

Query: 932  -PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKT 988
              T+I GT+ G I V A +  E+Y  LE +Q  L    +   V G +H ++R   N  + 
Sbjct: 1262 DDTIIAGTLLGSIFVFAPISSEEYELLEAVQAKLGIHPLTAPVLGNDHNEFRGRENPSQ- 1320

Query: 989  VDAKNFLDGDLIESFLDLSRTRMDEI 1014
              A   LDGD++  FL+L+  + + +
Sbjct: 1321 --ATKILDGDMLAQFLELTNRQQESV 1344


>gi|150863836|ref|XP_001382447.2| hypothetical protein PICST_54680 [Scheffersomyces stipitis CBS 6054]
 gi|149385092|gb|ABN64418.2| predicted protein [Scheffersomyces stipitis CBS 6054]
          Length = 1228

 Score =  120 bits (301), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 245/1184 (20%), Positives = 474/1184 (40%), Gaps = 188/1184 (15%)

Query: 8    GRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDR-IGRPTDNG 66
            G I  +E  R  G  +D   +  E  K   L++D +    +       +    GR  + G
Sbjct: 68   GVIHKIEKIRIVGTQKDLAVVVGESGKVVFLEFDVDLHRFVPVLQEPYAKTGFGR-VNPG 126

Query: 67   QIGIIDPDCRLIGLHLYDG---LFKVIPFDNKGQLK-----EAFNIRLEELQV--LDIKF 116
            +   +DP  R I L   +    +FKV   D++G+++     EA +     L V  LD +F
Sbjct: 127  EYLAVDPQSRCIFLGAIERNKLIFKV-ETDSQGKVELSSPLEAHSKHTLTLSVVALDTQF 185

Query: 117  ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG------PWSQNNLDNGADLL 167
               ++   +      + D K      +  + L   +  +G        S N + + A  L
Sbjct: 186  SNPVFAAIECDYSNYHSDGKVQFDADSSPLLLNYYELDQGLNHIVKKKSTNTIPSSATHL 245

Query: 168  IPVPPPLCGVLIIGEETIVYCSANA-----FKAIPIRPS------ITKAYGRVDADGSRY 216
            IP+P  + GV +  +  I+Y + +      +  +P+R        ++    ++  +    
Sbjct: 246  IPLPSHVGGVFVCCKNYIIYDNLHKNLERLYLPLPLRKDSEYTVVVSHVVHKLKKNNFFV 305

Query: 217  LLGDHAGLLHLLVITH--EKEKVTGLKIELLGETSIASTISYL-----------DNAVVY 263
            LL    G L  + + +  +KE +  ++I       ++S+++ L           ++ + Y
Sbjct: 306  LLQSSMGDLFKVTVEYNSDKELIEDIQIGYFDTIPVSSSLNILKSGFLFANVLNNDKLYY 365

Query: 264  IGSSYGDS-QLIKLNLQPDAKG----------------SYVEVLERYVNLGPIVDFCVVD 306
                 GD  + I+L   PD                     + ++E + +L PI D  +V+
Sbjct: 366  QFEKLGDDDENIQLKASPDISSIDEEDRSNRTFTVKALDNLALVEIFTSLSPITDAGIVE 425

Query: 307  LERQGQG----QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPF 361
                G      Q++T S       L+ + +GI  +   S  L  I  G+ + +   D   
Sbjct: 426  SISSGTADSLQQMITASS---HSHLKSLVHGIQTSTLVSSPLPIIPTGVLTTKLFADSRS 482

Query: 362  DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
            D +LV+S    +R L +++ + +EE E   F +   TL       + +VQ+ +  +R V 
Sbjct: 483  DEYLVISSTVASRTLVLSIGEVVEEVENSQFVNDQPTLAVQQVGTSSVVQIYTNGIRHVK 542

Query: 422  STSRELR--------NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGILTEV 471
             T  E +         +W  P G ++  A+ +  QV++A     + Y E+   D  L E 
Sbjct: 543  HTRTEDKEQSISRKITDWYPPAGITIVNASTHREQVIIALSNAEICYFEVDATDDQLIEY 602

Query: 472  KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
            +  ++E   S   I    E  +   + AV   +D ++++ SL   N +    L   +   
Sbjct: 603  QD-RVEMSNSITSIAICEETANKKNLFAVVGCSDETIQVLSLQPHNCLETLSLQA-LSAN 660

Query: 532  SVLLCAFEGISYLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-T 588
            S  L   +  +  +  +G  +G  +   +   +G L+D +   LG++P+TL      N +
Sbjct: 661  STSLSMLQNDNTTMVHIGMDNGLYVRTSIEEISGNLSDTRIKYLGSKPVTLSVTKLPNGS 720

Query: 589  THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS--------AAFPDSLAIAKEGE 640
              + A S RP + Y +  +   + +   ++     F+S         A  D+  I     
Sbjct: 721  KAILAISSRPWICYYNRSEFKVTPLLGVKILKGASFSSEDIGGEGIVALSDNNLIV---- 776

Query: 641  LTIGTID---DI-QKLHIRSIPLGEHPRRICHQEQSRT------------------FAIC 678
             TIG  D   DI Q ++I  I L   PR++   +  ++                  F+  
Sbjct: 777  FTIGKEDVEFDINQDVNIEKIRLRYTPRKLIIDDDGKSSKVNYIYALQSEYGTKSPFSPS 836

Query: 679  SLKNQSCAE-------------ESEM----HFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
            +L ++   E             E+E+      ++++D +    I T    + E   S+  
Sbjct: 837  NLNSEDDPESEIDQDYYDAFGYETEVDKWASCIQVVDFENSSIIQTVEFSSNESAISMAK 896

Query: 722  CSF--SDDSNVYYCVGTAYVLPE--ENEPTKGRILVFIVEDG-------KLQLIAEKETK 770
              F  S   N+ + +       +  +N   K  +  F ++         +L+ + + E  
Sbjct: 897  LHFVSSGKGNMEHLIIGVTTDRKFLKNSVGKSYLFTFKIQKNTRKSNKKRLEYLHKTEID 956

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL----QSECGHHGHILALYVQTRGD 826
             +   +  FNG+LL  + + ++LY     D G R+L     +   +   I+ L V T G+
Sbjct: 957  CSPTVMIPFNGRLLVGMGKYLRLY-----DIGHRQLLRKSSTNIDYISSIVDL-VHTGGE 1010

Query: 827  FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
             I  GD   SI    +   E      A D     ++AV  LD D  +G +   N+F  R 
Sbjct: 1011 RIAFGDSHSSIVFAKFDSAENRFVPFADDIMKRQITAVAALDYDTVIGGDKFGNVFVSRV 1070

Query: 886  ---------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
                           K  E   +    R + + E+ L +    F  GS+        +G 
Sbjct: 1071 PDSVSKKSDEDWSLLKVQESYLNASPSRTKNLCEFFLSDTPTSFTKGSMT-------IGG 1123

Query: 931  IPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIK-GVGGLNHEQWRSFNNEKKT 988
               +I+  + G +G++  L  +  + F+  L+ +LR+ +   + G++H ++RS+ N    
Sbjct: 1124 HDGIIYTGIQGTVGLLLPLSTKSEVQFINSLEQSLRQQMGFNLLGMDHLKFRSYYNP--- 1180

Query: 989  VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
               KN +DGDLIE + +LS++   +I++ +N + +E+ K++ +L
Sbjct: 1181 --VKNVIDGDLIEKYYELSQSLKIKIARELNRTPKEVEKKISDL 1222


>gi|407416557|gb|EKF37684.1| damage-specific DNA binding protein, putative [Trypanosoma cruzi
            marinkellei]
          Length = 1276

 Score =  120 bits (300), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 195/878 (22%), Positives = 339/878 (38%), Gaps = 155/878 (17%)

Query: 236  KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVN 295
            KV  + + +    +I  +++ L      IGS   D+  +K        G    +LE   N
Sbjct: 376  KVPVVNLAMSSLRTIPDSLAILSGGHCVIGSRMTDTLWLKW-----CTGESGVLLE---N 427

Query: 296  LGPIVDFCV-VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR 354
             GP+ D  + VD  R G   V+  +G    G + + R+ +     A+++   +  +  L 
Sbjct: 428  CGPVFDLTIAVDGPRMG---VIASTGVGMGGGVSLQRSAVNFRHDAAID--KLLNVTRLF 482

Query: 355  SSTDDPFDTFLVVSFISETRILA------MNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
            ++ D       ++SF   +R+        M    EL ET  +   S+      + +  N 
Sbjct: 483  AAGD-----IFIISFPGYSRVYQFQTTPNMTTIKELSETSFDK--SKETLSLVYLSERNT 535

Query: 409  LVQVTSGSVRLVSSTSRE---LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD 465
             VQVTS  V +V          +NE      + +  A+ANA   LL       V + +  
Sbjct: 536  FVQVTSVGVNVVKGGKGVYIICKNE------FGIQHASANAKLRLLVFSSSRFVSV-VDL 588

Query: 466  GILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG 525
                     ++E E+SCL I       S SQ   +G W   +V ++ + D          
Sbjct: 589  NTRNTRASLEVENEVSCLLI-------SSSQGFVIGEWNSGAVCLYEVQD---------- 631

Query: 526  GEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTD-----------RKKVSLG 574
            GEI+ +  + C+    S  +C L   +    ++ +  G + D            ++  + 
Sbjct: 632  GEILLKGRIFCSATSCS--MCILSHPYTPRLVVGLLNGYIADISLEAILMGGGARETLIR 689

Query: 575  TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
             QP+ L    S N   V    + P +I   +     + ++  +V+ +C     +   S  
Sbjct: 690  MQPVELFNLESHNA--VLCLGEVPLIIILCDTGFQLTGIDFNDVA-VCAIIECSHISSKY 746

Query: 635  I---AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-------- 683
            I     E  L  G I D++KL+   + L     R+ +      F I   +N+        
Sbjct: 747  IFFSQSENSLAFGNIVDLKKLNSCFVGLKATVTRVKYIAWWNFFVISVRRNEKDQVFLLM 806

Query: 684  ------SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTA 737
                  S   ++E+  + LL+++   F  T  L     G +        D +    +GT 
Sbjct: 807  GHELANSWVPQNELVSIELLENERCVFFETVVLG----GSNESPDGNEADKDKAILLGTT 862

Query: 738  YVLPEENEPTKGRILVFIVEDGKL-------QLIAEKETKGAVYS---LNAFNGKLLAAI 787
            +  P+E  P   R + + VE GKL       + I  K+ +GA+     +  + G++   I
Sbjct: 863  FAFPDEQLPRSSRFIWYSVEQGKLLSERPQLRQIGSKDVEGALQCCCIVPNYVGRIALGI 922

Query: 788  NQKIQLYKWMLRDDGTRELQSECGHH--GHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
            N  I LY W   D      ++ C       +L ++ Q    ++V  D   S   +     
Sbjct: 923  NGCIALYSWNAADSVFVAEETICIGTIVTRVLPIF-QGDASYMVAFDARHSCFFIQVDTI 981

Query: 846  EGAIEERARDYNANWM--SAVEILDD--DIYLGAENNFNLFTVRK---------NSEGAT 892
            +G++E  ARD     +   AV  L +  DI  G ++ +N F + +         N+  AT
Sbjct: 982  QGSLEIVARDSEPRGVMDGAVFQLGNIYDICFG-DDYYNFFCLSRGATTLPSASNAISAT 1040

Query: 893  DEERGRLEVVGEYHLGEFVNRFRHGSLV--------MRLPDSDVGQI--PTVIFGTVNGV 942
                G+L+   +YHLG+ V   + GS          + +P + +  I  P V+FGT +G 
Sbjct: 1041 VSS-GKLKTRAQYHLGDMVTAMQLGSFAPCSVTNSAVPIPTTLIPGICGPQVVFGTSHGA 1099

Query: 943  IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS--------------------- 981
             G I  + +E YLFL+ L+  +  V+  +GG  H  +R                      
Sbjct: 1100 FGTITPVSNETYLFLKALEVAVSSVVPALGGFEHVTYREVLRAGQERGYSRNASFENVNV 1159

Query: 982  -----FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
                 FN  +K   ++    GDLIESFL    +    I
Sbjct: 1160 VSSEVFNKRRKRYLSRCVCSGDLIESFLTFPHSIQQRI 1197


>gi|453087531|gb|EMF15572.1| splicing factor 3B subunit 3 [Mycosphaerella populorum SO2202]
          Length = 1223

 Score =  120 bits (300), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 176/809 (21%), Positives = 328/809 (40%), Gaps = 96/809 (11%)

Query: 296  LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGI-----NEQASVELQGIKGM 350
            L P++   V +L  +   Q+    G       + +R+G+ +     N   +V    I   
Sbjct: 433  LHPLMKTRVDNLTGEDAPQIYGIQGKGNRSQFKTIRHGLDVEVLINNSMGNVPYDNI--- 489

Query: 351  WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
            W+ +  T D    +L++S       +A ++ D +E+ E   F     T+         LV
Sbjct: 490  WTFKHRTTDEHHRYLLLSSNYGDLTIACSIGDSVEQIENSNFLENRATVHAEQMGDAVLV 549

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
            QV +  +R +    +   NEW +P   +  VA+AN  Q+LL      L +  +G D IL 
Sbjct: 550  QVHARGIRSIYQDGK--LNEWNTPAHRTCVVASANQRQLLLGLSSAELCFFWMGEDEILV 607

Query: 470  EVKH-AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP-DLNLITKEHLGGE 527
            +++   ++  +I+ + +    +    ++ A VG   D ++R+ S+  D  L  +      
Sbjct: 608  QLEEMPEMSGKITAISVGSTPKGRQQAKYAVVGC-DDCTIRVLSIELDSPLEARSVQALS 666

Query: 528  IIPRSV----LLCAFEGISYLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
             +P S+    +L    G +  +  +G   G  L  +++  TGEL D +   LGT+   L 
Sbjct: 667  AVPTSLEVVEMLDPASGTTVNVVHIGLQSGLYLRAIIDETTGELGDVRTKFLGTKAPRLC 726

Query: 582  TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD--SLAIAKEG 639
                ++   V A S RP V Y+  +   Y+   L     M    S   PD   L   +  
Sbjct: 727  PVQVEDEECVLACSTRPWVGYNHPQTGQYTITPLI-AEKMEAARSFISPDLSGLCAIQGS 785

Query: 640  ELTIGTIDDIQ-KLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQ-SCAEESEMHFVRL 696
             L I  +  ++ +L   S+ L   PR +  H      + + S  N  S A   ++    +
Sbjct: 786  SLLIFQLPSVEGRLSHSSVQLNNTPRGMARHPYYQIWYTVQSDGNTLSKATRDQLRGSPI 845

Query: 697  LDDQTFEFISTY------------------PLD------TFEYG------CSILSCSFSD 726
             DD+    +  +                  P++      T E G      C       S 
Sbjct: 846  DDDEEAMALERHLGLPRGNTHWASCIQAIDPINRQAVVSTVEMGDNEAALCCACVAFESR 905

Query: 727  DSNVYYCVGTA-YVLPEENEPTKGRILVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKL 783
            +  V+  VGT  ++ P       G + +F + EDG KL  + + +    VY+L  F+G+L
Sbjct: 906  NWEVFLAVGTGQHMSPGTGLQAAGYVHIFKLEEDGTKLTFVHKTKFDQPVYALLPFHGRL 965

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
               +  ++ +Y     D G + L  +  G       + +++ G  I+ GD+ + ++ ++Y
Sbjct: 966  ALGVGNELFIY-----DIGQKALLRKARGQATPNQIVSLESHGQRIICGDVSEGVTYMVY 1020

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
            K     +     D    W +   ++D +   G +   NL+ VR   + S+ A +E  G  
Sbjct: 1021 KPGYNRLIPFVDDVVQRWTTGTTMIDYETTAGGDKFGNLWVVRCPEQPSQEADEEGAGGF 1080

Query: 898  -------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
                         RL++   Y+  +     +  +LV        G    + +  + G +G
Sbjct: 1081 IMNERSYLGGAPYRLDLRAHYYCQDIPMSLQRTALV-------AGGQEVLFWSGLQGTLG 1133

Query: 945  VIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
            ++   +  E   F   L+  LR     + G +H  +RS+      V  K  +DGDL E F
Sbjct: 1134 MLVPFVTREDVEFFTSLEQQLRIEDPPLAGRDHLMYRSY-----YVPVKGVIDGDLCERF 1188

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            + LS     +++  ++ SV+E+ K+V+E+
Sbjct: 1189 MALSYDSKQKVAAEVDRSVKEIEKKVQEM 1217


>gi|11359310|pir||T51288 silencing protein rik1 - fission yeast (Schizosaccharomyces pombe)
 gi|4580723|gb|AAD24488.1|AF136156_1 silencing protein Rik1 [Schizosaccharomyces pombe]
          Length = 916

 Score =  119 bits (299), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 209/944 (22%), Positives = 391/944 (41%), Gaps = 117/944 (12%)

Query: 111  VLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPV 170
            + D+  L   + PT+VVL+ D  D  +V  + + L  K          +L  G  L   V
Sbjct: 44   IYDMCILNDSSTPTLVVLHSDGLDC-YVTAFLLDLSSK----------SLGKGIRLFERV 92

Query: 171  PPPLCGV-----LIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGL 224
             P +        L++ E   ++C     F  I    +    +  + A    Y++ D  G 
Sbjct: 93   KPSMIMPFGKRGLLVFESLFIHCMYRGNFVTINGPCTTYMHWTPLKAQKMHYIVCDTNGY 152

Query: 225  LHLLVITHEKEKVTGLKIELLG--ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDA 282
            L  +  +   +    L +E L        ++++ +   +++IGS   +S+LI L+   D 
Sbjct: 153  LFGVYSSILGKNKWSLVMERLPIPPFDFITSLNSIHEGLLFIGSKNSESKLINLSTLKD- 211

Query: 283  KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
                   ++   NLGPI D  V  L+   +   + C+G  ++ SL   ++ + ++     
Sbjct: 212  -------VDSIPNLGPIHDLLV--LKNDIEKSFLVCAGTPRNASLIYFQHALKLDILGQT 262

Query: 343  ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH 402
            ++ GI     L S  +      L + F SET  +A N++++ +  E++   S  +     
Sbjct: 263  KISGILRAMVLPSYPEHK----LFLGFPSET--VAFNIKEDFQ-LELDPSLSTKERTIAL 315

Query: 403  DAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
                 + VQVTS  + +  S  R      +       N A       ++  G    ++ +
Sbjct: 316  SGTNGENVQVTSTFLCIYDSAKRSRLVYIEK----ITNAACYQEYSAIVINGTALAIFKK 371

Query: 463  IGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT-K 521
                  TEV     E EISCLD +      +  QI  VG W+   V I +  D + I+  
Sbjct: 372  D-----TEVARKVFESEISCLDFS------AQFQIG-VGFWSK-QVMILTFSDNSSISCA 418

Query: 522  EHLGGEIIPRSVLLCAFEGI----SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
                   +PR+++L   EG+    + LL + G G   +++L       ++ K    GT P
Sbjct: 419  FQTNVPSLPRNIIL---EGVGVDRNLLLVSSGSGEFKSYVLFKNNLVFSETKH--FGTTP 473

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            ++ R F+    T++   +D P ++Y  N  L Y  +++ +   +C F   +  D L    
Sbjct: 474  VSFRRFTMNIGTYIICNNDCPHMVYGFNGALCYMPLSMPQSYDVCQFRDNSGKDFLISVS 533

Query: 638  EGELTIGTIDDIQKLHIRSIPLGEHP-RRICHQEQSRTFAICSLKN-----QSCAEESEM 691
             G L    ++ + +L  R + L   P + I  Q +     + +L+N     +S  E   +
Sbjct: 534  LGGLKFLQLNPLPELTPRKVLLEHVPLQAIIFQNK---LLLRTLENRYEDYESYKENYHL 590

Query: 692  HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRI 751
              V   DD +F   S    +  E    I   S          VGT+ +  ++  P  GR+
Sbjct: 591  ELVDSYDDNSFRVFSFTENERCEKVLKINESSL--------LVGTSIIEQDKLVPVNGRL 642

Query: 752  LV--FIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
            ++  F  E   L++++      AV  L  +N + + A  Q++ + K     +    + S 
Sbjct: 643  ILLEFEKELQSLKVVSSMVLSAAVIDLGVYNDRYIVAFGQQVAIVKL---TEERLMIDSR 699

Query: 810  CGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
                  +L L V+  G+ I + D +   +++ +  ++  +  R   +  N + A  + + 
Sbjct: 700  ISLGSIVLQLIVE--GNEIAIADSIGRFTIMYFDGQKFIVVARYL-FGENIVKAA-LYEG 755

Query: 870  DIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 929
             +Y+ A N+  L  +R N +     +R   E V  YHL + V++F++       P ++  
Sbjct: 756  TVYIIATNSGLLKLLRYNKDAKNFNDRFICESV--YHLHDKVSKFQN------FPITNTN 807

Query: 930  QI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV-IKGVGGLNHEQWRSFNNEK 986
                P ++F T  G IG I SL  ++ L LE+L   +RK+    +  +++E       E 
Sbjct: 808  SFLEPKMLFATEIGAIGSIVSLKDKE-LELEELTRKIRKLKFSYLSSMDYESI-----EA 861

Query: 987  KTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
              +    F+DGDL+   +D+ R    E+ +        LC+ VE
Sbjct: 862  DLISPVPFIDGDLV---IDVKRWASSELFR--------LCRSVE 894


>gi|297739782|emb|CBI29964.3| unnamed protein product [Vitis vinifera]
          Length = 1363

 Score =  119 bits (297), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 125/537 (23%), Positives = 226/537 (42%), Gaps = 80/537 (14%)

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
           N+ PI+D  VVD   +   Q+  C G   +GSLRI+R+GI + +  + +   QGI G W+
Sbjct: 462 NIAPILDMSVVDCHDEEHDQMFACCGVTPEGSLRIIRSGISVEKLLRTAPIYQGITGTWT 521

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           ++    D + +FLV+SF+ ETR+L++ L    + T+  GF     TL C       LVQ+
Sbjct: 522 VKMKVIDSYHSFLVLSFVEETRVLSVGLSFT-DVTDSVGFQPDVSTLACGVVDDGLLVQI 580

Query: 413 TSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
               V+L   T+            +   W  P   S+++     + +++AT     +++ 
Sbjct: 581 HKNGVKLCLPTTVAHPEGIPLASPICTSW-FPENISISLGAVGYNLIVVATSSPCFLFIL 639

Query: 463 IGDGI------LTEVKHAQLEYEISCLDI--NPIGENPSY--------SQIAAVGMWTDI 506
               +      + E++H +L+ E+SC+ I      + PS         S  AA+ +  +I
Sbjct: 640 GVRSVSAYQYEIYEMQHVRLQNEVSCISIPHKHFDKKPSTFLSNLVDNSSAAALLIGVNI 699

Query: 507 SVRIF-------SLPDLNLITKEHL----GGEIIPRSVLLCAFEGIS------------Y 543
             RIF       S+  L+ +  E L     G I   + L  A  G              Y
Sbjct: 700 G-RIFVIGTHKPSVEILSFLPDEGLRILASGAISLTNTLGTAVSGCVPQDARLVLVDRFY 758

Query: 544 LLCALGDGHLLNFLLNMKTGELTDRKKVS-------------------LGTQPITLRTFS 584
           +L  L +G LL F L   +   +                         +G  P+ L   S
Sbjct: 759 VLSGLRNGMLLRFELPAASMVFSSELSSHSPSTNINSPVNLQLIAIRRIGITPVFLVPLS 818

Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
                 + A SDRP ++ S+   L Y++++ +  +H+ P  S   P  +    E  L + 
Sbjct: 819 DSLEADIIALSDRPWLLQSARHSLSYTSISFQPSTHVTPVCSMECPMGILFVAENSLHLV 878

Query: 645 TIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEF 704
            +   ++L+++   LG  PR++ +  +SR   +        ++++    +  +D  +   
Sbjct: 879 EMVHSKRLNVQKFYLGGTPRKVLYHSESRLLLV---MRTELSQDTYSSDICCVDPLSGSV 935

Query: 705 ISTYPLDTFEYGCSILSCSFSDDS----NVYYCVGTAYVLPEENEPTKGRILVFIVE 757
           +S++ L+  E G S+      ++           G A +   E E TKGR++V  +E
Sbjct: 936 LSSFKLELGETGKSMELVRVVNEQVLVIGTSLSSGPAMMPSGEAESTKGRLIVLCLE 992



 Score = 50.4 bits (119), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 44/203 (21%), Positives = 86/203 (42%), Gaps = 17/203 (8%)

Query: 828  IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN--NFNLFTVR 885
            I VGD    +    Y  +   +E+   D     ++   ++D D  + ++   +  + +  
Sbjct: 1122 IAVGDCRDGVVFYSYHEDSRKLEQLYCDPEQRLVADCILMDVDTAVVSDRKGSIAVLSCS 1181

Query: 886  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV-----GQIPTVIF---- 936
             + E     E   L +   Y++GE     + GS   +LP  DV     G    + F    
Sbjct: 1182 NHLEDNASPE-CNLTLNCSYYMGEIAMSIKKGSFSYKLPADDVLKGCDGSNTIIDFSENS 1240

Query: 937  ---GTVNGVIGVIASLPHEQYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDA 991
               GT+ G I ++  +  E++  LE +Q  L   ++   + G +H ++RS  N  +    
Sbjct: 1241 IMAGTLLGSIIMLIPISREEHELLEAVQARLAVHQLTAPILGNDHNEFRSRENSVRKAGV 1300

Query: 992  KNFLDGDLIESFLDLSRTRMDEI 1014
               LDGD++  FL+L+  + + +
Sbjct: 1301 SKILDGDMLAQFLELTSMQQEAV 1323


>gi|226480826|emb|CAX73510.1| glyceraldehyde 3-phosphate dehydrogenase [Schistosoma japonicum]
          Length = 332

 Score =  118 bits (295), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 79/249 (31%), Positives = 125/249 (50%), Gaps = 21/249 (8%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
           DVPI  +I    LFR      D LF+ T +    +++   + +S E +T A G V DR  
Sbjct: 54  DVPINAKIVAASLFRRKDRETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVEDRSA 113

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIP------------FDNKGQLKEAFNIRLEE 108
           R  D G   +IDP    I + LY GL K+I              D        +++R+EE
Sbjct: 114 RIIDQGFDVLIDPGANYIVVRLYHGLLKIILLQCIGDKIGTDFLDTNQWTVNTYSVRIEE 173

Query: 109 LQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
             ++D+ F+YG + PT  ++Y+D     H+KTYE+  ++        + ++++  + LLI
Sbjct: 174 GNIVDMAFIYGYSLPTFAMIYEDEL-VLHMKTYEIYGREPALRNVQLTLDSIEPDSKLLI 232

Query: 169 PVPPPLCGVLIIGEETIVYCSANA---FKAIP-IRPSITKAYGRVDADGSRYLLGDHAGL 224
           PVP P  GV+++G+  I Y + +     + IP  + S    Y  VDA   RYLLGD AG 
Sbjct: 233 PVPKPYGGVILVGDNIICYHTKDGPHISQYIPQAKASQVLCYAAVDA--QRYLLGDMAGR 290

Query: 225 LHLLVITHE 233
           L+++ +  E
Sbjct: 291 LYMVHLLSE 299


>gi|124806507|ref|XP_001350742.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium
           falciparum 3D7]
 gi|23496869|gb|AAN36422.1|AE014849_41 splicing factor 3b, subunit 3, 130kD, putative [Plasmodium
           falciparum 3D7]
          Length = 1329

 Score =  116 bits (291), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 123/535 (22%), Positives = 219/535 (40%), Gaps = 57/535 (10%)

Query: 160 LDNGADLLIPVP---PPLCGVLIIGEETIVYCSAN---AFKAIPIRPSI--TKAYGRVDA 211
           +D  A LLIP+P       GVLI  E  +VY   +    + A P R  I   K    +  
Sbjct: 263 IDITAHLLIPLPGGQQGPSGVLICCENFLVYKKVDHEDIYCAYPRRLEIGQDKNISIICW 322

Query: 212 DGSRY------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIG 265
              R       L+    G L+ + + HE   V  +  +      I ++IS L +  +++ 
Sbjct: 323 TMHRIKKFFFILIQSEYGDLYKIEVDHEDGIVKEIVCKYFDTVPIGNSISVLKSGSLFVA 382

Query: 266 SSYGD-------------SQLIKLNLQPDAKGSYVE----------VLERYVNLGPIVDF 302
           + +G+              Q +  +  P  K + +           ++++  +L PI+D 
Sbjct: 383 AEFGNHYFYQFSGIGDDNKQFMCTSNHPLGKNAIIAFKTNKLKNLYLVDQIYSLSPILDM 442

Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPF 361
            ++D +     Q+ T  G     SLRI+++G+ I E A  EL G  K +W+++      +
Sbjct: 443 KIIDAKNTHTPQIYTLCGRGPRSSLRILQHGLSIEELADNELPGKPKYIWTIKKDNLSEY 502

Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
           D ++VVSF   T IL +   + +EE       +   TL  +    N  +QV    +R ++
Sbjct: 503 DGYIVVSFEGNTLILEIG--ESVEEVSDTLLLNNVTTLHINILYDNSFIQVYDTGIRHIN 560

Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEI 480
               ++  EW +P    +  A++N+SQ++++  GG L+Y EI +   L E+    L  E+
Sbjct: 561 G---KVVQEWVAPKNKQIKAASSNSSQIVISLSGGELIYFEIDESHTLVEIFRKNLNVEV 617

Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLCA 537
            CL I  I  N   +   AVG   ++ VR+ S+        ++   L     P+ + +  
Sbjct: 618 LCLSIQQIPPNRVRANFLAVGCLDNV-VRLLSIEKDKYFKQLSTHLLPNNSSPQDICISE 676

Query: 538 F---------EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
                       I +L   L  G LL  +++   G L++     LG + I +   +    
Sbjct: 677 MNDNGNTMKERNIIFLNIGLNTGVLLRSIIDPVAGTLSNHYSKYLGAKSIKICPVNVNKN 736

Query: 589 THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
             +    ++  + Y    K LYS +N   + +   F S    D         L I
Sbjct: 737 PALLVLCEKTYLCYMHQGKFLYSPLNYDMLEYASSFYSPQCSDGYVAISSNSLRI 791



 Score = 72.8 bits (177), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 80/363 (22%), Positives = 157/363 (43%), Gaps = 45/363 (12%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC--VGTAYVLPEENEP-TKGR 750
            +++++    + +    LD  E   S+ +C         +C  VGT   L  + +  T   
Sbjct: 983  IKIINPVNLQILDKISLDMEEAALSVCACELE----ALHCLIVGTTTNLSLKTKSLTSAS 1038

Query: 751  ILVFIVE-DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
            + V+  +   KL L+     +   Y   ++NGKL+A+I  K+++Y       G ++L  +
Sbjct: 1039 LRVYTYDIQYKLNLLHITPIEEQPYCFCSYNGKLIASIGNKLRIYAL-----GKKKLLKK 1093

Query: 810  CGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
            C +     A+  ++  G+ I   D+ +S+ +  Y   +  +   + D    W++  EILD
Sbjct: 1094 CEYKDIPEAIVSIKISGNRIFACDIRESVLIFFYDPNQNTLRLISDDIIPRWITCSEILD 1153

Query: 869  DDIYLGAENNFNLFTVRKNSEGATDE------------------ERGRLEVVGEYHLGEF 910
                + A+   ++F +R   E   DE                  +  +LE +  +H+GE 
Sbjct: 1154 HHTIMAADKFDSVFILRVPEEAKQDEYGITNKCWYGGEIMNSSTKNRKLEHMMSFHIGEI 1213

Query: 911  VNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL-EKLQTNLRKVIK 969
            V   +   L    P S       +I+ T+ G IG      +++ L L + L+  LR    
Sbjct: 1214 VTSMQKVRLS---PTSS----ECIIYSTIMGTIGAFIPYDNKEELELTQHLEIILRTEKP 1266

Query: 970  GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
             + G  H  +RS+ +       +N +DGDL E F  LS     +I+  +  + E++ +++
Sbjct: 1267 PLCGREHIFFRSYYHP-----VQNVVDGDLCEQFSSLSYDAQKKIANDLERTPEDILRKL 1321

Query: 1030 EEL 1032
            E++
Sbjct: 1322 EDI 1324


>gi|340508225|gb|EGR33979.1| splicing factor subunit 130kda, putative [Ichthyophthirius
           multifiliis]
          Length = 983

 Score =  115 bits (289), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 159/692 (22%), Positives = 283/692 (40%), Gaps = 111/692 (16%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           ++G I T+  FR  G  +D++ I ++  +  +L ++ +  E              R    
Sbjct: 59  VFGLIRTISTFRLPGFKKDYIIIGSDSGRIVILNYNKKKGEFEKLHQETYGKTGCRRIVP 118

Query: 66  GQIGIIDPD---CRLIGLHLYDGLFKVIPFDNKGQLKE---------------AFNIRLE 107
           GQ   +DP    C +  L     +F +   ++K  +                   ++  E
Sbjct: 119 GQYIAVDPKGRVCMIAALEKQKFVFILNRENDKLTISSPLEAHKSHTICYDICGIDVGYE 178

Query: 108 ELQVLDIKFLYGCAKPTIVVLYQDNKDA--------RHVKTYEVALKDKDFVEGPWSQNN 159
             Q   I+  YG          +D KD+        + +  YE+       V+   S  N
Sbjct: 179 NAQFACIECDYG---------EKDQKDSPMNNGIIQKQLTIYELDFGLNHVVKK--SSEN 227

Query: 160 LDNGADLLIPVP---PPLCGVLIIGEETIVYCSANAFK--AIPIRPSI-TKAYGRVDADG 213
           +   A LLI +P       GV+I+ E+ ++Y      +    P R S  T +   ++  G
Sbjct: 228 VPESAHLLISIPGGQEGPGGVVIVCEDFLIYKGTKGERICQFPKRFSADTNSKIMINTFG 287

Query: 214 SR-------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
                    +LL    G L  L I   K+ V  + ++       + +I  + N  ++   
Sbjct: 288 FHKQKEFFFFLLQTELGDLFKLEIQSTKDDVHSISLQYFDSIPPSISICVMRNGYLFAAC 347

Query: 267 SYGDSQLIKL---------NLQPDA-------------KGSYVEVLERYVNLGPIVDFCV 304
             G+  L +          +++ D+             K   ++ +++  NL  I D  V
Sbjct: 348 EKGNHLLYRFKSIGEKEANSVRTDSTQDQKMPVLFIPRKLKNLQQVDQLENLSAISDIKV 407

Query: 305 VDLERQGQGQVVT-CSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFD 362
            DL  +G  Q+ T CS  Y+  SLRI+R+G+ +NE A+  L G+  G+W+++S  D+ F 
Sbjct: 408 SDLTGEGHPQIYTLCSAGYRS-SLRILRHGLQVNEVAASRLPGVPTGIWTIKSRYDENFS 466

Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
            ++++SF  +T  L +++ D +++    G     QT+  +    N ++QV     R +  
Sbjct: 467 KYIILSFSKQT--LVLSISDRVQQVTDSGIDLNKQTIHANLLEDNAIIQVMPDGFRHIRV 524

Query: 423 TSR--ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYE 479
             R  +L+ E K      +  A +N  Q+ ++  GG ++Y E+   G L EV    L+ E
Sbjct: 525 DKRVQQLKTEGK------IVKAVSNQKQIAISLQGGDIIYFELDYAGQLIEVAKTNLQEE 578

Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLC 536
           I C+DI  +      S+  +VG  +D SVRI SL +   L  I+ + L G  I  +V L 
Sbjct: 579 IECMDIGEVPFGRQKSKFLSVG-CSDHSVRILSLENDTCLQKISIQALPG--IAENVSLI 635

Query: 537 AFEGIS-----------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
             +  S           YL   L +G LL   ++  TG L+D +   L   P+    +  
Sbjct: 636 EMKRGSGLEQEAEQYQLYLYVGLKNGILLRASVDQITGSLSDTRTRVLSGAPVRTCKYQV 695

Query: 586 KNTTHVFA--------ASDRPTVIYSSNKKLL 609
           +    + A          D P  +++   KLL
Sbjct: 696 QGQPALLALKLIHKTEVDDIPGSLHAHKGKLL 727



 Score = 89.0 bits (219), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 80/295 (27%), Positives = 135/295 (45%), Gaps = 43/295 (14%)

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHI 816
            L+LI + E      SL+A  GKLLA     ++ Y    K +L+    + LQS        
Sbjct: 704  LKLIHKTEVDDIPGSLHAHKGKLLAGCGTFLRYYDIGKKKLLKKSEVKGLQSPING---- 759

Query: 817  LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
                +QT GD I V  +  ++ ++ +K +E    E   D    WMSA ++LD   Y+G +
Sbjct: 760  ----IQTFGDRIFVSMVGDAVHIMKHKQKEQTFYEVCDDVLPRWMSAFQVLDYSTYIGGD 815

Query: 877  NNFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGS 918
               N+F  R  +N+E   DE         E G L       E + ++++GE V  F+   
Sbjct: 816  KFENMFVCRIPQNAEEEMDENPMSYKLRWESGYLNGAPYKTEQICQFYVGEVVTTFQKAC 875

Query: 919  LVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHE 977
            LV        G    +I+GT  G IG       ++ + F   L+  LR  +  + G +H 
Sbjct: 876  LV------STGN-ECIIYGTSMGSIGAFYPFQTKEDIDFFVHLEMYLRIDVLPLAGRDHV 928

Query: 978  QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +R+F    K+V     +DGDL E F+ + + +   +++ M  +  E+ K+++E+
Sbjct: 929  MFRAFYGPVKSV-----IDGDLCEQFMRIGQGKQKVLAEEMERTPAEVHKKLDEI 978


>gi|335310432|ref|XP_003362030.1| PREDICTED: splicing factor 3B subunit 3-like [Sus scrofa]
          Length = 701

 Score =  115 bits (289), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 135/579 (23%), Positives = 244/579 (42%), Gaps = 79/579 (13%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L V ++G I +L  FR  G  +D++ + ++  +  +L++   S  +  +   +   + G
Sbjct: 53  LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111

Query: 61  -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
            R    GQ   +DP  R + +   +   L  ++  D   +L  +  +   +   L    +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171

Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNNLDNGAD------ 165
               G   P    L  D ++A +  T E A   +    F E     N++           
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSQIIAEF 231

Query: 166 -LLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
             L+ VP     P   GVLI  E  I Y   N      IR  I +    +D D  R    
Sbjct: 232 HFLLTVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMIF 286

Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                        +L     G +  + +  +++ VT ++++      +A+ +  L    +
Sbjct: 287 VCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFL 346

Query: 263 YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
           ++ S +G+  L ++                          QP    + V +++   +L P
Sbjct: 347 FVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLSP 405

Query: 299 IVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
           I+ FC + DL  +   Q+    G     SLR++R+G+ ++E A  EL G    +W++R  
Sbjct: 406 IL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 464

Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
            +D FD +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    
Sbjct: 465 IEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDG 522

Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
           +R + +  R   NEWK+P   ++     N  QV++A  GG LVY E+   G L E  +  
Sbjct: 523 IRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 580

Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL 513
           ++  ++ C+ +  +      S+  AVG+  D +VRI SL
Sbjct: 581 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISL 618


>gi|225441567|ref|XP_002276675.1| PREDICTED: pre-mRNA-splicing factor rse1-like [Vitis vinifera]
          Length = 1387

 Score =  115 bits (287), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 126/551 (22%), Positives = 229/551 (41%), Gaps = 94/551 (17%)

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
            N+ PI+D  VVD   +   Q+  C G   +GSLRI+R+GI + +  + +   QGI G W+
Sbjct: 462  NIAPILDMSVVDCHDEEHDQMFACCGVTPEGSLRIIRSGISVEKLLRTAPIYQGITGTWT 521

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            ++    D + +FLV+SF+ ETR+L++ L    + T+  GF     TL C       LVQ+
Sbjct: 522  VKMKVIDSYHSFLVLSFVEETRVLSVGLSFT-DVTDSVGFQPDVSTLACGVVDDGLLVQI 580

Query: 413  TSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
                V+L   T+            +   W  P   S+++     + +++AT     +++ 
Sbjct: 581  HKNGVKLCLPTTVAHPEGIPLASPICTSW-FPENISISLGAVGYNLIVVATSSPCFLFIL 639

Query: 463  IGDGI------LTEVKHAQLEYEISCLDI--NPIGENPSY--------SQIAAVGMWTDI 506
                +      + E++H +L+ E+SC+ I      + PS         S  AA+ +  +I
Sbjct: 640  GVRSVSAYQYEIYEMQHVRLQNEVSCISIPHKHFDKKPSTFLSNLVDNSSAAALLIGVNI 699

Query: 507  SVRIF-------SLPDLNLITKEHL----GGEIIPRSVLLCAFEGIS------------Y 543
              RIF       S+  L+ +  E L     G I   + L  A  G              Y
Sbjct: 700  G-RIFVIGTHKPSVEILSFLPDEGLRILASGAISLTNTLGTAVSGCVPQDARLVLVDRFY 758

Query: 544  LLCALGDGHLLNFLLNMKT---------------------GELTDRKKVS---------- 572
            +L  L +G LL F L   +                      +    K ++          
Sbjct: 759  VLSGLRNGMLLRFELPAASMVFSSELSSHSPSVSSCSVNDADTNLSKNINSPVNLQLIAI 818

Query: 573  --LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
              +G  P+ L   S      + A SDRP ++ S+   L Y++++ +  +H+ P  S   P
Sbjct: 819  RRIGITPVFLVPLSDSLEADIIALSDRPWLLQSARHSLSYTSISFQPSTHVTPVCSMECP 878

Query: 631  DSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE 690
              +    E  L +  +   ++L+++   LG  PR++ +  +SR   +        ++++ 
Sbjct: 879  MGILFVAENSLHLVEMVHSKRLNVQKFYLGGTPRKVLYHSESRLLLV---MRTELSQDTY 935

Query: 691  MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS----NVYYCVGTAYVLPEENEP 746
               +  +D  +   +S++ L+  E G S+      ++           G A +   E E 
Sbjct: 936  SSDICCVDPLSGSVLSSFKLELGETGKSMELVRVVNEQVLVIGTSLSSGPAMMPSGEAES 995

Query: 747  TKGRILVFIVE 757
            TKGR++V  +E
Sbjct: 996  TKGRLIVLCLE 1006



 Score = 44.7 bits (104), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 32/124 (25%), Positives = 58/124 (46%), Gaps = 17/124 (13%)

Query: 905  YHLGEFVNRFRHGSLVMRLPDSDV-----GQIPTVIF-------GTVNGVIGVIASLPHE 952
            Y++GE     + GS   +LP  DV     G    + F       GT+ G I ++  +  E
Sbjct: 1227 YYMGEIAMSIKKGSFSYKLPADDVLKGCDGSNTIIDFSENSIMAGTLLGSIIMLIPISRE 1286

Query: 953  QYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTR 1010
            ++  LE +Q  L   ++   + G +H ++RS  N   +      LDGD++  FL+L+  +
Sbjct: 1287 EHELLEAVQARLAVHQLTAPILGNDHNEFRSREN---SAGVSKILDGDMLAQFLELTSMQ 1343

Query: 1011 MDEI 1014
             + +
Sbjct: 1344 QEAV 1347


>gi|149038189|gb|EDL92549.1| splicing factor 3b, subunit 3 (predicted) [Rattus norvegicus]
          Length = 650

 Score =  114 bits (286), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 149/662 (22%), Positives = 275/662 (41%), Gaps = 117/662 (17%)

Query: 466  GILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
            G L E  +  ++  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L
Sbjct: 5    GQLNEYTERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPL 61

Query: 525  GGEIIP-RSVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTD 567
              + +P +   LC  E                G  YL   L +G LL  +L+  TG+L+D
Sbjct: 62   SMQALPAQPESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSD 121

Query: 568  RKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
             +   LG++P+ L     +    V A S R  + YS   +   + ++ + +     F S 
Sbjct: 122  TRTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASE 181

Query: 628  AFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------T 674
              P+ +       L I  ++ +  +  + + PL   PR+ + H E +            T
Sbjct: 182  QCPEGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYT 241

Query: 675  FAICSLKNQSCAEE-------------SEM-------------------------HFVRL 696
             A  + + Q  AEE             +EM                           +R+
Sbjct: 242  EATKAQRKQQMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRV 301

Query: 697  LDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF- 754
            ++      +    L+  E   S+  C FS+    +Y  VG A  L        G  +   
Sbjct: 302  MNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILSPRSVAGGFVYTY 361

Query: 755  -IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
             +V +G KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  +C  
Sbjct: 362  KLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-E 415

Query: 813  HGHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
            + HI A Y+   QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD 
Sbjct: 416  NKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDY 474

Query: 870  DIYLGAENNFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFV 911
            D   GA+   N+  VR   N+    DE         +RG L       EV+  YH+GE V
Sbjct: 475  DTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETV 534

Query: 912  NRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKG 970
               +  +L+        G   ++++ T++G IG++     HE + F + ++ +LR     
Sbjct: 535  LSLQKTTLI-------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPP 587

Query: 971  VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
            + G +H  +RS+         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E
Sbjct: 588  LCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLE 642

Query: 1031 EL 1032
            ++
Sbjct: 643  DI 644


>gi|414882040|tpg|DAA59171.1| TPA: hypothetical protein ZEAMMB73_269016 [Zea mays]
          Length = 690

 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 115/447 (25%), Positives = 196/447 (43%), Gaps = 23/447 (5%)

Query: 278 LQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
            QP A  + + + E   +L P++D  V +L  +   Q+ T  G     +LRI+R G+ I+
Sbjct: 248 FQPRALKNLMRIDE-IESLMPVMDMRVANLFDEETPQLFTACGRGPRSTLRILRPGLAIS 306

Query: 338 EQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQT 396
           E A   L      +W+++ + +D FD ++VVSF + T +L++   + +EE     F   T
Sbjct: 307 EMARSMLPAEPIAVWTVKKNINDMFDAYIVVSFTNVTLVLSIG--ETIEEVSDSQFLDTT 364

Query: 397 QTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGG 456
            +L       + L+QV    +R +    R   NEW++P   ++    +N  QV++A  GG
Sbjct: 365 HSLAVTLLGEDSLMQVHPNGIRHIREDGRV--NEWRTPGKKTITKVGSNRLQVVIALSGG 422

Query: 457 HLVYLEIG-DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-P 514
            L+Y E+   G L EV+   +  +++CL I P+ E    S+   VG + D ++ I SL P
Sbjct: 423 ELIYFEMDMTGQLMEVEKQDMSGDVACLAIAPVPEGRQRSRFLVVGSY-DNTIGILSLDP 481

Query: 515 DLNLITKEHLGGEIIPRSVLLCAFE-------GISY-----LLCALGDGHLLNFLLNMKT 562
           D  L           P S+L    +       G  Y     L   L +G L    ++M T
Sbjct: 482 DDCLQPLSVQSVSSAPESLLFLEVQASVGGEDGADYPANLFLNAGLQNGVLFRTNVDMVT 541

Query: 563 GELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
           G+L+D +   LG +P  L      +       S RP + Y      L + ++   +    
Sbjct: 542 GQLSDTRSRFLGLRPPKLFPCIVSHRQAKLCLSSRPWLGYIHQGHFLLTPLSCDTLESAS 601

Query: 623 PFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SL 680
            F+S    + +       L I T++ + +  +  +IPL   PR+     + +   +  S 
Sbjct: 602 SFSSDQCSEGVVAVAGDALRIFTVERLGETFNETTIPLRYTPRKFVILPKKKYIPVIESD 661

Query: 681 KNQSCAEESEMHFVRLLDDQTFEFIST 707
           K    AEE E      LD     F+ +
Sbjct: 662 KGAFSAEEREAAKKECLDGCLRAFVPS 688


>gi|119572189|gb|EAW51804.1| splicing factor 3b, subunit 3, 130kDa, isoform CRA_b [Homo sapiens]
          Length = 635

 Score =  114 bits (285), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 146/651 (22%), Positives = 270/651 (41%), Gaps = 116/651 (17%)

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVL 534
            +  ++ C+ +  +      S+  AVG+  D +VRI SL   + +  + L  + +P +   
Sbjct: 1    MSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPES 57

Query: 535  LCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
            LC  E                G  YL   L +G LL  +L+  TG+L+D +   LG++P+
Sbjct: 58   LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPV 117

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
             L     +    V A S R  + YS   +   + ++ + +     F S   P+ +     
Sbjct: 118  KLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIST 177

Query: 639  GELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQSC 685
              L I  ++ +  +  + + PL   PR+ + H E +            T A  + + Q  
Sbjct: 178  NTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQM 237

Query: 686  AEE-------------SEM-------------------------HFVRLLDDQTFEFIST 707
            AEE             +EM                           +R+++      +  
Sbjct: 238  AEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDL 297

Query: 708  YPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQL 763
              L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL+ 
Sbjct: 298  VQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEF 357

Query: 764  IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV-- 821
            + +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y+  
Sbjct: 358  LHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISG 410

Query: 822  -QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
             QT G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+   N
Sbjct: 411  IQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGN 470

Query: 881  LFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMR 922
            +  VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+  
Sbjct: 471  ICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-- 528

Query: 923  LPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS 981
                  G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +RS
Sbjct: 529  -----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRS 583

Query: 982  FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +         KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 584  Y-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 629


>gi|339259094|ref|XP_003369733.1| splicing factor 3B subunit 3 [Trichinella spiralis]
 gi|316965959|gb|EFV50595.1| splicing factor 3B subunit 3 [Trichinella spiralis]
          Length = 1241

 Score =  114 bits (284), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 165/765 (21%), Positives = 321/765 (41%), Gaps = 92/765 (12%)

Query: 7   YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
           +G I +L  FR  G ++D++ + ++  +  +L++  + + + T+   +   + G R    
Sbjct: 114 FGIIRSLSSFRLTGSSKDYIVVGSDSGRIVILEYIPDKN-IFTKVHQETFGKSGCRRIVP 172

Query: 66  GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
           GQ   +DP  R  +IG      L  ++  D+  +L  +  +   + Q +  + +    G 
Sbjct: 173 GQYLTVDPKGRAIMIGATEKQKLVYIMNRDSAARLTISSPLEAHKSQSICFQIVGVDVGF 232

Query: 121 AKPTIVVLYQDNKDARHVKTYEV--ALKDKDFVEGPWSQNN--------LDNGADLLIPV 170
             P    L  D ++   V T EV  A++   F E     N+        LD  A+ LI V
Sbjct: 233 ENPLFACLEVDYEEIDSVNT-EVGKAIQTLTFYELDLGLNHVVRKYAEPLDYFANHLIAV 291

Query: 171 P-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD-------ADGSRYLL 218
           P     P   GV++  E  I+Y   N  + + +R  I +    ++        +   +LL
Sbjct: 292 PGGSEGP--SGVILCCENYIIY--KNLGEQLDVRCPIPRRRNDLEDPDRGILVNAFFFLL 347

Query: 219 GDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV--YIGSSYGD------ 270
               G +  + +    + VT +KI+      +A++I  L +  +  Y  +  GD      
Sbjct: 348 QTEQGDIFKVTLEVFDDIVTEIKIKYFDTLPVANSICMLKSGFLNLYQIAKLGDNDDETE 407

Query: 271 -SQLIKLN------LQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 323
            S L+ L+        P    + ++V +  ++L PI+ F V D+  +   Q+    G   
Sbjct: 408 FSSLMPLDEGDTFFFAPRGLQNLIQV-DELLSLSPIMSFEVADIVYEDTPQLYAACGRGP 466

Query: 324 DGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAM-NLE 381
             S++++R+G+ + E A  EL G    +W+++   D  +D ++VVSF++ T +L++    
Sbjct: 467 RSSIKVLRHGLEVTEMAISELPGNPNAVWTVKKHQDAEYDAYIVVSFVNATLVLSIGETV 526

Query: 382 DELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNV 441
           +E+ ++ + G                  VQ+    +R +    R   NEWK P    +  
Sbjct: 527 EEVTDSVVLGVVD---------------VQIYPEGIRHIRCDRR--VNEWKVPGRRVITN 569

Query: 442 ATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYEISCLDINPIGENPSYSQIAA 499
              N  QVL+A  GG +VY E+ + G L E  +  ++  ++  + +  I       +  A
Sbjct: 570 CAVNCRQVLIALNGGEIVYFELDEGGQLNEYTERKEMPCDVVAMALADIPPGEIRCRFLA 629

Query: 500 VGMWTDISVRIFSLPDLNLITKEHLGG-EIIPRSVLLCAFEGI---SYLLCALG------ 549
           V + TD +VR+ SL   + +T  ++     + +S+ + A  G    S L   +G      
Sbjct: 630 VAL-TDRTVRVISLDPSDCLTPLNMQSLPAMAQSLCIAAVGGDTMESTLFLNIGLEVNKF 688

Query: 550 -DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT--------HVFAASDRPTV 600
            +G LL  +L+  TGEL D +   LG +P+ L      +          ++ A S R  +
Sbjct: 689 TNGVLLRTVLDSVTGELQDTRTRYLGMKPVKLFKVKMHDNIAVMNSVQLNIIAMSSRAWL 748

Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL-HIRSIPL 659
           +Y    +   + ++   + +   F S   P+ +       L I  I+++  + +  S  L
Sbjct: 749 LYKYQTRYHLTPLSYIPLEYASNFASDQCPEGVVAIAGNSLRILAIENLGTIFNYSSFQL 808

Query: 660 GEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEF 704
              PRR     ++         + S   +++M    +L  +  E 
Sbjct: 809 SHTPRRFVFHPETGNIITVETDHNSLTTQTKMKRREILAKEMIEL 853



 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 71/283 (25%), Positives = 132/283 (46%), Gaps = 27/283 (9%)

Query: 760  KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILA 818
            +L+++       AV +L +F G+LLA+  + +++Y     D G ++L  +C   H   L 
Sbjct: 970  RLEMMHRTTVDEAVTALASFRGRLLASAGKMLRIY-----DLGKKKLLRKCENKHMPNLI 1024

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
             ++ T G  I VGD+ +S+    YK  E  +   A D +  + SA+ ILD D       +
Sbjct: 1025 THILTMGHRIFVGDVQESVFFYRYKPIENQLVVFADDTHQRFCSAMCILDYDTVALRLPS 1084

Query: 879  FNLFTVRKNS--------EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
                 V+++         +G  +    + E+V  +++GE V   +   L+        G 
Sbjct: 1085 DCTDDVQEDPTGIRALWDKGILNGASQKCEMVATFYVGECVTCLQKAMLI-------PGS 1137

Query: 931  IPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
              ++++ T++G+IG +      E Y F + L+ +LR     + G +H  +RSF       
Sbjct: 1138 SESLVYSTMSGMIGALVPFSSKEDYEFFQHLEMHLRTEYPPLCGRDHLAYRSF-----YA 1192

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              K  +DGDL E +  L   +  EIS  ++    E+ K++E++
Sbjct: 1193 PVKGVIDGDLCEQYCLLEYGKQKEISNELDRVPSEIMKKLEDI 1235


>gi|296086939|emb|CBI33172.3| unnamed protein product [Vitis vinifera]
          Length = 934

 Score =  114 bits (284), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 90/353 (25%), Positives = 163/353 (46%), Gaps = 47/353 (13%)

Query: 152 EGPWSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKA 205
           +G WS+  +DNGA++L+ VP       GVL+  E  ++Y +    +    IP R  +   
Sbjct: 81  KGKWSEQ-VDNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAE 139

Query: 206 YGR--VDADGSR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
            G   V A   R      +LL    G +  + + HE ++++ LKI+      + S++  L
Sbjct: 140 RGVLIVSAATHRQKSMFFFLLQTEYGDVFKVTLEHENDRISELKIKYFDTIPVTSSMCVL 199

Query: 258 DNAVVYIGSSYGDSQLIKLN--------------------------LQPDAKGSYVEVLE 291
            +  ++  S +G+  L +                             QP    + V + +
Sbjct: 200 KSGFLFAASEFGNHGLYQFQAIGDDADVESSSASLMETEEGFQPVFFQPRGLKNLVRI-D 258

Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
           +  +L PI+D  V +L  +   Q+    G     S+RI+R G+ I+E A  +L G+   +
Sbjct: 259 QVESLMPIMDMKVSNLFEEETPQIFALCGRGPRSSIRILRPGLAISEMAVSQLPGVPSAV 318

Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
           W+++ + +D FD ++VVSF + T +L++   + +EE    GF   T +L       + L+
Sbjct: 319 WTVKKNVNDEFDAYIVVSFANATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLM 376

Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
           QV    +R +    R   NEW++P   ++    +N  QV++A  GG L+Y E+
Sbjct: 377 QVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEV 427



 Score = 97.1 bits (240), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 100/370 (27%), Positives = 168/370 (45%), Gaps = 54/370 (14%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGR 750
            +R+LD +T        L   E   SI + +F D +      VGTA  L    +     G 
Sbjct: 583  IRILDPRTATTTCLLELQDNEAAFSICTVNFHDKEYGTLLAVGTAKSLQFWPKRSFDAGY 642

Query: 751  ILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
            I ++  +EDGK L+L+ + + +G   +L  F G+LLA I   ++LY     D G R L  
Sbjct: 643  IHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSVLRLY-----DLGKRRLLR 697

Query: 809  ECGHH---GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
            +C +      I++++  T  D I VGD+ +S     Y+ +E  +   A D    W++A  
Sbjct: 698  KCENKLFPNTIVSIH--TYRDRIYVGDIQESFHYCKYRRDENQLYIFADDSVPRWLTASY 755

Query: 866  ILDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHL 907
             +D D   GA+   N++ VR                  K  +G  +    ++E + ++H+
Sbjct: 756  HIDFDTMAGADKFGNIYFVRLPQDVSDEVEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHV 815

Query: 908  GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRK 966
            G+ V   +  SL+        G    +I+GTV G +G ++A    +   F   L+ ++R+
Sbjct: 816  GDVVTCLQKASLI-------PGGGECIIYGTVMGSLGALLAFTSRDDVDFFSHLEMHMRQ 868

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSV 1022
                + G +H  +RS          K+ +DGDL E F    LDL R   DE+ +T     
Sbjct: 869  EHPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPTLPLDLQRKIADELDRTPG--- 920

Query: 1023 EELCKRVEEL 1032
             E+ K++EE+
Sbjct: 921  -EILKKLEEV 929


>gi|358253751|dbj|GAA53701.1| splicing factor 3B subunit 3, partial [Clonorchis sinensis]
          Length = 726

 Score =  112 bits (281), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 135/594 (22%), Positives = 248/594 (41%), Gaps = 75/594 (12%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L    +G I +   FR  G ++D+L + ++  +  VL++   S     +   +   + G
Sbjct: 56  LLTCEAFGIIRSFMPFRLTGSSKDYLIVGSDSGRVVVLEY-IPSKNAFEKLQQETFGKSG 114

Query: 61  -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
            R    GQ   ++P  R  +IG      L  ++  D++ +L  +  +   +   L    +
Sbjct: 115 CRRIVPGQYLAVNPKGRAFMIGAVEKQKLVYIMNRDSQARLTISSPLEAHKSNTLVYHMV 174

Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
               G   P    +  D +DA    T E A        + E     N+        L+  
Sbjct: 175 GVDVGFENPLFACIEMDYEDADQDSTGEAARGANQLLTYYELDLGLNHVVRKYSEPLEEH 234

Query: 164 ADLLIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLL-- 218
           A+ LI VP       G+LI  E  I Y   N      IR  I +    +D D  R +L  
Sbjct: 235 ANFLIAVPGGNEGPSGILICSENYITY--KNFGDQPDIRCPIPQRRNNLD-DPDRGILFV 291

Query: 219 --GDHA--GLLHLLVITHE-----------KEKVTGLKIELLGETSIASTISYLDNAVVY 263
               H    L   L  T +            + VT +K++      +AS +  L    ++
Sbjct: 292 CSASHKTKNLFFFLAQTEQGDIFKINLEVDDDMVTEIKLKYFDTLPVASAMCVLKTGFLF 351

Query: 264 IGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGPI 299
           + S +G+  L ++                           P A  + +EV +   NL PI
Sbjct: 352 VASEFGNHSLYQIAHLGDDDDEPEFSSAMPLEEGDTFYFAPRALKNLIEV-DVLENLSPI 410

Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
           + + + D   +   Q+    G     + R++R+G+ ++E A  +L G    +W+++ +++
Sbjct: 411 MHYHIADFANEDTPQLAVLCGRGPGSTFRLLRHGLEVSEMAKSDLPGNPNAVWTVKRNSE 470

Query: 359 DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
           + +D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +R
Sbjct: 471 EEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLTCSQLGDDALVQVYPDGIR 528

Query: 419 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQL 476
            + +  R   N W++P    +     N  QV++A  GG LVY E+   G L E  +  ++
Sbjct: 529 HIRADKRV--NVWRAPGKKMIVRCAVNRRQVVIALTGGELVYFEMDMTGQLNEYTERKEM 586

Query: 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
             ++ C+ +  I  +   S+  AVG+  D +VRI SL P ++ ++ E     ++
Sbjct: 587 PADVICMALGRIPTSEQRSRFLAVGL-ADNTVRILSLDPSVSRVSCERFSTPMV 639


>gi|341893349|gb|EGT49284.1| hypothetical protein CAEBREN_30765 [Caenorhabditis brenneri]
          Length = 213

 Score =  112 bits (279), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 68/213 (31%), Positives = 106/213 (49%), Gaps = 10/213 (4%)

Query: 834  MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-KNSEGAT 892
            M+S+SLL Y+  EG  EE A+D+N+ WM   E +  +  LG E + N+FTV    S   T
Sbjct: 1    MRSVSLLSYRTLEGNFEEVAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPVT 60

Query: 893  DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
            D+ R  LE  G ++LGE         LV +  D+ +     +++GT  G +G++  +   
Sbjct: 61   DDGRYVLEPTGYWYLGELTKVMIRAVLVPQPDDNSIRYTQPIMYGTNQGSLGLVVQIDDM 120

Query: 953  QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD 1012
               FL  ++  +    K    + H  +RSF   K+      F+DGDLIES LD+ R+R  
Sbjct: 121  YKKFLLSIEKAISDAEKNCMQIEHSTYRSFTYNKRIEPPSGFIDGDLIESILDMDRSRAI 180

Query: 1013 EISKTMNV---------SVEELCKRVEELTRLH 1036
            EI +  N             E+ K +++L+R H
Sbjct: 181  EILEKANTRGWDPSIPKDPVEILKIIDDLSRTH 213


>gi|260947152|ref|XP_002617873.1| hypothetical protein CLUG_01332 [Clavispora lusitaniae ATCC 42720]
 gi|238847745|gb|EEQ37209.1| hypothetical protein CLUG_01332 [Clavispora lusitaniae ATCC 42720]
          Length = 1242

 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 214/1010 (21%), Positives = 400/1010 (39%), Gaps = 165/1010 (16%)

Query: 156  SQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSA----NAFKAIPIRPS-----ITKAY 206
            S+ ++D  A+LLIP+P P  GVLI  +  ++Y S       +   P+R       IT  Y
Sbjct: 259  SKPDIDASANLLIPLPAPAGGVLICCKSYLIYESTPGGTRKYLQFPVRSDTSEVVITNFY 318

Query: 207  -GRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIG 265
              ++       L+    G L  + ++ E E +T           + ++++  ++  ++  
Sbjct: 319  VHKLKKKEFFVLVQSSLGDLFKITVSIEGETLTAFYATYFDTIPVCTSLNIFNSGFLFAN 378

Query: 266  SSYGDS---QLIKLNLQPDAKGSYVE--------------------VLERYVNLGPIVDF 302
            S+  +    Q   L    D   S V                     ++E   +L P+VD 
Sbjct: 379  STDNNKFFYQFESLGEADDTTTSSVSEKSDFPTKTFFDSRGLKNLSLVEVLGSLSPLVDS 438

Query: 303  CVVDLERQ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLRSST 357
             +++           Q+VT +    +  ++ +  G+ ++E  S  +      +++ + S 
Sbjct: 439  TMLETYSSEFPDPLKQLVTLAS---NSYMKTLTYGVPVSELVSSPVPINPTSIFTTKISK 495

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
                D +LV++     + + +++ + +EE    GF     T+       + LVQV +  +
Sbjct: 496  KSINDDYLVLTSSLTMKSMVLSIGEVVEEVSDSGFALDQHTIAVQQIGQHSLVQVHNNGI 555

Query: 418  RLVSS--------TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI--GDGI 467
            R + +        TS+   N W  P G  +  A+ N  QVL+      + Y EI   D  
Sbjct: 556  RHIRNFFDDSDNVTSKRETN-WYPPAGIVILHASTNNEQVLVGLSNREVCYFEIDPADDQ 614

Query: 468  LTEVKHAQLEY---EISCLDINP--IGENPSYSQIAAVGMWTDISVRIFSLPD--LNLIT 520
            L E +  +LE     I+ L +    + E    S  A +G   +    I  LP     ++T
Sbjct: 615  LIEYQE-RLEVTGGSITALALASKFVTEGDRKSNFAVIGTSEETIQVISLLPKTCFEIVT 673

Query: 521  KEHLGGEIIPRSVLLCAFEGISYLL-CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
             + L       S+L+   +  +Y +   + +G      +++ +G+L D +   LGTQP++
Sbjct: 674  LQALSANCF--SLLMMPLDKDNYFVHIGMINGVYARVRIDVVSGKLGDSRLKYLGTQPVS 731

Query: 580  LRTFSSKN--TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAF------PD 631
            LR+ +  N  ++ + A S RP + Y +N+    SN  L  +  +   + +AF       +
Sbjct: 732  LRSLALPNVASSGILAISSRPWIGYFNNE----SNFKLTPLIDISIKDGSAFYSEDIGIE 787

Query: 632  SLAIAKEGELTIGTI--------DDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS---- 679
            S+   ++GE+TI T+        D    L I SI L   PR+      +  F +      
Sbjct: 788  SVVGIRDGEITIFTVGGEEGEGFDVTNDLTISSIKLRYAPRKQLKDSLTDYFFVIESEFG 847

Query: 680  -----LKNQSC------------AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
                 L+++               E S    ++++D        T+     E   S+ S 
Sbjct: 848  VISPYLESKEVDAEYFEVFGYKKKENSWASCLQIVDVAGSGIEYTHEFTNNECTISLASI 907

Query: 723  SFSDDSNVYYCVGTAYVLPEENEPTKGR-ILVFIVEDG---KLQLIAEKET--KGAVYSL 776
               D+   Y  VGT   L        G  I  F ++     K +L+   +T  +    SL
Sbjct: 908  KIRDEE--YLIVGTTENLVYSPHYYSGNSIYTFRIKRNAKKKPELVYLHKTTIEFPPSSL 965

Query: 777  NAFNGKLLAAINQKIQLY----KWMLRDDGTR-ELQSECGHHGHILALYVQTRGDFIVVG 831
             AFN KLL     +++LY    K +LR   T+ +         HI        GD IVV 
Sbjct: 966  CAFNEKLLVGAGNQLRLYDVGRKQLLRKTSTKIDFLRRINKIQHI-------AGDVIVVC 1018

Query: 832  DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR------ 885
            D  +S+S + +   +        D     ++A+E+LD    +  +   N+F  R      
Sbjct: 1019 DSSESVSFMRFDQTKNQFIAFCNDTAKRQITALEVLDSRTVIAGDRFGNIFVSRIPKNVA 1078

Query: 886  ---------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
                     K  E        RL+ + +++  + V     GS V       VG   ++I+
Sbjct: 1079 EQLENNVLMKFEEETLGASSSRLDKLCDFYTQDIVTSLHKGSFV-------VGGSESIIY 1131

Query: 937  GTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIK-------------GVGGLNHEQWRSF 982
              + G +G++  L   Q +  L KL+ +LR                  + G  H ++R +
Sbjct: 1132 TGLQGTVGILLPLATTQEVDLLMKLENSLRDYFNDSFDDFDNTKQGFNLVGREHLKFRGY 1191

Query: 983  NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             N       +N +DGD IE F +L+ +   +++  ++ S  ++ +++ +L
Sbjct: 1192 YNP-----VENVIDGDFIERFFELNPSAQVKLAGRLDKSPRDIERKIYDL 1236


>gi|312380158|gb|EFR26239.1| hypothetical protein AND_07834 [Anopheles darlingi]
          Length = 1503

 Score =  111 bits (278), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 121/481 (25%), Positives = 212/481 (44%), Gaps = 67/481 (13%)

Query: 609  LYSNVNLKEVSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH----IRSIPLGEHP 663
            LY+   +K  +   PFN+   P+  L   ++ +L I  +           +R IPL   P
Sbjct: 978  LYARTVMKAFA---PFNNVNCPNGFLYFDEQYQLKISILPTYLSYDSVWPVRKIPLRSSP 1034

Query: 664  RRICHQEQSRTFAICSLKNQSC-------AEESEM----------------HFVRLLDDQ 700
            ++I +  ++R + +     + C        E+ E+                  V L++  
Sbjct: 1035 KQIVYHRENRVYCVVMDAEEICNKYYRFNGEDKELTEENKGERFLYPMGHQFSVVLVNPA 1094

Query: 701  TFEFI--STYPLDTFEYGCSILSCSFSDDS-----NVYYCVGTAYVLPEENEPTKGRILV 753
             +E +  +   L+ +E+  S+ + S + +        Y  VGT +    E+  ++GR+L+
Sbjct: 1095 AWEIVPDTAIALEEWEHVVSLKNVSLAYEGARSGLKEYIAVGTNFNY-SEDITSRGRLLL 1153

Query: 754  F-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDG 802
            +     + E GK       + +  K+ KG V +++   G L+ A+ QK+  Y W ++DD 
Sbjct: 1154 YDIIEVVPEPGKPLTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKV--YLWQMKDD- 1210

Query: 803  TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMS 862
              +L        +I    + +    I+V D+ KS+SLL ++ E   +   +RDY+   + 
Sbjct: 1211 --DLVGVAFIDTNIFVHQMVSIKSLILVADVYKSVSLLRFQDEFRTLSLVSRDYHPLNVY 1268

Query: 863  AVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN-RFR--- 915
             VE + D+  LG   A++  NL T     E        RL   G+YHLG+ VN  FR   
Sbjct: 1269 QVEYVVDNTNLGFLVADDQANLITYMYQPESRESFGGQRLLRKGDYHLGQRVNAMFRVQC 1328

Query: 916  --HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
              H S VMR    +     T  F T++G  G +  LP + Y  L  LQ  L        G
Sbjct: 1329 DFHESDVMRR-TLNYDNKHTTFFATLDGGFGFVLPLPEKTYRRLFMLQNVLLTHSPHTCG 1387

Query: 974  LNHEQWRSFNNEKKTV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
            LN + +R+    +      ++  +DGDL+ SFL+L      E++K +   +EE+C  + E
Sbjct: 1388 LNPKAYRTIKQSRALPINPSRCVVDGDLVWSFLELPANEKQEVAKKIGTRIEEICADLME 1447

Query: 1032 L 1032
            +
Sbjct: 1448 I 1448



 Score = 67.0 bits (162), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 122/554 (22%), Positives = 210/554 (37%), Gaps = 140/554 (25%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           M    ++G I +L+     G  +D L I+    K  V+Q+D ++ +L T ++    D   
Sbjct: 81  MASYRLFGNIMSLQSVSLAGSQRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDEDI 140

Query: 61  RPTDNGQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE---------------- 100
           R    G   I    +DPD R   + +Y     V+PF     L E                
Sbjct: 141 RGGWTGHYHIPLVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEMQDVKPIKKTPTLL 200

Query: 101 --------AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTY 141
                   ++ I L++L      V+D++FL+G  +PT+++LY+  +        R     
Sbjct: 201 IAKTPILASYIIELKDLDEKIDNVIDVQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCT 260

Query: 142 EVALK---DKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY----------- 187
            VAL     +      W+ N+L       +P+  P+ G L++   +++Y           
Sbjct: 261 MVALSLNIQQRVHPVIWTVNSLPFDCLQAVPISKPIGGCLVMCVNSLIYLNQSVPPYGVS 320

Query: 188 --CSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELL 245
              SA+     P++P           DG R  L D A    +  I  EK  ++    EL 
Sbjct: 321 LNSSADHSTNFPLKPQ----------DGVRISL-DAA---QVCFIESEKLVLSLKGGELY 366

Query: 246 GETSIAST---ISYLDNAVVYIGSSYGDSQLIK-------LNLQPDAKG----------- 284
             T  A +   I   +   +++GS  G+S L++       L +  D  G           
Sbjct: 367 VLTLCADSMRSICVCETEYLFLGSRLGNSLLLRFREKDESLVITIDDSGTVEKEQKRQRL 426

Query: 285 -------------------SYV-EVLERYVNLGPIVDFCVVD---------------LER 309
                              SY+ EV +  +N+GPI    V +               +  
Sbjct: 427 EEEELEVYGSGYKTSVQLTSYIFEVCDSVLNIGPIAHMAVGERICEEEMEEGAEVQFVPN 486

Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------------RSS 356
           +   +VVT SG  K+G+L ++++ I      S  L G   +W++             R  
Sbjct: 487 KLDVEVVTASGHGKNGALCVLQSSIKPQVITSFGLSGCLDVWTVFDEAAGPGGVTGTRKP 546

Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSG 415
            D P      +    E   + +   +E+ E E  GF +   T+   +   N+ +VQVT+ 
Sbjct: 547 DDAPPPNHAFMILSQEGATMVLQTGEEINEIENTGFATDVPTIHVGNIGSNRFIVQVTTK 606

Query: 416 SVRLVSSTSRELRN 429
           S+RL+  T R L+N
Sbjct: 607 SIRLLQGT-RLLQN 619


>gi|356570929|ref|XP_003553635.1| PREDICTED: uncharacterized protein LOC100799711 [Glycine max]
          Length = 1258

 Score =  111 bits (277), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 120/506 (23%), Positives = 214/506 (42%), Gaps = 72/506 (14%)

Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
           GD  ++KL    D +  Y   ++   N+ PI+D  VVD   +   Q+  C G   +GSLR
Sbjct: 443 GDGMVLKLE---DGRLCYTNPIQ---NIAPILDMEVVDYHDEKHDQMFACCGVAPEGSLR 496

Query: 329 IVRNGIGI---NEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
           I+RNGI +   +  AS+  QG+ G W++R    D   +FLV+SF+ ETRIL++ L    +
Sbjct: 497 IIRNGINVENLHRTASI-YQGVSGTWTVRMKVTDSHHSFLVLSFLDETRILSVGLSFT-D 554

Query: 386 ETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATAN 445
            T+  GF     TL C       LVQ+   +V+L   T                  A+ +
Sbjct: 555 VTDSVGFQPNVCTLACGLVTDGLLVQIHRSTVKLCLPTK-----------------ASHS 597

Query: 446 ASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTD 505
             ++     GG +  +  G   LT                N IG   S S    V + + 
Sbjct: 598 EVEIWFFAPGGGITVVACGTISLT----------------NTIGSVKSDSIPQDVRLVSA 641

Query: 506 ISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNM--KTG 563
               + +     ++ +     E  P S        I+ +  AL   +L+N + N   K  
Sbjct: 642 DKYYVLAGLRNGMLLRFEWPAEPCPSS-------PINMVDTALSSTNLVNSVTNAFDKRN 694

Query: 564 ELTDRKKV----SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS 619
           +L    ++     +G  PI L          +   +DRP +++S+ + L Y++++ +  +
Sbjct: 695 DLPSMLQLIAIRRIGITPIFLVPLGDTLDADIIVLADRPWLLHSARQGLSYTSISFQPAT 754

Query: 620 HMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI-- 677
           H+ P +   FP  +    E  L +  +   ++L+++   L   PR++ + ++S+   +  
Sbjct: 755 HVTPVSCVEFPKGILFVAENSLHLVEMGHGKRLNVQKFHLEGTPRKVLYHDESKMLLVMR 814

Query: 678 CSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTA 737
             L    C  +     +  +D  +   +S++ L+  E G S+        S     VGT+
Sbjct: 815 TELNCGPCLSD-----ICCVDSLSGSVLSSFRLELGETGKSMELVRVG--SEQVLVVGTS 867

Query: 738 -----YVLPE-ENEPTKGRILVFIVE 757
                + +P  E E  KGR+LV  ++
Sbjct: 868 LSSGPHTMPTGEAESCKGRLLVLCLD 893



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 50/201 (24%), Positives = 82/201 (40%), Gaps = 17/201 (8%)

Query: 828  IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
            I VGD    I L  Y  E   +E    D +   ++   ++D D  + ++   ++  +  +
Sbjct: 1023 IAVGDCRDGILLYSYHEEAKKLELLYNDPSQRIVADCILMDADTAVVSDRKGSIAVLCSD 1082

Query: 888  SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV---GQIP---------TVI 935
                    +  + +   Y + E     + GS   RLP  DV   G  P         T+I
Sbjct: 1083 HLEDNAGAQCNMTLSCAYFMAEIAMSIKKGSYSYRLPADDVLEGGNGPKTNVDSLQNTII 1142

Query: 936  FGTVNGVIGVIASLPHEQYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
              T+ G I +   L  E+Y  LE +Q  L    +   V G +H ++RS  N    V    
Sbjct: 1143 ASTLLGSIMIFIPLSREEYELLEVVQARLVVHHLTAPVLGNDHHEFRSRENR---VGVPK 1199

Query: 994  FLDGDLIESFLDLSRTRMDEI 1014
             LDGD++  FL+L+  +   I
Sbjct: 1200 ILDGDILTQFLELTSMQQKMI 1220


>gi|322700233|gb|EFY91989.1| Pre-mRNA-splicing factor rse-1 [Metarhizium acridum CQMa 102]
          Length = 1039

 Score =  111 bits (277), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 127/583 (21%), Positives = 239/583 (40%), Gaps = 79/583 (13%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L   ++G I ++  FR  G ++D++ +A++  +  ++++    +      +        
Sbjct: 59  LLSHDVFGIIRSIAAFRLAGSSKDYIILASDSGRIAIVEYAPTKNSFSRIHLATFGKSGV 118

Query: 61  RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
           R    GQ    DP  R  LI     + L  V+  + + +L  +  +   +  +L    + 
Sbjct: 119 RRVIPGQYLAADPKGRACLIASIEKNKLVYVLNRNAEAELTISSPLEAHKHGILVFSVVS 178

Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALK--------DKDFVEGPWSQNNLDNGADLL 167
              G A P    L  D  +A    + EV ++          + V   WS+  +D  + LL
Sbjct: 179 LDVGYATPVFAALEADYTEAEQDDSAEVEIQLVYYELDLGLNHVVRKWSEV-VDPTSSLL 237

Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYL 217
             VP     P   GVL+ GEE + Y  +N  AF+  IP R   T+     R+   G  + 
Sbjct: 238 FQVPGGSDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRRGATEDPQRRRIIVSGVMHK 295

Query: 218 LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
           L         L+ T + +                  +VT +KI+      ++++I  + +
Sbjct: 296 LKGSTSAFFFLLQTDDGDLFKVTLDMAEDADGNLTGEVTRIKIKYFDTIPVSNSICIMKS 355

Query: 260 AVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLER 292
             ++    +G+ Q  +                +   D + SY            + ++E 
Sbjct: 356 GFLFTAGEFGNYQFYQFEKLGDDDDETEFTSDDFPTDYQASYHPVYFHPRPLENLTLVES 415

Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
             ++ P++D  ++    +   Q+    G     +LRI+R+G+ +NE  + EL G    +W
Sbjct: 416 VSSMAPLMDCKILTQAGEDISQIYAACGNGARSTLRILRHGLEVNELVASELPGTPSAVW 475

Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
           + + +  D +D +++++F+ +T +L++   + + +    GF +   TL       N L Q
Sbjct: 476 TTKLTQSDDYDAYIILTFLHDTMVLSVG--ETVTQVTDSGFITTVATLAVQQIGKNSLFQ 533

Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE 470
           V S  +R + S       EW  P   ++  A  N  QV +A   G +VY E+  DG L E
Sbjct: 534 VYSKGIRHIQSGQ---FTEWPVPQHRTIVAAATNERQVAIALSSGEIVYFEVDEDGSLAE 590

Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL 513
               +    ++ L + P+ E    S   AVG   D +VRI SL
Sbjct: 591 FDERKEIGSVTSLGLGPVPEGRLRSPFLAVGC-DDCTVRILSL 632



 Score = 84.3 bits (207), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 84/339 (24%), Positives = 163/339 (48%), Gaps = 25/339 (7%)

Query: 710  LDTFEYGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIA 765
            L+  E G S     F S D   +  +GT   ++    + ++G I V+   +DG+ L+ I 
Sbjct: 704  LEGNEAGVSAAVVPFTSQDGESFLIIGTGKDMIVNPRQSSEGFIHVYRFHDDGRSLEFIH 763

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG 825
            + + +    +L +F+G+LLA I + +++Y   +R    R+ Q++     HI++L  Q++G
Sbjct: 764  KTKVEEPPTALLSFHGRLLAGIGKTLRIYDLGMRQL-LRKAQADISPQ-HIVSL--QSQG 819

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR 885
              IVVGD+   +++++Y      +     D  A W + + + D +   G +   N++ VR
Sbjct: 820  FRIVVGDVQHGVTMVVYNPVSNKLLPFVDDTIARWTTCLAMADYESVAGGDKFGNIWIVR 879

Query: 886  --KNSEGATDEERGRLEVV-GEYHLGEFVNRFRHGS--LVMRLPDS------DVGQIPTV 934
                +    DE    +++  G+ +L    +R +  +   V  +P S       VG    +
Sbjct: 880  CPDKASAEADEPGSDVQLSNGQSYLHGAAHRLQLMAHMFVQDIPTSICKTSLVVGGQDVL 939

Query: 935  IFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
            ++  + G IGV+  L   E   F + L+ ++R     + G +H  +R ++     V  K 
Sbjct: 940  LWSGLQGTIGVLIPLVTRETADFFQTLEMHMRNEDPPLAGRDHLMYRGYH-----VPVKG 994

Query: 994  FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +DGDL E F  LSR +   I+  ++ SV E+ +R+ ++
Sbjct: 995  VIDGDLCERFSLLSREKKQMIAGELDRSVREVERRISDV 1033


>gi|297816810|ref|XP_002876288.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322126|gb|EFH52547.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 633

 Score =  110 bits (274), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 145/634 (22%), Positives = 257/634 (40%), Gaps = 125/634 (19%)

Query: 497  IAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAFEGIS----------- 542
            + AVG + D +VRI SL PD  L +++ + +     P S+L    +              
Sbjct: 20   LLAVGSY-DNTVRILSLDPDDCLQILSVQSVSS--APESLLFLEVQASIGGEDGADHPAN 76

Query: 543  -YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 601
             +L   L +G L   +++M TG+L+D +   LG +P  L + S +  + +   S RP + 
Sbjct: 77   LFLNSGLQNGVLFRTVVDMVTGQLSDSRSRFLGLKPPKLFSISVRGRSAMLCLSSRPWLG 136

Query: 602  YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLG 660
            Y        + ++ + +    PF+S    + +       L I  ID + +  +   +PL 
Sbjct: 137  YIHRGHFHLTPLSYETLEFAAPFSSDQCAEGVVSVAGDALRIFMIDRLGETFNETVVPLR 196

Query: 661  EHPRRICHQEQSRTFAIC-SLKNQSCAEESE----------------------------- 690
              PR+   Q + +   I  S +    AEE E                             
Sbjct: 197  YTPRKFVLQPKRKLLVIIESDQGAFTAEEREAARKECFEAGGVGENGNGNADQMENGADD 256

Query: 691  ----------------------MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-D 727
                                  +  +R+LD +T        L   E   S+ + +F D +
Sbjct: 257  EDKEDPLSDEQYGYPKAESEKWVSCIRVLDPKTATTTCLLELQDNEAAYSVCTVNFHDKE 316

Query: 728  SNVYYCVGTAYVLPEENEPTKGRILVFI-----VEDGK-LQLIAEKETKGAVYSLNAFNG 781
                  VGT   +  +  P K  +  FI     VE+GK L+L+ + + +G   +L  F G
Sbjct: 317  YGTLLAVGTVKGM--QFWPKKNLVAGFIHIYRFVEEGKSLELLHKTQVEGVPLALCQFQG 374

Query: 782  KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI-LALYVQTRGDFIVVGDLMKSISLL 840
            +LLA I   ++LY     D G + L  +C +       + +QT  D I VGD+ +S    
Sbjct: 375  RLLAGIGPVLRLY-----DLGKKRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYC 429

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--------------- 885
             Y+ +E  +   A D    W++A   +D D   GA+   N++ VR               
Sbjct: 430  KYRRDENQLYIFADDCVPRWLTASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTG 489

Query: 886  ---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGV 942
               K  +G  +    +++ + ++H+G+ V   +  S++        G   ++++GTV G 
Sbjct: 490  GKIKWEQGKLNGAPNKVDEIVQFHVGDVVTCLQKASMI-------PGGSESIMYGTVMGS 542

Query: 943  IGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
            IG + A    +   F   L+ ++R+    + G +H  +RS          K+ +DGDL E
Sbjct: 543  IGALHAFTSRDDVDFFSHLEMHMRQEYPPLCGRDHMAYRS-----AYFPVKDVIDGDLCE 597

Query: 1002 SF----LDLSRTRMDEISKTMNVSVEELCKRVEE 1031
             F    +DL R   DE+ +T      E+ K++E+
Sbjct: 598  QFPTLPMDLQRKIADELDRT----PAEILKKLED 627


>gi|356505471|ref|XP_003521514.1| PREDICTED: uncharacterized protein LOC100806799 [Glycine max]
          Length = 1278

 Score =  110 bits (274), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 124/524 (23%), Positives = 223/524 (42%), Gaps = 77/524 (14%)

Query: 260 AVVYIGSSY-------GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQ 312
           A++++ S Y       GD  ++KL    D +  Y+  ++   N+ PI+D  VVD   + Q
Sbjct: 427 ALLWVESGYLAALVEMGDGMVLKLE---DGRLCYINPIQ---NIAPILDMEVVDYHDEKQ 480

Query: 313 GQVVTCSGAYKDGSLRIVRNGIGI---NEQASVELQGIKGMWSLRSSTDDPFDTFLVVSF 369
            Q+  C G   +GSLRI+RNGI +   +  AS+  QG+ G W++R    D   +FLV+SF
Sbjct: 481 DQMFACCGVAPEGSLRIIRNGINVENLHRTASI-YQGVTGTWTVRMRVTDSHHSFLVLSF 539

Query: 370 ISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
           + ETRIL++ L    + T+  GF     TL C       LVQ+   +V+L   T      
Sbjct: 540 VEETRILSVGLSFT-DVTDSVGFQPNVCTLACGLVTDGLLVQIHKSTVKLCLPTK----- 593

Query: 430 EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIG 489
                  +S  +   +      A GGG +  +  G   LT      +            G
Sbjct: 594 -----AAHSEGIPLPSVEIWYFAPGGG-ITVVACGTISLTNTVGTAIS-----------G 636

Query: 490 ENPSYSQIAAVGMWTDISVRIFSLPDLN--LITKEHLGGEIIPRSVLLCAFEGISYLLCA 547
             P   ++  VG +       + L  L   ++ +     E  P S        I+ +  A
Sbjct: 637 CVPQDVRLVFVGKY-------YVLAGLRNGMLLRFEWPAEPCPSS-------PINIVDTA 682

Query: 548 LGDGHLLNFLLNM--KTGELTDRKKV----SLGTQPITLRTFSSKNTTHVFAASDRPTVI 601
           L   +L+N + N   K  +     ++     +G  P+ L          +   SDRP ++
Sbjct: 683 LSSINLVNSVTNAFDKRNDFPSMLQLIAIRRIGITPVFLVPLGDTLDADIITLSDRPWLL 742

Query: 602 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 661
           +S+   L YS+++ +  +H+ P  S   P  +    E  L +  +   ++L+++   L  
Sbjct: 743 HSARHSLSYSSISFQPSTHVTPVCSVECPKGILFVAENSLHLVEMVHSKRLNMQKFHLEG 802

Query: 662 HPRRICHQEQSRTFAI--CSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
            PR++ + ++S+   +    L   +C  +     + ++D  +   +S++ L+  E G S+
Sbjct: 803 TPRKVLYHDESKMLLVMRTELNCGTCLSD-----ICIMDPLSGSVLSSFRLELGETGKSM 857

Query: 720 LSCSFSDDSNVYYCVGTAY------VLPEENEPTKGRILVFIVE 757
                  +  +   VGT+       +   E E  KGR+LV  ++
Sbjct: 858 ELVRVGSEQVLV--VGTSLSSGPHTMATGEAESCKGRLLVLCLD 899



 Score = 53.1 bits (126), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 50/201 (24%), Positives = 82/201 (40%), Gaps = 17/201 (8%)

Query: 828  IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
            I VGD    I L  Y  E   +E    D +   ++   ++D D  + ++   ++  +  +
Sbjct: 1029 IAVGDCRDGILLYSYHEEAKKLELLYNDPSLRLVADCILMDADTAVVSDRKGSIAVLCSD 1088

Query: 888  SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV---GQIP---------TVI 935
                    +  + +   Y + E     + GS   RLP  DV   G  P         T+I
Sbjct: 1089 HLEDNAGAQCNMALSCAYFMAEIAMSIKKGSYSYRLPADDVLQGGNGPKTNVDSLQNTII 1148

Query: 936  FGTVNGVIGVIASLPHEQYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
              T+ G I +   L  E+Y  LE +Q  L    +   V G +H ++RS  N    V    
Sbjct: 1149 ATTLLGSIMIFIPLSREEYELLEAVQARLVVHHLTAPVLGNDHNEFRSRENR---VGVPK 1205

Query: 994  FLDGDLIESFLDLSRTRMDEI 1014
             LDGD++  FL+L+  +   I
Sbjct: 1206 ILDGDMLTQFLELTSMQQKMI 1226


>gi|344231825|gb|EGV63707.1| hypothetical protein CANTEDRAFT_134986 [Candida tenuis ATCC 10573]
 gi|344231826|gb|EGV63708.1| hypothetical protein CANTEDRAFT_134986 [Candida tenuis ATCC 10573]
          Length = 991

 Score =  110 bits (274), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 160/701 (22%), Positives = 289/701 (41%), Gaps = 113/701 (16%)

Query: 350  MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
            +++ + + DD  D +LV+S  SET +L++   +E+ E     F S+  T+         L
Sbjct: 378  IFTTKLTRDDVNDAYLVISS-SETSVLSIG--EEVAEVTDSKF-SKDPTILVQQVGKMAL 433

Query: 410  VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGI 467
            +QV S  ++ ++    E   +W  P G ++  A++N  Q+++      ++Y E+   D +
Sbjct: 434  IQVYSNGIKHING---EKVTDWFPPAGINIIKASSNNQQLIIGLTNNEVIYFEVDVDDQL 490

Query: 468  LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
            +      +L   I+ L I         S+  AV    D +V++ SL   N +       E
Sbjct: 491  VEYQDKVELPTNITALAI---------SKDFAVAGCADETVQVISLKQQNCL-------E 534

Query: 528  IIPRSVLLCAFEGISY----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
            I+   +L      I +    +   + +G  +   ++ + G+L++ +   LGT+PI L   
Sbjct: 535  ILSMQMLSSNSSAIEFSEQEVHIGMENGLFVRTNIDAR-GKLSNTRVKYLGTKPIRL--- 590

Query: 584  SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG-ELT 642
             SK    + A S +P V + +N       +N  +++    F S        +   G +L 
Sbjct: 591  -SKINDSILAISSKPWVGFKTNGNFNIVPLNDIDITDGTSFYSEDIGGEGIVGFRGNDLI 649

Query: 643  IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF 702
            I TIDD +   I S       + +   E      + SL  +      E            
Sbjct: 650  IFTIDDFRNNFIISTEDIPATKLLVDNE------VYSLGKELVRNNKE------------ 691

Query: 703  EFISTYPLDTFEYGCSILS-CSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKL 761
                      F +  S LS C F D    Y  VG          P       F+     +
Sbjct: 692  ---------KFAFEESPLSICRFKD----YVVVGVT-------NPN------FLYTFKDM 725

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHIL 817
            +L+ + E +    S+  FNG+LL  ++  ++ Y    K +LR   T        H   I+
Sbjct: 726  KLVHKTEVEMPPRSILEFNGRLLVGMDNLLRTYDLGKKQLLRKSST-----AIKHINKII 780

Query: 818  ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
             +  Q + D IVVGD   + S +  K  E +    + D     ++++  LD D  +G + 
Sbjct: 781  RIVYQGK-DRIVVGD--SNNSTIFCKFIENSFVPISDDTMNRQITSLSTLDYDTVIGGDK 837

Query: 878  NFNLFTVRKN-------SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
              N+F  R          E   +   GR + + E+ L +    F  G+LV+  P+     
Sbjct: 838  FGNVFVNRIKYDNTYFVEESYLNGSSGRCQTLAEFFLNDIPMSFTKGTLVLGGPE----- 892

Query: 931  IPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
               +I+  + G IG++  +    + FL  L   L K +  + G +H ++R + N      
Sbjct: 893  --VIIYAGLQGTIGILLPISESDFKFLSNLSIELNKDL--LLGRDHMKFRGYYNS----- 943

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
              N +DGD+IE FL+L+ +   +IS  +N SV E+  ++ +
Sbjct: 944  THNVIDGDIIEKFLELNASSRIKISNKLNKSVREIENKIND 984


>gi|261329035|emb|CBH12013.1| damage-specific DNA binding protein, putative [Trypanosoma brucei
            gambiense DAL972]
          Length = 1270

 Score =  109 bits (273), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 179/800 (22%), Positives = 313/800 (39%), Gaps = 127/800 (15%)

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR 354
            N GP+ D  V       +  V+  +G   +G L  VR+ + + +   + + G++ +  + 
Sbjct: 426  NCGPVFDVTVA--ADGSRTSVIAGTGVGLNGGLSFVRSAVSVRQD--IRVTGLQNVRQIS 481

Query: 355  SSTDDPFDTFLVVSFISETRILAMNLEDE--LEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
             S D      ++ S    +R+    + +   LEE     F ++   L  ++   N  +QV
Sbjct: 482  VSEDT-----IIFSLPGYSRVCRYCVGETMVLEEIHNTFFGTEETLLLEYNNERNVFLQV 536

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSVNVAT-----ANASQVLLATGGGHLVYLEIGDGI 467
            T+            LRN W     Y + V+      A+AS+ LL       +Y+      
Sbjct: 537  TTAG----------LRNVWPDRGEYVIRVSNNDIGHAHASEGLLVFSNPANLYV-FCMKT 585

Query: 468  LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
            LT V    L   ISCL +         S    VG W   +V ++ L D  + +K  L   
Sbjct: 586  LTPVATLCLAEAISCLLVFS-------SNSLLVGTWGSCAVHLYELHDGMIQSKVILQCS 638

Query: 528  IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKT--GELTDRKKVSLGTQPITLRTFSS 585
              P S+ +    G   LL  L +G++ +  ++     GE+ +     L TQP+  R F+ 
Sbjct: 639  ATPCSMCVVFRSGGHRLLVGLHNGYVADVPISGANVYGEMVETM---LTTQPV--RLFNL 693

Query: 586  KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE--LTI 643
            ++   V    + P ++  +N     + ++ ++VS          P       + E  L +
Sbjct: 694  ESHAAVLCLGEIPLILIVTNTGFQLTGIDFRDVSACAIMPKMRSPSRYIFFSKSECALIL 753

Query: 644  GTIDDIQKLH------------IRSIP----LGEHPRRICHQEQSRTFAICSLKNQSCAE 687
            G+I  +QKL+            ++ +P    L    RRI  ++Q  +     + N S   
Sbjct: 754  GSITSVQKLNTDYVGLKATATCVKYMPWWNILVLSIRRI-EKDQVVSTMGHEVSNLSVLL 812

Query: 688  ESEMHFVRLLDDQTFEFISTYPL-DTFEYGCSILSCS-FSDDSNVYYCVGTAYVLPEENE 745
            +    F  LL+++   FI    L    E+G    SC   + D+ V   +GT +V P+E  
Sbjct: 813  DEPNSF-ELLENERCAFIDCVALGQANEWG----SCGEVASDAGVVL-IGTTFVFPDEQL 866

Query: 746  PTKGRILVFIVEDGKLQ-------LIAEKETKGAVYS---LNAFNGKLLAAINQKIQLYK 795
                R +   VE  KL+       L   K+ +GA+     +  + G++   I   + LY 
Sbjct: 867  SRSSRFMWCTVEVAKLRTEKTLLRLQGSKDVEGALQCCCIVPNYAGRVALGIGGCVVLYS 926

Query: 796  WMLRDDG-TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
            W   D     E   + G     L   +Q    +IV  D   S   +     +G++   AR
Sbjct: 927  WNAADATFVAEETIQIGTLIVRLIPVMQKEVSYIVASDARHSCFFVRIDTIQGSLNIVAR 986

Query: 855  DYNANWMSAVEILD----DDIYLGAENNFNLFTVR-----KNSEG--ATDEERGRLEVVG 903
            D     +    IL      D+ LG ++ FN F V       NS G  A      +L+   
Sbjct: 987  DPELRGVMDCAILQYESRHDVCLG-DDLFNFFCVSHVEPLANSSGVSAPAMPTKKLQTSA 1045

Query: 904  EYHLGEFVNRFRHGSL--------VMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQ 953
            +YH+G+ +     GS         V+ +P + V  +  P +++GT +G  G I  +  E 
Sbjct: 1046 QYHMGDLITVMHQGSFAPCSVLNDVVPIPATLVRGVCGPQIVYGTSHGAFGAITPISSET 1105

Query: 954  YLFLEKLQTNLRKVIKGVGGLNHEQWRS--------------------------FNNEKK 987
            ++ L+ L+ ++  V+  +GG  H  +R                           FN  ++
Sbjct: 1106 FILLKGLEVSVASVVPPLGGFTHASFREVLRVGQERGASRNASFQVTNPQATELFNRRRR 1165

Query: 988  TVDAKNFLDGDLIESFLDLS 1007
                +    GD++E FL LS
Sbjct: 1166 RCVPRGVCCGDVVEMFLTLS 1185


>gi|427780291|gb|JAA55597.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
            cft1 cpsf subunit [Rhipicephalus pulchellus]
          Length = 1237

 Score =  107 bits (267), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 127/507 (25%), Positives = 211/507 (41%), Gaps = 76/507 (14%)

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC--PFNSAAFPDS-LAIAKEGELTIGTID 647
            VF    RP  ++ S++  L  +    +    C  PF++   P   L   K+GEL I T+ 
Sbjct: 735  VFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQGELRISTLP 794

Query: 648  DI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-------AEESEMHFVR- 695
                      +R +PL   P  + +   S+T+ + + +   C        EE E   +  
Sbjct: 795  THLTYDAPWPVRKVPLRCTPHFVNYHVDSKTYCVVTSQPDPCNHLVRFTGEEKEYELLER 854

Query: 696  -------LLDDQTFEFIS-----TYP-----LDTFEYGCSILSCSFSDDSNV-----YYC 733
                    +D  + + +S     T P     LD +E+   + +   S +        Y  
Sbjct: 855  DSRYIFPTMDKFSLQLLSPVSWETIPNTRVDLDEWEHLTCLKNVMLSSEGTTTGMKGYLA 914

Query: 734  VGTAYVLPEE--------------NEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            +GT Y   E+                P  G+ L       K++++  KE KG V +L+  
Sbjct: 915  LGTNYCYGEDVTSRGRIIILDIIDVVPEPGQPLT----KNKIKIVYSKEQKGPVTALSQV 970

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
             G LL+AI QKI  Y W L+D+   EL         I    V T  + I+VGD+ KS+SL
Sbjct: 971  VGFLLSAIGQKI--YIWQLKDN---ELVGVAFIDTQIYIHSVVTVKNLILVGDVFKSVSL 1025

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDE 894
            L Y+     +   +RD     + AVE   D+  +      AE N  L+  +  S  +   
Sbjct: 1026 LRYQEASRTLSLVSRDVRPLEVYAVEFFIDNTQMSFLVTDAERNLLLYMYQPESRESCGG 1085

Query: 895  ERGRLEVVGEYHLGE-----FVNRFRHGSLVM--RLPDSDVGQIPTVIFGTVNGVIGVIA 947
            +R  L   G++H+G      F  + R G +    R   S V      +  T++G +  + 
Sbjct: 1086 QR--LLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSLAYVL 1143

Query: 948  SLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV--DAKNFLDGDLIESFLD 1005
             +P + Y  L  LQ  L   I    GLN + +R + ++++ +    KN LDG+LI  F+ 
Sbjct: 1144 PVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHKNILDGELIWKFMH 1203

Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
            LS     E+SK +  +V ++   + E+
Sbjct: 1204 LSFMERSELSKKIGTTVTQITDDLLEI 1230


>gi|297810341|ref|XP_002873054.1| hypothetical protein ARALYDRAFT_908112 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297318891|gb|EFH49313.1| hypothetical protein ARALYDRAFT_908112 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 114

 Score =  107 bits (266), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 54/103 (52%), Positives = 67/103 (65%), Gaps = 6/103 (5%)

Query: 643 IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF 702
           I  ID  QK ++ +IPLGE+   ICHQ Q+ T+ IC+         S  HFVRLL+DQTF
Sbjct: 13  IIAIDATQKHNVLTIPLGEYASCICHQAQTCTYGICT------GTHSGTHFVRLLEDQTF 66

Query: 703 EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENE 745
             +S Y LD +E G SI+SC F+ DSN YYCVGT YV PEE +
Sbjct: 67  NVLSAYTLDPYERGLSIVSCLFTADSNEYYCVGTEYVFPEEED 109


>gi|340054355|emb|CCC48651.1| putative damage-specific DNA binding protein [Trypanosoma vivax Y486]
          Length = 1272

 Score =  107 bits (266), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 176/819 (21%), Positives = 322/819 (39%), Gaps = 124/819 (15%)

Query: 295  NLGPIVDFCV-VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
            N GP+ D  V VD  R     V+  +G    G L ++R+ + + + A+V   G+  +  +
Sbjct: 426  NCGPVFDMTVAVDGPRM---SVIASTGVGLSGGLNLLRSAVNVCQFAAV--HGVVNVKRV 480

Query: 354  RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI--EGFCSQTQTLFCHD-AIYNQLV 410
              + +      +++SF   +R+    +   +E  E+    F +  +TL   D +     +
Sbjct: 481  CMAGN-----VIILSFPGYSRVCHFAVGKTMEVVEVYETPFDTLVETLELVDISQGGAFL 535

Query: 411  QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
            QVT+  V  V  T +E  +     PG+  +V  A++        GG LV+       +  
Sbjct: 536  QVTTSGVNAVQGTMQEYIHR----PGHGKSVEHAHSC-------GGMLVFSSYTSICVLA 584

Query: 471  VK--HAQL----EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
            V   H+ +    EYE+S L +         +    VG W   ++ ++ +    +  +   
Sbjct: 585  VNEPHSVITFHSEYEVSSLAM-------VSTTSLLVGEWGSNAISLYDVTAEGVHPRGRF 637

Query: 525  GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTF 583
                 P SV +  + G S LL  L +G++ + LL+   G+ + +   +L T  P+ L   
Sbjct: 638  TCSATPCSVSVIPYHGTSRLLVGLLNGYVADVLLDDIAGDSSVKVAETLLTLHPVRLFNL 697

Query: 584  SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLK--EVSHMCPFNSAAFPDSLAIAKEGEL 641
             S N   V    + P VI  S+     + ++L   E+  + P + +         KE  L
Sbjct: 698  QSHNA--VLCLGEVPLVIIVSDNGFQITGIDLYGIELCGIIPSSCSPVRYIFHSKKEQSL 755

Query: 642  TIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--------------SCAE 687
              G I D+Q+L+    PL      + +      F +   +N+                  
Sbjct: 756  IFGNITDVQRLNTNFFPLRATVTLVKYMAWWNVFVMSLRRNERDQILTIMGYELWAPSPL 815

Query: 688  ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPT 747
              E + + LL+++   F+ +  L     G +  S       N    +GT +  P+E    
Sbjct: 816  RDEQNSLELLENERCVFVESVILG----GGNEWSTGTDALENSAIIIGTTFAFPDEQLSR 871

Query: 748  KGRILVFIVEDGK-------LQLIAEKETKGAVYS---LNAFNGKLLAAINQKIQLYKWM 797
              R +   VE GK       L+    K+ +GA+     +  + G++   IN  + LY W 
Sbjct: 872  SSRFMWCSVEQGKTANEKPQLRQQGSKDVQGALGCCCVVPNYAGRIALGINGCVALYCWN 931

Query: 798  LRDDGTRELQSECGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
              D  +  +  E    G IL   +   Q    ++V  D   S S++     +G++   AR
Sbjct: 932  AVD--STFIAEETIGVGMILTRLMPVLQHDMSYLVAFDSRHSSSVIAVDTIQGSLSVAAR 989

Query: 855  DYNANWMSAVEILD----DDIYLGAENNFNLFTVRKNSEGATDEERG-------RLEVVG 903
            D     +    +L     DD+  G ++ FNLF V      AT  +         +L    
Sbjct: 990  DPELRGVMDGTVLHSGARDDMCFG-DDFFNLFCVPHVMPSATSPDAPAAGMPTCKLPTSA 1048

Query: 904  EYHLGEFVNRFRHGSL--------VMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQ 953
            +YH+G+ +   + GS         V+ +P   +  +  P ++FGT +G  G I  +  E 
Sbjct: 1049 QYHVGDLITAMQQGSFAPCSLSNGVVPVPKVLIPGVCGPQIVFGTTHGSFGTITPVTSET 1108

Query: 954  YLFLEKLQTNLRKVIKGVGGLNHEQWRSF--NNEKKTVD--------------------- 990
            YL L+ ++  +  V+  +GG +H  +R     N+++ V                      
Sbjct: 1109 YLLLKGIEVAVAAVVPALGGFSHAAYREVLCANQERGVSRNASFEAVNPRAMEVLRERRL 1168

Query: 991  ---AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
               A+    GDL+E FL LS      + +   + ++  C
Sbjct: 1169 KYVARCVCSGDLVEMFLTLSPDTQLRVIEEAAMHIQRWC 1207


>gi|340371789|ref|XP_003384427.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Amphimedon queenslandica]
          Length = 1408

 Score =  106 bits (264), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 130/575 (22%), Positives = 235/575 (40%), Gaps = 82/575 (14%)

Query: 535  LCAFEGISYLLCALGDGHL-LNF--------LLNMKTGELTDRKKVSLGTQPITLRTFSS 585
            L  +E   Y   A+  GHL L F        L + + G+L    +    + P  LR FS+
Sbjct: 838  LVIYEAFQYT-SAIHPGHLKLRFSKVQHNVILQDKRVGKLAKHFQQQEFSFPPHLRKFSN 896

Query: 586  -KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDS-LAIAKEGEL 641
                + VF     P  I+ + +  L  +    +  V    PF++   P   L   KE EL
Sbjct: 897  IAGYSGVFVCGPYPHWIFMAARGHLSIHPMYIDGPVQSFAPFDNVNCPSGFLYFNKESEL 956

Query: 642  TIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA----------- 686
             I  +           +R +PL   P  + +  +S+   I +   Q              
Sbjct: 957  RISVLPTQLSYDSYWPVRKVPLKATPHFVGYHMESKVHVIIASTPQPVTVIPDPNGETED 1016

Query: 687  -------------EESEMHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV- 730
                          + E ++++LL   ++E I  S Y ++   +   +          + 
Sbjct: 1017 ALETVERDGRFVYSQEETYYLQLLSPTSWETIPHSKYEMEAHYHVTDMKVMRLRSQETLS 1076

Query: 731  ----YYCVGTAYVLPEENEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYS 775
                Y  VGT     EE    KG++L+F     I E GK      L+ + ++E K  V  
Sbjct: 1077 GRKEYIVVGTMATFGEELS-AKGKVLIFDVSVVIPEPGKPFSQYRLKNLYDQEQKWPVTG 1135

Query: 776  LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
            L   NG +L A+ QKI  + W  +D+  ++L +             Q+   FI+ GD+ +
Sbjct: 1136 LECVNGLILTAMGQKI--FMWQFKDN--KDLLAVAFIDAETYIHTAQSIKGFILTGDVTR 1191

Query: 836  SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGAT 892
            SI LL Y  +  ++   ++D N   + +   + D   LG   ++++ N+   +   E   
Sbjct: 1192 SIQLLHYNEDRRSLSLISQDPNPMEVFSTTFMIDGKALGFLVSDSDRNITLFQYQPENPA 1251

Query: 893  DEERGRLEVVGEYHLGEFVNRFRH----------GSLVMRLPDSDVGQIPTVIFGTVNGV 942
                  L   G+ H+G  VN F +           S  M++  +D  Q     FGT++G 
Sbjct: 1252 SSGGANLVRCGDIHVGSLVNVFLNIRCKTSAGLGASREMKIALADKRQ--CTFFGTLDGG 1309

Query: 943  IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLI 1000
            IG +  +P + Y  L  LQ  + + ++ + GLN + +R+F    + +    +N LDG L+
Sbjct: 1310 IGCLLPIPEKVYRRLSMLQVKMTQGMRHMAGLNPKAFRTFQTRHQYLHNAQRNILDGTLL 1369

Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
              +L L+     + SK +  +V ++ + ++E+ ++
Sbjct: 1370 YQYLSLTAKEKFDFSKQIGTTVAQIMEDLKEIDKV 1404



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 54/257 (21%), Positives = 103/257 (40%), Gaps = 42/257 (16%)

Query: 2   LDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGR 61
           L+   +G I +L+  R     +D L ++    K  +++++ E++ L T +M    D   R
Sbjct: 59  LEFSFHGNIQSLDKVRLRHSDRDCLLLSFNDAKLSIVEYNPETNGLKTVSMHQFEDEEIR 118

Query: 62  ----PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFD---------------NKGQLKEAF 102
                 D+  +  +DP+ R   + L+     V PF                +   +   +
Sbjct: 119 GGILHNDSRPVVKVDPEGRCAVMLLFGSHLAVCPFQQDLSIDTPLSPSPSLDTHDILPTY 178

Query: 103 NIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDN----------KDARHVKTYEVALKD 147
            I L +L      + D+ F+ G   PT++ L + +          +D+  +    +   D
Sbjct: 179 TISLRDLPEPLPVIKDMTFIEGYTSPTLLFLSEVSPTWAGRISLRQDSMMLLGLSLNTSD 238

Query: 148 KDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYG 207
           K      W+  NL   +  L PVP PL GVL+ G  T++Y + ++       P    +  
Sbjct: 239 KSHTVI-WTLKNLPFDSSYLHPVPKPLGGVLVFGANTLIYLNQSS-------PPYGLSLN 290

Query: 208 RVDADGSRYLLGDHAGL 224
            +    +R+LL +   L
Sbjct: 291 SITDYTTRFLLKNEGSL 307


>gi|17861814|gb|AAL39384.1| GM01240p [Drosophila melanogaster]
 gi|220943184|gb|ACL84135.1| CG13900-PB [synthetic construct]
          Length = 688

 Score =  105 bits (263), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 146/702 (20%), Positives = 271/702 (38%), Gaps = 125/702 (17%)

Query: 434  PPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE-VKHAQLEYEISCLDINPIGENP 492
            P   S  +      ++L  T   +++ L+ G+  L E  + +++  EI C+ +  + E  
Sbjct: 3    PAVLSFKINQKPYPRLLNFTTNEYILCLQTGE--LNEYTERSEMPAEIMCMALGTVPEGE 60

Query: 493  SYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE------------- 539
              S   AVG+  D +VRI SL   N +T   +     P   L C  E             
Sbjct: 61   QRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAESL-CLVEMGHTESTTQGGLD 118

Query: 540  ------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
                        G  YL   L +G LL  +L+  +G+L D +   LG++P+ L     + 
Sbjct: 119  DDAPAQRSGNNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRTRYLGSRPVKLFRIKMQG 178

Query: 588  TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
            +  V A S R  + Y    +   + ++ + + +   F+S    + +       L I  ++
Sbjct: 179  SEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCSEGIVAISTNTLRILALE 238

Query: 648  DIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQSCAEE------ 688
             +  +  + + PL   PR  + H +  R           T    S + +  AEE      
Sbjct: 239  KLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDTKSARKEQMAEEMRSAAG 298

Query: 689  -------SEM-------------------------HFVRLLDDQTFEFISTYPLDTFE-- 714
                    EM                           +R LD    + + + PL   E  
Sbjct: 299  DEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDAMHGQTMFSVPLTQNEAI 358

Query: 715  YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVED---GKLQLIAEKETKG 771
               ++L  S + D   Y  VG A  L      ++G  +     D     L+ +   +   
Sbjct: 359  MSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKIDPTCSSLEFMHRTDIDE 418

Query: 772  AVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVV 830
               +L  F G+LLA   + +++Y     D G +++  +C   H     + +Q  G  + V
Sbjct: 419  IPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKHIPYQIVNIQAMGHRVYV 473

Query: 831  GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
             D+ +S+  + Y+  E  +   A D +  W++A  +LD D  +   + F   ++++    
Sbjct: 474  SDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-IAIADKFGNLSIQRLPHS 532

Query: 891  ATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
             TD+            +RG L       E +  +H+GE +   +  +L+        G  
Sbjct: 533  VTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSLQKATLI-------PGGS 585

Query: 932  PTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
              +I+ T++G +G        E Y F + L+ ++R     + G +H  +RS         
Sbjct: 586  EALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCGRDHLSYRS-----SYYP 640

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             KN LDGDL E +L +   +   I+  M  +  ++CK++E++
Sbjct: 641  VKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 682


>gi|354547430|emb|CCE44165.1| hypothetical protein CPAR2_503890 [Candida parapsilosis]
          Length = 1171

 Score =  105 bits (263), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 217/981 (22%), Positives = 385/981 (39%), Gaps = 156/981 (15%)

Query: 162  NGADLLIPVPPPLCGVLIIGEETIVY---CSANAFKAIPIRPS--------ITKAYGRVD 210
            N A+ L+PVP  + GVL+ GE  I+Y    S +    +P R S        +T    +  
Sbjct: 231  NDANYLVPVPGHIGGVLVCGENWIIYDKLGSESIALPLPRRKSQSSVIVTHVTHVLKK-K 289

Query: 211  ADGSRYLLGDHAGLLHLLVITHE--KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
            + G   LL +  G L  LVI ++  +E V  ++I       +  ++S   N + +  +  
Sbjct: 290  SYGFFILLQNEFGDLFRLVIDYDYDRELVKDIEISYFDTIPVCYSLSIFKNGLCF-ANCV 348

Query: 269  GDSQL---------------------IKLNLQPDAKGSYVE----------VLERYVNLG 297
              SQL                     I+++  P  K   VE          +++   +L 
Sbjct: 349  NRSQLLYQFEKLGEEIGEEGVGINRRIQMDSIPLTKEKVVEFKLKGLDNLALIDVVESLS 408

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWSLRS 355
            PI D  +++        +VT S   K   L+ + +G       ++S  ++    +++ ++
Sbjct: 409  PITDSTLIN------DTLVTLSTKSK---LKSIVHGTPTTTLVESSFPIKPT-NVFTAKT 458

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            S D   D +LV++     + L +++ + +EE     F     T+       + +VQV S 
Sbjct: 459  SADAMDDEYLVITSTLAFKTLVLSIGEVIEEVNDSKFVMDQPTVAVQQVGKSSIVQVYSN 518

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
             +R ++   +     W  P G ++  AT N  QVLL      LVY E  D    +++  Q
Sbjct: 519  GLRHINGKKK--VTNWYPPAGITITHATTNNQQVLLGMSNLELVYFET-DPEDDQLREYQ 575

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
               E+S   I  +      S  A VG  +D ++ + SL   N +  + L       + L+
Sbjct: 576  DRLEVSS-PIRSMCIPKEQSSFAVVGC-SDETISVVSLQSQNCLQVKSLQALSSSANSLV 633

Query: 536  CAFEGISYLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS-SKNTTHVF 592
                  S  L  +G  +G  +   ++   G+L+D +   LG +P+TL     S + T V 
Sbjct: 634  MLTHNPSTTLIHIGMDNGVYVRTKIDTFNGKLSDTRIKYLGPKPVTLNELKLSDDITGVL 693

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS---------AAFPDSLAIAKEGELTI 643
            A S +P + Y    K   + +   ++ +   F S           + D+L I   G+   
Sbjct: 694  AISTKPWIGYFYQGKYRCTPLLDIDIINGASFKSEDIGGEGIVGIYDDNLVIFSVGKED- 752

Query: 644  GTIDDIQKLHIRSIPLGEHPRRICH-QEQSRTFAICSLKNQSCAEESEM----HFVRLLD 698
               D  Q+L +  + L   PR+I    +  + F          A E EM     +V  L 
Sbjct: 753  SIFDPNQELTVTELNLRYTPRKIVKGGDDDKLF----------ASEVEMGINTPYVSNLT 802

Query: 699  DQTFEFIST--YPLDTFEYGCSILSC-------------SFSDDSNVYYCV-----GTAY 738
                  +    Y    FE G    SC              F  D  +   V        Y
Sbjct: 803  KDVENGVDPEYYEAFGFERGPGCASCVQLIENGEIKQSLEFVKDQRIVDMVKIQFNKNQY 862

Query: 739  VLPEENEPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY--- 794
            ++    +  +  +  F ++  + LQ I + + K     +  F  +LL A    I LY   
Sbjct: 863  LIVGVTKQEENLLYTFKIDKKRNLQYIHKTQLKYVPQVMEVFQNRLLVASGNAISLYELG 922

Query: 795  -KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
             + +LR   TR           I+ +  Q R D I++ D   SI    + +EE      A
Sbjct: 923  QRQLLRKSLTR-----IDFVQSIVKVSPQPR-DRILLADSANSIVFAKFDNEENQFIPVA 976

Query: 854  RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG---------------- 897
             D     ++A + LD D  +G +   N+F  R + E +   ++                 
Sbjct: 977  DDVVKRNITAWKQLDYDTIIGGDKFGNIFVTRLDREESKQVDQDWTVLKQAAQNSSNLNS 1036

Query: 898  ---RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
               +L+ + +Y++ + +  F+ GS        ++G    +I+  V G IGV+  L  +  
Sbjct: 1037 CVFKLQNLCQYYIPDIITSFQLGSF-------NLGGEECIIYTGVMGTIGVLIPLLSKSE 1089

Query: 955  LFLE---KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
            + L    +LQ  +      V G NH + RS+ N +     KN  DGD +E +  +     
Sbjct: 1090 IELVHDLQLQIGIWNDGVNVAGKNHGKLRSYYNPR-----KNVYDGDFLELYFAIPLDVK 1144

Query: 1012 DEISKTMNVSVEELCKRVEEL 1032
             +I+K +N SV E+ K++ ++
Sbjct: 1145 VKIAKKLNKSVGEIEKKLNDI 1165


>gi|427795803|gb|JAA63353.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
            cft1 cpsf subunit, partial [Rhipicephalus pulchellus]
          Length = 726

 Score =  105 bits (263), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 127/507 (25%), Positives = 211/507 (41%), Gaps = 76/507 (14%)

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC--PFNSAAFPDS-LAIAKEGELTIGTID 647
            VF    RP  ++ S++  L  +    +    C  PF++   P   L   K+GEL I T+ 
Sbjct: 224  VFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQGELRISTLP 283

Query: 648  DI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-------AEESEMHFVR- 695
                      +R +PL   P  + +   S+T+ + + +   C        EE E   +  
Sbjct: 284  THLTYDAPWPVRKVPLRCTPHFVNYHVDSKTYCVVTSQPDPCNHLVRFTGEEKEYELLER 343

Query: 696  -------LLDDQTFEFIS-----TYP-----LDTFEYGCSILSCSFSDDSNV-----YYC 733
                    +D  + + +S     T P     LD +E+   + +   S +        Y  
Sbjct: 344  DSRYIFPTMDKFSLQLLSPVSWETIPNTRVDLDEWEHLTCLKNVMLSSEGTTTGMKGYLA 403

Query: 734  VGTAYVLPEE--------------NEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
            +GT Y   E+                P  G+ L       K++++  KE KG V +L+  
Sbjct: 404  LGTNYCYGEDVTSRGRIIILDIIDVVPEPGQPLT----KNKIKIVYSKEQKGPVTALSQV 459

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
             G LL+AI QKI  Y W L+D+   EL         I    V T  + I+VGD+ KS+SL
Sbjct: 460  VGFLLSAIGQKI--YIWQLKDN---ELVGVAFIDTQIYIHSVVTVKNLILVGDVFKSVSL 514

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDE 894
            L Y+     +   +RD     + AVE   D+  +      AE N  L+  +  S  +   
Sbjct: 515  LRYQEASRTLSLVSRDVRPLEVYAVEFFIDNTQMSFLVTDAERNLLLYMYQPESRESCGG 574

Query: 895  ERGRLEVVGEYHLGE-----FVNRFRHGSLVM--RLPDSDVGQIPTVIFGTVNGVIGVIA 947
            +R  L   G++H+G      F  + R G +    R   S V      +  T++G +  + 
Sbjct: 575  QR--LLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSLAYVL 632

Query: 948  SLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV--DAKNFLDGDLIESFLD 1005
             +P + Y  L  LQ  L   I    GLN + +R + ++++ +    KN LDG+LI  F+ 
Sbjct: 633  PVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHKNILDGELIWKFMH 692

Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
            LS     E+SK +  +V ++   + E+
Sbjct: 693  LSFMERSELSKKIGTTVTQITDDLLEI 719


>gi|72390667|ref|XP_845628.1| damage-specific DNA binding protein [Trypanosoma brucei TREU927]
 gi|62359843|gb|AAX80271.1| damage-specific DNA binding protein, putative [Trypanosoma brucei]
 gi|70802163|gb|AAZ12069.1| damage-specific DNA binding protein, putative [Trypanosoma brucei
            brucei strain 927/4 GUTat10.1]
          Length = 1270

 Score =  105 bits (263), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 178/800 (22%), Positives = 313/800 (39%), Gaps = 127/800 (15%)

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR 354
            N GP+ D  V       +  V+  +G   +G L  VR+ + + +   + + G++ +  + 
Sbjct: 426  NCGPVFDVTVA--ADGSRTSVIAGTGVGLNGGLSFVRSAVSVRQD--IRVTGLQNVRQIS 481

Query: 355  SSTDDPFDTFLVVSFISETRILAMNLEDE--LEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
             S D      ++ S    +R+    + +   LEE     F ++   L  ++   N  +QV
Sbjct: 482  VSEDT-----IIFSLPGYSRVCRYCVGETMVLEEIHNTFFGTEETLLLEYNNERNVFLQV 536

Query: 413  TSGSVRLVSSTSRELRNEWKSPPGYSV-----NVATANASQVLLATGGGHLVYLEIGDGI 467
            T+            LRN W     Y +     ++  A+AS+ LL       +Y+      
Sbjct: 537  TTAG----------LRNVWPDRGEYVIRVINNDIGHAHASEGLLVFSNPANLYV-FCMKT 585

Query: 468  LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
            LT V    L   ISCL +         S    VG W   +V ++ L D  + +K  L   
Sbjct: 586  LTPVATLCLADAISCLLV-------FSSSSLLVGTWGSCAVHLYELHDGMIQSKVILQCS 638

Query: 528  IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKT--GELTDRKKVSLGTQPITLRTFSS 585
              P S+ +    G   LL  L +G++ +  ++     GE+ +     L TQP+  R F+ 
Sbjct: 639  ATPCSMCVVFHSGGHRLLVGLHNGYVADVPISGANVCGEMVETM---LTTQPV--RLFNL 693

Query: 586  KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE--LTI 643
            ++   V    + P ++  +N     + ++ ++VS          P       + E  L +
Sbjct: 694  ESHAAVLCLGEIPLILIVTNTGFQLTGIDFRDVSACAIMPKMRSPSRYIFFSKSECALIL 753

Query: 644  GTIDDIQKLH------------IRSIP----LGEHPRRICHQEQSRTFAICSLKNQSCAE 687
            G+I  +QKL+            ++ +P    L    RRI  ++Q  +     + N S   
Sbjct: 754  GSITSVQKLNTDYVGLKATATCVKYMPWWNVLVLSIRRI-EKDQVVSTMGHEVSNLSVLL 812

Query: 688  ESEMHFVRLLDDQTFEFISTYPL-DTFEYGCSILSCS-FSDDSNVYYCVGTAYVLPEENE 745
            +    F  LL+++   FI    L    E+G    SC   + D+ V   +GT +V P+E  
Sbjct: 813  DEPNSF-ELLENERCAFIDCVALGQANEWG----SCGEVASDAGVVL-IGTTFVFPDEQL 866

Query: 746  PTKGRILVFIVEDGKLQ-------LIAEKETKGAVYS---LNAFNGKLLAAINQKIQLYK 795
                R +   VE  KL+       L   K+ +GA+     +  + G++   I   + LY 
Sbjct: 867  SRSSRFMWCTVEVAKLRTEKTLLRLQGSKDVEGALQCCCIVPNYAGRVALGIGGCVVLYS 926

Query: 796  WMLRDDG-TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
            W   D     E   + G     L   +Q    +IV  D   S   +     +G++   AR
Sbjct: 927  WNAADATFVAEETIQIGTLIVRLIPVMQKEVSYIVASDARHSCFFVRIDTIQGSLNIVAR 986

Query: 855  DYNANWMSAVEILD----DDIYLGAENNFNLFTVR-----KNSEG--ATDEERGRLEVVG 903
            D     +    IL      D+ LG ++ FN F V       NS G  A      +L+   
Sbjct: 987  DPELRGVMDCAILQYESRHDVCLG-DDLFNFFCVSHVEPLANSSGVSAPAMPTKKLQTTA 1045

Query: 904  EYHLGEFVNRFRHGSL--------VMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQ 953
            +YH+G+ +     GS         V+ +P + V  +  P +++GT +G  G I  +  E 
Sbjct: 1046 QYHMGDLITVMHQGSFAPCSVLNDVVPIPATLVRGVCGPQIVYGTSHGAFGAITPISSET 1105

Query: 954  YLFLEKLQTNLRKVIKGVGGLNHEQWRS--------------------------FNNEKK 987
            ++ L+ L+ ++  V+  +GG  H  +R                           FN  ++
Sbjct: 1106 FILLKGLEVSVASVVPPLGGFTHASFREVLRVGQERGASRNASFQVTNPQATELFNRRRR 1165

Query: 988  TVDAKNFLDGDLIESFLDLS 1007
                +    GD++E FL LS
Sbjct: 1166 RCVPRGVCCGDVVEMFLTLS 1185


>gi|388579257|gb|EIM19583.1| hypothetical protein WALSEDRAFT_33953 [Wallemia sebi CBS 633.66]
          Length = 986

 Score =  105 bits (263), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 88/367 (23%), Positives = 172/367 (46%), Gaps = 20/367 (5%)

Query: 544 LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYS 603
           +L    DG L  F  +     + D + V++G+ P+ L   +  N   +FA  D P+++  
Sbjct: 549 VLAGSADGSLYAFRFDEHLKSV-DIQTVAIGSTPVCL---TRSNDGLIFALCDVPSIVTL 604

Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EH 662
            N +L YS++N+  ++ +  + +    + + +  +       +    ++HI SI +G + 
Sbjct: 605 DNTRLRYSSININYINGLTSYKTNDMVNYVFVQNDQLKFSRILSTENRVHIHSIEMGADV 664

Query: 663 PRRICHQEQSRTFAICSLKNQSCAEES---EMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
           PR++ ++E    +A+  ++N   ++ +       V+LLD+  +E ++   ++  E    +
Sbjct: 665 PRQVAYKEDR--YAVGCVRNAYRSDRTLYESSSCVKLLDNN-YEQLAQMEMEKDEIVSVV 721

Query: 720 LSCSFSDDSNVYYCVGTAYVLPEEN--EPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
            S S ++     + VGT Y    E   E TKGR ++ +V+D K  + +     G VY++ 
Sbjct: 722 ESLSIANME--VFVVGTYYNNETEGTEEATKGRFIILLVKDDKFIIASSFLVPGCVYAVC 779

Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
             + KL  A+N ++++Y      D T +++    +    + + + + G  +VV D +KS 
Sbjct: 780 GIDQKLAVAVNYQVRVYDIESIRDDTYKMRFIASYGNAFVVVSLTSVGKILVVADFLKSA 839

Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDD-----IYLGAENNFNLFTVRKNSEGAT 892
             L    E GA+ +   D    W S V  LDD        LGA+  F+LF + + S G T
Sbjct: 840 IYLQLDTERGALTQVGYDTAQRWSSLVVALDDGNEETFTTLGADIRFHLFALDRTSSGIT 899

Query: 893 DEERGRL 899
                +L
Sbjct: 900 SRTLAQL 906



 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 101/408 (24%), Positives = 181/408 (44%), Gaps = 63/408 (15%)

Query: 2   LDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGR 61
           +D  I+    +L L   H +AQ  L I +  Y+  V++   E ++++    G+       
Sbjct: 20  IDKIIFQTHESLLLTLDHPQAQ--LVILSLSYENGVIKHIVECTKMLVETTGE------- 70

Query: 62  PTDNGQIGIIDPDCRLIGLHLYDG---LFKVIPFDN--KGQLKEAFNIRLEELQVLDIKF 116
           P+      I+D   ++   HL+ G    FK + +D+  K  + +  N +++   VL + F
Sbjct: 71  PSYEYCNSIVDKSTQIGVSHLWQGQLHAFK-LSYDDRRKTHIIDGRNSQIDHSVVLSMAF 129

Query: 117 LYGCA--KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDN---GADLLIPVP 171
           L      KPT+  L Q       +  +E  +  +D+V+   +   ++     A  ++ V 
Sbjct: 130 LATDKDEKPTLCRLVQSADVDNPLLVFEHLICKEDYVDISQTVLRIETQCPSAQKIVAVE 189

Query: 172 PPLCGVLIIGEETIVY--CSANAFKAIPIRPSIT---------------------KAYGR 208
                VL+IG     Y        K +  + S T                     + Y  
Sbjct: 190 GKKRAVLVIGAFGCEYYEIPKKDLKGVRRKSSTTVTSPQVVDMEHLSVKSPMAAVRGYTA 249

Query: 209 VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
           V+   + +L+GD  G ++ + I++       L+I L+G +S+AST+ +L +  +++GS  
Sbjct: 250 VNDSCTAWLIGDEKGDIYYISISN------FLEITLVGNSSVASTLQHLGSGFLFLGSQN 303

Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
            DS+L+ +   P      +  LE Y NL P+ D  +   +   QGQ+V CSG+ K G LR
Sbjct: 304 EDSKLLVVQTNP----VRIVELENYTNLAPVSDMALTHPDGI-QGQLVVCSGSNKTGKLR 358

Query: 329 IVRNGIGINEQASVEL-QGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
           +V  GIG+ +   V L   I  ++ L S         L+VS++S T+I
Sbjct: 359 VVTTGIGLCDIYQVGLGDSISNVFILGS--------HLLVSYLSTTKI 398


>gi|448528339|ref|XP_003869702.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis Co 90-125]
 gi|380354055|emb|CCG23569.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis]
          Length = 1170

 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 218/991 (21%), Positives = 387/991 (39%), Gaps = 177/991 (17%)

Query: 162  NGADLLIPVPPPLCGVLIIGEETIVYCS-ANAFKAIP----------IRPSITKAYGRVD 210
            N A+ L PVP  + GVL+ G   I+Y    N    +P          I   +T A  + +
Sbjct: 231  NDANYLAPVPGHIGGVLVCGANWIMYDKLGNESILLPLLRRKDQTSVIISHVTHALKKKN 290

Query: 211  ADGSRYLLGDHAGLLHLLVITHE--KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
              G   LL +  G L  L+I ++  +E +  ++I       +   ++   N + +  +  
Sbjct: 291  Y-GFFILLQNDLGDLFRLIIDYDSNRELIKDIEITYFDTIPVCYNLNIFKNGLCF-ANCI 348

Query: 269  GDSQL---------------IKLN--LQPD----AKGSYVE----------VLERYVNLG 297
              SQL               I++N  +Q D     K  Y E          +++   +L 
Sbjct: 349  NRSQLLYQFEKLGEEISEEDIRINKTVQMDNIQLTKEKYFEFKLKGLDNLALIDVVESLS 408

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK--GMWSLRS 355
            PI D  + D        +VT S   K   L+ + +G         +L  IK   +++ ++
Sbjct: 409  PITDSILND------DTLVTLSTKSK---LKTIVHGTPTTTLVESQLP-IKPTNIFTTKT 458

Query: 356  STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
            S +   D +LV++     + L ++L + +EE     F     T+       + +VQ+ S 
Sbjct: 459  SANAVDDEYLVITSTLSFKTLVLSLGEVIEEVNDSEFVLDQPTVAVQQVGKSSIVQIYSN 518

Query: 416  SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
             +R ++   +     W  P G +V  AT N  QV L      LVY E  D    +++  Q
Sbjct: 519  GLRHINGNKK--VTNWYPPAGITVTHATTNNQQVFLGLSNLELVYFET-DPDDDQLREYQ 575

Query: 476  LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG--EIIPRSV 533
               E+S   I  +  +   S  A VG  +D ++ + SL   N +  + L          +
Sbjct: 576  DRLEVSS-PIRSMCISKEQSSFAVVGC-SDETISVISLQRQNCLQVKSLQALSSSANSLI 633

Query: 534  LLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS-SKNTTHVF 592
            +L    G + +   + +G  +   ++   G+L+D +   LG +P+TL     S +   V 
Sbjct: 634  MLTHKPGTTLIHIGMDNGVYVRTKIDTFNGKLSDTRIKYLGPKPVTLNELKLSDDIIGVL 693

Query: 593  AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE----LTIGTIDD 648
            A S +P + Y    K   + +   ++ +   F S        +   G+     ++G  D 
Sbjct: 694  AISSKPWIGYFHQGKYRCTPLLDIDIINGAFFKSEDIGGDGIVGIHGDNLVIFSVGKEDS 753

Query: 649  I----QKLHIRSIPLGEHPRRICHQEQSRTF----------------------------- 675
            +    Q+  +  + L   PR+I   +  + F                             
Sbjct: 754  LFDPNQEFTVTELNLRYTPRKIVKGDDGKLFVSEVELGIKTPYLSNLTKEVEDTVDPEYY 813

Query: 676  -AICSLKNQSCAEESEMHFVRLLDD----QTFEFISTYPLDTFEYGCSILSCSFSDDSNV 730
             A    ++  CA       V+L+ D    QT EF+    +        ++   F  + N+
Sbjct: 814  EAFGYERDSGCAS-----CVQLVQDGEIKQTLEFVKNQRI------VDMIKIQF--NKNL 860

Query: 731  YYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
            Y  VG      E+N      +  F ++  K LQ I + E K    ++  F  +LL A   
Sbjct: 861  YLVVGVTK--HEDN-----LLYTFKIDKKKNLQYIHKTELKYVPQTMEVFQDRLLVASGN 913

Query: 790  KIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
             I LY    + +LR   TR           I+ +  Q R D I++ D   SI    +  E
Sbjct: 914  SISLYELGQRQLLRKSLTR-----IDFIQTIVKVTPQPR-DRILLADSANSIVFAKFDQE 967

Query: 846  EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG-------- 897
            E      A D     ++A + LD D  +G +   N+F  R + E +   ++         
Sbjct: 968  ENQFVSMADDTVKRNITAWKQLDYDTVIGGDKFGNIFVSRLDREESKQIDQNWTVLKQAA 1027

Query: 898  -----------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
                       +L+ + EY++ + +  F+ GS        ++G    +I+  + G IG++
Sbjct: 1028 KNSPNLNSCVYKLQNLCEYYIPDIITSFQLGSF-------NLGGEECIIYTGLTGTIGIL 1080

Query: 947  ASLPHEQYL-FLEKLQTNLR----KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
              L  +  +  L  LQ  +     KV   V G NH + RS+ N      AKN  DGD +E
Sbjct: 1081 LPLISKSEIELLHDLQLEISAYNDKV--NVAGKNHAKLRSYYNP-----AKNIFDGDFLE 1133

Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +L+L      +I+K +N SV E+ K++ ++
Sbjct: 1134 LYLNLPLDEKLKIAKRLNKSVGEVEKKLNDI 1164


>gi|167395195|ref|XP_001741266.1| pre-mRNA-splicing factor rse-1 [Entamoeba dispar SAW760]
 gi|165894222|gb|EDR22283.1| pre-mRNA-splicing factor rse-1, putative [Entamoeba dispar SAW760]
          Length = 1145

 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 230/1134 (20%), Positives = 438/1134 (38%), Gaps = 149/1134 (13%)

Query: 1    MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
            +L   ++  I ++   R   EA+D++ I ++     +L +  E ++LI     ++  + G
Sbjct: 53   LLQESVFCMIRSICKIRIGSEAKDYILIGSDSGSITLLGYSFEQNKLIP-ICNEIFGKSG 111

Query: 61   -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DI 114
             R    G+    DP  R  +IG      L  +   DN G++  +  +   +   +    I
Sbjct: 112  IRRVVPGEYICSDPMGRAAMIGAIEKQKLVYIFNRDNNGKITISSPLEAHKSNTICYNII 171

Query: 115  KFLYGCAKPTIVVL-------YQDNKDARHVKTYEVALKDKDF--------VEGPWSQNN 159
                G   P    +       Y  + + R +K Y V L + D         V  P     
Sbjct: 172  ALNVGYDNPIFATIEVDYNEQYLSHIEHRIIKKY-VVLYELDLGLNHVIRKVAEP----- 225

Query: 160  LDNGADLLIPVPP--PLC--GVLIIGEETIVYCSANAFKA-IPI---------RPSITKA 205
            +D  ++ LIP+      C  G+ I  E+ I + +    K  IPI           ++  +
Sbjct: 226  IDPTSNYLIPILSLNEECHYGIFICSEDKITWFNIGHDKVCIPIPKYHNQSNTHSTLITS 285

Query: 206  YGRVDADGSRYLL--GDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
            +      G  ++L   +   L  LL  T EK ++  + I       IA ++ +     + 
Sbjct: 286  HVMRTHKGKHFVLLQSEFGDLFQLLFDTKEKGQLDNIHIRYFDSIPIALSLQFSKRGHLL 345

Query: 264  IGSSYGDSQL--------IKLNLQPDAKGSY--------VEVLERYVNLGPIVDFCVVD- 306
                YGDS L        I + L+ + +  +        +E   R+ +L P++D  V   
Sbjct: 346  CIGEYGDSILYNIISMENINVPLEKNGRLEFERHEEILNLEEFYRFKSLAPLIDLKVAPA 405

Query: 307  LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFL 365
               Q   ++    G     +++I++N +   +   +EL  I   +W L+  TD+ +  +L
Sbjct: 406  TSSQDTTKMYAFCGKGNQSTIKILKNQLNTLDAVEIELPAIPISVWPLKKETDE-YHQYL 464

Query: 366  VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
             +S+ + T +L +  EDE+ E         T +L          +QV +  + + S   +
Sbjct: 465  AISYSNITTLLKIT-EDEMSECNTTPILLSTPSLLVSMLSDGTFLQVMTDRIIVYSEPIQ 523

Query: 426  ELRNEWKSPPGYSVNVATANASQVLLATGGGH---LVYLEIGDGILTEVKHAQLEYEISC 482
            +   E +         A+ N S+++++    +   L+Y +   G L  ++  +   +I+ 
Sbjct: 524  QFITENQK-----YVCASCNGSELIVSVEKNNQTALIYFQYQSGHLLTMERKENLAKITA 578

Query: 483  LDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGI 541
            L ++    +PS  Q  A+G   D SV + SL P+        +  +    SV    F  I
Sbjct: 579  LALDQF--HPS--QHCAIGC-VDGSVHLLSLIPNETTKALSRVSLQTYDCSVNSLTFNII 633

Query: 542  ---SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRP 598
               SYL   L +G L     +  +GE+ +     +G++P+TL          V A S R 
Sbjct: 634  DNKSYLFAGLSNGLLGRSTYDPISGEINESSLNFVGSRPVTLSNVKDCGEDSVLAISGRS 693

Query: 599  TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSI 657
             + Y +  K+  + +N++  +  C        +++AI  E  + I TI+     L  ++I
Sbjct: 694  LLSYKNGTKIKTTPLNIQNTTLACGILVPFVDNAIAIICEKVMKIITIESTSSSLTGKNI 753

Query: 658  PLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM---------HFVRLLDDQTFEFISTY 708
             L   PR+I          I    N SC   +E+           + +LD    E I   
Sbjct: 754  QLSYTPRKIITHPTIPLLYILEGDNNSCKVGNEIIQTNEGNWVGGIHILDASQDELIQFI 813

Query: 709  PLDTFEY---GCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGR---ILVFIVEDGKLQ 762
              D  ++   GC + S S    +  Y  VG   +   +  P + +   I V+I+ +  + 
Sbjct: 814  DFDNNKHPTAGCVVRSIS---KNQTYLIVGV--IESYKTRPIQWKSSEIQVYIINEHSIN 868

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILA 818
                 + +  V +   F G +LA +   ++LY    K +L+    R+L S+         
Sbjct: 869  YCYSTKVEYPVRAFAEFKGMVLAGVGNVLRLYDIGLKSLLKKAEKRQLASDVAQ------ 922

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              +   G+ I++  +    +L+ Y       +  A D    W+     L+    L ++  
Sbjct: 923  --LHVIGETILLTGISDGFNLIRYNQINHKFDIYA-DSLPRWVVTAAPLNQSTVLASDKF 979

Query: 879  FNLF-----------------TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVM 921
              +F                 T+ +  +   +    ++    ++ +G+    F   SL+ 
Sbjct: 980  GEIFMYQLPKEIEEKALSPFSTLLQPHKTIYEGSNYKMVTATQFFIGDIATSFAQCSLIP 1039

Query: 922  RLPDSDVGQIPTVIFGTVNGVIGVIASLP---HEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
              P          IF   N + G+ A +P        F + L+ ++R     +   NH  
Sbjct: 1040 GAPS---------IFLYSNFMGGLSALIPLQNQNDIDFYQHLEMHMRVHWTNLTDRNHIS 1090

Query: 979  WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            +RS       V  K+ +DGDL E +  L      EI++ M   V E+ K++ +L
Sbjct: 1091 FRS-----SMVPVKDTVDGDLCELYERLPYEIQQEIAEEMEKEVNEIIKKLHDL 1139


>gi|158287218|ref|XP_309311.4| AGAP011340-PA [Anopheles gambiae str. PEST]
 gi|157019545|gb|EAA05261.4| AGAP011340-PA [Anopheles gambiae str. PEST]
          Length = 1434

 Score =  105 bits (262), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 117/484 (24%), Positives = 213/484 (44%), Gaps = 73/484 (15%)

Query: 609  LYSNVNLKEVSHMCPFNSAAFPDS-LAIAKEGELTIG------TIDDIQKLHIRSIPLGE 661
            LY+   +K  +   PFN+   P+  L   ++ EL I       + D +    +R IPL  
Sbjct: 957  LYARTVMKAFA---PFNNVNCPNGFLYFDEQYELKISIFPTYLSYDSVWP--VRKIPLRS 1011

Query: 662  HPRRICHQEQSRTFAICSLKNQSC-------AEESEM----------------HFVRLLD 698
             P++I +  +++ + +     + C        E+ E+                  V L+ 
Sbjct: 1012 SPKQIVYHRENKVYCVVMDAEEICNKYYRFNGEDKELTEENKGERFLYPMGHRFSVVLVT 1071

Query: 699  DQTFEFI--STYPLDTFEYGCSILSCSFSDDS-----NVYYCVGTAYVLPEENEPTKGRI 751
               +E +  ++  L+ +E+  ++ + S + +        Y  VGT +    E+  ++GR+
Sbjct: 1072 PAAWEVVPETSINLEEWEHVIALKNVSLTYEGARSGLKEYIAVGTNFNY-SEDITSRGRL 1130

Query: 752  LVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
            L++     + E GK       + +  K+ KG V +++   G L+ A+ QK+  Y W ++D
Sbjct: 1131 LLYDIIEVVPEPGKPLTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKV--YLWQMKD 1188

Query: 801  DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
            D   +L        +I    + +    I+V D+ KS+SLL ++ E   +   +RDY+   
Sbjct: 1189 D---DLVGVAFIDTNIFVHQMVSIKSLILVADVYKSVSLLRFQEEYRTLSVVSRDYHPLN 1245

Query: 861  MSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN----- 912
            +  VE + D+  LG   +++  NL T     E        RL    +YHLG+ VN     
Sbjct: 1246 VFQVEYVVDNANLGFLVSDDQCNLITYMYQPESRESFGGQRLLRKSDYHLGQQVNCMFRV 1305

Query: 913  --RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
               F    ++ R  + D     T  F T++G IG +  LP + Y  L  LQ  L      
Sbjct: 1306 QCDFHETDVMKRTLNYDNKH--TTFFATLDGGIGFVLPLPEKTYRRLFMLQNVLLTHSPH 1363

Query: 971  VGGLNHEQWRSFNNEKKTV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
              GLN + +R+    +K     ++  +DGDL+ SFL+L      E++K +   +EE+C  
Sbjct: 1364 TCGLNPKAYRTIKQTRKLPINPSRCVVDGDLVWSFLELPANEKHEVAKKIGTRIEEICAD 1423

Query: 1029 VEEL 1032
            + E+
Sbjct: 1424 LMEI 1427



 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 123/547 (22%), Positives = 217/547 (39%), Gaps = 131/547 (23%)

Query: 8   GRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQ 67
           G I +++     G  +D L I+    K  V+Q+D ++ +L T ++    D   R    G 
Sbjct: 75  GNIKSMQSVSLAGSLRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDEDIRGGWTGH 134

Query: 68  IGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE----------------------- 100
             I    +DPD R   + +Y     V+PF     L E                       
Sbjct: 135 YHIPMVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIELQDVKPIKKAPMQLVAKTPIL 194

Query: 101 -AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK-- 146
            ++ I L++L      V+DI+FL+G  +PT+++LY+  +        R      VAL   
Sbjct: 195 ASYIIELKDLDEKIDNVIDIQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSLN 254

Query: 147 -DKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANA 192
             +      W+ N+L       IP+  P+ G L++   +++Y              SA+ 
Sbjct: 255 IQQRVHPVIWTVNSLPFDCIQAIPINKPIGGCLVMCVNSLIYLNQSVPPYGVSLNSSADH 314

Query: 193 FKAIPIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLLVITHEKEK-VTGLKIE 243
             + P++P   +   R+  D +        + +L    G L++L +  +  + V      
Sbjct: 315 STSFPLKP---QDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRNFHFN 371

Query: 244 LLGETSIASTISYLDNAVVYIGSSYGDSQLIK-------LNLQPDAKG------------ 284
               + + S I   ++  +++GS  G+S L++       L +  D  G            
Sbjct: 372 KAAASVLTSCICVCEDEYLFLGSRLGNSLLLRFKEKDESLVITIDDSGAVEKEPKRPRLE 431

Query: 285 ------------------SYV-EVLERYVNLGPIVDFCV------VDLERQGQGQ----- 314
                             SY+ EV +  +N+GPI    V       D E Q   Q     
Sbjct: 432 EEELEVYGSGYKTSVQLTSYIFEVCDNVLNIGPIAHMAVGERVAEEDAENQPDVQIVQNK 491

Query: 315 ----VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------RSSTDDP-FDT 363
               VVT SG  K+G+L ++++ I      S  L G   +W++      R + D P    
Sbjct: 492 LDIEVVTSSGHGKNGALCVLQSSIKPQVITSFGLSGCVDVWTVFDEAVARRAEDGPSTHA 551

Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSS 422
           F+++S   E   + +   +E+ E E  GF +   T+   +   N+ +VQVT+ S+RL+  
Sbjct: 552 FMILS--QEGGTMVLQTGEEINEIENTGFATTVPTIHVGNIGTNRFIVQVTTKSIRLLQG 609

Query: 423 TSRELRN 429
           T R L+N
Sbjct: 610 T-RLLQN 615


>gi|350413821|ref|XP_003490124.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Bombus impatiens]
          Length = 1417

 Score =  104 bits (260), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 127/525 (24%), Positives = 233/525 (44%), Gaps = 76/525 (14%)

Query: 575  TQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPD 631
            T+   +R FS+      VF  SD P  I+ + +  L ++    +  V+   PFN+   P 
Sbjct: 899  TRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQ 958

Query: 632  S-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICS-----LK 681
              L   ++ EL I  +           +R +PL   P  + +  +S+T+ + +     LK
Sbjct: 959  GFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIAEPLK 1018

Query: 682  N---------QSCAEESEMHFVRLLDDQ----TFEFIS--TYP-----LDTFEYGCSILS 721
            +         +   EE    F+    +Q     F  +S  T P     LD +E+   + +
Sbjct: 1019 SYYRFNGEDKEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKN 1078

Query: 722  CSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIA 765
             S + +        Y  +GT Y   E+   ++GRIL+F I+E            + + I 
Sbjct: 1079 VSLAYEGTRSGLKGYIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIY 1137

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQ 822
             KE KG + ++   +G L++A+ QKI  Y W L+D+   G   + ++   H  +L++   
Sbjct: 1138 AKEQKGPITAITQVSGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI--- 1191

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AEN 877
                 I++ D+ KSISLL ++ E   +   +RD+    +  +E L D+  LG      E+
Sbjct: 1192 --KSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGES 1249

Query: 878  NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP----- 932
            N  LF  +  S  +   +  +L    ++HLG+ VN F      +  P +D          
Sbjct: 1250 NMALFMYQPESRESLGGQ--KLIRKADFHLGQKVNTFFRIKCRVSDPANDKKHFSGADKR 1307

Query: 933  -TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV-- 989
               ++ +++G +G I  +P + Y  L  LQ  L   I  + GLN + +R++ +  +T   
Sbjct: 1308 HVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGN 1367

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
             A+  +DGDL+  +L L      +++K +   V+E+ + + E+ R
Sbjct: 1368 PARGIIDGDLVWRYLYLPNNEKIDVAKKIGTRVQEIIEDLTEIDR 1412



 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 65/330 (19%), Positives = 139/330 (42%), Gaps = 61/330 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
           ++G + +++     G  +D L ++    K  V+++D ++ +L T ++    +   R   T
Sbjct: 73  LHGNVMSMQAVTLVGSQRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEEIRDGWT 132

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFN 103
           ++  I I+  DP+ R   + +Y     V+PF                   NK  +  ++ 
Sbjct: 133 NHHHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKALSNKTPILSSYM 192

Query: 104 IRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKD 149
           I L+ L+     ++D++FL+G  +PT+++LY+  +        R      VA+     + 
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252

Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAI 196
                WS +NL       +PV  PL G LI+   +++Y +             A      
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLAETSTNF 312

Query: 197 PIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE 247
           P++P   +   ++  +GS        R ++   +G L++L +       V G   +    
Sbjct: 313 PLKP---QEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAA 369

Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
           + + S +   ++  +++GS  G+S L++  
Sbjct: 370 SVLTSCVCMCEDNYLFLGSRLGNSLLLRFT 399



 Score = 43.1 bits (100), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 45/172 (26%), Positives = 76/172 (44%), Gaps = 14/172 (8%)

Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
           ++VT SG  K+G+L +++  I      + EL G + MW++  + ++             F
Sbjct: 507 ELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVIGTLNNDEQIRPEAEGSHAF 566

Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           L++S    T IL      E+ E +  GF +Q  T+F  +   N+ +VQVT   VRL+   
Sbjct: 567 LILSQEDSTMILQTG--QEINEVDQSGFSTQGSTIFAGNLGANRYIVQVTQMGVRLLQGI 624

Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
             E         G  +  A+     V L +  G ++ L + +G  T   H Q
Sbjct: 625 --EQIQHMPIDLGCPIVHASCADPYVTLLSEDGQVMLLTLREGRGTAKLHVQ 674


>gi|110750698|ref|XP_624382.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Apis mellifera]
          Length = 1415

 Score =  104 bits (259), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 129/527 (24%), Positives = 236/527 (44%), Gaps = 80/527 (15%)

Query: 575  TQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPD 631
            T+   +R FS+      VF  SD P  I+ + +  L ++    +  V+   PFN+   P 
Sbjct: 897  TRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQ 956

Query: 632  S-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICS-----LK 681
              L   ++ EL I  +           +R +PL   P  + +  +S+T+ + +     LK
Sbjct: 957  GFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIAEPLK 1016

Query: 682  N---------QSCAEESEMHFVRLLDDQ----TFEFIS--TYP-----LDTFEYGCSILS 721
            +         +   EE    F+    +Q     F  +S  T P     LD +E+   + +
Sbjct: 1017 SYYRFNGEDKEFTEEERPDRFIFPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKN 1076

Query: 722  CSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIA 765
             S + +        Y  +GT Y   E+   ++GRIL+F I+E            + + I 
Sbjct: 1077 VSLAYEGTRSGLKGYIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIY 1135

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQ 822
             KE KG + ++   +G L++A+ QKI  Y W L+D+   G   + ++   H  +L++   
Sbjct: 1136 AKEQKGPITAITQVSGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI--- 1189

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AEN 877
                 I++ D+ KSISLL ++ E   +   +RD+    +  +E L D+  LG      E+
Sbjct: 1190 --KSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGES 1247

Query: 878  NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD--------SDVG 929
            N  LF  +  S  +   +  +L    ++HLG+ VN F    +  R+ D        SD  
Sbjct: 1248 NIALFMYQPESRESLGGQ--KLIRKADFHLGQKVNTFFR--IRCRISDPANDKKHFSDAD 1303

Query: 930  QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
            +    ++ +++G +G I  +P + Y  L  LQ  L   I  + GLN + +R++ +  +T 
Sbjct: 1304 KRHVTMYASLDGNLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQ 1363

Query: 990  --DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
               A+  +DGDL+  +L L      +++K +   V+E+ + + E+ R
Sbjct: 1364 GNPARGIIDGDLVWRYLYLPNNEKIDVAKKIGTRVQEIIEDLTEIDR 1410



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 65/329 (19%), Positives = 139/329 (42%), Gaps = 61/329 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
           ++G + +++     G  +D L ++    K  V+++D ++ +L T ++    +   R   T
Sbjct: 73  LHGNVMSMQAVTLVGSQRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEEIRDGWT 132

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFN 103
           ++  I I+  DP+ R   + +Y     V+PF                   NK  +  ++ 
Sbjct: 133 NHHHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYM 192

Query: 104 IRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKD 149
           I L+ L+     ++D++FL+G  +PT+++LY+  +        R      VA+     + 
Sbjct: 193 IVLKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252

Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAI 196
                WS +NL       +PV  PL G LI+   +++Y +             A      
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLAETSTNF 312

Query: 197 PIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE 247
           P++P   +   ++  +GS        R ++   +G L++L +       V G   +    
Sbjct: 313 PLKP---QEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAA 369

Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKL 276
           + + S +   ++  +++GS  G+S L++ 
Sbjct: 370 SVLTSCVCMCEDNYLFLGSRLGNSLLLRF 398



 Score = 45.4 bits (106), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 78/172 (45%), Gaps = 14/172 (8%)

Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
           ++VT SG  K+G+L ++++ I      + EL G + MW++  + ++             F
Sbjct: 505 ELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVIGTLNNDEQIRPEAEGSHAF 564

Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           L++S    T IL      E+ E +  GF +Q  T+F  +   N+ +VQVT   VRL+   
Sbjct: 565 LILSQEDSTMILQTG--QEINEVDQSGFSTQGSTIFAGNLGANRYIVQVTQMGVRLLQGI 622

Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
             E         G  +  A+     V L +  G ++ L + +G  T   HAQ
Sbjct: 623 --EQIQHMPIDLGCPIVHASCADPYVTLLSEDGQVMLLTLREGRGTARLHAQ 672


>gi|350645868|emb|CCD59413.1| Splicing factor 3B subunit 3, 5' fragment [Schistosoma mansoni]
          Length = 1073

 Score =  104 bits (259), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 92/343 (26%), Positives = 158/343 (46%), Gaps = 33/343 (9%)

Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
           +D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +R +
Sbjct: 433 YDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLTCSQLGDDALVQVYPDGIRHI 490

Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEY 478
            S  R   N W++P   ++     N  QV++A  GG LVY E+   G L E  +  ++  
Sbjct: 491 RSDKRV--NVWRAPGKKTITKCAVNRRQVVIALTGGELVYFEMDMTGQLNEYTERKEMPA 548

Query: 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LLCA 537
           ++ C+ +  I  N   S+  AVG+  D +VRI SL   + +T   + G  IP +   LC 
Sbjct: 549 DVICMALGRIPANEQRSRFLAVGL-ADNTVRILSLDPSDCLTPLTMQG--IPSTPESLCI 605

Query: 538 FE---------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
            E                     GI Y+   L +G LL  +L+  TGEL+D +   LGT+
Sbjct: 606 VEMGTNEPSPSTDDGESEATSSGGILYMNIGLINGVLLRVILDPVTGELSDTRTRYLGTR 665

Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
           P+ L     +    V + S R  + Y+   +     ++ + + +   F+S   P+ +   
Sbjct: 666 PVKLFRIMMQGGEAVLSVSSRSWLSYAYQNRFHLIPLSYEALEYASGFSSEQCPEGIVAI 725

Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAI 677
               L I  ++ +  +  + S PL   PR+ + H + + T+ I
Sbjct: 726 CNNSLRIMALEKLGAVFNQISYPLQYTPRKFVFHPDSNITYII 768


>gi|209875989|ref|XP_002139437.1| spliceosomal protein SAP 130 [Cryptosporidium muris RN66]
 gi|209555043|gb|EEA05088.1| spliceosomal protein SAP 130, putative [Cryptosporidium muris RN66]
          Length = 1300

 Score =  103 bits (258), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 102/412 (24%), Positives = 177/412 (42%), Gaps = 45/412 (10%)

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSL 353
           +L PI+D  + D   +   Q+    G     +LRI   G  + E A   L G  + +W+L
Sbjct: 401 SLSPIIDMKIADSSGEFGPQIYVACGRGPRSTLRICTYGKAVEEMAETPLPGRPRFIWTL 460

Query: 354 RSS---------------TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQT 398
           +                  +  + +F+++SFI  T +L++   +++EE     F     T
Sbjct: 461 KKGGTSYILNKETSETLEDEGNYHSFIIISFIDRTLVLSVG--EQVEEISDSPFTLSEST 518

Query: 399 LFCHDA-IYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
           ++     + N  +Q+    V+L++    E   EWK+P G  +  A +N  Q+ LA  GG+
Sbjct: 519 IYASSMELKNSYLQILETYVKLIT---EEKIYEWKAPDGRHIVAADSNGRQISLALSGGY 575

Query: 458 LVYLEIG--DGI---------LTEVKHAQLEYEISCLDINP-IGENPSYSQIAAVGMWTD 505
           +V LE+   D I         LTE+ H ++ Y+I C+ I   I       +  AVG  TD
Sbjct: 576 IVILEMNTIDLISGTTNSIMGLTELCHREVSYDIICISIQQLIYPTKLCREYVAVGTSTD 635

Query: 506 ISVRIF--SLPDLNL---ITKEHLGGEIIPRSVLLC--AFEGISYLLCALGDGHLLNFLL 558
            SVR++  +  D  L    T+       IP +++L         YLL  L +G LL  ++
Sbjct: 636 NSVRVYWINTADKKLKQTSTQVMPNASSIPENIVLYKPGLNDSLYLLIGLNNGVLLCCIV 695

Query: 559 NMKTGELTDRKKVSLGTQPITLRT--FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLK 616
           +   G L+D +   LG + I +        N    F  ++RP ++  + + L Y  +  +
Sbjct: 696 DELNGTLSDNRSRFLGGKSIKMVRVGIGEYNKLSTFCMANRPWIVNHNGRSLNYIPIQYR 755

Query: 617 EVSHMCPFNSAAFPDSLAIAKEGELTIGTI--DDIQKLHIRSIPLGEHPRRI 666
            +  + PF++  F +         L I  +  DD        I L   PR++
Sbjct: 756 TLDTVAPFHTKQFKNGFVAVSGTTLVIFQVLYDDSGTFTHSIIKLNYTPRKL 807



 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 81/349 (23%), Positives = 149/349 (42%), Gaps = 43/349 (12%)

Query: 710  LDTFEYGCSILSCSFSDDSNVYYCVGTAYVLP-----EENEPTKGRILVFIVEDGK--LQ 762
            LD  E   S   C F +       VGT Y +       EN  T G  +     D    L+
Sbjct: 964  LDVNEGCISACVCKFDEMDLPCLVVGTTYGMKLRKEYNENSSTLGATIKVYNYDTNFNLK 1023

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL-YV 821
            L+     +     +  + G+LL +IN+ +++Y       G ++L  +C +      L ++
Sbjct: 1024 LVHVTPIENVATCMIGWRGRLLVSINKTLRIYSL-----GKKKLLKKCEYRNIPEVLVWL 1078

Query: 822  QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWM-SAVEILDDDIYLGAENNFN 880
            +   D I  GD+   + +  Y   +  +   A D    W+ SA EILD    + ++   N
Sbjct: 1079 KVINDRIFAGDIRHGVIIFKYHSVQNRLSIIANDIMPRWLTSACEILDYHTVITSDKFDN 1138

Query: 881  LFTVRKNSEGATDEERG----------------RLEVVGEYHLGEFVNRFRHGSLVMRLP 924
            L   R  +E +++ +                  ++  V  +H+G+ V       L     
Sbjct: 1139 LIVCRVPTEASSNYDFTSNFNSQTNTSSYMKPYQINPVAHFHVGDLVTCIHKNQL----- 1193

Query: 925  DSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
             S +G + T+IFGT+ G IG +  + ++  +  L KL+  LR     +   +H  +RS+ 
Sbjct: 1194 -SPLG-VETLIFGTILGSIGTLTPITNKDDVDLLCKLELLLRNESPSLMSRDHLMFRSY- 1250

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                     N +DGDL E+F   S      I+ ++++S++E+ K++++L
Sbjct: 1251 ----YAPVLNVIDGDLCETFTSYSSDVQARIASSLDISIQEIFKKLDDL 1295


>gi|256075924|ref|XP_002574265.1| Splicing factor 3B subunit 3 5' fragment [Schistosoma mansoni]
          Length = 982

 Score =  103 bits (258), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 92/343 (26%), Positives = 158/343 (46%), Gaps = 33/343 (9%)

Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
           +D +++VSF++ T +L++   + +EE    GF   T TL C     + LVQV    +R +
Sbjct: 433 YDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLTCSQLGDDALVQVYPDGIRHI 490

Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEY 478
            S  R   N W++P   ++     N  QV++A  GG LVY E+   G L E  +  ++  
Sbjct: 491 RSDKRV--NVWRAPGKKTITKCAVNRRQVVIALTGGELVYFEMDMTGQLNEYTERKEMPA 548

Query: 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LLCA 537
           ++ C+ +  I  N   S+  AVG+  D +VRI SL   + +T   + G  IP +   LC 
Sbjct: 549 DVICMALGRIPANEQRSRFLAVGL-ADNTVRILSLDPSDCLTPLTMQG--IPSTPESLCI 605

Query: 538 FE---------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
            E                     GI Y+   L +G LL  +L+  TGEL+D +   LGT+
Sbjct: 606 VEMGTNEPSPSTDDGESEATSSGGILYMNIGLINGVLLRVILDPVTGELSDTRTRYLGTR 665

Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
           P+ L     +    V + S R  + Y+   +     ++ + + +   F+S   P+ +   
Sbjct: 666 PVKLFRIMMQGGEAVLSVSSRSWLSYAYQNRFHLIPLSYEALEYASGFSSEQCPEGIVAI 725

Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAI 677
               L I  ++ +  +  + S PL   PR+ + H + + T+ I
Sbjct: 726 CNNSLRIMALEKLGAVFNQISYPLQYTPRKFVFHPDSNITYII 768


>gi|71663721|ref|XP_818850.1| damage-specific DNA binding protein [Trypanosoma cruzi strain CL
            Brener]
 gi|70884123|gb|EAN96999.1| damage-specific DNA binding protein, putative [Trypanosoma cruzi]
          Length = 774

 Score =  103 bits (258), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 159/705 (22%), Positives = 267/705 (37%), Gaps = 134/705 (19%)

Query: 407  NQLVQVTSGSVRLVSSTSRE---LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
            N  VQVTS  V  V          +NE      + +  A+A+A+  LL       V +  
Sbjct: 28   NTFVQVTSVGVNFVKGGKGVYIICKNE------FGIQHASADANLRLLVFSSSRFVSVV- 80

Query: 464  GDGILTEVKHAQLEY--EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK 521
               + T    A LE   E+SCL I       S SQ   +G W   +V ++ + D      
Sbjct: 81   --DLNTHHTRASLELDNEVSCLVI-------SSSQGFVIGEWNSGAVCLYEVQD------ 125

Query: 522  EHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTD-----------RKK 570
                GEI+ +  + C+    S  +C L   H    ++ +  G + D            ++
Sbjct: 126  ----GEILLKGRIFCSATSCS--MCILSHLHTPRLVVGLLNGYIADISLESMLMGSAARE 179

Query: 571  VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
              +  QP+ L    S N   V    + P +I   +     + ++  +V+  C     +  
Sbjct: 180  TFIRMQPVQLFNLESHNA--VLCLGEVPLIIILCDTGFQLTGIDFNDVA-ACAIIEGSHI 236

Query: 631  DSLAI---AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ---- 683
             S  I     E  L  G I D++KL+   + L     R+ +      F +   +N+    
Sbjct: 237  SSKYIFLSQSENSLAFGNIVDLKKLNSCFLGLKATVTRVKYIAWWNVFVMSVRRNEKDQI 296

Query: 684  ----------SCAEESEMHF----VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
                      S   ++E+      + LL ++   F  T  L     G +        D +
Sbjct: 297  LLLMGHELANSWVPQNELLIELFSIELLGNERCVFFETVVLG----GSNESPNGNEADKD 352

Query: 730  VYYCVGTAYVLPEENEPTKGRILVFIVEDGKL-------QLIAEKETKGAVYS---LNAF 779
                +GT +  P+E  P   R + + VE GKL       + I  K+ +GA+     +  +
Sbjct: 353  KAILLGTTFAFPDEQLPRSSRFIWYSVEQGKLISERPQLRQIGSKDVEGALQCCCIVPNY 412

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH--GHILALYVQTRGDFIVVGDLMKSI 837
             G++   IN  I LY W   D      ++ C       +L ++ Q    ++V  D   S 
Sbjct: 413  VGRIALGINGCIALYSWNAADSVFVAEETICIGTIVTRVLPIF-QGDASYMVAFDARHSC 471

Query: 838  SLLIYKHEEGAIEERARDYNANW-MSAVEILDDDIY--LGAENNFNLFTVRK-------- 886
              +     +G++E  ARD      M        +IY     ++ +N F + +        
Sbjct: 472  FFIQVDTFQGSLEIVARDSEPRGVMDGTVFQLGNIYNICFGDDYYNFFCLSRGASTLPSA 531

Query: 887  -NSEGATDEERGRLEVVGEYHLGEFVNRFRHGS--------LVMRLPDSDVGQI--PTVI 935
             N+  AT    G+LE   +YHLG+ V   + GS        + + +P + +  I  P V+
Sbjct: 532  SNATSATVSS-GKLETSAQYHLGDMVTAMQLGSFAPCSVTNIAVPIPTTLIPGICGPQVV 590

Query: 936  FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS-------------- 981
            FGT +G  G I  + +E YL L+ L+  +  V+  +GG  H  +R               
Sbjct: 591  FGTSHGAFGTITPVSNETYLLLKALEVAVSSVVPALGGFEHVTYREVLRAGQERGYSRNA 650

Query: 982  ------------FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
                        F+  +K   ++    GDLIESFL  S+T    I
Sbjct: 651  SFENANVVSSGVFDKRRKRYLSRCVCSGDLIESFLTFSQTIQQRI 695


>gi|413948668|gb|AFW81317.1| hypothetical protein ZEAMMB73_752618 [Zea mays]
          Length = 122

 Score =  103 bits (258), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 47/58 (81%), Positives = 49/58 (84%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDR 58
           MLDVPIYGRIAT+ELFRPH E QDFLFIATERYKFCVLQWD E SELITR+   V  R
Sbjct: 53  MLDVPIYGRIATIELFRPHNETQDFLFIATERYKFCVLQWDTEKSELITRSHLHVVCR 110


>gi|270003792|gb|EFA00240.1| hypothetical protein TcasGA2_TC003068 [Tribolium castaneum]
          Length = 1392

 Score =  103 bits (258), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 128/526 (24%), Positives = 228/526 (43%), Gaps = 82/526 (15%)

Query: 578  ITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNL--KEVSHMCPFNSAAFPDS-L 633
            I +R F++      VF     P  I+ S +  L ++      EV     FN+   P   L
Sbjct: 877  IKMRYFTNIAGYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFL 936

Query: 634  AIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES 689
               ++ EL IG +           +R +PL   P  + +  +S+T+ + +    S AE S
Sbjct: 937  YFNRKSELRIGVLPTHLSYDAAWPVRKVPLRCTPHFVTYHLESKTYCLVT----SIAEPS 992

Query: 690  EMHF---------------------------VRLLDDQTFEFI--STYPLDTFEYGCSIL 720
              ++                           + L    +++ I  +   LD +E+   + 
Sbjct: 993  NKYYKFNGEDKELSVEDRGDRFPYPLQEKFSLMLFSPVSWDVIPNTKIDLDEWEHVNCLK 1052

Query: 721  SCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLI 764
            + S + +        Y  VGT Y   E+   ++GRIL+F I+E            + + I
Sbjct: 1053 NVSLAYEGTRSGLKGYIAVGTNYNYGEDVT-SRGRILIFDIIEVVPEPGQPLTKNRFKEI 1111

Query: 765  AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
              K+ KG V +L+   G L++A+ QKI  Y W L+D+   +L         I    + T 
Sbjct: 1112 YAKDQKGPVTALSQVKGFLVSAVGQKI--YIWQLKDN---DLVGVAFIDTQIYTHQILTI 1166

Query: 825  GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNF 879
               ++V D+ KSISLL ++ E   +   +RD+    + +VE + D+  +G     +E N 
Sbjct: 1167 KSLLLVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMIDNTTMGFLVSDSEKNL 1226

Query: 880  NLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH--------GSLVMRLPDSDVGQI 931
             L+  +  S  +   +R  L    ++HLG+ VN F          G     L  +D   I
Sbjct: 1227 VLYMYQPESRESLGGQR--LLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADKRHI 1284

Query: 932  PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK--TV 989
               ++ T++G +G I  +P + Y  L  LQ  L      + GLN + +R++ + KK  T 
Sbjct: 1285 --TMYATLDGGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTN 1342

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
             A++ +DG+L+ ++L LS     E+SK +   +EEL   + ++ ++
Sbjct: 1343 PARSVIDGELVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDIQKI 1388



 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 105/542 (19%), Positives = 204/542 (37%), Gaps = 147/542 (27%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG--DVSDRIGRPT 63
           ++G I +++        +D L +A +  K  V+++D E+ +L T ++   +  D     T
Sbjct: 74  LFGNIMSMQSVNLANSPRDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDDMKDGWT 133

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPFD---------------------NKGQLKE 100
            +  + ++  DP+ R   + ++     V+PF                      +K  +  
Sbjct: 134 HHYHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILA 193

Query: 101 AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDF 150
           ++ I L++       ++DI+FL+G  +PT+++L++  K      A    T  +A    + 
Sbjct: 194 SYMIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNL 253

Query: 151 VEGP----WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAF 193
            +      WS  NL       +P+  PL G LI     ++Y +             A   
Sbjct: 254 QQKVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQSIPPYGVSLNSIAENS 313

Query: 194 KAIPIRP------SITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLG 246
              P++P      S+  A      D +  +L    G L++L +  +  + V     E   
Sbjct: 314 TNFPLKPQDDLCISLDCAQATFLEDDT-IVLSLKGGELYVLTLLADNMRYVRSFHFEKAA 372

Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKL------------NLQPDAK----------- 283
            + + + IS  +N  +++GS  G+S L++              ++P AK           
Sbjct: 373 ASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITLDETIEPSAKRLKASNSTSEN 432

Query: 284 ---------------------------------------GSYV-EVLERYVNLGPIVDFC 303
                                                   SYV EV +  +N+GP  +  
Sbjct: 433 EDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQISSYVFEVCDSLLNIGPCGNIS 492

Query: 304 V-------------VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
           +             +DL+     ++VT +G  K+G+L +++  +      +  L G   M
Sbjct: 493 LGEPAFLSEEFSENLDLDL----ELVTTAGYGKNGALCVLQKSVRPQIVTTFTLPGCSNM 548

Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
           W++ +  D     FL++S    T IL     DE+ E +  GF +   T++   A  NQL 
Sbjct: 549 WTVHAGEDK--HAFLILSQEDGTMILQTG--DEINEIDNTGFATHIPTVY---AGINQLQ 601

Query: 411 QV 412
            +
Sbjct: 602 HI 603


>gi|91078626|ref|XP_968117.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
            cpsf [Tribolium castaneum]
          Length = 1413

 Score =  103 bits (257), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 128/526 (24%), Positives = 228/526 (43%), Gaps = 82/526 (15%)

Query: 578  ITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNL--KEVSHMCPFNSAAFPDS-L 633
            I +R F++      VF     P  I+ S +  L ++      EV     FN+   P   L
Sbjct: 898  IKMRYFTNIAGYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFL 957

Query: 634  AIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES 689
               ++ EL IG +           +R +PL   P  + +  +S+T+ + +    S AE S
Sbjct: 958  YFNRKSELRIGVLPTHLSYDAAWPVRKVPLRCTPHFVTYHLESKTYCLVT----SIAEPS 1013

Query: 690  EMHF---------------------------VRLLDDQTFEFI--STYPLDTFEYGCSIL 720
              ++                           + L    +++ I  +   LD +E+   + 
Sbjct: 1014 NKYYKFNGEDKELSVEDRGDRFPYPLQEKFSLMLFSPVSWDVIPNTKIDLDEWEHVNCLK 1073

Query: 721  SCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLI 764
            + S + +        Y  VGT Y    E+  ++GRIL+F I+E            + + I
Sbjct: 1074 NVSLAYEGTRSGLKGYIAVGTNYNY-GEDVTSRGRILIFDIIEVVPEPGQPLTKNRFKEI 1132

Query: 765  AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
              K+ KG V +L+   G L++A+ QKI  Y W L+D+   +L         I    + T 
Sbjct: 1133 YAKDQKGPVTALSQVKGFLVSAVGQKI--YIWQLKDN---DLVGVAFIDTQIYTHQILTI 1187

Query: 825  GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNF 879
               ++V D+ KSISLL ++ E   +   +RD+    + +VE + D+  +G     +E N 
Sbjct: 1188 KSLLLVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMIDNTTMGFLVSDSEKNL 1247

Query: 880  NLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH--------GSLVMRLPDSDVGQI 931
             L+  +  S  +   +R  L    ++HLG+ VN F          G     L  +D   I
Sbjct: 1248 VLYMYQPESRESLGGQR--LLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADKRHI 1305

Query: 932  PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK--TV 989
               ++ T++G +G I  +P + Y  L  LQ  L      + GLN + +R++ + KK  T 
Sbjct: 1306 --TMYATLDGGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTN 1363

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
             A++ +DG+L+ ++L LS     E+SK +   +EEL   + ++ ++
Sbjct: 1364 PARSVIDGELVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDIQKI 1409



 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 118/588 (20%), Positives = 226/588 (38%), Gaps = 150/588 (25%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG--DVSDRIGRPT 63
           ++G I +++        +D L +A +  K  V+++D E+ +L T ++   +  D     T
Sbjct: 74  LFGNIMSMQSVNLANSPRDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDDMKDGWT 133

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPFD---------------------NKGQLKE 100
            +  + ++  DP+ R   + ++     V+PF                      +K  +  
Sbjct: 134 HHYHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILA 193

Query: 101 AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDF 150
           ++ I L++       ++DI+FL+G  +PT+++L++  K      A    T  +A    + 
Sbjct: 194 SYMIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNL 253

Query: 151 VEGP----WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAF 193
            +      WS  NL       +P+  PL G LI     ++Y +             A   
Sbjct: 254 QQKVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQSIPPYGVSLNSIAENS 313

Query: 194 KAIPIRP------SITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLG 246
              P++P      S+  A      D +  +L    G L++L +  +  + V     E   
Sbjct: 314 TNFPLKPQDDLCISLDCAQATFLEDDT-IVLSLKGGELYVLTLLADNMRYVRSFHFEKAA 372

Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKL------------NLQPDAK----------- 283
            + + + IS  +N  +++GS  G+S L++              ++P AK           
Sbjct: 373 ASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITLDETIEPSAKRLKASNSTSEN 432

Query: 284 ---------------------------------------GSYV-EVLERYVNLGPIVDFC 303
                                                   SYV EV +  +N+GP  +  
Sbjct: 433 EDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQISSYVFEVCDSLLNIGPCGNIS 492

Query: 304 V-------------VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
           +             +DL+     ++VT +G  K+G+L +++  +      +  L G   M
Sbjct: 493 LGEPAFLSEEFSENLDLDL----ELVTTAGYGKNGALCVLQKSVRPQIVTTFTLPGCSNM 548

Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD-AIYNQL 409
           W++ +  D     FL++S    T IL     DE+ E +  GF +   T++  +      +
Sbjct: 549 WTVHAGEDK--HAFLILSQEDGTMILQTG--DEINEIDNTGFATHIPTVYAGNLGNLKYI 604

Query: 410 VQVTSGSVRLVSSTS--RELRNEWKSPPGYSVNVATANASQVLLATGG 455
           VQVTS +VRL+   +  + +  E  SP    V+V + +    LL T G
Sbjct: 605 VQVTSSAVRLLQGINQLQHIPLELGSP---IVHVTSVDPYISLLTTDG 649


>gi|340710064|ref|XP_003393618.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Bombus terrestris]
          Length = 1417

 Score =  103 bits (257), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 126/525 (24%), Positives = 232/525 (44%), Gaps = 76/525 (14%)

Query: 575  TQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPD 631
            T+   +R FS+      VF  SD P  I+ + +  L ++    +  V+   PFN+   P 
Sbjct: 899  TRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQ 958

Query: 632  S-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICS-----LK 681
              L   ++ EL I  +           +R +PL   P  + +  +S+T+ + +     LK
Sbjct: 959  GFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIAEPLK 1018

Query: 682  N---------QSCAEESEMHFVRLLDDQ----TFEFIS--TYP-----LDTFEYGCSILS 721
            +         +   EE    F+    +Q     F  +S  T P     LD +E+   + +
Sbjct: 1019 SYYRFNGEDKEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKN 1078

Query: 722  CSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIA 765
             S + +        Y  +GT Y   E+   ++GRIL+F I+E            + + I 
Sbjct: 1079 VSLAYEGTRSGLKGYIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIY 1137

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQ 822
             KE KG + ++   +G L++A+ QKI  Y W L+D+   G   + ++   H  +L++   
Sbjct: 1138 AKEQKGPITAITQVSGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI--- 1191

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AEN 877
                 I++ D+ KSISLL ++ E   +   +RD+    +  +E L D+  LG      E+
Sbjct: 1192 --KSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGES 1249

Query: 878  NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP----- 932
            N  LF  +  S  +   +  +L    ++HLG+ VN F      +  P +D          
Sbjct: 1250 NMALFMYQPESRESLGGQ--KLIRKADFHLGQKVNTFFRIRCRLSDPANDKKHFSGADKR 1307

Query: 933  -TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV-- 989
               ++ +++G +G I  +P + Y  L  LQ  L   I  + GLN + +R++ +  +T   
Sbjct: 1308 HVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGN 1367

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
             A+  +DGDL+  +  L      +++K +   V+E+ + + E+ R
Sbjct: 1368 PARGIIDGDLVWRYFYLPNNEKIDVAKKIGTRVQEIIEDLTEIDR 1412



 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 65/330 (19%), Positives = 139/330 (42%), Gaps = 61/330 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
           ++G + +++     G  +D L ++    K  V+++D ++ +L T ++    +   R   T
Sbjct: 73  LHGNVMSMQAVTLVGSQRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEEIRDGWT 132

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFN 103
           ++  I I+  DP+ R   + +Y     V+PF                   NK  +  ++ 
Sbjct: 133 NHHHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKALSNKTPILSSYM 192

Query: 104 IRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKD 149
           I L+ L+     ++D++FL+G  +PT+++LY+  +        R      VA+     + 
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252

Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAI 196
                WS +NL       +PV  PL G LI+   +++Y +             A      
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLAETSTNF 312

Query: 197 PIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE 247
           P++P   +   ++  +GS        R ++   +G L++L +       V G   +    
Sbjct: 313 PLKP---QEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAA 369

Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
           + + S +   ++  +++GS  G+S L++  
Sbjct: 370 SVLTSCVCMCEDNYLFLGSRLGNSLLLRFT 399



 Score = 44.7 bits (104), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 77/172 (44%), Gaps = 14/172 (8%)

Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
           ++VT SG  K+G+L +++  I      + EL G + MW++  + ++             F
Sbjct: 507 ELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVIGTLNNDEQIRPEAEGSHAF 566

Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           L++S    T IL      E+ E +  GF +Q  T+F  +   N+ +VQVT   VRL+   
Sbjct: 567 LILSQEDSTMILQTG--QEINEVDQSGFSTQGSTIFAGNLGANRYIVQVTQMGVRLLQGI 624

Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
             E         G  +  A+     V L +  G ++ L + +G  T   HAQ
Sbjct: 625 --EQIQHMPIDLGCPIVHASCADPYVTLLSEDGQVMLLTLREGRGTAKLHAQ 674


>gi|431908146|gb|ELK11749.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
            alecto]
          Length = 820

 Score =  103 bits (257), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 113/466 (24%), Positives = 194/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 349  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 408

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 409  KVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 468

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 469  IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 527

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 528  PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 582

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 583  IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 642

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 643  QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 696

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 697  AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 756

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 757  AFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 802


>gi|53791734|dbj|BAD53405.1| UV-damaged DNA-binding protein-like [Oryza sativa Japonica Group]
          Length = 154

 Score =  103 bits (257), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 50/93 (53%), Positives = 64/93 (68%), Gaps = 7/93 (7%)

Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
            +G  +E +RDYN  WM+AVE+LDD +Y+GA+N +NLFTV K          GRL V+G+
Sbjct: 61  NKGQTDELSRDYNITWMTAVEMLDDYVYIGADNCYNLFTVLK-------RRVGRLLVIGQ 113

Query: 905 YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
           YHLG+ VNRF  G LVM+ P S+V QIP  IFG
Sbjct: 114 YHLGDLVNRFHQGPLVMQDPGSEVDQIPAFIFG 146


>gi|357611296|gb|EHJ67409.1| putative cleavage and polyadenylation specific factor 1 [Danaus
            plexippus]
          Length = 328

 Score =  103 bits (256), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 89/337 (26%), Positives = 160/337 (47%), Gaps = 41/337 (12%)

Query: 722  CSFSDDSNV--YYCVGTAYVLPEENEPTKGRILVFIVED-----------GKLQLIAEKE 768
            C F   S +  Y  +GT Y   E+   ++GRIL++ + D            + + I  KE
Sbjct: 4    CGFIQLSGLRGYIAIGTNYNYGEDI-TSRGRILIYDIIDVVPEPGQPLTKNRFKEIYAKE 62

Query: 769  TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRG 825
             KG V +L    G L++A+ QKI  Y W L+D+   G   + ++   H  +LA+      
Sbjct: 63   QKGPVTALTQVLGFLISAVGQKI--YLWQLKDNDLVGVAFIDTQIYVH-RMLAV-----K 114

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFN 880
            + I+V D+ KSISLL Y+H+   +   +RD     +  ++ + D+  LG     +E NF 
Sbjct: 115  NLILVADVYKSISLLRYQHQHRTLSLVSRDLRTAQIYDMQFMIDNTSLGFLVSESEGNFA 174

Query: 881  LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
            ++  +  +  +   +R  L    +YHLG+ V+       + RL      Q    +F T++
Sbjct: 175  MYMHQPQARESYGGQR--LIRKCDYHLGQRVH------AMFRLAARGERQTHVTMFTTLD 226

Query: 941  GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDG 997
            G +G +  +  + Y  L  LQ  +      + GLN + +R++   ++ +    A+  LDG
Sbjct: 227  GGVGYVLPVSEKVYRRLLMLQNVINNYCCHLAGLNPKAYRTYKVSRRALCGGAARGVLDG 286

Query: 998  DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
            DL+  +  + RT   +I++ +   VEE+   + E+ R
Sbjct: 287  DLVSLYTSMPRTEQQDIARKIGTKVEEIMSDLYEIDR 323


>gi|255918233|ref|NP_001157645.1| cleavage and polyadenylation specificity factor subunit 1 isoform 1
            [Mus musculus]
          Length = 1450

 Score =  103 bits (256), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 114/480 (23%), Positives = 199/480 (41%), Gaps = 74/480 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 970  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1030 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1089

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1090 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1148

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1149 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1203

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1204 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1263

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1264 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1317

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1318 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1377

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  + +    R+   +R+
Sbjct: 1378 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDITGSRLTSNSRM 1437



 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 108/565 (19%), Positives = 220/565 (38%), Gaps = 151/565 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE---- 288
           + +++  ++   +++GS  G+S L+K    LQ               P +K   VE    
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVEPAVG 429

Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
                                             V +  +N+GP  +  V +   L  + 
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 489

Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
           Q       ++V CSG  K+G+L +++  I      + EL G   MW++            
Sbjct: 490 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 549

Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
                       ++  D     FL++S    T IL      E+ E +  GF +Q  T+F 
Sbjct: 550 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 607

Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
            +   N+ +VQV+   +RL+   ++
Sbjct: 608 GNIGDNRYIVQVSPLGIRLLEGVNQ 632


>gi|417406474|gb|JAA49895.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
            cft1 cpsf subunit [Desmodus rotundus]
          Length = 1444

 Score =  102 bits (255), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 112/466 (24%), Positives = 194/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 973  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1032

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1033 KVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1092

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1093 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1151

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1152 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1206

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ +   +   +RD     + +V+ + D+ 
Sbjct: 1207 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEDSKTLSLVSRDAKPLEVYSVDFMVDNA 1266

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1267 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1320

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1321 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1380

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1381 AFRMLHVDRRTLQNAVRNILDGELLNRYLYLSTMERSELAKKIGTT 1426



 Score = 47.4 bits (111), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 63/326 (19%), Positives = 141/326 (43%), Gaps = 60/326 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 77  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 137 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 195

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 196 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 255

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +              +   A P+
Sbjct: 256 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTSGTTAFPL 315

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R   D +        + ++    G +++L +IT     V     +    + 
Sbjct: 316 R---TQEGVRTTLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAAASV 372

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
           + S++  ++   +++GS  G+S L+K
Sbjct: 373 LTSSMVTMEPGYLFLGSRLGNSLLLK 398


>gi|440301427|gb|ELP93813.1| DNA repair protein xp-E, putative [Entamoeba invadens IP1]
          Length = 1001

 Score =  102 bits (255), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 97/331 (29%), Positives = 147/331 (44%), Gaps = 41/331 (12%)

Query: 700  QTFEFI--STYPLDTFEYGCSILSCSFSDD-SNVYYCVGTAYVLPEENEPTKGRILVFIV 756
            Q F  I  ++YP +     C    C   +  S   Y +GTA +   E EPT GR+LV   
Sbjct: 684  QLFNLIDGASYPSEFLLLSCDEEICMCVETISEHLYVIGTAIIKENEVEPTVGRLLVAEE 743

Query: 757  EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI 816
             DGKL L    E +GAVY++  F   +LA IN+ + +          R +        ++
Sbjct: 744  VDGKLVLKCTHEFEGAVYAIKKFKQHVLALINRHLHVMDITNESVLPRNIIEL-----NM 798

Query: 817  LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA-RDYNANWMSAVEILDDDIYLGA 875
            + + ++  GD+I+VGD MKS+++ + K ++ +  E+A  D   +W++AV  +++    G 
Sbjct: 799  IGVCMEVLGDYIIVGDFMKSVTVFVCKSDDLSQTEKAWTDIAVSWVNAVGCIEN----GD 854

Query: 876  ENNFNLFTVRKNSEGATD--EERGRLE----VVGEYHLGEFVNRFRHGSLVMRLPDSDVG 929
             N F    V  N +      EER  +E     VG+ H+ E +N F   SL          
Sbjct: 855  SNKFVSCDVDGNVKIFVKGGEERWYIEDLLKCVGKIHVCECIN-FVEKSLY--------- 904

Query: 930  QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
                V  G V+G +  I+ L    Y  L+K Q  L K          + WR F   K   
Sbjct: 905  --KGVTLGGVSGALYNISCLSDNDYQILKKAQDMLVK----------DNWRQFVGTKTKE 952

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
               NF+DGD IE+ LD    R     K + V
Sbjct: 953  VMTNFIDGDKIETILDWPLERQTSFCKKIGV 983



 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 81/343 (23%), Positives = 166/343 (48%), Gaps = 21/343 (6%)

Query: 9   RIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQI 68
           +I  L  ++P  +  +++ + T   +  V++    ++ L   + G +SD  GR +  G  
Sbjct: 63  KIEVLLKYKPVKDEDEYIIVVTASVQIDVIKL---TTNLHVVSSGFLSDPFGRLSFYGMK 119

Query: 69  GIIDPDCRLIGLHLYDGLFKVIPFDNKGQLK-EAFNIRLEELQVLDIKFLYGCAKPTIVV 127
            ++ P+ + + LH+Y+ L K +    +   K  A N ++   +++D  F     + T+V 
Sbjct: 120 AVVSPNGQYLLLHIYEQLIKFVKLPQRPSDKIFATNSKISINRIIDFSFCTIADQETVVF 179

Query: 128 LYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
           L+++++D RHV TY +    ++FV+  +SQ N+ +   L++ +P  + GVL+I +ET  Y
Sbjct: 180 LHENSRDTRHVNTY-IIDDHQNFVKSTFSQPNVGSSTRLVLGLPNGVNGVLVISDETACY 238

Query: 188 CSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-KVTGLKIELLG 246
            +      + +     +       +    +L D     HLL +    E  V+ +   +L 
Sbjct: 239 FTGTG-DHVLVNVGKNRITTTTFLNKDMVILTDSNTGCHLLKLKVCVENSVSEILYSVLP 297

Query: 247 ET-SIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVV 305
            T S+A T+S +   +++ GSS G S L++++          E+ E + N GPI D   +
Sbjct: 298 STESVAQTVSVIGGDIIFFGSSSGKSYLMRVS---------GEIFETWENCGPITDLKKI 348

Query: 306 DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 348
           +     +  V+ C+G  ++  L ++  G G+ E  S  L+ ++
Sbjct: 349 E---NKENYVINCNGGGQE-CLGVIFKGSGVKELGSTSLEHVR 387


>gi|219110831|ref|XP_002177167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411702|gb|EEC51630.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 1303

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 124/503 (24%), Positives = 212/503 (42%), Gaps = 75/503 (14%)

Query: 233 EKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL--QPDA-------- 282
           +K KV G++I LL      + ++     +++  +S+GD  L +      PDA        
Sbjct: 372 DKTKVVGMEIALLDTLPNVNALNVSKLGMLFAAASFGDHHLYQFERIDLPDAPKNTSDEA 431

Query: 283 ------KGSYV-------EVLERYV--------------NLGPIVDFCVVDLE-RQGQGQ 314
                  GS V       E+  ++V              N  P     V +L   +   Q
Sbjct: 432 TKALTASGSTVLTTALASEIASKFVPTVLQNLRKIHSLDNPSPTTGVLVGELAGNEVSPQ 491

Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG----MWSLRSSTDDPFDTFLVVSFI 370
           +   +G+    +LR+ R+G  + E A  EL G+ G    + +  ++     D F+VVSF 
Sbjct: 492 IYALTGSGPTSALRVTRHGASVTELAVSELPGVPGAIFTIGTGNTAGGKKLDQFIVVSFA 551

Query: 371 SETRILAMNLEDELEETEIE-GFCSQTQTLFCHD-AIYNQLVQVTSGSVRLVSSTSRELR 428
             T +L++   + +EE   E GF + + TL C        L QV    VR + +   +  
Sbjct: 552 DATLVLSVG--ETVEEVGKESGFLTNSPTLACSALGSDGALCQVHPVGVRHIQNGQAK-- 607

Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGILTEVKHAQLEYEISCLDIN 486
            +W  P    +  A+AN SQV++A  GG ++Y E+    G L E    ++  ++ CLD+ 
Sbjct: 608 -QWHCPGLKRIECASANESQVIVALAGGEIIYFELDPMSGNLMESATREMGADVCCLDVG 666

Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII--PRSVLLCAFE-GIS 542
            I +  S S  A VG   D +VR+ SL PD   I  +     +   P SV L     G++
Sbjct: 667 TIPKGRSRSLFAVVG-CRDQTVRVISLEPDKTKILSQRSSTALKARPHSVALQMMNAGVA 725

Query: 543 ---------YLLCALGDGHLLNFLLNMKTGEL-TDRKKVSLGTQPITLRTFSSKNTTHVF 592
                     L+  L DG  L   ++  TG + T   +  LG +P+ +   S ++     
Sbjct: 726 NSNANVDDLTLIIGLDDGSSLRASIDPITGSIGTSPTRRFLGARPVAVSRISLESNPATL 785

Query: 593 AASDRPTVIY--SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID--- 647
             S RP +    +++ K + + ++   + H C F+S A  + +       L I +I+   
Sbjct: 786 LLSSRPWITRRDAASGKHIMAPLSYAPLDHGCSFSSDAVSEGIVATAGKTLRILSIESSG 845

Query: 648 ----DIQKLHIRSIPLGEHPRRI 666
               D +  +   IPL   PR++
Sbjct: 846 MEGGDDEAFNTNKIPLRYTPRQM 868



 Score = 73.6 bits (179), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 77/303 (25%), Positives = 139/303 (45%), Gaps = 42/303 (13%)

Query: 751  ILVFIVEDGKLQLIAEKETK-GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL--Q 807
            ++  IV   + QL+       G V SL  F G+LL  I   ++LY+      G R+L  +
Sbjct: 1015 VIYRIVNGDRFQLLHRTSVDDGPVLSLAHFQGRLLVGIGTTLRLYEM-----GKRQLLRK 1069

Query: 808  SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
            SE  +    +   VQT G+   +GD+M+SI ++ Y      +   A D +   +   E+L
Sbjct: 1070 SELRNFPTFVK-TVQTVGERAYIGDMMQSIQIVRYDVSANRLVLIANDASPRPIVCQELL 1128

Query: 868  D-DDIYLGAENNFNLFTVRKNSEGAT-------------DEER----GRLEVVGEYHLGE 909
            D + + +G  + F   +V +   GA              D  R     +LE++ +Y++GE
Sbjct: 1129 DWNTVAVG--DKFGNISVMRLPRGADTSAIDVTGQRALWDSSREDMIPKLELLCQYYVGE 1186

Query: 910  FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVI 968
             V      SLV        G   ++I+ TV+G IG      +   + F  +L++ LR   
Sbjct: 1187 VVTSMTRSSLV-------AGGAESLIYVTVSGRIGAFVPFTNRNDVDFYSQLESELRGDA 1239

Query: 969  KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
                G + + +RS+      V     +DGDL ++F  L   + ++I++ ++ +V E+ K+
Sbjct: 1240 SRPTGRDPQSYRSYYAPMMHV-----VDGDLCDAFNSLGPEKQNKIAEKLDRTVGEIMKK 1294

Query: 1029 VEE 1031
            +E+
Sbjct: 1295 LED 1297


>gi|402879380|ref|XP_003903320.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1 [Papio anubis]
          Length = 1389

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 114/460 (24%), Positives = 195/460 (42%), Gaps = 62/460 (13%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 918  VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 977

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  CA                        + E   ++L+   ++E I  + 
Sbjct: 978  KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1037

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L  +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1038 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1096

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI  + W LR     EL     
Sbjct: 1097 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKI--FLWSLR---ASELTGMAF 1151

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1152 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1211

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
             LG   ++ + NL       E        RL    ++H+G  VN F      G+      
Sbjct: 1212 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1271

Query: 925  DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
             S V +   +  F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  +
Sbjct: 1272 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1331

Query: 984  NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1332 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1371



 Score = 40.0 bits (92), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
           EV +  +N+GP  +  + +   L  + Q       ++V CSG  K+G+L +++  I    
Sbjct: 411 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQV 470

Query: 339 QASVELQGIKGMWSL-----RSSTDDP-------------------FDTFLVVSFISETR 374
             + EL G   MW++     +   D+P                      FL++S    T 
Sbjct: 471 VTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTM 530

Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
           IL      E+ E +  GF +Q  T+F  +   N+ +VQV+   +RL+   ++
Sbjct: 531 ILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 580


>gi|426361048|ref|XP_004047737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Gorilla gorilla gorilla]
          Length = 1440

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 969  VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1028

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  CA                        + E   ++L+   ++E I  + 
Sbjct: 1029 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1088

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L  +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1089 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1147

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1148 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1202

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1203 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1262

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
             LG   ++ + NL       E        RL    ++H+G  VN F      G+      
Sbjct: 1263 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1322

Query: 925  DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
             S V +   +  F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  +
Sbjct: 1323 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1382

Query: 984  NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1383 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1422



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 104/564 (18%), Positives = 219/564 (38%), Gaps = 150/564 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD     + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGTCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
           + +++  ++   +++GS  G+S L+    KL   P                    DA   
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAG 429

Query: 286 YV------------------------------EVLERYVNLGPIVDFCVVD---LERQGQ 312
           +                               +V +  +N+GP  +  + +   L  + Q
Sbjct: 430 WSGEGRSRAGQERGQVTQGWSGAGAPLTVAVPQVCDSILNIGPCANAAMGEPAFLSEEFQ 489

Query: 313 G------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDDP- 360
                  ++V CSG  K+G+L +++  I      + EL G   MW++     +   D+P 
Sbjct: 490 NSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPK 549

Query: 361 ------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH 402
                                FL++S    T IL      E+ E +  GF +Q  T+F  
Sbjct: 550 GEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAG 607

Query: 403 DAIYNQ-LVQVTSGSVRLVSSTSR 425
           +   N+ +VQV+   +RL+   ++
Sbjct: 608 NIGDNRYIVQVSPLGIRLLEGVNQ 631


>gi|395512730|ref|XP_003760588.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Sarcophilus harrisii]
          Length = 1449

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 117/480 (24%), Positives = 200/480 (41%), Gaps = 74/480 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 978  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1037

Query: 673  RTFAICSLKNQSC-------AEESEMHFV----------------RLLDDQTFEFI--ST 707
            + +A+ +  N  C        EE E   +                +L+   ++E I  + 
Sbjct: 1038 KVYAVATSTNALCTRIPRMTGEEKEFETIERDDRYIHPLQEAFSIQLISPVSWEAIPNAR 1097

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1098 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1156

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1157 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1211

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D  
Sbjct: 1212 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDSA 1271

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1272 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1325

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + PT             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1326 AEGPTKKSIVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1385

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  + + +   + E+ R+
Sbjct: 1386 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLEIDRV 1445



 Score = 44.7 bits (104), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 45/216 (20%), Positives = 99/216 (45%), Gaps = 35/216 (16%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 77  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 137 QNVHTPRVRV-DPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGEGQKSSFLPSYIID 195

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP- 154
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     + ++   
Sbjct: 196 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNILQKVH 255

Query: 155 ---WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
              WS  NL       + VP P+ GV+I    +++Y
Sbjct: 256 PVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLY 291


>gi|355680843|gb|AER96659.1| cleavage and polyadenylation specific factor 1, 160kDa [Mustela
            putorius furo]
          Length = 1399

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 929  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 988

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 989  KVYAVATSTNMPCTRIPRMTGEEKEFEAIERDDRYVHPQQEAFSIQLISPVSWEAIPNAR 1048

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1049 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1107

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1108 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1162

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1163 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1222

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1223 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1276

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1277 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1336

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1337 AFRLLHADRRALQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1382



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 110/570 (19%), Positives = 218/570 (38%), Gaps = 158/570 (27%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 30  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 89

Query: 61  RPTDNGQIGIIDPDCR----LIGLHLYDGLFKVIPF-------DNKGQLKEA-------- 101
           +     ++ + DPD R    L  + +Y     V+PF       +++G + E         
Sbjct: 90  QNVHAPRVRV-DPDGRCAAMLTAMLIYGSRLVVLPFRRESLAEEHEGLMGEGQRSSFLPS 148

Query: 102 --FNIR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---D 147
              ++R L+E  L ++D++FL+G  +PT+++L++ N+        R      VA+     
Sbjct: 149 YIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCCIVAISLNIT 208

Query: 148 KDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFK 194
           +      WS  +L       + VP P+ GV++    +++Y +                  
Sbjct: 209 QKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNGLTTGTT 268

Query: 195 AIPIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELL 245
           A P+R   T+   R+  D +        + ++    G +++L +IT     V     +  
Sbjct: 269 AFPLR---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 325

Query: 246 GETSIASTISYLDNAVVYIGSSYGDSQLIKL--NLQ--------------PDAKGSYV-- 287
             + + +++  ++   +++GS  G+S L+K    LQ              P +K   V  
Sbjct: 326 AASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPAGAVRETDKDEPPSKKKRVES 385

Query: 288 ------------------------------------EVLERYVNLGPIVDFCVVD---LE 308
                                               EV +  +N+GP  +  + +   L 
Sbjct: 386 AVGWSGGKSAPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAMGEPAFLS 445

Query: 309 RQGQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS---------- 352
            + Q       ++V CSG  K+G+L +++  I      + EL G   MW+          
Sbjct: 446 EEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPARKEQE 505

Query: 353 --------------LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQT 398
                         L +  D     FL++S    T IL      E+ E +  GF +Q  T
Sbjct: 506 ETPKGDGAEQEPSALEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPT 563

Query: 399 LFC---HDAIYNQLVQVTSGSVRLVSSTSR 425
           +F     D  Y  +VQV+   +RL+   S+
Sbjct: 564 VFAGNIGDGRY--IVQVSPLGIRLLEGVSQ 591


>gi|355698297|gb|EHH28845.1| Cleavage and polyadenylation specificity factor 160 kDa subunit
            [Macaca mulatta]
          Length = 1436

 Score =  102 bits (254), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 965  VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1024

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  CA                        + E   ++L+   ++E I  + 
Sbjct: 1025 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1084

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L  +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1085 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1143

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1144 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1198

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1199 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1258

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
             LG   ++ + NL       E        RL    ++H+G  VN F      G+      
Sbjct: 1259 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1318

Query: 925  DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
             S V +   +  F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  +
Sbjct: 1319 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1378

Query: 984  NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1379 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1418



 Score = 46.2 bits (108), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
           + +++  ++   +++GS  G+S L+K
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLK 395


>gi|410042329|ref|XP_003954555.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1 [Pan troglodytes]
          Length = 1296

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 825  VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 884

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  CA                        + E   ++L+   ++E I  + 
Sbjct: 885  KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 944

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L  +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 945  IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1003

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1004 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1058

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1059 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1118

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
             LG   ++ + NL       E        RL    ++H+G  VN F      G+      
Sbjct: 1119 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1178

Query: 925  DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
             S V +   +  F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  +
Sbjct: 1179 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1238

Query: 984  NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1239 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1278



 Score = 46.2 bits (108), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
           + +++  ++   +++GS  G+S L+K
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLK 395


>gi|119602512|gb|EAW82106.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
            [Homo sapiens]
 gi|119602513|gb|EAW82107.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
            [Homo sapiens]
 gi|119602514|gb|EAW82108.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
            [Homo sapiens]
          Length = 1365

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 114/460 (24%), Positives = 195/460 (42%), Gaps = 62/460 (13%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 894  VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 953

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  CA                        + E   ++L+   ++E I  + 
Sbjct: 954  KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1013

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L  +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1014 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1072

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI  + W LR     EL     
Sbjct: 1073 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKI--FLWSLR---ASELTGMAF 1127

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1128 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1187

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
             LG   ++ + NL       E        RL    ++H+G  VN F      G+      
Sbjct: 1188 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1247

Query: 925  DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
             S V +   +  F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  +
Sbjct: 1248 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1307

Query: 984  NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1308 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1347



 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 106/553 (19%), Positives = 214/553 (38%), Gaps = 153/553 (27%)

Query: 20  GEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIGRPTDNGQIGIIDPD 74
           G  +D L ++ +  K  V+++D  + +L T ++      ++ D   +     ++ + DPD
Sbjct: 10  GAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFVQNVHTPRVRV-DPD 68

Query: 75  CRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FNIR-LEE--LQVLDI 114
            R   + +Y     V+PF       +++G + E            ++R L+E  L ++D+
Sbjct: 69  GRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIIDVRALDEKLLNIIDL 128

Query: 115 KFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNNLDNGAD 165
           +FL+G  +PT+++L++ N+      A    T  +     +  +   P  WS  +L     
Sbjct: 129 QFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCT 188

Query: 166 LLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIRPSITKAYGRVDAD 212
             + VP P+ GV++    +++Y +                  A P+R   T+   R+  D
Sbjct: 189 QALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLR---TQEGVRITLD 245

Query: 213 GS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
            +        + ++    G +++L +IT     V     +    + + +++  ++   ++
Sbjct: 246 CAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLF 305

Query: 264 IGSSYGDSQLI----KLNLQP--------------------DAKGSY------------- 286
           +GS  G+S L+    KL   P                    DA   +             
Sbjct: 306 LGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAGWSAAGKSVPQDEVD 365

Query: 287 -------------------VEVLERYVNLGPIVDFCVVD---LERQGQG------QVVTC 318
                               EV +  +N+GP  +  V +   L  + Q       ++V C
Sbjct: 366 EIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAVGEPAFLSEEFQNSPEPDLEIVVC 425

Query: 319 SGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDDP------------- 360
           SG  K+G+L +++  I      + EL G   MW++     +   D+P             
Sbjct: 426 SGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTTP 485

Query: 361 -------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQV 412
                     FL++S    T IL      E+ E +  GF +Q  T+F  +   N+ +VQV
Sbjct: 486 EADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQV 543

Query: 413 TSGSVRLVSSTSR 425
           +   +RL+   ++
Sbjct: 544 SPLGIRLLEGVNQ 556


>gi|16751835|ref|NP_444423.1| cleavage and polyadenylation specificity factor subunit 1 isoform 2
            [Mus musculus]
 gi|17374611|sp|Q9EPU4.1|CPSF1_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit
 gi|11762096|gb|AAG40326.1|AF322193_1 cleavage and polyadenylation specificity factor 1 [Mus musculus]
 gi|38614159|gb|AAH56388.1| Cleavage and polyadenylation specific factor 1 [Mus musculus]
          Length = 1441

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 970  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1030 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1089

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1090 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1148

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1149 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1203

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1204 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1263

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1264 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1317

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1318 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1377

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1378 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1423



 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 108/565 (19%), Positives = 220/565 (38%), Gaps = 151/565 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE---- 288
           + +++  ++   +++GS  G+S L+K    LQ               P +K   VE    
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVEPAVG 429

Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
                                             V +  +N+GP  +  V +   L  + 
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 489

Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
           Q       ++V CSG  K+G+L +++  I      + EL G   MW++            
Sbjct: 490 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 549

Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
                       ++  D     FL++S    T IL      E+ E +  GF +Q  T+F 
Sbjct: 550 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 607

Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
            +   N+ +VQV+   +RL+   ++
Sbjct: 608 GNIGDNRYIVQVSPLGIRLLEGVNQ 632


>gi|348512553|ref|XP_003443807.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Oreochromis niloticus]
          Length = 1456

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 115/483 (23%), Positives = 208/483 (43%), Gaps = 80/483 (16%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   K+GEL I  +           +R IPL      + +  +S
Sbjct: 985  IESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVES 1044

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFISTYP 709
            + +A+C+   + C                         + E   ++L+   ++E I    
Sbjct: 1045 KVYAVCTSVKEPCTRIPRMTGEEKEYEVIERDERYIHPQQEKFSIQLISPVSWEAIPNTR 1104

Query: 710  LD--TFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
            +D   +E+   + + +      V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1105 IDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVT-CRGRILILDVIEVVPE 1163

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 808
             G      K +++ EKE KG V +L   NG L++AI QKI L  W+L+D+   G   + +
Sbjct: 1164 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFL--WVLKDNDLTGMAFIDT 1221

Query: 809  ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
            +   H         +  +FI+  DLMKSISLL Y+ E   +   +RD     + ++E + 
Sbjct: 1222 QLYIHQMF------SIKNFILAADLMKSISLLRYQEESKTLSLVSRDAKPLEVYSIEFMV 1275

Query: 869  DDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP- 924
            D+  LG   ++ + NL+      E        RL    +++ G  +N F       R+P 
Sbjct: 1276 DNNQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTF------WRMPC 1329

Query: 925  ----DSDVGQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
                D+   +  T        F T++G IG++  +  + Y  L  LQ  L  ++    GL
Sbjct: 1330 RGALDASSKKALTWDNKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1389

Query: 975  NHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            N + +R  +++++++    KN LDG+L+  +L LS     E++K +  + + +   + E+
Sbjct: 1390 NPKAFRMLHSDRRSLQNPVKNILDGELLNKYLYLSMMERSELAKKIGTTQDIILDDLLEI 1449

Query: 1033 TRL 1035
             R+
Sbjct: 1450 DRV 1452



 Score = 49.7 bits (117), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 63/323 (19%), Positives = 136/323 (42%), Gaps = 54/323 (16%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
           ++G I ++   +  G ++D L ++ +  K  V+++D  + +L T ++    +   R    
Sbjct: 74  LFGNIMSMASVQLVGASRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPFDN---------------KGQLKEAFNIRL 106
            N  I ++  DP+ R   + +Y     V+PF                 K     ++ I +
Sbjct: 134 QNVHIPVVRVDPENRCAVMLVYGTKLVVLPFRKDTLTDEQESGVGEGPKSSFLPSYIIDV 193

Query: 107 EEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP-- 154
            EL      ++D+KFL+G  +PT+++L++ N+      A    T  +     + ++    
Sbjct: 194 RELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKVHP 253

Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIR 199
             WS +NL      ++ VP P+ GV++    +++Y +              N   A P+R
Sbjct: 254 VIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVSLNSQTNGTTAFPLR 313

Query: 200 PSITKAYGRVDADGSRYLLGD------HAGLLHLL-VITHEKEKVTGLKIELLGETSIAS 252
               +    +D   S ++  D        G +++L +IT     V     +    + + +
Sbjct: 314 VQ-DEVKLTLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTT 372

Query: 253 TISYLDNAVVYIGSSYGDSQLIK 275
            +  ++   +++GS  G+S L+K
Sbjct: 373 CMVTMEPGYLFLGSRLGNSLLLK 395


>gi|56676371|ref|NP_037423.2| cleavage and polyadenylation specificity factor subunit 1 [Homo
            sapiens]
 gi|23503048|sp|Q10570.2|CPSF1_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit
 gi|16878041|gb|AAH17232.1| Cleavage and polyadenylation specific factor 1, 160kDa [Homo sapiens]
 gi|119602516|gb|EAW82110.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_c
            [Homo sapiens]
 gi|123993607|gb|ABM84405.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
            construct]
 gi|123999626|gb|ABM87355.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
            construct]
 gi|307684758|dbj|BAJ20419.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
            construct]
          Length = 1443

 Score =  102 bits (253), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 972  VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1031

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  CA                        + E   ++L+   ++E I  + 
Sbjct: 1032 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1091

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L  +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1092 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1150

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1151 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1205

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1206 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1265

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
             LG   ++ + NL       E        RL    ++H+G  VN F      G+      
Sbjct: 1266 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1325

Query: 925  DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
             S V +   +  F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  +
Sbjct: 1326 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1385

Query: 984  NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1386 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1425



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 107/567 (18%), Positives = 220/567 (38%), Gaps = 153/567 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
           + +++  ++   +++GS  G+S L+    KL   P                    DA   
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAG 429

Query: 286 Y--------------------------------VEVLERYVNLGPIVDFCVVD---LERQ 310
           +                                 EV +  +N+GP  +  V +   L  +
Sbjct: 430 WSAAGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAVGEPAFLSEE 489

Query: 311 GQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDD 359
            Q       ++V CSG  K+G+L +++  I      + EL G   MW++     +   D+
Sbjct: 490 FQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDN 549

Query: 360 P--------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
           P                       FL++S    T IL      E+ E +  GF +Q  T+
Sbjct: 550 PKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTV 607

Query: 400 FCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
           F  +   N+ +VQV+   +RL+   ++
Sbjct: 608 FAGNIGDNRYIVQVSPLGIRLLEGVNQ 634


>gi|410987992|ref|XP_004000273.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Felis catus]
          Length = 1432

 Score =  102 bits (253), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 113/480 (23%), Positives = 200/480 (41%), Gaps = 74/480 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 961  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1020

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1021 KVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1080

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1081 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1139

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI  + W LR     EL     
Sbjct: 1140 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKI--FLWSLR---ASELTGMAF 1194

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1195 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1254

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1255 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1308

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1309 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1368

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  + + + + + E  R+
Sbjct: 1369 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILEDLLETDRV 1428



 Score = 46.6 bits (109), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 65  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 124

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 125 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 183

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 184 VRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 243

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 244 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNGLTTGTTAFPL 303

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 304 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 360

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
           + +++  ++   +++GS  G+S L+K
Sbjct: 361 LTTSMVTMEPGYLFLGSRLGNSLLLK 386



 Score = 40.0 bits (92), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 36/172 (20%)

Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
           EV +  +N+GP  +  + +   L  + Q       ++V CSG  K+G+L +++  I    
Sbjct: 454 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 513

Query: 339 QASVELQGIKGMW------------------------SLRSSTDDPFDTFLVVSFISETR 374
             + EL G   MW                        +L +  D     FL++S    T 
Sbjct: 514 VTTFELPGCYDMWTVIAPVRKEQEETSKGEGAEQEPSTLEAEDDGRRHGFLILSREDSTM 573

Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
           IL      E+ E +  GF +Q  T+F  +   N+ +VQV+   +RL+   ++
Sbjct: 574 ILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 623


>gi|392306997|ref|NP_001254722.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
 gi|380812168|gb|AFE77959.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
 gi|383417835|gb|AFH32131.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
          Length = 1442

 Score =  102 bits (253), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 971  VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1030

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  CA                        + E   ++L+   ++E I  + 
Sbjct: 1031 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1090

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L  +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1091 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1149

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1150 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1204

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1205 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1264

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
             LG   ++ + NL       E        RL    ++H+G  VN F      G+      
Sbjct: 1265 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1324

Query: 925  DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
             S V +   +  F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  +
Sbjct: 1325 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1384

Query: 984  NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1385 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1424



 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 107/566 (18%), Positives = 221/566 (39%), Gaps = 152/566 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
           + +++  ++   +++GS  G+S L+    KL   P                    DA  S
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAS 429

Query: 286 Y--------------------------------VEVLERYVNLGPIVDFCVVD---LERQ 310
           +                                 EV +  +N+GP  +  + +   L  +
Sbjct: 430 WSAGGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAMGEPAFLSEE 489

Query: 311 GQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDD 359
            Q       ++V CSG  K+G+L +++  I      + EL G   MW++     +   D+
Sbjct: 490 FQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDN 549

Query: 360 P-------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
           P                      FL++S    T IL      E+ E +  GF +Q  T+F
Sbjct: 550 PKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVF 607

Query: 401 CHDAIYNQ-LVQVTSGSVRLVSSTSR 425
             +   N+ +VQV+   +RL+   ++
Sbjct: 608 AGNIGDNRYIVQVSPLGIRLLEGVNQ 633


>gi|354491126|ref|XP_003507707.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 3 [Cricetulus griseus]
          Length = 1449

 Score =  102 bits (253), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 970  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1030 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1089

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1090 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1148

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1149 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1203

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1204 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1263

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1264 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1317

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1318 AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1377

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1378 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1423



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 107/565 (18%), Positives = 220/565 (38%), Gaps = 151/565 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLN--LQ---------------PDAKGSYVE---- 288
           + +++  ++   +++GS  G+S L+K    LQ               P +K   V+    
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVDPTAG 429

Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
                                             V +  +N+GP  +  V +   L  + 
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 489

Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
           Q       ++V CSG  K+G+L +++  I      + EL G   MW++            
Sbjct: 490 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEEAP 549

Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
                       ++  D     FL++S    T IL      E+ E +  GF +Q  T+F 
Sbjct: 550 RAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 607

Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
            +   N+ +VQV+   +RL+   ++
Sbjct: 608 GNIGDNRYIVQVSPLGIRLLEGVNQ 632


>gi|397497327|ref|XP_003819464.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Pan paniscus]
 gi|410336497|gb|JAA37195.1| cleavage and polyadenylation specific factor 1, 160kDa [Pan
            troglodytes]
          Length = 1442

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 971  VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1030

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  CA                        + E   ++L+   ++E I  + 
Sbjct: 1031 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1090

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L  +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1091 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1149

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1150 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1204

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1205 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1264

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
             LG   ++ + NL       E        RL    ++H+G  VN F      G+      
Sbjct: 1265 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1324

Query: 925  DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
             S V +   +  F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  +
Sbjct: 1325 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1384

Query: 984  NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1385 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1424



 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 106/566 (18%), Positives = 220/566 (38%), Gaps = 152/566 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
           + +++  ++   +++GS  G+S L+    KL   P                    DA   
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAG 429

Query: 286 Y--------------------------------VEVLERYVNLGPIVDFCVVD---LERQ 310
           +                                 EV +  +N+GP  +  + +   L  +
Sbjct: 430 WSAAGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAMGEPAFLSEE 489

Query: 311 GQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDD 359
            Q       ++V CSG  K+G+L +++  I      + EL G   MW++     +   D+
Sbjct: 490 FQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDN 549

Query: 360 P-------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
           P                      FL++S    T IL      E+ E +  GF +Q  T+F
Sbjct: 550 PKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVF 607

Query: 401 CHDAIYNQ-LVQVTSGSVRLVSSTSR 425
             +   N+ +VQV+   +RL+   ++
Sbjct: 608 AGNIGDNRYIVQVSPLGIRLLEGVNQ 633


>gi|345779232|ref|XP_532356.3| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Canis lupus familiaris]
          Length = 1460

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 111/466 (23%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 989  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1048

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1049 KVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1108

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1109 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1167

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI  + W LR     EL     
Sbjct: 1168 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKI--FLWSLR---ASELTGMAF 1222

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1223 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1282

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1283 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1336

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1337 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1396

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1397 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1442



 Score = 46.6 bits (109), Expect = 0.073,   Method: Compositional matrix adjust.
 Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 93  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 152

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 153 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 211

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 212 VRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 271

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 272 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNGLTTGTTAFPL 331

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 332 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 388

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
           + +++  ++   +++GS  G+S L+K
Sbjct: 389 LTTSMVTMEPGYLFLGSRLGNSLLLK 414



 Score = 40.0 bits (92), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 36/172 (20%)

Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
           EV +  +N+GP  +  + +   L  + Q       ++V CSG  K+G+L +++  I    
Sbjct: 482 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 541

Query: 339 QASVELQGIKGMWS------------------------LRSSTDDPFDTFLVVSFISETR 374
             + EL G   MW+                        L +  D     FL++S    T 
Sbjct: 542 VTTFELPGCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTM 601

Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
           IL      E+ E +  GF +Q  T+F  +   N+ +VQV+   +RL+   ++
Sbjct: 602 ILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 651


>gi|156364999|ref|XP_001626630.1| predicted protein [Nematostella vectensis]
 gi|156213514|gb|EDO34530.1| predicted protein [Nematostella vectensis]
          Length = 1420

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 125/531 (23%), Positives = 227/531 (42%), Gaps = 81/531 (15%)

Query: 570  KVSLGTQPITLRTFSSKNT-THVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNS 626
            K  L  +   LR F+  ++ + +F     P  I+ +N+   + +    +  V+    F++
Sbjct: 900  KSGLDPKVAMLRVFNDISSYSGIFVCGSYPFWIFVTNRGAFHWHPMSIDGPVTCFAAFHN 959

Query: 627  AAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLK 681
               P   L     GEL I  +           +R +PL   P  + +  +S+T+AI + +
Sbjct: 960  VNCPKGFLYFNTRGELRISVLPTHLSYDSPWPVRKVPLRYTPHMVSYNRESKTYAIVTSE 1019

Query: 682  NQSC-------AEESEMHFVRLLDDQTFEFIST--------------------YPLDTFE 714
             + C       AE+ E  FV  + D  F + ST                    + LD +E
Sbjct: 1020 QEPCKKIPRVTAEDKE--FVDTIRDARFIYPSTERFVLQLISPISWEVIPNTRHDLDEWE 1077

Query: 715  YGCSILSCSF-SDDSNV----YYCVGTAYVLPEENEPTKGRILVF-IVE----------D 758
            +  ++ +    S++++     + CVGT  +  EE    +GRIL+F I+E           
Sbjct: 1078 HVTTMKNLLLHSEETHTGRKGFICVGTTQLYGEEIA-VRGRILIFDIIEVVPEPGQPLTK 1136

Query: 759  GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGH 815
             K +L+ EKE KG V +LN  NG L++ I QKI  Y W   D+   G   + ++   H  
Sbjct: 1137 NKFKLLYEKEQKGPVTALNQVNGYLVSGIGQKI--YIWNFTDNDLVGMAFIDTQLYIHSL 1194

Query: 816  ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG- 874
            +      T  +F++  D+ KSI+LL  + E   +   ++D     + A +   D   +G 
Sbjct: 1195 V------TIRNFVIAADVCKSITLLRLQEETKTLAFVSKDPKNLEVYAADFFIDGPQIGF 1248

Query: 875  ----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS---- 926
                 E N  LFT +   E    +   RL    + ++G  +  F   +    L  S    
Sbjct: 1249 LVSDVEKNLVLFTYQ--PEAIESQGGQRLLQRADINVGTHITSFFRIAAKAHLKASGEKS 1306

Query: 927  -DVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
             ++ Q+    FGT++G +G++  +  + +  L  LQT L   I  V GLN + +R     
Sbjct: 1307 KEMRQL--TCFGTLDGALGLMLPMTEKTFRRLHMLQTKLVDCIPHVAGLNPKAFRMLQWR 1364

Query: 986  KKTV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
            K+ +    +N LD  L+  ++ LS     E+++ +  +  ++   + ++ R
Sbjct: 1365 KRKLCNPHRNVLDWQLLFKYMHLSFMERQEVARKIGTTPAQIMDDMMDIER 1415



 Score = 63.9 bits (154), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 110/569 (19%), Positives = 217/569 (38%), Gaps = 163/569 (28%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGR---- 61
           ++G I +L   R  G  +D L ++ +  K  ++ +D    ++ TR++    D   +    
Sbjct: 72  LFGNIESLHAIRLAGNTRDSLLMSFKDAKLSIVDYDPGKHDIKTRSLHFFEDEKIKSHCL 131

Query: 62  PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKE-----------------AFNI 104
             D   +  IDP+ R   +  Y     V+PF  +G + +                 ++ I
Sbjct: 132 AQDRAPVVRIDPERRCAVMLAYGTHLVVLPFRQEGGIDDTAQDSIISSSDRPPVLPSYII 191

Query: 105 RLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD------------ARHVKTYEVALKD 147
            ++E+      +LDI+FL+G  +PT+++LY+  K             A    +  ++ K 
Sbjct: 192 DVKEIDEKTCNILDIQFLHGYYEPTLLILYEPLKTWAGRLAMRNDTCALVAVSLNMSQKA 251

Query: 148 KDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAF----K 194
              V   W  + L      ++PVP P+ GVL+     ++Y          S N+      
Sbjct: 252 HPVV---WQLSCLPFDCIYVMPVPKPIGGVLVCCMNALLYLNQSVPPYGVSVNSIGENST 308

Query: 195 AIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLL------VITHEKEKVTGLKIELLGET 248
             P++P   +    +  +GS  +   +  L+  L      V+T   + V  ++  +  +T
Sbjct: 309 VFPLKP---QKGVTITLEGSNAIFIANDKLVFSLKGGEIYVVTLIADGVRSVRNFVFDKT 365

Query: 249 S---IASTISYLDNAVVYIGSSYGDSQLIKLNLQP---------DAKGSYVEVLERY--- 293
           +   + S +    +  +++GS  G+S L+K   +P         +A+    + +ER+   
Sbjct: 366 AASVLTSCVCECGDGYLFLGSRLGNSLLVKYTEKPQDIVYGTENNAQSMQCDNIERWQIL 425

Query: 294 --------------------------------------VNLGP----------------- 298
                                                 +N+GP                 
Sbjct: 426 NGSLLLIVDDLDELEVYGAQQEAGVELTSYTFEVCDSLLNIGPCSCMDIGEPAFLSVSSY 485

Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST- 357
             D   +DLE      VV+CSG  K+G+L +++  I      + EL G   MW++ S   
Sbjct: 486 FADAQELDLE------VVSCSGYGKNGALTVLQRSIRPQVVTTFELPGCTDMWTVFSKDQ 539

Query: 358 --------------------DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQ 397
                               ++ + +FL++S   E   + +  E E+ E +  GF +Q  
Sbjct: 540 KKGAQTNAIHRYPSQPCTQGNEKYHSFLILS--REDSSMILKTEQEIMEVDQSGFSTQCA 597

Query: 398 TLFCHD-AIYNQLVQVTSGSVRLVSSTSR 425
           T++  +    + ++QVT   VRL+   ++
Sbjct: 598 TIYAGNFGNGSYILQVTPLGVRLLEGVNQ 626


>gi|344236599|gb|EGV92702.1| Cleavage and polyadenylation specificity factor subunit 1 [Cricetulus
            griseus]
          Length = 1419

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 948  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1007

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1008 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1067

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1068 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1126

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1127 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1181

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1182 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1241

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1242 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1295

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1296 AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1355

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1356 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1401



 Score = 59.7 bits (143), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 107/570 (18%), Positives = 220/570 (38%), Gaps = 156/570 (27%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL----------------------NLQPDAKGSYV 287
           + +++  ++   +++GS  G+S L+K                       N +P +K   V
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKASAHNEEPPSKKKRV 429

Query: 288 --------------------------------------EVLERYVNLGPIVDFCVVD--- 306
                                                 EV +  +N+GP  +  V +   
Sbjct: 430 DPTAGWTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAF 489

Query: 307 LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------- 353
           L  + Q       ++V CSG  K+G+L +++  I      + EL G   MW++       
Sbjct: 490 LSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKE 549

Query: 354 -----------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQT 396
                            ++  D     FL++S    T IL      E+ E +  GF +Q 
Sbjct: 550 EEEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQG 607

Query: 397 QTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
            T+F  +   N+ +VQV+   +RL+   ++
Sbjct: 608 PTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 637


>gi|197245729|gb|AAI68713.1| Cpsf1 protein [Rattus norvegicus]
          Length = 1439

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 968  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1027

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1028 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1087

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1088 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1146

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1147 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1201

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1202 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1261

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1262 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1315

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1316 AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1375

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1376 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1421



 Score = 60.1 bits (144), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 108/563 (19%), Positives = 220/563 (39%), Gaps = 149/563 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYV----- 287
           + +++  ++   +++GS  G+S L+K    LQ               P +K   V     
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVDPTVG 429

Query: 288 -------------------------------EVLERYVNLGPIVDFCVVD---LERQGQG 313
                                          EV +  +N+GP  +  V +   L  + Q 
Sbjct: 430 WTGGKTQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQN 489

Query: 314 ------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------------- 353
                 ++V CSG  K+G+L +++  I      + EL G   MW++              
Sbjct: 490 SPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKA 549

Query: 354 ----------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
                     ++  D     FL++S    T IL      E+ E +  GF +Q  T+F  +
Sbjct: 550 ESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGN 607

Query: 404 AIYNQ-LVQVTSGSVRLVSSTSR 425
              N+ +VQV+   +RL+   ++
Sbjct: 608 IGDNRYIVQVSPLGIRLLEGVNQ 630


>gi|354491122|ref|XP_003507705.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 1 [Cricetulus griseus]
          Length = 1441

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 970  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1030 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1089

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1090 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1148

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1149 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1203

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1204 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1263

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1264 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1317

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1318 AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1377

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1378 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1423



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 107/565 (18%), Positives = 220/565 (38%), Gaps = 151/565 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLN--LQ---------------PDAKGSYVE---- 288
           + +++  ++   +++GS  G+S L+K    LQ               P +K   V+    
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVDPTAG 429

Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
                                             V +  +N+GP  +  V +   L  + 
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 489

Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
           Q       ++V CSG  K+G+L +++  I      + EL G   MW++            
Sbjct: 490 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEEAP 549

Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
                       ++  D     FL++S    T IL      E+ E +  GF +Q  T+F 
Sbjct: 550 RAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 607

Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
            +   N+ +VQV+   +RL+   ++
Sbjct: 608 GNIGDNRYIVQVSPLGIRLLEGVNQ 632


>gi|1045574|gb|AAC50293.1| cleavage and polyadenylation specificity factor [Homo sapiens]
          Length = 1442

 Score =  101 bits (252), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 116/465 (24%), Positives = 195/465 (41%), Gaps = 73/465 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 972  VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1031

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  CA                        + E   ++L+   ++E I  + 
Sbjct: 1032 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1091

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L  +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1092 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1150

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1151 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1205

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1206 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1265

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP---- 924
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1266 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRAT 1319

Query: 925  -----DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
                  S V +   +  F T++G IG++  +  + Y  L  LQ  L  ++    GLN   
Sbjct: 1320 EGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRA 1379

Query: 979  WRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            +R  + +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1380 FRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1424



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 107/567 (18%), Positives = 220/567 (38%), Gaps = 153/567 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
           + +++  ++   +++GS  G+S L+    KL   P                    DA   
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAG 429

Query: 286 Y--------------------------------VEVLERYVNLGPIVDFCVVD---LERQ 310
           +                                 EV +  +N+GP  +  V +   L  +
Sbjct: 430 WSAAGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAVGEPAFLSEE 489

Query: 311 GQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDD 359
            Q       ++V CSG  K+G+L +++  I      + EL G   MW++     +   D+
Sbjct: 490 FQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDN 549

Query: 360 P--------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
           P                       FL++S    T IL      E+ E +  GF +Q  T+
Sbjct: 550 PKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTV 607

Query: 400 FCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
           F  +   N+ +VQV+   +RL+   ++
Sbjct: 608 FAGNIGDNRYIVQVSPLGIRLLEGVNQ 634


>gi|338728511|ref|XP_001505047.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like isoform 1 [Equus caballus]
          Length = 1444

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 973  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1032

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1033 KVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1092

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1093 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1151

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1152 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1206

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1207 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1266

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1267 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1320

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1321 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1380

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1381 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1426



 Score = 46.6 bits (109), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 77  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 137 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 195

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 196 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 255

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 256 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 315

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 316 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 372

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
           + +++  ++   +++GS  G+S L+K
Sbjct: 373 LTTSMVTMEPGYLFLGSRLGNSLLLK 398


>gi|444523674|gb|ELV13604.1| Cleavage and polyadenylation specificity factor subunit 1 [Tupaia
            chinensis]
          Length = 1469

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 115/468 (24%), Positives = 197/468 (42%), Gaps = 78/468 (16%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 998  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1057

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1058 KVYAVATSTNAPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1117

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1118 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1176

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1177 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1231

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1232 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1291

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVV--GEYHLGEFVNRFRHGSLVMRLPDS 926
             LG   ++ + NL       E    E  G L ++   ++HLG  VN F       R P  
Sbjct: 1292 QLGFLVSDRDRNLMVYMYLPEA--KESFGGLLLLRRADFHLGAHVNTF------WRTPCR 1343

Query: 927  DVGQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLN 975
               + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN
Sbjct: 1344 GAVEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLN 1403

Query: 976  HEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
               +R  + +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1404 PRAFRMLHVDRRTLQNAVRNVLDGELLSRYLYLSTMERSELAKKIGTT 1451



 Score = 47.4 bits (111), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 62/323 (19%), Positives = 138/323 (42%), Gaps = 54/323 (16%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVE--G 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQRVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGR 208
           P  WS  +L       + VP P+ GV++    +++Y + +      A+    + T A+  
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTAGTTAFPL 312

Query: 209 VDADGSRYLLG-DHAGLLH---------------LLVITHEKEKVTGLKIELLGETSIAS 252
              DG R  L   HA  +                L ++T     V     +    + + +
Sbjct: 313 RTQDGVRLTLDCAHAAFISYDKMVISLKGGEIYVLTLVTDGMRSVRAFHFDKAAASVLTT 372

Query: 253 TISYLDNAVVYIGSSYGDSQLIK 275
           ++  ++   +++GS  G+S L+K
Sbjct: 373 SMVTMEPGYLFLGSRLGNSLLLK 395


>gi|383863556|ref|XP_003707246.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Megachile rotundata]
          Length = 1415

 Score =  101 bits (251), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 125/529 (23%), Positives = 232/529 (43%), Gaps = 84/529 (15%)

Query: 575  TQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPD 631
            T+   +R FS+      VF  SD P  I+ + +  L ++    +  ++   PFN+   P 
Sbjct: 897  TRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPITSFAPFNNINCPQ 956

Query: 632  S-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA 686
              L   ++ EL I  +           +R +PL   P  + +  +S+T+ + +    S A
Sbjct: 957  GFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVIT----SIA 1012

Query: 687  EESEMHFVRLLDDQTF-------EFI-----------------STYP-----LDTFEYGC 717
            E  + ++    +D+ F        FI                  T P     LD +E+  
Sbjct: 1013 EPLKSYYRFNGEDKEFTEEDRPDRFIFPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVT 1072

Query: 718  SILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKL 761
             + + S + +        Y  +GT Y   E+   ++GRIL+F I+E            + 
Sbjct: 1073 CLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRF 1131

Query: 762  QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILA 818
            + I  KE KG + ++   +G L++A+ QKI  Y W L+D+   G   + ++   H  +L+
Sbjct: 1132 KQIYAKEQKGPITAITQVSGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLS 1188

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---- 874
            +        I++ D+ KSISLL ++ E   +   +RD+    +  +E L D+  LG    
Sbjct: 1189 I-----KSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNNNLGFLVA 1243

Query: 875  -AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP- 932
              E+N  LF  +  S  +   +  +L    ++HLG+ VN F      +  P +D      
Sbjct: 1244 DGESNIALFMYQPESRESLGGQ--KLIRKADFHLGQKVNTFFRIRCRISDPANDKKHFSG 1301

Query: 933  -----TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
                   ++ +++G +G I  +P + Y  L  LQ  L   I  + GLN + +R++ +  +
Sbjct: 1302 ADKRHVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYIR 1361

Query: 988  TV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
            T    A+  +DGDL+  +L L      +++K +   V+E+ + + E+ R
Sbjct: 1362 TQGNPARGIIDGDLVWRYLYLPNNEKIDVAKKIGTRVQEIIEDLTEIDR 1410



 Score = 54.3 bits (129), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 66/329 (20%), Positives = 138/329 (41%), Gaps = 61/329 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
           ++G + +++     G  +D L ++    K  V+++D +  +L T ++    +   R   T
Sbjct: 73  LHGNVMSMQAVTLVGSQRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEEIRDGWT 132

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFN 103
           ++  I I+  DP+ R   + +Y     V+PF                   NK  +  ++ 
Sbjct: 133 NHHHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYM 192

Query: 104 IRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKD 149
           I L+ L+     ++D++FL+G  +PT+++LY+  +        R      VA+     + 
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252

Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAI 196
                WS +NL       +PV  PL G LI+   +++Y +             A      
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLAETSTNF 312

Query: 197 PIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE 247
           P++P   +   ++  +GS        R ++   +G L++L +       V G   +    
Sbjct: 313 PLKP---QEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAA 369

Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKL 276
           + + S +   D+  +++GS  G+S L++ 
Sbjct: 370 SVLTSCVCMCDDNYLFLGSRLGNSLLLRF 398



 Score = 44.3 bits (103), Expect = 0.39,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 77/172 (44%), Gaps = 14/172 (8%)

Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
           ++VT SG  K+G+L +++  I      + EL G + MW++  + ++             F
Sbjct: 505 ELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVIGALNNDEQVRPEAEGSHAF 564

Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           L++S    T IL      E+ E +  GF +Q  T+F  +   N+ +VQVT   VRL+   
Sbjct: 565 LILSQEDSTMILQTG--QEINEVDQSGFSTQGSTVFAGNLGANRYIVQVTQMGVRLLQGI 622

Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
             E         G  +  A+     V L +  G ++ L + +G  T   HAQ
Sbjct: 623 --EQIQHMPIDLGCPIVHASCADPYVSLLSEDGQVMLLTLREGRGTAKLHAQ 672


>gi|338728513|ref|XP_003365689.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Equus caballus]
          Length = 1450

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 979  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1038

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1039 KVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1098

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1099 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1157

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1158 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1212

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1213 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1272

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1273 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1326

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1327 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1386

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1387 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1432



 Score = 46.6 bits (109), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 77  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 137 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 195

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 196 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 255

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 256 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 315

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 316 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 372

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
           + +++  ++   +++GS  G+S L+K
Sbjct: 373 LTTSMVTMEPGYLFLGSRLGNSLLLK 398


>gi|241952575|ref|XP_002419009.1| pre-mRNA-splicing factor, putative; pre-spliceosome component,
            putative [Candida dubliniensis CD36]
 gi|223642349|emb|CAX42591.1| pre-mRNA-splicing factor, putative [Candida dubliniensis CD36]
          Length = 1187

 Score =  101 bits (251), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 158/754 (20%), Positives = 308/754 (40%), Gaps = 97/754 (12%)

Query: 350  MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
            +++ + S +   D +LV+S    ++ L +++ + +E+ E   F     T+         +
Sbjct: 454  IFTTKLSLESANDEYLVISSSLSSKTLVLSIGEVVEDVEDSEFVLDQSTISVQQVGIASV 513

Query: 410  VQVTSGSVRLVSSTS-RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG---D 465
            VQV S  ++ + + + ++   +W  P G ++  A+ N  QVL+A    ++VY EI    D
Sbjct: 514  VQVYSNGIKHIRTVNGKKKTTDWFPPAGITITHASTNNQQVLIALSNLNVVYFEIDSTDD 573

Query: 466  GILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL- 524
             ++    H ++   I+ + I     N   S  A +G  +D ++++ SL + N +  + L 
Sbjct: 574  QLIEYQDHLEISTTITAMAIQE--HNSEKSSFAIIGC-SDETIQVVSLQEDNCLEIKSLQ 630

Query: 525  ---GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
                     + +     E  +++   + +G      ++   G L++ +   LG++P+ L 
Sbjct: 631  ALSANSSSLKMLKSSGKE--THVHIGMENGVYARIKIDTINGNLSNSRVKYLGSKPVNLS 688

Query: 582  TFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP-DSLAIAKEG 639
                 N    V A S  P + Y        + +   ++++   F S     + +   K+ 
Sbjct: 689  VIKFSNEIEGVLAISSAPWISYLYRDSFKITPLLEIDITNGSSFISEDIGGEGIVGIKDN 748

Query: 640  ELTIGTI-------DDIQKLHIRSIPLGEHPRR-ICHQEQSRTFAI-----------CSL 680
             L I ++       D  Q L I +  L   PR+ I H +  R F             C++
Sbjct: 749  NLVIFSVGKEDSVFDPSQDLTITNTKLRYTPRKMITHGD--RLFISESEYNVQGPFKCNI 806

Query: 681  KNQSCAEESEMHF---------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFS 725
                     E ++               ++++DD+T + I +   +  E   S+   +F+
Sbjct: 807  NGNVKENVDEDYYEAFGYDRKQDSWASCIQVVDDKTNQVIQSLQFEENESIVSMSDVAFN 866

Query: 726  DD-SNVY---YCVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
               S+V+     VG      +LP   E  K  +  F +    LQL+ + E       L  
Sbjct: 867  KSLSSVHASHLVVGVCTNQTILPNSYE--KSYLYTFKIGKKHLQLVHKTELDYIPQVLEN 924

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECG---HHGHILALYVQTRGDFIVVGDLMK 835
            F  KLL A N  IQLY     D G ++L  +          +   +      I+      
Sbjct: 925  FQDKLLVASNNHIQLY-----DIGQKQLLKKSTTIIDFSQNINKIIPQSNRIIICDSHKS 979

Query: 836  SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---------- 885
            SI    +   +      A D     ++++  LD D  +G +   N+F  R          
Sbjct: 980  SIIFAKFDDSQNQFVPFADDIMKRQITSIMNLDIDTLIGGDKFGNIFVTRIDEDISKQAD 1039

Query: 886  ------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
                  K+ EG  +    +L+ + E+H+G+ +     G L       ++    +VI+  +
Sbjct: 1040 DDWTILKSQEGILNSCPYKLQNLIEFHIGDIITSLNLGCL-------NLAGTESVIYTGL 1092

Query: 940  NGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
             G IG++  L  +  +  L  LQ  +++    + G +H ++RS+ N       KN +DGD
Sbjct: 1093 QGTIGLLVPLVSKSEVELLFNLQLLMQQFQNNLVGKDHLKFRSYYNP-----IKNVIDGD 1147

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            L+E FL+   +   EIS+ +N SV ++ K++ +L
Sbjct: 1148 LLERFLEFDTSLRIEISRKLNKSVNDIEKKLIDL 1181


>gi|334326317|ref|XP_001364707.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Monodelphis domestica]
          Length = 1449

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 116/480 (24%), Positives = 200/480 (41%), Gaps = 74/480 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 978  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1037

Query: 673  RTFAICSLKNQSC-------AEESEMHFV----------------RLLDDQTFEFI--ST 707
            + +A+ +  N  C        EE E   +                +L+   ++E I  + 
Sbjct: 1038 KVYAVATSTNALCTRIPRMTGEEKEFETIERDERYIHPLQEAFSIQLISPVSWEAIPNAR 1097

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1098 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1156

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1157 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1211

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D  
Sbjct: 1212 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDSA 1271

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1272 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1325

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1326 AEGPSKKSIVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1385

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  + + +   + E+ R+
Sbjct: 1386 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLEIDRV 1445



 Score = 44.7 bits (104), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 45/216 (20%), Positives = 99/216 (45%), Gaps = 35/216 (16%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 77  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 137 QNVHTPRVRV-DPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGEGQKSSFLPSYIID 195

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP- 154
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     + ++   
Sbjct: 196 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNILQKVH 255

Query: 155 ---WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
              WS  NL       + VP P+ GV+I    +++Y
Sbjct: 256 PVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLY 291


>gi|395860104|ref|XP_003802355.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Otolemur garnettii]
          Length = 1441

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 970  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1030 KVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1089

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1090 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1148

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1149 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1203

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1204 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1263

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1264 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1317

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1318 TEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1377

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1378 AFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1423



 Score = 46.6 bits (109), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 62/326 (19%), Positives = 141/326 (43%), Gaps = 60/326 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
           + +++  ++   +++GS  G+S L+K
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLK 395



 Score = 39.7 bits (91), Expect = 8.7,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 73/172 (42%), Gaps = 36/172 (20%)

Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
           EV +  +N+GP  +  + +   L  + Q       ++V CSG  K+G+L +++  I    
Sbjct: 463 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 522

Query: 339 QASVELQGIKGMWSLRSST------------------------DDPFDTFLVVSFISETR 374
             + EL G   MW++ +S                         D     FL++S    T 
Sbjct: 523 VTTFELPGCYDMWTVIASVRKEEEETPKGEGTEQESGVPEGEEDGRRHGFLILSREDSTM 582

Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
           IL      E+ E +  GF +Q  T+F  +   N+ +VQV+   +RL+   ++
Sbjct: 583 ILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 632


>gi|241060959|ref|XP_002408050.1| cleavage and polyadenylation specificity factor, putative [Ixodes
            scapularis]
 gi|215492346|gb|EEC01987.1| cleavage and polyadenylation specificity factor, putative [Ixodes
            scapularis]
          Length = 1241

 Score =  100 bits (250), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 92/323 (28%), Positives = 155/323 (47%), Gaps = 35/323 (10%)

Query: 731  YYCVGTAYVLPEENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAF 779
            Y  +GT Y    E+  ++GRI +  + D            K++++  KE KG V +L+  
Sbjct: 916  YLALGTNYCY-GEDVTSRGRITILDIIDVVPEPGQPLTKNKIKIVYSKEQKGPVTALSQV 974

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
             G LL+AI QK  +Y W L+D+G   L         I    V T  + I+VGD+ KS+SL
Sbjct: 975  VGFLLSAIGQK--MYIWQLKDNG---LVGVAFIDTQIYIHSVVTVKNLILVGDVFKSVSL 1029

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDE 894
            L Y+     +   +RD     + AVE   D+  +      +E N  L+  +  S  +   
Sbjct: 1030 LRYQEASRTLSLVSRDVRPLEVFAVEFFIDNSQMSFLVTDSERNMILYMYQPESRESCGG 1089

Query: 895  ERGRLEVVGEYHLGE-----FVNRFRHGSLV---MRLPDSDVGQIPTVIFGTVNGVIGVI 946
            +  RL   G++H+G      F  + R G +     RL  S  G+  T++  T++G +G +
Sbjct: 1090 Q--RLLRRGDFHIGSPVVSMFRIKCRMGEVAKHDRRLAASVDGRHITML-ATLDGSLGYV 1146

Query: 947  ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV--DAKNFLDGDLIESFL 1004
              +P + Y  L  LQ  L   +    GLN + +R ++++++ +    KN LDG+LI  F+
Sbjct: 1147 LPVPEKTYRRLLMLQNVLVTNMPHYAGLNPKAFRMYHSQRRVLGNPHKNILDGELIWKFM 1206

Query: 1005 DLSRTRMDEISKTMNVSVEELCK 1027
             LS     E+SK +  +V ++ +
Sbjct: 1207 HLSFMERSELSKKIGTTVTQVVQ 1229


>gi|157110889|ref|XP_001651294.1| cleavage and polyadenylation specificity factor cpsf [Aedes aegypti]
 gi|108883895|gb|EAT48120.1| AAEL000832-PA [Aedes aegypti]
          Length = 1417

 Score =  100 bits (249), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 129/538 (23%), Positives = 237/538 (44%), Gaps = 79/538 (14%)

Query: 561  KTGELTDRKKVSLGTQPITL-RTFSSKNTTH-VFAASDRPTVIYSSNK-----KLLYSNV 613
            + GE  D     +  + I++ R F++ N  + V    ++P ++  +++       LY+  
Sbjct: 886  QEGEKPDEHSTKIMYENISMIRYFNNVNGYNGVAVCGEKPYIMLLTSRGELRAHRLYAKT 945

Query: 614  NLKEVSHMCPFNSAAFPDS-LAIAKEGELTIG------TIDDIQKLHIRSIPLGEHPRRI 666
             +K  +   PFN+   P+  L   ++ EL I       + D I    +R IPL   P++I
Sbjct: 946  IMKGFA---PFNNVNCPNGFLYFDEQYELKIAVFPGYLSYDSIWP--VRKIPLRSSPKQI 1000

Query: 667  CHQEQSRTFAICSLKNQSC-------AEESEM----------------HFVRLLDDQTFE 703
             + ++++ + +     + C        E+ E+                  V L+    +E
Sbjct: 1001 VYHKENKVYCVVMDAEEVCNKYYRFNGEDKELTEENKGERFLYPMAHKFSVVLVTPSAWE 1060

Query: 704  FI--STYPLDTFEYGCSILSCSFSDDS-----NVYYCVGTAYVLPEENEPTKGRILVF-- 754
             I  ++  LD +E+  ++ + S S +        Y  VGT +    E+  ++GR+L++  
Sbjct: 1061 IIPETSINLDEWEHVIALKNVSLSYEGARSGFKEYIAVGTNFNY-SEDITSRGRLLLYDI 1119

Query: 755  ---IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRE 805
               + E GK       + +  KE KG V ++   +G L+ A+ QK+  Y W L+DD   +
Sbjct: 1120 IEVVPEPGKPLTRYKFKEVIVKEQKGPVSAITHVSGFLVGAVGQKV--YLWQLKDD---D 1174

Query: 806  LQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
            L        +I    + +    I+V D+ KS+SLL ++ +   +   +RDY    +  +E
Sbjct: 1175 LVGVAFIDTNIFVHQLVSIKSLILVADVYKSVSLLRFQEDYRTLSLVSRDYQPLNVFQIE 1234

Query: 866  ILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR-FR-----H 916
             + D+  LG   ++   N+ T     E        RL    +YH+G+ +N  FR     H
Sbjct: 1235 YVVDNHNLGFLVSDEQCNIITYMYQPESRESFGGQRLLRKCDYHVGQKINSMFRVQCDFH 1294

Query: 917  GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
                 R  +S+     T  F T++G IG +  LP + Y  L  LQ  L      + GLN 
Sbjct: 1295 EMDYKR--NSNYECKHTTYFATLDGGIGYVLPLPEKTYRRLFMLQNVLMTHSPHLCGLNP 1352

Query: 977  EQWRSFNNEKKTV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            + +R+    KK     A+  +DGDLI +FL L      E++K +   ++++C  + E+
Sbjct: 1353 KAFRTIKTVKKLPINPARCVVDGDLIWTFLTLPANEKLEVAKKIGTRIDDICADLMEI 1410



 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 117/548 (21%), Positives = 213/548 (38%), Gaps = 133/548 (24%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG--DVSDR 58
           M    ++G I +++     G  +D L I+ +  K  V+Q+D ++ EL T ++   +  D 
Sbjct: 68  MATYTLFGNIMSMQSVSLAGSQRDALLISFQDAKLSVVQFDPDNFELKTLSLHYFEEEDI 127

Query: 59  IGRPTDNGQIGII--DPDCRLIGLHLYDGLFKVIPFDNKGQLKE---------------- 100
            G  T +    I+  DPD R   + +Y     V+PF     L E                
Sbjct: 128 KGGWTGHYHTPIVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEVQDVKPMKKAPTQL 187

Query: 101 --------AFNIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTY 141
                   ++ I L+E +     V+DI+FL+G  +PT+++LY+  K        R     
Sbjct: 188 IAKTPILASYVIELKESEERIDNVIDIQFLHGYYEPTLLILYEPVKTFPGRIAVRSDTCM 247

Query: 142 EVALK---DKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS--------- 189
            VAL     +      W+ N L       I +  P+ G LI+    ++Y +         
Sbjct: 248 MVALSLNIQQRVHPVIWTVNCLPFDCLQAIAISKPIGGCLILSVNALIYLNQSVPPYGVS 307

Query: 190 ----ANAFKAIPIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLLVITHEKEK- 236
               A+     P++P   +   R+  D +        + +L    G L++L +  +  + 
Sbjct: 308 LNSIADHCTNFPLKP---QDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRS 364

Query: 237 VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAK------------- 283
           V          + +   I  ++   +++GS  G+S L++   + ++              
Sbjct: 365 VRSFHFSKAASSVLTCCICVVEEEYLFLGSRLGNSLLLRFKEKDESMVITIDDTEEVVEK 424

Query: 284 -------------------------GSYV-EVLERYVNLGPIVDFCV---VDLERQGQG- 313
                                     SY+ EV +  +N+GPI    V   +  E Q +  
Sbjct: 425 EPKRLRLEQEELEVYGSGQKTSVQLTSYIFEVCDSILNIGPIGHMAVGERISEEEQDENK 484

Query: 314 -----------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFD 362
                      ++VT SG  K+G+L +++N I      S  L G   +        D   
Sbjct: 485 DVQFVPNKLDLEIVTSSGHGKNGALCVLQNSIKPQVITSFGLSGCLDV--------DDMH 536

Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVS 421
           +F+++S   E   + +   DE+ E E  GF +   T+   +   N+ +VQVT+ S+RL+ 
Sbjct: 537 SFMILS--QEAGTMVLQTGDEINEIENTGFATNVPTIHVGNIGGNRFIVQVTTKSIRLLQ 594

Query: 422 STSRELRN 429
            T R L+N
Sbjct: 595 GT-RLLQN 601


>gi|330932624|ref|XP_003303845.1| hypothetical protein PTT_16226 [Pyrenophora teres f. teres 0-1]
 gi|311319874|gb|EFQ88052.1| hypothetical protein PTT_16226 [Pyrenophora teres f. teres 0-1]
          Length = 1247

 Score =  100 bits (248), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 140/617 (22%), Positives = 243/617 (39%), Gaps = 85/617 (13%)

Query: 155 WSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVY-----CSANAFK-AIPIRPSITKA 205
           WS+  +D  A+ L  VP       GVL  GE++I Y       +N  + AIP R   T+ 
Sbjct: 224 WSEP-VDRTANTLFRVPGGPNAPSGVLCCGEDSITYRRIFNNKSNVHRLAIPRREGATED 282

Query: 206 YGR-----------VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTI 254
             R           +      YLL    G +  L +      V  +KI+      IA++I
Sbjct: 283 PNRKRMIIAGTLYSLKGGDFFYLLQTEDGDVFKLTVEAPSGTVDKIKIKYFDTIPIATSI 342

Query: 255 SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------V 287
             L    VY     GD  L +L                   D + S+            +
Sbjct: 343 CILRAGFVYAACESGDRILYELESLGDETEDPVFESDQFPVDPEASFAPPFFKPRALVNL 402

Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
             +E   +L PI+   V +   +   QV T +G     S R  RN + + +     L Q 
Sbjct: 403 TAVESMPSLNPIMGMEVANPALEDAPQVYTINGTSGRSSFRTTRNALEVLDLIESPLPQN 462

Query: 347 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
              +W+ + +++D  DT +V+     +R L + + D++EE    GF   T TL       
Sbjct: 463 ASDVWTTKLTSEDEADTLIVLCL--HSRTLVLKIGDDVEEASNTGFLPDTNTLGVQQFGE 520

Query: 407 NQLVQVTSGSVRLV---------SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
           + ++Q+    +R +         +  +     +W  P   ++     N  QV +A   G 
Sbjct: 521 DCIIQIHPKGIRHIQGIQFPNDDAGATHANLMDWHPPAHRTIVACATNNRQVAIALSSGQ 580

Query: 458 LVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-P 514
           ++Y E   DG L   +    L+  I+CL +  + E    +   AVG  +D +VRIF+L P
Sbjct: 581 ILYFECDSDGSLAMAEEEISLDSTINCLAMPDVPEGSVRAFFLAVGC-SDQTVRIFNLSP 639

Query: 515 DL--NLITKEHLGGEIIPRSVLLCAF------EGIS-YLLCALGDGHLLNFLLNMKTGEL 565
           D+  N++    +     P S L           G S +L   L  G  +  +L+  TG++
Sbjct: 640 DMDGNILRSISVQALTSPPSDLTINLMTDRSPRGYSQFLHIGLRSGVYIRSVLDEMTGDI 699

Query: 566 TDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHMCP 623
            D ++  LG +PI     +      + A + RP + Y+  +   L  + +N         
Sbjct: 700 GDTRRRFLGPEPIKFAKVTVAGEPGILAMTSRPWLGYTHPRTGVLQLTPLNYIPFKSAWN 759

Query: 624 FNSAAFPDSLAIAKEGELTIGTIDDI------QKLHIRSIP---LGEHPRRICHQEQSRT 674
           F+ + F   + ++   EL I T +D+      + + +R  P   +G H + + +  QS  
Sbjct: 760 FDGSQFKGIICVSAN-ELRIFTFNDLTDNTTYENISLRYTPRKMVGYHDQGVFYVIQSDN 818

Query: 675 FAICSLKNQSCAEESEM 691
             I + + Q    ++ M
Sbjct: 819 NTISADRRQQLIAQANM 835



 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 73/287 (25%), Positives = 132/287 (45%), Gaps = 33/287 (11%)

Query: 755  IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL--QSECG 811
            I  DG+ L+L  E        +L AF GKL+A + + + LY     D G + L  +++  
Sbjct: 954  ISPDGRELELFNETAVSEPPLALLAFKGKLIAGVGRHLCLY-----DCGMKSLLRKAQAP 1008

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSAVEILDD 869
            +      + ++T+G  +VV D  +S++  ++K +     +     D  +   SA E+LD 
Sbjct: 1009 NSVPTRIVDIKTQGSRLVVSDQAQSVTYFVHKDQVHPNRLIPFVDDSISRHTSASEMLDY 1068

Query: 870  DIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
            D  +G +   N++ VR   + SE + +   G   +V + +LG   NR     L+     +
Sbjct: 1069 DTTIGGDKFGNIWLVRCPKEISESSDESPDGSDLLVDKSYLGGTPNRL---DLIAHYFTN 1125

Query: 927  DV-----------GQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
            D+           G    + +  + G +G +I       +   ++L+ +LR   K + G 
Sbjct: 1126 DIPVSIQKANLISGGDKVIFWAGLQGTLGALIPFTSRRNHKLFQQLELSLRSEDKPLSGR 1185

Query: 975  NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            +H  +RS+    K+V     +DGDLIE +L LSR + + I+  M  S
Sbjct: 1186 DHLTYRSYYAPVKSV-----IDGDLIERYLVLSRDKRESIAAQMTGS 1227


>gi|403302917|ref|XP_003942095.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Saimiri boliviensis boliviensis]
          Length = 1390

 Score = 99.8 bits (247), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 113/460 (24%), Positives = 194/460 (42%), Gaps = 62/460 (13%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 919  VDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 978

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 979  KVYAVATSTNTPCTRIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1038

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L  +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1039 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1097

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI  + W LR     EL     
Sbjct: 1098 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKI--FLWSLR---ASELTGMAF 1152

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1153 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1212

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
             LG   ++ + NL       E        RL    ++H+G  VN F      G+      
Sbjct: 1213 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1272

Query: 925  DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
             S V +   +  F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  +
Sbjct: 1273 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1332

Query: 984  NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1333 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1372


>gi|432883539|ref|XP_004074300.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Oryzias latipes]
          Length = 1456

 Score = 99.8 bits (247), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 119/483 (24%), Positives = 208/483 (43%), Gaps = 80/483 (16%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   K+GEL I  +           +R IPL      + +  +S
Sbjct: 985  IESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHFVSYHVES 1044

Query: 673  RTFAICSLKNQSCA---------------EESEMHFVRLLDDQTFEFIS-----TYP--- 709
            + +A+C+   + C                E  E +   L +  + + IS     T P   
Sbjct: 1045 KVYAVCTSVKELCTRIPRMTGEEKEFETIERDERYINPLQEKFSIQLISPVSWETIPNTR 1104

Query: 710  --LDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + +      V     Y   GT  VL  E    +GRIL+      + E
Sbjct: 1105 IDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTC-VLQGEEVTCRGRILILDVIEVVPE 1163

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 808
             G      K +++ EKE KG V +L   +G L++AI QKI L  W L+D+   G   + +
Sbjct: 1164 PGQPLTKNKFKVLYEKEQKGPVTALCHCHGYLVSAIGQKIFL--WALKDNDLTGMAFIDT 1221

Query: 809  ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
            +   H  I      +  +FI+  D+MKSISLL Y+ E   +   +RD     + ++E + 
Sbjct: 1222 QLYIHQMI------SIKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSIEFIV 1275

Query: 869  DDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP- 924
            D+  LG   ++ + NLF      E        RL    +++ G  +N       + R+P 
Sbjct: 1276 DNNQLGFLVSDRDKNLFVYMYLPEAKESFGGMRLLRRADFNAGAHINS------LWRMPC 1329

Query: 925  ----DSDVGQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
                DS   +  T        F T++G IG++  +  + Y  L  LQ  L  ++    GL
Sbjct: 1330 RGALDSGSKKALTWDNKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1389

Query: 975  NHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            N + +R  ++ ++++    KN LDG+L+  +L LS     E++K +  + + +   + E+
Sbjct: 1390 NPKAFRMMHSNRRSLQNAVKNILDGELLAKYLYLSTMERSELAKKIGTTQDIILDDLLEI 1449

Query: 1033 TRL 1035
             R+
Sbjct: 1450 DRV 1452



 Score = 49.3 bits (116), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 62/323 (19%), Positives = 136/323 (42%), Gaps = 54/323 (16%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT-- 63
           ++G + ++   +  G ++D L ++ +  K  V+++D  + +L T ++    +   R    
Sbjct: 74  LFGNVMSMASVQLTGASKDALLLSFKDAKLSVIEYDPGTHDLKTLSLHYFEEPELRDGFF 133

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPFDN---------------KGQLKEAFNIRL 106
            N  I I+  DP+ R   + +Y     V+PF                 K     ++ I +
Sbjct: 134 QNVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKDTLSDEQEGGVGEGPKSSFLPSYIIDV 193

Query: 107 EEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP-- 154
            EL      ++D+KFL+G  +PT+++L++ N+      A    T  +     + ++    
Sbjct: 194 RELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKVHP 253

Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIR 199
             WS +NL      ++ VP P+ GV++    +++Y +              N   + P+R
Sbjct: 254 VIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVSLNSQTNGTTSFPLR 313

Query: 200 PSITKAYGRVDADGSRYLLGD------HAGLLHLL-VITHEKEKVTGLKIELLGETSIAS 252
               +    +D   S ++  D        G +++L +IT     V     +    + + +
Sbjct: 314 VQ-EEVKITLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTT 372

Query: 253 TISYLDNAVVYIGSSYGDSQLIK 275
            +  ++   +++GS  G+S L+K
Sbjct: 373 CMVTMEPGYLFLGSRLGNSLLLK 395



 Score = 40.4 bits (93), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 42/172 (24%), Positives = 73/172 (42%), Gaps = 41/172 (23%)

Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
           EV +  +N+GP  +  + +   L  + Q       ++V CSG  K+G+L +++  I    
Sbjct: 469 EVCDSILNIGPCANASMGEPAFLSEEFQSNPEPDLEIVVCSGYGKNGALSVLQRSIRPQV 528

Query: 339 QASVELQGIKGMWSLRS-----------------------STDDPFDT------FLVVSF 369
             + EL G   MW++ S                        T+ P +       FL++S 
Sbjct: 529 VTTFELPGCHDMWTVISGEDKKESEGGEKEADAEKKEEQDKTEPPLEDDAKKHGFLILSR 588

Query: 370 ISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
              T IL      E+ E +  GF +Q  T+F  +   NQ ++QV+   +RL+
Sbjct: 589 EDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIGDNQYIIQVSPMGLRLL 638


>gi|358415280|ref|XP_003583063.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Bos taurus]
          Length = 1490

 Score = 99.8 bits (247), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 111/466 (23%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 1019 IDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1078

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  +  C                         + E   ++L+   ++E I  + 
Sbjct: 1079 KVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVSWEAIPNAR 1138

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1139 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1197

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1198 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1252

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1253 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1312

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1313 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1366

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1367 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1426

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1427 AFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1472



 Score = 46.6 bits (109), Expect = 0.064,   Method: Compositional matrix adjust.
 Identities = 63/326 (19%), Positives = 141/326 (43%), Gaps = 60/326 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 123 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 182

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 183 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 241

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDN-----KDARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N     K A    T  +     +  +   
Sbjct: 242 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGKVAVRQDTCSIVAISLNITQKVH 301

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 302 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 361

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 362 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 418

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
           + +++  ++   +++GS  G+S L+K
Sbjct: 419 LTTSMVTMEPGYLFLGSRLGNSLLLK 444


>gi|345482082|ref|XP_001607052.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Nasonia vitripennis]
          Length = 1415

 Score = 99.4 bits (246), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 123/519 (23%), Positives = 225/519 (43%), Gaps = 74/519 (14%)

Query: 580  LRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSN-VNLK-EVSHMCPFNSAAFPDS-LAI 635
            +R FS+      VF   D P  I+ + +  L ++ +N+   V    PFN+   P   L  
Sbjct: 902  MRYFSNIAGYNGVFIGGDYPHWIFLTGRGELRAHPMNIDGPVKSFAPFNNVNCPQGFLYF 961

Query: 636  AKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICS-----LKN---- 682
             ++ EL I  +           +R +PL   P  + +  +S+T+ + +     LK+    
Sbjct: 962  NRKDELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVVTSTAEPLKSYYRF 1021

Query: 683  -----QSCAEESEMHFVRLLDDQ----TFEFIS--TYP-----LDTFEYGCSILSCSFSD 726
                 +   EE    F+    +Q     F  +S  T P     LD +E+   + + S + 
Sbjct: 1022 NGEDKEFTEEERNERFLYPTQEQFSIVLFSPVSWDTIPNTKIDLDQWEHVTCLKNVSLAY 1081

Query: 727  DSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIAEKETK 770
            +        Y  +GT Y   E+   ++GRI +F I+E            + + I  KE K
Sbjct: 1082 EGTRSGLKGYIVIGTNYNYGEDIT-SRGRIFIFDIIEVVPEPGQPLTKNRFKQIYAKEQK 1140

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
            G V ++   +G L++AI QKI  Y W L+D+   +L         I    + +    I+V
Sbjct: 1141 GPVTAITQVSGFLVSAIGQKI--YIWQLKDN---DLVGVAFIDTQIYVCQMLSIKSLILV 1195

Query: 831  GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVR 885
             D+ KS+SLL ++ E   +   +RD+    + A+E    +  LG      E+N ++F+ +
Sbjct: 1196 ADVYKSVSLLRFQPEYKTLSLVSRDFRTTEIYAIEYFIQNNELGFIVADGESNISIFSYQ 1255

Query: 886  KNSEGATDEERGRLEVVGEYHLGEFVNRFRH--------GSLVMRLPDSDVGQIPTVIFG 937
              S  +   ++  L    + HLG+ +N F           +   +   +D   +   ++ 
Sbjct: 1256 PESSQSLGGQK--LIRKADIHLGQKINTFFRIKCKTTDSANPTKQFSGADKRHV--TMYA 1311

Query: 938  TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN--EKKTVDAKNFL 995
            T++G +G I  +P + Y  L  LQ  L   I  + GLN + +R++ +    +   A+  +
Sbjct: 1312 TLDGSLGYILPVPEKTYRRLLMLQNVLVSHIYHIAGLNPKAFRTYKSCVRMQGNPARGII 1371

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
            DGDL+  +LDL      EI+K +    +E+   + E+ +
Sbjct: 1372 DGDLVRKYLDLPVNEKIEIAKKIGTGAQEIMDDMHEIYK 1410



 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 122/604 (20%), Positives = 232/604 (38%), Gaps = 137/604 (22%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
           ++G + +++  +  G  +D L ++    K  V+++D E   L T ++    +   +   T
Sbjct: 73  LHGNVMSMQAVQLIGSPRDSLLLSFREAKLSVVEYDPEIHSLRTVSLHYFEEEEIKDGWT 132

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFN 103
           ++  + I+  DP+ R   + +Y     V+PF                   +K  +  ++ 
Sbjct: 133 NHHHVPIVRVDPEGRCAVMLIYGRKLVVLPFRKDPILDEGDLIENPKSSSHKTPILSSYM 192

Query: 104 IRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKD 149
           I L+ L+     ++D++FL+G  +PT+++LY+  +        R      VA+     + 
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFAGRIAVRQDTCAMVAISLNIQQK 252

Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAFK----AI 196
                WS +NL       + V  PL G LI+   +++Y          S N+        
Sbjct: 253 VHPIIWSVSNLPFDCYQAVAVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLTDNCTNF 312

Query: 197 PIRPS----ITKAYGRVD-ADGSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSI 250
           P++P     I+    +V      R ++    G L++L +       V G   +    + +
Sbjct: 313 PLKPQEGVKISLESSQVAFISPDRLVISLKTGELYVLSLFADSMRSVRGFHFDKAAASVL 372

Query: 251 ASTISYLDNAVVYIGSSYG----------------DSQLIKLNL-------QPDAK---- 283
            S +   D+  +++GS  G                D  +++++L       QP  K    
Sbjct: 373 TSCVCLCDDNYLFLGSRLGNSLLLRFTEKESEKINDISMLEMSLNSSNSQEQPTKKIKLD 432

Query: 284 --------------------------------GSYV-EVLERYVNLGPIVD-------FC 303
                                            SY+ EV +  +N+GP  +       F 
Sbjct: 433 YLEDWMASDVLDIKDPEELEVYGSETQTSIQITSYIFEVCDSLLNIGPCGNISMGEPAFL 492

Query: 304 VVDLERQGQGQV--VTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST--DD 359
             +     +  V  VT SG  K+G+L +++  I      + +L G + +W++  ST  D+
Sbjct: 493 SEEFSNNSEPDVELVTTSGYGKNGALCVLQRSIRPQVITTFDLPGYENIWTVIDSTVSDN 552

Query: 360 PFDT-------FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQ 411
              T       FL+++    T +L    E   E  +  GF +Q  T+F  +   N+ ++Q
Sbjct: 553 RAKTETEGTHGFLILTQDDSTMVLQTGQEIN-EVVDQSGFSTQGTTIFAGNLGSNRYIIQ 611

Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
           VT   VRL+     E         G  +  A+     V L +  G +V L + +G  T  
Sbjct: 612 VTQMGVRLLQGL--EQIQHMPMDLGCPIVHASCADPYVSLLSEDGQVVLLTLREGRGTAR 669

Query: 472 KHAQ 475
            HAQ
Sbjct: 670 LHAQ 673


>gi|440904368|gb|ELR54893.1| Cleavage and polyadenylation specificity factor subunit 1, partial
            [Bos grunniens mutus]
          Length = 1417

 Score = 99.4 bits (246), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 111/466 (23%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 963  IDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1022

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  +  C                         + E   ++L+   ++E I  + 
Sbjct: 1023 KVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVSWEAIPNAR 1082

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1083 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1141

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1142 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1196

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1197 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1256

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1257 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1310

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1311 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1370

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1371 AFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSPMERGELAKKIGTT 1416



 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 52/256 (20%), Positives = 110/256 (42%), Gaps = 54/256 (21%)

Query: 71  IDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FNIR-LEE--LQ 110
           +DPD R   + +Y     V+PF       +++G + E            ++R L+E  L 
Sbjct: 136 VDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIIDVRALDEKLLN 195

Query: 111 VLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNNLD 161
           ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   P  WS  +L 
Sbjct: 196 IVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLP 255

Query: 162 NGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIRPSITKAYGR 208
                 + VP P+ GV+I    +++Y +                  A P+R   T+   R
Sbjct: 256 FDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLR---TQEGVR 312

Query: 209 VDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDN 259
           +  D +        + ++    G +++L +IT     V     +    + + +++  ++ 
Sbjct: 313 ITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEP 372

Query: 260 AVVYIGSSYGDSQLIK 275
             +++GS  G+S L+K
Sbjct: 373 GYLFLGSRLGNSLLLK 388


>gi|27807297|ref|NP_777145.1| cleavage and polyadenylation specificity factor subunit 1 [Bos
            taurus]
 gi|1706101|sp|Q10569.1|CPSF1_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit
 gi|929007|emb|CAA58152.1| cleavage and polyadenylation specificity factor, 160 kDa subunit [Bos
            taurus]
 gi|296480730|tpg|DAA22845.1| TPA: cleavage and polyadenylation specificity factor subunit 1 [Bos
            taurus]
          Length = 1444

 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 111/466 (23%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 973  IDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1032

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  +  C                         + E   ++L+   ++E I  + 
Sbjct: 1033 KVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVSWEAIPNAR 1092

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1093 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1151

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1152 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1206

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1207 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1266

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1267 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1320

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1321 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1380

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1381 AFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1426



 Score = 46.6 bits (109), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 62/326 (19%), Positives = 141/326 (43%), Gaps = 60/326 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 77  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 137 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 195

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 196 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 255

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 256 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 315

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 316 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 372

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
           + +++  ++   +++GS  G+S L+K
Sbjct: 373 LTTSMVTMEPGYLFLGSRLGNSLLLK 398


>gi|348555854|ref|XP_003463738.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 1 [Cavia porcellus]
          Length = 1440

 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 111/466 (23%), Positives = 193/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 969  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1028

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  +  C                         + E   ++L+   ++E I  + 
Sbjct: 1029 KVYAVATSTSTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1088

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1089 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1147

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1148 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1202

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1203 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1262

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1263 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1316

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1317 TEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1376

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1377 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1422



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 98/495 (19%), Positives = 190/495 (38%), Gaps = 145/495 (29%)

Query: 71  IDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FNIR-LEE--LQ 110
           +DPD R   + +Y     V+PF       +++G + E            ++R L+E  L 
Sbjct: 142 VDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIIDVRALDEKLLN 201

Query: 111 VLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNNLD 161
           ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   P  WS  +L 
Sbjct: 202 IVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLP 261

Query: 162 NGADLLIPVPPPLCGVLIIGEETIVYCSAN-------------AFKAIPIRPSITKAYGR 208
                 + VP P+ GV+I    +++Y + +                A P+R   T+   R
Sbjct: 262 FDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLGTTAFPLR---TQEGVR 318

Query: 209 VDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDN 259
           +  D +        + ++    G +++L +IT     V     +    + + +++  ++ 
Sbjct: 319 ITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEP 378

Query: 260 AVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE-------------- 288
             +++GS  G+S L+K    LQ               P +K   V+              
Sbjct: 379 GYLFLGSRLGNSLLLKYTEKLQEPPASTVREAADKEEPPSKKKRVDSTAGWAGSKTVPQD 438

Query: 289 ------------------------VLERYVNLGPIVDFCVVD---LERQGQG------QV 315
                                   V +  +N+GP  +  V +   L  + Q       ++
Sbjct: 439 EVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPDLEI 498

Query: 316 VTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL--------------RSSTDDPF 361
           V CSG  K+G+L +++  I      + EL G   MW++                S  +P 
Sbjct: 499 VVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAEGSEQEPS 558

Query: 362 DT----------FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LV 410
                       FL++S    T IL      E+ E +  GF +Q  T+F  +   N+ +V
Sbjct: 559 APEAEDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIV 616

Query: 411 QVTSGSVRLVSSTSR 425
           QV+   +RL+   ++
Sbjct: 617 QVSPLGIRLLEGVNQ 631


>gi|402588688|gb|EJW82621.1| splicing factor 3b [Wuchereria bancrofti]
          Length = 601

 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 117/515 (22%), Positives = 209/515 (40%), Gaps = 79/515 (15%)

Query: 7   YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
           +G I +L  FR  G ++D++ I ++  +  +L+++A+      R   +   + G R    
Sbjct: 90  FGIIRSLLAFRLTGGSKDYIVIGSDAGRIVILEYNAQKV-CFERVHQETFGKTGCRRIVP 148

Query: 66  GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
           GQ   +DP  R  LIG      L  ++  D    L  +  +   +   +    +    G 
Sbjct: 149 GQFLAVDPKGRAILIGAVERQKLVYIMNRDASANLTISSPLEAHKSHCICYSVVGVDVGF 208

Query: 121 AKPTIVVLYQDNKDARHVKTYEVALK--------DKDFV----EGPWS----------QN 158
             PT   L  D ++  H  T  +A K        + D V    +GP              
Sbjct: 209 ENPTFACLEVDYEEVDHDPTGHLATKIPQTLTFYELDLVPGGQDGPSGVIVCCENYLVYK 268

Query: 159 NLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLL 218
           NL +  D+  PVP     +       I+ C+A     +                   +L+
Sbjct: 269 NLGDQPDIKCPVPRRRNELDDCDRTVIIVCAATHKTKLMYF----------------FLV 312

Query: 219 GDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL 278
               G +  + +  E + VT LKI+      +++ +  L    ++  S +G+  L ++  
Sbjct: 313 QTDQGDIFKVTLESEHDIVTELKIKYFDTIPVSNAMCILKTGFLFTASEFGNHHLYQIAH 372

Query: 279 QPDAK----------------------GSYV-EVLERYVN-----LGPIVDFCVVD-LER 309
             D                        GS + ++L R++      L      C++D L  
Sbjct: 373 LGDEDDEPEFSSRMQLEEGETFFLLLVGSQIWQLLTRWIAYHRLFLPTFKVHCMIDDLAN 432

Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVS 368
           +   Q+ T  G     +++++RNG+ + E A  EL G    +W+++ + DD FD+ +VVS
Sbjct: 433 EDSPQIYTLVGRGALSAVKVLRNGLEVTEMAVSELPGNPNAVWTVKRNIDDKFDSHIVVS 492

Query: 369 FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 428
           F++ T +L++   + +EE    GF   T TL C     + L+QV    +R + +  R   
Sbjct: 493 FVNATLVLSIG--ETVEEVTDSGFLGTTPTLGCALIGDDALLQVYPDGIRHIRADRRV-- 548

Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
           NEWK+P   ++     N  QV +A  GG LVY E+
Sbjct: 549 NEWKAPGKRTIMKCALNRRQVAIALAGGELVYFEL 583


>gi|307190910|gb|EFN74734.1| Cleavage and polyadenylation specificity factor subunit 1 [Camponotus
            floridanus]
          Length = 1418

 Score = 99.0 bits (245), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 88/332 (26%), Positives = 156/332 (46%), Gaps = 39/332 (11%)

Query: 731  YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAF 779
            Y  +GT Y   E+   ++GRIL+F I+E            + + I  KE KG + ++   
Sbjct: 1093 YIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPITAITQV 1151

Query: 780  NGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
            +G L++A+ QKI  Y W L+D+   G   + ++   H  +L++        I++ D+ KS
Sbjct: 1152 SGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI-----KSLILIADVYKS 1203

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGA 891
            ISLL ++ E   +   +RD+    +  +E L D+  LG      E+N  LF  +  S  +
Sbjct: 1204 ISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFFLADGESNLALFMYQPESRES 1263

Query: 892  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGV 945
               ++  L    ++HLG+ VN F      +  P +D  Q          ++ T++G +G 
Sbjct: 1264 LGGQK--LIRKADFHLGQKVNTFFRIRCRVSDPANDKKQFSGADKRHVTMYATLDGSLGY 1321

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN---EKKTVDAKNFLDGDLIES 1002
            I  +P + Y  L  LQ  L   I  + GLN + +R         +   A+  +DGDL+  
Sbjct: 1322 ILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKSYRQTYKSYIRNQGNPARGIIDGDLVWR 1381

Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
            +L L      +++K +   V+E+ + + E+ R
Sbjct: 1382 YLFLPNNEKTDVAKKIGTRVQEIIEDITEIDR 1413



 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 67/327 (20%), Positives = 139/327 (42%), Gaps = 56/327 (17%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
           ++G I +++     G  +D L ++    K  V+++D +  +L T ++    +   +   T
Sbjct: 73  LHGNIMSMQAVHLIGSQRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEEIKDGWT 132

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAF 102
           ++  I I+  DP+ R   + +Y     V+PF                    NK  +  ++
Sbjct: 133 NHHHIPIVRVDPEGRCAIMLIYGRKLVVLPFRKDPSLDDGDLLDSAKLTSSNKTPILSSY 192

Query: 103 NIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DK 148
            I L+ L+     V+D++FLYG  +PT+++LY+  +        R      VA+     +
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLYGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252

Query: 149 DFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKA 195
                 WS +NL      ++PV  PL G LI+   +++Y +             A+    
Sbjct: 253 RVHPIIWSVSNLPFDCYQVVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTN 312

Query: 196 IPIRP----SITKAYGRVD-ADGSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
            P++P     ++    +V    G R ++   +G L++L +       V G   +    + 
Sbjct: 313 FPLKPQEGVKMSLEGSQVAFISGDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL 276
           + S +   ++  +++GS  G+S L++ 
Sbjct: 373 LTSCVCMCEDNYLFLGSRLGNSLLLRF 399



 Score = 42.4 bits (98), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 12/117 (10%)

Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
           ++VT SG  K+G+L +++  I      + EL G + MW++  + ++             F
Sbjct: 506 ELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVIGTLNNDEQVKAEAEGSHAF 565

Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
           L++S    T IL      E+ E +  GF +Q  T+F  +   N+ +VQVT   VRL+
Sbjct: 566 LILSQEDSTMILQTG--QEINEVDQSGFSTQGSTVFAGNLGANRYIVQVTQMGVRLL 620


>gi|229335612|ref|NP_001108153.2| cleavage and polyadenylation specificity factor subunit 1 [Danio
            rerio]
          Length = 1449

 Score = 98.6 bits (244), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 114/477 (23%), Positives = 209/477 (43%), Gaps = 68/477 (14%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   K+GEL I  +           +R IPL      + +  +S
Sbjct: 978  IESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVES 1037

Query: 673  RTFAICSLKNQSC-------AEESEMHFV----------------RLLDDQTFEFI--ST 707
            + +A+C+   + C        EE E   +                +L+   ++E I  + 
Sbjct: 1038 KVYAVCTSVKEPCTRIPRMTGEEKEFETIERDERYIHPQQDKFSIQLISPVSWEAIPNTR 1097

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + +      V     Y  +GT  +  EE    +GRIL+      + E
Sbjct: 1098 VDLEEWEHVTCMKTVALKSQETVSGLKGYVALGTCLMQGEEVT-CRGRILILDVIEVVPE 1156

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 808
             G      K +++ EKE KG V +L   +G L++AI QKI L  W L+D+   G   + +
Sbjct: 1157 PGQPLTKNKFKVLYEKEQKGPVTALCHCSGFLVSAIGQKIFL--WSLKDNDLTGMAFIDT 1214

Query: 809  ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
            +     +I  +Y  +  +FI+  D+MKSISLL Y+ E   +   +RD     + ++E + 
Sbjct: 1215 QL----YIHQMY--SIKNFILAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFMV 1268

Query: 869  DDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD 925
            D+  LG   ++ + NL       E        RL    ++++G  VN F        L  
Sbjct: 1269 DNNQLGFLVSDRDKNLMVYMYLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDT 1328

Query: 926  SDVGQIP-----TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
            ++   +         F T++G +G++  +  + Y  L  LQ  L  ++    GLN + +R
Sbjct: 1329 ANKKALTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFR 1388

Query: 981  SFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
              + +++T+    KN LDG+L+  +L LS     E++K +  + + +   + E+ R+
Sbjct: 1389 MLHCDRRTLQNAVKNILDGELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEIERV 1445



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 104/565 (18%), Positives = 212/565 (37%), Gaps = 148/565 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
           ++G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++    +   R    
Sbjct: 74  LFGNVMSMASVQLVGTNRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPF---------------DNKGQLKEAFNIRL 106
            N  I ++  DP+ R   + +Y     V+PF                 K     ++ I +
Sbjct: 134 QNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDTLADEQEGIVGEGQKSSFLPSYIIDV 193

Query: 107 EEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP-- 154
            EL      ++D+KFL+G  +PT+++L++ N+      A    T  +     + ++    
Sbjct: 194 RELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKVHP 253

Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIR 199
             WS +NL    + ++ VP P+ GV++    +++Y +              N   A P+R
Sbjct: 254 VIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPPFGVSLNSLTNGTTAFPLR 313

Query: 200 PSITKAYGRVDADGSRYLLGD------HAGLLHLL-VITHEKEKVTGLKIELLGETSIAS 252
           P   +    +D   + ++  D        G +++L +IT     V     +    + + +
Sbjct: 314 PQ-EEVKITLDCSQASFITSDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTT 372

Query: 253 TISYLDNAVVYIGSSYGDSQLIKLN------------------LQPDAKGSYV------- 287
            +  ++   +++GS  G+S L++                     QP  K   V       
Sbjct: 373 CMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEEGKENEEKEKQPPNKKKRVDSNWAGC 432

Query: 288 -------------------------------EVLERYVNLGPIVDFCVVD---LERQGQG 313
                                          EV +  +N+GP     + +   L  + Q 
Sbjct: 433 PGKGNLPDELDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCASASMGEPAFLSEEFQT 492

Query: 314 ------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDT---- 363
                 +VV CSG  K+G+L +++  I      + EL G   MW++    + P       
Sbjct: 493 NPEPDLEVVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEG 552

Query: 364 ----------------------FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
                                 FL++S    T IL      E+ E +  GF +Q  T++ 
Sbjct: 553 DGESPEEEKREPTIEDDKKKHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVYA 610

Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
            +   N+ ++QV+   +RL+   ++
Sbjct: 611 GNIGDNKYIIQVSPMGIRLLEGVNQ 635


>gi|242021233|ref|XP_002431050.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
            putative [Pediculus humanus corporis]
 gi|212516279|gb|EEB18312.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
            putative [Pediculus humanus corporis]
          Length = 1409

 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 106/438 (24%), Positives = 194/438 (44%), Gaps = 66/438 (15%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE------ESEMHFVRLLDDQ------- 700
            +R +PL   P  + +  +S+T+ + +  ++   E      E + H V   DD+       
Sbjct: 978  VRKVPLRCTPHMVTYHLESKTYCLITSSSEPSNEYFRFNGEDKEHSVEDRDDRFPLPLQD 1037

Query: 701  TFEFISTYP------------LDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEE 743
             F  +   P            LD +E+   + + + S +        Y  VGT Y    E
Sbjct: 1038 KFSIVLFSPVSWEVIPNTKMELDEWEHVTCVKTVNLSYEGTRSGLKGYVAVGTNYNY-SE 1096

Query: 744  NEPTKGRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
            +  +KGRIL++ I+E            + + +  KE KG V +L    G L+ A+ QKI 
Sbjct: 1097 DITSKGRILIYDIIEVVPEPGQPLTKNRFKTVYAKEQKGPVTALCHVLGFLVTAMGQKI- 1155

Query: 793  LYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
             Y W L+D+   G   + ++   H  I      +    I+V D+ KSISLL ++ E   +
Sbjct: 1156 -YIWQLKDNDLVGIAFIDTQIYIHQMI------SVKSLILVADVYKSISLLRFQEEYRTL 1208

Query: 850  EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
               +RD+    + A+E+L D+  +G   ++   N+       E        +L    ++H
Sbjct: 1209 SLVSRDFRPCEVYAIELLLDNTQMGFLISDVEMNIIMYMYKPEDRDSVGGQKLLRKADFH 1268

Query: 907  LGEFVN-----RFRHGSLVMRLPDSDVG--QIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
            LG+ +N     R R G       D  +G  +    +F T++G +G +  +P + Y  L+ 
Sbjct: 1269 LGQHINSWFRIRCRLGDQAENY-DFPIGAEKRHISMFATLDGALGYLLPIPEKTYRRLQM 1327

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTV--DAKNFLDGDLIESFLDLSRTRMDEISKT 1017
            LQ  L   I  + GLN + +R + + +K +    K  +DG+LI  +L L+     +++K 
Sbjct: 1328 LQNILVYHIPHLAGLNPKAFRIYKSGRKLLGNPCKRIVDGELIWMYLSLTVMEKQDVAKK 1387

Query: 1018 MNVSVEELCKRVEELTRL 1035
            M   ++++ + +  + RL
Sbjct: 1388 MGSKMDDIIEDIAVIERL 1405



 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 70/331 (21%), Positives = 137/331 (41%), Gaps = 59/331 (17%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG--DVSDRIGRPT 63
           ++  + +++     G ++D L ++    K CV+++D +S +L T ++   +  D  G  T
Sbjct: 73  LFANVMSMQAVSLAGSSRDALLLSFREAKLCVVEYDPDSHDLRTLSLHYFEEEDMKGGWT 132

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
           ++  I  +  DP+ R   + +Y     ++PF  + +L +                     
Sbjct: 133 NHYDIPYVRVDPEGRCAAMLVYGRKLVILPFRRESKLDDPDIALLDPHSSSVATAKAPVL 192

Query: 101 -AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD--------ARHVKTYEVALK 146
            ++ I L E+      V+DI+FLYG  +PT+++LY+  K         +       V+L 
Sbjct: 193 SSYTITLREIDEKLENVIDIQFLYGYYEPTLLILYEPLKTFAGRIAVRSDTCAMIAVSLN 252

Query: 147 DKDFVEGP-WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAFKAI 196
            +  V    WS  NL       IPVP PL G LI     ++Y          S N+    
Sbjct: 253 IQQRVHPAIWSVGNLPFNCTQAIPVPKPLGGTLIFSVNALIYLNQSIPPFGVSVNSIAEN 312

Query: 197 PIRPSITKAYG-RVDADGS--------RYLLGDHAGLLHLLVITHEK-EKVTGLKIELLG 246
                +    G ++  +GS        R +L    G L++L +  +    V G   +   
Sbjct: 313 STNFQLKIQEGVKITLEGSQATFISHDRLVLSLKTGELYVLSLLADNIRSVRGFHFDKAA 372

Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
            + + + +   ++  +++GS  G+S L++  
Sbjct: 373 ASVLTTCLCVCEDKYLFLGSRLGNSLLLRFT 403



 Score = 45.8 bits (107), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 52/180 (28%), Positives = 79/180 (43%), Gaps = 22/180 (12%)

Query: 309 RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL---------RSSTDD 359
           R    ++VT SG  K+G++ +++  I      + EL G   MW++           + DD
Sbjct: 501 RDPDLELVTTSGHGKNGAICVLQRTIRPQVVTTFELPGCLDMWTVIGPQSDSGPTQAEDD 560

Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVR 418
               FL++S    T IL      E+ E +  GF +Q  T+F  +   N+ +VQV+   VR
Sbjct: 561 ISHAFLILSQKDSTMILQTG--QEINEVDHSGFNTQGPTIFAGNLASNKYIVQVSKAGVR 618

Query: 419 LVSSTSRELRNEWKSP--PGYSVNVATANASQVLLATGGGHLVYLEI----GDGILTEVK 472
           L+    R L      P   G SV  A+     V L T  G +V L +    G G L+  K
Sbjct: 619 LL----RGLEQIQHIPLDLGSSVVHASTADPYVALLTEDGQVVLLTLRESRGQGRLSVFK 674


>gi|410911304|ref|XP_003969130.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Takifugu rubripes]
          Length = 1444

 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 115/467 (24%), Positives = 200/467 (42%), Gaps = 76/467 (16%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   K+GEL I  +           +R IPL      + +  +S
Sbjct: 973  IESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVES 1032

Query: 673  RTFAICSLKNQSC-------AEESEMHFV----------------RLLDDQTFEFISTYP 709
            + +A+C+   + C        EE E   +                +L+   ++E I    
Sbjct: 1033 KVYAVCTSLKELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLISPVSWEAIPNTR 1092

Query: 710  LD--TFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
            +D   +EY   + + +      V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1093 IDLEEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVT-CRGRILILDVIEVVPE 1151

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD-DGTRELQSEC 810
             G      K +++ EKE KG V +L   NG L++AI QKI L  W+L+D D T     + 
Sbjct: 1152 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFL--WVLKDNDLTGMAFIDT 1209

Query: 811  GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
              H H +     +  +FI+  DLMKS+SLL Y+ E   +   +RD     + ++E + D+
Sbjct: 1210 QLHIHQM----MSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVYSIEFMVDN 1265

Query: 871  IYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS- 926
              LG   ++ + NL+      E        RL    +++ G  +N F       R+P   
Sbjct: 1266 NQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTF------WRMPCRG 1319

Query: 927  --DVGQIPTVI--------FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
              + G    +         F T++G +G++  +  + Y  L  LQ  L  ++    GLN 
Sbjct: 1320 ALEAGSRKAMTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLSHHAGLNP 1379

Query: 977  EQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            + +R  + +++++    KN LDG+L+  +L LS     E++K +  +
Sbjct: 1380 KAFRMLHCDRRSLQNPVKNILDGELLNKYLYLSMMERSELAKKIGTT 1426



 Score = 51.6 bits (122), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 64/322 (19%), Positives = 135/322 (41%), Gaps = 52/322 (16%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
           ++G + ++E  +  G  +D L ++ +  K  V+++D  + +L T ++    +   R    
Sbjct: 74  LFGNVMSMESVQLVGANRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEELELRDGFV 133

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPFDN---------------KGQLKEAFNIRL 106
            N  I I+  DP+ R   + +Y     V+PF                 K      + I +
Sbjct: 134 QNVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKDTLTDEQEVGVGEGPKSSFLPTYIIDV 193

Query: 107 EEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP-- 154
            EL      ++D+KFL+G  +PT+++L++ N+      A    T  +     + ++    
Sbjct: 194 RELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKVHP 253

Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIR 199
             WS +NL      ++ VP P+ GV++    +++Y +              N   A P+R
Sbjct: 254 VIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFPLR 313

Query: 200 ----PSITKAYGRVDADG-SRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIAST 253
                 IT    + D     + ++    G +++L +IT     V     +    + + + 
Sbjct: 314 LQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTC 373

Query: 254 ISYLDNAVVYIGSSYGDSQLIK 275
           +  ++   +++GS  G+S L+K
Sbjct: 374 MVTMEPGYLFLGSRLGNSLLLK 395



 Score = 42.4 bits (98), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 41/171 (23%), Positives = 75/171 (43%), Gaps = 35/171 (20%)

Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
           EV +  +N+GP  +  + +   L  + Q       +VV CSG  K+G+L +++  I    
Sbjct: 465 EVCDSILNIGPCANASMGEPAFLSEEFQSNPEPDLEVVVCSGHGKNGALSVLQRSIRPQV 524

Query: 339 QASVELQGIKGMWSL-----------------RSSTDDPFDT------FLVVSFISETRI 375
             + EL G   MW++                 +  T+ P +       FL++S    T I
Sbjct: 525 VTTFELPGCHDMWTVISNEPVQKEQEETEREGKEKTEPPAEEDTKKHGFLILSREDSTMI 584

Query: 376 LAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
           L      E+ E +  GF +Q  T+F  +   N+ ++QV+   +RL+   ++
Sbjct: 585 LQTG--QEIMELDTSGFATQGPTVFAGNIGDNKYIIQVSPMGIRLLEGVTQ 633


>gi|71028348|ref|XP_763817.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68350771|gb|EAN31534.1| hypothetical protein TP04_0182 [Theileria parva]
          Length = 1231

 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 101/402 (25%), Positives = 183/402 (45%), Gaps = 47/402 (11%)

Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
           LL    G L+ + ++H+   V+ + +       +AST+  L +  +++GS  GD +L + 
Sbjct: 304 LLLSEYGDLYKIELSHDDTTVSEIVVRYFDTVDVASTMCILRSGYLFVGSESGDHKLYQF 363

Query: 277 -------------NLQPDAKGSYV----------EVLERYVNLGPIVDFCVVDLERQGQG 313
                        +L PDAK + +           V++R  ++   VD  VVD+      
Sbjct: 364 TALENGDKDVICTSLHPDAKNAIIAFKPRKLQNLAVVDRMNSMALAVDMKVVDVLGLNNY 423

Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISE 372
            +    G + D  L+ +R G    E A  EL G  K +++++ S +  +D ++++SF   
Sbjct: 424 DIFVACGRWYDSRLKCLRYGFNTEELAFNELPGRPKAVFTIK-SLESNYDEYIIISFQGN 482

Query: 373 TRILAMNLEDELEETEIEGFCSQTQTLF-CHDAIYNQ------LVQVTSGSVRLVSSTSR 425
           T +L++   + +EE     F +   TL  CH    +        VQV  G  R ++    
Sbjct: 483 TLVLSIG--EAVEEVTDSFFLTSITTLHSCHMGSTSGSVGGGIFVQVHDGGFRYLTG--- 537

Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI---LTEVKHAQLEYEISC 482
           ++  EWK      V +A  N +Q++L   GG ++Y ++ +     L EV    L  E++C
Sbjct: 538 DVVKEWKVQTTKRVKLADNNNTQLVLVLTGGEVIYFQLTEADVLELVEVGRRNLSTEVTC 597

Query: 483 LDI-NPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSV-LLCAFE 539
           L + + +  N   ++    G   +I VR+  L   L L + + LG   +P SV LL    
Sbjct: 598 LAVQHQVSGNK--AEFCCCGSIDNI-VRVMKLDKTLKLCSSQILGNNSLPESVALLTTSV 654

Query: 540 GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
              YL   L +G L+   +++  G L+D++   LG +P+ L+
Sbjct: 655 DELYLYVGLNNGVLIRNTVDV-AGNLSDQESRFLGAKPLRLK 695



 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 76/283 (26%), Positives = 127/283 (44%), Gaps = 34/283 (12%)

Query: 761  LQLIAEKETKGAVYSLNAFNGKLL-AAINQKIQLY----KWMLRDDGTRELQSECGHHGH 815
            ++L+    TKG +   N +  KLL  AI  K+++Y    K ML     R L S    +G 
Sbjct: 967  IKLLHVTNTKGWIRCFNNYENKLLLCAIGSKLRMYALGRKQMLLKGEHRSLTS----NGF 1022

Query: 816  ILALYVQTRGDFIVVGDLMKSISLL---IYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
               + ++  G  I  GD+ +S+ LL    Y  + G  E         W+S +E+LD    
Sbjct: 1023 ---MDIKVVGSRIYCGDIRESVQLLRLKFYGEDLGEFELTTTSTGPRWLSTMELLDYSTV 1079

Query: 873  LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
            +  +   +LF  R       +E+  R     EYH     N+F  G +V  L    +  I 
Sbjct: 1080 IAGDKFDSLFVSRV----PHNEDVVRSNYF-EYH-----NQFHLGDIVTSLQRVRINPIH 1129

Query: 933  T--VIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
            +  V++ T+ G IGV+     +  L FL+ L+  L   I  V G   + +RS+       
Sbjct: 1130 SEVVLYTTLMGSIGVLIPFVSKDELDFLQHLEMLLCNQIDTVTGREVQMFRSY-----YF 1184

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              +N +DGDL E F  L   + + ++  +N+ V E+ K+++ +
Sbjct: 1185 PVQNIVDGDLCEMFTALGDEKFN-VANQLNLKVAEVVKKLKNI 1226


>gi|84996279|ref|XP_952861.1| splicing factor 3b subunit [Theileria annulata strain Ankara]
 gi|65303859|emb|CAI76236.1| splicing factor 3b subunit, putative [Theileria annulata]
          Length = 1340

 Score = 98.6 bits (244), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 105/428 (24%), Positives = 182/428 (42%), Gaps = 73/428 (17%)

Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
           LL    G L+ + ++H+   V+ + I       +  ++  L +  ++IGS  GD +L + 
Sbjct: 357 LLLSEYGDLYKIELSHDDNTVSEVVIRYFDTVDVGISMCILRSGYLFIGSESGDHKLYQF 416

Query: 277 -------------NLQPDAKGSYVE----------VLERYVNLGPIVDFCVVDLERQGQG 313
                        +L PDAK + +           V++R  ++G +VD  V D+      
Sbjct: 417 TSLDNGDKDVICTSLHPDAKNAIIAFKPRVLQNLVVVDRMSSMGLVVDMKVADVMGLNNY 476

Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISE 372
            +    G + +  L+ +R G    E A  EL G  K +++++ S +  FD ++++SF   
Sbjct: 477 DIFVACGRWYNSRLKCLRYGFNTEELAFNELPGRPKHVFTIK-SLESNFDEYIIISFQGN 535

Query: 373 TRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL----------------------- 409
           T +L++   + +EE     F +   TL C    Y                          
Sbjct: 536 TLVLSIG--EAVEEVTDSFFLTSITTLHCCYMSYRNGVSSSGVGGVSGVSGVGTVTNGVP 593

Query: 410 ------------VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
                       VQ+  G  R +S    ++  EWK      V +A  N +Q++L   GG 
Sbjct: 594 VTGGTGITRGIYVQIHDGGYRYLSG---DIIKEWKVQSTKRVKLADNNDTQLILVLTGGE 650

Query: 458 LVYLEIGDG---ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL- 513
           ++Y ++ D     L EV    L  EI+CL I     + + ++    G   +I VRI  L 
Sbjct: 651 IIYFQLTDTEVPELVEVGRRNLSTEITCLAIQH-PNSGTKAEFCCCGSIDNI-VRIMKLD 708

Query: 514 PDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573
            +L L + + LG   +P SV L   + I YL   L +G L+   L+M  G L D++   +
Sbjct: 709 KNLKLCSSQILGNNSLPESVTLLTNDEI-YLYVGLNNGVLIRNTLDM-IGNLIDQESRFM 766

Query: 574 GTQPITLR 581
           GT+P+ L+
Sbjct: 767 GTKPLKLK 774



 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 92/381 (24%), Positives = 160/381 (41%), Gaps = 47/381 (12%)

Query: 675  FAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
            F +  LKN           +R+++    E I+       E   +  +C    +S     V
Sbjct: 979  FEVMELKNYRAGVGKWSSCIRIINPINLETIAKLLFTENEAATTAYTCIL--NSIQLLIV 1036

Query: 735  GT---AYVLPEE---NEPTKGRILVF--------IVEDGKLQLIAEKETKGAVYSLNAFN 780
            GT   A++ P     N+  +  I V+        +V    ++L+    TKG +   N + 
Sbjct: 1037 GTIKNAHLYPTHMDVNDEVESCIRVYEYDSNYNNLVNGFNIKLLHITNTKGWIRCFNNYE 1096

Query: 781  GKLL-AAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
             KLL  AI  K+++Y    K ML     R L S    HG    + ++  G  I  GD+ +
Sbjct: 1097 NKLLLCAIGSKLRMYSLGKKQMLLKGEHRSLTS----HGF---MDIKVIGSRIYCGDIRE 1149

Query: 836  SISLL---IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
            S+ LL    Y  + G  E         W+S++E+LD    + A + F+   V +      
Sbjct: 1150 SVQLLRIKFYGEDLGEFELTTTSTGPRWLSSMELLDYSTVI-AGDKFDSIFVSRVPHNED 1208

Query: 893  DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
                   E   ++HLG+ V  F+   + +    S+V     V++ T+ G IGV+     +
Sbjct: 1209 VVRSNYFEYHNQFHLGDIVTSFQR--VRINPIHSEV-----VLYTTLMGSIGVLVPFVSK 1261

Query: 953  QYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
              L FL+ L+  +   I  V G   + +RS+         +N +DGDL E ++ L     
Sbjct: 1262 DELDFLQHLEMLMCNQIDTVTGREVQMFRSY-----YFPVQNIVDGDLCEMYMTLDDKY- 1315

Query: 1012 DEISKTMNVSVEELCKRVEEL 1032
              I+  +N+ V E+ K+++ +
Sbjct: 1316 -NIANQLNLKVNEIIKKLKNI 1335


>gi|320581947|gb|EFW96166.1| hypothetical protein HPODL_2449 [Ogataea parapolymorpha DL-1]
          Length = 1203

 Score = 98.2 bits (243), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 173/822 (21%), Positives = 326/822 (39%), Gaps = 122/822 (14%)

Query: 295  NLGPIVDFCVVDLERQGQGQVV-TCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 352
            N+ P++D  + +        V+ +  G     +L+++ + +  +E  + EL  + + ++ 
Sbjct: 415  NINPLIDAKLYNSNETTSLPVIYSLCGTGPRSALKVLNHELPFSEVVTQELPSVVQKVFV 474

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
             + + DD +D ++V+SF+  T  L + + +++EE E  G    T TL         LVQV
Sbjct: 475  SKLNRDDEYDKYIVLSFVDGT--LVLRIGEDVEEVENSGLVLDTNTLGVFQVGSTALVQV 532

Query: 413  TSGSVRLVSSTSRELRN--EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGIL 468
                VR V       +   +W  P G  V   +A  SQ+ +A     +VY E+   D ++
Sbjct: 533  HPNGVRQVFYVDETPQKTIDWAPPAGIRVLHCSATNSQLAIALSNREIVYFELDDLDKLI 592

Query: 469  TEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
               +H +L  +I+ L +  +    +      V    D ++ +FS  P+  L   E +G E
Sbjct: 593  EYNEHKELAAQITALTLGEVQAGTARFPYLLVA-CQDKTLTVFSTDPESTL---EIVGEE 648

Query: 528  IIPRSV-LLCAF------------------------EGISYLLCALGDGHLLNFLLNMKT 562
            I+  +   L AF                          + Y+   +  G      ++ +T
Sbjct: 649  ILSSAASSLMAFYMKDTAIVLNKDAGEDDEDEPELATSLLYVHIGMESGVYARLQMDPQT 708

Query: 563  GELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
            GEL++ +    G +P+ L    +     V   S R  + Y+   +   + +      H C
Sbjct: 709  GELSNPRNKYTGPRPVQLSKIEAVGQNAVCIFSARTYLGYTRPSEFKITPLTKPVFQHAC 768

Query: 623  PFNSAAFPDSLAIAKEG-ELTIGTIDDIQK-LHIRSIPLGEHPRRICHQEQSRTFAICSL 680
             F S   P++  +A  G  LTI TID ++    I SI L   P+ +C             
Sbjct: 769  SFRSEDIPENGVLAVYGNSLTIMTIDQLENDTIIESIALRNTPKYMC------------- 815

Query: 681  KNQSCAEESEMHFVRLLDDQTFEFIST----------YPLDTFEYGCSILSCSFSDDSNV 730
                C +++ M +V   D  T   I            YP     +   I +   +D S V
Sbjct: 816  ---DCTDKNGMMYVLEGDIDTKRDIEEERIEEYQQFGYPRAPGAWASCIQTVVVADKSVV 872

Query: 731  YYC-VG--TAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA---VYSLNAFNGKL- 783
                +G  TA+     N  ++       V   K Q ++     GA   +Y +++ NG L 
Sbjct: 873  QTIDLGDETAFRACWVNFESRPGEPFLAVSSAKNQQLSPPRNDGAYIHLYRISS-NGTLE 931

Query: 784  -------------LAAINQKIQL---YKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
                         L A   K+ +    + +L D G ++L            +  ++T+G 
Sbjct: 932  LFQTTKTEHLSLALTAFQGKLLVGAKNELILYDIGQKQLVKRSSTRLECYEIVDLKTQGF 991

Query: 827  FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
             ++V D+  S+   +YK  E +  +   D     ++   +LD D  +  +   N+  +R 
Sbjct: 992  RVIVSDVRDSVRYTVYKPLENSFVDFIDDTMQRHVTRTLLLDYDTVVVGDKFGNISVLRC 1051

Query: 886  -KNSEGATDEE---------RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVI 935
             +     +DE+         R +L+    Y++G+    F+ GSL        +G   ++I
Sbjct: 1052 PEQISEMSDEDNHGFLVKMRRTKLDNPVNYYVGDMPTFFQKGSLT-------IGGAESII 1104

Query: 936  FGTVNGVIGVIASLPH-EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNF 994
            +G + G +G +  +    +  F ++LQ   R +I     L   ++  F         KN 
Sbjct: 1105 YGCLQGQMGCLYPMKSLSEINFFKELQ---RLIIHEFTSLTDREYLKFKGYYNP--PKNS 1159

Query: 995  LDGDLIESFLDLS-------RTRMDEISKTMNVSVEELCKRV 1029
            +DGDLIE +  L         T+MD + + ++  + ++  R+
Sbjct: 1160 IDGDLIEEYYRLGPEKRIRIATKMDRLPRDIDRRISDMRSRI 1201


>gi|195056749|ref|XP_001995154.1| GH22991 [Drosophila grimshawi]
 gi|193899360|gb|EDV98226.1| GH22991 [Drosophila grimshawi]
          Length = 1426

 Score = 97.8 bits (242), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 107/435 (24%), Positives = 198/435 (45%), Gaps = 62/435 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-------AEESEM-------HFVR---- 695
            +R +PL   PR++ +  ++R + + + K +          E+ E+        F+     
Sbjct: 997  VRKVPLRCTPRQLVYHRENRVYCLITQKEEPMTKYYRFNGEDKELSEECRGERFIYPIGS 1056

Query: 696  -----LLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
                 L+  +T+E +  ++   + +E+     I+  S+    +    Y C+GT +    E
Sbjct: 1057 LFEMVLISPETWEIVPDASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1115

Query: 744  NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
            +  ++G I ++     + E GK      L+ + +KE KG V +++   G L+  + QKI 
Sbjct: 1116 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKI- 1174

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
             Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +   
Sbjct: 1175 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLA 1230

Query: 853  ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
            +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   +  +L    +YHL
Sbjct: 1231 SRDFNPMEVFGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQ--KLLRKADYHL 1288

Query: 908  GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
            G+ VN  FR       L  R P     +   VI+G+++G +G    LP + Y     LQ 
Sbjct: 1289 GQVVNTMFRVQCHQRGLHQRQPFLYENK-HLVIYGSLDGALGYCLPLPEKVYRRFLMLQN 1347

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
             L      + GLN +++R+  + KK     ++  +DGDLI SF  L+ +  +E++K +  
Sbjct: 1348 VLLSYQDHLCGLNPKEYRTIKSVKKLGINPSRCIIDGDLIWSFRMLAHSERNEVAKKIGT 1407

Query: 1021 SVEELCKRVEELTRL 1035
              EE+   + E+ R+
Sbjct: 1408 RTEEILADLLEIERI 1422



 Score = 56.2 bits (134), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 70/323 (21%), Positives = 129/323 (39%), Gaps = 77/323 (23%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           +YG + +L+     G  +D L I+ +  K  VLQ DA++  L T ++    +   R    
Sbjct: 76  LYGNVMSLQSVSLAGAMRDALLISFKDAKLSVLQLDADTQTLKTLSLHYFEEDDIRGGWT 135

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
           G+  +    +DPD R   + +Y     V+PF     L E                     
Sbjct: 136 GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195

Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE 152
              ++ I L +L      VLDI+FL+G  +PT+++LY+       V+T    +K +    
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEP------VRTCAGRIKVR---- 245

Query: 153 GPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIR 199
                      +D   P+  P+ G L++    ++Y              SA+   A P++
Sbjct: 246 -----------SDTFFPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSADNSTAFPLK 294

Query: 200 P------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSIA 251
           P      S+  A +  +D D  + ++    G L++L +  +  + V          + + 
Sbjct: 295 PQDNVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHFHKAAASVLT 352

Query: 252 STISYLDNAVVYIGSSYGDSQLI 274
           S I       +++GS  G+S L+
Sbjct: 353 SCICVCHTEYIFLGSRLGNSLLL 375



 Score = 41.2 bits (95), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 43/160 (26%), Positives = 70/160 (43%), Gaps = 26/160 (16%)

Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
           EV +  +N+ PI   C    V+ E  G               ++V  +G  K+G+L +  
Sbjct: 455 EVCDSLINVAPINYMCAGERVEFEEDGATLRPHADNLNDLKIELVAATGHSKNGALSVFV 514

Query: 332 NGIGINEQASVELQGIKGMWSL-----RSSTDDPFDT--FLVVSFISETRILAMNLEDEL 384
           N I      S EL+G   +W++     R +T    D   F+++S  S T +L      E+
Sbjct: 515 NCINPQIITSFELEGCLDVWTVFDDATRKATTARQDQHDFMLLSQRSSTLVLQTG--QEI 572

Query: 385 EETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
            E E  GF     T++  +    + +VQVT+  VRL+  T
Sbjct: 573 NEIENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGT 612


>gi|296227035|ref|XP_002807684.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1 [Callithrix jacchus]
          Length = 1394

 Score = 97.8 bits (242), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 111/460 (24%), Positives = 190/460 (41%), Gaps = 62/460 (13%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 923  VDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 982

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 983  KVYAVATSTNTPCTRIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1042

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVFIVED---- 758
              L  +E+   + + S   +  V     Y   GT  +  EE    +GRIL+  V +    
Sbjct: 1043 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVTE 1101

Query: 759  -------GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
                    K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1102 PRQTLTXXKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1156

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1157 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1216

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF-----RHGSLVMRL 923
             LG   ++ + NL       E        RL    ++H+G  VN F     R  +  +  
Sbjct: 1217 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1276

Query: 924  PDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
                        F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  +
Sbjct: 1277 KSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1336

Query: 984  NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1337 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1376



 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 106/527 (20%), Positives = 211/527 (40%), Gaps = 122/527 (23%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP- 154
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 155 ---WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
              WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE-- 247
           R   T+   R+  D +        + ++    G +++L +IT     V     +      
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 248 --TSIASTISYL-------------DNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
             TS++ T  +L             D   VY   +   +QL   +          EV + 
Sbjct: 370 LTTSVSGTEGFLCAAGGKSVPQDEXDEIEVYGSETQSGTQLATYSF---------EVCDS 420

Query: 293 YVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVE 343
            +N+GP  +  + +   L  + Q       ++V CSG  K+G+L +++  I      + E
Sbjct: 421 ILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFE 480

Query: 344 LQGIKGMWSL-------------------RSSTDDPFDT-----FLVVSFISETRILAMN 379
           L G   MW++                     ST +  D      FL++S    T IL   
Sbjct: 481 LPGCYDMWTVIAPVRKEEEENPKGEGTEQEPSTPEADDDSRRHGFLILSREDSTMILQTG 540

Query: 380 LEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
              E+ E +  GF +Q  T+F  +   N+ +VQV+   +RL+   ++
Sbjct: 541 --QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 585


>gi|380014171|ref|XP_003691113.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Apis florea]
          Length = 1583

 Score = 97.8 bits (242), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 127/516 (24%), Positives = 229/516 (44%), Gaps = 80/516 (15%)

Query: 575  TQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPD 631
            T+   +R FS+      VF  SD P  I+ + +  L ++    +  V+   PFN+   P 
Sbjct: 897  TRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQ 956

Query: 632  S-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICS-----LK 681
              L   ++ EL I  +           +R +PL   P  + +  +S+T+ + +     LK
Sbjct: 957  GFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIAEPLK 1016

Query: 682  N---------QSCAEESEMHFVRLLDDQ----TFEFIS--TYP-----LDTFEYGCSILS 721
            +         +   EE    F+    +Q     F  +S  T P     LD +E+   + +
Sbjct: 1017 SYYRFNGEDKEFTEEERPDRFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKN 1076

Query: 722  CSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIA 765
             S + +        Y  +GT Y   E+   ++GRIL+F I+E            + + I 
Sbjct: 1077 VSLAYEGTRSGLKGYIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIY 1135

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQ 822
             KE KG + ++   +G L++A+ QKI  Y W L+D+   G   + ++   H  +L++   
Sbjct: 1136 AKEQKGPITAITQVSGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI--- 1189

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AEN 877
                 I++ D+ KSISLL ++ E   +   +RD+    +  +E L D+  LG      E+
Sbjct: 1190 --KSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGES 1247

Query: 878  NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD--------SDVG 929
            N  LF  +  S  +   +  +L    ++HLG+ VN F    +  R+ D        SD  
Sbjct: 1248 NIALFMYQPESRESLGGQ--KLIRKADFHLGQKVNTFFR--IRCRISDPANDKKHFSDAD 1303

Query: 930  QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
            +    ++ +++G +G I  +P + Y  L  LQ  L   I  + GLN + +R++ +  +T 
Sbjct: 1304 KRHVTMYASLDGNLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQ 1363

Query: 990  --DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
               A+  +DGDL+  +L L      +++K +   VE
Sbjct: 1364 GNPARGIIDGDLVWRYLYLPNNEKIDVAKKIAPIVE 1399



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 65/329 (19%), Positives = 139/329 (42%), Gaps = 61/329 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
           ++G + +++     G  +D L ++    K  V+++D ++ +L T ++    +   R   T
Sbjct: 73  LHGNVMSMQAVTLVGSQRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEEIRDGWT 132

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFN 103
           ++  I I+  DP+ R   + +Y     V+PF                   NK  +  ++ 
Sbjct: 133 NHHHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYM 192

Query: 104 IRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKD 149
           I L+ L+     ++D++FL+G  +PT+++LY+  +        R      VA+     + 
Sbjct: 193 IVLKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252

Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAI 196
                WS +NL       +PV  PL G LI+   +++Y +             A      
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLAETSTNF 312

Query: 197 PIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE 247
           P++P   +   ++  +GS        R ++   +G L++L +       V G   +    
Sbjct: 313 PLKP---QEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAA 369

Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKL 276
           + + S +   ++  +++GS  G+S L++ 
Sbjct: 370 SVLTSCVCMCEDNYLFLGSRLGNSLLLRF 398



 Score = 45.4 bits (106), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 46/172 (26%), Positives = 78/172 (45%), Gaps = 14/172 (8%)

Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
           ++VT SG  K+G+L ++++ I      + EL G + MW++  + ++             F
Sbjct: 505 ELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVIGTLNNDEQIRPEAEGSHAF 564

Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           L++S    T IL      E+ E +  GF +Q  T+F  +   N+ +VQVT   VRL+   
Sbjct: 565 LILSQEDSTMILQTG--QEINEVDQSGFSTQGSTIFAGNLGANRYIVQVTQMGVRLLQGI 622

Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
             E         G  +  A+     V L +  G ++ L + +G  T   HAQ
Sbjct: 623 --EQIQHMPIDLGCPIVHASCADPYVTLLSEDGQVMLLTLREGRGTARLHAQ 672


>gi|195381337|ref|XP_002049409.1| GJ21566 [Drosophila virilis]
 gi|194144206|gb|EDW60602.1| GJ21566 [Drosophila virilis]
          Length = 1420

 Score = 97.8 bits (242), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 108/435 (24%), Positives = 197/435 (45%), Gaps = 62/435 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLK-------------NQSCAEESE---------- 690
            +R +PL   PR++ +  ++R + + + K             ++  +EES           
Sbjct: 991  VRKVPLRCTPRQLVYHRENRVYCLITQKEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1050

Query: 691  MHFVRLLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
            +  + L+  +T+E +  ++   + +E+     I+  S+    +    Y C+GT +    E
Sbjct: 1051 LFEMVLISPETWEIVPDASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1109

Query: 744  NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
            +  ++G I ++     + E GK      L+ + +KE KG V +++   G L+  + QKI 
Sbjct: 1110 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKI- 1168

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
             Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +   
Sbjct: 1169 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 1224

Query: 853  ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
            +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   ++  L    +YHL
Sbjct: 1225 SRDFNPLEVFGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQK--LLRKADYHL 1282

Query: 908  GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
            G+ VN  FR       L  R P     +   VI+GT++G +G    LP + Y     LQ 
Sbjct: 1283 GQVVNTMFRVQCHQRGLHHRQPFLYENK-HLVIYGTLDGALGYCLPLPEKVYRRFLMLQN 1341

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
             L      + GLN +++R+    KK     ++  +DGDLI S+  L+ +   E++K +  
Sbjct: 1342 VLLSYQDHLCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIGT 1401

Query: 1021 SVEELCKRVEELTRL 1035
              EE+   + E+ RL
Sbjct: 1402 RTEEILADMLEIERL 1416



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 69/323 (21%), Positives = 127/323 (39%), Gaps = 82/323 (25%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           +YG + +L+     G  +D L I+ +  K  VLQ DA++  L T ++    +   R    
Sbjct: 76  LYGNVMSLQSVSLAGGMRDALLISFKDAKLSVLQLDADTQALKTLSLHYFEEEDIRGGWT 135

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
           G+  +    +DPD R   + +Y     V+PF     L E                     
Sbjct: 136 GRYHVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195

Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE 152
              ++ I L +L      VLDI+FL+G  +PT+++LY+       V+T    +K      
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEP------VRTCAGRIK------ 243

Query: 153 GPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIR 199
                         + P+  P+ G L++    I+Y              SA+   + P++
Sbjct: 244 --------------VFPIQKPIGGCLVMTVNAIIYLNQSVPPYGVSLNSSADNSTSFPLK 289

Query: 200 P------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSIA 251
           P      S+  A +  +D D  + ++    G L++L +  +  + V          + + 
Sbjct: 290 PQDNVRLSLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHFHKAAASVLT 347

Query: 252 STISYLDNAVVYIGSSYGDSQLI 274
           S I       +++GS  G+S L+
Sbjct: 348 SCICVCHTEYIFLGSRLGNSLLL 370


>gi|224004656|ref|XP_002295979.1| spliceosome associated factor 3b, subunit 3; 130kD spliceosome
           associated protein [Thalassiosira pseudonana CCMP1335]
 gi|209586011|gb|ACI64696.1| spliceosome associated factor 3b, subunit 3; 130kD spliceosome
           associated protein [Thalassiosira pseudonana CCMP1335]
          Length = 1212

 Score = 97.4 bits (241), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 104/410 (25%), Positives = 185/410 (45%), Gaps = 44/410 (10%)

Query: 295 NLGPIVDFCVVDLE-RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG-MWS 352
           +L P++   V +L   +   Q+    G     SLR++R+G+ + E A  EL G+ G +++
Sbjct: 386 SLAPVISVLVGELAGNEVSPQLYALCGRGPTSSLRVLRHGLSVTELAVSELPGVPGAVFN 445

Query: 353 LR----SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE-GFCSQTQTLFCH----- 402
           +R    +     +D ++VVSF   T +L++   + +EE   E GF +   TL C      
Sbjct: 446 VRDDQAAKNGKFYDRYIVVSFADATLVLSVG--ETVEEMGKESGFLTTEPTLACSALGNG 503

Query: 403 --DAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVY 460
                   +VQV  G VR +    R   ++W  P    +  A+AN SQ+L+A  GG L+Y
Sbjct: 504 GNGEQEGGIVQVYPGGVRHIQ---RGSVSQWHVPGIKKIECASANESQILIALVGGELIY 560

Query: 461 LEIG--DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
            E+    G L E     +  ++  LD+  + +  S S  AAVG   D +VR+ SL   +L
Sbjct: 561 FELDPLSGNLMEAATKDVGADVCSLDVGAVPKGKSRSLFAAVG-CRDSTVRLLSLAPGSL 619

Query: 519 ITKEHLG--GEIIPRSVLL---CAFEGISYLLCALG--DGHLLNFLLNMKTGEL-TDRKK 570
           + ++     G   P SV L      +G   +  ++G  DG  L   ++  TG + T   +
Sbjct: 620 LEQKSSTTLGTTRPHSVALSNGVGSDGAGEITLSVGLDDGSALRAGVDPITGAMSTSPSR 679

Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
             LG +P+ +     + +      S RP +  +   +   + ++   + H C F++ A  
Sbjct: 680 RFLGARPVAVSRVMIEGSPSTLLLSSRPWIGRAGQGRHALAPMSYAPLDHGCSFSNEAVR 739

Query: 631 DS-----------LAIAKEG--ELTIGTIDDIQKLHIRSIPLGEHPRRIC 667
           +            L++ + G  ++ +G  DD +  +   I L   PR++C
Sbjct: 740 EGIVATSGSTLRILSVGESGGNDVVLGAEDD-EAFNSTRIGLRYTPRQMC 788



 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 75/300 (25%), Positives = 135/300 (45%), Gaps = 36/300 (12%)

Query: 751  ILVFIVEDGKLQLIAEKETK-GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
            +L  +V   +LQL+       G V SL  F G+LL  + + ++LY+      G R+L  +
Sbjct: 924  VLYRVVNGERLQLLHRTTVDDGPVLSLVHFQGRLLVGVGKTVRLYEM-----GKRQLLKK 978

Query: 810  CGHHGH-ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
            C   G   +   +Q  GD   VGD+M+S+  + Y      +   A+D N   ++  E+LD
Sbjct: 979  CELRGMPTMVKTLQAAGDRAFVGDMMQSMQFIRYDSTANRLVLVAKDRNPRPITCQELLD 1038

Query: 869  -DDIYLGAE-NNFNLFTVRKNSEG--------------ATDEERGRLEVVGEYHLGEFVN 912
             + + +G +  N  +  + + ++               A D+   +LE +  YH+GE V 
Sbjct: 1039 INTVAVGDKFGNVTILRLPRGADAGAIDVTGTRALWDSARDDATPKLETLCTYHVGEVVT 1098

Query: 913  RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGV 971
                 SLV        G   ++I+ TV G +G        +   F   L+  LR      
Sbjct: 1099 SMTRASLV-------AGGAESLIYVTVTGRVGAFVPFTSRDDVEFYTSLEGFLRTETPRP 1151

Query: 972  GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
             G + + +RS+         K+ +DGDL ++F  L      +I+++++ SV E+ K++E+
Sbjct: 1152 TGRDPQSYRSYY-----APMKHIVDGDLCDAFAQLPYETKQKIAESLDRSVGEVMKKLED 1206


>gi|307191845|gb|EFN75271.1| Cleavage and polyadenylation specificity factor subunit 1
            [Harpegnathos saltator]
          Length = 1214

 Score = 97.4 bits (241), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 85/333 (25%), Positives = 160/333 (48%), Gaps = 42/333 (12%)

Query: 731  YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAF 779
            Y  +GT Y   E+   ++GRIL+F I+E            + + I  KE KG + ++   
Sbjct: 890  YIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPITAITQV 948

Query: 780  NGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
            +G L+ A+ QKI  Y W L+D+   G   + ++   H  +L++        I++ D+ KS
Sbjct: 949  SGFLVTAVGQKI--YIWQLKDNDLVGIAFIDTQIYIH-QMLSI-----KSLILIADVYKS 1000

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGA 891
            ISLL ++ +   +   +RD+    +  +E L D+  LG      E+N  LF  +  S  +
Sbjct: 1001 ISLLRFQEKCRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLIADGESNLALFMYQPESRES 1060

Query: 892  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD--------SDVGQIPTVIFGTVNGVI 943
               ++  L    ++HLG+ +N F    +  R+ D        SD  +    ++ +++G +
Sbjct: 1061 LGGQK--LIRKADFHLGQKINTFFR--IKCRVTDVASDKKHFSDADKKHVTMYASLDGSL 1116

Query: 944  GVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN--EKKTVDAKNFLDGDLIE 1001
            G +  +P + Y  L  LQ  L   I  + GLN + +R++ +    +   A+  +DGDL+ 
Sbjct: 1117 GYVLPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYVRNQGNPARGIIDGDLVW 1176

Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
             +L L      +++K +   V+E+ + + E+ R
Sbjct: 1177 RYLSLPNNEKADVAKKIGTRVQEIIEDITEIDR 1209



 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 86/417 (20%), Positives = 161/417 (38%), Gaps = 109/417 (26%)

Query: 111 VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKDFVEGPWSQNNLD 161
           V+D++FL+G  +PT+++LY+  +        R      VA+     +      WS +NL 
Sbjct: 4   VIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVHPIIWSVSNLP 63

Query: 162 NGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIRPS----ITK 204
                 +PV  PL G LI+   +++Y +             A+     P+RP     I+ 
Sbjct: 64  FDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTNFPLRPQDGVKISL 123

Query: 205 AYGRVD-ADGSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
              +V      R ++   +G L++L +       V G   +    + + S +   ++  +
Sbjct: 124 EGAQVAFLSADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASVLTSCVCMCEDNYL 183

Query: 263 YIGSSYGDSQLIKL----------------------NLQPDAKG---------------- 284
           ++GS  G+S L++                       N +P AK                 
Sbjct: 184 FLGSRLGNSLLLRFTEKEPETIKSLDDGEINIEDNDNEEPPAKKAKQDFLGDWMASDVLD 243

Query: 285 ---------------------SYV-EVLERYVNLGPIVDFCVVDL---------ERQGQG 313
                                SY+ EV +  +N+GP  +  + +           +    
Sbjct: 244 IKDPEELEVYGSETHTSIQITSYIFEVCDSLLNIGPCGNISMGEPAFLSEEFAHNQNPDV 303

Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
           ++VT SG  K+G+L +++  I      + EL G + MW++  S ++             F
Sbjct: 304 ELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVIGSLNNDEQVKSETEGSHAF 363

Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
           L++S   E   + +    E+ E +  GF +Q  T+F  +   N+ +VQVT   VRL+
Sbjct: 364 LILS--QEDSTMVLQTGQEINEVDQSGFSTQGSTVFAGNLGANRYIVQVTQMGVRLL 418


>gi|195122290|ref|XP_002005645.1| GI18959 [Drosophila mojavensis]
 gi|193910713|gb|EDW09580.1| GI18959 [Drosophila mojavensis]
          Length = 1431

 Score = 97.1 bits (240), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 107/435 (24%), Positives = 195/435 (44%), Gaps = 62/435 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLK-------------NQSCAEESE---------- 690
            +R +PL   PR++ +  ++R + + + K             ++  +EES           
Sbjct: 1002 VRKVPLRCTPRQLVYHRENRVYCLITQKEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1061

Query: 691  MHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEE 743
            +  + L+  +T+E +  ++   + +E+  +      S +        Y C+GT +    E
Sbjct: 1062 LFEMVLISPETWEIVPDASIQFEPWEHVTAFKLVKLSYEGTRSGLKEYLCIGTNFNY-SE 1120

Query: 744  NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
            +  ++G I ++     + E GK      L+ + +KE KG V +++   G L+  + QKI 
Sbjct: 1121 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKI- 1179

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
             Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +   
Sbjct: 1180 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 1235

Query: 853  ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
            +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   ++  L    +YHL
Sbjct: 1236 SRDFNPLEVFGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQK--LLRKADYHL 1293

Query: 908  GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
            G+ VN  FR       L  R P     +   VI+GT++G +G    LP + Y     LQ 
Sbjct: 1294 GQVVNTMFRVQCHQRGLHQRQPFLYENK-HFVIYGTLDGALGYCLPLPEKVYRRFLMLQN 1352

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
             L      + GLN +++R+    KK     ++  +DGDLI S+  L+ +   E++K +  
Sbjct: 1353 VLLSYQDHLCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIGT 1412

Query: 1021 SVEELCKRVEELTRL 1035
              EE+   + E+ RL
Sbjct: 1413 RTEEILADLLEIERL 1427



 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 67/323 (20%), Positives = 127/323 (39%), Gaps = 82/323 (25%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           +YG + +L+     G  +D L ++ +  K  VLQ DA++  L T ++    +   R    
Sbjct: 76  LYGNVMSLQSVSLAGGMRDALLVSFKDAKLSVLQLDADTQTLKTLSLHYFEEDDIRGGWT 135

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
           G+  +    +DPD R   + +Y     V+PF     L E                     
Sbjct: 136 GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVSRTP 195

Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE 152
              ++ I L +L      VLDI+FL+G  +PT+++LY+       V+T    +K      
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEP------VRTCAGRIK------ 243

Query: 153 GPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIR 199
                         + P+  P+ G L++    ++Y              SA+   + P++
Sbjct: 244 --------------VFPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSADNSTSFPLK 289

Query: 200 P------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSIA 251
           P      S+  A +  +D D  + ++    G L++L +  +  + V          + + 
Sbjct: 290 PQDNVRLSLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHFHKAAASVLT 347

Query: 252 STISYLDNAVVYIGSSYGDSQLI 274
           S I       +++GS  G+S L+
Sbjct: 348 SCICVCHTEYIFLGSRLGNSLLL 370


>gi|195455711|ref|XP_002074834.1| GK23274 [Drosophila willistoni]
 gi|194170919|gb|EDW85820.1| GK23274 [Drosophila willistoni]
          Length = 1463

 Score = 96.7 bits (239), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 115/477 (24%), Positives = 212/477 (44%), Gaps = 67/477 (14%)

Query: 617  EVSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQ 671
            EV     FN+   P+  L      EL I  +           +R +PL   PR++ +  +
Sbjct: 992  EVRSFAAFNNINIPNGFLFFDTTFELKISVLPSYLSYDSTWPVRKVPLRCTPRQLVYHRE 1051

Query: 672  SRTFAICSL-------------KNQSCAEESE-MHFVR---------LLDDQTFEFI--S 706
            +R + + +              +++  +EES    F+          L+  +T+E +  +
Sbjct: 1052 NRVYCLITQTEEPMTKFYRFNGEDKELSEESRGERFIYPIGSQFDMVLISPETWEIVPDA 1111

Query: 707  TYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEENEPTKGRILVF-----IV 756
            +   + +E+     I+  S+    +    Y C+GT +    E+  ++G I ++     + 
Sbjct: 1112 SIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SEDITSRGNIHIYDIIEVVP 1170

Query: 757  EDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
            E GK      L+ + +KE KG V +++   G L+  + QKI  Y W LRD    +L    
Sbjct: 1171 EPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKI--YIWQLRDG---DLIGVA 1225

Query: 811  GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
                +I    + T    I + D+ KSISLL ++ E   +   +RD+N   +  +E + D+
Sbjct: 1226 FIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDN 1285

Query: 871  IYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR-FR----HGSLV 920
              LG     AE+N  ++  +  +  +   ++  L    +YHLG+ VN  FR       L 
Sbjct: 1286 TNLGFLVTDAESNLIVYMYQPEARESLGGQK--LLRKADYHLGQVVNTMFRVQCHQRGLH 1343

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
             R P     +   V++GT++G +G    LP + Y     LQ  L      + GLN +++R
Sbjct: 1344 QRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDHLCGLNPKEYR 1402

Query: 981  SFNNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            +  + K+     ++  +DGDLI S+  L+ +  +E++K +    EE+   + E+ RL
Sbjct: 1403 TLKSSKRLGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEEILADLLEIERL 1459



 Score = 54.3 bits (129), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 72/334 (21%), Positives = 136/334 (40%), Gaps = 67/334 (20%)

Query: 6   IYGRIATLEL--FRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT 63
           +YG + +L+       G  +D L ++ +  K  VLQ D ++  L T ++    +   R  
Sbjct: 76  LYGNVMSLQCVSLAGAGAMRDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEEDIRGG 135

Query: 64  DNGQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE------------------- 100
             G+  +    +DPD R   + +Y     V+PF     L E                   
Sbjct: 136 WTGRYYVPEVRVDPDARCAVMLIYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVTR 195

Query: 101 -----AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-ARHVKTYE------- 142
                ++ I L +L      VLDI+FL+G  +PT+++LY+  +  A  +K          
Sbjct: 196 TPIMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDTCVLVA 255

Query: 143 VALKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------C 188
           ++L  +  V    W+ NNL      L+P+  P+ G L++    ++Y              
Sbjct: 256 ISLNIQQRVHPIIWTVNNLPFDCLRLLPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNS 315

Query: 189 SANAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGL 240
           SA+   + P++P      S+  A +  +D D  + ++    G L++L +  +  + V   
Sbjct: 316 SADNSTSFPLKPQDGVRISLDCANFAFIDVD--KLVVSLRTGDLYVLTLCVDSMRTVRNF 373

Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
                  + + S I       +++GS  G+S L+
Sbjct: 374 HFHKAASSVLTSCICVCHMEYIFLGSRLGNSLLL 407



 Score = 45.8 bits (107), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 26/159 (16%)

Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
           EV +  +N+ PI   C    V+ E  G               ++V  +G  K+G+L +  
Sbjct: 490 EVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLQDVKIELVAATGHSKNGALSVFV 549

Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
           N I      S EL+G   +W++      ++S  D  D F+++S  + T  L +    E+ 
Sbjct: 550 NCINPQIITSFELEGCLDVWTVFDDATKKTSRQDQHD-FMLLSQKNST--LVLQTGQEIN 606

Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           E E  GF     T+F  +   N+ +VQVT+  VRL+  T
Sbjct: 607 EIENTGFTVNQATIFVGNLGQNRFIVQVTTRHVRLLQGT 645


>gi|68476233|ref|XP_717766.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
            albicans SC5314]
 gi|68476422|ref|XP_717672.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
            albicans SC5314]
 gi|74586274|sp|Q5A7S5.1|RSE1_CANAL RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|46439394|gb|EAK98712.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
            albicans SC5314]
 gi|46439495|gb|EAK98812.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
            albicans SC5314]
          Length = 1219

 Score = 96.7 bits (239), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 234/1145 (20%), Positives = 457/1145 (39%), Gaps = 185/1145 (16%)

Query: 24   DFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPD--CRLIGLH 81
            D + I ++     +LQ+D ++ + I++    ++         G+   IDP+  C L+   
Sbjct: 118  DGVVITSDSGNLSILQYDNKTKKFISKIQEPMTKNGWGRNYVGENLAIDPENRCILVAAM 177

Query: 82   LYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKF----------LYGC-----AKPTIV 126
              + LF  I  ++ G  + +  +     QVL +K           L+G       K  I+
Sbjct: 178  EKNKLFYKIESNSSGSKELSSPLEAHSKQVLCLKIVALNTDHNNPLFGALELTPEKKCII 237

Query: 127  VLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN--LDNGADLLIPVPPPLCGVLIIGEET 184
              Y+ ++   HV      +K K     P S N+  L N  + LIP+P  + G+++ G   
Sbjct: 238  NYYELDQGLNHV------VKKK-----PNSSNSDPLPNDVNYLIPLPGHIGGMVVCGTNW 286

Query: 185  IVYCSANA---FKAIPIRPSITKAYGRVD-----ADGSRY--LLGDHAGLLHLLVITHE- 233
              Y   +    +  +P R   T+    V+         ++  LL +  G L  L + ++ 
Sbjct: 287  CFYDKLDGPRIYLPLPRRNGQTQDSIIVNHVTHVLKKKKFFILLQNALGDLFKLTVDYDF 346

Query: 234  -KEKVTGLKIELLGETSIASTISYLDNAVVY---------------IGSSYGDSQLI--- 274
             KE +  + I        A +++   N  ++               +G    + +L+   
Sbjct: 347  DKEIIKNISITYFDTIPPALSLNIFKNGFLFANVLNNDKLLYQFEKLGDDLTEGELVINS 406

Query: 275  ----KLNLQPDAKGSY----------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 320
                 LN   ++  S+          ++VLE    L PI D  ++D       ++VT S 
Sbjct: 407  SDYESLNSVRESVTSFKLKGLDNLALIDVLE---TLSPITDSKIID------SKLVTLSS 457

Query: 321  AYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
                  ++ + +G+         L      +++ + S +   D +LV+S    ++ L ++
Sbjct: 458  H---SYVKSITHGVPTTTLVESPLPITPTDIFTTKLSLESANDEYLVISSSLSSKTLVLS 514

Query: 380  LEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN-EWKSPPGYS 438
            + + +E+ E   F     T+         +VQ+ S  ++ V + +   +  +W  P G +
Sbjct: 515  IGEVVEDVEDSEFVLDQPTIAVQQVGIASVVQIYSNGIKHVRTVNGNKKTTDWFPPAGIT 574

Query: 439  VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI-SCLDINPIGENPSYSQI 497
            +  AT N  QVL+A     +VY EI D    ++   Q   EI + +    I EN S    
Sbjct: 575  ITHATTNNQQVLIALSNLSVVYFEI-DATDDQLIEYQDRLEIATTITAMAIQENISEKSP 633

Query: 498  AAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEGISYLLCALGDGHL 553
             A+   +D ++++ SL + N +  + L          + +     E  +++   + +G  
Sbjct: 634  FAIIGCSDETIQVVSLQEHNCLEIKSLQALSANSSSLKMLKSSGKE--THVHIGMENGVY 691

Query: 554  LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSN 612
                ++   G L++ +   +G++P++L      N    + A S  P + Y        + 
Sbjct: 692  ARIKIDTINGNLSNSRVKYIGSKPVSLSVIKFSNEIEGILAISSAPWISYLYRDSFKITP 751

Query: 613  VNLKEVSHMCPFNSAAFP-DSLAIAKEGELTIGTI-------DDIQKLHIRSIPLGEHPR 664
            +   ++++   F S     + +   K+  L I ++       D  Q L I +  L   PR
Sbjct: 752  LLEIDITNGSSFISEDIGGEGIVGIKDNNLIIFSVGKEDSVFDPSQDLTIATTKLRYTPR 811

Query: 665  RICHQEQSRTFAI-----------CSLKNQSCAEESEMHF---------------VRLLD 698
            ++     +R F             C++         E ++               ++++D
Sbjct: 812  KMI-TNGNRLFISESEYNVQGPFKCNINGDVKENVDEDYYEAFGYEWKQNSWASCIQVVD 870

Query: 699  DQTFEFISTYPLDTFEYGCSILSCSFSDDSN-----VYYCVGTAY---VLPEENEPTKGR 750
             ++ + I +  LD  E   S+ + SF+  S       +  VG      +LP  N   K  
Sbjct: 871  SKSNQVIQSLQLDGNESIVSMSAVSFNKTSTPSVPASHLVVGVCTNQTILP--NSYDKSY 928

Query: 751  ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
            +  F +    LQL+ + E       L  F  KLL A    I+LY     D G ++L  + 
Sbjct: 929  LYTFKIGKKHLQLVHKTELDHIPQVLENFQDKLLVASGNHIRLY-----DIGQKQLLKKS 983

Query: 811  ----GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE--ERARDYNANWMSAV 864
                    +I  +  QT  + I++ D  KS S++  K +E   +    A D     ++++
Sbjct: 984  TTIIDFSTNINKIIPQT--NRIIICDSHKS-SIVFAKFDESQNQFVPFADDVMKRQITSI 1040

Query: 865  EILDDDIYLGAENNFNLFTVR----------------KNSEGATDEERGRLEVVGEYHLG 908
              LD D  +G +   N+F  R                K  +G  +    +L+ + E+H+G
Sbjct: 1041 MNLDIDTLIGGDKFGNIFVTRIDEDISKQADDDWTILKTQDGILNSCPYKLQNLIEFHIG 1100

Query: 909  EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKV 967
            + +  F  G L       ++    +VI+  + G IG++  L  +  +  L  LQ  +++ 
Sbjct: 1101 DIITSFNLGCL-------NLAGTESVIYTGLQGTIGLLIPLVSKSEVELLFNLQLYMQQS 1153

Query: 968  IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
               + G +H + RS+ N       KN +DGDL+E FL+   +   EIS+ +N SV ++ K
Sbjct: 1154 QNNLVGKDHLKLRSYYNP-----IKNVIDGDLLERFLEFDISLKIEISRKLNKSVNDIEK 1208

Query: 1028 RVEEL 1032
            ++ +L
Sbjct: 1209 KLIDL 1213


>gi|189208368|ref|XP_001940517.1| pre-mRNA-splicing factor rse1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
 gi|187976610|gb|EDU43236.1| pre-mRNA-splicing factor rse1 [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 1247

 Score = 96.3 bits (238), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 138/614 (22%), Positives = 238/614 (38%), Gaps = 80/614 (13%)

Query: 155 WSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVY-----CSANAFK-AIPIRPSITKA 205
           WS+  +D  A+ L  VP       GVL  GE++I Y       +N  + AIP R   T+ 
Sbjct: 224 WSEP-VDRTANTLFRVPGGPNAPSGVLCCGEDSITYRRIFNNKSNVHRLAIPRREGATED 282

Query: 206 YGR-----------VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTI 254
             R           +      YLL    G +  L +      V  +KI+      I ++I
Sbjct: 283 PNRKRMIIAGTLYSLKGGDFFYLLQTEDGDVFKLTVDAPSGTVEKIKIKYFDTIPIVTSI 342

Query: 255 SYLDNAVVYIGSSYGDSQLIKLN----------------------------LQPDAKGSY 286
             L    VY     GD  L +L                              +P A  ++
Sbjct: 343 CILRAGFVYAACESGDRILYELESLGDETDDPVFESDQFPVDPDTSFAPPFFKPRALVNF 402

Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-Q 345
             V E   +L PI+   V +   +   QV T +G     S R  RN + + +     L Q
Sbjct: 403 TPV-ESMPSLNPIMGMEVANPALEDAPQVYTINGTSGRSSFRTTRNALEVLDLIESPLPQ 461

Query: 346 GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
               +W+ + ++ D  DT +V+     +R L + + D++EE    GF   T TL      
Sbjct: 462 NASDVWTTKLTSGDEADTLIVLCL--HSRTLVLKIGDDVEEASNTGFLPDTNTLGVQQFG 519

Query: 406 YNQLVQVTSGSVRLV---------SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGG 456
            + ++Q+    +R +         +  +     +W  P   ++     N  QV +A   G
Sbjct: 520 EDCIIQIHPKGIRHIQGIQFPNDDADATHANLMDWHPPAHRTIVACATNNRQVAIALSSG 579

Query: 457 HLVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL- 513
            ++Y E   DG L   +    L+  I+CL +  + E    +   AVG  +D +VRIF+L 
Sbjct: 580 QILYFECDSDGSLAMAEEEISLDSTINCLAMPDVPEGSVRAFFLAVGC-SDQTVRIFNLS 638

Query: 514 PDL--NLITKEHLGGEIIPRSVLLCAF------EGIS-YLLCALGDGHLLNFLLNMKTGE 564
           PD+  N++    +     P S L   F       G S +L   L  G  +  +L+  TG+
Sbjct: 639 PDMDGNILRSISVQALTSPPSDLTINFMKDRSPRGESQFLHIGLRSGVYIRSVLDEMTGD 698

Query: 565 LTDRKKVSLGTQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHM 621
           + D ++  LG +PI     +    +  + A + RP + Y+  +   L  + +N       
Sbjct: 699 IGDTRRRFLGPEPIKFAKVNDVAGSPGILAMTSRPWLGYTHPRTGVLQLTPLNYIPFKSA 758

Query: 622 CPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSL 680
             F+ + F   + ++   EL I T +D+       +I L   PR++    +   F +   
Sbjct: 759 WNFDGSQFKGIICVSTN-ELRIFTFNDLTDNTTYENISLKYTPRKMVGYHEQGVFYVIQS 817

Query: 681 KNQSCAEESEMHFV 694
            N + + +     +
Sbjct: 818 DNNTISADRRQQLI 831



 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 71/287 (24%), Positives = 130/287 (45%), Gaps = 33/287 (11%)

Query: 755  IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL--QSECG 811
            I  DG+ ++L  E        +L AF GKL+A + + + LY     D G + L  +++  
Sbjct: 954  ISPDGREIELFNETAVSEPPLALLAFKGKLIAGVGRHLCLY-----DCGMKSLLRKAQAP 1008

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSAVEILDD 869
            +      + ++T+G  +VV D  +S++  ++K +     +     D  A   SA E+LD 
Sbjct: 1009 NSVPTRIVDIKTQGSRLVVSDQAQSVTYFVHKDQVHPNRLIPFVDDSVARHTSASEMLDY 1068

Query: 870  DIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
            D  +G +   N++ VR   + SE + +   G   +V + +LG   NR     L+     +
Sbjct: 1069 DTTIGGDKFGNIWLVRCPKEISESSDESPDGSDLLVDKSYLGGTPNRL---DLIAHYFTN 1125

Query: 927  DV-----------GQIPTVIFGTVNGVIGVIASLPHEQ-YLFLEKLQTNLRKVIKGVGGL 974
            D+           G    + +  + G +G +      + +   ++L+  LR   K + G 
Sbjct: 1126 DIPVSIQKANLISGGDKVIFWAGLQGTLGALIPFTSRRTHKLFQQLELTLRSEDKPLSGR 1185

Query: 975  NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            +H  +RS+    K+V     +DGDLIE +L LS  + + I+  M  S
Sbjct: 1186 DHLTYRSYYAPVKSV-----IDGDLIERYLVLSSDKRESIAGQMTGS 1227


>gi|159486547|ref|XP_001701300.1| nuclear pre-mRNA splicing factor, component of splicing factor 3b
           [Chlamydomonas reinhardtii]
 gi|158271783|gb|EDO97595.1| nuclear pre-mRNA splicing factor, component of splicing factor 3b
           [Chlamydomonas reinhardtii]
          Length = 1078

 Score = 96.3 bits (238), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 119/518 (22%), Positives = 209/518 (40%), Gaps = 89/518 (17%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
           ++G+I +L   R  G   D L I ++  +  +++++ + +  + +   +   R G R   
Sbjct: 57  VFGQIRSLAAVRLTGSTTDHLAIGSDSGRIVLIKFNKDKNVWV-KVHQETFGRSGCRRIV 115

Query: 65  NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY---- 118
            GQ   +DP  R  ++G         V+  D    L    +  LE  +   I F      
Sbjct: 116 PGQFLAVDPKGRACMVGAVEKQKFVYVLNRDAAANL--TISSPLEAHKSHHITFSICGMD 173

Query: 119 -GCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGA 164
            G   P    +  D  +A             +H+  YE+ L   + V   W++  +DNGA
Sbjct: 174 CGFDNPIFAAIELDYSEADQDPTGEAASLAQKHLTFYEMDLGLNNVVR-KWTEP-VDNGA 231

Query: 165 DLLIPVPPPL---CGVL--------------IIGEETIVYCSANAFKAIPIRPSITKAYG 207
           +LLI VP       GVL              + G+  ++  S    K         KAY 
Sbjct: 232 NLLIAVPGGADGPGGVLDHEEVRAVIPRRSDLPGDRGVLIVSYATHK--------KKAY- 282

Query: 208 RVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
                 S +L+    G ++ + +++E E VT +K++      + ++I+ L    ++  S 
Sbjct: 283 ------SFFLVQSEYGDIYKVTLSYEGEAVTEVKVKYFDTIPVTASIAVLKTGFLFAASE 336

Query: 268 YGDSQLIKL----NLQPDAKGSYVEV----------------------LERYVNLGPIVD 301
           YG+  L +         D + S  ++                      ++   +L PI D
Sbjct: 337 YGNHALYQFVGTGEDDDDVESSSAQLVATEEGFQPVFFEPRPLKNLLLIDEMASLMPITD 396

Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDP 360
             V +L  +   Q+    G     SL ++R G+ + E A   L G    +W++R +  D 
Sbjct: 397 MKVANLLGEEIPQIYALCGRGPRASLSVLRPGLAVTELAVSPLPGAPTAVWTVRRAASDE 456

Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
           +D ++VVSF + T  L  ++ +E++ET   GF     TL       N ++QV  G +R +
Sbjct: 457 YDAYIVVSFANAT--LVFSIGEEVKETNESGFLGTVPTLHTQLLADNSMLQVYPGGLRHI 514

Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
               R   NEWK P    +  A +N  QV +A  GG +
Sbjct: 515 RPDRR--INEWKVPGRRVIKAAASNDKQVAIALQGGEV 550



 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 70/303 (23%), Positives = 131/303 (43%), Gaps = 38/303 (12%)

Query: 751  ILVFIVEDG--KLQLIAEKETKGAVY-SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
            I V+ + DG  +L L+ + +  G V  +L  F G+LLA +   ++LY     D G +++ 
Sbjct: 788  IRVYRLGDGGRRLDLLHKTQVDGGVPGALAGFKGRLLAGVGPTLRLY-----DMGKKKML 842

Query: 808  SECGHHG------HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWM 861
             +C ++       H+       R          +S+ ++ YK  + A    A D    ++
Sbjct: 843  RKCEYNRWTNIFLHVFFYRPYFRSS-------QESVHMMRYKKADNAFYIFADDVAPRYL 895

Query: 862  SAVEILDDDIYLGAENNFNLFTVRKNSEGAT----DEERGRLEVV-----GEYHLGEFVN 912
            SA+  LD D     +   NL  +R   E +     D   G++        G  H  E + 
Sbjct: 896  SALLPLDYDTIATGDKFGNLVILRLPQEASQQVEDDPTGGKMAAASGKLNGAPHKLEELV 955

Query: 913  RFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIK 969
            +F  G  +  L  +++  G    +++ TV G IGV+      E   F   L+ +LR+   
Sbjct: 956  KFHVGDTITALQRAEMQAGGQEVLVYSTVMGAIGVVYPFTSREDVDFFSHLEMHLRQENP 1015

Query: 970  GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
             + G +H  +RS          +N +DGDL   +  +   +   I++ M+ +  E+ K++
Sbjct: 1016 PLAGRDHLAYRS-----AYFPVRNCVDGDLCSQYASIPMKKQQMIAEAMDRTTGEMLKKL 1070

Query: 1030 EEL 1032
            E++
Sbjct: 1071 EDI 1073


>gi|449661926|ref|XP_002167992.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like [Hydra magnipapillata]
          Length = 1122

 Score = 96.3 bits (238), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 119/564 (21%), Positives = 244/564 (43%), Gaps = 111/564 (19%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM---------GDVS 56
           ++G I  +++ R     +D L +A +  K  ++++D  S +L T +M         G +S
Sbjct: 76  LFGNIINMQVVRLGSNVRDSLLLAFKHAKLSIVEFDPLSHDLKTDSMHYFENDEFKGGLS 135

Query: 57  DRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEA--------------- 101
             I  P     +  +DP+ R   + +Y+    V+PF +  +L E+               
Sbjct: 136 HNIYLP-----LVRVDPEQRCACMLIYNRHLVVLPFKHDIKLDESEELSDGEHIKSVLPS 190

Query: 102 FNIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGP-- 154
           + I L  L+     + +++FL+G  +PT++ L++  + +    T  VA++   F      
Sbjct: 191 YMIDLHSLEQPLLNITELQFLHGYHQPTLMFLFEPVQTS----TGRVAVRQDTFCVSAIS 246

Query: 155 -----------WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAFK 194
                      WS  NL     +L P+  P+ GVL+    +++Y          S N+  
Sbjct: 247 LNMTEKVHPVIWSVTNLPFDCHMLRPIEKPIGGVLVFASNSLIYLNQSIPPYGVSLNSIT 306

Query: 195 ----AIPIRPS----ITKAYGRVDADGS-RYLLGDHAGLLHLL-VITHEKEKVTGLKIEL 244
                 P++      IT A    DA  + +++L    G +++L +++     V     E 
Sbjct: 307 EGSTMFPLKIQEDVVITLAESSCDAIATDQFILSLKGGEIYVLSLLSDGLRTVRSFHFEK 366

Query: 245 LGETSIASTISYLDNAVVYIGSSYGDSQLIKL------NLQPDAKGSYVEVL-------- 290
              + +AS + ++++  V++GS  G+S L++       ++   +K + VE +        
Sbjct: 367 AAGSVLASCVCWIEHGFVFLGSRLGNSLLLRYTEKDSASIAEKSKEAKVEKMYGGGVGGG 426

Query: 291 ----ERYVNLGPIVDFCVVDL---------ERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
               +  +N+GPI    + +           RQ   ++V CSG  K+G+L +++  I   
Sbjct: 427 IIVCDSLLNIGPITKAALGEPAFLSEEFFGSRQIDLEMVCCSGYGKNGTLTVLQRSIRPQ 486

Query: 338 EQASVELQGIKGMWSL--RSSTD--DPFDTFLVVSFISETRILAMNLEDELEETEIEGFC 393
              + EL G   MW++  +SS +  + + ++L++S    T +L      E+ E +  GF 
Sbjct: 487 VVTTFELPGCVNMWTVCGKSSKESVENYHSYLILSRDDSTMVLKTGA--EITELDNSGFN 544

Query: 394 SQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLA 452
            Q  T+F  + + N+ ++QV   S+ L+  T +   N         +   + +   V++ 
Sbjct: 545 VQQPTIFACNHLSNKYILQVCPQSIHLLEDTVQ--INSISLQDTIKITQCSISDPYVVMV 602

Query: 453 TGGGHLVYLEI-----GDGILTEV 471
              G L+YL++     G+G ++ +
Sbjct: 603 DSTGQLIYLQVIEDSEGNGKISSI 626



 Score = 76.3 bits (186), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 80/324 (24%), Positives = 144/324 (44%), Gaps = 28/324 (8%)

Query: 731  YYCVGTAYVLPEENEPTKGRILVF-----IVEDG------KLQLIAEKETKGAVYSLNAF 779
            Y  VGT +   E+    KGRIL+F     + E G      K + + +KE KG V ++ A 
Sbjct: 802  YLVVGTTFNYGED-LACKGRILIFDVLEVVPEPGQPLTKTKCKCVYDKEQKGPVTAICAT 860

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            +G ++AA+ QKI  +K+  +D+   +L         +  + +    + IV  D+ +SISL
Sbjct: 861  SGYIIAAVGQKIYAFKY--KDN---DLVGVAFVDSQVFTVNLMAIRNVIVAADISRSISL 915

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDE 894
            + ++ E  ++   +RD         E   D   +G     AE N  +F+ +   E     
Sbjct: 916  VRFQVEHKSLALVSRDTKTLEAYTSEFFIDGSQVGFVVSDAERNIVIFSYQ--PEALESF 973

Query: 895  ERGRLEVVGEYHLGEFVNRFRHGSLVM--RLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
               RL    + ++G  VN      L+   +       Q   +I  T++G IG++  L  +
Sbjct: 974  GGHRLLQKADINIGSHVNTMMRIKLIQDEQSLSKSSEQRQLIILPTLDGSIGILFPLSEK 1033

Query: 953  QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTR 1010
             +  L  LQ  L   +    GLN   +R+ +   +T+    +N LDG L++ +  LS   
Sbjct: 1034 PFRRLTMLQNKLVDCLPHKAGLNPRAFRALDVPLRTLTNPHRNILDGQLLDKYAQLSFQE 1093

Query: 1011 MDEISKTMNVSVEELCKRVEELTR 1034
              +I+K M  +  ++   + ++ R
Sbjct: 1094 RFDIAKKMGTTSGQILDDMMDIER 1117


>gi|358372791|dbj|GAA89393.1| cleavage and polyadenylation specificity factor subunit A
            [Aspergillus kawachii IFO 4308]
          Length = 1372

 Score = 96.3 bits (238), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 97/429 (22%), Positives = 179/429 (41%), Gaps = 52/429 (12%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 694
            ++ + LGE    + +   S  + +  C   +    ++ E+H                 F+
Sbjct: 940  LKKVHLGEQVDHLAYSTSSGMYVLGTCHATDFKLPDDDELHPEWRNEAISFFPSARGSFI 999

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
            +L+   T+  I +Y L T EY  +I + S     N +       VGTA+    E+ P++G
Sbjct: 1000 KLVSPNTWSIIDSYSLGTDEYVMAIKNISLEISENTHERKDLIVVGTAFAR-GEDIPSRG 1058

Query: 750  RILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
             I VF V            D KL+LI ++  KGAV +L+   G+    + Q  +     L
Sbjct: 1059 CIYVFEVVQVVPDPDDPETDRKLKLIGKESVKGAVTALSEIGGQGFVLVAQGQKCMVRGL 1118

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            ++DG+    +      ++  +         ++GD +K I    Y  E   +   A+D + 
Sbjct: 1119 KEDGSLLPVAFMDMQCYVSVVKELKGTGMCILGDAVKGIWFAGYSEEPYKMSLFAKDLDY 1178

Query: 859  NWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF- 914
              +SA E L D   L    A+++ N+  ++ + E        +L    ++H G F +   
Sbjct: 1179 LEVSAAEFLPDGRRLFIVVADSDCNIHVLQYDPEDPKSSNGDKLLSRSKFHTGNFASTLT 1238

Query: 915  -------RHGSLVMRLPDSDV---GQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
                       ++    D D+     +  V+  T NG +G+I  +P E Y  L  LQ+ L
Sbjct: 1239 LLPRTMVSSEKMISNSDDMDIDNQSALHQVLMTTQNGSLGLITCMPEESYRRLSALQSQL 1298

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
               ++   GLN   +R+   E      +  LDG+L+  ++D+S+ R  EI+  +     E
Sbjct: 1299 TNTLEHPCGLNPRAFRAV--ESDGTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWE 1356

Query: 1025 LCKRVEELT 1033
            +   +E ++
Sbjct: 1357 IKADLEAIS 1365


>gi|328771718|gb|EGF81757.1| hypothetical protein BATDEDRAFT_34564 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1248

 Score = 95.9 bits (237), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 161/664 (24%), Positives = 273/664 (41%), Gaps = 79/664 (11%)

Query: 295  NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGI-NEQASVELQGIKGMWSL 353
            N GP +DFC+ D        +   +GA   G ++ +R+G+G+  +  + +  G   +W L
Sbjct: 439  NAGPTLDFCLADTLGNNTDTLYMTAGAAPLGYIQEIRHGVGVLIDDTTKQFDGAIKLWGL 498

Query: 354  RSSTDDPFDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL---FCHDAIYNQL 409
            R+S +D  D+ LV SF++ TRI+ M  +DE E+ ++I GF     TL    C  + Y   
Sbjct: 499  RTSCEDTVDSLLVASFVASTRIMYMQ-DDEFEDISDISGFTIDVATLNTAACFVSGY--F 555

Query: 410  VQVTSGSVRLVSSTSRELRN------EWKSPPGYSVNVATANASQVLLA-TGGGHLVYLE 462
            +QV    + +      +  N       W  P  + V  +      VLL  TG   L+ L+
Sbjct: 556  IQVHLHGIIIARPLLDKTINNNTCTANWSPPNDHKVGFSAFYQDCVLLTLTGENSLMLLK 615

Query: 463  IGDGILTEVKHAQLEYEIS-CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK 521
            +   I+ E  + Q+E   S  LDI P  +  S          T I+V     P    I K
Sbjct: 616  V---IIQENSNVQIESISSISLDIEPSSQYYS----------TPINV-----PHSICIIK 657

Query: 522  EHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
                 EI  RS  L +     +LL  + DG LL+F L+ +  ++     V L      L 
Sbjct: 658  ---SKEISSRSNALISKP---HLLIGIRDGALLDFTLDFENDQINFMPPVILQLGDCPLD 711

Query: 582  TFSSKNT----THVFAASDRP-TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS-LAI 635
               S +T     +V   +++   +  S    +  S +  K + H   F   A  D+    
Sbjct: 712  LVYSHHTDPLDRYVLGVTNQTWQLSVSPIGGIRISQLLHKPILHAARFLYDADVDTGFLF 771

Query: 636  AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR 695
                 L+   +D     HIR+I +G+ PRRI     ++   +    +Q  A+E  +  V+
Sbjct: 772  LTSDTLSFVKVDPESGYHIRNINMGDTPRRILVDPVTKLLVVAG-SSQGDADEI-LTTVK 829

Query: 696  LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI 755
            +++  T +  +T  L   E   S++       +  Y C+GT          T GR+LVF 
Sbjct: 830  VMNPDTGQVYATERLAANETIHSLIVWHVK-PTKRYICMGTRI------HATSGRVLVFG 882

Query: 756  VEDG------KLQLIAEKETKGAVYSLNAF-NGKLLAAINQ-----KIQLYKWMLRDDGT 803
            ++        K  L+ +    G V +L  F N  LLA+        KI+     +    +
Sbjct: 883  LKPATKNKHIKFTLMGQYTLNGPVLALCTFVNSYLLASAGSTLYQLKIEAVHRTITAGAS 942

Query: 804  RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
             ++ S       I AL  Q     I + +   SIS+  +     A      D  +   S 
Sbjct: 943  IDINSII---TRIHALKTQ-----IFIANTQDSISVYKFDIATKAFAFIKSDVTSRVGSE 994

Query: 864  VEILDDDIYLGAENNFNLFTVRKN----SEGATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
               LDD + +G + + N++ +  N    S+     +   ++   E+H+ + V + + GS+
Sbjct: 995  CFPLDDSLVIGTDRHGNIYGLDTNQGEDSDTHESTDSQSMQTGFEFHILDIVLQLKPGSM 1054

Query: 920  VMRL 923
              RL
Sbjct: 1055 KHRL 1058


>gi|49619065|gb|AAT68117.1| cleavage and polyadenylation specific factor 1 [Danio rerio]
          Length = 1105

 Score = 95.9 bits (237), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 112/475 (23%), Positives = 206/475 (43%), Gaps = 64/475 (13%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   K+GEL I  +           +R IPL      + +  +S
Sbjct: 634  IESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVES 693

Query: 673  RTFAICSLKNQSC-------AEESEMHFV----------------RLLDDQTFEFI--ST 707
            + +A+C+   + C        EE E   +                +L+   ++E I  + 
Sbjct: 694  KVYAVCTSVKEPCTRIPRMTGEEKEFETIERDERYIHPQQDKFSIQLISPVSWEAIPNTR 753

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + +      V     Y  +GT  +  EE    +GRIL+      + E
Sbjct: 754  VDLEEWEHVTCMKTVALKSQETVSGLKGYVALGTCLMQGEEVT-CRGRILILDVIEVVPE 812

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD-DGTRELQSEC 810
             G      K +++ EKE KG V +L   +G L++AI QKI L+     D  G   + ++ 
Sbjct: 813  PGQPLTKNKFKVLYEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKYNDLTGMAFIDTQL 872

Query: 811  GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
                +I  +Y  +  +FI+  D+MKSISLL Y+ E   +   +RD     + ++E + D+
Sbjct: 873  ----YIHQMY--SIKNFILAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFMVDN 926

Query: 871  IYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
              LG   ++ + NL       E        RL    ++++G  VN F        L  ++
Sbjct: 927  NQLGFLVSDRDKNLMVYMYLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTAN 986

Query: 928  VGQIP-----TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
               +         F T++G +G++  +  + Y  L  LQ  L  ++    GLN + +R  
Sbjct: 987  KKALTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRML 1046

Query: 983  NNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            + +++T+    KN LDG+L+  +L LS     E++K +  + + +   + E+ R+
Sbjct: 1047 HCDRRTLQNAVKNILDGELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEIERV 1101


>gi|342181720|emb|CCC91200.1| putative damage-specific DNA binding protein [Trypanosoma congolense
            IL3000]
          Length = 1274

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 178/826 (21%), Positives = 324/826 (39%), Gaps = 131/826 (15%)

Query: 295  NLGPIVDFCV-VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWS 352
            N GP+ D  V +D  R G   V   +G   +G L  VR  + ++    + + Q ++ +  
Sbjct: 426  NCGPVFDITVALDGPRTG---VFAGTGVDHNGGLSFVRAAVSVSRDVCISDFQDVRCVCV 482

Query: 353  LRSSTDDPFDTFLVVSFISETRILAMNLEDE--LEETEIEGFCSQTQTLF-CHDAIYNQL 409
               S D      ++ SF   +RI    + +   LEE     F +  +TL   +   +   
Sbjct: 483  ---SHD-----IIIFSFPGYSRICRYCVGETTVLEELISSTFDTSRETLLLTYSDEHEAF 534

Query: 410  VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILT 469
            +QVT+  VR+V         E     G  +  A AN   ++ +         E+    + 
Sbjct: 535  LQVTTAGVRIVKPEEGSYLFECA---GNGIEHAHANEGLIVFSNPA------ELNAFSVR 585

Query: 470  EVKH-AQLE--YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 526
            E  H A L   +E SCL +       S+      G W   +V +++L D  +  K     
Sbjct: 586  EAAHIANLHQVHETSCLWVL---SGSSF----LTGEWGTCAVNLYALVDRAVQHKACFNC 638

Query: 527  EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
               P S+ + A    + LL  L +G++++  ++ + G  T   +  +  QP+ L    S 
Sbjct: 639  SATPCSMCVVARPEGNRLLVGLLNGYVVDVPVD-EVGSSTTVTETVVTMQPVRLFNLESH 697

Query: 587  NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS--HMCPFNSAAFPDSLAIAKEGELTIG 644
            N   V    + P ++  +  +   + V+L +V+   + P  ++     L    E  L  G
Sbjct: 698  NA--VLCLGEVPLILIVTETRFQLTGVDLNDVALGAVIPNPNSPLRYILFSKTERALIFG 755

Query: 645  TIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI-----------CSLKNQSCAEESEMH- 692
             I ++QKL+I  I L     R+     S  F +              K ++   +S +  
Sbjct: 756  NISNVQKLNIDFIGLKATVTRLKFMPWSGIFVVSIRRTDKDQLLAVTKQETLCPDSLLEN 815

Query: 693  --FVRLLDDQTFEFISTYPLD-TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG 749
               + LL+++   FI +  L  + E+     SC  + + +    +GT +V P E      
Sbjct: 816  PTSLELLENERCVFIESTVLGGSNEWS----SCGDAGNKDSVVLIGTTFVFPHEQLSRSS 871

Query: 750  RILVFIVEDG-------KLQLIAEKETKGAVY---SLNAFNGKLLAAINQKIQLYKWMLR 799
            R     VE G        L+L    + +GA+    S+  + G++   I+  + L+ W   
Sbjct: 872  RFTWCTVEAGGVAPERATLRLQGRTDVEGALQCCCSVPNYAGRIALGISGCVALFSWNPA 931

Query: 800  DDGTRELQSECGHHGHILALYVQT-RGD--FIVVGDLMKSISLLIYKHEEGAIEERARDY 856
            D  +  +  E    G I+   V T +GD  +IV  D+  S   +     +G++   ARD 
Sbjct: 932  D--STFVPEETIRVGTIVTRIVPTVQGDTSYIVACDVRHSCFFIRVDTIQGSLSIAARDP 989

Query: 857  NANWM---SAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE-------RGRLEVVGEYH 906
                +   + ++         A++ FN F+V   +      E         RL    +YH
Sbjct: 990  ELRGVMDGTVIQYPSHHDTCFADDCFNFFSVSHTTPSTDSSEPPPSTITTQRLRTTAQYH 1049

Query: 907  LGEFVNRFRHGSLV--------MRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLF 956
            +G+ +   + GS          + +P + V  +  P + +GT +G  G I  + +E +L 
Sbjct: 1050 IGDLITAMQQGSFAPCSVINDFVPVPMALVPGVCGPQIAYGTSHGAFGTITPVTNETFLL 1109

Query: 957  LEKLQTNLRKVIKGVGGLNHEQWR---------------SF-----------NNEKKTVD 990
            L+ ++ ++  V+  +GG  H  +R               SF           N  ++   
Sbjct: 1110 LKSIEISVAAVLPPLGGFCHSSYREVLCAGQERGMSRNTSFEVLNPKAAEVMNRRRQKYL 1169

Query: 991  AKNFLDGDLIESFLDLS--------RTRMDEI---SKTMNVSVEEL 1025
             K    GD++E FL +S        +   D+I   S + N S+EE 
Sbjct: 1170 PKCVCSGDVVEMFLRMSDSEKRSTIKLASDQIVRWSPSSNTSLEEF 1215


>gi|194756960|ref|XP_001960738.1| GF11349 [Drosophila ananassae]
 gi|190622036|gb|EDV37560.1| GF11349 [Drosophila ananassae]
          Length = 1455

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 108/435 (24%), Positives = 199/435 (45%), Gaps = 62/435 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
            IR +PL   PR++ +  ++R + + +              +++  +EES    F+     
Sbjct: 1026 IRKVPLRCTPRQLVYHRENRVYCLITQNEEPMTKFYRFNGEDKELSEESRGERFIYPIGS 1085

Query: 696  -----LLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
                 L+  +T+E +  ++   + +E+     I+  S+    +    Y C+GT +    E
Sbjct: 1086 QFEMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1144

Query: 744  NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
            +  ++G I ++     + E GK      L+ + +KE KG V +++   G L+  + QKI 
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKI- 1203

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
             Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +   
Sbjct: 1204 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 1259

Query: 853  ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
            +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   ++  L    +YHL
Sbjct: 1260 SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQK--LLRKADYHL 1317

Query: 908  GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
            G+ VN  FR       L  R P     +   V++GT++G +G    LP + Y     LQ 
Sbjct: 1318 GQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKLYRRFLMLQN 1376

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
             L    + + GLN +++R+    KK     ++  +DGDLI S+  L+ +  +E++K +  
Sbjct: 1377 VLLSYQEHLCGLNPKEYRTIKAVKKQGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGT 1436

Query: 1021 SVEELCKRVEELTRL 1035
              EE+   + E+ RL
Sbjct: 1437 RTEEILSDLLEIERL 1451



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 71/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           +YG + +L+     G  +D L I+ +  K  VLQ D ++  L T ++    +   R    
Sbjct: 76  LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTYALKTLSLHYFEEEDIRGGWT 135

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
           G+  +    +DPD R   + +Y     V+PF     L E                     
Sbjct: 136 GRYFVPVVRVDPDSRCAVMLVYGKRLVVLPFRKDNTLDEIELADVKPIKKAPTAMVTRTP 195

Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
              ++ I L +L      VLDI+FL+G  +PT+++LY+  +     +K          ++
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
           L  +  V    W+ N+L      + P+  P+ G L++    ++Y              SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCQQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315

Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
           +   + P++P      S+  A +  +D D  + ++    G L++L +  +  + V     
Sbjct: 316 DNSTSFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHF 373

Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
                + + S I  L +  +++GS  G+S L+
Sbjct: 374 HKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405



 Score = 42.4 bits (98), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 70/159 (44%), Gaps = 26/159 (16%)

Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
           EV +  +N+ PI   C    V+ E  G               ++V  +G  K+G+L +  
Sbjct: 485 EVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLNDLKIELVAATGHSKNGALSVFV 544

Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
           N I      S EL G   +W++      ++S  D  D F+++S  + T +L      E+ 
Sbjct: 545 NCINPQIITSFELDGCLDVWTVFDDATKKTSRHDQHD-FMLLSQRNSTLVLQTG--QEIN 601

Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           E E  GF     T+F  +    + +VQVT+  VRL+  T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640


>gi|332018184|gb|EGI58789.1| Cleavage and polyadenylation specificity factor subunit 1 [Acromyrmex
            echinatior]
          Length = 1412

 Score = 95.5 bits (236), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 122/515 (23%), Positives = 221/515 (42%), Gaps = 81/515 (15%)

Query: 575  TQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPD 631
            T+   +R FS+      VF  SD P  I+ + +  L ++    +  V+   PFN+   P 
Sbjct: 897  TRICVMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQ 956

Query: 632  S-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA 686
              L   ++ EL I  +           +R +PL   P  + +  +S+T+ + +    S A
Sbjct: 957  GFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVIT----STA 1012

Query: 687  EESE------------------MHFVRLLDDQTFEFIS--TYP-----LDTFEYGCSILS 721
            E  +                    F R+  +  F  +S  T P     LD +E+   + +
Sbjct: 1013 EPLKSYYRFNGEDKVLTKLYYLFQFSRIFMNLLFSPVSWETIPNTKIELDQWEHVTCLKN 1072

Query: 722  CSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIA 765
             S + +        Y  +GT Y   E+   ++GRIL+F I+E            + + I 
Sbjct: 1073 VSLAYEGTRSGLKGYIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIY 1131

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQ 822
             KE KG + ++   +G L++A+ QKI  Y W L+D+   G   + ++   H  +L++   
Sbjct: 1132 AKEQKGPITAITQVSGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI--- 1185

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AEN 877
                 I++ D+ KSISLL ++ E   +   +RD+    +  +E L D+  LG      E+
Sbjct: 1186 --KSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNSNLGFIVADGES 1243

Query: 878  NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP----- 932
            N  LF  +  S  +   +  +L    ++HLG+ +N F      +  P +D  Q       
Sbjct: 1244 NLALFMYQPESRESLGGQ--KLIRKADFHLGQKINTFFRIKCRITDPANDKKQFSGADKR 1301

Query: 933  -TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN---EKKT 988
               ++ +++G +G I  +P + Y  L  LQ  L   I  + GLN + +R         + 
Sbjct: 1302 HVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRHTYKSYVRNQG 1361

Query: 989  VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
              A+  +DGDL+  +L L      +++K +   V+
Sbjct: 1362 NPARGIIDGDLVWRYLFLPNNEKADLAKKIGTRVQ 1396



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 65/330 (19%), Positives = 140/330 (42%), Gaps = 62/330 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
           ++G I +++     G  +D L ++    K  V+++D +  +L T ++    +   +   T
Sbjct: 73  LHGNIMSMQAVHLIGSQRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEEIKDGWT 132

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAF 102
           ++  I I+  DP+ R   + ++     V+PF                    NK  +  ++
Sbjct: 133 NHHHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPSLDDGDLLDSAKLTSTNKTPILSSY 192

Query: 103 NIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DK 148
            I L+ L+     V+D++FL+G  +PT+++LY+  +        R      VA+     +
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252

Query: 149 DFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKA 195
                 WS +NL       +PV  PL G LI+   +++Y +             A++   
Sbjct: 253 RVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADSSTN 312

Query: 196 IPIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLG 246
            P++P   +   ++  +GS        R ++   +G L++L +       V G   +   
Sbjct: 313 FPLKP---QEGVKMSLEGSQVAFISADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 369

Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
            + + S +   ++  +++GS  G+S L++ 
Sbjct: 370 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRF 399



 Score = 44.7 bits (104), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 41/154 (26%), Positives = 75/154 (48%), Gaps = 20/154 (12%)

Query: 285 SYV-EVLERYVNLGPIVDFCVVD--------LERQGQG-QVVTCSGAYKDGSLRIVRNGI 334
           SY+ EV +  +N+GP  +  + +        L+ Q    ++VT SG  K+G+L +++  I
Sbjct: 467 SYIFEVCDSLLNIGPCGNISMGEPAFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSI 526

Query: 335 GINEQASVELQGIKGMWSL-------RSSTDDPFDTFLVVSFISETRILAMNLEDELEET 387
                 + +L G + MW++          T++    FL++S   E   + +    E+ E 
Sbjct: 527 RPQVVTTFQLPGCEDMWTVIGIVNNDEIRTEEGSHAFLILS--QEDSTMVLQTGQEINEV 584

Query: 388 EIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
           +  GF +Q  T+F  +   N+ +VQVT   VRL+
Sbjct: 585 DQSGFSTQGSTVFAGNLGANRYIVQVTQMGVRLL 618


>gi|157141630|ref|XP_001647735.1| spliceosomal protein sap [Aedes aegypti]
 gi|108867854|gb|EAT32403.1| AAEL015441-PA, partial [Aedes aegypti]
          Length = 645

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 128/649 (19%), Positives = 253/649 (38%), Gaps = 112/649 (17%)

Query: 475  QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV- 533
            ++  ++ C+ +  +      S   AVG+  D +VR+ SL   + ++   +  + +P +  
Sbjct: 12   KMPSDVMCMALGSVPAGEQRSWFLAVGL-ADNTVRVISLDPSDCLSPRSM--QALPSAAE 68

Query: 534  LLCAFE---------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
             LC  E               G  YL   L +G LL  +L+  +G+L D +   LG++P+
Sbjct: 69   SLCIVEMGTGDTNEEGVASSAGCIYLNIGLTNGVLLRTVLDPVSGDLADTRTRYLGSRPV 128

Query: 579  TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
             L     + +  V A S R  + Y    +   + ++ + + +   F+S    + +     
Sbjct: 129  KLFRIKMQGSEAVLAMSSRTWLSYYFQNRFHLTPLSYETLEYASGFSSEQCSEGIVAIST 188

Query: 639  GELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLL 697
              L I  ++ +  +  + + PL   P+R     ++    I    + +  EE++    + +
Sbjct: 189  NTLRILALEKLGAVFNQITFPLEYTPKRFLIHNETGKLVISETDHNAYTEETKNIRKKQM 248

Query: 698  DDQTFE----------------FIS-TYPLDTF--------------------------- 713
             D+  E                FI+   P D F                           
Sbjct: 249  ADEMKEAAGEDEQELANEMADAFINEVLPEDVFSAPKAGTGMWASQIRVMDPINGHTYSK 308

Query: 714  ------EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG---RILVFIVEDGKLQLI 764
                  E   S+    F+ D   Y   G A  L    +   G    +  + V   +L+  
Sbjct: 309  VQLAQNEAVLSMALVRFAVDQKWYVVAGVAKDLQMNPKIANGGFIDVYKYDVHTHQLEHY 368

Query: 765  AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQ 822
               E   A  ++  F G++L  + + +++Y     D G ++L  +C  + HI    + +Q
Sbjct: 369  HRTEIDDAPGAIAGFQGRVLVGVGRVLRIY-----DLGKKKLLRKC-ENKHIPNQIVNIQ 422

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
              G  + V D+ +SI  + YK  E  +   A D +  W++   +LD D    A+   N+ 
Sbjct: 423  AMGSRVFVSDVQESIYCIKYKRAENQLIIFADDTHPRWITTSTLLDYDTVATADKFGNIA 482

Query: 883  TVRKNSEGATD-----------EERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
             +R     + D            +RG L       E +  +HLGE +   +  +L+    
Sbjct: 483  ILRLPHSVSDDVDEDPTGNKALWDRGLLNGASQKAENICTFHLGETIMSLQKATLI---- 538

Query: 925  DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
                G   ++I+ T++G +G +      E Y F + L+ ++R     + G +H  +RS+ 
Sbjct: 539  ---PGGSESLIYATMSGTVGALVPFTSREDYDFFQHLEMHMRNENTPLCGRDHLSYRSY- 594

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                    K+ +DGDL E F  +   +   I+  +  +  E+ K++E++
Sbjct: 595  ----YYPVKHVMDGDLCEQFTSMDPAKQKSIASDLGRTPNEVAKKLEDI 639


>gi|156086042|ref|XP_001610430.1| hypothetical protein [Babesia bovis T2Bo]
 gi|154797683|gb|EDO06862.1| conserved hypothetical protein [Babesia bovis]
          Length = 1450

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 198/943 (20%), Positives = 339/943 (35%), Gaps = 216/943 (22%)

Query: 292  RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
            +  NLGPIVDF        G   ++ C G   +G +  + NG+ I+  AS    G+    
Sbjct: 523  KQTNLGPIVDFTFGPQSDSGAIPILACCGYGAEGRVCSITNGVAIDVMASSPASGVLYTT 582

Query: 352  SLRSSTDDPFDTFLVVSFISETR----ILAMNLEDELEETEIE----------------- 390
            +L    +   +  +  S+ + TR    ++    + +  +T I+                 
Sbjct: 583  ALPLFNEANNNFMICTSYFNRTRFYKVVVPPAPKRDPSKTAIDFVKQPLWVVTSMDPSSH 642

Query: 391  GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR------NEWKSPPGYSVNVATA 444
             F    +TL         ++QVT+  + LV+  S  +       N+     G +V     
Sbjct: 643  RFVEHMRTLLMVPYGTQMVLQVTTNGISLVNYKSVAIPRVDHSLNDICKIAGIAVGSGVM 702

Query: 445  NAS---------QVLLATGGGHLVYLEIGDGILTEVKHAQ-LEYEISCLDINPIGE--NP 492
              S          V +       + L++ +G+  ++ H   L  ++S       G+    
Sbjct: 703  PISIVSCHLCDSGVFIGLSNHVFLILDVSNGV--KILHKSVLPKQVSSTAYLSGGDFKTK 760

Query: 493  SYSQIAAVGMWTDISVRIFSLPDLNLI--TKEHLGGEIIPRSVLLCAFEGISYLLCALGD 550
              + + AV  W D  + + S   L ++  TK   G  +  R+V       ++ +  AL D
Sbjct: 761  RRTGLIAVSTWEDTEISLISPEGLQMLHTTKVPCGYGVAIRAVRFGVVGDVALVFAALSD 820

Query: 551  GHLLNFLLNM----KTGELTDRK---------KVSLGTQPITLRTFSSKNTT-------- 589
            G L  + L      K G  T+ +         KVS G  PI L   S   +         
Sbjct: 821  GTLCVYRLKFGDCDKDGLGTNTQLSMVMENVIKVSNG--PIGLDAISMHASVPGSGGTNL 878

Query: 590  ---HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA-AFPD--SLAIAKEGELTI 643
                +    D P +IY++  KL Y  VN+  +  +  FN     P   SL    +  + I
Sbjct: 879  LKNRIVTTGDNPMLIYANRGKLEYVPVNVPRIDTVTSFNFVEKNPQGVSLVFTDKKSIHI 938

Query: 644  GTIDDIQKLHIRSIPLGEHPRRIC-HQEQSRTFAIC-----------SLKNQSCAEESEM 691
            G +D   +LH+ +I  G     IC H E       C           S+ + S       
Sbjct: 939  GHMDTALQLHVETICSGRSFETICYHDESDLVVVGCNGELIADCDMPSISDSSVTHNDSS 998

Query: 692  HF--------------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
             F                    V+ +   T E + T  +       S+ +  F D     
Sbjct: 999  VFRCMDVASCGTIPGVYVLKSCVKFIHLGTKEVVHTLNMPQRHVITSMCTVKF-DGPKTL 1057

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN----------- 780
              +G++ +  ++  P++G I  F+V+     +I     K  V  L   N           
Sbjct: 1058 IALGSSLIQDKDGVPSQGYI--FLVD-----VIKSDAHKWNVVFLRTLNFIDKGVTQMTP 1110

Query: 781  --GKLLAAINQKIQLYKWMLR------DDGTRELQSECGHHG------HILALYVQTRG- 825
               +L+ A+N+ + +   ++R       D  R  + E  H        + L    + +  
Sbjct: 1111 CINQLVVALNETVAVLS-LVRGADPEPSDTLRSYKMEVLHESDESYGQYTLVTRAEYKSC 1169

Query: 826  ----------DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
                      D IV+GDLM S+ +L ++  E  + E  +D+N+ + +A   +D    + A
Sbjct: 1170 SYVVSLDAYQDVIVIGDLMNSMRMLQWQGTE--LREVCKDFNSVYCTAAAAIDQTSCVVA 1227

Query: 876  ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHG-------SLVMRLPDSDV 928
            +++ N +   K      D E  + E VG +H GE +NR R         +L+    D D+
Sbjct: 1228 DSSGNFYVFAKRQVVTNDAEAIKAEDVGLFHHGELINRIRRNPKVQSRVALLNGNSDHDM 1287

Query: 929  GQIPTVI-----FGTVNGVIGVIASLPHE----------------QYLF----------- 956
             + P        F  V G   V A++ H                 +Y F           
Sbjct: 1288 PEFPMKTYCNRPFCCVQG--DVSAAISHSSCNDLVRGLAKPNKFWKYGFKTILTCVTTSG 1345

Query: 957  ----------------LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
                            L  ++  + ++    G +++E WRSF N      AK F+DGD I
Sbjct: 1346 SLLQLCIFDDTKLFCRLAFVEEGINRIQPQAGNISNEHWRSFKNRWMMCAAKGFIDGDAI 1405

Query: 1001 ESFLDLSRTRMDE----ISKT----MNVSVEELCKRVEELTRL 1035
            ES+  L  +   E    +SKT    +  S E L   VE + RL
Sbjct: 1406 ESYNRLDTSLKSEVYNMVSKTDAHGLFYSPELLSLEVEHIQRL 1448


>gi|351713968|gb|EHB16887.1| Cleavage and polyadenylation specificity factor subunit 1
            [Heterocephalus glaber]
          Length = 1440

 Score = 95.1 bits (235), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 109/466 (23%), Positives = 192/466 (41%), Gaps = 74/466 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 969  IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1028

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  +  C                         + E   ++L+   ++E I  + 
Sbjct: 1029 KVYAVATSTSTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1088

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRI-----LVFIVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GR+     +  + E
Sbjct: 1089 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRVRDWERIEVVPE 1147

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1148 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1202

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1203 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1262

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1263 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1316

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  +  + Y  L  LQ  L  ++    GLN  
Sbjct: 1317 SEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1376

Query: 978  QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +R  + +++ +    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1377 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1422



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 107/565 (18%), Positives = 219/565 (38%), Gaps = 151/565 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE---- 288
           + +++  ++   +++GS  G+S L+K    LQ               P +K   V+    
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTVREAADKEEPPSKKKRVDSAAG 429

Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
                                             V +  +N+GP  +  V +   L  + 
Sbjct: 430 WAGNKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 489

Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
           Q       ++V CSG  K+G+L +++  I      + EL G   MW++            
Sbjct: 490 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 549

Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
                        +  D     FL++S    T IL      E+ E +  GF +Q  T+F 
Sbjct: 550 KAEGSEQEPSAPEAQDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 607

Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
            +   N+ +VQV+   +RL+   ++
Sbjct: 608 GNIGDNRYIVQVSPLGIRLLEGVNQ 632


>gi|356530945|ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1449

 Score = 94.7 bits (234), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 96/356 (26%), Positives = 164/356 (46%), Gaps = 40/356 (11%)

Query: 706  STYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI---VED 758
            +T P+ + E   ++    L  + S ++     +GTAYV   E+   +GRIL+F    + D
Sbjct: 1100 ATIPMQSSENALTVRMVTLLNTTSKENETLLAIGTAYV-QGEDVAARGRILLFSLGKITD 1158

Query: 759  GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG- 814
                L++E   KE KGA+ +L +  G LL A   KI L+KW    +GT EL         
Sbjct: 1159 NPQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKW----NGT-ELNGIAFFDAP 1213

Query: 815  --HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
              H+++L +    +FI++GD+ KSI  L +K +   +   A+D+ +    A E L D   
Sbjct: 1214 PLHVVSLNIVK--NFILIGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGST 1271

Query: 873  LG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF----------RHGSL 919
            L    +++N N+       + +   +  +L    E+H+G  V +F          R GS+
Sbjct: 1272 LSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSDRAGSV 1331

Query: 920  VMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
                P SD      ++FGT++G IG IA L    +  L+ LQ  L   +  V GLN   +
Sbjct: 1332 ----PGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAF 1387

Query: 980  RSF--NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
            R F  N +       + +D +L+  +  L      EI+  +  +  ++   + +L+
Sbjct: 1388 RLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIANQIGTTRSQILSNLSDLS 1443



 Score = 74.7 bits (182), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 108/460 (23%), Positives = 176/460 (38%), Gaps = 128/460 (27%)

Query: 66  GQIGIIDPDCRLIGLHLYDGLFKVI--------------PFDNKG----QLKEAFNIRLE 107
           G +  IDP  R  G+ +YD    ++               F + G    +++ ++ I L 
Sbjct: 178 GPVVKIDPQGRCGGVLIYDLQMIILKATQVGSGLVGDDDAFGSSGAVAARIESSYMINLR 237

Query: 108 EL---QVLDIKFLYGCAKPTIVVLYQDNKD-------ARHVKTYEVALKDKDFVEGP--W 155
           +L    V D  F+YG  +P +V+L++             H              + P  W
Sbjct: 238 DLDMRHVKDFTFVYGYIEPVMVILHERELTWAGRVSWTHHTCMISALSISTTLKQHPLIW 297

Query: 156 SQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR---------PSITKAY 206
           S  NL + A  L+ VP P+ GVL+IG  TI Y S +A  A+ +            I ++ 
Sbjct: 298 SAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNNYAVTLDSSQEIPRSS 357

Query: 207 GRV--DADGSRYLLGDHA------GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
             V  DA  + +LL D A      G L LL++ ++   V  L +     + ++S I+ + 
Sbjct: 358 FNVELDAANATWLLSDVALLSTKTGELLLLMLVYDGRVVQRLDLSKSKASVLSSGITTIG 417

Query: 259 NAVVYIGSSYG-----------------------------DSQLIKLNLQPD-------- 281
           N++ ++ S  G                             D+   +L   P         
Sbjct: 418 NSLFFLASRLGDSMLVQFSCGSGVSMMSSNLKEEVGDIEVDAPSKRLRRSPSDALQDMVS 477

Query: 282 ----------------AKGSY-VEVLERYVNLGPIVDFC----------VVDLERQGQGQ 314
                           A+ S+   V +  +N+GP+ DF              + +Q   +
Sbjct: 478 GEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSNYE 537

Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW-----SLRSST---------DDP 360
           +V CSG  K+GSL ++R  I       VEL G KG+W     S RS           DD 
Sbjct: 538 LVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADDDDE 597

Query: 361 FDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
           +  +L++S   E R + +   D L E TE   +  Q +TL
Sbjct: 598 YHAYLIISL--EARTMVLETADLLSEVTESVDYYVQGKTL 635


>gi|71407487|ref|XP_806209.1| damage-specific DNA binding protein [Trypanosoma cruzi strain CL
            Brener]
 gi|70869885|gb|EAN84358.1| damage-specific DNA binding protein, putative [Trypanosoma cruzi]
          Length = 508

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 86/340 (25%), Positives = 140/340 (41%), Gaps = 62/340 (18%)

Query: 735  GTAYVLPEENEPTKGRILVFIVEDGKL-------QLIAEKETKGAVYS---LNAFNGKLL 784
            GT +  P+E  P   R + + VE GKL       + I  K+ +GA+     +  + G++ 
Sbjct: 92   GTTFAFPDEQLPRSSRFIWYCVEQGKLISERPQLRQIGSKDVEGALQCCCIVPNYVGRIA 151

Query: 785  AAINQKIQLYKWMLRDDGTRELQSECGHH--GHILALYVQTRGDFIVVGDLMKSISLLIY 842
              IN  I LY W   D      ++ C       +L ++ Q    ++V  D   S   +  
Sbjct: 152  LGINGCIALYSWNAADSVFVAEETICIGTIVTRVLPIF-QGDASYMVAFDARHSCFFIQV 210

Query: 843  KHEEGAIEERARDYNANW-MSAVEILDDDIY--LGAENNFNLFTVRK---------NSEG 890
               +G++E  ARD      M        +IY     ++ +N F + +         N+  
Sbjct: 211  DTIQGSLEIVARDSEPRGVMDGAVFQLGNIYNICFGDDYYNFFCLSRGASTLPSVSNATS 270

Query: 891  ATDEERGRLEVVGEYHLGEFVNRFRHGS--------LVMRLPDSDVGQI--PTVIFGTVN 940
            AT    G+LE   +YHLG+ V   + GS        + + +P + +  I  P V+FGT +
Sbjct: 271  ATVSS-GKLETSAQYHLGDMVTAMQLGSFAPCSVTNIAVPIPTTLIPGICGPQVVFGTSH 329

Query: 941  GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS------------------- 981
            G  G I  + +E YLFL+ L+  +  V+  +GG  H  +R                    
Sbjct: 330  GAFGTITPVSNETYLFLKALEVAVSSVVPALGGFEHVTYREVLRAGQERGYSRNASFENA 389

Query: 982  -------FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
                   F+  +K   ++    GDLIESFL  S+T    I
Sbjct: 390  NVVSSGVFDKHRKRYLSRCVCSGDLIESFLTFSQTIQQRI 429


>gi|224120960|ref|XP_002318462.1| predicted protein [Populus trichocarpa]
 gi|222859135|gb|EEE96682.1| predicted protein [Populus trichocarpa]
          Length = 1455

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 98/374 (26%), Positives = 170/374 (45%), Gaps = 49/374 (13%)

Query: 694  VRLLD--DQTFEFISTYPLDTFEYGCSILSCSF----SDDSNVYYCVGTAYVLPEENEPT 747
            VR+L+  +  ++  +T P+ T E   ++   S     + ++     VGTAYV   E+   
Sbjct: 1091 VRILEPSNGPWQVKATIPMQTSENALTVRMVSLFNTSTKENETLLAVGTAYV-QGEDVAA 1149

Query: 748  KGRILVFIV----EDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
            +GRIL+F V    E+ ++ L++E   KE KGA+ +L +  G LL A   KI L+KW    
Sbjct: 1150 RGRILLFSVVKNPENSQI-LVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWT--- 1205

Query: 801  DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
             GT            +  + +    +FI++GD+ KSI  L +K +   +   A+D+ +  
Sbjct: 1206 -GTELTGVAFSDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFASLD 1264

Query: 861  MSAVEILDDDIYLG-----AENNFNLFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
              + E L D   L       + N  +F    K SE    +   +L    E+H+G  V +F
Sbjct: 1265 CFSTEFLIDGSTLSLVVSDEQKNVQIFYYAPKMSESWKGQ---KLLSRAEFHVGALVTKF 1321

Query: 915  RHGSLVMRL-------------PDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
                  MRL             P SD      ++FGT++G IG IA L    +  L+ LQ
Sbjct: 1322 ------MRLQMLSPSLDRSGAAPVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1375

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
              L   +  V GLN + +R F ++ K      ++ +D +++  +  +      EI++ + 
Sbjct: 1376 KKLVDAVPHVAGLNPKSFRQFRSDGKAHRPGPESIVDCEMLSYYEMIPLEEQVEIAQQIG 1435

Query: 1020 VSVEELCKRVEELT 1033
             +  ++   + +LT
Sbjct: 1436 TTRAQILSNLNDLT 1449



 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 122/517 (23%), Positives = 201/517 (38%), Gaps = 144/517 (27%)

Query: 23  QDFLFIATERYKFCVLQWDAESSELITRAM----GDVSDRIGRPTDN---GQIGIIDPDC 75
           +D + +A +  K  VL++D     L T +M    G     + R  ++   G +  +DP  
Sbjct: 130 RDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPDWRHLKRGRESFARGPLVKVDPQG 189

Query: 76  RLIGLHLYDGLFKVIPFDNKG----QLKEAF----------------NIR-LEELQVLDI 114
           R  G+ +YD    ++     G    Q ++AF                N+R L+   V D 
Sbjct: 190 RCGGVLVYDLQMIILKAAQAGSALVQDEDAFGSGAAISAHIASSYIINLRDLDMKHVKDF 249

Query: 115 KFLYGCAKPTIVVLYQDNKDA------RHVKTYEVALKDKDFVEGP---WSQNNLDNGAD 165
            F++   +P +VVL++           +H      AL     ++ P   WS  NL + A 
Sbjct: 250 IFVHDYIEPVVVVLHERELTWAGRVVWKHHTCMISALSISTTLKQPTLIWSIGNLPHDAY 309

Query: 166 LLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR------------PSITKAYGRVDADG 213
            L+ VP P+ GVL+IG  TI Y S +A  A+ +             P  T +   +DA  
Sbjct: 310 KLLAVPSPIGGVLVIGVNTIHYHSESASCALALNSYAASVDSSQELPRATFSV-ELDAAN 368

Query: 214 SRYLLGDHA------GLLHLLVITHEKEKVTGLKIELLGETSIASTISYL---------- 257
           + +LL D A      G L LL + ++   V  L +     + + S I+ L          
Sbjct: 369 ATWLLKDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDITTLGNSFFFLGSR 428

Query: 258 --DNAVVYIGSSYGDSQL----------IKLNLQPDAK---------------------- 283
             D+ +V   S  G S L          I+ +L P AK                      
Sbjct: 429 LGDSLLVQFTSGLGSSMLSPGLKEEVGDIEGDL-PSAKRLKVSSSDALQDMVSGEELSLY 487

Query: 284 ---------GSYVEVLERY--------VNLGPIVDFC----------VVDLERQGQGQVV 316
                       V V++ +        +N+GP+ DF              + +Q   ++V
Sbjct: 488 SSAPNNAESSQVVSVIKTFSFTVRDSLINVGPLKDFAYGLRINADANATGISKQSNYELV 547

Query: 317 TCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW-------------SLRSSTDDPFDT 363
            CSG  K+G+L +++  I       VEL G KG+W             SL+ ++DD +  
Sbjct: 548 CCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNARIHSVDSLKMASDDEYHA 607

Query: 364 FLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
           +L++S   E R + +   D L E TE   +  Q +T+
Sbjct: 608 YLIISM--EARTMVLETADHLTEVTESVDYFVQGRTI 642


>gi|317036382|ref|XP_001398211.2| protein cft1 [Aspergillus niger CBS 513.88]
          Length = 1393

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 98/429 (22%), Positives = 179/429 (41%), Gaps = 52/429 (12%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 694
            ++ + LGE    + +   S  + +  C   +    E+ E+H                 F+
Sbjct: 961  LKRVHLGEQVDHLAYSTSSGMYVLGTCHATDFKLPEDDELHPEWRNEAISFFPSARGSFI 1020

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
            +L+   T+  I ++ L   EY  +I + S     N +       VGTA+    E+ P++G
Sbjct: 1021 KLVSPNTWSIIDSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAFAR-GEDIPSRG 1079

Query: 750  RILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
             I VF V            D KL+LI ++  KGAV +L+   G+    + Q  +     L
Sbjct: 1080 CIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQGQKCMVRGL 1139

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            ++DG+    +      ++  +         ++GD +K +    Y  E   +   A+D + 
Sbjct: 1140 KEDGSLLPVAFMDMQCYVSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKMSLFAKDLDY 1199

Query: 859  NWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
              + A E L D   L    A+++ N+  ++ + E        RL    ++H+G F +   
Sbjct: 1200 LEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFHMGNFASTLT 1259

Query: 916  HGSLVM----RLPDSDVG-----QIP--TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
                 M    ++  S  G     Q P   V+  T NG +G+I  +P E Y  L  LQ+ L
Sbjct: 1260 LLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYRRLSALQSQL 1319

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
               ++   GLN   +R+   E      +  LDG+L+  ++D+S+ R  EI+  +     E
Sbjct: 1320 TNTLEHPCGLNPRAFRAV--ESDGTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWE 1377

Query: 1025 LCKRVEELT 1033
            +   +E ++
Sbjct: 1378 IKADLEAIS 1386


>gi|66812672|ref|XP_640515.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
 gi|60468551|gb|EAL66554.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
          Length = 1628

 Score = 94.0 bits (232), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 115/467 (24%), Positives = 211/467 (45%), Gaps = 77/467 (16%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGEL----TIGTIDDIQK-LHIRSIPLGEHPRRICHQEQ 671
            V     FN+ +  D  +  +KE ++    T+ T+ + +  + IR IP      +I +  +
Sbjct: 1180 VETFTSFNNISCQDGFIYFSKEKDVIKICTLSTLMNFENDIAIRRIPTKNSCHKIAYHSE 1239

Query: 672  SRTFA-ICSLKN--QSCAEESEMHF-------VRLLD---DQTFEFISTYPLDTFE--YG 716
            ++ +  I S     Q   E+S+          ++L+D   D  ++FI ++ L   E    
Sbjct: 1240 AKCYVVIVSFPQVTQELQEDSKKPILTDDKFQIKLIDPTIDWNWKFIDSFSLQDRETVLA 1299

Query: 717  CSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE-----------DG 759
              I+S  F++   +     +  +GTA+   E+ +  KGR+LVF IV            + 
Sbjct: 1300 MKIVSLKFTEPDGITRARPFLVIGTAFTFGEDTQ-CKGRVLVFEIVSHKTQFESEELGEK 1358

Query: 760  KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
            +L L+ EKE KG V +L++ NG LL  I  K+ + ++      T  L +   +   I   
Sbjct: 1359 RLNLLYEKEQKGPVTALSSVNGLLLMTIGPKLTVNQFY-----TGSLVTLSFYDAQIYIC 1413

Query: 820  YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA---- 875
             + T  ++IV+GD+ KS+  L +K +   +   ++DY A  + + E + +   L      
Sbjct: 1414 SICTIKNYIVIGDMYKSVYFLQWK-DNKTLNLLSKDYQALNIFSTEFIVNQKTLSILVSD 1472

Query: 876  -ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
             + N  LF+       +   +  + E+ G        N+  +     RLP  +      V
Sbjct: 1473 LDKNILLFSFEPQDPSSRSGQINQ-EINGN-------NKNDN-----RLPKKE----QLV 1515

Query: 935  IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD---- 990
            IFGT++G + V+  L  + YL    +Q+ L   +    GLN +Q+RSF +  +       
Sbjct: 1516 IFGTLDGGLNVLRPLDEKIYLLFYHIQSKLY-YLPQTAGLNPKQYRSFKSFSQNFHFSPS 1574

Query: 991  -----AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                  K  LDGDLI  FL LS++    IS ++N + +E+ + ++++
Sbjct: 1575 TFHQLPKFILDGDLISKFLSLSQSEKRLISNSINSTSDEIIESLKDV 1621



 Score = 47.0 bits (110), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 48/223 (21%), Positives = 90/223 (40%), Gaps = 49/223 (21%)

Query: 94  NKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFV-- 151
           N  Q KE  NI +E ++  D  FL+G  +PTI+ L++       ++T+   +  K F   
Sbjct: 273 NNNQDKEKKNIEIENVK--DFCFLHGYYEPTILFLHEP------IQTWTSRIAVKKFTCQ 324

Query: 152 -------------EGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--------SA 190
                           W+ +N     ++L+ VP PL G L+I    + Y         + 
Sbjct: 325 MTAISLNLLTKAGSFIWNVSNFPYNCEMLVSVPEPLGGALVITANIMFYVNQTSRYGLAV 384

Query: 191 NAFKAI-------------PIRPSITKAY-----GRVDADGSRYLLGDHAGLLHLLVITH 232
           N + +I             PI  ++   +       V  +  +++     G L +  +  
Sbjct: 385 NEYASIDTSTIIGSQPFDFPIDDTLNLVFTLDRSNFVFLESDKFIGSLKGGELLIFHLIS 444

Query: 233 EKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
           +   V  + +   G + + S I  L N ++++GS  GDS L++
Sbjct: 445 DGRSVQRIHVSKAGGSVLTSCICVLSNNLIFLGSRLGDSLLLQ 487


>gi|356559917|ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Glycine max]
          Length = 1447

 Score = 93.6 bits (231), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 87/305 (28%), Positives = 144/305 (47%), Gaps = 30/305 (9%)

Query: 706  STYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---ED 758
            +T P+ + E   ++    L  + S ++     +GTAYV   E+   +GRIL+F +    D
Sbjct: 1098 ATIPMQSSENALTVRMVTLVNTTSKENETLLAIGTAYV-QGEDVAARGRILLFSLGKNTD 1156

Query: 759  GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG- 814
                L++E   KE KGA+ +L +  G LL A   KI L+KW    +GT EL         
Sbjct: 1157 NPQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKW----NGT-ELNGIAFFDAP 1211

Query: 815  --HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
              H+++L +    +FI++GD+ KSI  L +K +   +   A+D+ +    A E L D   
Sbjct: 1212 PLHVVSLNIVK--NFILIGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGST 1269

Query: 873  LG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------L 923
            L    +++N N+       + +   +  +L    E+H+G  V +F    ++        +
Sbjct: 1270 LSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSDRAGAV 1329

Query: 924  PDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
            P SD      ++FGT++G IG IA L    +  L+ LQ  L   +  V GLN   +R F 
Sbjct: 1330 PGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAFRLFR 1389

Query: 984  NEKKT 988
            +  K 
Sbjct: 1390 SNGKA 1394



 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 122/536 (22%), Positives = 208/536 (38%), Gaps = 140/536 (26%)

Query: 66  GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG------------------QLKEAFNIRLE 107
           G +  +DP  R  G+ +YD    ++     G                  +++ ++ I L 
Sbjct: 176 GPVVKVDPQGRCGGVLIYDLQMIILKATQAGSGLVGEDDALGSSGAVAARIESSYMINLR 235

Query: 108 EL---QVLDIKFLYGCAKPTIVVLYQ------DNKDARHVKTYEVALK-DKDFVEGP--W 155
           +L    V D  F++G  +P +V+L++           +H      AL       + P  W
Sbjct: 236 DLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 295

Query: 156 SQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP---------SITKAY 206
           S  NL + A  L+ VP P+ GVL+I   TI Y S +A  A+ +            I ++ 
Sbjct: 296 SAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNSYAVTLDSSQEIPRSS 355

Query: 207 GRV--DADGSRYLLGDHA------GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
             V  DA  + +LL D A      G L LL + ++   V  L +     + ++S I+ + 
Sbjct: 356 FNVELDAANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTIG 415

Query: 259 NAVVYIGSSYGDSQLIKL------------------NLQPDA------------------ 282
           N++ ++ S  GDS L++                   +++ DA                  
Sbjct: 416 NSLFFLASRLGDSMLVQFSCGSGVSMLSSNLKEEVGDIEADAPSKRLRRSPSDALQDMVS 475

Query: 283 ------------------KGSYVEVLERYVNLGPIVDFC----------VVDLERQGQGQ 314
                             K     V +  +N+GP+ DF              + +Q   +
Sbjct: 476 GEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSNYE 535

Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW-----SLRSST---------DDP 360
           +V CSG  K+GSL ++R  I       VEL G KG+W     S RS           DD 
Sbjct: 536 LVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADDDDE 595

Query: 361 FDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTLFCHD--------AIYNQLVQ 411
           +  +L++S   E R + +   D L E TE   +  Q +TL   +         +Y +  +
Sbjct: 596 YHAYLIISL--EARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRCRVIQVYERGAR 653

Query: 412 VTSGS--VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD 465
           +  GS   + VS  +  L +   S    +++V+ A+   +L  + G   + L IGD
Sbjct: 654 ILDGSFMTQDVSFGASNLESGSASDSAIALSVSIADPFVLLRMSDGS--IRLLIGD 707


>gi|195150431|ref|XP_002016158.1| GL10645 [Drosophila persimilis]
 gi|194110005|gb|EDW32048.1| GL10645 [Drosophila persimilis]
          Length = 1459

 Score = 93.6 bits (231), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 105/434 (24%), Positives = 191/434 (44%), Gaps = 60/434 (13%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESEMHFVRLLDDQ 700
            +R +PL   PR++ +  ++R + + +              +++  +EES        +  
Sbjct: 1030 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPNGS 1089

Query: 701  TFEFISTYP------------LDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
             FE +   P             + +E+     I+  S+    +    Y C+GT +    E
Sbjct: 1090 QFEMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1148

Query: 744  NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
            +  ++G I ++     + E GK      L+ + +KE KG V +++   G L+  + QKI 
Sbjct: 1149 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKI- 1207

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
             Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +   
Sbjct: 1208 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLA 1263

Query: 853  ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
            +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   +  +L    +YHL
Sbjct: 1264 SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQ--KLIRKADYHL 1321

Query: 908  GEFVNR-FR---HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
            G+ VN  FR   H   V +           V++GT++G +G    LP + Y     LQ  
Sbjct: 1322 GQVVNTMFRVQCHQRGVHQRQPFLYENKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNV 1381

Query: 964  LRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            L      + GLN +++R+  + KK     ++  +DGDLI S+  L  +  +E++K +   
Sbjct: 1382 LLSYQDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTR 1441

Query: 1022 VEELCKRVEELTRL 1035
             EE+   + E+ RL
Sbjct: 1442 TEEILSDLLEIERL 1455



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 71/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           +YG + +L+     G  QD L ++ +  K  VLQ D ++  L T ++    +   R    
Sbjct: 76  LYGNVMSLQCVSLAGAMQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEEDIRGGWT 135

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
           G+  +    +DPD R   + +Y     V+PF     L E                     
Sbjct: 136 GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSLDEIELTDVKPFKKAPTAMVSRTP 195

Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
              ++ I L+EL      VLDI+FL+G  +PT+++LY+  +     +K          ++
Sbjct: 196 IMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCSGRIKVRSDTCVLVAIS 255

Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
           L  +  V    W+ N+L      + P+  P+ G L++    ++Y              SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315

Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
           +   + P++P      S+  A +  +D D  + ++    G L++L +  +  + V     
Sbjct: 316 DNSTSFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGELYVLTLCVDSMRTVRNFHF 373

Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
                + + S I    +  +++GS  G+S L+
Sbjct: 374 HKAAASVLTSCICVCHSEYIFLGSRLGNSLLL 405



 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 70/159 (44%), Gaps = 26/159 (16%)

Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
           EV +  +N+ PI   C    V+ E  G               ++V  +G  K+G+L +  
Sbjct: 488 EVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGHSKNGALSVFV 547

Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
           N I      S EL G   +W++      ++S  D  D F+++S  + T  L +    E+ 
Sbjct: 548 NCINPQIITSFELDGCLDVWTVFDDATKKTSRHDQHD-FMLLSQSNST--LVLQTGQEIN 604

Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           E E  GF     T+F  +    + +VQVT+  VRL+  T
Sbjct: 605 EIENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGT 643


>gi|297820284|ref|XP_002878025.1| hypothetical protein ARALYDRAFT_906938 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297323863|gb|EFH54284.1| hypothetical protein ARALYDRAFT_906938 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 454

 Score = 93.2 bits (230), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 95/371 (25%), Positives = 167/371 (45%), Gaps = 58/371 (15%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRIL 752
            +R+LD +T        L   E   S+ + +F D +      VGT   +  +  P K  + 
Sbjct: 103  IRILDPKTATTTCLLELQDNEAAYSVCTVNFHDKEYGTLLAVGTVKGM--QFWPKKNLVA 160

Query: 753  VFI-----VEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
             FI     VE+GK L+L+ + + +G   +L  F G+LLA I   ++LY     D G + L
Sbjct: 161  GFIHIYRFVEEGKSLELLHKTQVEGVPLALCQFQGRLLAGIGPVLRLY-----DLGKKRL 215

Query: 807  QSECGHH---GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
              +C +      I++  +QT  D I VGD+ +S     Y+ +E  +   A D    W++A
Sbjct: 216  LRKCENKLFPNTIIS--IQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLTA 273

Query: 864  VEILDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEY 905
               +D D   GA+   N++ VR                  K  +G  +    +++ + ++
Sbjct: 274  SHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGAPNKVDEIVQF 333

Query: 906  HLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNL 964
            H+G+ V   +  S++        G   ++++GTV G IG + A    +   F   L+ ++
Sbjct: 334  HVGDVVTCLQKASMI-------PGGSESIMYGTVMGSIGALHAFTSRDDVDFFSHLEMHM 386

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNV 1020
            R+    + G +H  +RS          K+ +DGDL E F    +DL R   DE+ +T   
Sbjct: 387  RQEYPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPTLPMDLQRKIADELDRT--- 438

Query: 1021 SVEELCKRVEE 1031
               E+ K++E+
Sbjct: 439  -PAEILKKLED 448


>gi|198457226|ref|XP_001360595.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
 gi|198135905|gb|EAL25170.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
          Length = 1459

 Score = 93.2 bits (230), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 105/434 (24%), Positives = 191/434 (44%), Gaps = 60/434 (13%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESEMHFVRLLDDQ 700
            +R +PL   PR++ +  ++R + + +              +++  +EES        +  
Sbjct: 1030 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPNGS 1089

Query: 701  TFEFISTYP------------LDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
             FE +   P             + +E+     I+  S+    +    Y C+GT +    E
Sbjct: 1090 QFEMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1148

Query: 744  NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
            +  ++G I ++     + E GK      L+ + +KE KG V +++   G L+  + QKI 
Sbjct: 1149 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKI- 1207

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
             Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +   
Sbjct: 1208 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLA 1263

Query: 853  ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
            +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   +  +L    +YHL
Sbjct: 1264 SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQ--KLIRKADYHL 1321

Query: 908  GEFVNR-FR---HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
            G+ VN  FR   H   V +           V++GT++G +G    LP + Y     LQ  
Sbjct: 1322 GQVVNTMFRVQCHQRGVHQRQPFLYENKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNV 1381

Query: 964  LRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            L      + GLN +++R+  + KK     ++  +DGDLI S+  L  +  +E++K +   
Sbjct: 1382 LLSYQDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTR 1441

Query: 1022 VEELCKRVEELTRL 1035
             EE+   + E+ RL
Sbjct: 1442 TEEILSDLLEIERL 1455



 Score = 57.8 bits (138), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 71/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           +YG + +L+     G  QD L ++ +  K  VLQ D ++  L T ++    +   R    
Sbjct: 76  LYGNVMSLQCVSLAGAMQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEEDIRGGWT 135

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
           G+  +    +DPD R   + +Y     V+PF     L E                     
Sbjct: 136 GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSLDEIELTDVKPFKKAPTAMVSRTP 195

Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
              ++ I L+EL      VLDI+FL+G  +PT+++LY+  +     +K          ++
Sbjct: 196 IMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
           L  +  V    W+ N+L      + P+  P+ G L++    ++Y              SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315

Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
           +   + P++P      S+  A +  +D D  + ++    G L++L +  +  + V     
Sbjct: 316 DNSTSFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGELYVLTLCVDSMRTVRNFHF 373

Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
                + + S I    +  +++GS  G+S L+
Sbjct: 374 HKAAASVLTSCICVCHSEYIFLGSRLGNSLLL 405



 Score = 42.7 bits (99), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 70/159 (44%), Gaps = 26/159 (16%)

Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
           EV +  +N+ PI   C    V+ E  G               ++V  +G  K+G+L +  
Sbjct: 488 EVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGHSKNGALSVFV 547

Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
           N I      S EL G   +W++      ++S  D  D F+++S  + T  L +    E+ 
Sbjct: 548 NCINPQIITSFELDGCLDVWTVFDDATKKTSRHDQHD-FMLLSQSNST--LVLQTGQEIN 604

Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           E E  GF     T+F  +    + +VQVT+  VRL+  T
Sbjct: 605 EIENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGT 643


>gi|449524573|ref|XP_004169296.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1-like, partial [Cucumis sativus]
          Length = 741

 Score = 92.4 bits (228), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 86/276 (31%), Positives = 129/276 (46%), Gaps = 32/276 (11%)

Query: 733 CVGTAYVLPEENEPTKGRILVFIV---EDGKLQLIAE---KETKGAVYSLNAFNGKLLAA 786
            VGTAYV   E+   +GR+L+F V    D    L++E   KE KGA+ +L +  G LL A
Sbjct: 423 AVGTAYV-QGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIA 481

Query: 787 INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG---DFIVVGDLMKSISLLIYK 843
              KI L+KW        EL     +   +  LYV +     +FI++GD+ KSI  L +K
Sbjct: 482 SGPKIILHKWT-----GAELNGIAFYD--VPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 534

Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGA-----ENNFNLFTVRKNSEGATDEERG- 897
            +   +   A+D+ +    A E L D   L       + N  +F     S   T+  +G 
Sbjct: 535 EQGAQLSLLAKDFGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKS---TESWKGQ 591

Query: 898 RLEVVGEYHLGEFVNRFRHGSLVMRLPD------SDVGQIPTVIFGTVNGVIGVIASLPH 951
           +L    E+H+G  V +F    ++    D      SD      ++FGT++G IG IA L  
Sbjct: 592 KLLSRAEFHVGAHVTKFLRLQMLSTSSDKACSTVSDKTNRFALLFGTLDGSIGCIAPLDE 651

Query: 952 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
             +  L+ LQ  L   +  VGGLN   +R F++  K
Sbjct: 652 LTFRRLQSLQKKLGDAVPHVGGLNPRSFRQFHSNGK 687


>gi|449470342|ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Cucumis sativus]
          Length = 1504

 Score = 92.4 bits (228), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 86/276 (31%), Positives = 129/276 (46%), Gaps = 32/276 (11%)

Query: 733  CVGTAYVLPEENEPTKGRILVFIV---EDGKLQLIAE---KETKGAVYSLNAFNGKLLAA 786
             VGTAYV   E+   +GR+L+F V    D    L++E   KE KGA+ +L +  G LL A
Sbjct: 1186 AVGTAYV-QGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIA 1244

Query: 787  INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG---DFIVVGDLMKSISLLIYK 843
               KI L+KW        EL     +   +  LYV +     +FI++GD+ KSI  L +K
Sbjct: 1245 SGPKIILHKWT-----GAELNGIAFYD--VPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1297

Query: 844  HEEGAIEERARDYNANWMSAVEILDDDIYLGA-----ENNFNLFTVRKNSEGATDEERG- 897
             +   +   A+D+ +    A E L D   L       + N  +F     S   T+  +G 
Sbjct: 1298 EQGAQLSLLAKDFGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKS---TESWKGQ 1354

Query: 898  RLEVVGEYHLGEFVNRFRHGSLVMRLPD------SDVGQIPTVIFGTVNGVIGVIASLPH 951
            +L    E+H+G  V +F    ++    D      SD      ++FGT++G IG IA L  
Sbjct: 1355 KLLSRAEFHVGAHVTKFLRLQMLSTSSDKACSTVSDKTNRFALLFGTLDGSIGCIAPLDE 1414

Query: 952  EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
              +  L+ LQ  L   +  VGGLN   +R F++  K
Sbjct: 1415 LTFRRLQSLQKKLGDAVPHVGGLNPRSFRQFHSNGK 1450



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 77/338 (22%), Positives = 144/338 (42%), Gaps = 60/338 (17%)

Query: 6   IYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSD---- 57
           ++G + ++ +    G    + +D + +  +  K  VL++D  +  L T +M         
Sbjct: 108 LHGNVESMAILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWL 167

Query: 58  --RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVI---------------PFDNKG--- 96
             + GR +   G +  +DP  R  G+ +Y GL  +I                F N G   
Sbjct: 168 HLKRGRESFARGPVVKVDPQGRCGGVLVY-GLQMIILKASQAGSGLVVDDEAFGNTGAIS 226

Query: 97  -QLKEAFNIRLEELQVLDIK---FLYGCAKPTIVVLYQD------NKDARHVKTYEVALK 146
            +++ ++ I L +L V  +K   F++G  +P +V+L++           +H      AL 
Sbjct: 227 ARVESSYLINLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMVSALS 286

Query: 147 -DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSIT 203
                 + P  WS +NL + A  L+ VP P+ GVL+I   +I Y S +A   + +     
Sbjct: 287 ISTTLKQHPLIWSASNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAV 346

Query: 204 KAYG-----------RVDADGSRYLLGDHA------GLLHLLVITHEKEKVTGLKIELLG 246
            A              +DA  + +L+ D A      G L LL + ++   V  L +    
Sbjct: 347 SADSSQDMPRSNFNVELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSK 406

Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKG 284
            + + S I+ + N++ ++GS  GDS L++ +    + G
Sbjct: 407 ASVLTSGIASIGNSLFFLGSRLGDSLLVQFSCGVGSSG 444



 Score = 47.4 bits (111), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 32/112 (28%), Positives = 52/112 (46%), Gaps = 24/112 (21%)

Query: 289 VLERYVNLGPIVDFC----------VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 338
           V +  +N+GP+ DF              + +Q   ++V CSG  K+G+L I+R  I    
Sbjct: 556 VRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGHGKNGALCILRQSIRPEM 615

Query: 339 QASVELQGIKGMWSL-----RSST---------DDPFDTFLVVSFISETRIL 376
              VEL G KG+W++     R S          DD +  +L++S  + T +L
Sbjct: 616 ITEVELPGCKGIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIISLEARTMVL 667


>gi|241560031|ref|XP_002400960.1| spliceosomal protein sap, putative [Ixodes scapularis]
 gi|215501812|gb|EEC11306.1| spliceosomal protein sap, putative [Ixodes scapularis]
          Length = 1019

 Score = 92.4 bits (228), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 82/329 (24%), Positives = 150/329 (45%), Gaps = 50/329 (15%)

Query: 729  NVYYCVGTAYVL-----PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
            N   C G   +L     P   EP  G          +L+L+     + A  +L  F G+L
Sbjct: 710  NPRVCRGGGLLLTYRLAPNPEEPMAGPT--------QLELVHATPVEEAPTALCPFQGRL 761

Query: 784  LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIY 842
            LA + + ++LY     D G ++L  +C +     A+  +Q  G+ +VV D+ +S   L Y
Sbjct: 762  LAGVGKCLRLY-----DLGRKKLLRKCENKYIPNAIVSIQAMGNRVVVSDVQESFFFLRY 816

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD--------- 893
            K +E  +   A D    W++A  +LD +   GA+   N+  +R  S  + D         
Sbjct: 817  KRQENQLVIFADDSVPRWITASCMLDYETVAGADKFGNVSIIRLPSSISDDVDEDPTGIK 876

Query: 894  --EERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
               +RG L       +V+  +H+GE V   +  +L+        G   ++++ T++G +G
Sbjct: 877  SLWDRGWLGGSSQKADVISNFHIGETVLSLQKATLI-------PGGSESLVYVTLSGTVG 929

Query: 945  VIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
            V+     HE + F + L+ ++R     + G +H  +RS          KN +DGDL E F
Sbjct: 930  VLVPFTAHEDHDFFQHLEMHMRYENPPLCGRDHLSFRS-----SYFPVKNVIDGDLCEQF 984

Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              L  ++   I++ ++ +  E+ K++E++
Sbjct: 985  NSLDPSKQKSIAEELDRNPSEVSKKLEDI 1013



 Score = 49.3 bits (116), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 28/87 (32%), Positives = 48/87 (55%), Gaps = 3/87 (3%)

Query: 333 GIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 391
           G+ ++E A  EL G    +W+++   D+ FD +++VSF++ T +L++   + +EE    G
Sbjct: 428 GLQVSEMAVSELPGNPNAVWTVKRKADEDFDAYIIVSFVNATLVLSIG--ETVEEVTDSG 485

Query: 392 FCSQTQTLFCHDAIYNQLVQVTSGSVR 418
           F   T TL C     + LVQ  + S R
Sbjct: 486 FLGTTPTLSCAQIGDDALVQALAMSSR 512



 Score = 48.1 bits (113), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 67/326 (20%), Positives = 126/326 (38%), Gaps = 80/326 (24%)

Query: 216 YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
           +L     G +  + +  +++ VT +K++      +A+++  L    +++ + +G+  L +
Sbjct: 111 FLAQTEQGDIFKITLETDEDMVTEIKLKYFDTIPVAASMCVLKTGFLFVAAEFGNHCLYQ 170

Query: 276 LN------------------------LQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG 311
           +                           P A  + + V E   +L P +   + DL  + 
Sbjct: 171 IARLGEEDEEPEFSSAIPLEEGDTFFFAPRALRNLLPV-EELDSLSPAMGCTIADLANED 229

Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
             Q+    G      LR++R+G+  +  A V L  ++  W +    + P           
Sbjct: 230 TPQLYVACGRGPRSCLRVLRHGLEASVHAEVLLCVLE--WLIDCDQNVP----------- 276

Query: 372 ETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
              ILA++                               QV    +R + +  R   NEW
Sbjct: 277 ---ILAVHW------------------------------QVYPEGIRHIRADKR--VNEW 301

Query: 432 KSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYEISCLDIN--P 487
           ++P   ++     N  QV++A  GG LVY E+   G L E     ++  +I C+ +   P
Sbjct: 302 RTPGKKAIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTDRREMSADIICMSLASVP 361

Query: 488 IGENPSYSQIAAVGMWTDISVRIFSL 513
           +GE    S+  AVG+  D +VRI SL
Sbjct: 362 VGEQ--RSRFLAVGL-ADNTVRIISL 384


>gi|350633238|gb|EHA21604.1| hypothetical protein ASPNIDRAFT_51242 [Aspergillus niger ATCC 1015]
          Length = 1406

 Score = 92.0 bits (227), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 98/432 (22%), Positives = 179/432 (41%), Gaps = 55/432 (12%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH------------------- 692
            ++ + LGE    + +   S  + +  C   +    E+ E+H                   
Sbjct: 971  LKRVHLGEQVDHLAYSTSSGMYVLGTCHATDFKLPEDDELHPEWRNEDCLAISFFPSARG 1030

Query: 693  -FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEP 746
             F++L+   T+  I ++ L   EY  +I + S     N +       VGTA+    E+ P
Sbjct: 1031 SFIKLVSPNTWSIIDSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAFAR-GEDIP 1089

Query: 747  TKGRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
            ++G I VF V            D KL+LI ++  KGAV +L+   G+    + Q  +   
Sbjct: 1090 SRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQGQKCMV 1149

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+    +      ++  +         ++GD +K +    Y  E   +   A+D
Sbjct: 1150 RGLKEDGSLLPVAFMDMQCYVSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKMSLFAKD 1209

Query: 856  YNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
             +   + A E L D   L    A+++ N+  ++ + E        RL    ++H+G F +
Sbjct: 1210 LDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFHMGNFAS 1269

Query: 913  RFRHGSLVM----RLPDSDVG-----QIP--TVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
                    M    ++  S  G     Q P   V+  T NG +G+I  +P E Y  L  LQ
Sbjct: 1270 TLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYRRLSALQ 1329

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            + L   ++   GLN   +R+   E      +  LDG+L+  ++D+S+ R  EI+  +   
Sbjct: 1330 SQLTNTLEHPCGLNPRAFRAV--ESDGTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAR 1387

Query: 1022 VEELCKRVEELT 1033
              E+   +E ++
Sbjct: 1388 EWEIKADLEAIS 1399


>gi|218188544|gb|EEC70971.1| hypothetical protein OsI_02599 [Oryza sativa Indica Group]
          Length = 274

 Score = 92.0 bits (227), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 58/168 (34%), Positives = 78/168 (46%), Gaps = 57/168 (33%)

Query: 845  EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
             +G  +E  RDYN  WM+AVE+LDD +Y+GA+N +NLFT  K          GRL V+G+
Sbjct: 148  NKGQTDELNRDYNITWMTAVEMLDDYVYIGADNCYNLFTELK-------RRVGRLLVIGQ 200

Query: 905  YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
            YHLG                                                 ++LQ  L
Sbjct: 201  YHLG-------------------------------------------------DQLQLVL 211

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVD-AKNFLDGDLIESFLDLSRTRM 1011
               IK VG L H QWR+F  + ++V  A+ F+D DLIESFL L  ++M
Sbjct: 212  ADYIKSVGNLTHAQWRAFYTDGRSVGVAQGFVDNDLIESFLSLEPSKM 259



 Score = 55.1 bits (131), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 45/79 (56%), Gaps = 5/79 (6%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQ-----DFLFIATERYKFCVLQWDAESSELITRAMGDV 55
           MLD P+YGRIAT+ELFRP   A      D    ++ ++    +  + E+  +I R+  D 
Sbjct: 80  MLDAPVYGRIATIELFRPCVLASIKFSPDSRAKSSCKFVNAFVYLNQENRYVICRSGRDA 139

Query: 56  SDRIGRPTDNGQIGIIDPD 74
           SD IGRPT+ GQ   ++ D
Sbjct: 140 SDHIGRPTNKGQTDELNRD 158


>gi|302814354|ref|XP_002988861.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
 gi|300143432|gb|EFJ10123.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
          Length = 1413

 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 86/315 (27%), Positives = 145/315 (46%), Gaps = 24/315 (7%)

Query: 733  CVGTAYVLPEENEPTKGRI-LVFIVEDGK-----LQLIAEKETKGAVYSLNAFNGKLLAA 786
             +GT Y L  E+  ++GRI LV + ED        + +  KE KGA+ +L A  G LL A
Sbjct: 1103 AIGTGY-LQGEDVASRGRIILVSLGEDPSDPKVWAKELYSKELKGAISALAALQGHLLLA 1161

Query: 787  INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
            I  KI L+ W    +G+ EL         +  + +    +F++ GD  KSI  L +K E 
Sbjct: 1162 IGPKIILHTW----NGS-ELIGTAFFDAPLYVVSLNIVKNFVLFGDFHKSIYFLCWKEEG 1216

Query: 847  GAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFT-VRKNSEGATDEERGRLE 900
              +   A+D+ +    A E L D   L      +  N  +F+   KN+E    +   +L 
Sbjct: 1217 AQLVLLAKDFGSLDCYATEFLIDGSTLSLLVSDSRKNIQVFSYAPKNAESWKGQ---KLL 1273

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
               E+HLG  V +F    + ++ P S       + FGT++G IG I  L    +  L+ L
Sbjct: 1274 PRVEFHLGSHVTKFLRLQM-LQTPGSSRTNRFALCFGTLDGGIGYITPLDELTFRRLQTL 1332

Query: 961  QTNLRKVIKGVGGLNHEQWRSF--NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            Q  L  ++  V GLN + +R F  N E       N +D + +  +  LS  +   I++ +
Sbjct: 1333 QRKLVDLVPHVAGLNPKAYRQFQANGEHHKHGPDNTVDSEQLREYESLSLDKQVAIARQI 1392

Query: 1019 NVSVEELCKRVEELT 1033
              + +++   + +++
Sbjct: 1393 GTTRQQIFANLRDIS 1407



 Score = 58.9 bits (141), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 124/589 (21%), Positives = 220/589 (37%), Gaps = 136/589 (23%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM----GDVSDRIGR 61
           I   +A L   R  G  +D + +A    KF VL +D  + +L T +M    G     + R
Sbjct: 114 IVDSMAILHRGRDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEWIHLKR 173

Query: 62  PTDN---GQIGIIDPDCRLIGLHLYDGLFKVI--------------PFDN--KGQLKEAF 102
             +    G +   D   R  G+ +Y     ++              P  N    +++ ++
Sbjct: 174 GREKFPGGPLVRADSQGRCAGVLIYKSQLVMMKAAQEAYGLVEEDDPSGNIVSARIESSY 233

Query: 103 NIRLEELQVLDIK---FLYGCAKPTIVVLYQD----------NKDARHVKTYEVALKDKD 149
            + L+EL ++ +K   FLYG  +P + +L++            +D   V    +    K 
Sbjct: 234 VVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALSINTNTKK 293

Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRV 209
                W Q  L   A  L+ VP P+ GVL++   +I+Y S  +   + +    T   G +
Sbjct: 294 HPR-LWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELATPPAGSL 352

Query: 210 DADGSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIAS 252
           +   S++                 LL    G+L  L +  +   V  L++     + ++S
Sbjct: 353 EMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSKGSVLSS 412

Query: 253 TISYLDNAVVYIGSSYGDSQLI---------------------------KLNLQPDAKGS 285
           ++  + +   ++GS  GDS L+                           ++ L  DA   
Sbjct: 413 SLCTIGDMFFFVGSRLGDSLLVQFGSASTSNSLSQSYDGEDDIMVRPSKRMRLDDDANEQ 472

Query: 286 YV------------------EVLERYVNLGPIVDFCVVDL---ERQGQGQ-VVTCSGAYK 323
            +                   V +   N+GPI D         E+ G  Q ++ C G  K
Sbjct: 473 SLYQYKSAVSDSQKNMNFLFSVRDSLCNIGPIRDITGRSQNPSEQPGSAQDLIACCGHGK 532

Query: 324 DGSLRIVRNGIGIN--EQAS---------------VELQGIKGMWSL--RS----STDDP 360
           +GSL I+   I  +   QA+               V+L G  G+W++  RS    +  D 
Sbjct: 533 NGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCVGVWTVYHRSGQIPAEKDE 592

Query: 361 FDTFLVVSFISETRILAM--NLEDELEETE--IEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
           +  +L++S  S T +L     L +  +  E   EG       LF       ++ QV    
Sbjct: 593 YHAYLIISLESRTMVLETGETLGEVTDSVEYYTEGPSISAGNLFGR----RRIAQVYQKG 648

Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD 465
           VR++   +R+ ++     PG ++  A+     VLL    G    L +GD
Sbjct: 649 VRILDG-ARQTQDLQVGEPGNAIESASFADPYVLLRMQDGS-CQLVVGD 695


>gi|302761560|ref|XP_002964202.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
 gi|300167931|gb|EFJ34535.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
          Length = 1413

 Score = 91.7 bits (226), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 86/315 (27%), Positives = 145/315 (46%), Gaps = 24/315 (7%)

Query: 733  CVGTAYVLPEENEPTKGRI-LVFIVEDGK-----LQLIAEKETKGAVYSLNAFNGKLLAA 786
             +GT Y L  E+  ++GRI LV + ED        + +  KE KGA+ +L A  G LL A
Sbjct: 1103 AIGTGY-LQGEDVASRGRIILVSLGEDPSDPKVWAKELYSKELKGAISALAALQGHLLLA 1161

Query: 787  INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
            I  KI L+ W    +G+ EL         +  + +    +F++ GD  KSI  L +K E 
Sbjct: 1162 IGPKIILHTW----NGS-ELIGTAFFDAPLYVVSLNIVKNFVLFGDFHKSIYFLCWKEEG 1216

Query: 847  GAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFT-VRKNSEGATDEERGRLE 900
              +   A+D+ +    A E L D   L      +  N  +F+   KN+E    +   +L 
Sbjct: 1217 AQLVLLAKDFGSLDCYATEFLIDGSTLSLLVSDSRKNIQVFSYAPKNAESWKGQ---KLL 1273

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
               E+HLG  V +F    + ++ P S       + FGT++G IG I  L    +  L+ L
Sbjct: 1274 PRVEFHLGSHVTKFLRLQM-LQTPGSSRTNRFALCFGTLDGGIGYITPLDELTFRRLQTL 1332

Query: 961  QTNLRKVIKGVGGLNHEQWRSF--NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            Q  L  ++  V GLN + +R F  N E       N +D + +  +  LS  +   I++ +
Sbjct: 1333 QRKLVDLVPHVAGLNPKAYRQFQANGEHHKHGPDNTVDSEQLREYESLSLDKQVAIARQI 1392

Query: 1019 NVSVEELCKRVEELT 1033
              + +++   + +++
Sbjct: 1393 GTTRQQIFANLRDIS 1407



 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 122/589 (20%), Positives = 222/589 (37%), Gaps = 136/589 (23%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGD------VSDRI 59
           I   +A L   R  G  +D + +A    KF VL +D  + +L T +M        +  + 
Sbjct: 114 IVDSMAILHRGRDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEWIHLKR 173

Query: 60  GRPTDNGQ-----------IGIIDPDCRLIGLHLYDGLFKVIPFDN------KGQLKEAF 102
           GR    G             G++   C+L+ +      + ++  D+        +++ ++
Sbjct: 174 GREKFPGGPLVRADSQGRCAGVLIYKCQLVMMKAAQEAYGLVEEDDPSGNIVSARIESSY 233

Query: 103 NIRLEELQVLDIK---FLYGCAKPTIVVLYQD----------NKDARHVKTYEVALKDKD 149
            + L+EL ++ +K   FLYG  +P + +L++            +D   V    +    K 
Sbjct: 234 VVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALSINTNTKK 293

Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRV 209
                W Q  L   A  L+ VP P+ GVL++   +I+Y S  +   + +    T   G +
Sbjct: 294 HPR-LWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELATPPAGSL 352

Query: 210 DADGSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIAS 252
           +   S++                 LL    G+L  L +  +   V  L++     + ++S
Sbjct: 353 EMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSKGSVLSS 412

Query: 253 TISYLDNAVVYIGSSYGDSQLI---------------------------KLNLQPDAKGS 285
           ++  + +   ++GS  GDS L+                           ++ L  DA   
Sbjct: 413 SLCTIGDKFFFVGSRLGDSLLVQFGSASTSNSLEHSYDGEDDIMVRPSKRMRLDDDASEQ 472

Query: 286 YV------------------EVLERYVNLGPIVDFCVVDL---ERQGQGQ-VVTCSGAYK 323
            +                   V +   N+GPI D         E+ G  Q ++ C G  K
Sbjct: 473 SLYQYKSGVSDSQKNMNFLFSVRDSLCNIGPIRDITCRSQNPSEQPGSAQDLIACCGHGK 532

Query: 324 DGSLRIVRNGIGIN--EQAS---------------VELQGIKGMWSL--RS----STDDP 360
           +GSL I+   I  +   QA+               V+L G  G+W++  RS    +  D 
Sbjct: 533 NGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCVGVWTVYHRSGQIPAEKDE 592

Query: 361 FDTFLVVSFISETRILAM--NLEDELEETE--IEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
           +  +L++S  S T +L     L +  +  E   EG       LF       ++ QV    
Sbjct: 593 YHAYLIISLESRTMVLETGETLGEVTDSVEYYTEGPSISAGNLFGR----RRIAQVYQKG 648

Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD 465
           VR++   +R+ ++     PG ++  A+     VLL    G    L +GD
Sbjct: 649 VRILDG-ARQTQDLQVGEPGNAIESASFADPYVLLRMQDGS-CQLVVGD 695


>gi|255539681|ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
 gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative
            [Ricinus communis]
          Length = 1461

 Score = 91.3 bits (225), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 104/404 (25%), Positives = 185/404 (45%), Gaps = 50/404 (12%)

Query: 664  RRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT----FEFISTYPLDTFEYGCSI 719
            + + HQ ++   +   L      EE E   VR+L+ +     ++  +T P+ + E   ++
Sbjct: 1068 QEVGHQIENHNLSSDELLQTYSVEEFE---VRILESENGGGPWQTKATIPMQSSENALTV 1124

Query: 720  ----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV----EDGKLQLIAE---KE 768
                L  + + ++     +GTAYV   E+   +GR+L+F V    E+ ++ L++E   KE
Sbjct: 1125 RVVTLFNATTKENETLLAIGTAYV-QGEDVAARGRVLLFSVVKSTENSQV-LVSEVYSKE 1182

Query: 769  TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG--- 825
             KGA+ +L +  G LL A   KI L+KW     GT EL     +      LYV +     
Sbjct: 1183 LKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGVAFYDAP--PLYVASMNIVK 1235

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFN 880
            +FI++GD+ KSI  L +K +   +   A+D+ +    A E L D   L       + N  
Sbjct: 1236 NFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1295

Query: 881  LFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------- 932
            +F     +    +  +G +L    E+H+G  + +F   S++    D   G  P       
Sbjct: 1296 IFYY---APKMLESWKGQKLLSRAEFHVGAHITKFIRLSMLSTSSDRS-GAAPGPDKTNR 1351

Query: 933  -TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT--V 989
              ++FGT++G IG IA L    +  L+ LQ  L   +  V GLN   +R F ++ K    
Sbjct: 1352 FALLFGTLDGSIGCIAPLDELTFRRLQSLQRKLVDAVPHVAGLNPRSFRQFRSDGKVHRP 1411

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
              ++ +D +L+  F  L      EI++ +  +  ++   + +L+
Sbjct: 1412 GPESIVDCELLSHFEMLPLEEQLEIAQQVGTTRAQILSNLNDLS 1455



 Score = 57.0 bits (136), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 73/309 (23%), Positives = 130/309 (42%), Gaps = 54/309 (17%)

Query: 23  QDFLFIATERYKFCVLQWDAESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDC 75
           +D + +A +  K  VL++D     L T +M           + GR +   G +  +DP  
Sbjct: 131 RDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLKRGRESFARGPLLKVDPQG 190

Query: 76  RLIGLHLYDGLFKVIPFDN------------------KGQLKEAFNIRLEEL---QVLDI 114
           R  G+ +YD    ++                        +++ ++ I L ++    V D 
Sbjct: 191 RCGGILVYDMQMIILRAAQASSGLVGDDDALSSGGSISARVQSSYVINLRDMDMKHVKDF 250

Query: 115 KFLYGCAKPTIVVLYQ------DNKDARHVKTYEVALKDKDFVEGP---WSQNNLDNGAD 165
            FL+   +P +V+L++           +H      AL     ++ P   WS  NL + A 
Sbjct: 251 IFLHDYIEPVVVILHERELTWAGRVSWKHHTCMISALSISTTLKQPTLIWSVVNLPHDAY 310

Query: 166 LLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDADGSR- 215
            L+ VPPP+ GVL+I   TI Y S +A  A+ +            + +A   V+ D  + 
Sbjct: 311 KLLAVPPPIGGVLVICANTIHYHSESATYALALNNYAVSIDSSQELPRASFSVELDAVKA 370

Query: 216 -YLLGDHA------GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
            +LL D A      G L LL + ++   V  L +     + + S I+ + N++ ++GS  
Sbjct: 371 AWLLNDVALLSAKNGELLLLSLVYDGRVVQRLDLSKSKASVLTSDITTIGNSLFFLGSRL 430

Query: 269 GDSQLIKLN 277
           GDS L++  
Sbjct: 431 GDSLLVQFT 439



 Score = 45.1 bits (105), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 36/148 (24%), Positives = 63/148 (42%), Gaps = 27/148 (18%)

Query: 277 NLQPDAKGSYVEVLERYVNLGPIVDFC----------VVDLERQGQGQVVTCSGAYKDGS 326
           N +   K     V +  +N+GP+ DF              + +Q    +V CSG  K+G+
Sbjct: 494 NTESAQKSFSFAVRDSLINVGPLKDFSYGLRSNYDASATGIAKQSNYDLVCCSGHGKNGT 553

Query: 327 LRIVRNGIGINEQASVELQGIKGMWSL--------------RSSTDDPFDTFLVVSFISE 372
           L I+R  I       V+L G +G+W++               ++  D +  +L++S   E
Sbjct: 554 LCILRQSIRPEMITEVDLPGCRGIWTVYHKNARGHNVDLSKMAAAADEYHAYLIISM--E 611

Query: 373 TRILAMNLEDELEE-TEIEGFCSQTQTL 399
            R + +   D L E TE   +  Q +T+
Sbjct: 612 ARTMVLETADLLSEVTESVDYFVQGRTI 639


>gi|330799483|ref|XP_003287774.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
 gi|325082229|gb|EGC35718.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
          Length = 1453

 Score = 91.3 bits (225), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 108/496 (21%), Positives = 210/496 (42%), Gaps = 64/496 (12%)

Query: 584  SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLK-EVSHMCPFNSAAFPDSLAIAKEGE-- 640
            SSKN   VF    +P  I++    +   +++    +  + PF++A  P+      E +  
Sbjct: 954  SSKNNG-VFIGGQKPVWIFNEKGYIRLHSMDFDGAIVSLKPFHNADCPNGFLYYTEDKQH 1012

Query: 641  LTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRL 696
            + IG ++ +        IR +P+     +I +  + + + +     Q   E  E     +
Sbjct: 1013 IKIGYLNGLMNFENEYAIRRVPIKLSAHKIAYHNELKCYVVVVSFPQVTQELEEDSKKPI 1072

Query: 697  LDDQTFE-------------FISTYPLDTFE--YGCSILSCSFSDD-----SNVYYCVGT 736
            L D+ F+             FI ++ L   E      I+S  F +      S  +  +GT
Sbjct: 1073 LTDEKFQIKIIDPTIDWSWRFIDSFSLQDRETVLAMKIVSLKFKESDETIKSKPFLVIGT 1132

Query: 737  AYVLPEENEPTKGRILVFIVEDGKLQ------------LIAEKETKGAVYSLNAFNGKLL 784
            A+   E+ +  KGR+LVF +   K Q            L+ EKE KG V +L++ +G LL
Sbjct: 1133 AFTFGEDTQ-CKGRVLVFEIVSHKTQFESDDLGTKRLNLLYEKEQKGPVTALSSVSGLLL 1191

Query: 785  AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
              I  K+ + +++     T +L +   H   I    + T   +IV+GD+ KS+  L +  
Sbjct: 1192 MTIGPKLTVNQFL-----TGQLVTLSFHDAQIYICSISTIKTYIVIGDMYKSVYFLQWNG 1246

Query: 845  EEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEV 901
            ++  +   ++DY +  + + E + +   L    ++ + N+     +    T  +   L  
Sbjct: 1247 KQ--LVPLSKDYQSLNIFSTEFIVNQQTLSILVSDLDKNILLFSFDPADPTSRQGQMLLC 1304

Query: 902  VGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLE 958
              ++H+G  + +F    +   +  S  G       V FGT++G + V+  L    Y    
Sbjct: 1305 KADFHIGSNIEKFVRTPMKFNIQSSSNGNNNNDQLVFFGTLDGSLNVLRPLDERMYQLFY 1364

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD---------AKNFLDGDLIESFLDLSRT 1009
             LQ+ L   +    GLN +Q+R+F +  +             K  LDGDL+  F+ L++ 
Sbjct: 1365 HLQSKLY-YLPQPAGLNAKQYRAFKSFSQNFHFSPSTIHQLPKYILDGDLLSKFVKLNQK 1423

Query: 1010 RMDEISKTMNVSVEEL 1025
                ++ ++  + +E+
Sbjct: 1424 ERRLLASSVGSNTDEI 1439



 Score = 41.2 bits (95), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 63/337 (18%), Positives = 125/337 (37%), Gaps = 69/337 (20%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           ++++ ++G I ++   R  GE +D L +     K  VL ++ +  +   R++    +   
Sbjct: 72  LIELKLFGTIESMASVRYPGENKDSLLLTFRDAKISVLDYNIDIMDFEIRSLHFYENDEF 131

Query: 61  RPTDNGQIGI-------IDPDCRLIGLHLYDGLFKVIPFDNKGQL--------------- 98
           +   NG+I         ID   R   + LYD    V+PF     +               
Sbjct: 132 K---NGRIHFKHPPILKIDTQQRCATMLLYDRNIVVLPFKQISSILDDEDEEEKDEEDEK 188

Query: 99  --KEAFNIRLEELQVL----DIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALK------ 146
               A     EE        +  FLYG  +PTI+ L++         T  +A+K      
Sbjct: 189 ENDNANQDYTEEFDDDDDDNNFCFLYGYYEPTILFLHE----PSQTWTSRIAVKRLTSQL 244

Query: 147 -------DKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--------SAN 191
                          W  +N+    D L+ VP PL G L+I    + +         + N
Sbjct: 245 TAISINFSTKLASIIWHTSNMPYNCDQLVSVPEPLSGALVITPNIMFHVNQTSKYGLAVN 304

Query: 192 AFKAI--------PIRPSITKAY-----GRVDADGSRYLLGDHAGLLHLLVITHEKEKVT 238
            +  I        P+  ++   +       V  +  +++     G L +  +  +   V 
Sbjct: 305 EYANIDIGDKFEFPLDETLNLVFTLDRSNFVFLEADKFIGSLKGGELLIFHLISDGRTVQ 364

Query: 239 GLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
            + +   G + +A+ +  + + ++++GS  GDS L++
Sbjct: 365 RIHVSKAGGSVLATCMCVVSDNLLFLGSRLGDSLLLQ 401


>gi|67472417|ref|XP_652013.1| splicing factor 3B subunit 3 [Entamoeba histolytica HM-1:IMSS]
 gi|56468814|gb|EAL46625.1| splicing factor 3B subunit 3, putative [Entamoeba histolytica
            HM-1:IMSS]
 gi|449709507|gb|EMD48761.1| premRNA-splicing factor rse-1, putative [Entamoeba histolytica KU27]
          Length = 1145

 Score = 91.3 bits (225), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 174/883 (19%), Positives = 343/883 (38%), Gaps = 109/883 (12%)

Query: 217  LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL--- 273
            L  +   L  L   T EK ++  + ++       A ++ +           YGDS L   
Sbjct: 299  LQSEFGDLFQLSFDTKEKGQLDNIHLKYFDSIPTALSLQFSKRGHFLCVGEYGDSILYNI 358

Query: 274  -----IKLNLQPDAKGSY--------VEVLERYVNLGPIVDFCVVD-LERQGQGQVVTCS 319
                 I +  + D +  +        +E + R+ +L P++D  V      Q   ++    
Sbjct: 359  ISMENISIPFEKDGRLEFERHKEILNLEEIYRFKSLAPLIDLKVAPATSSQDTTKMYAFC 418

Query: 320  GAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRILAM 378
            G     +++I++N +   +   +EL  I   +W L+  TD+ +  +L +S+ + T +L +
Sbjct: 419  GKGNQSTIKILKNQLNTLDAVEIELPAIPISIWPLKKETDE-YHQYLAISYSNITTLLKI 477

Query: 379  NLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYS 438
              EDE+ E         T +L          +QV +  + + S   ++   E +      
Sbjct: 478  T-EDEMSECTTSPILLSTPSLLVSMLFDGTFLQVMTDRIIIYSEPIQQFITEDQK----- 531

Query: 439  VNVATANASQVLLATGGGH---LVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYS 495
               A+ N  +++++    +   L+Y +   G L  ++      +I+ L ++    +PS  
Sbjct: 532  YVCASCNGCELIVSVEKNNQTSLIYFQYQSGHLLTMERKDNLSKITALALDQF--HPS-- 587

Query: 496  QIAAVGMWTDISVRIFSL-PD-----LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALG 549
            +  A+G   D SV + SL P+     L+ ++ +     I   S+     +  SYL   L 
Sbjct: 588  KHCAIGC-IDGSVHLLSLIPNETTKALSRVSLQTYECSI--NSLTFNIIDNTSYLFAGLS 644

Query: 550  DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLL 609
            +G L   + +  +GE+ +     +G++P+TL          V A S R  + Y +  K+ 
Sbjct: 645  NGLLGRSVYDPISGEINESSLNFVGSRPVTLSNVKDCGEDSVLAISGRSLLSYKNGTKIK 704

Query: 610  YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICH 668
             + +N++  +  C        +++AI  E  + I TI+     L  ++I +   PR++  
Sbjct: 705  TTPLNIQNTTLACGILVPFVDNAIAIICEKVMKIITIESTSSSLTGKNIQISYTPRKVIT 764

Query: 669  QEQSRTFAICSLKNQSCA------EESEMHFV---RLLDDQTFEFISTYPLDTFEY---G 716
                    I    N SC       + +E ++V     LD    E I     D  ++   G
Sbjct: 765  HPTIPLLYILEGDNNSCKVGNEIIQTNEGNWVGGIHTLDASQDELIQFIDFDNNKHPTGG 824

Query: 717  CSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGR---ILVFIVEDGKLQLIAEKETKGAV 773
            C + S S    +  Y  +G   +   +  P + +   I V+ + +  +      + +  V
Sbjct: 825  CVVRSIS---KNQTYLIIGV--IESYKTRPIQWKSSEIQVYSINERSINYCYSTKVEYPV 879

Query: 774  YSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
             +   F G +LA +   ++LY    K +LR    R+  S+           +   G+ I+
Sbjct: 880  RAFAEFKGMVLAGVGNILRLYDIGLKSLLRKAEKRQFASDIAQ--------LHVIGETIL 931

Query: 830  VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA-------------- 875
            +  +    +L+ Y       +  A D    W+     L+    L +              
Sbjct: 932  LTGVSDGFNLIRYNQINHKFDIYA-DSLPRWVVTAAPLNQSTVLASDKFGEIFMYQLPKE 990

Query: 876  --ENNFNLF-TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
              E   N F T+ +  +   +    ++    ++ +G+    F   SL+   P        
Sbjct: 991  IEEQALNPFSTLLQPHKTIYEGSSYKMVTATQFFVGDIATSFAQCSLIPGAPS------- 1043

Query: 933  TVIFGTVNGVIGVIASLP---HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
              IF   N + G+ A +P        F + L+ ++R     +   NH  +RS       V
Sbjct: 1044 --IFLYSNFMGGLSALIPLQSQNDIDFYQHLEMHMRVHWTNLTDRNHISFRS-----SIV 1096

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
              K+ +DGDL E +  L      EI++ M   V E+ K++ +L
Sbjct: 1097 PVKDTVDGDLCELYERLPYEIQQEIAEEMEKEVNEIIKKLHDL 1139


>gi|6671952|gb|AAF23212.1|AC016795_25 hypothetical protein [Arabidopsis thaliana]
 gi|10998135|dbj|BAB03106.1| unnamed protein product [Arabidopsis thaliana]
          Length = 1331

 Score = 91.3 bits (225), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 57/209 (27%), Positives = 106/209 (50%), Gaps = 20/209 (9%)

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
           N+ PI+DF V+D + + + Q+  C G   +GSLRI+R+GI + +  + +   QGI G W+
Sbjct: 440 NIAPILDFSVMDDQNEKRDQIFACCGVTPEGSLRIIRSGINVEKLLKTAPVYQGITGTWT 499

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           ++    D + +FLV+SF+ ETR+L++ L  + + T+  GF S   T  C       LVQ+
Sbjct: 500 VKMKLTDVYHSFLVLSFVEETRVLSVGLSFK-DVTDSVGFQSDVCTFACGLVADGLLVQI 558

Query: 413 TSGSVRLVSST----------SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
              ++RL   T          S    + W  P   S+++     + ++++T     + + 
Sbjct: 559 HQDAIRLCMPTMDAHSDGIPVSSPFFSSW-FPENVSISLGAVGQNLIVVSTSNPCFLSIL 617

Query: 463 IGDGI------LTEVKHAQLEYEISCLDI 485
               +      + E++   L+YE+SC+ +
Sbjct: 618 GVKSVSSQCCEIYEIQRVTLQYEVSCISV 646



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 60/126 (47%), Gaps = 16/126 (12%)

Query: 905  YHLGEFVNRFRHGSLVMRLPDSDVGQI-----------PTVIFGTVNGVIGVIASLPHEQ 953
            Y++GE     + G  + +LP  DV +             T+I GT+ G I V A +  E+
Sbjct: 1172 YYMGEIAMSIKKGCNIYKLPADDVLRSYGLSKSIDTADDTIIAGTLLGSIFVFAPISSEE 1231

Query: 954  YLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
            Y  LE +Q  L    +   V G +H ++R   N  +   A+  LDGD++  FL+L+  + 
Sbjct: 1232 YELLEGVQAKLGIHPLTAPVLGNDHNEFRGRENPSQ---ARKILDGDMLAQFLELTNRQQ 1288

Query: 1012 DEISKT 1017
            + +  T
Sbjct: 1289 ESVLST 1294


>gi|30681985|ref|NP_850565.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit
           protein [Arabidopsis thaliana]
 gi|332641609|gb|AEE75130.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit
           protein [Arabidopsis thaliana]
          Length = 1329

 Score = 91.3 bits (225), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 57/209 (27%), Positives = 106/209 (50%), Gaps = 20/209 (9%)

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
           N+ PI+DF V+D + + + Q+  C G   +GSLRI+R+GI + +  + +   QGI G W+
Sbjct: 410 NIAPILDFSVMDDQNEKRDQIFACCGVTPEGSLRIIRSGINVEKLLKTAPVYQGITGTWT 469

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           ++    D + +FLV+SF+ ETR+L++ L  + + T+  GF S   T  C       LVQ+
Sbjct: 470 VKMKLTDVYHSFLVLSFVEETRVLSVGLSFK-DVTDSVGFQSDVCTFACGLVADGLLVQI 528

Query: 413 TSGSVRLVSST----------SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
              ++RL   T          S    + W  P   S+++     + ++++T     + + 
Sbjct: 529 HQDAIRLCMPTMDAHSDGIPVSSPFFSSW-FPENVSISLGAVGQNLIVVSTSNPCFLSIL 587

Query: 463 IGDGI------LTEVKHAQLEYEISCLDI 485
               +      + E++   L+YE+SC+ +
Sbjct: 588 GVKSVSSQCCEIYEIQRVTLQYEVSCISV 616



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 60/126 (47%), Gaps = 16/126 (12%)

Query: 905  YHLGEFVNRFRHGSLVMRLPDSDVGQI-----------PTVIFGTVNGVIGVIASLPHEQ 953
            Y++GE     + G  + +LP  DV +             T+I GT+ G I V A +  E+
Sbjct: 1170 YYMGEIAMSIKKGCNIYKLPADDVLRSYGLSKSIDTADDTIIAGTLLGSIFVFAPISSEE 1229

Query: 954  YLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
            Y  LE +Q  L    +   V G +H ++R   N  +   A+  LDGD++  FL+L+  + 
Sbjct: 1230 YELLEGVQAKLGIHPLTAPVLGNDHNEFRGRENPSQ---ARKILDGDMLAQFLELTNRQQ 1286

Query: 1012 DEISKT 1017
            + +  T
Sbjct: 1287 ESVLST 1292


>gi|42564075|ref|NP_187802.2| Cleavage and polyadenylation specificity factor (CPSF) A subunit
           protein [Arabidopsis thaliana]
 gi|29824376|gb|AAP04148.1| unknown protein [Arabidopsis thaliana]
 gi|110739103|dbj|BAF01468.1| hypothetical protein [Arabidopsis thaliana]
 gi|332641608|gb|AEE75129.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit
           protein [Arabidopsis thaliana]
          Length = 1379

 Score = 91.3 bits (225), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 57/209 (27%), Positives = 106/209 (50%), Gaps = 20/209 (9%)

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
           N+ PI+DF V+D + + + Q+  C G   +GSLRI+R+GI + +  + +   QGI G W+
Sbjct: 468 NIAPILDFSVMDDQNEKRDQIFACCGVTPEGSLRIIRSGINVEKLLKTAPVYQGITGTWT 527

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           ++    D + +FLV+SF+ ETR+L++ L  + + T+  GF S   T  C       LVQ+
Sbjct: 528 VKMKLTDVYHSFLVLSFVEETRVLSVGLSFK-DVTDSVGFQSDVCTFACGLVADGLLVQI 586

Query: 413 TSGSVRLVSST----------SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
              ++RL   T          S    + W  P   S+++     + ++++T     + + 
Sbjct: 587 HQDAIRLCMPTMDAHSDGIPVSSPFFSSW-FPENVSISLGAVGQNLIVVSTSNPCFLSIL 645

Query: 463 IGDGI------LTEVKHAQLEYEISCLDI 485
               +      + E++   L+YE+SC+ +
Sbjct: 646 GVKSVSSQCCEIYEIQRVTLQYEVSCISV 674



 Score = 52.4 bits (124), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 36/126 (28%), Positives = 60/126 (47%), Gaps = 16/126 (12%)

Query: 905  YHLGEFVNRFRHGSLVMRLPDSDVGQI-----------PTVIFGTVNGVIGVIASLPHEQ 953
            Y++GE     + G  + +LP  DV +             T+I GT+ G I V A +  E+
Sbjct: 1220 YYMGEIAMSIKKGCNIYKLPADDVLRSYGLSKSIDTADDTIIAGTLLGSIFVFAPISSEE 1279

Query: 954  YLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
            Y  LE +Q  L    +   V G +H ++R   N  +   A+  LDGD++  FL+L+  + 
Sbjct: 1280 YELLEGVQAKLGIHPLTAPVLGNDHNEFRGRENPSQ---ARKILDGDMLAQFLELTNRQQ 1336

Query: 1012 DEISKT 1017
            + +  T
Sbjct: 1337 ESVLST 1342


>gi|168021793|ref|XP_001763425.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162685218|gb|EDQ71614.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 1452

 Score = 90.9 bits (224), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 84/322 (26%), Positives = 144/322 (44%), Gaps = 22/322 (6%)

Query: 725  SDDSNVYYCVGTAYVLPEENEPTKGRI-LVFIVEDGK-----LQLIAEKETKGAVYSLNA 778
            +D +     +GT+YV   E+   KGRI LV + +D +      + +  KE KG++ ++ +
Sbjct: 1134 TDQTQTLLAIGTSYV-QGEDVAAKGRIILVSVGKDPQDPGSWAREVYSKELKGSISAIAS 1192

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
              G LL AI  KI L+ W    +G+ EL         +  + +    +FI+ GD+ KSI 
Sbjct: 1193 LQGHLLIAIGPKIILHSW----NGS-ELNGAAFFDAPLYVVSLNIVKNFILFGDIHKSIY 1247

Query: 839  LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATD 893
             L +K +   +   A+D+ +    A E L D   L      +  N  +F+    S  +  
Sbjct: 1248 FLCWKEDGAQLTLLAKDFGSLDCYATEFLIDGSTLSLLVSDSRKNLQIFSYAPKSMESWK 1307

Query: 894  EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQ 953
             ++  L    E+HLG  VN+F H   ++  P S       V+FGT++G I  +A L    
Sbjct: 1308 GQK--LLSRAEFHLGAHVNKF-HRLQMLPTPGSARSNRYAVLFGTLDGAIDYLAPLDELT 1364

Query: 954  YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRM 1011
            +  L  LQ  L   +  V G+N   +R F  + K       N +D +L+  +  L     
Sbjct: 1365 FRRLHTLQRKLVDCVSHVAGVNPRAFRQFRCDGKAHRPGPDNIVDCELLSHYDMLPLDEQ 1424

Query: 1012 DEISKTMNVSVEELCKRVEELT 1033
             EI++ +  +   +   + +L 
Sbjct: 1425 LEIARQIGTTRAHVLSNLRDLA 1446



 Score = 53.1 bits (126), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 86/404 (21%), Positives = 147/404 (36%), Gaps = 107/404 (26%)

Query: 122 KPTIVVLYQDNKD-------ARHVKTYEVALKDKDFVEGP--WSQNNLDNGADLLIPVPP 172
           +P +VVL++ +          RH         +    + P  WS  NL   A  L+ VP 
Sbjct: 273 EPVLVVLHEKDPTWAGRVAVRRHTCAITALSINTTLKQHPHIWSATNLPYDAYKLLAVPA 332

Query: 173 PLCGVLIIGEETIVYCSANAFKAI-----PIRPSITKAYGR----VDADGSRY------- 216
           P+ GVL+    ++ Y S +   A+      + P  +  Y R    V+ D +         
Sbjct: 333 PIGGVLVFCANSLHYHSQSGSCALGLNEFAVAPEGSAEYPRSKMSVELDCAHATWVANEV 392

Query: 217 -LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
            L+    G+L  L + +E   V  L++     + + S +  +     ++GS   DS L++
Sbjct: 393 ALISTKNGMLLFLNLVYEGRSVQRLELTKSKASVLTSCMCTIGENFFFLGSRLADSLLVQ 452

Query: 276 LNL-------------------QPDAKGSYVE---------------------------- 288
             L                    P AK    E                            
Sbjct: 453 HTLGSASGRTSSLMGDIEADLSAPAAKRLKREPSEEEEGVSAEEMSLYYSTPTASDISQK 512

Query: 289 ------VLERYVNLGPIVDFC----------VVDLERQGQGQVVTCSGAYKDGSLRIVRN 332
                 V +  VN+ P+ DF              L +Q   ++V CSG  K+GSL ++  
Sbjct: 513 KTFTFTVRDSLVNICPLRDFAYGLRSNADQSATGLGKQSNYELVACSGHGKNGSLSVLHQ 572

Query: 333 GIGINEQASVELQGIKGMWSLR--------------SSTDDPFDTFLVVSFISETRILAM 378
            I  +    V L G  G+W++               +S DD F  +L++S   E+R + +
Sbjct: 573 SIRPDLINKVALPGCSGIWTVYHKTDRDDSNEFDFGTSEDDEFHAYLIISL--ESRTMVL 630

Query: 379 NLEDELEE-TEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
              D L E TE   + ++  T+   +    + +VQV    +RL+
Sbjct: 631 ETGDTLGEVTENVEYYTEGNTIAAGNLFGRRFVVQVYQNGLRLL 674


>gi|407044064|gb|EKE42344.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
          Length = 1145

 Score = 90.9 bits (224), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 160/797 (20%), Positives = 315/797 (39%), Gaps = 93/797 (11%)

Query: 287  VEVLERYVNLGPIVDFCVVD-LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
            +E + R+ +L P++D  V      Q   ++    G     +++I++N +   +   +EL 
Sbjct: 385  LEEIYRFKSLAPLIDLKVAPATSSQDTTKMYAFCGKGNQSTIKILKNQLNTLDAVEIELP 444

Query: 346  GIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA 404
             I   +W L+  TD+ +  +L +S+ + T +L +  EDE+ E         T +L     
Sbjct: 445  AIPISIWPLKKETDE-YHQYLAISYSNITTLLKIT-EDEMSECTTSPILLSTPSLLVSML 502

Query: 405  IYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH---LVYL 461
                 +QV +  + + S   ++   E +         A+ N S+++++    +   L+Y 
Sbjct: 503  FDGTFLQVMTDRIIIYSEPIQQFITEDQK-----YVCASCNGSELIVSVEKNNQTSLIYF 557

Query: 462  EIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD----- 515
            +   G L  ++      +I+ L ++    +PS  +  A+G   D SV + SL P+     
Sbjct: 558  QYQSGHLLTMERKDNLSKITALALDQF--HPS--KHCAIGC-IDGSVHLLSLIPNETTKA 612

Query: 516  LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT 575
            L+ ++ +     I   S+     +  SYL   L +G L   + +  +GE+ +     +G+
Sbjct: 613  LSRVSLQTYECSI--NSLTFNIIDNTSYLFAGLSNGLLGRSVYDPISGEINESSLNFVGS 670

Query: 576  QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
            +P+TL          V A S R  + Y +  K+  + +N++  +  C        +++AI
Sbjct: 671  RPVTLSNVKDCGEDSVLAISGRSLLSYKNGTKIKTTPLNIQNTTLACGILVPFVDNAIAI 730

Query: 636  AKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM--- 691
              E  + I TI+     L  ++I +   PR++          I    N SC   +E+   
Sbjct: 731  ICEKVMKIITIESTSSSLTGKNIQISYTPRKVITHPTIPLLYILEGDNNSCKVGNEIIQT 790

Query: 692  ------HFVRLLDDQTFEFISTYPLDTFEY---GCSILSCSFSDDSNVYYCVGTAYVLPE 742
                    +  LD    E I     D  ++   GC + S S    +  Y  VG   +   
Sbjct: 791  NEGNWVGGIHTLDASQDELIQFIDFDNNKHPTGGCVVRSIS---KNQTYLIVGV--IESY 845

Query: 743  ENEPTKGR---ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----K 795
            +  P + +   I V+ + +  +      + +  V +   F G +LA +   ++LY    K
Sbjct: 846  KTRPIQWKSSEIQVYSINERSINYCYSTKVEYPVRAFAEFKGMVLAGVGNILRLYDIGLK 905

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
             +L+    R+  S+           +   G+ I++  +    +L+ Y       +  A D
Sbjct: 906  SLLKKAEKRQFASDIAQ--------LHVIGETILLTGVSDGFNLIRYNQINHKFDIYA-D 956

Query: 856  YNANWMSAVEILDDDIYLGA----------------ENNFNLF-TVRKNSEGATDEERGR 898
                W+     L+    L +                E   N F T+ +  +   +    +
Sbjct: 957  SLPRWVVTAAPLNQSTVLASDKFGEIFMYQLPKEIEEQALNPFSTLLQPHKTIYEGSSYK 1016

Query: 899  LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLP---HEQYL 955
            +    ++ +G+    F   SL+   P          IF   N + G+ A +P        
Sbjct: 1017 MVTATQFFVGDIATSFAQCSLIPGAPS---------IFLYSNFMGGLSALIPLQSQNDID 1067

Query: 956  FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
            F + L+ ++R     +   NH  +RS       V  K+ +DGDL E +  L      EI+
Sbjct: 1068 FYQHLEMHMRVHWTNLTDRNHISFRS-----SIVPVKDTVDGDLCELYERLPYEIQQEIA 1122

Query: 1016 KTMNVSVEELCKRVEEL 1032
            + M   V E+ K++ +L
Sbjct: 1123 EEMEKEVNEIIKKLHDL 1139


>gi|308459872|ref|XP_003092248.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
 gi|308253976|gb|EFO97928.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
          Length = 1448

 Score = 90.9 bits (224), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 84/312 (26%), Positives = 146/312 (46%), Gaps = 29/312 (9%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+PE ++PT  R         K++++ +KE KG V  L A NG LL+ + QK+  + W  
Sbjct: 1147 VVPEPDQPTSNR---------KIKVLFDKEQKGPVTGLCAINGLLLSGMGQKV--FIWQF 1195

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD--Y 856
            +D+    L S    H ++  L+  +     +  D  +S+SL+ ++ E  A+   +RD   
Sbjct: 1196 KDNDLMGL-SFLDMHYYVYQLH--SLRTIALACDARESMSLIRFQEENKAMSIASRDDRR 1252

Query: 857  NANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
             A    A + + D  +LG   ++ N N+     + E        RL V    ++G  VN 
Sbjct: 1253 TAKPPMAAQFVVDGAHLGFLLSDENGNITLFNYSPEAPESNGGERLTVRAAMNIGTNVNA 1312

Query: 914  FR----HGSLVMRLPDSD---VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            F     H SL+    D +   V Q  + IFG+++G  G +  L  + Y  L  LQT +  
Sbjct: 1313 FLRVKGHTSLLNLQSDEEKESVEQRMSTIFGSLDGSFGFVRPLSEKSYRRLHFLQTFIGS 1372

Query: 967  VIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
            V   + GL+ +  RS    +  V   +A+N +DGD++E +L LS     ++++ + V   
Sbjct: 1373 VTPQIAGLHIKGARSARPAQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVGRY 1432

Query: 1024 ELCKRVEELTRL 1035
             +   +  L R+
Sbjct: 1433 HIIDDLMHLRRM 1444



 Score = 70.5 bits (171), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 119/527 (22%), Positives = 207/527 (39%), Gaps = 103/527 (19%)

Query: 72  DPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTIV 126
           DP  R     +Y     ++PF    +   ++ I L+++      V D+ FL G  +PTI+
Sbjct: 145 DPSNRCAASLVYGKHIAILPFHENSKRILSYIIPLKQIDPRLDNVADMVFLDGYYEPTIL 204

Query: 127 VLYQD----------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCG 176
            LY+             D   +    V + D+ F    W   NL      L+P+P PL G
Sbjct: 205 FLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAV-VWQTANLPMDCTSLLPIPKPLGG 263

Query: 177 VLIIGEETIVYCSA-------------NAFKAIPIRPSITKAYGRVDADGSRYL------ 217
            L+ G  TIVY +              + F   P++  +      +D   S Y+      
Sbjct: 264 ALVFGSNTIVYLNQAVPPCGVVLNSCYDGFTKFPLK-DMKHLKMTLDCATSVYMEDGRIA 322

Query: 218 LGDHAGLLHL--LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
           +G   G+L+L  LV +     V  ++   + ETSIA  ++      ++IGS  GDSQL++
Sbjct: 323 VGGRDGVLYLLRLVTSSGGATVKSMEFSRVWETSIAYCLTVCAPGHLFIGSRLGDSQLVE 382

Query: 276 LNLQPDAK-------------------------GSYVEV-----------------LERY 293
             L    K                         G  +E+                 L+R 
Sbjct: 383 YTLLKMTKESAKRQKIEKDPGEIELDEDDMELYGGAIEMQLNDDEEQILESLEFRELDRL 442

Query: 294 VNLGPIVDFC----------VVDLERQGQ-GQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
            N+GP+   C          + +++R+     +VT SG  K+G+L + +  +      S 
Sbjct: 443 RNVGPVKSMCFGRPNYMSSDLAEMKRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSS 502

Query: 343 ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH 402
            L+G + +W++    ++    +L+VS +  T +L +  E    E ++  F +   T+   
Sbjct: 503 ILEGAEQLWAVGRKENESH-KYLIVSRVRSTLVLELGEELVELEEQL--FVTNEPTVAAG 559

Query: 403 DAIYNQL-VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL 461
           +     L VQVTS  + LV  T  +   E      + V  A+     V + T  G  +  
Sbjct: 560 ELSQGALAVQVTSTCIALV--TDGQQMQEVHIDSNFPVVQASIQDPYVAVLTQNGRPLLY 617

Query: 462 EIGDGILTEVKHAQL-EYEISCLDINPIGENPSYSQIAAVGMWTDIS 507
           E     L    + QL E +IS         + + +Q+ ++ ++TD S
Sbjct: 618 E-----LVMAPYVQLREIDISQTPFATFIAHNAATQLTSISVYTDAS 659


>gi|28175094|gb|AAH31197.2| Sf3b3 protein, partial [Mus musculus]
          Length = 494

 Score = 90.5 bits (223), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 161/349 (46%), Gaps = 45/349 (12%)

Query: 710  LDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIA 765
            L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL+ + 
Sbjct: 159  LEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILSPRSVAGGFVYTYKLVNNGEKLEFLH 218

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---Q 822
            +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y+   Q
Sbjct: 219  KTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQ 271

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
            T G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+   N+ 
Sbjct: 272  TIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNIC 331

Query: 883  TVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
             VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+    
Sbjct: 332  VVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIP--- 388

Query: 925  DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
                G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +RS+ 
Sbjct: 389  ----GGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY- 443

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                    KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 444  ----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 488


>gi|193785523|dbj|BAG50889.1| unnamed protein product [Homo sapiens]
          Length = 503

 Score = 90.5 bits (223), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 161/349 (46%), Gaps = 45/349 (12%)

Query: 710  LDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIA 765
            L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL+ + 
Sbjct: 168  LEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEFLH 227

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---Q 822
            +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y+   Q
Sbjct: 228  KTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQ 280

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
            T G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+   N+ 
Sbjct: 281  TIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNIC 340

Query: 883  TVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
             VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+    
Sbjct: 341  VVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI---- 396

Query: 925  DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
                G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +RS+ 
Sbjct: 397  ---PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY- 452

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                    KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 453  ----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 497


>gi|75145059|sp|Q7XWP1.2|CPSF1_ORYSJ RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
 gi|38345987|emb|CAD39979.2| OSJNBa0032B23.5 [Oryza sativa Japonica Group]
          Length = 1441

 Score = 90.5 bits (223), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 87/353 (24%), Positives = 160/353 (45%), Gaps = 33/353 (9%)

Query: 702  FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF--I 755
            +E  ST P+  FE   ++    L  + + ++     +GTAYVL  E+   +GR+L+F   
Sbjct: 1095 WETKSTIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVL-GEDVAARGRVLLFSFT 1153

Query: 756  VEDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
              +    L+ E   KE+KGAV ++ +  G LL A   KI L KW        EL +   +
Sbjct: 1154 KSENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWT-----GAELTAVAFY 1208

Query: 813  HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
               +  + +    +F++ GD+ KSI  L +K +   +   A+D+ +    A E L D   
Sbjct: 1209 DAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGST 1268

Query: 873  LG-----AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
            L      ++ N  +F     +    +  +G +L    E+H+G  + +F     +  LP  
Sbjct: 1269 LSLVASDSDKNVQIFYY---APKMVESWKGQKLLSRAEFHVGAHITKFLR---LQMLPTQ 1322

Query: 927  DVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
             +    T    ++FG ++G IG IA +    +  L+ LQ  L   +  V GLN   +R F
Sbjct: 1323 GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQF 1382

Query: 983  NNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
            ++  K       N +D +L+ S+  LS     ++++ +  +  ++     +++
Sbjct: 1383 HSNGKGHRPGPDNIIDFELLCSYEMLSLDEQLDVAQQIGTTRSQILSNFSDIS 1435



 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 90/393 (22%), Positives = 160/393 (40%), Gaps = 109/393 (27%)

Query: 110 QVLDIKFLYGCAKPTIVVLYQDNKD------ARH----VKTYEVALKDKDFVEGP--WSQ 157
            V D  F++G  +P +V+L++          ++H    +  + +++  K   + P  WS 
Sbjct: 250 HVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSISMTLK---QHPVIWSA 306

Query: 158 NNLDNGADLLIPVPPPLCGVLIIGEETIVY------CSA--NAFKAIPI-RPSITKAYGR 208
            NL + A  L+ VPPP+ GVL+I   +I Y      CS   N F + P   P I+K+  +
Sbjct: 307 ANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSHPDGSPEISKSNFQ 366

Query: 209 VDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGL-----KIELLGE--TSIAST 253
           V+ D ++         +    AG + LL + ++   V  L     K  +L    TSI ++
Sbjct: 367 VELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKASVLSSAVTSIGNS 426

Query: 254 ISYL-----DNAVVYIGSSYGDSQLIKL-------------------------------- 276
             +L     D+ +V        S L  L                                
Sbjct: 427 FFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLKRIPSDVLQDVTSV 486

Query: 277 ------------NLQPDAKGSYVEVLERYVNLGPIVDFC----------VVDLERQGQGQ 314
                       +L+   K SY+ V +  +N+GP+ DF            +   +Q   +
Sbjct: 487 EELSFQNIIAPNSLESAQKISYI-VRDALINVGPLKDFSYGLRANADPNAMGNAKQSNYE 545

Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------RSSTDDPFDTFLVV 367
           +V CSG  K+GSL +++  I  +    VEL   +G+W++       + + D+ +  +L++
Sbjct: 546 LVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRGQMAEDNEYHAYLII 605

Query: 368 SFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
           S   E R + +   D+L E TE   +  Q  T+
Sbjct: 606 SL--ENRTMVLETGDDLGEVTETVDYFVQASTI 636


>gi|26351753|dbj|BAC39513.1| unnamed protein product [Mus musculus]
          Length = 463

 Score = 90.5 bits (223), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 161/349 (46%), Gaps = 45/349 (12%)

Query: 710  LDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIA 765
            L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL+ + 
Sbjct: 128  LEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILSPRSVAGGFVYTYKLVNNGEKLEFLH 187

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---Q 822
            +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y+   Q
Sbjct: 188  KTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQ 240

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
            T G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+   N+ 
Sbjct: 241  TIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNIC 300

Query: 883  TVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
             VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+    
Sbjct: 301  VVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIP--- 357

Query: 925  DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
                G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +RS+ 
Sbjct: 358  ----GGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY- 412

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                    KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 413  ----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 457


>gi|391328522|ref|XP_003738737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Metaseiulus occidentalis]
          Length = 1500

 Score = 90.1 bits (222), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 97/426 (22%), Positives = 178/426 (41%), Gaps = 60/426 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA----------------EESEMHFVRLL 697
            +R IP+ E P  + +   SRT+ + + K ++                   ES       +
Sbjct: 1067 MRKIPIYETPHSVNYHVDSRTYCVATSKEETATCVPKLANEDKEFEPIERESSRFIPPTV 1126

Query: 698  DDQTFEFIST----------YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPE 742
            D    E  S            P++ +E    + +   + +           VGT +   E
Sbjct: 1127 DKFALELWSPVSWEAIPNTRMPMEDWEKITCVKNVMIASEGTTSGEKGLIAVGTIHNFGE 1186

Query: 743  ENEPTKGRILVF-----IVEDG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            +    KGRIL+      + E G      K++ I  K     V +L +  G L+AA+ QK+
Sbjct: 1187 DIT-AKGRILLIDIIEVVPEPGQPLTRSKVKTILSKPQNAPVTALCSVKGHLMAAVGQKL 1245

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
             L++  L+D+   +L         I  L   +   FI++GD+ KSI+LL Y+ E   +  
Sbjct: 1246 FLFQ--LKDN---DLVGMAFLDTQIYILSAISIKSFILIGDVHKSITLLRYQEESKTLAV 1300

Query: 852  RARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
             ++D     + ++E L D+    +L  +   N+       E        RL   G++++G
Sbjct: 1301 VSKDTKPVQIYSIEYLVDNSQMAFLATDAQCNILVYMYQPENRETFGGQRLIRRGDFNIG 1360

Query: 909  EFVNR-FRHGSLVMRLPDS------DVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
              +N  FR    +  +P S      D+      ++ +++G  G +  +  + Y  L  LQ
Sbjct: 1361 SRINTMFRIRCRLAEVPRSERRLLSDLEARHVTLYASLDGAFGYLLPISEKTYRRLLMLQ 1420

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
              L    + VGGLN + +R    + + +    KN +DGDLI  F+DL+     E+++ + 
Sbjct: 1421 NVLNSYCQHVGGLNPKAFRIMQTDVRALSNPQKNIVDGDLINVFMDLNFNEKAEVARKIG 1480

Query: 1020 VSVEEL 1025
             +V ++
Sbjct: 1481 TTVHQI 1486



 Score = 56.6 bits (135), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 70/329 (21%), Positives = 134/329 (40%), Gaps = 63/329 (19%)

Query: 7   YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
           +G I+ +E  R +      LF+  E  K  ++++D  + EL T A+  +     +     
Sbjct: 66  FGNISGMEKIRLNASRDSLLFVFKET-KISLVEYDPATHELQTLAIRSLEKEEYKEGFYN 124

Query: 67  QIG----IIDPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAFN 103
            +G     +DP  R   + +Y     +IPF                    N     E + 
Sbjct: 125 FVGNTLIKVDPLNRCAAVLIYGKHLAIIPFVKKDATDLSDPIASSKSTQTNTSGFLEYYT 184

Query: 104 IRLEEL-------QVLDIKFLYGCAKPTIVVLYQD----------NKDARHVKTYEVALK 146
           IRL +L        + D+ FL G  +PT+++LY+            +D   +    + + 
Sbjct: 185 IRLIDLDEEKGVNNIHDMTFLNGYYEPTLLLLYEPIRTWTGRVAIRQDTCSIMALSLNVY 244

Query: 147 DKDFVEGP-WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANA 192
            +  V  P WS + L   +  ++PVP P+ GVLI+    ++Y             C    
Sbjct: 245 QR--VHPPVWSFSGLPFNSFKVLPVPKPIGGVLILSVNALLYLNQSVPAYGVSLNCFTEC 302

Query: 193 FKAIPIR----PSITKAYGRVD-ADGSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLG 246
             + P++    P +T    R +    ++ LL    G L++L + T     +   + + + 
Sbjct: 303 STSFPLKDQAGPPLTLDCCRCEFLSETKILLSVANGDLYVLSLFTDGMRSINQFEFKKIA 362

Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIK 275
            T++A+ IS  +   +++GS  G+S L++
Sbjct: 363 TTTVATCISLCEPGYLFVGSRIGNSLLLR 391



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 41/162 (25%), Positives = 76/162 (46%), Gaps = 20/162 (12%)

Query: 277 NLQPDAKGSY---VEVLERYVNLGPIVDFCV--------VDLERQGQGQVVTCSGAYKDG 325
           N + D  G+    +EV +R   LGP+    +        + ++   + ++    G  + G
Sbjct: 526 NCEADCLGTTCYKLEVTDRLQTLGPVTKVAMGEPAFQSELSMKTDTEVEIFAACGHERSG 585

Query: 326 SLRIVRNGIGINEQASVELQGIKGMWSLRSST------DDPFDTFLVVSFISETRILAMN 379
           +L +++  +      + EL G   +W++RSS+      D+    FL++S    T IL   
Sbjct: 586 ALCVLQRTVRPQVITTFELPGCTDLWTVRSSSTRSPDVDEDSHQFLILSRPDSTMILQTG 645

Query: 380 LEDELEETEIEGFCSQTQTLFCHD-AIYNQLVQVTSGSVRLV 420
              E+ E +  GFC+Q+ T+F  + A    ++QV   SVRL+
Sbjct: 646 --QEINELDHSGFCTQSPTIFAGNLADGRYIIQVCPNSVRLL 685


>gi|285999|dbj|BAA02805.1| KIAA0017 [Homo sapiens]
          Length = 399

 Score = 90.1 bits (222), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 161/349 (46%), Gaps = 45/349 (12%)

Query: 710  LDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIA 765
            L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL+ + 
Sbjct: 64   LEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEFLH 123

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---Q 822
            +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y+   Q
Sbjct: 124  KTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQ 176

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
            T G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+   N+ 
Sbjct: 177  TIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNIC 236

Query: 883  TVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
             VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+    
Sbjct: 237  VVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIP--- 293

Query: 925  DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
                G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +RS+ 
Sbjct: 294  ----GGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY- 348

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                    KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 349  ----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 393


>gi|326511015|dbj|BAJ91855.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 706

 Score = 90.1 bits (222), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 88/366 (24%), Positives = 159/366 (43%), Gaps = 46/366 (12%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKG--- 749
            +R+LD ++ +      L   E   SI + +F D +      VGT   L  +  P +    
Sbjct: 355  IRILDPRSRDTTCLLELQDNEAAVSICTVNFHDKEHGTLLAVGTTKGL--QFWPKRSLAS 412

Query: 750  ---RILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
                I  F+ E   L+L+ + + +    SL  F G+LLA +   ++LY     D G R+L
Sbjct: 413  GFIHIYKFVNEGKSLELLHKTQVEEVPLSLCQFQGRLLAGVGSVLRLY-----DLGKRKL 467

Query: 807  QSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
              +C +      +  + T  D I VGD+ +S     Y+ +E  +   A D    W++A  
Sbjct: 468  LRKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAAN 527

Query: 866  ILDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHL 907
             +D D   GA+   N++  R                  K  +G  +    ++E + ++H+
Sbjct: 528  HIDFDTMAGADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHV 587

Query: 908  GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRK 966
            G+ V   +  SL+        G    +I+GTV G +G ++A    E   F   L+ +LR+
Sbjct: 588  GDVVTCLQKASLI-------PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQ 640

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
                + G +H  +RS          K+ +DGDL E +  L      +I+  ++ +  E+ 
Sbjct: 641  EHPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQYPSLPADMQRKIADELDRTPGEIL 695

Query: 1027 KRVEEL 1032
            K++E++
Sbjct: 696  KKLEDI 701



 Score = 83.6 bits (205), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 80/300 (26%), Positives = 131/300 (43%), Gaps = 19/300 (6%)

Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-D 465
           + L+QV    +R +    R   NEW++P   ++    +N  QV++A  GG L+Y E+   
Sbjct: 2   DSLMQVHPNGIRHIREDGR--VNEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMT 59

Query: 466 GILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
           G L EV+   +  +++CL I P+ E    S+  AVG + D ++RI SL PD  L      
Sbjct: 60  GQLMEVEKQDMSGDVACLAIAPVPEGRQRSRFLAVGSY-DNTIRILSLDPDDCLQPLSVQ 118

Query: 525 GGEIIPRSVLLCAFE-------GISY-----LLCALGDGHLLNFLLNMKTGELTDRKKVS 572
                P S+L    +       G  Y     L   L +G L    ++M TG+L+D +   
Sbjct: 119 SVSSAPESLLFLEVQASVGGEDGADYPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRF 178

Query: 573 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
           LG +P  L      +   +   S RP + Y      L + ++   +     F+S    + 
Sbjct: 179 LGLRPPKLFPCIVSHRQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEG 238

Query: 633 LAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE 690
           +       L I TI+ + +  +  SIPL   PR+     + +  A+  S K    AE+ E
Sbjct: 239 VVAVAGDALRIFTIERLGETFNETSIPLRYTPRKFVILPKKKYLAVIESDKGAFSAEQRE 298


>gi|449437538|ref|XP_004136549.1| PREDICTED: pre-mRNA-splicing factor RSE1-like [Cucumis sativus]
          Length = 1376

 Score = 89.7 bits (221), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 107/210 (50%), Gaps = 22/210 (10%)

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
           N+ PI+D  VVD   + Q Q+  C G   +GSLRI+RNGI +    + S   QGI  +W+
Sbjct: 458 NIAPILDMSVVDKHDEKQDQMFACCGMAPEGSLRIIRNGISVENLLRTSPIYQGITSIWT 517

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           ++    D + ++LV+SF+ ETR+L++ L   ++ T+  GF S T TL C       ++Q+
Sbjct: 518 IKMKRSDTYHSYLVLSFVEETRVLSVGLS-FIDVTDSVGFQSDTCTLACGLLDDGLVIQI 576

Query: 413 TSGSVRLVSST----------SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
              +VRL   T          S      W  P    +++     + ++++T     +++ 
Sbjct: 577 HQNAVRLCLPTKIAHSEGIELSSPACTSW-FPDNIGISLGAVGHNVIVVSTSNPCFLFI- 634

Query: 463 IG-------DGILTEVKHAQLEYEISCLDI 485
           +G       D  + E ++ +L+YE+SC+ I
Sbjct: 635 LGVRKVSGYDYEIYEKQYLRLQYELSCISI 664



 Score = 52.8 bits (125), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/201 (23%), Positives = 85/201 (42%), Gaps = 20/201 (9%)

Query: 828  IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN--NFNLFTVR 885
            I VGD    I    Y+ +   +E+   D +   ++   +LD D  + ++   +  + +  
Sbjct: 1139 IAVGDCRDGILFFSYQEDAKKLEQIYSDPSQRLVADCTLLDVDTAVVSDRKGSIAILSCS 1198

Query: 886  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV---GQIP---------T 933
               E     E   L +   Y++GE     R GS   +LP  D+     +P         T
Sbjct: 1199 DRLEDNASPE-CNLTLNCAYYMGEIAMTLRKGSFSYKLPADDLLRGCAVPGSDFDSSHNT 1257

Query: 934  VIFGTVNGVIGVIASLPHEQYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDA 991
            +I  T+ G I +   L  ++Y  LE +Q  L    +   + G +H ++RS  N    +  
Sbjct: 1258 IIASTLLGSIVIFTPLSRDEYELLEAVQAKLAVHPLTSPILGNDHYEYRSRENP---IGV 1314

Query: 992  KNFLDGDLIESFLDLSRTRMD 1012
               LDGD++  FL+L+  + +
Sbjct: 1315 PKILDGDILTQFLELTSMQQE 1335


>gi|449526686|ref|XP_004170344.1| PREDICTED: uncharacterized protein LOC101227016, partial [Cucumis
           sativus]
          Length = 997

 Score = 89.7 bits (221), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 66/236 (27%), Positives = 118/236 (50%), Gaps = 28/236 (11%)

Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
           GD  ++KL    + +  Y   ++   N+ PI+D  VVD   + Q Q+  C G   +GSLR
Sbjct: 438 GDGMVLKLE---NGRLIYANPIQ---NIAPILDMSVVDKHDEKQDQMFACCGMAPEGSLR 491

Query: 329 IVRNGIGINE--QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
           I+RNGI +    + S   QGI  +W+++    D + ++LV+SF+ ETR+L++ L   ++ 
Sbjct: 492 IIRNGISVENLLRTSPIYQGITSIWTIKMKRSDTYHSYLVLSFVEETRVLSVGLS-FIDV 550

Query: 387 TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPG 436
           T+  GF S T TL C       ++Q+   +VRL   T          S      W  P  
Sbjct: 551 TDSVGFQSDTCTLACGLLDDGLVIQIHQNAVRLCLPTKIAHSEGIELSSPACTSW-FPDN 609

Query: 437 YSVNVATANASQVLLATGGGHLVYLEIG-------DGILTEVKHAQLEYEISCLDI 485
             +++     + ++++T     +++ +G       D  + E ++ +L+YE+SC+ I
Sbjct: 610 IGISLGAVGHNVIVVSTSNPCFLFI-LGVRKVSGYDYEIYEKQYLRLQYELSCISI 664


>gi|53136520|emb|CAG32589.1| hypothetical protein RCJMB04_30d19 [Gallus gallus]
          Length = 503

 Score = 89.4 bits (220), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 85/349 (24%), Positives = 162/349 (46%), Gaps = 45/349 (12%)

Query: 710  LDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQLIA 765
            L+  E   S+  C FS+    +Y  VG A  ++        G +  + + +G  KL+ + 
Sbjct: 168  LEQNEAAFSVAVCRFSNTGEEWYVLVGVAKDLILNPRSVAGGFVYTYKLVNGGEKLEFLH 227

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---Q 822
            +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y+   Q
Sbjct: 228  KTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYICGIQ 280

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
            T G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+   N+ 
Sbjct: 281  TIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTATLLDYDTVAGADKFGNIC 340

Query: 883  TVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
             VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+    
Sbjct: 341  VVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI---- 396

Query: 925  DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
                G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +RS+ 
Sbjct: 397  ---PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY- 452

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                    KN +DGDL E F  +   +   +++ ++ +  E+ K++E++
Sbjct: 453  ----YFPVKNVIDGDLCEQFNSMEPNKQKNVAEELDRTPPEVSKKLEDI 497


>gi|222628488|gb|EEE60620.1| hypothetical protein OsJ_14038 [Oryza sativa Japonica Group]
          Length = 1441

 Score = 89.4 bits (220), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 86/353 (24%), Positives = 159/353 (45%), Gaps = 33/353 (9%)

Query: 702  FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF--I 755
            +E  ST P+  FE   ++    L  + + ++     +GTAYVL  E+   +GR+L+F   
Sbjct: 1095 WETKSTIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVL-GEDVAARGRVLLFSFT 1153

Query: 756  VEDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
              +    L+ E   KE+KGAV ++ +  G LL A   KI L KW        EL +   +
Sbjct: 1154 KSENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWT-----GAELTAVAFY 1208

Query: 813  HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
               +  + +    +F++ GD+ KSI  L +K +   +   A+D+ +    A E L D   
Sbjct: 1209 DAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGST 1268

Query: 873  LG-----AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
            L      ++ N  +F     +    +  +G +L    E+H+G  + +F     +  LP  
Sbjct: 1269 LSLVASDSDKNVQIFYY---APKMVESWKGQKLLSRAEFHVGAHITKFLR---LQMLPTQ 1322

Query: 927  DVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
             +    T    ++FG ++G IG IA +    +  L+ LQ  L   +  V GLN   +R F
Sbjct: 1323 GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQF 1382

Query: 983  NNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
            ++  K       N +D +L+  +  LS     ++++ +  +  ++     +++
Sbjct: 1383 HSNGKGHRPGPDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSDIS 1435



 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 90/393 (22%), Positives = 160/393 (40%), Gaps = 109/393 (27%)

Query: 110 QVLDIKFLYGCAKPTIVVLYQDNKD------ARH----VKTYEVALKDKDFVEGP--WSQ 157
            V D  F++G  +P +V+L++          ++H    +  + +++  K   + P  WS 
Sbjct: 250 HVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSISMTLK---QHPVIWSA 306

Query: 158 NNLDNGADLLIPVPPPLCGVLIIGEETIVY------CSA--NAFKAIPI-RPSITKAYGR 208
            NL + A  L+ VPPP+ GVL+I   +I Y      CS   N F + P   P I+K+  +
Sbjct: 307 ANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSHPDGSPEISKSNFQ 366

Query: 209 VDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGL-----KIELLGE--TSIAST 253
           V+ D ++         +    AG + LL + ++   V  L     K  +L    TSI ++
Sbjct: 367 VELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKASVLSSAVTSIGNS 426

Query: 254 ISYL-----DNAVVYIGSSYGDSQLIKL-------------------------------- 276
             +L     D+ +V        S L  L                                
Sbjct: 427 FFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLKRIPSDVLQDVTSV 486

Query: 277 ------------NLQPDAKGSYVEVLERYVNLGPIVDFC----------VVDLERQGQGQ 314
                       +L+   K SY+ V +  +N+GP+ DF            +   +Q   +
Sbjct: 487 EELSFQNIIAPNSLESAQKISYI-VRDALINVGPLKDFSYGLRANADPNAMGNAKQSNYE 545

Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------RSSTDDPFDTFLVV 367
           +V CSG  K+GSL +++  I  +    VEL   +G+W++       + + D+ +  +L++
Sbjct: 546 LVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRGQMAEDNEYHAYLII 605

Query: 368 SFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
           S   E R + +   D+L E TE   +  Q  T+
Sbjct: 606 SL--ENRTMVLETGDDLGEVTETVDYFVQASTI 636


>gi|196012166|ref|XP_002115946.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
 gi|190581722|gb|EDV21798.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
          Length = 1187

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 135/562 (24%), Positives = 234/562 (41%), Gaps = 98/562 (17%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
            YG+I  +E  R     +D LFI     K  ++++DA++S L T ++    D   +   +
Sbjct: 81  FYGKIYGIESVRFRHHHRDSLFICFADAKLSLVEYDADNSNLTTLSLHTFEDDELKNGFS 140

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPFDNKGQ-----------------LKEAFNI 104
            N  I II  DPD R   + + +    ++PF ++G                  L  ++ +
Sbjct: 141 RNLSIPIIRVDPDNRCAAMVVSNVHLAILPFRHRGPAEQQVQIDPKNTSGKYPLMPSYVV 200

Query: 105 RLEEL------QVLDIKFLYGCAKPTIVVL------YQDNKDARH--VKTYEVALKDKDF 150
            + +L      +++DI+FL G  +PTI++L      +      R        V+L   D 
Sbjct: 201 DVRDLGNEKVSRLIDIRFLEGYYEPTILILCEILRTWSGRVAVRQDTCSILAVSLNTIDK 260

Query: 151 VEGP-WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRV 209
           V    WS NNL       I VP P+ GVLI     +++ + +       +P   ++   +
Sbjct: 261 VHPVIWSLNNLPFDCLGAITVPRPIGGVLIFAANCLLHLNQS-------KPPYAESLNSI 313

Query: 210 DADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIE-LLGETSIASTISYLDNAVVYIGSSY 268
             + + + + D A L      T + ++ T  K+     +        Y  +       SY
Sbjct: 314 TDESTTFPMHD-ADLAPNTPETQDTDEPTSKKLRTDDEKEDEELEKLYSAHTSCTAKESY 372

Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL----ERQGQGQVVTCSGAYKD 324
                         +    EV +R +++GP     +  +    + +   +VV CSG  K+
Sbjct: 373 -------------LRSYTFEVCDRILHVGPCASIAIGQISTFVQEESDVEVVICSGHDKN 419

Query: 325 GSLRIVRNGIGINEQASVELQGIKGMWS-----LRSSTDDPFDT-----FLVVSFISETR 374
           G+L ++  GI     AS +L G   MW+     L    D  F+T     FL++S  + T 
Sbjct: 420 GALSVLNKGIKPQVVASYDLPGCVDMWTVKDIRLNDENDGDFETENTHKFLIISRDNLTM 479

Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIY-NQLVQVTSGSVRLVSSTSR----ELRN 429
           IL      E+ E E  GF +QT+T+F  +    N ++QVT   V LVS   +    EL N
Sbjct: 480 ILRTG--KEITEVEQLGFLTQTKTVFAGNLDNGNCIIQVTPYEVILVSKGEKIQQLELEN 537

Query: 430 EWKSPPGYSVNVATANASQVLLATGGG-HLVYLEIGDGILTEVKHAQLEYEISCLDINPI 488
           E  SP    V  +  +    LL  GG   ++  E+ D           E ++  ++  P+
Sbjct: 538 E--SP---IVFCSLQDPYISLLLEGGSIMMLAFELSDN---------GEKQVKLVNTTPL 583

Query: 489 GENPSYSQIAAVGMWTDISVRI 510
               ++S+IAA  ++ D + R+
Sbjct: 584 ----NHSRIAACCLFQDNNGRM 601



 Score = 63.9 bits (154), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 107/470 (22%), Positives = 185/470 (39%), Gaps = 111/470 (23%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH------IRSIPLGEHPRRICHQE 670
            V    PFN A  P+  L    E +L I  +D  Q+        +  +PL      I H  
Sbjct: 772  VKCFAPFNIANCPNGFLYFNSEEDLRICVLD--QRFTYDCPWPVHKVPLRNTLHFITHHF 829

Query: 671  QSRTFAICSLKNQSC-------AEESEM-------HFV---------RLLDDQTFEFI-- 705
             ++T+ I S     C        E+ E         F+         +L+  +T+E I  
Sbjct: 830  VTKTYVIISSTMTVCEKMPHITTEDKEFIPVEKGDRFIHAPVEKFCLQLITSETWEIIPD 889

Query: 706  STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----I 755
            +   +  +E+   + S     +  V     +  VGT  V  EE    +GRI++F     +
Sbjct: 890  AEIQMAEWEHVTCLKSVKLKSEETVSGLKEFIAVGTTNVCGEE-VACRGRIVIFDVIEVV 948

Query: 756  VEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
             E GK      ++   +KE KG V ++    G L+ +I QKI  Y W  RD+        
Sbjct: 949  PEPGKPLTKNKIKTYYDKEQKGPVTAITCVEGFLVTSIGQKI--YIWEFRDNKD------ 1000

Query: 810  CGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
                               ++G  M  I  LIY H       +   +N N+      L  
Sbjct: 1001 -------------------LIG--MAFIDTLIYIHSLD--RHQLEIFNTNFYVNKNQLG- 1036

Query: 870  DIYLGAENNFNLFTVRKN--SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
              ++  E++   F VR+     G+      R +V     L +  N  +H +         
Sbjct: 1037 --FVAPESHGGQFLVRRAEIQTGSNAHAFFRTKVRA---LNQRQNENKHITW-------- 1083

Query: 928  VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
                    FGT++G IG++  +  ++Y  L  LQ  L   ++   GLN + +R+F + +K
Sbjct: 1084 --------FGTLDGSIGLLLPVDEKEYRRLFSLQAKLSIYLEQNAGLNQKAFRTFRSHQK 1135

Query: 988  TV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
             +    +N LDGDL++ +  L      +++K +  + E++   + +LT+L
Sbjct: 1136 KLQNSMRNILDGDLLKRYFHLGFVERRDLAKQIMSTPEQI---INDLTKL 1182


>gi|322792443|gb|EFZ16427.1| hypothetical protein SINV_15375 [Solenopsis invicta]
          Length = 1532

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 84/321 (26%), Positives = 149/321 (46%), Gaps = 39/321 (12%)

Query: 731  YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAF 779
            Y  +GT Y   E+   ++GRIL+F I+E            + + I  KE KG + ++   
Sbjct: 1060 YIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPITAITQV 1118

Query: 780  NGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
            +G L++A+ QKI  Y W L+D+   G   + ++   H  +L++        I++ D+ KS
Sbjct: 1119 SGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI-----KSLILIADVYKS 1170

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGA 891
            ISLL ++ E   +   +RD+    +  +E L D+  LG      E+N  LF  +  S  +
Sbjct: 1171 ISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFIVADGESNLALFMYQPESRES 1230

Query: 892  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGV 945
               +  +L    ++HLG+ VN F      +  P +D  Q          ++ +++G +G 
Sbjct: 1231 LGGQ--KLIRKADFHLGQKVNTFFRIRCRVTDPANDKKQFSGADKRHVTMYASLDGSLGY 1288

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN---EKKTVDAKNFLDGDLIES 1002
            I  +P + Y  L  LQ  L   I  + GLN + +R         +   A+  +DGDL+  
Sbjct: 1289 ILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKSYRHTYKSYIRNQGNPARGIIDGDLVWR 1348

Query: 1003 FLDLSRTRMDEISKTMNVSVE 1023
            +L L      +++K +   V+
Sbjct: 1349 YLFLPNNEKADLAKKIGTRVQ 1369



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 67/329 (20%), Positives = 139/329 (42%), Gaps = 60/329 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
           ++G I +++     G  +D L ++    K  V+++D +  +L T ++    +   +   T
Sbjct: 41  LHGNIMSMQAVHLIGSQRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEEIKDGWT 100

Query: 64  DNGQIGII--DPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAF 102
           ++  I I+  DP+ R   + ++     V+PF                    NK  +  ++
Sbjct: 101 NHHHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPSLDDGDLLDTAKLTSSNKAPILSSY 160

Query: 103 NIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DK 148
            I L+ L+     V+D++FL+G  +PT+++LY+  +        R      VA+     +
Sbjct: 161 MIVLKSLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFAGRIAVRQDTCAMVAISLNIQQ 220

Query: 149 DFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKA 195
                 WS +NL       +PV  PL G LI+   +++Y +             A+    
Sbjct: 221 RVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAFNSLIYLNQSIPPYGVSLNSLADTSTN 280

Query: 196 IPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE 247
            P++P      S+  A    + AD  R ++   +G L++L +       V G   +    
Sbjct: 281 FPLKPQEGVKMSLEGAQVAFISAD--RLVISLKSGELYVLSLFADSMRSVRGFHFDKAAA 338

Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKL 276
           + + S +   ++  +++GS  G+S L++ 
Sbjct: 339 SVLTSCVCMCEDNYLFLGSRLGNSLLLRF 367



 Score = 44.7 bits (104), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 45/155 (29%), Positives = 76/155 (49%), Gaps = 21/155 (13%)

Query: 285 SYV-EVLERYVNLGPIVDFCVVD--------LERQGQG-QVVTCSGAYKDGSLRIVRNGI 334
           SY+ EV +  +N+GP  +  + +        L+ Q    ++VT SG  K+G+L +++  I
Sbjct: 435 SYIFEVCDSLLNIGPCGNISMGEPAFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSI 494

Query: 335 GINEQASVELQGIKGMWSLRSS-TDDPFDT-------FLVVSFISETRILAMNLEDELEE 386
                 + EL G + MW++  +  +D   T       FL++S    T IL      E+ E
Sbjct: 495 RPQVVTTFELPGCEDMWTVIGTLNNDEIKTEAEGSHAFLILSQEDSTMILQTG--QEINE 552

Query: 387 TEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
            +  GF +Q  T+F  +   N+ +VQVT   VRL+
Sbjct: 553 VDQSGFSTQGSTVFAGNLGANRYIVQVTQMGVRLL 587


>gi|12653387|gb|AAH00463.1| SF3B3 protein [Homo sapiens]
 gi|13111947|gb|AAH03146.1| SF3B3 protein [Homo sapiens]
          Length = 399

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 88/349 (25%), Positives = 161/349 (46%), Gaps = 45/349 (12%)

Query: 710  LDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIA 765
            L+  E   S+  C FS+    +Y  VG A  L        G  +    +V +G KL+ + 
Sbjct: 64   LEQNEAAFSVAVCRFSNIGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEFLH 123

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---Q 822
            +   +    ++  F G++L  + + +++Y     D G ++L  +C  + HI A Y+   Q
Sbjct: 124  KTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQ 176

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
            T G  ++V D+ +S   + YK  E  +   A D    W++   +LD D   GA+   N+ 
Sbjct: 177  TIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNIC 236

Query: 883  TVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
             VR   N+    DE         +RG L       EV+  YH+GE V   +  +L+    
Sbjct: 237  VVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIP--- 293

Query: 925  DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
                G   ++++ T++G IG++     HE + F + ++ +LR     + G +H  +RS+ 
Sbjct: 294  ----GGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY- 348

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                    KN +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 349  ----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 393


>gi|9794908|gb|AAF98388.1| cleavage and polyadenylation specificity factor [Drosophila
            melanogaster]
          Length = 813

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 106/435 (24%), Positives = 201/435 (46%), Gaps = 62/435 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEES-EMHFVR---- 695
            +R +PL   PR++ +  ++R + + +              +++  +EES +  F+     
Sbjct: 384  VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRDERFIYPIGS 443

Query: 696  -----LLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
                 L+  +T+E +  ++   + +E+     I+  S+    +    Y C+GT +    E
Sbjct: 444  QFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 502

Query: 744  NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
            +  ++G I ++     + E GK      ++ I +KE KG V +++   G L+  + QKI 
Sbjct: 503  DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI- 561

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
             Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +   
Sbjct: 562  -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 617

Query: 853  ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
            +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   ++  L    +YHL
Sbjct: 618  SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQK--LLRKADYHL 675

Query: 908  GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
            G+ VN  FR       L  R P     +   V++GT++G +G    LP + Y     LQ 
Sbjct: 676  GQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQN 734

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
             L    + + GLN +++R+  + KK     ++  +DGDLI S+  ++ +  +E++K +  
Sbjct: 735  VLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGT 794

Query: 1021 SVEELCKRVEELTRL 1035
              EE+   + E+ RL
Sbjct: 795  RTEEILGDLLEIERL 809


>gi|390347522|ref|XP_003726804.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Strongylocentrotus purpuratus]
          Length = 1439

 Score = 89.4 bits (220), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 82/322 (25%), Positives = 144/322 (44%), Gaps = 38/322 (11%)

Query: 731  YYCVGTAYVLPEENEPTKGRILVF-----IVEDG------KLQLIAEKETKGAVYSLNAF 779
            Y  V T +V  E+ + T+G + ++     + E G      KL+ + EK  KG V +L   
Sbjct: 1115 YVVVATTHVYSEDLQ-TRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQKGPVSALCEV 1173

Query: 780  NGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
             G LL  I QK+  Y W  +D+   G   + ++   H  +      +   FI++ D+MK 
Sbjct: 1174 MGFLLTCIGQKV--YMWQFKDNDLIGLAFIDTQIYIHNAV------SVKQFILITDVMKG 1225

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATD 893
               L Y+ ++  +   +RD     +   E + DD    +L ++ + NL     + E    
Sbjct: 1226 AYFLQYQAQDRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAPES 1285

Query: 894  EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--------PTVIFGTVNGVIGV 945
                 L   G+ ++G  VN F    +  RL D    Q+          V F T++G +G+
Sbjct: 1286 HGGAYLLRRGDMNIGSAVNTFVR--VRCRLTDPSTEQVLSGPVLRRQVVFFATLDGSLGL 1343

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESF 1003
            +  +  + Y  L  LQ  L   +  VGGLN + +R   +  + ++   +N LDGDL+  +
Sbjct: 1344 LLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLLLKY 1403

Query: 1004 LDLSRTRMDEISKTMNVSVEEL 1025
              LS    +E +K +  SV+++
Sbjct: 1404 CHLSVVERNEFAKKIGTSVDQI 1425



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 70/332 (21%), Positives = 139/332 (41%), Gaps = 71/332 (21%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           I+G++ +++  +  G  +D L ++    K  ++++D    +L T +M    +      D 
Sbjct: 70  IFGKVMSMQSAQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEE------DE 123

Query: 66  GQIGI----------IDPDCRLIGLHLYDGLFKVIPF----------------DNKGQLK 99
            + G+          +DPD R   +  Y     V+PF                  +G L 
Sbjct: 124 TKEGVYRNIFHPVVKVDPDHRCAIMLTYGSKLVVLPFRRDGLVEDLDKSMSASTRRGALM 183

Query: 100 EAFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQD-----NKDARHVKTYEVALKDKD 149
            ++ IRL E+      VLDI+FL+G  +PT+++LY+       + A    T  +     +
Sbjct: 184 PSYVIRLNEMDDPICNVLDIQFLHGYYEPTLLILYEPLRTWAGRVAVRQDTCSIVALSLN 243

Query: 150 FVEGP----WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAF--- 193
             +      WSQ++L      +  VP P+ GVLI+   +++Y          S N+    
Sbjct: 244 MAQKVHPIIWSQSSLPYDCMQVQAVPKPIGGVLILAVNSLLYLNQSIPPYGVSLNSLTDW 303

Query: 194 -KAIPIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIE 243
             A P++   T+   ++  D +        R  L    G +++L ++      V G  ++
Sbjct: 304 STAFPLK---TQEGVKLSMDCTQATFISYDRLALSLKDGEIYVLTLLVDGMRSVRGFHLD 360

Query: 244 LLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
               + + + I  + +  +++GS  G+S L+K
Sbjct: 361 KAAASVLTTCICPMGDGFLFLGSRLGNSLLLK 392


>gi|66361481|ref|XP_627314.1| possible spliceosome factor [Cryptosporidium parvum Iowa II]
 gi|46228697|gb|EAK89567.1| possible spliceosome factor [Cryptosporidium parvum Iowa II]
          Length = 1317

 Score = 89.0 bits (219), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 110/522 (21%), Positives = 209/522 (40%), Gaps = 76/522 (14%)

Query: 176 GVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSRY--------LLGDHAGL 224
           GVL+   + I+YC    +N F   P R   +  YG +    S +        L+    G 
Sbjct: 245 GVLVCDYKGIIYCKMGHSNIFCPYPFRFGDSSEYGTMIVASSLHKLKGFFLILVQTELGD 304

Query: 225 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL-NLQPDAK 283
           ++ + + H +  V  ++I       + +++  L +  ++    +G+    ++ +L  D  
Sbjct: 305 IYRINLIHNEGIVKEMRIYYYDTIPVCNSLLLLRSGFLFASHEFGNHTNYQIVSLGDDKT 364

Query: 284 GSYVEVL----------------------ERYVNLGPIVDFCVVDLERQGQGQVVTCSGA 321
             Y   L                      E  ++L PI D  V+D    G  Q+V   G 
Sbjct: 365 DPYTSSLPDSNDLKRVYFRPRNCQCIRKSEEILSLSPITDIKVIDTNNDGTPQIVATCGR 424

Query: 322 YKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD-------------DPFDTFLVV 367
               +LR+   G  + E A   L G  + +W+L++  D             D    ++++
Sbjct: 425 GPRSTLRVCSYGKNVEEIAENPLPGRPRCIWTLKNGIDPSLSGSQAEAAILDNIHHYIII 484

Query: 368 SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI-YNQLVQVTSGSVRLVSSTSRE 426
           SFI   R L + + + +EET    F     T++    I YN  +QV    V+L+    ++
Sbjct: 485 SFID--RSLVLTIGEHVEETNDTLFTLNETTMYAASMIFYNSFLQVLETHVKLI---IQD 539

Query: 427 LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-----------GDGILTEVKHAQ 475
              +WK+P    +  A +N  QV LA  GG +V LE+           G G L EV   +
Sbjct: 540 RIYDWKTPDSRKIIAADSNGRQVSLALEGGLIVILELNVNGVSGITNTGMGGLVEVCRRE 599

Query: 476 LEYEISCLDINPIGENPSY-SQIAAVGMWTDISVRIFSLPDLNLITKEHL-----GGEII 529
           +  EI C+ I  +  +    S    VG  T+ ++R++ +       K+           I
Sbjct: 600 ITCEIICIGIQQLSYSGQMRSDYVVVGTSTENALRLYKIDSAEKRLKQTCTQILPNSNSI 659

Query: 530 PRSVLLCAFEGISYLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
           P +V L       +L+  +G   G +L+  ++   G ++D +   LG + + +     ++
Sbjct: 660 PENVQLYHSNKYGHLILFVGLTTGVILSCKVDASNGSISDPRSKYLGNRGVNICRIMRED 719

Query: 588 ---TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS 626
                 +   S RP ++ S    + ++ +  + +  + P N+
Sbjct: 720 FGGEMSLVCMSSRPWLVDSQTSGVNFTPLQYRCIDSIAPLNT 761



 Score = 82.8 bits (203), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 93/390 (23%), Positives = 164/390 (42%), Gaps = 68/390 (17%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLP------------ 741
            VR+++ ++ E I   PLDT E   S   C F +       +GT Y +             
Sbjct: 947  VRIVNLKSMETIQLIPLDTNEGCISACVCKFDELDLPCLVLGTVYGMKLNRGYNNNNSNN 1006

Query: 742  ---------EENEPTKGRILVFIVEDGK--LQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
                     +EN  + G  +     D     +L+     + +  +L  + G+LL  IN+ 
Sbjct: 1007 STDSAKNNDKENHDSFGAAIKIFKYDSNYNFELVHITPIENSATALTGWRGRLLVGINKT 1066

Query: 791  IQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
            +++Y    K +LR    R +             +++   D I  GD+   + +  + +  
Sbjct: 1067 LRVYSLGKKRLLRKSEYRNIPQGL--------TWIKVVNDRIFAGDISNGVLVFKFNNTS 1118

Query: 847  GAIEERARDYNANWM-SAVEILD----------DDIYLG-----AENNFNL---FTVRKN 887
                  A+D    W+ SA E+LD          D+I +      A ++F+    FT   N
Sbjct: 1119 NQFILVAKDPMPRWLTSACEVLDYHTIAVSDKFDNIIVSRVPAEASDDFSFVTSFTDNNN 1178

Query: 888  SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIA 947
            S+ +      ++  V ++HLG+ V   +   L    P S       +I+GTV G IG ++
Sbjct: 1179 SQSSALMRTHQINTVAQFHLGDIVTCLQKSQLT---PTS----AEAIIYGTVLGSIGSLS 1231

Query: 948  S-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
              L +E    L KL+  LRK    +   +H  +RS+ +        N +DGD  ++F  L
Sbjct: 1232 PILNNEDIELLSKLEILLRKQKSTLLSRDHLMFRSYYSP-----VHNVIDGDFCQTFTIL 1286

Query: 1007 SRTRMDEISKTMNVSVEELCKRVEEL-TRL 1035
                  EI+  ++V+VEE+ K++++  TRL
Sbjct: 1287 DSQIQSEIASKLDVTVEEIYKKLDDYKTRL 1316


>gi|390358535|ref|XP_789715.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Strongylocentrotus purpuratus]
          Length = 1223

 Score = 89.0 bits (219), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 82/322 (25%), Positives = 144/322 (44%), Gaps = 38/322 (11%)

Query: 731  YYCVGTAYVLPEENEPTKGRILVF-----IVEDG------KLQLIAEKETKGAVYSLNAF 779
            Y  V T +V  E+ + T+G + ++     + E G      KL+ + EK  KG V +L   
Sbjct: 899  YVVVATTHVYSEDLQ-TRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQKGPVSALCEV 957

Query: 780  NGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
             G LL  I QK+  Y W  +D+   G   + ++   H  +      +   FI++ D+MK 
Sbjct: 958  MGFLLTCIGQKV--YMWQFKDNDLIGLAFIDTQIYIHNAV------SVKQFILITDVMKG 1009

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATD 893
               L Y+ ++  +   +RD     +   E + DD    +L ++ + NL     + E    
Sbjct: 1010 AYFLQYQAQDRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAPES 1069

Query: 894  EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--------PTVIFGTVNGVIGV 945
                 L   G+ ++G  VN F    +  RL D    Q+          V F T++G +G+
Sbjct: 1070 HGGAYLLRRGDMNIGSAVNTFVR--VRCRLTDPSTEQVLSGPVLRRQVVFFATLDGSLGL 1127

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESF 1003
            +  +  + Y  L  LQ  L   +  VGGLN + +R   +  + ++   +N LDGDL+  +
Sbjct: 1128 LLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLLLKY 1187

Query: 1004 LDLSRTRMDEISKTMNVSVEEL 1025
              LS    +E +K +  SV+++
Sbjct: 1188 CHLSVVERNEFAKKIGTSVDQI 1209


>gi|238609081|ref|XP_002397396.1| hypothetical protein MPER_02189 [Moniliophthora perniciosa FA553]
 gi|215471757|gb|EEB98326.1| hypothetical protein MPER_02189 [Moniliophthora perniciosa FA553]
          Length = 180

 Score = 89.0 bits (219), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 54/156 (34%), Positives = 85/156 (54%), Gaps = 6/156 (3%)

Query: 295 NLGPIVDFCVVD-LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWS 352
           +L PI+D  V++ L      Q+    G     SLR +R+G+ + E  S EL GI   +W+
Sbjct: 29  SLCPILDSKVLNILPNSDTPQIFAVCGRGARSSLRTLRHGLEVEESVSSELPGIPNAVWT 88

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
            +   DDPFD+++++SF++ T +L++   + +EE +  GF S   TL       + L+QV
Sbjct: 89  TKKREDDPFDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGADALLQV 146

Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQ 448
             G +R V + SR   NEW+ P G S+  AT N  Q
Sbjct: 147 HPGGIRHVLADSRV--NEWRVPTGKSIVAATTNKRQ 180


>gi|402579535|gb|EJW73487.1| hypothetical protein WUBG_15606 [Wuchereria bancrofti]
          Length = 254

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 75/236 (31%), Positives = 118/236 (50%), Gaps = 19/236 (8%)

Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
           T D FD+ +VVSF++ T +L++   + +EE    GF   T TL C     + L+QV    
Sbjct: 9   TLDKFDSHIVVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLGCALIGDDALLQVYPDG 66

Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
           +R + +  R   NEWK+P   ++     N  QV +A  GG LVY E+   G L E  +  
Sbjct: 67  IRHIRADRRV--NEWKAPGKRTIMKCALNRRQVAIALAGGELVYFELDVTGQLNEYTERR 124

Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
           +L  ++ C+ ++ I E    S+   VG+  D +VRI SL     L+ ++ + L  E  P 
Sbjct: 125 ELPADVLCMSLSEIPEGELRSRFLTVGL-ADKTVRIISLDPQDCLSPLSMQALPSE--PE 181

Query: 532 SVLLCAFEGIS-------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
           S+++    G         +L   L +G LL   ++  TGELTD +   LGT+ + L
Sbjct: 182 SIIVLEMFGTEIQSASTVHLNIGLQNGCLLRTTVDQVTGELTDNRTRYLGTKSVKL 237


>gi|297722899|ref|NP_001173813.1| Os04g0252200 [Oryza sativa Japonica Group]
 gi|255675253|dbj|BAH92541.1| Os04g0252200, partial [Oryza sativa Japonica Group]
          Length = 432

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 86/353 (24%), Positives = 159/353 (45%), Gaps = 33/353 (9%)

Query: 702  FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF--I 755
            +E  ST P+  FE   ++    L  + + ++     +GTAYVL E+    +GR+L+F   
Sbjct: 86   WETKSTIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVLGED-VAARGRVLLFSFT 144

Query: 756  VEDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
              +    L+ E   KE+KGAV ++ +  G LL A   KI L KW        EL +   +
Sbjct: 145  KSENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWT-----GAELTAVAFY 199

Query: 813  HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
               +  + +    +F++ GD+ KSI  L +K +   +   A+D+ +    A E L D   
Sbjct: 200  DAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGST 259

Query: 873  LG-----AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
            L      ++ N  +F     +    +  +G +L    E+H+G  + +F     +  LP  
Sbjct: 260  LSLVASDSDKNVQIFYY---APKMVESWKGQKLLSRAEFHVGAHITKFLR---LQMLPTQ 313

Query: 927  DVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
             +    T    ++FG ++G IG IA +    +  L+ LQ  L   +  V GLN   +R F
Sbjct: 314  GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQF 373

Query: 983  NNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
            ++  K       N +D +L+  +  LS     ++++ +  +  ++     +++
Sbjct: 374  HSNGKGHRPGPDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSDIS 426


>gi|260835071|ref|XP_002612533.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
 gi|229297910|gb|EEN68542.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
          Length = 1003

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 110/470 (23%), Positives = 198/470 (42%), Gaps = 82/470 (17%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V+   PF++   P   L   + GEL I  +           +R +PL   P  + +  + 
Sbjct: 560  VTCFSPFHNVNCPKGFLYFNRGGELRISVLPTHLSYDAPWPVRKVPLRCTPHFVAYHMEC 619

Query: 673  RTFAICSLKNQSC---------------AEESEMHFVRLLDDQTFEFISTYPLD------ 711
            + +A+ +   + C                E+ + +   +LD    + +S    +      
Sbjct: 620  KVYAVAASTFEMCNRIPRMAGDEKEYDAVEKDDRYIYPMLDKFNIQLMSPVSWEIIPNTR 679

Query: 712  --TFEYGCSILSCSFSDDSN-VYYCVGTAYVL------PEENEPTKGRILVFIVEDGKLQ 762
                E   +  +CSF    N V +  G   +L      PE  +P         +   K++
Sbjct: 680  GMQLEENYAECTCSFLVGINFVLFVAGQIVILDVIEVVPEPGQP---------LTKNKIK 730

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILAL 819
             +  KE KG V +L   NG LL+AI QKI L  W  R++   G   + ++   H  I   
Sbjct: 731  ELYGKEQKGPVSALCGCNGYLLSAIGQKIFL--WEFRNNDLIGVAFIDTQVYIHTAI--- 785

Query: 820  YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG----- 874
               +  +++++ D+ KSISLL Y+           D        VE   D+  +G     
Sbjct: 786  ---SIKNYVILADVFKSISLLRYQ-----------DMRPLETYCVEFFVDNAQIGFLVSD 831

Query: 875  AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF-RHGSLVM-----RLPDSD- 927
            A+ NF L++ +  +  +   +R  L    ++++G  VN F R    +M     R  D+D 
Sbjct: 832  AQKNFLLYSYQPEARESYGGQR--LVRRADFNVGSHVNTFFRVRCKIMDPSGERRRDADT 889

Query: 928  VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF-NNEK 986
            V +    +F T++G +G +  +  + Y  L  LQ  L   +    GLN + +R   +N +
Sbjct: 890  VAKRHVTMFATLDGGLGALLPMAEKTYRRLLMLQNTLMTHMPFPAGLNPKAFRMLKHNHR 949

Query: 987  KTVDA-KNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
              ++A +N LDG+L+  FL LS     E+++ +  S E + + + ++ RL
Sbjct: 950  SLINACRNILDGELLWKFLHLSVVERSELARKIGTSPETITEDLMDIDRL 999


>gi|67600754|ref|XP_666354.1| CG13900 gene product [Cryptosporidium hominis TU502]
 gi|54657334|gb|EAL36124.1| CG13900 gene product [Cryptosporidium hominis]
          Length = 1318

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 124/565 (21%), Positives = 232/565 (41%), Gaps = 81/565 (14%)

Query: 136 RHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVYCS--- 189
           +H+  +EV L   ++V    SQ   D+ +  LI VP       GVL+   + I+YC    
Sbjct: 204 KHLIFWEVDL-GLNYVSRKSSQVITDS-SHTLISVPGGNDGPSGVLVCDYKGIIYCKVGH 261

Query: 190 ANAFKAIPIRPSITKAYGRVDADGSRY--------LLGDHAGLLHLLVITHEKEKVTGLK 241
           +N F   P R   +  YG +    S +        L+    G ++ + + H +  V  ++
Sbjct: 262 SNIFCPYPFRFGDSSEYGTMIVASSLHKLKGFFLILVQTELGDIYRINLIHNEGIVKEMR 321

Query: 242 IELLGETSIASTISYLDNAVVYIGSSYGDS---QLIKL---------NLQPDA---KGSY 286
           I       + +++  L +  ++    +G+    Q++ L         +  PD+   K  Y
Sbjct: 322 IYYYDTIPVCNSLLLLRSGFLFASHEFGNHTNYQIVSLGDDKTDPCTSSLPDSNDLKRVY 381

Query: 287 --------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 338
                   +   E  ++L PI D  V+D    G  Q+V   G     +LR+   G  + E
Sbjct: 382 FRPRNCQCIRKSEEILSLSPITDIKVIDTNNDGTPQIVATCGRGPRSTLRVCSYGKNVEE 441

Query: 339 QASVELQG-IKGMWSLRSSTD-------------DPFDTFLVVSFISETRILAMNLEDEL 384
            A   L G  + +W+L++  D             D    ++++SFI   R L + + + +
Sbjct: 442 IAENPLPGRPRCIWTLKNGIDPSLSGSQAEAAILDNIHHYIIISFID--RSLVLTIGEHV 499

Query: 385 EETEIEGFCSQTQTLFCHDAI-YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVAT 443
           EET    F     T++    I YN  +QV    V+L+    ++   +WK+P    +  A 
Sbjct: 500 EETNDTLFTLNETTMYAASMIFYNSFLQVLETHVKLI---IQDRIYDWKTPDSRKIIAAD 556

Query: 444 ANASQVLLATGGGHLVYLEI-----------GDGILTEVKHAQLEYEISCLDINPIGENP 492
           +N  QV LA  GG +V LE+           G G L EV   ++  EI C+ I  +  + 
Sbjct: 557 SNGRQVSLALEGGLIVILELNVNGVSGITNTGIGGLVEVCRREITCEIICIGIQQLSYSG 616

Query: 493 SY-SQIAAVGMWTDISVRIFSLPDLNLITKEHL-----GGEIIPRSVLLCAFEGISYLLC 546
              S    VG  T+ ++R++ +       K+           IP +V L       +L+ 
Sbjct: 617 QMRSDYVVVGTSTENALRLYKIDSAEKRLKQTCTQILPNSNSIPENVQLYHSNKYGHLIL 676

Query: 547 ALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN---TTHVFAASDRPTVI 601
            +G   G +L+  ++   G ++D +   LG + + +     ++      +   S RP ++
Sbjct: 677 FVGLTTGVILSCKVDASNGSISDPRSKYLGNRGVNICRIMREDFGGEMSLVCMSSRPWLV 736

Query: 602 YSSNKKLLYSNVNLKEVSHMCPFNS 626
            S    + ++ +  + +  + P N+
Sbjct: 737 DSQTSGVNFTPLQYRCIDSIAPLNT 761



 Score = 82.8 bits (203), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 93/391 (23%), Positives = 164/391 (41%), Gaps = 69/391 (17%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLP------------ 741
            VR+++ ++ E I   PLDT E   S   C F +       +GT Y +             
Sbjct: 947  VRIVNLKSMETIQLIPLDTNEGCISACVCKFDELDLPCLVLGTVYGMKLNRGYNNNNNSN 1006

Query: 742  ----------EENEPTKGRILVFIVEDGK--LQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
                      +EN  + G  +     D     +L+     + +  +L  + G+LL  IN+
Sbjct: 1007 NNTDSAKNNDKENHDSFGAAIKIFKYDSNYNFELVHITPIENSATALTGWRGRLLVGINK 1066

Query: 790  KIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
             +++Y    K +LR    R +             +++   D I  GD+   + +  + + 
Sbjct: 1067 TLRVYSLGKKRLLRKSEYRNIPQGL--------TWIKVVNDRIFAGDISNGVLVFKFNNT 1118

Query: 846  EGAIEERARDYNANWM-SAVEILD----------DDIYLG-----AENNFNL---FTVRK 886
                   A+D    W+ SA E+LD          D+I +      A ++F+    FT   
Sbjct: 1119 SNQFILVAKDPMPRWLTSACEVLDYHTIAVSDKFDNIIVSRVPVEASDDFSFVTSFTDNN 1178

Query: 887  NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
            NS+ +      ++  V ++HLG+ V   +   L    P S       +I+GTV G IG +
Sbjct: 1179 NSQSSALMRTHQINTVAQFHLGDIVTCLQKSQLT---PTS----AEAIIYGTVLGSIGSL 1231

Query: 947  AS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
            +  L +E    L KL+  LRK    +   +H  +RS+ +        N +DGD  ++F  
Sbjct: 1232 SPILNNEDIELLSKLEILLRKQKSTLLSRDHLMFRSYYSP-----VHNVIDGDFCQTFTI 1286

Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL-TRL 1035
            L      EI+  ++V+VEE+ K++++  TRL
Sbjct: 1287 LDSKIQSEIASKLDVTVEEIYKKLDDYKTRL 1317


>gi|242075246|ref|XP_002447559.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
 gi|241938742|gb|EES11887.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
          Length = 389

 Score = 88.6 bits (218), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 87/347 (25%), Positives = 155/347 (44%), Gaps = 33/347 (9%)

Query: 707  TYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILV--FIVEDGK 760
            T P+ +FE   ++    L  + + ++     +GTAYV   E+   +GR+L+  F   +  
Sbjct: 48   TIPMQSFENALTVRIVTLQNTSTKENETLMAIGTAYV-QGEDVAARGRVLLYSFSRSENS 106

Query: 761  LQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
              L+ E   KE+KGAV ++ +  G LL A   KI L KW        EL +   +   + 
Sbjct: 107  QNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWT-----GSELTAVAFYDAPLH 161

Query: 818  ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG--- 874
             + +    +F++ GD+ KSI  L +K +   +   A+D+ +    A E L D   L    
Sbjct: 162  VVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 221

Query: 875  --AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
              ++ N  +F     +    +  +G +L    E+H+G  V++F     +  LP   +   
Sbjct: 222  SDSDKNVQIFYY---APKMVESWKGQKLLSRAEFHVGAHVSKFLR---LQMLPTQGLASE 275

Query: 932  PT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF--NNE 985
             T    ++FGT++G IG IA +    +  L+ LQ  L   +  V GLN   +R F  N +
Sbjct: 276  KTNRFALVFGTLDGGIGCIAPVDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRHFKSNGK 335

Query: 986  KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                   N +D +L+  +  LS     EI++ +  +  ++     + 
Sbjct: 336  AHRPGPDNIIDFELLSHYEMLSLEEQLEIAQQIGTTRSQILSNFSDF 382


>gi|321475208|gb|EFX86171.1| hypothetical protein DAPPUDRAFT_313209 [Daphnia pulex]
          Length = 1260

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 130/611 (21%), Positives = 254/611 (41%), Gaps = 132/611 (21%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGD------VSDRI 59
           ++G++ ++      G +QD + ++    K  ++++D  S  L T ++ +      + + I
Sbjct: 74  LFGKVMSIAAVSLPGSSQDTILMSFAHAKLSLIEYDPVSDNLKTLSLHNFEVVSILDEGI 133

Query: 60  GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF------DNKGQLKEAFNIRLEELQ--- 110
           G      +I + DP+ R   L ++     ++PF      D+   L  ++ I+L +L+   
Sbjct: 134 GSNHKIPEIRV-DPEGRCAALLIFRNTLAILPFRKDSAHDSNVTLS-SYIIKLTDLEERV 191

Query: 111 --VLDIKFLYGCAKPTIVVLYQD----------NKDARHVKTYEVALKDKDFVEGP-WSQ 157
             V+D++FL+G  +PT+++LY+            +D        V+L  +  V    WS 
Sbjct: 192 DNVIDVQFLHGYYEPTLIILYEPVGTFPGRIAVRQDT--CNMVAVSLNTQQRVHPIIWSL 249

Query: 158 NNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIRPSITK 204
           N+L      L+PVP PL G LI+   +++Y +             A+   + P++P    
Sbjct: 250 NSLPFDCSQLLPVPKPLGGALIMAVNSVIYVNQSVPPYGVSVNSIADHCTSFPLKPYEGS 309

Query: 205 AYGRVDADGS-----RYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSIASTISYLD 258
             G   A  +     R +L    G L++L +  +  + V    +E    + + + +   D
Sbjct: 310 RIGLDCARAAFLQYDRVVLSLKGGELYVLTLFADSMRSVRKFHLEKAAASVLTTCLCICD 369

Query: 259 NAVVYIGSSYGDSQLIKLNLQ--------------------------------------- 279
           N  +++GS  G+S L+    +                                       
Sbjct: 370 N-YLFLGSRLGNSLLLAFQTKDYNQYATPFAAKKPKMEQFSLLFDQELDHLDEEEIDNYL 428

Query: 280 -------PDAKG-SY-VEVLERYVNLGPIVDFCV-------VDLERQGQG---QVVTCSG 320
                   D+K  SY  EV +  +N+GP     V        D +++      ++VT SG
Sbjct: 429 YGEDHESTDSKAISYQFEVCDSLLNIGPCGQMAVGEPASTCTDFDKKSPDPDVEIVTTSG 488

Query: 321 AYKDGSLRIVRNGIGINEQASVELQGIKGMWSL---RSSTDDPFDTFLVVSFISETRILA 377
             K+G++ +++  +      + EL  +  M+++   R++ D    T+L++S    T +L 
Sbjct: 489 YGKNGAICVLQRTMKPQVVTTFELPEVSDMFTVFASRNNEDAIMHTYLLLSRADSTMVLQ 548

Query: 378 MNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSRELRNEWKSPPG 436
                E+ E +  GF   + T+   +   N+ +VQV   SVRL+ +T+  ++ E      
Sbjct: 549 TG--QEINEMDQSGFSVTSPTILAANLGNNRFIVQVCPTSVRLLDATATVIQ-ELVMDSD 605

Query: 437 YSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQ 496
           + +  A+A+   V + T  G +       G+LT V+ +QLE     L  N        S 
Sbjct: 606 FLITSASASDPYVAVLTENGRI-------GLLTFVEGSQLEMIFPVLSKN--------SP 650

Query: 497 IAAVGMWTDIS 507
           +  V ++ DIS
Sbjct: 651 VVCVCLYRDIS 661



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 52/182 (28%), Positives = 88/182 (48%), Gaps = 27/182 (14%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+PE  +P         +   K++ +  K+ KG V ++++  G L+AAI QKI  Y W L
Sbjct: 1092 VVPEPGQP---------LTKNKIKTLYAKDQKGPVAAISSVCGYLVAAIGQKI--YLWQL 1140

Query: 799  RDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
            ++D   G   + +E   H  +L +       FI+  D+ KS+S+L ++ E   +   ARD
Sbjct: 1141 KNDDLVGIAFIDTEIYIH-QLLNI-----KSFILAADVYKSVSILRFQEEYRTLCIVARD 1194

Query: 856  YNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 910
            Y    + AV+   D+  LG     AE N  L+  +   E    +   RL    ++H+G+ 
Sbjct: 1195 YQPLEVMAVDYYIDNTQLGFLVSDAEKNLILYMYQ--PEARESQGGHRLIRKADFHVGQV 1252

Query: 911  VN 912
            V+
Sbjct: 1253 VS 1254


>gi|195334368|ref|XP_002033855.1| GM20208 [Drosophila sechellia]
 gi|194125825|gb|EDW47868.1| GM20208 [Drosophila sechellia]
          Length = 1455

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 108/436 (24%), Positives = 198/436 (45%), Gaps = 64/436 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
            +R +PL   PR++ +  ++R + + +              +++  +EES    F+     
Sbjct: 1026 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1085

Query: 696  -----LLDDQTFEFISTYPLDTFE-----YGCSILSCSFSDDSN---VYYCVGTAYVLPE 742
                 L+  +T+E +    + TFE         I+  S+    +    Y C+GT +    
Sbjct: 1086 QFEMVLISPETWEIVPDASI-TFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-S 1143

Query: 743  ENEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            E+  ++G I ++     + E GK      ++ I +KE KG V +++   G L+  + QKI
Sbjct: 1144 EDITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI 1203

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
              Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +  
Sbjct: 1204 --YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSL 1258

Query: 852  RARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
             +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   ++  L    +YH
Sbjct: 1259 ASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQK--LLRKADYH 1316

Query: 907  LGEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
            LG+ VN  FR       L  R P     +   V++GT++G +G    LP + Y     LQ
Sbjct: 1317 LGQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQ 1375

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
              L    + + GLN +++R+  + KK     ++  +DGDLI S+  ++ +  +E++K + 
Sbjct: 1376 NVLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIG 1435

Query: 1020 VSVEELCKRVEELTRL 1035
               EE+   + E+ RL
Sbjct: 1436 TRTEEILGDLLEIERL 1451



 Score = 56.6 bits (135), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 72/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           +YG + +L+     G  +D L I+ +  K  VLQ D ++  L T ++    +   R    
Sbjct: 76  LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
           G+  +    +DPD R   + +Y     V+PF     L E                     
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
              ++ I L +L      VLDI+FL+G  +PT+++LY+  +     +K          ++
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
           L  +  V    W+ N+L      + P+  P+ G L++    ++Y              SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315

Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
           +   A P++P      S+  A +  +D D  + ++    G L++L +  +  + V     
Sbjct: 316 DNSTAFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHF 373

Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
                + + S I  L +  +++GS  G+S L+
Sbjct: 374 HKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405



 Score = 42.0 bits (97), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 71/159 (44%), Gaps = 26/159 (16%)

Query: 288 EVLERYVNLGPIVDFCV---VDLERQG-------------QGQVVTCSGAYKDGSLRIVR 331
           EV +  +N+ PI   C    V+ E  G             + ++V  +G  K+G+L +  
Sbjct: 485 EVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFV 544

Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
           N +      S EL G   +W++      +SS +D  D F+ +S  + T +L      E+ 
Sbjct: 545 NCLNPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMFLSQRNSTLVLQTG--QEIN 601

Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           E E  GF     T+F  +    + +VQVT+  VRL+  T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640


>gi|45552619|ref|NP_995833.1| cleavage and polyadenylation specificity factor 160, isoform A
            [Drosophila melanogaster]
 gi|18203551|sp|Q9V726.1|CPSF1_DROME RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=CPSF 160 kDa subunit;
            Short=dCPSF 160
 gi|7303176|gb|AAF58240.1| cleavage and polyadenylation specificity factor 160, isoform A
            [Drosophila melanogaster]
          Length = 1455

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 108/436 (24%), Positives = 198/436 (45%), Gaps = 64/436 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
            +R +PL   PR++ +  ++R + + +              +++  +EES    F+     
Sbjct: 1026 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1085

Query: 696  -----LLDDQTFEFISTYPLDTFE-----YGCSILSCSFSDDSN---VYYCVGTAYVLPE 742
                 L+  +T+E +    + TFE         I+  S+    +    Y C+GT +    
Sbjct: 1086 QFEMVLISPETWEIVPDASI-TFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-S 1143

Query: 743  ENEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            E+  ++G I ++     + E GK      ++ I +KE KG V +++   G L+  + QKI
Sbjct: 1144 EDITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI 1203

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
              Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +  
Sbjct: 1204 --YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSL 1258

Query: 852  RARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
             +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   ++  L    +YH
Sbjct: 1259 ASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQK--LLRKADYH 1316

Query: 907  LGEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
            LG+ VN  FR       L  R P     +   V++GT++G +G    LP + Y     LQ
Sbjct: 1317 LGQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQ 1375

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
              L    + + GLN +++R+  + KK     ++  +DGDLI S+  ++ +  +E++K + 
Sbjct: 1376 NVLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIG 1435

Query: 1020 VSVEELCKRVEELTRL 1035
               EE+   + E+ RL
Sbjct: 1436 TRTEEILGDLLEIERL 1451



 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 72/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           +YG + +L+     G  +D L I+ +  K  VLQ D ++  L T ++    +   R    
Sbjct: 76  LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
           G+  +    +DPD R   + +Y     V+PF     L E                     
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
              ++ I L +L      VLDI+FL+G  +PT+++LY+  +     +K          ++
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
           L  +  V    W+ N+L      + P+  P+ G L++    ++Y              SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315

Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
           +   A P++P      S+  A +  +D D  + ++    G L++L +  +  + V     
Sbjct: 316 DNSTAFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHF 373

Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
                + + S I  L +  +++GS  G+S L+
Sbjct: 374 HKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405



 Score = 43.5 bits (101), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 26/159 (16%)

Query: 288 EVLERYVNLGPIVDFCV---VDLERQG-------------QGQVVTCSGAYKDGSLRIVR 331
           EV +  +N+ PI   C    V+ E  G             + ++V  +G  K+G+L +  
Sbjct: 485 EVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFV 544

Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
           N I      S EL G   +W++      +SS +D  D F+++S  + T +L      E+ 
Sbjct: 545 NCINPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNSTLVLQTG--QEIN 601

Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           E E  GF     T+F  +    + +VQVT+  VRL+  T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640


>gi|397615212|gb|EJK63291.1| hypothetical protein THAOC_16062, partial [Thalassiosira oceanica]
          Length = 322

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 80/305 (26%), Positives = 134/305 (43%), Gaps = 41/305 (13%)

Query: 751  ILVFIVEDGKLQLIAEKETK------GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR 804
            +L  +V   +LQL+   +        G V +L  F G+LL  I + ++LY+      G R
Sbjct: 29   VLYRVVSGERLQLLHRTKVDDGSSGGGPVLALVHFQGRLLVGIGKSLRLYEM-----GKR 83

Query: 805  ELQSECGHHG-HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
            +L  +C   G   +   +Q  GD   VGD+M+S+  + Y      +   ARD +A  ++ 
Sbjct: 84   QLLKKCELRGLPTMVKTLQAAGDRAFVGDMMQSMQFVRYDATANRLVLVARDRSARPITC 143

Query: 864  VEILDDDIYLGAENNFNLFTV---RKNSEGATD-------------EERGRLEVVGEYHL 907
             E+LD +     +   N+ T+   R    GA D             +   +LE +  YH+
Sbjct: 144  QELLDVNTVAVGDKFGNVTTLRLPRGADTGAVDVSGTRALWDSSREDATPKLETLCTYHV 203

Query: 908  GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRK 966
            GE V      SLV        G   ++I+ TV G IG +      E   F   L++++R 
Sbjct: 204  GEVVTSLTRASLV-------AGGAESLIYVTVTGRIGALVPFTSREDVEFYTSLESHVRS 256

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
             +    G + + +RSF         K+ +DGDL E++  L     + I+  M  S  E+ 
Sbjct: 257  EVPRPTGRDPQSYRSFY-----CPVKHVIDGDLCEAYGGLPYEARERIADQMERSTGEVM 311

Query: 1027 KRVEE 1031
            K++E+
Sbjct: 312  KKLED 316


>gi|397627714|gb|EJK68584.1| hypothetical protein THAOC_10223, partial [Thalassiosira oceanica]
          Length = 456

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 80/305 (26%), Positives = 134/305 (43%), Gaps = 41/305 (13%)

Query: 751  ILVFIVEDGKLQLIAEKETK------GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR 804
            +L  +V   +LQL+   +        G V +L  F G+LL  I + ++LY+      G R
Sbjct: 163  VLYRVVSGERLQLLHRTKVDDGSSGGGPVLALVHFQGRLLVGIGKSLRLYEM-----GKR 217

Query: 805  ELQSECGHHG-HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
            +L  +C   G   +   +Q  GD   VGD+M+S+  + Y      +   ARD +A  ++ 
Sbjct: 218  QLLKKCELRGLPTMVKTLQAAGDRAFVGDMMQSMQFVRYDATANRLVLVARDRSARPITC 277

Query: 864  VEILDDDIYLGAENNFNLFTV---RKNSEGATD-------------EERGRLEVVGEYHL 907
             E+LD +     +   N+ T+   R    GA D             +   +LE +  YH+
Sbjct: 278  QELLDVNTVAVGDKFGNVTTLRLPRGADTGAVDVSGTRALWDSSREDATPKLETLCTYHV 337

Query: 908  GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRK 966
            GE V      SLV        G   ++I+ TV G IG +      E   F   L++++R 
Sbjct: 338  GEVVTSLTRASLV-------AGGAESLIYVTVTGRIGALVPFTSREDVEFYTSLESHVRS 390

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
             +    G + + +RSF         K+ +DGDL E++  L     + I+  M  S  E+ 
Sbjct: 391  EVPRPTGRDPQSYRSFY-----CPVKHVIDGDLCEAYGGLPYEARERIADQMERSTGEVM 445

Query: 1027 KRVEE 1031
            K++E+
Sbjct: 446  KKLED 450


>gi|422294117|gb|EKU21417.1| uv-damaged dna-binding protein [Nannochloropsis gaditana CCMP526]
          Length = 192

 Score = 88.2 bits (217), Expect = 2e-14,   Method: Composition-based stats.
 Identities = 48/106 (45%), Positives = 64/106 (60%), Gaps = 4/106 (3%)

Query: 915  RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
            R G  V R    D+G    ++FGT  G+IG I  +  E Y F   L   L KV+KGVGGL
Sbjct: 29   RSGGEVARGHVQDLG----LMFGTQQGMIGSILPISEEDYRFFVALTKCLNKVVKGVGGL 84

Query: 975  NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            +HE++R F  +K   D + F+DGDLIESFL+L   RM+E+ + M V
Sbjct: 85   SHEEYRRFLTDKAISDTQGFVDGDLIESFLELPTQRMEEVVELMRV 130


>gi|24653655|ref|NP_725397.1| cleavage and polyadenylation specificity factor 160, isoform B
            [Drosophila melanogaster]
 gi|15292103|gb|AAK93320.1| LD38533p [Drosophila melanogaster]
 gi|21627189|gb|AAM68553.1| cleavage and polyadenylation specificity factor 160, isoform B
            [Drosophila melanogaster]
          Length = 1420

 Score = 87.8 bits (216), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 108/436 (24%), Positives = 198/436 (45%), Gaps = 64/436 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
            +R +PL   PR++ +  ++R + + +              +++  +EES    F+     
Sbjct: 991  VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1050

Query: 696  -----LLDDQTFEFISTYPLDTFE-----YGCSILSCSFSDDSN---VYYCVGTAYVLPE 742
                 L+  +T+E +    + TFE         I+  S+    +    Y C+GT +    
Sbjct: 1051 QFEMVLISPETWEIVPDASI-TFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-S 1108

Query: 743  ENEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            E+  ++G I ++     + E GK      ++ I +KE KG V +++   G L+  + QKI
Sbjct: 1109 EDITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI 1168

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
              Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +  
Sbjct: 1169 --YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSL 1223

Query: 852  RARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
             +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   ++  L    +YH
Sbjct: 1224 ASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQK--LLRKADYH 1281

Query: 907  LGEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
            LG+ VN  FR       L  R P     +   V++GT++G +G    LP + Y     LQ
Sbjct: 1282 LGQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQ 1340

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
              L    + + GLN +++R+  + KK     ++  +DGDLI S+  ++ +  +E++K + 
Sbjct: 1341 NVLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIG 1400

Query: 1020 VSVEELCKRVEELTRL 1035
               EE+   + E+ RL
Sbjct: 1401 TRTEEILGDLLEIERL 1416



 Score = 52.4 bits (124), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 68/324 (20%), Positives = 126/324 (38%), Gaps = 84/324 (25%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           +YG + +L+     G  +D L I+ +  K  VLQ D ++  L T ++    +   R    
Sbjct: 76  LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
           G+  +    +DPD R   + +Y     V+PF     L E                     
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYEVALKDKDFV 151
              ++ I L +L      VLDI+FL+G  +PT+++LY+  +     +K Y          
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVY---------- 245

Query: 152 EGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPI 198
                            P+  P+ G L++    ++Y              SA+   A P+
Sbjct: 246 -----------------PIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSADNSTAFPL 288

Query: 199 RP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSI 250
           +P      S+  A +  +D D  + ++    G L++L +  +  + V          + +
Sbjct: 289 KPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHFHKAAASVL 346

Query: 251 ASTISYLDNAVVYIGSSYGDSQLI 274
            S I  L +  +++GS  G+S L+
Sbjct: 347 TSCICVLHSEYIFLGSRLGNSLLL 370



 Score = 43.9 bits (102), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 26/159 (16%)

Query: 288 EVLERYVNLGPIVDFCV---VDLERQG-------------QGQVVTCSGAYKDGSLRIVR 331
           EV +  +N+ PI   C    V+ E  G             + ++V  +G  K+G+L +  
Sbjct: 450 EVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFV 509

Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
           N I      S EL G   +W++      +SS +D  D F+++S  + T +L      E+ 
Sbjct: 510 NCINPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNSTLVLQTG--QEIN 566

Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           E E  GF     T+F  +    + +VQVT+  VRL+  T
Sbjct: 567 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 605


>gi|428182571|gb|EKX51431.1| hypothetical protein GUITHDRAFT_134365 [Guillardia theta CCMP2712]
          Length = 1295

 Score = 87.8 bits (216), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 115/493 (23%), Positives = 205/493 (41%), Gaps = 80/493 (16%)

Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQ---VVTCSGAYKDG 325
           GDS ++ + L  +     +E LER   LG   DF V+  +    G    V+ CSG  K  
Sbjct: 439 GDSAVMMMRL--NGNEIEIETLERLTMLGVTTDFTVLSDKNVNGGDEFDVLVCSGTGKHA 496

Query: 326 SLRIVRNGIGINEQASVE---LQGIKGMWSLRSSTDDPFD-TFLVVSFISETRILAMNLE 381
           S+R +R G+ +      E     GI G++++  + +   D +FL ++F +  R+L +   
Sbjct: 497 SIRRLRYGLPVETHVRSEKGLCDGIVGLYTI--TFEQKHDQSFLFMAFTTGCRVLTVG-- 552

Query: 382 DELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS------------TSRELR 428
            EL + T+  G      TL         L+QV+   V ++               S+  R
Sbjct: 553 SELSDVTDDLGLNPTALTLHAAANSDGHLIQVSDEEVIVIQQPVLDSMDLDVIPCSQPER 612

Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYL-----EIGDG----ILTEVKHAQLEYE 479
             W+      ++V+ +  + V+  T GG  +++     + G      I  E+   +   E
Sbjct: 613 AVWRPAESTKISVSASLGNLVMCCTCGGTSIHILDTTRDQGSAGEFQIRGEITSFRSSEE 672

Query: 480 ISCLDINPIGENPSYS-QIAAVGMWTDI------SVRIFSLP------------DLNLIT 520
           +SC+ +  +G + S + +I+ +G + D        ++I+SL             DL + +
Sbjct: 673 VSCMAMTSLGAHGSETRKISIIGTYADALKARAGKIKIYSLGEEKREVELAQSIDLGIYS 732

Query: 521 KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGEL----TDRKKVSLGTQ 576
              L G +     +L    G  +L+  L +G ++ F LN  +  L    T R+   L   
Sbjct: 733 GSDLSGSVPQSFGILPEPAGGVFLVVGLRNGKVIIFTLNALSLPLVGLGTTRR---LAES 789

Query: 577 PITLRTFSSKNTTHVFA-ASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDSL 633
           PIT  + +S     + A   D   ++  S   L +  +  +   +SH  PF   A P  +
Sbjct: 790 PITFVSITSYAGPSLIALVEDVAFLVTPSRASLQFQRIGFQPGGISHAAPFICDACPLGM 849

Query: 634 AIAKEGELTIGTIDDIQKLHIRSIPLG---EH------------PRRIC-HQEQSRTFAI 677
           AI  +G+L + +ID  +KL +++IPL    EH            P R+  H +       
Sbjct: 850 AIVVDGQLKLVSIDRCEKLDVKTIPLALESEHTESESKLEDNFMPARVVWHPDAKLVIVG 909

Query: 678 CSLKNQSCAEESE 690
           C+   + C  E E
Sbjct: 910 CNYSRKVCGAEEE 922


>gi|395740218|ref|XP_002819588.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            [Pongo abelii]
          Length = 1388

 Score = 87.8 bits (216), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 95/368 (25%), Positives = 163/368 (44%), Gaps = 46/368 (12%)

Query: 688  ESEMHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVL 740
            + E   ++L+   ++E I  +   L  +E+   + + S   +  V     Y   GT  + 
Sbjct: 1015 QQEAFSIQLISPVSWEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 1074

Query: 741  PEENEPTKGRILVF-----IVEDG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
             EE    +GRIL+      + E G      K +++ EKE KG V +L   NG L++AI Q
Sbjct: 1075 GEEVT-CRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQ 1133

Query: 790  KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
            KI  + W LR     EL         +    + +  +FI+  D+MKSISLL Y+ E   +
Sbjct: 1134 KI--FLWSLR---ASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTL 1188

Query: 850  EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
               +RD     + +V+ + D+  LG   ++ + NL       E        RL    ++H
Sbjct: 1189 SLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFH 1248

Query: 907  LGEFVNRFRHGSLVMRLPDSDVGQ---IPTVIFG--------TVNGVIGVIASLPHEQYL 955
            +G  VN F       R P     +     +V++         +V G IG++  +  + Y 
Sbjct: 1249 VGAHVNTF------WRTPCRGAAEGLSKKSVVWENKHITWLVSVRGGIGLLLPMQEKTYR 1302

Query: 956  FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDE 1013
             L  LQ  L  ++    GLN   +R  + +++T+    +N LDG+L+  +L LS     E
Sbjct: 1303 RLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSE 1362

Query: 1014 ISKTMNVS 1021
            ++K +  +
Sbjct: 1363 LAKKIGTT 1370



 Score = 40.4 bits (93), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 36/172 (20%)

Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
           EV +  +N+GP  +  + +   L  + Q       ++V CSG  K+G+L +++  I    
Sbjct: 412 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQV 471

Query: 339 QASVELQGIKGMWSL-----RSSTDDP-------------------FDTFLVVSFISETR 374
             + EL G   MW++     +   D+P                      FL++S    T 
Sbjct: 472 VTTFELPGCYDMWTVIAPLRKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTM 531

Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
           IL      E+ E +  GF +Q  T+F  +   N+ +VQV+   +RL+   ++
Sbjct: 532 ILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 581


>gi|193702313|ref|XP_001945086.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Acyrthosiphon pisum]
          Length = 1335

 Score = 87.8 bits (216), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 89/334 (26%), Positives = 148/334 (44%), Gaps = 43/334 (12%)

Query: 731  YYCVGTAYVLPEENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAF 779
            Y  +GT Y    E+  ++GRI +F + D            K+++I  KE KG V ++   
Sbjct: 1010 YIAMGTNYSY-SEDITSRGRIFLFDIIDVVPEPGKPLTKNKIKMIYAKEQKGPVTAITHV 1068

Query: 780  NGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
             G L+ A+ QKI  Y W L+D+   G   + +E   H  +L++        I+V DL KS
Sbjct: 1069 VGFLVTAVGQKI--YIWQLKDNDLIGIAFIDTEVYVH-QMLSI-----KSLILVADLFKS 1120

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI---YLGAENNFNLFTVRKNSEGATD 893
            I+LL ++ E   +    RD     +  +  L D+    +L ++ + NL            
Sbjct: 1121 ITLLRFQEEYRTLSLVCRDSKPLEVFDINFLIDNTELGFLASDRDQNLLLYLYQPMARES 1180

Query: 894  EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG----------QIPTVIFGTVNGVI 943
                 L   G++++G  VN F      +R   S V           +    ++ T++G I
Sbjct: 1181 YGGQHLVRRGDFNIGSNVNSF----FRLRCKQSTVAPDRREAIGSDKRHVTMYTTLDGSI 1236

Query: 944  GVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF--NNEKKTVDAKNFLDGDLIE 1001
            G I  +  + Y  L  LQ  L K I  + GLN + +RSF     ++   A+  +DG+L+ 
Sbjct: 1237 GYIVPIHEKNYRRLLTLQNMLVKNITHLAGLNPKAYRSFKATAPERMNQARRVIDGELVW 1296

Query: 1002 SFLDLSRTRM-DEISKTMNVSVEELCKRVEELTR 1034
             F+     R  +EI+  + V   EL + + EL R
Sbjct: 1297 MFVTCMNARQRNEIANKVGVKTIELLQDIYELDR 1330



 Score = 69.7 bits (169), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 114/553 (20%), Positives = 228/553 (41%), Gaps = 117/553 (21%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           ++G I  L+       + D L ++    KF ++++D +   L T ++    D   +   N
Sbjct: 65  LFGNIMCLQSVTLCPSSPDALLLSFSEAKFSLVEYDRDMHSLRTLSLHYFEDDKFK---N 121

Query: 66  GQ-------IGIIDPDCRLIGLHLYDGLFKVIPF-----DN--KGQLKEAFNIRLEEL-- 109
           G        +  +DPD R +   +Y   F V+PF     DN    Q+  ++ I + ++  
Sbjct: 122 GHTQHWSPPLIRVDPDGRCVVGLVYGSYFVVLPFGRTIDDNAKSAQVMPSYTIPISKIDP 181

Query: 110 ---QVLDIKFLYGCAKPTIVVLYQDNKDARHVKTY--------------EVALKDKDFVE 152
               ++D  FL+G  +PT+++LY+       VKT+               ++L  +  V 
Sbjct: 182 KMNNIMDFDFLHGYYEPTLLILYEP------VKTFAGRIAVRKDTCAMVAISLNIQQHVH 235

Query: 153 GP-WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGR--- 208
              WS ++L      +I V  P+ GVLI+   +++Y + +         SI K       
Sbjct: 236 PVIWSLDSLPYDCQKVIAVSRPIGGVLIMAVNSLIYLNQSVPPFGVALNSIAKTLTNFPL 295

Query: 209 ---------VDADGSRYLLGDHA------GLLHLLVITHEKEK-VTGLKIELLGETSIAS 252
                    +D   + ++  D        G L+++ +  +  + V     E    + + +
Sbjct: 296 GQQEDINLVLDRATATFISSDKLVTSLCNGDLYVITLYADSMRAVRSFHFEKCASSVLTT 355

Query: 253 TISYLDNAVVYIGSSYGDSQLIKL-------NLQPDAK-----------------GSYV- 287
            I+   ++ +++GS  G+S L++        + +P  K                 GS V 
Sbjct: 356 CITVCLDSYLFLGSRLGNSLLLRYYARSQSNDDEPSIKRKKTDETDEDLVELEVYGSEVQ 415

Query: 288 ----------EVLERYVNLGPIVDFCVVDL--------ERQGQGQVVTCSGAYKDGSLRI 329
                     EV +  +N+GP     + +           +   +++  SG  K+G+L +
Sbjct: 416 TSICLESYSFEVCDSIINIGPCSQASIGEPAYISDEFSSDEHDVELLCTSGHGKNGALSV 475

Query: 330 VRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI 389
           +   I      +  L G K MW++    D  F TF++++ +  T IL      E+ E + 
Sbjct: 476 LHRSIKPQLVTTFHLDGYKDMWTVHGEND--FHTFMILTNVDSTLILQTG--QEINELDS 531

Query: 390 EGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN---EWKSPPGYSVNVATANA 446
            G+ ++  T+F  + +   ++QV   SVRL++  S +L++   ++ SP    ++ ++ N 
Sbjct: 532 SGYATREHTVFVCN-MNKFVIQVLRYSVRLLNG-SEQLQSVSLDFGSP---IIHGSSCNP 586

Query: 447 SQVLLATGGGHLV 459
             VLL   G  +V
Sbjct: 587 YAVLLTEDGQVIV 599


>gi|308449821|ref|XP_003088088.1| hypothetical protein CRE_07177 [Caenorhabditis remanei]
 gi|308249634|gb|EFO93586.1| hypothetical protein CRE_07177 [Caenorhabditis remanei]
          Length = 236

 Score = 87.4 bits (215), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 54/188 (28%), Positives = 99/188 (52%), Gaps = 6/188 (3%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
           ++PIYG++ T+ L R   E +  L + TE+++  VL +     ++ITR  G ++D+ GR 
Sbjct: 54  EIPIYGQVLTMALVRCKREKRQSLVVVTEKWQMAVLTY--RDGKVITRTAGALADQSGRA 111

Query: 63  TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-YGCA 121
           +DN    +      L+ +  Y+G  K+I ++  G    +FN+R +   V D KF+  G  
Sbjct: 112 SDN-LFSLTIHRSGLVAIRAYEGSVKMIQWE-PGTDVRSFNVRFDYPNVSDFKFIDTGVD 169

Query: 122 KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
               +    D+   +H++  ++ + DK+       Q ++   A +LIPVP P+ GVL++ 
Sbjct: 170 DTYRIAFIYDDDHGKHLQFSDLNMHDKEL-HTFSRQASIAADASVLIPVPAPISGVLVLA 228

Query: 182 EETIVYCS 189
             +I+Y S
Sbjct: 229 ANSILYKS 236


>gi|195485994|ref|XP_002091320.1| GE12310 [Drosophila yakuba]
 gi|194177421|gb|EDW91032.1| GE12310 [Drosophila yakuba]
          Length = 1455

 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 106/435 (24%), Positives = 200/435 (45%), Gaps = 62/435 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
            +R +PL   PR++ +  ++R + + +              +++  +EES    F+     
Sbjct: 1026 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1085

Query: 696  -----LLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
                 L+  +T+E +  ++   + +E+     I+  S+    +    Y C+GT +    E
Sbjct: 1086 QFEMVLISPETWEIVPDASISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1144

Query: 744  NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
            +  ++G I ++     + E GK      ++ I +KE KG V +++   G L+  + QKI 
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI- 1203

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
             Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +   
Sbjct: 1204 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 1259

Query: 853  ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
            +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   ++  L    +YHL
Sbjct: 1260 SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQK--LLRKADYHL 1317

Query: 908  GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
            G+ VN  FR       L  R P     +   V++GT++G +G    LP + Y     LQ 
Sbjct: 1318 GQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQN 1376

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
             L    + + GLN +++R+  + KK     ++  +DGDLI S+  ++ +  +E++K +  
Sbjct: 1377 VLLSYQEHLCGLNPKEYRTLKSFKKQGINPSRCVIDGDLIWSYRLMANSERNEVAKKIGT 1436

Query: 1021 SVEELCKRVEELTRL 1035
              EE+   + E+ RL
Sbjct: 1437 RTEEILADLLEIERL 1451



 Score = 56.2 bits (134), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 72/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           +YG + +L+     G  +D L I+ +  K  VLQ D ++  L T ++    +   R    
Sbjct: 76  LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
           G+  +    +DPD R   + +Y     V+PF     L E                     
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
              ++ I L +L      VLDI+FL+G  +PT+++LY+  +     +K          ++
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
           L  +  V    W+ N+L      + P+  P+ G L++    ++Y              SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315

Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
           +   A P++P      S+  A +  +D D  + ++    G L++L +  +  + V     
Sbjct: 316 DNSTAFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHF 373

Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
                + + S I  L +  +++GS  G+S L+
Sbjct: 374 HKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405



 Score = 43.5 bits (101), Expect = 0.55,   Method: Compositional matrix adjust.
 Identities = 44/159 (27%), Positives = 71/159 (44%), Gaps = 26/159 (16%)

Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
           EV +  +N+ PI   C    V+ E  G               ++V  +G  K+G+L +  
Sbjct: 485 EVCDSLMNVAPINYMCAGERVEFEEDGATLRPHAESLQDLKIELVAATGHSKNGALSVFV 544

Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
           N I      S EL G   +W++      +SS +D  D F+++S  + T +L      E+ 
Sbjct: 545 NCINPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNSTLVLQTG--QEIN 601

Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           E E  GF     T+F  +    + +VQVT+  VRL+  T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640


>gi|147860990|emb|CAN78747.1| hypothetical protein VITISV_022228 [Vitis vinifera]
          Length = 1298

 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 58/209 (27%), Positives = 103/209 (49%), Gaps = 20/209 (9%)

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
           N+ PI+D  VVD   +   Q+  C G   +GSLRI+R+GI + +  + +   QGI G W+
Sbjct: 503 NIAPILDMSVVDCHDEEHDQMFACCGVTPEGSLRIIRSGISVEKLLRTAPIYQGITGTWT 562

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           ++    D + +FLV+SF+ ETR+L++ L    + T+  GF     TL C       LVQ+
Sbjct: 563 VKMKVIDSYHSFLVLSFVEETRVLSVGLSFT-DVTDSVGFQPDVSTLACGVVDDGLLVQI 621

Query: 413 TSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
               V+L   T+            +   W  P   S+++     + +++AT     +++ 
Sbjct: 622 HKNGVKLCLPTTVAHPEGIPLASPICTSW-FPENISISLGAVGYNLIVVATSSPCFLFIL 680

Query: 463 IGDGI------LTEVKHAQLEYEISCLDI 485
               +      + E++H +L+ E+SC+ I
Sbjct: 681 GVRSVSAYQYEIYEMQHVRLQNEVSCISI 709



 Score = 56.2 bits (134), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 41/189 (21%), Positives = 85/189 (44%), Gaps = 7/189 (3%)

Query: 573  LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
            +G  P+ L   S      + A SDRP ++ S+   L Y++++ +  +H+ P  S   P  
Sbjct: 882  IGITPVFLVPLSDSLEADIIALSDRPWLLQSARHSLSYTSISFQPSTHVTPVCSMECPMG 941

Query: 633  LAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
            +    E  L +  +   ++L+++   LG  PR++ +  +SR   +        ++++   
Sbjct: 942  ILFVAENSLHLVEMVHSKRLNVQKFYLGGTPRKVLYHSESRLLLV---MRTELSQDTYSS 998

Query: 693  FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS----NVYYCVGTAYVLPEENEPTK 748
             +  +D  +   +S++ L+  E G S+      ++           G A +   E E TK
Sbjct: 999  DICCVDPLSGSVLSSFKLELGETGKSMELVRVVNEQVLVIGTSLSSGPAMMPSGEAESTK 1058

Query: 749  GRILVFIVE 757
            GR++V  +E
Sbjct: 1059 GRLIVLCLE 1067


>gi|320169222|gb|EFW46121.1| cleavage and polyadenylation specificity factor 1 [Capsaspora
            owczarzaki ATCC 30864]
          Length = 1725

 Score = 87.4 bits (215), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 116/495 (23%), Positives = 206/495 (41%), Gaps = 70/495 (14%)

Query: 591  VFAASDRPT-VIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDSLA-IAKEGELTI--- 643
            VF    RP  ++ S  +K L +++ L +  VS    FN+ A P        +G L     
Sbjct: 1227 VFVCGRRPLWLLMSPTRKALRAHLMLTDGSVSAFSAFNNNACPGGFVYFTTQGTLRFCQL 1286

Query: 644  -GTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS----------------------- 679
              T +      +R +PL      I + E  RT+ + +                       
Sbjct: 1287 APTTNHDNPWPVRRVPLRATAHYIGYHEVFRTYVLVTSHPKPYFNLPRLTNDETYTPVPY 1346

Query: 680  -LKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV-----YYC 733
              K ++     +   ++L+   T+E I ++ L  FE   S+   + +    V     Y  
Sbjct: 1347 TPKPRAIPATFDTFSLQLISPVTWESIHSFDLPAFERVTSVDIAAITSQETVTGLKDYVV 1406

Query: 734  VGTAYVLPEENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAFNGK 782
            +GT  V+  E+    GRI+VF + D            KL+ + E+E KGA+ +L+   G 
Sbjct: 1407 IGTT-VIEGEDVTCHGRIIVFEIIDVVPEVNRPQTNRKLKYLMEREQKGAITALSHVCGH 1465

Query: 783  LLAAINQKIQLYKWMLRD--DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
            L++ I QKI ++++   D  DG   + ++         + V    +FI+VGDL  S+ LL
Sbjct: 1466 LVSCIGQKIIIWQFASDDTMDGVAFIDTQT------FVVSVSAIKNFILVGDLNNSVFLL 1519

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERG 897
             +      +   ARD++   +++ + L D   LG    +++ NL     N          
Sbjct: 1520 RFNETTKHLGFIARDFDHMSVASTQFLVDGSSLGFLATDSHQNLVVFAYNPLNRESNNGQ 1579

Query: 898  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV-----IFGTVNGVIGVIASLPHE 952
            RL    ++H+G  V +     +V R     V +  +V     +  T+ G +  +A +   
Sbjct: 1580 RLLRQLDFHVGSHVQQVLR--MVPRSLPVSVDRGASVKRHIDLLATLEGSLNALAPIGET 1637

Query: 953  QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRTR 1010
             +  LE LQ  L   ++   GLN   +R++   +K  T  A N +DG+L+  FL L    
Sbjct: 1638 TFRRLEWLQRQLVG-LQQRAGLNPIGYRAYRFPRKMTTTRAGNVIDGELLSRFLYLGLAE 1696

Query: 1011 MDEISKTMNVSVEEL 1025
              E+++    + E+L
Sbjct: 1697 QRELARQRRNTPEDL 1711



 Score = 47.4 bits (111), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 32/113 (28%), Positives = 61/113 (53%), Gaps = 15/113 (13%)

Query: 95  KGQLKEAFNIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKDARHVKTYE------- 142
           + +L+ ++ I+L ELQ     V+DI+FL G  +PT+ +L++ N  +   +T +       
Sbjct: 292 RQRLRPSYEIKLTELQRHIHHVIDIEFLTGYFEPTLALLFEPNAPSWTGRTVQRKDTCSM 351

Query: 143 VALK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANA 192
           VAL  +      P  WS + L   +  ++ VP P+CG +I+  + I++ S ++
Sbjct: 352 VALSINTSSHSHPVVWSVDKLPFNSMRVMAVPRPVCGTVIVTPDAILHLSQSS 404


>gi|268580265|ref|XP_002645115.1| Hypothetical protein CBG16808 [Caenorhabditis briggsae]
 gi|296439546|sp|A8XPU7.1|CPSF1_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
          Length = 1454

 Score = 87.0 bits (214), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 84/315 (26%), Positives = 146/315 (46%), Gaps = 35/315 (11%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+PE  +PT  R         K++++ +KE KG V  L A NG LL+ + QK+  + W  
Sbjct: 1153 VVPEPGQPTSNR---------KIKVLYDKEQKGPVTGLCAINGLLLSGMGQKV--FIWQF 1201

Query: 799  RDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
            +D+    + S    H ++  L+ ++T     +  D  +S+SL+ ++ E  A+   +RD  
Sbjct: 1202 KDNDLMGI-SFLDMHYYVYQLHSIRT---IALALDARESMSLIRFQEENKAMSIASRDDR 1257

Query: 858  --ANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 910
              A    A E L D +++G        N  LF+    +  +   E  RL V    ++G  
Sbjct: 1258 KCAQAPMASEFLVDGMHIGFLLSDEHGNITLFSYSPEAPESNGGE--RLTVKAAINIGTN 1315

Query: 911  VNRFRHGSLVMRLPDS-------DVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
            +N F        L DS       ++ Q    IFG+++G  G I  L  + Y  L  LQT 
Sbjct: 1316 INAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGSFGYIRPLTEKSYRRLHFLQTF 1375

Query: 964  LRKVIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            +  V   + GL+ +  RS    +  V   +A+N +DGD++E +L LS     ++++ + V
Sbjct: 1376 IGSVTPQIAGLHIKGARSSKPSQPIVNGRNARNLIDGDVVEQYLHLSVYDKTDLARRLGV 1435

Query: 1021 SVEELCKRVEELTRL 1035
                +   + +L R+
Sbjct: 1436 GRYHILDDLMQLRRM 1450



 Score = 68.2 bits (165), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 90/399 (22%), Positives = 158/399 (39%), Gaps = 93/399 (23%)

Query: 65  NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYG 119
           N  I   DP  R     +Y     ++PF    +   ++ I L+++      V D+ FL G
Sbjct: 138 NPPIVRTDPANRCAASLVYGKHIAILPFHENSKRILSYIIPLKQIDPRLDNVADMVFLEG 197

Query: 120 CAKPTIVVLYQD----------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIP 169
             +PTI+ LY+             D   +    V + D+ F    W   NL    + L+ 
Sbjct: 198 YYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVV-WQTANLPMDCNSLLS 256

Query: 170 VPPPLCGVLIIGEETIVYCSA-------------NAFKAIPIRPSITKAYGRVDADGSRY 216
           +P PL G ++ G  TIVY +              + F   P++  +      +D   S Y
Sbjct: 257 IPKPLGGAVVFGSNTIVYLNQAVPPCGIVLNSCYDGFTKFPLK-DMKHLKMTLDCSTSVY 315

Query: 217 L------LGDHAGLLHL--LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
           +      +G   G L+L  LV +     V  L+   + +TSIA T++      +++GS  
Sbjct: 316 MEDGRIAVGSREGDLYLLRLVTSSGGATVKSLEFSKVCDTSIAFTLTVCAPGHLFVGSRL 375

Query: 269 GDSQLIKLNL----QPDAK----------------------GSYVEV------------- 289
           GDSQL++  L    +  AK                      G  +E+             
Sbjct: 376 GDSQLLEYTLLKVTKESAKKQRLEQQNPSEIELDEDDIELYGGAIEMQQNDDDEQISESL 435

Query: 290 ----LERYVNLGPIVDFC----------VVDLERQGQ-GQVVTCSGAYKDGSLRIVRNGI 334
               L+R +N+GP+   C          ++D +R+     +VT SG  K+G+L + +  +
Sbjct: 436 QFRELDRLLNVGPVKSMCFGRPNYMSNDLIDAKRKDPVFDLVTASGHGKNGALCVHQRSM 495

Query: 335 GINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISET 373
                 S  L+G + +W++    ++    +L+VS +  T
Sbjct: 496 RPEIITSSLLEGAEQLWAVGRKENESH-KYLIVSRVRST 533


>gi|194883064|ref|XP_001975624.1| GG22421 [Drosophila erecta]
 gi|190658811|gb|EDV56024.1| GG22421 [Drosophila erecta]
          Length = 1455

 Score = 87.0 bits (214), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 106/435 (24%), Positives = 200/435 (45%), Gaps = 62/435 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
            +R +PL   PR++ +  ++R + + +              +++  +EES    F+     
Sbjct: 1026 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1085

Query: 696  -----LLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
                 L+  +T+E +  ++   + +E+     I+  S+    +    Y C+GT +    E
Sbjct: 1086 QFEMVLISPETWEIVPDASISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1144

Query: 744  NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
            +  ++G I ++     + E GK      ++ I +KE KG V +++   G L+  + QKI 
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI- 1203

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
             Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +   
Sbjct: 1204 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 1259

Query: 853  ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
            +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   ++  L    +YHL
Sbjct: 1260 SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQK--LLRKADYHL 1317

Query: 908  GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
            G+ VN  FR       L  R P     +   V++GT++G +G    LP + Y     LQ 
Sbjct: 1318 GQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQN 1376

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
             L    + + GLN +++R+  + KK     ++  +DGDLI S+  ++ +  +E++K +  
Sbjct: 1377 VLVSYQEHLCGLNPKEYRTLKSFKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGT 1436

Query: 1021 SVEELCKRVEELTRL 1035
              EE+   + E+ RL
Sbjct: 1437 RTEEILGDLLEIERL 1451



 Score = 56.2 bits (134), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 74/337 (21%), Positives = 136/337 (40%), Gaps = 75/337 (22%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM---------GDVS 56
           +YG + +L+     G  +D L I+ +  K  VLQ D ++  L T ++         G  +
Sbjct: 76  LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135

Query: 57  DRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKE---------------- 100
            R   PT       +DPD R   + +Y     V+PF     L E                
Sbjct: 136 GRYFVPTVR-----VDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAM 190

Query: 101 --------AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE---- 142
                   ++ I L +L      VLDI+FL+G  +PT+++LY+  +     +K       
Sbjct: 191 VSRTPIMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCV 250

Query: 143 ---VALKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY----------- 187
              ++L  +  V    W+ N+L      + P+  P+ G L++    ++Y           
Sbjct: 251 LVAISLNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVS 310

Query: 188 --CSANAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-V 237
              SA+   A P++P      S+  A +  +D D  + ++    G L++L +  +  + V
Sbjct: 311 LNSSADNSTAFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTV 368

Query: 238 TGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
                     + + S I  L +  +++GS  G+S L+
Sbjct: 369 RNFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405



 Score = 42.7 bits (99), Expect = 0.97,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 71/159 (44%), Gaps = 26/159 (16%)

Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
           EV +  +N+ P+   C    V+ E  G               ++V  +G  K+G+L +  
Sbjct: 485 EVCDSLMNVAPVNYMCAGERVEFEEDGATLRPHAESLQDVKIELVAATGHSKNGALSVFV 544

Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
           N I      S EL G   +W++      +SS +D  D F+++S  + T +L      E+ 
Sbjct: 545 NCINPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNSTLVLQTG--QEIN 601

Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           E E  GF     T+F  +    + +VQVT+  VRL+  T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640


>gi|154419018|ref|XP_001582526.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
 gi|121916762|gb|EAY21540.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
          Length = 1142

 Score = 87.0 bits (214), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 164/814 (20%), Positives = 328/814 (40%), Gaps = 81/814 (9%)

Query: 259  NAVVYIGSSYGDSQ--LIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQ-GQV 315
            N+  YI  S  D +   ++    P+ + + +E  E +     +         R  Q G +
Sbjct: 356  NSFYYITESCEDVEEGFVEKTFSPNTENTKIEYFEGHEIKNRLTKLITAPSFRGSQLGDI 415

Query: 316  VTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---FDTFLVVSFISE 372
            ++  G++   SL+I R G+       +E+ G  G   +++   DP   +DTF+ VS  S 
Sbjct: 416  ISIHGSFNKSSLKITRKGMPTKVLKPIEIGG--GCTFVKAVKKDPLNEYDTFIFVSNEST 473

Query: 373  TRILAMNLED-ELEETEIEGFCSQTQTL--FCHDAIYN--QLVQVTSGSVRLVSSTSREL 427
            TRI     E  EL++++   F +  +T+  F   A  N   L Q+T+  +R+++   +  
Sbjct: 474  TRIFEFFEETRELKDSKESLFVTDKKTIDVFFLPAFNNTSSLAQITTEGMRIINDKEK-- 531

Query: 428  RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGIL----TEVKHAQLEYEISCL 483
             ++WK      +   TAN SQ+ +      +V  E  +  L    +  K  ++  +I+ +
Sbjct: 532  -HDWKRDESSKIICVTANPSQIAIVYDDNQIVLFETDETSLPKEVSSNKVMEIAGQITSI 590

Query: 484  DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK--EHLGGEIIPRSVLLCAF--- 538
             +        Y +  AV    D  + I  + +++  T+       +I+ ++V+   F   
Sbjct: 591  ALPQPQTGVRYVEWLAVSAVND-GLSIVYIVNISKTTEMWSVSSRQILDKTVISMMFLFV 649

Query: 539  EGISYLL-CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDR 597
             GI  +L     +G L    L+   G L +     LG  P+T      KN   +      
Sbjct: 650  PGIGNILHIGHNEGLLTRTNLDDSNGSLDNATLKFLGNAPVTFSRCEVKNQNSILINCAS 709

Query: 598  PTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS--LAIAKEGELTIGTIDDIQKLHIR 655
            P       K L    ++       C + +  F D   + +     +   T D    +  +
Sbjct: 710  PWY----TKGLSLVQLSSSPFVSCCQYKAPFFIDDSFIGLTTNNMVLFYTPDKNILIDTQ 765

Query: 656  SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEY 715
            ++ L   PR+I H   ++   +C L +     + +    +L++    E IS+ PL   + 
Sbjct: 766  TLDLDMTPRQIVHIPNTK--FVCVLMSDLIKGKWKSQ-AKLVNYD--ELISSEPLQLDDN 820

Query: 716  GCSILSCSFSDDSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVY 774
                 +C   + + V  CV       P+  +   G++L+  +  G  +++   +      
Sbjct: 821  LLVSAACYLENYNLVAVCVAKNLSFFPKRCD--GGQVLLLDIGSGIPKVVQSTDFDDIPQ 878

Query: 775  SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
            ++  F   ++  + + + +    L+   T+ L+  C         Y++  G  I+ GD M
Sbjct: 879  AIAPFGEYVILGVQETLMV----LKVGRTKLLKKCCSKAFPHCINYLKAFGTRIIAGDAM 934

Query: 835  KSISLLIYKHEEGAIEERARDY-NANWMSAVEILDDDIYLGAENNFNLFTVRK-----NS 888
            +S   + +  EE  +   A D    + +SA+ +    +  G  + F  F + +     N 
Sbjct: 935  ESFHFVKFDKEEDILSIFADDMVPRHPLSAIGLDRSTVCCG--DKFGSFCILRLPPDIND 992

Query: 889  EGATDEE--------------RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
            +   D                + ++E+   YH+G  +         + L   D     ++
Sbjct: 993  DAEIDPSDVGHAFENEKFPGAKNKVELSNMYHIGSPITG-------LCLSQGDFENT-SI 1044

Query: 935  IFGTVNGVIGVIASLPHEQYLFLEKL-QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
            ++GTV+G IG+   L  +    L +L +  ++K+IK   G   EQ+RS+    K V+   
Sbjct: 1045 VYGTVDGEIGLFIPLKTDTDARLFRLLEDEMKKLIKSPVGRFIEQFRSYYTPLKCVN--- 1101

Query: 994  FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
              D +L+ S++DL      EI++ + V   +L +
Sbjct: 1102 --DSNLLLSYIDLPTNLQKEIAEKLKVKPFDLSR 1133


>gi|225455571|ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Vitis vinifera]
          Length = 1442

 Score = 86.7 bits (213), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 93/369 (25%), Positives = 164/369 (44%), Gaps = 37/369 (10%)

Query: 694  VRLLDDQT----FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENE 745
            VR+L+ +     ++  +T P+ + E   ++    L  + + ++     +GTAYV   E+ 
Sbjct: 1076 VRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDV 1134

Query: 746  PTKGRILVFIV---EDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
              +GR+L+F V    D    L++E   KE KGA+ ++ +  G LL A   KI L+KW   
Sbjct: 1135 AARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIILHKWT-- 1192

Query: 800  DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
              GT            +  + +    +FI++GD+ +SI  L +K +   +   A+D+ + 
Sbjct: 1193 --GTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFGSL 1250

Query: 860  WMSAVEILDDDIYLGA-----ENNFNLFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
               A E L D   L       + N  +F    K SE    +   +L    E+H+G  V +
Sbjct: 1251 DCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTK 1307

Query: 914  FRHGSLVMRLPD-------SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            F    ++    D       SD      ++FGT++G IG IA L    +  L+ LQ  L  
Sbjct: 1308 FLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1367

Query: 967  VIKGVGGLNHEQWRSF--NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
             +  V GLN   +R F  N +       N +D +L+  +  L      EI++ +  +  +
Sbjct: 1368 AVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQ 1427

Query: 1025 LCKRVEELT 1033
            +   + +L+
Sbjct: 1428 ILSNLNDLS 1436



 Score = 76.3 bits (186), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 109/503 (21%), Positives = 194/503 (38%), Gaps = 132/503 (26%)

Query: 6   IYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDAESSELITRAM----GDVSD 57
           ++G + T+ +    G      +D + +A +  K  VL++D     L T +M    G    
Sbjct: 109 LHGNVETMTVLPSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWF 168

Query: 58  RIGRPTDN---GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG------------------ 96
            + R  ++   G +  +DP  R  G+ +Y     ++     G                  
Sbjct: 169 HLKRGHESFARGPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSA 228

Query: 97  QLKEAFNIRLEEL---QVLDIKFLYGCAKPTIVVLYQ------DNKDARHVKTYEVALK- 146
           +++ ++ I L +L    V D  F++G  +P +V+L++           +H      AL  
Sbjct: 229 RVESSYVISLRDLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288

Query: 147 DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITK 204
                + P  WS  NL + A  L+PVP P+ GV++I   +I Y S +A  A+ +      
Sbjct: 289 STTLKQHPLIWSAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVS 348

Query: 205 AYG-----------RVDADGSRYLLGDHA------GLLHLLVITHEKEKVTGLKIELLGE 247
           A              +DA  + +L  D A      G L LL + ++   V  L +     
Sbjct: 349 ADNSQEMPRSSFSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRA 408

Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKLNL----------------QPDAK-------- 283
           + + S I+ + N++ ++GS  GDS L++                    P AK        
Sbjct: 409 SVLTSGIAAIGNSLFFLGSRLGDSLLVQFTSILSSSVKEEVGDIEGDVPSAKRLRKSSSD 468

Query: 284 --------------GSY------------VEVLERYVNLGPIVDFC----------VVDL 307
                         GS               V + ++N+GP+ DF              +
Sbjct: 469 ALQDMVNGEELSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGI 528

Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------------- 353
            +Q   ++V CSG  K+G+L I++  I       VEL G KG+W++              
Sbjct: 529 AKQSNYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTK 588

Query: 354 RSSTDDPFDTFLVVSFISETRIL 376
            ++ DD +  +L++S  S T +L
Sbjct: 589 MATKDDEYHAYLIISLESRTMVL 611


>gi|296084122|emb|CBI24510.3| unnamed protein product [Vitis vinifera]
          Length = 1448

 Score = 86.7 bits (213), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 93/369 (25%), Positives = 164/369 (44%), Gaps = 37/369 (10%)

Query: 694  VRLLDDQT----FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENE 745
            VR+L+ +     ++  +T P+ + E   ++    L  + + ++     +GTAYV   E+ 
Sbjct: 1082 VRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDV 1140

Query: 746  PTKGRILVFIV---EDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
              +GR+L+F V    D    L++E   KE KGA+ ++ +  G LL A   KI L+KW   
Sbjct: 1141 AARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIILHKWT-- 1198

Query: 800  DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
              GT            +  + +    +FI++GD+ +SI  L +K +   +   A+D+ + 
Sbjct: 1199 --GTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFGSL 1256

Query: 860  WMSAVEILDDDIYLGA-----ENNFNLFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
               A E L D   L       + N  +F    K SE    +   +L    E+H+G  V +
Sbjct: 1257 DCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTK 1313

Query: 914  FRHGSLVMRLPD-------SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
            F    ++    D       SD      ++FGT++G IG IA L    +  L+ LQ  L  
Sbjct: 1314 FLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1373

Query: 967  VIKGVGGLNHEQWRSF--NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
             +  V GLN   +R F  N +       N +D +L+  +  L      EI++ +  +  +
Sbjct: 1374 AVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQ 1433

Query: 1025 LCKRVEELT 1033
            +   + +L+
Sbjct: 1434 ILSNLNDLS 1442



 Score = 70.5 bits (171), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 109/509 (21%), Positives = 193/509 (37%), Gaps = 138/509 (27%)

Query: 6   IYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDAESSELITRAM----GDVSD 57
           ++G + T+ +    G      +D + +A +  K  VL++D     L T +M    G    
Sbjct: 109 LHGNVETMTVLPSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWF 168

Query: 58  RIGRPTDN---GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG------------------ 96
            + R  ++   G +  +DP  R  G+ +Y     ++     G                  
Sbjct: 169 HLKRGHESFARGPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSA 228

Query: 97  QLKEAFNIRLEEL---QVLDIKFLYGCAKPTIVVLYQ------DNKDARHVKTYEVALK- 146
           +++ ++ I L +L    V D  F++G  +P +V+L++           +H      AL  
Sbjct: 229 RVESSYVISLRDLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288

Query: 147 DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITK 204
                + P  WS  NL + A  L+PVP P+ GV++I   +I Y S +A  A+ +      
Sbjct: 289 STTLKQHPLIWSAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVS 348

Query: 205 AYG-----------RVDADGSRYLLGDHA------GLLHLLVITHEKEKVTGLKIELLGE 247
           A              +DA  + +L  D A      G L LL + ++   V  L +     
Sbjct: 349 ADNSQEMPRSSFSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRA 408

Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKLNL----------------QPDAK---GSYVE 288
           + + S I+ + N++ ++GS  GDS L++                    P AK    S  +
Sbjct: 409 SVLTSGIAAIGNSLFFLGSRLGDSLLVQFTSILSSSVKEEVGDIEGDVPSAKRLRKSSSD 468

Query: 289 VLERYVN-------------------------------------LGPIVDFC-------- 303
            L+  VN                                     +GP+ DF         
Sbjct: 469 ALQDMVNGEELSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINAD 528

Query: 304 --VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------- 353
                + +Q   ++V CSG  K+G+L I++  I       VEL G KG+W++        
Sbjct: 529 PKATGIAKQSNYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGH 588

Query: 354 ------RSSTDDPFDTFLVVSFISETRIL 376
                  ++ DD +  +L++S  S T +L
Sbjct: 589 NADSTKMATKDDEYHAYLIISLESRTMVL 617


>gi|357162146|ref|XP_003579318.1| PREDICTED: probable cleavage and polyadenylation specificity factor
            subunit 1-like [Brachypodium distachyon]
          Length = 1442

 Score = 86.7 bits (213), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 84/350 (24%), Positives = 160/350 (45%), Gaps = 27/350 (7%)

Query: 702  FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF--I 755
            +E  ST P+ +FE   ++    L  + + ++     +GTAYV   E+   +GR+L+F   
Sbjct: 1096 WETRSTIPMQSFENALTVRIVTLHNTTTKENETLMAIGTAYV-QGEDVAARGRVLLFSFT 1154

Query: 756  VEDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
              +    L+ E   KE+KGAV ++ +  G L+ A   KI L KW    +G+ EL +   +
Sbjct: 1155 KSENSQNLVTEVYSKESKGAVSAVASLQGHLVIASGPKITLNKW----NGS-ELTAVAFY 1209

Query: 813  HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
               +  + +    +F++ GD+ KS+  L +K +   +   A+D+ +    A E L D   
Sbjct: 1210 DAPLHVVSLNIVKNFVLFGDIHKSVYFLSWKEQGSQLTLLAKDFGSLDCFATEFLIDGST 1269

Query: 873  LG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 929
            L    ++++ NL       +     +  +L    E H+G  + +F     +  LP   + 
Sbjct: 1270 LSLVVSDSDKNLQIFYYAPKMVESWKGQKLLSRAELHVGAHMTKFLR---LQMLPAQGLA 1326

Query: 930  QIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF--N 983
               T    ++FGT++G IG IA +    +  L+ LQ  L   +  V GLN   +R F  N
Sbjct: 1327 SEKTNRFALLFGTLDGSIGCIAPVDELTFRRLQSLQRKLVDAVSHVCGLNPRSFRQFKSN 1386

Query: 984  NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
             +       N +D +L+  +  LS     ++++ +  +  ++     +++
Sbjct: 1387 GKAHRPGPDNIIDFELLTYYEILSLEEQLDMAQQIGTTRAQILSNFSDIS 1436



 Score = 75.1 bits (183), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 90/419 (21%), Positives = 173/419 (41%), Gaps = 110/419 (26%)

Query: 106 LEELQVLDIKFLYGCAKPTIVVLYQDNK------DARH----VKTYEVALKDKDFVEGP- 154
           L+   V D  F++G  +P +V+L++          ++H    +  + +++  K   + P 
Sbjct: 245 LDTNHVKDFTFVHGYIEPVLVILHEREPTWAGRISSKHHTCMISAFSISMTLK---QHPM 301

Query: 155 -WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY------CS--ANAFKAIPI-RPSITK 204
            WS  N+ + A  ++ VPPP+ GVL+I   +I Y      CS   N F + P   P I K
Sbjct: 302 IWSAANIPHDAYQILSVPPPISGVLVICANSIHYHSQSTSCSLALNNFASQPDGSPEIHK 361

Query: 205 AYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISY 256
               V+ D ++         +     G + LL + ++   V  L +     + I+S ++ 
Sbjct: 362 VNFHVELDAAKATWLSNDIVMFSAKTGEMLLLTVVYDGRTVQKLDLMKSKASVISSGVTT 421

Query: 257 LDNAVVYIGSSYGDSQLIKL---------------------------------------- 276
           + ++  ++GS  GDS L++                                         
Sbjct: 422 IGSSFFFLGSRVGDSLLVQFSCGVPTSVIPDIADERSADIEGDLPFSKRLKRVPSDILQD 481

Query: 277 -----------NLQPDA-----KGSYVEVLERYVNLGPIVDFC----------VVDLERQ 310
                      N+ P++     K SYV V +  VN+GP+ DF                +Q
Sbjct: 482 VTSVEELSFQNNMLPNSLESAQKISYV-VRDALVNVGPLKDFSYGLRVNADPNATGNAKQ 540

Query: 311 GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------RSSTDDPFDT 363
              ++V CSG  K+G+L +++  I  +    VEL   +G+W++        ++ D+ +  
Sbjct: 541 SNYELVCCSGHGKNGALSVLQQSIRPDLITEVELPSCRGIWTVYYKSSRGHTTEDNEYHA 600

Query: 364 FLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTLFCHDAI-YNQLVQVTSGSVRLV 420
           +L++S   E+R + +   D+L E TE   +  Q  T+   +     +++QV +   R++
Sbjct: 601 YLIISL--ESRTMVLETGDDLGEVTETVDYYVQGATITAGNLFGRRRVIQVYATGARVL 657


>gi|221055487|ref|XP_002258882.1| CPSF (cleavage and polyadenylation specific factor), subunit A
            [Plasmodium knowlesi strain H]
 gi|193808952|emb|CAQ39655.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
            putative [Plasmodium knowlesi strain H]
          Length = 2478

 Score = 86.3 bits (212), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 122/539 (22%), Positives = 220/539 (40%), Gaps = 96/539 (17%)

Query: 565  LTDRKKVSLGTQ-PITLRTFS----------------SKNTTHVFAASDRPTVIYSS-NK 606
            LT R+K+S  T+ P+  +TF                 SK    +F   D P +IYS+  K
Sbjct: 1907 LTKRRKISTCTKSPVKFKTFLKVNSEKNLIDINMSKLSKKCNFLFVCCDNPIIIYSTLKK 1966

Query: 607  KLLYSNVNLKEV---------SHMCPF-NSAAFP-----DSLAIAKEG-ELTIGTIDDIQ 650
            K+  S ++++ V         +++ PF N  +F      +S  I  +G +L I  +++++
Sbjct: 1967 KISTSKLSIRNVLLVDMFSDFNYLNPFHNFLSFKKKNQNNSYFIFFDGHKLCISYLNEMK 2026

Query: 651  KLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYP 709
            K  +  IP      +I  H +       C ++ +    +     V   D     F  TY 
Sbjct: 2027 KTFMERIPFHRTVEKIAYHADTGLLITACPVEEKHKTNQMMKQIVCFFDPFQNSFKYTYI 2086

Query: 710  LDTFEYGCSI----LSCSFS-------DDSN-----------------VYYCVGTAYVLP 741
            + +     SI    L+ S S       D  N                    CVGTA    
Sbjct: 2087 IPSKFSVSSICIYELASSHSLVADHSVDQMNQVNHPNNKDGALNPPVHTLICVGTANNNE 2146

Query: 742  EENEPTKGRILVFIVEDGKLQ-----LIAEKETKGAVYSLNAFNGKLLAAINQKIQLY-- 794
               EP+ G I VF+ +    Q     +       G +  L  F  K++AA+N  + +   
Sbjct: 2147 RITEPSSGHIYVFVAKKKTNQFEIKHIYTYNVNCGGITHLKQFRDKIVAAVNNTVLILDI 2206

Query: 795  KWMLRDDGT------RELQSECGHHGHILALYVQTR--------GDFIVVGDLMKSISLL 840
            +  L + GT      + ++ E       +A +  +          ++IVVGD+M S++LL
Sbjct: 2207 RNFLTNLGTYIYNASKAMKVESNDAFLEVASFTPSSWIMSLDVVKNYIVVGDIMTSVTLL 2266

Query: 841  IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
             Y  E   + E  RDY   W +A   L +D +L ++   N   ++K++    DEE  +L 
Sbjct: 2267 SYDFENAILNEVCRDYANIWCTA---LSEDHFLVSDMESNFLVLQKSNIKFNDEESFKLS 2323

Query: 901  VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVI-----GVIASL-PHEQY 954
            +V +++ G  VN+    SL   + + +  + P  I      ++     G I++L P   +
Sbjct: 2324 LVSQFNHGSVVNKMLSTSLRNLVDEYESEERPNEIVQKERSILCASSEGSISTLIPFSNF 2383

Query: 955  LFLEK---LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTR 1010
            +  ++   ++  +   I  +G L+H  +R +     + + K  +DG+L + F  L   R
Sbjct: 2384 IQFKRALCIEIAINDNISSLGNLSHSSYREYKITLASKNCKGVVDGELFKMFFYLPFER 2442



 Score = 48.9 bits (115), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 43/69 (62%), Gaps = 1/69 (1%)

Query: 286 YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
           ++E+L    N+GPI+DFCVV   +  + +++TC+   + G + I+RNG+ ++  + + + 
Sbjct: 829 FIEILSVIQNMGPILDFCVVK-NKNEEKEIITCNSYGRTGCISIIRNGMKVDIISKLNIG 887

Query: 346 GIKGMWSLR 354
            I  M+ ++
Sbjct: 888 KITNMFVVK 896


>gi|67521912|ref|XP_659017.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
 gi|74598221|sp|Q5BDG7.1|CFT1_EMENI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|40745387|gb|EAA64543.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
 gi|259486722|tpe|CBF84808.1| TPA: Protein cft1 (Cleavage factor two protein 1)
            [Source:UniProtKB/Swiss-Prot;Acc:Q5BDG7] [Aspergillus
            nidulans FGSC A4]
          Length = 1339

 Score = 86.3 bits (212), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 96/438 (21%), Positives = 192/438 (43%), Gaps = 70/438 (15%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 694
            +R++P+G+   ++ +   S T+ +  C        E+ E+H                  +
Sbjct: 907  MRTVPIGQQIDKLTYVSASDTYVLGTCQRCEFRLPEDDELHPEWRNEEISFLPEVNQSSL 966

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
            +++  +T+  I +YPL+  E+   + + S     N +       VGT+     E+ P++G
Sbjct: 967  KVVSPKTWSVIDSYPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLAR-GEDIPSRG 1025

Query: 750  RILVF----IVEDG-------KLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKW 796
             I VF    +V D        +L+LI ++  KGAV +L+   G+  L+AA  QK  +   
Sbjct: 1026 CIYVFEVIEVVPDPEQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRG- 1084

Query: 797  MLRDDGT----RELQSECGHHGHILALYVQTRGD-FIVVGDLMKSISLLIYKHEEGAIEE 851
             L++DG+      +  +C      +++  + +G    + GD +K +    Y  E   +  
Sbjct: 1085 -LKEDGSLLPVAFMDMQC-----FVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSL 1138

Query: 852  RARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
             A+D +   + A + L D      + A+++ NL+ ++ + E        +L    ++H G
Sbjct: 1139 FAKDLDYLEVLAADFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTG 1198

Query: 909  EFVN-------------RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL 955
             F +             R   GS  M +   +   +  V+  + NG IG++  +P E Y 
Sbjct: 1199 NFASTVTLLPRTLVSSERAMSGSDKMDI--DNTAPLHQVLVTSHNGSIGLVTCVPEESYR 1256

Query: 956  FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
             L  LQ+ L   ++   GLN   +R+  ++      +  LD +L+  +LD+S+ R  EI+
Sbjct: 1257 RLSALQSQLTNTLEHPCGLNPRAYRAVESDASA--GRGMLDSNLLLQYLDMSKQRKAEIA 1314

Query: 1016 KTMNVSVEELCKRVEELT 1033
              +  +  E+   +E ++
Sbjct: 1315 GRVGATEWEIRADLEAIS 1332


>gi|156097003|ref|XP_001614535.1| hypothetical protein [Plasmodium vivax Sal-1]
 gi|148803409|gb|EDL44808.1| hypothetical protein, conserved [Plasmodium vivax]
          Length = 2558

 Score = 85.5 bits (210), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 119/565 (21%), Positives = 226/565 (40%), Gaps = 110/565 (19%)

Query: 552  HLLNFLLNMKTGELTDRKKVSLGTQ-PITLRTF----SSKNTTHV------------FAA 594
            +L ++++  K   LT R+K+S  T  P+  + F    S +N   V            F  
Sbjct: 1962 YLKSYVVKSKKIMLTKRRKISTCTNSPVKFKAFLKVHSERNQIDVNMSKLTKRCNFLFVC 2021

Query: 595  SDRPTVIYSS-NKKLLYSNVNLKEV---------SHMCPFNSAAF------PDSLAIAKE 638
             D P +IYS+  KKL  S ++++ V         +++ PF++          +S  I  +
Sbjct: 2022 CDSPIIIYSTLKKKLSISKLSIRNVHLVDMFSDFNYLNPFHNFLLFKKKNQNNSYFIFFD 2081

Query: 639  G-ELTIGTIDDIQKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEMHFVRL 696
            G +L I  +++++K  +  IP      +I  H +       C ++ +    +     V  
Sbjct: 2082 GNQLCISYLNEMKKTFMERIPFHRTVEKIAYHADTGLLITACPVEEKHKTNQMMKQIVCF 2141

Query: 697  LD--DQTFEF------------ISTYPLDTFEYGCSI--------------------LSC 722
             D    +F++            I  Y L     G S+                    L  
Sbjct: 2142 FDPFQNSFKYTYIIPSKFSVSSICIYELAPSSGGASMGEMEQTSQMGQMEQTNQTNELKP 2201

Query: 723  SFSDDSN------VYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQ-----LIAEKETKG 771
            S  ++           CVGTA       EP+ G I VF+ +    Q     +     + G
Sbjct: 2202 SHPEERTDAPPVRTLICVGTANNNERITEPSSGHIYVFVAKKQTNQFEIKHVYTYNVSCG 2261

Query: 772  AVYSLNAFNGKLLAAINQKI----------QLYKWMLRDDGTRELQSECGH--------H 813
             +  L  F  K++AA+N  +           L  ++       +++S             
Sbjct: 2262 GITHLKQFRDKIVAAVNNTVVILDIGNFLANLGAYIYNSSKAIKIESNDAFLEVASFTPS 2321

Query: 814  GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
              I++L V    ++IVVGD+M S++LL Y  E   + E  RDY   W ++V  L ++ +L
Sbjct: 2322 SWIMSLDVVE--NYIVVGDIMTSVTLLSYDFENAILNEVCRDYANIWCTSVSALSENHFL 2379

Query: 874  GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD-----V 928
             ++   N   ++K++    DEE  +L +V +++ G  VN+    SL   + D +     +
Sbjct: 2380 VSDMESNFLVLQKSNIKFNDEESFKLSLVSQFNHGSVVNKMFSTSLRNLVDDEERRNEIL 2439

Query: 929  GQIPTVIFGTVNGVIGVIASLPHEQYLFLEK---LQTNLRKVIKGVGGLNHEQWRSFNNE 985
             +  +++  +  G I  +  +P   +L  ++   ++  +   I  +G L+H  +R +   
Sbjct: 2440 QKEQSILCASSEGSISAL--IPFSNFLQFKRALCIEIAINDNISSLGNLSHSSYREYKVS 2497

Query: 986  KKTVDAKNFLDGDLIESFLDLSRTR 1010
              + + K  +DG+L + F  L   R
Sbjct: 2498 LASKNCKGVVDGELFKMFFYLPFER 2522



 Score = 54.7 bits (130), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 44/69 (63%), Gaps = 1/69 (1%)

Query: 286 YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
           ++E+L    N+GPI+DFCVV   + G+ +++TC+   + G + I+RNG+ +N  + + L 
Sbjct: 822 FIEILAVIQNMGPILDFCVVK-NKNGEKEIITCNSYGRTGCISIIRNGMKVNIISKLSLG 880

Query: 346 GIKGMWSLR 354
            I  M+ ++
Sbjct: 881 KITNMFVVK 889


>gi|157872916|ref|XP_001684981.1| putative CPSF-domain protein [Leishmania major strain Friedlin]
 gi|68128051|emb|CAJ06910.1| putative CPSF-domain protein [Leishmania major strain Friedlin]
          Length = 1347

 Score = 85.1 bits (209), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 206/987 (20%), Positives = 378/987 (38%), Gaps = 169/987 (17%)

Query: 157  QNNLDNGADLLIPVPPPLCGVLIIGEETI-----VYCSANAF-KAIPIRPSITK-AYGRV 209
            Q N+D  A  ++P      G+ ++G   +     V      F + IP R +I   +   V
Sbjct: 320  QANVDPTASRIVPSRH---GLFVVGNHLVTLIQQVRPRKGVFSREIPSRQAILDVSCASV 376

Query: 210  DADGSRYLLGDHAGLLHLLVITHE----KEKVTGLKIELLGETSIASTISYLDNAVVYIG 265
             AD S  L+G   G+L  + +       ++     +       +I + +  L   +  + 
Sbjct: 377  SADASELLIGFSDGVLARVTVISRGMGGEDPELAFRFLSHPAPTIPTELVTLHADLYLLC 436

Query: 266  SSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDG 325
            S +  S  ++L+     + S  EVL    N GP++D  +   +      VV  +G ++ G
Sbjct: 437  SRFDSSFTVRLD-----EASCEEVLH---NCGPVLD--MTTHKNGSHNSVVASTGIHRGG 486

Query: 326  SLRIVRNGIGINEQASVEL----QGIKGMWSLRS-STDDPFDTFLVVSFISETRILAMNL 380
             + ++R+ + + EQA + L    Q I     L   +T    + F +VS        A+ L
Sbjct: 487  GISVLRSAVMLREQADIPLRCHPQRILCAGELMCITTPAAANRFFLVS------ARALTL 540

Query: 381  EDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVN 440
            E+       E            DA  NQLV +++  +  +  +   +    + P      
Sbjct: 541  EELAPHVFPEIVSRHLLVELGLDAALNQLVLISARGIAWLRLSGPRVEAVSQLP------ 594

Query: 441  VATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ-LEYEISCLDINPIGENPSYS-QIA 498
               A+  Q+L A     L    + DGI+  V   Q L Y +  L    I      S  + 
Sbjct: 595  -RQADDPQILFAVVRHGLTV--VCDGIVVTVYRGQDLVYTVPSLAGVAISSVSLVSPTLL 651

Query: 499  AVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGIS-------YLLCALGDG 551
             +G W D  V ++ L          L  + +PR++  CA   +S         L  +  G
Sbjct: 652  VLGQW-DGVVALYELGHRQADVVGLLLLDSVPRTIA-CAVPPLSPSSVVGEAALPMIYIG 709

Query: 552  HLLNFLLN-----MKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK 606
             L  FL+      ++ GE   R  V +   P+ L   + ++   +   +    V+  +  
Sbjct: 710  TLHGFLVGTTVALLRRGE--KRLSVFVAPHPLELLELAGRHAGLLCLGNVPLVVLCVAGG 767

Query: 607  KLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRI 666
             L  + ++L  VS     ++     +     +G L +G+++ ++K     +PLGE   ++
Sbjct: 768  GLQLTGLHLTGVSAAATVDADLQCYAFYSKHDGRLHVGSLEALEKTSRTFLPLGETITQL 827

Query: 667  CHQEQSRTFAICSLKNQSCAEESEMHF---------------------VRLLDDQTFEFI 705
             H+  +      S++     E  E+ F                     V LL+++   F+
Sbjct: 828  -HETAAWGGYAASVRK---VEGDEVLFLPSAVIQFPWAADLALWRSAAVPLLENERCVFL 883

Query: 706  STYPLDTFEYGCSI------LSCSFSDDSNV-------YYCVGTAYVLPEENEPTKGRIL 752
             T  LD  E G +       +  + SD + V          +G+++  P+E     GRI 
Sbjct: 884  QTVRLDGGEGGGAGERGGEPIDVAASDSAGVSEEEWQHLLLIGSSFTFPDEQRARSGRIT 943

Query: 753  VFIV----EDGKLQLIAEKETKGAVYSLNA---FNGKLLAAINQKIQLYKWMLRDDGTRE 805
               +    +  +L LIA K+  GA+    A   + G++   +N  + LYKW   +D T  
Sbjct: 944  WCALREEHQQQRLHLIASKDIGGALQCCAAVPHYKGRIALGVNGCVCLYKWN-TEDQTFV 1002

Query: 806  LQSECG---HHGHILALYVQTRGDFIVVG-DLMKSISLLIYKHEEGAIEERARDYNANWM 861
             +  C        ++ LY  +    ++V  D+  S   +     +G+++   RD      
Sbjct: 1003 AEERCRVGLTVTKLIPLYHTSLAASVLVALDVRHSAFFIEVDTLQGSLKVLCRDAELRG- 1061

Query: 862  SAVEILDDDIYLGAEN--------NFNLFTV---------RKNSEGATDEERGRLEVVGE 904
                ++D  I   AEN        NF    V            +  A+   + R EV  +
Sbjct: 1062 ----VMDGHIGSDAENLCLFDDSLNFTALRVVPLPVEAGDGDAAAAASVTAQYRFEVRAQ 1117

Query: 905  YHLGEFVNRFRHGSL----VMRLPDSDVGQI----------PTVIFGTVNGVIGVIASLP 950
             HLG+ V   R GS     +M  P S               P ++F T +G  GV+  + 
Sbjct: 1118 CHLGDLVTCVRQGSFAATSLMEAPASCASAQNRLLLPGIAGPQLVFATAHGGFGVVTPVH 1177

Query: 951  HEQYLFLEKLQTNLRKVIKGVGGLNHEQWR--------------------SFNNEK-KTV 989
               YL L  L+ +L + ++ +GGL+H+ +R                    +   E+ +  
Sbjct: 1178 AATYLVLRTLEASLVRTLQPLGGLSHQAFREVLRSGQERGVSYLASKTGCALTRERLRRY 1237

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEISK 1016
            +  N +DGD++E F+ LS +   ++ +
Sbjct: 1238 EPLNTVDGDMVEQFVQLSVSDKKQVCR 1264


>gi|291232722|ref|XP_002736302.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
            [Saccoglossus kowalevskii]
          Length = 984

 Score = 84.7 bits (208), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 123/500 (24%), Positives = 208/500 (41%), Gaps = 97/500 (19%)

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC--PFNSAAFPDS-LAIAKEGELTIGTID 647
            VF     P  ++ +++  L S+    + S  C  PF++   P   L   K GEL I  + 
Sbjct: 523  VFVCGPYPHWLFMTSRGALRSHPMHIDGSVTCFAPFHNINCPKGFLYFNKHGELRICVLP 582

Query: 648  DIQKLH----IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM------------ 691
                      +R +PL   P  I +  +S+T+A+ +  ++ C    +M            
Sbjct: 583  THLSYDALWPVRKVPLRCTPHFISYHIESKTYAVVTSVSEPCLRICKMTGDDKEFEDVER 642

Query: 692  --HFV---------RLLDDQTFEFISTYPLDT--FEYGCSILSCSFSDDSNV-----YYC 733
               F+         +L    ++E I    +DT  +E+   + +     +  V     +  
Sbjct: 643  DDRFIFPTIEKFSLQLFSPLSWEAIPNTKIDTEDWEHITGLKTVFLKSEGTVSGLKGFIA 702

Query: 734  VGTAYVLPEENEPTKGRILVF-----IVEDG------KLQLIAEKETKGAVYSLNAFNGK 782
            V T  V  EE    +GRIL+F     + E G      KL+L+ +KE KG V +L    G 
Sbjct: 703  VSTTIVYGEEVT-CRGRILIFDVIEVVPEPGQPLTKNKLKLLYDKEQKGPVTTLCDIEGL 761

Query: 783  LLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            L AAI QKI L  W  R++   G   + ++   H H L     T  +FI+  D+ KS+SL
Sbjct: 762  LAAAIGQKIFL--WAFRNNDLIGVAFIDTQI--HIHTLC----TIKNFILAADIRKSVSL 813

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-KNSEGATDE-ERG 897
            L +  E+ ++    R+          +L    +    +  + F +R K S+ AT++   G
Sbjct: 814  LRFSDEDRSLSLVTRES----FGGQRLLRRADFNAGSHVCSFFRMRSKLSDPATEKLLTG 869

Query: 898  RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL 957
             +E              RH                  +F T++G IG +  +  + Y  L
Sbjct: 870  PMER-------------RH----------------VTMFATLDGSIGYLIPMTEKTYRRL 900

Query: 958  EKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEIS 1015
              LQ  L        GLN + +R   ++ K+++   KN LDGDL+  +  LS     E++
Sbjct: 901  LMLQNALTTQTLHTAGLNPKGFRMVKHQTKSLENTHKNILDGDLLWKYTFLSVNERTELA 960

Query: 1016 KTMNVSVEELCKRVEELTRL 1035
            K +  SVE++   + ++ RL
Sbjct: 961  KKIGTSVEQILDDLMDVERL 980


>gi|294889072|ref|XP_002772683.1| spliceosomal protein sap, putative [Perkinsus marinus ATCC 50983]
 gi|239877121|gb|EER04499.1| spliceosomal protein sap, putative [Perkinsus marinus ATCC 50983]
          Length = 229

 Score = 84.7 bits (208), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 56/199 (28%), Positives = 97/199 (48%), Gaps = 16/199 (8%)

Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLR---------SSTDDPFDT 363
           Q+   SG     +LRI+R+G+ + E  + EL G    +++++         +  +   D 
Sbjct: 34  QIYALSGRGPRSALRILRHGLTLGEAGASELPGQPNALFTIKPFGASYAPVAEGEVESDR 93

Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
           ++VVSF+ +T  L +   D + E    GF  +  TLF         +QV    +R V++ 
Sbjct: 94  YIVVSFVDQTLTLLVT-SDNIHEVTDSGFAKEQPTLFAMRMQDKSAIQVMPTGIRHVAAG 152

Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG---DGILTEVKHAQLEYEI 480
            R    EW++PPG  V +A +N SQV++A  GG +   E+    +G L+EV    +  E+
Sbjct: 153 RRT--TEWRAPPGRQVTMAASNGSQVVIALSGGEIQLFELDADTNGHLSEVAKRDIGCEV 210

Query: 481 SCLDINPIGENPSYSQIAA 499
           + L + P+    + SQ  A
Sbjct: 211 AALTVQPLSSGRTRSQFMA 229


>gi|313232279|emb|CBY09388.1| unnamed protein product [Oikopleura dioica]
          Length = 1451

 Score = 84.7 bits (208), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 89/373 (23%), Positives = 171/373 (45%), Gaps = 40/373 (10%)

Query: 693  FVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENE 745
            FV +    ++E I  S   +D  E+     S     ++++     Y  +GT+ +  E+ +
Sbjct: 1083 FVEMFSPASWEVIPNSRIEMDAHEHILCCRSVYLKSEASMSGRKQYIAIGTSNICGEDFQ 1142

Query: 746  PTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
             ++GR+++  V D            K + + +   +G V ++++ +G L+AAI QK+ ++
Sbjct: 1143 -SRGRLILLEVIDVVPEPGKPLTRYKYKTVFDASQRGPVSAVDSLDGALIAAIGQKVFIH 1201

Query: 795  KWM---LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
             +    LR  G  + Q    H  H    Y        +VGD+ + I+LL ++ E   I +
Sbjct: 1202 AFQDDNLRATGFVDTQLYT-HATHCFKNYA-------LVGDIQQGITLLRHQGERNCISQ 1253

Query: 852  RARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVR---KNSEGATDEERGRLEVVG 903
             +R   A  ++AV IL D   +G      + N  ++  +   K S G     R     +G
Sbjct: 1254 ISRARRAGEVTAVGILLDGNQVGLVSTDMQRNLQVYMYKPDQKESNGGKQLVRQADINLG 1313

Query: 904  EYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
            +  +  + +  R      ++  ++        +  ++G IG I  +  + +  LE LQT 
Sbjct: 1314 KRVISIWNSLGRQNDTFTKVALTENDARHVTFYAGLDGSIGDIVPVSEKVFRRLEMLQTL 1373

Query: 964  LRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            ++  +   GGLN  ++R   NE + ++  AKN +DGDL+E F  LS T   ++S+ + V+
Sbjct: 1374 VQSHLPHYGGLNPREYRYCTNEYRDLENAAKNIIDGDLLERFNGLSFTEQTDLSRKIGVT 1433

Query: 1022 VEELCKRVEELTR 1034
             E L   + ++ R
Sbjct: 1434 REALLDDMMDVQR 1446



 Score = 55.5 bits (132), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 71/339 (20%), Positives = 146/339 (43%), Gaps = 62/339 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSD----RIGR 61
           ++G +  +   R  G  +D L ++ E  K  +++++ ++  L T +M    D    R  R
Sbjct: 81  LWGIVVCMTRLRLAGSVRDSLLLSIEESKCVIVEYEPDTGSLSTISMHFFQDEDLRRGFR 140

Query: 62  PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF------DNKGQLKEA-------FNIRLEE 108
              +  +  +D   R   + +Y     V+PF      D  GQ  +A       F   + +
Sbjct: 141 KLSSMALARVDGFNRCAAVLVYGSYLAVLPFRRSTERDLSGQRHQAVFYENSGFIANMID 200

Query: 109 LQ--------VLDIKFLYGCAKPTIVVLYQ----------DNKDARHVKTYEVALKDKDF 150
           LQ        VLD +FL G   PTI++LY+          + +D   +    + L D+  
Sbjct: 201 LQSLPVKIASVLDFQFLEGYNDPTILLLYEALPTWTGRVTERQDTCGMVALSINLIDETH 260

Query: 151 VEGP--WSQNNL---DNGADLLIPVPPPLCGVLIIGEETIVYCSAN------AFKAIPIR 199
              P  W    L   +  +  L P+P PL G L+    +++Y   +      A  ++P+ 
Sbjct: 261 ---PVIWQMAGLPFPNPYSSALFPIPKPLGGSLLFATNSLIYLDQSVPPYGVALNSLPLG 317

Query: 200 PS----ITKAYGRVDADGSRY-LLGDHAGLLHL-----LVITHEKEKVTGLK---IELLG 246
            +     T+    ++    +  +L D +  + L      +IT +K+ +  ++   ++ + 
Sbjct: 318 CTNFALKTQDVAPLNLQNCKACMLSDDSICVSLESGDVYIITLKKDSLNNVRRFYLDQVA 377

Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
            + I +T+S L + ++++GS  G+S L++   + ++K S
Sbjct: 378 SSVIPTTLSKLSDNLIFLGSRLGNSLLLRYKCKENSKKS 416


>gi|118346970|ref|XP_001006962.1| CPSF A subunit region family protein [Tetrahymena thermophila]
 gi|89288729|gb|EAR86717.1| CPSF A subunit region family protein [Tetrahymena thermophila SB210]
          Length = 1112

 Score = 84.3 bits (207), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 193/935 (20%), Positives = 380/935 (40%), Gaps = 133/935 (14%)

Query: 176  GVLIIGEETIVYCSANAFKAIPIR---PSITKAYGRVDADGSRYLLGDHAGLLHLLVITH 232
            G+++  +  I Y S N    I      P     +  V  D + ++    A     L + +
Sbjct: 234  GLILFSDYKIFYYSYNDKSTIEFTTPFPDEVICHSHV-KDTTYFIATQDAAFQSSLYLVN 292

Query: 233  EKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN--LQPDAKGSYVEVL 290
                   + I+ LG  S  ++I  L    ++I S   D ++IK++     + +G ++EV 
Sbjct: 293  ILASSRSIDIKYLGRISPPTSIIQLPEQHIFITSVDDDCKVIKVHDKFIKNLEG-HLEVK 351

Query: 291  ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKG 349
             +  NL  I +  +++ +     Q +  S   K+ S+ I + G+ + ++  ++    I+ 
Sbjct: 352  LKLSNLSSIYNMKLLESDF---NQRLIISSPRKEQSMNIFQKGMSLIQKGEIKFNSPIRE 408

Query: 350  MWSLRSSTDDPFDTFLVVSFISETRIL-----------AMNLEDELEETEIEGFCSQTQT 398
            M  + S        F+VVSF  +T IL            MN+   L         S++Q 
Sbjct: 409  MLVVHS-------VFIVVSFDFQTLILKLDDKKGSFNVVMNINCGLLNVFALESSSESQK 461

Query: 399  LFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
            L C          V++ S++L+  T   + +E  +     + ++T N + ++ A    +L
Sbjct: 462  LVCL---------VSAQSIKLIDLTRANILSEI-AISNKRIILSTINKNILITADSTNNL 511

Query: 459  VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
                  +  +++V+        S   IN      SY    AV +W    + + +  D   
Sbjct: 512  QAFNFLNNQISQVETFSYNSRYSYSAIN------SYGNYIAVALWFQSYIILINASDPTK 565

Query: 519  ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDG--HLLNFLLNMKTGELT--DRKKVSLG 574
              +  +  + + RS++   F   + LL    DG   L+ +  + +   +T  ++K + +G
Sbjct: 566  THQIDIPTQKVVRSLI---FHEENKLLIGYSDGIISLVYYSTDDQNNIVTSNNQKNMVIG 622

Query: 575  TQPIT---LRTFSSKNTTHVFAASDRPTVIY---------------SSNKKLLYSNVNLK 616
             +P+    LR FSS      +  S  P  I                 S ++   +  N++
Sbjct: 623  YKPVKFRQLRLFSS------YCQSSTPACISCISNEVSILSLRKVDESYQRYQKTYFNIQ 676

Query: 617  EVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSI--PLGEHPRRICHQE---- 670
            +V+ +        P  L +AK+  L+IG I+  QK HI S    + E     C  E    
Sbjct: 677  DVNQIDEIEINGVP-YLIMAKKNCLSIGGIEVSQKFHIYSFGKDVKEQNEFRCQTELVDV 735

Query: 671  -QSRTFAICSLKNQS---CAEESEMHFVR---LLDDQTFEFISTYPL----DTFEYGCSI 719
             ++ + +I  ++N+          +H ++   LL    F     Y L    D F  G   
Sbjct: 736  IRNMSISISIVENEQDDVVCSNILLHSIKNQDLLQQLKFNNTCIYSLLVVGDYFFIGGK- 794

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKET-------KGA 772
                  + +  Y      Y++  EN+  +   L F +   ++Q  +E             
Sbjct: 795  -----QEQNQTYEGFLQVYLVDNENKMIQVDKLAFKIPVMQIQKYSENSIAIILGHENFK 849

Query: 773  VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
            +Y +   + +  + + Q  Q +K++      +E+ S+   +   L+L    R   I++ D
Sbjct: 850  IYQMPYTSAQEYSILKQNTQ-HKFL------QEIVSQSNLNITALSLDCNKRNQ-ILIAD 901

Query: 833  LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
              + I L  ++ +   +   A+     + +  + L D+  + +++N N+   R+N     
Sbjct: 902  RTQGIQL--FQEQNKKVMLAAKSPFPVFSTHAKFLGDENIVLSDSNTNIVIFRQNDFAEN 959

Query: 893  DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ-------IPTVIFGTVNGVIGV 945
            D ER +L V+   + G+ V      SL  ++   DV         +  VI+GT  G IGV
Sbjct: 960  DFERMKLLVLSAINNGDIV----ITSLRRKVNTKDVKDKYITTEGLDQVIYGTQKGWIGV 1015

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN-EKKTVDAKNFLDGDLIESFL 1004
            I SL  + Y  L+ LQ ++ +  K     ++ +W+S  N  +   D  NF+DG++I   L
Sbjct: 1016 IMSLNEQAYTVLQDLQESILQKFKCPLKFDYNKWKSIRNIPQSKSDKSNFIDGEIIFKVL 1075

Query: 1005 DLSRTRMDEISKTMNV----SVEELCKRVEELTRL 1035
             +S++ + ++   M+     S+ E+ + +E L  L
Sbjct: 1076 KMSQSELVQVLDGMSAMPKPSIAEMEQLIENLDLL 1110


>gi|119484094|ref|XP_001261950.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Neosartorya fischeri NRRL 181]
 gi|148886830|sp|A1DB13.1|CFT1_NEOFI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|119410106|gb|EAW20053.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Neosartorya fischeri NRRL 181]
          Length = 1400

 Score = 84.3 bits (207), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 88/371 (23%), Positives = 160/371 (43%), Gaps = 37/371 (9%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 748
            ++++  +T+  I +Y L   EY  ++ +       N +       VGTA+    E+ P++
Sbjct: 1027 LKVVSPRTWTVIDSYSLGPAEYVMAVKNMDLEVSENTHERRNMIVVGTAFAW-GEDIPSR 1085

Query: 749  GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
            G I VF V            D KL+LI ++  KGAV +L+   G+  L+AA  QK  +  
Sbjct: 1086 GCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRG 1145

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+    +      ++  +         ++GD +K +    Y  E   +    +D
Sbjct: 1146 --LKEDGSLLPVAFMDMQCYVNVVKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD 1203

Query: 856  YNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
                 + A E L D      L A+++ NL  ++ + E        RL    ++H+G F  
Sbjct: 1204 QGYLEVVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFAT 1263

Query: 913  RFRHGSLVM-----RLPDSDVGQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
                    M      + D D  +I +      V+  + +G +G++ S+P E Y  L  LQ
Sbjct: 1264 TMTLLPRTMVSSEKAMADPDSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQ 1323

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            + L   ++   GLN   +R+   E      +  LDG+L+  +LD+ + R  EI+  +   
Sbjct: 1324 SQLTNSLEHPCGLNPRAYRAV--ESDGTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAH 1381

Query: 1022 VEELCKRVEEL 1032
              E+   +E +
Sbjct: 1382 EWEIKADLEAI 1392


>gi|405977622|gb|EKC42064.1| Cleavage and polyadenylation specificity factor subunit 1
            [Crassostrea gigas]
          Length = 1369

 Score = 84.3 bits (207), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 90/329 (27%), Positives = 144/329 (43%), Gaps = 32/329 (9%)

Query: 731  YYCVGTAYVLPEENEPTKGRILV-----FIVEDG------KLQLIAEKETKGAVYSLNAF 779
            Y  +GT   L EE   ++GR+++      + E G      K++ + EKE KG V +L   
Sbjct: 1045 YIVMGTNLSLGEEVT-SRGRVIIADIIEVVPEPGMPLTKHKIKTLYEKEQKGPVTALADI 1103

Query: 780  NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
            NG L+ AI QK  LY W L+D+   +L        HI    + T    I+ GD++KS+S+
Sbjct: 1104 NGLLITAIGQK--LYIWQLKDN---DLMGVAFIDTHIYIHTLVTIKHIILAGDILKSVSV 1158

Query: 840  LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEER 896
              Y+ E   +   +RD     +   + L D+  L    ++   NL       E       
Sbjct: 1159 YQYQEEHKVLSIVSRDPRPLEVYTADFLIDNTQLCCLVSDRMKNLVVYSYQPEARESHGG 1218

Query: 897  GRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD--VGQIP----TVIFGTVNGVIGVIASLP 950
             RL    +++ G  V+        +  P SD  +   P       F T++G +G +  L 
Sbjct: 1219 QRLIRKADFNAGSNVSSMFRVRCKLYDPSSDKRMTGAPEKRHITYFATLDGSLGFVLPLS 1278

Query: 951  HEQYLFLEKLQTNLRKVIKGVGGLNHEQWR----SFNNEKKTVDAKNFLDGDLIESFLDL 1006
             + Y  L  LQ  L   I  V GLN   +R    +F   +     KN LDG+L+  + +L
Sbjct: 1279 EKVYRRLFMLQNALVTHIPHVAGLNPRSYRHVIGTFPELRNP--QKNILDGELLWKYTNL 1336

Query: 1007 SRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            S     EI+K +  S +++   + E+ RL
Sbjct: 1337 SIMEKIEIAKRLGTSNDQIMDDLMEIDRL 1365



 Score = 45.1 bits (105), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 63/250 (25%), Positives = 112/250 (44%), Gaps = 39/250 (15%)

Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYV-EVLERYVNLGPI 299
           K ++ G T IAS +S ++N  +Y    YG ++    N       SY  EV +   N+GP 
Sbjct: 359 KKKVEGSTEIASDVSQIEN--LYDLEVYGSAE----NPTSTTITSYTFEVCDNIWNIGPC 412

Query: 300 VDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
            +  + +   L  +         ++V  SG  K+G+L +++  I      + EL G   M
Sbjct: 413 GNIVMGEPAFLSEEFSSCEDPDIEMVMTSGYGKNGALSVLQRSIRPQVVTTFELPGCLDM 472

Query: 351 WSLRS------------STDDPFD-------TFLVVSFISETRILAMNLEDELEETEIEG 391
           W+++S            S +D  D       +FL++S    + IL      E+ E +  G
Sbjct: 473 WTVKSLVPKEKSEDKENSMEDDSDDNIEGGHSFLILSRSDSSMILETG--QEMNELDHSG 530

Query: 392 FCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVL 450
           F +QT T+F  +   ++ +VQV+  S+RL+    R++++         V  + A+   VL
Sbjct: 531 FSTQTTTIFAGNIGGDRYIVQVSDTSLRLLEGV-RQIQHIPLDTGSPVVQCSLADPYIVL 589

Query: 451 LATGGGHLVY 460
           L   G  L++
Sbjct: 590 LTQEGQILMF 599



 Score = 44.7 bits (104), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 66/325 (20%), Positives = 131/325 (40%), Gaps = 59/325 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           ++G I +++  +  G  +D L ++    K  V+++D  + +L T ++    +    P+  
Sbjct: 13  LFGNIMSMKYVKLPGALRDSLLLSFSEAKLSVVEYDPGTHDLQTTSLHFFEE----PSMK 68

Query: 66  GQIGI--------IDPDCRLIGLHLYDGLFKVIPF---------DN-----KGQLKEAFN 103
           G            +DPD R   + +Y     ++PF         DN     K  +  ++ 
Sbjct: 69  GGFFTNYCIPEVRVDPDGRCAAMLVYGTHMVILPFRRDVMVEEGDNLAGTSKSPILSSYI 128

Query: 104 IRLEE-----LQVLDIKFLYGCAKPTIVVLYQD-----NKDARHVKTYEVALKDKDFVEG 153
           I L       + V D +FL+G  +PT+ +L++       + A    T  +     +  E 
Sbjct: 129 IDLRNFDEKIINVKDFQFLHGYYEPTVFILFEPLQTWAGRTAVRADTCSIVAISLNLQEK 188

Query: 154 P----WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAFKA----I 196
                WS  +L      ++ VP P+ GV+II   +++Y          S N+  A     
Sbjct: 189 VHPVIWSLGSLPFDCCQVLAVPRPIGGVIIIAVNSLLYLNQSVPPYGVSLNSISAQSTLF 248

Query: 197 PIR--PSITKAYGRVDADGSRY---LLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSI 250
           P+R    +  A     A    Y   +L    G L++L +  +  + V     +    + +
Sbjct: 249 PLRVQEGVRIALDCCQAAFMSYDKIVLSLKGGELYVLTLVVDGMRSVRSFNFDKSAASVL 308

Query: 251 ASTISYLDNAVVYIGSSYGDSQLIK 275
            S +   ++  +++GS  G+S L+K
Sbjct: 309 TSCMCICEDGFLFLGSRLGNSLLLK 333


>gi|154342093|ref|XP_001566998.1| putative CPSF-domain protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
 gi|134064323|emb|CAM40524.1| putative CPSF-domain protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
          Length = 1347

 Score = 84.0 bits (206), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 108/465 (23%), Positives = 185/465 (39%), Gaps = 101/465 (21%)

Query: 637  KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF--- 693
            ++G L IG+++ ++K     +PLGE   ++        +A+   K     E  E+ F   
Sbjct: 798  QDGRLHIGSLEALEKTSRTFLPLGETVTQLHETAAWGGYAVAVRK----VEGDEVLFLPS 853

Query: 694  ------------------VRLLDDQTFEFISTYPLDTFEYG-----C-------SILSCS 723
                              V LL+++   F+ T  LD  E G     C       +I S  
Sbjct: 854  SVIQSPWTADLGLWRTAAVPLLENERCVFLQTVRLDGSEGGGPAERCGEQIDVPAITSAG 913

Query: 724  FSD-DSNVYYCVGTAYVLPEENEPTKGRILVFIVED----GKLQLIAEKETKGAVYS--- 775
             S+ D      VG+++  P+E     GR++ F + +     +L+LIA K+  GA+     
Sbjct: 914  VSEEDWQHLLLVGSSFTFPDEQRARSGRVMWFALHEERQGQRLRLIASKDIGGALQCCAE 973

Query: 776  LNAFNGKLLAAINQKIQLYKWMLRDDG-------------TR--ELQSECGHHGHILALY 820
            +  + G++   +N  + LYKW   D               TR   L +       ++AL 
Sbjct: 974  VPYYKGRIALGVNGCVCLYKWNTEDQTFVAEERCRVGLTVTRLIPLYNTALAASVLVALD 1033

Query: 821  VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
            V+    FI V  L  S+ +L     EG +      Y  +    + + DD++   A     
Sbjct: 1034 VRHSAFFIEVDLLQGSLKVLC---REGNLRGVMDGYVGSDAENLCLFDDNLNFTALKVVP 1090

Query: 881  LFT---VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL----VMRLPDSDVGQI-- 931
            L         +  A+   + R EV  +YH+G+ V   R GS     +M+ P         
Sbjct: 1091 LPVEPGDGDAAAAASGTPQCRFEVRAQYHVGDLVTCVRPGSFAATSLMKAPTPSSSVPSP 1150

Query: 932  --------PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR--- 980
                    P ++F T +G  GV+  L    YL L  L+ +L + +  +GGL+H+ +R   
Sbjct: 1151 LLLPGIAGPQLVFATAHGGFGVVTPLHAATYLVLRALEASLERTLPPLGGLSHQAFREVL 1210

Query: 981  -----------------SFNNEK-KTVDAKNFLDGDLIESFLDLS 1007
                             +   E+ +  ++ N +DGD++E FL LS
Sbjct: 1211 RAGQERGVSYLASKVGCALTRERLRRYESLNTVDGDMVEQFLWLS 1255


>gi|341892673|gb|EGT48608.1| CBN-CPSF-1 protein [Caenorhabditis brenneri]
          Length = 1440

 Score = 84.0 bits (206), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 79/311 (25%), Positives = 142/311 (45%), Gaps = 28/311 (9%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+PE  +PT  R         K++++ +KE KG V  L A  G LL+ + QK+  + W  
Sbjct: 1140 VVPEPGQPTSNR---------KIKVLFDKEQKGPVTGLCAMEGLLLSGMGQKV--FIWQF 1188

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN- 857
            +D+    L S    H ++  L+  +     +  D  +S+SL+ ++ E  A+   +RD   
Sbjct: 1189 KDNDLMGL-SFLDMHYYVYQLH--SLRSIALACDARESMSLIRFQEENKAMSVASRDDRK 1245

Query: 858  -ANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
             A    A + + D  ++G   ++ N N+       E        RL V    ++G  +N 
Sbjct: 1246 CAQAPMAAQFMVDGAHIGFLLSDENGNITLFNYAPEAPESNGGERLTVRAAINIGTNINA 1305

Query: 914  FR----HGSLV--MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
            F     H +L+           Q  + IF +++G  G I  L  + Y  L  LQT +  V
Sbjct: 1306 FLRVKGHTALLNLHEFEKEAAEQRMSTIFASLDGSFGFIRPLTEKSYRRLHFLQTFIGSV 1365

Query: 968  IKGVGGLNHEQWRSFNNEKKTVD---AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
             + + GL+ +  RS    +  V+   A+N +DGD++E +L+LS     ++++ + V    
Sbjct: 1366 SQQIAGLHIKGARSAKPPQPIVNGRNARNLIDGDVVEQYLNLSTYDKTDLARRLGVGKYH 1425

Query: 1025 LCKRVEELTRL 1035
            +   + EL R+
Sbjct: 1426 IIDDLMELRRM 1436



 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 123/532 (23%), Positives = 210/532 (39%), Gaps = 112/532 (21%)

Query: 72  DPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTIV 126
           DP+ R     +Y     ++PF    +   ++ I L+++      V DI FL G  +PTI+
Sbjct: 144 DPENRCAASLVYGKHIAILPFHENSKRIHSYIIPLKQIDPRLDNVADIVFLDGYYEPTIL 203

Query: 127 VLYQD----------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCG 176
            LY+             D   +    V + D+ F    W   NL      L+P+P PL G
Sbjct: 204 FLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAV-VWQTANLPMDCATLLPIPKPLGG 262

Query: 177 VLIIGEETIVYCSA-------------NAFKAIPIRPSITKAYGRVDADGSRYL------ 217
            ++ G  TIVY +              + F   P++  +      +D   S Y+      
Sbjct: 263 AIVFGSNTIVYLNQAVPPCGIVLNSCYDGFTKFPLK-DMKSMKMTLDCSTSVYMEDGRIA 321

Query: 218 LGDHAGLLHL--LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
           +G   G L L  LV +     V  L+   + +TSIA T++      +++GS  GDSQL++
Sbjct: 322 VGTRDGELFLLRLVTSSGGATVKSLEFSKVWDTSIAYTLTVCAPGHLFLGSRLGDSQLLE 381

Query: 276 LNL-----------QPDAKGSYVEV---------------------------------LE 291
            +L           + + + ++VE                                  L+
Sbjct: 382 YSLIKTTRESVKRHKMEQEQNHVEAELDEDDLELYGGAIEEQQNDDEEQITESLQFSELD 441

Query: 292 RYVNLGPIVDFCV----------VDLERQGQ-GQVVTCSGAYKDGSLRIVRNGIGINEQA 340
           R  N+GP+   CV          VD +R+     V+T SG  K+GSL + +  +      
Sbjct: 442 RLRNIGPVKSMCVGRPNYMSNDLVDAKRRDPVFDVITASGHGKNGSLCVHQRSLRPEIVT 501

Query: 341 SVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
           S  L+G + +W++    ++    +L+VS I  T +L +  E    E  +  F +   T+ 
Sbjct: 502 SSLLEGAEQLWAVGRKENESH-KYLIVSRIRSTLVLELGEELIELEEPL--FVTGQPTVA 558

Query: 401 CHDAIYNQL-VQVTSGSVRLVSSTSR--ELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
             +       VQVTS S+ LV+   +  E++ +   P    V  +  +    +L   G  
Sbjct: 559 AGELSQGAFAVQVTSTSIALVTDGQQLAEVKIDSNFP---VVQASIVDPYVAVLTQNGRL 615

Query: 458 LVYLEIGDGI--LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDIS 507
           L+Y  + +    L E+  AQ  +              S SQI ++ M+ D S
Sbjct: 616 LLYTLVSNPYMQLQEIDLAQTPFSTFIAQ--------SASQITSISMYADAS 659


>gi|358348136|ref|XP_003638105.1| Pre-mRNA-splicing factor rse1 [Medicago truncatula]
 gi|355504040|gb|AES85243.1| Pre-mRNA-splicing factor rse1 [Medicago truncatula]
          Length = 1370

 Score = 84.0 bits (206), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 116/236 (49%), Gaps = 28/236 (11%)

Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
           GDS ++KL    D +  +  +++   N+ PI D    D   +   Q+  C G   +GSLR
Sbjct: 441 GDSVVLKLK---DGRLCFTNLIQ---NIAPIFDVTSGDYHDEKHDQMFACCGVTPEGSLR 494

Query: 329 IVRNGIGINE--QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
           ++++GI + +  +     +G+ G W++R    D + +FLV+SF+ ETRIL++ L    + 
Sbjct: 495 VIQSGINVEKLLRTPSTYEGVAGTWTVRMKISDQYHSFLVLSFLGETRILSVGLSFT-DV 553

Query: 387 TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPG 436
           T+  GF     TL C       LVQ+   +V+L   T          S  +   W  P  
Sbjct: 554 TDSVGFQPNVCTLACGLVSDGLLVQIYQSAVKLCLPTKDGHSEGIPLSSPICTSWY-PDN 612

Query: 437 YSVNVATANASQVLLATGGGHLVYLEIGDGILT-------EVKHAQLEYEISCLDI 485
            ++++     + ++++T     +++ +G  +L+       E++H +L+ E+SC+ I
Sbjct: 613 LNISLGAVGHNFIVVSTSNPCFLFI-LGVRMLSAYQYEIYEMQHLELQNEVSCISI 667



 Score = 53.9 bits (128), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 52/205 (25%), Positives = 86/205 (41%), Gaps = 23/205 (11%)

Query: 828  IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
            I VGD    I    Y  E   +E+   D +   ++   ++DD+  + ++   ++  +  +
Sbjct: 1134 IAVGDNRDGILFFSYHEEARKLEQLYGDPSQRLVADCILMDDNTAIVSDRKGSIAVLCSD 1193

Query: 888  ----SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV---GQIP-------- 932
                   A+ E   RL     Y + E     R GS   RLP  D+   G  P        
Sbjct: 1194 HLEAPNNASTECNLRLSCA--YFMAEIAVSIRKGSYSYRLPADDLLSGGIGPKTNVDSLQ 1251

Query: 933  -TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTV 989
             T++  T+ G I +   L  E+Y  LE +Q  L    +   V G +H ++RS  N    V
Sbjct: 1252 NTILVSTLLGSIMIFIPLSREEYELLEAVQARLAVHHLTAPVLGNDHNEFRSRENP---V 1308

Query: 990  DAKNFLDGDLIESFLDLSRTRMDEI 1014
                 LDGD++  FL+L+  + + I
Sbjct: 1309 GTPKILDGDMLTQFLELTNMQQNNI 1333



 Score = 46.2 bits (108), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 42/193 (21%), Positives = 84/193 (43%), Gaps = 15/193 (7%)

Query: 573  LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
            +G  P+ L          + A SDRP +++S+   + Y++++ +  SH  P  S   P  
Sbjct: 817  IGITPVFLVPLDDTLDADIIALSDRPWLLHSARHSISYTSISFQPSSHATPVCSIDCPKG 876

Query: 633  LAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESE 690
            +    E  L +  +   ++L++R   L   PR++ +  +S+   +    L   +C  +  
Sbjct: 877  ILFVAENSLHLVEMVYSKRLNMRKFHLKGTPRKVLYHNESQMLLVMRTELSIGTCLSD-- 934

Query: 691  MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTA-YVLPE-----EN 744
               +  +D  +   +S++ L+  E   S+       +  +   VGT+ Y  P      E 
Sbjct: 935  ---ICCVDPLSGSVLSSFRLELGETATSMELIRVGSEQVLV--VGTSLYSGPPAIPSGEA 989

Query: 745  EPTKGRILVFIVE 757
            E  KGR+LV  ++
Sbjct: 990  ESAKGRLLVLCID 1002


>gi|146324727|ref|XP_747211.2| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus fumigatus Af293]
 gi|148886828|sp|Q4WCL1.2|CFT1_ASPFU RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|129556124|gb|EAL85173.2| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus fumigatus Af293]
          Length = 1401

 Score = 83.6 bits (205), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 91/371 (24%), Positives = 159/371 (42%), Gaps = 37/371 (9%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 748
            ++++  +T+  I +Y L   EY  ++ +       N +       VGTA+    E+ P++
Sbjct: 1028 LKVVSPRTWTVIDSYSLGPDEYVMAVKNMDLEVSENTHERRNMIVVGTAFAR-GEDIPSR 1086

Query: 749  GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
            G I VF V            D KL+LI ++  KGAV +L+   G+  L+AA  QK  +  
Sbjct: 1087 GCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRG 1146

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+    +      ++  L         ++GD +K +    Y  E   +    +D
Sbjct: 1147 --LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD 1204

Query: 856  YNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
                 + A E L D      L A+++ NL  ++ + E        RL    ++H+G F  
Sbjct: 1205 QGYLEVVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFAT 1264

Query: 913  RFR-------HGSLVMRLPDS---DVGQIPT-VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
                           M  PDS   D   I   V+  + +G +G++ S+P E Y  L  LQ
Sbjct: 1265 TMTLLPRTMVSSEKAMANPDSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQ 1324

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            + L   ++   GLN   +R+   E      +  LDG+L+  +LD+ + R  EI+  +   
Sbjct: 1325 SQLANSLEHPCGLNPRAYRAV--ESDGTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAH 1382

Query: 1022 VEELCKRVEEL 1032
              E+   +E +
Sbjct: 1383 EWEIKADLEAI 1393


>gi|159123784|gb|EDP48903.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus fumigatus A1163]
          Length = 1401

 Score = 83.6 bits (205), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 91/371 (24%), Positives = 159/371 (42%), Gaps = 37/371 (9%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 748
            ++++  +T+  I +Y L   EY  ++ +       N +       VGTA+    E+ P++
Sbjct: 1028 LKVVSPRTWTVIDSYSLGPDEYVMAVKNMDLEVSENTHERRNMIVVGTAFAR-GEDIPSR 1086

Query: 749  GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
            G I VF V            D KL+LI ++  KGAV +L+   G+  L+AA  QK  +  
Sbjct: 1087 GCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRG 1146

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+    +      ++  L         ++GD +K +    Y  E   +    +D
Sbjct: 1147 --LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD 1204

Query: 856  YNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
                 + A E L D      L A+++ NL  ++ + E        RL    ++H+G F  
Sbjct: 1205 QGYLEVVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFAT 1264

Query: 913  RFR-------HGSLVMRLPDS---DVGQIPT-VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
                           M  PDS   D   I   V+  + +G +G++ S+P E Y  L  LQ
Sbjct: 1265 TMTLLPRTMVSSEKAMANPDSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQ 1324

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            + L   ++   GLN   +R+   E      +  LDG+L+  +LD+ + R  EI+  +   
Sbjct: 1325 SQLANSLEHPCGLNPRAYRAV--ESDGTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAH 1382

Query: 1022 VEELCKRVEEL 1032
              E+   +E +
Sbjct: 1383 EWEIKADLEAI 1393


>gi|302504585|ref|XP_003014251.1| hypothetical protein ARB_07556 [Arthroderma benhamiae CBS 112371]
 gi|291177819|gb|EFE33611.1| hypothetical protein ARB_07556 [Arthroderma benhamiae CBS 112371]
          Length = 460

 Score = 83.6 bits (205), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 89/353 (25%), Positives = 159/353 (45%), Gaps = 39/353 (11%)

Query: 700  QTFEFISTYPLDTFEYGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IV 756
             T   +S   L+  E   SI + SF S +   +  VGT   ++      T G I ++   
Sbjct: 96   HTKSVLSNLELEDNEAAVSIAAVSFTSQEDETFLVVGTGKDMVVSPRTFTCGFIHIYRFQ 155

Query: 757  EDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHG 814
            E+GK L+ I + + +    +L  F G+LLA I   +++Y     D G R+L  +C     
Sbjct: 156  EEGKELEFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIY-----DLGMRQLLRKCQAQIT 210

Query: 815  HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
              + + +QT+G  I+V D+ +S++ ++YK++E A+   A D    W +   ++D +   G
Sbjct: 211  PRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTTMVDYETVAG 270

Query: 875  AENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
             +   N++ +R   K SE A ++  G   +    +L    NR    SLV+     D   I
Sbjct: 271  GDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRL---SLVIHFYSQD---I 324

Query: 932  PTVIFGT--------------VNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
            PT I  T              + G +G+ +  +  +   F + L+  L      + G +H
Sbjct: 325  PTSIQKTQLVAGGRDILVWTGLQGTVGMFVPFITRDDVDFFQTLEMQLASQNPPLAGRDH 384

Query: 977  EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
              +R +         K  +DGDL E+FL L   +   I+  ++ SV E+ +++
Sbjct: 385  LIYRGY-----YAPCKGVIDGDLCETFLLLPNDKKQAIAGELDRSVREIERKI 432


>gi|302654423|ref|XP_003019019.1| hypothetical protein TRV_07032 [Trichophyton verrucosum HKI 0517]
 gi|291182709|gb|EFE38374.1| hypothetical protein TRV_07032 [Trichophyton verrucosum HKI 0517]
          Length = 460

 Score = 83.6 bits (205), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 89/353 (25%), Positives = 159/353 (45%), Gaps = 39/353 (11%)

Query: 700  QTFEFISTYPLDTFEYGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IV 756
             T   +S   L+  E   SI + SF S +   +  VGT   ++      T G I ++   
Sbjct: 96   HTKSVLSNLELEDNEAAVSIAAVSFTSQEDETFLVVGTGKDMVVSPRTFTCGFIHIYRFQ 155

Query: 757  EDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHG 814
            E+GK L+ I + + +    +L  F G+LLA I   +++Y     D G R+L  +C     
Sbjct: 156  EEGKELEFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIY-----DLGMRQLLRKCQAQIT 210

Query: 815  HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
              + + +QT+G  I+V D+ +S++ ++YK++E A+   A D    W +   ++D +   G
Sbjct: 211  PRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTTMVDYETVAG 270

Query: 875  AENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
             +   N++ +R   K SE A ++  G   +    +L    NR    SLV+     D   I
Sbjct: 271  GDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRL---SLVIHFYSQD---I 324

Query: 932  PTVIFGT--------------VNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
            PT I  T              + G +G+ +  +  +   F + L+  L      + G +H
Sbjct: 325  PTSIQKTQLVAGGRDILVWTGLQGTVGMFVPFITRDDVDFFQTLEMQLASQNPPLAGRDH 384

Query: 977  EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
              +R +         K  +DGDL E+FL L   +   I+  ++ SV E+ +++
Sbjct: 385  LIYRGY-----YAPCKGVIDGDLCETFLLLPNDKKQAIAGELDRSVREIERKI 432


>gi|358348106|ref|XP_003638090.1| DNA damage-binding protein, partial [Medicago truncatula]
 gi|355504025|gb|AES85228.1| DNA damage-binding protein, partial [Medicago truncatula]
          Length = 642

 Score = 83.6 bits (205), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 116/236 (49%), Gaps = 28/236 (11%)

Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
           GDS ++KL    D +  +  +++   N+ PI D    D   +   Q+  C G   +GSLR
Sbjct: 261 GDSVVLKLK---DGRLCFTNLIQ---NIAPIFDVTSGDYHDEKHDQMFACCGVTPEGSLR 314

Query: 329 IVRNGIGINE--QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
           ++++GI + +  +     +G+ G W++R    D + +FLV+SF+ ETRIL++ L    + 
Sbjct: 315 VIQSGINVEKLLRTPSTYEGVAGTWTVRMKISDQYHSFLVLSFLGETRILSVGLSFT-DV 373

Query: 387 TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPG 436
           T+  GF     TL C       LVQ+   +V+L   T          S  +   W  P  
Sbjct: 374 TDSVGFQPNVCTLACGLVSDGLLVQIYQSAVKLCLPTKDGHSEGIPLSSPICTSWY-PDN 432

Query: 437 YSVNVATANASQVLLATGGGHLVYLEIGDGILT-------EVKHAQLEYEISCLDI 485
            ++++     + ++++T     +++ +G  +L+       E++H +L+ E+SC+ I
Sbjct: 433 LNISLGAVGHNFIVVSTSNPCFLFI-LGVRMLSAYQYEIYEMQHLELQNEVSCISI 487


>gi|121719617|ref|XP_001276507.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus clavatus NRRL 1]
 gi|148886827|sp|A1C3U1.1|CFT1_ASPCL RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|119404719|gb|EAW15081.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus clavatus NRRL 1]
          Length = 1401

 Score = 83.6 bits (205), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 96/413 (23%), Positives = 172/413 (41%), Gaps = 56/413 (13%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
            ++ + +GEH   + +   S T+ + +  +      E+ E+H                  +
Sbjct: 969  LKKVAIGEHVDHLAYSISSETYVLGTSHSADFKLPEDDELHPEWRNEAISFLPELRQCCL 1028

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
            +++  +T+  I +Y L   E   ++ + +     N +       VGTA     E+ P +G
Sbjct: 1029 KVVHPKTWTVIDSYTLGPDEEIMAVKNMNLEVSENTHERKNMIVVGTALAR-GEDIPARG 1087

Query: 750  RILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKW 796
             I VF V            D KL+LI ++  KGAV +L+   G+  L+AA  QK  +   
Sbjct: 1088 CIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSEIGGQGFLIAAQGQKCMVRG- 1146

Query: 797  MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
             L++DG+    +      ++  L         +VGD  K I    Y  E   +    +D 
Sbjct: 1147 -LKEDGSLLPVAFMDVQCYVNVLKELKGTGMCIVGDAFKGIWFAGYSEEPYKMSLFGKDL 1205

Query: 857  NANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
                + A + L D      L A+++ NL  ++   E        +L V  ++H+G F + 
Sbjct: 1206 EYPEVVAADFLPDGDKLFILVADSDCNLHVLQYEPEDPMSSNGDKLLVRSKFHMGHFTST 1265

Query: 914  FR-----HGSLVMRLPDSDVGQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
                     S  +   DSD  ++        V+  + +G IG++ S+P E Y  L  LQ+
Sbjct: 1266 LTLLPRTTASYEIPSADSDSMEVDPRITPQQVLITSQSGSIGIVTSIPEESYRRLSALQS 1325

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
             L   ++   GLN   +R+   E      +  LDG+L+  +L +S+ R  EI+
Sbjct: 1326 QLANTVEHPCGLNPRAYRAI--ESDGTAGRGMLDGNLLYQWLSMSKQRRMEIA 1376


>gi|358348118|ref|XP_003638096.1| DNA damage-binding protein, partial [Medicago truncatula]
 gi|355504031|gb|AES85234.1| DNA damage-binding protein, partial [Medicago truncatula]
          Length = 822

 Score = 83.6 bits (205), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 61/236 (25%), Positives = 116/236 (49%), Gaps = 28/236 (11%)

Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
           GDS ++KL    D +  +  +++   N+ PI D    D   +   Q+  C G   +GSLR
Sbjct: 441 GDSVVLKLK---DGRLCFTNLIQ---NIAPIFDVTSGDYHDEKHDQMFACCGVTPEGSLR 494

Query: 329 IVRNGIGINE--QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
           ++++GI + +  +     +G+ G W++R    D + +FLV+SF+ ETRIL++ L    + 
Sbjct: 495 VIQSGINVEKLLRTPSTYEGVAGTWTVRMKISDQYHSFLVLSFLGETRILSVGLSFT-DV 553

Query: 387 TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPG 436
           T+  GF     TL C       LVQ+   +V+L   T          S  +   W  P  
Sbjct: 554 TDSVGFQPNVCTLACGLVSDGLLVQIYQSAVKLCLPTKDGHSEGIPLSSPICTSWY-PDN 612

Query: 437 YSVNVATANASQVLLATGGGHLVYLEIGDGILT-------EVKHAQLEYEISCLDI 485
            ++++     + ++++T     +++ +G  +L+       E++H +L+ E+SC+ I
Sbjct: 613 LNISLGAVGHNFIVVSTSNPCFLFI-LGVRMLSAYQYEIYEMQHLELQNEVSCISI 667


>gi|25148482|ref|NP_500157.2| Protein CPSF-1 [Caenorhabditis elegans]
 gi|22096347|sp|Q9N4C2.2|CPSF1_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
            factor subunit 1; AltName: Full=Cleavage and
            polyadenylation specificity factor 160 kDa subunit;
            Short=CPSF 160 kDa subunit
 gi|373220398|emb|CCD73182.1| Protein CPSF-1 [Caenorhabditis elegans]
          Length = 1454

 Score = 83.6 bits (205), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 76/312 (24%), Positives = 146/312 (46%), Gaps = 29/312 (9%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+PE ++PT  R         K++++ +KE KG V  L A NG LL  + QK+  + W  
Sbjct: 1153 VVPEPDQPTSNR---------KIKVLFDKEQKGPVTGLCAINGLLLCGMGQKV--FIWQF 1201

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN- 857
            +D+    + S    H ++  L+  +     +  D  +S+SL+ ++ +  A+   +RD   
Sbjct: 1202 KDNDLMGI-SFLDMHYYVYQLH--SLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRK 1258

Query: 858  -ANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN- 912
             A    A +++ D  ++G   ++   N+       E        RL V    ++G  +N 
Sbjct: 1259 CAQPPMASQLVVDGAHVGFLLSDETGNITMFNYAPEAPESNGGERLTVRAAINIGTNINA 1318

Query: 913  --RFRHGSLVMRLPDSD----VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
              R R  + +++L + D    + Q  T +F +++G  G +  L  + Y  L  LQT +  
Sbjct: 1319 FVRLRGHTSLLQLNNEDEKEAIEQRMTTVFASLDGSFGFVRPLTEKSYRRLHFLQTFIGS 1378

Query: 967  VIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
            V   + GL+ +  RS    +  V   +A+N +DGD++E +L LS     ++++ + V   
Sbjct: 1379 VTPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVGRY 1438

Query: 1024 ELCKRVEELTRL 1035
             +   + +L R+
Sbjct: 1439 HIIDDLMQLRRM 1450



 Score = 70.5 bits (171), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 87/395 (22%), Positives = 153/395 (38%), Gaps = 96/395 (24%)

Query: 72  DPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTIV 126
           DP  R     +Y     ++PF    +   ++ I L+++      + D+ FL G  +PTI+
Sbjct: 145 DPSNRCAACLVYGKHIAILPFHENSKRIHSYVIPLKQIDPRLDNIADMVFLDGYYEPTIL 204

Query: 127 VLYQDNK----------DARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCG 176
            LY+  +          D   +    V + D+ F    W   NL      L+P+P PL G
Sbjct: 205 FLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAV-VWQTANLPMDCSQLLPIPKPLGG 263

Query: 177 VLIIGEETIVYCSA-------------NAFKAIPIRPSITKAYGRVDADGSRYL------ 217
            L+ G  T+VY +              + F   P++  +      +D   S Y+      
Sbjct: 264 ALVFGSNTVVYLNQAVPPCGLVLNSCYDGFTKFPLK-DLKHLKMTLDCSTSVYMEDGRIA 322

Query: 218 LGDHAGLLHL--LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
           +G   G L L  L+ +     V  L+   + ETSIA +++      +++GS  GDSQL++
Sbjct: 323 VGSRDGDLFLLRLMTSSGGGTVKSLEFSKVYETSIAYSLTVCAPGHLFVGSRLGDSQLLE 382

Query: 276 LNL---------------QPDAKGSYVEV------------------------------- 289
             L                 D   + +E+                               
Sbjct: 383 YTLLKTTRDCAVKRLKIDNKDPAAAEIELDEDDMELYGGAIEEQQNDDDEQIDESLQFRE 442

Query: 290 LERYVNLGPIVDFCV----------VDLERQGQ-GQVVTCSGAYKDGSLRIVRNGIGINE 338
           L+R  N+GP+   CV          VD +R+     +VT SG  K+G+L + +  +    
Sbjct: 443 LDRLRNVGPVKSMCVGRPNYMSNDLVDAKRRDPVFDLVTASGHGKNGALCVHQRSLRPEI 502

Query: 339 QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISET 373
             S  L+G + +W++    ++    +L+VS +  T
Sbjct: 503 ITSSLLEGAEQLWAVGRKENESH-KYLIVSRVRST 536


>gi|148886829|sp|A2R919.1|CFT1_ASPNC RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|134083776|emb|CAK47110.1| unnamed protein product [Aspergillus niger]
          Length = 1383

 Score = 83.2 bits (204), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 97/438 (22%), Positives = 179/438 (40%), Gaps = 61/438 (13%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 694
            ++ + LGE    + +   S  + +  C   +    E+ E+H                 F+
Sbjct: 942  LKRVHLGEQVDHLAYSTSSGMYVLGTCHATDFKLPEDDELHPEWRNEAISFFPSARGSFI 1001

Query: 695  RLLDDQTFE---------FISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVL 740
            +L+ D   +          + ++ L   EY  +I + S     N +       VGTA+  
Sbjct: 1002 KLVWDHHLQRQDSVILIFHLHSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAFAR 1061

Query: 741  PEENEPTKGRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
              E+ P++G I VF V            D KL+LI ++  KGAV +L+   G+    + Q
Sbjct: 1062 -GEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQ 1120

Query: 790  KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
              +     L++DG+    +      ++  +         ++GD +K +    Y  E   +
Sbjct: 1121 GQKCMVRGLKEDGSLLPVAFMDMQCYVSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKM 1180

Query: 850  EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
               A+D +   + A E L D   L    A+++ N+  ++ + E        RL    ++H
Sbjct: 1181 SLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFH 1240

Query: 907  LGEFVNRFRHGSLVM----RLPDSDVG-----QIP--TVIFGTVNGVIGVIASLPHEQYL 955
            +G F +        M    ++  S  G     Q P   V+  T NG +G+I  +P E Y 
Sbjct: 1241 MGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYR 1300

Query: 956  FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
             L  LQ+ L   ++   GLN   +R+   E      +  LDG+L+  ++D+S+ R  EI+
Sbjct: 1301 RLSALQSQLTNTLEHPCGLNPRAFRAV--ESDGTAGRGMLDGNLLFKWIDMSKQRKTEIA 1358

Query: 1016 KTMNVSVEELCKRVEELT 1033
              +     E+   +E ++
Sbjct: 1359 GRVGAREWEIKADLEAIS 1376


>gi|443684051|gb|ELT88095.1| hypothetical protein CAPTEDRAFT_161045 [Capitella teleta]
          Length = 1410

 Score = 82.8 bits (203), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 95/433 (21%), Positives = 180/433 (41%), Gaps = 58/433 (13%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE--------ESEMHFVRLLDD------ 699
            +R +PL   P  + +   S+T+++ S +   C +        E E+  V+  D       
Sbjct: 979  VRKVPLRCTPHFVVYHPDSKTYSVVSSQQVPCTQLVRVAGDGEKEIEAVQKDDRFVFPIM 1038

Query: 700  -----QTFEFISTYP-------LDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPE 742
                 Q F  +S  P       L+ +E+   I + +   +  +     Y  VGT      
Sbjct: 1039 NKFNIQLFSPVSWEPIPNTRFDLEEWEHVMCIKTINLKSEGTLSGLKGYVVVGTNLNY-N 1097

Query: 743  ENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            E+  ++G++ ++ V D            K++++  KE KG V +L+   G L+ AI QK+
Sbjct: 1098 EDVSSRGKLTIYDVIDVVPEPGQPLTKNKIKVVYNKEQKGPVTALDGVQGFLVTAIGQKV 1157

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
              Y W L+D+   +L         I    ++   + I++GD+ KSIS+L Y+ +   +  
Sbjct: 1158 --YIWQLKDN---DLAGIAFIDTQIYIHKMEALKNLIIIGDVCKSISVLRYQEDMKVLSL 1212

Query: 852  RARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
             ++D     +  V  L D+  L    A+   N        +    +   RL    + ++G
Sbjct: 1213 VSKDVRPLAVYGVAYLVDETSLAFIVADKLKNFLVYCYQPDLVQSQGGQRLIRKADINIG 1272

Query: 909  EFVNRFRHGSLVMRLP-----DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
              VN F      +  P     D  +       + T++G IG +  +    Y  L  LQ  
Sbjct: 1273 SLVNAFFRVKCRVSDPSTSKTDQSLAMKHITYYVTLDGSIGYLLPISESLYRRLYMLQKM 1332

Query: 964  LRKVIKGVGGLNHEQWRSFNNEKKTV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            L + ++   GLN + +R+   E + +    +N +DGDL   +L L+     E++K +  +
Sbjct: 1333 LIQQVQQTAGLNPKAYRTCQTEFRQLINIQRNIIDGDLAWKYLALTSHDRAEMAKRIGTT 1392

Query: 1022 VEELCKRVEELTR 1034
              ++   + E+ R
Sbjct: 1393 SHQIEDDLLEIDR 1405



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 73/343 (21%), Positives = 142/343 (41%), Gaps = 67/343 (19%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
           D  + G ++++E     G A+D L +  E  K  +  +D ++ +L T ++    D     
Sbjct: 71  DYELCGNVSSIESISLVGAARDALLLCFEEAKLSLCDYDPDTDDLKTISLHYFEDA---D 127

Query: 63  TDNG--QIGI------IDPDCRLIGLHLYDGLFKVIPFD--------------NKGQLKE 100
            +NG  Q G+      +DP+ R   + +Y     V+PF               +K  +  
Sbjct: 128 LENGCCQRGLHHSEVRVDPEGRCAVMLIYGTHLIVLPFRKESPSDEIDATSCASKSPIMS 187

Query: 101 AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQD----------NKDARHVKTYEVAL 145
            + I L  L      V+DI+FL+G  +PT+++LY+            KD   +    + L
Sbjct: 188 TYIIDLRTLDERVTNVVDIQFLHGYYEPTVLILYEPLPTWTCRVAVRKDTCSIVAISLNL 247

Query: 146 KDKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAFK 194
           +DK     P  WS +NL        PVP P+ GV++    +++Y          S N+  
Sbjct: 248 QDKTH---PIIWSHSNLPYDCLRTFPVPKPIGGVIVFAVNSLLYLNQSFPPYGVSLNSLT 304

Query: 195 AIP----------IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIE 243
           +            +R S+  A      D  + ++    G L++L +  +  + V    ++
Sbjct: 305 SFNTEFLLKPQEGVRMSLDCAQAEF-IDNDKLVISLKGGELYVLTLVIDSMRAVRSFHLD 363

Query: 244 LLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN-LQPDAKGS 285
               + + + +    +  +++GS  G+S L++    +P+A  S
Sbjct: 364 KAAASVLTTCMCMCGDNYLFLGSRLGNSLLLRYQEKKPEASSS 406



 Score = 39.7 bits (91), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 69/278 (24%), Positives = 116/278 (41%), Gaps = 37/278 (13%)

Query: 232 HEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
             KEK+T L I L+G + + S +  LD   VY      DSQ ++     D      EV +
Sbjct: 415 QRKEKMT-LAIGLVGSSDV-SKLDDLDELEVY----GRDSQAVE---SEDITQFMFEVCD 465

Query: 292 RYVNLGPI-------VDFCVVDLERQGQG--QVVTCSGAYKDGSLRIVRNGIGINEQASV 342
             +N+GP          F   +   Q     ++VT SG  K+G++ I++  I      + 
Sbjct: 466 SIINIGPCGQVEMGEPAFLSEEFSHQEDPDLELVTTSGYGKNGAISILQRQIRPQVVTTF 525

Query: 343 ELQGIKGMWSLRSSTDDPFDT---------FLVVSFISETRILAMNLEDELEETEIEGFC 393
           EL G   +W++  S D+   +         FL++S    + +L      E+ E +  GFC
Sbjct: 526 ELPGCTDVWTVLGSPDEQQGSDEKLAGSHAFLLLSRADSSMVLETG--QEIMELDHSGFC 583

Query: 394 SQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR--ELRNEWKSPPGYSVNVATANASQVL 450
           +   T+   +    + +VQV   ++ L+    R   L  +  SP    V+ + A+   +L
Sbjct: 584 TDAPTVHAANIGNGRYIVQVGPNAIWLLKGVERIQHLALDVSSP---VVSCSLADPHVLL 640

Query: 451 LATGGG--HLVYLEIGDGILTEVKHAQLEYEISCLDIN 486
           L   G   HLV    GD     +   +L  +   + IN
Sbjct: 641 LCEDGQLLHLVLSVQGDDPTLSLLTTKLHQKSKVIAIN 678


>gi|301773406|ref|XP_002922132.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1-like [Ailuropoda
            melanoleuca]
          Length = 1469

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 111/464 (23%), Positives = 186/464 (40%), Gaps = 87/464 (18%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 1015 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1074

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  C                         + E   ++L+   ++E I  + 
Sbjct: 1075 KVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1134

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L+ +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1135 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1193

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1194 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1248

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1249 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1308

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             LG   ++ + NL       E        RL    ++H+G  VN F       R P    
Sbjct: 1309 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1362

Query: 929  GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
             + P+             F T++G IG++  LP ++       +TN  +       L H 
Sbjct: 1363 AEGPSKKSVVWENKHITWFATLDGGIGLL--LPMQE-------KTNRLQPAXSPRML-HV 1412

Query: 978  QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
              R   N       +N LDG+L+  +L LS     E++K +  +
Sbjct: 1413 DRRILQNA-----VRNVLDGELLNRYLYLSTMERGELAKKIGTT 1451



 Score = 46.2 bits (108), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 112 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 171

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 172 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 230

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 231 VRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 290

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 291 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNGLTTGTTAFPL 350

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 351 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 407

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
           + +++  ++   +++GS  G+S L+K
Sbjct: 408 LTTSMVTMEPGYLFLGSRLGNSLLLK 433


>gi|389614684|dbj|BAM20371.1| spliceosomal protein sap, partial [Papilio polytes]
          Length = 541

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 75/296 (25%), Positives = 131/296 (44%), Gaps = 43/296 (14%)

Query: 759  GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHIL 817
            GKL+L+ +        +L AFNG+L+A + + +++Y     D G R+L  +C   H   L
Sbjct: 259  GKLELVHKTPIDDYPGALAAFNGRLMAGVGRMLRMY-----DIGRRKLLRKCENRHIPNL 313

Query: 818  ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
               ++T G  I V D+ +S+  + YK  E  +   A D N  W++   +LD D  +   +
Sbjct: 314  IADIKTIGQRIFVADVQESVFCVKYKKRENQLIIFADDTNPRWITNTCVLDYDT-VAMSD 372

Query: 878  NFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGS 918
             F    V +     +D+            +RG L       EVV  +H+GE V   +  +
Sbjct: 373  KFGNVAVMRLPHSVSDDVDEDPTGNKALWDRGLLNGASQKGEVVVNFHVGETVTSLQRAT 432

Query: 919  LVMRLPDSDVGQIPTVIFGTVNGVIGVIASLP---HEQYLFLEKLQTNLRKVIKGVGGLN 975
            L+        G    +++ T  G +G +  LP    E +   + L+ ++R     + G +
Sbjct: 433  LI-------PGGSEALLYATAGGALGAL--LPFTSREDHDXFQHLEMHMRSENAPLCGRD 483

Query: 976  HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
            H  +RS+         KN +DGDL E F  L   +   I+  +  +  E  K++E+
Sbjct: 484  HLSFRSY-----YYPVKNVIDGDLCEQFNSLEPAKQKAIAGDLERTPAEGSKKLED 534


>gi|255581562|ref|XP_002531586.1| spliceosomal protein sap, putative [Ricinus communis]
 gi|223528782|gb|EEF30789.1| spliceosomal protein sap, putative [Ricinus communis]
          Length = 1220

 Score = 82.4 bits (202), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/131 (35%), Positives = 73/131 (55%), Gaps = 3/131 (2%)

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
           N+ PI+D  VVD   + Q Q+  C G   +GSLRI+R+GI + +  + +   QG+ G W+
Sbjct: 460 NIAPILDMSVVDHHDEKQEQMFGCCGVAPEGSLRIIRSGISVEKLLRTAPIYQGVTGTWA 519

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           LR    D + +F+V+SF+ ETR+L++ +    + T+  GF     TL C       LVQ+
Sbjct: 520 LRMKVTDLYHSFVVLSFVEETRVLSVGVSFA-DVTDSAGFLPNVCTLACGLVGDGLLVQI 578

Query: 413 TSGSVRLVSST 423
              +VRL   T
Sbjct: 579 HQTAVRLCLPT 589



 Score = 58.9 bits (141), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 45/193 (23%), Positives = 90/193 (46%), Gaps = 15/193 (7%)

Query: 573 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
           +G  P+ L   +      + A SDRP ++ ++   L Y++++ +  +H  P  S   P  
Sbjct: 725 IGVTPVFLVPLTDSLDADMIALSDRPWLLQTARHGLSYTSISFQPSTHSTPVCSVECPKG 784

Query: 633 LAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESE 690
           L    E  L +  +   ++L+++   LG  PR++ +  +SR   +    L N +C+ +  
Sbjct: 785 LLFVAENSLHLVEMVHSKRLNVQKFHLGGTPRKVLYHSESRLLLVMRTELSNDTCSSD-- 842

Query: 691 MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY-----VLPE-EN 744
              +  +D  +   +S++ L+  E G S+       +  +   VGT+      ++P  E 
Sbjct: 843 ---ICCVDPLSGSVVSSFKLEHGETGKSMELVRVGTEQVLV--VGTSLSSGPAIMPSGEA 897

Query: 745 EPTKGRILVFIVE 757
           E TKGR++V  +E
Sbjct: 898 ESTKGRLIVLCLE 910


>gi|297792471|ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
            lyrata]
 gi|297309955|gb|EFH40379.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
            lyrata]
          Length = 1444

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 78/300 (26%), Positives = 133/300 (44%), Gaps = 20/300 (6%)

Query: 702  FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
            +E  +T P+ + E+  ++    L  + + ++     VGTAYV   E+   +GR+L+F   
Sbjct: 1099 WETKATIPMQSSEHALTVRVVTLLNASTGENETLLAVGTAYV-QGEDVAARGRVLLFSFG 1157

Query: 758  ---DGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
               D    ++ E   +E KGA+ ++ +  G LL +   KI L+KW    +GT        
Sbjct: 1158 KNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW----NGTELNGVAFF 1213

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +  + +     FI++GD+ KSI  L +K +   +   A+D+ +    A E L D  
Sbjct: 1214 DAPPLYVVSMNVVKTFILLGDVHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGN 1273

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             L    ++   N+       + A   +  +L    E+H+G  V +F    L M    +D 
Sbjct: 1274 TLSLAVSDEQKNIQVFYYAPKMAESWKGQKLLSRAEFHVGSHVTKFLR--LQMVTSGADK 1331

Query: 929  GQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT 988
                 ++FGT++G  G IA L    +  L+ LQ  L   +  V GLN   +R F    K 
Sbjct: 1332 TNRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPHSFRQFRTSGKA 1391



 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 95/407 (23%), Positives = 159/407 (39%), Gaps = 116/407 (28%)

Query: 103 NIR-LEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFV---------- 151
           N+R LE   V D  FL+G  +P IV+L ++     H     V+ K    V          
Sbjct: 240 NLRDLEMKHVKDFVFLHGYIEPVIVILQEE----EHTWAGRVSWKHHTCVLSALSINTTL 295

Query: 152 -EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGR 208
            + P  WS  NL + A  L+ VP P+ GVL++   TI Y S +A  A+ +    + A   
Sbjct: 296 KQHPVIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSS 355

Query: 209 VDADGSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIA 251
            +   S +                 LL   +G L LL + ++   V  L +     + +A
Sbjct: 356 QELPASNFSVELDAAHGTWISSDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLA 415

Query: 252 STISYLDNAVVYIGSSYGDSQLI------------------------------KLNLQPD 281
           S I+ + N++ ++GS  GDS L+                              +L +  D
Sbjct: 416 SDITSVGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRISSD 475

Query: 282 -----------------------AKGSY-VEVLERYVNLGPIVDFC----------VVDL 307
                                  A+ S+   V +  VN+GP+ DF              +
Sbjct: 476 TFQDTIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGV 535

Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------------- 353
            +Q   ++V CSG  K+G+L ++R  +       VEL G KG+W++              
Sbjct: 536 SKQSNYELVCCSGHGKNGALCVLRQSVRPEMITEVELPGCKGIWTVYHKSSRGHNADSSK 595

Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
            ++ +D +  +L++S   E R + +   D L E TE   +  Q +T+
Sbjct: 596 MAADEDEYHAYLIISV--EARTMVLETADLLTEVTESVDYYVQGRTI 640


>gi|225679191|gb|EEH17475.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
          Length = 1377

 Score = 81.6 bits (200), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 98/421 (23%), Positives = 182/421 (43%), Gaps = 46/421 (10%)

Query: 655  RSIPLGEHPRRICHQEQSRTFAICSLK--NQSCAEESEMH-------FVRLLDDQTFEFI 705
            R I LGE    + +   S T+ + + +  +    E+ E+H        V+LL+ +T+  I
Sbjct: 954  RKIGLGEQVDSVEYSSSSETYVLGTSQKVDFKLPEDDEIHPEWRNEESVKLLNPRTWSII 1013

Query: 706  STYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---- 756
             +Y L T E      C  L  S  + +      VGTA    E+    +G I VF V    
Sbjct: 1014 DSYQLRTAERVMCVKCLNLEASEITHERKEMIAVGTALTRGEDIA-ARGCIYVFEVIKVV 1072

Query: 757  -------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKWMLRDDGTRELQ 807
                    + KL+LIA++E KGA+ SL+   G+  L+AA  QK  +    L++DG+    
Sbjct: 1073 PEVDRPETNRKLKLIAKEEVKGAITSLSGIGGQGFLIAAQGQKCIVRG--LKEDGSLLPV 1130

Query: 808  SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
            +      ++  L         ++GD +K +    Y  E   +   ++D  +  + A + L
Sbjct: 1131 AFMDMQCYVSVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKDDGSLQVMAADFL 1190

Query: 868  DDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR---HGSLVM 921
                    + A+++ N+  ++ + E     +  RL     +H G+F +        S++ 
Sbjct: 1191 PHGKRLFIMVADDDCNIHVLQYDPEDPGSAKGDRLLHRSTFHTGQFASTLTLLPRTSVLS 1250

Query: 922  RLPDSDV--------GQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
            + P+++         G +  V+  +  G I +I  +    Y  L  LQ+ +   ++   G
Sbjct: 1251 QGPEAEANAMDLDSSGPLHQVLVTSETGSIALITPVSEMAYRRLSALQSQMINTLEHPCG 1310

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
            LN   +R+   E   +  +  +DGDL++ +LDL   R  EI+  +   V E+   +E + 
Sbjct: 1311 LNPRAFRAV--ESDGIGGRGMVDGDLVQKWLDLGTQRKAEIASRVGADVWEIRADLEAIG 1368

Query: 1034 R 1034
            +
Sbjct: 1369 K 1369


>gi|83314897|ref|XP_730560.1| multisubunit cleavage/polyadenylation specificity factor subunit A
           [Plasmodium yoelii yoelii 17XNL]
 gi|23490318|gb|EAA22125.1| CPSF A subunit region, putative [Plasmodium yoelii yoelii]
          Length = 863

 Score = 81.3 bits (199), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 101/449 (22%), Positives = 179/449 (39%), Gaps = 86/449 (19%)

Query: 552 HLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS-------------------KNTTHVF 592
           +L  +++N     LT+R+K+++   PI  + F+                    K +  +F
Sbjct: 287 YLKKYIINSDNVLLTNRRKINVCRSPIKFKQFNKVCSEKNKIDINNNNNNTCVKKSNFLF 346

Query: 593 AASDRPTVIYSS-NKKLLYSNVNLKEVSHMCPFNS----AAFPDSLAIAKEGE------- 640
             SD P +IYS   KK+  S V++K +  +  FN       F +  +  K+ +       
Sbjct: 347 ICSDNPIIIYSDIKKKISLSKVSIKNIFLVDIFNDFDYLNPFHNFNSFKKKNQNNLYFIF 406

Query: 641 -----LTIGTIDDIQKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEMHFV 694
                L I  +++I + +I+ IP      +I  H E       C  + +    ++    +
Sbjct: 407 FDGLSLYISHLNEINETYIQKIPFYRTVEKIAYHNESGLLITSCPTEEKHKTNKNLKQII 466

Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY---------YCVGTAYVLPEENE 745
              +     F  +Y + + +Y  S + C +  + ++Y          CVGTA +    +E
Sbjct: 467 CFFNPHQNSFKYSYIIPS-KYNVSSI-CIYQINKDIYPNKSNINTLICVGTANINDRVSE 524

Query: 746 PTKGRILVFIVEDGKLQLIAEKE------TKGAVYSLNAFNGKLLAAINQK--------- 790
           P+ G I +F  +  K  L   K         G +  L  F  KL++ IN           
Sbjct: 525 PSSGHIYIFFAK-KKANLFEIKHIYTHNINVGGITHLKQFYDKLISTINNTVIYKCVNKK 583

Query: 791 ------------IQLYKWM--------LRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
                       I L K++        L +DGT    +       I++L V    ++IVV
Sbjct: 584 LIVVILDISDFLINLDKYVDNTNKPIKLENDGTIVDVASFTPSSWIMSLDVI--ENYIVV 641

Query: 831 GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
           GD+M S+++L Y      + E  RDY+  W + V  L    +L ++   N    +K+S  
Sbjct: 642 GDIMTSVTILSYDFNNSTLTEVCRDYSNVWCTFVCALSKSHFLVSDMESNFLVFQKSSIR 701

Query: 891 ATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
             DE+  +L  V  ++ G  VN+    SL
Sbjct: 702 YNDEDSFKLSRVALFNHGHVVNKMLPVSL 730


>gi|47217773|emb|CAG05995.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1446

 Score = 80.9 bits (198), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 112/498 (22%), Positives = 196/498 (39%), Gaps = 106/498 (21%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            +    PF++   P   L   K+GEL I  +           +R IPL      + +  +S
Sbjct: 943  IESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVES 1002

Query: 673  RT-------FAICSLKNQSC-------AEESEMHFV----------------RLLDDQTF 702
            +        +A+C+   + C        EE E   +                +L+   ++
Sbjct: 1003 KASLSHCCVYAVCTSVKELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLISPVSW 1062

Query: 703  EFISTYPLD--TFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF- 754
            E I    +D   +EY   + + +      V     Y   GT  +  EE    +GRIL+  
Sbjct: 1063 EAIPNTRIDLEEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVT-CRGRILILD 1121

Query: 755  ----IVEDG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR 804
                + E G      K +++ EKE KG V +L   NG L++AI QKI L  W+L+D+   
Sbjct: 1122 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFL--WVLKDN--- 1176

Query: 805  ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV 864
            +L         +    + +  +FI+  DLMKS+SLL Y+ E   +   +RD     + ++
Sbjct: 1177 DLTGMAFIDTQLYIHQMMSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVYSI 1236

Query: 865  EILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVM 921
            E + D+  LG   ++ + NL+      E        RL    +++ G  +N F       
Sbjct: 1237 EFMVDNSQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTF------W 1290

Query: 922  RLPDS---DVGQIPTVI--------FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
            R+P     + G    +         F T++G +G++  +  + Y  L  LQ  L  ++  
Sbjct: 1291 RMPCRGALEAGSRKAMTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLSH 1350

Query: 971  VGGLNHEQWRSFNNEKKTV---------------------------DAKNFLDGDLIESF 1003
              GLN + +R    ++ +                              KN LDG+L+  +
Sbjct: 1351 HAGLNPKAFRCVGADRTSAAMLSGMLPDFATSVSRMLHCDRRSLQNPVKNILDGELLNKY 1410

Query: 1004 LDLSRTRMDEISKTMNVS 1021
            L LS     E++K +  +
Sbjct: 1411 LYLSMMERSELAKKIGTT 1428



 Score = 48.5 bits (114), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 61/301 (20%), Positives = 131/301 (43%), Gaps = 47/301 (15%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT-- 63
           ++G + ++E  +  G  +D L ++ +  K  V+++D  + +L T ++    +   R T  
Sbjct: 74  LFGNVMSMESVQLVGANRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDTLT 133

Query: 64  DNGQIGIID-PDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
           D  ++G+ + P    +  ++ D    V   D K             L ++D+KFL+G  +
Sbjct: 134 DEQELGVGEGPKSSFLPTYIID----VRELDEK------------LLNIIDMKFLHGYYE 177

Query: 123 PTIVVLYQDNKD------ARHVKTYEVALKDKDFVE-GP--WSQNNLDNGADLLIPVPPP 173
           PT+++L++ N+        R  +   VA+      +  P  WS +NL      ++ VP P
Sbjct: 178 PTLLILFEPNQTWPGRVAVRQAQCSIVAISLNIMQKVHPVIWSLSNLPFDCTQVMAVPKP 237

Query: 174 LCGVLIIGEETIVYCS-------------ANAFKAIPIR----PSITKAYGRVDADG-SR 215
           + GV++    +++Y +              N   A P+R      IT    + D     +
Sbjct: 238 IGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFPLRLQDEVKITLDCSQADFIAYDK 297

Query: 216 YLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
            ++    G +++L +IT     V     +    + + + +  ++   +++GS  G+S L+
Sbjct: 298 MVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTCMVTMEPGYLFLGSRLGNSLLL 357

Query: 275 K 275
           K
Sbjct: 358 K 358



 Score = 42.7 bits (99), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 42/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)

Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
           EV +  +N+GP  +  + +   L  + QG      +VV CSG  K+G+L +++  I    
Sbjct: 430 EVCDSILNIGPCANASMGEPAFLSEEFQGNPEPDLEVVVCSGHGKNGALSVLQRSIRPQV 489

Query: 339 QASVELQGIKGMWSLRSS-----------------------TDDPFDTFLVVSFISETRI 375
             + EL G   MW++ S+                        D     FL++S    T I
Sbjct: 490 VTTFELPGCHDMWTVISNEVKEDKKVPQSPGSFTATHYSLEEDTKKHGFLILSREDSTMI 549

Query: 376 LAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
           L      E+ E +  GF +Q  T+F  +   N+ ++QV+   +RL+
Sbjct: 550 LQTG--QEIMELDTSGFATQGPTVFAGNIGDNKYIIQVSPMGIRLL 593


>gi|384487281|gb|EIE79461.1| hypothetical protein RO3G_04166 [Rhizopus delemar RA 99-880]
          Length = 1468

 Score = 80.5 bits (197), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 134/583 (22%), Positives = 232/583 (39%), Gaps = 117/583 (20%)

Query: 6   IYGRIATLELFRPHG-EAQDFLFIATERYKFCVLQWDAESSELITRAMGDVS-DRIGRP- 62
           I   + T+    P G E  D L +     K  +L+W + ++ +IT ++     D   +  
Sbjct: 111 IITTMGTVRTNSPRGREGCDSLLLGFSDAKMSLLEWSSSTNSIITVSIHYYERDEFKKEF 170

Query: 63  -TDNGQIGI-IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE----------------AFNI 104
            T+     I IDP  R    + YD    V+PF    +L E                +F I
Sbjct: 171 LTNPYPSAIHIDPQQRCAVFNFYDNKLAVLPFRQSDKLDERQGEGEEDEEKWPYYPSFII 230

Query: 105 RLEEL-----QVLDIKFLYGCAKPTIVVLYQ----------DNKDARHVKTYEVALKDKD 149
            L  L      V+D+ FL    +PT+ +L+Q          +NKD   +    + +  K 
Sbjct: 231 DLATLDSRIKNVIDMTFLSDYYEPTLAILFQPEQTWTGRLGNNKDTVSLVVISLDITAKI 290

Query: 150 FVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANA--------------- 192
           +   P  +S + L      L+ +P P+ G+L+I   +I++ S  +               
Sbjct: 291 Y---PIIYSIDKLPYDCFKLVAMPKPVTGMLVIAANSILHVSQGSPGMGVAVNGYTKKTT 347

Query: 193 -FKAIPIRPSITK-----------AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
            F  +   PS+ +           A+G     G R L+    G   L+ +  +  KV G+
Sbjct: 348 DFPGMIYEPSLIELGLSLEGAKALAFG-----GDRCLIFMQNGHWALVEVRRDGNKVVGM 402

Query: 241 KIELLGE---------------TSIASTISYLDNA----VVYIGSSYGDSQLIK-----L 276
            I  +                   +AS  S + N       ++GS  GDS LIK     +
Sbjct: 403 AISEIKHDLPVMEKKPPRFDTPPLLASVPSCVTNVKAGEYFFLGSRVGDSLLIKYDANRV 462

Query: 277 NLQPDAKGSYVEVLERYVNLGPIVDFCVVDL------ERQGQGQVVTCSGAYKDGSLRIV 330
           N Q  A   +  V +  +N GPIVD  V D+      E   Q ++V+ SG  K+G+L + 
Sbjct: 463 NHQSVAPPVF-RVCDTMLNTGPIVDMAVGDVDTVEQQEDWPQLELVSSSGHGKNGALCVF 521

Query: 331 RNGIGINEQASVELQGIKGMWSLR-------SSTDDPFDTFLVVSFISETRILAMNLEDE 383
           +  I      +      + +WS++        + DD FD  L +S    T +L+    DE
Sbjct: 522 QRHIYPQTSFAFHQFDSQAIWSIKCRKNDQQQNEDDDFDKLLFISKSKSTLVLSAG--DE 579

Query: 384 LEETEIEGFCSQTQTLFCHDAI-YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVA 442
           L+E +  GF ++  T+         ++VQV +  V +++   + ++     P G  +  A
Sbjct: 580 LQEVKT-GFYTRGSTIAVSTLFDATRIVQVYATGVMVLTPEGKRIQT-VPIPRGAKIVEA 637

Query: 443 TANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI 485
           + +   +LL      ++ L+ GD    ++ H QL   I  + I
Sbjct: 638 SIHDPYILLTLDNNKILALQ-GDASTKDIIHIQLPNHIKDVGI 679



 Score = 76.6 bits (187), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 97/372 (26%), Positives = 156/372 (41%), Gaps = 50/372 (13%)

Query: 688  ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPE 742
            E E   + L+   T+E +     + FE  C  L C+  D         YY +     L  
Sbjct: 1099 EMEQFSMILVSPVTWEIVDKVEFEEFEQ-CFSLECALLDSKQTSTGRKYYMIIGTGTLKG 1157

Query: 743  ENEPTKGRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
            E+   KG I ++ I+E          + K + +  ++ KGAV ++   +G L A I  K+
Sbjct: 1158 EDTTMKGSIRMYDIIEVVPEPDNPQTNHKFKPVLTEDVKGAVTAMCTVSGHLAACIGSKV 1217

Query: 792  QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
             +  W L DD    L         I    + +  +FI++GD  KSI  L ++ E   +  
Sbjct: 1218 IV--WSLEDD--ERLVGVAFIDVQIYVTSMSSIKNFILIGDAQKSIWFLGFQLEPAKLTL 1273

Query: 852  RARDYNANWMSAVEILDDD--IYL--GAEN-NFNLFTVRK---NSEGATDEERGRLEVVG 903
              +DY +  +  V+ + DD  +YL  G  N N +L+        S G       +L   G
Sbjct: 1274 LGKDYQSFDVGCVDFIIDDKSLYLIVGDTNENIDLYQYAPFNLQSFGGQ-----KLMRRG 1328

Query: 904  EYHLGEFVNRFRHGSLVMRLPD-------SDVGQIPTVIFGTVNGVIGVIASLPHEQYLF 956
            ++H+G  V        ++RLP         +  +    + GT NG I VI+S+  + +  
Sbjct: 1329 DFHVGSQVQ------TMVRLPQIEKTEKGFEYSRRHFCLCGTFNGSIAVISSISEKTFKR 1382

Query: 957  LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK---TVDAKNFLDGDLIESFLDLSRTRMDE 1013
            L  L  +L   ++ V GLN   +R     K+   T   K  LDGDLI  F  LS     E
Sbjct: 1383 LNTLYGHLVNNLQHVAGLNPRAFRLIKGPKQRMSTNRTKAVLDGDLIFEFAGLSIEEQKE 1442

Query: 1014 ISKTMNVSVEEL 1025
             +K +  +V  +
Sbjct: 1443 TTKQIGTTVTRI 1454


>gi|240280498|gb|EER44002.1| pre-mRNA-splicing factor rse1 [Ajellomyces capsulatus H143]
          Length = 305

 Score = 80.5 bits (197), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 76/306 (24%), Positives = 145/306 (47%), Gaps = 35/306 (11%)

Query: 747  TKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR 804
            T G I ++   E+GK L+ I + + +    +L  F G+LLA I   +++Y   ++    R
Sbjct: 9    TAGFIHIYRFQEEGKELEFIHKTKVEQPPMALLGFQGRLLAGIGTDLRIYDLGMKQ-MLR 67

Query: 805  ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV 864
            + Q+    H   L + +QT+G  I+V D+ +S++ ++YK++E  +     D  + W +  
Sbjct: 68   KCQASVVPH---LVVGLQTQGSRIIVSDVQESLTYVVYKYQENRLIPFVDDVISRWTTCT 124

Query: 865  EILDDDIYLGAENNFNLFTVR---KNSEGATDEERG---------------RLEVVGEYH 906
             ++D +   G +   NL+ +R   K SE A ++  G               RL +V  ++
Sbjct: 125  TMVDYETVAGGDKFGNLWLLRCPAKASEEADEDGSGAHLIHERQYLQGAPNRLNLVAHFY 184

Query: 907  LGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
              +     +   LV    D  V    T + GTV+ +I  I+    E+  F + L+  L  
Sbjct: 185  PQDLPTSIQKAQLVTGGRDILVW---TGLQGTVSMLIPFIS---REEVDFFQSLEMQLAA 238

Query: 967  VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
                + G +H  +RS+        AK  +DGDL E++L L   +  +I+  ++ SV E+ 
Sbjct: 239  QNPPLAGRDHLIYRSY-----YAPAKGTIDGDLCETYLLLPNDKKQQIAGELDRSVREIE 293

Query: 1027 KRVEEL 1032
            +++ ++
Sbjct: 294  RKIADM 299


>gi|295665178|ref|XP_002793140.1| cleavage and polyadenylation specificity factor subunit A
            [Paracoccidioides sp. 'lutzii' Pb01]
 gi|226278054|gb|EEH33620.1| cleavage and polyadenylation specificity factor subunit A
            [Paracoccidioides sp. 'lutzii' Pb01]
          Length = 1408

 Score = 80.1 bits (196), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 100/431 (23%), Positives = 185/431 (42%), Gaps = 56/431 (12%)

Query: 655  RSIPLGEHPRRICHQEQSRTFAICSLK--NQSCAEESEMH-----------------FVR 695
            R I LGE    + +   S T+ + + +  +    E+ E+H                  V+
Sbjct: 977  RKIGLGEQVDSVEYSSSSETYVLGTSQKVDFKLPEDDEIHPEWRNEVISFFPQIDKGSVK 1036

Query: 696  LLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTKGR 750
            LL+ +T+  I +Y L T E      C  L  S  + +      VGTA    E+    +G 
Sbjct: 1037 LLNPRTWSIIDSYQLRTSERVMCVKCLNLEASEITHERKEMIAVGTALTRGEDIA-ARGC 1095

Query: 751  ILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKWM 797
            I VF V            + KL+LIA++E KGA+ SL+   G+  L+AA  QK  +    
Sbjct: 1096 IYVFEVIKVVPEVDRPETNRKLKLIAKEEVKGAITSLSGIGGQGFLIAAQGQKCIVRG-- 1153

Query: 798  LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
            L++DG+    +      ++  L         ++GD +K +    Y  E   +   ++D  
Sbjct: 1154 LKEDGSLLPVAFMDMQCYVSVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKDDG 1213

Query: 858  ANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
            +  + A + L D   +Y + A+++ N+  ++ + E     +  RL     +H G+F +  
Sbjct: 1214 SLQVMAADFLPDGKRLYIMVADDDCNIHVLQYDPEDPGSAKGDRLLHRSTFHTGQFASTL 1273

Query: 915  R---HGSLVMRLPDSDV--------GQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
                  S++ + P+++         G +  V+  +  G I +I  +    Y  L  LQ+ 
Sbjct: 1274 TLLPRTSVLSQGPETEANAMDLDLSGPLHQVLVTSETGSIALITPVSEMAYRRLSALQSQ 1333

Query: 964  LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
            +   ++   GLN   +R+   E   +  +  +DGDL++ +LDL   R  EI+  +   V 
Sbjct: 1334 MINTLEHPCGLNPRAFRAV--ESDGIGGRGMVDGDLVQKWLDLGTQRKAEIASRVGADVW 1391

Query: 1024 ELCKRVEELTR 1034
            E+   +E + +
Sbjct: 1392 EIRADLEAIGK 1402


>gi|154285962|ref|XP_001543776.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
 gi|150407417|gb|EDN02958.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
          Length = 1283

 Score = 79.7 bits (195), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 106/434 (24%), Positives = 182/434 (41%), Gaps = 61/434 (14%)

Query: 655  RSIPLGEHPRRICHQEQSRTFAICSLKNQ----SCAEESEMH-----------------F 693
            R I LGE    + +   S T+ I +  NQ    +  E+ E+H                  
Sbjct: 851  RKIGLGEQVDAVEYSSSSETYVIGT--NQKVDFNLPEDDEIHPEWRNEVISFLPQIDKGS 908

Query: 694  VRLLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTK 748
            V+LL  +T+  I +Y L T E      C  L  S  + +      VGTA +   E+   +
Sbjct: 909  VKLLTPRTWSIIDSYNLRTAERIMCVKCLNLEVSEITHERKDTIVVGTA-LTKGEDIAAR 967

Query: 749  GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
            G I +F V            + KL+LIA++E KGAV SL+   G+  L+AA  QK  +  
Sbjct: 968  GCIYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGSLIAAQGQKCIVRG 1027

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+    +      ++  L         ++GD +K +    Y  E   +   ++D
Sbjct: 1028 --LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD 1085

Query: 856  YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
                 + A + L D   +Y L A+++ N+  ++ + E     +  RL     +  G F +
Sbjct: 1086 DGTLQVMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFAS 1145

Query: 913  RF----RHGSLVMRLPDSD--------VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
                  R  +   + PD+D         G +  V+  +  G I +I  +    Y  L  L
Sbjct: 1146 TMTLLPRTATSSSQRPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSAL 1205

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            Q+ L   ++   GLN   +R+   E   +  +  +DGDL++ +LDL   R  EI+  +  
Sbjct: 1206 QSQLTNTLEHPCGLNPRAFRAV--ESDGIGGRGMVDGDLVKRWLDLGTQRKAEIANRVGA 1263

Query: 1021 SVEELCKRVEELTR 1034
             V E+   +E + +
Sbjct: 1264 DVWEIRADLEAIGK 1277


>gi|294654658|ref|XP_456718.2| DEHA2A08932p [Debaryomyces hansenii CBS767]
 gi|218511767|sp|Q6BYK1.2|RSE1_DEBHA RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|199429049|emb|CAG84677.2| DEHA2A08932p [Debaryomyces hansenii CBS767]
          Length = 1256

 Score = 79.3 bits (194), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 83/306 (27%), Positives = 133/306 (43%), Gaps = 52/306 (16%)

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL----QSECGHHGHI 816
            L+ + + E      ++  FNG+LL  ++  ++LY     D G R+L     S   +  +I
Sbjct: 963  LEFVHKTELDYQPTAIIPFNGRLLVGMSNFLRLY-----DLGQRQLLRKASSNIEYLKNI 1017

Query: 817  LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
            + L  Q  G  IVVGD   S + + Y   E      A D     ++A+  LD D  +G +
Sbjct: 1018 IRLTHQG-GSRIVVGDSSMSTTFVKYDSTENQFIPFADDIMKRQITALVTLDYDTIIGGD 1076

Query: 877  NNFNLFTVR----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 920
               N+F  R                +  E   +    RL+ + E++L +    F  GSLV
Sbjct: 1077 KFGNIFVSRVPETISQQSDKDWSLLRYQESYLNGSGSRLKNICEFYLQDIPTSFTKGSLV 1136

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVI--------KGV 971
            M       G   ++I+  + G +G++  L  E  + FL  LQ  LRK          K  
Sbjct: 1137 M-------GGKESIIYTGIQGTLGLLLPLSTENEVKFLGDLQLLLRKYFDYNFDDFDKDK 1189

Query: 972  GGLN-----HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
             G N     H ++RS+ N       KN +DGDLIE F +LS++    I   +N +  E+ 
Sbjct: 1190 NGYNLLGKDHLKFRSYYNP-----VKNVMDGDLIERFYELSQSMKIRIGTELNRTPREIE 1244

Query: 1027 KRVEEL 1032
            K++ E+
Sbjct: 1245 KKISEM 1250



 Score = 63.5 bits (153), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 141/680 (20%), Positives = 264/680 (38%), Gaps = 90/680 (13%)

Query: 7   YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
           +G I  ++  R  G  +D++ I ++  K  VL++D E  + +       S    R T  G
Sbjct: 67  FGIIQGIDKIRLVGTQKDYVVITSDSGKLVVLEFDIEKLQFVPLFQEPHSKNGLRRTSPG 126

Query: 67  QIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIRLEELQV--LDIKF- 116
           +   +DP  R  LIG    + L   +  +++G+L+     E F+     LQ+  +D  F 
Sbjct: 127 EYLCVDPHNRAILIGAIEKNKLVYKVQSNDEGKLELSSPLETFSKHTLTLQICAMDTGFE 186

Query: 117 --LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG------PWSQNNLDNGADLLI 168
             ++   +       QDN +        + L   +  +G        S   +   +  LI
Sbjct: 187 NPMFAAIECDYNARQQDNGEEEDAGEASLLLNYYELDQGLNHVVKHKSNEKIPGSSSHLI 246

Query: 169 PVPPPLCGVLIIGEETIVYCSANAFK---AIPIRPS------ITKAYGRVDADGSRYLLG 219
           P+P  + G+L+  + TI+Y   +  K    IPIR +      +     R+  +    L+ 
Sbjct: 247 PLPDFIGGLLVCSKSTIIYAHPSKDKLYLPIPIRSNTNETLIVNHVIHRLKKNNFFILVQ 306

Query: 220 DHAGLLHLLVITHEK--EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS---QLI 274
              G    + I H++  E +  + I       ++ +++   +  ++   +  +    Q  
Sbjct: 307 SQLGDCFKITIDHDEVNESIENINITYFDTIPLSQSLNIFKSGFLFANVATNNKLFYQFE 366

Query: 275 KL-------NLQPDAKGSYVEVLERYVN--------------------LGPIVDFCVVDL 307
           KL        LQ      Y  + E  ++                    L PI D  +++ 
Sbjct: 367 KLGDDNNNTTLQSCNFSDYNSIFELDISKRSFKVAGLENLALVDIMETLNPITDGALIET 426

Query: 308 ERQGQGQVVTCSGAYKDGS-LRIVRNGIGINEQASVELQGIK--GMWSLRSSTDDPFDTF 364
            R           A    S L+ + +GI  N   S  L  IK   + + R   +   D +
Sbjct: 427 LRPEVPDPFKQLTALSSHSYLKTLTHGISTNTVVSSPLP-IKPTAIHTTRIFAESANDEY 485

Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
           LV+S    ++ L +++ + +EE     F +   T+       + +VQ+ S  +R +  T 
Sbjct: 486 LVISSTLSSQTLVLSIGEVVEEVNDSQFVTNEPTINVQQVGKSSVVQIYSNGIRHIKHTM 545

Query: 425 R-----ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGILTEVKHAQLE 477
           R     +   +W  P G S+  A+ N  QV++      + Y EI   D  L E +  +LE
Sbjct: 546 RNDTIEKKYTDWYPPAGISIIQASTNNEQVIIGLSNREICYFEIDPHDDQLVEYQE-RLE 604

Query: 478 YE--------ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGG 526
                     IS   I+ +    SY   A VG  +D +++  SL     L ++T + L  
Sbjct: 605 MSGGSISALAISSSSISKLQRKSSY---AIVG-CSDETIQAISLKPHNCLEIVTLQALSA 660

Query: 527 EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
                S+ +      + +   + +G  +   ++  TG+L+D +   LG++P+ L      
Sbjct: 661 N--SSSIAMVPHGYSTSVHIGMENGLYVRVTIDEITGKLSDTRIQFLGSKPVQLSVIGLP 718

Query: 587 NTTH--VFAASDRPTVIYSS 604
                 + A S RP + Y S
Sbjct: 719 QLQQNGLLAISSRPWIGYYS 738


>gi|156084934|ref|XP_001609950.1| splicing factor 3b, subunit 3, 130kD [Babesia bovis T2Bo]
 gi|154797202|gb|EDO06382.1| splicing factor 3b, subunit 3, 130kD, putative [Babesia bovis]
          Length = 1169

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 61/277 (22%), Positives = 121/277 (43%), Gaps = 22/277 (7%)

Query: 760  KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
            +++L+      G V +   + G+LLA++ ++I+LY    +    +     C  HG I   
Sbjct: 906  EIRLLHVTRVGGVVRAFTGYEGRLLASVGKRIRLYALGKKQLLLKAEHRTCSDHGFI--- 962

Query: 820  YVQTRGDFIVVGDLMKSISLL---IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
            ++   G  I  GD+ + I +L    Y  E    E         W+++   LD    +  +
Sbjct: 963  WLNAVGSRIFAGDIREGIQILRIKFYSEEAAEFEWVGGATGPRWLTSCAQLDYSTVIAGD 1022

Query: 877  NNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
               ++F  R   E +T     +LE V ++HLG+        +L              V++
Sbjct: 1023 KFDSIFVTRVPQEEST--RHIQLENVCQFHLGDLPTAMDKAALSQ--------STHVVLY 1072

Query: 937  GTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
            GTV G IG +     +  L FL+ L+  +      + G  H  +RS+      V  +  +
Sbjct: 1073 GTVMGSIGALVPFQSKDELDFLQHLEMLMATEAPPLCGREHSFYRSY-----YVPVQQVV 1127

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            DGDL E F  L+  +  ++++ ++ +V  + ++++++
Sbjct: 1128 DGDLCEQFRHLTEAQQRKVAQQLDTTVNNVLRKLDDI 1164


>gi|403224220|dbj|BAM42350.1| splicing factor 3b subunit [Theileria orientalis strain Shintoku]
          Length = 1272

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 146/670 (21%), Positives = 274/670 (40%), Gaps = 103/670 (15%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
           ++G +  +  FR  G  +D+L + ++  K  +L++  E  +   R   +   + G R   
Sbjct: 58  VFGIVRAISAFRLTGSQKDYLVVGSDSGKLVILEFSLEL-KTFKRVHCETYGKTGVRRIV 116

Query: 65  NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
            GQ   +DP  R  ++G         ++  D+K  L  +  +   +   +    +    G
Sbjct: 117 PGQYLGVDPKGRAVMVGAVERQKFVYIMNRDSKANLTISSPLEAHKSHSVCFDLVGLEVG 176

Query: 120 CAKPTIVVLYQ--DNKDARHVK-----TYEVALKDKDFVEGPWSQNNL--------DNGA 164
              P    + Q  +N D+  +      T E   K   F E     N++        D  A
Sbjct: 177 FENPIFASIEQSYENVDSLQIDLDEELTDEALKKGVSFWEMDLGLNHVVKKVTLPVDLTA 236

Query: 165 DLLIPVPPPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITK-------AYG-RVDADG 213
            +L+PVP    GV++  E  +VY +    + + A P R  +++       +Y      D 
Sbjct: 237 HMLVPVPGGPGGVIVCCENFLVYKNLDHGDVYCAYPRRLEVSEHAKLLITSYAVHKMKDF 296

Query: 214 SRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL 273
              LL    G L+ + + ++  +V  + +       +A+++  L +  +++ S +GD  +
Sbjct: 297 FFILLQSEYGDLYKVDLNYDDAQVKEIVVRYFDTVELATSMCILRSGYLFVASEFGDHHV 356

Query: 274 IKL-------------NLQPDAKGSYVEVLERYV----------NLGPIVDFCVVDLERQ 310
            +              +L P +K + +    R +          +L  IVD  VVD+   
Sbjct: 357 YQFTDLGSNEKDPMCTSLHPHSKSAIIAFKPRVLQNLYETDKLPSLSSIVDMKVVDVMGT 416

Query: 311 GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSF 369
           G  + +   G + +  L+ +R G+   E A  EL G  + +++++S  +  +D F++VSF
Sbjct: 417 GDYEFIMGCGRWYNSRLKSLRYGLSTEELAFNELPGRPRAVFTIKS-LESNYDEFIIVSF 475

Query: 370 ISETRILAMNLEDELEETEIEGFCSQTQTLF-CHDAIYNQ----------------LVQV 412
              T +L++   + +EE     F +   TL  C+ + Y+                  VQV
Sbjct: 476 QGNTLVLSIG--EAVEEVTDSFFLTSITTLHSCYMSNYHATESLEGRFEGGVSDGIFVQV 533

Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
                R    +  ++  EWK P    V +A  N +Q+LL   GG +VY E+ D  L EV 
Sbjct: 534 HDSGFRY---SHGQVVKEWKVPSTKRVKLADNNLNQLLLVLSGGEVVYFELVDNDLEEVA 590

Query: 473 HAQLEYEISCLDI-----NPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGG 526
              L  EI+C+ +     +  GE     +   VG   +I +R+  L   L + + + L  
Sbjct: 591 KRNLSTEITCVALQHTPASKAGERVRKGEFCCVGSIDNI-IRVLKLDKTLKMCSSQLLSN 649

Query: 527 EIIPRSVLLCAFEG---------------ISYLLCALGDGHLLNFLLNMKTGELTDRKKV 571
             +P S+ L   E                 SYL   L +G L+   ++   G L+D++  
Sbjct: 650 NALPESLALVTAETGSGSGGGSSGGLGESESYLYVGLNNGILIRNNVD-SLGNLSDQESR 708

Query: 572 SLGTQPITLR 581
            +GT+ + L+
Sbjct: 709 FMGTKAVKLK 718



 Score = 72.8 bits (177), Expect = 9e-10,   Method: Compositional matrix adjust.
 Identities = 70/282 (24%), Positives = 126/282 (44%), Gaps = 31/282 (10%)

Query: 761  LQLIAEKETKGAVYSLNAFNGKLL-AAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
            ++L+    TKG +   N +  KLL  AI  ++++Y       G ++L  +  H    L  
Sbjct: 1007 IKLLHVTNTKGWIRCFNNYENKLLLCAIGTRLRMYAL-----GKKQLLLKGEHRS--LTN 1059

Query: 820  Y----VQTRGDFIVVGDLMKSISLL---IYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
            Y    ++  G  I  GD+ +S+ LL    Y  E G  E  A      W+S +E+LD    
Sbjct: 1060 YGFMDIKVIGSRIYCGDIRESVQLLRIKFYGEETGEFEMTATSTGPRWLSTMELLDYSTV 1119

Query: 873  LGAENNFNLFTVR-KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
            + A+   ++F  R  N+E          E   ++HL + V       L     +S  G++
Sbjct: 1120 MAADKFDSVFVARVPNNEDVV--RNNYFEYQNQFHLSDLVTSLSKVKL-----NSVYGEL 1172

Query: 932  PTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
              V++ T+ G +G   +   +  + FL+ L+  L   +  + G   +  RS+        
Sbjct: 1173 --VVYSTILGSLGAFVTFTSKDEVDFLQHLEMLLANELDTLSGREAQMHRSY-----YFP 1225

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
             +N +DGDL E + +LS     +I+  +NV V E+ K+++ +
Sbjct: 1226 VQNVIDGDLCELYFNLSSDLKAKIANQLNVKVAEVVKKLKNI 1267


>gi|12805469|gb|AAH02210.1| Ddb1 protein, partial [Mus musculus]
          Length = 86

 Score = 79.3 bits (194), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 41/86 (47%), Positives = 53/86 (61%), Gaps = 10/86 (11%)

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN- 1019
            Q  L KVIK VG + H  WRSF+ E+KT  A  F+DGDLIESFLD+SR +M E+   +  
Sbjct: 1    QNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQY 60

Query: 1020 ---------VSVEELCKRVEELTRLH 1036
                      + ++L K VEELTR+H
Sbjct: 61   DDGSGMKREATADDLIKVVEELTRIH 86


>gi|339253000|ref|XP_003371723.1| cleavage and polyadenylation specificity factor subunit 1
            [Trichinella spiralis]
 gi|316967988|gb|EFV52332.1| cleavage and polyadenylation specificity factor subunit 1
            [Trichinella spiralis]
          Length = 1376

 Score = 79.0 bits (193), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 77/308 (25%), Positives = 128/308 (41%), Gaps = 29/308 (9%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+PE + P         +   KL+++  KE KG V SL +  G LL  + QK+  Y W  
Sbjct: 1081 VVPEPDRP---------MTKFKLKVVYSKEQKGPVTSLCSLRGYLLTGMGQKV--YIWQY 1129

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGD---FIVVGDLMKSISLLIYKHEEGAIEERARD 855
            +D+         G     L +YV          +  D    +SLL Y+ E  A+   +RD
Sbjct: 1130 KDNAL------VGISFLDLQVYVHQMASIRYLALTADAFFGVSLLRYQEEYKALSLVSRD 1183

Query: 856  YNANWMSAVEILDDDI---YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
               + + AVE L D     +L      ++ T     E        RL    +YH G  VN
Sbjct: 1184 PRPDEVLAVEFLVDRTDLSFLMTSAAGDILTYVYLPESLDSFGGQRLVPQADYHFGSQVN 1243

Query: 913  RFR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVI 968
             F     H   +      +V Q   +IF + +G +  +  LP  +Y  L  LQ+ L  ++
Sbjct: 1244 AFVRMRCHAQEIAGRKRQEVLQRQGLIFASSDGSVNYLLPLPEREYRLLGMLQSLLIDML 1303

Query: 969  KGVGGLNHEQWRS--FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
                GLN + +R+  F N       KN +DG++   +L +   + ++I + +  S  ++ 
Sbjct: 1304 PSFAGLNVDDYRTVRFPNSCLREPTKNIIDGNICMLYLYIDALQQEDIVRQIGSSHSQIM 1363

Query: 1027 KRVEELTR 1034
              +  + R
Sbjct: 1364 LELAYMER 1371



 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 80/335 (23%), Positives = 140/335 (41%), Gaps = 76/335 (22%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           +YGR+A++ + R  G   D + +A +  K  V+ +D E+  L+T +M    D      D 
Sbjct: 75  VYGRLASVAIARLTGFQLDVILLAIDDAKLSVVGYDIETHSLVTLSMHYYED------DL 128

Query: 66  GQIGI----------IDPDCRLIGLHLYDGLFKVIPF------------DNKGQLKEAFN 103
            ++G           +DP+ R   + +Y     V+P             D   +    F+
Sbjct: 129 FKLGFTRFEIPPMLRMDPERRCAAMTIYGAHLVVLPLVRESLYESMNIVDPSQRPGWPFS 188

Query: 104 IRLEEL------------QVLDIKFLYGCAKPTIVVLYQDNKDAR-----HVKTYE---V 143
           +RL                V D+ FL+G  +PT+++LY+  +           TY+   V
Sbjct: 189 LRLTSYTVAFNAIDAKMHNVTDMCFLHGFYEPTVLLLYEPTQTTAGRVVVRQDTYQILAV 248

Query: 144 ALKDKDFVEGP-WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC-------------S 189
           +L  KD      W+  NL   A  L+ +P PL GVL+    +I+Y              +
Sbjct: 249 SLNPKDKTHAVIWTLGNLPFDAFALLALPKPLGGVLLFSVNSIIYLNQSVPCCGILINDN 308

Query: 190 ANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLL------VITHEKEKVTGLKIE 243
              F   P+R    ++   V  DGS   L D A    +L      V++   +++  +K  
Sbjct: 309 GRGFTNYPLR---DRSELMVTLDGSHAALIDSANAALVLRSGLVFVVSLLFDRLNMVKEI 365

Query: 244 LLGETSI----ASTISY-LDNAVVYIGSSYGDSQL 273
           LL  +S+     ST+S  + +  +++GS+ G+S L
Sbjct: 366 LLTASSVRGAAPSTVSACVSSNCLFVGSAIGNSAL 400


>gi|324499955|gb|ADY39993.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
            suum]
          Length = 1434

 Score = 78.6 bits (192), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 73/306 (23%), Positives = 146/306 (47%), Gaps = 25/306 (8%)

Query: 741  PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
            PE  +PT            +++ + +KE KG V SL + NG LLA + QK+  + W+ RD
Sbjct: 1139 PEPGQPTSKH---------RIKTLYDKEQKGPVTSLCSCNGYLLAGMGQKV--FIWLFRD 1187

Query: 801  DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA-- 858
            +  + + S    H +I  L V  R +  +  D+ +S++LL Y+ E  A+   +RD  A  
Sbjct: 1188 NNLQGI-SFLDMHFYIHQL-VGVR-NLALACDIYRSVALLRYQEEYKALSLASRDMRAVV 1244

Query: 859  -NWMSAVEILDDD--IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
               M+A  ++D+    ++ ++   N+       E        RL +  E ++G  VN F 
Sbjct: 1245 QPPMAAQFLIDNRQMAFIMSDEAANIAVFNYLPEALESSGGERLILRSEINIGTNVNSFM 1304

Query: 916  H--GSLVMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
               G +     +++   +   +V+F +++G  G +  L  + +  L  LQ  +  ++   
Sbjct: 1305 RVKGHISSGFVENEHYSLNRQSVLFCSLDGSFGFVRPLSEKVFRRLHMLQQLMSSLVAQA 1364

Query: 972  GGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
             GLN +  R+   ++    ++ +N +DGD++  +L LS    +++++ +  S   +   +
Sbjct: 1365 AGLNVKGSRAARPQRPNHYLNTRNMVDGDVVFQYLHLSLADKNDLARKLGTSRYHIIDDL 1424

Query: 1030 EELTRL 1035
             E++RL
Sbjct: 1425 TEISRL 1430



 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 107/511 (20%), Positives = 193/511 (37%), Gaps = 93/511 (18%)

Query: 1   MLDVPIYGRIATLELFR-PHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRI 59
           ++ V +   + +L + R P   +   L +  +  K  V+ + A    L T ++    + +
Sbjct: 110 IIHVRLLAPVKSLAVARIPQNPSCSSLLLGFDTAKLSVVGFSAAERSLKTISLHCFEEEM 169

Query: 60  ---GRPTDNGQIGI-IDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----Q 110
              G  TD     I +DP  R   + +Y     V+PFD+       + + L  +      
Sbjct: 170 LKDGYVTDLPSPVIRVDPAQRCAVMLIYGRYLAVLPFDDTSPHLHTYTVALSSIDPRLVN 229

Query: 111 VLDIKFLYGCAKPTIVVLYQDNKD--ARHVKTYE------VALKDKDFVEGP-WSQNNLD 161
           ++DI FL G  +PT++ LY+  +    R    Y+      V+L  K+ V    W  NNL 
Sbjct: 230 IIDIAFLDGYYEPTLLFLYEPAQTTAGRACVRYDTVCMLGVSLNTKEQVHASVWQLNNLP 289

Query: 162 NGADLLIPVPPPLCGVLIIGEETIVY-------CSA------NAFKAIPIRP------SI 202
              + ++ +P P+ G LIIG   ++Y       C +      + F   P++       ++
Sbjct: 290 MDCNQVLMIPRPIGGALIIGANELIYLNQSVPPCGSLLNSCMDGFTKFPLKSEKEMALTL 349

Query: 203 TKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
                 V +     +      L  L ++      V  ++ +   + SI  T++      +
Sbjct: 350 DGCAACVISTNKVVVCARSGALFILTLVVDSTNSVKSIEFKHEFDVSIPHTVTACSPGYL 409

Query: 263 YIGSSYGDSQLIKLNLQ------PDAKGSYVE---------------------------- 288
           ++GS  GDS  I+   +      P  K   VE                            
Sbjct: 410 FVGSRVGDSLFIEYVSELVPVDDPIEKKLKVEVPQDDLEDEDLELYGKALPSVISQDVSV 469

Query: 289 ------VLERYVNLGP---IVDFCVVDLERQGQGQV--------VTCSGAYKDGSLRIVR 331
                 VL+R +N+ P   +   C   L    Q Q         V   G  KD S+ I +
Sbjct: 470 EKMRFRVLDRMLNVAPCKKMTSGCSEGLNSYLQEQPRLDPVFDRVCACGHGKDSSICIFQ 529

Query: 332 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 391
             I  +   S  ++G+   W++    D   DT + +    E   LA+  +++L E E   
Sbjct: 530 QSIRPDIITSSSIEGVIQYWAVGRRED---DTHMYIIASKELGSLALETDNDLVELEAPV 586

Query: 392 FCSQTQTLFCHDAIYNQL-VQVTSGSVRLVS 421
           F +   T+   +     L VQVT+ S+ +V+
Sbjct: 587 FITSESTIAAGELADGGLSVQVTTSSIVVVA 617


>gi|24415580|gb|AAN41460.1| putative cleavage and polyadenylation specificity factor 160 kDa
           subunit [Arabidopsis thaliana]
          Length = 1442

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 95/403 (23%), Positives = 160/403 (39%), Gaps = 108/403 (26%)

Query: 103 NIR-LEELQVLDIKFLYGCAKPTIVVLYQDNK------DARHVKTYEVALK-DKDFVEGP 154
           N+R LE   V D  FL+G  +P IV+L ++          +H      AL  +    + P
Sbjct: 240 NLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHP 299

Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDAD 212
             WS  NL + A  L+ VP P+ GVL++   TI Y S +A  A+ +    + A    +  
Sbjct: 300 VIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELP 359

Query: 213 GSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
            S +                 LL   +G L LL + ++   V  L +     + +AS I+
Sbjct: 360 ASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDIT 419

Query: 256 YLDNAVVYIGSSYGDSQLIKLNLQ------------------------------------ 279
            + N++ ++GS  GDS L++ + +                                    
Sbjct: 420 SVGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTSDTFQD 479

Query: 280 -------------PD----AKGSY-VEVLERYVNLGPIVDFC----------VVDLERQG 311
                        PD    A+ S+   V +  VN+GP+ DF              + +Q 
Sbjct: 480 TIGNEELSLFGSTPDNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL--------------RSST 357
             ++V CSG  K+G+L ++R  I       VEL G KG+W++               ++ 
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
           +D +  +L++S   E R + +   D L E TE   +  Q +T+
Sbjct: 600 EDEYHAYLIISL--EARTMVLETADLLTEVTESVDYYVQGRTI 640



 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 78/302 (25%), Positives = 137/302 (45%), Gaps = 24/302 (7%)

Query: 702  FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
            +E  +  P+ T E+  ++    L  + + ++     VGTAYV   E+   +GR+L+F   
Sbjct: 1097 WETKAKIPMQTSEHALTVRVVTLLNASTGENETLLAVGTAYV-QGEDVAARGRVLLFSFG 1155

Query: 758  ---DGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
               D    ++ E   +E KGA+ ++ +  G LL +   KI L+KW    +GT        
Sbjct: 1156 KNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW----NGTELNGVAFF 1211

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD-- 869
                +  + +     FI++GD+ KSI  L +K +   +   A+D+ +    A E L D  
Sbjct: 1212 DAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLDCFATEFLIDGS 1271

Query: 870  DIYLGAENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             + L   +      V   +    +  +G +L    E+H+G  V++F    L +++  S  
Sbjct: 1272 TLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKF----LRLQMVSSGA 1327

Query: 929  GQIP--TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
             +I    ++FGT++G  G IA L    +  L+ LQ  L   +  V GLN   +R F +  
Sbjct: 1328 DKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAFRQFRSSG 1387

Query: 987  KT 988
            K 
Sbjct: 1388 KA 1389


>gi|10257491|dbj|BAB11613.1| cleavage and polyadenylation specificity factor subunit [Arabidopsis
            thaliana]
          Length = 1448

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 78/302 (25%), Positives = 137/302 (45%), Gaps = 24/302 (7%)

Query: 702  FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
            +E  +  P+ T E+  ++    L  + + ++     VGTAYV   E+   +GR+L+F   
Sbjct: 1103 WETKAKIPMQTSEHALTVRVVTLLNASTGENETLLAVGTAYV-QGEDVAARGRVLLFSFG 1161

Query: 758  ---DGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
               D    ++ E   +E KGA+ ++ +  G LL +   KI L+KW    +GT        
Sbjct: 1162 KNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW----NGTELNGVAFF 1217

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD-- 869
                +  + +     FI++GD+ KSI  L +K +   +   A+D+ +    A E L D  
Sbjct: 1218 DAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLDCFATEFLIDGS 1277

Query: 870  DIYLGAENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             + L   +      V   +    +  +G +L    E+H+G  V++F    L +++  S  
Sbjct: 1278 TLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKF----LRLQMVSSGA 1333

Query: 929  GQIP--TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
             +I    ++FGT++G  G IA L    +  L+ LQ  L   +  V GLN   +R F +  
Sbjct: 1334 DKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAFRQFRSSG 1393

Query: 987  KT 988
            K 
Sbjct: 1394 KA 1395



 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 93/210 (44%), Gaps = 27/210 (12%)

Query: 103 NIR-LEELQVLDIKFLYGCAKPTIVVLYQDNK------DARHVKTYEVALK-DKDFVEGP 154
           N+R LE   V D  FL+G  +P IV+L ++          +H      AL  +    + P
Sbjct: 240 NLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHP 299

Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDAD 212
             WS  NL + A  L+ VP P+ GVL++   TI Y S +A  A+ +    + A    +  
Sbjct: 300 VIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELP 359

Query: 213 GSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
            S +                 LL   +G L LL + ++   V  L +     + +AS I+
Sbjct: 360 ASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDIT 419

Query: 256 YLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
            + N++ ++GS  GDS L++ + +     S
Sbjct: 420 SVGNSLFFLGSRLGDSLLVQFSCRSGPAAS 449



 Score = 48.5 bits (114), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 36/136 (26%), Positives = 62/136 (45%), Gaps = 27/136 (19%)

Query: 289 VLERYVNLGPIVDFC----------VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 338
           V +  VN+GP+ DF              + +Q   ++V CSG  K+G+L ++R  I    
Sbjct: 513 VRDSLVNVGPVKDFAYGLRINADANATGVSKQSNYELVCCSGHGKNGALCVLRQSIRPEM 572

Query: 339 QASVELQGIKGMWSL--------------RSSTDDPFDTFLVVSFISETRILAMNLEDEL 384
              VEL G KG+W++               ++ +D +  +L++S   E R + +   D L
Sbjct: 573 ITEVELPGCKGIWTVYHKSSRGHNADSSKMAADEDEYHAYLIISL--EARTMVLETADLL 630

Query: 385 EE-TEIEGFCSQTQTL 399
            E TE   +  Q +T+
Sbjct: 631 TEVTESVDYYVQGRTI 646


>gi|30696088|ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
 gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit
            1; AltName: Full=Cleavage and polyadenylation specificity
            factor 160 kDa subunit; Short=AtCPSF160; Short=CPSF 160
            kDa subunit
 gi|332008729|gb|AED96112.1| cleavage and polyadenylation specificity factor subunit 1
            [Arabidopsis thaliana]
          Length = 1442

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 78/302 (25%), Positives = 137/302 (45%), Gaps = 24/302 (7%)

Query: 702  FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
            +E  +  P+ T E+  ++    L  + + ++     VGTAYV   E+   +GR+L+F   
Sbjct: 1097 WETKAKIPMQTSEHALTVRVVTLLNASTGENETLLAVGTAYV-QGEDVAARGRVLLFSFG 1155

Query: 758  ---DGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
               D    ++ E   +E KGA+ ++ +  G LL +   KI L+KW    +GT        
Sbjct: 1156 KNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW----NGTELNGVAFF 1211

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD-- 869
                +  + +     FI++GD+ KSI  L +K +   +   A+D+ +    A E L D  
Sbjct: 1212 DAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLDCFATEFLIDGS 1271

Query: 870  DIYLGAENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
             + L   +      V   +    +  +G +L    E+H+G  V++F    L +++  S  
Sbjct: 1272 TLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKF----LRLQMVSSGA 1327

Query: 929  GQIP--TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
             +I    ++FGT++G  G IA L    +  L+ LQ  L   +  V GLN   +R F +  
Sbjct: 1328 DKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAFRQFRSSG 1387

Query: 987  KT 988
            K 
Sbjct: 1388 KA 1389



 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 95/403 (23%), Positives = 159/403 (39%), Gaps = 108/403 (26%)

Query: 103 NIR-LEELQVLDIKFLYGCAKPTIVVLYQDNK------DARHVKTYEVALK-DKDFVEGP 154
           N+R LE   V D  FL+G  +P IV+L ++          +H      AL  +    + P
Sbjct: 240 NLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHP 299

Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDAD 212
             WS  NL + A  L+ VP P+ GVL++   TI Y S +A  A+ +    + A    +  
Sbjct: 300 VIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELP 359

Query: 213 GSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
            S +                 LL   +G L LL + ++   V  L +     + +AS I+
Sbjct: 360 ASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDIT 419

Query: 256 YLDNAVVYIGSSYGDSQLI------------------------------KLNLQPD---- 281
            + N++ ++GS  GDS L+                              +L +  D    
Sbjct: 420 SVGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTSDTFQD 479

Query: 282 -------------------AKGSY-VEVLERYVNLGPIVDFC----------VVDLERQG 311
                              A+ S+   V +  VN+GP+ DF              + +Q 
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539

Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL--------------RSST 357
             ++V CSG  K+G+L ++R  I       VEL G KG+W++               ++ 
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599

Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
           +D +  +L++S   E R + +   D L E TE   +  Q +T+
Sbjct: 600 EDEYHAYLIISL--EARTMVLETADLLTEVTESVDYYVQGRTI 640


>gi|320165260|gb|EFW42159.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
          Length = 2631

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 56/205 (27%), Positives = 95/205 (46%), Gaps = 32/205 (15%)

Query: 492  PSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDG 551
            P  + + A+ +W  + V +F LP + L+ +    G+++ RS+ L A EG   L   L DG
Sbjct: 1661 PKATTLVAISLWKTMDVLVFQLPGMELVAQHRFAGDVVVRSLALHAIEGRIKLFVGLADG 1720

Query: 552  HLLNFLLNMKTG--------ELTDRKKVSLGTQPITLRTF---------------SSKNT 588
            HL+ F L  ++         E  +R+ V +G+ P+TL  F               + +  
Sbjct: 1721 HLVQFTLVPRSVVAGTKCELEFAERRAVLVGSSPLTLEVFPLAGPADNDHGESVEAGRPP 1780

Query: 589  THVFAASDRPTVIYS----SNKKLLYSNVNLKEVSHMCPFNSAAFPDS-LAIAKEGELTI 643
              +    D+P ++      S  ++L  N+ L E S  C F+SA F  S L +A +G +++
Sbjct: 1781 VGLLVCCDKPALVLPGKTDSQTQVLPLNIALAEKS--CCFSSALFRQSTLLLAAQGRISV 1838

Query: 644  GTID--DIQKLHIRSIPLGEHPRRI 666
             T    D ++LH   + LG  P  I
Sbjct: 1839 ATFAQLDGKQLHAEKLLLGATPLAI 1863


>gi|124505011|ref|XP_001351247.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
            putative [Plasmodium falciparum 3D7]
 gi|7768292|emb|CAB11136.2| CPSF (cleavage and polyadenylation specific factor), subunit A,
            putative [Plasmodium falciparum 3D7]
          Length = 2870

 Score = 78.2 bits (191), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 108/505 (21%), Positives = 209/505 (41%), Gaps = 79/505 (15%)

Query: 565  LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSS-NKKLLYSNVNLKEVSHMCP 623
             +++KK+ +    I       K    +F   + P +IYS   KK+  S ++LK +  +  
Sbjct: 2344 FSEKKKIDINQSNII------KKYNFLFVCCESPIIIYSDLKKKINVSKLSLKNIYIVDI 2397

Query: 624  FNS----AAFPDSLAIAKEGE------------LTIGTIDDIQKLHIRSIPLGEHPRRIC 667
            FN       F + L+  K+ +            + I  ++ I+K  ++ IP      +I 
Sbjct: 2398 FNDFNYLNPFHNFLSFKKKNQNNFYFIFYDGSNIHISPLNQIKKTFLKKIPFHRTVEKIA 2457

Query: 668  -HQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS-------- 718
             H +     A C  + +    E     +   D        TY + + +Y  S        
Sbjct: 2458 YHSDTGLLIAACPSEEKHKTNEMMKQIICFFDPYHDSIKYTYIIPS-KYTVSTIIIYDNE 2516

Query: 719  -ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKE------TKG 771
             ++  +F   S ++  VGT     +  EPT G I +FI +  K  +   K         G
Sbjct: 2517 KLMKSNFDVTSFIF--VGTCNSNEKYTEPTSGHIHIFIAK-KKANIFEIKHIYTHNINYG 2573

Query: 772  AVYSLNAFNGKLLAAINQKIQL---------YKWMLRDDGTRELQSECGHHGHILALY-- 820
             V +L  ++ K++A IN  + +         Y+  +     + LQ +   +  I+ L   
Sbjct: 2574 GVTNLVPYDDKIVATINNMVVILDINNLIIKYEAFM---DPQNLQPKIEGNNAIVELVSF 2630

Query: 821  --------VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
                    V   GD+IVVGD+M S+++L Y +E   + E  RDY+  W +++  L     
Sbjct: 2631 TPSSWIMTVDVYGDYIVVGDIMTSVTILQYDYENSQLFEVCRDYSNIWCTSLCALSKSHI 2690

Query: 873  LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR---FRHGSLVMRLPDSDVG 929
            + ++ + N   ++K+     DE+  +L  V  ++ G  +N+     + +L+    D D  
Sbjct: 2691 VVSDMDANFIILQKSKFKYNDEDSYKLSSVSLFNHGSIINKMLPLSNTNLIEE--DYDKR 2748

Query: 930  QIPT----VIFGTVNGVIGVIASLPHEQYLFLEK---LQTNLRKVIKGVGGLNHEQWRSF 982
             I T    ++  +  G I V+  +P   +   +K   ++  +   I  +G L+H  +R +
Sbjct: 2749 NILTKNDGILCASSEGSISVL--IPFSSFANFKKALCIEIAITDNISSIGNLSHNAYREY 2806

Query: 983  NNEKKTVDAKNFLDGDLIESFLDLS 1007
                ++   K  +DG+L++ F  +S
Sbjct: 2807 KVNFRSKHCKGIVDGELLKMFFHMS 2831



 Score = 43.5 bits (101), Expect = 0.64,   Method: Compositional matrix adjust.
 Identities = 27/108 (25%), Positives = 55/108 (50%), Gaps = 6/108 (5%)

Query: 21  EAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGL 80
           + + +LF+ T +Y   +L++D + ++ IT +  ++ +  G   +   I ++D   + I  
Sbjct: 83  KVKSYLFVLTRKYVLLLLEYDVKENDFITLSKINLCELNGLHLEEDVIFLLDERHKTILF 142

Query: 81  HLYDGLFKVI--PFDNKGQLKEAFNIRLEELQVLDIKFL----YGCAK 122
           + Y  + K I   +DN   L   + +R++E  ++DI FL     GC K
Sbjct: 143 YGYKNILKYIYLDYDNFLNLNNVYTMRIDESLIIDIAFLGTHTMGCNK 190



 Score = 43.1 bits (100), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 17/69 (24%), Positives = 38/69 (55%), Gaps = 1/69 (1%)

Query: 286  YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
            Y+E+L    N+GPI+D CVV   +  + +++TC+   + G + I+++G+  N    +   
Sbjct: 970  YIEILSVIQNMGPILDMCVVK-NKNNEYEIITCNSYGRTGCVSIIQSGLKTNITCDLNFN 1028

Query: 346  GIKGMWSLR 354
             +   + ++
Sbjct: 1029 KLNNFFVVK 1037


>gi|290981010|ref|XP_002673224.1| CPSF A subunit [Naegleria gruberi]
 gi|284086806|gb|EFC40480.1| CPSF A subunit [Naegleria gruberi]
          Length = 1373

 Score = 77.8 bits (190), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 103/528 (19%), Positives = 218/528 (41%), Gaps = 92/528 (17%)

Query: 591  VFAASDRPTVIYSSNKKL-LYSNVNLKEVSHMCPFNSAAFPDSLAI--------AKEGEL 641
            +F   ++P  +++ +  L ++   +   V+   P++    P              K+ +L
Sbjct: 849  LFKTGEKPFWLFTEHSNLRVHPTQSRDPVTTFTPYHHENCPHGFIYLTDKEQDNKKQSKL 908

Query: 642  TIGTIDDIQKLHI----RSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEES------- 689
             I +++   K +     R I L   P  I  HQ+ +   A  S+  ++   +S       
Sbjct: 909  HISSLNANVKFNAYWPQRKILLKSTPNVITFHQDTNTCLAFTSVPVKAILPDSIPFPEGK 968

Query: 690  ------EMHFVRLLDDQTFEFISTYPLDTFEYGCS-----ILSCSFSDDSNVYY------ 732
                  + H V+L     ++ +  +  D  E   +     +    ++DD+++ +      
Sbjct: 969  CPPPAEQKHTVKLFSGHNWQEMDKFEFDLHESAVAAKVVYLSKEEYNDDTDISFEEPLNS 1028

Query: 733  ---------CVGTAYVLPEENEPTKGRILVFIVE-------DGKLQLIAEKETKGAVYSL 776
                      VGTAYV   E E  +GR+L+F ++       + KL LI+    KG + +L
Sbjct: 1029 RKQDLVSVVAVGTAYV-QSERELCRGRLLLFDLDPILGRENEYKLNLISSTSVKGPITTL 1087

Query: 777  NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
               +  ++ ++  +I  Y +   D   + +     +        + T  +FI+ GD+ KS
Sbjct: 1088 EQVDRYIICSVGNRIYTYYF---DWEEKRMHITSFYDTQFYTASLNTVRNFIMFGDIYKS 1144

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGA 891
            +S L +K +   +   A+D     + + E L ++  LG        N  +F+     + +
Sbjct: 1145 VSFLRWKEKGHRLILLAKDNRPLQVVSSEFLVNNDLLGLAVIDTSKNLQIFSYLPQHQES 1204

Query: 892  TDEERGR-LEVVGEYHLGEFVNRFRHGSLVMRLPDSD------VGQIPT----------- 933
             D   GR L  V ++H+G  +N       V  LPD +      V + P            
Sbjct: 1205 ND---GRNLVPVCDFHIGTLINSLIRMK-VRELPDDNTIRLGNVNEKPKQSGKKDITKTN 1260

Query: 934  -----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT 988
                 ++FG+V+G IG +A +    +  L  LQ  +   ++   GL+ + +R +   ++T
Sbjct: 1261 PNHQFILFGSVDGAIGYVAPINEVTHRRLFALQLKMYTQLEQAAGLHPKSFRLYKPLERT 1320

Query: 989  V--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
                 KN +DG LI ++ +++     ++++ +  + + + + ++EL +
Sbjct: 1321 EYNYKKNIIDGQLIWNYANINTILQRDLARQIGTNSDNILRSIQELNQ 1368



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 126/668 (18%), Positives = 237/668 (35%), Gaps = 164/668 (24%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATE-RYKFCVLQWDAESSELITRAMGDVSDRIGRPTD 64
           ++G I ++      G  ++ L I  E   K  V+++D + S+L T ++  + D   R   
Sbjct: 58  LFGVIDSIIAVCLQGMKKEMLLINFEDEAKVSVVEFDEKRSDLKTLSLHYLEDDFLREGK 117

Query: 65  ----NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ----------------------- 97
               + Q  I+DP  R   + + D    ++PF   G+                       
Sbjct: 118 ARFFHNQPIILDPQNRFATVIICDSKLVILPFRQSGEDVSLSTEDNFLFALSGDQEEANE 177

Query: 98  ------------LKEAFNIRLEELQVLDIK---FLYGCAKPTIVVLYQDNKD-----ARH 137
                       ++    I L +L + ++K   FL G  +PTI+ L+++ +      A  
Sbjct: 178 NVGDQKKHHQPEVQRQVIIDLNDLGIKNVKDYCFLNGYNEPTILFLHENEQTWSGRLAAK 237

Query: 138 VKTYEVALKDKD----FVEGPWSQNNLDNGADLLIPVPPPLCG-VLIIGEETIVYCSANA 192
             T  V     D    +    WS  +L +  + LIP+   + G  L+IG  +I++ +  A
Sbjct: 238 SNTSTVTAVSFDLFRKYYPKIWSVGSLPHDCNKLIPLQEDVAGGALVIGMNSIIHINQCA 297

Query: 193 FKAIPIR------PSITKAYGRVDADG-------------SRYLLGDHAGLLHLLVITHE 233
              +         P+++  +   D                 + L+    G L+ + +   
Sbjct: 298 TYGLSFNDFAVSNPNLSINFNTFDGPALFFDTVAYTFIARDKLLVSLKDGELYTMYLESG 357

Query: 234 KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL----------------- 276
             ++  + I+    T+ AS +  L   ++++GS  GDS L +                  
Sbjct: 358 GSRINNINIKKTSNTTPASCMCTLKGNLIFLGSKIGDSVLYEYQEKVEVETSSLDTDEEM 417

Query: 277 --------NLQPDAKG-------SYVEVLER----------------------------Y 293
                   N +P+ K         +   LE+                            +
Sbjct: 418 SSVFAAGENFEPEKKKRKLADDDDFFAALEKDEEPTVIESFSKVSKKETTKVELKIKHVF 477

Query: 294 VNLGPIVDFCVV-----DLE------RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
            N+GPI           D+          Q   + CSG  + G L ++   +  + Q+  
Sbjct: 478 TNIGPISHLTAAVTSSFDMSGFKSKTNDNQLSAIACSGIGRHGCLTVLNRSLQPDIQSEA 537

Query: 343 ELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
            L   +K +W++   T+   D +L++S   +T++     +  L E   +      +T   
Sbjct: 538 TLPFLVKQVWTISQKTEH--DLYLILSLEDKTKVFES--KATLAEVTSKSMFVTNETTLN 593

Query: 402 HDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL 461
              I   +VQVT  SV L+ S  +++ +  K      +  +      VLL    G LV L
Sbjct: 594 IGKIRESIVQVTRKSVMLIGSEPKQVHHSKKE-----IRSSIILDPYVLLHFYDGSLVLL 648

Query: 462 EIGDGILT------EVKHAQLEYEISCLDINP----IGENPSYSQIAAVGMWTDISVRIF 511
              +G +T      E  H ++   +     NP     G N    +      WTD +  I 
Sbjct: 649 THDNGRVTSKQLDIESNHGKIT-AVCLYKTNPEFEFFGINEKEGKYLCCVYWTDGAFEIL 707

Query: 512 SLPDLNLI 519
           S+PD+  +
Sbjct: 708 SVPDMTCV 715


>gi|238508528|ref|XP_002385456.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus flavus NRRL3357]
 gi|220688975|gb|EED45327.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Aspergillus flavus NRRL3357]
          Length = 1204

 Score = 77.4 bits (189), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 93/435 (21%), Positives = 176/435 (40%), Gaps = 64/435 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
            I+ IP+GE    + +   S  + I +         E+ E+H                  +
Sbjct: 772  IQQIPIGEQVDHLAYSSSSGMYVIGTSHRTEFKLPEDDELHPEWRNEMTSFFPEVQRSSL 831

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
            +++  +T+  I +Y L   E+  ++ + S     N +       VGTA+   E+   ++G
Sbjct: 832  KVVSPKTWTVIDSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIA-SRG 890

Query: 750  RILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
             + VF V            D KL+L+ ++  KGAV +L+   G+    + Q  +     L
Sbjct: 891  CVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGL 950

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            ++DG+    +      H+  +         ++ D +K +    Y  E   +   A+D + 
Sbjct: 951  KEDGSLLPVAFMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDLDY 1010

Query: 859  NWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
              + A + L D      L A+++ NL  ++ + E        RL    ++H G F+    
Sbjct: 1011 LEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFI---- 1066

Query: 916  HGSLVMRLPDSDVG---------------QIP--TVIFGTVNGVIGVIASLPHEQYLFLE 958
              S +  LP + V                +IP   ++  + NG +G++  +  E Y  L 
Sbjct: 1067 --STLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLS 1124

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
             LQ+ L   I+   GLN   +R+   E      +  LDG L+  +LD+S+ R  EI+  +
Sbjct: 1125 ALQSQLTNTIEHPCGLNPRAFRAV--ESDGTAGRGMLDGKLLFQWLDMSKQRKVEIASRV 1182

Query: 1019 NVSVEELCKRVEELT 1033
              +  E+    E ++
Sbjct: 1183 GANEWEIKADFEAIS 1197


>gi|317157892|ref|XP_001826637.2| protein cft1 [Aspergillus oryzae RIB40]
 gi|391864317|gb|EIT73613.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
            [Aspergillus oryzae 3.042]
          Length = 1389

 Score = 77.0 bits (188), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 93/435 (21%), Positives = 176/435 (40%), Gaps = 64/435 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
            I+ IP+GE    + +   S  + I +         E+ E+H                  +
Sbjct: 957  IQQIPIGEQVDHLAYSSSSGMYVIGTSHRTEFKLPEDDELHPEWRNEMTSFFPEVQRSSL 1016

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
            +++  +T+  I +Y L   E+  ++ + S     N +       VGTA+   E+   ++G
Sbjct: 1017 KVVSPKTWTVIDSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIA-SRG 1075

Query: 750  RILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
             + VF V            D KL+L+ ++  KGAV +L+   G+    + Q  +     L
Sbjct: 1076 CVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGL 1135

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            ++DG+    +      H+  +         ++ D +K +    Y  E   +   A+D + 
Sbjct: 1136 KEDGSLLPVAFMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDLDY 1195

Query: 859  NWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
              + A + L D      L A+++ NL  ++ + E        RL    ++H G F+    
Sbjct: 1196 LEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFI---- 1251

Query: 916  HGSLVMRLPDSDVG---------------QIP--TVIFGTVNGVIGVIASLPHEQYLFLE 958
              S +  LP + V                +IP   ++  + NG +G++  +  E Y  L 
Sbjct: 1252 --STLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLS 1309

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
             LQ+ L   I+   GLN   +R+   E      +  LDG L+  +LD+S+ R  EI+  +
Sbjct: 1310 ALQSQLTNTIEHPCGLNPRAFRAV--ESDGTAGRGMLDGKLLFQWLDMSKQRKVEIASRV 1367

Query: 1019 NVSVEELCKRVEELT 1033
              +  E+    E ++
Sbjct: 1368 GANEWEIKADFEAIS 1382


>gi|195583398|ref|XP_002081509.1| GD25678 [Drosophila simulans]
 gi|194193518|gb|EDX07094.1| GD25678 [Drosophila simulans]
          Length = 1450

 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 99/403 (24%), Positives = 183/403 (45%), Gaps = 62/403 (15%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
            +R +PL   PR++ +  ++R + + +              +++  +EES    F+     
Sbjct: 1026 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1085

Query: 696  -----LLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
                 L+  +T+E +  ++   + +E+     I+  S+    +    Y C+GT +    E
Sbjct: 1086 QFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1144

Query: 744  NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
            +  ++G I ++     + E GK      ++ I +KE KG V +++   G L+  + QKI 
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI- 1203

Query: 793  LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
             Y W LRD    +L        +I    + T    I + D+ KSISLL ++ E   +   
Sbjct: 1204 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 1259

Query: 853  ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
            +RD+N   +  +E + D+  LG     AE N  ++  +  +  +   ++  L    +YHL
Sbjct: 1260 SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQK--LLRKADYHL 1317

Query: 908  GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
            G+ VN  FR       L  R P     +   V++GT++G +G    LP + Y     LQ 
Sbjct: 1318 GQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQN 1376

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESF 1003
             L    + + GLN +++R+  + KK     ++  +DGDLI S+
Sbjct: 1377 VLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSY 1419



 Score = 56.6 bits (135), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 72/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           +YG + +L+     G  +D L I+ +  K  VLQ D ++  L T ++    +   R    
Sbjct: 76  LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
           G+  +    +DPD R   + +Y     V+PF     L E                     
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195

Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
              ++ I L +L      VLDI+FL+G  +PT+++LY+  +     +K          ++
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255

Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
           L  +  V    W+ N+L      + P+  P+ G L++    ++Y              SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315

Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
           +   A P++P      S+  A +  +D D  + ++    G L++L +  +  + V     
Sbjct: 316 DNSTAFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHF 373

Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
                + + S I  L +  +++GS  G+S L+
Sbjct: 374 HKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405



 Score = 42.7 bits (99), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 43/159 (27%), Positives = 72/159 (45%), Gaps = 26/159 (16%)

Query: 288 EVLERYVNLGPIVDFCV---VDLERQG-------------QGQVVTCSGAYKDGSLRIVR 331
           EV +  +N+ PI   C    V+ E  G             + ++V  +G  K+G+L +  
Sbjct: 485 EVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFV 544

Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
           N +      S EL G   +W++      +SS +D  D F+++S  + T  L +    E+ 
Sbjct: 545 NCLNPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNST--LVLQTGQEIN 601

Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
           E E  GF     T+F  +    + +VQVT+  VRL+  T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640


>gi|239611898|gb|EEQ88885.1| protein CFT1 [Ajellomyces dermatitidis ER-3]
 gi|327352847|gb|EGE81704.1| CFT1 [Ajellomyces dermatitidis ATCC 18188]
          Length = 1402

 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 106/435 (24%), Positives = 183/435 (42%), Gaps = 63/435 (14%)

Query: 655  RSIPLGEHPRRICHQEQSRTFAICSLK--NQSCAEESEMH-----------------FVR 695
            R I LGE    + +   S T+ I + +  + +  E+ E+H                  V+
Sbjct: 970  RKIGLGEQVDIVEYSSSSETYVIGTSQKVDFNLPEDDEIHPEWRNEVISFLPQIDQGSVK 1029

Query: 696  LLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTKGR 750
            LL  +T+  I ++ L T E      C  L  S  + +      VGTA V   E+   +G 
Sbjct: 1030 LLSPRTWSIIDSHTLRTAERIMCVKCLDLEVSEITHERRDMIAVGTA-VTRGEDIAARGC 1088

Query: 751  ILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKWM 797
            I +F V            + KL+LIA++E KGAV SL+   G+  L+AA  QK  +    
Sbjct: 1089 IYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRG-- 1146

Query: 798  LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
            L++DG+    +      ++  L         ++GD +K I    Y  E   +   ++D  
Sbjct: 1147 LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDALKGIWFAGYSEEPYKLSLFSKDDG 1206

Query: 858  ANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
               + A + L D   +Y L A+++ N+  ++ + E     +  RL     +H G F +  
Sbjct: 1207 TLQVMAADFLPDGKRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFHTGHFASTM 1266

Query: 915  RHGSLVMRL-------PDSD--------VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
               +L+ R        PD++         G +  V+  +  G I +I  L    Y  L  
Sbjct: 1267 ---TLLPRTIIPSAQGPDANPDMMELDSSGPLYHVLVTSETGSIALITPLSETAYRRLSA 1323

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            LQ+ L   ++   GLN   +R+   E   +  +  +DGDL+  +LDL   R  EI+  + 
Sbjct: 1324 LQSQLINTLEHPCGLNPRAFRAI--ESDGIGGRGMVDGDLLHRWLDLGTQRKAEIAHRVG 1381

Query: 1020 VSVEELCKRVEELTR 1034
              + E+   +E + +
Sbjct: 1382 ADIWEIRADLEAIGK 1396


>gi|261201748|ref|XP_002628088.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
 gi|239590185|gb|EEQ72766.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
          Length = 1403

 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 106/435 (24%), Positives = 183/435 (42%), Gaps = 63/435 (14%)

Query: 655  RSIPLGEHPRRICHQEQSRTFAICSLK--NQSCAEESEMH-----------------FVR 695
            R I LGE    + +   S T+ I + +  + +  E+ E+H                  V+
Sbjct: 971  RKIGLGEQVDIVEYSSSSETYVIGTSQKVDFNLPEDDEIHPEWRNEVISFLPQIDQGSVK 1030

Query: 696  LLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTKGR 750
            LL  +T+  I ++ L T E      C  L  S  + +      VGTA V   E+   +G 
Sbjct: 1031 LLSPRTWSIIDSHTLRTAERIMCVKCLDLEVSEITHERRDMIAVGTA-VTRGEDIAARGC 1089

Query: 751  ILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKWM 797
            I +F V            + KL+LIA++E KGAV SL+   G+  L+AA  QK  +    
Sbjct: 1090 IYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRG-- 1147

Query: 798  LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
            L++DG+    +      ++  L         ++GD +K I    Y  E   +   ++D  
Sbjct: 1148 LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDALKGIWFAGYSEEPYKLSLFSKDDG 1207

Query: 858  ANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
               + A + L D   +Y L A+++ N+  ++ + E     +  RL     +H G F +  
Sbjct: 1208 TLQVMAADFLPDGKRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFHTGHFASTM 1267

Query: 915  RHGSLVMRL-------PDSD--------VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
               +L+ R        PD++         G +  V+  +  G I +I  L    Y  L  
Sbjct: 1268 ---TLLPRTIIPSAQGPDANPDMMELDSSGPLYHVLVTSETGSIALITPLSETAYRRLSA 1324

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            LQ+ L   ++   GLN   +R+   E   +  +  +DGDL+  +LDL   R  EI+  + 
Sbjct: 1325 LQSQLINTLEHPCGLNPRAFRAI--ESDGIGGRGMVDGDLLHRWLDLGTQRKAEIAHRVG 1382

Query: 1020 VSVEELCKRVEELTR 1034
              + E+   +E + +
Sbjct: 1383 ADIWEIRADLEAIGK 1397


>gi|225558298|gb|EEH06582.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
          Length = 1408

 Score = 77.0 bits (188), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 105/434 (24%), Positives = 181/434 (41%), Gaps = 61/434 (14%)

Query: 655  RSIPLGEHPRRICHQEQSRTFAICSLKNQ----SCAEESEMH-----------------F 693
            R I LGE    + +   S T+ I +  NQ    +  E+ E+H                  
Sbjct: 976  RKIGLGEQVDAVEYSSSSETYVIGT--NQKVDFNLPEDDEIHPEWRNEVISFLPQIDKGS 1033

Query: 694  VRLLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTK 748
            V+LL  +T+  I +Y L   E      C  L  S  + +      VGTA +   E+   +
Sbjct: 1034 VKLLTPRTWSIIDSYNLRNAERIMCVKCLNLEVSEITHERKDTIVVGTA-LTKGEDIAAR 1092

Query: 749  GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
            G I +F V            + KL+LIA++E KGAV SL+   G+  L+AA  QK  +  
Sbjct: 1093 GCIYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRG 1152

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+    +      ++  L         ++GD +K +    Y  E   +   ++D
Sbjct: 1153 --LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD 1210

Query: 856  YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
                 + A + L D   +Y L A+++ N+  ++ + E     +  RL     +  G F +
Sbjct: 1211 DGTLQVMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFAS 1270

Query: 913  RF----RHGSLVMRLPDSD--------VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
                  R  +   + PD+D         G +  V+  +  G I +I  +    Y  L  L
Sbjct: 1271 TMTLLPRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSAL 1330

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            Q+ L   ++   GLN   +R+   E   +  +  +DGDL++ +LDL   R  EI+  +  
Sbjct: 1331 QSQLTNTLEHPCGLNPRAFRAV--ESDGIGGRGMVDGDLVKRWLDLGTQRKAEIANRVGA 1388

Query: 1021 SVEELCKRVEELTR 1034
             V E+   +E + +
Sbjct: 1389 DVWEIRADLEAIGK 1402


>gi|240277254|gb|EER40763.1| cleavage factor two protein 1 [Ajellomyces capsulatus H143]
          Length = 1408

 Score = 76.6 bits (187), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 105/434 (24%), Positives = 181/434 (41%), Gaps = 61/434 (14%)

Query: 655  RSIPLGEHPRRICHQEQSRTFAICSLKNQ----SCAEESEMH-----------------F 693
            R I LGE    + +   S T+ I +  NQ    +  E+ E+H                  
Sbjct: 976  RKIGLGEQVDAVEYSSSSETYVIGT--NQKVDFNLPEDDEIHPEWRNEVISFLPQIDKGS 1033

Query: 694  VRLLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTK 748
            V+LL  +T+  I +Y L   E      C  L  S  + +      VGTA +   E+   +
Sbjct: 1034 VKLLTPRTWSIIDSYNLRNAERIMCVKCLNLEVSEITHERKDTIVVGTA-LTKGEDIAAR 1092

Query: 749  GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
            G I +F V            + KL+LIA++E KGAV SL+   G+  L+AA  QK  +  
Sbjct: 1093 GCIYIFEVIKVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRG 1152

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+    +      ++  L         ++GD +K +    Y  E   +   ++D
Sbjct: 1153 --LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD 1210

Query: 856  YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
                 + A + L D   +Y L A+++ N+  ++ + E     +  RL     +  G F +
Sbjct: 1211 DGTLQVMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFAS 1270

Query: 913  RF----RHGSLVMRLPDSD--------VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
                  R  +   + PD+D         G +  V+  +  G I +I  +    Y  L  L
Sbjct: 1271 TMTLLPRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSAL 1330

Query: 961  QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
            Q+ L   ++   GLN   +R+   E   +  +  +DGDL++ +LDL   R  EI+  +  
Sbjct: 1331 QSQLANTLEHPCGLNPRAFRAV--ESDGIGGRGMVDGDLVKRWLDLGTQRKAEIANRVGA 1388

Query: 1021 SVEELCKRVEELTR 1034
             V E+   +E + +
Sbjct: 1389 DVWEIRADLEAIGK 1402


>gi|212541400|ref|XP_002150855.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces marneffei ATCC 18224]
 gi|210068154|gb|EEA22246.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces marneffei ATCC 18224]
          Length = 1383

 Score = 76.6 bits (187), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 95/429 (22%), Positives = 176/429 (41%), Gaps = 52/429 (12%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
            IR IPL E    + +   S T+ + +   Q     ++ E+H                  +
Sbjct: 951  IRRIPLNEQVDHLAYSTASGTYVVGTTHEQDFKLPDDDELHPEWATEEISLLPKVAYGSI 1010

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSI--LSCSFSDDSNV---YYCVGTAYVLPEENEPTKG 749
            +L++ +T++ I +Y     E   ++  ++   S+ +        VGT Y    E+   +G
Sbjct: 1011 KLINPKTWKVIDSYTFSPAERITAVENINLEISEKTGKRKDMIVVGTTYA-KGEDIAARG 1069

Query: 750  RILVFIVEDG-----------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
             + VF V D            KL+LI E+  +GAV +++   G+    + Q  +     L
Sbjct: 1070 NVYVFDVIDVVPDPDEPGTNLKLKLIGEESIRGAVTAVSGIGGQGFMIVAQGQKCMVRGL 1129

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            +DDG+    +      ++  +         ++GD  K +    Y  E   +    +D + 
Sbjct: 1130 KDDGSLLPVAFIDVQCYVSVIKELKGTGMCLIGDAFKGLWFTGYSEEPYKMTLFGKDLDE 1189

Query: 859  NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
              +   + L D   +Y L A+ + NL+ ++ + E        RL    ++H+G F +   
Sbjct: 1190 LEVVTADFLPDGKKLYILVADGDCNLYVLQYDPEDPKSSNGDRLLNRCKFHMGHFASTLT 1249

Query: 916  -------HGSLVMRLPDS-DVGQ---IPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
                      L +   DS D+     +   +  T +G + +I SL  E Y  L  LQ+ L
Sbjct: 1250 LLPRTAVSSELAVMSSDSMDIDSYTPLYQALITTQSGSMALITSLSEESYRRLTALQSQL 1309

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
               ++   GLN   +RS   E   V  +  +DG L+  +LDLSR+R  EI+  +     E
Sbjct: 1310 SNTLEHPCGLNPRAYRSV--ESDGVVGRGMIDGKLLMRWLDLSRSRKLEIAGRVGADEWE 1367

Query: 1025 LCKRVEELT 1033
            +   +E ++
Sbjct: 1368 IRADLEAVS 1376


>gi|440793679|gb|ELR14857.1| CPSF A subunit region protein [Acanthamoeba castellanii str. Neff]
          Length = 1477

 Score = 76.3 bits (186), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 94/424 (22%), Positives = 174/424 (41%), Gaps = 66/424 (15%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAIC----------------------------------- 678
            +R +PLG  PR I +   SRT+ +                                    
Sbjct: 1053 VRKVPLGRTPREIAYHPPSRTYVVALATPVTTVVPTPPETDMERQEREREEEESREMGIE 1112

Query: 679  ------SLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDD-SNVY 731
                   +  +  A   E H + L+  +T++ +    L+  E+  ++      D+ S V 
Sbjct: 1113 PEEKQRDMGPREIAMMEERHELHLISPRTWQILHHVELEPKEHVLTLSVLKLGDNYSQVN 1172

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVEDGKLQL-----IAEKETKGAVYSLNAFNGKLLAA 786
              +   ++L  E       I V   E  KL +     + EK  KG V +  +  G L+ A
Sbjct: 1173 RELRPPHLLIYE-------IDVTGEEQCKLTMAYQKPMKEKPMKGPVSAAASLQGYLIIA 1225

Query: 787  INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
            +  KI ++ +   D G+ E  +      +I++  ++T  +F++ GD+ KSI  L +K   
Sbjct: 1226 VGPKIWVFNF---DGGSTEAVAFYDAPHYIVS--IKTLKNFVLCGDIYKSIFFLRWKDSA 1280

Query: 847  GAIEERARDYNANWMSAVEILDD--DIYLGAENNFNLFTVRKNSEGATDEERGRLEVV-G 903
              +   A+D     + A E + D  ++ L   +      V   +    +   G+L V  G
Sbjct: 1281 SQLALLAKDVGRVSVFATEYVVDKQNLALLMSDERQNLQVTAYAPHTAESRGGQLLVPRG 1340

Query: 904  EYHLGEFVNRFRHGSLVMRLPDSDVG-QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
            ++++G+ +N+F    L M LP      Q   + FGT++G +G +A +    +  L  LQ+
Sbjct: 1341 DFNVGQSINKFVR--LPMTLPSGTTSLQRHALWFGTLSGGVGYLAPMDESVFRRLGMLQS 1398

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN-FLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             L   I    GL+ + +R+   E+   + K+  LDG L+  +L L      +I+  +  S
Sbjct: 1399 ALLSAIPHTAGLHPQAYRALQRERLLRNRKHTILDGLLLSRYLALDSATQQQIALKLGTS 1458

Query: 1022 VEEL 1025
             E +
Sbjct: 1459 RERI 1462



 Score = 56.6 bits (135), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 38/121 (31%), Positives = 65/121 (53%), Gaps = 18/121 (14%)

Query: 288 EVLERYVNLGPIVDFCVVD---------LERQGQG--QVVTCSGAYKDGSLRIVRNGIGI 336
           +V +  VN+GPI DF + +          E++GQ   ++VTCSG  K+GSL ++++G+  
Sbjct: 530 KVCDSLVNVGPITDFAIGESFDPASVSMAEQEGQRSVEIVTCSGQGKNGSLCVLQHGVRP 589

Query: 337 N-EQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI 389
               AS +L G K  W+L      R   +  +  +L++S   +TR++A +  DEL   E 
Sbjct: 590 ELVHASADLAGCKAFWTLYHRSEERQGEEAEYHAYLLLSEEEQTRVIAGDGLDELSNEET 649

Query: 390 E 390
           +
Sbjct: 650 D 650



 Score = 52.0 bits (123), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 53/239 (22%), Positives = 101/239 (42%), Gaps = 42/239 (17%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
           +L   ++G I ++   R  G ++D L ++    K  VL++D  +++L T ++    D   
Sbjct: 90  VLSSSLFGNIESMAAVRFPGTSKDALLLSFRDAKISVLEFDIATNDLRTISLHYFEDYKV 149

Query: 61  RPTDNGQIGI----IDPDCRLIGLHLYDGLFKVIPF---------DNKGQ----LKEAFN 103
           +   +  I +    +DP  R   +  +D    V+PF         +N GQ    +K +F 
Sbjct: 150 KEGHDHYIHVPELRVDPQQRCAAMLAFDRKLVVLPFRQHASLMEIENGGQEDQPVKPSFL 209

Query: 104 IRLEELQVLDIK---FLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGP------ 154
           + L  + ++++K   FL G  +PT+++LY+  +      +  VA+     V         
Sbjct: 210 LDLRAMGIINVKDFVFLQGYYEPTLLILYEPTQ----TWSGRVAVNRNTCVAAAVSLNLW 265

Query: 155 ------------WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPS 201
                       WS   L      LI VP P+ G L +   +++Y +  +F    I P+
Sbjct: 266 QHRGQTSAHPVVWSAEFLPYDTQRLIAVPGPIGGALALSTNSLLYLNQVSFPYRLILPA 324


>gi|218194461|gb|EEC76888.1| hypothetical protein OsI_15095 [Oryza sativa Indica Group]
          Length = 1503

 Score = 76.3 bits (186), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 92/409 (22%), Positives = 175/409 (42%), Gaps = 112/409 (27%)

Query: 97  QLKEAFNIRLEEL---QVLDIKFLYGCAKPTIVVLYQDNKD------ARH----VKTYEV 143
           +++ ++ I L  L    V D  F++G  +P +V+L++          ++H    +  + +
Sbjct: 234 RIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSI 293

Query: 144 ALKDKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY------CSA--NAF 193
           ++  K   + P  WS  NL + A  L+ VPPP+ GVL+I   +I Y      CS   N F
Sbjct: 294 SMTLK---QHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNF 350

Query: 194 KAIPI-RPSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 244
            + P   P I+K+  +V+ D ++         +    AG + LL + ++   V  L +  
Sbjct: 351 SSHPDGSPEISKSNFQVELDAAKATWFSNDIVMFSSKAGEMLLLTVVYDGRVVQRLDLMK 410

Query: 245 LGETSIASTISYLDNAVVYIGS----------SYGDSQLI-------------------- 274
              + ++S ++ + N+  ++GS          SYG S+ +                    
Sbjct: 411 SKASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYGASKSVLQDLTNERSADIEGDLPFSK 470

Query: 275 -----------------KLNLQ----PDA-----KGSYVEVLERYVNLGPIVDFC----- 303
                            +L+ Q    P++     K SY+ V +  +N+GP+ DF      
Sbjct: 471 RLKRIPSDVLQDVTSVEELSFQNIIAPNSLESAQKISYI-VRDALINVGPLKDFSYGLRA 529

Query: 304 -----VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL----- 353
                 +   +Q   ++V CSG  K+GSL +++  I  +    VEL   +G+W++     
Sbjct: 530 NADPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSY 589

Query: 354 --RSSTDDPFDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
             + + D+ +  +L++S   E R + +   D+L E TE   +  Q  T+
Sbjct: 590 RGQMAEDNEYHAYLIISL--ENRTMVLETGDDLGEVTETVDYFVQASTI 636



 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 90/411 (21%), Positives = 168/411 (40%), Gaps = 87/411 (21%)

Query: 702  FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF--I 755
            +E  ST P+  FE   ++    L  + + ++     +GTAYVL  E+   +GR+L+F  +
Sbjct: 1095 WETKSTIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVL-GEDVAARGRVLLFSFM 1153

Query: 756  VEDGKLQLIAE---KETKGAVYSLNAFNGKLLAA-------------------------- 786
              +    L+ E   KE+KGAV ++ +  G LL A                          
Sbjct: 1154 KSENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGAELTAVAFYDAPLH 1213

Query: 787  ---------------INQKIQLYKW---------MLRDDGTRE-LQSECGHHGHILALY- 820
                           I++ I    W         + +D G+ +   +E    G  L+L  
Sbjct: 1214 VVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVA 1273

Query: 821  ------VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
                  VQ + +F++ GD+ KSI  L +K +   +   A+D+ +    A E L D   L 
Sbjct: 1274 SDSDKNVQVK-NFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLS 1332

Query: 875  -----AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
                 ++ N  +F     +    +  +G +L    E+H+G  + +F     +  LP   +
Sbjct: 1333 LVASDSDKNVQIFYY---APKMVESWKGQKLLSRAEFHVGAHITKFLR---LQMLPTQGL 1386

Query: 929  GQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN 984
                T    ++FG ++G IG IA +    +  L+ LQ  L   +  V GLN   +R F++
Sbjct: 1387 SSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHS 1446

Query: 985  EKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
              K       N +D +L+  +  LS     ++++ +  +  ++     +++
Sbjct: 1447 NGKGHRPGPDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSDIS 1497


>gi|389609725|dbj|BAM18474.1| spliceosomal protein sap [Papilio xuthus]
          Length = 367

 Score = 75.9 bits (185), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 69/268 (25%), Positives = 124/268 (46%), Gaps = 40/268 (14%)

Query: 747 TKGRILVFIV--EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR 804
           T+G + V+ +    GKL+L+ +        +L AFNGKL+A + + +++Y     D G R
Sbjct: 112 TEGSLHVYKIFSNTGKLELVHKTPIDDYPGALAAFNGKLMAGVGRMLRMY-----DIGRR 166

Query: 805 ELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
           +L  +C   H   L   ++T G  I V D+ +S+  + YK  E  +   A D N  W++ 
Sbjct: 167 KLLRKCENRHIPNLIADIKTIGQRIFVADVQESVFCVKYKKRENQLIIFADDTNPRWITN 226

Query: 864 VEILDDDIYLGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGE 904
             +LD D  +   + F   +V +     +D+            +RG L       E+V  
Sbjct: 227 TCVLDYDT-VAMSDKFGNVSVMRLPHSVSDDVDEDPTGNKALWDRGLLNGASQKGEIVVN 285

Query: 905 YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLP---HEQYLFLEKLQ 961
           +H+GE V   +  +L+        G    +++ T++G +GV+  LP    E + F + L+
Sbjct: 286 FHVGETVASLQRATLI-------PGGSEALLYATISGALGVL--LPFTSREDHDFFQHLE 336

Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
            ++R     + G +H  +RS+    K V
Sbjct: 337 MHMRSENSPLCGRDHLSFRSYYYPVKNV 364


>gi|255948500|ref|XP_002565017.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
 gi|211592034|emb|CAP98296.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
          Length = 1392

 Score = 75.5 bits (184), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 84/375 (22%), Positives = 158/375 (42%), Gaps = 45/375 (12%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 748
            ++++  +T+  I +YPLD  E   ++ + +     N +       VGTA ++  E+ P +
Sbjct: 1019 IKVVSPKTWTIIDSYPLDPDEQVTAVKNVNIEVSENTHERRDLIVVGTA-IVKGEDMPAR 1077

Query: 749  GRILVFIV----------EDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
            G I VF V          E G KL+LI ++  KGAV +L+   G+    + Q  +     
Sbjct: 1078 GTIYVFDVIKVAPDPEKPETGHKLKLIGKESVKGAVTALSGIGGQGFVIVAQGQKCMVRG 1137

Query: 798  LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
            L++DG+    +      ++           +++GD +K +    Y  E   +    +D  
Sbjct: 1138 LKEDGSLLPVAFMDMQCYVTVAKELKGTGLVILGDAVKGLWFAGYSEEPYRMTLFGKDPE 1197

Query: 858  ANWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
               + A + L D   +Y L A+++ NL  ++ + E        RL    +++ G F    
Sbjct: 1198 YLEVVAADFLPDGNKLYMLVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNF---- 1253

Query: 915  RHGSLVMRLPDSDVGQIPT-----------------VIFGTVNGVIGVIASLPHEQYLFL 957
               S V  LP + V    T                 V+  + NG + ++ S+  E Y  L
Sbjct: 1254 --ASSVTLLPRTAVSSERTESSEEGMDLDETFARHQVLIASQNGSLALVTSVAEESYRRL 1311

Query: 958  EKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKT 1017
              LQ+ L   +    GLN   +R+   E      +  +DG+L+  +L++ + R  EI+  
Sbjct: 1312 SALQSQLINTVDHPAGLNPRAFRAI--ESDGAAGRGMVDGNLLRLWLNMGKQRQTEIAGR 1369

Query: 1018 MNVSVEELCKRVEEL 1032
            +  +  E+   +E +
Sbjct: 1370 VGATEWEIKADLETI 1384


>gi|384946686|gb|AFI36948.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
            mulatta]
          Length = 1428

 Score = 75.5 bits (184), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 87/346 (25%), Positives = 142/346 (41%), Gaps = 55/346 (15%)

Query: 618  VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
            V    PF++   P   L   ++GEL I  +           +R IPL      + +  +S
Sbjct: 971  VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1030

Query: 673  RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
            + +A+ +  N  CA                        + E   ++L+   ++E I  + 
Sbjct: 1031 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1090

Query: 708  YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
              L  +E+   + + S   +  V     Y   GT  +  EE    +GRIL+      + E
Sbjct: 1091 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1149

Query: 758  DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
             G      K +++ EKE KG V +L   NG L++AI QKI L  W LR     EL     
Sbjct: 1150 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1204

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                +    + +  +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 1205 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1264

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
             LG   ++ + NL       E        RL    ++H+G  VN F
Sbjct: 1265 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1310



 Score = 56.6 bits (135), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 107/566 (18%), Positives = 221/566 (39%), Gaps = 152/566 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
           + +++  ++   +++GS  G+S L+    KL   P                    DA  S
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAS 429

Query: 286 Y--------------------------------VEVLERYVNLGPIVDFCVVD---LERQ 310
           +                                 EV +  +N+GP  +  + +   L  +
Sbjct: 430 WSAGGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAMGEPAFLSEE 489

Query: 311 GQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDD 359
            Q       ++V CSG  K+G+L +++  I      + EL G   MW++     +   D+
Sbjct: 490 FQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDN 549

Query: 360 P-------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
           P                      FL++S    T IL      E+ E +  GF +Q  T+F
Sbjct: 550 PKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVF 607

Query: 401 CHDAIYNQ-LVQVTSGSVRLVSSTSR 425
             +   N+ +VQV+   +RL+   ++
Sbjct: 608 AGNIGDNRYIVQVSPLGIRLLEGVNQ 633


>gi|145348791|ref|XP_001418827.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144579057|gb|ABO97120.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1386

 Score = 74.7 bits (182), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 90/390 (23%), Positives = 165/390 (42%), Gaps = 41/390 (10%)

Query: 665  RICHQEQSRTFAICSLKNQSCAEESEMHF---VRLLDDQTFEFISTYPLDTFEYGCSILS 721
            R   Q+     A+  ++ +      +M     VRLL   + +    + L+  E+   + +
Sbjct: 985  REIEQDNVHGIALAKVRRERAKANDDMELQYSVRLLVPGSLDSAWQHALEPGEHVQCVRN 1044

Query: 722  CSFSDDSN----VYYCVGTAYVLPE-ENEPTKGRILVFIV--------EDG---KLQLIA 765
                D +         VGTA  +P  E+ P +GR+++F +         DG   K Q+  
Sbjct: 1045 VQLRDINTGALLSLLAVGTA--MPGGEDTPCRGRVILFQMVWERDAESMDGYRWKGQVCC 1102

Query: 766  EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG 825
             +E K A  +L+A +G L+ A+  K+ ++ W    DG  EL S       I  + +    
Sbjct: 1103 VREAKMACTALSALDGHLIVAVGTKLTVHTW----DGV-ELNSVAFFDTPIHTVSINVVK 1157

Query: 826  DFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSAVEILDDDI---YLGAENNFN 880
            +FI+VGDL K +    +K    E +I + ++D++   + + E L D      LG++ + N
Sbjct: 1158 NFILVGDLEKGLHFFRWKANGFEKSIIQLSKDFDRMDVVSTEFLIDGATLSLLGSDMSGN 1217

Query: 881  LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TV 934
                  + +     +  +L V   YH+G  ++R    ++      +  G+ P       V
Sbjct: 1218 ARIFGYDPKSLESWKGQKLLVRSAYHVGSPISRMVRFNVEGTTAKAAPGERPKGTNRHAV 1277

Query: 935  IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS---FNNEKKTVDA 991
             FGT++G +G+        Y  L  LQ  L   ++   G N   +R+   F  +   + A
Sbjct: 1278 FFGTLDGALGIFMPTDEPTYAKLHALQRELNTTVRSPIGCNPRTFRTPKVFEGKHVQLLA 1337

Query: 992  K-NFLDGDLIESFLDLSRTRMDEISKTMNV 1020
              + LDG L+  F  L+ T    +++   V
Sbjct: 1338 PLDVLDGGLLSKFETLTFTEQRAVAERSGV 1367



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 69/331 (20%), Positives = 138/331 (41%), Gaps = 66/331 (19%)

Query: 8   GRIATLELFR-----PHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-- 60
           G I ++ + R     P  + +D L IA    K  V+++DA + ++   +M      +G  
Sbjct: 67  GAIGSMSVLRRRFGAPRNQ-RDALLIAIRERKLSVVEYDAATGDVCCSSMHSFESALGCN 125

Query: 61  ------RPTDNGQIGIIDPDCRLIGLHL--------------YDGLFKVIPFDNKGQLK- 99
                 R +    + + DP+ R   + L               DG   ++  D++G+++ 
Sbjct: 126 PLGTTLRMSREAPLVVSDPEGRCAAVVLREDGVAGKVRVLPSVDGGLGLVANDDEGRVRG 185

Query: 100 ------EAFNIRLEELQVL-DIKFLYGCAKPTIVVLYQD----------NKDARHVKTYE 142
                 E+F + L  ++++ D  FL+G  +P + VLY+           +KD   +    
Sbjct: 186 PAASVRESFPLHLPGVRLIRDACFLHGYGEPALAVLYEKTPTWAGRYNLSKDTCEIVALS 245

Query: 143 VALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSI 202
           V + DK      W + NL + +  L  + PPL G L+  ++ +++ S  +   + +    
Sbjct: 246 VDV-DKQKGTVIWRRQNLPSSSYKLTALLPPLGGALVFSQDFLLHESQESSSVLGLN--- 301

Query: 203 TKAYG--------RVDADGS--------RYLLGDHAGLLHLLVITHEKEKVTGLKIELLG 246
           T  +G         +  DG+        R L+    G L LL +  +   +  + ++  G
Sbjct: 302 TFGHGGPQEGNDAEITLDGAQASVVSEDRVLVTTKTGALLLLALHTDGRSLRRMMLQRAG 361

Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
              ++S +  L   ++++GS  GDS L+K  
Sbjct: 362 GAVLSSGMCLLSRDLLFLGSRIGDSLLVKFT 392


>gi|399216612|emb|CCF73299.1| unnamed protein product [Babesia microti strain RI]
          Length = 1195

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 162/767 (21%), Positives = 306/767 (39%), Gaps = 97/767 (12%)

Query: 202  ITKAYGRVDADGSR-YLLGDHAGLLHLLVIT--HEKEK-VTGLKIELLGETSIASTIS-- 255
            + ++  R+  D +  Y   D  G L++  IT  HEK+K +   K+  L       TIS  
Sbjct: 350  LQRSLSRIGDDETHLYYACDVTGQLYMFKITIKHEKDKYLINDKMRNLDTIIPLYTISPF 409

Query: 256  ----YLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG 311
                 L +  +++ +  GDS++++L     + G   E+  +  +L PI+D  V+      
Sbjct: 410  SLLLALSDDCIFVATRVGDSKILQL-----SNGKVSEIWSK-PSLSPIIDATVIG--GNC 461

Query: 312  QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
              Q++  SG  K  ++ I+  G+  +   ++ L  +  ++ + S    P   FLV   I+
Sbjct: 462  TDQLIAVSGIGKSANISILTIGLFFHNICNISLAKVSKLFIISSQ---PEMIFLVA--IT 516

Query: 372  ETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS-STSRELRNE 430
             T+     + D+  E       S+   + C     NQL Q+T  S+R+ +    R  +  
Sbjct: 517  STQSYYFKMVDQSIE-----LISRGDKIICAFQYNNQLYQLTPNSIRISTIDNPRITKVL 571

Query: 431  WKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGE 490
            W S       V       +L+ +GG +L  L I      ++ +  ++   S     PI  
Sbjct: 572  WTSDEPI---VHYLKCENMLIISGGYNLSALVIDKSDPIKITYNHIQSFTS-----PI-- 621

Query: 491  NPSYSQIAAVGMWTDISVRIFSLPDLNLIT---KEHLGG--EIIPRS-------VLLCAF 538
                S IA V     + + + +  ++ LI     + +G   EI  R+       ++    
Sbjct: 622  ----SAIALVENMNKLHILVATFTNVLLIVDCETKSIGTICEIPTRTSCVGVSDLVYVVK 677

Query: 539  EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN----TTHVFAA 594
              I Y+   L DG+LL +          +R+ V    +      F SKN      +    
Sbjct: 678  NSIYYIFIGLTDGNLLVYT--------NEREDVRHFLECKGWEFFMSKNFGVYPLYFKHN 729

Query: 595  SDRPTVIYSSNKKLLYSNVNLKEVSHMC----PFNSAAF-PDSLAIAK-EGELTIGTIDD 648
            S R T+I      + +S +  + V  +     P     F  D L IA     L +G + D
Sbjct: 730  SQRLTLIGQHVHFVTFSPITGRIVDFLSLEVPPVRDLEFISDDLLIASLTDHLELGKLSD 789

Query: 649  IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTY 708
               L +R IP+      I + + S    +   +  S +      ++ +    T   ++TY
Sbjct: 790  CNSLDVRRIPVSGALDLIEYHQPSNCVIVAFPQLTSESIFPSNAYIAVYSFVTGLLVTTY 849

Query: 709  PLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG--------K 760
             L+    G SI   +F+  S+    +G +     +  P++GRI+  I  +         K
Sbjct: 850  YLEQSFEGSSI--STFTRGSDKTLLIGISST-DSDYLPSQGRIVACIYNNSTFTFKDSYK 906

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR------ELQSECGHHG 814
                  +   G V  +      + A +N  + +   +L             + SEC    
Sbjct: 907  FDKDLPEVPSGGVLQICTSKNFVFAGVNHSLVVLTPILHGRSWEGELSPIYIYSECN--- 963

Query: 815  HILALYVQTRGDFIVVGDLMKSISLL-IYKHEEG-AIEERARDYNANWMSAVEILDDDIY 872
              + + +    D +++ D++  + LL I+    G ++EE A+D + +W S  +++D +  
Sbjct: 964  --MIIGIDVCNDLVLLTDIITGVKLLKIFSDTNGDSVEEIAKDTDTSWCSCAKLIDQNHI 1021

Query: 873  LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
            L ++   N+  ++   E   DE+   L  + ++HLGE VN+   G  
Sbjct: 1022 LVSDTKGNITLLKLIDEPINDEQCYILTKIAKFHLGECVNKIVIGKF 1068


>gi|242798830|ref|XP_002483249.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces stipitatus ATCC 10500]
 gi|218716594|gb|EED16015.1| cleavage and polyadenylation specificity factor subunit A, putative
            [Talaromyces stipitatus ATCC 10500]
          Length = 1382

 Score = 74.3 bits (181), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 95/435 (21%), Positives = 177/435 (40%), Gaps = 64/435 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
            IR IPL E    + +   S T+ + +   Q     +  E+H                  +
Sbjct: 950  IRKIPLNEEVDYLAYSTVSGTYVVGTTHEQDFKLPDNDELHPEWANEDISLRPKVAQGSI 1009

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVYYCVGTAYVLPEENEPTKG 749
            +LL+ +T++ I +Y  +  E   +I + +      + +      VGT +    E+   +G
Sbjct: 1010 KLLNPKTWKVIDSYTFNAAERITAIENINLEISEKTSERKDMIVVGTTFA-KGEDIAARG 1068

Query: 750  RILVFIV--------EDG---KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
             + VF V        E G   KL+LI E+  +GA+ +++   G+    + Q  +     L
Sbjct: 1069 NVYVFDVINVVPDPDEPGTNLKLKLIGEESVRGALTAVSGIGGQGFLIVAQGQKCMVRGL 1128

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            +DDG+    +      ++  +         ++GD +K +    Y  E   +    +D + 
Sbjct: 1129 KDDGSLLPVAFIDVQCYVSVIKELKGTGMCLIGDALKGLWFTGYSEEPYKMTLFGKDLDE 1188

Query: 859  NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
              +   + L D   +Y L A+++ NL  ++ + E        RL    ++H+G F     
Sbjct: 1189 LEVVTADFLPDGKKLYILVADSDCNLHVLQYDPEDPKSSNGDRLLNRCKFHMGHF----- 1243

Query: 916  HGSLVMRLPDSDVGQ---------------IP--TVIFGTVNGVIGVIASLPHEQYLFLE 958
              S +  LP + V                 IP    +  T +G++ ++ SL  E Y  L 
Sbjct: 1244 -ASTITLLPRTAVSSELAVMNSDSMDIDSYIPLHQALITTQSGLMALVTSLSEESYRRLS 1302

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
             LQ+ L   ++   GLN   +R+   E   V  +  +DG L+  +LDLSR R  EI+  +
Sbjct: 1303 ALQSQLSNTLEHPCGLNPRAYRAV--ESDGVVGRGMIDGKLLMRWLDLSRPRKLEIAGRV 1360

Query: 1019 NVSVEELCKRVEELT 1033
                 E+   +E ++
Sbjct: 1361 GADEWEIRADLEAVS 1375


>gi|425765419|gb|EKV04111.1| Cleavage and polyadenylation specificity factor subunit A, putative
            [Penicillium digitatum Pd1]
 gi|425767100|gb|EKV05682.1| Cleavage and polyadenylation specificity factor subunit A, putative
            [Penicillium digitatum PHI26]
          Length = 1271

 Score = 73.9 bits (180), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 92/434 (21%), Positives = 176/434 (40%), Gaps = 64/434 (14%)

Query: 654  IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
            +R +P+ E    + +   S T+ + + +       E  E+H                  +
Sbjct: 839  LRKVPIEEQVNFLAYSTSSETYVLGTSRQGDFKLPEGDELHPEWRNEELSFCPKIPESSI 898

Query: 695  RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
            +++  +T+  I +YPLD  E   ++ + +     N +       VGTA +   E+ P +G
Sbjct: 899  KVVSPKTWTIIDSYPLDPDEQVTAVKNVNIEVSENTHERMDLIVVGTA-IAKGEDMPARG 957

Query: 750  RILVFIV----------EDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
             I VF V          E G KL+LI ++  KGAV +L+   G+    + Q  +     L
Sbjct: 958  TIYVFDVIKVAPDPERPETGRKLKLIGKETVKGAVTALSGIGGQGFIIVAQGQKCMVRGL 1017

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            ++DG+    +      ++  +        +++GD +K +    Y  E   +    +D   
Sbjct: 1018 KEDGSLLPVAFMDMQCYVNVVKELKGTGMVILGDAVKGLWFAGYSEEPYRMTLFGKDPEY 1077

Query: 859  NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
              + A + L D   +Y L A+++ NL  ++ + E        RL    +++ G F     
Sbjct: 1078 LEVVAADFLPDGNKLYMLVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNF----- 1132

Query: 916  HGSLVMRLPDSDVGQIPT-----------------VIFGTVNGVIGVIASLPHEQYLFLE 958
              S V  LP + V    T                 V+  + NG + ++ S+  E Y  L 
Sbjct: 1133 -ASSVTLLPRTAVSSELTESSEEAMDVDETFAKYQVLIASQNGSLALVTSVAEESYRRLS 1191

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
             LQ+ L   +    GLN   +R+   E      +  +DG+L+  +L++ + R  EI+  +
Sbjct: 1192 GLQSQLINTVDHPAGLNARAFRA--TESDGAAGRGMVDGNLLRLWLNMGKQRQAEIAGRV 1249

Query: 1019 NVSVEELCKRVEEL 1032
              +  E+   +E +
Sbjct: 1250 GATEWEIKADLETI 1263


>gi|84999522|ref|XP_954482.1| damage-specific DNA binding protein 1 [Theileria annulata]
 gi|65305480|emb|CAI73805.1| damage-specific DNA binding protein 1, putative [Theileria annulata]
          Length = 1501

 Score = 73.9 bits (180), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 62/273 (22%), Positives = 117/273 (42%), Gaps = 39/273 (14%)

Query: 800  DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
            D+ T  L     +  +   + + T+ D I VGDLM S+ +L  K  + ++ E  RD+N  
Sbjct: 1218 DENTPFLDVVASYDSNTFVVSLDTKDDVIFVGDLMTSVKML--KFRDNSLHETCRDFNTL 1275

Query: 860  WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF--RHG 917
            W +++  +D+   L ++++ N   ++K     TD++  + + +G +H GE VN+   R  
Sbjct: 1276 WTTSLVAVDNSSCLVSDDSGNFMLLKKVKHPVTDQQSIKFDKIGLFHHGEVVNKILKRTE 1335

Query: 918  SLVMRLPDSDVGQIPTVIFGTVNGVI--------GVIASLPHEQYLF------------- 956
              +    D+ + +     F   + VI           +SL     LF             
Sbjct: 1336 MPIHHEADTTITRNNPREFMVSSRVICESETNSPSETSSLNEHNNLFKGFFTCATTSGSL 1395

Query: 957  --------------LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
                          L  L+  ++ V K +G +     R+F +    +  + F+DGDL+ES
Sbjct: 1396 FQVCFFEDLKMFLKLSLLEHTMQLVQKDLGNIPSRNQRNFEDLHSNIPTRGFVDGDLVES 1455

Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            FL L  +    +  TM ++ ++L  ++  L  L
Sbjct: 1456 FLKLPDSLKKWVFDTMLINSKQLGIKLNSLESL 1488



 Score = 41.2 bits (95), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 33/146 (22%), Positives = 62/146 (42%), Gaps = 10/146 (6%)

Query: 245 LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER---------YVN 295
           LG   + S++  LD   ++  S  GDS ++ +N   ++   Y +  E+           N
Sbjct: 433 LGNFPVPSSLIKLDEEHIFYTSKMGDSSILSINSILNSSNRYEQNSEQSDRNSTEWAQTN 492

Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-R 354
           LGPI DF   +    G+  ++ C G    GS   +  G+        ++ G+  ++S+  
Sbjct: 493 LGPITDFAYREETSSGETTILACCGMGNSGSFCEIYLGLSSEIIHESDVPGVLNLFSVPM 552

Query: 355 SSTDDPFDTFLVVSFISETRILAMNL 380
            S  +   + L +SF   T+    +L
Sbjct: 553 KSLYNSTSSLLCISFFRFTKFYNFSL 578


>gi|164655043|ref|XP_001728653.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
 gi|159102535|gb|EDP41439.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
          Length = 1212

 Score = 73.6 bits (179), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 83/378 (21%), Positives = 152/378 (40%), Gaps = 36/378 (9%)

Query: 661  EHPRRICHQEQSRTFAIC-SLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
            E P R   Q+ +RT+A C +L+      E  +H       +T   +   P+D  + G   
Sbjct: 830  EQPVRDPKQDPTRTYAACGALELLVRVGEPPVHGYEFSACETVSALHMAPMDCLDRGSGR 889

Query: 720  LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG------------KLQLIAEK 767
             +      +  Y           E+  +KG + VF V +             +LQL+  +
Sbjct: 890  RTFVVVGTTVTY----------GEDRSSKGHMYVFDVVECVPSEGMAASDALRLQLLCTE 939

Query: 768  ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDF 827
            E +  V +L+  NG L+AA+ QK+ +  W    +    L +       +    +Q   +F
Sbjct: 940  EMRAPVTALHDLNGFLVAAVGQKLLIRSW----EYCEWLVTVAFLDMGMYTTSIQRVKNF 995

Query: 828  IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLFTV 884
            +++ D  +S   + ++ +   +    RDY    ++    L D   L     + N  L  +
Sbjct: 996  LLLTDYYQSAYFVAFQEDPARLVLLGRDYIPTSVTCGAFLIDRARLSIVTCDMNGCLRLM 1055

Query: 885  RKNSEGATDEERGRLEVVGEYHL-GEFVN-RFRHGSLVMRLPDSDVGQIPTVIFGTVNGV 942
              +    T     RL    EYH  GE V  R  HG     L  S       ++    NG 
Sbjct: 1056 DYHPSNPTSLGGQRLLARCEYHAPGEVVRARMLHGPY---LATSGECLTSEIVLAKRNGA 1112

Query: 943  IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS-FNNEKKTVDAKNFLDGDLIE 1001
            + V+  +  + +  L+  Q+ L ++++   GLN   +R+ FN       AK  LDG L+ 
Sbjct: 1113 VDVLVPVTEKIFPTLQLFQSQLVRMVRHTAGLNPRGFRAVFNQHISRPLAKGILDGTLLH 1172

Query: 1002 SFLDLSRTRMDEISKTMN 1019
            +   +SR ++  + + ++
Sbjct: 1173 TAESMSRPKLTSLVRDLS 1190


>gi|170576536|ref|XP_001893668.1| CPSF A subunit region family protein [Brugia malayi]
 gi|158600196|gb|EDP37499.1| CPSF A subunit region family protein [Brugia malayi]
          Length = 1323

 Score = 73.2 bits (178), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 104/468 (22%), Positives = 200/468 (42%), Gaps = 62/468 (13%)

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLK-EVSHMCPFNSAAFPDSLAIAKE-------GELT 642
            VF  S  P + +  +       VNL   +     FN+A  P+      E        +L 
Sbjct: 891  VFICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAVCPNGFIYLTERDRFMRVAKLP 950

Query: 643  IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF 702
               I D     ++ I +G     + +   S T+A+ + + +   +      V + DD+TF
Sbjct: 951  SDMILD-ASYPVKRINVGATVHSVVYLLHSNTYAVLTSEKRKVTKMC----VLINDDKTF 1005

Query: 703  EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQ 762
            E       DTF Y           + + Y       V+PE  +PT            +++
Sbjct: 1006 E--EHEKPDTFVY----------PEMDQY----KLQVVPEPGQPTSKH---------RIK 1040

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQ 822
             + +KE KG V SL + NG LL  + QK+  + W+ +D+  + + S    H +I  L + 
Sbjct: 1041 TLYDKEQKGPVTSLCSCNGYLLTGMGQKV--FIWLFKDNNLQGI-SFLDMHFYIHQL-IG 1096

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMS--AVEILDDDIYLG-----A 875
             R +  +  D+ +S++LL Y+ E  A+   +RD  ++     A + L D+  +G      
Sbjct: 1097 VR-NLALACDMYRSLALLRYQEEYKALSLASRDMRSDVQPPMAAQFLIDNKQMGFIMSDE 1155

Query: 876  ENNFNLFTVRKNSEGATDEERG--RLEVVGEYHLGEFVNRF--RHGSLVMRLPDSDVGQI 931
              N  +F    N    T E  G  +L +  E ++G  VN F    G +     ++++  +
Sbjct: 1156 AANIAIF----NYLPETLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSL 1211

Query: 932  --PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT- 988
               +V+F +++G  G +  L  + +  L  LQ  +  ++    GLN +  R+   ++   
Sbjct: 1212 ERQSVLFASLDGSFGYLRPLTEKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRPNH 1271

Query: 989  -VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
             ++ +N +DGD+   +L LS    +++++ +  S   +   + E+ R+
Sbjct: 1272 YLNTRNLVDGDVAMQYLHLSLPEKNDLARKLGTSRYHIIDDLIEICRV 1319



 Score = 63.5 bits (153), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 92/439 (20%), Positives = 177/439 (40%), Gaps = 94/439 (21%)

Query: 71  IDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTI 125
           +DP  R   + ++     V+PF++      ++ ++L ++      V+D+ FL G  +PT+
Sbjct: 140 VDPGQRCASMLVFGRYLAVLPFNDSSTQLHSYTVQLSQIDSRLVNVVDMVFLDGYYEPTL 199

Query: 126 VVLYQDNKDA--RHVKTYE------VALKDKDFV-EGPWSQNNLDNGADLLIPVPPPLCG 176
           + LY+  +    R    Y+      V+L  K+ V    W   NL    + ++ +P P+ G
Sbjct: 200 LFLYEPVQTTCGRACVRYDTMCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGG 259

Query: 177 VLIIGEETIVYCS-------------ANAFKAIPIRPSITKAYGRVDADGS--------R 215
           +L++    ++Y +              + F   P++     A   +  DG+        +
Sbjct: 260 ILLVATNELIYLNQSVPPCGISLNSCMDGFTKFPLKDFKHMA---LTLDGAVVTVVSTNK 316

Query: 216 YLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS--- 271
            LL D  G L  L+++T     V  L+++   ET I  T++      ++IGS   DS   
Sbjct: 317 ILLCDRNGRLFTLILVTDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFL 376

Query: 272 --------------QLIKLNLQPDAKGS------------------------YVEVLERY 293
                         + +KL+ +P+A                            + VL++ 
Sbjct: 377 HCIFEQSTLEESATKKMKLSTEPNANEEDEDFELYGEVLPKVAKPDVTEELLNIRVLDKL 436

Query: 294 VNLGPI---------VDFCVVDLERQGQGQVVTCS-GAYKDGSLRIVRNGIGINEQASVE 343
           +N+GP          V     ++ R+     + C+ G  K GS+ I++  I      S  
Sbjct: 437 LNVGPCKKITGGCPSVSAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSSS 496

Query: 344 LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
           ++G+   W++    DD    F+      E   LA+  +++L E E   F +   T+   +
Sbjct: 497 IEGVVQYWAVGRREDDTHMYFIAS---RELGTLALETDNDLVELEAPIFSTSESTIAAGE 553

Query: 404 AIYNQL-VQVTSGSVRLVS 421
                L VQVT+ S+ +V+
Sbjct: 554 LADGGLAVQVTTSSLVMVA 572


>gi|70954357|ref|XP_746229.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56526771|emb|CAH77136.1| hypothetical protein PC000016.02.0 [Plasmodium chabaudi chabaudi]
          Length = 372

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 76/363 (20%), Positives = 154/363 (42%), Gaps = 45/363 (12%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG-RIL 752
            +++++    + I    L+  E   S+ +C       +     T   L   N P+   R+ 
Sbjct: 26   IKIINPINLQVIDKVSLELEEAALSVCACELEALHCLIVGTTTNMTLKNRNVPSASLRVY 85

Query: 753  VFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
             + + + KL L+     +   Y    FNG+++ ++  K+++Y       G ++L  +C +
Sbjct: 86   TYDI-NYKLNLLHITPIEDQPYCFCPFNGRVIVSVGNKLRIYAL-----GKKKLLKKCEY 139

Query: 813  HGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
                 A+  ++  GD I   D+ +S+ +  Y   +  I   + D    W++  EILD   
Sbjct: 140  KDIPEAIVSIKVSGDRIFASDIRESVLIFFYDSNQNVIRLISDDIIPRWITCSEILDHHT 199

Query: 872  YLGAENNFNLFTVRKNSEGATDEERG-------------------RLEVVGEYHLGEFVN 912
             + A+   ++F +R   E A  EE G                   ++E +  +H+GE V 
Sbjct: 200  IMAADKFDSVFILRVPEE-AKQEEYGIANKCWYGGEVISSSTKNRKMEHIMSFHIGEIVT 258

Query: 913  RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH---EQYLFLEKLQTNLRKVIK 969
              +   L    P S       +I+ T+ G IG  A +P+   E+    + L+  LR    
Sbjct: 259  SLQKVKLS---PASS----ECIIYSTIMGTIG--AFIPYDNKEELELTQHLEIILRTEKH 309

Query: 970  GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
             + G  H  +RS+ +       ++ +DGDL E F  L      +++  +  + +E+ +++
Sbjct: 310  ALCGREHIFFRSYYHP-----VQHVIDGDLCEQFSSLPFDVQRKVASDLEKTPDEILRKL 364

Query: 1030 EEL 1032
            E++
Sbjct: 365  EDI 367


>gi|428186188|gb|EKX55039.1| hypothetical protein GUITHDRAFT_160593 [Guillardia theta CCMP2712]
          Length = 2290

 Score = 72.4 bits (176), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 93/376 (24%), Positives = 149/376 (39%), Gaps = 52/376 (13%)

Query: 690  EMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV--------YYCVGTAYVLP 741
            E + V+L    + E + +Y  D F  G   L        N         +  VGT +   
Sbjct: 1209 EKYEVQLRAPYSMELLDSYEFD-FANGEKALCLQVVHLKNTRVKDSLLPFVAVGTGFQNG 1267

Query: 742  E-ENEPTKGRILVFIVE--------DGKLQLIAEK--------ETKGAVYSLNAFNGKLL 784
            E E     GRI VF V         +G+     +K        + K  V +L    G LL
Sbjct: 1268 ESETSRATGRIYVFEVTTVVGEEGYEGRTSFKIKKIFTSADIQDIKAPVSALCQLEGYLL 1327

Query: 785  AAIN--------QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
             A           K+ +Y+W+       +L        H+    ++T   FIV GD+  S
Sbjct: 1328 VAQGPNPGMIGGSKLYVYEWV-----DEKLVGRAFFDAHLYITTLKTVKFFIVFGDIRHS 1382

Query: 837  ISLLIYKHEEGAIEERARDYNANWMSAVE--ILDDDIYLGA---ENNFNLFTVRKNSEGA 891
            + LL ++ +   ++  A+D     + A E  ++  +  L A   + N  +F    NS   
Sbjct: 1383 VHLLRWREDIRMLQLLAKDALPLSVYAAEFVVMGSNFGLLASDEQKNVQVFVFNPNS--- 1439

Query: 892  TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH 951
             +  R +L    + H+G  +N+F    L  R     +G      + T++G IG I  +P 
Sbjct: 1440 PEYRRQQLICRADLHVGSHINKFIRWPLPFR---PTLGVRTAAHYTTLDGGIGAIIPIPE 1496

Query: 952  EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE--KKTVDAKNFLDGDLIESFLDLSRT 1009
            + Y  L  LQ  L   +    GLN   WR +      K   AKNFLDG+L+  +L L   
Sbjct: 1497 QSYRRLLALQNLLVTAMPHYAGLNPRSWRLYKPAMCMKRRYAKNFLDGNLLGRYLHLDLA 1556

Query: 1010 RMDEISKTMNVSVEEL 1025
               ++S  +N + E +
Sbjct: 1557 LQMQLSSALNQTREAI 1572


>gi|146094112|ref|XP_001467167.1| putative CPSF-domain protein [Leishmania infantum JPCM5]
 gi|134071531|emb|CAM70220.1| putative CPSF-domain protein [Leishmania infantum JPCM5]
          Length = 1347

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 105/474 (22%), Positives = 188/474 (39%), Gaps = 101/474 (21%)

Query: 637  KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE-----ESEM 691
            ++G L IG+++ ++K     +PLGE   ++ H+  +    + S++     E      S +
Sbjct: 798  QDGRLHIGSLEALEKTSRTFLPLGETITQL-HEAAAWGGYVASVRKVEGDEVLFLPSSAI 856

Query: 692  HF-------------VRLLDDQTFEFISTYPLDTFEYGCS------ILSCSFSDDSNV-- 730
             F             V LL+++   F+ T  LD  E G        ++    SD + V  
Sbjct: 857  QFPWAADLSLWRSAAVPLLENERCVFLETVRLDGAEGGGVGERGGDLIDVPASDGAAVSE 916

Query: 731  -----YYCVGTAYVLPEENEPTKGRILVFIVED----GKLQLIAEKETKGAVYSLNA--- 778
                    +G+++  P+E     GRI    + +     +L LIA K+  GA+    A   
Sbjct: 917  EEWQHLLLIGSSFTFPDEQRARSGRITWCTLREERQRQRLHLIASKDIGGALQCCAAVPH 976

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECG---HHGHILALYVQTRGDFIVVG-DLM 834
            + G++   +N  + LY+W   +D T   +  C        ++ LY  +    ++V  D+ 
Sbjct: 977  YKGRIALGVNGCVCLYQWN-TEDQTFVAEERCRVGLTVTKLIPLYHTSLAASVLVALDVR 1035

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN--------NFNLFTV-- 884
             S   +     +G ++   RD +        I+D  +   AEN        NF    V  
Sbjct: 1036 HSAFFIEVDTLQGNLKVLCRDADLRG-----IMDGHVGSDAENLCLFDDSLNFTALKVVP 1090

Query: 885  -------RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL----VMRLP---DSDVGQ 930
                      +  A    + R EV  + HLG+ V   R GS     +M  P    S   Q
Sbjct: 1091 LPVEARDGDAAAAARATAQYRFEVRAQCHLGDLVTCVRQGSFAATSLMEAPAHCSSAQNQ 1150

Query: 931  I-------PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR--- 980
            +       P ++F T +G  GV+  +    YL L  L+ +L + +  VGGL+H+ +R   
Sbjct: 1151 LLLPGIAGPQLVFATAHGGFGVVTPVHAATYLVLRALEASLVRTVPPVGGLSHQAFREVL 1210

Query: 981  -----------------SFNNEK-KTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
                             +   E+ +  +  N +DGD++E F+ LS +    + +
Sbjct: 1211 RAGQERGVSYLASKTGCALTRERLRRYEPLNTVDGDVVEQFVLLSPSDKKHVCR 1264


>gi|440294229|gb|ELP87246.1| splicing factor 3B subunit, putative [Entamoeba invadens IP1]
          Length = 1168

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 159/778 (20%), Positives = 313/778 (40%), Gaps = 91/778 (11%)

Query: 308  ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLV 366
            +++   + +  +G      ++++++ +   E A + L  I  G++ L+ S D     +LV
Sbjct: 431  DKKTANKYILYTGRGPSSCVQVLKHQMKTLEVAEITLPAIAMGVYPLQRSKD--VSEYLV 488

Query: 367  VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
            V F + T IL ++ EDE++ET+       + ++          VQV    + + +S  +E
Sbjct: 489  VPFANATSILKIS-EDEVKETDETPVILSSYSVRIGMMPDGTFVQVLKEKIVVYTSKIKE 547

Query: 427  LRNEWKSPPGYSVNVATANASQVLLATGGGH--LVYLEIGDGILTEVKHAQLEYEISCLD 484
            +     S  G  V  A   +  V+  + G    ++  +  DG L E++      +I  + 
Sbjct: 548  V-----SVVGDIVCAAIVESFVVVGVSKGDDNVIMLFKYEDGGLHEIESKGSFGKIRSVT 602

Query: 485  INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG------EIIPRSVLLCAF 538
            I    EN    +  AV    D  +RI +L   N  T + L        E     V     
Sbjct: 603  I----ENKEVPKFCAVA-CEDGGIRILTL--FNTETTKALTRVSVISLESAINDVKFGVL 655

Query: 539  EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL-RTFSSKNTTH--VFAAS 595
                 ++  L DG L     +  +GE+ +     +G Q + L R +++  +    +   +
Sbjct: 656  NSKQIIVAGLDDGKLAWCFFDSNSGEIEETNFEFVGVQNVNLGRVYTTSESGEEWMVGYN 715

Query: 596  DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHI 654
                ++ +SN ++  + ++   +  +  F ++   + +A+     L I   ++    L  
Sbjct: 716  GNGILLSTSNGQIKSTPLDQPNIYGVEGFKASFVDNPIALVSGNILKILVFENTTSSLSG 775

Query: 655  RSIPLGEHPRRICHQEQSR-TFAICSLKNQSCAEESEMH---------FVRLLDDQT-FE 703
              +PL   PR+I   E    TF +C+      A +  +            R+  + T  E
Sbjct: 776  VKVPLCMTPRKIVESETVEDTFVLCADNFSKYANDEIIRGDDGEWVSSIYRMDKEATKIE 835

Query: 704  FISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEEN-EPTKGRILVFIVEDGKLQ 762
             ++ +P + +    ++L+ S S    +Y  VG A        + T   I V+ V    +Q
Sbjct: 836  LVTKFPQNVYALCGTLLTVSDS----IYLVVGLAEKYTSRPIKFTNSSIAVYKVNVDNIQ 891

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILA 818
             +   +T+  V S+  + G++L  IN+++++Y    K +LR    R+L SE     H++ 
Sbjct: 892  YLYTTQTEYPVRSMAVYGGRVLCGINRQLRMYEIGKKQLLRKAELRQLASEISDI-HVIN 950

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA--- 875
              +Q  G       +   IS + +   +   +  A D    W      L    Y+G+   
Sbjct: 951  NKIQLVG-------ISDGISFVRFNQTKQEFDIYA-DTLPRWTVKSVALTPTSYIGSDKF 1002

Query: 876  -------------ENNFNLFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVM 921
                         EN  N+F+ +    +   +  + +   + E++LG+    F +  + +
Sbjct: 1003 GQIFVEGLDKETEENASNIFSSILSGEKTIYNGAKYKAVSLNEFYLGDIATGFVNCCVRI 1062

Query: 922  RLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY---LFLEKLQTNLRKVIKGVGGLNHEQ 978
              P          IF   + + G+ A +P   +    F E+L+ ++R   + V G +H  
Sbjct: 1063 GAPQ---------IFIVSHLLGGISALIPLNGFGEIEFFEQLEMHMRVRHQNVNGRDHIA 1113

Query: 979  WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
            +RS       V  K   DGDL E F  L+     EI+  M   V E+ K++ +L  + 
Sbjct: 1114 YRS-----SVVPVKGIFDGDLCELFEKLTEEEKKEIAGEMEREVGEIVKKLHDLRNMR 1166


>gi|395324102|gb|EJF56549.1| hypothetical protein DICSQDRAFT_93527 [Dichomitus squalens LYAD-421
            SS1]
          Length = 1433

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 112/511 (21%), Positives = 215/511 (42%), Gaps = 57/511 (11%)

Query: 565  LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK---KLLYSNVNLKEVSHM 621
            L ++K++S    P        +  + VF   DRP+ I S++K   ++  S  N+      
Sbjct: 923  LAEQKRISRLLLPFVTSPAPGQTYSGVFFTGDRPSWIVSTDKGGVRVFPSGHNVVHAFTT 982

Query: 622  CPFNSAAFPDSLAIAKEGELTIGTIDDI---QKLHIRSIPLGE-HPRRICHQEQSRTFAI 677
            C    +   D L  ++EG   +  + DI     L  RS+P    +   +     S   A 
Sbjct: 983  CSLWESRG-DFLLYSEEGPSLVEWMPDIILDAHLPARSVPRSRPYSHVVFDASSSLIVAA 1041

Query: 678  CSLKNQSCAEESEMHFVRLLDD----------QTFEFIST---YPLDTFEYGCS-ILSC- 722
             S  N+  + + + + V   D            T E IS      +D +E+  +  +SC 
Sbjct: 1042 SSFMNRFASYDEDGNIVWEPDSPNISFPHCETSTLELISPDGWITMDGYEFAANEFVSCV 1101

Query: 723  ------SFSDDSNV--YYCVGTAYVLPEENEPTKGRILVF----IVEDG--------KLQ 762
                  + S +S +  +  VGT  +   E+   KG + +F    +V D         +L+
Sbjct: 1102 VSVPLETVSTESGMKDFIAVGTT-INRGEDLAVKGAVYIFEIVEVVPDASLNIKRWWRLK 1160

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQ 822
            L+   + KG V  L   NG L++++ QKI +  + L +        + G    +    ++
Sbjct: 1161 LLCRDDAKGPVSFLCGMNGYLVSSMGQKIFVRAFDLDERLVGVAFLDVG----VYVTSLR 1216

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
               + +V+GD +KS+  + ++ +   +    +D +   ++  ++   D +L         
Sbjct: 1217 AVKNLLVIGDAVKSVWFVAFQEDPYKLVILGKDPHHCCVTRADLFFADGHLSIVTCDEEG 1276

Query: 883  TVRKNSEGATD-EERGRLEVV--GEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--VIFG 937
             VR  +    D E +G   ++   E+H G+    +R   LV R P +   +IP   +I G
Sbjct: 1277 VVRLYAYDPHDPESKGGQHLLRRTEFH-GQ--TEYRSSLLVARRPKAGDPEIPQARLICG 1333

Query: 938  TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD-AKNFLD 996
            +V+G +  +  +    +  L  LQ  L + ++ V  LN + +R   NE  +   +K  LD
Sbjct: 1334 SVDGSLTTLTYVDENAFKRLHLLQGQLIRTVQHVAALNPKAFRMVRNEYVSRPLSKGVLD 1393

Query: 997  GDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
            G+L+ +F DL   R +E+++ +      + K
Sbjct: 1394 GNLLATFEDLPIGRQNEVTRQIGTDRATVLK 1424


>gi|209881602|ref|XP_002142239.1| CPSF A subunit domain-containing protein [Cryptosporidium muris RN66]
 gi|209557845|gb|EEA07890.1| CPSF A subunit domain-containing protein [Cryptosporidium muris RN66]
          Length = 1509

 Score = 71.6 bits (174), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 108/536 (20%), Positives = 204/536 (38%), Gaps = 145/536 (27%)

Query: 641  LTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS------LKNQSCAEE------ 688
            L   +ID +Q+ H RS+ L   P RIC+      + +        +KN     +      
Sbjct: 971  LRASSIDILQQYHCRSLQLNFTPERICYISNLNVYVVVGIWEKGLMKNSKSENKFLIQNN 1030

Query: 689  ----SEMHFVR------------LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732
                S + FV+            +L  +T E  S YP    EY    ++ +  + S  + 
Sbjct: 1031 STLKSSLSFVQALMCVISANNLEILHFKTLES-SVYP-TCLEYIELNINQTPENISESFI 1088

Query: 733  CVGTAYV---------LPEENEPTKGRILVFIVEDGKLQLIAEKET----KGAVYSLNAF 779
             +GT  V          P  + P  G++ +F +     + I ++ET    +G  Y++  F
Sbjct: 1089 VLGTNKVGHINDYSNKSPNIDLPI-GKLTLFSIRKLHSKFIVDEETSINAEGCPYAITRF 1147

Query: 780  N-GKLLAAINQKIQLYKWMLRDDGTRELQSEC---------------------------- 810
              G +L A+   +  Y+ +     +  L+                               
Sbjct: 1148 QLGTILVAVENSVVTYEIIQYSTSSNYLKDTSILPSSSSSSSSCLLLDTLPENEKTPKFK 1207

Query: 811  -----GHHGHILALYVQTRGD-FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV 864
                  +  H + ++++   D +++VGDLM+S+ L  Y       EE  RD +  W+   
Sbjct: 1208 FNKINSYSTHTMIVFLKLWKDEYLLVGDLMRSVGLWKYDKVAKQFEEICRDPSLAWVMDG 1267

Query: 865  EILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP 924
              ++ ++Y+ ++ N N+  + K      DE    L+ +  +H GE V+ F+ G L+M  P
Sbjct: 1268 IFINKNLYMVSDENKNVRILTKPENPINDEMDTVLQSIAHFHSGEIVSTFQKGKLMMPYP 1327

Query: 925  D--SDVGQI----------PTVIFGTVNGVIGVIASLPHEQYLFLE----------KLQT 962
                ++  I            ++FGT  G + VI SL  +  +FL+           ++ 
Sbjct: 1328 RCLQEINNIGLDFCHSLYSEQIVFGTAQGSMSVIFSLNADYKMFLQLIMFEEAIIAAIKK 1387

Query: 963  NLRKVIKG--------------------------VGGLNHE-----QW------RSFNNE 985
            +LR + K                           +G +  E     QW      + +N E
Sbjct: 1388 SLRTIYKDCKVCNLSIKQIKQLYIKESSSSFSGVIGTIGGEDVTPFQWNLSKNYQDYNYE 1447

Query: 986  KKTVDA-KNFLDGDLIESFLDLSRTRMDEISKTM------NVSVEELCKRVEELTR 1034
             +   A + F+ GD+IE FL  S    + + + +      N+++ E  K++E + +
Sbjct: 1448 YECSGASRGFISGDIIELFLKFSNELQEMVIQELKAIQIDNITIPETSKQLEHIIQ 1503



 Score = 56.6 bits (135), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 88/406 (21%), Positives = 169/406 (41%), Gaps = 78/406 (19%)

Query: 3   DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW---------DAESSELITRAMG 53
           ++  Y  I  +  +RP G+  D + I TE+Y+  +++            +SS L    + 
Sbjct: 72  EISTYKNIIQIAPWRPKGQEFDDILIYTEQYELILIRAYKLQNIEDNSTDSSVLYLNVIQ 131

Query: 54  DVSDRIGRPT---DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEA-----FNIR 105
             S  + RP     N    IID   + I L +Y+G  +V+       LK +       +R
Sbjct: 132 --SKDLFRPNLRKSNNIKMIIDSYIQRIILLIYEGCLQVVDCYTDLSLKMSVFTPPIMLR 189

Query: 106 LEELQVLDIKFLYGCAKPTIV-VLYQD--NKDARHVKTYEVALKDKDF-------VEGPW 155
           L EL +LD+  +   +   ++ +LY    ++D R +K  ++    K +       ++ P 
Sbjct: 190 LSELNILDLCLINSSSNKCLLGILYDSGYSEDPRLLKLIDLPSDLKSWTYNMRIGMKIPS 249

Query: 156 SQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS------------------------AN 191
           +   + N  D      P   G+L+ G+ ++ Y +                        +N
Sbjct: 250 NIMKIVNYEDF--NNDPLSNGLLLFGDGSVFYLTIGEILYNNNENLISQPLFIQCNFPSN 307

Query: 192 AFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-KVTGLKIELLGE 247
             + +      P+I  A   +D    R+L+ D+ G L ++V+  ++  K+  +K+++LG+
Sbjct: 308 YLEKVESGLTLPNIVDAIP-IDGKLKRWLVLDNTGGLFIMVVYKDQNLKIENIKVDILGQ 366

Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKL------------NLQPDAKGSYV---EVLER 292
            S  S +  L N + +IGS   DS ++ +            N+ P     ++   E  E 
Sbjct: 367 YSPFSVLLSLKNDIYFIGSKICDSLVLYIYHNKPLIMQIIHNIGPIRDLLFIRDNEYRED 426

Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 338
            VN  P+ +     + +     +V   G    GS +I+ NGIG+N+
Sbjct: 427 IVNENPLSN---TQISKNLSLPLVAACGFGNSGSFKIICNGIGLNQ 469


>gi|312069702|ref|XP_003137805.1| hypothetical protein LOAG_02219 [Loa loa]
          Length = 1065

 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 71/310 (22%), Positives = 146/310 (47%), Gaps = 33/310 (10%)

Query: 741  PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
            PE  +PT            +++ + +KE KG V SL + NG LL  + QK+  + W+ +D
Sbjct: 770  PEPGQPTSKH---------RIKTLYDKEQKGPVTSLCSCNGYLLTGMGQKV--FIWLFKD 818

Query: 801  DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
            +  + + S    H ++  L +  R +  +  D+ +S++LL Y+ E  A+   +RD  ++ 
Sbjct: 819  NNLQGI-SFLDMHFYVHQL-IGVR-NLALACDMYRSVALLRYQEEYKALSLASRDMRSDV 875

Query: 861  ---MSAVEILDDD----IYLGAENNFNLFTVRKNSEGATDEERG--RLEVVGEYHLGEFV 911
               M+A  I+D+     +      N  +F    N    T E  G  +L +  E ++G  V
Sbjct: 876  QPPMAAQFIIDNKQMGFVMSDEAANIAIF----NYLPETLESLGGEKLTLRAEINIGTVV 931

Query: 912  NRF--RHGSLVMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
            N F    G +     ++++  +   +V+F +++G  G +  L  + +  L  LQ  +  +
Sbjct: 932  NSFIRVKGHISSGFVENELFSLERQSVLFASLDGSFGFLRPLTEKVFRRLHMLQQLMSSM 991

Query: 968  IKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEEL 1025
            +    GLN +  R+    +    ++ +N +DGD++  +L LS    +++++ +  S   +
Sbjct: 992  VPQPAGLNAKGARAARPPRPNHYLNTRNLVDGDMVMQYLHLSLPEKNDLARKLGTSRYHI 1051

Query: 1026 CKRVEELTRL 1035
               + E+ R+
Sbjct: 1052 IDDLIEICRV 1061


>gi|398019848|ref|XP_003863088.1| CPSF-domain protein, putative [Leishmania donovani]
 gi|322501319|emb|CBZ36398.1| CPSF-domain protein, putative [Leishmania donovani]
          Length = 1347

 Score = 71.2 bits (173), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 105/474 (22%), Positives = 187/474 (39%), Gaps = 101/474 (21%)

Query: 637  KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE-----ESEM 691
            ++G L IG+++ ++K     +PLGE   ++ H+  +    + S++     E      S +
Sbjct: 798  QDGRLHIGSLEALEKTSRTFLPLGETITQL-HEAAAWGGYVASVRKVEGDEVLFLPSSAI 856

Query: 692  HF-------------VRLLDDQTFEFISTYPLDTFEYGCSI------LSCSFSDDSNV-- 730
             F             V LL+++   F+ T  LD  E G         +    SD + V  
Sbjct: 857  QFPWAADLSLWRSAAVPLLENERCVFLETVRLDGAEGGGVGERGGDPIDVPASDGAGVSE 916

Query: 731  -----YYCVGTAYVLPEENEPTKGRILVFIVED----GKLQLIAEKETKGAVYSLNA--- 778
                    +G+++  P+E     GRI    + +     +L LIA K+  GA+    A   
Sbjct: 917  EEWQHLLLIGSSFTFPDEQRARSGRITWCTLREERQRQRLHLIASKDIGGALQCCAAVPH 976

Query: 779  FNGKLLAAINQKIQLYKWMLRDDGTRELQSECG---HHGHILALYVQTRGDFIVVG-DLM 834
            + G++   +N  + LY+W   +D T   +  C        ++ LY  +    ++V  D+ 
Sbjct: 977  YKGRIALGVNGCVCLYQWN-TEDQTFVAEERCRVGLTVTKLIPLYHTSLAASVLVALDVR 1035

Query: 835  KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN--------NFNLFTV-- 884
             S   +     +G ++   RD +        I+D  +   AEN        NF    V  
Sbjct: 1036 HSAFFIEVDTLQGNLKVLCRDADLRG-----IMDGHVGSDAENLCLFDDSLNFTALKVVP 1090

Query: 885  -------RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL----VMRLP---DSDVGQ 930
                      +  A    + R EV  + HLG+ V   R GS     +M  P    S   Q
Sbjct: 1091 LPVEARDGDAAAAARATAQYRFEVRAQCHLGDLVTCVRQGSFAATSLMEAPAHCSSAQNQ 1150

Query: 931  I-------PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR--- 980
            +       P ++F T +G  GV+  +    YL L  L+ +L + +  VGGL+H+ +R   
Sbjct: 1151 LLLPGIAGPQLVFATAHGGFGVVTPVHAATYLVLRALEASLVRTVPPVGGLSHQAFREVL 1210

Query: 981  -----------------SFNNEK-KTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
                             +   E+ +  +  N +DGD++E F+ LS +    + +
Sbjct: 1211 RAGQERGVSYLASKTGCALTRERLRRYEPLNTVDGDVVEQFVLLSPSDKKHVCR 1264


>gi|384253955|gb|EIE27429.1| hypothetical protein COCSUDRAFT_64224 [Coccomyxa subellipsoidea
            C-169]
          Length = 1137

 Score = 70.9 bits (172), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 120/527 (22%), Positives = 210/527 (39%), Gaps = 104/527 (19%)

Query: 591  VFAASDRPTVIYSSNKKLLYSNVNLK-EVSHMCPFNSAAFPDSLAIA----KEGE----- 640
            +F + +RP  + +S   L+   ++++  VS M PF++   P     A     +GE     
Sbjct: 623  MFVSGERPLWLVASRGTLVAHAMDVEGRVSGMTPFHNINCPLGFITACMAENDGETLKIC 682

Query: 641  -LTIGTIDD----IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK------NQSCAEES 689
             L + T  D    +QK+ +R+ P      R+ +  ++R + +   +      +Q  A + 
Sbjct: 683  QLPMRTRLDTPWPLQKIAVRATP-----HRLAYYAEARLYVLLVSRPVPYREHQEEASDG 737

Query: 690  EMHF---------------------VRLLDDQTFEFISTYPLDTFEYGCSI----LSCSF 724
            + H                      VRLL+   ++ ++ + LD  E  CS+    L  + 
Sbjct: 738  DPHASYSYICADAAAKASGTELGGEVRLLEPGRYQTVARHALDPGEEPCSVAADWLRNAQ 797

Query: 725  SDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE--------------DGKLQLIAEKETK 770
            +     Y  VGTA     E+ P  GRIL+F                   +L L+      
Sbjct: 798  TGALEPYITVGTALNY-GEDYPCSGRILLFKATRTSTSGAEQADPTISWQLTLVHASGFS 856

Query: 771  GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
              V  L   +G+L+AA+   +Q+ +  LR      L      H  +    V T   FI++
Sbjct: 857  RPVQGLAVMDGRLVAAVGNNMQVME--LRGS---SLHMISFFHAQLFITSVATIKTFILL 911

Query: 831  GDLMKSISLLIYKHEEG---AIEERARDYNANWMSAVEILDDDIYL-----GAENNFNLF 882
            GD+ K ++  +Y  ++    A+ + ++DYN   + A E L +   L      A  N  LF
Sbjct: 912  GDVHKGLT-FVYADKKANYTALTQLSKDYNDVDVEAAEFLVNGKKLFLLACDAAQNLRLF 970

Query: 883  TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRL-PDSDVG-QIPTVIFGTVN 940
                  E     +  +L  +G  H+G+ +      SL  R+ P S  G Q+   +FG+  
Sbjct: 971  AYDGGKEQQATWQGKKLLPLGAIHVGQNIC----SSLSHRITPASATGVQLRAAVFGSAA 1026

Query: 941  GVIGVIA----SLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF--NNEKKTVDAKNF 994
            G I  +A     LP E+ L    LQ  +   +  V GLN   +R    +  K     ++F
Sbjct: 1027 GSIASLAPTWDGLPAEELL---ALQREMVLAVPQVAGLNPVSFRRRYKHGVKALAGGQSF 1083

Query: 995  ---------LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
                     LD D +  F  L  T    ++   N+S +++   + E+
Sbjct: 1084 EAPVSDDRVLDLDQLNRFQWLPLTEQVALAAKCNLSRQQVLHALREM 1130


>gi|213408090|ref|XP_002174816.1| predicted protein [Schizosaccharomyces japonicus yFS275]
 gi|212002863|gb|EEB08523.1| predicted protein [Schizosaccharomyces japonicus yFS275]
          Length = 1053

 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 165/773 (21%), Positives = 310/773 (40%), Gaps = 81/773 (10%)

Query: 4   VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL-ITRAMGDVSDRIGRP 62
           +P++     L  +RP    QD+L I T+   +  + WD    ++ +   +   + +   P
Sbjct: 53  IPLHANALHLLAYRPLDLEQDYLLIVTQDGYYFSVYWDDNFGKVALDSPLQMCTTQPSYP 112

Query: 63  -TDNGQIGIIDPDCRLIGLHLYDGLFKVIP---FDNKGQLKEAFNI------RLEELQVL 112
            T      +++ + +L+ +   +   KV+    F  + + ++ F+I       L    V+
Sbjct: 113 NTSWRSFCLVNQEAQLVLIRTSE--MKVLCLSIFSEEQRKQQGFSIGDGAELLLPGYLVM 170

Query: 113 DIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPP 172
           D+ FL     P + VL Q   D+  V  +++ L+   F  G     N  +    L+ V  
Sbjct: 171 DMCFLQHGNPPQVAVLSQ-TVDSTKVTVHKLNLQSDSFDSGSVVLKN--SLYYKLMAVGK 227

Query: 173 PLCGVLIIGEETIVYCSANAFKAIPIR-PSITKA-YGRVDADGSRYLLGDHAGLLHLLVI 230
              G++ +GE+ I Y   N    I IR PS+  + Y  +     +++L    G ++ L  
Sbjct: 228 K--GLIALGEDNISY--FNRGTVISIRAPSVRFSHYCALRGLKRQFVLVSAEGNVYTLSC 283

Query: 231 THEKEKVTGLKIELLGETSI--ASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVE 288
                K+  L +E L  + +   S++  +D   ++  +  G   ++ L        SY  
Sbjct: 284 NPLFHKLYTLMVEELPFSPVFNVSSVMAIDEERLFASNINGSIHILSL-------PSYT- 335

Query: 289 VLERYVNLGPIVDFCVVDLERQGQGQ-VVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
           V E     G I D   +     G G  ++ C+G+   GSL  + N +     +  +L G+
Sbjct: 336 VSEITAGHGAIHD---MQFFESGSGNTLLLCAGSLNSGSLIYLCNSLSAEPISLFDLSGV 392

Query: 348 KGMWSLRSSTDDPFDTFL--VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
             +W +       F + L   V   +ET++  M     L E E +   S  ++     A 
Sbjct: 393 LHVWPVH------FQSVLKLFVGMANETKVFTMV---SLYEAEFDPELSLDESTLALSAC 443

Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD 465
            N  VQ+T+ S+R  + TS++       P     +V++  A   ++      ++Y     
Sbjct: 444 ENGYVQITNKSLRYCTETSKD-----AIPLEGITSVSSLGAYAAIVIDNTSLVIY----- 493

Query: 466 GILTEVKHAQLEYEISCLDI-NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
            +  E+       E+S +D  +P+           VG W   ++ + ++ D + I     
Sbjct: 494 NMTKEIARTTFSKEVSAVDFASPLR--------IVVGFWNK-TMEMLAIRDNSFICVLRK 544

Query: 525 GGEIIPRSVLLCAFEGISYLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
               IPR VLL  F     L   +G  +G+L   +  ++   +T+     LG+ PI    
Sbjct: 545 SLNFIPRDVLL-QFSNDDRLHLFMGSDEGNLC--VATVEKSLITNMDYRVLGSTPIHFTR 601

Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
                  +V   +D+P ++Y  + KL +  +N+     +  F        L  A    ++
Sbjct: 602 LLISTGAYVICTNDKPHMVYGLHGKLAFMPLNVPSSIDLNAFLDEKLRFCLVSAHSEGVS 661

Query: 643 IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH--FVRLLDDQ 700
           I  +D+  KL +  +     P++    +Q+  F+    +     E S  +      + DQ
Sbjct: 662 ILHLDNQPKLSVTRLKTKLLPQKAAIIDQTVLFSGLKQQPNGLGELSAQYSLVCSKVYDQ 721

Query: 701 TFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILV 753
            FE  S + LD FE   S+++ S S      + V TA   P       GR+LV
Sbjct: 722 -FELCS-FKLDEFEECTSVIASSSS-----LFVVTTAKKYPYNYFSVSGRLLV 767


>gi|441648592|ref|XP_004093268.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
            specificity factor subunit 1 [Nomascus leucogenys]
          Length = 1177

 Score = 70.9 bits (172), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 59/220 (26%), Positives = 106/220 (48%), Gaps = 12/220 (5%)

Query: 814  GHILALYVQTR--GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
             H +A +V+++   +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+ 
Sbjct: 940  AHYVAYHVESKVCPNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 999

Query: 872  YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
             LG   ++ + NL       E        RL    ++H+G  VN F      G+      
Sbjct: 1000 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1059

Query: 925  DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
             S V +   +  F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  +
Sbjct: 1060 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1119

Query: 984  NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
             +++T+    +N LDG+L+  +L LS     E++K +  +
Sbjct: 1120 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1159



 Score = 50.8 bits (120), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 101/489 (20%), Positives = 206/489 (42%), Gaps = 79/489 (16%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV++    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQPDAKGSYVEVL--ERYVNLGPIVDFCVV 305
           + +++  ++   +++GS  G+S L+K    LQ     +  E    E   +    VD   V
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVD-ATV 428

Query: 306 DLERQGQG----QVVTC---SGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTD 358
               +GQ     QVVT     G Y    +  V   +   E+ + + +G +   S   + D
Sbjct: 429 GWSGEGQKSIRPQVVTTFELPGCY---DMWTVIAPVRKEEEDNPKGEGTEQEPSTPEADD 485

Query: 359 D-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGS 416
           D     FL++S    T IL      E+ E +  GF +Q  T+F  +   N+ +VQV+   
Sbjct: 486 DCRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLG 543

Query: 417 VRLVSSTSR 425
           +RL+   ++
Sbjct: 544 IRLLEGVNQ 552


>gi|315045910|ref|XP_003172330.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
 gi|311342716|gb|EFR01919.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
          Length = 1397

 Score = 70.5 bits (171), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 99/428 (23%), Positives = 177/428 (41%), Gaps = 52/428 (12%)

Query: 655  RSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMHF-----------------VR 695
            R IPLGE    I +   S ++ I +   +     E+ E H                  V+
Sbjct: 965  RKIPLGEQVDCIVYSSASESYVIGTSTKEDFKLPEDDESHTEWRNEFITFLPQLDRGTVK 1024

Query: 696  LLDDQTFEFISTYPLDTFE-YGC-SILSCSFSDDSNV---YYCVGTAYVLPEENEPTKGR 750
            LL+ + +  I  Y ++  E   C  I+    S+ ++       VG+A    E+  P KG 
Sbjct: 1025 LLEPKNWSAIDIYEVEPAERITCIKIIRLEISETTHERKDMVVVGSAVAKGEDIVP-KGC 1083

Query: 751  ILVFIVED-----------GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
            I VF + D            KL+L A +E KGAV +++   G+    + Q  +     L+
Sbjct: 1084 IRVFEIIDVVPDPDHPEKNKKLKLFAREEVKGAVTAVSGIGGQGFLIVAQGQKCMVRGLK 1143

Query: 800  DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
            +DG+    +      ++  L         ++GD  K +    Y  E   ++   ++    
Sbjct: 1144 EDGSLLPIAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFTGYSEEPYKLDLFGKENENL 1203

Query: 860  WMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR- 915
             +   + L D   +Y L A+++ NL  ++ + E  +  +  RL     +H G F +    
Sbjct: 1204 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTL 1263

Query: 916  --HGSLVMRLP------DSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
              HGS  +  P      D+D+   P+   V+     G IGVI+ L  + Y  L  LQ+ L
Sbjct: 1264 LPHGSHTLSSPVDEDAMDTDLPPPPSKYQVLITFQTGSIGVISPLNEDSYRRLLALQSQL 1323

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
               ++   GLN   +R+  ++      +  +DG+L+  +LD+   R  EI+  +   V  
Sbjct: 1324 VNALEHPCGLNPRGYRAVESDGMG-GQRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGA 1382

Query: 1025 LCKRVEEL 1032
            +   +E+L
Sbjct: 1383 IRIDLEKL 1390


>gi|68071595|ref|XP_677711.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56497932|emb|CAI04454.1| conserved hypothetical protein [Plasmodium berghei]
          Length = 493

 Score = 70.5 bits (171), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 80/318 (25%), Positives = 132/318 (41%), Gaps = 49/318 (15%)

Query: 641 LTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQ 700
           L I  +++I + +I+ IP      +I   ++S       L   SC  E E H       Q
Sbjct: 53  LYISHLNEINETYIQKIPFYRTVEKIAFHKES------GLLITSCPPE-EKHKTNKNLKQ 105

Query: 701 TFEFISTYPLDTFEYGCSILS-------CSFSDDSNVY---------YCVGTAYVLPEEN 744
              F + Y  ++F+Y   I S       C +  + ++Y          CVGTA +    +
Sbjct: 106 IICFFNPYQ-NSFKYSYIIPSKYNVSSICIYQINKDIYPNKSNINTLICVGTANINDRVS 164

Query: 745 EPTKGRILVFIVE--DGKLQL---IAEKETKGAVYSLNAFNGKLLAAINQK--------- 790
           EP+ G I +F  +  D   ++          G +  L  F  KL++ IN           
Sbjct: 165 EPSSGNIYIFFAKKKDNLFEIKHIYTHNVNVGGITHLKQFYDKLISTINNTVVILDISDF 224

Query: 791 -IQLYKWM--------LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
            I L K++        L +DGT    +       I++L V    ++IVVGD+M S+++L 
Sbjct: 225 LINLDKYVDNTNKPIKLENDGTIVDVASFTPSSWIMSLDVIE--NYIVVGDIMTSVTILS 282

Query: 842 YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEV 901
           Y      + E  RDY+  W + V  L    +L ++   N    +K+S    DE+  +L  
Sbjct: 283 YDFNNSTLTEVCRDYSNVWCTFVCALSKSHFLVSDMESNFLVFQKSSIRYNDEDSFKLSR 342

Query: 902 VGEYHLGEFVNRFRHGSL 919
           V  ++ G  VN+    SL
Sbjct: 343 VAFFNHGHVVNKMLPVSL 360


>gi|402590016|gb|EJW83947.1| hypothetical protein WUBG_05142 [Wuchereria bancrofti]
          Length = 374

 Score = 70.5 bits (171), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 72/310 (23%), Positives = 146/310 (47%), Gaps = 33/310 (10%)

Query: 741  PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
            PE  +PT            +++ + +KE KG V SL + NG LL  + QK+  + W+ +D
Sbjct: 79   PEPGQPTSKH---------RIKTLYDKEQKGPVTSLCSCNGYLLTGMGQKV--FIWLFKD 127

Query: 801  DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
            +  + + S    H +I  L +  R +  +  D+ +S++LL Y+ E  A+   +RD  ++ 
Sbjct: 128  NNLQGI-SFLDMHFYIHQL-IGVR-NLALACDMYRSLALLRYQEEYKALSLASRDMRSDV 184

Query: 861  MS--AVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERG--RLEVVGEYHLGEFV 911
                A + L D+  +G        N  +F    N    T E  G  +L +  E ++G  V
Sbjct: 185  QPPMAAQFLIDNKQMGFIMSDEAANIAIF----NYLPETLESLGGEKLTLRAEINIGTVV 240

Query: 912  NRFRH--GSLVMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
            N F    G +     ++++  +   +V+F +++G  G +  L  + +  L  LQ  +  +
Sbjct: 241  NSFIRVKGHISSGFVENELFSLERQSVLFASLDGSFGYLRPLTEKVFRRLHMLQQLMSSM 300

Query: 968  IKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEEL 1025
            +    GLN +  R+   ++    ++ +N +DGD++  +L LS    +++++ +  S   +
Sbjct: 301  VLQPAGLNAKGARAARPQRPNHYLNTRNLVDGDVVMQYLHLSLPEKNDLARKLGTSRYHI 360

Query: 1026 CKRVEELTRL 1035
               + E+ R+
Sbjct: 361  IDDLNEICRV 370


>gi|403373826|gb|EJY86839.1| hypothetical protein OXYTRI_09860 [Oxytricha trifallax]
          Length = 1380

 Score = 70.5 bits (171), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 86/379 (22%), Positives = 154/379 (40%), Gaps = 58/379 (15%)

Query: 19  HGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLI 78
           + + QD++F+ T  +  C+L        ++  + G++SD  G+  +      +  D R I
Sbjct: 112 NKKVQDYIFVLTADFN-CLLFAHDTKQGIVCLSTGNISDVTGQKREPPYSVFLGSDGRYI 170

Query: 79  GLHLYDGLFKVIPF----DNKGQLKEAFNIRL---EELQVLDIKFLYGCAKPTIVVLYQD 131
           GL LY+ + K+IP     + K  L  AFN+R+   E  Q++ +          I V YQ 
Sbjct: 171 GLMLYENIIKIIPLVKGQNGKLALSNAFNVRIKHPEAHQIIPLLDDEQNGHSAIAVFYQQ 230

Query: 132 NKDAR----------HVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
             +             +K Y + + +KD +E   ++  LD     +I V     G  I  
Sbjct: 231 QPNRTINPNNTYFQLKLKKYYLNILEKD-IEPAQNEQQLDFENMHIIRVKELEFGGFIAF 289

Query: 182 EETIVYCSANAFKA--------IPIRPSITKAYGRVDA---------DGSRYLLGDHAGL 224
           +   ++C      +         P++ +   A  + D             RYL     G 
Sbjct: 290 DPENIHCYKKRINSKIISKKLRKPMKVTSITAIDQYDPLTKRTKQGLQLMRYLFATECGD 349

Query: 225 LHLLVITHE-------------KEKVTGLKIELLGET-SIASTISYLDNAVVYIGSSYGD 270
           L+LL    E              E  + + IE L +     +T++YLDN + + GS   D
Sbjct: 350 LYLLAFNLEYLHLVTSIGNINPHEATSFMMIEFLAQKLPSCTTLNYLDNGLFFYGSKLAD 409

Query: 271 SQLIKLNLQPDAKGS----YVEVLERYVNLGPIVDFCVVDLE--RQGQGQVVTCSGAYKD 324
           S L  L L+ +  G     Y+ + +RY +LG I D  + + +  R  Q  +V   G   +
Sbjct: 410 S-LTAL-LETENTGHKDYLYISIRKRYESLGCISDLAIREQQNIRGQQKDLVVVGGQGHN 467

Query: 325 GSLRIVRNGIGINEQASVE 343
             + ++R G+ +   A+ +
Sbjct: 468 SHISVLRKGVSVKTLAATD 486



 Score = 64.3 bits (155), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 71/360 (19%), Positives = 151/360 (41%), Gaps = 29/360 (8%)

Query: 590  HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA---------KEGE 640
            H   + D P +   S+K+ +YS        H    N      ++ I           + +
Sbjct: 817  HKLKSDDGPIIYVQSSKQAIYSLTRQTLQMHFINENDIKQARTMMIEDKCIFVYENTQNQ 876

Query: 641  LTIGTIDDIQKLHIRSIPLGEHPRRICH-QEQSRTFAICSLKNQSCAE--ESEMHFVRLL 697
            +  G +D+     ++ I   +   +I     ++   A+  + N    +     ++ + LL
Sbjct: 877  IIFGQLDNFNSTQLKQIKFDKEVCKISLVPNENLMIALMEVPNYEYLDLNSPAVNQIALL 936

Query: 698  DDQTFEFISTYPLDT---------FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTK 748
            D  T E +  + L           FE   S  + S  + S  Y  +GTA +  +E  P+K
Sbjct: 937  DSSTMEMLELFSLQKEEIASCLYLFEKKPSSATLSIRNQSR-YVAIGTAIIKTDEYLPSK 995

Query: 749  GRILVFIVEDGKLQLIAEK--ETKGAVYSLNAF--NGKLLAAINQKIQLYKWMLRDDGTR 804
            GR+++F ++  + +LI +   +T G+V S+     +  L+  +N  +QL+   ++    +
Sbjct: 996  GRLMIFEIDTRQRKLIIKHSFQTNGSVQSIAMLRDDKYLVLGVNNDLQLFSMQIKPQEIK 1055

Query: 805  ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE--GAIEERARDYNANWMS 862
                +    G  +   +    + +VVGD+M+++    +K ++    + E A   ++ W++
Sbjct: 1056 LQLLQTLQVGTAIQ-QIFAVNNTVVVGDIMRALYTFDFKEQKQNKLLNESAHTNHSIWVN 1114

Query: 863  AVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR 922
             +  L +   +  +   N+F   KN     D E+ +L +V    +GE   +   GSL ++
Sbjct: 1115 TMLPLSESNTMVFDKECNMFVFNKNLLPTNDHEKFKLNMVACMKIGEEATKAIFGSLAIQ 1174


>gi|121797760|sp|Q2TZ19.1|CFT1_ASPOR RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
            1
 gi|83775384|dbj|BAE65504.1| unnamed protein product [Aspergillus oryzae RIB40]
          Length = 1393

 Score = 70.5 bits (171), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 76/331 (22%), Positives = 139/331 (41%), Gaps = 40/331 (12%)

Query: 734  VGTAYVLPEENEPTKGRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK 782
            VGTA+   E+   ++G + VF V            D KL+L+ ++  KGAV +L+   G+
Sbjct: 1065 VGTAFARGEDIA-SRGCVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQ 1123

Query: 783  LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
                + Q  +     L++DG+    +      H+  +         ++ D +K +    Y
Sbjct: 1124 GFLIVAQGQKCIVRGLKEDGSLLPVAFMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGY 1183

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRL 899
              E   +   A+D +   + A + L D      L A+++ NL  ++ + E        RL
Sbjct: 1184 SEEPYKMSLFAKDLDYLEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRL 1243

Query: 900  EVVGEYHLGEFVNRFRHGSLVMRLPDSDVG---------------QIP--TVIFGTVNGV 942
                ++H G F+      S +  LP + V                +IP   ++  + NG 
Sbjct: 1244 LSRSKFHTGNFI------STLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGS 1297

Query: 943  IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
            +G++  +  E Y  L  LQ+ L   I+   GLN   +R+   E      +  LDG L+  
Sbjct: 1298 VGLVTCVSEESYRRLSALQSQLTNTIEHPCGLNPRAFRAV--ESDGTAGRGMLDGKLLFQ 1355

Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
            +LD+S+ R  EI+  +  +  E+    E ++
Sbjct: 1356 WLDMSKQRKVEIASRVGANEWEIKADFEAIS 1386


>gi|125526594|gb|EAY74708.1| hypothetical protein OsI_02601 [Oryza sativa Indica Group]
          Length = 314

 Score = 70.5 bits (171), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 33/65 (50%), Positives = 45/65 (69%), Gaps = 7/65 (10%)

Query: 846 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
           +G  +E +RDYN  WM+AVE+LDD +Y+GA+N +NLFTV K          GRL V+G+Y
Sbjct: 153 KGQTDELSRDYNITWMTAVEMLDDYVYIGADNCYNLFTVLK-------RRVGRLLVIGQY 205

Query: 906 HLGEF 910
           HLG+ 
Sbjct: 206 HLGDL 210



 Score = 53.5 bits (127), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 42/72 (58%), Gaps = 5/72 (6%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQ-----DFLFIATERYKFCVLQWDAESSELITRAMGDV 55
           MLD P+YGRIAT+ELFRP   A      D    ++ ++    +  + E+  +I R+  D 
Sbjct: 84  MLDAPVYGRIATIELFRPCVLASIKFSPDSRAKSSCKFVNAFVYLNQENRYVICRSGRDA 143

Query: 56  SDRIGRPTDNGQ 67
           SD IGRPT+ GQ
Sbjct: 144 SDHIGRPTNKGQ 155


>gi|156339124|ref|XP_001620088.1| hypothetical protein NEMVEDRAFT_v1g149219 [Nematostella vectensis]
 gi|156204443|gb|EDO27988.1| predicted protein [Nematostella vectensis]
          Length = 69

 Score = 70.5 bits (171), Expect = 5e-09,   Method: Composition-based stats.
 Identities = 34/67 (50%), Positives = 45/67 (67%), Gaps = 9/67 (13%)

Query: 979  WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV---------SVEELCKRV 1029
            WRSF+NE+K   A  F+DGDLIESFLDL R RM+E+   + +         +V++L K V
Sbjct: 3    WRSFSNERKMEPAHGFIDGDLIESFLDLPRARMEEVVTGLQIDDGGMKKECTVDDLVKTV 62

Query: 1030 EELTRLH 1036
            EELTR+H
Sbjct: 63   EELTRIH 69


>gi|303321596|ref|XP_003070792.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
 gi|240110489|gb|EER28647.1| CPSF A subunit region family protein [Coccidioides posadasii C735
            delta SOWgp]
          Length = 1394

 Score = 70.5 bits (171), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 87/372 (23%), Positives = 162/372 (43%), Gaps = 37/372 (9%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVYYCVGTAYVLPEENEPTK 748
            ++LL  +T+  + +Y L   E    + + +      + +      VGTA V  E+  P +
Sbjct: 1022 IKLLSPRTWSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITP-R 1080

Query: 749  GRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
            G I VF I+E          + KL++ A+ + KGAV +++   G+  L+ A  QK  +  
Sbjct: 1081 GSIYVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRG 1140

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+    +      ++  L         ++GD +K I    Y  E   +    +D
Sbjct: 1141 --LKEDGSLLPVAFMDMQCYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKD 1198

Query: 856  YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
                 + A + L D   +Y L A+++  +  +  + E  T  +  RL     +H+G F +
Sbjct: 1199 NEYLQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHMGHFTS 1258

Query: 913  RF----RHGSLVMR----LPDSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
                  +H S          D DV  +P    V+  +  G IGV+  L  + Y  L  LQ
Sbjct: 1259 TMTLLPQHSSSPSADDPGEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQ 1318

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            + L   ++   GLN + +R+   E      +  +DG+L+  +LD+   R  EI+  +   
Sbjct: 1319 SQLVTSMEHPCGLNPKAYRAV--ESDGFGGRGIVDGNLLLRWLDMGVQRKAEIAGRVGAD 1376

Query: 1022 VEELCKRVEELT 1033
            +E +   +E+++
Sbjct: 1377 IESIRVDLEKIS 1388


>gi|320040273|gb|EFW22206.1| hypothetical protein CPSG_00105 [Coccidioides posadasii str.
            Silveira]
          Length = 1387

 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 87/372 (23%), Positives = 162/372 (43%), Gaps = 37/372 (9%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVYYCVGTAYVLPEENEPTK 748
            ++LL  +T+  + +Y L   E    + + +      + +      VGTA V  E+  P +
Sbjct: 1015 IKLLSPRTWSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITP-R 1073

Query: 749  GRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
            G I VF I+E          + KL++ A+ + KGAV +++   G+  L+ A  QK  +  
Sbjct: 1074 GSIYVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRG 1133

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+    +      ++  L         ++GD +K I    Y  E   +    +D
Sbjct: 1134 --LKEDGSLLPVAFMDMQCYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKD 1191

Query: 856  YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
                 + A + L D   +Y L A+++  +  +  + E  T  +  RL     +H+G F +
Sbjct: 1192 NEYLQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHMGHFTS 1251

Query: 913  RF----RHGSLVMR----LPDSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
                  +H S          D DV  +P    V+  +  G IGV+  L  + Y  L  LQ
Sbjct: 1252 TMTLLPQHSSSPSADDPGEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQ 1311

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            + L   ++   GLN + +R+   E      +  +DG+L+  +LD+   R  EI+  +   
Sbjct: 1312 SQLVTSMEHPCGLNPKAYRAV--ESDGFGGRGIVDGNLLLRWLDMGVQRKAEIAGRVGAD 1369

Query: 1022 VEELCKRVEELT 1033
            +E +   +E+++
Sbjct: 1370 IESIRVDLEKIS 1381


>gi|238880502|gb|EEQ44140.1| conserved hypothetical protein [Candida albicans WO-1]
          Length = 1168

 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 221/1100 (20%), Positives = 432/1100 (39%), Gaps = 186/1100 (16%)

Query: 24   DFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPD--CRLIGLH 81
            D + I ++     +LQ+D ++ + I++    ++         G+   IDP+  C L+   
Sbjct: 114  DGVVITSDSGNLSILQYDNKTKKFISKIQEPMTKNGWGRNYVGENLAIDPENRCILVAAM 173

Query: 82   LYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKF----------LYGC-----AKPTIV 126
              + LF  I  ++ G  + +  +     QVL +K           L+G       K  I+
Sbjct: 174  EKNKLFYKIESNSSGSKELSSPLEAHSKQVLCLKIVALNTDHNNPLFGALELTPEKKCII 233

Query: 127  VLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN--LDNGADLLIPVPPPLCGVLIIGEET 184
              Y+ ++   HV      +K K     P S N+  L N  + LIP+P  + G+++ G   
Sbjct: 234  NYYELDQGLNHV------VKKK-----PNSSNSDPLPNDVNYLIPLPGHIGGMVVCGTNW 282

Query: 185  IVYCSANA---FKAIPIRPSITKAYGRVD-----ADGSRY--LLGDHAGLLHLLVITHE- 233
              Y   +    +  +P R   T+    V+         ++  LL +  G L  L + ++ 
Sbjct: 283  CFYDKLDGPRIYLPLPRRNGQTQDSIIVNHVTHVLKKKKFFILLQNALGDLFKLTVDYDF 342

Query: 234  -KEKVTGLKIELLGETSIASTISYLDNAVVY---------------IGSSYGDSQLI--- 274
             KE +  + I        A +++   N  ++               +G    + +L+   
Sbjct: 343  DKEIIKNISITYFDTIPPALSLNIFKNGFLFANVLNNDKLLYQFEKLGDDLTEGELVINS 402

Query: 275  ----KLNLQPDAKGSY----------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 320
                 LN   ++  S+          ++VLE    L PI D  ++D       ++VT S 
Sbjct: 403  SDYESLNSVRESVTSFKLKGLDNLALIDVLE---TLSPITDSKIID------SKLVTLSS 453

Query: 321  AYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
                  ++ + +G+         L      +++ + S +   D +LV+S    ++ L ++
Sbjct: 454  H---SYVKSITHGVPTTTLVESPLPITPTDIFTTKLSLESANDEYLVISSSLSSKTLVLS 510

Query: 380  LEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS---VRLVSSTSRELRNEWKSPPG 436
            + + +E+ E   F     T+         +VQ+ S     VR+V+   +    +W  P G
Sbjct: 511  IGEVVEDVEDSEFVLDQPTIAVQQVGIASVVQIYSNGIKHVRMVNGNKK--TTDWFPPAG 568

Query: 437  YSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI-SCLDINPIGENPSYS 495
             ++  AT N  QVL+A     +VY EI D    ++   Q   EI + +    I EN S  
Sbjct: 569  ITITHATTNNQQVLIALSNLSVVYFEI-DATDDQLIEYQDRLEIATTITAMAIQENISEK 627

Query: 496  QIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEGISYLLCALGDG 551
               A+   +D ++++ SL + N +  + L          + +     E  +++   + +G
Sbjct: 628  SPFAIIGCSDETIQVVSLQEHNCLEIKSLQALSANSSSLKMLKSSGKE--THVHIGMENG 685

Query: 552  HLLNFLLNMKTGELTDRKKVSLGTQPITLRT--FSSKNTTHVFAASDRPTVIYSSNKKLL 609
                  ++   G L++ +   +G++P++L    FSS+    + A S  P + Y       
Sbjct: 686  VYARIKIDTINGNLSNSRVKYIGSKPVSLSVIKFSSE-IEGILAISSAPWISYLYRDSFK 744

Query: 610  YSNVNLKEVSHMCPFNSAAF-PDSLAIAKEGELTIGTI-------DDIQKLHIRSIPLGE 661
             + +   ++++   F S     + +   K+  L I ++       D  Q L I +  L  
Sbjct: 745  ITPLLEIDITNGSSFISEDIGGEGIVGIKDNNLIIFSVGKEDSVFDPSQDLTIVTTKLRY 804

Query: 662  HPRRICHQEQSRTFAI-----------CSLKNQSCAEESEMHF---------------VR 695
             PR++     +R F             C++         E ++               ++
Sbjct: 805  TPRKMI-TYGNRLFISESEYNVQGPFKCNINGDVKENVDEDYYGAFGYEWKQNSWASCIQ 863

Query: 696  LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN-----VYYCVGTA---YVLPEENEPT 747
            ++D ++ + I +  LD  E   S+ + SF+  S       +  VG      +LP  N   
Sbjct: 864  VVDSKSNQVIQSLQLDGNESIVSMSAVSFNKTSTPSVPASHLVVGVCTNQTILP--NSYD 921

Query: 748  KGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
            K  +  F +    LQL+ + E       L  F  KLL A    I+LY     D G ++L 
Sbjct: 922  KSYLYTFKIGKKHLQLVHKTELDHIPQVLENFQDKLLVASGNHIRLY-----DIGQKQLL 976

Query: 808  SEC----GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE--ERARDYNANWM 861
             +         +I  +  QT  + I++ D  KS S++  K +E   +    A D     +
Sbjct: 977  KKSTTIIDFSTNINKIIPQT--NRIIICDSHKS-SIVFAKFDESQNQFVPFADDVMKRQI 1033

Query: 862  SAVEILDDDIYLGAENNFNLFTVR----------------KNSEGATDEERGRLEVVGEY 905
            +++  LD D  +G +   N+F  R                K  +G  +    +L+ + E+
Sbjct: 1034 TSIMNLDIDTLIGGDKFGNIFVTRIDEDISKQADDDWTILKTQDGILNSCPYKLQNLIEF 1093

Query: 906  HLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNL 964
            H+G+ +  F  G L       ++    +VI+  + G IG++  L  +  +  L  LQ  +
Sbjct: 1094 HIGDIITSFNLGCL-------NLAGTESVIYTGLQGTIGLLIPLVSKSEVELLFNLQLYM 1146

Query: 965  RKVIKGVGGLNHEQWRSFNN 984
            ++    + G +H + RS+ N
Sbjct: 1147 QQSQNNLVGKDHLKLRSYYN 1166


>gi|448114553|ref|XP_004202604.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
 gi|359383472|emb|CCE79388.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
          Length = 1248

 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 133/677 (19%), Positives = 267/677 (39%), Gaps = 97/677 (14%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           ++  I +LE  R  G  +D+L I ++  K  +LQ+D   +  IT      S    R    
Sbjct: 66  VFSVIQSLEKIRLMGSQKDYLVITSDSGKLAILQYDTGRNRFITMFQEPHSKTGFRRNTP 125

Query: 66  GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
           G   + DP  R  LIG    + L   +  D+KG ++ +  +  +    + +       G 
Sbjct: 126 GAYLLTDPQNRAILIGALERNKLIYKVHSDDKGGMQISSPLESQTRHTITLAMCALDTGY 185

Query: 121 AKPTIVVLYQDNK--DARH------------VKTYEVALKDKDFVEGPWSQNNLDNGADL 166
             P  V +  D    D++             + +YE+  +  + V      N L   A  
Sbjct: 186 ENPVFVAIEADYGALDSKEHSIDPQAHQSLLLTSYELD-QGLNHVVRRVVNNKLPMTATH 244

Query: 167 LIPVPPPLCGVLIIGEETIVYCS---ANAFKAIPIRP----SITKAYGRVDADGSRYL-- 217
           LIP+P P  GVLI  E   +Y        +   PIR     ++   +       S YL  
Sbjct: 245 LIPLPSPAGGVLICCESYAIYEPFEYKRLYLPFPIRQDRENTVIINHVSQKFKKSNYLIL 304

Query: 218 ----LGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS-- 271
               LGD   L   L    E+E++  + +       ++++++ L +  ++  +++G+   
Sbjct: 305 AQTQLGD--CLKFSLHYNDEREQLDDITVSYFDTIPVSNSLNILRSGFLFANTAHGNKLF 362

Query: 272 -QLIKL--------------------------NLQPDAKG----SYVEVLERYVNLGPIV 300
            Q  KL                          + +P  KG    + V+V++   +L P+ 
Sbjct: 363 YQFEKLGEEGNDLTLRCSDHYEQVTTIDHSKRDFKP--KGLENLALVDVMD---SLNPVT 417

Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM-WSLRSSTDD 359
           D  +++         +          ++I+ +G+ +N+  +  L     + ++ +   + 
Sbjct: 418 DATLLETRTSESPDPLINLFTLSSRFIKILTHGMPVNQLVTSPLPIHPSLIFATKKFNNS 477

Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
             D +LV+S    ++ L +++ + +EE     F +    +       + +VQV S  +R 
Sbjct: 478 VNDDYLVISSELSSQTLVLSIGEVVEEVSDSEFVTDQPAIHVQQTGNSSIVQVFSNGIRS 537

Query: 420 VSSTSRELRN---------EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
           +    R +RN         +W  P G SV  A+ N++Q+++      + Y EI D +  +
Sbjct: 538 I----RHIRNGDEEIKKTTDWYPPAGISVTKASGNSTQLIIGMSNREVCYFEI-DPVDDQ 592

Query: 471 VKHAQLEYEIS----CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 526
           +   Q   E+S            ++P      A+   +D ++++ SL   N ++   L  
Sbjct: 593 LVEYQERLEVSGGSISSLAILSSKDPEQRSAHAIIGSSDETIQVVSLKRHNCLSVVALQA 652

Query: 527 -EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF-- 583
                 S+L+   E   Y+   + +G  +  +++  +G+L+D +   LG++ + L     
Sbjct: 653 LSSKSTSILMLRHEDSIYVHIGMENGLYVCTMIDEVSGKLSDTRIKFLGSKSVRLSPIKV 712

Query: 584 --SSKNTTHVFAASDRP 598
             S+     + A S RP
Sbjct: 713 PGSAGEEDGILALSSRP 729



 Score = 53.9 bits (128), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 69/305 (22%), Positives = 129/305 (42%), Gaps = 50/305 (16%)

Query: 761  LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHI 816
            L  I + E++ A +++  FN +LL      ++LY    K +LR     +  S   +  +I
Sbjct: 955  LLFIHKTESEFAPFAMIEFNNRLLIGTKNLLRLYDIGHKQLLR-----KASSSIDYFENI 1009

Query: 817  LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
            + L     G+ I+  D   S + + Y   E      A D     ++++  LD D  +G +
Sbjct: 1010 IKL-TYMGGNRIMAADANISSTFVKYDQVENQFFPLADDIMKRKITSMCSLDYDTIVGGD 1068

Query: 877  NNFNLFTVR----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 920
               N+F  R                ++ +   +    RL+ + E++ G+    F  GSL+
Sbjct: 1069 KFGNIFVSRIPEVLSKQADQDWGLIRHQDSYLNGAVSRLKNLCEFYSGDIPTSFSKGSLI 1128

Query: 921  MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK----------- 969
            +    S+     T + GTV  +I ++     E   F+E L+  LR               
Sbjct: 1129 L---GSEESIFYTGLMGTVGALIPLVTK--SEVQFFIE-LEAELRGYFDYNFDNFDEQKN 1182

Query: 970  --GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
               + G +H + RS+ N       KN +DGDLIE F ++S      I+  ++ + +++ K
Sbjct: 1183 GYNLLGKDHLKHRSYYNP-----VKNVIDGDLIERFSEVSYNNKIRIANKLDRTPKDIDK 1237

Query: 1028 RVEEL 1032
            ++ E+
Sbjct: 1238 KISEI 1242


>gi|5817204|emb|CAB53699.1| hypothetical protein [Homo sapiens]
          Length = 215

 Score = 70.1 bits (170), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 56/220 (25%), Positives = 102/220 (46%), Gaps = 31/220 (14%)

Query: 832  DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSE 889
            D+ +S   + YK  E  +   A D    W++   +LD D   GA+   N+  VR   N+ 
Sbjct: 2    DVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTN 61

Query: 890  GATDEE---------RGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
               DE+         RG L       EV+  YH+GE V   +  +L+        G   +
Sbjct: 62   DEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-------PGGSES 114

Query: 934  VIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK 992
            +++ T++G IG++     HE + F + ++ +LR     + G +H  +RS+         K
Sbjct: 115  LVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YFPVK 169

Query: 993  NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            N +DGDL E F  +   +   +S+ ++ +  E+ K++E++
Sbjct: 170  NVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 209


>gi|393907594|gb|EJD74706.1| hypothetical protein LOAG_18016 [Loa loa]
          Length = 398

 Score = 70.1 bits (170), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 67/306 (21%), Positives = 145/306 (47%), Gaps = 25/306 (8%)

Query: 741  PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
            PE  +PT            +++ + +KE KG V SL + NG LL  + QK+  + W+ +D
Sbjct: 103  PEPGQPTSKH---------RIKTLYDKEQKGPVTSLCSCNGYLLTGMGQKV--FIWLFKD 151

Query: 801  DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
            +  + + S    H ++  L +  R +  +  D+ +S++LL Y+ E  A+   +RD  ++ 
Sbjct: 152  NNLQGI-SFLDMHFYVHQL-IGVR-NLALACDMYRSVALLRYQEEYKALSLASRDMRSDV 208

Query: 861  ---MSAVEILDDDI--YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF- 914
               M+A  I+D+    ++ ++   N+       E        +L +  E ++G  VN F 
Sbjct: 209  QPPMAAQFIIDNKQMGFVMSDEAANIAIFNYLPETLESLGGEKLTLRAEINIGTVVNSFI 268

Query: 915  -RHGSLVMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
               G +     ++++  +   +V+F +++G  G +  L  + +  L  LQ  +  ++   
Sbjct: 269  RVKGHISSGFVENELFSLERQSVLFASLDGSFGFLRPLTEKVFRRLHMLQQLMSSMVPQP 328

Query: 972  GGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
             GLN +  R+    +    ++ +N +DGD++  +L LS    +++++ +  S   +   +
Sbjct: 329  AGLNAKGARAARPPRPNHYLNTRNLVDGDMVMQYLHLSLPEKNDLARKLGTSRYHIIDDL 388

Query: 1030 EELTRL 1035
             E+ R+
Sbjct: 389  IEICRV 394


>gi|448111975|ref|XP_004201977.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
 gi|359464966|emb|CCE88671.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
          Length = 1249

 Score = 70.1 bits (170), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 136/678 (20%), Positives = 272/678 (40%), Gaps = 99/678 (14%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           ++  I +LE  R  G  +D+L + ++  K  +LQ+D   + L+T      S    R    
Sbjct: 66  VFSVIQSLEKVRLTGSQKDYLVVTSDSGKLAILQYDTGRNRLVTVFQEPHSKTGFRRNTP 125

Query: 66  GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKP 123
           G   + DP  R  LIG    + L   +  D+KG ++ +  +   E Q+     L  CA  
Sbjct: 126 GPYLLTDPQNRAILIGALERNKLIYKVHSDDKGGMQISSPL---ESQIRHTITLAMCALD 182

Query: 124 T-----IVVLYQDNKDARHVKTYEV---ALKDKDFVEGPWSQ-----------NNLDNGA 164
           T     + V  +    A   K Y +   A +   F      Q           N L   A
Sbjct: 183 TGYENPVFVAIEAEYGALDSKEYSIDSQAHQTLLFTSYELDQGLNHVVRRVVNNKLPISA 242

Query: 165 DLLIPVPPPLCGVLIIGEETIVYCSAN---AFKAIPIRPS------ITKAYGRVDADGSR 215
             LIP+P P+ GVLI  E   +Y   +    +   PIR        I   + +       
Sbjct: 243 THLIPLPSPVGGVLICCESYAIYERFDYKRLYLPFPIRQGTENTVIINHVFQKFKKSNYL 302

Query: 216 YLLGDHAG--LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS-- 271
            L     G  L   L    E+E++  + +       ++++++ L +  ++  +++G+   
Sbjct: 303 ILAQTQLGDCLKFSLHYNDEREQLDDITVSYFDTIPVSNSLNILRSGFLFANTAHGNKLF 362

Query: 272 -QLIKLNLQPD------------------------AKG----SYVEVLERYVNLGPIVDF 302
            Q  KL  + +                         KG    + V+V++   +L P+ D 
Sbjct: 363 YQFEKLGEEDNDLTLRCSDHYEQMTTIDHSKREFKPKGLENLALVDVMD---SLNPVTDA 419

Query: 303 CVVDLERQGQG-----QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM-WSLRSS 356
            +++  R  +       +VT S  +    ++I+ +G+ +N+  +  L     + ++ +  
Sbjct: 420 TLLE-TRMSESPDPLINLVTLSSRF----IKILTHGMPVNQLVTSPLPIHPSLIFTTKKF 474

Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG- 415
            +   D +LV+S    ++ L +++ + +EE     F +   T+       + +VQV S  
Sbjct: 475 NESVNDDYLVISSELSSQTLVLSIGEVVEEVSDSEFVTNQPTIHVQQTGKSSIVQVFSNG 534

Query: 416 --SVRLVSSTSRELRN--EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
             S+R + +   E++   +W  P G SV  A+ N +Q+++      + Y EI D    ++
Sbjct: 535 IRSIRHIKNGDEEIKKTTDWYPPAGISVIKASGNNTQLIIGMSNREVCYFEI-DSADDQL 593

Query: 472 KHAQLEYEIS----CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHL 524
              Q   E+S            ++P      A+   +D ++++ SL     L+++  + L
Sbjct: 594 IEYQERLEVSGGSISSLAILSSKDPEKRSSHAIIGSSDETIQVVSLKRHNCLSIVALQAL 653

Query: 525 GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF- 583
             +    S+L+       Y+   + +G  +  +++  +G+L+D +   LG++ + L    
Sbjct: 654 SSK--STSILMINHNDSVYVHIGMENGLYVRTMIDEVSGKLSDTRIKFLGSKSVRLSPIK 711

Query: 584 ---SSKNTTHVFAASDRP 598
              +S     + A S RP
Sbjct: 712 VPGNSGEENGILALSSRP 729



 Score = 55.8 bits (133), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 80/352 (22%), Positives = 145/352 (41%), Gaps = 63/352 (17%)

Query: 725  SDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-----------LQLIAEKETKGAV 773
            S DS+ Y  VGT     ++N   K  +  F V   +           L  I + E++ A 
Sbjct: 911  SSDSSSYLIVGTTRNSFDKN--AKHYLYTFKVAGSRNSSHSQKGQKTLLFIHKTESEFAP 968

Query: 774  YSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
             ++  FN +LL      ++LY    K +LR     +  S   +  +I+ L     G+ I+
Sbjct: 969  LAMIEFNNRLLIGTKNLLRLYDIGHKQLLR-----KASSSIDYFENIIKL-AYMGGNRIM 1022

Query: 830  VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---- 885
              D   S + + Y   E      A D     ++++  LD D  +G +   N+F  R    
Sbjct: 1023 AADASMSSTFVKYDQVENQFFPLADDVMKRKITSMCPLDYDTIVGGDKFGNVFVSRIPEF 1082

Query: 886  ------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
                        ++ +   +    RL+ + E++ G+    F  GSL++    S+     T
Sbjct: 1083 LSKQVDQDWGLIRHQDSYLNGAASRLKNLCEFYSGDIPTSFSKGSLIL---GSEESIFYT 1139

Query: 934  VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVI-------------KGVGGLNHEQWR 980
             + GTV  +I ++    +E   F+E L+  LR                  + G +H + R
Sbjct: 1140 GLMGTVGALIPLVTK--NEVQFFIE-LEAELRSYFDYNFDNFDEQKNGHNLLGKDHLKHR 1196

Query: 981  SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            S+ N       KN +DGDLIE F ++S      I+  ++ + +++ K++ E+
Sbjct: 1197 SYYNP-----VKNVIDGDLIERFSEVSYNNKIRIANKLDRTPKDIDKKISEI 1243


>gi|71032257|ref|XP_765770.1| hypothetical protein [Theileria parva strain Muguga]
 gi|68352727|gb|EAN33487.1| hypothetical protein TP01_0243 [Theileria parva]
          Length = 1504

 Score = 69.7 bits (169), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 63/274 (22%), Positives = 113/274 (41%), Gaps = 39/274 (14%)

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            +D+ T  +     +  +   + + T+ D I VGDLM S+ +L  K  +  + E  RD+N 
Sbjct: 1220 QDETTLFIDVVANYDSNTFVVSLDTKDDVIFVGDLMTSVKML--KFRDNRLLETCRDFNT 1277

Query: 859  NWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF--RH 916
             W +++  +D+   L +++  N    +K     TD++  R +  G +H GE VN+   R 
Sbjct: 1278 LWTTSLAAVDNSSCLVSDDLGNFLLFKKVQHPTTDQQSIRFDKQGLFHHGEVVNKILKRT 1337

Query: 917  GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHE----------------------- 952
               V  + +S + +     F   N V+    S  P E                       
Sbjct: 1338 QMSVQHVANSRMSRSNPREFMVSNRVVSESESNNPSETLNVNEYTNLFKSFFTCATTSGS 1397

Query: 953  -----------QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
                        +L L  L+  +  V K +G +     R+F +    +  K F+DGDL+E
Sbjct: 1398 LLQVCFFDDLNMFLKLSLLEHTMHLVQKDLGNIPSRNQRNFEDLHSNIPTKGFVDGDLVE 1457

Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            +FL L  +    + +TM ++   L  ++  L  L
Sbjct: 1458 AFLKLPDSLKKWVFETMLINSRHLGVKLSSLESL 1491


>gi|257215708|emb|CAX83006.1| Cleavage and polyadenylation specificity factor subunit 1
           [Schistosoma japonicum]
          Length = 462

 Score = 69.7 bits (169), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 63/213 (29%), Positives = 97/213 (45%), Gaps = 36/213 (16%)

Query: 111 VLDIKFLYGCAKPTIVVLYQD------NKDARHVKTYEVALK---DKDFVEGPWSQNNLD 161
           VLD++FL+G  +PT++VLY+          AR      VAL     K      W Q +L 
Sbjct: 214 VLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQESLP 273

Query: 162 NGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIRPSITKAYGR 208
                +IPVP P+ GV+I+   +I+Y             C A      P+R  +    G 
Sbjct: 274 FDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCSLPLNCYAQISTNFPMRQDVPSC-GP 332

Query: 209 VDADGSR--------YLLGDHAGLLHLLVITHEK--EKVTGLKIELLGETSIASTISYLD 258
           +  DG R        +L+G  +G L+LL +  E+  + VT L    +G       +  L+
Sbjct: 333 LSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKVGHAVPPHCMVLLE 392

Query: 259 NAVVYIGSSYGDSQLIKLN---LQPDAKGSYVE 288
           +  ++IGS + DS L+K+N   L  DA G  V+
Sbjct: 393 SKYLFIGSRFCDSVLMKINHSLLCVDASGKEVD 425


>gi|148697643|gb|EDL29590.1| cleavage and polyadenylation specific factor 1, isoform CRA_b [Mus
            musculus]
          Length = 1311

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
            +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+  LG   ++ + NL 
Sbjct: 1088 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1147

Query: 883  TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
                  E        RL    ++H+G  VN F       R P     + P+         
Sbjct: 1148 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGAAEGPSKKSVVWENK 1201

Query: 934  --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
                F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  + +++ +  
Sbjct: 1202 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1261

Query: 991  -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
              +N LDG+L+  +L LS     E++K +  +
Sbjct: 1262 AVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1293



 Score = 57.0 bits (136), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 107/551 (19%), Positives = 214/551 (38%), Gaps = 151/551 (27%)

Query: 20  GEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIGRPTDNGQIGIIDPD 74
           G  +D L ++ +  K  V+++D  + +L T ++      ++ D   +     ++ + DPD
Sbjct: 9   GAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFVQNVHTPRVRV-DPD 67

Query: 75  CRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FNIR-LEE--LQVLDI 114
            R   + +Y     V+PF       +++G + E            ++R L+E  L ++D+
Sbjct: 68  GRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIIDVRALDEKLLNIIDL 127

Query: 115 KFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNNLDNGAD 165
           +FL+G  +PT+++L++ N+      A    T  +     +  +   P  WS  +L     
Sbjct: 128 QFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCT 187

Query: 166 LLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIRPSITKAYGRVDAD 212
             + VP P+ GV+I    +++Y +                  A P+R   T+   R+  D
Sbjct: 188 QALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLR---TQEGVRITLD 244

Query: 213 GS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
            +        + ++    G +++L +IT     V     +    + + +++  ++   ++
Sbjct: 245 CAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLF 304

Query: 264 IGSSYGDSQLIKL--NLQ---------------PDAKGSYVE------------------ 288
           +GS  G+S L+K    LQ               P +K   VE                  
Sbjct: 305 LGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVEPAVGWTGGKTVPQDEVDE 364

Query: 289 --------------------VLERYVNLGPIVDFCVVD---LERQGQG------QVVTCS 319
                               V +  +N+GP  +  V +   L  + Q       ++V CS
Sbjct: 365 IEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPDLEIVVCS 424

Query: 320 GAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------------------RS 355
           G  K+G+L +++  I      + EL G   MW++                        ++
Sbjct: 425 GYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAESTEQEPSAPKA 484

Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTS 414
             D     FL++S    T IL      E+ E +  GF +Q  T+F  +   N+ +VQV+ 
Sbjct: 485 EEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSP 542

Query: 415 GSVRLVSSTSR 425
             +RL+   ++
Sbjct: 543 LGIRLLEGVNQ 553


>gi|194474008|ref|NP_001124043.1| cleavage and polyadenylation specificity factor subunit 1 [Rattus
            norvegicus]
 gi|149066087|gb|EDM15960.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
            isoform CRA_a [Rattus norvegicus]
          Length = 1386

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
            +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+  LG   ++ + NL 
Sbjct: 1163 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1222

Query: 883  TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
                  E        RL    ++H+G  VN F       R P     + P+         
Sbjct: 1223 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGAAEGPSKKSVMWENK 1276

Query: 934  --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
                F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  + +++ +  
Sbjct: 1277 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1336

Query: 991  -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
              +N LDG+L+  +L LS     E++K +  +
Sbjct: 1337 AVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1368



 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 106/561 (18%), Positives = 218/561 (38%), Gaps = 147/561 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYV----- 287
           + +++  ++   +++GS  G+S L+K    LQ               P +K   V     
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVDPTVG 429

Query: 288 -------------------------------EVLERYVNLGPIVD-------FCVVDLER 309
                                          EV +  +N+GP  +       F   +   
Sbjct: 430 WTGGKTQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEENSP 489

Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL---------------- 353
           +   ++V CSG  K+G+L +++  I      + EL G   MW++                
Sbjct: 490 EPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAES 549

Query: 354 --------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
                   ++  D     FL++S    T IL      E+ E +  GF +Q  T+F  +  
Sbjct: 550 TEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIG 607

Query: 406 YNQ-LVQVTSGSVRLVSSTSR 425
            N+ +VQV+   +RL+   ++
Sbjct: 608 DNRYIVQVSPLGIRLLEGVNQ 628


>gi|148697642|gb|EDL29589.1| cleavage and polyadenylation specific factor 1, isoform CRA_a [Mus
            musculus]
          Length = 1417

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
            +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+  LG   ++ + NL 
Sbjct: 1194 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1253

Query: 883  TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
                  E        RL    ++H+G  VN F       R P     + P+         
Sbjct: 1254 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGAAEGPSKKSVVWENK 1307

Query: 934  --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
                F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  + +++ +  
Sbjct: 1308 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1367

Query: 991  -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
              +N LDG+L+  +L LS     E++K +  +
Sbjct: 1368 AVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1399



 Score = 58.5 bits (140), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 108/560 (19%), Positives = 218/560 (38%), Gaps = 151/560 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 101 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 160

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 161 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 219

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 220 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 279

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 280 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 339

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 340 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 396

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE---- 288
           + +++  ++   +++GS  G+S L+K    LQ               P +K   VE    
Sbjct: 397 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVEPAVG 456

Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
                                             V +  +N+GP  +  V +   L  + 
Sbjct: 457 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 516

Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
           Q       ++V CSG  K+G+L +++  I      + EL G   MW++            
Sbjct: 517 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 576

Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
                       ++  D     FL++S    T IL      E+ E +  GF +Q  T+F 
Sbjct: 577 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 634

Query: 402 HDAIYNQ-LVQVTSGSVRLV 420
            +   N+ +VQV+   +RL+
Sbjct: 635 GNIGDNRYIVQVSPLGIRLL 654


>gi|221116994|ref|XP_002166173.1| PREDICTED: DNA damage-binding protein 1-like, partial [Hydra
           magnipapillata]
          Length = 107

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Composition-based stats.
 Identities = 28/54 (51%), Positives = 40/54 (74%)

Query: 2   LDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDV 55
            D+ +YGR+A ++LFRP  E QD LFI TERY+  +L + AE+ ++IT+A GDV
Sbjct: 53  FDLNVYGRVAVMQLFRPQNENQDLLFILTERYRTAILAYKAETGDIITKAYGDV 106


>gi|354491124|ref|XP_003507706.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 2 [Cricetulus griseus]
          Length = 1388

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
            +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+  LG   ++ + NL 
Sbjct: 1165 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1224

Query: 883  TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
                  E        RL    ++H+G  VN F       R P     + P+         
Sbjct: 1225 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGAAEGPSKKSVMWENK 1278

Query: 934  --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
                F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  + +++ +  
Sbjct: 1279 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1338

Query: 991  -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
              +N LDG+L+  +L LS     E++K +  +
Sbjct: 1339 AVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1370



 Score = 57.4 bits (137), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 106/563 (18%), Positives = 219/563 (38%), Gaps = 149/563 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLN--LQ---------------PDAKGSYVE---- 288
           + +++  ++   +++GS  G+S L+K    LQ               P +K   V+    
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVDPTAG 429

Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
                                             V +  +N+GP  +  V +   L  + 
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEN 489

Query: 312 QGQ----VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------------- 353
             +    +V CSG  K+G+L +++  I      + EL G   MW++              
Sbjct: 490 SPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEEAPRA 549

Query: 354 ----------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
                     ++  D     FL++S    T IL      E+ E +  GF +Q  T+F  +
Sbjct: 550 ESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGN 607

Query: 404 AIYNQ-LVQVTSGSVRLVSSTSR 425
              N+ +VQV+   +RL+   ++
Sbjct: 608 IGDNRYIVQVSPLGIRLLEGVNQ 630


>gi|148697644|gb|EDL29591.1| cleavage and polyadenylation specific factor 1, isoform CRA_c [Mus
            musculus]
          Length = 1388

 Score = 69.3 bits (168), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
            +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+  LG   ++ + NL 
Sbjct: 1165 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1224

Query: 883  TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
                  E        RL    ++H+G  VN F       R P     + P+         
Sbjct: 1225 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGAAEGPSKKSVVWENK 1278

Query: 934  --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
                F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  + +++ +  
Sbjct: 1279 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1338

Query: 991  -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
              +N LDG+L+  +L LS     E++K +  +
Sbjct: 1339 AVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1370



 Score = 58.2 bits (139), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 106/563 (18%), Positives = 218/563 (38%), Gaps = 149/563 (26%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++ D   
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 61  RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
           +     ++ + DPD R   + +Y     V+PF       +++G + E            +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192

Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
           +R L+E  L ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252

Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
           P  WS  +L       + VP P+ GV+I    +++Y +                  A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312

Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
           R   T+   R+  D +        + ++    G +++L +IT     V     +    + 
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369

Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLN--LQ---------------PDAKGSYVE---- 288
           + +++  ++   +++GS  G+S L+K    LQ               P +K   VE    
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVEPAVG 429

Query: 289 ----------------------------------VLERYVNLGPIVD-------FCVVDL 307
                                             V +  +N+GP  +       F   + 
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEN 489

Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------------- 353
             +   ++V CSG  K+G+L +++  I      + EL G   MW++              
Sbjct: 490 SPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKA 549

Query: 354 ----------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
                     ++  D     FL++S    T IL      E+ E +  GF +Q  T+F  +
Sbjct: 550 ESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGN 607

Query: 404 AIYNQ-LVQVTSGSVRLVSSTSR 425
              N+ +VQV+   +RL+   ++
Sbjct: 608 IGDNRYIVQVSPLGIRLLEGVNQ 630


>gi|412986884|emb|CCO15310.1| predicted protein [Bathycoccus prasinos]
          Length = 1595

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 84/346 (24%), Positives = 145/346 (41%), Gaps = 53/346 (15%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN----VYYCVGTAYVLPE-ENEPTK 748
            +RLL   T E      ++  E+   + +    D S         +GTA  +P  E+ P +
Sbjct: 1230 IRLLVPGTLETAWQKAVEPGEHILCVKNVQIRDQSTGALLSMLAIGTA--MPGGEDTPCR 1287

Query: 749  GRILVFIV-----EDGKL----QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
            GRIL+F +      DG +    +L  EK +K A  ++ + +G  + AI  K+  + W   
Sbjct: 1288 GRILLFAIMWERARDGGVRWRGELKCEKPSKMACSAIESVDGTFMVAIGTKLTAHSW--- 1344

Query: 800  DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG--AIEERARDYN 857
             DG + L     +   +    +    +F++ GDL KSI  + +K  +G   + +  +DY 
Sbjct: 1345 -DG-KHLNPIAFYDTPLYTTTLCCVKNFLLCGDLHKSIRFVRWKDSQGEKTLSQLGKDYE 1402

Query: 858  ANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN-- 912
                 A E + D      L A+ N N    +   + A   +  +L     YH G  +   
Sbjct: 1403 VLDCIASEFMIDGGTLSLLAADANGNAHVFQYAPKLAESWKGDKLLPKSAYHAGSLIRKM 1462

Query: 913  -RFRHGSLVMRLPDSDVGQIP----TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
             RF+ G          VG+       V FG+ +G +G+ + +    +L LEKLQ  +R  
Sbjct: 1463 VRFQIG----------VGEQKQNRHAVFFGSSDGGLGIFSPVDEHTFLNLEKLQDAMRSN 1512

Query: 968  IKGVG------GLNHEQWRSFNNEKKTV----DAKNFLDGDLIESF 1003
            I          GLN + +R+  + + +V      +  +DG L+  F
Sbjct: 1513 IVASSNSINPLGLNSKTYRALKSSEGSVARQTPPRTIVDGGLLSKF 1558



 Score = 42.7 bits (99), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 39/147 (26%), Positives = 64/147 (43%), Gaps = 23/147 (15%)

Query: 296 LGPIVDFCV-----VDLERQGQGQVVTCSGAYKDGSLRIVRNGIG---INEQASVELQGI 347
           + P+VD  V     V  +   + ++V   G  K+G+L I+  GI    + E  S  L G+
Sbjct: 679 ISPVVDLTVGASAPVGTDLDPRTELVAACGHGKNGALAILTRGITPELVTEVESGALPGL 738

Query: 348 KGMWSLRSSTDDP-----------FDTFLVVSFISETRILAMNLEDELEETEIE-GFCSQ 395
           +  W+ R+  D+            FD  L++S  S    + +   +EL E   E  F   
Sbjct: 739 RACWATRTEDDNDGTVRPKRKDELFDEHLILSLSSTKTTMVLETGEELREVSKEVDFIVD 798

Query: 396 TQTLFCHDAIYN--QLVQVTSGSVRLV 420
            +TL C + I+N   + QVT   +R  
Sbjct: 799 EETLAC-ERIFNGRAIAQVTKTKIRFT 824



 Score = 39.7 bits (91), Expect = 8.3,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 46/92 (50%), Gaps = 10/92 (10%)

Query: 111 VLDIKFLYGCAKPTIVVLYQDN------KDARHVKTYEVALKDKDFVEGP----WSQNNL 160
           V D  FL+G  +P +++LY+ N      + +  + T ++     D  +      W++  L
Sbjct: 292 VRDCCFLHGYGEPVLMILYESNPPTWSGRLSLRMDTCKLVAVSIDCTKKKYTIVWTREKL 351

Query: 161 DNGADLLIPVPPPLCGVLIIGEETIVYCSANA 192
            + A  L PVP PL GVL++    I+Y S ++
Sbjct: 352 PSAAYSLFPVPNPLGGVLVLSSGHILYESQSS 383


>gi|50292811|ref|XP_448838.1| hypothetical protein [Candida glabrata CBS 138]
 gi|74608786|sp|Q6FLQ6.1|RSE1_CANGA RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|49528151|emb|CAG61808.1| unnamed protein product [Candida glabrata]
          Length = 1296

 Score = 68.9 bits (167), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 80/349 (22%), Positives = 155/349 (44%), Gaps = 58/349 (16%)

Query: 249 SIASTISYLDNAVVYIGSSYGDSQLIKLNLQP-----------DAKGSYVEVLERYVNLG 297
           S +S +   +N  +YI S +     + LN +            D +   + V+ ++ N+ 
Sbjct: 340 SNSSKLHIFNNGSMYINSQFNYDH-VYLNFESIGDNDENYDKIDNENENISVISKHTNIN 398

Query: 298 PI-VDFCVVD------LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QGIKG 349
           PI ++ C+++      +  QG  +            + I+RN I + E  S  L QG+  
Sbjct: 399 PIALNLCLMENMPLTFMHFQGGNRTTDSE------KVNIIRNAIPLKEYVSSPLPQGVSN 452

Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY--- 406
           ++++++     + +F+ ++ I+ T ++      ++ +  IE +   + T    D +    
Sbjct: 453 IFTIKTQYQ-SYHSFIFLTMINFTTVIL-----KIADDSIEQYIPASDTFKLKDDMTIHV 506

Query: 407 -----NQLVQVTSGSVRLVSSTSRELRN-----EWKSPPGYSVNVATANASQVLLATGGG 456
                N ++QV     R +   S++  N     +W  P G S+  A +N SQ++LA    
Sbjct: 507 ATMGDNSIIQVCKDEFRQILLDSKDEENFKMNLKWYPPAGVSILSAVSNFSQLILALSNN 566

Query: 457 HLVYLEIGDGILTEVKH-AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515
            +VYL++ +  L E K+  +L   I+ L +  + +N   S+I AVG  +D  V + SL  
Sbjct: 567 EIVYLQLENNTLIEYKNRPELPDVITSLAL--LNDNTKKSEILAVGT-SDNMVNVLSLE- 622

Query: 516 LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGH-LLNFLLNMKTG 563
              I  E +  E    +V+  A + I   L  L  GH L+N  + ++ G
Sbjct: 623 ---IVDEAISFE----TVVFQALDAIPSSLLILNQGHKLVNLHIGVEDG 664


>gi|70945139|ref|XP_742421.1| hypothetical protein [Plasmodium chabaudi chabaudi]
 gi|56521397|emb|CAH76894.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
          Length = 435

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 73/307 (23%), Positives = 128/307 (41%), Gaps = 38/307 (12%)

Query: 646 IDDIQKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEF 704
           +++I + +I+ IP      +I  H+E       C  + +    ++    +   +     F
Sbjct: 1   LNEINETYIQKIPFYRTVEKIAYHKESGLLITSCPPEEKHKTNKNLKQIICFFNPHQNSF 60

Query: 705 ISTYPLDTFEYGCSILSCSFSDDSNVY---------YCVGTAYVLPEENEPTKGRILVFI 755
             +Y + + +Y  S + C +  + ++Y          CVGTA +    +EP+ G I +F 
Sbjct: 61  KYSYIIPS-KYNVSSI-CVYQINKDIYPNKSSINTLICVGTANINDRVSEPSSGHIYIFF 118

Query: 756 VEDGKLQLIAEKE------TKGAVYSLNAFNGKLLAAINQK----------IQLYKWM-- 797
            +  K  L   K         G +  L  F  KL+  IN            I L K++  
Sbjct: 119 AK-KKANLFEIKHIYTHNVNVGGITHLKQFYDKLITTINNTVVILDISEFLINLDKYVDN 177

Query: 798 -----LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
                L +DGT    +       I++L V    ++IVVGD+M S+++L Y      + E 
Sbjct: 178 TNKPKLENDGTIVDVASFTPSSWIMSLDV--IENYIVVGDIMTSVTILSYDFNNSILTEV 235

Query: 853 ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
            RDY+  W + V  L    +L ++   N    +K+S    DE+  +L  V  ++ G  VN
Sbjct: 236 CRDYSNVWCTFVCALSKSHFLVSDMESNFLVFQKSSIKYNDEDSFKLSRVALFNHGHVVN 295

Query: 913 RFRHGSL 919
           +    SL
Sbjct: 296 KMLPVSL 302


>gi|302694047|ref|XP_003036702.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
 gi|300110399|gb|EFJ01800.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
          Length = 1396

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 116/508 (22%), Positives = 206/508 (40%), Gaps = 82/508 (16%)

Query: 573  LGTQPITLRTF----SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
            L  Q   LRTF     +     VF   DRP  I +++K      V +    H   +  +A
Sbjct: 891  LAEQKKALRTFVPFACAGAPAGVFFTGDRPHWIVATDK----GGVQMYPSGHAAVYAFSA 946

Query: 629  FP------DSLAIAKEGELTIGTIDDIQ---KLHIRSIPLGEHPRRICHQEQSRTF-AIC 678
                    + L  ++EG+     I + +    L +R IP G     I ++  S    A  
Sbjct: 947  CTLWERSTEFLIYSEEGQTLCEWITEYEIGRPLPMRHIPRGRAYSNIVYEPASSMIVAAA 1006

Query: 679  SLK----------NQSCAEES--------EMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
            SL+          NQ  A +         E   + L+  + +  +  Y   T E+  + +
Sbjct: 1007 SLRARFASFDEDGNQIWAPDGPGITEPTVECSTLELISPEVWATVDGYEFATNEF-VNTM 1065

Query: 721  SC----SFSDDSNV--YYCVGTAYVLPEENEPTKGRILVF----IVEDG--------KLQ 762
             C    + S ++ V  +  VGT+ V   E+   KG   +F    +V D         +L+
Sbjct: 1066 ECVPLETVSTEAGVKHFIAVGTSIVR-GEDLAVKGATYIFEVVEVVPDQSNGPKRWYRLK 1124

Query: 763  LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQ 822
            L    + KG V +L   N  L++++ QKI +  + L +        + G    +    ++
Sbjct: 1125 LRCRDDAKGPVTALCGINNYLVSSMGQKIFVRAFDLDERLVGVAFMDVG----VYVTSLR 1180

Query: 823  TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
               + +++GD+++ I  + ++ +   +    RD +    + V+      +  AE    + 
Sbjct: 1181 ALKNLLLIGDVVRGIQFVAFQEDPYKLVTLGRDVSRMCATTVD------FFFAEEALAIV 1234

Query: 883  TVRKNS--------EGATDEERGRLEVV-GEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
            T  +N           A D   GRL +   E++L      FR  +L+ R    D   IP 
Sbjct: 1235 TTDENGVMSMYNYDPEAPDSHDGRLLLKQTEFNLH---TDFRTSTLIARRTKDD-PIIPQ 1290

Query: 934  --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
              +IFG  +G +  +  +P +    L+ LQ  L + ++ V GLN +  R   NE  +   
Sbjct: 1291 GILIFGGTDGTLSCLTPVPDDAAKRLQPLQLQLTRNMQHVAGLNPKALRIVRNEHVSRPL 1350

Query: 991  AKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            +K  LDG+LI  F  L  TR DE+++ +
Sbjct: 1351 SKGILDGNLIAYFEHLPITRQDEMTRQI 1378



 Score = 45.1 bits (105), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 63/303 (20%), Positives = 116/303 (38%), Gaps = 55/303 (18%)

Query: 24  DFLFIATERYKFCVLQWDAESSELIT-------RAMGDVSDRIGRPTDNGQIGIIDPDCR 76
           D L ++ +  K  +L+W   + +L+T       RA+  V+  I   T   ++   DP  R
Sbjct: 139 DRLLVSFKDAKIALLEWSTATQDLLTVSIHTYERAIQMVATDISAFTSELRV---DPQSR 195

Query: 77  LIGLHLYDGLFKV-------------IPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKP 123
              L L    F +             +P+     L     +      V+D  FL G + P
Sbjct: 196 CAALSLPKDAFAILPPCQVSDSVCRDVPYSPSFILNLPSEVESGIRNVIDFTFLPGFSNP 255

Query: 124 TIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGPWSQNNLDNGAD---LLIPVPPPLC 175
           T+ VLY+  +           T ++A    D V   +    L  G     L +   P   
Sbjct: 256 TVAVLYETYQTWTGRLNEQKDTVKMAFFTLDIVNRRYPVIGLATGLPCDCLSVLACPSTG 315

Query: 176 GVLIIGEETIVYCSANAFK---------------AIPIRPSITKAYGRVDADGSRYLLGD 220
           GV+++   +I+Y   +  K               A+P   +  +    ++ +GSR +  D
Sbjct: 316 GVMVVASNSIIYVDQSGRKVVLPVNAWIPRMSDIALPTNLTPEEQARTLELEGSRSIFVD 375

Query: 221 HA--------GLLHLLVITHEKEKVTGLKI-ELLGETSIASTISYLDNAVVYIGSSYGDS 271
                     G ++ + +      V+ L +   L +T+I S +  ++N  + +GS+ GDS
Sbjct: 376 DKTAFIILKDGTIYPVELVTAGRVVSKLALGTPLAKTTIPSILRRINNDYLLVGSASGDS 435

Query: 272 QLI 274
            L+
Sbjct: 436 ALL 438


>gi|114318675|gb|ABI63336.1| putative splicing factor 3b [Hordeum vulgare subsp. vulgare]
          Length = 379

 Score = 68.6 bits (166), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 72/274 (26%), Positives = 118/274 (43%), Gaps = 17/274 (6%)

Query: 433 SPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCLDINPIGEN 491
           +P   ++    +N  QV++A  GG L+Y E+   G L EV+   +  +++CL I P+ E 
Sbjct: 1   TPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDMSGDVACLAIAPVPEG 60

Query: 492 PSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFE-------GISY 543
              S+  AVG + D ++RI SL PD  L           P S+L    +       G  Y
Sbjct: 61  RQRSRFLAVGSY-DNTIRILSLDPDDCLQPLSVQSVSSAPESLLFLEVQASVGGEDGADY 119

Query: 544 -----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRP 598
                L   L +G L    ++M TG+L+D +   LG +P  L      +   +   S RP
Sbjct: 120 PANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPCIVSHRQAMLCLSSRP 179

Query: 599 TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSI 657
            + Y      L + ++   +     F+S    + +       L I TI+ + +  +  SI
Sbjct: 180 WLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGDALRIFTIERLGETFNETSI 239

Query: 658 PLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE 690
           PL   PR+     + +  A+  S K    AE+ E
Sbjct: 240 PLRYTPRKFVILPKKKYLAVIESDKGAFSAEQRE 273


>gi|348555856|ref|XP_003463739.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
            isoform 2 [Cavia porcellus]
          Length = 1387

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
            +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+  LG   ++ + NL 
Sbjct: 1164 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1223

Query: 883  TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
                  E        RL    ++H+G  VN F       R P     + P+         
Sbjct: 1224 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGATEGPSKKSVVWENK 1277

Query: 934  --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
                F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  + +++ +  
Sbjct: 1278 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1337

Query: 991  -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
              +N LDG+L+  +L LS     E++K +  +
Sbjct: 1338 AVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1369



 Score = 50.1 bits (118), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 96/493 (19%), Positives = 188/493 (38%), Gaps = 143/493 (29%)

Query: 71  IDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FNIR-LEE--LQ 110
           +DPD R   + +Y     V+PF       +++G + E            ++R L+E  L 
Sbjct: 142 VDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIIDVRALDEKLLN 201

Query: 111 VLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNNLD 161
           ++D++FL+G  +PT+++L++ N+      A    T  +     +  +   P  WS  +L 
Sbjct: 202 IVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLP 261

Query: 162 NGADLLIPVPPPLCGVLIIGEETIVYCSAN-------------AFKAIPIRPSITKAYGR 208
                 + VP P+ GV+I    +++Y + +                A P+R   T+   R
Sbjct: 262 FDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLGTTAFPLR---TQEGVR 318

Query: 209 VDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDN 259
           +  D +        + ++    G +++L +IT     V     +    + + +++  ++ 
Sbjct: 319 ITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEP 378

Query: 260 AVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE-------------- 288
             +++GS  G+S L+K    LQ               P +K   V+              
Sbjct: 379 GYLFLGSRLGNSLLLKYTEKLQEPPASTVREAADKEEPPSKKKRVDSTAGWAGSKTVPQD 438

Query: 289 ------------------------VLERYVNLGPIVDFCVVD---LERQGQGQ----VVT 317
                                   V +  +N+GP  +  V +   L  +   +    +V 
Sbjct: 439 EVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEENSPEPDLEIVV 498

Query: 318 CSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------------------ 353
           CSG  K+G+L +++  I      + EL G   MW++                        
Sbjct: 499 CSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAEGSEQEPSAP 558

Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQV 412
            +  D     FL++S    T IL      E+ E +  GF +Q  T+F  +   N+ +VQV
Sbjct: 559 EAEDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQV 616

Query: 413 TSGSVRLVSSTSR 425
           +   +RL+   ++
Sbjct: 617 SPLGIRLLEGVNQ 629


>gi|119195757|ref|XP_001248482.1| hypothetical protein CIMG_02253 [Coccidioides immitis RS]
 gi|121769680|sp|Q1E5B0.1|CFT1_COCIM RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
            1
 gi|392862316|gb|EAS37050.2| protein CFT1 [Coccidioides immitis RS]
          Length = 1387

 Score = 68.2 bits (165), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 87/372 (23%), Positives = 159/372 (42%), Gaps = 37/372 (9%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVYYCVGTAYVLPEENEPTK 748
            ++LL  +T+  + +Y L   E    + + +      + +      VGTA V  E+  P +
Sbjct: 1015 IKLLSPRTWSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITP-R 1073

Query: 749  GRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
            G I VF I+E          + KL++ A+ + KGAV +++   G+  L+ A  QK  +  
Sbjct: 1074 GSIYVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRG 1133

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+    +      ++  L         ++GD +K I    Y  E   +    +D
Sbjct: 1134 --LKEDGSLLPVAFMDMQCYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKD 1191

Query: 856  YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
                 + A + L D   +Y L A+++  +  +  + E  T  +  RL     +H G F +
Sbjct: 1192 NEYLQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHTGHFTS 1251

Query: 913  RF-----RHGSLVMRLP---DSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
                      S     P   D DV  +P    V+  +  G IGV+  L  + Y  L  LQ
Sbjct: 1252 TMTLLPEHSSSPSADDPEEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQ 1311

Query: 962  TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
            + L   ++   GLN + +R+   E      +  +DG+L+  +LD+   R  EI+  +   
Sbjct: 1312 SQLVTSMEHPCGLNPKAYRAV--ESDGFGGRGIVDGNLLLRWLDMGVQRKAEIAGRVGAD 1369

Query: 1022 VEELCKRVEELT 1033
            +E +   +E ++
Sbjct: 1370 IESIRVDLETIS 1381


>gi|167386195|ref|XP_001737658.1| spliceosomal protein sap [Entamoeba dispar SAW760]
 gi|165899468|gb|EDR26056.1| spliceosomal protein sap, putative [Entamoeba dispar SAW760]
          Length = 1079

 Score = 67.8 bits (164), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 148/691 (21%), Positives = 280/691 (40%), Gaps = 82/691 (11%)

Query: 249 SIASTISYLDNAVVYIGSSYGDSQLI------KLNLQPDA----KGSYVEVLERYVNLGP 298
           +I+ ++S+  N  +   +S   + L+       + +QP +    +  Y  +     +  P
Sbjct: 262 TISPSVSWSINGDIIFSASESQNHLVLKVPSLTIPIQPSSTYKPQSLYFNISTSISSHHP 321

Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTD 358
           +    VVD     Q +     G+    SL+I+ +G+  +E  S+ LQ  + +W+L+   +
Sbjct: 322 LTSVDVVDSHEVFQIRAFVGKGS--RSSLKILTHGLNPDECMSLSLQHPRNIWTLKPYNE 379

Query: 359 DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
                ++V+   ++T +L   L D L++    G     QTL     I   LVQ+ +  + 
Sbjct: 380 QTHQ-YVVIGLENQTYVLKT-LPDALKQCPECGIRPNVQTLHAGMFIDGTLVQIHAHGII 437

Query: 419 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH----LVYLEIGDGILTEVKH- 473
            + +   E  +E       ++ V T+++ QV++A         +VY E      TE K  
Sbjct: 438 TILN---EKLSEQDPGAQRTILVGTSSSCQVVIALKNAQSKCEIVYFEYN----TETKKL 490

Query: 474 AQLEYEISCLDINPIG----ENPSYSQIAAV---GMWTDISVRIFSLPDLNLITKEHLGG 526
           A++E  I   +I  +     EN    ++A +   GM   +++   S     L T   + G
Sbjct: 491 AEVERRIIRSNIIAMALGDFENGKAKKVAYICDNGMANILTLEKESKFLQILETNITVEG 550

Query: 527 EIIPRSVLLCAFEGIS--YLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
           E    S++L  F  IS  +L   +G  DG+   + +N K  EL   K +  G  P+    
Sbjct: 551 E--AESIILTQFNEISEPFLFLGIGMKDGYCYGYKINGKNSELMWCKLI--GNDPVVFGQ 606

Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
           +       V  +S     +YS   ++ ++ ++ + +  + PF +    +S+A     ELT
Sbjct: 607 YEFGGNIGVIMSSSTMIFMYSYFSRIEFTPISYQPLKALAPFITDFSGESIAGITNDELT 666

Query: 643 ---IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD 699
              IG++DD  +      PL   PR+I       T  +CS       +     +  ++D+
Sbjct: 667 VIAIGSLDD--RFTYDEQPLLYTPRKIIKSNYFPTTVLCS-------DNKSNLYNPIIDN 717

Query: 700 QTFEFIS-TYPLDTFEYG----------CSILSCSFSDDSNVY----------YCVGTAY 738
           +T EF      +D  E G           SI    F  D  +            C  ++ 
Sbjct: 718 ETNEFNERKIGVDLQEDGKWASEVGIINNSIFKSYFKTDEAIICGAHVSFKKKQCFISSQ 777

Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
           V  +  +  KG   + +      +LI +   +    +L +   +LLA I   ++LY    
Sbjct: 778 V--KHYKKMKGECWINVYNLKTFKLIHKTPVEDICRALTSCGERLLAGIGTTLRLY---- 831

Query: 799 RDDGTRELQSECGHHGHILAL-YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
            D G + L  +C   G    +  ++  GD IVVG +      + Y  +   +    +D  
Sbjct: 832 -DLGKQILIRKCELDGFPSIINSLEVIGDKIVVGTVATGFIYVNYDSDANILSITEKDRI 890

Query: 858 ANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
            + ++A  ILD+   +G +   ++F    N+
Sbjct: 891 WHSLTASTILDETSTIGFDKLGSVFITETNT 921


>gi|326501484|dbj|BAK02531.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 101

 Score = 67.0 bits (162), Expect = 5e-08,   Method: Composition-based stats.
 Identities = 36/67 (53%), Positives = 44/67 (65%)

Query: 632 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM 691
           SLAIA+EGEL+IG+IDD+QKLHIR+I L E  R I HQEQS T         + A ES  
Sbjct: 4   SLAIAEEGELSIGSIDDVQKLHIRTILLIEQARSIFHQEQSMTRCFAFSSTPTIAGESFA 63

Query: 692 HFVRLLD 698
              RL++
Sbjct: 64  KIHRLME 70


>gi|327287424|ref|XP_003228429.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
            1-like [Anolis carolinensis]
          Length = 1294

 Score = 67.0 bits (162), Expect = 5e-08,   Method: Compositional matrix adjust.
 Identities = 67/227 (29%), Positives = 104/227 (45%), Gaps = 31/227 (13%)

Query: 710  LDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVEDG 759
            L+ +E+   + + S   +  V     Y  VGT  +  EE    +GRIL+      + E G
Sbjct: 1048 LEEWEHVTCMKTVSLKSEETVSGLKGYIAVGTCLMQGEEVT-CRGRILIMDIIEVVPEPG 1106

Query: 760  ------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSEC 810
                  K +++ EKE KG V +L   NG L++AI QKI L  W L+D+   G   + ++ 
Sbjct: 1107 QPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFL--WSLKDNDLTGMAFIDTQL 1164

Query: 811  GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
              H  I      +  +FI+  D+MKSISLL Y+ E   +   +RD     +  V+ + D 
Sbjct: 1165 YIHQMI------SVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYCVDFMVDS 1218

Query: 871  IYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
              LG   ++ + NL       E        RL    ++H+G  VN F
Sbjct: 1219 CQLGFLVSDRDRNLLVYMYLPEAKESFGGMRLLRRADFHVGAHVNAF 1265



 Score = 47.4 bits (111), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 48/215 (22%), Positives = 98/215 (45%), Gaps = 33/215 (15%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
            +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++    +   R    
Sbjct: 74  FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FNI 104
             + I    +DPD R   + +Y     V+PF       +++G + E            +I
Sbjct: 134 QNVHIPKVRVDPDGRCAVMLIYGTRLVVLPFRRDTLTDEHEGVVGEGQKSSFLPSYIIDI 193

Query: 105 R-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP-- 154
           R L+E  L ++D++FLYG  +PT+++L++ N+      A    T  +     + ++    
Sbjct: 194 RELDEKLLNIIDMQFLYGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKVHP 253

Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
             WS +NL       + VP P+ GV+I    +++Y
Sbjct: 254 VIWSLSNLPFDCTQALAVPKPIGGVVIFAVNSLLY 288


>gi|70952260|ref|XP_745310.1| splicing factor 3b, subunit 3, 130kD [Plasmodium chabaudi chabaudi]
 gi|56525593|emb|CAH80831.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium chabaudi
           chabaudi]
          Length = 530

 Score = 66.6 bits (161), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 96/475 (20%), Positives = 193/475 (40%), Gaps = 71/475 (14%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
           I+G I +LE+FR  G  +D++ I ++  +  +L++D E ++ I R   +   + G R   
Sbjct: 58  IFGIIRSLEIFRLTGSNKDYIAIGSDSGRLVILKYDDEKNDFI-RVHCETYGKSGIRRIV 116

Query: 65  NGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVL---DIKFLYG 119
            G+   +DP  R + L   +      ++  DNK  L  +  +   +   +    +    G
Sbjct: 117 PGEYIAVDPKGRALMLCAIEKQKFVYILNRDNKENLTISSPLEAHKSHSICHAVVGLNVG 176

Query: 120 CAKPTIVVLYQDNKDA-RHVKTYEVAL-----KDKDFVEGP------WSQN--------- 158
              P  V + Q+ +   + V    + L     +D+  +E P      W  +         
Sbjct: 177 FENPMFVSIEQNYETLDKEVTNGNIQLSSNPNEDQQIMEYPKKGLCFWEMDLGLNHVIKK 236

Query: 159 ---NLDNGADLLIPVP---PPLCGVLIIGEETIVY---------CSANAFKAIPIRPSIT 203
               +D  A LLIP+P       G+++  E  +VY         CS      +    +I+
Sbjct: 237 HTIPIDITAHLLIPLPGGQQGPSGLIVCCENYLVYKKIDHDDVYCSYPRRLEVGEEKNIS 296

Query: 204 KAYGRVDADGSRY--LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
                +    + +  L+    G L+ + + HE   V  +  +      IA++I  L +  
Sbjct: 297 IVCWTIHRIKTFFFILIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGA 356

Query: 262 VYIGSSYGDSQLIKLN-------------LQPDAKGSYVE----------VLERYVNLGP 298
           +++ + +G+    + +               P  K + +           ++++  +L P
Sbjct: 357 LFVAAEFGNHFFYQFSGIGNDSNESMCTSNHPSGKNAIIAFKTQKLKNLYLVDQIYSLSP 416

Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
           I+D  ++D +     Q+    G     SLRI+++G+ I E A+ EL G  K +W+++   
Sbjct: 417 IIDMKILDAKNSSLPQIYALCGRGPRSSLRILQHGLSIEELANNELPGKPKYIWTIKKDN 476

Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
              +D +++VSF   T IL +   + +EE       +   T+  +    N  +QV
Sbjct: 477 SSEYDGYIIVSFEGNTLILEIG--ETVEEVYDSLLLTNVTTIHINLLYDNSFIQV 529


>gi|149066088|gb|EDM15961.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
            isoform CRA_b [Rattus norvegicus]
          Length = 241

 Score = 66.6 bits (161), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
            +FI+  D+MKSISLL Y+ E   +   +RD     + +V+ + D+  LG   ++ + NL 
Sbjct: 18   NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 77

Query: 883  TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
                  E        RL    ++H+G  VN F       R P     + P+         
Sbjct: 78   VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGAAEGPSKKSVMWENK 131

Query: 934  --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
                F T++G IG++  +  + Y  L  LQ  L  ++    GLN   +R  + +++ +  
Sbjct: 132  HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 191

Query: 991  -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
              +N LDG+L+  +L LS     E++K +  +
Sbjct: 192  AVRNVLDGELLNRYLYLSTMERSELAKKIGTT 223


>gi|85682925|gb|ABC73438.1| CG13900 [Drosophila miranda]
          Length = 341

 Score = 66.6 bits (161), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 64/288 (22%), Positives = 120/288 (41%), Gaps = 37/288 (12%)

Query: 718 SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVED---GKLQLIAEKETKGAVY 774
           +++  S + D   Y  VG A  L      ++G  L     D     L+ +   E      
Sbjct: 67  AMVKFSVAADGRYYLAVGIARDLQLNPRISQGGYLDIYKIDPTCSSLEFMHRTEIDEIPG 126

Query: 775 SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDL 833
           +L  F G+LLA   + +++Y     D G +++  +C   H     + +Q  G  + V D+
Sbjct: 127 ALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCENKHIPYQIVNIQAMGHRVYVSDV 181

Query: 834 MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
            +S+  L Y+  E  +   A D +  W++A  +LD D  +   + F   ++++     TD
Sbjct: 182 QESVFFLRYRRAENQLIIFADDTHPRWVTATTLLDYDT-IAIADKFGNLSIQRLPHSVTD 240

Query: 894 E------------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
           +            +RG L       E +  +H+GE +   +  +L+        G    +
Sbjct: 241 DVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSLQKATLI-------PGGSEAL 293

Query: 935 IFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS 981
           I+ T+NG +G        E Y F + L+ ++R     + G +H  +RS
Sbjct: 294 IYSTLNGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCGRDHLSYRS 341


>gi|308805673|ref|XP_003080148.1| cleavage and polyadenylation specificity factor (ISS) [Ostreococcus
            tauri]
 gi|116058608|emb|CAL54315.1| cleavage and polyadenylation specificity factor (ISS), partial
            [Ostreococcus tauri]
          Length = 1473

 Score = 66.2 bits (160), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 87/360 (24%), Positives = 155/360 (43%), Gaps = 41/360 (11%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD----DSNVYYCVGTAYVLPE-ENEPTK 748
            VRLL   + + +  + L+  E+   + +    D     S  Y  VGTA  +P  E+ P +
Sbjct: 845  VRLLVPGSLDCVWQHTLEPGEHVQCVRNVQLKDINTGHSLSYLAVGTA--MPGGEDTPCR 902

Query: 749  GRILVFIV--------EDG---KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
            GR+ +F +         DG   K Q+   +E K A  +L    G L+ A+  K+ ++ W 
Sbjct: 903  GRVYLFNMVWERDSESADGYRWKGQVCCVREAKMACTALEGLGGHLIVAVGTKLTVHTW- 961

Query: 798  LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARD 855
               DG REL S       I  + +    +FI+VGDL K +    +K    E ++ + ++D
Sbjct: 962  ---DG-RELNSVAFFDTPIHTVSINVVKNFILVGDLEKGLHFFRWKDTGFEKSLIQLSKD 1017

Query: 856  YNANWMSAVEILDDDI---YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
            +    + + E L D      LG++ + N  T   + +     +  +L     YH+G  ++
Sbjct: 1018 FERMDVVSSEFLIDGTTLSLLGSDMSGNARTFGYDPKSIESWKGQKLLPRAAYHVGSPIS 1077

Query: 913  R---FRHGSLVMRLPDSDVGQIP-----TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
            R   F       ++  +D G+        V FGT++G +G+        Y  L  +Q  L
Sbjct: 1078 RMVRFNVEGSKSKMASTD-GKPKGANRFAVFFGTLDGALGIFMPTDPVTYEKLLAIQREL 1136

Query: 965  RKVIKGVGGLNHEQWRS---FNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNV 1020
               ++   G N   +R+   F  +   + A  + LDG L+  F  L+ +   +I+ +  V
Sbjct: 1137 TTAVRSPIGCNPRTFRTPKVFEGKHVQLRAPLDVLDGGLLSKFETLTFSEQVKIASSAQV 1196


>gi|196476797|gb|ACG76262.1| damaged-DNA binding protein DDB p127 subunit [Amblyomma americanum]
          Length = 74

 Score = 66.2 bits (160), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 32/73 (43%), Positives = 48/73 (65%), Gaps = 10/73 (13%)

Query: 974  LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM----------NVSVE 1023
            ++H  WRSF+ E+KT     F+DGDLIESFLDLSR +M E+ + +          + +V+
Sbjct: 2    IDHAFWRSFSTERKTEQPVGFIDGDLIESFLDLSRDKMQEVVQGIQMDDGSGMKKDATVD 61

Query: 1024 ELCKRVEELTRLH 1036
            +L K +EEL+R+H
Sbjct: 62   DLIKIIEELSRIH 74


>gi|407851810|gb|EKG05523.1| damage-specific DNA binding protein, putative [Trypanosoma cruzi]
          Length = 313

 Score = 66.2 bits (160), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 43/154 (27%), Positives = 67/154 (43%), Gaps = 36/154 (23%)

Query: 897  GRLEVVGEYHLGEFVNRFRHGS--------LVMRLPDSDVGQI--PTVIFGTVNGVIGVI 946
            G+LE   +YHLG+ V   + GS        + + +P + +  I  P V+FGT +G  G I
Sbjct: 81   GKLETSAQYHLGDMVTAMQLGSFAPCSVTNIAVPIPTTLIPGICGPQVVFGTSHGAFGTI 140

Query: 947  ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS------------------------- 981
              + +E YL L+ L+  +  V+  +GG  H  +R                          
Sbjct: 141  TPVSNETYLLLKALEVAVSSVVPALGGFEHVTYREVLRAGQERGYSRNASFENANVVSSG 200

Query: 982  -FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
             F+  +K   ++    GDLIESFL  S+T    I
Sbjct: 201  VFDKRRKRYLSRCVCSGDLIESFLTFSQTIQQRI 234


>gi|198415711|ref|XP_002123169.1| PREDICTED: similar to cleavage and polyadenylation specificity
           factor 1, partial [Ciona intestinalis]
          Length = 1370

 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 112/592 (18%), Positives = 224/592 (37%), Gaps = 135/592 (22%)

Query: 6   IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
           ++G +  +   R  G   D + ++    K  ++++D  + ++ T ++    D + +    
Sbjct: 74  LFGNVVRMRSVRLAGAKLDSVLLSFAEAKLSIIEFDQATHDIKTTSLHYFEDALYKDGSY 133

Query: 66  GQIGI----IDPDCRLIGLHLYDGLFKVIPFD-NKGQLK--------------------- 99
            +I +    +DP+ R + L L      V+P   N   L                      
Sbjct: 134 QRITLPKIAVDPESRCVALQLTTKSVAVVPLRANTAALATDDGAAPQDNVSLQNKRSTTS 193

Query: 100 -----EAFNIRLEELQVLDIKFLYGCAKPTIVVLYQD-----NKDARHVKTYEVALKDKD 149
                 A + RL+  +++DI+FL+G  +PT++VL++       + A    T  +     +
Sbjct: 194 YTIDLHAVDARLQ--RIIDIQFLHGYNEPTLLVLFESLRTWAGRVAMRQDTCNIVAISLN 251

Query: 150 FVEG----PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANA------------- 192
             E      WS N L        PVP P+ GVLI    +I+Y + +              
Sbjct: 252 MAEQLHPVVWSLNGLPFDCKYAYPVPKPIGGVLIFAVNSILYLNQSVPPYGTSLNSTTEN 311

Query: 193 FKAIPIRP------SITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLG 246
             + P++P      ++  ++    +  S  +   +  L  L ++      V     +   
Sbjct: 312 STSFPLKPQEDVCMTLDCSHAMFISPESLVISLKNGELYVLTLLVDSMRSVRNFHFDKSA 371

Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIK-----------------------LNLQPDAK 283
            + + S ++ LD+  +++GS  G+S L+K                       LN   D  
Sbjct: 372 SSVLTSCLTVLDDGFLFLGSRLGNSLLLKYTEARPVFRNCYHTEEPAAKRKRLNTAADWA 431

Query: 284 GS-----------------------YVEVLERYVNLGPI-------VDFCVVDL--ERQG 311
            S                         EV +  VN+GP          F   +   +R+ 
Sbjct: 432 ASDTNDIDLQMYGKDTVTSEPLSSYKFEVCDSLVNIGPCGAAELGEPAFLSEEFVSQRES 491

Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS---STDDPFDT----- 363
             ++   SG  K+G++ +++  +      + EL G   MW+++S    T+ P  T     
Sbjct: 492 DLELAILSGHGKNGAISVLQRSVKPQVVTTFELPGCIDMWTVKSVCEKTELPTKTQQQQH 551

Query: 364 -FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ--LVQVTSGSVRLV 420
            +L++S   E   L +    E+ E E  GF ++ Q++F  +   ++  ++QV +  V L+
Sbjct: 552 SYLILS--REESTLILETGKEIMEVENSGFNTREQSVFVGNIGGDKELILQVCASGVWLL 609

Query: 421 SSTS--RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
           +     + +  E  SP    +   +      LL T  G L+ L + + + +E
Sbjct: 610 AGVKLLQHIPLELGSP----ITQCSICDPYALLLTSDGDLIMLTLTNDLDSE 657



 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 103/459 (22%), Positives = 191/459 (41%), Gaps = 63/459 (13%)

Query: 580  LRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC--PFNSAAFPDS-LAI 635
            LRTFS+    + VF     P  I+ + +  L  +    + S  C  PF++   P+  L  
Sbjct: 917  LRTFSNIGGYSGVFLCGPYPYWIFVTIRGALCCHPMSVDGSVSCFVPFHNVNCPNGFLYF 976

Query: 636  AKEGELTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE---- 687
              +GEL I  +    K      +R I L      + +  + + +A+ +  ++ C      
Sbjct: 977  NSQGELRICMLPPHMKYDTAWPMRKITLRCSVHFLAYSIEHKVYALVTSVSEPCTRLPYL 1036

Query: 688  --ESEMHF------------------VRLLDDQTFEFISTYPLDT--FEYGCSI----LS 721
              E+E  F                  V+L+   +++ +    LD   FE+   +    LS
Sbjct: 1037 TFENEREFEDLEKGDRFIYPHIDKFSVQLISPASWDLVPNARLDMGEFEHITCMKNVWLS 1096

Query: 722  CSFSDDSNVYYCV-GTAYVLPEENEPTKGRILVF-----IVEDG------KLQLIAEKET 769
            C     +   + V GT  V  EE   ++G+I++      + E G      KL+ I  +E 
Sbjct: 1097 CGQDSSARQNFLVLGTVNVFGEEMS-SRGKIIILEVIEVVPEPGQPLTKNKLKQIYSEEQ 1155

Query: 770  KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
            KG V ++    G LL AI QKI ++++    D  + L+       ++   +  +   F +
Sbjct: 1156 KGPVTAVCGLEGNLLTAIGQKIFIWRF----DENQSLRGLAFVDTNVYIHHALSFRSFAL 1211

Query: 830  VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI---YLGAENNFNLFTVRK 886
            VGD+ +SI+LL Y+ +   +   +RD     +   +++ D     +L +++  NL     
Sbjct: 1212 VGDIQRSITLLRYQTDFKTLSVTSRDVRPLEVYTADLVVDGTGINFLVSDHEKNLVLFAY 1271

Query: 887  NSEGATDEERGRLEVVGEYHLGEFVN-RFRHGSL-VMR---LPDSDVGQIPTVIFGTVNG 941
            + E        RL    + H+G   N  +R  +  V R   LP+     +   + GT++G
Sbjct: 1272 DPEDHESHGGSRLTKRADMHIGSRANCMWRVAACGVDRSTGLPNQPYAGVHITMMGTLDG 1331

Query: 942  VIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
             I  +  +  + Y  L  LQ  +   ++ + GLN + +R
Sbjct: 1332 SICHVLPVAEKVYRRLLMLQNIMITGLQHIAGLNPKAFR 1370


>gi|401426063|ref|XP_003877516.1| putative CPSF-domain protein [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|322493761|emb|CBZ29051.1| putative CPSF-domain protein [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 1347

 Score = 65.9 bits (159), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 75/346 (21%), Positives = 138/346 (39%), Gaps = 69/346 (19%)

Query: 734  VGTAYVLPEENEPTKGRILVFIVED----GKLQLIAEKETKGAVYSLNA---FNGKLLAA 786
            +G+++  P+E     GRI    + +     +L LIA K+  GA+    A   + G++   
Sbjct: 925  IGSSFTFPDEQRARSGRITWCALREEHQRQRLHLIASKDIGGALQCCAAVPHYKGRIALG 984

Query: 787  INQKIQLYKWMLRDDGTRELQSECG---HHGHILALYVQTRGDFIVVG-DLMKSISLLIY 842
            +N  + LY W   +D T   +  C        ++ LY  +    ++V  D+  S   +  
Sbjct: 985  VNGCVCLYNWNT-EDQTFVAEERCRIGLTVTKLIPLYDTSLAASVLVALDVRHSAFFIEV 1043

Query: 843  KHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN--------NFNLFTV---------R 885
               +G+++   RD +        ++D  I   AEN        NF    V          
Sbjct: 1044 DTLQGSLKVLCRDADLRG-----VMDGHIGSDAENLCLFDDSLNFTAMKVVPLPVEAGDG 1098

Query: 886  KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL----VMRLPDS---DVGQI------- 931
              +   +   + R EV  + HLG+ V   R G      +M  P +      ++       
Sbjct: 1099 DAAAAGSVTAQYRFEVRAQCHLGDLVTCVRPGCFAATSLMEAPAACSLSRNRLLLPGIAG 1158

Query: 932  PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR----------- 980
            P ++F T +G  GV+  +    YL L  L+ +L + ++ VGGL+H+ +R           
Sbjct: 1159 PQLVFATAHGGFGVVTPVQAATYLVLRALEASLVRTLQPVGGLSHQAFREVLRAGQERGV 1218

Query: 981  ---------SFNNEK-KTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
                     +   E+ +  +  N +DGD++E F+ L  +    I +
Sbjct: 1219 SYLASKTGCALTRERLRRYEPLNTVDGDMVELFVQLPPSDKKHICR 1264


>gi|313215162|emb|CBY42850.1| unnamed protein product [Oikopleura dioica]
          Length = 228

 Score = 65.5 bits (158), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 56/219 (25%), Positives = 106/219 (48%), Gaps = 10/219 (4%)

Query: 826  DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFN 880
            ++ +VGD+ + I+LL ++ E   I + +R   A  ++AV IL D   +G      + N  
Sbjct: 5    NYALVGDIQQGITLLRHQGERNCISQISRARRAGEVTAVGILLDGNQVGLVSTDMQRNLQ 64

Query: 881  LFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
            ++  +   K S G     R     +G+  +  + +  R      ++  ++        + 
Sbjct: 65   VYMYKPDQKESNGGKQLVRQADINLGKRVISIWNSLGRQNDTFTKVALTENDARHVTFYA 124

Query: 938  TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFL 995
             ++G IG I  +  + +  LE LQT ++  +   GGLN  ++R   NE + ++  AKN +
Sbjct: 125  GLDGSIGDIVPVSEKVFRRLEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAKNII 184

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
            DGDL+E F  LS T   ++S+ + V+ E L   + ++ R
Sbjct: 185  DGDLLERFNGLSFTEQTDLSRKIGVTREALLDDMMDVQR 223


>gi|392558419|gb|EIW51607.1| hypothetical protein TRAVEDRAFT_176174 [Trametes versicolor FP-101664
            SS1]
          Length = 1431

 Score = 65.5 bits (158), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 111/515 (21%), Positives = 215/515 (41%), Gaps = 65/515 (12%)

Query: 565  LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK---KLLYSNVNLKEVSHM 621
            L ++K++S    P        +  + VF   DRP+ I S++K   K+  S  ++ +    
Sbjct: 921  LAEQKRISRLLVPFVTSPAPGQTFSGVFFTGDRPSWILSTDKGGVKVFPSGHSVVQA--- 977

Query: 622  CPFNSAAFPDS----LAIAKEGELTIGTIDDIQ---KLHIRSIPLGE-HPRRICHQEQSR 673
              F +++  +S    L  ++EG   +  + D+Q    L  RS+P    +   +     S 
Sbjct: 978  --FTTSSLWESRGDFLLYSEEGPSLVEWMPDVQLDGHLPARSVPRSRPYSNVVFDASTSL 1035

Query: 674  TFAICSLKNQSCAEESEMHFV----------RLLDDQTFEFIST---YPLDTFEYG---- 716
              A  S +N+  + + + + V           L +  T E IS      +D +E+     
Sbjct: 1036 IVAASSFQNRFASYDEDGNVVWEPDSPNISSPLCECSTLELISPDGWITMDGYEFAPNEF 1095

Query: 717  ----CSILSCSFSDDSNV--YYCVGTAYVLPEENEPTKGRILVF----IVEDG------- 759
                 SI   + S +S +  +  VGT  +   E+   KG + +F    +V D        
Sbjct: 1096 VNCIVSIPLETMSTESGMKDFIAVGTT-INRGEDLAVKGAVYIFEIVEVVPDPSTHVKRW 1154

Query: 760  -KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 818
             +L+L+   + KG V  L   NG L++++ QKI +  + L +        + G    +  
Sbjct: 1155 WRLKLLCRDDAKGPVSFLCGINGYLVSSMGQKIFVRAFDLDERLVGVAFLDVG----VYV 1210

Query: 819  LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
              ++   + +V+GD +KS+  + ++ +   +    +D     ++  ++   D  L     
Sbjct: 1211 TSLRAVKNLLVIGDAVKSVWFVAFQEDPYKLVVLGKDPQLCCITRADLFFADGQLSIVTC 1270

Query: 879  FNLFTVRKNSEGATDEERG---RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT-- 933
                 VR  +    D E      L    E+H G+  + +R   LV R P +   +IP   
Sbjct: 1271 DEEGIVRLYAYDPHDPESKSGQHLLRRTEFH-GQ--SEYRSSMLVARRPKNGDPEIPQAR 1327

Query: 934  VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD-AK 992
            ++ G+V+G +  +  +       L  LQ  L + ++ V  LN + +R   NE  +   +K
Sbjct: 1328 LVCGSVDGSLSTLTYVDEAASKRLHLLQGQLIRTVQHVAALNPKAFRMVRNEYVSRPLSK 1387

Query: 993  NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
              LDG+L+ +F DL   R +E+++ +      + K
Sbjct: 1388 GILDGNLLATFEDLPIARQNEVTRQIGTDRATVLK 1422


>gi|345566738|gb|EGX49680.1| hypothetical protein AOL_s00078g169 [Arthrobotrys oligospora ATCC
            24927]
          Length = 1407

 Score = 65.1 bits (157), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 106/488 (21%), Positives = 187/488 (38%), Gaps = 60/488 (12%)

Query: 591  VFAASDRPTVIYSSNKKL--LYSNVNLKEVSHMCPFNSAAFPDS-LAIAKEGELTIGTID 647
             F A D PT I  S+K L  LY  +    V  + PFN+       L I  +G   +    
Sbjct: 913  AFMAGDTPTFITKSSKTLPKLY-KLQGGMVRSLSPFNTKETERGFLYIDSKGTARVCHFP 971

Query: 648  DIQKLHI---RSIPLGEHPRRICHQEQSRTFAICSLK------------------NQSCA 686
            ++   H    + IPL   P  + + +    + +  L                   +++  
Sbjct: 972  EVSMEHTWLSQRIPLERTPTSLTYYDPKNVYVVSVLSTSKPEVDDEDFQMEEGLVDETLL 1031

Query: 687  EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVYYCVGTAYVLP 741
             E E   + ++   T+     Y     E    + +        + +  V   VGT  +L 
Sbjct: 1032 PELETGHLVMISPVTWTTTDRYEFPVHEVPFVVKAVELEISEVTKERKVLIAVGTG-LLR 1090

Query: 742  EENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
             EN P +G + VF V D            K +LI+ +E KG V +L   +G LL    QK
Sbjct: 1091 GENSPARGAVYVFDVIDVVPEIGKPETGKKFKLISREEVKGVVSTLAGMDGYLLITHGQK 1150

Query: 791  IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
              +    L++DG+    +    + H      +T    ++ GD++K +S + +  E   + 
Sbjct: 1151 CMIRG--LKEDGSLLPVAFMDMNTHTTV--AKTLEKMVMFGDVLKGVSFVGFSEEPYKMI 1206

Query: 851  ERARDYNANWMSAVEILDDDI---YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
               +D     ++A + L       ++ A+   N+  ++ + E        RL   GE + 
Sbjct: 1207 LFGKDPRQLSITAGDFLPAGTACYFVVADAQSNIHVLQYDPENPKSIHGNRLLPKGEIYC 1266

Query: 908  GEFVNRF----RHGSLVMRLPDSDVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEK 959
            G  V       +  SL     + D+ +        +F T+ GV G ++S+    Y  L  
Sbjct: 1267 GHEVKSICILPKKKSLFTEPDEDDMDEDEDEEFLCMFSTMTGVFGTVSSITESMYRRLNV 1326

Query: 960  LQTNLRKVIKGVGGLNHEQWRS--FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKT 1017
            +Q  +    + + GLN   +R+  F N   +   +  LDG L+  +L L   R  E++  
Sbjct: 1327 IQGQITNTGEHIAGLNPRAYRAAKFRN-TSSEPMRAILDGKLLVRWLMLGAGRRKELAGR 1385

Query: 1018 MNVSVEEL 1025
               S E L
Sbjct: 1386 AGTSEEML 1393


>gi|50549733|ref|XP_502337.1| YALI0D02717p [Yarrowia lipolytica]
 gi|74634908|sp|Q6CAH5.1|RSE1_YARLI RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|49648205|emb|CAG80525.1| YALI0D02717p [Yarrowia lipolytica CLIB122]
          Length = 1143

 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 155/735 (21%), Positives = 283/735 (38%), Gaps = 102/735 (13%)

Query: 350  MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
            +W++R       D ++V+S+ + T  L + + D + ET   G      TL C  ++ +  
Sbjct: 459  LWTMRDGAGS--DKYIVLSYANAT--LVLEIGDSVVETTSSGLTLDKPTLHC-GSVGSSY 513

Query: 410  VQVTSGSVRLVSSTSRELRNE------WKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
            VQV +  + ++   SRE  +E      W +P G  V  A++++ QV+L      L Y E 
Sbjct: 514  VQVMTDGMNVIP-MSREGSSESLPATKWTAPSG-QVICASSSSHQVVLGLTSS-LFYFED 570

Query: 464  GDGILTEVKHAQLEYEIS----CLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNL 518
              G  +E+      YE+S     + + P+      S   AV    D +VRI S+ P+   
Sbjct: 571  TPG--SELSAYDGAYELSSPPTAVAVAPVPAGRVRSPFVAVAT-DDETVRIVSVDPESMF 627

Query: 519  ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDR--KKVSLGTQ 576
             T    G      S+ L +   + YL   L +G  +   L+  TGE+     K V LG  
Sbjct: 628  ETVAVQGLMATASSLALLSVGQVLYLHMGLANGVYVRVELDPLTGEIVGSWSKFVGLGRL 687

Query: 577  PITLRTFSSKNTT------------HVFAASDRPTVIYSSNKKLLYS--NVNLKEVSHMC 622
             +   T   + +             HV A SD   V    N    ++   ++ + +    
Sbjct: 688  SVVPVTCGGEESILVSSRGVKTCLGHVNATSDT-WVPTGGNSAPFFALDAISGEPLDLAH 746

Query: 623  PFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN 682
             F++   P  +       L I T++  QK     + L    +R+  Q  + T  I    +
Sbjct: 747  SFHTQDCPHGVIGVAGSTLKIFTVNTAQKWTENEVKLEGTAKRLI-QHDATTLTITQNPD 805

Query: 683  QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPE 742
            +  +          +D+           D      SI    F D    Y+ VG +     
Sbjct: 806  RLVS----------VDNGAVGITK----DLGGPPTSICEVMFGDGKR-YFAVGGSRDGSP 850

Query: 743  ENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDG 802
                T G I +F      L  +   E +    +L A+NG L+A I  +++LY   L+   
Sbjct: 851  GTSGTSGYISIF--SSSSLGHVHTTEVEAPPLALCAYNGLLVAGIGSQVRLYALGLKQV- 907

Query: 803  TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA--IEERARDYNANW 860
             R+ Q E       LA +  +  + + VGD+ +S+++ +   E+    I     D  +  
Sbjct: 908  LRKAQIELSKRVTCLAHFAGS--NRVAVGDIRQSVTVCVVLEEDSGHVIYPLVCDKISRQ 965

Query: 861  MSAVEILD-DDIYLGAE-NNFNLFTVRKNSEGATDEERGRLEVV-------GEYHLGEFV 911
            ++ +  +D + + LG     F +  +   +    DE+   + +        G  H     
Sbjct: 966  VTCLFFVDYETVALGDRFGGFTMLRIPSEASKLADEDHNAVHLRQLEPTLNGPAHF---- 1021

Query: 912  NRFRHGSLVMRLPDSDVGQIPTVI------------FGTVNGVIGVIASLPHEQYLFLEK 959
             RF H      +    +  +P  I             GTV+  + V++    +Q   L+ 
Sbjct: 1022 -RFDH------VASFHIEDVPVAIHMYNDYLVVCGLLGTVSAFVPVVSP---KQSRDLKT 1071

Query: 960  LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            ++  +     G+ G +H ++R +      V  K  +DGD++   L +   R +E+ +   
Sbjct: 1072 IEKFVCASDPGLMGRDHGRFRGYY-----VPVKEVVDGDMLREVLVMDEKRREEVGEKTG 1126

Query: 1020 VSVEELCKRVEELTR 1034
            + VE    RV  + +
Sbjct: 1127 LGVEGAVGRVVNVMK 1141


>gi|407039323|gb|EKE39583.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
          Length = 1108

 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 167/790 (21%), Positives = 316/790 (40%), Gaps = 92/790 (11%)

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST 357
            P+    VVD     Q +     G+    SL+I+ +G+  +E  S+ LQ  + +W+L+   
Sbjct: 350  PLTSIDVVDSHEVLQIRAFVGKGS--RSSLKILTHGLNPDECMSLSLQHPRNIWTLKPYN 407

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            +     ++V+   ++T +L   L D L++    G     QTL     I   LVQ+ +  +
Sbjct: 408  EQTHQ-YVVIGLENQTYVLK-TLPDALKQCPECGIRPNVQTLHAGMFIDGTLVQIHAHGI 465

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH----LVYLEIGDGILTEVKH 473
              + +   E  +E       ++ V T+++ QV++A         +VY E      TE K 
Sbjct: 466  ITILN---EKLSEQDPGAQRTILVGTSSSCQVVIALKNAQSKCEIVYFEYN----TETKK 518

Query: 474  -AQLEYEISCLDINPIG----ENPSYSQIAAV---GMWTDISVRIFSLPDLNLITKEHLG 525
             A++E  I   +I  +     EN    ++A +   GM   +++   S     L T   + 
Sbjct: 519  LAEVERRIIRSNIIAMALGDFENGKAKKVAYICDNGMANILTLEKESKFLQILETNITVE 578

Query: 526  GEIIPRSVLLCAFEGIS--YLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
            GE    S++L  F  IS  +L   +G  DG    + +  K  EL   K +  GT P+   
Sbjct: 579  GE--AESIVLTQFNEISEPFLFLGIGMKDGCCYGYKVIGKNSELMWCKLI--GTDPVVFG 634

Query: 582  TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
             +       V  +S     +YS   ++ ++ ++ + +  + PF +    +S+A     EL
Sbjct: 635  QYEFGGNIGVIMSSSTMIFMYSYFSRIEFTPISYQPLKALAPFITDFSGESIAGITNDEL 694

Query: 642  T---IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLD 698
            T   IG++DD  +      PL   PR++       T  +CS       +     +   +D
Sbjct: 695  TVIAIGSLDD--RFTYDEQPLLYTPRKVIKSNYFPTTVLCS-------DNKSNLYNPTID 745

Query: 699  DQTFEFIS-TYPLDTFEYG----------CSILSCSFSDDSNVY----------YCVGTA 737
            ++T EF      +D  E G           SI    F  D  +            C  ++
Sbjct: 746  NETNEFNERKIGIDLQEDGKWASEVGIIDNSIFKSYFKTDEAIICGAHVSFKKKQCFISS 805

Query: 738  YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
             V  +  +  KG   + +      +LI +   +   ++L +   +LLA I   ++LY   
Sbjct: 806  QV--KHYKKMKGECWINVYNLKTFKLIHKTPVEDICHALTSCGERLLAGIGTTLRLY--- 860

Query: 798  LRDDGTRELQSECGHHGHILAL-YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
              D G + L  +C   G    +  ++  GD IVVG +      + Y  +   +    +D 
Sbjct: 861  --DLGKQILIRKCELDGFPSIINSLEVIGDKIVVGTVATGFIYVNYDSDANILSITEKDR 918

Query: 857  NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA----TDEERGRLEVVGEYHLGEFVN 912
              + ++A  ILD+   +G +   ++F    N+       TD      E+V ++++G+ V 
Sbjct: 919  IWHSLTASTILDETSTIGFDKLGSVFITETNTNSNQLNLTDIIPLSNEIV-QWYVGDVVT 977

Query: 913  RFRHGSLVMRLPDSD-----VGQIP----TVIFGTVNGVIGVIASLP-HEQYLFLEKLQT 962
                  +   + D+        QI      +I+  + G IGV+      E   F  KL+ 
Sbjct: 978  SVSVNEIWKGITDNTNEYTFEEQIKENKNVIIYSCLMGRIGVLIPFNFREDVEFFSKLEM 1037

Query: 963  NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSV 1022
             ++     +   + + +R  N     V     +DGDL + F  +      +I+  + ++ 
Sbjct: 1038 EIKNNYSPLLSNSFDSYRGTNYPGIGV-----IDGDLCDYFNQMDPKLQLQIANNLEMTP 1092

Query: 1023 EELCKRVEEL 1032
             ++  + E+ 
Sbjct: 1093 VQIQLKCEQF 1102


>gi|402591342|gb|EJW85272.1| hypothetical protein WUBG_03818, partial [Wuchereria bancrofti]
          Length = 1025

 Score = 64.7 bits (156), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 94/439 (21%), Positives = 178/439 (40%), Gaps = 94/439 (21%)

Query: 71  IDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTI 125
           +DP  R   + ++     V+PF++      ++ ++L ++      V+D+ FL G  +PT+
Sbjct: 140 VDPGQRCASMLVFGRYLAVLPFNDSSAQLHSYTVQLSQIDSRLVNVVDMVFLDGYYEPTL 199

Query: 126 VVLYQDNKDA--RHVKTYE------VALKDKDFV-EGPWSQNNLDNGADLLIPVPPPLCG 176
           + LY+  +    R    Y+      V+L  K+ V    W   NL    + ++ +P P+ G
Sbjct: 200 LFLYEPVQTTCGRACVRYDTMCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGG 259

Query: 177 VLIIGEETIVYCS-------------ANAFKAIPIRPSITKAYGRVDADGS--------R 215
           +L++    ++Y +              + F   P++     A   +  DG+        +
Sbjct: 260 ILLVATNELIYLNQSVPPCGISLNSCMDGFTKFPLKDFKHMA---LTLDGAVVTVVSTNK 316

Query: 216 YLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS--- 271
            LL D  G L  L+++T     V  L+++   ET I  T++      ++IGS   DS   
Sbjct: 317 ILLCDRNGRLFTLILVTDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFL 376

Query: 272 --------------QLIKLNLQPDAKGS------------------------YVEVLERY 293
                         + IKL+ +P+A                            + VL++ 
Sbjct: 377 HCIFEQSTLEESATKKIKLSTEPNANEEDEDFELYGEMLPKVAKPDITEELLNIRVLDKL 436

Query: 294 VNLGP---IVDFC------VVDLERQGQGQVVTCS-GAYKDGSLRIVRNGIGINEQASVE 343
           +N+GP   I   C        ++ R+     + C+ G  K GS+ I++  I      S  
Sbjct: 437 LNVGPCKKITGGCPSISAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSSS 496

Query: 344 LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
           ++G+   W++    DD    F+      E   LA+  +++L E E   F +   T+   +
Sbjct: 497 IEGVVQYWAIGRREDDTHMYFIAS---RELGTLALETDNDLVELEAPIFSTSESTIAAGE 553

Query: 404 AIYNQL-VQVTSGSVRLVS 421
                L VQVT+ S+ +V+
Sbjct: 554 LADGGLAVQVTTSSLVMVA 572


>gi|296806499|ref|XP_002844059.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
 gi|238845361|gb|EEQ35023.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
          Length = 1348

 Score = 64.3 bits (155), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 100/434 (23%), Positives = 174/434 (40%), Gaps = 60/434 (13%)

Query: 655  RSIPLGEHPRRICHQEQSRTFAI-CSLKNQ-SCAEESEMHF-----------------VR 695
            R IPLGE    I +   S ++ I  S+K      E+ E H                  V+
Sbjct: 912  RKIPLGEQVDCIVYSSASESYVIGTSVKEDFKLPEDDESHTEWQNEFITFLPQLERGTVK 971

Query: 696  LLDDQTFEFISTYPLDTFEYGCSILSC---------SFSDDSNVYYCVGTAYVLPEENEP 746
            LLD + +      P          ++C           + +      VG+A V  E+  P
Sbjct: 972  LLDPKNWSIADIAPSSHELEPAERITCIEVIRLEISEITHERKDMVVVGSAIVKGEDIVP 1031

Query: 747  TKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAF--NGKLLAAINQKIQL 793
             KG I VF + D            +L+L A +E KGAV +L+     G L+ A  QK  +
Sbjct: 1032 -KGCIRVFEIIDVVPDPDHSEMNKRLKLFAREEVKGAVTALSGIGSQGFLIVAQGQKCMV 1090

Query: 794  YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
                L++DG+    +      ++  L         +VGD +K +    Y  E   ++   
Sbjct: 1091 RG--LKEDGSLLPVAFKDAQCYVSVLKELKGTGMCIVGDAIKGLWFTGYSEEPYKLDLFG 1148

Query: 854  RDYNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 910
            ++     + A + L D   +Y L A+++ NL  ++ + E  +  +  RL     +H+G F
Sbjct: 1149 KENENIAVIAADFLPDGNRLYVLVADDDCNLHVLQYDPEDPSSSKGDRLLHRNVFHVGHF 1208

Query: 911  VNRFR---HGSLVMRLP------DSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLE 958
             +       GS     P      D+D    P+   ++     G +G+I  L  + Y  L 
Sbjct: 1209 ASTMTLLPQGSHTPHSPADRDAMDTDAPLPPSKYQILMTFQTGSVGIITPLNEDSYRRLL 1268

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
             LQ+ L   ++   GLN   +R+  ++      +  +DG+L+  +LD+   R  EI+  +
Sbjct: 1269 ALQSQLVNALEHPCGLNPRGYRAVESD-GIGGQRGMIDGNLLLRWLDMGAQRKAEIAGRV 1327

Query: 1019 NVSVEELCKRVEEL 1032
               V  +   +E+L
Sbjct: 1328 GADVGAIRMDLEKL 1341


>gi|237839083|ref|XP_002368839.1| hypothetical protein TGME49_067710 [Toxoplasma gondii ME49]
 gi|211966503|gb|EEB01699.1| hypothetical protein TGME49_067710 [Toxoplasma gondii ME49]
          Length = 2136

 Score = 63.9 bits (154), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 59/224 (26%), Positives = 93/224 (41%), Gaps = 62/224 (27%)

Query: 733  CVGTAYVLPEENEPTKGRILVFIVE---DGKL--QLIAE-KETKGAVYSLNAFNGKLLAA 786
            CVGTA V+  ++EP +G I++  V    DG+L  QL A   E +  V  L  F G LLAA
Sbjct: 1674 CVGTAEVMASDSEPKEGLIILLDVRRGTDGRLVVQLAAHIHELRSGVQQLRPFQGLLLAA 1733

Query: 787  INQKIQLYKWMLRDDGTRELQSE------------------------------------- 809
             N +++L+      +G   L++E                                     
Sbjct: 1734 CNHRVRLFGLRRELEGAAGLEAEMEGDSPQASEVHAGDRGRSSAGRRQGARARDRTEEKG 1793

Query: 810  --------------CGHHGHILALYVQTRGDFIV-VGDLMKSISLLIYKHEEGAIEERAR 854
                          C H  +     + T  D IV +GD++ S  LL +  ++ A +E AR
Sbjct: 1794 RELCEMLKARLELVCSHSSNAFVSSMDTWHDEIVCLGDMVASAVLLRFSPKDNAFKEIAR 1853

Query: 855  DYNANWMSAVEILDDDIYLGAENNFNLFTVRK----NSEGATDE 894
            D NA W  AV  L  +++  A+ + N++ + +     SEG  +E
Sbjct: 1854 DTNACWTLAVSCLSPNLHYLADADRNIWLLERASLLGSEGRREE 1897



 Score = 47.0 bits (110), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 40/73 (54%), Gaps = 1/73 (1%)

Query: 935  IFGTVNGVIGVIASLPHEQ-YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
            ++ +  G IG +  +P EQ +  L  LQ  + KV K +G L+   + S      TV +K 
Sbjct: 2050 LWASSEGAIGHLLQIPDEQTFARLAVLQDAVTKVTKSIGKLSAVAFHSVKVGTATVPSKG 2109

Query: 994  FLDGDLIESFLDL 1006
            F+DGD++E FL+ 
Sbjct: 2110 FIDGDILERFLEF 2122



 Score = 47.0 bits (110), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 38/137 (27%), Positives = 64/137 (46%), Gaps = 32/137 (23%)

Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQ-ASVELQ 345
           +++L+ + NLGPIVD C+ ++E   Q  ++   G  +  SLR +R+G+ ++   ASV L 
Sbjct: 751 LQLLQVFPNLGPIVDCCLAEVEGLDQRLLIVACGHGRSTSLRFIRSGLALHASCASVALH 810

Query: 346 G-IKGMWSL-------------------RSSTDDPF-----DTFL------VVSFISETR 374
             +  +W+L                   RS T   F     D  L      V++F  ETR
Sbjct: 811 APVHRLWTLDLSPYAIAASATGMRISGRRSHTRGEFSKTRADALLARGPIAVLAFPHETR 870

Query: 375 ILAMNLEDELEETEIEG 391
           +LA  L+   + +  +G
Sbjct: 871 VLAWTLKPRGDRSNPKG 887



 Score = 40.0 bits (92), Expect = 6.2,   Method: Compositional matrix adjust.
 Identities = 30/107 (28%), Positives = 48/107 (44%), Gaps = 15/107 (14%)

Query: 26  LFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDG 85
           L + T+R    +L++D E   ++T    D+ +   R  D G +  + P  R I ++ Y+ 
Sbjct: 186 LLVLTDRQILLLLRYDEEKQTVVTVESIDLKEVSTRFVDGGPLMAVCPTTRCIVVYQYES 245

Query: 86  LFKVIPF--DNKGQLKEAFN-------------IRLEELQVLDIKFL 117
           L + +PF  +    L  A N              RL+E  VLDI FL
Sbjct: 246 LLQYLPFGPEASSSLSPARNHEAGSSLFSGVRLFRLDEQNVLDICFL 292


>gi|393907593|gb|EJD74705.1| CPSF A subunit region family protein [Loa loa]
          Length = 990

 Score = 63.9 bits (154), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 94/438 (21%), Positives = 178/438 (40%), Gaps = 93/438 (21%)

Query: 71  IDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTI 125
           +DP  R   + ++     V+PF++ G    ++ ++L ++      V+D+ FL G  +PT+
Sbjct: 140 VDPGQRCAAMLVFGRYLAVLPFNDSGAQLHSYTVQLSQIDSRLVNVVDMVFLDGYYEPTL 199

Query: 126 VVLYQDNKDA--RHVKTYE------VALKDKDFV-EGPWSQNNLDNGADLLIPVPPPLCG 176
           + LY+  +    R    Y+      V+L  K+ V    W   NL    + ++ +P P+ G
Sbjct: 200 LFLYEPVQTTCGRACVRYDTMCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGG 259

Query: 177 VLIIGEETIVYCS-------------ANAFKAIPIRPSITKAYGRVDADG--------SR 215
           +L++    ++Y +              + F   P+R      +  +  DG        ++
Sbjct: 260 ILLVATNELIYLNQSVPPCGISLNSCMDGFTKFPLR---DFKHMVLTLDGCVVTVISTNK 316

Query: 216 YLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS--- 271
            LL D  G L  L+++T     V  L+++   +T I  T++      ++IGS   DS   
Sbjct: 317 ILLCDRNGRLFTLVLVTDATNSVKSLELKFQFKTVIPCTMTSCAPGYLFIGSRLCDSVFL 376

Query: 272 --------------QLIKLNLQ---------------------PDAKGSY--VEVLERYV 294
                         + IKLN +                     PD+      + VL++ +
Sbjct: 377 HCIFEQSTLDESAPKKIKLNTELNANEDEDFELYGEVLPKVAKPDSAEELLNIRVLDKLL 436

Query: 295 NLGP---IVDFC------VVDLERQGQGQVVTCS-GAYKDGSLRIVRNGIGINEQASVEL 344
           N+GP   I   C        ++ R+     + C+ G  K GS+ I +  +      S  +
Sbjct: 437 NVGPCKKITGGCPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSSSI 496

Query: 345 QGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA 404
           +G+   W++    DD    F+      E   LA+  +++L E E   F +   T+   + 
Sbjct: 497 EGVVQYWAVGRREDDTHMYFIAS---KELGTLALETDNDLVELEAPIFATSEPTIAAGEL 553

Query: 405 IYNQL-VQVTSGSVRLVS 421
               L VQVT+ S+ +V+
Sbjct: 554 ADGGLAVQVTTSSLVMVA 571


>gi|68060004|ref|XP_671977.1| hypothetical protein [Plasmodium berghei strain ANKA]
 gi|56488645|emb|CAI04030.1| hypothetical protein PB301494.00.0 [Plasmodium berghei]
          Length = 346

 Score = 63.9 bits (154), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 52/251 (20%), Positives = 108/251 (43%), Gaps = 29/251 (11%)

Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
           L+    G L+ + + HE   V  +  +      IA++I  L +  +++ + +G+    + 
Sbjct: 90  LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 149

Query: 277 N-------------LQPDAKGSYVE----------VLERYVNLGPIVDFCVVDLERQGQG 313
           +               P  K + +           ++++  +L PIVD  ++D +     
Sbjct: 150 SGIGNDSNESMCTSNHPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSNIP 209

Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISE 372
           Q+    G     SLRI+++G+ I E A+ EL G  + +W+++      +D +++VSF   
Sbjct: 210 QIYALCGRGPRSSLRILQHGLSIEELANNELPGKPRYIWTIKKDNSSEYDGYIIVSFEGN 269

Query: 373 TRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWK 432
           T IL +   + +EE       +   T+  +    N  +QV    +R ++    ++  EW 
Sbjct: 270 TLILEIG--ETVEEVYDSLLLTNVTTIHINLLYDNSFIQVYDTGIRHING---KIVQEWV 324

Query: 433 SPPGYSVNVAT 443
            P    ++ AT
Sbjct: 325 PPKNKQIDAAT 335


>gi|374109159|gb|AEY98065.1| FAFR382Wp [Ashbya gossypii FDAG1]
          Length = 1288

 Score = 63.9 bits (154), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 94/389 (24%), Positives = 161/389 (41%), Gaps = 31/389 (7%)

Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
           L  +H  L  + ++  EK     + I        A  +    +  ++  S YG+S L + 
Sbjct: 329 LQSNHGDLYKVKILPDEKTASPVVTISYFDTIPQAQNLHIFKHGYMFANSEYGNSYLYQF 388

Query: 277 -NLQPDAKGSYVEVLE-RYVNLGP---IVDFCVVD--------LERQGQGQVVTCSGAYK 323
            NL  + +     V+  R + + P   + +  V D        L  Q   +V        
Sbjct: 389 ENLDDEEESMLTSVMPGRRLIIEPRTVLKNLLVADKLALVNPILSSQLTERVPLTIATST 448

Query: 324 DGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRIL--AMNL 380
            G +R+   G+   +  S  L      +W++ ++    F   L ++    T IL  A   
Sbjct: 449 LGDVRLFTAGVNFMDIISSPLPAAPLDIWTV-ATNGSRFHKLLFIALQESTMILKIAAGT 507

Query: 381 EDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS-VRLVSST--SRELRNEWKSPPGY 437
            +ELE        +Q +T+         ++QVT    V ++ +   S E + EW  P G 
Sbjct: 508 VEELELPHNPFVIAQDKTVLIAHMGGQSIIQVTENKMVHIIENRDESYESKLEWFPPAGI 567

Query: 438 SVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVK-HAQLEYEISCLDINPIGENPSYS 495
            +  A++N++Q++LA     +VY EIG    L E++   ++E  I+ L I   G    Y 
Sbjct: 568 CILKASSNSTQLILALSNNEVVYFEIGSNESLNELQDRIEVEERITALAIGN-GNRSDYM 626

Query: 496 QIAAVGMWTDISVRIFSLP---DLNLITKEHLGGEIIPRSVLLCAFEGISYLL-CALGDG 551
            IA+V    D +V+++SL      N +    +   + P S L  A  G S  L   L  G
Sbjct: 627 IIASV----DSTVKVYSLKVQDQANFLEVVSMQVLVSPASSLQLASSGGSLCLHIGLDSG 682

Query: 552 HLLNFLLNMKTGELTDRKKVSLGTQPITL 580
             +   L+  TGEL D +   LGT+P+ +
Sbjct: 683 VYVRSKLDRNTGELFDVRTKYLGTKPVEI 711


>gi|221502136|gb|EEE27880.1| conserved hypothetical protein [Toxoplasma gondii VEG]
          Length = 2131

 Score = 63.9 bits (154), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 59/224 (26%), Positives = 93/224 (41%), Gaps = 62/224 (27%)

Query: 733  CVGTAYVLPEENEPTKGRILVFIVE---DGKL--QLIAE-KETKGAVYSLNAFNGKLLAA 786
            CVGTA V+  ++EP +G I++  V    DG+L  QL A   E +  V  L  F G LLAA
Sbjct: 1669 CVGTAEVIASDSEPKEGLIILLDVRRGTDGRLVVQLAAHIHELRSGVQQLRPFQGLLLAA 1728

Query: 787  INQKIQLYKWMLRDDGTRELQSE------------------------------------- 809
             N +++L+      +G   L++E                                     
Sbjct: 1729 CNHRVRLFGLRRELEGAAGLEAEMEGDSPQASEVHAGDRGRSSAGRRQGARARDRTEEKG 1788

Query: 810  --------------CGHHGHILALYVQTRGDFIV-VGDLMKSISLLIYKHEEGAIEERAR 854
                          C H  +     + T  D IV +GD++ S  LL +  ++ A +E AR
Sbjct: 1789 RELCEMLKARLELVCSHSSNAFVSSMDTWHDEIVCLGDMVASAVLLRFSPKDNAFKEIAR 1848

Query: 855  DYNANWMSAVEILDDDIYLGAENNFNLFTVRK----NSEGATDE 894
            D NA W  AV  L  +++  A+ + N++ + +     SEG  +E
Sbjct: 1849 DTNACWTLAVSCLSPNLHYLADADRNIWLLERASLLGSEGRREE 1892



 Score = 47.0 bits (110), Expect = 0.052,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 40/73 (54%), Gaps = 1/73 (1%)

Query: 935  IFGTVNGVIGVIASLPHEQ-YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
            ++ +  G IG +  +P EQ +  L  LQ  + KV K +G L+   + S      TV +K 
Sbjct: 2045 LWASSEGAIGHLLQIPDEQTFARLAVLQDAVTKVTKSIGKLSAVAFHSVKVGTATVPSKG 2104

Query: 994  FLDGDLIESFLDL 1006
            F+DGD++E FL+ 
Sbjct: 2105 FIDGDILERFLEF 2117



 Score = 44.3 bits (103), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 42/69 (60%), Gaps = 2/69 (2%)

Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQ-ASVELQ 345
           +++L+ + NLGPIVD C+ ++E   Q  ++   G  +  SLR +R+G+ ++   ASV L 
Sbjct: 751 LQLLQVFPNLGPIVDCCLAEVEGLDQRLLIVACGHGRSTSLRFIRSGLALHASCASVALH 810

Query: 346 G-IKGMWSL 353
             +  +W+L
Sbjct: 811 APVHRLWTL 819


>gi|326477251|gb|EGE01261.1| protein kinase subdomain-containing protein [Trichophyton equinum CBS
            127.97]
          Length = 1267

 Score = 63.5 bits (153), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 68/308 (22%), Positives = 130/308 (42%), Gaps = 24/308 (7%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+PE ++P K +         KL+L A++E KGAV +L+   G+    + Q  +     L
Sbjct: 963  VVPEPDQPEKSK---------KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGL 1013

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            ++DG+    +      ++  L         ++GD  K +  + Y  E   ++   ++   
Sbjct: 1014 KEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENEN 1073

Query: 859  NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
              +   + L D   +Y L A+++ NL  ++ + E  +  +  RL     +H G F +   
Sbjct: 1074 LAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMT 1133

Query: 916  ---HGSLVMRLP----DSDVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
               HG+     P      D   +P     ++     G I VI  L  + Y  L  LQ+ L
Sbjct: 1134 LLPHGAYTPSAPVDEDAMDTDSLPPSKYQILMTFQTGSIAVITPLSEDSYRRLLALQSQL 1193

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
               ++    LN   +R+  ++      +  +DG+L+  +LD+   R  EI+  +   V  
Sbjct: 1194 VNALEHPCSLNPRGYRAVESDGMG-GQRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGA 1252

Query: 1025 LCKRVEEL 1032
            +   +E+L
Sbjct: 1253 IRTDLEKL 1260


>gi|67473926|ref|XP_652712.1| splicing factor 3b subunit 3 [Entamoeba histolytica HM-1:IMSS]
 gi|56469590|gb|EAL47325.1| splicing factor 3b subunit 3, putative [Entamoeba histolytica
            HM-1:IMSS]
          Length = 1108

 Score = 63.5 bits (153), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 164/794 (20%), Positives = 319/794 (40%), Gaps = 100/794 (12%)

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST 357
            P+    VVD     Q +     G+    SL+I+ +G+  +E  S+ LQ  + +W+L+   
Sbjct: 350  PLTSIDVVDSHEVLQIRAFVGKGS--RSSLKILTHGLNPDECMSLSLQHPRNIWTLKPYN 407

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            +     ++V+   ++T +L   L D L++    G     QTL     I   LVQ+ +  +
Sbjct: 408  EQTHQ-YVVIGLENQTYVLK-TLPDALKQCPECGIRPNVQTLHAGMFIDGTLVQIHAHGI 465

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH----LVYLEIGDGILTEVKH 473
              + +   E  +E       ++ V T+++ QV++A         +VY E      TE K 
Sbjct: 466  ITILN---EKLSEQDPGAQRTILVGTSSSCQVVIALKNAQSKCEIVYFEYN----TETKK 518

Query: 474  -AQLEYEISCLDINPIG----ENPSYSQIAAV---GMWTDISV----RIFSLPDLNLITK 521
             A++E  I   +I  +     EN    ++A +   GM   +++    +   + ++N+  +
Sbjct: 519  LAEVERRIIRSNIIAMALGDFENGKAKKVAYICDNGMANILTLEKESKFLQILEINITVE 578

Query: 522  EHLGGEIIPRSVLLCAFEGIS--YLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQP 577
                GE    S++L  F  IS  +L   +G  DG    + +  K  EL   K +  GT P
Sbjct: 579  ----GE--AESIILTQFNEISEPFLFLGIGMKDGCCYGYKVIGKNSELMWCKLI--GTDP 630

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            +    +       V  +S     +YS   ++ ++ ++ + +  + PF +    +S+A   
Sbjct: 631  VVFGQYEFGGNIGVIMSSSTMIFMYSYFSRIEFTPISYQPLKALAPFITDFSGESIAGIT 690

Query: 638  EGELT---IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
              ELT   IG++DD  +      PL   PR++       T  +CS       +     + 
Sbjct: 691  NDELTVIAIGSLDD--RFTYDEQPLLYTPRKVIKSNYFPTTVLCS-------DNKSNLYN 741

Query: 695  RLLDDQTFEFIS-TYPLDTFEYG----------CSILSCSFSDDSNVY----------YC 733
              +D++T EF      +D  E G           S+    F  D  +            C
Sbjct: 742  PTIDNETNEFNERKIGIDLQEDGKWASEVGIIDNSLFKSYFKTDEAIICGAHVSFKKKQC 801

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
              ++ V  +  +  KG   + +      +LI +   +   ++L +   +LLA I   ++L
Sbjct: 802  FISSQV--KHYKKMKGECWINVYNLKTFKLIHKTPVEDICHALTSCGERLLAGIGTTLRL 859

Query: 794  YKWMLRDDGTRELQSECGHHGHILAL-YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
            Y     D G + L  +C   G    +  ++  GD IVVG +      + Y  +   +   
Sbjct: 860  Y-----DLGKQILIRKCELDGFPSIINSLEVIGDKIVVGTVATGFIYVNYDSDANILSIT 914

Query: 853  ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA----TDEERGRLEVVGEYHLG 908
             +D   + ++A  ILD+   +G +   ++F    N+       TD      E+V ++++G
Sbjct: 915  EKDRIWHSLTASTILDETSTIGFDKLGSVFITETNTNSNQLNLTDIIPLSNEIV-QWYVG 973

Query: 909  EFVNRFRHGSLVMRLPDSD-----VGQIP----TVIFGTVNGVIGVIASLP-HEQYLFLE 958
            + V       +   + D+        QI      +I+  + G IGV+      E   F  
Sbjct: 974  DVVTSVSVNEIWKGITDNTNEYTFEEQIKENKNVIIYSCLMGRIGVLIPFNFREDVEFFS 1033

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            KL+  ++     +   + + +R  N     V     +DGDL + F  +      +I+  +
Sbjct: 1034 KLEMEIKNNYSPLLSNSFDSYRGTNYPGIGV-----IDGDLCDYFNQMDPKLQLQIANNL 1088

Query: 1019 NVSVEELCKRVEEL 1032
             ++  ++  + E+ 
Sbjct: 1089 EMTPVQIQLKCEQF 1102


>gi|302308830|ref|NP_985929.2| AFR382Wp [Ashbya gossypii ATCC 10895]
 gi|442570014|sp|Q753D4.2|RSE1_ASHGO RecName: Full=Pre-mRNA-splicing factor RSE1
 gi|299790815|gb|AAS53753.2| AFR382Wp [Ashbya gossypii ATCC 10895]
          Length = 1288

 Score = 63.5 bits (153), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 94/389 (24%), Positives = 161/389 (41%), Gaps = 31/389 (7%)

Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
           L  +H  L  + ++  EK     + I        A  +    +  ++  S YG+S L + 
Sbjct: 329 LQSNHGDLYKVKILPDEKTASPVVTISYFDTIPQAQNLHIFKHGYMFANSEYGNSYLYQF 388

Query: 277 -NLQPDAKGSYVEVLE-RYVNLGP---IVDFCVVD--------LERQGQGQVVTCSGAYK 323
            NL  + +     V+  R + + P   + +  V D        L  Q   +V        
Sbjct: 389 ENLDDEEESMLTSVMPGRRLIIEPRTVLKNLLVADKLALVNPILSSQLTERVPLTIATST 448

Query: 324 DGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRIL--AMNL 380
            G +R+   G+   +  S  L      +W++ ++    F   L ++    T IL  A   
Sbjct: 449 LGDVRLFTAGVNFMDIISSPLPAAPLDIWTV-ATNGSRFHKLLFIALQESTMILKIAAGT 507

Query: 381 EDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS-VRLVSST--SRELRNEWKSPPGY 437
            +ELE        +Q +T+         ++QVT    V ++ +   S E + EW  P G 
Sbjct: 508 VEELELPHNPFVIAQDKTVLIAHMGGQSIIQVTENKMVHIIENRDESYESKLEWFPPAGI 567

Query: 438 SVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVK-HAQLEYEISCLDINPIGENPSYS 495
            +  A++N++Q++LA     +VY EIG    L E++   ++E  I+ L I   G    Y 
Sbjct: 568 CILKASSNSTQLILALSNNEVVYFEIGSNESLNELQDRIEVEERITALAIGN-GNRSDYM 626

Query: 496 QIAAVGMWTDISVRIFSLP---DLNLITKEHLGGEIIPRSVLLCAFEGISYLL-CALGDG 551
            IA+V    D +V+++SL      N +    +   + P S L  A  G S  L   L  G
Sbjct: 627 IIASV----DSTVKVYSLKVQDQANFLEVVSMQVLVSPASSLQLASSGGSLCLHIGLDSG 682

Query: 552 HLLNFLLNMKTGELTDRKKVSLGTQPITL 580
             +   L+  TGEL D +   LGT+P+ +
Sbjct: 683 VYVRSKLDRNTGELFDVRTKYLGTKPVEI 711


>gi|328773280|gb|EGF83317.1| hypothetical protein BATDEDRAFT_21894 [Batrachochytrium dendrobatidis
            JAM81]
          Length = 1673

 Score = 63.5 bits (153), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 68/314 (21%), Positives = 132/314 (42%), Gaps = 40/314 (12%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+P+ N P   R         K + +  ++ +    ++   N  LLAAI  KI +Y+   
Sbjct: 1373 VVPDPNNPQTNR---------KFKHVDSEDDRSPFSAICTVNDYLLAAIGPKIIMYQL-- 1421

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
             +DG  E+        ++    + +  + I + D+ KS+  + ++ E   +    RD + 
Sbjct: 1422 -EDG--EITGVAFLDVNVFVTSLSSVKNLIQICDIQKSVWFVAFQEEPAKLAVLGRDVHP 1478

Query: 859  NWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
                A  +L DD  L    A+ + NL T+    +        RL   GE HLG+ V++F 
Sbjct: 1479 LQGYAANMLIDDNQLALLVADGDKNLHTMIYAPDNVQSLGGERLIRKGEIHLGQHVSKF- 1537

Query: 916  HGSLVMRLPDSDVGQIPTVIF--------GTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
                 +R+    + +   ++F         T++G + +I  +    +  L  L + +   
Sbjct: 1538 -----IRMRRKPLLRNDAIVFSKQYLNVAATLDGALEIITPVSERIFKRLYGLYSRMVTS 1592

Query: 968  IKGVGGLNHEQWRSFNNEKKTVDAKNF---------LDGDLIESFLDLSRTRMDEISKTM 1018
            I+ + GLN   +R   +  + +    F         LDGDL+  ++ LSRT+   ++K +
Sbjct: 1593 IEHIAGLNPRGFRQAQHRVRPITLSGFIGPPGPRGILDGDLLYEYVRLSRTQQRGLAKAI 1652

Query: 1019 NVSVEELCKRVEEL 1032
                + L   + E+
Sbjct: 1653 GSKDDRLMDDLLEV 1666


>gi|449706784|gb|EMD46555.1| splicing factor 3b subunit 3, putative [Entamoeba histolytica KU27]
          Length = 1079

 Score = 63.5 bits (153), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 164/794 (20%), Positives = 319/794 (40%), Gaps = 100/794 (12%)

Query: 298  PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST 357
            P+    VVD     Q +     G+    SL+I+ +G+  +E  S+ LQ  + +W+L+   
Sbjct: 321  PLTSIDVVDSHEVLQIRAFVGKGS--RSSLKILTHGLNPDECMSLSLQHPRNIWTLKPYN 378

Query: 358  DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
            +     ++V+   ++T +L   L D L++    G     QTL     I   LVQ+ +  +
Sbjct: 379  EQTHQ-YVVIGLENQTYVLK-TLPDALKQCPECGIRPNVQTLHAGMFIDGTLVQIHAHGI 436

Query: 418  RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH----LVYLEIGDGILTEVKH 473
              + +   E  +E       ++ V T+++ QV++A         +VY E      TE K 
Sbjct: 437  ITILN---EKLSEQDPGAQRTILVGTSSSCQVVIALKNAQSKCEIVYFEYN----TETKK 489

Query: 474  -AQLEYEISCLDINPIG----ENPSYSQIAAV---GMWTDISV----RIFSLPDLNLITK 521
             A++E  I   +I  +     EN    ++A +   GM   +++    +   + ++N+  +
Sbjct: 490  LAEVERRIIRSNIIAMALGDFENGKAKKVAYICDNGMANILTLEKESKFLQILEINITVE 549

Query: 522  EHLGGEIIPRSVLLCAFEGIS--YLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQP 577
                GE    S++L  F  IS  +L   +G  DG    + +  K  EL   K +  GT P
Sbjct: 550  ----GE--AESIILTQFNEISEPFLFLGIGMKDGCCYGYKVIGKNSELMWCKLI--GTDP 601

Query: 578  ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
            +    +       V  +S     +YS   ++ ++ ++ + +  + PF +    +S+A   
Sbjct: 602  VVFGQYEFGGNIGVIMSSSTMIFMYSYFSRIEFTPISYQPLKALAPFITDFSGESIAGIT 661

Query: 638  EGELT---IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
              ELT   IG++DD  +      PL   PR++       T  +CS       +     + 
Sbjct: 662  NDELTVIAIGSLDD--RFTYDEQPLLYTPRKVIKSNYFPTTVLCS-------DNKSNLYN 712

Query: 695  RLLDDQTFEFIS-TYPLDTFEYG----------CSILSCSFSDDSNVY----------YC 733
              +D++T EF      +D  E G           S+    F  D  +            C
Sbjct: 713  PTIDNETNEFNERKIGIDLQEDGKWASEVGIIDNSLFKSYFKTDEAIICGAHVSFKKKQC 772

Query: 734  VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
              ++ V  +  +  KG   + +      +LI +   +   ++L +   +LLA I   ++L
Sbjct: 773  FISSQV--KHYKKMKGECWINVYNLKTFKLIHKTPVEDICHALTSCGERLLAGIGTTLRL 830

Query: 794  YKWMLRDDGTRELQSECGHHGHILAL-YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
            Y     D G + L  +C   G    +  ++  GD IVVG +      + Y  +   +   
Sbjct: 831  Y-----DLGKQILIRKCELDGFPSIINSLEVIGDKIVVGTVATGFIYVNYDSDANILSIT 885

Query: 853  ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA----TDEERGRLEVVGEYHLG 908
             +D   + ++A  ILD+   +G +   ++F    N+       TD      E+V ++++G
Sbjct: 886  EKDRIWHSLTASTILDETSTIGFDKLGSVFITETNTNSNQLNLTDIIPLSNEIV-QWYVG 944

Query: 909  EFVNRFRHGSLVMRLPDSD-----VGQIP----TVIFGTVNGVIGVIASLP-HEQYLFLE 958
            + V       +   + D+        QI      +I+  + G IGV+      E   F  
Sbjct: 945  DVVTSVSVNEIWKGITDNTNEYTFEEQIKENKNVIIYSCLMGRIGVLIPFNFREDVEFFS 1004

Query: 959  KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
            KL+  ++     +   + + +R  N     V     +DGDL + F  +      +I+  +
Sbjct: 1005 KLEMEIKNNYSPLLSNSFDSYRGTNYPGIGV-----IDGDLCDYFNQMDPKLQLQIANNL 1059

Query: 1019 NVSVEELCKRVEEL 1032
             ++  ++  + E+ 
Sbjct: 1060 EMTPVQIQLKCEQF 1073


>gi|326471884|gb|EGD95893.1| protein kinase subdomain-containing protein [Trichophyton tonsurans
            CBS 112818]
          Length = 1398

 Score = 63.5 bits (153), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 68/308 (22%), Positives = 130/308 (42%), Gaps = 24/308 (7%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+PE ++P K +         KL+L A++E KGAV +L+   G+    + Q  +     L
Sbjct: 1094 VVPEPDQPEKSK---------KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGL 1144

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            ++DG+    +      ++  L         ++GD  K +  + Y  E   ++   ++   
Sbjct: 1145 KEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENEN 1204

Query: 859  NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
              +   + L D   +Y L A+++ NL  ++ + E  +  +  RL     +H G F +   
Sbjct: 1205 LAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMT 1264

Query: 916  ---HGSLVMRLP----DSDVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
               HG+     P      D   +P     ++     G I VI  L  + Y  L  LQ+ L
Sbjct: 1265 LLPHGAYTPSAPVDEDAMDTDSLPPSKYQILMTFQTGSIAVITPLSEDSYRRLLALQSQL 1324

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
               ++    LN   +R+  ++      +  +DG+L+  +LD+   R  EI+  +   V  
Sbjct: 1325 VNALEHPCSLNPRGYRAVESDGMG-GQRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGA 1383

Query: 1025 LCKRVEEL 1032
            +   +E+L
Sbjct: 1384 IRTDLEKL 1391


>gi|255075065|ref|XP_002501207.1| predicted protein [Micromonas sp. RCC299]
 gi|226516471|gb|ACO62465.1| predicted protein [Micromonas sp. RCC299]
          Length = 1423

 Score = 63.5 bits (153), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 86/353 (24%), Positives = 143/353 (40%), Gaps = 59/353 (16%)

Query: 694  VRLLDDQTFEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPE-ENEPTK 748
            +RLL   T E    Y +D  E+  SI    L  + +        VGTA  LP  E+ P +
Sbjct: 1051 LRLLVPGTLECAWQYTVDPGEHVQSIRNVQLRNTMTGALQSMLVVGTA--LPGGEDAPCR 1108

Query: 749  GRILVFIV--------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
            GR+L+F V           + QL+  ++ K A  +L    G L  AI  K+ ++ W    
Sbjct: 1109 GRVLIFEVVWQMTDRGTKWQGQLVCVRDAKMACTALEGVGGHLAVAIGTKLIVHSW---- 1164

Query: 801  DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH--EEGAIEERARDYNA 858
            DG   L         +  + +    +FI++GD+ K      +K   +E  + + A+D+  
Sbjct: 1165 DG-HSLMPVAFFDTPLHTVTMNVVKNFILLGDIQKGAFFFRWKDTPDEKLLVQMAKDFEG 1223

Query: 859  NWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN--- 912
              + A E L D   L     +   N F    + +     +  +L   G +H+G  V+   
Sbjct: 1224 MDILATEFLVDGSTLSMLTTDMTGNAFIFSYDPKSLESWKGQKLLTKGAFHVGSPVHRMV 1283

Query: 913  RFRHGSLVMRLPDSDVGQIPT------------VIFGTVNGVIGVIASLPHEQYLFLEKL 960
            RFR     ++ P +  GQ  +            V FGT++G +G++  +    +  L+ L
Sbjct: 1284 RFR-----LKAPTAAPGQTISPAEQKAQANRHAVFFGTLDGSLGILVPIEEAAHASLQSL 1338

Query: 961  QTNLRKVIK--GVGGLNHEQWRSFNNEKKTVDAK--------NFLDGDLIESF 1003
            Q  L        + GLN    R      KTV+ +        + LDG L+  +
Sbjct: 1339 QRYLTYATPHAALAGLNARTHR----HPKTVEGRPMRQPAPHSLLDGGLLAVY 1387


>gi|353231025|emb|CCD77443.1| putative cleavage and polyadenylation specificity factor cpsf
           [Schistosoma mansoni]
          Length = 1825

 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 61/214 (28%), Positives = 96/214 (44%), Gaps = 36/214 (16%)

Query: 110 QVLDIKFLYGCAKPTIVVLYQD------NKDARHVKTYEVALK---DKDFVEGPWSQNNL 160
            VLD++FLYG  +PT++VLY+          AR      VAL     K      W Q +L
Sbjct: 213 NVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQESL 272

Query: 161 DNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIRPSITKAYG 207
                 +I VP P+ GV+++   +I+Y             C A      P+R  +    G
Sbjct: 273 PFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTNFPMRQDVPSC-G 331

Query: 208 RVDADGSR--------YLLGDHAGLLHLLVITHEK--EKVTGLKIELLGETSIASTISYL 257
            +  DG R        +L+G  +G L+LL +  E+  + VT L    +G       +  L
Sbjct: 332 PLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKVGHAVPPHCMVLL 391

Query: 258 DNAVVYIGSSYGDSQLIKLN---LQPDAKGSYVE 288
           ++  ++IGS + DS L+K++   L  DA G  V+
Sbjct: 392 ESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVD 425


>gi|344300987|gb|EGW31299.1| hypothetical protein SPAPADRAFT_140061 [Spathaspora passalidarum
           NRRL Y-27907]
          Length = 1107

 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 88/381 (23%), Positives = 166/381 (43%), Gaps = 36/381 (9%)

Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
           +L PI+D       +Q    ++  S  Y    L+ V  GI   E  S  L      +++ 
Sbjct: 374 SLSPILD------AKQTPSNMLVLSSHY----LKQVTRGIPTTEIVSTPLPFTPTSVYTT 423

Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
           + +     D++LVV+    ++ L +++ + +EE       +   TL       + LVQV 
Sbjct: 424 KLTQASVHDSYLVVTSSLASQTLVLSIGEVVEEVPDSKLVTDQHTLSIQQMGKSSLVQVY 483

Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI--GDGILTEV 471
           +  +R +         +W SP G +V  A+ NA+QV++A     +VY EI   D ++   
Sbjct: 484 TNGIRQIGHKV----TDWFSPAGITVTHASTNANQVIIAMSNCEVVYFEIDVDDQLIEYQ 539

Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
           +  +L   I+ L I     + + S +A +G   D ++++ SL + + +  + L   +   
Sbjct: 540 QRLELTSSITSLAI-----SDTRSDVAVIGC-ADETIQVVSLAESDCLDVKSLQA-LSAN 592

Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
           +  L   E   ++   + +G      L++K  +L D +   LG++P+++   S  +T  V
Sbjct: 593 ASSLVMIENTVHI--GMENGVYARTKLDIK--QLKDTRVQYLGSRPVSMSRISVGDTAGV 648

Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI---GTIDD 648
            A S    + Y+ +   +   +++ ++S    FNS      + I    EL I   GT D 
Sbjct: 649 LAISSSAWIGYTHDTWRIMPLLDI-DISSAASFNSEDIEGIVGI-HGNELVIFSLGTEDG 706

Query: 649 IQKLH---IRSIPLGEHPRRI 666
               H   I+   L   PRR+
Sbjct: 707 FDPSHEWTIKKSRLRYTPRRM 727


>gi|256079900|ref|XP_002576222.1| cleavage and polyadenylation specificity factor cpsf [Schistosoma
           mansoni]
          Length = 1958

 Score = 63.2 bits (152), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 61/214 (28%), Positives = 96/214 (44%), Gaps = 36/214 (16%)

Query: 110 QVLDIKFLYGCAKPTIVVLYQD------NKDARHVKTYEVALK---DKDFVEGPWSQNNL 160
            VLD++FLYG  +PT++VLY+          AR      VAL     K      W Q +L
Sbjct: 230 NVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQESL 289

Query: 161 DNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIRPSITKAYG 207
                 +I VP P+ GV+++   +I+Y             C A      P+R  +    G
Sbjct: 290 PFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTNFPMRQDVPSC-G 348

Query: 208 RVDADGSR--------YLLGDHAGLLHLLVITHEK--EKVTGLKIELLGETSIASTISYL 257
            +  DG R        +L+G  +G L+LL +  E+  + VT L    +G       +  L
Sbjct: 349 PLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKVGHAVPPHCMVLL 408

Query: 258 DNAVVYIGSSYGDSQLIKLN---LQPDAKGSYVE 288
           ++  ++IGS + DS L+K++   L  DA G  V+
Sbjct: 409 ESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVD 442


>gi|66362282|ref|XP_628105.1| contains a UVDDB domain that is present in CPSF_A and damage specific
            DNA binding protein 1 [Cryptosporidium parvum Iowa II]
 gi|46227624|gb|EAK88559.1| contains a UVDDB domain that is present in CPSF_A and damage specific
            DNA binding protein 1 [Cryptosporidium parvum Iowa II]
          Length = 1495

 Score = 62.8 bits (151), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 60/123 (48%), Gaps = 1/123 (0%)

Query: 805  ELQSECGHHGHILALYVQT-RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
            EL+ +  +  H + ++++   G++I+VGDLM+S+ L  +    G   E  RD +  W+  
Sbjct: 1188 ELKRKETYCTHTMIVFIKILNGEYILVGDLMRSVGLWEFDRYTGKFHEVCRDNSIAWVVE 1247

Query: 864  VEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRL 923
               L  D+YL ++ N NL  + +      DE    L  +   H+GE +  F+ G      
Sbjct: 1248 GIFLSKDMYLISDENRNLRVLMRTLNPENDETYTSLSCIAHLHVGESITTFQQGKFTQAY 1307

Query: 924  PDS 926
            P++
Sbjct: 1308 PET 1310



 Score = 51.2 bits (121), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 82/405 (20%), Positives = 162/405 (40%), Gaps = 83/405 (20%)

Query: 4   VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQ---WDAESSELITRAMGDVSDRIG 60
           V IY  I  ++ FRP  +  D + I T  Y+  +L+    +++S +++  ++ D      
Sbjct: 75  VEIYKDIIEIDRFRPKNQDFDDVLIFTREYELILLRAHLLNSDSEKILCMSILDKVSLYR 134

Query: 61  RPTDNGQI--GIIDPDCRLIGLHLYDGLFKVIPFD-------NKGQLKEAFNIRLEELQV 111
                 Q+   ++  +   I +  Y+G  +V+  +       +K        IRL EL V
Sbjct: 135 ENLRKSQLIKMMVHSEKNRIVILAYEGCLQVVGCEISSKDEEDKAVFTSPLIIRLSELSV 194

Query: 112 LDIKFLYGCAKPTIV-VLYQD--NKDARHVKTYEVALKDKDFVEGPWSQN-NLDNGADLL 167
            DI       + +++ +LY    +KD R +K  E+ +  +      WS N N+ +     
Sbjct: 195 TDICLANTTNERSLLGILYDSGYSKDPRLMKIIELPIDLR-----KWSYNTNIGHVMQQN 249

Query: 168 IPVPPPLC----------GVLIIGEETIVYCSANAFKAIPIRPSIT-------------- 203
           +    PL           G  + G+  + Y S         + S T              
Sbjct: 250 VFKIAPLYYRSKKQKMVKGFFLFGDGIVEYRSIEEIMPNKTKNSETISSTQKIVTSRFNS 309

Query: 204 KAYGRVDA--------------DGSRYLLGDHAGLLHLLVITHEK---EKVTGLKIELLG 246
            A+  +D               DGSR+L+ D+ G L ++++ +++   +KV  +++ +L 
Sbjct: 310 SAFSGMDIGLSPLSITDILNLDDGSRWLVLDNLGRLFIMLVEYDQDDIDKVVDIRLNILN 369

Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVD 306
             S  S I  L + + ++ S   DS L+       +K +   ++    N+GPI D    +
Sbjct: 370 RYSPFSRIVNLGDDLFFLASKLSDSLLLY------SKNNNFYIISSLPNIGPIKDLIFTN 423

Query: 307 LERQGQGQ---------------VVTCSGAYKDGSLRIVRNGIGI 336
            +R+ + +               ++   G    G+L+ + NGIG+
Sbjct: 424 FDRKEETKYEDIPSQISKRDPLPLIAACGFGSGGALKSICNGIGL 468


>gi|388581811|gb|EIM22118.1| hypothetical protein WALSEDRAFT_28358 [Wallemia sebi CBS 633.66]
          Length = 1259

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 63/280 (22%), Positives = 121/280 (43%), Gaps = 14/280 (5%)

Query: 760  KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
            KL+++  +ETKGAV ++ + +G  + A+ QK+ +    + +     L S   +      +
Sbjct: 989  KLKMLMREETKGAVSAITSCSGYFVVAVGQKVLIRALEINE----RLISVAFYDAGTYIV 1044

Query: 820  YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL--DDDI-YLGAE 876
             ++   +FI+VGD +KSI+ L ++     + + +RD           L  +D I ++  +
Sbjct: 1045 SLEVLKNFILVGDQVKSITFLAFQESPYKLVQLSRDARQIETCVSNFLAHEDQISFVSND 1104

Query: 877  NNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
               +L  +  N    T E   +L    E+H G         +  + LP   V     ++ 
Sbjct: 1105 IQGDLRLIDYNPFDPTAEGGEKLIRTTEFHKGS------EATCSLLLPKPSVRPSSELLL 1158

Query: 937  GTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD-AKNFL 995
            G V+G +  ++ +    +  L  LQ  L + I  +  LN    R   N+  +   +K  L
Sbjct: 1159 GCVDGSLSCLSPVDEITFKALWLLQGALVRQIPHIAALNPRAHRHVRNDYVSRSLSKGIL 1218

Query: 996  DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
            DG L+ ++  +      EI+K +  S  EL   +   + L
Sbjct: 1219 DGLLLSAYQTIDHATQVEIAKRIGYSKAELLGYLRNFSWL 1258


>gi|350584885|ref|XP_003126937.3| PREDICTED: splicing factor 3B subunit 3-like [Sus scrofa]
          Length = 425

 Score = 62.4 bits (150), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 68/288 (23%), Positives = 128/288 (44%), Gaps = 39/288 (13%)

Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRIL 752
           +R+++      +    L+  E   S+  C FS+  + +Y  VG A  L        G  +
Sbjct: 150 IRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGDDWYVLVGVAKDLILNPRSVAGGFV 209

Query: 753 VF--IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
               +V +G KL+ + +   +    ++  F G++L  + + +++Y     D G ++L  +
Sbjct: 210 YTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRK 264

Query: 810 CGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
           C  + HI A Y+   QT G  ++V D+ +S   + YK  E  +   A D    W++   +
Sbjct: 265 C-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASL 322

Query: 867 LDDDIYLGAENNFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLG 908
           LD D   GA+   N+  VR   N+    DE         +RG L       EV+  YH+G
Sbjct: 323 LDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVG 382

Query: 909 EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLF 956
           E V   +  +L+        G   ++++ T++G IG++      + ++
Sbjct: 383 ETVLSLQKTTLIP-------GGSESLVYTTLSGGIGILVPFTSHEVIY 423


>gi|327304811|ref|XP_003237097.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
 gi|326460095|gb|EGD85548.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
          Length = 1398

 Score = 62.0 bits (149), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 69/308 (22%), Positives = 128/308 (41%), Gaps = 24/308 (7%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+PE ++P K +         KL+L A++E KGAV +L+   G+    + Q  +     L
Sbjct: 1094 VVPEPDQPEKSK---------KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGL 1144

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            ++DG+    +      ++  L         ++GD  K +    Y  E   ++   ++   
Sbjct: 1145 KEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFTGYSEEPYKLDLFGKENEN 1204

Query: 859  NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF---VN 912
              +   + L D   +Y L A+++ NL  ++ + E  +  +  RL     +H G F   V 
Sbjct: 1205 LAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLRRSVFHTGHFASTVT 1264

Query: 913  RFRHGSLVMRLP--------DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
               HG+     P        DS       ++     G I VI  L  + Y  L  LQ+ L
Sbjct: 1265 LLPHGAHTTSSPVDEDAMDTDSPPPSKYQILMTFQTGSIAVITPLSEDSYRRLLALQSQL 1324

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
               ++    LN   +R+  ++      +  +DG+L+  +LD+   R  EI+  +   V  
Sbjct: 1325 VNALEHPCSLNPRGYRAVESDGMG-GQRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGA 1383

Query: 1025 LCKRVEEL 1032
            +   +E+L
Sbjct: 1384 IRIDLEKL 1391


>gi|301628217|ref|XP_002943254.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
           1 [Xenopus (Silurana) tropicalis]
          Length = 628

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 106/562 (18%), Positives = 216/562 (38%), Gaps = 148/562 (26%)

Query: 1   MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDV 55
           M     +G + ++   +  G  +D L ++ +  K  V+++D  + +L T ++      ++
Sbjct: 69  MASFSFFGNVMSMASVQLAGAKRDALLLSFKEAKLSVVEYDPGTHDLKTLSLHYFEEPEL 128

Query: 56  SDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF---------------DNKGQLKE 100
            D   +   N ++ + DP  R   + +Y     V+PF                 K     
Sbjct: 129 RDGFVQNVHNPKVRV-DPSGRCAVMLIYGTQLVVLPFRRDTLAEEHDGLVGEGQKSSFLP 187

Query: 101 AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDF 150
           ++ I + EL      ++D++FL+G  +PT+++L++ N+      A    T  +     + 
Sbjct: 188 SYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNI 247

Query: 151 VEG--P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAF 193
           ++   P  WS  NL       + VP P+ GV+I    +++Y +              N  
Sbjct: 248 MQKVHPVIWSLTNLPYDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVSLNSLTNGT 307

Query: 194 KAIPIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIEL 244
            + P++P   +   RV  D S        + ++    G +++L +IT     V     + 
Sbjct: 308 TSFPLKP---QEGLRVTLDCSQATFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDK 364

Query: 245 LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQ-----------PD------------ 281
              + + ++++ ++   +++GS  G+S L++   +           PD            
Sbjct: 365 AAASVLTTSMTPMEPGYLFLGSRLGNSLLLRYTEKVQDSPAGPSKDPDKQDEPPNKKKRV 424

Query: 282 ---------AKGSYV------------------------EVLERYVNLGPIVD------- 301
                    +KG+ V                        EV +  +N+GP          
Sbjct: 425 DSSLARPGGSKGNMVDEIDEIEVYGSEMQSGTQLSTYSFEVCDSILNIGPCATASMGEPA 484

Query: 302 FCVVDLERQGQG--QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSS--- 356
           F   + +   +   ++V CSG  K+G+L +++  I      + EL G   MW++ S+   
Sbjct: 485 FLSEEFQESPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVISNHKK 544

Query: 357 -----------------TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
                             D     FL++S    T IL      E+ E +  GF +Q  T+
Sbjct: 545 EEQEGEKEGETPPVEAEEDTNRHGFLILSRDDSTMILQTG--QEIMELDTSGFATQDPTV 602

Query: 400 FCHDAIYNQ-LVQVTSGSVRLV 420
           +  +   N+ +VQV+   +RL+
Sbjct: 603 YAGNIGDNKYIVQVSPRGIRLL 624


>gi|326432241|gb|EGD77811.1| hypothetical protein PTSG_08901 [Salpingoeca sp. ATCC 50818]
          Length = 1506

 Score = 61.6 bits (148), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 79/334 (23%), Positives = 135/334 (40%), Gaps = 61/334 (18%)

Query: 17  RPHGEAQDFLFIATERYKFCVLQWDAESSELIT---RAMGDVSDRIGRPTDNGQIGI-ID 72
           RP G ++D L    +  +  ++++D +  +L T    A  D   ++G      ++ + +D
Sbjct: 75  RPRGNSRDLLIFTFKDARVAIVRFDPKMRDLETVSLHAFEDTDTKLGGWHSEQRLRVCVD 134

Query: 73  PDCRLIGLHLYDGLFKVIPF-----------DNKGQLKEAFNIRLEEL--------QVLD 113
           P  R   L +Y     VI F           D +   +++F  R+ +L        +V D
Sbjct: 135 PLHRCAALMVYGCKLIVISFSSGTATAAPEADTQEDTEQSFTSRVIDLLSLPSTIGRVDD 194

Query: 114 IKFLYGCAKPTIVVLYQ----------DNKDARHVKTYEVALKDKDFVEGP--------- 154
           + FL G   P + +L+Q            KD  HV    +AL +      P         
Sbjct: 195 MAFLDGYDVPCLAILHQPRPAWVGHMAKTKDTAHVTALSLALDEMTARRAPTAPPPPPPP 254

Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSA--------NAFKAIPIRPSITK 204
             W Q NL +    L PVP PL GV++IG   + Y +         N +        I +
Sbjct: 255 VVWHQENLPSDTFALQPVPAPLGGVVVIGVNVLFYVTQSLVRSLALNGYSRASTNAPIQE 314

Query: 205 AYG-RVDADGSRY--------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
             G  +D DG+ +        L    +G +HLL I      V GL+++ L  + I S I 
Sbjct: 315 QTGISLDLDGAHHALLTPTQILFALPSGDIHLLTIVCTDVTVDGLRMDKLATSVIGSDIC 374

Query: 256 YLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEV 289
            L    ++I S +  S L++    P +  ++++V
Sbjct: 375 TLGRRHIFIASRHATSLLLEWAPIPLSATTHIDV 408


>gi|296424589|ref|XP_002841830.1| hypothetical protein [Tuber melanosporum Mel28]
 gi|295638079|emb|CAZ86021.1| unnamed protein product [Tuber melanosporum]
          Length = 595

 Score = 61.2 bits (147), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 88/476 (18%), Positives = 189/476 (39%), Gaps = 58/476 (12%)

Query: 247 ETSIASTISYLD-----NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 301
           + SI +  + LD     N ++  G       +  L + P    +  ++ E   N  P+ D
Sbjct: 76  DCSIGTAFASLDFGLNKNDMLVAGGEMSSGGVYLLKIGPHPAETEPKLEESVANWSPVFD 135

Query: 302 FCVVDLERQGQG-------QVVTCSGAYKDGSLRIVRNGIGINEQASV--ELQGIKGMWS 352
           F +V+L  QG         +++ C+G    G++  +R GI    Q S   +++G++ +W 
Sbjct: 136 FELVNLPAQGHASGKVERDRILACTGRGDHGAITELRYGIQARIQGSADHQMRGVRRLWV 195

Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
           L   +D  +  FL  S   ++ +  +  + E E+          +T     A+    VQV
Sbjct: 196 LPDCSDVGY--FLFSSLPDQSNLCFLKPDGEWEDASDLELLDMNETTLAAGAVGKHSVQV 253

Query: 413 TSGSVRLVSSTSRELRN--------EWKSPPGYSVNVATANASQVLLATG-GGHLVYLEI 463
           T  ++ +       L N        E    P  S+         V++A    G+ + +  
Sbjct: 254 TPSTINIAQLKHASLANHREVSRDVEMADEPQLSILRCRCGGGDVIVAAALKGNYIVVAT 313

Query: 464 GDGILTEVKHAQLEYEISCLD-INPIG------ENPSYSQI--------AAVG------- 501
            +G L ++  A +  + +  D +NPIG      E P++  +        AAVG       
Sbjct: 314 RNGFLVKLTLATIVVDENIEDFLNPIGIPTSLAEEPTFISVLEINGRILAAVGTRQATIQ 373

Query: 502 -MWTDISVRIFSLPDLNLITKEHLGGE--IIPRSVLLCAFEGISYLLCALGDGHLLNFLL 558
               D S  +  L + ++++    G +   I    ++    G + LLC L  G ++   +
Sbjct: 374 LFLFDFSNGLVPLLEKSMMSDSCQGEQDLFICECAVIIRANGTARLLCGLRGGTVVVMDI 433

Query: 559 NMKTGELTDR-KKVSLGTQPITLRTFSSKNTTHVFAAS------DRPTVIYSSNKKLLYS 611
             + G   +R   +  G  P+ +     ++ +    A       D P+  + +++ +   
Sbjct: 434 QWQRGLSLERIDGIKFGPTPVQIYPDVGRSDSAFILAGPELFRFDLPSGNFRASQVVFEE 493

Query: 612 NVNLKEVSHMCPFNSAAFPDSLAIA-KEGELTIGTIDDIQKLHIRSIPLGEHPRRI 666
           +     +      +++  P+++ +   + ++    + + +K+ +R + L E PRR+
Sbjct: 494 SEVEPSLVAFVQIDTSPDPENIVVGVTKDQIFFADVGESEKVCVRRLRLYETPRRL 549


>gi|149245793|ref|XP_001527373.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
 gi|146449767|gb|EDK44023.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 1223

 Score = 60.5 bits (145), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 100/473 (21%), Positives = 188/473 (39%), Gaps = 70/473 (14%)

Query: 160 LDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPS--------ITKAYGRVDA 211
           L N A+ LIPV   + GVL+ GE  I +  AN    +   P         +      +  
Sbjct: 229 LPNDANYLIPVLGAIGGVLLCGENWIRFVKANKDNIVLPLPRRGNQNTIIVNHVSHFLKK 288

Query: 212 DGSRYLLGDHAGLLHLLVITHE--KEKVTGLKIELLGETSIASTISYL-----------D 258
            G    +    G +  LV  ++  + KV  + I        + +++             +
Sbjct: 289 KGLFIFMQSTLGDILTLVFDYDSDRSKVKDITITYFDTIPPSQSLNIFKSGFLFANSLNN 348

Query: 259 NAVVY----IGSSYGDSQLIKLNLQPDAKG-------------------SYVEVLERYVN 295
           N V+Y    +G+   D+  +KL+   DA                     S ++ LE   +
Sbjct: 349 NQVLYQFMKLGADLSDATSVKLSTVTDANSIDIAKKLHYKFEVRSLQNLSEIDTLE---S 405

Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLR 354
           L PI D  ++         + T S  +K   ++ V  GI  +      L      +++ +
Sbjct: 406 LSPITDSKLLP-----NSHLATLSSNHK---MKTVMAGIPTSVLVESPLTLTPTNVFTTK 457

Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
            S D P D +LV +    +  L  ++ + LE+     F     T+         +VQ+ +
Sbjct: 458 LSEDSPNDEYLVFTSTLSSETLVFSIGESLEDVTDSKFVLDQPTIAVQQVGKASVVQIYA 517

Query: 415 GSVRLVSS---TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE---IGDGIL 468
             +R +S+    S+++  +W  P G S+  A  N  QVL+A     +VY E     D +L
Sbjct: 518 YGLRHISTKIGNSKKV-TDWYPPAGISIVHAATNNKQVLIALSNSEVVYFETDPTDDQLL 576

Query: 469 TEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
              +  ++   ++ + I+P  E  S++ I      +D ++++ SL P   L  K      
Sbjct: 577 EYQEKLEVTSPVTSMTISP--ERSSFAIIGC----SDETIQVISLQPQSCLEVKSLQALS 630

Query: 528 IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
               S+ +   E  +Y+   + +G      ++  +G+L+D +   LG++PI L
Sbjct: 631 SKANSLAMLTNERTTYVHIGMENGVYARTKIDKFSGKLSDTRVKYLGSKPIAL 683



 Score = 40.0 bits (92), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 37/123 (30%), Positives = 59/123 (47%), Gaps = 24/123 (19%)

Query: 925  DSDVGQIPTVIFGTVNGVIGVIASL--PHEQYLFLEKLQTNLRKV-------IKGVG--- 972
            DSD G    +++  + G IGV+  L   HE  L    LQ  LRK+       + GV    
Sbjct: 1104 DSDDG---CIVYTGIQGTIGVLILLLTKHEVELLFN-LQLELRKLCNHDDDAVSGVASWI 1159

Query: 973  ---GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
               G +H + RS+ N       KN +DGD +E +L L +    +I + ++ S  E+ KR+
Sbjct: 1160 SPLGKDHLKHRSYFNP-----VKNIIDGDFVECYLQLLQNWKVKIGEAIDKSPNEIEKRL 1214

Query: 1030 EEL 1032
             ++
Sbjct: 1215 NDI 1217


>gi|428170663|gb|EKX39586.1| hypothetical protein GUITHDRAFT_143370 [Guillardia theta CCMP2712]
          Length = 1162

 Score = 60.5 bits (145), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 129/649 (19%), Positives = 262/649 (40%), Gaps = 132/649 (20%)

Query: 123 PTIVVLYQDNKDARHVKTYEVALKDKD-----FVEGPWS----------QNNLDNGADLL 167
           P + +L++D K    ++T  +  ++ +      ++GPWS            NL+ GA L 
Sbjct: 216 PLMALLFEDEKGIVDLETCSIKYEESNKLNFNLLQGPWSFSTLHPETSLGRNLEVGAFLY 275

Query: 168 IPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHL 227
             +P     + I    T  +  A++  A+P  P   +    +      ++LG + G +H+
Sbjct: 276 PHLPSSQWAIRI----TSRFAVASSL-AVPAGPGSNRKKRSM-----HWILGSNDGQMHI 325

Query: 228 LVITHEKEKVTGLKIEL--------LGETSIASTISYLDNAVVYIGSSYGDSQLIKL--- 276
           L ++ E  K   + I +        L    +A+ +   +  +    +  G + ++ +   
Sbjct: 326 LSVSGES-KSANVTINVKRIDTNIPLSNPRVATILRSQERTLQLWCTKTGYTSILDITKL 384

Query: 277 ------NLQPDAKGSYVEVLERYV--NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
                 N Q D +   VEV+      +LG I D   VD    G+ Q++   G   D  LR
Sbjct: 385 SHHIENNSQTDQEIPGVEVMCSPFEESLGSIKDAIAVDFLGDGEMQLLLACGEGSDSCLR 444

Query: 329 IVRNGIGINE--------QASVELQGIKGMWSLRSSTDD---------PFDTFLVVSF-- 369
           + R+G+ +++            ++  ++G+  L     D          FD+ LV SF  
Sbjct: 445 LCRSGLEVSKIIEEGPEMPECSDIFALRGLHILHVQPHDNGSNIQRLRAFDSHLVFSFAS 504

Query: 370 ISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV-------SS 422
           I++T++L ++   E     + G C    T+F     +  LVQVT   +RL+       +S
Sbjct: 505 INQTKVLELD-GHEFVPVTLPGLCEDANTVFITSLPHGHLVQVTEMEIRLINMRKSVMNS 563

Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGG------HLVYLEIG-DGILT------ 469
           +  +  + W +P   ++N+A+       + T G        L  L +  D +L+      
Sbjct: 564 SQEDYLHVW-TPKIGNINMASVVKYDTSIKTDGNQSLTPHQLASLVVAVDSVLSLFEILT 622

Query: 470 --------EVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT 520
                   ++KH+ Q E++IS +  N             V M  + S+ +  LPD  L  
Sbjct: 623 HSNKKSEVKLKHSRQFEHQISAVWPN------------EVHMLKNDSMEV-KLPD-KLSY 668

Query: 521 KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRK------KVSLG 574
           +++            C  + ++ +  + G  H+  F +  KT  + D         + +G
Sbjct: 669 QQYNQH---------CVCDQVAVVGSSTGIAHI--FPIPSKTSGIVDVMTLIEVCSLRVG 717

Query: 575 TQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVN-LKEVSHMCPFNSAAFPDS 632
              +++    ++   H +F  S+    I+  + +L    +N +++V  +C  ++   P S
Sbjct: 718 ETAVSIHVEETEQAVHGIFLHSNYNAWIHLRSGELEVKRINGIEKVKAVCSLHTELMPTS 777

Query: 633 LAIAKEGE-LTIGTIDDIQK---LHIRSIPLGEHPRRICHQEQSRTFAI 677
           L    E + L  G + + ++   ++ RS+ L E+   + H  + R   +
Sbjct: 778 LVWINEKDCLQFGKLAEDRQTSTIYQRSVKLAENVVSMTHSSRHRCLLL 826


>gi|303285993|ref|XP_003062286.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455803|gb|EEH53105.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 1469

 Score = 60.1 bits (144), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 75/335 (22%), Positives = 141/335 (42%), Gaps = 44/335 (13%)

Query: 733  CVGTAYVLPE-ENEPTKGRILVFIVE----DGKLQLI--AEKETKGAVYSLNAFNGKLLA 785
             VGTA  +P  E+ P +GR+++F +     DG+ + +         A+ +L+   G L+ 
Sbjct: 1139 AVGTA--MPGGEDTPCRGRVILFEISWQMVDGETRRVPLLLLFFDDALAALSGLEGHLVV 1196

Query: 786  AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
            AI  K+ ++ W    DG  EL         +  + +    +F+ +GD+ K      +K +
Sbjct: 1197 AIGTKLIVHAW----DGA-ELIPVAFFDTPVHTVTINVVKNFVCIGDVQKGAYFFRWKDD 1251

Query: 846  ----EGAIEERARDYNANWMSAVEILDDDI---YLGAENNFNLFTVRKNSEGATDEERGR 898
                E  + + A+D+ +  + + E L D      L A+   N +    + + +   +  +
Sbjct: 1252 PRTGEKNLIQLAKDFESMDVLSTEFLVDGSTLSLLAADTAGNAYVFAYDPKSSESWKGQK 1311

Query: 899  LEVVGEYHLGEFVNR---FRHGSLVMRLPDSDVGQIPT----------VIFGTVNGVIGV 945
            L     +H+G  V+R   F+  +      D      P           V FGT++G +G+
Sbjct: 1312 LLTKASFHVGSPVHRMVRFKLKTPTGAGNDGRAAPTPAEIKANANRHAVFFGTLDGSLGI 1371

Query: 946  IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK----KTVDAKNFLDGDLIE 1001
            +  +    +  LE LQ  L        GLN   +R+    +    ++    N LDG++++
Sbjct: 1372 LVPMESSTHAKLEVLQRWLNYNTAQNAGLNGRSYRAPKTTEGRAMRSPAPHNLLDGEMLQ 1431

Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
             F  L+ T+  E +    ++      R E LT LH
Sbjct: 1432 GFESLAWTKQAEAADAAGMT------REEALTYLH 1460



 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 132/661 (19%), Positives = 245/661 (37%), Gaps = 179/661 (27%)

Query: 8   GRIATLEL-FRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM---------GDVSD 57
           G IATL   F    E +D L +A    K  V+++D  +  L+  ++         G V  
Sbjct: 113 GSIATLRRRFGAPREQRDALLLAVRESKLSVVEFDPSTLSLVCSSLHSWETPPGAGGVPS 172

Query: 58  RIGRPTDNGQIGIIDPD--CRLIGLHLYDGL-FKVIPFDN-------------KGQ---- 97
            + R      + + DP+  C  + L    G    ++P DN             KG+    
Sbjct: 173 AL-RLAPTPPVVVADPEGRCAAVLLRAEGGTRLALLPTDNDAMDVDGGDGSEGKGRRTLR 231

Query: 98  -----LKEAFNIRL-EELQVL---DIKFLYGCAKPTIVVLYQDN----------KDARHV 138
                +K+++ + L  E+ V    D+ FL+G  +P ++VL+++           KD   +
Sbjct: 232 GTAAAVKKSYVVDLVREMGVRYVRDVCFLHGYGEPVLLVLHEERLTWAARATLVKDTMRL 291

Query: 139 KTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI 198
               + +  +      W ++ L +    L  +P PL G +++ +  +++ S  +  A+ +
Sbjct: 292 SAISLNVDARKHTV-IWRRSALPHSCYRLTAMPAPLGGAIVLSQNFLLHESQESSAALAL 350

Query: 199 RPSITKAYGRVDADG----------------------SRYLLGDHAGLLHLLVITHEKEK 236
            P      GR D                          + L+   AG L+LL +  E  +
Sbjct: 351 NP--LAGGGRGDDPAAKAAAAASAAALDGAYAAVISEKQALVTTKAGALYLLSLRIEGRR 408

Query: 237 VT---GLKIELLGETSIASTISYLDNAVVYIGSSYGDS---------------------- 271
           +    G+ ++  G   ++S +  +   ++++GS  GDS                      
Sbjct: 409 LATRGGMHLKRAGGAVLSSGMCLVTRRLLFLGSRVGDSLLVSRCSTARASTAAPGRRPRA 468

Query: 272 ------------QLIKLNLQ---------------------PDAKGSYVEVLERYVNLGP 298
                       +L+ +  Q                     PD  G    V +  + + P
Sbjct: 469 AAAAATTAAAEVRLLPIRPQIDGVGGVSAASLRAAAAAHRAPDHPGYTFTVRDSVLGISP 528

Query: 299 IVDFCV-----VDLERQGQGQVVTCSGAYKDGSLRIVRNGIG---INEQASVELQGIKGM 350
           ++D  V     V  +   + +++   G  K+G+L +++ GI    + E  S  L G+KG 
Sbjct: 529 VIDLTVGASASVSGDTIERTELIAACGHGKNGALAVLQRGIQPELVTEVESGTLPGLKGT 588

Query: 351 WS----------LRSSTD-------DPFDTFLVVSFISETRILAMNLEDELEE-TEIEGF 392
           W+          LR S         DP+  +LV+S  S T IL     +EL+E +E    
Sbjct: 589 WTVHHDSADNERLRGSAAAAAAQAVDPYHAYLVISLASSTMILETG--EELKEVSEHVEL 646

Query: 393 CSQTQTLFCHDA--------IYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATA 444
            +   TL   +A        +Y++ V+V +G V++    S EL  +     G  +  A  
Sbjct: 647 VTDAATLCAGNAFGRERIVQVYDKGVRVAAGPVKVQDIASTELVADAGDGEGIEIVAAEI 706

Query: 445 NASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
           +   VL            + DG L  +K  +    +  LD++ +   P    IA   +  
Sbjct: 707 SFPYVL----------CRLSDGSLAVLKGDEESKTLVKLDVDALARLPPGGGIACATLVD 756

Query: 505 D 505
           D
Sbjct: 757 D 757


>gi|325094074|gb|EGC47384.1| cleavage factor two protein 1 [Ajellomyces capsulatus H88]
          Length = 1377

 Score = 60.1 bits (144), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 104/431 (24%), Positives = 172/431 (39%), Gaps = 86/431 (19%)

Query: 655  RSIPLGEHPRRICHQEQSRTFAICSLKNQ----SCAEESEMH-----------------F 693
            R I LGE    + +   S T+ I +  NQ    +  E+ E+H                  
Sbjct: 976  RKIGLGEQVDAVEYSSSSETYVIGT--NQKVDFNLPEDDEIHPEWRNEVISFLPQIDKGS 1033

Query: 694  VRLLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTK 748
            V+LL  +T+  I +Y L   E      C  L  S  + +      VGTA +   E+   +
Sbjct: 1034 VKLLTPRTWSIIDSYNLRNAERIMCVKCLNLEVSEITHERKDTIVVGTA-LTKGEDIAAR 1092

Query: 749  GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
            G I +F V            + KL+LIA++E KGAV SL+   G+  L+AA  QK  +  
Sbjct: 1093 GCIYIFEVIKVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRG 1152

Query: 796  WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
              L++DG+            +L +       ++ V   +K           GAI      
Sbjct: 1153 --LKEDGS------------LLPVAFMDMQCYVNVLKELKG----------GAITNCL-- 1186

Query: 856  YNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF- 914
            ++A     V   DD  +L  EN   +     +  G++  +R  L     +  G F +   
Sbjct: 1187 FSARM--TVPSSDDADFLPDENRLYILVADDDYPGSSKGDR--LLHRSTFQTGHFASTMT 1242

Query: 915  ---RHGSLVMRLPDSD--------VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
               R  +   + PD+D         G +  V+  +  G I +I  +    Y  L  LQ+ 
Sbjct: 1243 LLPRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQ 1302

Query: 964  LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
            L   ++   GLN   +R+   E   +  +  +DGDL++ +LDL   R  EI+  +   V 
Sbjct: 1303 LANTLEHPCGLNPRAFRAV--ESDGIGGRGMVDGDLVKRWLDLGTQRKAEIANRVGADVW 1360

Query: 1024 ELCKRVEELTR 1034
            E+   +E + +
Sbjct: 1361 EIRADLEAIGK 1371


>gi|302652143|ref|XP_003017931.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
 gi|291181517|gb|EFE37286.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
          Length = 429

 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 70/312 (22%), Positives = 132/312 (42%), Gaps = 27/312 (8%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+PE ++P K +         KL+L A++E KGAV +L+   G+    + Q  +     L
Sbjct: 125  VVPEPDQPEKSK---------KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGL 175

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            ++DG+    +      ++  L         ++GD  K +  + Y  E   ++   ++   
Sbjct: 176  KEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENEN 235

Query: 859  NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
              +   + L D   +Y L A+++ NL  ++ + E  +  +  RL     +H G F +   
Sbjct: 236  LAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMT 295

Query: 916  ---HGSLVMRLP--------DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
               HG+     P        DS       ++     G + VI  L  + Y  L  LQ+ L
Sbjct: 296  LLPHGARTPSSPVDEDAMDTDSPPPSKYQILMTFQTGSVAVITPLGEDSYRRLLALQSQL 355

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
               ++    LN   +R+  ++      +  +DG+L+  +LD+   R  EI+  +   V  
Sbjct: 356  VNALEHPCSLNPRGYRAVESDGMG-GQRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGA 414

Query: 1025 LCKRVEELTRLH 1036
            +  RV +L +LH
Sbjct: 415  I--RV-DLEKLH 423


>gi|308504990|ref|XP_003114678.1| hypothetical protein CRE_28194 [Caenorhabditis remanei]
 gi|308258860|gb|EFP02813.1| hypothetical protein CRE_28194 [Caenorhabditis remanei]
          Length = 270

 Score = 59.7 bits (143), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 59/274 (21%), Positives = 119/274 (43%), Gaps = 46/274 (16%)

Query: 791  IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
             +++ W     G        G +  +  + +Q+ G  I+V D  +S+  L Y+  +  + 
Sbjct: 5    FEIFPW-----GNHLFDDRMGKNFPVSIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQLV 59

Query: 851  ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE--------ERG-- 897
              A D    ++S V +LD      A+   NL  VR   + +E   D+        +RG  
Sbjct: 60   VFADDTTPRYVSCVCVLDYHTVAVADKFGNLAVVRLPERVNEDVQDDPTVSKSVWDRGWL 119

Query: 898  -----RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPH 951
                 ++E+V  + +G+ +   +  SL   +P    G    +++ T+ G IG + S +  
Sbjct: 120  NGASQKVELVANFFIGDTITSLQKTSL---MP----GANEALVYTTIGGAIGCLVSFMSK 172

Query: 952  EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL-------------DGD 998
            ++  F   L+ ++R     + G +H  +RS+    K     NFL             DGD
Sbjct: 173  DEVDFFTNLEMHVRSEYPPLCGRDHLAYRSYYAPCKV--CFNFLLFRSIVSLFQSVIDGD 230

Query: 999  LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
            + E F  +  ++  E+++ +  +V E+ K++E++
Sbjct: 231  ICEQFSLMDLSKQKEVAEELGKTVSEISKKLEDI 264


>gi|255725990|ref|XP_002547921.1| hypothetical protein CTRG_02218 [Candida tropicalis MYA-3404]
 gi|240133845|gb|EER33400.1| hypothetical protein CTRG_02218 [Candida tropicalis MYA-3404]
          Length = 1053

 Score = 59.7 bits (143), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 74/340 (21%), Positives = 146/340 (42%), Gaps = 21/340 (6%)

Query: 271 SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIV 330
           +++I  +L+     + ++ LE    L PI+D   +D       ++VT S       L+ V
Sbjct: 387 TKVITFDLKSLTNLALIDTLE---TLSPILDAKSID------SKLVTLSSH---SYLKTV 434

Query: 331 RNGIGINEQASVELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI 389
            +GI  +      L  I   +++ + S +   D +LV+S    ++ L +++ + +E+ E 
Sbjct: 435 THGIPTSTMVESPLPVIPTDIFTTKLSFESENDEYLVISSSLSSKTLVLSIGEVVEDVED 494

Query: 390 EGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR--NEWKSPPGYSVNVATANAS 447
             F     T+       N +VQ+ +  ++ V   + E +   +W  P G ++  A  N  
Sbjct: 495 SEFVLDQPTIAVQQVGKNSVVQIYTNGIKHVRQINGEKKKITDWLPPAGITITHAATNNQ 554

Query: 448 QVLLATGGGHLVYLEI---GDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
           QVL+A     +VY EI    D ++   +  +L    + L I     N   S  A +G  +
Sbjct: 555 QVLIALSNLEVVYFEIDHLDDQLIEYQERLELSSSTTALAIEEHTNNHHQSPFAIIG-CS 613

Query: 505 DISVRIFSLPDLNLITKEHLGG-EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG 563
           D ++++ SL   N +  + +        SV +        +   + +G      ++   G
Sbjct: 614 DETIQVVSLKQQNCLEIQSIQALSSNCSSVKMIKLGKDLMVHIGMNNGVYARIKIDPING 673

Query: 564 ELTDRKKVSLGTQPITLRTFS-SKNTTHVFAASDRPTVIY 602
           +L+D +   LG++P+ L     +K  T V A S +  + Y
Sbjct: 674 KLSDSRIKYLGSKPVKLNIVKINKEITGVLAVSSKSWIGY 713


>gi|302506529|ref|XP_003015221.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
 gi|291178793|gb|EFE34581.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
          Length = 1370

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 70/312 (22%), Positives = 131/312 (41%), Gaps = 27/312 (8%)

Query: 739  VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
            V+PE ++P K +         KL+L A++E KGAV +L+   G+    + Q  +     L
Sbjct: 1036 VVPEPDQPEKNK---------KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGL 1086

Query: 799  RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
            ++DG+    +      ++  L         ++GD  K +  + Y  E   ++   ++   
Sbjct: 1087 KEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENEN 1146

Query: 859  NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
              +   + L D   +Y L A+++ NL  ++ + E  +  +  RL     +H G F +   
Sbjct: 1147 LAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMT 1206

Query: 916  ---HGSLVMRLP--------DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
               HG      P        DS       ++     G I +I  L  + Y  L  LQ+ L
Sbjct: 1207 LLPHGGHTPSSPVDEDAMDTDSPPPSKYQILMTFQTGSIAIITPLGEDSYRRLLALQSQL 1266

Query: 965  RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
               ++    LN   +R+  ++      +  +DG+L+  +LD+   R  EI+  +   V  
Sbjct: 1267 VNALEHPCSLNPRGYRAVESDGMG-GQRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGA 1325

Query: 1025 LCKRVEELTRLH 1036
            +  RV +L +LH
Sbjct: 1326 I--RV-DLEKLH 1334


>gi|169864473|ref|XP_001838845.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
 gi|116500065|gb|EAU82960.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
          Length = 1458

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 111/517 (21%), Positives = 207/517 (40%), Gaps = 74/517 (14%)

Query: 562  TGELTDRKKVSLGTQPITLRTFSSKNT-----THVFAASDRPTVIYSSNK---KLLYSNV 613
            +G L + KK+     P T++  +S  T     + VF   D+P  I  ++K   ++  S  
Sbjct: 940  SGILAEHKKIQRLFVPFTVKPKTSDGTPSPTYSGVFFTGDKPNWIIGTDKGGVQIYPSGH 999

Query: 614  NLKEVSHMCPF--NSAAFPDSLAIAKEGELTIGTIDDIQKLH---IRSIPLGEHPRRICH 668
            N+      C        F   L   ++G   I  + D    H    RSIP G     +  
Sbjct: 1000 NVVHSFSACSLWEERGEF---LVYTEDGPCLIEWLPDFTYSHPLPARSIPRGRGYSNVVF 1056

Query: 669  QEQSRTFAICSLKNQSCAEESEMHFVR------------LLDDQTFEFIST---YPLDTF 713
             + S    + +   Q+     +   VR            + D    E IS      +D F
Sbjct: 1057 -DPSTCLIVAASSMQARFASYDEDGVRVWEKDGPGVDDPITDTSALELISPNSWITMDGF 1115

Query: 714  EYGC----------SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF----IVEDG 759
            E+            ++ + +    S  +  VGT  +   E+   KG   +F    +V D 
Sbjct: 1116 EFATNEYINDISIVTLETAATETGSKDFIAVGTT-IDRGEDLAAKGAAYIFEIVEVVPDP 1174

Query: 760  --------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
                    KL+L    + KG V ++  F G L++++ QKI +  +    D    L     
Sbjct: 1175 AISPTRWYKLRLRCRDDAKGPVTAVCGFQGYLVSSMGQKIFVRAF----DSDERLVGVAF 1230

Query: 812  HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL---D 868
                I    ++   + +++GD +KS+  + ++ +   +   A+D + + ++  +     D
Sbjct: 1231 MDVGIYVTSLRVLKNLLLIGDAVKSVMFVAFQEDPYKLVLLAKDVHLHSVTRADFFFNAD 1290

Query: 869  DDIYLGAENNFNLFTVRKNSEGATDEERGR-LEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
             D+ L   +   +  + + +    D   GR L +  EYH G+      H S  +   D +
Sbjct: 1291 GDLALIVGDEEGIMRIYEYNPNDPDSRDGRYLLLRTEYH-GQVP---YHTSTTIARRDKE 1346

Query: 928  VGQIPT--VIFGTVNGVIGVIASLPHEQYLF--LEKLQTNLRKVIKGVGGLNHEQWRSFN 983
               IP   ++ G+ +G +  +  +P ++Y F  L+ LQ  L + I+ V GLN + +R   
Sbjct: 1347 DPSIPQSHLLIGSADGSLSSL--VPVDEYAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVK 1404

Query: 984  NEKKTVD-AKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
            N+  +   +K  LDG L+  +  L   R +E++K + 
Sbjct: 1405 NDYVSKPLSKGILDGQLLAQYESLPIPRQNEMTKQIG 1441


>gi|358338426|dbj|GAA28838.2| cleavage and polyadenylation specificity factor subunit 1
           [Clonorchis sinensis]
          Length = 1741

 Score = 59.3 bits (142), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 60/218 (27%), Positives = 93/218 (42%), Gaps = 37/218 (16%)

Query: 111 VLDIKFLYGCAKPTIVVLYQD------NKDARHVKTYEVALK---DKDFVEGPWSQNNLD 161
           VLD++FL G  +PT++VLY+          AR      VAL     K      W Q +L 
Sbjct: 158 VLDMQFLNGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQESLP 217

Query: 162 NGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIRPSITKAYGR 208
                +  VP P+ GVLI+   +I+Y             C A      P+R  + +  G 
Sbjct: 218 YDCTYVHSVPEPIGGVLILATNSIIYMKQTLPSCGLPLNCYAQVTTNFPMRQDVPQC-GP 276

Query: 209 VDADGSR--------YLLGDHAGLLHLLV--ITHEKEKVTGLKIELLGETSIASTISYLD 258
           +  DG R        +L+    G + LL   + H  + V+ L +  +G +    +++ LD
Sbjct: 277 LTLDGCRIVTMTDSQFLIVTRTGKMCLLSLWVEHTTQTVSSLLLHEIGCSVPPYSVALLD 336

Query: 259 NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNL 296
              V++GS   DS L+ L     A   +V  L R V+L
Sbjct: 337 KGYVFVGSRLCDSVLLHLT----ASTMFVNTLGRIVDL 370



 Score = 46.2 bits (108), Expect = 0.087,   Method: Compositional matrix adjust.
 Identities = 56/258 (21%), Positives = 105/258 (40%), Gaps = 61/258 (23%)

Query: 650  QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA----------EESEM-------- 691
            + L +R +PL   P  + +  +S+T+A+   + +SC+          +E E+        
Sbjct: 1245 EHLGMRWVPLELTPYFLQYHIESKTYALVGTRVKSCSSVYHLNAEGNKEEEVLLRPPTCV 1304

Query: 692  -----HFVRLLDDQTFEFISTYPLDTFEYGC------SILSCSFSDD---------SNVY 731
                 ++V  +   +       P     + C       +++C  +           +  Y
Sbjct: 1305 LPSLDYYVLQMYAPSTSLAEATPWQAIPHACIDFEPWEVVTCMITAQLSSEQTFHGTKDY 1364

Query: 732  YCVGTAYVLPEENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAFN 780
              +G      EE  P +GRI++  V D            KL+ I + E KG V +L++  
Sbjct: 1365 LALGANLSYGEEI-PVRGRIIILDVIDVVPEPGQPLTRHKLKTIYDGEQKGPVTALSSCQ 1423

Query: 781  GKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
            G L++AI QK+  Y W L++    G   + SE   H  +         + I+  D++KSI
Sbjct: 1424 GHLVSAIGQKV--YIWTLKNADLVGVAFVDSELYIHSLLCV------KNLILAADVLKSI 1475

Query: 838  SLLIYKHEEGAIEERARD 855
             LL ++ +   +   +RD
Sbjct: 1476 QLLRFQSDLRVLSVVSRD 1493


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.138    0.404 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,396,485,688
Number of Sequences: 23463169
Number of extensions: 723575118
Number of successful extensions: 1575804
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 588
Number of HSP's successfully gapped in prelim test: 596
Number of HSP's that attempted gapping in prelim test: 1567224
Number of HSP's gapped (non-prelim): 3132
length of query: 1036
length of database: 8,064,228,071
effective HSP length: 153
effective length of query: 883
effective length of database: 8,769,330,510
effective search space: 7743318840330
effective search space used: 7743318840330
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 82 (36.2 bits)