BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 001655
(1036 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225443990|ref|XP_002280735.1| PREDICTED: DNA damage-binding protein 1 isoform 1 [Vitis vinifera]
Length = 1089
Score = 2034 bits (5270), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 993/1037 (95%), Positives = 1020/1037 (98%), Gaps = 1/1037 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE+SE+ITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAENSEVITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPW+QNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173 SKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWAQNNLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSASAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA VY+GSSYGDSQLIK++LQPDAKGSYVEVLERYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKIHLQPDAKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
DTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA+Y+QLVQVTS SVRLV
Sbjct: 413 HDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVYDQLVQVTSSSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
STSRELRNEWK+P GYSVNVATANA+QVLLATGGGHLVYLEIGDG LTEVKHAQLEY+I
Sbjct: 473 GSTSRELRNEWKAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGTLTEVKHAQLEYDI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP++SQ+AAVGMWTDISVRIFSLPDLNLITKE+LGGEIIPRSVLLC+FEG
Sbjct: 533 SCLDINPIGENPNFSQLAAVGMWTDISVRIFSLPDLNLITKEYLGGEIIPRSVLLCSFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
I YLLCALGDGHLLNFLLNM TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593 IPYLLCALGDGHLLNFLLNMSTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG+LTIGTIDDIQKLHIRSIPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGDLTIGTIDDIQKLHIRSIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRICHQEQSRTFAICSLK NQS E+SEMHF+RLLDDQTFEFISTYPLDTFEYGCSI
Sbjct: 713 EHARRICHQEQSRTFAICSLKYNQSSTEDSEMHFIRLLDDQTFEFISTYPLDTFEYGCSI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 773 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKWMLRDDGTRELQSE GHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 833 NGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGHILALYVQTRGDFIVVGDLMKSISL 892
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN+FTVRKNSEGATDEERGRL
Sbjct: 893 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNIFTVRKNSEGATDEERGRL 952
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QY+FLEK
Sbjct: 953 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEK 1012
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ NLRKVIKGVGGL+HEQWRSFNNEKKTVDAKNFLDGDLIE+FLDL+RTRMDEISK M
Sbjct: 1013 LQANLRKVIKGVGGLSHEQWRSFNNEKKTVDAKNFLDGDLIETFLDLNRTRMDEISKAMA 1072
Query: 1020 VSVEELCKRVEELTRLH 1036
VSVEELCKRVEELTRLH
Sbjct: 1073 VSVEELCKRVEELTRLH 1089
>gi|224061051|ref|XP_002300334.1| predicted protein [Populus trichocarpa]
gi|222847592|gb|EEE85139.1| predicted protein [Populus trichocarpa]
Length = 1088
Score = 2023 bits (5242), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 984/1036 (94%), Positives = 1014/1036 (97%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE+SELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAETSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLHGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDF+EGPWSQNNLDNGADLLIPVPPP CGVLII
Sbjct: 173 SKPTIVVLYQDNKDARHVKTYEVALKDKDFIEGPWSQNNLDNGADLLIPVPPPFCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSAN F+AIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANVFRAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNL PDAKG+YVEVL+RYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLHPDAKGTYVEVLDRYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS TDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSLTDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMN+EDELEETEIEGFCSQ QTLFCH A++NQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNIEDELEETEIEGFCSQVQTLFCHCAVFNQLVQVTSSSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELR EW +P GYS+NVATANA+QVLLATGGGHLVYLEIGDG LT+ KHAQLE EI
Sbjct: 473 SSTTRELRQEWNAPSGYSINVATANATQVLLATGGGHLVYLEIGDGTLTQAKHAQLECEI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP+YSQ+AAVGMWTDISVRIFSLPDLNLITKE LGGEIIPRSVLLC+FEG
Sbjct: 533 SCLDINPIGENPNYSQLAAVGMWTDISVRIFSLPDLNLITKEPLGGEIIPRSVLLCSFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
I+YLLCALGDGHLLNFLLN+ TG L DRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593 IAYLLCALGDGHLLNFLLNLSTGGLKDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL+IGTIDDIQKLHIRSIPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELSIGTIDDIQKLHIRSIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQSRTF+ICS+KNQS AEESEMHF+RLLDDQTFEFISTYPLDTFEYGCSIL
Sbjct: 713 EHARRICHQEQSRTFSICSMKNQSNAEESEMHFIRLLDDQTFEFISTYPLDTFEYGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN
Sbjct: 773 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE
Sbjct: 893 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL
Sbjct: 953 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q+NLRKVIKGVGGL+HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR+RMDEISK M +
Sbjct: 1013 QSNLRKVIKGVGGLSHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRSRMDEISKAMEI 1072
Query: 1021 SVEELCKRVEELTRLH 1036
SVEELCKRVEELTRLH
Sbjct: 1073 SVEELCKRVEELTRLH 1088
>gi|356512636|ref|XP_003525024.1| PREDICTED: DNA damage-binding protein 1a-like isoform 1 [Glycine max]
Length = 1089
Score = 1996 bits (5170), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 973/1037 (93%), Positives = 1010/1037 (97%), Gaps = 1/1037 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQD+LFIATERYKFCVLQWD+E++EL+TRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDYLFIATERYKFCVLQWDSETAELVTRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDF+EGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173 SKPTIVVLYQDNKDARHVKTYEVALKDKDFLEGPWSQNNLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDH GLL LLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDPDGSRYLLGDHTGLLSLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIE LGETSIASTISYLDNA VYIGSSYGDSQLIKLNLQPDAKGSYVE LERYVNLGPIV
Sbjct: 293 KIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKLNLQPDAKGSYVEGLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVHNQLVQVTSSSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELRNEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDGIL EVKHAQLEYEI
Sbjct: 473 SSTTRELRNEWHAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGILQEVKHAQLEYEI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP++S +AAVGMWTDISVRIFSLPDL+LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGENPNHSHLAAVGMWTDISVRIFSLPDLSLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+LN TGEL DRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFMLNTSTGELADRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRICHQEQSRTFAICSLK N + E+SEMHFVRLLDDQTFEFISTY LDT+EYGC I
Sbjct: 713 EHARRICHQEQSRTFAICSLKYNPASGEDSEMHFVRLLDDQTFEFISTYSLDTYEYGCFI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
+SCSFSDD+NVYYCVGTAYVLPEENEPTKGRILVF VEDGKLQLIAEKETKGAVY LNAF
Sbjct: 773 ISCSFSDDNNVYYCVGTAYVLPEENEPTKGRILVFAVEDGKLQLIAEKETKGAVYCLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKW+LRDDGT ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 833 NGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 892
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
LIYKHEEGAIEERARDYNANWMSAVEI+DDDIYLGAEN+FNLFTVRKNSEGATDEERGRL
Sbjct: 893 LIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLGAENSFNLFTVRKNSEGATDEERGRL 952
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT+NGVIGVIASLPHEQY+FLEK
Sbjct: 953 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTINGVIGVIASLPHEQYVFLEK 1012
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ+NLRKVIKGVGGL+HEQWRSFNNEKKTV+A+NFLDGDLIESFLDL+R++MDEISK ++
Sbjct: 1013 LQSNLRKVIKGVGGLSHEQWRSFNNEKKTVEARNFLDGDLIESFLDLNRSKMDEISKALD 1072
Query: 1020 VSVEELCKRVEELTRLH 1036
VSVEELCKRVEELTRLH
Sbjct: 1073 VSVEELCKRVEELTRLH 1089
>gi|356525401|ref|XP_003531313.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Glycine max]
Length = 1089
Score = 1994 bits (5165), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 971/1037 (93%), Positives = 1010/1037 (97%), Gaps = 1/1037 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQD+LFIATERYKFCVLQWD+E+ EL+TRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDYLFIATERYKFCVLQWDSETGELVTRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173 SKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDH GL+ LLVI HEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDPDGSRYLLGDHTGLVSLLVIIHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIE LGETSIASTISYLDNA VY+GSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV
Sbjct: 293 KIEPLGETSIASTISYLDNAFVYVGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVHNQLVQVTSSSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+R+LRNEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDGIL EVKHAQLEYEI
Sbjct: 473 SSTTRDLRNEWHAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGILQEVKHAQLEYEI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP++S +AAVGMWTDISVRIFSLPDL+LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGENPNHSNLAAVGMWTDISVRIFSLPDLSLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+LN TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFMLNTSTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRICHQEQSRTFAICSLK N + E+SEMHFVRLLDDQTFEFISTY LDT+EYGC I
Sbjct: 713 EHARRICHQEQSRTFAICSLKYNPASGEDSEMHFVRLLDDQTFEFISTYSLDTYEYGCFI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
+SCSFSDD+NVYYCVGTAYVLPEENEPTKGRI+VF VEDGKLQLIAEKETKGAVY LNAF
Sbjct: 773 ISCSFSDDNNVYYCVGTAYVLPEENEPTKGRIIVFAVEDGKLQLIAEKETKGAVYCLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKW+LRDDGT ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 833 NGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 892
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
LIYKHEEGAIEERARDYNANWMSAVEI+DDDIYLGAEN+FNLFTVRKNSEGATDEERGRL
Sbjct: 893 LIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLGAENSFNLFTVRKNSEGATDEERGRL 952
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT+NGVIGVIASLPHEQY+FLEK
Sbjct: 953 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTINGVIGVIASLPHEQYVFLEK 1012
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ+NLRKVIKGVGGL+HEQWRSFNNEKKTV+A+NFLDGDLIESFLDL+R++MDEISK ++
Sbjct: 1013 LQSNLRKVIKGVGGLSHEQWRSFNNEKKTVEARNFLDGDLIESFLDLNRSKMDEISKAVD 1072
Query: 1020 VSVEELCKRVEELTRLH 1036
VSVEELCKRVEELTRLH
Sbjct: 1073 VSVEELCKRVEELTRLH 1089
>gi|312283457|dbj|BAJ34594.1| unnamed protein product [Thellungiella halophila]
Length = 1088
Score = 1983 bits (5137), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 958/1036 (92%), Positives = 1006/1036 (97%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVP+YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG
Sbjct: 53 MLDVPMYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173 AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+HLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGA+KDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAFKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRVLAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 473 SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHAVLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+A+VGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLASVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF L+ TG+L DRKKVSLGTQPITLRTFSSKN THVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFQLDTTTGQLKDRKKVSLGTQPITLRTFSSKNATHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTF ICSL NQ+ AEESEMHFVRLLDDQ+FEF+STYPLD FEYGCSIL
Sbjct: 713 EHARRICHQEQTRTFGICSLGNQTNAEESEMHFVRLLDDQSFEFVSTYPLDAFEYGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKG+VYSLNAFN
Sbjct: 773 SCSFADDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGSVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 893 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLE 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY+FLEKL
Sbjct: 953 VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYMFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 1013 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
>gi|297809743|ref|XP_002872755.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
gi|297318592|gb|EFH49014.1| UV-damaged DNA-binding protein 1A [Arabidopsis lyrata subsp. lyrata]
Length = 1088
Score = 1982 bits (5134), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 958/1036 (92%), Positives = 1006/1036 (97%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173 AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+HLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERY+NLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYINLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 473 SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+A+VGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLASVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF L+ TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFQLDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTF ICSL NQS AEESEMHFVRLLDDQTFEF+STYPLD+FEYGCSIL
Sbjct: 713 EHARRICHQEQTRTFGICSLGNQSNAEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFN
Sbjct: 773 SCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
+YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 893 LYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLVTVKKNSEGATDEERGRLE 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKL
Sbjct: 953 VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 1013 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
>gi|225443992|ref|XP_002280744.1| PREDICTED: DNA damage-binding protein 1 isoform 2 [Vitis vinifera]
Length = 1068
Score = 1977 bits (5121), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 972/1037 (93%), Positives = 999/1037 (96%), Gaps = 22/1037 (2%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE+SE+ITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAENSEVITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQ VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQ---------------------VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 151
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPW+QNNLDNGADLLIPVPPPLCGVLII
Sbjct: 152 SKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWAQNNLDNGADLLIPVPPPLCGVLII 211
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL
Sbjct: 212 GEETIVYCSASAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 271
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA VY+GSSYGDSQLIK++LQPDAKGSYVEVLERYVNLGPIV
Sbjct: 272 KIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKIHLQPDAKGSYVEVLERYVNLGPIV 331
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 332 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 391
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
DTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA+Y+QLVQVTS SVRLV
Sbjct: 392 HDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVYDQLVQVTSSSVRLV 451
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
STSRELRNEWK+P GYSVNVATANA+QVLLATGGGHLVYLEIGDG LTEVKHAQLEY+I
Sbjct: 452 GSTSRELRNEWKAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGTLTEVKHAQLEYDI 511
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP++SQ+AAVGMWTDISVRIFSLPDLNLITKE+LGGEIIPRSVLLC+FEG
Sbjct: 512 SCLDINPIGENPNFSQLAAVGMWTDISVRIFSLPDLNLITKEYLGGEIIPRSVLLCSFEG 571
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
I YLLCALGDGHLLNFLLNM TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 572 IPYLLCALGDGHLLNFLLNMSTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 631
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG+LTIGTIDDIQKLHIRSIPLG
Sbjct: 632 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGDLTIGTIDDIQKLHIRSIPLG 691
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRICHQEQSRTFAICSLK NQS E+SEMHF+RLLDDQTFEFISTYPLDTFEYGCSI
Sbjct: 692 EHARRICHQEQSRTFAICSLKYNQSSTEDSEMHFIRLLDDQTFEFISTYPLDTFEYGCSI 751
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 752 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 811
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKWMLRDDGTRELQSE GHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 812 NGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGHILALYVQTRGDFIVVGDLMKSISL 871
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN+FTVRKNSEGATDEERGRL
Sbjct: 872 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNIFTVRKNSEGATDEERGRL 931
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QY+FLEK
Sbjct: 932 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEK 991
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ NLRKVIKGVGGL+HEQWRSFNNEKKTVDAKNFLDGDLIE+FLDL+RTRMDEISK M
Sbjct: 992 LQANLRKVIKGVGGLSHEQWRSFNNEKKTVDAKNFLDGDLIETFLDLNRTRMDEISKAMA 1051
Query: 1020 VSVEELCKRVEELTRLH 1036
VSVEELCKRVEELTRLH
Sbjct: 1052 VSVEELCKRVEELTRLH 1068
>gi|15235577|ref|NP_192451.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
gi|55976605|sp|Q9M0V3.1|DDB1A_ARATH RecName: Full=DNA damage-binding protein 1a; AltName: Full=UV-damaged
DNA-binding protein 1a; Short=DDB1a
gi|7267302|emb|CAB81084.1| UV-damaged DNA binding factor-like protein [Arabidopsis thaliana]
gi|25054828|gb|AAN71904.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
gi|332657117|gb|AEE82517.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
Length = 1088
Score = 1974 bits (5114), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 952/1036 (91%), Positives = 1005/1036 (97%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDPESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQN+LDNGADLLIPVPPPLCGVLII
Sbjct: 173 AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSITKAYGRVD DGSRYLLGDHAG++HLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSASAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGMIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERY+NLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYINLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 473 SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+AAVGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLAAVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF ++ TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFQMDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTF ICSL NQS +EESEMHFVRLLDDQTFEF+STYPLD+FEYGCSIL
Sbjct: 713 EHARRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF++D NVYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFN
Sbjct: 773 SCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
+YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 893 LYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLE 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKL
Sbjct: 953 VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 1013 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
>gi|350537001|ref|NP_001234275.1| DNA damage-binding protein 1 [Solanum lycopersicum]
gi|350539125|ref|NP_001233864.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
gi|55976440|sp|Q6QNU4.1|DDB1_SOLLC RecName: Full=DNA damage-binding protein 1; AltName: Full=High
pigmentation protein 1; AltName: Full=UV-damaged
DNA-binding protein 1
gi|38455768|gb|AAR20885.1| UV damaged DNA binding protein 1 [Solanum lycopersicum]
gi|42602165|gb|AAS21683.1| UV-damaged DNA binding protein 1 [Solanum lycopersicum]
Length = 1090
Score = 1967 bits (5095), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 959/1038 (92%), Positives = 1004/1038 (96%), Gaps = 2/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGE QD LFIATERYKFCVLQWD E+SE+ITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTIVVLYQDNKDARHVKTYEV+LKDKDF+EGPW+QNNLDNGA LLIPVPPPLCGVLII
Sbjct: 173 PKPTIVVLYQDNKDARHVKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSIT+AYGRVDADGSRYLLGDH GLLHLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSASAFKAIPIRPSITRAYGRVDADGSRYLLGDHNGLLHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLQPDTKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS+TDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSATDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+DTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 YDTFLVVSFISETRVLAMNLEDELEETEIEGFNSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SSTSR+L+NEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDG+L EVK+A+L+Y+I
Sbjct: 473 SSTSRDLKNEWFAPVGYSVNVATANATQVLLATGGGHLVYLEIGDGVLNEVKYAKLDYDI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP+YS IAAVGMWTDISVRI+SLPDLNLITKE LGGEIIPRSVL+C+FEG
Sbjct: 533 SCLDINPIGENPNYSNIAAVGMWTDISVRIYSLPDLNLITKEQLGGEIIPRSVLMCSFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+L+M TGELTDRKKVSLGTQPITLRTFSSK+TTHVFAASDRPTV
Sbjct: 593 ISYLLCALGDGHLLNFVLSMSTGELTDRKKVSLGTQPITLRTFSSKDTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFN AAFPDSLAIAKEGELTIGTID+IQKLHIRSIPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNVAAFPDSLAIAKEGELTIGTIDEIQKLHIRSIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRI HQEQ+RTFA+CS+K QS A++ EMHFVRLLDDQTFEFISTYPLD FEYGCSI
Sbjct: 713 EHARRISHQEQTRTFALCSVKYTQSNADDPEMHFVRLLDDQTFEFISTYPLDQFEYGCSI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
LSCSFSDDSNVYYC+GTAYV+PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 773 LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
NGKLLAAINQKIQLYKW R+D G+RELQ+ECGHHGHILALYVQTRGDFIVVGDLMKSIS
Sbjct: 833 NGKLLAAINQKIQLYKWASREDGGSRELQTECGHHGHILALYVQTRGDFIVVGDLMKSIS 892
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LLI+KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER R
Sbjct: 893 LLIFKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERSR 952
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QYLFLE
Sbjct: 953 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLE 1012
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KLQTNLRKVIKGVGGL+HEQWRSF NEKKTVDAKNFLDGDLIESFLDLSR RM+EISK M
Sbjct: 1013 KLQTNLRKVIKGVGGLSHEQWRSFYNEKKTVDAKNFLDGDLIESFLDLSRNRMEEISKAM 1072
Query: 1019 NVSVEELCKRVEELTRLH 1036
+V VEEL KRVEELTRLH
Sbjct: 1073 SVPVEELMKRVEELTRLH 1090
>gi|55976392|sp|Q6E7D1.1|DDB1_SOLCE RecName: Full=DNA damage-binding protein 1; AltName: Full=UV-damaged
DNA-binding protein 1
gi|49484911|gb|AAT66742.1| UV-damaged DNA binding protein 1 [Solanum cheesmaniae]
Length = 1095
Score = 1964 bits (5088), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 959/1038 (92%), Positives = 1004/1038 (96%), Gaps = 2/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGE QD LFIATERYKFCVLQWD E+SE+ITRAMGDVSDRIG
Sbjct: 58 MLDVPIYGRIATLELFRPHGETQDLLFIATERYKFCVLQWDTEASEVITRAMGDVSDRIG 117
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 118 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 177
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTIVVLYQDNKDARHVKTYEV+LKDKDF+EGPW+QNNLDNGA LLIPVPPPLCGVLII
Sbjct: 178 PKPTIVVLYQDNKDARHVKTYEVSLKDKDFIEGPWAQNNLDNGASLLIPVPPPLCGVLII 237
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSIT+AYGRVDADGSRYLLGDH GLLHLLVITHEKEKVTGL
Sbjct: 238 GEETIVYCSASAFKAIPIRPSITRAYGRVDADGSRYLLGDHNGLLHLLVITHEKEKVTGL 297
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA V+IGSSYGDSQL+KLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 298 KIELLGETSIASTISYLDNAFVFIGSSYGDSQLVKLNLQPDTKGSYVEVLERYVNLGPIV 357
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS+TDDP
Sbjct: 358 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSATDDP 417
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+DTFLVVSFISETR+LAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 418 YDTFLVVSFISETRVLAMNLEDELEETEIEGFNSQVQTLFCHDAVYNQLVQVTSNSVRLV 477
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SSTSR+L+NEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDG+L EVK+A+L+Y+I
Sbjct: 478 SSTSRDLKNEWFAPVGYSVNVATANATQVLLATGGGHLVYLEIGDGVLNEVKYAKLDYDI 537
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP+YS IAAVGMWTDISVRI+SLPDLNLITKE LGGEIIPRSVL+C+FEG
Sbjct: 538 SCLDINPIGENPNYSNIAAVGMWTDISVRIYSLPDLNLITKEQLGGEIIPRSVLMCSFEG 597
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+L+M TGELTDRKKVSLGTQPITLRTFSSK+TTHVFAASDRPTV
Sbjct: 598 ISYLLCALGDGHLLNFVLSMSTGELTDRKKVSLGTQPITLRTFSSKDTTHVFAASDRPTV 657
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFN AAFPDSLAIAKEGELTIGTID+IQKLHIRSIPLG
Sbjct: 658 IYSSNKKLLYSNVNLKEVSHMCPFNVAAFPDSLAIAKEGELTIGTIDEIQKLHIRSIPLG 717
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRI HQEQ+RTFA+CS+K QS A++ EMHFVRLLDDQTFEFISTYPLD FEYGCSI
Sbjct: 718 EHARRISHQEQTRTFALCSVKYTQSNADDPEMHFVRLLDDQTFEFISTYPLDQFEYGCSI 777
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
LSCSFSDDSNVYYC+GTAYV+PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 778 LSCSFSDDSNVYYCIGTAYVMPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 837
Query: 780 NGKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
NGKLLAAINQKIQLYKW R+D G+RELQ+ECGHHGHILALYVQTRGDFIVVGDLMKSIS
Sbjct: 838 NGKLLAAINQKIQLYKWASREDGGSRELQTECGHHGHILALYVQTRGDFIVVGDLMKSIS 897
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LLI+KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER R
Sbjct: 898 LLIFKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERSR 957
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QYLFLE
Sbjct: 958 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYLFLE 1017
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KLQTNLRKVIKGVGGL+HEQWRSF NEKKTVDAKNFLDGDLIESFLDLSR RM+EISK M
Sbjct: 1018 KLQTNLRKVIKGVGGLSHEQWRSFYNEKKTVDAKNFLDGDLIESFLDLSRNRMEEISKAM 1077
Query: 1019 NVSVEELCKRVEELTRLH 1036
+V VEEL KRVEELTRLH
Sbjct: 1078 SVPVEELMKRVEELTRLH 1095
>gi|449435512|ref|XP_004135539.1| PREDICTED: DNA damage-binding protein 1-like [Cucumis sativus]
Length = 1093
Score = 1952 bits (5057), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 950/1041 (91%), Positives = 1000/1041 (96%), Gaps = 5/1041 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDTESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTD+GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDSGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
++PTIVVLYQDNKDARHVKTYEV LKDKDFVEGPWSQNNLDNGA +LIPVPPPLCGV+II
Sbjct: 173 SRPTIVVLYQDNKDARHVKTYEVVLKDKDFVEGPWSQNNLDNGAAVLIPVPPPLCGVIII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA AFKAIP+RPSIT+AYGRVDADGSRYLLGDHAGLLHLLVITHEKE+VTGL
Sbjct: 233 GEETIVYCSATAFKAIPVRPSITRAYGRVDADGSRYLLGDHAGLLHLLVITHEKERVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA VYIGSSYGDSQL+KLN+QPDAKGSYVEVLERYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKLNVQPDAKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFNSQVQTLFCHDALFNQLVQVTSSSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+REL NEW +P YS+NVATANASQVLLATGGG LV+LEI DG+L E KH QLE+EI
Sbjct: 473 SSTTRELLNEWNAPSNYSINVATANASQVLLATGGGVLVHLEICDGLLVEKKHIQLEHEI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+ SQ+AAVGMWTDISVRIFSLPDLNL+TKE LGGEIIPRSVLLC FEG
Sbjct: 533 SCLDINPIGDNPNCSQLAAVGMWTDISVRIFSLPDLNLLTKEQLGGEIIPRSVLLCTFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTG----ELTDRKKVSLGTQPITLRTFSSKNTTHVFAASD 596
ISYLLCALGDGHLLNF+LN + EL DRKKVSLGTQPITLRTFSSKN THVFAASD
Sbjct: 593 ISYLLCALGDGHLLNFILNTNSNSNSCELMDRKKVSLGTQPITLRTFSSKNATHVFAASD 652
Query: 597 RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS
Sbjct: 653 RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 712
Query: 657 IPLGEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEY 715
IPLGEH RRICHQEQSRTFAICSL+ NQS E++EMHF+RLLDDQTFE ISTY LDT+EY
Sbjct: 713 IPLGEHARRICHQEQSRTFAICSLRYNQSGTEDTEMHFIRLLDDQTFESISTYALDTYEY 772
Query: 716 GCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYS 775
GCSILSCSFSDD+NVYYCVGTAYV+PEENEPTKGRILVF+VE+GKLQLIAEKETKG+VYS
Sbjct: 773 GCSILSCSFSDDNNVYYCVGTAYVMPEENEPTKGRILVFVVEEGKLQLIAEKETKGSVYS 832
Query: 776 LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
LNAFNGKLLAAINQKIQLYKW LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK
Sbjct: 833 LNAFNGKLLAAINQKIQLYKWTLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 892
Query: 836 SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN FNLFTVRKNSEGATDEE
Sbjct: 893 SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENYFNLFTVRKNSEGATDEE 952
Query: 896 RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL 955
R RLEVVGEYHLGEFVNRF+HGSLVMRLPDSDVGQIPTVIFG+VNGVIGVIASLPH+QY+
Sbjct: 953 RSRLEVVGEYHLGEFVNRFQHGSLVMRLPDSDVGQIPTVIFGSVNGVIGVIASLPHDQYV 1012
Query: 956 FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
FLE+LQ+NLRKVIKGVGGL+HEQWRSFNNEK+T +AKNFLDGDLIESFLDL+R++M+EIS
Sbjct: 1013 FLERLQSNLRKVIKGVGGLSHEQWRSFNNEKRTAEAKNFLDGDLIESFLDLNRSKMEEIS 1072
Query: 1016 KTMNVSVEELCKRVEELTRLH 1036
+ M+VS EELCKRVEELTRLH
Sbjct: 1073 RAMSVSAEELCKRVEELTRLH 1093
>gi|297799958|ref|XP_002867863.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
lyrata]
gi|297313699|gb|EFH44122.1| hypothetical protein ARALYDRAFT_492777 [Arabidopsis lyrata subsp.
lyrata]
Length = 1088
Score = 1941 bits (5029), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 935/1036 (90%), Positives = 995/1036 (96%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDVP+YGRIATLELFRPHGEAQDFLF+ATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 ILDVPLYGRIATLELFRPHGEAQDFLFVATERYKFCVLQWDYESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTI VLYQDNKDARHVKTYEV+LK+KDFVEGPWSQNNLDNGADLLIPVP PLCGVLII
Sbjct: 173 TKPTIAVLYQDNKDARHVKTYEVSLKEKDFVEGPWSQNNLDNGADLLIPVPSPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDH+GL+HLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHSGLIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIAS+ISYLDNAVV++GSSYGDSQLIKLNLQPDA GSYVE+LE+YVNLGPIV
Sbjct: 293 KIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDATGSYVEILEKYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMN+EDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNIEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELRN+W +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEVKH LEYE+
Sbjct: 473 SSTTRELRNKWDAPAGFAVNVATANASQVLLATGGGHLVYLEIGDGTLTEVKHVLLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+AAVG+WTDISVRIF LPDL LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLAAVGLWTDISVRIFVLPDLTLITKEQLGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF L+ +G+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRP V
Sbjct: 593 ISYLLCALGDGHLLNFQLDTSSGKLRDRKKVSLGTQPITLRTFSSKSATHVFAASDRPAV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYS+NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IP+G
Sbjct: 653 IYSNNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPIG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTFAIC L+NQ AEESEMHFVRLLD Q+FEF+STYPLD FEYGCSIL
Sbjct: 713 EHARRICHQEQTRTFAICCLRNQPSAEESEMHFVRLLDAQSFEFLSTYPLDAFEYGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVE+G+LQLI EKETKGAVYSLNAFN
Sbjct: 773 SCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEEGRLQLITEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 833 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
IYKHEEGAIEERARDYNANWM+AVEILDDDIYLGA+N FNLFTV+KN+EGATDEER R+E
Sbjct: 893 IYKHEEGAIEERARDYNANWMAAVEILDDDIYLGADNCFNLFTVKKNNEGATDEERARME 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYH+GEFVNRFRHGSLVMRLPDS++GQIPTVIFGTV+G+IGVIASLP EQY FLEKL
Sbjct: 953 VVGEYHIGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVSGMIGVIASLPQEQYAFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
QT+LRKVIKGVGGL+HEQWRSFNNEK+T +AK++LDGDLIESFLDLSR +M+EISK M+V
Sbjct: 1013 QTSLRKVIKGVGGLSHEQWRSFNNEKRTAEAKSYLDGDLIESFLDLSRGKMEEISKGMDV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
>gi|356512638|ref|XP_003525025.1| PREDICTED: DNA damage-binding protein 1a-like isoform 2 [Glycine max]
Length = 1068
Score = 1939 bits (5022), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 952/1037 (91%), Positives = 989/1037 (95%), Gaps = 22/1037 (2%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQD+LFIATERYKFCVLQWD+E++EL+TRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDYLFIATERYKFCVLQWDSETAELVTRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQ VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQ---------------------VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 151
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDF+EGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 152 SKPTIVVLYQDNKDARHVKTYEVALKDKDFLEGPWSQNNLDNGADLLIPVPPPLCGVLII 211
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDH GLL LLVITHEKEKVTGL
Sbjct: 212 GEETIVYCSANAFKAIPIRPSITKAYGRVDPDGSRYLLGDHTGLLSLLVITHEKEKVTGL 271
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIE LGETSIASTISYLDNA VYIGSSYGDSQLIKLNLQPDAKGSYVE LERYVNLGPIV
Sbjct: 272 KIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKLNLQPDAKGSYVEGLERYVNLGPIV 331
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 332 DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 391
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 392 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVHNQLVQVTSSSVRLV 451
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELRNEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDGIL EVKHAQLEYEI
Sbjct: 452 SSTTRELRNEWHAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGILQEVKHAQLEYEI 511
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP++S +AAVGMWTDISVRIFSLPDL+LITKE LGGEIIPRSVLLCAFEG
Sbjct: 512 SCLDINPIGENPNHSHLAAVGMWTDISVRIFSLPDLSLITKEQLGGEIIPRSVLLCAFEG 571
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+LN TGEL DRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 572 ISYLLCALGDGHLLNFMLNTSTGELADRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 631
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG
Sbjct: 632 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 691
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRICHQEQSRTFAICSLK N + E+SEMHFVRLLDDQTFEFISTY LDT+EYGC I
Sbjct: 692 EHARRICHQEQSRTFAICSLKYNPASGEDSEMHFVRLLDDQTFEFISTYSLDTYEYGCFI 751
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
+SCSFSDD+NVYYCVGTAYVLPEENEPTKGRILVF VEDGKLQLIAEKETKGAVY LNAF
Sbjct: 752 ISCSFSDDNNVYYCVGTAYVLPEENEPTKGRILVFAVEDGKLQLIAEKETKGAVYCLNAF 811
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKW+LRDDGT ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 812 NGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 871
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
LIYKHEEGAIEERARDYNANWMSAVEI+DDDIYLGAEN+FNLFTVRKNSEGATDEERGRL
Sbjct: 872 LIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLGAENSFNLFTVRKNSEGATDEERGRL 931
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT+NGVIGVIASLPHEQY+FLEK
Sbjct: 932 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTINGVIGVIASLPHEQYVFLEK 991
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ+NLRKVIKGVGGL+HEQWRSFNNEKKTV+A+NFLDGDLIESFLDL+R++MDEISK ++
Sbjct: 992 LQSNLRKVIKGVGGLSHEQWRSFNNEKKTVEARNFLDGDLIESFLDLNRSKMDEISKALD 1051
Query: 1020 VSVEELCKRVEELTRLH 1036
VSVEELCKRVEELTRLH
Sbjct: 1052 VSVEELCKRVEELTRLH 1068
>gi|356525403|ref|XP_003531314.1| PREDICTED: DNA damage-binding protein 1-like isoform 2 [Glycine max]
Length = 1068
Score = 1938 bits (5020), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 950/1037 (91%), Positives = 989/1037 (95%), Gaps = 22/1037 (2%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQD+LFIATERYKFCVLQWD+E+ EL+TRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDYLFIATERYKFCVLQWDSETGELVTRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQ VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQ---------------------VIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 151
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII
Sbjct: 152 SKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 211
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDH GL+ LLVI HEKEKVTGL
Sbjct: 212 GEETIVYCSANAFKAIPIRPSITKAYGRVDPDGSRYLLGDHTGLVSLLVIIHEKEKVTGL 271
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIE LGETSIASTISYLDNA VY+GSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV
Sbjct: 272 KIEPLGETSIASTISYLDNAFVYVGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 331
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 332 DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 391
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 392 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVHNQLVQVTSSSVRLV 451
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+R+LRNEW +P GYSVNVATANA+QVLLATGGGHLVYLEIGDGIL EVKHAQLEYEI
Sbjct: 452 SSTTRDLRNEWHAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGILQEVKHAQLEYEI 511
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP++S +AAVGMWTDISVRIFSLPDL+LITKE LGGEIIPRSVLLCAFEG
Sbjct: 512 SCLDINPIGENPNHSNLAAVGMWTDISVRIFSLPDLSLITKEQLGGEIIPRSVLLCAFEG 571
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF+LN TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 572 ISYLLCALGDGHLLNFMLNTSTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 631
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG
Sbjct: 632 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 691
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRICHQEQSRTFAICSLK N + E+SEMHFVRLLDDQTFEFISTY LDT+EYGC I
Sbjct: 692 EHARRICHQEQSRTFAICSLKYNPASGEDSEMHFVRLLDDQTFEFISTYSLDTYEYGCFI 751
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
+SCSFSDD+NVYYCVGTAYVLPEENEPTKGRI+VF VEDGKLQLIAEKETKGAVY LNAF
Sbjct: 752 ISCSFSDDNNVYYCVGTAYVLPEENEPTKGRIIVFAVEDGKLQLIAEKETKGAVYCLNAF 811
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKW+LRDDGT ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 812 NGKLLAAINQKIQLYKWVLRDDGTHELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 871
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
LIYKHEEGAIEERARDYNANWMSAVEI+DDDIYLGAEN+FNLFTVRKNSEGATDEERGRL
Sbjct: 872 LIYKHEEGAIEERARDYNANWMSAVEIVDDDIYLGAENSFNLFTVRKNSEGATDEERGRL 931
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT+NGVIGVIASLPHEQY+FLEK
Sbjct: 932 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTINGVIGVIASLPHEQYVFLEK 991
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ+NLRKVIKGVGGL+HEQWRSFNNEKKTV+A+NFLDGDLIESFLDL+R++MDEISK ++
Sbjct: 992 LQSNLRKVIKGVGGLSHEQWRSFNNEKKTVEARNFLDGDLIESFLDLNRSKMDEISKAVD 1051
Query: 1020 VSVEELCKRVEELTRLH 1036
VSVEELCKRVEELTRLH
Sbjct: 1052 VSVEELCKRVEELTRLH 1068
>gi|357519461|ref|XP_003630019.1| DNA damage-binding protein [Medicago truncatula]
gi|355524041|gb|AET04495.1| DNA damage-binding protein [Medicago truncatula]
Length = 1171
Score = 1936 bits (5016), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 960/1119 (85%), Positives = 1001/1119 (89%), Gaps = 83/1119 (7%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDVP+YGRIATLELFRPHGE QDFLFIATERYKFCVLQWD E SEL+TR+MGDVSDRIG
Sbjct: 53 ILDVPLYGRIATLELFRPHGETQDFLFIATERYKFCVLQWDTEKSELVTRSMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQN+LDNGADLLIPVPPPLCGVLII
Sbjct: 173 PKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSAN FKAIPIR +ITKAYGRVD DGSRYLLGDH GLL LLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANGFKAIPIRAAITKAYGRVDPDGSRYLLGDHTGLLSLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIE LGETSIASTISYLDNA VYIGSSYGDSQLIKLNLQPD KGSYVEVLERYVNLGPIV
Sbjct: 293 KIEPLGETSIASTISYLDNAFVYIGSSYGDSQLIKLNLQPDPKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQAS------------------- 341
DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQ S
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQVSPLFSCLSVGSFTQDTTKRK 412
Query: 342 -----------VELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE 390
+ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE
Sbjct: 413 RFFFFLIINNTLELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE 472
Query: 391 GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQ-- 448
GFCSQ QTLFCHDA++NQLVQVTS SVRLVSS++REL NEW +P YSVNVATANA+Q
Sbjct: 473 GFCSQVQTLFCHDAVHNQLVQVTSNSVRLVSSSTRELLNEWHAPSDYSVNVATANATQSQ 532
Query: 449 --------------------------------------------------VLLATGGGHL 458
VLLATGGGHL
Sbjct: 533 NHFIFQISSPSLSHGNIMICLSCKDVDNWSSTSSSLSLIYRVLFLLVAASVLLATGGGHL 592
Query: 459 VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
VYLEIGDGIL EVKHAQLEYEISCLDINPIGENP+ SQ+AAVGMWTDISVR+FSLP+LNL
Sbjct: 593 VYLEIGDGILQEVKHAQLEYEISCLDINPIGENPNQSQLAAVGMWTDISVRLFSLPELNL 652
Query: 519 ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNF+LN TGELTDRKKVSLGTQPI
Sbjct: 653 ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFMLNTSTGELTDRKKVSLGTQPI 712
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE
Sbjct: 713 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 772
Query: 639 GELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLL 697
GELTIGTIDDIQKLHIR+IPLGEH RRICHQEQ+RTFAICSLK N + AEESEMHFVRLL
Sbjct: 773 GELTIGTIDDIQKLHIRTIPLGEHARRICHQEQTRTFAICSLKYNSASAEESEMHFVRLL 832
Query: 698 DDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
DDQTF+FIS YPLDT+EYGC I+SCSFSDD+NVYYCVGTAYVLPEENEPTKGRILVF VE
Sbjct: 833 DDQTFDFISVYPLDTYEYGCFIISCSFSDDNNVYYCVGTAYVLPEENEPTKGRILVFSVE 892
Query: 758 DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
+GKLQL+AEKETKGAVY LNAFNGKLLAAINQKIQLYKW+LR+DGTRELQSECGHHGHIL
Sbjct: 893 EGKLQLVAEKETKGAVYCLNAFNGKLLAAINQKIQLYKWVLREDGTRELQSECGHHGHIL 952
Query: 818 ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD+YLGAEN
Sbjct: 953 ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDVYLGAEN 1012
Query: 878 NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
+FNLFTVRKNSEGATDEERGRLEV GEYHLGEF+NRFRHGSLVMRLPDSDVGQIPTVIFG
Sbjct: 1013 SFNLFTVRKNSEGATDEERGRLEVAGEYHLGEFINRFRHGSLVMRLPDSDVGQIPTVIFG 1072
Query: 938 TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
T+NGVIGVIASLPHEQY+FLEKLQ+NLRKVIKGVGGL+HEQWRSFNNEKKTV+A+NFLDG
Sbjct: 1073 TINGVIGVIASLPHEQYVFLEKLQSNLRKVIKGVGGLSHEQWRSFNNEKKTVEARNFLDG 1132
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
DLIESFLDL R++MDEISK M VSVEEL KRVEELTRLH
Sbjct: 1133 DLIESFLDLKRSKMDEISKAMEVSVEELAKRVEELTRLH 1171
>gi|297740793|emb|CBI30975.3| unnamed protein product [Vitis vinifera]
Length = 1043
Score = 1920 bits (4974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 950/1037 (91%), Positives = 974/1037 (93%), Gaps = 47/1037 (4%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE+SE+ITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAENSEVITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPW+QNNLDNGADLLIPVPPPLCGVLII
Sbjct: 173 SKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWAQNNLDNGADLLIPVPPPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSASAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA VY+GSSYGDSQLIK++LQPDAKGSYVEVLERYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAFVYVGSSYGDSQLIKIHLQPDAKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
DTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCHDA+Y+QLVQVTS SVRLV
Sbjct: 413 HDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHDAVYDQLVQVTSSSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
STSRELRNEWK+P GYSVNVATANA+QVLLATGGGHLVYLEIGDG LTEVKHAQLEY+I
Sbjct: 473 GSTSRELRNEWKAPSGYSVNVATANATQVLLATGGGHLVYLEIGDGTLTEVKHAQLEYDI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP++SQ+AAVGMWTDISVRIFSLPDLNLITKE+LGGEIIPRSVLLC+FEG
Sbjct: 533 SCLDINPIGENPNFSQLAAVGMWTDISVRIFSLPDLNLITKEYLGGEIIPRSVLLCSFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
I YLLCALGDGHLLNFLLNM TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 593 IPYLLCALGDGHLLNFLLNMSTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG+LTIGTIDDIQKLHIRSIPLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGDLTIGTIDDIQKLHIRSIPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
EH RRICHQEQSRTFAICSLK NQS E+SEMHF+RLLDDQTFEFISTYPLDTFEYGCSI
Sbjct: 713 EHARRICHQEQSRTFAICSLKYNQSSTEDSEMHFIRLLDDQTFEFISTYPLDTFEYGCSI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF
Sbjct: 773 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKWMLRDDGTRELQSE GHHGHILALYVQTRGDFIVVGDLMKSISL
Sbjct: 833 NGKLLAAINQKIQLYKWMLRDDGTRELQSESGHHGHILALYVQTRGDFIVVGDLMKSISL 892
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN+FTVRKNSEGATDEERGRL
Sbjct: 893 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNIFTVRKNSEGATDEERGRL 952
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH+QY+FLEK
Sbjct: 953 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHDQYVFLEK 1012
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ NLRKVIKG M
Sbjct: 1013 LQANLRKVIKG----------------------------------------------AMA 1026
Query: 1020 VSVEELCKRVEELTRLH 1036
VSVEELCKRVEELTRLH
Sbjct: 1027 VSVEELCKRVEELTRLH 1043
>gi|15233515|ref|NP_193842.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
gi|73620956|sp|O49552.2|DDB1B_ARATH RecName: Full=DNA damage-binding protein 1b; AltName: Full=UV-damaged
DNA-binding protein 1b; Short=DDB1b
gi|110739453|dbj|BAF01636.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
gi|332659001|gb|AEE84401.1| DNA damage-binding protein 1b [Arabidopsis thaliana]
Length = 1088
Score = 1920 bits (4973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 925/1036 (89%), Positives = 989/1036 (95%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDVP+YGRIAT+ELFRPHGEAQDFLF+ATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 ILDVPLYGRIATMELFRPHGEAQDFLFVATERYKFCVLQWDYESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCR+IGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRVIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTI VLYQDNKDARHVKTYEV+LKDK+FVEGPWSQNNLDNGADLLIPVP PLCGVLII
Sbjct: 173 TKPTIAVLYQDNKDARHVKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+HLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIAS+ISYLDNAVV++GSSYGDSQLIKLNLQPDAKGSYVE+LE+YVNLGPIV
Sbjct: 293 KIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDAKGSYVEILEKYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMN+EDELEETEIEGF S+ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNIEDELEETEIEGFLSEVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELRN+W +P G+SVNVATANASQVLLATGGGHLVYLEIGDG LTEVKH LEYE+
Sbjct: 473 SSTTRELRNKWDAPAGFSVNVATANASQVLLATGGGHLVYLEIGDGTLTEVKHVLLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+AAVGMWTDISVRIF LPDL LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLAAVGMWTDISVRIFVLPDLTLITKEELGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF L+ G+L DRKKVSLGT+PITLRTFSSK+ THVFAASDRP V
Sbjct: 593 ISYLLCALGDGHLLNFQLDTSCGKLRDRKKVSLGTRPITLRTFSSKSATHVFAASDRPAV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYS+NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IP+G
Sbjct: 653 IYSNNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPIG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTFAI L+N+ AEESE HFVRLLD Q+FEF+S+YPLD FE GCSIL
Sbjct: 713 EHARRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQSFEFLSSYPLDAFECGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVE+G+LQLI EKETKGAVYSLNAFN
Sbjct: 773 SCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEEGRLQLITEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLA+INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI VGDLMKSISLL
Sbjct: 833 GKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIAVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
IYKHEEGAIEERARDYNANWM+AVEIL+DDIYLG +N FN+FTV+KN+EGATDEER R+E
Sbjct: 893 IYKHEEGAIEERARDYNANWMTAVEILNDDIYLGTDNCFNIFTVKKNNEGATDEERARME 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYH+GEFVNRFRHGSLVM+LPDSD+GQIPTVIFGTV+G+IGVIASLP EQY FLEKL
Sbjct: 953 VVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIPTVIFGTVSGMIGVIASLPQEQYAFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
QT+LRKVIKGVGGL+HEQWRSFNNEK+T +AK +LDGDLIESFLDLSR +M+EISK M+V
Sbjct: 1013 QTSLRKVIKGVGGLSHEQWRSFNNEKRTAEAKGYLDGDLIESFLDLSRGKMEEISKGMDV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
>gi|62318656|dbj|BAD95136.1| UV-damaged DNA-binding protein- like [Arabidopsis thaliana]
Length = 1088
Score = 1919 bits (4971), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 924/1036 (89%), Positives = 989/1036 (95%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDVP+YGRIAT+ELFRPHGEAQDFLF+ATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 ILDVPLYGRIATMELFRPHGEAQDFLFVATERYKFCVLQWDYESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCR+IGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRVIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTI VLYQDNKDARHVKTYEV+LKDK+FVEGPWSQNNLDNGADLLIPVP PLCGVLII
Sbjct: 173 TKPTIAVLYQDNKDARHVKTYEVSLKDKNFVEGPWSQNNLDNGADLLIPVPSPLCGVLII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+HLLVITHEKEKVTGL
Sbjct: 233 GEETIVYCSANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHAGLIHLLVITHEKEKVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIAS+ISYLDNAVV++GSSYGDSQLIKLNLQPDAKGSYVE+LE+YVNLGPIV
Sbjct: 293 KIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDAKGSYVEILEKYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLKSSIDEA 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMN+EDELEETEIEGF S+ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNIEDELEETEIEGFLSEVQTLFCHDAVYNQLVQVTSNSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELRN+W +P G+SVNVATANASQVLLATGGGHLVYLEIGDG LTEVKH LEYE+
Sbjct: 473 SSTTRELRNKWDAPAGFSVNVATANASQVLLATGGGHLVYLEIGDGTLTEVKHVLLEYEV 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+AAVGMWTDISVRIF LPDL LITKE LGGEIIPRSVLLCAFEG
Sbjct: 533 SCLDINPIGDNPNYSQLAAVGMWTDISVRIFVLPDLTLITKEELGGEIIPRSVLLCAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF L+ G+L DRKKVSLGT+PITLRTFSSK+ THVFAASDRP V
Sbjct: 593 ISYLLCALGDGHLLNFQLDTSCGKLRDRKKVSLGTRPITLRTFSSKSATHVFAASDRPAV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYS+NKKLLYSNV+LKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IP+G
Sbjct: 653 IYSNNKKLLYSNVSLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPIG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTFAI L+N+ AEESE HFVRLLD Q+FEF+S+YPLD FE GCSIL
Sbjct: 713 EHARRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQSFEFLSSYPLDAFECGCSIL 772
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVE+G+LQLI EKETKGAVYSLNAFN
Sbjct: 773 SCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEEGRLQLITEKETKGAVYSLNAFN 832
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLA+INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI VGDLMKSISLL
Sbjct: 833 GKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIAVGDLMKSISLL 892
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
IYKHEEGAIEERARDYNANWM+AVEIL+DDIYLG +N FN+FTV+KN+EGATDEER R+E
Sbjct: 893 IYKHEEGAIEERARDYNANWMAAVEILNDDIYLGTDNCFNIFTVKKNNEGATDEERARME 952
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYH+GEFVNRFRHGSLVM+LPDSD+GQIPTVIFGTV+G+IGVIASLP EQY FLEKL
Sbjct: 953 VVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIPTVIFGTVSGMIGVIASLPQEQYAFLEKL 1012
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
QT+LRKVIKGVGGL+HEQWRSFNNEK+T +AK +LDGDLIESFLDLSR +M+EISK M+V
Sbjct: 1013 QTSLRKVIKGVGGLSHEQWRSFNNEKRTAEAKGYLDGDLIESFLDLSRGKMEEISKGMDV 1072
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1073 QVEELCKRVEELTRLH 1088
>gi|186511557|ref|NP_001118940.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
gi|332657118|gb|AEE82518.1| DNA damage-binding protein 1a [Arabidopsis thaliana]
Length = 1067
Score = 1919 bits (4970), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 931/1036 (89%), Positives = 984/1036 (94%), Gaps = 21/1036 (2%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDPESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQ VIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQ---------------------VIPFDNKGQLKEAFNIRLEELQVLDIKFLFGC 151
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
AKPTI VLYQDNKDARHVKTYEV+LKDKDFVEGPWSQN+LDNGADLLIPVPPPLCGVLII
Sbjct: 152 AKPTIAVLYQDNKDARHVKTYEVSLKDKDFVEGPWSQNSLDNGADLLIPVPPPLCGVLII 211
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA+AFKAIPIRPSITKAYGRVD DGSRYLLGDHAG++HLLVITHEKEKVTGL
Sbjct: 212 GEETIVYCSASAFKAIPIRPSITKAYGRVDVDGSRYLLGDHAGMIHLLVITHEKEKVTGL 271
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNAVV++GSSYGDSQL+KLNL PDAKGSYVEVLERY+NLGPIV
Sbjct: 272 KIELLGETSIASTISYLDNAVVFVGSSYGDSQLVKLNLHPDAKGSYVEVLERYINLGPIV 331
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+
Sbjct: 332 DFCVVDLERQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEA 391
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLV
Sbjct: 392 FDTFLVVSFISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLV 451
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+RELR+EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+
Sbjct: 452 SSTTRELRDEWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEV 511
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG+NP+YSQ+AAVGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEG
Sbjct: 512 SCLDINPIGDNPNYSQLAAVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEG 571
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNF ++ TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTV
Sbjct: 572 ISYLLCALGDGHLLNFQMDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTV 631
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLG
Sbjct: 632 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLG 691
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQ+RTF ICSL NQS +EESEMHFVRLLDDQTFEF+STYPLD+FEYGCSIL
Sbjct: 692 EHARRICHQEQTRTFGICSLGNQSNSEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSIL 751
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSF++D NVYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFN
Sbjct: 752 SCSFTEDKNVYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFN 811
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 812 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 871
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
+YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLE
Sbjct: 872 LYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLE 931
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKL
Sbjct: 932 VVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKL 991
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV
Sbjct: 992 QSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNV 1051
Query: 1021 SVEELCKRVEELTRLH 1036
VEELCKRVEELTRLH
Sbjct: 1052 QVEELCKRVEELTRLH 1067
>gi|2911067|emb|CAA17529.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
gi|7268907|emb|CAB79110.1| UV-damaged DNA-binding protein-like [Arabidopsis thaliana]
Length = 1102
Score = 1856 bits (4807), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 904/1050 (86%), Positives = 972/1050 (92%), Gaps = 14/1050 (1%)
Query: 1 MLDVPIYGRIATLELFRPHGEA----QDFLFIATERYKFCVLQWDAESS----------E 46
+LDVP+YGRIAT+ELFRPH FL + + FC+LQ +S
Sbjct: 53 ILDVPLYGRIATMELFRPHVSIVFLLNTFLCLRVKHKTFCLLQLKDINSVFFNGIMSPLS 112
Query: 47 LITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRL 106
L+ AMGDVSDRIGRPTDNGQIGIIDPDCR+IGLHLYDGLFKVIPFDNKGQLKEAFNIRL
Sbjct: 113 LLQGAMGDVSDRIGRPTDNGQIGIIDPDCRVIGLHLYDGLFKVIPFDNKGQLKEAFNIRL 172
Query: 107 EELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
EELQVLDIKFLYGC KPTI VLYQDNKDARHVKTYEV+LKDK+FVEGPWSQNNLDNGADL
Sbjct: 173 EELQVLDIKFLYGCTKPTIAVLYQDNKDARHVKTYEVSLKDKNFVEGPWSQNNLDNGADL 232
Query: 167 LIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLH 226
LIPVP PLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD DGSRYLLGDHAGL+H
Sbjct: 233 LIPVPSPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDLDGSRYLLGDHAGLIH 292
Query: 227 LLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSY 286
LLVITHEKEKVTGLKIELLGETSIAS+ISYLDNAVV++GSSYGDSQLIKLNLQPDAKGSY
Sbjct: 293 LLVITHEKEKVTGLKIELLGETSIASSISYLDNAVVFVGSSYGDSQLIKLNLQPDAKGSY 352
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
VE+LE+YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG
Sbjct: 353 VEILEKYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 412
Query: 347 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
IKGMWSL+SS D+ FDTFLVVSFISETRILAMN+EDELEETEIEGF S+ QTLFCHDA+Y
Sbjct: 413 IKGMWSLKSSIDEAFDTFLVVSFISETRILAMNIEDELEETEIEGFLSEVQTLFCHDAVY 472
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG 466
NQLVQVTS SVRLVSST+RELRN+W +P G+SVNVATANASQVLLATGGGHLVYLEIGDG
Sbjct: 473 NQLVQVTSNSVRLVSSTTRELRNKWDAPAGFSVNVATANASQVLLATGGGHLVYLEIGDG 532
Query: 467 ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 526
LTEVKH LEYE+SCLDINPIG+NP+YSQ+AAVGMWTDISVRIF LPDL LITKE LGG
Sbjct: 533 TLTEVKHVLLEYEVSCLDINPIGDNPNYSQLAAVGMWTDISVRIFVLPDLTLITKEELGG 592
Query: 527 EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
EIIPRSVLLCAFEGISYLLCALGDGHLLNF L+ G+L DRKKVSLGT+PITLRTFSSK
Sbjct: 593 EIIPRSVLLCAFEGISYLLCALGDGHLLNFQLDTSCGKLRDRKKVSLGTRPITLRTFSSK 652
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
+ THVFAASDRP VIYS+NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTI
Sbjct: 653 SATHVFAASDRPAVIYSNNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTI 712
Query: 647 DDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIS 706
DDIQKLHIR+IP+GEH RRICHQEQ+RTFAI L+N+ AEESE HFVRLLD Q+FEF+S
Sbjct: 713 DDIQKLHIRTIPIGEHARRICHQEQTRTFAISCLRNEPSAEESESHFVRLLDAQSFEFLS 772
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAE 766
+YPLD FE GCSILSCSF+DD NVYYCVGTAYVLPEENEPTKGRILVFIVE+G+LQLI E
Sbjct: 773 SYPLDAFECGCSILSCSFTDDKNVYYCVGTAYVLPEENEPTKGRILVFIVEEGRLQLITE 832
Query: 767 KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD 826
KETKGAVYSLNAFNGKLLA+INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD
Sbjct: 833 KETKGAVYSLNAFNGKLLASINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD 892
Query: 827 FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
FI VGDLMKSISLLIYKHEEGAIEERARDYNANWM+AVEIL+DDIYLG +N FN+FTV+K
Sbjct: 893 FIAVGDLMKSISLLIYKHEEGAIEERARDYNANWMTAVEILNDDIYLGTDNCFNIFTVKK 952
Query: 887 NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
N+EGATDEER R+EVVGEYH+GEFVNRFRHGSLVM+LPDSD+GQIPTVIFGTV+G+IGVI
Sbjct: 953 NNEGATDEERARMEVVGEYHIGEFVNRFRHGSLVMKLPDSDIGQIPTVIFGTVSGMIGVI 1012
Query: 947 ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
ASLP EQY FLEKLQT+LRKVIKGVGGL+HEQWRSFNNEK+T +AK +LDGDLIESFLDL
Sbjct: 1013 ASLPQEQYAFLEKLQTSLRKVIKGVGGLSHEQWRSFNNEKRTAEAKGYLDGDLIESFLDL 1072
Query: 1007 SRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
SR +M+EISK M+V VEELCKRVEELTRLH
Sbjct: 1073 SRGKMEEISKGMDVQVEELCKRVEELTRLH 1102
>gi|115465791|ref|NP_001056495.1| Os05g0592400 [Oryza sativa Japonica Group]
gi|48475231|gb|AAT44300.1| putative DNA damage binding protein 1 [Oryza sativa Japonica Group]
gi|113580046|dbj|BAF18409.1| Os05g0592400 [Oryza sativa Japonica Group]
gi|215694552|dbj|BAG89545.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222632766|gb|EEE64898.1| hypothetical protein OsJ_19757 [Oryza sativa Japonica Group]
Length = 1090
Score = 1815 bits (4701), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 877/1038 (84%), Positives = 959/1038 (92%), Gaps = 2/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
M+DVPIYGRIATLELFRPH E QDFLFIATERYKFCVLQWD E SEL+TRAMGDVSDRIG
Sbjct: 53 MIDVPIYGRIATLELFRPHNETQDFLFIATERYKFCVLQWDGEKSELLTRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA LLIPVP PL GV+II
Sbjct: 173 VKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGAGLLIPVPAPLGGVIII 232
Query: 181 GEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG 239
GEETIVYC+AN+ F+AIPI+ SI +AYGRVD DGSRYLLGD+AG+LHLLV+THE+E+VTG
Sbjct: 233 GEETIVYCNANSTFRAIPIKQSIIRAYGRVDPDGSRYLLGDNAGILHLLVLTHERERVTG 292
Query: 240 LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
LKIE LGETSIAS+ISYLDN VVY+GS +GDSQL+KLNLQ D GSYVEVLERYVNLGPI
Sbjct: 293 LKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKLNLQADPNGSYVEVLERYVNLGPI 352
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
VDFCVVDL+RQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKG+WSL+SS +D
Sbjct: 353 VDFCVVDLDRQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGLWSLKSSFND 412
Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
P+D +LVVSFISETR LAMN+EDELEETEIEGF +QTQTLFC +AI + L+QVT+ SVRL
Sbjct: 413 PYDMYLVVSFISETRFLAMNMEDELEETEIEGFDAQTQTLFCQNAINDLLIQVTANSVRL 472
Query: 420 VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
VS TSREL ++W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D L EVKH QLE+E
Sbjct: 473 VSCTSRELVDQWNAPEGFSVNVASANASQVLLATGGGHLVYLEIKDSKLVEVKHIQLEHE 532
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
ISC+D+NPIGENP YS +AAVGMWTDISVRI SLPDL LI KE+LGGEI+PRSVLLC E
Sbjct: 533 ISCVDLNPIGENPQYSSLAAVGMWTDISVRILSLPDLELIRKENLGGEIVPRSVLLCTLE 592
Query: 540 GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
G+SYLLCALGDGHL +FLLN TGELTDRKKVSLGTQPI+LRTFSSK TTHVFA+SDRPT
Sbjct: 593 GVSYLLCALGDGHLFSFLLNASTGELTDRKKVSLGTQPISLRTFSSKGTTHVFASSDRPT 652
Query: 600 VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
VIYSSNKKLLYSNVNLKEV+HMCPFN+AA PDSLAIAKEGEL+IGTIDDIQKLHIR+IPL
Sbjct: 653 VIYSSNKKLLYSNVNLKEVNHMCPFNTAAIPDSLAIAKEGELSIGTIDDIQKLHIRTIPL 712
Query: 660 GEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
E RRICHQEQSRT A CS K NQ+ EESE HFVRLLD QTFEF+S Y LD +E+GCS
Sbjct: 713 NEQARRICHQEQSRTLAFCSFKHNQTSIEESETHFVRLLDHQTFEFLSIYQLDQYEHGCS 772
Query: 719 ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
I+SCSFSDD+NVYYCVGTAYVLPEENEP+KGRILVF VEDG+LQLI EKETKGAVYSLNA
Sbjct: 773 IISCSFSDDNNVYYCVGTAYVLPEENEPSKGRILVFAVEDGRLQLIVEKETKGAVYSLNA 832
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
FNGKLLAAINQKIQLYKWMLR+DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSIS
Sbjct: 833 FNGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSIS 892
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LL+YKHEE AIEE ARDYNANWMSAVE+LDD+IY+GAENN+N+FTVRKNS+ ATDEERGR
Sbjct: 893 LLVYKHEESAIEELARDYNANWMSAVEMLDDEIYIGAENNYNIFTVRKNSDAATDEERGR 952
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNR RHGSLVMRLPDS++GQIPTVIFGT+NGVIG+IASLPHEQY+FLE
Sbjct: 953 LEVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIPTVIFGTINGVIGIIASLPHEQYVFLE 1012
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KLQ+ L K IKGVG L+HEQWRSF+N+KKT +A+NFLDGDLIESFLDLSR +M+E++K M
Sbjct: 1013 KLQSTLVKFIKGVGNLSHEQWRSFHNDKKTSEARNFLDGDLIESFLDLSRNKMEEVAKGM 1072
Query: 1019 NVSVEELCKRVEELTRLH 1036
V VEEL KRVEELTRLH
Sbjct: 1073 GVPVEELSKRVEELTRLH 1090
>gi|413946716|gb|AFW79365.1| hypothetical protein ZEAMMB73_562969 [Zea mays]
Length = 1089
Score = 1814 bits (4698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 872/1037 (84%), Positives = 956/1037 (92%), Gaps = 1/1037 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIAT+ELFRPH E QDFLFIATERYKFCVLQWDAE SELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATIELFRPHNETQDFLFIATERYKFCVLQWDAEKSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLHGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
AKPTIVVLYQDNKD RHVKTYEVALKDKDFVEGPWSQNN+DNGA LLIPVP PL GV+II
Sbjct: 173 AKPTIVVLYQDNKDVRHVKTYEVALKDKDFVEGPWSQNNVDNGAGLLIPVPAPLGGVIII 232
Query: 181 GEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG 239
GEE IVYC+AN+ FKAIPI+ SI +AYGRVD DGSRYLLGD+ G+LHLLV+THE+E+VTG
Sbjct: 233 GEEQIVYCNANSTFKAIPIKQSIIRAYGRVDPDGSRYLLGDNTGILHLLVLTHERERVTG 292
Query: 240 LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
LKIE LGETSIAS+ISYLDN VVY+GS +GDSQL+KLNLQ DA GS+VE+LERYVNLGPI
Sbjct: 293 LKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKLNLQADASGSFVEILERYVNLGPI 352
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
VDFCVVDL+RQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKG+WSL+SS +D
Sbjct: 353 VDFCVVDLDRQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGLWSLKSSIND 412
Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
PFD +LVVSFISETR LAMN+EDELEETEIEGF +QTQTLFC +AI + L+QVT+ SVRL
Sbjct: 413 PFDMYLVVSFISETRFLAMNMEDELEETEIEGFDAQTQTLFCQNAINDLLIQVTANSVRL 472
Query: 420 VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
VS TSREL N+W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D L EVKHAQLE+E
Sbjct: 473 VSCTSRELVNQWNAPAGFSVNVASANASQVLLATGGGHLVYLEIRDAKLVEVKHAQLEHE 532
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
ISCLD+NPIGENP YS +AAVGMWTDISVRIFSLPDL LI KE+LGGEI+PRSVLLC E
Sbjct: 533 ISCLDLNPIGENPQYSSLAAVGMWTDISVRIFSLPDLELIRKENLGGEIVPRSVLLCTLE 592
Query: 540 GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
G+SYLLCALGDG+L +FLLN TGELTDRKKV+LGTQPI+LRTFSSK TTHVFA+SDRPT
Sbjct: 593 GVSYLLCALGDGNLFSFLLNASTGELTDRKKVTLGTQPISLRTFSSKGTTHVFASSDRPT 652
Query: 600 VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
VIYSSNKKLLYSNVNLKEV+HMCPFN+AAFPDSLAIAKEGEL+IGTIDDIQKLHIR+IPL
Sbjct: 653 VIYSSNKKLLYSNVNLKEVNHMCPFNTAAFPDSLAIAKEGELSIGTIDDIQKLHIRTIPL 712
Query: 660 GEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
E RRICHQEQSRT A CS K EESE H +RLLD QTFE + YPLD +E GCSI
Sbjct: 713 NEQARRICHQEQSRTLAFCSFKYNQSVEESETHLIRLLDHQTFESLCVYPLDQYECGCSI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
+SCSF+DDSNVYYCVGTAYV+PEENEPTKGRILVF VEDG LQLI EKETKGAVYSLNAF
Sbjct: 773 ISCSFADDSNVYYCVGTAYVIPEENEPTKGRILVFAVEDGSLQLIVEKETKGAVYSLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKWM R+DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSISL
Sbjct: 833 NGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSISL 892
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
L+YKHEE AIEERARDYNANWM+AVE+LDD++Y+GAEN++NLFTVRKNS+ ATD+ER RL
Sbjct: 893 LVYKHEESAIEERARDYNANWMTAVEMLDDEVYVGAENSYNLFTVRKNSDAATDDERARL 952
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
EVVGEYHLGEFVNRFRHGSLVMRLPDSD+GQIPTVIFGT+NGVIG+IASLPH+QY+FLEK
Sbjct: 953 EVVGEYHLGEFVNRFRHGSLVMRLPDSDIGQIPTVIFGTINGVIGIIASLPHDQYIFLEK 1012
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ+ L K IKGVG L+HEQWRSF+N+KKT +A+NFLDGDLIESFLDLSR++M+E+SK M
Sbjct: 1013 LQSTLVKYIKGVGNLSHEQWRSFHNDKKTAEARNFLDGDLIESFLDLSRSKMEEVSKAMG 1072
Query: 1020 VSVEELCKRVEELTRLH 1036
V VEEL KRVEELTRLH
Sbjct: 1073 VPVEELSKRVEELTRLH 1089
>gi|12082087|dbj|BAB20761.1| UV-damaged DNA binding protein [Oryza sativa Japonica Group]
Length = 1090
Score = 1808 bits (4683), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 875/1038 (84%), Positives = 957/1038 (92%), Gaps = 2/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
M+DVPIYGRIATLELFRPH E QDFLFIATERYKFCVLQWD E SEL+TRAMGDVSDRIG
Sbjct: 53 MIDVPIYGRIATLELFRPHNETQDFLFIATERYKFCVLQWDGEKSELLTRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA LLIPVP PL GV+II
Sbjct: 173 VKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGAGLLIPVPAPLGGVIII 232
Query: 181 GEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG 239
GEETIVYC+AN+ F+AIPI+ SI +AYGRVD DGSRYLLGD+AG+LHLLV+THE+E+VTG
Sbjct: 233 GEETIVYCNANSTFRAIPIKQSIIRAYGRVDPDGSRYLLGDNAGILHLLVLTHERERVTG 292
Query: 240 LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
LKIE LGETSIAS+ISYLDN VVY+GS +GDSQL+KLNLQ D GSYVEVLERYVNLGPI
Sbjct: 293 LKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKLNLQADPNGSYVEVLERYVNLGPI 352
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
VDFCVVDL+RQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKG+WSL+SS +D
Sbjct: 353 VDFCVVDLDRQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGLWSLKSSFND 412
Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
P+D +LVVSFISETR LAMN+EDELEETEIEGF +QTQTLFC +AI + L+QVT+ SVRL
Sbjct: 413 PYDMYLVVSFISETRFLAMNMEDELEETEIEGFDAQTQTLFCQNAINDLLIQVTANSVRL 472
Query: 420 VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
VS TSREL ++W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D L EVKH QLE+E
Sbjct: 473 VSCTSRELVDQWNAPEGFSVNVASANASQVLLATGGGHLVYLEIKDSKLVEVKHIQLEHE 532
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
ISC+D+NPIGENP YS +AAVGMWTDISVRI S PDL LI KE+LGGEI+PRSVLLC E
Sbjct: 533 ISCVDLNPIGENPQYSSLAAVGMWTDISVRILSFPDLELIRKENLGGEIVPRSVLLCTLE 592
Query: 540 GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
G+SYLLCALGDGHL +FLLN TGELTDRKKVSLGTQPI+LRTFSSK TTHVFA+SDRPT
Sbjct: 593 GVSYLLCALGDGHLFSFLLNASTGELTDRKKVSLGTQPISLRTFSSKGTTHVFASSDRPT 652
Query: 600 VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
VIYSSNKKLLYSNVNLKEV+HMCPFN+AA PDSLAIAKEGEL+IGTIDDIQKLHIR+IPL
Sbjct: 653 VIYSSNKKLLYSNVNLKEVNHMCPFNTAAIPDSLAIAKEGELSIGTIDDIQKLHIRTIPL 712
Query: 660 GEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
RRICHQEQSRT A CS K NQ+ EESE HFVRLLD QTFEF+S Y LD +E+GCS
Sbjct: 713 DHQTRRICHQEQSRTLAFCSFKHNQTSIEESETHFVRLLDHQTFEFLSIYQLDQYEHGCS 772
Query: 719 ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
I+SCSFSDD+NVYYCVGTAYVLPEENEP+KGRILVF VEDG+LQLI EKETKGAVYSLNA
Sbjct: 773 IISCSFSDDNNVYYCVGTAYVLPEENEPSKGRILVFAVEDGRLQLIVEKETKGAVYSLNA 832
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
FNGKLLAAINQKIQLYKWMLR+DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSIS
Sbjct: 833 FNGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSIS 892
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LL+YKHEE AIEE ARDYNANWMSAVE+LDD+IY+GAENN+N+FTVRKNS+ ATDEERGR
Sbjct: 893 LLVYKHEESAIEELARDYNANWMSAVEMLDDEIYIGAENNYNIFTVRKNSDAATDEERGR 952
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEF NRFRHGSLVMRLPDS++GQIPTVIFGT+NGVIG+IASLPHEQY+FLE
Sbjct: 953 LEVVGEYHLGEFGNRFRHGSLVMRLPDSEMGQIPTVIFGTINGVIGIIASLPHEQYVFLE 1012
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KLQ+ L K IKGVG L+HEQWRSF+N+KKT +A+NFLDGDLIESFLDLSR +M+E++K M
Sbjct: 1013 KLQSTLVKFIKGVGNLSHEQWRSFHNDKKTSEARNFLDGDLIESFLDLSRNKMEEVAKGM 1072
Query: 1019 NVSVEELCKRVEELTRLH 1036
V VEEL KRVEELTRLH
Sbjct: 1073 GVPVEELSKRVEELTRLH 1090
>gi|255571318|ref|XP_002526608.1| DNA repair protein xp-E, putative [Ricinus communis]
gi|223534048|gb|EEF35767.1| DNA repair protein xp-E, putative [Ricinus communis]
Length = 1033
Score = 1796 bits (4651), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 906/1036 (87%), Positives = 939/1036 (90%), Gaps = 55/1036 (5%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELF
Sbjct: 53 MLDVPIYGRIATLELF-------------------------------------------- 68
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP Q D I Y F V+ +D + RLEELQVLDIKFLYGC
Sbjct: 69 RPHGEAQ------DFLFIATERYK--FCVLQWDAE---TSELITRLEELQVLDIKFLYGC 117
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTIVVLYQDNKDARHVKTYEVALKDKDF EGPW+QNNLDNGADLLIPVPPPLCGVLII
Sbjct: 118 SKPTIVVLYQDNKDARHVKTYEVALKDKDFGEGPWAQNNLDNGADLLIPVPPPLCGVLII 177
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSANAFKAIPIRPSIT+AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL
Sbjct: 178 GEETIVYCSANAFKAIPIRPSITRAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 237
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNAVVYIGSSYGDSQL+KLNLQPDAKGSYVEVLE YVNLGPIV
Sbjct: 238 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLVKLNLQPDAKGSYVEVLESYVNLGPIV 297
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 298 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 357
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ QTLFCH A+YNQLVQVTS SVRLV
Sbjct: 358 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQVQTLFCHYAVYNQLVQVTSSSVRLV 417
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SST+REL+NEW +P GYS+NVATANA+QVLLATGGGHLVYLEIGDG LT KHAQLE EI
Sbjct: 418 SSTTRELQNEWHAPAGYSINVATANATQVLLATGGGHLVYLEIGDGTLTHTKHAQLECEI 477
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP+YSQ+AAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLC+FEG
Sbjct: 478 SCLDINPIGENPNYSQLAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCSFEG 537
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
ISYLLCALGDGHLLNFLLN+ TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV
Sbjct: 538 ISYLLCALGDGHLLNFLLNLNTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 597
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG
Sbjct: 598 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 657
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
EH RRICHQEQSRTFA+CSLKNQ+ AEESE HF+RLLDDQTFEFISTY LD FEYGCSIL
Sbjct: 658 EHARRICHQEQSRTFAVCSLKNQASAEESETHFIRLLDDQTFEFISTYQLDPFEYGCSIL 717
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSFSDD+N+YYCVGTAYV+PEENEPTKGRILVF+VEDGKLQ+I EKETKGAVYSLN+FN
Sbjct: 718 SCSFSDDNNLYYCVGTAYVMPEENEPTKGRILVFLVEDGKLQVITEKETKGAVYSLNSFN 777
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAAINQKIQLYKWMLRDDG+RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL
Sbjct: 778 GKLLAAINQKIQLYKWMLRDDGSRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 837
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE
Sbjct: 838 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 897
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY+FLEKL
Sbjct: 898 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYIFLEKL 957
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q+NLR+VIKGVGGL+HEQWRSFNNEKKTV+AKNFLDGDLIESFLDLSR RMDEISK + V
Sbjct: 958 QSNLRRVIKGVGGLSHEQWRSFNNEKKTVEAKNFLDGDLIESFLDLSRNRMDEISKAIGV 1017
Query: 1021 SVEELCKRVEELTRLH 1036
SVEELCKRVEELTRLH
Sbjct: 1018 SVEELCKRVEELTRLH 1033
>gi|357132340|ref|XP_003567788.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
distachyon]
Length = 1090
Score = 1795 bits (4649), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 864/1038 (83%), Positives = 959/1038 (92%), Gaps = 2/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDVPIYGRIAT+ELFRP EAQDFLFIATERYKFCVLQWDAE SELITR++GDVSDRIG
Sbjct: 53 LLDVPIYGRIATIELFRPPTEAQDFLFIATERYKFCVLQWDAEKSELITRSVGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA LLIPVP PL GV+II
Sbjct: 173 LRPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGAGLLIPVPAPLGGVIII 232
Query: 181 GEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG 239
GEETIVYC+AN+ FKAIPI+ SI +AYGRVD DGSRYLLGD+ G+LHLLV+T E+E+VTG
Sbjct: 233 GEETIVYCNANSTFKAIPIKQSIIRAYGRVDPDGSRYLLGDNTGILHLLVLTQERERVTG 292
Query: 240 LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
LKIE LGETS+AS+ISYLDN VVY+GS +GDSQL+KLNLQ DA GS+VEVLERYVNLGPI
Sbjct: 293 LKIEHLGETSVASSISYLDNGVVYVGSRFGDSQLVKLNLQADATGSFVEVLERYVNLGPI 352
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
VDFCVVDL+RQGQGQVVTCSGA+KDGS+R+VRNGIGINEQASVELQGIKG+WSL+SS +D
Sbjct: 353 VDFCVVDLDRQGQGQVVTCSGAFKDGSIRVVRNGIGINEQASVELQGIKGLWSLKSSFND 412
Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
P+DTFLVVSFISETR LAMN+EDELEETEIEGF +Q QTLFC +AI + L+QVT+ SVRL
Sbjct: 413 PYDTFLVVSFISETRFLAMNMEDELEETEIEGFDAQIQTLFCQNAISDLLIQVTANSVRL 472
Query: 420 VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
VS SREL + W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D L EVKHAQLE+E
Sbjct: 473 VSCASRELVDHWNAPDGFSVNVASANASQVLLATGGGHLVYLEIRDAKLVEVKHAQLEHE 532
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
ISC+D+NPIGENP YS +AA+GMWTDISV + SLPDL LI KE+LGGEI+PRSVLLC E
Sbjct: 533 ISCVDLNPIGENPQYSSLAAIGMWTDISVSLLSLPDLELIRKENLGGEIVPRSVLLCTLE 592
Query: 540 GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
G+SYLLCALGDGHL +FLLN+ TGELTD+KKVSLGTQPI+LRTFSSK TTHVFA+SDRPT
Sbjct: 593 GVSYLLCALGDGHLFSFLLNVSTGELTDKKKVSLGTQPISLRTFSSKGTTHVFASSDRPT 652
Query: 600 VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
VIYSSNKKLLYSNVNLKEV+HMCPFN+AAFPDSLAIAKEGEL+IGTIDDIQKLHIR+IPL
Sbjct: 653 VIYSSNKKLLYSNVNLKEVNHMCPFNTAAFPDSLAIAKEGELSIGTIDDIQKLHIRTIPL 712
Query: 660 GEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
E RRICHQEQSRT A CS K N + EESE HF+RLLD QTFEF+ST+PLD +E GCS
Sbjct: 713 NEQARRICHQEQSRTLAFCSFKYNPNSMEESEAHFIRLLDHQTFEFLSTHPLDQYECGCS 772
Query: 719 ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
++SCSFSDD+N YYCVGTAYVLPEENEPTKGRILVF VEDG+LQLI EKETKGAVYSLNA
Sbjct: 773 MISCSFSDDNNFYYCVGTAYVLPEENEPTKGRILVFAVEDGRLQLIVEKETKGAVYSLNA 832
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
FNGKLLAAINQKIQLYKWM R+DG+ ELQSECGHHGHILAL+ QTRGDFIVVGDLMKSIS
Sbjct: 833 FNGKLLAAINQKIQLYKWMTREDGSHELQSECGHHGHILALFTQTRGDFIVVGDLMKSIS 892
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LL+YKHEE AIEE ARDYNANWM+AVE++DDDIY+GAEN++NLFTVRKNS+ ATDEERGR
Sbjct: 893 LLVYKHEESAIEELARDYNANWMTAVEMIDDDIYVGAENSYNLFTVRKNSDAATDEERGR 952
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNRFRHGSLVMRLPD+++GQIPTVIFGT+NGVIG+IASLPH+QY+FLE
Sbjct: 953 LEVVGEYHLGEFVNRFRHGSLVMRLPDTEMGQIPTVIFGTINGVIGIIASLPHDQYVFLE 1012
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KLQ+ L K IKGVG L+H+QWRSF+NEKKT +A+NFLDGDLIESFLDL+R++M+E+SK M
Sbjct: 1013 KLQSILGKFIKGVGSLSHDQWRSFHNEKKTAEARNFLDGDLIESFLDLNRSKMEEVSKGM 1072
Query: 1019 NVSVEELCKRVEELTRLH 1036
VSVE L KRVEELTRLH
Sbjct: 1073 GVSVENLSKRVEELTRLH 1090
>gi|218197365|gb|EEC79792.1| hypothetical protein OsI_21216 [Oryza sativa Indica Group]
Length = 1089
Score = 1779 bits (4609), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 863/1037 (83%), Positives = 947/1037 (91%), Gaps = 1/1037 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
M+DVPIYGRIATLELFRPH E QDFLFIATERYKFCVLQWD E SEL+TRAMGDVSDRIG
Sbjct: 53 MIDVPIYGRIATLELFRPHNETQDFLFIATERYKFCVLQWDGEKSELLTRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA LLIPVP PL GV+II
Sbjct: 173 VKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGAGLLIPVPAPLGGVIII 232
Query: 181 GEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG 239
GEETIVYC+AN+ F+AIPI+ SI +AYGRVD DGSRYLLGD+AG+LHLLV+THE+E+VTG
Sbjct: 233 GEETIVYCNANSTFRAIPIKQSIIRAYGRVDPDGSRYLLGDNAGILHLLVLTHERERVTG 292
Query: 240 LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
LKIE LGETSIAS+ISYLDN VVY+GS +GDSQL+KLNLQ D GSYVEVLERYVNLGPI
Sbjct: 293 LKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKLNLQADPNGSYVEVLERYVNLGPI 352
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
VDFCVVDL+RQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKG+WSL+SS +D
Sbjct: 353 VDFCVVDLDRQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGLWSLKSSFND 412
Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
P+D +LVVSFISETR LAMN+EDELEETEIEGF +QTQTLFC +AI + L+QVT+ SVRL
Sbjct: 413 PYDMYLVVSFISETRFLAMNMEDELEETEIEGFDAQTQTLFCQNAINDLLIQVTANSVRL 472
Query: 420 VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
VS TSREL ++W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D L EVKH QLE+E
Sbjct: 473 VSCTSRELVDQWNAPEGFSVNVASANASQVLLATGGGHLVYLEIKDSKLVEVKHIQLEHE 532
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
ISC+D+NPIGENP YS +AAVGMWTDISVRI SLPDL LI KE+LGGEI+PRSVLLC E
Sbjct: 533 ISCVDLNPIGENPQYSSLAAVGMWTDISVRILSLPDLELIRKENLGGEIVPRSVLLCTLE 592
Query: 540 GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
G+SYLLCALGDGHL +FLLN TGELTDRKKVSLGTQPI+LRTFSSK TTHVFA+SDRPT
Sbjct: 593 GVSYLLCALGDGHLFSFLLNASTGELTDRKKVSLGTQPISLRTFSSKGTTHVFASSDRPT 652
Query: 600 VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
VIYSSNKKLLYSNVNLKEV+HMCPFN+AA PDSLAIAKEGEL+IGTIDDIQKLHIR+IPL
Sbjct: 653 VIYSSNKKLLYSNVNLKEVNHMCPFNTAAIPDSLAIAKEGELSIGTIDDIQKLHIRTIPL 712
Query: 660 GEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
E + T + NQ+ EESE HFVRLLD QTFEF+S Y LD +E+GCSI
Sbjct: 713 NEQHVAFAIRNVQDTSVLQLQHNQTSIEESETHFVRLLDHQTFEFLSIYQLDQYEHGCSI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
+SCSFSDD+NVYYCVGTAYVLPEENEP+KGRILVF VEDG+LQLI EKETKGAVYSLNAF
Sbjct: 773 ISCSFSDDNNVYYCVGTAYVLPEENEPSKGRILVFAVEDGRLQLIVEKETKGAVYSLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKWMLR+DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSISL
Sbjct: 833 NGKLLAAINQKIQLYKWMLREDGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSISL 892
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
L+YKHEE AIEE ARDYNANWMSAVE+LDD+IY+GAENN+N+FTVRKNS+ ATDEERGRL
Sbjct: 893 LVYKHEESAIEELARDYNANWMSAVEMLDDEIYIGAENNYNIFTVRKNSDAATDEERGRL 952
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
EVVGEYHLGEFVNR RHGSLVMRLPDS++GQIPTVIFGT+NGVIG+IASLPHEQY+FLEK
Sbjct: 953 EVVGEYHLGEFVNRLRHGSLVMRLPDSEMGQIPTVIFGTINGVIGIIASLPHEQYVFLEK 1012
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ+ L K IKGVG L+HEQWRSF+N+KKT +A+NFLDGDLIESFLDLSR +M+E++K M
Sbjct: 1013 LQSTLVKFIKGVGNLSHEQWRSFHNDKKTSEARNFLDGDLIESFLDLSRNKMEEVAKGMG 1072
Query: 1020 VSVEELCKRVEELTRLH 1036
V VEEL KRVEELTRLH
Sbjct: 1073 VPVEELSKRVEELTRLH 1089
>gi|242089089|ref|XP_002440377.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
gi|241945662|gb|EES18807.1| hypothetical protein SORBIDRAFT_09g030580 [Sorghum bicolor]
Length = 1783
Score = 1776 bits (4600), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 856/1056 (81%), Positives = 947/1056 (89%), Gaps = 24/1056 (2%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIAT+ELFRPH E QDFLFIATERYKFCVLQWDAE SELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATIELFRPHNETQDFLFIATERYKFCVLQWDAEKSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL+GC
Sbjct: 113 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLHGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTIVVLYQDNKD RHVKTYEVALKDKDFVEGPWSQNN+DNGA LLIPVP PL GV+II
Sbjct: 173 VKPTIVVLYQDNKDVRHVKTYEVALKDKDFVEGPWSQNNVDNGAGLLIPVPAPLGGVIII 232
Query: 181 GEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG 239
GEE IVYC+AN+ FKAIPI+ SI +AYGRVD DGSRYLLGD+ G+LHLLV+THE+E+VTG
Sbjct: 233 GEEQIVYCNANSTFKAIPIKQSIIRAYGRVDPDGSRYLLGDNTGILHLLVLTHERERVTG 292
Query: 240 LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
LKIE LGETSIAS+ISYLDN VVY+GS +GDSQL+KLNLQ DA GS+VE+LERYVNLGPI
Sbjct: 293 LKIEYLGETSIASSISYLDNGVVYVGSRFGDSQLVKLNLQADASGSFVEILERYVNLGPI 352
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
VDFCVVDL+RQGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKG+WSL+SS +D
Sbjct: 353 VDFCVVDLDRQGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGLWSLKSSFND 412
Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
P+D +LVVSFISETR LAMN+EDELEETEIEGF +QTQTLFC +A + L+QVT+ SVRL
Sbjct: 413 PYDMYLVVSFISETRFLAMNMEDELEETEIEGFDAQTQTLFCQNATNDLLIQVTANSVRL 472
Query: 420 VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
VS TSREL ++W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D L EVKHAQLE+E
Sbjct: 473 VSCTSRELVDQWNAPAGFSVNVASANASQVLLATGGGHLVYLEIRDSKLVEVKHAQLEHE 532
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
ISCLD+NPIGENP YS +AAVGMWTDISVRIFSLPDL LI KE+LGGEI+PRSVLLC E
Sbjct: 533 ISCLDLNPIGENPQYSSLAAVGMWTDISVRIFSLPDLELIRKENLGGEIVPRSVLLCTLE 592
Query: 540 GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
G+SYLLCALGDG+L +FLLN TGELTDRKKV+LGTQPI+LRTFSSK TTHVFA+SDRPT
Sbjct: 593 GVSYLLCALGDGNLFSFLLNASTGELTDRKKVTLGTQPISLRTFSSKGTTHVFASSDRPT 652
Query: 600 VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
VIYSSNKKLLYSNVNLKEV+HMCPFN+AAFPDSLAIAKEGEL+IGTIDDIQKLHIR+IPL
Sbjct: 653 VIYSSNKKLLYSNVNLKEVNHMCPFNTAAFPDSLAIAKEGELSIGTIDDIQKLHIRTIPL 712
Query: 660 GEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
E RRICHQEQS+T A CS K EESE H +RLLD QTFE + YPLD +E+GCSI
Sbjct: 713 NEQARRICHQEQSKTLAFCSFKYNQSVEESETHLIRLLDHQTFESLCVYPLDQYEFGCSI 772
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
+SCSF+DD+NVYYCVGTAYV+PEENEPTKGRILVF VEDG LQLI EKETKGAVYSLNAF
Sbjct: 773 ISCSFADDNNVYYCVGTAYVIPEENEPTKGRILVFAVEDGSLQLIVEKETKGAVYSLNAF 832
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NGKLLAAINQKIQLYKWM R+DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSISL
Sbjct: 833 NGKLLAAINQKIQLYKWMSREDGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSISL 892
Query: 840 LIYKHE-----------------------EGAIEERARDYNANWMSAVEILDDDIYLGAE 876
L+YK E AIEERARDYNANWM+AVE+LDD++Y+GAE
Sbjct: 893 LVYKVVPLTVCLTHIVLSVIFFVSLFVVLESAIEERARDYNANWMTAVEMLDDEVYVGAE 952
Query: 877 NNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
N +NLFTVRKNS+ ATD+ER RLEVVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIF
Sbjct: 953 NGYNLFTVRKNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIF 1012
Query: 937 GTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
GT+NGVIG+IASLPH+QY+FLEKLQ+ L K IKGVG L+HEQWRSF+N+KKT +A+NFLD
Sbjct: 1013 GTINGVIGIIASLPHDQYVFLEKLQSTLVKYIKGVGNLSHEQWRSFHNDKKTAEARNFLD 1072
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
GDLIESFLDLSR++M+E+SK M V VEEL KRV+ +
Sbjct: 1073 GDLIESFLDLSRSKMEEVSKAMGVPVEELSKRVDTI 1108
>gi|168066745|ref|XP_001785293.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162663100|gb|EDQ49884.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1090
Score = 1771 bits (4588), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 847/1038 (81%), Positives = 937/1038 (90%), Gaps = 2/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRP GE+QD LFI+ ERYKFCVLQWDAE+ L+TRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPPGESQDVLFISFERYKFCVLQWDAETGLLVTRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGI+DPDCRLIGLHLYDGLFKVIP DNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIVDPDCRLIGLHLYDGLFKVIPIDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
AKPTI VLYQDNKDARHVKTYEV LK+KDF EGPW QNNLDNGA LLIPVP PL G +II
Sbjct: 173 AKPTIAVLYQDNKDARHVKTYEVQLKEKDFGEGPWLQNNLDNGAGLLIPVPLPLGGAIII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GE+TIVY + + FKAIPIRPSITKAYGRVD+DGSRYLL DH G+L+LLVI+H+KE+V+ L
Sbjct: 233 GEQTIVYYNGSVFKAIPIRPSITKAYGRVDSDGSRYLLSDHNGMLYLLVISHDKERVSAL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
+E LGETS AST+SYLDN VV++GSSYGDSQLI+LN Q DAK SYVEVLE YVNLGPIV
Sbjct: 293 NVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRLNHQADAKNSYVEVLESYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
D CVVDLERQGQGQVVTCSGA+KDGSLRIVRNGIGINEQAS ELQGIKGMWSLR+S+ D
Sbjct: 353 DLCVVDLERQGQGQVVTCSGAFKDGSLRIVRNGIGINEQASAELQGIKGMWSLRASSSDV 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+DTFLVVSFISETRILAMN +DELEETEI+GF S+ QTLFC++A+++QLVQVT+GS+RLV
Sbjct: 413 YDTFLVVSFISETRILAMNTDDELEETEIDGFDSEAQTLFCYNAVHDQLVQVTAGSLRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
+ +R EWK+P ++NVATANASQVLLATGGG+LVY+EIG G L EV H QLEYEI
Sbjct: 473 DAKTRRQLTEWKAPAPMTINVATANASQVLLATGGGNLVYIEIGQGTLVEVAHVQLEYEI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINP+G+NP S + AVGMWTDIS+RIF+LP L LITKE LGGEIIPRSVL C+F+G
Sbjct: 533 SCLDINPVGDNPERSNLVAVGMWTDISIRIFALPSLTLITKEMLGGEIIPRSVLFCSFDG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
I+YLLCALGDGHL NF LN TGEL+DRKK+SLGTQPI LRTF SKNTTHVFAASDRPTV
Sbjct: 593 IAYLLCALGDGHLFNFTLNPATGELSDRKKISLGTQPIALRTFRSKNTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEV+HMCPFNSA+FPDSLAI KEGELTIGTID+IQKLHIR++PLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVNHMCPFNSASFPDSLAIGKEGELTIGTIDEIQKLHIRTVPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
EHPRRI HQEQSRTFAICS K S E+ E H+VRL++DQTFE IS +PLD +E GCS
Sbjct: 713 EHPRRIAHQEQSRTFAICSAKYAPGSNGEDMETHYVRLIEDQTFEIISGFPLDPYENGCS 772
Query: 719 ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
I++CSF+DDSNVYYCVGTAY LPEE+EP+KGRILVF VEDGK+QL+AEKE KGAVY+LNA
Sbjct: 773 IITCSFTDDSNVYYCVGTAYALPEESEPSKGRILVFSVEDGKIQLVAEKEVKGAVYNLNA 832
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
FNGKLLA INQKI LYKW LRDDGTRELQ E HHGHILALYVQ+RGDFIVVGDLMKSIS
Sbjct: 833 FNGKLLAGINQKIALYKWTLRDDGTRELQYESSHHGHILALYVQSRGDFIVVGDLMKSIS 892
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LLIYK EEGAIEERARDYNANWM+AVEILDDD YLGAEN+FNLFTVRKN++ ATDEERGR
Sbjct: 893 LLIYKPEEGAIEERARDYNANWMTAVEILDDDTYLGAENSFNLFTVRKNNDAATDEERGR 952
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNRFRHGSLVMRLPDS+ QIPTVIFGTVNGVIGVIASLP +Q+LFL+
Sbjct: 953 LEVVGEYHLGEFVNRFRHGSLVMRLPDSEASQIPTVIFGTVNGVIGVIASLPQDQFLFLQ 1012
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KLQ L KVIKGVGGL+HEQWRSF+NE+KTVDA+NFLDGDLIESFLDLSR +M+EI+ ++
Sbjct: 1013 KLQQALVKVIKGVGGLSHEQWRSFSNERKTVDARNFLDGDLIESFLDLSRNKMEEIATSL 1072
Query: 1019 NVSVEELCKRVEELTRLH 1036
VSVEEL K VEELTRLH
Sbjct: 1073 EVSVEELHKAVEELTRLH 1090
>gi|302788810|ref|XP_002976174.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
gi|300156450|gb|EFJ23079.1| hypothetical protein SELMODRAFT_151061 [Selaginella moellendorffii]
Length = 1089
Score = 1771 bits (4586), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 839/1038 (80%), Positives = 940/1038 (90%), Gaps = 3/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDVPIYGRIATLELFRP GE QD LF++TERYKFCVLQWD+E++EL+TRAMGDVSDRIG
Sbjct: 53 LLDVPIYGRIATLELFRPPGETQDVLFVSTERYKFCVLQWDSETTELVTRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGI+DP+CRLIGLHLYDGLFKVIP DNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIVDPECRLIGLHLYDGLFKVIPIDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTI VLYQDNKDARHVKTYE+ LK+KDF EGPW QNNLDNGA +LIPVP PL GV+II
Sbjct: 173 SKPTIAVLYQDNKDARHVKTYEIQLKEKDFGEGPWLQNNLDNGAGMLIPVPTPLGGVIII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GE+TIVY S +AFKAIPIRPSITKAYG+VDADGSRYLL DH G LHLLVITHE+++V GL
Sbjct: 233 GEQTIVYYSGSAFKAIPIRPSITKAYGKVDADGSRYLLSDHTGSLHLLVITHERDRVLGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
K+ELLGETS AS++SYLDN VVY+GSSYGDSQLIKLN Q D++ SYVEVLE +VNLGPIV
Sbjct: 293 KVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKLNAQVDSRNSYVEVLESFVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
D CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQAS ELQGIKGMWSLR+++ D
Sbjct: 353 DLCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASAELQGIKGMWSLRATSKDV 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FD FLVVSFISETRILAMN++DELEETEIEGF S+ QTLFCH+AI++Q++QVTS S+RLV
Sbjct: 413 FDIFLVVSFISETRILAMNMDDELEETEIEGFDSEAQTLFCHNAIHDQIIQVTSTSLRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
+TSR EW++P G ++NVATANASQVLLATGGG+LVY+EIG G LTEVKH QL+ EI
Sbjct: 473 DATSRRQLTEWRTPSGVAINVATANASQVLLATGGGNLVYIEIGRGSLTEVKHVQLQNEI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG +P S IAAVGMWTDISV++FSLP+L ++ KE LGGEIIPRSVL+CAFEG
Sbjct: 533 SCLDINPIGTDPDRSNIAAVGMWTDISVQVFSLPNLEVLAKESLGGEIIPRSVLICAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
+SYLLCALGDGHL NFL++ TG+L+DRKK+SLGTQPI LRTF SKN THVFAASDRPTV
Sbjct: 593 VSYLLCALGDGHLFNFLIDTSTGQLSDRKKISLGTQPIMLRTFRSKNATHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEV+HMCPFNSA+FPDSLAI KEGELTIGTIDDIQKLHIR++ LG
Sbjct: 653 IYSSNKKLLYSNVNLKEVNHMCPFNSASFPDSLAIGKEGELTIGTIDDIQKLHIRTVALG 712
Query: 661 EHPRRICHQEQSRTFAICSLK--NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
EHPRRICHQEQ+RTF +C+ + + E+ E HFV+LLDDQTFE + +Y LDTFE GC+
Sbjct: 713 EHPRRICHQEQTRTFGLCTARFYSNPNGEDHESHFVKLLDDQTFEVLGSYNLDTFENGCT 772
Query: 719 ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
I++CSF+DD YYCVGTAY LPEENEP+KGRIL+F VEDGK QL+ EKETKGAVY+LNA
Sbjct: 773 IITCSFTDDPATYYCVGTAYALPEENEPSKGRILIFTVEDGKFQLVTEKETKGAVYNLNA 832
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
FNGKLLA INQKIQLYKW R D TRELQSECGHHGHILALYVQ+RGDFIVVGDLMKSIS
Sbjct: 833 FNGKLLAGINQKIQLYKWTQR-DSTRELQSECGHHGHILALYVQSRGDFIVVGDLMKSIS 891
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LL+YK EEGAIEERARDYNANWM+AVEILDDDIYLGAEN+FNLFTVRKNS+ ATDEERGR
Sbjct: 892 LLLYKPEEGAIEERARDYNANWMTAVEILDDDIYLGAENSFNLFTVRKNSDAATDEERGR 951
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNRFRHGSLVMRLPD++ QIPTVIFGTVNGVIGV+ASL EQ+ FL+
Sbjct: 952 LEVVGEYHLGEFVNRFRHGSLVMRLPDNETSQIPTVIFGTVNGVIGVVASLQQEQFNFLQ 1011
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
+LQ L KVIKGVGGL+HEQWRSF++E+K DAKNFLDGDLIESFLDL+R +MDE+S M
Sbjct: 1012 RLQHCLAKVIKGVGGLSHEQWRSFSSERKNADAKNFLDGDLIESFLDLNRAKMDEVSAAM 1071
Query: 1019 NVSVEELCKRVEELTRLH 1036
+VSVEELCKRVEE+TRLH
Sbjct: 1072 SVSVEELCKRVEEMTRLH 1089
>gi|302769568|ref|XP_002968203.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
gi|300163847|gb|EFJ30457.1| hypothetical protein SELMODRAFT_145521 [Selaginella moellendorffii]
Length = 1089
Score = 1770 bits (4584), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 838/1038 (80%), Positives = 939/1038 (90%), Gaps = 3/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDVPIYGRIATLELFRP GE QD LF++TERYKFCVLQWD+E++EL+TRAMGDVSDRIG
Sbjct: 53 LLDVPIYGRIATLELFRPPGETQDVLFVSTERYKFCVLQWDSETTELVTRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGI+DP+CRLIGLHLYDGLFKVIP DNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIVDPECRLIGLHLYDGLFKVIPIDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTI VLYQDNKDARHVKTYE+ LK+KDF EGPWSQNNLDNGA +LIPVP PL GV+II
Sbjct: 173 SKPTIAVLYQDNKDARHVKTYEIQLKEKDFGEGPWSQNNLDNGAGMLIPVPTPLGGVIII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GE+TIVY S +AFKAIPIRPSITKAYG+VDADGSRYLL DH G LHLLVITHE+++V GL
Sbjct: 233 GEQTIVYYSGSAFKAIPIRPSITKAYGKVDADGSRYLLSDHTGSLHLLVITHERDRVLGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
K+ELLGETS AS++SYLDN VVY+GSSYGDSQLIKLN Q D++ SYVEVLE +VNLGPIV
Sbjct: 293 KVELLGETSAASSLSYLDNGVVYVGSSYGDSQLIKLNAQVDSRNSYVEVLESFVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
D CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQAS ELQGIKGMWSLR+++ D
Sbjct: 353 DLCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASAELQGIKGMWSLRATSKDV 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FD FLVVSFISETRILAMN++DELEETEIEGF S+ QTLFCH+AI++Q++QVTS S+RLV
Sbjct: 413 FDIFLVVSFISETRILAMNMDDELEETEIEGFDSEAQTLFCHNAIHDQIIQVTSTSLRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
+TSR EW++P ++NVATANASQVLLATGGG+LVY+EIG G LTEVKH QL+ EI
Sbjct: 473 DATSRRQLTEWRTPSAVAINVATANASQVLLATGGGNLVYIEIGRGSLTEVKHVQLQNEI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIG +P S IAAVGMWTDISV++FSLP+L ++ KE LGGEIIPRSVL+CAFEG
Sbjct: 533 SCLDINPIGTDPDRSNIAAVGMWTDISVQVFSLPNLEVLAKESLGGEIIPRSVLICAFEG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
+SYLLCALGDGHL NFL++ G+L+DRKK+SLGTQPI LRTF SKN THVFAASDRPTV
Sbjct: 593 VSYLLCALGDGHLFNFLIDTSNGQLSDRKKISLGTQPIMLRTFRSKNATHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEV+HMCPFNSA+FPDSLAI KEGELTIGTIDDIQKLHIR++ LG
Sbjct: 653 IYSSNKKLLYSNVNLKEVNHMCPFNSASFPDSLAIGKEGELTIGTIDDIQKLHIRTVALG 712
Query: 661 EHPRRICHQEQSRTFAICSLK--NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
EHPRRICHQEQ+RTF +C+ + + E+ E HFV+LLDDQTFE + +Y LDTFE GC+
Sbjct: 713 EHPRRICHQEQTRTFGLCTARFYSNPNGEDHESHFVKLLDDQTFEVLGSYNLDTFENGCT 772
Query: 719 ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
I++CSF+DD YYCVGTAY LPEENEP+KGRIL+F VEDGK QL+ EKETKGAVY+LNA
Sbjct: 773 IITCSFTDDPATYYCVGTAYALPEENEPSKGRILIFTVEDGKFQLVTEKETKGAVYNLNA 832
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
FNGKLLA INQKIQLYKW R D TRELQSECGHHGHILALYVQ+RGDFIVVGDLMKSIS
Sbjct: 833 FNGKLLAGINQKIQLYKWTQR-DSTRELQSECGHHGHILALYVQSRGDFIVVGDLMKSIS 891
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LL+YK EEGAIEERARDYNANWM+AVEILDDDIYLGAEN+FNLFTVRKNS+ ATDEERGR
Sbjct: 892 LLLYKPEEGAIEERARDYNANWMTAVEILDDDIYLGAENSFNLFTVRKNSDAATDEERGR 951
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNRFRHGSLVMRLPD++ QIPTVIFGTVNGVIGV+ASL EQ+ FL+
Sbjct: 952 LEVVGEYHLGEFVNRFRHGSLVMRLPDNETSQIPTVIFGTVNGVIGVVASLQQEQFNFLQ 1011
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
+LQ L KVIKGVGGL+HEQWRSF++E+K DAKNFLDGDLIESFLDL+R +MDE+S M
Sbjct: 1012 RLQHCLAKVIKGVGGLSHEQWRSFSSERKNADAKNFLDGDLIESFLDLNRAKMDEVSAAM 1071
Query: 1019 NVSVEELCKRVEELTRLH 1036
+VSVEELCKRVEE+TRLH
Sbjct: 1072 SVSVEELCKRVEEMTRLH 1089
>gi|168047617|ref|XP_001776266.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672361|gb|EDQ58899.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1089
Score = 1747 bits (4525), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 836/1038 (80%), Positives = 932/1038 (89%), Gaps = 3/1038 (0%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVP+YGRIATLELFRP GE+QD LFI+ ERYKFCVLQWDAE+ ITRAMGDVSDR G
Sbjct: 53 MLDVPLYGRIATLELFRPPGESQDVLFISFERYKFCVLQWDAETGSPITRAMGDVSDRTG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIGI+DPDCRLIGLHLYDG+FKVIP DNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDNGQIGIVDPDCRLIGLHLYDGMFKVIPIDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PTI VLYQDNKDARHVKTYEV LK+KDF EGPW QNNLDNGA LLIPVP PL G +II
Sbjct: 173 ANPTIAVLYQDNKDARHVKTYEVNLKEKDFGEGPWLQNNLDNGAGLLIPVPLPLGGAIII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GE+TIVY + + FKAIPIRPSITKAYGRVD+DGSRYLL DH G+L+LLVI+H+KE+V+ L
Sbjct: 233 GEQTIVYYNGSVFKAIPIRPSITKAYGRVDSDGSRYLLSDHNGMLYLLVISHDKERVSAL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
+E LGETS AST+SYLDN VV++GSSYGDSQLI+LN Q D KGSYVEVLE +VNLGPIV
Sbjct: 293 NVEPLGETSAASTLSYLDNGVVFVGSSYGDSQLIRLNHQADVKGSYVEVLESFVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
D CVVDLERQGQGQVVTCSGA+KDGSLRIVRNGIGINEQASVELQGIKGMWSLR+S+ D
Sbjct: 353 DLCVVDLERQGQGQVVTCSGAFKDGSLRIVRNGIGINEQASVELQGIKGMWSLRASSSDV 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+DTFLVVSFISETRILAMN +DELEETEI+GF S+ QTLFCH+A+++QLVQVT+GS+RLV
Sbjct: 413 YDTFLVVSFISETRILAMNTDDELEETEIDGFDSEAQTLFCHNAVHDQLVQVTAGSLRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
++ +R+ EWK+P ++NVATANASQVLLATGGG+LVY+EIG G LT V H+QLEYEI
Sbjct: 473 NAKTRKQLTEWKAPAPMTINVATANASQVLLATGGGNLVYIEIGQGTLTGVAHSQLEYEI 532
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINP+GENP S + AVGMWTDISVRIF+LP L LI KE LGGEIIPRSVL C+F+G
Sbjct: 533 SCLDINPVGENPERSNLVAVGMWTDISVRIFALPSLTLINKEMLGGEIIPRSVLFCSFDG 592
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
++YLLCA+GDGHL NF+LN TGEL+DRKK+SLGTQPI LRTF SKNTTHVFAASDRPTV
Sbjct: 593 LAYLLCAVGDGHLFNFMLNPSTGELSDRKKISLGTQPIALRTFRSKNTTHVFAASDRPTV 652
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSSNKKLLYSNVNLKEV+HMCPFNSA+FPDSLAI KEGELTIGTIDDIQKLHIR++PLG
Sbjct: 653 IYSSNKKLLYSNVNLKEVNHMCPFNSASFPDSLAIGKEGELTIGTIDDIQKLHIRTVPLG 712
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCA--EESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
E P RI HQEQSR+FAICS K E+ E H+VRL++DQTFE S + LD +E GCS
Sbjct: 713 ERPCRIAHQEQSRSFAICSAKYSQGPNNEDIETHYVRLIEDQTFEITSGFALDLYEIGCS 772
Query: 719 ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
I++CSF+DDSNVYYCVGTAY LPEE+EPTKGRILVF+VEDGKLQL+AEKE KGAVY+LNA
Sbjct: 773 IITCSFTDDSNVYYCVGTAYALPEESEPTKGRILVFLVEDGKLQLVAEKEMKGAVYNLNA 832
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
FNGKLLA INQKI LYKW LR DGTR L+ E HHGHILALYVQ+RGDFIVVGDLMKSIS
Sbjct: 833 FNGKLLAGINQKIALYKWTLR-DGTRVLEIESSHHGHILALYVQSRGDFIVVGDLMKSIS 891
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LLIYK EEGAIEERARDYNANWM+AVEILDDD YLGAEN+FNLFTVRKN++ ATDEERGR
Sbjct: 892 LLIYKPEEGAIEERARDYNANWMTAVEILDDDTYLGAENSFNLFTVRKNNDAATDEERGR 951
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
LEVVGEYHLGEFVNRFRHGSLVMRLPDS+ IPTVIFGTVNGVIGVIASLP +++LFL+
Sbjct: 952 LEVVGEYHLGEFVNRFRHGSLVMRLPDSEASLIPTVIFGTVNGVIGVIASLPQDKFLFLQ 1011
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KLQ L KVIKGVGGL+HEQWRSF+NE+KTVDA+NFLDGDLIESFLDLSR +M+EI+ +
Sbjct: 1012 KLQQALVKVIKGVGGLSHEQWRSFSNERKTVDARNFLDGDLIESFLDLSRNKMEEIAAPL 1071
Query: 1019 NVSVEELCKRVEELTRLH 1036
+SVEELCKRVEELTRLH
Sbjct: 1072 EISVEELCKRVEELTRLH 1089
>gi|357135348|ref|XP_003569272.1| PREDICTED: DNA damage-binding protein 1a-like [Brachypodium
distachyon]
Length = 1074
Score = 1460 bits (3779), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 707/1036 (68%), Positives = 846/1036 (81%), Gaps = 14/1036 (1%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
M+DVP+YG IATLELFR E QDFLFI+ ERY+ VL WD +SELITR+ GDVSD IG
Sbjct: 53 MVDVPLYGTIATLELFRSRSETQDFLFISMERYRCIVLHWDGRNSELITRSGGDVSDFIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTDNGQIG+IDP RLIGL LYDGLFKVIPFDNKG LKEA NIRL+E VLDIKFLYGC
Sbjct: 113 RPTDNGQIGVIDPQNRLIGLSLYDGLFKVIPFDNKGNLKEALNIRLQEFLVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A+PT+VVL+QDNKD+RHVKTYEVAL+DKDFVEG WSQ+NLDN A LLIPVP L GV+II
Sbjct: 173 ARPTVVVLHQDNKDSRHVKTYEVALEDKDFVEGSWSQSNLDNSAHLLIPVP--LGGVIII 230
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GE TIVYCSA FKA+ I+ SI +A GRVD DGSRYL GD+ G LHL+VITHE +VT L
Sbjct: 231 GEHTIVYCSATTFKALSIKQSIIRAVGRVDPDGSRYLYGDNTGALHLIVITHEWGRVTDL 290
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
K +GETSIASTISYLD+ +VYIGS +GDSQLIKLN+Q DA S+VE+LE+++N GPIV
Sbjct: 291 KTHYMGETSIASTISYLDSGLVYIGSRFGDSQLIKLNIQADASASFVEILEQFMNTGPIV 350
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVD ER+GQGQV+TCSGAYKDGS+R VRNG+ I +QASVEL+G+KG+WS++SS +DP
Sbjct: 351 DFCVVDTERRGQGQVITCSGAYKDGSIRAVRNGVVITDQASVELRGMKGLWSMKSSLNDP 410
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+DTFLVV+FI+ET LAMN+E+ELEE +I+GF S+TQTL C AI+NQL+QVTS SVRLV
Sbjct: 411 YDTFLVVTFINETHFLAMNMENELEEVDIKGFDSETQTLACGSAIHNQLIQVTSRSVRLV 470
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
SS S EL ++W +P +SVNVA ANA+QVLLATG HLVYLEI + VKH QLE+EI
Sbjct: 471 SSVSLELLDQWFAPARFSVNVAAANANQVLLATGNCHLVYLEITSSKIVPVKHIQLEHEI 530
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG 540
SCLDINPIGENP YS +AAVGMWTDISVRIFSLP L LI KEHL GE++PRSVLLC E
Sbjct: 531 SCLDINPIGENPQYSSLAAVGMWTDISVRIFSLPGLKLIRKEHL-GEVVPRSVLLCTIEA 589
Query: 541 ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
+SYL C LGDGHL +F+LN T EL+DR++VSLG QPI+L FSS+N THVFAASDRP V
Sbjct: 590 VSYLFCGLGDGHLFSFVLNSSTCELSDRRRVSLGAQPISLHIFSSQNRTHVFAASDRPAV 649
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IYSS++KLLYS NLKEV+H+CPFN+A FP+S+ +AKE EL+IG I+DI++LHIR+IPL
Sbjct: 650 IYSSDQKLLYSYANLKEVNHVCPFNTAVFPESIVLAKESELSIGEINDIRQLHIRTIPLK 709
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
E RRICHQEQS+T A+CS K + ES HFVRLLD QTF +ST+ LD FE GCSI+
Sbjct: 710 EQARRICHQEQSQTLALCSFKPKFIHAESGKHFVRLLDYQTFWVLSTHTLDEFECGCSIV 769
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SCSFSDD N YYCVGTAY+LP E EPTKGRIL+F+VE+ KL+L+AE+ETKGAVYSLNA
Sbjct: 770 SCSFSDDDNFYYCVGTAYILPYEIEPTKGRILIFLVEERKLRLVAERETKGAVYSLNALT 829
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLAA+NQKI +YKW+ RD+ +LQSEC + G +LAL+ QT G FIVVGD+++S+SLL
Sbjct: 830 GKLLAAVNQKIIVYKWVRRDN-RHQLQSECSYRGCVLALHTQTHGHFIVVGDMVRSVSLL 888
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
YK+EEG IE RD+N W++AV +LDDDIY+GA+N NLFT+ GR
Sbjct: 889 RYKYEEGLIEVVTRDFNTKWITAVAMLDDDIYIGADNCCNLFTLHS----------GRPG 938
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VVGEYHLG+ VNR HGSLVM DS++GQIPTVIFGT++G IGVIAS P++QY+FLEKL
Sbjct: 939 VVGEYHLGDLVNRMHHGSLVMHHTDSEIGQIPTVIFGTISGAIGVIASFPYDQYVFLEKL 998
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q+ L K IK VG L+H +WRSF N +T +A+NF+DGDLIESFL LS ++M+E+S+ M +
Sbjct: 999 QSVLVKFIKSVGNLSHVEWRSFYNVSRTAEARNFVDGDLIESFLSLSPSKMEEVSQVMGL 1058
Query: 1021 SVEELCKRVEELTRLH 1036
+ELCK VEEL +LH
Sbjct: 1059 RADELCKIVEELRKLH 1074
>gi|110741229|dbj|BAF02165.1| UV-damaged DNA binding factor - like protein [Arabidopsis thaliana]
Length = 727
Score = 1358 bits (3515), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 657/727 (90%), Positives = 700/727 (96%)
Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSF 369
QGQGQVVTCSGA+KDGSLR+VRNGIGINEQASVELQGIKGMWSL+SS D+ FDTFLVVSF
Sbjct: 1 QGQGQVVTCSGAFKDGSLRVVRNGIGINEQASVELQGIKGMWSLKSSIDEAFDTFLVVSF 60
Query: 370 ISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
ISETRILAMNLEDELEETEIEGF SQ QTLFCHDA+YNQLVQVTS SVRLVSST+RELR+
Sbjct: 61 ISETRILAMNLEDELEETEIEGFLSQVQTLFCHDAVYNQLVQVTSNSVRLVSSTTRELRD 120
Query: 430 EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIG 489
EW +P G++VNVATANASQVLLATGGGHLVYLEIGDG LTEV+HA LEYE+SCLDINPIG
Sbjct: 121 EWHAPAGFTVNVATANASQVLLATGGGHLVYLEIGDGKLTEVQHALLEYEVSCLDINPIG 180
Query: 490 ENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALG 549
+NP+YSQ+AAVGMWTDISVRIFSLP+L LITKE LGGEIIPRSVLLCAFEGISYLLCALG
Sbjct: 181 DNPNYSQLAAVGMWTDISVRIFSLPELTLITKEQLGGEIIPRSVLLCAFEGISYLLCALG 240
Query: 550 DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLL 609
DGHLLNF ++ TG+L DRKKVSLGTQPITLRTFSSK+ THVFAASDRPTVIYSSNKKLL
Sbjct: 241 DGHLLNFQMDTTTGQLKDRKKVSLGTQPITLRTFSSKSATHVFAASDRPTVIYSSNKKLL 300
Query: 610 YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
YSNVNLKEVSHMCPFNSAAFPDSLAIA+EGELTIGTIDDIQKLHIR+IPLGEH RRICHQ
Sbjct: 301 YSNVNLKEVSHMCPFNSAAFPDSLAIAREGELTIGTIDDIQKLHIRTIPLGEHARRICHQ 360
Query: 670 EQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
EQ+RTF ICSL NQS +EESEMHFVRLLDDQTFEF+STYPLD+FEYGCSILSCSF++D N
Sbjct: 361 EQTRTFGICSLGNQSNSEESEMHFVRLLDDQTFEFMSTYPLDSFEYGCSILSCSFTEDKN 420
Query: 730 VYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
VYYCVGTAYVLPEENEPTKGRILVFIVEDG+LQLIAEKETKGAVYSLNAFNGKLLAAINQ
Sbjct: 421 VYYCVGTAYVLPEENEPTKGRILVFIVEDGRLQLIAEKETKGAVYSLNAFNGKLLAAINQ 480
Query: 790 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL+YKHEEGAI
Sbjct: 481 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLLYKHEEGAI 540
Query: 850 EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGE 909
EERARDYNANWMSAVEILDDDIYLGAENNFNL TV+KNSEGATDEERGRLEVVGEYHLGE
Sbjct: 541 EERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKKNSEGATDEERGRLEVVGEYHLGE 600
Query: 910 FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 969
FVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVIASLP EQY FLEKLQ++LRKVIK
Sbjct: 601 FVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVIASLPQEQYTFLEKLQSSLRKVIK 660
Query: 970 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
GVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDLSR +M++ISK+MNV VEELCKRV
Sbjct: 661 GVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDLSRNKMEDISKSMNVQVEELCKRV 720
Query: 1030 EELTRLH 1036
EELTRLH
Sbjct: 721 EELTRLH 727
>gi|384250802|gb|EIE24281.1| hypothetical protein COCSUDRAFT_28729 [Coccomyxa subellipsoidea
C-169]
Length = 1101
Score = 1320 bits (3417), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 625/1040 (60%), Positives = 800/1040 (76%), Gaps = 9/1040 (0%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
DV IYGR+AT+ELFRP GE++D LF++TERYKFCVL++D+E+ EL+TRA GD+ D++GRP
Sbjct: 60 DVAIYGRVATMELFRPVGESKDLLFLSTERYKFCVLEYDSETGELVTRANGDIEDQVGRP 119
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
DNGQIGI+DP CR+IGLHLYDGLFKVIP D+KGQL EAFN+R++EL V+D+ FL GCAK
Sbjct: 120 CDNGQIGIVDPGCRMIGLHLYDGLFKVIPIDDKGQLHEAFNMRIDELNVIDMIFLEGCAK 179
Query: 123 PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGE 182
PTI VLYQDNKDARH+KTYEV LK+KD EGPW Q+NLD GA +I VP PL G L++GE
Sbjct: 180 PTIAVLYQDNKDARHIKTYEVVLKEKDLTEGPWRQSNLDAGASRVIAVPEPLGGALVVGE 239
Query: 183 ETIVYCS-ANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLK 241
I Y A K PI+ +I +A+GRVD DGSRYLLGD+ G L+LLV+ H+ E V GLK
Sbjct: 240 SVIAYMGQGQAMKCTPIKATIIRAHGRVDEDGSRYLLGDYVGNLYLLVLQHDGEHVAGLK 299
Query: 242 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN---LQPDAKGSYVEVLERYVNLGP 298
+E LG TS ST++YLDN VV++GSS GDSQL++L+ + P ++VEVLE NLGP
Sbjct: 300 VEPLGRTSAPSTLTYLDNGVVFVGSSGGDSQLVRLHPTPVTPQEPSNFVEVLETMTNLGP 359
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTD 358
I+DF VVDLERQGQGQVV CSG DGSLRIVRNGIG+ EQA+VEL GIKGMW+LR+S
Sbjct: 360 IIDFVVVDLERQGQGQVVMCSGIMADGSLRIVRNGIGMIEQATVELPGIKGMWALRASHM 419
Query: 359 DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
D FDTFLV+SF+ ETRILA+N +DEL+E E+ GF + QTL C + + + LVQV VR
Sbjct: 420 DAFDTFLVISFVGETRILAINADDELDEAELPGFSADAQTLCCGNTVSDHLVQVAGADVR 479
Query: 419 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY 478
LV +++R+L ++W+ P G ++NVA+ + +Q +T G+LVYLE+G+ + + H +L+
Sbjct: 480 LVDASTRQLTHQWRPPAGLNINVASVSPTQASPSTAHGNLVYLELGESGIEQKGHVKLDA 539
Query: 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAF 538
E++C+DI+P+ E+ + + AVG W D+ V IFSLP + + E LGGEIIPRSVL AF
Sbjct: 540 EVACVDISPLSEDGEAASLLAVGTW-DMRVHIFSLPAMAPLVSEPLGGEIIPRSVLFAAF 598
Query: 539 EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRP 598
EG+ YLLCA+GDG L NF ++ TG L DRKK+ LGT+PI LR+F S +HVFAASDRP
Sbjct: 599 EGVPYLLCAMGDGQLYNFHVDEATGALADRKKICLGTKPIMLRSFRSNGQSHVFAASDRP 658
Query: 599 TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIP 658
TVIYS+NKKLLYSNVN EV+ M FNS++FPDSLA+AKEG +TIG+ID IQKLHIR++P
Sbjct: 659 TVIYSANKKLLYSNVNENEVNFMTSFNSSSFPDSLALAKEGAMTIGSIDQIQKLHIRTVP 718
Query: 659 LGEHPRRICHQEQSRTFAICSLKNQSCA--EESEMHFVRLLDDQTFEFISTYPLDTFEYG 716
LGE PRR+ HQE SR+F + + N +++ VRLLDDQTFE + + L+T E
Sbjct: 719 LGEQPRRLAHQEASRSFLVLTSPNNGATGMDDAGPDSVRLLDDQTFETLDRFGLETNEVC 778
Query: 717 CSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSL 776
C+ S SFSDD YY VGTA + EE EPTKGRILVF + GKL L+ EKE KGA Y+L
Sbjct: 779 CAAASMSFSDDPCPYYVVGTAITVAEEPEPTKGRILVFGAKGGKLSLVCEKEVKGAAYNL 838
Query: 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
+ F GKL+A IN ++QL+KW +DG+REL +EC H GH+LALY+ TRGDF++VGDLM+S
Sbjct: 839 HPFQGKLIAGINSRVQLFKWTQSEDGSRELTNECSHVGHVLALYIVTRGDFVIVGDLMRS 898
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER 896
+ LLIY+ +EG +E RARDY +WM+AVE+LDDD YLGAEN+ N+FT+RKN++ A DE+R
Sbjct: 899 LQLLIYRADEGILEVRARDYKTHWMTAVEVLDDDTYLGAENSNNIFTLRKNTDAAADEDR 958
Query: 897 GRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLF 956
RLE VG+YHLG FVNRFRHGSLVM+LPDS+ +IPTV+F T+NG IGVIASLP +Q+ F
Sbjct: 959 NRLETVGQYHLGVFVNRFRHGSLVMKLPDSEAAKIPTVLFVTINGSIGVIASLPQQQFQF 1018
Query: 957 LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
L +LQ LRKVIKGVGGL+H WR+F +E + ++NF+DGDLIE FLDL R M+ +++
Sbjct: 1019 LSRLQDCLRKVIKGVGGLSHVAWRTFQDEHTKMPSQNFVDGDLIEQFLDLKRDSMERVAR 1078
Query: 1017 TM--NVSVEELCKRVEELTR 1034
M V+ E+L + VEEL+R
Sbjct: 1079 EMGEGVTSEDLLRMVEELSR 1098
>gi|428164905|gb|EKX33915.1| hypothetical protein GUITHDRAFT_158867 [Guillardia theta CCMP2712]
Length = 1092
Score = 1223 bits (3165), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 602/1046 (57%), Positives = 775/1046 (74%), Gaps = 14/1046 (1%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LD IYGRIA +EL+ G ++ L+I TER KFC++++D+ + ELIT+AMGDV D +G
Sbjct: 51 VLDTGIYGRIAAIELYTVAGAERESLYILTERLKFCIVEYDSSTGELITKAMGDVQDSVG 110
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK-GQLKEAFNIRLEELQVLDIKFLYG 119
RP D G I IDP+ R+IG LYDGLFKVIP D + GQL+EAFNIRLEELQVLD++FLYG
Sbjct: 111 RPVDGGPIAHIDPERRMIGFLLYDGLFKVIPIDTRNGQLREAFNIRLEELQVLDVQFLYG 170
Query: 120 CAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLI 179
A+PTIV+LYQD K+ RH+KTY+V+++DKDF+ GPWSQ ++ GA ++IPVP P+ G ++
Sbjct: 171 YAQPTIVLLYQDPKEMRHLKTYQVSIRDKDFIAGPWSQTGVEIGATMIIPVPTPIGGCIL 230
Query: 180 IGEETIVYCSANA--FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
+GE+TI Y + + K I + ++ +A+G++D DG RYLLGDH G L++LV+ + KV
Sbjct: 231 LGEQTISYLNGDKGDTKTIHMDMTVIRAWGKIDEDGRRYLLGDHLGQLYVLVLEFDGNKV 290
Query: 238 TGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLG 297
GLK++ LGETS A TI+YLD+ VV+IGS +GDSQLI+L+ D S +EVLE + NLG
Sbjct: 291 LGLKLDTLGETSSAKTITYLDSGVVFIGSCFGDSQLIRLHPDKDENDSNIEVLESFTNLG 350
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST 357
PI DFCVVDLERQGQGQVVTCSG KDGSLR+VRNGIGINEQA+VEL GIKG+WSLR S
Sbjct: 351 PIQDFCVVDLERQGQGQVVTCSGTLKDGSLRVVRNGIGINEQAAVELPGIKGLWSLRESI 410
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
D +D +L+ SF++ETR+L + ++EL ETEI+GF QT+FC + + + L+Q+T S+
Sbjct: 411 DAQYDKYLIQSFVNETRVLEIA-DEELSETEIDGFDHNAQTIFCSNVLGDCLLQITEVSL 469
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLE 477
RLVS+ S++L EW P G + VA N QV+L +G L+YL++ +G +TEVK +++
Sbjct: 470 RLVSTKSKQLLKEWFPPNGERITVAGGNVQQVVLTSGKRTLIYLDVSNGDVTEVKRIEMD 529
Query: 478 YEISCLDINPIGENPSY--SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
EI+CL++NP+GE + S AVG W ++S+ + LP + ++ E +GG+ IPRS+LL
Sbjct: 530 QEIACLNLNPLGEKSDHNKSDFVAVGHW-NLSLSMLRLPSMEVVCTESIGGDAIPRSLLL 588
Query: 536 CAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS 595
EG+ YLLCALGDG+L F ++ T ++ +RKK+SLGT P+ L F S+ THVFAAS
Sbjct: 589 VTLEGVDYLLCALGDGYLFTFAIDASTAQIGERKKISLGTHPMILSKFMSRGATHVFAAS 648
Query: 596 DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR 655
DRPTVIYS+N+KLL+SNVNLKEV+ M PFNS FPDSLAIA E L IG IDDIQKLHIR
Sbjct: 649 DRPTVIYSNNRKLLFSNVNLKEVTQMAPFNSEGFPDSLAIATETSLRIGVIDDIQKLHIR 708
Query: 656 SIPLGEHPRRICHQEQSRTFAIC--SLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTF 713
++ L E PRRICHQE S+TF + S++ EE E F++L DDQTFE + TY L F
Sbjct: 709 TVYLREQPRRICHQESSKTFCVATLSIRINRDGEEVEEQFIKLFDDQTFEILDTYQLQEF 768
Query: 714 EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
E CS+ SFSDD +YY VGTA +P+E+EP +GR+LVF V D KL L A KE KGA
Sbjct: 769 ENTCSVECASFSDDPTLYYIVGTATAVPQESEPKEGRLLVFEVIDRKLHLKASKEIKGAP 828
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDL 833
Y + FNGKLLA IN KI+L++ D G EL SEC H GHIL LY+QTRGDFIV GDL
Sbjct: 829 YQIKPFNGKLLAGINSKIELFRLSDSDTGHMELVSECCHRGHILVLYLQTRGDFIVAGDL 888
Query: 834 MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
M+SISLL YK +G IEE ARD+NANWM+AV+ILDDD +LGAE FNLFTVRKN++ +D
Sbjct: 889 MRSISLLTYKQVDGQIEEIARDFNANWMTAVDILDDDTFLGAEGYFNLFTVRKNTDATSD 948
Query: 894 EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQ 953
EER RLEVVGEYHLG+ VNRF+ GSLV+R SD T+IFGTVNG+IGVIA L E+
Sbjct: 949 EERARLEVVGEYHLGDMVNRFQRGSLVLR--SSDTPTTDTIIFGTVNGMIGVIAVLSKEE 1006
Query: 954 YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRTRM 1011
Y FL K+Q L VIKGVGGL HE WRSF NE+ K F+DGDLIESFLDL R +M
Sbjct: 1007 YEFLLKVQDALNFVIKGVGGLRHEDWRSFENERTQGARAPKGFIDGDLIESFLDLRREKM 1066
Query: 1012 DEISKTM-NVSVEELCKRVEELTRLH 1036
+E+ + +++VEEL +++EEL RLH
Sbjct: 1067 EEVCHAIGSITVEELSRKIEELQRLH 1092
>gi|255080490|ref|XP_002503825.1| predicted protein [Micromonas sp. RCC299]
gi|226519092|gb|ACO65083.1| predicted protein [Micromonas sp. RCC299]
Length = 1114
Score = 1219 bits (3153), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 614/1067 (57%), Positives = 774/1067 (72%), Gaps = 38/1067 (3%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
M DVP+YGRIA ++L+RP GE + L++ATER FCVL +D S + TRAMGD+S+ IG
Sbjct: 52 MHDVPVYGRIAVMKLYRPAGEKRQLLYVATERLMFCVLAYDQTSGAIATRAMGDLSNTIG 111
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP ++G IG +DP+CRLIG YDGLFKV+P D GQL+EAF++RLEEL V+D+KF++GC
Sbjct: 112 RPCEHGLIGEVDPECRLIGSQAYDGLFKVVPMDRAGQLREAFDVRLEELNVVDVKFMHGC 171
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PTI VLY+D K+ARHVKTYEV +K+K +GPWSQ++++ G+ L+IPVP PL G +++
Sbjct: 172 ATPTICVLYEDTKEARHVKTYEVDVKEKTLRDGPWSQSDVEGGSSLIIPVPAPLGGAIVV 231
Query: 181 GEETIVYCSANAFKAIPIRPSITK----AYGRVDADGSRYLLGDHAGLLHLLVITHEKEK 236
GE IVY + + + A+G VDADGSRYLL D G+LHLLV+ H++ +
Sbjct: 232 GESVIVYLNKDGGNGAGGAIATKSVNVMAHGVVDADGSRYLLSDSTGMLHLLVLVHDRRR 291
Query: 237 VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP----------DAKG-S 285
V LK+E LG+TSIAST+SYLDN VVY+GS+YGDSQL++L+ QP G +
Sbjct: 292 VHALKLESLGQTSIASTLSYLDNGVVYVGSAYGDSQLVRLHAQPVRCAADQVPATPDGLT 351
Query: 286 YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
YVE LE + NLGPIVDF VVDL+R GQGQVVTCSG KDGSLR+VRNG+GI+E+A++EL
Sbjct: 352 YVECLESFTNLGPIVDFAVVDLDRHGQGQVVTCSGVNKDGSLRVVRNGVGIHERAAIELP 411
Query: 346 GIKGMWSLR--SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
G+KG WSLR ++ P DT LVVSF ETRILA++ +DEL E E GF + QTL +
Sbjct: 412 GVKGCWSLRRGDASTHPSDTHLVVSFAGETRILAIDDDDELAECEFRGFSANEQTLCVCN 471
Query: 404 AIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
+VQ + VRLV++ E R W P G +V+VA+AN +Q L+AT GG L L +
Sbjct: 472 VDGGFVVQCVASGVRLVNAADGEPRATWSPPGGATVSVASANRTQALVATTGGSLYSLAL 531
Query: 464 GDGILT-EVKHAQLE-YEISCLDINPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLIT 520
G L E A L+ EI+CLD P+ + + +++ AVG WT V + ++PDL L+T
Sbjct: 532 GSAALIRETASASLDGKEIACLDCTPLSDPGDAAARLCAVGTWT-AEVFLLTMPDLRLVT 590
Query: 521 KEHLGGE---IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
LGG +IPR+VLLC+FEG +LL LGDG L F ++ + G L D K +SLGTQP
Sbjct: 591 TSPLGGGGGGVIPRAVLLCSFEGTPHLLAGLGDGALHTFGVDPEAGTLRDGKSLSLGTQP 650
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
ITLRTF SK THVFA SDRPTVIY +N KL+YSNVNL+EV H CPFN AFPDSLA+A
Sbjct: 651 ITLRTFRSKGATHVFAGSDRPTVIYGNNGKLIYSNVNLREVLHACPFNCDAFPDSLALAS 710
Query: 638 EGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLL 697
E +LTIG IDDIQKLHIR++PLGE PRRI HQ ++RT+A + + +FVRL
Sbjct: 711 ESDLTIGGIDDIQKLHIRTVPLGEQPRRIAHQPETRTYAALTENF-----DENGYFVRLF 765
Query: 698 DDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
DD TFE + + L+ E S++SC+F+DD VYY VGT Y LPEE EPT+GRILVF E
Sbjct: 766 DDVTFETLCKFRLEPDEQDSSVISCAFADDPRVYYVVGTGYSLPEEPEPTRGRILVFRAE 825
Query: 758 DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW--MLRDDG----TRELQSECG 811
DGKLQL+AEKE KGAVY+LNAFNGKLLA IN K++L++ + DG T EL EC
Sbjct: 826 DGKLQLVAEKEVKGAVYNLNAFNGKLLAGINSKVELFRGGDPVGADGAGGSTYELAKECS 885
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
HHGHI+ALYV RG+FIVVGDLMKS+SLL YK EE IEERARDYNANWM+AV+ILDDD
Sbjct: 886 HHGHIVALYVAVRGEFIVVGDLMKSVSLLAYKPEESVIEERARDYNANWMTAVDILDDDT 945
Query: 872 YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
YLGAENNFNLFT+R+ S+ ATDEER RLEVVGEYH+GEFVNRFR GSLVMRLPD + +
Sbjct: 946 YLGAENNFNLFTLRRQSDAATDEERSRLEVVGEYHVGEFVNRFRRGSLVMRLPDQENADV 1005
Query: 932 PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE---KKT 988
PT++FGTV+GVIGV+A+LP EQ+ FL LQ L K + GVGGL+H+ WRSF NE +
Sbjct: 1006 PTLLFGTVSGVIGVLATLPREQFEFLSALQAALNKTVSGVGGLSHDAWRSFQNEHRHRAK 1065
Query: 989 VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
A+ F+DGDLIESFLDL + E++ + +SV+EL +RVE+L RL
Sbjct: 1066 DGARGFVDGDLIESFLDLRPEKAREVAAAVKLSVDELTRRVEDLQRL 1112
>gi|303271531|ref|XP_003055127.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226463101|gb|EEH60379.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1223
Score = 1215 bits (3144), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 634/1160 (54%), Positives = 784/1160 (67%), Gaps = 128/1160 (11%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
M+DVPIYGRIAT+EL RP E + LFI TER FCVL +DA ELITRAMGD++DR+G
Sbjct: 65 MMDVPIYGRIATMELVRPPFEKKPMLFILTERNMFCVLSYDAAKGELITRAMGDLTDRVG 124
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IG +DP+CR+ GLHLYDGLFKVIP D GQL+EAF++RLEELQV D+KFL G
Sbjct: 125 RPSECGPIGAVDPECRMYGLHLYDGLFKVIPMDQTGQLREAFSVRLEELQVFDVKFLAGT 184
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
KPTI VLYQD K+ RH+KTYEV LKDKDF GPW+QN++++G+ LI VP PL GV+++
Sbjct: 185 PKPTIAVLYQDTKEGRHIKTYEVCLKDKDFNPGPWAQNDVESGSRFLIAVPAPLGGVVVV 244
Query: 181 GEETIVYCSANA-------------------FKAIPIRPSIT-KAYGRVDADGSRYLLGD 220
GE+ I Y + KAI ++ T YG VD DGSRYLL D
Sbjct: 245 GEKVIAYLNKETTHGVGDGGGGGGGGGGGMIVKAIAMQSDATIMTYGAVDKDGSRYLLSD 304
Query: 221 HAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP 280
AG LHLLV+ H+K +V LK+E LG+TSIAS++SYLDN VVY+GS+YGDSQL++L+ P
Sbjct: 305 SAGRLHLLVLMHDKTRVRALKLESLGQTSIASSLSYLDNGVVYVGSAYGDSQLVRLHSTP 364
Query: 281 ------------------------------DAKGSYVEVLERYVNLGPIVDFCVVDLERQ 310
DA +YVEVLE + +LGPIVDF VVDL+R
Sbjct: 365 IPIAGGGGGDGDGGGGGGGEIVPVDSGAVTDAP-NYVEVLESFTSLGPIVDFVVVDLDRH 423
Query: 311 GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFI 370
GQGQVVTCSG +KDGSLR+VRNG+GI+EQA++EL G+KG W+L+++ D DTFLVV+FI
Sbjct: 424 GQGQVVTCSGVHKDGSLRVVRNGVGIHEQAAIELPGVKGCWALKNADDAVSDTFLVVAFI 483
Query: 371 SETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNE 430
ETRILA+N +DEL+ETE EGF + L C + QVTSG +RLV + LR
Sbjct: 484 GETRILAINDKDELDETEFEGFAGDERALACANVDGGYACQVTSGGIRLVDVATGALRAR 543
Query: 431 WKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-----------ILTEVKHAQLEYE 479
W PG V+VA AN +QV++A GG LV + G G +L E+ + +E
Sbjct: 544 WTPEPGERVSVAAANRTQVVVALEGGTLVSVAAGGGGDAMDVDDASPLLRELARVNVGHE 603
Query: 480 ISCLDINPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL-GGEIIPRSVLLCA 537
I+CLD+ P+ + + S+I AVG+W+ VR+ + L ++ L E+IPR+VLLC+
Sbjct: 604 IACLDVTPLADPRAASSEICAVGLWS-AEVRVLATATLEELSSAPLTDAEVIPRAVLLCS 662
Query: 538 FEGISYLLCALGDGHLLNF-LLNMKTGELTDRKKVSLGTQ--------------PITLRT 582
FEGI YLL LGDG L F L+ ++G + D KK+S+GTQ PITL+T
Sbjct: 663 FEGIPYLLAGLGDGQLFTFALMGERSGIIGDGKKLSVGTQARSIHWFPYDRVGVPITLKT 722
Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
F +KNTTHVFA SDRPTVIYS NKKL+YSNVNL+EV HMC FN AFPDSLA+A E ELT
Sbjct: 723 FRNKNTTHVFAGSDRPTVIYSQNKKLIYSNVNLREVLHMCAFNCDAFPDSLALASESELT 782
Query: 643 IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF 702
IG IDDIQKLHIR++PLGE PRRI HQ SRTFA+ + E E +VRL DD TF
Sbjct: 783 IGGIDDIQKLHIRTVPLGEQPRRIAHQPASRTFAVLTSHVSDVTNE-ESFYVRLFDDVTF 841
Query: 703 EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQ 762
E + Y LD E SI+SCSF+DD YY VGTA+ LPEE EP++GRILV ++G+L
Sbjct: 842 ETLFKYRLDVGETDSSIISCSFADDPASYYVVGTAFSLPEEVEPSRGRILVLRADEGRLS 901
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDG-------------------- 802
L+AEKE KGAVY+LNAFNGKLLA IN K+QL+KW+ R G
Sbjct: 902 LVAEKEVKGAVYNLNAFNGKLLAGINSKVQLFKWVSRGAGAGAGAGGGAEGGAVAMADGG 961
Query: 803 --------------TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
T EL SEC HHGHI+ALYV RGDFIVVGDLMKSISLL+YK +EG
Sbjct: 962 GGGGGGGGAPAAATTCELASECSHHGHIVALYVDVRGDFIVVGDLMKSISLLVYKPDEGV 1021
Query: 849 IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
IEERARD+N NWM+AV LDD+ YLGAEN+FNLFTVRKNS+ A DEER RL+V+GEYHLG
Sbjct: 1022 IEERARDFNPNWMTAVCALDDETYLGAENSFNLFTVRKNSDAAADEERSRLDVIGEYHLG 1081
Query: 909 EFVNRFRHGSLVMRLPDSDV-----------GQIPTVIFGTVNGVIGVIASLPHEQYLFL 957
EFVNRFR GSLVMRLP + PT +FGTVNG IGV+ASLP + FL
Sbjct: 1082 EFVNRFRAGSLVMRLPGDGDGAGLGLGLDASNEAPTQLFGTVNGAIGVVASLPESTHTFL 1141
Query: 958 EKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEIS 1015
LQ + KV+ GVGG +H+ WRSF+NE ++ V+A+ F+DGDLIESFLDL + E++
Sbjct: 1142 AALQKAMNKVVSGVGGFSHDAWRSFHNEHRSRLVEARGFVDGDLIESFLDLRPEKASEVA 1201
Query: 1016 KTMNVSVEELCKRVEELTRL 1035
+ V VEEL KR+EEL R+
Sbjct: 1202 SVVGVGVEELTKRIEELVRI 1221
>gi|307111604|gb|EFN59838.1| hypothetical protein CHLNCDRAFT_29381 [Chlorella variabilis]
Length = 1108
Score = 1204 bits (3114), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 595/1056 (56%), Positives = 758/1056 (71%), Gaps = 28/1056 (2%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDVP+YGR+A L+LFRP E +D L + TER KFCVL +D ES EL+TRA GD SDR+G
Sbjct: 61 VLDVPLYGRVAALQLFRPASEPRDLLLLLTERNKFCVLGFDEESGELVTRANGDASDRVG 120
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
R + GQ GI+DP CRLIGLHLYDGLFKVIP D +G L+EAFN+RL+EL+V+DI FL GC
Sbjct: 121 RQVELGQRGIVDPQCRLIGLHLYDGLFKVIPMDERGGLQEAFNMRLDELKVVDIAFLDGC 180
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PTI VLY+D K+ RHVKTYEV+L++K+ VEGPW Q+NLD GA +LIPVP G +++
Sbjct: 181 AAPTIAVLYEDTKEQRHVKTYEVSLREKELVEGPWQQSNLDAGASMLIPVPAS-GGAMVV 239
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG- 239
GE + + + ++ I+P++ AYG+VD DGSR+LL D G L+LL++ E+
Sbjct: 240 GESVVTFIAPGVVRSAAIKPTL--AYGQVDDDGSRFLLSDFLGNLYLLLLLREEGAAGAS 297
Query: 240 ------------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP---DAKG 284
LK+E LG T ASTI+YLD+ VV++GSS+GDSQLI+L+ P
Sbjct: 298 SSSGAGAGAVAGLKLEPLGRTPAASTIAYLDSGVVFVGSSFGDSQLIRLHSSPPDPSQPT 357
Query: 285 SYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL 344
S+VEVL+ NLGPIVDF VVDLERQGQGQVVTCSG DGSLRIVRNGIG+ EQA VEL
Sbjct: 358 SFVEVLDSQPNLGPIVDFAVVDLERQGQGQVVTCSGTGVDGSLRIVRNGIGVVEQALVEL 417
Query: 345 QGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA 404
GIK +WSLR + D DT+LV++F ETR+L MN EDEL+E EI GF S TL C +
Sbjct: 418 PGIKDLWSLRKTFMDAHDTYLVLTFSGETRVLGMNAEDELDEAEIPGFNSTALTLCCANT 477
Query: 405 IYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG 464
+++QL+QVT+ + RLV +++L +W+ + +A A+ +Q++ A GG LVYLE+G
Sbjct: 478 VHDQLLQVTATAFRLVDCMTQQLVTQWEPGADGRITIAAASPTQLVAAVGGRTLVYLELG 537
Query: 465 DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
DG + E QL+ +I+CLDI P+GE S++ AVG W + IF+LP L + +E L
Sbjct: 538 DGQIEEKGRVQLDADIACLDITPVGELSEASEVVAVGSWA-LEAHIFALPSLAPLFREKL 596
Query: 525 GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
++IPRS L FEG +YL+ LGDG L+N+ L+ TDRKK++LGT+PI+LRTF
Sbjct: 597 PTDVIPRSTLFAQFEGDTYLMYGLGDGQLVNYRLDADGP--TDRKKIALGTKPISLRTFR 654
Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
S+ VFAASDRPTVIYSSN+KLLYSN+N EVSHM FN+AAFPDSLA+ K+ L IG
Sbjct: 655 SRGAAAVFAASDRPTVIYSSNRKLLYSNLNENEVSHMTSFNTAAFPDSLALGKQEALLIG 714
Query: 645 TIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEF 704
TID+IQKLH+R++PLGE PRRI HQE SRTFA+ + E + VRL+D+QTFE
Sbjct: 715 TIDEIQKLHVRTVPLGEQPRRIAHQETSRTFAVTCTQATISGEGGDS--VRLVDEQTFEL 772
Query: 705 ISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLI 764
+ L E CS+ S DD YY VGTA+ P E EPTKGRI V GKL ++
Sbjct: 773 LDRLQLQQHELACSLCSTQLGDDPATYYVVGTAFAPPNEPEPTKGRIFVLAAAGGKLCVV 832
Query: 765 AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
EKET+GAVYSL F G+LLA IN ++Q+YKW+ + +G R L EC H GH+LALY+ TR
Sbjct: 833 CEKETRGAVYSLAEFQGRLLAGINSRVQMYKWLEQGEGGRALVPECSHAGHVLALYLATR 892
Query: 825 GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
GD +VVGDLMKSI LL + EEGA+E RARD++ NWMSAV +LDDD Y+GAEN++NLFTV
Sbjct: 893 GDLVVVGDLMKSIQLLAWGEEEGALELRARDFHPNWMSAVTVLDDDTYMGAENSYNLFTV 952
Query: 885 RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
R+N++ ATDEER RLE VG YHLGEFVNRF+ GSLVMRLPDS++ QIPTV+FGT+NGVIG
Sbjct: 953 RRNADAATDEERSRLETVGRYHLGEFVNRFQPGSLVMRLPDSELSQIPTVLFGTINGVIG 1012
Query: 945 VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK-KTVDAKNFLDGDLIESF 1003
V+ASLPH QY LE LQ +RKV+KGVGG +H QWR+F+N+ A+ F+DGDLIE F
Sbjct: 1013 VVASLPHAQYQLLESLQEAMRKVVKGVGGFDHAQWRAFSNQHMPATPARQFVDGDLIEQF 1072
Query: 1004 LDLSRTRMDEISKTM---NVSVEELCKRVEELTRLH 1036
LDL R + + M +VE + + VEEL+RLH
Sbjct: 1073 LDLKRDSAEAVIAAMAGGGATVESVTQLVEELSRLH 1108
>gi|348681092|gb|EGZ20908.1| hypothetical protein PHYSODRAFT_259403 [Phytophthora sojae]
Length = 1137
Score = 1165 bits (3014), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 586/1085 (54%), Positives = 767/1085 (70%), Gaps = 52/1085 (4%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
DV +YGRIA E+FR E QD+LFI T+R++FCVL +DA + + +T+A G + D IGR
Sbjct: 54 DVNLYGRIAIFEVFRALNEPQDWLFIVTQRFQFCVLAYDAAAQQFVTKAHGSIRDSIGRS 113
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDN-KGQLKEAFNIRLEELQVLDIKFLYGCA 121
++ G IDP+ RLIG++LY+G FKVIP D+ KG LK+ FNIRL+EL+V+DIKFL+G
Sbjct: 114 SEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGILKDTFNIRLDELRVIDIKFLHGYT 173
Query: 122 KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
KPTI VLY+D K ARH+KTY + LK+KDF EGPWSQ+N+++GA LLIPVP P+ GVLI+
Sbjct: 174 KPTICVLYEDYKAARHIKTYHILLKEKDFAEGPWSQSNVESGASLLIPVPAPVGGVLIVS 233
Query: 182 EETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLK 241
+TIVY + + F AIP++ ++ + YG VD DGSR+LL D G L ++ + H ++VTG+
Sbjct: 234 NQTIVYHNGSTFHAIPMQSTVIQVYGAVDKDGSRFLLADQYGTLSVVALQHTGKEVTGVH 293
Query: 242 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 301
+E+LGET+IAS +SYLDN VV+IGS++GDSQLIKLN D GSY+EVL+ YVN+GPI+D
Sbjct: 294 LEVLGETNIASCLSYLDNGVVFIGSTFGDSQLIKLNADRDENGSYIEVLDTYVNVGPIID 353
Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 361
FCV+DL+RQGQGQ+VTCSGA KDG+LR++RNGIGINEQAS EL GIKGMW+LR +
Sbjct: 354 FCVMDLDRQGQGQIVTCSGADKDGTLRVIRNGIGINEQASAELPGIKGMWALRETFAAEH 413
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D +L+ S++SE RILA+ EDE+EE EI F + +TL C + + +QVT VRL+S
Sbjct: 414 DKYLLQSYVSEIRILAIGDEDEMEEKEIPAF-TNVKTLLCRNMYGDVWLQVTESEVRLIS 472
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 481
TS L + W P G + VA AN +QV +AT GG LVYLE+ +G +TE ++E+EI+
Sbjct: 473 CTSLSLSSTWSPPLGSRITVAAANPTQVAVATSGGVLVYLEVENGQVTEKTKVKMEHEIA 532
Query: 482 CLDINPIGENPSY-------------------SQIAAVGMWTDISVRIFSLPDLNLITKE 522
C+DI P+ + + S I VG+WT+ SV + LP L +T E
Sbjct: 533 CVDITPLARSQATDGDVAMTGSSTHWDMAALNSSICVVGLWTNFSVSVLKLPTLEKLTTE 592
Query: 523 HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
LG +++PRSVL FEG YLL LGDG L+N+ LN+ G L RK+VSLG+QP++L T
Sbjct: 593 SLGTDLLPRSVLCNTFEGKDYLLVGLGDGSLMNYELNVAQGTLGTRKRVSLGSQPLSLST 652
Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
F SKN THVFAA DRPTVIYSSN KLLYSN+N KEV+ MCPF+S +FP+ LA++ E ELT
Sbjct: 653 FRSKNMTHVFAACDRPTVIYSSNNKLLYSNINSKEVNVMCPFDSESFPECLALSSEEELT 712
Query: 643 IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMHFVRLLDDQ 700
IGT+DDIQKLHI++ L E RRI H +S T + S EE + FVRL DDQ
Sbjct: 713 IGTVDDIQKLHIQTFHLNEWARRIAHDPESHTLGVLTVSFTVDDTGEEVDQGFVRLFDDQ 772
Query: 701 TFEFISTYPLDTFEYGCSILSCSFSDDSN--VYYCVGTAYVLPEENEPTKGRILVFIVE- 757
TFE + +Y LD FE S++ C FS DS+ Y+ VGTAY+ +E EP +GRILVF V
Sbjct: 773 TFEVLHSYRLDPFETPSSVVVCPFSGDSSSGSYFVVGTAYIHEDEAEPHQGRILVFAVTG 832
Query: 758 ---DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
+ KLQL+ EKE KGAVY LNAFNGK+LA +N K QLYKW D +EL SECGH+G
Sbjct: 833 IHGERKLQLVTEKEVKGAVYCLNAFNGKVLAGVNSKAQLYKWSENTDNEKELVSECGHYG 892
Query: 815 HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
H L LY+++RGDFIVVGDLMKS+SLL YK +G IEE A+D N+NWMSA+ I+DDD Y+G
Sbjct: 893 HTLVLYMESRGDFIVVGDLMKSVSLLSYKQLDGTIEEIAKDLNSNWMSALGIVDDDTYIG 952
Query: 875 AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL-------------VM 921
+E +FNLFTV++NS A+DEERGRLE VGE+HLGEFVNRFR+GSL V
Sbjct: 953 SETDFNLFTVQRNSGAASDEERGRLETVGEFHLGEFVNRFRYGSLTPAAAGPTDMVDVVE 1012
Query: 922 RLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS 981
+ P Q +++FGTV+G+IGVI L +QY FL ++Q L +V+KGVGG +H+ WR
Sbjct: 1013 QAPIVPAAQNQSMLFGTVSGMIGVILPLTKDQYSFLLRVQQALTQVVKGVGGFSHKDWRM 1072
Query: 982 FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSVEELCKRVEE 1031
F N + +A+NF+DGDL+ESFLDL + +M ++ +N +VE+L R+EE
Sbjct: 1073 FENRRSVSEARNFIDGDLVESFLDLPKAQMTKVVDKLNSDGMLDGTDQFTVEDLTLRIEE 1132
Query: 1032 LTRLH 1036
L +LH
Sbjct: 1133 LAQLH 1137
>gi|145351726|ref|XP_001420218.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580451|gb|ABO98511.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1120
Score = 1161 bits (3004), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 592/1059 (55%), Positives = 772/1059 (72%), Gaps = 30/1059 (2%)
Query: 1 MLDVPIYGRIATLELFRP-HGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRI 59
+LDVPI GRIAT+ L + G+ + L++ TERY F VL +D + EL T A GDV D I
Sbjct: 66 VLDVPINGRIATMSLCQTGSGDGKARLYLTTERYGFTVLSYDEANEELKTEAFGDVQDNI 125
Query: 60 GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYG 119
GRP D+GQIGI+D CR IGL LYDGLFKVIP D KG +KEAFNIRLEEL+V DIKFL+G
Sbjct: 126 GRPADDGQIGIVDDTCRAIGLRLYDGLFKVIPCDEKGGVKEAFNIRLEELRVEDIKFLHG 185
Query: 120 CAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLI 179
KPTI VLY+D KDA H+KTYE+ +++K+FV PW+QN+L+ G++ +IPVP P+ GV++
Sbjct: 186 TPKPTIAVLYRDTKDAVHIKTYEIGIREKEFVSSPWAQNDLEGGSNKIIPVPAPIGGVVV 245
Query: 180 IGEETIVYCS-----ANAF-KA-----IPIRPSITKAYGRVDADGSRYLLGDHAGLLHLL 228
+G+E IVY + A+ F KA IP R +IT YG +D DGSRYLLGD G+L+LL
Sbjct: 246 LGQEIIVYLNKFEDDADVFLKAINIPNIPDRTNIT-CYGAIDPDGSRYLLGDADGMLYLL 304
Query: 229 VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL-----NLQPDAK 283
VI H+ ++V LKIE LG+TSIAST+SYLDN VV++GS+YGDSQLIKL ++ D
Sbjct: 305 VILHDGKRVRELKIERLGDTSIASTLSYLDNGVVFVGSTYGDSQLIKLHAEKTSIDKDGN 364
Query: 284 GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE 343
+YV++LE + NLGPIVDF VDLER GQGQVVTCSGA KDGSLR+VRNGIGI+EQA ++
Sbjct: 365 PTYVQILEEFTNLGPIVDFAFVDLERHGQGQVVTCSGALKDGSLRVVRNGIGIDEQAVIQ 424
Query: 344 LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM--NLEDELEETEIEGFCSQTQTLFC 401
L G+KG++SLR S D D +LVV+FI+ETRIL + D L+ETEI GF ++ QTL C
Sbjct: 425 LPGVKGLFSLRDSDDSQMDKYLVVTFINETRILGFVGDEGDTLDETEIAGFDAEAQTLCC 484
Query: 402 HDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL 461
+ N +QVT VRLVS +L +EWK G + A N +Q+L+A GG L L
Sbjct: 485 GNMQGNVFLQVTHRGVRLVSRGG-DLLDEWKPKDGAEILSAKCNPTQILVAAAGGQLHCL 543
Query: 462 EIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK 521
+ G + + E EI+CLD P+G+ S S + AVG+W+ + + + S+ DL++ITK
Sbjct: 544 NVAKGKIVLLASKTFENEIACLDCTPMGDGMS-SPVCAVGLWS-MDIVLASMSDLSVITK 601
Query: 522 EHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
E +IIPRS LLC+FE I YL LGDG L+ ++L+ TG L+ RKK+SLGT+PITL+
Sbjct: 602 ESTDEDIIPRSTLLCSFEDIPYLFVGLGDGQLITYVLDQNTGALSGRKKLSLGTKPITLQ 661
Query: 582 TFSS--KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
TF S N + VFAASDRPTVI+S+NKKL+YSNVN++EV H+CPF+S AFPD+LA+A +
Sbjct: 662 TFKSHATNVSSVFAASDRPTVIFSNNKKLIYSNVNVQEVLHVCPFSSEAFPDALALAGDE 721
Query: 640 ELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD 699
+LTIG IDDIQKLHIR+IPLG HPRRI HQ + TFA+ ++++ + E+ F+RL+DD
Sbjct: 722 DLTIGGIDDIQKLHIRTIPLGGHPRRIAHQVDTNTFAV-AVEHLMSKGDQEL-FIRLIDD 779
Query: 700 QTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG 759
+F+ + + L+ E S++SCSF+ DS YY VGT + +E+EP++GRILV VE
Sbjct: 780 GSFDTLHQFRLEEHELASSLMSCSFAGDSREYYVVGTGFAYEQEDEPSRGRILVLRVEAD 839
Query: 760 KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
L+L++EKE +GAVY+LNAF GKLLA IN K++L+KW R+D EL SEC HHG I+
Sbjct: 840 ALELVSEKEVRGAVYNLNAFKGKLLAGINSKLELFKWTPREDDAHELVSECSHHGQIITF 899
Query: 820 YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD-IYLGAENN 878
V+TRGD+I+VGDL+KS+SLL YK EEGAI+E ARD+NANWM+AV +LDDD YLGAEN+
Sbjct: 900 SVKTRGDWILVGDLLKSMSLLQYKPEEGAIDEIARDFNANWMTAVAMLDDDETYLGAENS 959
Query: 879 FNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
NLFTV +N TDEER RLE+ GEYHLGEFVN F GSLVM L D D ++PT++FGT
Sbjct: 960 LNLFTVARNMNAMTDEERSRLEITGEYHLGEFVNVFSPGSLVMSLKDGDSLEVPTLLFGT 1019
Query: 939 VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN--EKKTVDAKNFLD 996
NGVIGV+ASLP + Y F E+LQT++ K I+GVGGL H +WRSF + +K+ ++NF+D
Sbjct: 1020 GNGVIGVLASLPKDAYDFAERLQTSMNKHIQGVGGLKHAEWRSFRHTLRRKSDPSRNFVD 1079
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
GDL+ESFLDL + D ++ M E+ +RVEEL RL
Sbjct: 1080 GDLVESFLDLKVEQADVVAADMKCDRAEIIRRVEELQRL 1118
>gi|413081953|ref|NP_741992.2| DNA damage-binding protein 1 [Rattus norvegicus]
gi|293344614|ref|XP_002725831.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
gi|293356422|ref|XP_002728912.1| PREDICTED: DNA damage-binding protein 1 [Rattus norvegicus]
gi|149062405|gb|EDM12828.1| damage-specific DNA binding protein 1 [Rattus norvegicus]
Length = 1140
Score = 1152 bits (2980), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 580/1094 (53%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPSRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+GE+ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGESNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|149725200|ref|XP_001502072.1| PREDICTED: DNA damage-binding protein 1 [Equus caballus]
Length = 1140
Score = 1151 bits (2978), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHAQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTATHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|344295432|ref|XP_003419416.1| PREDICTED: DNA damage-binding protein 1 [Loxodonta africana]
Length = 1140
Score = 1151 bits (2978), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 580/1094 (53%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVSLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAQQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +V+VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNVSVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|348526664|ref|XP_003450839.1| PREDICTED: DNA damage-binding protein 1-like [Oreochromis niloticus]
Length = 1140
Score = 1151 bits (2977), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 580/1094 (53%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD--AESSELITRAMGDVSDRIG 60
+V +YGRIA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGRIAVMELFRPKGESKDLLFILTSKYNACILEYKQTGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGI+DP+CR+IGL LYDGLFKVIP D + +AFNIRLEELQV+D+ FLYGC
Sbjct: 114 RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRDNRELKAFNIRLEELQVIDVHFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT+ +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++IPVP P G +II
Sbjct: 174 QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK--- 234
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E+
Sbjct: 233 GQESITYHNGDKYLAIAPPTIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEELMD 292
Query: 235 --EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L +ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV V+E
Sbjct: 293 GTVALKDLHVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVAVMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L ++ E E+EETE+ GF QT +C + + QL+Q+
Sbjct: 413 LRSEAGRETDDMLVLSFVGQTRVLMLSGE-EVEETELPGFVDNQQTFYCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TSGSVRLV S+ L +EWK P G +++VA N +QV+LA G L YL+I G L ++
Sbjct: 472 TSGSVRLVLQDSKTLVSEWKEPQGRNISVAACNHTQVVLAVGRA-LYYLQILAGELKQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLDI P+GE S + AVG+WTDIS R+ LP + KE LGGEIIPRS
Sbjct: 531 TTEMEHEVACLDITPLGEGGGESPLCAVGLWTDISARVLKLPCFTALHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEG YLLCALGDG L F L+++TG L++RKKV+LGTQP LRTF S +T++VF
Sbjct: 591 ILMTTFEGSYYLLCALGDGALFYFGLDLQTGALSERKKVTLGTQPTVLRTFRSLSTSNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PRRIC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRRICYQEVSQCFGVLSSRVEIQDVSGTTSAVRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D +VY+
Sbjct: 771 FPSSTSPHETSFGEEVEVHNLLVVDQHTFEVLHAHQFLPSEYALSLVSCRLGKDPSVYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGK LA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKFLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKSMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLV++ L +S +V+FGTVNG+IG++ SL Y L LQ L KVIK VG
Sbjct: 1007 FCHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSF+ E+KT A F+DGDLIESFLDL R +M E+ T+ + +V
Sbjct: 1067 KIEHSFWRSFHTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATV 1126
Query: 1023 EELCKRVEELTRLH 1036
+E+ K VEELTR+H
Sbjct: 1127 DEVIKIVEELTRIH 1140
>gi|73983859|ref|XP_533275.2| PREDICTED: DNA damage-binding protein 1 [Canis lupus familiaris]
gi|291409601|ref|XP_002721069.1| PREDICTED: damage-specific DNA binding protein 1 [Oryctolagus
cuniculus]
gi|301781686|ref|XP_002926259.1| PREDICTED: DNA damage-binding protein 1-like [Ailuropoda melanoleuca]
Length = 1140
Score = 1150 bits (2976), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|385865228|gb|AFI92852.1| DNA damage-binding protein 1 [Danio rerio]
Length = 1140
Score = 1150 bits (2976), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 580/1096 (52%), Positives = 768/1096 (70%), Gaps = 71/1096 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ +S ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGDSIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGI+DP+CR+IGL LYDGLFKVIP D + + +AFNIRLEELQV+D++FLYGC
Sbjct: 114 RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRENRELKAFNIRLEELQVIDVQFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT+ +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++IPVP P G +II
Sbjct: 174 QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
G+E+I Y + + + A+ I+ S + RVD +GSRYLLGD G L +L++ EKE++
Sbjct: 233 GQESITYHNGDKYLAVAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLL--EKEEL 290
Query: 238 TG-------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
L +ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV V+
Sbjct: 291 MDGAVVLKDLHVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNDQGSYVGVM 350
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 351 ETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGL 410
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W LRS + D LV+SF+ +TR+L ++ E E+EETE++GF QT FC + + QL+
Sbjct: 411 WPLRSESSRDTDDMLVLSFVGQTRVLMLSGE-EVEETELQGFVDNQQTFFCGNVAHQQLI 469
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
Q+TS SVRLV+ S+ L +EWK P G +++VA+ N +QV+LA G L YL+I G L +
Sbjct: 470 QITSVSVRLVTQDSKALVSEWKEPQGRNISVASCNNTQVVLAVGR-VLYYLQILSGELKQ 528
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
+ ++E+E++CLDI P+GE + S I AVG+WTDIS R+ LP + KE LGGEIIP
Sbjct: 529 ISSTEMEHEVACLDITPLGERTADSCICAVGLWTDISARLLKLPCFTPLHKEMLGGEIIP 588
Query: 531 RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
RS+L+ FEG YLLCALGDG L F L+++TG L++RKKV+LGTQP LRTF S +T++
Sbjct: 589 RSILMTTFEGSHYLLCALGDGALFYFGLDIQTGVLSERKKVTLGTQPTVLRTFRSLSTSN 648
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQ
Sbjct: 649 VFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQ 708
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------------- 681
KLHIR++PL E P+RIC+QE S+ F + S +
Sbjct: 709 KLHIRTVPLYESPKRICYQEVSQCFGVLSSRVEMQDASGTTAAVRPSASTQALSSSVSSS 768
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
S EE E+H + ++D TFE + + EY S++SC D VY
Sbjct: 769 KLFPSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLQNEYALSMVSCKLGRDPAVY 828
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN +
Sbjct: 829 FIVGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTV 888
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG+ EE
Sbjct: 889 RLYEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGSFEE 944
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFV
Sbjct: 945 IARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFV 1004
Query: 912 NRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
N F HGSLV++ L +S +V+FGTVNG+IG++ SL Y L LQ L KVIK
Sbjct: 1005 NVFSHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKS 1064
Query: 971 VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV---------- 1020
VG + H WRSF+ E+KT A F+DGDLIESFLDL R +M E+ T+ +
Sbjct: 1065 VGKIEHSFWRSFHTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREA 1124
Query: 1021 SVEELCKRVEELTRLH 1036
+V+E+ K VEELTR+H
Sbjct: 1125 TVDEVIKIVEELTRIH 1140
>gi|7657011|ref|NP_056550.1| DNA damage-binding protein 1 [Mus musculus]
gi|134034087|sp|Q3U1J4.2|DDB1_MOUSE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=UV-damaged DNA-binding factor
gi|5931596|dbj|BAA84699.1| XPE UV-damaged DNA binding factor [Mus musculus]
gi|16307148|gb|AAH09661.1| Damage specific DNA binding protein 1 [Mus musculus]
gi|74182145|dbj|BAE34102.1| unnamed protein product [Mus musculus]
gi|74196166|dbj|BAE32993.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 1150 bits (2975), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|354504619|ref|XP_003514371.1| PREDICTED: DNA damage-binding protein 1-like [Cricetulus griseus]
gi|344258340|gb|EGW14444.1| DNA damage-binding protein 1 [Cricetulus griseus]
Length = 1140
Score = 1150 bits (2975), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPSRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|410974071|ref|XP_003993471.1| PREDICTED: DNA damage-binding protein 1 [Felis catus]
Length = 1193
Score = 1150 bits (2975), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 107 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 166
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 167 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 226
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 227 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 285
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 286 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 345
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 346 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 405
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 406 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 465
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 466 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 524
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 525 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 583
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 584 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 643
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 644 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 703
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 704 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 763
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 764 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 823
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 824 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 883
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 884 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 943
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 944 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 999
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 1000 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1059
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1060 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1119
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1120 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1179
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1180 DDLIKVVEELTRIH 1193
>gi|418316|sp|P33194.1|DDB1_CERAE RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=DDBa; AltName:
Full=Damage-specific DNA-binding protein 1; AltName:
Full=UV-damaged DNA-binding protein 1; Short=UV-DDB 1
gi|304026|gb|AAA03021.1| UV-damaged DNA-binding protein [Chlorocebus aethiops]
Length = 1140
Score = 1150 bits (2974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|403255013|ref|XP_003920244.1| PREDICTED: DNA damage-binding protein 1 [Saimiri boliviensis
boliviensis]
Length = 1140
Score = 1150 bits (2974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|122692537|ref|NP_001073731.1| DNA damage-binding protein 1 [Bos taurus]
gi|426251842|ref|XP_004019630.1| PREDICTED: DNA damage-binding protein 1 [Ovis aries]
gi|134034086|sp|A1A4K3.1|DDB1_BOVIN RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|119223918|gb|AAI26630.1| Damage-specific DNA binding protein 1, 127kDa [Bos taurus]
gi|296471644|tpg|DAA13759.1| TPA: DNA damage-binding protein 1 [Bos taurus]
Length = 1140
Score = 1150 bits (2974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARIAKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|359546285|pdb|4A11|A Chain A, Structure Of The Hsddb1-Hscsa Complex
gi|361132519|pdb|4A0K|C Chain C, Structure Of Ddb1-Ddb2-Cul4a-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
Length = 1159
Score = 1150 bits (2974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 73 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 132
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 133 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 192
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 193 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 251
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 252 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 311
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 312 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 371
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 372 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 431
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 432 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 490
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 491 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 549
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 550 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 609
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 610 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 669
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 670 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 729
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 730 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 789
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 790 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 849
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 850 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 909
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 910 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 965
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 966 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1025
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1026 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1086 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1145
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1146 DDLIKVVEELTRIH 1159
>gi|348560393|ref|XP_003465998.1| PREDICTED: DNA damage-binding protein 1-like [Cavia porcellus]
Length = 1140
Score = 1150 bits (2974), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 758/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE +D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGENKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFSKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPSRETDDTLVLSFVGQTRVLMLNGE-EVEETELVGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSCGLSPLCAIGLWTDISARILQLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTVLRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCRLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEQATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|432851195|ref|XP_004066902.1| PREDICTED: DNA damage-binding protein 1-like [Oryzias latipes]
Length = 1140
Score = 1149 bits (2973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 764/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YGRI+ +ELFRP GE++D LFI T +Y C+L++ + +S ++ITRA G+V DRIG
Sbjct: 54 EVGMYGRISVMELFRPKGESKDLLFILTSKYNACILEYKQNGDSIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGI+DP+CR+IGL LYDGLFKVIP D + + +AFNIRLEELQV+D+ FLYGC
Sbjct: 114 RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRENRELKAFNIRLEELQVIDVHFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT+ +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++IPVP P G +II
Sbjct: 174 QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK--- 234
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E+
Sbjct: 233 GQESITYHNGDKYLAIAPPTIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEELMD 292
Query: 235 --EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L +ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV V+E
Sbjct: 293 GTVALKDLHVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVTVMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPI+D CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPILDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L ++ E E+EETE+ GF QT +C + + QL+Q+
Sbjct: 413 LRSEAGRESDDMLVLSFVGQTRVLMLSGE-EVEETELPGFVDNQQTFYCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TSGSVRLV S+ L +EWK P G +++VA N +QV+LA G L YL+I G L ++
Sbjct: 472 TSGSVRLVLQDSKALVSEWKEPQGRNISVAACNHTQVVLAVGRA-LYYLQILAGELKQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLDI P+GE+ S S + AVG+WTDIS R+ LP + KE LGGEIIPRS
Sbjct: 531 TTEMEHEVACLDITPLGESSSESPLCAVGLWTDISARMLKLPCFTPLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEG YLLCALGDG L F L+++TG L++RKKV+LGTQP LRTF S +T++VF
Sbjct: 591 ILMTTFEGSYYLLCALGDGALFYFGLDLQTGALSERKKVTLGTQPTVLRTFRSLSTSNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PRRIC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRRICYQEVSQCFGVLSSRVEIQDASGAISAVRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EYG S++SC D +VY+
Sbjct: 771 FPSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLPSEYGLSLVSCRLGKDPSVYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEPEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGVFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLV++ L +S +V+FGTVNG+IG++ SL + L LQ L KVIK VG
Sbjct: 1007 FCHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWHSLLLDLQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSF E+KT A F+DGDLIESFLDL R +M E+ T+ + +V
Sbjct: 1067 KIEHSFWRSFYTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGGGMKREATV 1126
Query: 1023 EELCKRVEELTRLH 1036
+E+ K VEELTR+H
Sbjct: 1127 DEVIKIVEELTRIH 1140
>gi|221046711|pdb|3EI1|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 14 Bp 6-4 Photoproduct
Containing Dna-Duplex
gi|221046715|pdb|3EI2|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Abasic Site
Containing Dna-Duplex
gi|221046719|pdb|3EI3|A Chain A, Structure Of The Hsddb1-Drddb2 Complex
Length = 1158
Score = 1149 bits (2973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 72 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 131
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 132 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 191
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 192 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 250
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 251 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 310
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 311 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 370
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 371 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 430
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 431 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 489
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 490 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 548
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 549 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 608
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 609 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 668
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 669 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 728
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 729 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 788
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 789 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 848
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 849 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 908
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 909 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 964
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 965 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1024
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1025 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1084
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1085 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1144
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1145 DDLIKVVEELTRIH 1158
>gi|358440066|pdb|4A0A|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.6 A Resolution (Cpd 3)
Length = 1159
Score = 1149 bits (2973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 73 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 132
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 133 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 192
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 193 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPKPFGGAIII 251
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 252 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 311
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 312 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 371
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 372 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 431
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 432 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 490
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 491 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 549
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 550 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 609
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 610 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 669
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 670 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 729
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 730 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 789
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 790 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 849
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 850 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 909
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 910 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 965
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 966 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1025
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1026 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1086 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1145
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1146 DDLIKVVEELTRIH 1159
>gi|358440070|pdb|4A0B|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
gi|358440072|pdb|4A0B|C Chain C, Structure Of Hsddb1-Drddb2 Bound To A 16 Bp Cpd-Duplex (
Pyrimidine At D-1 Position) At 3.8 A Resolution (Cpd 4)
Length = 1159
Score = 1149 bits (2973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 73 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 132
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 133 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 192
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 193 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPSPFGGAIII 251
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 252 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 311
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 312 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 371
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 372 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 431
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 432 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 490
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 491 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 549
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 550 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 609
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 610 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 669
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 670 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 729
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 730 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 789
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 790 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 849
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 850 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 909
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 910 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 965
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 966 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1025
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1026 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1086 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1145
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1146 DDLIKVVEELTRIH 1159
>gi|301121252|ref|XP_002908353.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
gi|262103384|gb|EEY61436.1| DNA damage-binding protein, putative [Phytophthora infestans T30-4]
Length = 1150
Score = 1149 bits (2973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 580/1098 (52%), Positives = 767/1098 (69%), Gaps = 65/1098 (5%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
D+ +YGRIA E+FR E QD++FI T+R++FCVL +D+ + +++T+A G + D IGR
Sbjct: 54 DINLYGRIAIFEVFRAANEPQDWIFIVTQRFQFCVLAYDSSAQQVVTKAHGSIRDSIGRS 113
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDN-KGQLKEAFNIRLEELQVLDIKFLYGCA 121
++ G IDP+ RLIG++LY+G FKVIP D+ KG L++ FNIRL+EL+V+DIKFL+G
Sbjct: 114 SEIVTSGNIDPEGRLIGMNLYEGYFKVIPIDSGKGILRDTFNIRLDELRVIDIKFLHGYN 173
Query: 122 KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
KPTI VLY+D K ARHVKTY + LK+KDF EGPWSQ+N+++GA LLIPVP P GVLI+
Sbjct: 174 KPTICVLYEDYKAARHVKTYHILLKEKDFAEGPWSQSNVESGASLLIPVPAPTGGVLIVS 233
Query: 182 EETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLK 241
+TIVY + + F AIP++ ++ + YG VD DGSR+LL D G L ++ + H ++V+G+
Sbjct: 234 NQTIVYHNGSTFHAIPMQSTVIQVYGAVDKDGSRFLLADQYGTLSVVALQHTGKEVSGVH 293
Query: 242 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 301
+E+LGET+IAS +SYLDN VV+IGS++GDSQLIKLN D GSY+EVL+ YVN+GPI+D
Sbjct: 294 LEVLGETNIASCLSYLDNGVVFIGSTFGDSQLIKLNADRDETGSYIEVLDSYVNVGPIID 353
Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 361
FCV+DL+RQGQGQ+VTCSGA KDG+LR++RNGIGINEQAS EL GIKGMW+LR +
Sbjct: 354 FCVMDLDRQGQGQIVTCSGADKDGTLRVIRNGIGINEQASAELPGIKGMWALRETFAAEH 413
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D FL+ S++SE RILA+ EDE+EE EI F + +TL C + + +QVT VRL+S
Sbjct: 414 DKFLLQSYVSEVRILAIGDEDEMEEKEIPAF-TNVKTLLCRNMYGDYWLQVTESEVRLIS 472
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 481
+S L + W G + VA AN +QV +AT GG LVYLEI +G + E ++E+E++
Sbjct: 473 CSSFSLSSTWSPASGSRITVAAANPTQVAVATSGGVLVYLEIENGQVVEKTTVKMEHELA 532
Query: 482 CLDINPI-------------GENPSY------SQIAAVGMWTDISVRIFSLPDLNLITKE 522
C+DI P+ G + + S + VG+WT SV + LP L T E
Sbjct: 533 CVDITPLTSSGAGDGDVAMTGSSTHWDMAALRSTLCVVGLWTSFSVCVLQLPTLEKRTTE 592
Query: 523 HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
LG +++PRSVL FEG YLL LGDG L+N+ L+++ G L RK+VSLG+QP++L T
Sbjct: 593 ALGTDLLPRSVLCNTFEGKDYLLVGLGDGSLMNYELDVQQGALGTRKRVSLGSQPLSLST 652
Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
F SKN THVFAA DRPTVIYS+N KLLYSN+N KEV+ MCPF+S +FP+ LA++ E ELT
Sbjct: 653 FRSKNMTHVFAACDRPTVIYSNNNKLLYSNINSKEVNVMCPFDSESFPECLALSSEEELT 712
Query: 643 IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMHFVRLLDDQ 700
IGT+DDIQKLHI++ L E RRI H +S T + S EE + FVRL DDQ
Sbjct: 713 IGTVDDIQKLHIQTFHLNEWARRIAHDPESHTLGVLTVSFTVDDTGEEVDQGFVRLFDDQ 772
Query: 701 TFEFISTYPLDTFEYGCSILSCSFSDDS--NVYYCVGTAYVLPEENEPTKGRILVFIVE- 757
TFE + +Y LD FE CS++ C + DS Y+ VGTAY+ EE EP +GRILVF V
Sbjct: 773 TFEVLHSYRLDPFETPCSVVVCPLAGDSVNASYFVVGTAYIHEEEAEPHQGRILVFAVTG 832
Query: 758 ---DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
+ KLQL+ EKE KGAVY LN+FNGK+LA +N K QLYKW D +EL SECGH+G
Sbjct: 833 IHGERKLQLVTEKEVKGAVYCLNSFNGKVLAGVNSKAQLYKWSENTDNEKELVSECGHYG 892
Query: 815 HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
H L LY+++RGDFIVVGDLMKSISLL YK +G IEE A+D N+NWMSAV I+DDD Y+G
Sbjct: 893 HTLVLYMESRGDFIVVGDLMKSISLLSYKQLDGTIEEIAKDLNSNWMSAVGIVDDDTYIG 952
Query: 875 AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------------ 922
+E +FNLFTV++NS A+DEERGRLE VGE+HLGEFVNRFR+GSLVM+
Sbjct: 953 SETDFNLFTVQRNSGAASDEERGRLETVGEFHLGEFVNRFRYGSLVMQNSSSTSQTPSGV 1012
Query: 923 --------------LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVI 968
P + V Q +++FGTV+G+IGVI + +QY FL ++Q L V+
Sbjct: 1013 VSTGPTAMVDVGESAPAAPVVQNQSMLFGTVSGMIGVILPISKDQYSFLLRVQQALTHVV 1072
Query: 969 KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN--------- 1019
KGVGG +H+ WR+F N + +A+NF+DGDL+ESFLDL + +M ++ +N
Sbjct: 1073 KGVGGFSHKDWRTFENRRSVSEARNFIDGDLVESFLDLPKPQMTKVVDKLNSDGMLDGTD 1132
Query: 1020 -VSVEELCKRVEELTRLH 1036
+VE+L R+EEL +LH
Sbjct: 1133 QFTVEDLTLRIEELVQLH 1150
>gi|395852550|ref|XP_003798801.1| PREDICTED: DNA damage-binding protein 1 [Otolemur garnettii]
Length = 1140
Score = 1149 bits (2973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSEPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLV + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVCQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|74215029|dbj|BAE33503.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 1149 bits (2973), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +++S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQRDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|361132523|pdb|4A0L|A Chain A, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
gi|361132525|pdb|4A0L|C Chain C, Structure Of Ddb1-Ddb2-Cul4b-Rbx1 Bound To A 12 Bp Abasic
Site Containing Dna-Duplex
Length = 1144
Score = 1149 bits (2972), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 58 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 117
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 118 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 177
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 178 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 236
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 237 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 296
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 297 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 356
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 357 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 416
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 417 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 475
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 476 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 534
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 535 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 594
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 595 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 654
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 655 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 714
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 715 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 774
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 775 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 834
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 835 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 894
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 895 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 950
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 951 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1010
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1011 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1070
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1071 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1130
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1131 DDLIKVVEELTRIH 1144
>gi|148529014|ref|NP_001914.3| DNA damage-binding protein 1 [Homo sapiens]
gi|296218432|ref|XP_002807395.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
[Callithrix jacchus]
gi|397516558|ref|XP_003828491.1| PREDICTED: DNA damage-binding protein 1 [Pan paniscus]
gi|402893195|ref|XP_003909786.1| PREDICTED: DNA damage-binding protein 1 [Papio anubis]
gi|426368721|ref|XP_004051351.1| PREDICTED: DNA damage-binding protein 1 [Gorilla gorilla gorilla]
gi|12643730|sp|Q16531.1|DDB1_HUMAN RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=DNA damage-binding protein a;
Short=DDBa; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=HBV X-associated protein 1;
Short=XAP-1; AltName: Full=UV-damaged DNA-binding factor;
AltName: Full=UV-damaged DNA-binding protein 1;
Short=UV-DDB 1; AltName: Full=XPE-binding factor;
Short=XPE-BF; AltName: Full=Xeroderma pigmentosum group
E-complementing protein; Short=XPCe
gi|203282525|pdb|3E0C|A Chain A, Crystal Structure Of Dna Damage-Binding Protein 1(Ddb1)
gi|695362|gb|AAA62838.1| X-associated protein 1, partial [Homo sapiens]
gi|1052865|gb|AAC50349.1| DDBa p127 [Homo sapiens]
gi|15079750|gb|AAH11686.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|29792243|gb|AAH50530.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|30354567|gb|AAH51764.1| Damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|61354161|gb|AAX44048.1| damage-specific DNA binding protein 1, 127kDa [Homo sapiens]
gi|119594341|gb|EAW73935.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_c [Homo
sapiens]
gi|168275638|dbj|BAG10539.1| DNA damage-binding protein 1 [synthetic construct]
gi|189065506|dbj|BAG35345.1| unnamed protein product [Homo sapiens]
gi|355566436|gb|EHH22815.1| Damage-specific DNA-binding protein 1 [Macaca mulatta]
gi|380784123|gb|AFE63937.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|380808126|gb|AFE75938.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|380810144|gb|AFE76947.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|383408123|gb|AFH27275.1| DNA damage-binding protein 1 [Macaca mulatta]
gi|410305600|gb|JAA31400.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
gi|410352015|gb|JAA42611.1| damage-specific DNA binding protein 1, 127kDa [Pan troglodytes]
Length = 1140
Score = 1149 bits (2972), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|400260815|pdb|4E54|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
Dimerization And Its Roles In Chromatinized Dna Repair
gi|401871507|pdb|4E5Z|A Chain A, Damaged Dna Induced Uv-Damaged Dna-Binding Protein (Uv-Ddb)
Dimerization And Its Roles In Chromatinized Dna Repair
Length = 1150
Score = 1149 bits (2972), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 64 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 123
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 124 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 183
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 184 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 242
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 243 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 302
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 303 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 362
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 363 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 422
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 423 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 481
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 482 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 540
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 541 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 600
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 601 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 660
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 661 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 720
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 721 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 780
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 781 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 840
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 841 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 900
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 901 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 956
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 957 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1016
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1017 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1076
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1077 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1136
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1137 DDLIKVVEELTRIH 1150
>gi|384941436|gb|AFI34323.1| DNA damage-binding protein 1 [Macaca mulatta]
Length = 1140
Score = 1149 bits (2972), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|355752055|gb|EHH56175.1| Damage-specific DNA-binding protein 1, partial [Macaca fascicularis]
Length = 1125
Score = 1149 bits (2972), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 39 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 98
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 99 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 158
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 159 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 217
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 218 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 277
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 278 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 337
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 338 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 397
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 398 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 456
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 457 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 515
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 516 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 575
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 576 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 635
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 636 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 695
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 696 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 755
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 756 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 815
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 816 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 875
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 876 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 931
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 932 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 991
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 992 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1051
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1052 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1111
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1112 DDLIKVVEELTRIH 1125
>gi|74138855|dbj|BAE27231.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 1149 bits (2971), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +P+SLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPNSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|441604084|ref|XP_004087862.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1
[Nomascus leucogenys]
Length = 1140
Score = 1148 bits (2970), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/1094 (52%), Positives = 758/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVAAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGP VD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPXVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|259155222|ref|NP_001158852.1| DNA damage-binding protein 1 [Salmo salar]
gi|223647700|gb|ACN10608.1| DNA damage-binding protein 1 [Salmo salar]
Length = 1139
Score = 1148 bits (2969), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 582/1096 (53%), Positives = 762/1096 (69%), Gaps = 72/1096 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ + ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQNGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGI+DP+CR+IGL LYDGLFKVIP D + + +AFNIRLEELQV+D+ FLYGC
Sbjct: 114 RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRENRELKAFNIRLEELQVIDVHFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT+ +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++IPVP P G +II
Sbjct: 174 QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ EKE++
Sbjct: 233 GQESITYHNGDKYLAIAPPTIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLL--EKEEL 290
Query: 238 TG-------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + GSYV V+
Sbjct: 291 MDGAVVLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVM 350
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 351 ETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGL 410
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W LRS D LV+SF+ +TR+L ++ E E+EETE+ GF QT +C + + QL+
Sbjct: 411 WPLRSEAGRETDDMLVLSFVGQTRVLMLSGE-EVEETELPGFVDNLQTFYCGNVAHQQLI 469
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
Q+TSG VRLV S+ L +EWK P G +++VA N+SQV+LA G L YL+I G L +
Sbjct: 470 QITSGGVRLVMQDSKALVSEWKEPQGRNISVAACNSSQVVLAVGRA-LYYLQILSGELKQ 528
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
+ ++E+E++CLDI P+GE S + AVG+WTDIS R+ LP + KE LGGEIIP
Sbjct: 529 ISTVEMEHEVACLDITPLGEGGE-SPLCAVGLWTDISARVLKLPCFTALHKEMLGGEIIP 587
Query: 531 RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
RS+L+ FEG YLLCALGDG L F L++ TG L++RKKV+LGTQP LRTF S +T++
Sbjct: 588 RSILMTTFEGGYYLLCALGDGALFYFGLDLTTGVLSERKKVTLGTQPTVLRTFRSLSTSN 647
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQ
Sbjct: 648 VFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQ 707
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------------- 681
KLHIR++PL E PRRIC+QE S+ F + S +
Sbjct: 708 KLHIRTVPLYESPRRICYQEVSQCFGVLSSRVEMQDASGTTAAVRPSASTQALSSSVSSS 767
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
S EE E+H + ++D TFE + + EY S++SC D VY
Sbjct: 768 KLFPSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLQSEYALSMVSCRLGRDPAVY 827
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN +
Sbjct: 828 FIVGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTV 887
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE
Sbjct: 888 RLYEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEE 943
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFV
Sbjct: 944 IARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGVFHLGEFV 1003
Query: 912 NRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
N F HGSLV++ L +S +V+FGTVNG+IG++ SL Y L LQ L KVIK
Sbjct: 1004 NVFSHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKS 1063
Query: 971 VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV---------- 1020
VG + H WRSF+ E+KT A F+DGDLIESFLDL R +M E+ T+ +
Sbjct: 1064 VGKIEHSFWRSFHTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREA 1123
Query: 1021 SVEELCKRVEELTRLH 1036
+V+E+ K VEELTR+H
Sbjct: 1124 TVDEVIKIVEELTRIH 1139
>gi|311247551|ref|XP_003122699.1| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Sus scrofa]
Length = 1140
Score = 1148 bits (2969), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N++QV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSNQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARISKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|197097564|ref|NP_001126613.1| DNA damage-binding protein 1 [Pongo abelii]
gi|75041202|sp|Q5R649.1|DDB1_PONAB RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|55732122|emb|CAH92767.1| hypothetical protein [Pongo abelii]
Length = 1140
Score = 1147 bits (2968), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/1094 (52%), Positives = 758/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNVCILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVY + FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYPMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|327278830|ref|XP_003224163.1| PREDICTED: DNA damage-binding protein 1-like [Anolis carolinensis]
Length = 1140
Score = 1147 bits (2968), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE +D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGENKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP + + + +AFNIRLEELQV+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLERENKELKAFNIRLEELQVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 238 TG-----LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
G L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GGVSLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPHHETDNTLVLSFVGQTRVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLV+ + L +EWK P G +++VA+ N+SQV++A G L YLEI L ++
Sbjct: 472 TSASVRLVTQEPKALVSEWKEPNGKNISVASCNSSQVVVAVGRV-LYYLEIHPQELKQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLDI P+G+ S + A+G+WTDIS RI LP +L+ KE LGGEIIPRS
Sbjct: 531 CTEMEHEVACLDITPLGDASGMSPLCAIGLWTDISARILKLPSFDLLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFENSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLSESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V P+E EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPDEAEPKQGRIVVFHYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALYV+T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWT----AEKELRTECNHYNNIMALYVKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ G +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEFGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + + +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATV 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKIVEELTRIH 1140
>gi|5353754|gb|AAD42230.1|AF159853_1 damage-specific DNA binding protein 1 [Mus musculus]
Length = 1140
Score = 1147 bits (2968), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 580/1095 (52%), Positives = 759/1095 (69%), Gaps = 69/1095 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKE-EQM 291
Query: 238 TG------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
G L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 292 DGTVTPKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAME 351
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 352 TFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLW 411
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q
Sbjct: 412 PLRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQ 470
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
+TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 471 ITSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQI 529
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPR
Sbjct: 530 SHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPR 589
Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
S+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+V
Sbjct: 590 SILMTTFESSRYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNV 649
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
FA SDRPTVIYS N KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQK
Sbjct: 650 FACSDRPTVIYSXNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQK 709
Query: 652 LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
LHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 710 LHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSK 769
Query: 682 ---------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 770 LFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYF 829
Query: 733 CVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++
Sbjct: 830 IVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVR 889
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
LY+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE
Sbjct: 890 LYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEI 945
Query: 853 ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 946 ARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVN 1005
Query: 913 RFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK V
Sbjct: 1006 VFCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSV 1065
Query: 972 GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VS 1021
G + H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1066 GKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREAT 1125
Query: 1022 VEELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1126 ADDLIKVVEELTRIH 1140
>gi|358440058|pdb|4A08|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 13 Bp Cpd-Duplex (
Purine At D-1 Position) At 3.0 A Resolution (Cpd 1)
gi|358440062|pdb|4A09|A Chain A, Structure Of Hsddb1-Drddb2 Bound To A 15 Bp Cpd-Duplex
(Purine At D-1 Position) At 3.1 A Resolution (Cpd 2)
Length = 1159
Score = 1147 bits (2967), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/1094 (52%), Positives = 758/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 73 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 132
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 133 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 192
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTY V+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 193 QAPTICFVYQDPQ-GRHVKTYAVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 251
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 252 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 311
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 312 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 371
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 372 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 431
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 432 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 490
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 491 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 549
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 550 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 609
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 610 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 669
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 670 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 729
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 730 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 789
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 790 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 849
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 850 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 909
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 910 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 965
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 966 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1025
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1026 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1085
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1086 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1145
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1146 DDLIKVVEELTRIH 1159
>gi|45383688|ref|NP_989547.1| DNA damage-binding protein 1 [Gallus gallus]
gi|82098863|sp|Q805F9.1|DDB1_CHICK RecName: Full=DNA damage-binding protein 1; AltName: Full=DDB p127
subunit; AltName: Full=Damage-specific DNA-binding
protein 1; AltName: Full=UV-damaged DNA-binding factor
gi|28375613|dbj|BAC56999.1| damaged-DNA binding protein DDB p127 subunit [Gallus gallus]
gi|53130071|emb|CAG31438.1| hypothetical protein RCJMB04_6h2 [Gallus gallus]
Length = 1140
Score = 1147 bits (2967), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 576/1094 (52%), Positives = 761/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+ A +ELFRP GE++D LFI T +Y C+L++ + ++ ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKTAVMELFRPKGESKDLLFILTAKYNACILEYKQNGDNIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + + +AFNIRLEELQV+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRENKELKAFNIRLEELQVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDSHREMDNMLVLSFVGQTRVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N++QV++A G L YLEI L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPNGKNISVASCNSNQVVVAVGRA-LYYLEIRPQELRQIN 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLDI P+G+ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 CTEMEHEVACLDITPLGDTNGMSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F L+++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLSLETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + + +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATV 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKIVEELTRIH 1140
>gi|223647932|gb|ACN10724.1| DNA damage-binding protein 1 [Salmo salar]
Length = 1139
Score = 1146 bits (2964), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 580/1096 (52%), Positives = 763/1096 (69%), Gaps = 72/1096 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ + ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQNGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGI+DP+CR+IGL LYDGLFKVIP D + + +AFNIRLEELQV+D+ FLYGC
Sbjct: 114 RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRENRELKAFNIRLEELQVIDVHFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT+ +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++IPVP P G +II
Sbjct: 174 QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ EKE++
Sbjct: 233 GQESITYHNGDKYLAIAPPTIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLL--EKEEL 290
Query: 238 TG-------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + GSYV V+
Sbjct: 291 MDGAVVLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNDSGSYVAVM 350
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 351 ETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGL 410
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W LRS D LV+SF+ +TR+L ++ E E+EETE+ GF QT +C + + QL+
Sbjct: 411 WPLRSEAGRETDDMLVLSFVGQTRVLMLSGE-EVEETELPGFVDNLQTFYCGNVAHQQLI 469
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
Q+TSG VRLV S+ L +EWK P G +++VA N+SQV+LA G L YL+I G L +
Sbjct: 470 QITSGGVRLVMQDSKALVSEWKEPQGRNISVAACNSSQVVLAVGRA-LYYLQILSGELKQ 528
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
+ ++E+E++CLDI P+GE+ S + AVG+WTDIS R+ LP + KE LGGEIIP
Sbjct: 529 ISTVEMEHEVACLDITPLGEDGD-SPLCAVGLWTDISARVLKLPCFTALHKEMLGGEIIP 587
Query: 531 RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
RS+L+ FE YLLCALGDG L F L++ +G L++RKKV+LGTQP LRTF S +T++
Sbjct: 588 RSILMTTFEASYYLLCALGDGALFYFGLDLTSGVLSERKKVTLGTQPTVLRTFRSLSTSN 647
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQ
Sbjct: 648 VFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQ 707
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------------- 681
KLHIR++PL E PRRIC+QE S+ F + S +
Sbjct: 708 KLHIRTVPLYESPRRICYQEVSQCFGVLSSRVEMQDASGTTAAVRPSASTQALSSSVSSS 767
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
S EE E+H + ++D TFE + + EY S++SC D +VY
Sbjct: 768 KLFPSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLQSEYALSMVSCRLGRDLSVY 827
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN +
Sbjct: 828 FIVGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMMEFNGKLLASINSTV 887
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE
Sbjct: 888 RLYEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEE 943
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFV
Sbjct: 944 IARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGVFHLGEFV 1003
Query: 912 NRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
N F HGSLV++ L +S +V+FGTVNG+IG++ SL Y L LQ L KVIK
Sbjct: 1004 NVFSHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKS 1063
Query: 971 VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV---------- 1020
VG + H WRSF+ E+KT A F+DGDLIESFLDL R +M E+ T+ +
Sbjct: 1064 VGKIEHSFWRSFHTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREA 1123
Query: 1021 SVEELCKRVEELTRLH 1036
+V+E+ K VEELTR+H
Sbjct: 1124 TVDEVIKIVEELTRIH 1139
>gi|224050582|ref|XP_002191856.1| PREDICTED: DNA damage-binding protein 1 [Taeniopygia guttata]
Length = 1140
Score = 1145 bits (2961), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+ A +ELFRP GE++D LFI T +Y C+L++ + +S ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKTAVMELFRPKGESKDLLFILTAKYNACILEYKQNGDSIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEELQV+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELQVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPHRETDNTLVLSFVGQTRVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N++QV++A G L YLEI L ++
Sbjct: 472 TSASVRLVSQEPKSLVSEWKEPNGKNISVASCNSNQVVVAVGRA-LYYLEIRPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 CTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F L+++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLSLETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + + +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATV 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKIVEELTRIH 1140
>gi|90108797|pdb|2B5L|A Chain A, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
Protein
gi|90108798|pdb|2B5L|B Chain B, Crystal Structure Of Ddb1 In Complex With Simian Virus 5 V
Protein
gi|90108801|pdb|2B5M|A Chain A, Crystal Structure Of Ddb1
gi|116667897|pdb|2HYE|A Chain A, Crystal Structure Of The Ddb1-cul4a-rbx1-sv5v Complex
gi|1136228|gb|AAA88883.1| UV-damaged DNA binding factor [Homo sapiens]
gi|1588524|prf||2208446A xeroderma pigmentosum group E-binding factor
Length = 1140
Score = 1145 bits (2961), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1095 (52%), Positives = 761/1095 (69%), Gaps = 69/1095 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTD-DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
LRS + + +DT LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q
Sbjct: 413 LRSDPNRETYDT-LVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQ 470
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
+TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 471 ITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQI 529
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPR
Sbjct: 530 SHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPR 589
Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
S+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+V
Sbjct: 590 SILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNV 649
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
FA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQK
Sbjct: 650 FACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQK 709
Query: 652 LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
LHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 710 LHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSK 769
Query: 682 ---------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 770 LFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYF 829
Query: 733 CVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++
Sbjct: 830 IVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVR 889
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
LY+W D +++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE
Sbjct: 890 LYEWTTEKD----VRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEI 945
Query: 853 ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 946 ARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVN 1005
Query: 913 RFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK V
Sbjct: 1006 VFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSV 1065
Query: 972 GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VS 1021
G + H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1066 GKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREAT 1125
Query: 1022 VEELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1126 ADDLIKVVEELTRIH 1140
>gi|270346571|pdb|3I7H|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Hbx
gi|270346573|pdb|3I7K|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Whx
gi|270346575|pdb|3I7L|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Ddb2
gi|270346577|pdb|3I7N|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdtc1
gi|270346579|pdb|3I7O|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Iqwd1
gi|270346581|pdb|3I7P|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr40a
gi|270346583|pdb|3I89|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr22
gi|270346585|pdb|3I8C|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr21a
gi|270346587|pdb|3I8E|A Chain A, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr42a
gi|270346588|pdb|3I8E|B Chain B, Crystal Structure Of Ddb1 In Complex With The H-Box Motif Of
Wdr42a
Length = 1143
Score = 1144 bits (2960), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1095 (52%), Positives = 761/1095 (69%), Gaps = 69/1095 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 57 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 116
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 117 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 176
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 177 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 235
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 236 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 295
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 296 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 355
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 356 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 415
Query: 353 LRSSTD-DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
LRS + + +DT LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q
Sbjct: 416 LRSDPNRETYDT-LVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQ 473
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
+TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 474 ITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQI 532
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPR
Sbjct: 533 SHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPR 592
Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
S+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+V
Sbjct: 593 SILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNV 652
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
FA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQK
Sbjct: 653 FACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQK 712
Query: 652 LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
LHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 713 LHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSK 772
Query: 682 ---------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 773 LFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYF 832
Query: 733 CVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++
Sbjct: 833 IVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVR 892
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
LY+W D +++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE
Sbjct: 893 LYEWTTEKD----VRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEI 948
Query: 853 ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 949 ARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVN 1008
Query: 913 RFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK V
Sbjct: 1009 VFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSV 1068
Query: 972 GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VS 1021
G + H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1069 GKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREAT 1128
Query: 1022 VEELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1129 ADDLIKVVEELTRIH 1143
>gi|221046721|pdb|3EI4|A Chain A, Structure Of The Hsddb1-Hsddb2 Complex
gi|221046723|pdb|3EI4|C Chain C, Structure Of The Hsddb1-Hsddb2 Complex
gi|221046725|pdb|3EI4|E Chain E, Structure Of The Hsddb1-Hsddb2 Complex
Length = 1158
Score = 1144 bits (2960), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1095 (52%), Positives = 761/1095 (69%), Gaps = 69/1095 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 72 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 131
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 132 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 191
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 192 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 250
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 251 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 310
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 311 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 370
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 371 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 430
Query: 353 LRSSTD-DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
LRS + + +DT LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q
Sbjct: 431 LRSDPNRETYDT-LVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQ 488
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
+TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 489 ITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQI 547
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPR
Sbjct: 548 SHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPR 607
Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
S+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+V
Sbjct: 608 SILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNV 667
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
FA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQK
Sbjct: 668 FACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQK 727
Query: 652 LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
LHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 728 LHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSK 787
Query: 682 ---------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 788 LFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYF 847
Query: 733 CVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++
Sbjct: 848 IVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVR 907
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
LY+W D +++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE
Sbjct: 908 LYEWTTEKD----VRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEI 963
Query: 853 ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 964 ARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVN 1023
Query: 913 RFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK V
Sbjct: 1024 VFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSV 1083
Query: 972 GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VS 1021
G + H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1084 GKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREAT 1143
Query: 1022 VEELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1144 ADDLIKVVEELTRIH 1158
>gi|147906138|ref|NP_001083624.1| DNA damage-binding protein 1 [Xenopus laevis]
gi|82186503|sp|Q6P6Z0.1|DDB1_XENLA RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|38303806|gb|AAH61946.1| Ddb1 protein [Xenopus laevis]
Length = 1140
Score = 1144 bits (2958), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1094 (52%), Positives = 759/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ +S ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGDSIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDPDCR+IGL LYDGLFKVIP + + +AFNIRLEEL V+D+KFLY C
Sbjct: 114 RPSETGIIGIIDPDCRMIGLRLYDGLFKVIPLERDNKELKAFNIRLEELHVIDVKFLYSC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFSKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDVNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KL + + +GSYV V+E
Sbjct: 293 GSVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLTTESNEQGSYVVVMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LR + D D LV+SF+ +TR+L + E E+EET++ GF QT FC + + QL+Q+
Sbjct: 413 LRVAADRDTDDTLVLSFVGQTRVLTLTGE-EVEETDLAGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G V+V + N+ QVLLA G L YLEI G L +
Sbjct: 472 TSASVRLVSQNPQNLVSEWKEPQGRKVSVCSCNSRQVLLAVGR-VLYYLEIHPGELRQTS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLD+ P+G N + S + A+G+WTDIS RI SLP L+ KE LGGEIIPRS
Sbjct: 531 CTEMEHEVACLDVTPLGGNDTLSSLCAIGLWTDISARILSLPGFQLLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ +FE YLLCALGDG L F LN TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTSFESSHYLLCALGDGALFYFSLNTDTGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLKNQ------------------------SCA-- 686
HIR++PL E PR+IC+QE S+ F + S + + SC+
Sbjct: 711 HIRTVPLFESPRKICYQEVSQCFGVLSSRIEVQDASGGSSPLRPSASTQALSSSVSCSKL 770
Query: 687 -------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
EE E+H + ++D TFE + T+ EY S++SC D Y+
Sbjct: 771 FSGSTSPHETSFGEEVEVHNLLIIDQHTFEVLHTHQFLQNEYTLSLVSCKLGKDPTTYFV 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V P+E EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPDEAEPKQGRIVVFQYNDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSPPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + + +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRETTV 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|2632123|emb|CAA05770.1| Xeroderma Pigmentosum Group E Complementing protein [Homo sapiens]
Length = 1140
Score = 1143 bits (2957), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 576/1094 (52%), Positives = 756/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H +E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTGMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KG VYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGDVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W D +++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTEKD----VRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|431910407|gb|ELK13480.1| DNA damage-binding protein 1 [Pteropus alecto]
Length = 1143
Score = 1143 bits (2957), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1097 (52%), Positives = 760/1097 (69%), Gaps = 70/1097 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQW---RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN---------- 1019
+ H + RSF+ E+KT A F+DGDLIESFLD+SR +M E+ +
Sbjct: 1067 KIEHSLYPSQRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKRE 1126
Query: 1020 VSVEELCKRVEELTRLH 1036
+ ++L K VEELTR+H
Sbjct: 1127 ATADDLIKVVEELTRIH 1143
>gi|355683071|gb|AER97036.1| damage-specific DNA binding protein 1, 127kDa [Mustela putorius furo]
Length = 1122
Score = 1142 bits (2954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 570/1062 (53%), Positives = 746/1062 (70%), Gaps = 57/1062 (5%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEV 1108
>gi|301616502|ref|XP_002937687.1| PREDICTED: DNA damage-binding protein 1-like [Xenopus (Silurana)
tropicalis]
Length = 1140
Score = 1142 bits (2954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/1094 (52%), Positives = 761/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ + +S ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQNGDSIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDPDCR+IGL LYDGLFKVIP + + +AFNIRLEEL V+D+KFLY C
Sbjct: 114 RPSETGIIGIIDPDCRMIGLRLYDGLFKVIPLERDNKELKAFNIRLEELHVIDVKFLYSC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFSKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDVNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KL + + +GSYV V+E
Sbjct: 293 GSVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLTAESNEQGSYVVVMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LR + D D LV+SF+ +TR+L + E E+EET++ GF QT FC + + QL+Q+
Sbjct: 413 LRVAADRDTDDTLVLSFVGQTRVLTLTGE-EVEETDLAGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLV+ + L +EWK P G V+V + N+ QVLLA G L YLEI G L ++
Sbjct: 472 TSASVRLVAQNPQSLVSEWKEPQGRKVSVCSCNSRQVLLAVGR-VLYYLEIHPGELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLD+ P+G + + S + A+G+WTDIS RI SLP L+ KE LGGEIIPRS
Sbjct: 531 CTEMEHEVACLDVTPLGGSDTLSSLCAIGLWTDISARILSLPGFQLLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ +FE YLLCALGDG L F LN TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTSFESSHYLLCALGDGALFYFSLNTDTGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------NQSCA-- 686
HIR++PL E PR+IC+QE S+ F + S + + SC+
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGSSPLRPSASTQALSSSVSCSKL 770
Query: 687 -------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
EE E+H + ++D TFE + T+ EY S++SC D Y+
Sbjct: 771 FSGSTSPHETSFGEEVEVHNLLIIDQHTFEVLHTHQFLQNEYTLSLVSCKLGKDPTTYFV 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V P+E EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPDEAEPKQGRIVVFQYNDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSPPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDVQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + + +V
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVIANLQIDDGSGMKRETTV 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|440893607|gb|ELR46310.1| DNA damage-binding protein 1 [Bos grunniens mutus]
Length = 1143
Score = 1142 bits (2953), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1097 (52%), Positives = 760/1097 (69%), Gaps = 70/1097 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGMSPLCAIGLWTDISARIAKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQW---RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN---------- 1019
+ H + RSF+ E+KT A F+DGDLIESFLD+SR +M E+ +
Sbjct: 1067 KIEHSLYPSQRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKRE 1126
Query: 1020 VSVEELCKRVEELTRLH 1036
+ ++L K VEELTR+H
Sbjct: 1127 ATADDLIKVVEELTRIH 1143
>gi|281345356|gb|EFB20940.1| hypothetical protein PANDA_015888 [Ailuropoda melanoleuca]
Length = 1124
Score = 1140 bits (2950), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1096 (52%), Positives = 759/1096 (69%), Gaps = 69/1096 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 36 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 95
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 96 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 155
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 156 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 214
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 215 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 274
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 275 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 334
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 335 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 394
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 395 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 453
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 454 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 512
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 513 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 572
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 573 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 632
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 633 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 692
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 693 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 752
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 753 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 812
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 813 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 872
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 873 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 928
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 929 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 988
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK +
Sbjct: 989 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKNIT 1048
Query: 973 -GLNH-EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------V 1020
L H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ +
Sbjct: 1049 HSLTHLSTWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 1108
Query: 1021 SVEELCKRVEELTRLH 1036
+ ++L K VEELTR+H
Sbjct: 1109 TADDLIKVVEELTRIH 1124
>gi|74178494|dbj|BAE32502.1| unnamed protein product [Mus musculus]
Length = 1140
Score = 1139 bits (2947), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/1094 (52%), Positives = 755/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE LLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHCLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR+ PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTDPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++ TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIGQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKLGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|260790329|ref|XP_002590195.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
gi|229275385|gb|EEN46206.1| hypothetical protein BRAFLDRAFT_128289 [Branchiostoma floridae]
Length = 1152
Score = 1139 bits (2945), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 579/1109 (52%), Positives = 767/1109 (69%), Gaps = 85/1109 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V YGRIA +ELFRP GE +D LFI T RY C+L++ D + ++TRA G+V DRIG
Sbjct: 54 EVGCYGRIAVMELFRPPGETKDLLFILTARYNACILEYKQDGDDVSIVTRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNRELKAFNIRLEELNVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT+V +YQD RHVKTYE++++DK+F +GPW Q+N++ A ++I VP P CG LII
Sbjct: 174 QVPTVVFVYQD-PHGRHVKTYEISVRDKEFSKGPWKQDNVETEASMVIAVPEPFCGSLII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
G+E+I Y + + + A+ I+ S +GRVDA+GSRYLLGD +G L +L++ EKE+
Sbjct: 233 GQESITYHNGDKYVAVAPPAIKQSTLICHGRVDANGSRYLLGDMSGRLFMLLL--EKEEL 290
Query: 237 ------VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
V LK+ELLGETSIA ++YLDN VVY+GS GDSQLIKLN+ D GSYV V+
Sbjct: 291 IDGSVTVKDLKVELLGETSIAECLTYLDNGVVYLGSRLGDSQLIKLNVDADDSGSYVHVM 350
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
E + NLGPIVD CVVDLERQGQGQ+VTCSGAYK+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 351 ETFTNLGPIVDMCVVDLERQGQGQLVTCSGAYKEGSLRIIRNGIGIHEHASIDLPGIKGI 410
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W L + D LV++F+ +TR+L ++ E E+EETE+ GF QT FC + + QL+
Sbjct: 411 WPLCVDPNGTMDDTLVLAFVGQTRVLLLSGE-EVEETELPGF-EDKQTFFCGNVMGGQLL 468
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
Q+T+ SVRLVS +++L +EWK P +++VA++N +QV+ A G + Y+EI +G L +
Sbjct: 469 QITAASVRLVSRQTKQLVSEWKPPSEKNISVASSNTTQVVCAV-GQVVYYIEIHEGELKQ 527
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
+ + E++CLD+ P+ E + + AVG+WTDIS + LP+L + E LGGEIIP
Sbjct: 528 LGETVMAREVACLDVTPLVEGSDRAHLCAVGLWTDISAHMLQLPNLEPMHVEMLGGEIIP 587
Query: 531 RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE------LTDRKKVSLGTQPITLRTFS 584
RS+L+ FEGI YLLCALGDG L F LN +TG+ LT + KV+LGTQP LRTF
Sbjct: 588 RSILMTTFEGIHYLLCALGDGSLFYFNLNPETGKNFSLHILTIKSKVTLGTQPTVLRTFR 647
Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+ + LTIG
Sbjct: 648 SLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSQGYPDSLALTNDSTLTIG 707
Query: 645 TIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--------------------- 683
TID+IQKLHIR++PL E PRRI +QE S+TF + S + +
Sbjct: 708 TIDEIQKLHIRTVPLYESPRRIAYQETSQTFGVLSTRTEVEDASGGSGTQPVRPSASTTA 767
Query: 684 --------------------SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCS 723
+ EE E+H + ++D TFE + + L E+ S++SC
Sbjct: 768 LSTSVSPNKNALRSSNDSDGAFGEEIEIHSLLVVDQHTFEILHAHQLLKDEFALSMVSCK 827
Query: 724 FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
DD N Y+ +GTA V PEE+EP GRI+VF DGKLQ +AEKE KGAVYSL FN KL
Sbjct: 828 LGDDPNTYFIIGTAMVYPEESEPKSGRIIVFQYTDGKLQQVAEKEVKGAVYSLVQFNNKL 887
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
LA+IN ++L++W +EL+ EC H+ +ILALY++T+GDFI+VGDLM+S++LL YK
Sbjct: 888 LASINSTVRLFEWT----AEKELRVECNHYNNILALYLKTKGDFILVGDLMRSVTLLAYK 943
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVG 903
EG EE ARD+N NWMSAVEILDDD +LGAEN+FN FT +K+S TDEER L+ VG
Sbjct: 944 PMEGCFEEIARDFNPNWMSAVEILDDDNFLGAENSFNFFTCQKDSAATTDEERQHLQEVG 1003
Query: 904 EYHLGEFVNRFRHGSLVMRLP-DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
+HLGEFVN FRHGSLVM+ P ++ +V+FGTVNG +G++ LP + + FL+++Q+
Sbjct: 1004 HFHLGEFVNVFRHGSLVMQHPGETSTPTQGSVLFGTVNGAVGLVTQLPADFFNFLQEVQS 1063
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV-- 1020
L +VIK VG + H WRSFN E+KT + F+DGDLIESFLDLSR +M E+ + + V
Sbjct: 1064 KLTRVIKSVGKIEHSFWRSFNTERKTEACQGFIDGDLIESFLDLSRDKMQEVVQGLQVGG 1123
Query: 1021 -------------SVEELCKRVEELTRLH 1036
+VE+L K VEELTR+H
Sbjct: 1124 AIMDDGSGMKRECTVEDLIKLVEELTRIH 1152
>gi|427788481|gb|JAA59692.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
Length = 1156
Score = 1138 bits (2944), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 575/1110 (51%), Positives = 775/1110 (69%), Gaps = 83/1110 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQ--WDAESSELITRAMGDVSDRIG 60
++ IYGRI+ ++L+RP GE +D LF T +Y +L+ +S E+IT+A G+++D
Sbjct: 54 EIGIYGRISIMKLYRPAGEKKDLLFFLTAKYNAAILECVQYGDSIEIITKAHGNIADTFS 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIR+EEL V D++FL+GC
Sbjct: 114 RPSETGNIGIIDPECRVIGLRLYDGLFKVIPLDKDNRELKAFNIRMEELTVQDMEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTIV+L+QD++ ARH+KTYEV+LKDK+FV+GPW Q+++++ A+L+I VP P CG LII
Sbjct: 174 KTPTIVLLHQDSQ-ARHMKTYEVSLKDKEFVKGPWKQDHVESEANLVIAVPEPFCGALII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + I IR S YG+VDA+GSRYLLGD AG L +L++ E +
Sbjct: 233 GQESITYHNGDQYVVITPHLIRQSTIVCYGKVDANGSRYLLGDMAGRLFMLLLEREDKMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V LK+E LGE +IA I+YLDN VVY+GS GDSQLIKL+ + + +GS+VE++E
Sbjct: 293 GTTTVKDLKLEFLGEITIAECITYLDNGVVYVGSRLGDSQLIKLHAERNDQGSFVEIMEV 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKGMW
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGMWP 412
Query: 353 LR---------SSTDDPFDTF-----LVVSFISETRILAMNLEDELEETEIEGFCSQTQT 398
LR DP D+ LV+SF+ +TR+L ++ E E+EETE+ GF + QT
Sbjct: 413 LRVGPGVAPHGGDGRDPGDSAERDNTLVLSFVRQTRVLMLSGE-EVEETELAGFDTSQQT 471
Query: 399 LFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
FC + QL+QVT+ +VRLV S +++L NEWK P +++V T N SQV+ A +
Sbjct: 472 FFCGNVRNKQLIQVTAAAVRLVDSQTKQLLNEWKPPGARNISVVTCNQSQVVCAVRK-EV 530
Query: 459 VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
LEIGDG+L ++ +A+LE E++CLDI P+ E + + AVG+WTDIS+RI SLP L
Sbjct: 531 FCLEIGDGVLNQISNAELENEVACLDITPLSEKAEKATLCAVGLWTDISIRILSLPSLQQ 590
Query: 519 ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
+ KE++GGEIIPRS+L+ FEGI YLLCALGDG L FLL TG LTDRKKV+LGTQP
Sbjct: 591 LQKENIGGEIIPRSILITTFEGIHYLLCALGDGSLFYFLLEATTGALTDRKKVTLGTQPT 650
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
L+TF S +T++VFA SDRPTVIYSSN KL++SNVNLKEV+HMCP NS +PDSLA+A +
Sbjct: 651 VLKTFKSLSTSNVFACSDRPTVIYSSNHKLVFSNVNLKEVNHMCPLNSEGYPDSLALAND 710
Query: 639 GELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------- 681
L IGTID+IQKLHIR++PLGE PRRI +QE ++TF + +++
Sbjct: 711 NTLLIGTIDEIQKLHIRTVPLGELPRRIAYQEATQTFGVITIRNDILGSSGLTPVRPSAS 770
Query: 682 ------------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGC 717
N +E E+H + ++D TFE + + EY
Sbjct: 771 TQAQNVTHSAQMSSIFKPGSVSTGNDQLGQEVEIHNLLIIDQHTFEVLHAHQFMQTEYAM 830
Query: 718 SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
SI+S +D N YY VGTA VLP+E++P +GRI+VF DGKL+ +AE+E KGA YS+
Sbjct: 831 SIVSTRLGNDPNTYYIVGTANVLPDESDPKQGRIVVFHWVDGKLEHVAEQEIKGAPYSML 890
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
FNGKLLAAIN ++L++W + REL++EC H +ILALY++ +GDF++VGDLM+S+
Sbjct: 891 EFNGKLLAAINSTVRLFEW----NAERELRNECSHFNNILALYLRAKGDFVLVGDLMRSM 946
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG 897
SLL YK EG EE ARDY NWMS+VEILDDD +LGAE+ NLF +K+S TDEER
Sbjct: 947 SLLAYKPLEGNFEEIARDYQTNWMSSVEILDDDTFLGAESTTNLFVCQKDSAATTDEERQ 1006
Query: 898 RLEVVGEYHLGEFVNRFRHGSLVMRLP-DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLF 956
L+ VG++HLGEFVN FRHGSLVM+ P ++ +V+FGT++G IG+++ LP + Y F
Sbjct: 1007 HLQEVGQFHLGEFVNVFRHGSLVMQHPGETSSPTQGSVLFGTIHGAIGLVSQLPADFYTF 1066
Query: 957 LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
L ++Q L KVIK VG ++H WRSF+ E+KT A F+DGDLIESFLDLSR +M E+ +
Sbjct: 1067 LSEVQEKLTKVIKSVGKIDHAFWRSFSTERKTEPAVGFIDGDLIESFLDLSRDKMQEVVQ 1126
Query: 1017 TM----------NVSVEELCKRVEELTRLH 1036
+ + SV++L K +EEL+R+H
Sbjct: 1127 GIQMDDGSGMKRDASVDDLIKIIEELSRIH 1156
>gi|444513057|gb|ELV10249.1| DNA damage-binding protein 1 [Tupaia chinensis]
Length = 1146
Score = 1138 bits (2944), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 576/1100 (52%), Positives = 758/1100 (68%), Gaps = 73/1100 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSSGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKRCF 1066
Query: 973 GL------NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN------- 1019
+ + WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ +
Sbjct: 1067 QISPNSLTDMSTWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGM 1126
Query: 1020 ---VSVEELCKRVEELTRLH 1036
+ ++L K VEELTR+H
Sbjct: 1127 KREATADDLIKVVEELTRIH 1146
>gi|119594342|gb|EAW73936.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_d [Homo
sapiens]
Length = 1146
Score = 1137 bits (2941), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 577/1100 (52%), Positives = 757/1100 (68%), Gaps = 73/1100 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK--- 969
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKRCF 1066
Query: 970 --GVGGLNH-EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN------- 1019
L H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ +
Sbjct: 1067 LISTCSLTHPSTWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGM 1126
Query: 1020 ---VSVEELCKRVEELTRLH 1036
+ ++L K VEELTR+H
Sbjct: 1127 KREATADDLIKVVEELTRIH 1146
>gi|410912407|ref|XP_003969681.1| PREDICTED: DNA damage-binding protein 1-like [Takifugu rubripes]
Length = 1140
Score = 1137 bits (2941), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 573/1094 (52%), Positives = 760/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ + ES ++ITRA G+V D IG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTSKYNACILEYKQNGESIDIITRAHGNVKDPIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGI+DP+CR+IGL LYDGLFKVIP D + +A+NIRLEELQV+D+ FLYGC
Sbjct: 114 RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRDNRELKAYNIRLEELQVIDVHFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT+ +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++IPVP P G +II
Sbjct: 174 QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK--- 234
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E+
Sbjct: 233 GQESITYHNGDKYLAIAPPTIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEELMD 292
Query: 235 --EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L +ELLGETSIA ++YLDN VV++GS GD QL+KLN+ + +GS+V V+E
Sbjct: 293 GTVALKDLHVELLGETSIAECLTYLDNGVVFVGSRLGDPQLVKLNVDSNDQGSFVTVMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L ++ E E+EETE+ GF QT +C + +NQL+Q+
Sbjct: 413 LRSEAGRETDDMLVLSFVGQTRVLMLSGE-EVEETELPGFVDNQQTFYCGNVAHNQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TSGSVRLV S+ L +EWK P G +++VA N +QV+LA G L YL+I G L ++
Sbjct: 472 TSGSVRLVLQDSKALVSEWKEPQGRNISVAACNHTQVVLAVGRA-LYYLQILAGELKQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLDI P+GE + S + AVG+WTDIS R+ LP + KE LGGEIIPRS
Sbjct: 531 TTEMEHEVACLDITPLGEAGAESPLCAVGLWTDISARVLKLPCFTPLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEG YLLCALGDG L F L+++TG L++ KKV+LGTQP LRTF S +T++VF
Sbjct: 591 ILMTTFEGSYYLLCALGDGALFYFGLDLQTGALSECKKVTLGTQPTVLRTFRSLSTSNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PRRIC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRRICYQEVSQCFGVLSSRVEIQDVSGTTSPVRPSASTQALSSSMSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D +VY+
Sbjct: 771 FPSSTSPHESSFGEEVEIHSLLVVDQHTFEVLHAHQFLPSEYALSMVSCRLGKDPSVYFV 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWT----AEKELRTECSHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDE+R L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEDRQHLQEVGVFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLV++ L ++ +V+FGTV G+IG++ SL + L LQ L KVIK VG
Sbjct: 1007 FCHGSLVLQNLGETSTPTQGSVLFGTVTGMIGLVTSLSEGWHSLLLDLQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSF+ E+KT AK F+DGDLIESFLDL R +M E+ T+ + +V
Sbjct: 1067 KIEHSFWRSFHTERKTEQAKGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATV 1126
Query: 1023 EELCKRVEELTRLH 1036
+E+ K VEELTR+H
Sbjct: 1127 DEVIKIVEELTRIH 1140
>gi|81868411|sp|Q9ESW0.1|DDB1_RAT RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|9843869|emb|CAB89874.2| damage-specific DNA binding protein 1 [Rattus norvegicus]
Length = 1140
Score = 1137 bits (2940), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 571/1094 (52%), Positives = 756/1094 (69%), Gaps = 67/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQ +KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQPVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPRAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLD+ P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDVTPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGT+++IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANTSTLTIGTMNEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++P+ E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPIYESPRKICYQEVSQCFGVLSTRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSAAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF GKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSGGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+ GTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLLGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSV 1022
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1067 KIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATA 1126
Query: 1023 EELCKRVEELTRLH 1036
++L K VEELTR+H
Sbjct: 1127 DDLIKVVEELTRIH 1140
>gi|449283451|gb|EMC90093.1| DNA damage-binding protein 1 [Columba livia]
Length = 1140
Score = 1135 bits (2937), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 578/1105 (52%), Positives = 761/1105 (68%), Gaps = 78/1105 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ + +S ++ITRA G+V DRIG
Sbjct: 43 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQNGDSIDIITRAHGNVQDRIG 102
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + + +AFNIRLEELQV+D+KFLYGC
Sbjct: 103 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRENKELKAFNIRLEELQVIDVKFLYGC 162
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 163 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 221
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 222 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 281
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 282 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 341
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 342 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 401
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 402 LRSDAHRETDNMLVLSFVGQTRVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQI 460
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YLEI L ++
Sbjct: 461 TSASVRLVSQEPKALVSEWKEPNGKNISVASCNSSQVVVAVGRA-LYYLEIRPQELRQIS 519
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLDI P+G+ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 520 CTEMEHEVACLDITPLGDTNGMSPLCAIGLWTDISARILRLPSFELLHKEMLGGEIIPRS 579
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F L+++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 580 ILMTTFESSHYLLCALGDGALFYFGLSLETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 639
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 640 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 699
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 700 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKL 759
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 760 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 819
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 820 VGTAMVYPEEAEPKQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINSTVRL 879
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 880 YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 935
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG HLGEFVN
Sbjct: 936 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNV 995
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 996 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1055
Query: 973 GLNH---------EQW--RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV- 1020
+ H W +SF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 1056 KIEHSLYPSLVQLRAWASQSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQID 1115
Query: 1021 ---------SVEELCKRVEELTRLH 1036
+V++L K VEELTR+H
Sbjct: 1116 DGSGMKREATVDDLIKIVEELTRIH 1140
>gi|325186344|emb|CCA20849.1| predicted protein putative [Albugo laibachii Nc14]
Length = 1148
Score = 1132 bits (2927), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 566/1096 (51%), Positives = 762/1096 (69%), Gaps = 63/1096 (5%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
D+ +YGRIA E FRP GE QD++F+ T+R++FCVL +D ++IT+A G + D IGR
Sbjct: 54 DIKMYGRIAIFEKFRPSGETQDWIFLVTQRFQFCVLAYDTTLQQIITKANGSLRDTIGRN 113
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ-LKEAFNIRLEELQVLDIKFLYGCA 121
++ G IDPD RLIG+++Y+G FKVIP DN + LK AFNIRL+EL++LDIKFLYG
Sbjct: 114 SEILTNGNIDPDGRLIGMNIYEGYFKVIPIDNHSKSLKAAFNIRLDELRILDIKFLYGYN 173
Query: 122 KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
KPTI VLY+D K ARHVKTY + LK+KDF EGPWSQ+N++ GA+LLIPVP P GVLII
Sbjct: 174 KPTICVLYEDFKAARHVKTYFILLKEKDFAEGPWSQSNVEAGANLLIPVPMPYGGVLIIS 233
Query: 182 EETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLK 241
+TIVY + F AIP++ ++ + YG V DGSR+LL D G LH++ + E ++V +
Sbjct: 234 NQTIVYHNGTYFHAIPMQNTMIQVYGAVGDDGSRFLLADQYGALHVVALQTEGKEVLDVY 293
Query: 242 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 301
+E+LG+TSIAS +SYLDN VV++GS++GDSQL+KLN + D GSY+EVL+ YVN+GPI+D
Sbjct: 294 LEVLGQTSIASCVSYLDNGVVFVGSTFGDSQLVKLNSKRDESGSYIEVLDSYVNIGPIID 353
Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPF 361
FCV+DL+RQGQGQ+VTCSGA KDGSLR++RNGIGINEQAS EL GIKGMW+LR S +
Sbjct: 354 FCVMDLDRQGQGQIVTCSGADKDGSLRVIRNGIGINEQASAELPGIKGMWALRESLASEY 413
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D +LV S+++E RI+ + DE+EE EIE F +TL+C + + +QVT VR++
Sbjct: 414 DKYLVQSYLNEIRIMTIGDSDEMEEVEIEAFL-DAKTLYCRNVNEDGWLQVTETEVRIID 472
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEIS 481
+ + + W PP + VA+AN +Q++LAT L+Y+EI D L E ++E+EI+
Sbjct: 473 AQTTSICCGWIPPPSTRITVASANPTQIVLATSSKVLIYMEILDKQLMEKARKEMEFEIA 532
Query: 482 CLDINPI---------------------------GENPSYSQIAAVGMWTDISVRIFSLP 514
C+D++P N + I AVG WT SV + SLP
Sbjct: 533 CIDLSPRLDQIDPPALTRSSQVMDEAMEDAIVSRDANALRTSICAVGFWTSSSVCVLSLP 592
Query: 515 DLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
L+ ++KE +++PRSVL +FE YLL LGDG L+ L+ + G L++RK++SLG
Sbjct: 593 SLDELSKETFDKDLLPRSVLCASFEDSDYLLIGLGDGSLVTCHLDREFGTLSERKRISLG 652
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
TQPI+L F SK+ HVFA+ DRPTVIY + KKLLYSNVN KEV+ M F+S FP+ LA
Sbjct: 653 TQPISLSAFWSKDKLHVFASCDRPTVIYYNKKKLLYSNVNAKEVAVMSTFHSEVFPECLA 712
Query: 635 IAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK--NQSCAEESEMH 692
+A ELTIGT+DDIQKLHI+S L E RRI H +SRT + ++K +E E
Sbjct: 713 LASGSELTIGTVDDIQKLHIQSFHLNEWARRIAHDSRSRTLGVATVKLSKDDGGDEVEQG 772
Query: 693 FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN--VYYCVGTAYVLPEENEPTKGR 750
++RL DDQTFE + ++ LD FE CSI++C F+ DS+ YY VGTA+V EE EP +GR
Sbjct: 773 YIRLFDDQTFECLKSFRLDPFESPCSIITCIFTGDSSGGTYYVVGTAFVHEEEAEPHQGR 832
Query: 751 ILVFIVE----DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
ILVF V D +LQL+ EKE KG+VY LNAFNGKLLA +N K+ L+KW ++ EL
Sbjct: 833 ILVFTVSGIHGDRRLQLVTEKEVKGSVYCLNAFNGKLLAGVNSKVYLFKWSESEENGEEL 892
Query: 807 QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
SECGHHGH L LY+++RGDFIVVGDLMKSISLL +K +G+IEE ARD N+NWM+AV I
Sbjct: 893 VSECGHHGHTLVLYMESRGDFIVVGDLMKSISLLNHKQLDGSIEEIARDLNSNWMTAVGI 952
Query: 867 LDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
+DDD Y+G+E +FNLFTV++NS A+DEERGRLE +GEYHLGEFVNRFR+GSLVM+ S
Sbjct: 953 IDDDNYVGSETDFNLFTVQRNSGAASDEERGRLETIGEYHLGEFVNRFRYGSLVMQHNLS 1012
Query: 927 DVGQIP-------------------TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
+ P +++FGTV+G+IGVI + E++ FL ++Q+ L +V
Sbjct: 1013 IGAEAPGISLSDDRPESLSPLSVQRSMLFGTVSGMIGVILPISKEKHEFLMRVQSALNQV 1072
Query: 968 IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN-------V 1020
I+GVGG +H +WR+F N + +++A NF+DGDLIESFLDLS+ M ++ +N
Sbjct: 1073 IQGVGGFSHSEWRTFENRRSSIEAHNFIDGDLIESFLDLSKDEMKQVVDELNRDQLEGKT 1132
Query: 1021 SVEELCKRVEELTRLH 1036
++E L R+EEL+++H
Sbjct: 1133 TLEALAARIEELSQMH 1148
>gi|405970039|gb|EKC34976.1| DNA damage-binding protein 1 [Crassostrea gigas]
Length = 1160
Score = 1131 bits (2926), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 574/1118 (51%), Positives = 772/1118 (69%), Gaps = 95/1118 (8%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQ--WDAESSELITRAMGDVSDRIG 60
+V +YGRIA +ELFRP GE++D +F+ T+RY +L+ + E+ ++ITRA G+V D+IG
Sbjct: 54 EVGVYGRIAVMELFRPPGESKDLMFLLTQRYNAMILECNQNGENIDIITRAHGNVQDKIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
R ++ G IGIIDP CR+IGL LYDGLFKVIP + + +AFNIRLEEL V+DI+FL+GC
Sbjct: 114 RASETGIIGIIDPLCRVIGLRLYDGLFKVIPLERDNKELKAFNIRLEELTVIDIQFLHGC 173
Query: 121 AKPTIVVLYQDN--------------------KDARHVKTYEVALKDKDFVEGPWSQNNL 160
PT+++++Q N + RHVKTYE++L+DK+F +GPW Q+N+
Sbjct: 174 TTPTLILIHQANLNCYHLMTLCITNLLSFKQDQHGRHVKTYEISLRDKEFQKGPWKQDNV 233
Query: 161 DNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITK-----AYGRVDADGSR 215
+ A +LI VP P G LIIG+E+I Y + F IPI P K YG+VDA+GSR
Sbjct: 234 ETEACMLIAVPEPFGGALIIGQESITYHKGDNF--IPIAPPAIKQSTLTCYGKVDANGSR 291
Query: 216 YLLGDHAGLLHLLVITHEKE-----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
YLLGD G L +L++ E++ V LK+ELLGET+IA I+YLDNAVVYIGS GD
Sbjct: 292 YLLGDMMGRLFMLMLEKEEKMDSTVTVKDLKVELLGETTIAECITYLDNAVVYIGSRLGD 351
Query: 271 SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIV 330
SQL+KLN++PD GSYV+ +ER+ NLGPI+D CVVDLERQGQGQ+VTCSGAYK+GSLRI+
Sbjct: 352 SQLVKLNVEPDENGSYVQEMERFTNLGPILDMCVVDLERQGQGQLVTCSGAYKEGSLRII 411
Query: 331 RNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE 390
RNGIGI+E AS++L GIKG+WSLR ++ + +D +V+SF+ +TR+L +N E E+EETE+
Sbjct: 412 RNGIGIHEHASIDLPGIKGIWSLRVNSPE-YDNMIVLSFVGQTRVLMLNGE-EVEETELS 469
Query: 391 GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVL 450
G S QT C + ++NQL+Q+T SVRL+S +++L EWK G ++++A++N QV+
Sbjct: 470 GIESDQQTFLCANVVHNQLLQITPQSVRLISCDNQKLLKEWKHSGGKNISLASSNTCQVV 529
Query: 451 LATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRI 510
A G L YLE+ G + +V + +E+E++C+D+ P+ E S + A+G+WTDIS R+
Sbjct: 530 -ACVGSELYYLELLQGDIKQVSTSTMEHEVACVDLTPLREGEEKSHLCAIGLWTDISARV 588
Query: 511 FSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
SLPD N + E LGGEIIPRS+L+ FEGI YLLCALGDG L F N+ TG T+++K
Sbjct: 589 LSLPDFNSLHVEMLGGEIIPRSILMTTFEGIHYLLCALGDGSLFYFNFNIDTGYFTEKRK 648
Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
V+LGTQP LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV+HMCP NS +P
Sbjct: 649 VTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNHMCPLNSEGYP 708
Query: 631 DSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK--------- 681
DSLA+A +G LTIGTID+IQKLHIR+IPLGE PRRI +QE S+TF + S++
Sbjct: 709 DSLALANDGTLTIGTIDEIQKLHIRTIPLGESPRRIAYQESSQTFGVISMRMDLQDSNGL 768
Query: 682 ------------------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLD 711
S +E E+H + ++D TFE + ++ L
Sbjct: 769 NPTRPSASTHAAMMSSSSSGKVTMGTSTMGEHSAGDEVEVHSLLIIDQHTFEVLHSHQLM 828
Query: 712 TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKG 771
E+ S++S +D YY VGTA V PEE EP +GRI++F +GKL IAEKE KG
Sbjct: 829 PNEFATSLISARLGEDPCNYYIVGTALVHPEEAEPKQGRIVIFHFHEGKLNQIAEKEIKG 888
Query: 772 AVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVG 831
A Y+L FNGKLLA+IN ++L++W +EL+ EC + I+ALY++T+GDFI+VG
Sbjct: 889 AAYTLVEFNGKLLASINSTVRLFEWTT----DKELRLECNYFNSIVALYLKTKGDFILVG 944
Query: 832 DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA 891
DLM+SI+LL+YK EG EE ARD N NW +AVEILDDD +LGAEN+FNLFT +K+S
Sbjct: 945 DLMRSITLLLYKPMEGTFEEIARDCNPNWTTAVEILDDDNFLGAENSFNLFTCQKDSAST 1004
Query: 892 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT---VIFGTVNGVIGVIAS 948
TDE+R L+ VG +HLGEFVN FRHGSLVM+ S PT V++GTVNG +G++
Sbjct: 1005 TDEDRQNLQEVGMFHLGEFVNVFRHGSLVMQ--HSGETSTPTQGSVLYGTVNGAVGLVTQ 1062
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+P E Y FL+ +Q+ L KVIK VG + H WRSF+ E+KT + F+DGDLIESFLDL+R
Sbjct: 1063 VPQEFYSFLQDIQSRLAKVIKSVGKIEHSFWRSFHTERKTEACEGFIDGDLIESFLDLNR 1122
Query: 1009 TRMDEISKTMNV----------SVEELCKRVEELTRLH 1036
+M E K + + +V++L K +EELTR+H
Sbjct: 1123 DKMQETVKGLQIDDGSGMKREATVDDLVKTIEELTRIH 1160
>gi|427780151|gb|JAA55527.1| Putative dna damage-binding protein 1 [Rhipicephalus pulchellus]
Length = 1181
Score = 1129 bits (2920), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 576/1135 (50%), Positives = 778/1135 (68%), Gaps = 108/1135 (9%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQ--WDAESSELITRAMGDVSDRIG 60
++ IYGRI+ ++L+RP GE +D LF T +Y +L+ +S E+IT+A G+++D
Sbjct: 54 EIGIYGRISIMKLYRPAGEKKDLLFFLTAKYNAAILECVQYGDSIEIITKAHGNIADTFS 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIR+EEL V D++FL+GC
Sbjct: 114 RPSETGNIGIIDPECRVIGLRLYDGLFKVIPLDKDNRELKAFNIRMEELTVQDMEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTIV+L+QD++ ARH+KTYEV+LKDK+FV+GPW Q+++++ A+L+I VP P CG LII
Sbjct: 174 KTPTIVLLHQDSQ-ARHMKTYEVSLKDKEFVKGPWKQDHVESEANLVIAVPEPFCGALII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHE---- 233
G+E+I Y + + + I IR S YG+VDA+GSRYLLGD AG L +L++ E
Sbjct: 233 GQESITYHNGDQYVVITPHLIRQSTIVCYGKVDANGSRYLLGDMAGRLFMLLLEREDKMD 292
Query: 234 --------------------KEKVTG------LKIELLGETSIASTISYLDNAVVYIGSS 267
++K+ G LK+E LGE +IA I+YLDN VVY+GS
Sbjct: 293 GTXYLLGDMAGRLFMLLLEREDKMDGTTTVKDLKLEFLGEITIAECITYLDNGVVYVGSR 352
Query: 268 YGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSL 327
GDSQLIKL+ + + +GS+VE++E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSL
Sbjct: 353 LGDSQLIKLHAERNDQGSFVEIMEVFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSL 412
Query: 328 RIVRNGIGINEQASVELQGIKGMWSLR---------SSTDDPFDTF-----LVVSFISET 373
RI+RNGIGI+E AS++L GIKGMW LR DP D+ LV+SF+ +T
Sbjct: 413 RIIRNGIGIHEHASIDLPGIKGMWPLRVGPGVAPHGGDGRDPGDSAERDNTLVLSFVRQT 472
Query: 374 RILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKS 433
R+L ++ E E+EETE+ GF + QT FC + QL+QVT+ +VRLV S +++L NEWK
Sbjct: 473 RVLMLSGE-EVEETELAGFDTSQQTFFCGNVRNKQLIQVTAAAVRLVDSQTKQLLNEWKP 531
Query: 434 PPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPS 493
P +++V T N SQV+ A + LEIGDG+L ++ +A+LE E++CLDI P+ E
Sbjct: 532 PGARNISVVTCNQSQVVCAVRK-EVFCLEIGDGVLNQISNAELENEVACLDITPLSEKAE 590
Query: 494 YSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHL 553
+ + AVG+WTDIS+RI SLP L + KE++GGEIIPRS+L+ FEGI YLLCALGDG L
Sbjct: 591 KATLCAVGLWTDISIRILSLPSLQQLQKENIGGEIIPRSILITTFEGIHYLLCALGDGSL 650
Query: 554 LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNV 613
FLL TG LTDRKKV+LGTQP L+TF S +T++VFA SDRPTVIYSSN KL++SNV
Sbjct: 651 FYFLLEATTGALTDRKKVTLGTQPTVLKTFKSLSTSNVFACSDRPTVIYSSNHKLVFSNV 710
Query: 614 NLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSR 673
NLKEV+HMCP NS +PDSLA+A + L IGTID+IQKLHIR++PLGE PRRI +QE ++
Sbjct: 711 NLKEVNHMCPLNSEGYPDSLALANDNTLLIGTIDEIQKLHIRTVPLGELPRRIAYQEATQ 770
Query: 674 TFAICSLK-----------------------------------------NQSCAEESEMH 692
TF + +++ N +E E+H
Sbjct: 771 TFGVITIRNDILGSSGLTPVRPSASTQAQNVTHSAQMSSIFKPGSVSTGNDQLGQEVEIH 830
Query: 693 FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRIL 752
+ ++D TFE + + EY SI+S +D N YY VGTA VLP+E++P +GRI+
Sbjct: 831 NLLIIDQHTFEVLHAHQFMQTEYAMSIVSTRLGNDPNTYYIVGTANVLPDESDPKQGRIV 890
Query: 753 VFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
VF DGKL+ +AE+E KGA YS+ FNGKLLAAIN ++L++W + REL++EC H
Sbjct: 891 VFHWVDGKLEHVAEQEIKGAPYSMLEFNGKLLAAINSTVRLFEW----NAERELRNECSH 946
Query: 813 HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
+ILALY++ +GDF++VGDLM+S+SLL YK EG EE ARDY NWMS+VEILDDD +
Sbjct: 947 FNNILALYLRAKGDFVLVGDLMRSMSLLAYKPLEGNFEEIARDYQTNWMSSVEILDDDTF 1006
Query: 873 LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP-DSDVGQI 931
LGAE+ NLF +K+S TDEER L+ VG++HLGEFVN FRHGSLVM+ P ++
Sbjct: 1007 LGAESTTNLFVCQKDSAATTDEERQHLQEVGQFHLGEFVNVFRHGSLVMQHPGETSSPTQ 1066
Query: 932 PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 991
+V+FGT++G IG+++ LP + Y FL ++Q L KVIK VG ++H WRSF+ E+KT A
Sbjct: 1067 GSVLFGTIHGAIGLVSQLPADFYTFLSEVQEKLTKVIKSVGKIDHAFWRSFSTERKTEPA 1126
Query: 992 KNFLDGDLIESFLDLSRTRMDEISKTM----------NVSVEELCKRVEELTRLH 1036
F+DGDLIESFLDLSR +M E+ + + + SV++L K +EEL+R+H
Sbjct: 1127 VGFIDGDLIESFLDLSRDKMQEVVQGIQMDDGSGMKRDASVDDLIKIIEELSRIH 1181
>gi|326919947|ref|XP_003206238.1| PREDICTED: DNA damage-binding protein 1-like [Meleagris gallopavo]
Length = 1079
Score = 1125 bits (2909), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 568/1081 (52%), Positives = 750/1081 (69%), Gaps = 69/1081 (6%)
Query: 18 PH--GEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIGIIDP 73
PH GE++D LFI T +Y C+L++ + ++ ++ITRA G+V DRIGRP++ G IGIIDP
Sbjct: 6 PHTRGESKDLLFILTAKYNACILEYKQNGDNIDIITRAHGNVQDRIGRPSETGIIGIIDP 65
Query: 74 DCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNK 133
+CR+IGL LYDGLFKVIP D + + +AFNIRLEELQV+D+KFLYGC PTI +YQD +
Sbjct: 66 ECRMIGLRLYDGLFKVIPLDRENKELKAFNIRLEELQVIDVKFLYGCQAPTICFVYQDPQ 125
Query: 134 DARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAF 193
RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +IIG+E+I Y + + +
Sbjct: 126 -GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKY 184
Query: 194 KAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-----KVTGLKIELL 245
AI I+ S + RVD +GSRYLLGD G L +L++ E++ + L++ELL
Sbjct: 185 LAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELL 244
Query: 246 GETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVV 305
GETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E + NLGPIVD CVV
Sbjct: 245 GETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVV 304
Query: 306 DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFL 365
DLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS + D L
Sbjct: 305 DLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDSHHEMDNML 364
Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
V+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+TS SVRLVS +
Sbjct: 365 VLSFVGQTRVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPK 423
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI 485
L +EWK P G +++VA+ N++QV++A G L YLEI L ++ ++E+E++CLDI
Sbjct: 424 ALVSEWKEPNGKNISVASCNSNQVVVAVGRA-LYYLEIRPQELRQINCTEMEHEVACLDI 482
Query: 486 NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLL 545
P+G+ S + A+G+WTDIS RI LP ++ KE LGGEIIPRS+L+ FE YLL
Sbjct: 483 TPLGDTNGMSPLCAIGLWTDISARILKLPSFEMLHKEMLGGEIIPRSILMTTFESSHYLL 542
Query: 546 CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSN 605
CALGDG L F L+++TG L+DRKKV+LGTQP LRTF S +TT+VFA SDRPTVIYSSN
Sbjct: 543 CALGDGALFYFGLSLETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSN 602
Query: 606 KKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRR 665
KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKLHIR++PL E PR+
Sbjct: 603 HKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRK 662
Query: 666 ICHQEQSRTFAICSLK---------------------------------------NQSCA 686
IC+QE S+ F + S + S
Sbjct: 663 ICYQEVSQCFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSTSKLFSSSTAPHETSFG 722
Query: 687 EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEP 746
EE E+H + ++D TFE + + EY S++SC D N Y+ VGTA V PEE EP
Sbjct: 723 EEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEP 782
Query: 747 TKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
+GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++LY+W +EL
Sbjct: 783 KQGRIVVFHYSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWT----AEKEL 838
Query: 807 QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE ARD+N NWMSAVEI
Sbjct: 839 RTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEI 898
Query: 867 LDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPD 925
LDDD +LGAEN FNLF +K+S TDEER L+ VG HLGEFVN F HGSLVM+ L +
Sbjct: 899 LDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLSHLGEFVNVFCHGSLVMQNLGE 958
Query: 926 SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
+ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG + H WRSF+ E
Sbjct: 959 TSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTE 1018
Query: 986 KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVEELCKRVEELTRL 1035
+KT A F+DGDLIESFLD+SR +M E+ + + +V++L K VEELTR+
Sbjct: 1019 RKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGMKREATVDDLIKIVEELTRI 1078
Query: 1036 H 1036
H
Sbjct: 1079 H 1079
>gi|413948669|gb|AFW81318.1| hypothetical protein ZEAMMB73_456332 [Zea mays]
Length = 674
Score = 1118 bits (2891), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 540/673 (80%), Positives = 605/673 (89%)
Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
+LVVSFISETR LAMN+EDELEETEIEGF +QTQTLFC +AI + L+QVT+ SVRLVS T
Sbjct: 2 YLVVSFISETRFLAMNMEDELEETEIEGFDAQTQTLFCQNAINDLLIQVTANSVRLVSCT 61
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCL 483
SREL ++W +P G+SVNVA+ANASQVLLATGGGHLVYLEI D L EVKHAQLE+EISCL
Sbjct: 62 SRELVDQWNAPAGFSVNVASANASQVLLATGGGHLVYLEIRDAKLVEVKHAQLEHEISCL 121
Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISY 543
D+NPIGE+ YS +AAVGMWTDISVRIFSLPDL LI KE+LGGEI+PRSVLLC EG+SY
Sbjct: 122 DLNPIGEHLEYSSLAAVGMWTDISVRIFSLPDLVLIRKENLGGEIVPRSVLLCTLEGVSY 181
Query: 544 LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYS 603
LLCALGDG+L +FLLN TGELTDRKKV+LGTQPI+LRTFSSK TTHVFA+SDRPTVIYS
Sbjct: 182 LLCALGDGNLFSFLLNASTGELTDRKKVTLGTQPISLRTFSSKGTTHVFASSDRPTVIYS 241
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
SNKKLLYSNVNLKEV+HMCPFN+AAFPDSLAIAKEGEL+IGTIDDIQKLHIR+IPL E
Sbjct: 242 SNKKLLYSNVNLKEVNHMCPFNTAAFPDSLAIAKEGELSIGTIDDIQKLHIRTIPLNEQA 301
Query: 664 RRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCS 723
RRICHQEQS+T A CS K EESE H +RLLD QTFE + YPLD +E GCSI+SCS
Sbjct: 302 RRICHQEQSKTLAFCSFKYNQSVEESETHLIRLLDHQTFESLCVYPLDQYECGCSIISCS 361
Query: 724 FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
F DD+NVYYCVGTAYV+PEENEPTKGRILVF VEDG LQLI EKETKGAVYSLNAFNGKL
Sbjct: 362 FVDDNNVYYCVGTAYVIPEENEPTKGRILVFAVEDGSLQLIVEKETKGAVYSLNAFNGKL 421
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
LAAINQKIQLYKWM R+DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSISLL+YK
Sbjct: 422 LAAINQKIQLYKWMSREDGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYK 481
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVG 903
HEE AIEERARDYNANWM+AVE+LDD++Y+GAEN +NLFTVRKNS+ ATD+ER +LEVVG
Sbjct: 482 HEESAIEERARDYNANWMTAVEMLDDEVYVGAENGYNLFTVRKNSDAATDDERAKLEVVG 541
Query: 904 EYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
EYHLGEFVNRFRHGSLVMRLPDS++G+IPTVIFGT+NGVIG+IASLPH+ Y FLEK Q+
Sbjct: 542 EYHLGEFVNRFRHGSLVMRLPDSEIGKIPTVIFGTINGVIGIIASLPHDHYTFLEKFQST 601
Query: 964 LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
L K IKGVG ++HEQWRSF+N+KKT +A+NFLDGDLIESFLDLSR++M+ +SK M VSVE
Sbjct: 602 LVKYIKGVGNMSHEQWRSFHNDKKTAEARNFLDGDLIESFLDLSRSKMEVVSKAMGVSVE 661
Query: 1024 ELCKRVEELTRLH 1036
+L K VEELTRLH
Sbjct: 662 DLSKIVEELTRLH 674
>gi|241260143|ref|XP_002404926.1| DNA repair protein xp-E, putative [Ixodes scapularis]
gi|215496735|gb|EEC06375.1| DNA repair protein xp-E, putative [Ixodes scapularis]
Length = 1148
Score = 1117 bits (2889), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 566/1105 (51%), Positives = 766/1105 (69%), Gaps = 81/1105 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS----ELITRAMGDVSDR 58
++ IYGRI+ ++LFRP GE +D LF T +Y +L+ + E+IT+A G+++D
Sbjct: 54 EIGIYGRISIMKLFRPPGEKKDLLFFLTAKYNAAILECIQDGGDGGVEIITKAHGNIADS 113
Query: 59 IGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIR+EEL V D++FL+
Sbjct: 114 FSRPSETGNIGIIDPECRVIGLRLYDGLFKVIPLDRDNRELKAFNIRMEELTVQDMEFLH 173
Query: 119 GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVL 178
GC PTIV+L+QD++ ARH+KTYE++LKDK+FV+GPW Q+++++ A ++I VP P C
Sbjct: 174 GCKTPTIVLLHQDSQ-ARHMKTYEISLKDKEFVKGPWKQDHVESEATIVIAVPEPFCDAR 232
Query: 179 IIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
IG+E+I Y + + I IR S YG+VDA+GSRYLLGD AG L +L++ E +
Sbjct: 233 CIGQESITYHNGDQDVVITPHLIRQSTIVCYGKVDANGSRYLLGDMAGRLFMLLLEREDK 292
Query: 236 -----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
V LK+E LGE +IA ++YLDN VVY+GS GDSQLIKLN + + +GSYVEV+
Sbjct: 293 MDGTTTVKDLKLEFLGEITIAECMTYLDNGVVYVGSRLGDSQLIKLNSERNEQGSYVEVM 352
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 353 EVFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGI 412
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W LR +TD D LV+SF+ +TR+L ++ E E+EETE+ GF QT FC + NQL+
Sbjct: 413 WPLRVNTDSSRDNTLVLSFVGQTRVLMLSGE-EVEETELAGFDISQQTFFCGNVRNNQLI 471
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
QVT+ +VRLV ++ L NEWK P G +++V T N +QV+ A + LEI DG L +
Sbjct: 472 QVTAAAVRLVDGKTKLLLNEWKPPGGKNISVVTCNQNQVVCAVRQ-EVFCLEIADGTLKQ 530
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
+A++E E++CLDI P+ + S + AVG+WTDIS+R+ LP L + KE++GGEIIP
Sbjct: 531 TSNAEMENEVACLDITPLNDGSGKSDLCAVGLWTDISIRVLRLPSLEQLQKENIGGEIIP 590
Query: 531 RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
RS+L+ FEG+ YLLCALGDG L FLL+ TG ++DRKKV+LGTQP L+TF S +T++
Sbjct: 591 RSILMTTFEGVHYLLCALGDGSLFYFLLDASTGAVSDRKKVTLGTQPTVLKTFKSLSTSN 650
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
VFA SDRPTVIYSSN KL++SNVNLKEV+HMCP NS +PDSLA+A +G L IGTID+IQ
Sbjct: 651 VFACSDRPTVIYSSNHKLVFSNVNLKEVNHMCPLNSEGYPDSLALANDGTLLIGTIDEIQ 710
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------------- 681
KLHIR++PLGE RI +QE ++TF + +++
Sbjct: 711 KLHIRTVPLGE---RIAYQEATQTFGVITIRNDIQGTGGLTPVRPSASTQAQNVTYSSTM 767
Query: 682 -----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 730
N +E E+H + ++D TFE + + EY SI+S D NV
Sbjct: 768 SSVFKPGTGSGNDQLGQEVEVHNLLIVDQHTFEVLHAHQFMQTEYAMSIVSTRLGSDPNV 827
Query: 731 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
Y+ VGTA VLP+E++P +GRI++F DGKLQ +AEKE KGA YSL FNGKLLA+IN
Sbjct: 828 YFIVGTAIVLPDESDPKQGRIIIFHWVDGKLQQVAEKEIKGAPYSLLEFNGKLLASINST 887
Query: 791 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
++L++W + REL +EC H +ILALY++T+GDFI+VGDLM+S+SLL YK EG+ E
Sbjct: 888 VRLFEW----NAERELHNECSHFNNILALYLKTKGDFILVGDLMRSMSLLAYKPLEGSFE 943
Query: 851 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 910
E ARDY NWM AVEILDDD +LGAE+ NLF +K+S TDE+R L+ VG++HLGEF
Sbjct: 944 EIARDYQTNWMCAVEILDDDTFLGAESTTNLFVCQKDSAATTDEDRQHLQEVGQFHLGEF 1003
Query: 911 VNRFRHGSLVMRLP-DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 969
VN FRHGSLVM+ P ++ +V+FGT++G IG++A LP + Y FL ++Q NL KVIK
Sbjct: 1004 VNIFRHGSLVMQHPGEASSPTQGSVLFGTIHGAIGLVAQLPSDFYNFLLEVQGNLTKVIK 1063
Query: 970 GVGGLNHE--------QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM--- 1018
VG ++H WRSF+ E+KT A+ F+DGDLIESFLDLSR +M E+ + +
Sbjct: 1064 SVGKIDHTLYPFVRLFTWRSFSTERKTEQAQGFIDGDLIESFLDLSRDKMQEVLQGIQMD 1123
Query: 1019 -------NVSVEELCKRVEELTRLH 1036
+ +V++L K +EEL+R+H
Sbjct: 1124 DGSGMKRDATVDDLIKIIEELSRVH 1148
>gi|351699158|gb|EHB02077.1| DNA damage-binding protein 1 [Heterocephalus glaber]
Length = 1144
Score = 1115 bits (2884), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 567/1098 (51%), Positives = 752/1098 (68%), Gaps = 71/1098 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVALKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS+ D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSAPSRETDDTLVLSFVGQTRVLMLNGE-EVEETELVGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G + +VA+ ++SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNNSVASGSSSQVVVAVGRA-LYYLQIHPRELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 QTEMEHEVACLDITPLGDSSGLSPLCAIGLWTDISARILQLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF D + ++ + KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDEEREVSSRGLVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNHEQWR----SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN--------- 1019
+ H + SF+ E+KT A F+DGDLIESFLD+SR +M E+ +
Sbjct: 1067 KIEHSLYPSRAVSFHTERKTEQATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKR 1126
Query: 1020 -VSVEELCKRVEELTRLH 1036
+ ++L K VEELTR+H
Sbjct: 1127 EATADDLIKVVEELTRIH 1144
>gi|320163506|gb|EFW40405.1| UV-damaged DNA binding protein [Capsaspora owczarzaki ATCC 30864]
Length = 1123
Score = 1114 bits (2881), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 562/1049 (53%), Positives = 730/1049 (69%), Gaps = 76/1049 (7%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LD +YGRIA ++LFRP QD LFI TERY F VL +DA++ EL+T+A GD+ D+ G
Sbjct: 76 LLDAGMYGRIAIMQLFRPPNAQQDLLFICTERYAFTVLAYDAQTGELVTKANGDLQDKSG 135
Query: 61 RPTDNGQIGIIDPDCRL------------------------------IGLHLYDGLFKVI 90
P D G IG+IDPDCRL IGL LY+G+FKVI
Sbjct: 136 NPADLGPIGVIDPDCRLIGENAALDLWGGGHPPFYFGCSCSLRLVVWIGLRLYNGMFKVI 195
Query: 91 PFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDF 150
P D GQ K+AFNIRLEELQV DIKFL G +PTI+VLYQD K+ RHVKTY+V LK+K+F
Sbjct: 196 PIDPHGQFKDAFNIRLEELQVFDIKFLRGYDRPTILVLYQDTKETRHVKTYQVLLKEKEF 255
Query: 151 VEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD 210
EGPW+QNN++ GA LLIPV PL GVLI+GE+TI Y S +AF+++ +RP+I K Y +D
Sbjct: 256 AEGPWAQNNVEGGASLLIPVLMPLGGVLIVGEQTITYHSGSAFRSVAMRPAIIKCYSVID 315
Query: 211 ADGSRYLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
+ R+LL D G L +++TH+++ KVT +KI+ LG TSI S ++YLDN VV+ GS +G
Sbjct: 316 TN--RFLLADSEGNLLSVLLTHDRQDKVTAIKIDRLGVTSILSCLTYLDNGVVFGGSQFG 373
Query: 270 DSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRI 329
DSQL++L + D GS+V VLE + NLGPI D VVDLERQGQ QVVTCSGA+KDGSLR+
Sbjct: 374 DSQLLRLATERDETGSFVRVLESFSNLGPICDMAVVDLERQGQCQVVTCSGAFKDGSLRV 433
Query: 330 VRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI 389
VRNG+GI EQA++EL GIKG+WSL+ + + + LVVSFI ETR+L M+ +ELEE +I
Sbjct: 434 VRNGVGIEEQATIELPGIKGIWSLKPTEAALYRSILVVSFIGETRLLGMSSGEELEEMQI 493
Query: 390 EGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW-------KSPPGYSVNVA 442
G +QTL C + +Q +QVT+ VRLV+ +++ L W + PG + +A
Sbjct: 494 PGLDQNSQTLHCANVSGDQFLQVTATEVRLVNCSTQALVASWSPASVPDRYAPGTRITMA 553
Query: 443 TANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVG 501
++N QVL+A GGGHLV L + G L + HA++++EI+C+DI PIG P SQ+ AVG
Sbjct: 554 SSNDFQVLVACGGGHLVCLSVEASGNLVPIGHARMDHEIACVDITPIGGQP-LSQVCAVG 612
Query: 502 MWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMK 561
+WTDI+VR+ S+P L + + L G+IIPRS+L+ FEG LLCALGDG + + ++
Sbjct: 613 LWTDITVRVLSVPTLEQVLVQPLEGQIIPRSILMATFEGQPRLLCALGDGSMHTYSFDVL 672
Query: 562 TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHM 621
+ +LTD K+VSLGTQPI L F S+ THVFA SDRPTVIYSSN+KLLYSNVNL+EV+H+
Sbjct: 673 SQQLTDHKRVSLGTQPILLSAFVSRGQTHVFACSDRPTVIYSSNRKLLYSNVNLREVTHV 732
Query: 622 CPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK 681
CPF S +F D LA+ LTIGTID+IQKLH+R+IPLGE PRRI + E +RT+ + ++
Sbjct: 733 CPFTSESFADCLAVVSSTSLTIGTIDEIQKLHVRAIPLGEMPRRIAYHEPTRTYGVATVT 792
Query: 682 ------------NQSCAEES------------------EMHFVRLLDDQTFEFISTYPLD 711
N + ++ + FVRL D QTFE ++ L
Sbjct: 793 LAEPLPVGSNSGNVAARAQNVRPMAFDDGPRSPSDVLEDTSFVRLFDGQTFEIRDSFQLP 852
Query: 712 TFEYGCSILSCSF---SDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKE 768
+ E S +SCSF S DS VY VGTA+V+P E+EP +GRILVF V G L L+ K+
Sbjct: 853 STETIMSFISCSFANDSSDSTVYLVVGTAFVIPSEDEPKRGRILVFDVAGGALHLVTAKD 912
Query: 769 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
KG VYSLNAFNGKLLA IN K+ L+KW L DG REL SEC HHGHIL LY+++RGDFI
Sbjct: 913 VKGCVYSLNAFNGKLLAGINSKVNLFKWNLTGDGIRELVSECSHHGHILTLYLKSRGDFI 972
Query: 829 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
+VGDLM+SISLL+YK +IEE A+D NW++AV++LDDD+++G E++FN+FT R+N
Sbjct: 973 IVGDLMRSISLLMYKSGTSSIEEIAQDTCPNWVTAVDMLDDDVFIGGESSFNIFTCRRNL 1032
Query: 889 EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-PTVIFGTVNGVIGVIA 947
E +TDEER RLEVVGE+H+GEF+N+FR GSLVM+LPD I P+ +FGT NGVIGVIA
Sbjct: 1033 EASTDEERKRLEVVGEFHVGEFINQFRAGSLVMKLPDEQEQPIQPSTLFGTGNGVIGVIA 1092
Query: 948 SLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
L QY FL+ +Q + KVIKGVGGLNH
Sbjct: 1093 RLTRSQYEFLQLVQAAMAKVIKGVGGLNH 1121
>gi|410045300|ref|XP_508472.4| PREDICTED: DNA damage-binding protein 1 [Pan troglodytes]
Length = 1107
Score = 1109 bits (2869), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 564/1093 (51%), Positives = 736/1093 (67%), Gaps = 105/1093 (9%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 61 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 120
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 121 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 180
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 181 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 239
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 240 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 299
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 300 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 359
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 360 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 419
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 420 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 478
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 479 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 537
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 538 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 597
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 598 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 657
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 658 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 717
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 718 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 777
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 778 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 837
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 838 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 897
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 898 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 953
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 954 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1013
Query: 914 FRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
F HGSLVM Q L KVIK VG
Sbjct: 1014 FCHGSLVM---------------------------------------QNRLNKVIKSVGK 1034
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSVE 1023
+ H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + + +
Sbjct: 1035 IEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATAD 1094
Query: 1024 ELCKRVEELTRLH 1036
+L K VEELTR+H
Sbjct: 1095 DLIKVVEELTRIH 1107
>gi|298711490|emb|CBJ26578.1| n/a [Ectocarpus siliculosus]
Length = 1135
Score = 1105 bits (2857), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 560/1065 (52%), Positives = 740/1065 (69%), Gaps = 47/1065 (4%)
Query: 2 LDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGR 61
LDVP+YG IAT+E +R E + +F+ TERY+FCVLQ+D E+ TR+ G V DRIGR
Sbjct: 53 LDVPLYGAIATMEAYRLADENCERIFVLTERYQFCVLQYDVSRQEIRTRSSGSVKDRIGR 112
Query: 62 PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCA 121
DN +IG++DP R+IGLHLY+G FKVIP D KGQLK+AFNIRLEEL+VLDI+FL GC
Sbjct: 113 AIDNSKIGVMDPQSRMIGLHLYEGYFKVIPMDAKGQLKDAFNIRLEELEVLDIQFLSGCP 172
Query: 122 KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
K TI VLYQD ++ARH+KTY ++ +DK+F GPW+Q N+++ A LIPVP P GVLI+G
Sbjct: 173 KATIAVLYQDQRNARHIKTYTISTRDKEFDTGPWAQLNVEHNASELIPVPAPFGGVLILG 232
Query: 182 EETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHE--KEKVTG 239
+TI Y S AF IPI+ + AYG VDADGSR L+ DH+G LH++++T + V
Sbjct: 233 HQTICYHSGKAFITIPIQNTRMCAYGWVDADGSRLLVSDHSGGLHVVILTPDATNTAVET 292
Query: 240 LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPI 299
IE LGETS AS+ISYLDN VV+IGS+ GDSQLIKLN + DA+G+Y++VLE Y NLGPI
Sbjct: 293 AHIEALGETSCASSISYLDNGVVFIGSASGDSQLIKLNPEKDAQGTYIQVLETYDNLGPI 352
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDD 359
+D CV DL+RQGQGQ VTCSG KDGSLRI+RNGIGINE A++EL GIKGMWSLR S +
Sbjct: 353 LDMCVADLDRQGQGQAVTCSGCSKDGSLRIIRNGIGINEHAAIELAGIKGMWSLRPSNTN 412
Query: 360 PFDTFLVVSFISETRILAMNLEDE-----LEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
D +LV +FISETR+LA EDE L E EI GF + TLFC N VQVT
Sbjct: 413 -HDKYLVQAFISETRVLAFE-EDEDGDHQLAEGEIAGF-QEGCTLFCGCVGGNMAVQVTK 469
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHA 474
V L+ + + W P ++ VA+ NA++V+LA GGG+LV+LE+ V+ A
Sbjct: 470 RGVVLICCDGLQEIDRWDPPTDLNITVASGNATRVVLALGGGNLVHLEVDATAKKLVQKA 529
Query: 475 --QLEYEISCLDINPIGENP-----------------SYSQIAAVGMWTDISVRIFSLPD 515
QL+ EI+C+ +NP P + AVGMWTD++VR+ SLPD
Sbjct: 530 RVQLDNEIACISLNPPSNQPVSNAEPATTAMECDEESKLDSLVAVGMWTDMTVRLLSLPD 589
Query: 516 LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNM---KTGELTDRKKVS 572
L ++ + LGG+ RSV+L + YL LGDGH+++F L + T L KKV+
Sbjct: 590 LQGVSSQPLGGDTQARSVILATIADVHYLFVGLGDGHVVSFPLEVTAESTLALGTPKKVA 649
Query: 573 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
LGTQP+ L F + VF ASDRP VIY S KLLY+NVN+ EV+ +C F+S+ P
Sbjct: 650 LGTQPVGLACFRNNGMVCVFVASDRPAVIYCSGGKLLYANVNMGEVNSVCSFDSSELPHC 709
Query: 633 LAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA---EES 689
LA+A E LTIGTIDDIQKLHI+ + LGE P+RI H + R F I + ++ EE
Sbjct: 710 LALASENSLTIGTIDDIQKLHIQKVSLGEAPQRITHHDSGRMFGIITTSYRAVENSDEEE 769
Query: 690 EMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG 749
E +FV+ LDD FE + +PLD FE G S++SC F++D Y VGT YV +E EP G
Sbjct: 770 EHNFVKFLDDTNFEELYCHPLDAFENGSSMVSCVFANDKKEYLVVGTGYVREDECEPAVG 829
Query: 750 RILVFIVE----DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRE 805
R+LVF VE + K+ L AE ET+GAVY LN FNGKLLA IN K+QL++W+ +DDG +E
Sbjct: 830 RLLVFSVEGQGAERKVDLAAEVETRGAVYVLNGFNGKLLACINSKVQLFRWIEKDDGIQE 889
Query: 806 LQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
LQ+ECG+HGHILAL++Q+RGDFI+VGDLM+S+SLL+YK +GAIEE ARDY+ANWM+AVE
Sbjct: 890 LQTECGYHGHILALHMQSRGDFIIVGDLMRSVSLLVYKAVDGAIEEVARDYHANWMTAVE 949
Query: 866 ILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP- 924
+L+DD+Y+G E + N+FT+R+N++ AT+EER RLE+ GE+HLGEFVN+F GSL+M+
Sbjct: 950 MLNDDVYIGGEADCNIFTLRRNADAATEEERARLEIQGEFHLGEFVNKFCRGSLLMQSSE 1009
Query: 925 -------DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
DS + + ++FGTVNG++G I +L + + FL +LQT + KV+KGVGG +H+
Sbjct: 1010 VNSPGGMDSPLVKGQPLLFGTVNGMVGTILTLTEDNHRFLAQLQTAMTKVVKGVGGFSHD 1069
Query: 978 QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSV 1022
+WRSF N ++T + NF+DGDL+ES+LD+ R +E+ + ++ V
Sbjct: 1070 EWRSFTNGRRTSPSSNFIDGDLVESYLDMPRHNQEEVLRHVDTPV 1114
>gi|156389050|ref|XP_001634805.1| predicted protein [Nematostella vectensis]
gi|156221892|gb|EDO42742.1| predicted protein [Nematostella vectensis]
Length = 1157
Score = 1098 bits (2841), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 566/1114 (50%), Positives = 756/1114 (67%), Gaps = 86/1114 (7%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDV +YG + +ELFRP E QD LFI T +Y+ C+L + E+ +++TRA GDV DRIG
Sbjct: 52 MLDVGLYGSVGVMELFRPPNEPQDLLFILTAKYRVCILGYRKETGDIVTRACGDVQDRIG 111
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF--DNKGQLKEAFNIRLEELQVLDIKFLY 118
RP+D GQIGIIDP CR+IGL LYDGLFKVIP D+ +LK AFNIRLEEL V+DI+FLY
Sbjct: 112 RPSDTGQIGIIDPSCRVIGLRLYDGLFKVIPLELDSDKELK-AFNIRLEELHVVDIQFLY 170
Query: 119 GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVL 178
GCA PTIV +YQD RHVKTYE+ L+D +F +GPW Q+N++ A +I VP PL G L
Sbjct: 171 GCANPTIVFIYQD-PHGRHVKTYEINLRDHEFAKGPWKQDNVEVEACRVIAVPNPLGGAL 229
Query: 179 IIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK- 234
IIG+E+I Y + + AI ++ S +G++D +GSRYLLGD G L++L++ ++
Sbjct: 230 IIGQESITYHKGSNYHAIAPPALKQSSLTCHGKIDTNGSRYLLGDMNGRLYMLLLERQEL 289
Query: 235 ----EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
+V LK+E+LGETSIA + YLDN VV+IGS GDSQL KL+ +PDA GSYV+V+
Sbjct: 290 IDGTYEVKDLKLEMLGETSIAHCLVYLDNGVVFIGSMLGDSQLAKLSTEPDADGSYVQVM 349
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
E + NLGPIVD VVDLERQGQGQ+VTCSGA K+GSLRI+RNGIGI+E A+++L GI G+
Sbjct: 350 ETFTNLGPIVDMVVVDLERQGQGQLVTCSGAKKEGSLRIIRNGIGIHEHATIDLAGIMGI 409
Query: 351 WSLR-SSTDDPFDTFLVVSFISETR---------ILAMNLEDELEETEIEGFCSQTQTLF 400
W+L+ T +D LV+SF+ ++R +L+++ E E+EETEI GF QT +
Sbjct: 410 WALKLRKTQQEYDDTLVLSFVGQSRRVLCTIHSRVLSLSGE-EVEETEIPGFSDDQQTYY 468
Query: 401 CHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVY 460
+ QL+QVT+ SVRLV+ +R+L +EWK P +++VA+ N QV+ A G L Y
Sbjct: 469 SGNVTGAQLIQVTAASVRLVNCETRQLVSEWKHPSAKNISVASCNTEQVVAAVGS-ELYY 527
Query: 461 LEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT 520
+EI G L ++ LEYE++CLDI P E S + + AVG+WTDISVR+ LP LN +
Sbjct: 528 IEIMPGELRQISQVTLEYEVACLDITPTVEGKSRADMVAVGLWTDISVRVLQLPKLNQLH 587
Query: 521 KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
+ LGGEIIPRS+L AFEGI YLLCALGDG L F ++ TG L + KKV+LGTQP L
Sbjct: 588 VQMLGGEIIPRSILKTAFEGIHYLLCALGDGTLFYFTMDPSTGALAECKKVTLGTQPTML 647
Query: 581 RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
RTF S +T +VFA SDRPTVIYSSN KL++SNVNLKEV+ MCP NS +PDSLA+A +G
Sbjct: 648 RTFKSLSTVNVFACSDRPTVIYSSNHKLVFSNVNLKEVNFMCPLNSQGYPDSLALANDGS 707
Query: 641 LTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ----------------- 683
LTIGTID+IQKLHIR++PLGE PRRI +QE ++TF + S++ +
Sbjct: 708 LTIGTIDEIQKLHIRTVPLGESPRRIAYQEATQTFGVISVRIEISEPGTSGTVPLHPSAS 767
Query: 684 --------------------------SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGC 717
+ +E E+ + ++D TFE + L E
Sbjct: 768 TTAHNVSSSVGTGVSGSASSSAPDGITFGDEMEIGSLLIIDQHTFEVTHAHQLHDNEQAT 827
Query: 718 SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
S++SC+ SDD + YYCVGTAYV PEE EP GR+L+F + +GKL +AEKE KGAVYSL
Sbjct: 828 SLMSCTLSDDPHTYYCVGTAYVFPEEPEPKAGRLLLFHLSEGKLVQVAEKEVKGAVYSLV 887
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
FNGK+LA IN + +++W +E + EC ++ +ILALY++T+GDFI+VGDLM+S+
Sbjct: 888 EFNGKVLAGINSTVSIFEWT----ADKEFRYECSYYDNILALYLKTKGDFILVGDLMRSM 943
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG 897
+LL+Y EG+ +E A D++ WM+A+EILDDD +LGAEN++NLFT K+S TDEER
Sbjct: 944 TLLVYLPLEGSFQEIAHDFSPKWMTAIEILDDDTFLGAENSYNLFTCTKDSGATTDEERY 1003
Query: 898 RLEVVGEYHLGEFVNRFRHGSLVMRLP-DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLF 956
L+ G+YHLGEFVN FRHGSLVM P D+ V+FGTVNG IG++A + + + F
Sbjct: 1004 HLQDAGQYHLGEFVNVFRHGSLVMEHPGDASTPFQGCVLFGTVNGRIGIVAQIAQDLFNF 1063
Query: 957 LEKLQTNLRKVIKGVGGLNHEQW-----RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
L ++Q L KVIK VG ++H + + ++ +K A F+DGDLIESFLDL R RM
Sbjct: 1064 LIQVQKKLNKVIKSVGKIDHSLYPFPHCSNLSHSRKMEPAHGFIDGDLIESFLDLPRARM 1123
Query: 1012 DEISKTMNV---------SVEELCKRVEELTRLH 1036
+E+ + + +V++L K VEELTR+H
Sbjct: 1124 EEVVTGLQIDDGGMKKECTVDDLVKTVEELTRIH 1157
>gi|345328202|ref|XP_003431248.1| PREDICTED: DNA damage-binding protein 1-like [Ornithorhynchus
anatinus]
Length = 1045
Score = 1097 bits (2837), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 551/1040 (52%), Positives = 726/1040 (69%), Gaps = 65/1040 (6%)
Query: 55 VSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDI 114
+ DRIGRP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEELQV+D+
Sbjct: 13 LKDRIGRPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELQVIDV 72
Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL 174
KFLYGC PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P
Sbjct: 73 KFLYGCQAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPF 131
Query: 175 CGVLIIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVIT 231
G +IIG+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++
Sbjct: 132 GGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLE 191
Query: 232 HEKE-----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSY 286
E++ + L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSY
Sbjct: 192 KEEQMDGAVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSY 251
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
V +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L G
Sbjct: 252 VVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPG 311
Query: 347 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
IKG+W LRS+++ D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + +
Sbjct: 312 IKGLWPLRSNSNRETDDTLVLSFVGQTRVLMLNGE-EVEETELTGFVDNQQTFFCGNVAH 370
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG 466
QL+Q+TS SVRLV+ + L +EWK P G +++VA+ N+SQV++A G L YL+I
Sbjct: 371 QQLIQITSASVRLVTQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPR 429
Query: 467 ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 526
L ++ H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGG
Sbjct: 430 ELRQISHTEMEHEVACLDITPLGDSHGMSPLCAIGLWTDISARILKLPSFELLHKEMLGG 489
Query: 527 EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
EIIPRS+L+ FE YLLCALGDG L F L+++TG L+DRKKV+LGTQP LRTF S
Sbjct: 490 EIIPRSILMTTFESSHYLLCALGDGALFYFGLSIETGLLSDRKKVTLGTQPTVLRTFRSL 549
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
+TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTI
Sbjct: 550 STTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTI 609
Query: 647 DDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------- 681
D+IQKLHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 610 DEIQKLHIRTVPLYESPRKICYQEGSQCFGVLSSRIEVQDASGGTTALRPSGSTQALSSS 669
Query: 682 --------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDD 727
S EE E+H + ++D TFE + + EY S++SC D
Sbjct: 670 VSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKD 729
Query: 728 SNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAI 787
+N Y+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+I
Sbjct: 730 ANTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASI 789
Query: 788 NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
N ++LY+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG
Sbjct: 790 NSTVRLYEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEG 845
Query: 848 AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
EE ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HL
Sbjct: 846 NFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHL 905
Query: 908 GEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
GEFVN F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L K
Sbjct: 906 GEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNK 965
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV------ 1020
VIK VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + +
Sbjct: 966 VIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQIDDGSGM 1025
Query: 1021 ----SVEELCKRVEELTRLH 1036
+V++L K VEELTR+H
Sbjct: 1026 KREATVDDLIKIVEELTRIH 1045
>gi|452824087|gb|EME31092.1| DNA damage-binding protein 1 isoform 1 [Galdieria sulphuraria]
Length = 1128
Score = 1094 bits (2829), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 548/1081 (50%), Positives = 756/1081 (69%), Gaps = 48/1081 (4%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++++PIYGRI+ ++L R + QD LF TE+YKF VL+W+ ++ E T A GDV DRIG
Sbjct: 51 VIELPIYGRISVMKLCRFPDDLQDSLFFLTEKYKFAVLRWNTQTGECDTIAGGDVHDRIG 110
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
PT G IGI DP GLHLYDGLFKVIP D K KEAFNIRLEEL VLDI+FLYG
Sbjct: 111 HPTSAGHIGICDPSMTCFGLHLYDGLFKVIPTDFK---KEAFNIRLEELDVLDIQFLYGH 167
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTI VLY D+++ RH+KTY V+LKDKDF GP Q NL++GA +LIPVP P+ GV+++
Sbjct: 168 SKPTIAVLYTDSEENRHLKTYTVSLKDKDFGNGPLFQGNLESGASMLIPVPTPIGGVVVL 227
Query: 181 GEETIVYCSANAFK---AIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI------- 230
G+ET+ Y S + + +IP+ +I +AYGR+D DG+RYLLGD G+L+LLV+
Sbjct: 228 GQETVTYISGSGLRGYHSIPVSATIFRAYGRIDKDGTRYLLGDEKGILYLLVLEQSTSLS 287
Query: 231 --THEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAK-GSYV 287
T + K+TGLKI+ LGETS+ STI YLDN VYIGS +GDSQLI+L +P + GS++
Sbjct: 288 TFTETETKITGLKIQTLGETSLPSTIDYLDNGFVYIGSCHGDSQLIRLLSEPHPETGSFL 347
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
EV+E Y NL PIVDFCV+D ERQGQGQVVTCSGA KDGSLRI+RNGIGI+EQASVE+ G+
Sbjct: 348 EVMETYPNLSPIVDFCVMDAERQGQGQVVTCSGAAKDGSLRIIRNGIGIHEQASVEVPGV 407
Query: 348 KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
K ++SL+ S+ + L++SF SE+R+L + + + E F Q TL+C + + +
Sbjct: 408 KELFSLKRSSLSSQHSLLLLSFASESRVLELVSTELMAEANFPVFEMQEPTLYCGNVVGD 467
Query: 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI 467
+VQ+T RL+S + NEW P G+ ++VA+ N+ Q+++AT GG+L+Y +I
Sbjct: 468 CIVQITPSKARLISCEDMSIVNEWHPPSGFRISVASGNSMQLIIATTGGNLIYFDIDANP 527
Query: 468 --LTEVKHAQLEYEISCLDINPIGE-------NPSYSQIAAVGMWTDISVRIFSLPDLNL 518
+ E + LEYEI LDI+P+G+ S AVGMWT++S+R +SLP L+L
Sbjct: 528 KRIMEKSYKSLEYEICSLDISPLGQAGMNLASQAIPSSFVAVGMWTEVSIRFYSLPSLDL 587
Query: 519 ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE-------LTDRKKV 571
I E LG ++I RS+L +G Y L ALGDG LL + L+ + L D++++
Sbjct: 588 IHTEKLGLDVIARSLLFVTMDGEDYFLAALGDGRLLTYRLDKSAKDTDSEKKFLYDQRQM 647
Query: 572 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK--KLLYSNVNLKEVSHMCPFNSAAF 629
S+GTQP +L F ++N HVFAA DRPTVI+SS+ KLL SNVNL+EV+ +C F+S AF
Sbjct: 648 SIGTQPASLSIFETQNALHVFAACDRPTVIHSSSGGGKLLCSNVNLREVTRVCSFSSEAF 707
Query: 630 PDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK-------- 681
PD LA+ EG L +GT+D+IQKLHIR+IPLGE PRRI H + FA+ + K
Sbjct: 708 PDCLALVTEGSLLLGTVDNIQKLHIRTIPLGEQPRRIAHLDTHHVFAVLTTKQVVTISED 767
Query: 682 -NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV-----YYCVG 735
N++ +E +E +VRL+DD E + +Y L+ FE CS+++ +F DD+ Y+ VG
Sbjct: 768 GNEALSETTEEGYVRLIDDTMMEIVHSYKLEQFETPCSVITVNFGDDAAAKDNQDYFVVG 827
Query: 736 TAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
TAY +E EP++GR+LVF V + +L L+AE+ KGA+YS++AFNGK+LA++N ++L +
Sbjct: 828 TAYSYADEPEPSRGRMLVFAVREQRLTLVAERTFKGALYSMDAFNGKILASVNSMLKLVR 887
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
W + G R L EC +HG I L ++ GDFI++GDL++S+SLL YK G IE+ ARD
Sbjct: 888 WSETESGARTLTEECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIEDVARD 947
Query: 856 YNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
+ +W++ +E+LD D Y+ AEN FNLFT+++NS+ +T+EER RLE VGEYHLGE VNR R
Sbjct: 948 IDPSWITVIEMLDLDYYISAENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIR 1007
Query: 916 HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLN 975
HG LV+++P+S + + ++++GT NG +GVIAS+ + + FL LQT L +VIKGVGG+
Sbjct: 1008 HGRLVLQIPESGISILKSLLYGTANGALGVIASIDEKTFQFLHSLQTALNEVIKGVGGIQ 1067
Query: 976 HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
HE WR F +E++ D+KNFLDGDLIE FLDLSR +M+ ++K +NV VEEL K+VEELTR+
Sbjct: 1068 HEDWRRFTSERRIGDSKNFLDGDLIERFLDLSRDKMELVAKKVNVPVEELAKQVEELTRI 1127
Query: 1036 H 1036
H
Sbjct: 1128 H 1128
>gi|449519304|ref|XP_004166675.1| PREDICTED: DNA damage-binding protein 1a-like [Cucumis sativus]
Length = 596
Score = 1091 bits (2822), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 526/593 (88%), Positives = 562/593 (94%), Gaps = 5/593 (0%)
Query: 449 VLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISV 508
VLLATGGG LV+LEI DG+L E KH QLE+EISCLDINPIG+NP+ SQ+AAVGMWTDISV
Sbjct: 4 VLLATGGGVLVHLEICDGLLVEKKHIQLEHEISCLDINPIGDNPNCSQLAAVGMWTDISV 63
Query: 509 RIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG----E 564
RIFSLPDLNL+TKE LGGEIIPRSVLLC FEG+ + LCALGDGHLLNF+LN + E
Sbjct: 64 RIFSLPDLNLLTKEQLGGEIIPRSVLLCTFEGVCFCLCALGDGHLLNFILNTNSNSNSCE 123
Query: 565 LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPF 624
L DRKKVSLGTQPITLRTFSSKN THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPF
Sbjct: 124 LMDRKKVSLGTQPITLRTFSSKNATHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPF 183
Query: 625 NSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK-NQ 683
NSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEH RRICHQEQSRTFAICSL+ NQ
Sbjct: 184 NSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHARRICHQEQSRTFAICSLRYNQ 243
Query: 684 SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEE 743
S E++EMHF+RLLDDQTFE ISTY LDT+EYGCSILSCSFSDD+NVYYCVGTAYV+PEE
Sbjct: 244 SGTEDTEMHFIRLLDDQTFESISTYALDTYEYGCSILSCSFSDDNNVYYCVGTAYVMPEE 303
Query: 744 NEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT 803
NEPTKGRILVF+VE+GKLQLIAEKETKG+VYSLNAFNGKLLAAINQKIQLYKW LRDDGT
Sbjct: 304 NEPTKGRILVFVVEEGKLQLIAEKETKGSVYSLNAFNGKLLAAINQKIQLYKWTLRDDGT 363
Query: 804 RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA
Sbjct: 364 RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 423
Query: 864 VEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRL 923
VEILDDDIYLGAEN FNLFTVRKNSEGATDEER RLEVVGEYHLGEFVNRF+HGSLVMRL
Sbjct: 424 VEILDDDIYLGAENYFNLFTVRKNSEGATDEERSRLEVVGEYHLGEFVNRFQHGSLVMRL 483
Query: 924 PDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
PDSDVGQIPTVIFG+VNGVIGVIASLPH+QY+FLE+LQ+NLRKVIKGVGGL+HEQWRSFN
Sbjct: 484 PDSDVGQIPTVIFGSVNGVIGVIASLPHDQYVFLERLQSNLRKVIKGVGGLSHEQWRSFN 543
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
NEK+T +AKNFLDGDLIESFLDL+R++M+EIS+ M+VS EELCKRVEELTRLH
Sbjct: 544 NEKRTAEAKNFLDGDLIESFLDLNRSKMEEISRAMSVSAEELCKRVEELTRLH 596
>gi|297267724|ref|XP_001082958.2| PREDICTED: DNA damage-binding protein 1 [Macaca mulatta]
Length = 1092
Score = 1090 bits (2818), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 546/1024 (53%), Positives = 716/1024 (69%), Gaps = 57/1024 (5%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNH 976
+ H
Sbjct: 1067 KIEH 1070
>gi|119594339|gb|EAW73933.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_a [Homo
sapiens]
Length = 1094
Score = 1089 bits (2817), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 546/1024 (53%), Positives = 716/1024 (69%), Gaps = 57/1024 (5%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNH 976
+ H
Sbjct: 1067 KIEH 1070
>gi|221040048|dbj|BAH11787.1| unnamed protein product [Homo sapiens]
Length = 1092
Score = 1089 bits (2817), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 546/1024 (53%), Positives = 715/1024 (69%), Gaps = 57/1024 (5%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPHGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNV 1006
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG
Sbjct: 1007 FCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVG 1066
Query: 973 GLNH 976
+ H
Sbjct: 1067 KIEH 1070
>gi|452824086|gb|EME31091.1| DNA damage-binding protein 1 isoform 2 [Galdieria sulphuraria]
Length = 1150
Score = 1081 bits (2796), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 548/1103 (49%), Positives = 756/1103 (68%), Gaps = 70/1103 (6%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++++PIYGRI+ ++L R + QD LF TE+YKF VL+W+ ++ E T A GDV DRIG
Sbjct: 51 VIELPIYGRISVMKLCRFPDDLQDSLFFLTEKYKFAVLRWNTQTGECDTIAGGDVHDRIG 110
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
PT G IGI DP GLHLYDGLFKVIP D K KEAFNIRLEEL VLDI+FLYG
Sbjct: 111 HPTSAGHIGICDPSMTCFGLHLYDGLFKVIPTDFK---KEAFNIRLEELDVLDIQFLYGH 167
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+KPTI VLY D+++ RH+KTY V+LKDKDF GP Q NL++GA +LIPVP P+ GV+++
Sbjct: 168 SKPTIAVLYTDSEENRHLKTYTVSLKDKDFGNGPLFQGNLESGASMLIPVPTPIGGVVVL 227
Query: 181 GEETIVYCSANAFK---AIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI------- 230
G+ET+ Y S + + +IP+ +I +AYGR+D DG+RYLLGD G+L+LLV+
Sbjct: 228 GQETVTYISGSGLRGYHSIPVSATIFRAYGRIDKDGTRYLLGDEKGILYLLVLEQSTSLS 287
Query: 231 --THEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAK-GSYV 287
T + K+TGLKI+ LGETS+ STI YLDN VYIGS +GDSQLI+L +P + GS++
Sbjct: 288 TFTETETKITGLKIQTLGETSLPSTIDYLDNGFVYIGSCHGDSQLIRLLSEPHPETGSFL 347
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
EV+E Y NL PIVDFCV+D ERQGQGQVVTCSGA KDGSLRI+RNGIGI+EQASVE+ G+
Sbjct: 348 EVMETYPNLSPIVDFCVMDAERQGQGQVVTCSGAAKDGSLRIIRNGIGIHEQASVEVPGV 407
Query: 348 KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
K ++SL+ S+ + L++SF SE+R+L + + + E F Q TL+C + + +
Sbjct: 408 KELFSLKRSSLSSQHSLLLLSFASESRVLELVSTELMAEANFPVFEMQEPTLYCGNVVGD 467
Query: 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI 467
+VQ+T RL+S + NEW P G+ ++VA+ N+ Q+++AT GG+L+Y +I
Sbjct: 468 CIVQITPSKARLISCEDMSIVNEWHPPSGFRISVASGNSMQLIIATTGGNLIYFDIDANP 527
Query: 468 --LTEVKHAQLEYEISCLDINPIGE-------NPSYSQIAAVGMWTDISVRIFSLPDLNL 518
+ E + LEYEI LDI+P+G+ S AVGMWT++S+R +SLP L+L
Sbjct: 528 KRIMEKSYKSLEYEICSLDISPLGQAGMNLASQAIPSSFVAVGMWTEVSIRFYSLPSLDL 587
Query: 519 ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE-------LTDRKKV 571
I E LG ++I RS+L +G Y L ALGDG LL + L+ + L D++++
Sbjct: 588 IHTEKLGLDVIARSLLFVTMDGEDYFLAALGDGRLLTYRLDKSAKDTDSEKKFLYDQRQM 647
Query: 572 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK--KLLYSNVNLKEVSHMCPFNSAAF 629
S+GTQP +L F ++N HVFAA DRPTVI+SS+ KLL SNVNL+EV+ +C F+S AF
Sbjct: 648 SIGTQPASLSIFETQNALHVFAACDRPTVIHSSSGGGKLLCSNVNLREVTRVCSFSSEAF 707
Query: 630 PDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK-------- 681
PD LA+ EG L +GT+D+IQKLHIR+IPLGE PRRI H + FA+ + K
Sbjct: 708 PDCLALVTEGSLLLGTVDNIQKLHIRTIPLGEQPRRIAHLDTHHVFAVLTTKQVVTISED 767
Query: 682 -NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV-----YYCVG 735
N++ +E +E +VRL+DD E + +Y L+ FE CS+++ +F DD+ Y+ VG
Sbjct: 768 GNEALSETTEEGYVRLIDDTMMEIVHSYKLEQFETPCSVITVNFGDDAAAKDNQDYFVVG 827
Query: 736 TAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
TAY +E EP++GR+LVF V + +L L+AE+ KGA+YS++AFNGK+LA++N ++L +
Sbjct: 828 TAYSYADEPEPSRGRMLVFAVREQRLTLVAERTFKGALYSMDAFNGKILASVNSMLKLVR 887
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
W + G R L EC +HG I L ++ GDFI++GDL++S+SLL YK G IE+ ARD
Sbjct: 888 WSETESGARTLTEECTYHGSIFILQIKCLGDFILIGDLVRSVSLLAYKPMNGTIEDVARD 947
Query: 856 YNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
+ +W++ +E+LD D Y+ AEN FNLFT+++NS+ +T+EER RLE VGEYHLGE VNR R
Sbjct: 948 IDPSWITVIEMLDLDYYISAENCFNLFTLKRNSDASTEEERSRLEKVGEYHLGELVNRIR 1007
Query: 916 HGSLVMRLPDSDVGQIPTVIF----------------------GTVNGVIGVIASLPHEQ 953
HG LV+++P+S + + ++++ GT NG +GVIAS+ +
Sbjct: 1008 HGRLVLQIPESGISILKSLLYGMYICFDDNLKELFMHKYRFNLGTANGALGVIASIDEKT 1067
Query: 954 YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
+ FL LQT L +VIKGVGG+ HE WR F +E++ D+KNFLDGDLIE FLDLSR +M+
Sbjct: 1068 FQFLHSLQTALNEVIKGVGGIQHEDWRRFTSERRIGDSKNFLDGDLIERFLDLSRDKMEL 1127
Query: 1014 ISKTMNVSVEELCKRVEELTRLH 1036
++K +NV VEEL K+VEELTR+H
Sbjct: 1128 VAKKVNVPVEELAKQVEELTRIH 1150
>gi|242010743|ref|XP_002426118.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
gi|212510165|gb|EEB13380.1| DNA damage-binding protein, putative [Pediculus humanus corporis]
Length = 1148
Score = 1075 bits (2779), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 540/1106 (48%), Positives = 748/1106 (67%), Gaps = 83/1106 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
++ +Y RI+ ++ FRP E +D LFI T RY +L+ D E+ E+ T+A G+V+DRIG
Sbjct: 54 EIGLYDRISVVKFFRPPHEHKDLLFIVTTRYNAMILECIGDGENIEIKTKAHGNVADRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+ ++ G I +IDP+ R+IGL LYDGL K+IP +A +IR+EE++V D+ FL+GC
Sbjct: 114 KQSETGIIAVIDPEARVIGLRLYDGLLKIIPLGKDNSELKASSIRMEEVEVQDLNFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI++++QD + RHVKT+E++L+DK+FV+ PW Q+N++ A ++IPVP PLCG +II
Sbjct: 174 QNPTIILIHQD-INGRHVKTHEISLRDKEFVKMPWKQDNVEPDASIVIPVPEPLCGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I+Y + + A+ I S Y +VD++GSRYLLGD AG L +L++ E++
Sbjct: 233 GQESILYHNGAGYVAVAPPVINQSTITCYTQVDSNGSRYLLGDMAGHLFMLLLETEEKID 292
Query: 236 -----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
K GLK+ELLGE SI I+YLDN V++IGS GDSQL+KLN PD G YV ++
Sbjct: 293 GTPCVKENGLKVELLGEISIPEAITYLDNGVLFIGSRCGDSQLVKLNTSPDENGYYVTIM 352
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
E + NL PI+D VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E A+++L GIKGM
Sbjct: 353 EAFTNLAPILDMVVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIQEHATIDLLGIKGM 412
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+LR+S D +D +V++F+ +TRIL +N E E+EETEI GF S Q+ +C + N ++
Sbjct: 413 WALRASLDSVYDNTIVLAFVGQTRILTLNGE-EVEETEIPGFLSDQQSFYCGNVENNNMI 471
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
Q+T RL+S +++L +EW+ P G +++V N Q + A G L YLEI L +
Sbjct: 472 QLTPTCARLISVETKQLVSEWRPPAGKTISVVACNTVQAICA-AGSDLYYLEILKNELVQ 530
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
+ LEYE++CLDI P+ E + I AVG+WTDIS RI LPDL + +E+LGGEIIP
Sbjct: 531 KGNTTLEYEVACLDITPLSEGGKTADIIAVGLWTDISARILKLPDLEELNREYLGGEIIP 590
Query: 531 RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
RS+L+ FE I+YLLCALGDG + F LN + G L+D+KKV+LGTQP LRTF S +TT+
Sbjct: 591 RSILMTCFENINYLLCALGDGSMFYFSLNNQNGILSDKKKVTLGTQPTVLRTFRSLSTTN 650
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
VFA SDRPTVIYSSN KL++SNVNLKEV+HMC NS A+PDSLA+A + +TIGTID+IQ
Sbjct: 651 VFACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNSEAYPDSLALATDSTVTIGTIDEIQ 710
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLK------------NQSCA------------ 686
KLHIR++PLGE PRRI +QE ++TF + +++ QS +
Sbjct: 711 KLHIRTVPLGESPRRIAYQETTQTFGVITMRMDVHQRSGLIPVKQSASTQAQSISSSSNI 770
Query: 687 ---------------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS 725
+E E+H + ++D TFE + + EY S++S
Sbjct: 771 GGAHNLKSGPAASINSFAEFGQEVEVHNLLVIDQHTFEVLHAHQFMQSEYALSLISTKLG 830
Query: 726 DDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLA 785
DD N YY VGTA V P+E+E +GRIL+F ++GKL +AEKE KGA YSL FNGKLLA
Sbjct: 831 DDPNTYYIVGTAMVNPDESESKQGRILIFQFQEGKLYQVAEKEIKGAAYSLVEFNGKLLA 890
Query: 786 AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
+IN ++L++W +EL+ EC H +I++LY++T+GDFI+VGDL++S++LL YK
Sbjct: 891 SINSTVRLFEWT----AEQELRLECSHFNNIISLYLKTKGDFILVGDLIRSMTLLQYKTM 946
Query: 846 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
EG EE ARD+N NWM+AVEI+DDD +LGAEN+FNLF +K+S ATDEER ++ VG +
Sbjct: 947 EGCFEEMARDHNPNWMTAVEIIDDDTFLGAENSFNLFVCQKDSAAATDEERQQMHAVGMF 1006
Query: 906 HLGEFVNRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKL 960
HLG+ VN FRHGSLVM+ +VG+ T ++FGTV+G IG++ + Y FL +L
Sbjct: 1007 HLGDMVNVFRHGSLVMQ----NVGETSTPTTGCILFGTVSGAIGLVTQISANFYNFLHEL 1062
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
+ L +VIK VG + H WRSF E KT F+DGDLIESFLDLS +M E++ + +
Sbjct: 1063 ECKLTEVIKSVGKIKHSFWRSFTTEIKTEPCDGFIDGDLIESFLDLSHEKMKEVAAGLQI 1122
Query: 1021 ----------SVEELCKRVEELTRLH 1036
+V++L K VE+LTR+H
Sbjct: 1123 DNGSGMKQEATVDDLVKMVEDLTRIH 1148
>gi|91087281|ref|XP_975549.1| PREDICTED: similar to conserved hypothetical protein [Tribolium
castaneum]
gi|270010588|gb|EFA07036.1| hypothetical protein TcasGA2_TC010010 [Tribolium castaneum]
Length = 1149
Score = 1070 bits (2766), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 533/1105 (48%), Positives = 755/1105 (68%), Gaps = 80/1105 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG++A ++LFRP E +D LFI T RY +L+ DA++ ++IT+A G+V+D+IG
Sbjct: 54 EVALYGKVAVMKLFRPQHEKKDLLFIVTMRYNAMILECVNDADNLDIITKAHGNVADKIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+P++ G + +IDP R+IGL LYDGLFK+IP + +A NIR++ELQV D++FL+GC
Sbjct: 114 KPSETGILAVIDPKARVIGLRLYDGLFKIIPLEKDNSELKASNIRIDELQVHDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PT+++++QD + RHVKT+E++L++K+FV+ PW Q+N++ A ++IPVP PL G +II
Sbjct: 174 ANPTLILIHQD-VNGRHVKTHEISLREKEFVKVPWRQDNVETEASMIIPVPSPLGGAIII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITK-----AYGRVDADGSRYLLGDHAGLLHLLVITHEK- 234
G+E I+Y + + + P++ K Y +VD G RYLLGD AG L +L + +
Sbjct: 233 GQENILY--HDGITPVVVAPAVIKQSTIVCYAKVDPGGLRYLLGDMAGHLFMLFLEVDNR 290
Query: 235 ----EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
+ V LK+ELLGE + I+YLDN V++IGS GDSQL+KL +P+ GSYV V+
Sbjct: 291 GDGNDVVKDLKVELLGEIATPECITYLDNGVLFIGSRLGDSQLVKLTTKPNESGSYVTVM 350
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
E + NL PI+D CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGM
Sbjct: 351 ESFTNLAPILDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGIKGM 410
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+L+ ++D +D LV++F+ +TR+L++N E E+EET+I GF S QT FC + I+ Q+V
Sbjct: 411 WALQVASDGRYDNTLVLAFVGQTRVLSLNGE-EVEETDIAGFASDQQTFFCGNVIHEQIV 469
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
Q+T S RL+S+ ++ L EWK P ++ V N Q++++TG L Y+EI L
Sbjct: 470 QITPISARLISAQNKTLLAEWKPPSDKNIAVVACNTCQIVVSTGS-ILYYIEIHQNELIM 528
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
L+ E++CLDI+P+G+ + S+ AVG+WTDI+ RI +PDL+ TKE+LGGEIIP
Sbjct: 529 KGVTTLDVEVACLDISPLGDGVTTSEYIAVGLWTDITARILRIPDLSEATKEYLGGEIIP 588
Query: 531 RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
RSVL+ FEG SYLLCALGDG + F+ + TG L+D+K+V+LGTQP LRTF S +TT+
Sbjct: 589 RSVLMTCFEGHSYLLCALGDGSMFYFVFHKDTGTLSDKKRVTLGTQPTVLRTFRSLSTTN 648
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
VFA SDRPTVIYSSN KL++SNVN+KEV+HMC N+ A+PDSLA+A + +TIGTID+IQ
Sbjct: 649 VFACSDRPTVIYSSNHKLVFSNVNMKEVNHMCSLNAEAYPDSLALATDSSVTIGTIDEIQ 708
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--------------------------- 683
KLHIR++PL E P+RI +QE+S+TF + + +
Sbjct: 709 KLHIRTVPLQESPKRIAYQEESQTFGVLTARIDIQDSTGLNPARPSASTMAQSTTSSSSV 768
Query: 684 --------------------SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCS 723
+E E+H + ++D TFE + + L EY SI+S +
Sbjct: 769 GSLSMSKSGSSSLIGGNIIPDYGQEVEVHNLLIIDQHTFEVLHAHQLMQQEYAMSIISTN 828
Query: 724 -FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGK 782
D N YY VGTA V PEE+EP +GRIL+F D KL ++EKE KGA YSL FNGK
Sbjct: 829 RLGGDMNEYYIVGTATVNPEESEPKQGRILIFQWNDNKLTQVSEKEIKGACYSLAEFNGK 888
Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
LLA+IN ++L++W + +EL+ EC H +IL L+++T+GDFI++GDLM+S++LL Y
Sbjct: 889 LLASINSTVRLFEWTVE----KELRLECSHFNNILTLFLKTKGDFILLGDLMRSMTLLQY 944
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVV 902
K EG+ EE ARDYN NWM+AVEILDDDI+LGAEN+FN+F +K+S TDEER ++ V
Sbjct: 945 KTMEGSFEEIARDYNPNWMTAVEILDDDIFLGAENSFNIFVCQKDSAATTDEERSQMHEV 1004
Query: 903 GEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
G +H+G+ +N FRHGSLVM+ L ++ V+FGTV+G IG++ + + Y FL +LQ
Sbjct: 1005 GRFHVGDMINVFRHGSLVMQNLGETSTPTTGCVLFGTVSGAIGLVTQITQDFYDFLLELQ 1064
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV- 1020
L VIK VG ++H QWR+FN + KT ++ F+DGDLIESFLDLS +M E++ + +
Sbjct: 1065 NKLSTVIKSVGKIDHSQWRAFNTDIKTEPSEGFIDGDLIESFLDLSHDKMKEVADGLQIT 1124
Query: 1021 ---------SVEELCKRVEELTRLH 1036
+V++L K VE+LTR+H
Sbjct: 1125 GEGGMKQDCTVDDLVKMVEDLTRIH 1149
>gi|308808936|ref|XP_003081778.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
gi|116060244|emb|CAL56303.1| putative UV-damaged DNA binding factor (ISS) [Ostreococcus tauri]
Length = 1282
Score = 1069 bits (2765), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 564/1007 (56%), Positives = 711/1007 (70%), Gaps = 83/1007 (8%)
Query: 1 MLDVPIYGRIATLELF--RPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDR 58
++DVPIYGR+AT+ L R G ++ LFI TERY F L +D ES EL T A GDV D
Sbjct: 276 IMDVPIYGRVATMSLCGGRERG-SKGRLFITTERYGFTALSYDEESEELKTEAFGDVRDN 334
Query: 59 IGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
IGRP +NGQIGI+D DCR IGL LYDGLFKVIP D KG++KEAFNIRLEEL+V DI+FL+
Sbjct: 335 IGRPAENGQIGIVDEDCRSIGLQLYDGLFKVIPCDEKGKVKEAFNIRLEELRVEDIQFLH 394
Query: 119 GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVL 178
G AKPTI VLY+D K+A H+KTYE+ +++K+FV PW+QN+L+ G+ +IPVP P+ GV+
Sbjct: 395 GTAKPTIAVLYRDMKEAVHIKTYEIGVREKEFVSSPWAQNDLEGGSSKIIPVPAPVGGVV 454
Query: 179 IIGEETIVYCSANA------FKAI--PIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI 230
++GEETIVY + + KAI P R SI YG +D DGSRYLLGDH G L+LLV+
Sbjct: 455 VLGEETIVYLNKTSDDTDVFLKAINIPERSSIV-CYGAIDPDGSRYLLGDHDGTLYLLVL 513
Query: 231 THEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL-----NLQPDAKGS 285
H+ ++V LKIE LGETSI ST+SYLDN VV++GS+YGDSQLIKL N+ D S
Sbjct: 514 VHDGKRVNELKIERLGETSIPSTVSYLDNGVVFVGSAYGDSQLIKLHSEKTNVDKDGNLS 573
Query: 286 YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
Y+++LE + NLGPIVDF VDLER GQGQVVTCSGAYKDGSLR+VRNGIGI+EQA ++L
Sbjct: 574 YIQILEEFTNLGPIVDFAFVDLERHGQGQVVTCSGAYKDGSLRVVRNGIGIDEQAVIQLP 633
Query: 346 GIKGMWSLRSSTDDPFDTFLVVSFISETRILAM--NLEDELEETEIEGFCSQTQTLFCHD 403
G+KG++SLR + D +LVV+FI+ETRIL + D L+ETEI GF ++ QTL C +
Sbjct: 634 GVKGLFSLRDDDESQVDKYLVVTFINETRILGFVGDEGDTLDETEISGFNAEAQTLCCGN 693
Query: 404 AIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
+ + GG L L
Sbjct: 694 MLES-----------------------------------------------GGKLHCLSA 706
Query: 464 GDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
G L V A E EI+CLD P+GE S S + AVG+W+ + + + S+ DL++I +E
Sbjct: 707 MKGKLALVASATFEDEIACLDCTPMGEATS-SPVCAVGLWS-MEIVLASMSDLSVIKRES 764
Query: 524 LGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
G +IIPRS LLC+FE I YLL LGDG L+ +LL+ K+GEL+ RKK+SLGT+PITL+TF
Sbjct: 765 TGEDIIPRSTLLCSFESIPYLLVGLGDGQLITYLLDEKSGELSVRKKLSLGTKPITLQTF 824
Query: 584 SS--KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
S N VFAASDRPTVI+S+NKKL+YSNVN++EV H+CPF+SAAFPD+LA+A E +L
Sbjct: 825 KSHATNVHSVFAASDRPTVIFSNNKKLIYSNVNVQEVLHVCPFSSAAFPDALALAGEEDL 884
Query: 642 TIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAIC-----SLKNQSCAEESEMHFVRL 696
TIG IDDIQKLHIR+IPLG PRRI HQ ++ TFA+ S +Q C FVRL
Sbjct: 885 TIGGIDDIQKLHIRTIPLGGQPRRIAHQPETNTFAVVVEHLWSKSSQDC-------FVRL 937
Query: 697 LDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV 756
+DD +FE +S + L+ E S+ SC+F+ DS YY VGT L E+EP++GRILVF V
Sbjct: 938 VDDGSFETLSQFQLEDQELTSSLTSCTFAGDSTTYYVVGTGIALETEDEPSRGRILVFKV 997
Query: 757 EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI 816
+D +L L++EKE +GAVY+LNAF GKLLA IN K++L+KW R+D EL SEC HHG I
Sbjct: 998 DDDQLVLVSEKEVRGAVYNLNAFKGKLLAGINSKLELFKWTPREDEVHELVSECSHHGQI 1057
Query: 817 LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD-IYLGA 875
+ V+TRGD+I+VGDLMKS+SLL+YK EEGAI+E ARD+NANWM+AV +LDDD YLGA
Sbjct: 1058 VTFAVKTRGDWILVGDLMKSMSLLLYKPEEGAIDEVARDFNANWMTAVAMLDDDETYLGA 1117
Query: 876 ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVI 935
EN+ NLFTV +N TDEER RLE+ GEYHLGE VN F GSLVM L D + +PT++
Sbjct: 1118 ENSLNLFTVSRNVNAVTDEERSRLEITGEYHLGELVNAFAPGSLVMSLRDGESLSVPTLL 1177
Query: 936 FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
FGT NGVIGV+ASLP + Y F E+LQ ++ K I+GVGGL H WRSF
Sbjct: 1178 FGTANGVIGVLASLPKDVYEFTERLQASINKHIQGVGGLKHADWRSF 1224
>gi|321478515|gb|EFX89472.1| hypothetical protein DAPPUDRAFT_303245 [Daphnia pulex]
Length = 1158
Score = 1063 bits (2749), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 553/1114 (49%), Positives = 741/1114 (66%), Gaps = 89/1114 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V IYG+IA ++L++ + +D LFI T RY +L+ + E+ E+ITR G+V+D+IG
Sbjct: 54 EVGIYGKIAVMKLYKSGNDKKDSLFILTARYNAMILECSGEGENMEIITRTQGNVADKIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+P++ G I IIDPDCR+IGL LYDGL K+IP + +A+NIR+EEL + DI FLYGC
Sbjct: 114 KPSETGIIAIIDPDCRVIGLRLYDGLLKIIPLEKDSSEIKAYNIRIEELCIQDIAFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PT+V+++QD RHVKT E++L+DK+F + W Q+N++ A +L+PVP P G LII
Sbjct: 174 ANPTVVIIHQD-AHGRHVKTREISLRDKEFAKTSWKQDNVETEAAMLLPVPEPYGGALII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
G+E+I Y + + I I+ S YG+VD +GSRYLLGD AG L +LV+ E EK+
Sbjct: 233 GQESITYHNGQNYVTIAPPIIKQSTVTCYGKVDPNGSRYLLGDLAGHLFMLVLEKE-EKM 291
Query: 238 TG------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
G +KIELLGE SI ++YLDN VV+IGS +GDSQL+KLN+ PD SYV V+E
Sbjct: 292 DGTVTVRDIKIELLGEVSIPECLTYLDNGVVFIGSRFGDSQLVKLNVTPDDNNSYVTVME 351
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
+ NL PIVD +VDL+RQGQGQ+VTCSGAYK+GSLRI+RNGIGI+EQAS++L GIKG+W
Sbjct: 352 TFTNLAPIVDMTIVDLDRQGQGQLVTCSGAYKEGSLRIIRNGIGIHEQASIDLPGIKGIW 411
Query: 352 SLR--SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
+L+ SS + D +V+SF+ +TR+L +N E E+EETEI G + QT FC + + +
Sbjct: 412 ALKMGSSGNPSVDDTVVLSFVGQTRVLMLNGE-EMEETEIPGLTADQQTFFCGNVGKDSV 470
Query: 410 VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILT 469
+Q+T+GSVRL+S +++ +EW P G +NV N QVL A G L YLE+ D
Sbjct: 471 LQITTGSVRLISVNTKQKLSEWTPPDGKMLNVVACNHGQVLCA-AGRELYYLEMEDDTQV 529
Query: 470 EVK-HAQLEYEISCLDINPIG------ENPSYSQIAAVGMWTDISVRIFSLPDLNLITKE 522
+K H L+YE++CLD+ PI S +++ AVG+WTDIS R+ LP L KE
Sbjct: 530 VLKTHVTLDYEVACLDLTPISIGSQQNTTVSKAEVCAVGLWTDISARLLKLPTLEEFHKE 589
Query: 523 HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
LGGEIIPRS+L+ FEG YLL ALGDG L F +N T L DRKKV+LGTQP LR
Sbjct: 590 PLGGEIIPRSILMALFEGTPYLLVALGDGSLFYFSMNPVTKLLGDRKKVTLGTQPTVLRP 649
Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
F S++T +VFA SDRPTVIYSSN KL++SNVNLKEV+HMCP NS A+PDSL +A + +T
Sbjct: 650 FRSQSTVNVFACSDRPTVIYSSNHKLVFSNVNLKEVAHMCPLNSEAYPDSLVLATDTAVT 709
Query: 643 IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK--------------------- 681
IGTID+IQKLHIR++PLGE PRRI +QE ++TF + + +
Sbjct: 710 IGTIDEIQKLHIRTVPLGESPRRIAYQENTQTFGVITSRVDIVESSGLTPARQSASTQAQ 769
Query: 682 ------------------NQSC--------AEESEMHFVRLLDDQTFEFISTYPLDTFEY 715
SC +E E++ + ++D +F + + EY
Sbjct: 770 SVSSSSSLGSAIKAPVSTRDSCGSGGGAEIGQEIEVYNLLIVDQHSFAVLHAHQFMQNEY 829
Query: 716 GCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYS 775
SI+S DD N YY VGTA V+PEE+EP +GRI++F DGKL +AEKE KGA YS
Sbjct: 830 ALSIVSTRLGDDVNPYYVVGTALVVPEESEPKQGRIVLFQWADGKLTTVAEKEVKGACYS 889
Query: 776 LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
L FN K+LAAIN ++LY+W +EL+ EC + HI+ALY++ +GDFI+VGDLM+
Sbjct: 890 LVDFNSKILAAINNVVRLYEWT----AEKELRLECSNFNHIIALYLKRKGDFILVGDLMR 945
Query: 836 SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
SI+LL YK EG+ EE ARD N NWMSAVEILDDD +LGAEN+FNLF +K+S T+EE
Sbjct: 946 SITLLQYKTMEGSFEEMARDSNPNWMSAVEILDDDTFLGAENSFNLFVCQKDSAATTEEE 1005
Query: 896 RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT---VIFGTVNGVIGVIASLPHE 952
R +L VG +HLG+ VN FRHGSLVM ++ PT V+FGTV+G IGV+ LP E
Sbjct: 1006 RQQLTEVGRFHLGDMVNVFRHGSLVMDHA-AETLTTPTQGCVLFGTVHGAIGVVTQLPSE 1064
Query: 953 QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD 1012
Y FL ++QT + +VIK VG + H WRSF E+K + F+DGDLIESFLDLS +M
Sbjct: 1065 FYHFLSEVQTRMARVIKPVGKIEHSFWRSFATERKVEPCEGFIDGDLIESFLDLSSDKMK 1124
Query: 1013 EISKTMNV----------SVEELCKRVEELTRLH 1036
E++ + + +VE++ K +E+LTR+H
Sbjct: 1125 EVATGLQMDDGSGMKREATVEDIIKLIEDLTRIH 1158
>gi|443707495|gb|ELU03057.1| hypothetical protein CAPTEDRAFT_148808 [Capitella teleta]
Length = 1084
Score = 1048 bits (2711), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 540/1092 (49%), Positives = 723/1092 (66%), Gaps = 119/1092 (10%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
++ IYGRIA LELFRP GE++D LF+ T RY +L+ D + E+ITRA G+V DRIG
Sbjct: 54 EIGIYGRIAVLELFRPPGESKDLLFLLTARYNAMILECSQDGDHIEIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGI+DP CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D++FL+GC
Sbjct: 114 RPSETGIIGIVDPLCRIIGLRLYDGLFKVIPLDRDMRELKAFNIRLEELNVIDVQFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+ PTI++ +K D + V PP II
Sbjct: 174 STPTIILSITYHK------------------------------GDNYLAVAPP-----II 198
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE----- 235
+ TI C G+VDA+G RY+LGD AG L +L++ E+
Sbjct: 199 KQSTITCC------------------GKVDANGCRYMLGDMAGRLFMLLLEKEERIDGSM 240
Query: 236 KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVN 295
V +K+ELLGE++IA I+YLDN VVY+GS GDSQL+KLN++ D GSYV+V+E + N
Sbjct: 241 AVKDIKVELLGESAIAECITYLDNGVVYVGSRLGDSQLVKLNVEADDSGSYVQVMETFTN 300
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS 355
LGP+VD CVVDLERQGQGQ+VTCSG YK+GSLRI+RNGIGI+E A+++L GIKGMW LR
Sbjct: 301 LGPVVDMCVVDLERQGQGQLVTCSGGYKEGSLRIIRNGIGIHEHATIDLPGIKGMWPLR- 359
Query: 356 STDDPF-DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
D P D +V+SF+ +TR+L ++ E E+EETE+ GF + QT FC + ++QL+Q+T+
Sbjct: 360 -VDSPTQDDMIVLSFVGQTRVLLLSGE-EVEETELGGFEADQQTFFCGNVAHSQLLQITA 417
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHA 474
SVRLV +++L +EW+ P G +++VA N QV+ A + L+I G L
Sbjct: 418 ASVRLVDPRTKKLLSEWRPPSGKNLSVAGCNHCQVVCAVSR-EVFCLDIQQGALILQGST 476
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL 534
+E+E++C+DI P+ + SQ+ AVG+WTDIS RI LP L + +E LGGEIIPRS+L
Sbjct: 477 TMEHEVACVDITPLNDESGPSQLCAVGLWTDISARILQLPSLEQLHQEMLGGEIIPRSIL 536
Query: 535 LCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAA 594
L FEG++Y+LCALGDG L F L TG L+DRKKV+LGTQP LRTF S TT+VFA
Sbjct: 537 LATFEGVNYVLCALGDGALFYFHLQPSTGVLSDRKKVTLGTQPTVLRTFKSPGTTNVFAC 596
Query: 595 SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHI 654
SDRPTV+YSSN KL++SNVNLKEV++MCP NS +PDSLA+A + LTIGTID+IQKLHI
Sbjct: 597 SDRPTVVYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANDCTLTIGTIDEIQKLHI 656
Query: 655 RSIPLGEHPRRICHQEQSRTFAICSLKN-------------------------------- 682
R++PLGE PRRI +QE S+TF + +L++
Sbjct: 657 RNVPLGETPRRIAYQEASQTFGVITLRSDLQDSNGSTPARPSASTQALSTSSSSNVKVMA 716
Query: 683 -------QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVG 735
+ +E E+H + +LD TFE + ++ L +E+ +++S F +D YY VG
Sbjct: 717 ASNANTEHTFGDEVEVHSLLVLDQHTFEVLHSHQLMQYEFATALMSGRFGEDPTTYYVVG 776
Query: 736 TAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
TA V PEE EP +GRI+VF DGKL +AEKE KGA Y+L FNGKLLA+IN ++L++
Sbjct: 777 TAMVYPEEAEPKQGRIIVFRFHDGKLTQVAEKEIKGAAYTLTEFNGKLLASINSTVRLFE 836
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
W +EL+ EC + +I+ALY++T+GDFI+VGDLM+S++LL YK EG EE ARD
Sbjct: 837 WT----AEKELRVECSYFNNIIALYLKTKGDFILVGDLMRSVTLLSYKPMEGCFEEIARD 892
Query: 856 YNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
YN NWM+++++LDDD +LGAEN+FN+FT +K+S TDEER L+ VG YHLGEFVN FR
Sbjct: 893 YNPNWMTSIDVLDDDTFLGAENSFNIFTCQKDSAATTDEERQHLQEVGLYHLGEFVNVFR 952
Query: 916 HGSLVMRLPDSDVGQIP-TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
HGSLVM+ P +V+FGTVNG +G++ LP E YLFL ++Q L K IK VG +
Sbjct: 953 HGSLVMQHPGECTSPTQGSVLFGTVNGALGLVTQLPQEFYLFLLEVQNKLAKTIKSVGKV 1012
Query: 975 NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVEE 1024
H WRSF+ E+KT A F+DGDLIESFLDLSR +M E+ + + + +V++
Sbjct: 1013 EHAFWRSFHTERKTEPATGFIDGDLIESFLDLSRDKMQEVVQGLQMDDGSGMKREAAVDD 1072
Query: 1025 LCKRVEELTRLH 1036
L K +EELTR+H
Sbjct: 1073 LVKMIEELTRIH 1084
>gi|345498295|ref|XP_001607743.2| PREDICTED: DNA damage-binding protein 1-like [Nasonia vitripennis]
Length = 1140
Score = 1044 bits (2700), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 530/1093 (48%), Positives = 737/1093 (67%), Gaps = 68/1093 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V IYG+IA ++ FRP E +D LF+ T RY +L+ + E E+IT+A G+V+DRIG
Sbjct: 57 EVGIYGKIAVVKFFRPQHEKKDLLFLLTTRYNAMILECVGEGEDIEIITKAHGNVADRIG 116
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+ ++ G +IDP R+IGL LYDGLFK+IP D +A +IR++E V D+ FL+GC
Sbjct: 117 KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMDEQNVQDVNFLHGC 176
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT+++++QD + RHVKT+E++L+DK+FV+ PW Q+N++ A ++IPVP P+CG +II
Sbjct: 177 TNPTLILIHQD-INGRHVKTHEISLRDKEFVKIPWRQDNVEREAMMVIPVPSPICGAIII 235
Query: 181 GEETIVYCSANAF-KAIP--IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
G+E+I+Y + +P I+ S Y +VD G RYLLGD AG L +L + +K+
Sbjct: 236 GQESILYHDGTTYVTVVPPIIKQSTISCYAKVDNQGLRYLLGDLAGHLFMLFLEQDKKAD 295
Query: 237 ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ LK+ELLGE SI I+YLDN V++IGS GDSQLIKLN +PD GSY +E
Sbjct: 296 GSMVIKDLKVELLGEVSIPECITYLDNGVIFIGSRLGDSQLIKLNTKPDENGSYCSTMET 355
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PIVD VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 356 FTNLAPIVDMAVVDLERQGQGQIVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGIKGMWA 415
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
L+ + + FD LV+SF+ +TRIL +N E E+EETEI GF + QT + + ++Q+
Sbjct: 416 LKVDSVN-FDNTLVLSFVGQTRILMLNGE-EVEETEIPGFVADEQTFHTGNVTNDVIIQI 473
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
T S RL+S+ S + +EW+ +++V N +QVL ATG L YLEI D +
Sbjct: 474 TPTSARLISNKSSSVISEWEPDNKRTISVVACNGTQVLCATGND-LFYLEINDNQIVSKG 532
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
+ L++E++C+DI+P+ + S ++I AVG+WTDISVRI +LP L I KE LGGEIIPRS
Sbjct: 533 YTTLQHEVACVDISPL-DGSSEAKIVAVGLWTDISVRILTLPKLEEINKELLGGEIIPRS 591
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEG +YLLCALGDG + F LN + G L D+KKV+LGTQP LRTF S +TT+VF
Sbjct: 592 ILMTCFEGNTYLLCALGDGSMYYFTLNKQNGMLADKKKVTLGTQPTVLRTFRSLSTTNVF 651
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ ++ DSLA+A + +TIGTID+IQKL
Sbjct: 652 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYSDSLALATDNTVTIGTIDEIQKL 711
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK-----------------NQSCA--------- 686
HIR++PL E PRRI +QE ++TF + +++ Q+ +
Sbjct: 712 HIRTVPLYESPRRIAYQESTQTFGVITMRVDIQESNGVNIARPSASTQAASISNSNHIPT 771
Query: 687 ------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
+E E+H + ++D TFE + + L EY S++S +D YY V
Sbjct: 772 HNKPSNTASEIGQEVEIHNLLIVDQHTFEVLHAHTLVPTEYAMSLISTKLGEDPTPYYIV 831
Query: 735 GTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
GTA + P+E+EP GRIL++ DGKL +AEKE KG+ YSL FNGKLLA+IN ++L+
Sbjct: 832 GTAMINPDESEPKSGRILLYHWNDGKLTQVAEKEIKGSCYSLVEFNGKLLASINSTVRLF 891
Query: 795 KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
+W +EL+ EC H +I+ALY++T+GDF++VGDLM+S++LL YK EG+ EE AR
Sbjct: 892 EWT----AEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSVTLLQYKTMEGSFEEIAR 947
Query: 855 DYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
DYN NWM+++EILDDD +LGAEN FNLF +K+S ++EER +++ VG++HLG+ VN F
Sbjct: 948 DYNPNWMTSIEILDDDTFLGAENCFNLFVCQKDSAATSEEERQQMQEVGQFHLGDMVNVF 1007
Query: 915 RHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
RHGSLVM+ L +S V+FGTV G IG++ +P Y FL L+ L VIK VG
Sbjct: 1008 RHGSLVMQHLGESSTPTHGCVLFGTVCGAIGLVTQIPSTFYEFLRNLEDRLTSVIKSVGK 1067
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVE 1023
+ H WRSFN + K + F+DGDLIESFLDLS +M E++ + + +V+
Sbjct: 1068 IEHNFWRSFNTDLKIEQCEGFIDGDLIESFLDLSHEKMAEVAMGIVIDDGSGMKKEATVD 1127
Query: 1024 ELCKRVEELTRLH 1036
+L K VE+LTR+H
Sbjct: 1128 DLVKIVEDLTRIH 1140
>gi|340714589|ref|XP_003395809.1| PREDICTED: DNA damage-binding protein 1-like [Bombus terrestris]
Length = 1141
Score = 1043 bits (2698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 533/1094 (48%), Positives = 736/1094 (67%), Gaps = 69/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V IYG+IA ++ FRP E +D LF+ T RY +L+ + E+ E+IT+A G+V+DRIG
Sbjct: 57 EVGIYGKIAVVKFFRPPHEKKDLLFLLTTRYNAMILECIGEGENIEIITKAHGNVADRIG 116
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+ ++ G +IDP R+IGL LYDGLFK+IP D +A +IR+EE QV D+ FL+GC
Sbjct: 117 KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMEEHQVQDVNFLHGC 176
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PT+++++QD + RHVKT+E++L+DK+FV+ PW Q+N++ A ++IPVP P+CG +II
Sbjct: 177 ANPTLILIHQD-INGRHVKTHEISLRDKEFVKVPWRQDNVEREAMIVIPVPSPICGAIII 235
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
G+E+I+Y N + A+ I+ S Y +VD G RYLLGD AG L +L + EK+
Sbjct: 236 GQESILYHDGNTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFVEQEKKPD 295
Query: 237 ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V LK+ELLGE SI I+YLDN V+++GS +GDSQL+KL + D GSY +E
Sbjct: 296 GTQVVKDLKVELLGEISIPECITYLDNGVIFVGSRFGDSQLVKLITKADENGSYCVPMET 355
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PI+D VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 356 FTNLAPIIDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIEEHASIDLPGIKGMWA 415
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
L+ + FD LV+SF+ +TRIL +N E E+EET+I GF + QT + + +Q+
Sbjct: 416 LKVGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 473
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
T S RL+S ++ + +EW+ +++V N +QVL ATG L Y+EI G +
Sbjct: 474 TPTSARLISHETKTVVSEWEPENKRTISVVACNGTQVLCATGN-DLFYMEISYGQIVPKG 532
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
A L+YE++CLDI+P+ + + ++IAAVG+WT ISVRI +LP L I KE LGGEIIPRS
Sbjct: 533 FATLQYEVACLDISPL-DGHTEAKIAAVGLWTHISVRILTLPALEEINKELLGGEIIPRS 591
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEG +YLLCALGDG + F L+ + G L+D+KKV+LGTQP LRTF S TT+VF
Sbjct: 592 ILMTCFEGNTYLLCALGDGSMYYFTLHKQNGILSDKKKVTLGTQPTVLRTFRSLFTTNVF 651
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ ++PDSLA+A + +TIGTID+IQKL
Sbjct: 652 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 711
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TF + +++
Sbjct: 712 HIRTVPLGESPRRIAYQESSQTFGVITMRVDIQDSSGVSIVRHSASTQAASTSSSSHIAS 771
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
+E E+H + ++D TFE + + L EY S++S +D YY
Sbjct: 772 YNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYALSLISTKLGEDPTSYYI 831
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V P+E EP GRIL++ DGKL +AEKE KG+ YSL FNGKLLA+IN ++L
Sbjct: 832 VGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSLTEFNGKLLASINSTVRL 891
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
++W +EL+ EC H +I+ALY++T+GDFI+VGDLM+S++LL YK EG EE A
Sbjct: 892 FEWT----AEKELRLECSHFNNIIALYLKTKGDFILVGDLMRSLTLLQYKTMEGCFEEIA 947
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RDYN NWM+A+EILDDD +LGAEN FNLF +K+S +++ER +++ VG++HLG+ VN
Sbjct: 948 RDYNPNWMTAIEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNV 1007
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
FRHGSLVM+ L +S V+FGTV+G IG++ +P Y FL L+ L VIK VG
Sbjct: 1008 FRHGSLVMQNLGESSTPTQGCVLFGTVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVG 1067
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSFN E K + F+DGDLIESFLDLS +M +++ + + +V
Sbjct: 1068 KIEHNFWRSFNTELKIEQCEGFIDGDLIESFLDLSPNKMADVASGLMIDDPSGMKKEATV 1127
Query: 1023 EELCKRVEELTRLH 1036
++L K VE+LTR+H
Sbjct: 1128 DDLVKIVEDLTRIH 1141
>gi|159489018|ref|XP_001702494.1| UV-damaged DNA binding complex subunit 1 protein [Chlamydomonas
reinhardtii]
gi|158280516|gb|EDP06273.1| UV-damaged DNA binding complex subunit 1 protein [Chlamydomonas
reinhardtii]
Length = 1147
Score = 1043 bits (2698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 530/1081 (49%), Positives = 728/1081 (67%), Gaps = 79/1081 (7%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDVPIYG +++L+ FRP D LFI TE+YKFCVLQ+D+ +L+TR+ GDV+DR G
Sbjct: 71 VLDVPIYGNVSSLQAFRPRDLQTDLLFILTEKYKFCVLQYDSAKGQLVTRSNGDVADRTG 130
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK-GQLKEAFNIRLEELQVLDIKFLYG 119
RPTDNGQ+G +DP CR+IGLHLYDG+ KVIP D + G L EA+N+RLEEL VLD+ +L
Sbjct: 131 RPTDNGQLGFVDPACRVIGLHLYDGMLKVIPMDERTGALSEAYNVRLEELCVLDMAWLAP 190
Query: 120 CAK-----------------PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDN 162
+K P + VL+QD K ARHVKTYE AL K+ VEGPW Q ++D
Sbjct: 191 TSKGSAAGPSGSSAAAASGRPLLCVLHQDPKGARHVKTYEAALAAKELVEGPWQQQHVDA 250
Query: 163 GADLLIPVPPPLCGVLIIGEETIVYCSANAFK-------AIPIRPSITKAYGRVDADGSR 215
GA +LIPVP PL GV+++GE + YC + P+R +I A+ VD DGSR
Sbjct: 251 GAGILIPVPAPLGGVVVVGENVLSYCGGPGAGPGGGAPVSAPLRQTIITAWCPVDPDGSR 310
Query: 216 YLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
YLLGD G LHLLV+ H+ +V+GL++E LG TS S +SYLD+ + ++GS GD QL+
Sbjct: 311 YLLGDRLGGLHLLVLAHDGAGRVSGLRVEPLGHTSTPSCLSYLDSGLTFVGSRSGDCQLV 370
Query: 275 KLNLQP-DAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNG 333
+++ P + SYVE+++ + NLGPI+DF V+DLERQGQGQ+VTCSG DGSLR++RNG
Sbjct: 371 RISPTPVNQPPSYVELVDSFPNLGPILDFVVMDLERQGQGQLVTCSGVDGDGSLRVIRNG 430
Query: 334 IGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFC 393
IGIN QA+VEL GIKG+WSLR+ D D +LV++F+ ETR+LA+N E+EL+E EI GF
Sbjct: 431 IGINRQATVELAGIKGVWSLRAHYGDEHDAYLVLTFVGETRLLALNAEEELDEAEIPGFD 490
Query: 394 SQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLAT 453
+ QTL+C + + L+QVTS SVRLV S+S L +EW+ P G+S+NVA + +QV++AT
Sbjct: 491 AAAQTLWCGNTACDHLLQVTSSSVRLVDSSSLALVSEWRPPAGFSINVAAGSPTQVVVAT 550
Query: 454 GGGHLVYLEI----GDGILTEVKHAQLEYEISCLDINPI------GENPSYSQIAAVGMW 503
GGGHLVYLE+ G + EV + L+ E++C+D++P+ G + S + AVG W
Sbjct: 551 GGGHLVYLEVQAQAGGAAVVEVANVTLDSEVACVDVSPLLLTAGPGGSAERSGLVAVGRW 610
Query: 504 TDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG 563
D ++++ ++P L ++ LGGE+IPRSVL EG+ Y + LGDG L + L TG
Sbjct: 611 -DQTLQLLAVPGLTPLSVTPLGGEVIPRSVLCVGLEGVPYCMVGLGDGALHTWRLEPATG 669
Query: 564 ELTDRKKVSLGTQPITLRTFSS-------------------KNTTHVFAASDRPTVIYSS 604
LTDRK+V LGT+PITLRTF + VFAASDRPTV+YSS
Sbjct: 670 ALTDRKRVVLGTKPITLRTFRTAAAAAAAAGGDNGGARGGGGGGVSVFAASDRPTVVYSS 729
Query: 605 NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPR 664
N+KL+YSN+N +V+H+ F++AAFP SLA+A E LTIGT D +QKLH+R++ LGE PR
Sbjct: 730 NRKLMYSNLNENDVAHLSSFHAAAFPHSLAVASEAALTIGTADQLQKLHVRTVVLGEQPR 789
Query: 665 RICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF 724
RI H E R + + ++ E ++ RLLDD TF+ +++Y L E C + +
Sbjct: 790 RIAHHEAGRLLGVLTQRDPDGEERGQL---RLLDDTTFDVVASYGLAPGEMPCGGSASAA 846
Query: 725 SDDSNVYYCVGTAYVLPEENEPTKGRILVF----------IVEDGKLQLIAEKETKGAVY 774
S + VGT + P+E EP++GRILV G ++L+ EKE KGA Y
Sbjct: 847 SAAGPATFIVGTVLLRPDEPEPSRGRILVLEYLPGGAGGAAAGGGAVRLVTEKEVKGAAY 906
Query: 775 SLNAFNG-KLLAAINQKIQLYKWMLRDDGTR-------ELQSECGHHGHILALYVQTRGD 826
++ F G K+LA++N K+ +Y+W++R+ EL SEC H G++LALY+ RG
Sbjct: 907 NVRPFAGDKILASVNNKVTVYRWVVREGSGGPGGCGAYELASECHHLGNVLALYLAARGG 966
Query: 827 FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
+VVGDLM+S+SLL Y E+G +E RA DYN+ W +AVEILDDD Y+ A+N+ NL+ VR+
Sbjct: 967 LVVVGDLMRSVSLLSYNAEQGVLEHRAADYNSGWTTAVEILDDDNYIAADNHCNLYVVRR 1026
Query: 887 NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD-VGQIPTVIFGTVNGVIGV 945
N++ ATDEER RL+VVGE+H G F+N+ R+GSLVMRLPDS+ G P ++F +G +GV
Sbjct: 1027 NADSATDEERARLQVVGEFHTGTFINQMRNGSLVMRLPDSEHAGLPPPLLFAGTDGRLGV 1086
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
+A LP Y + KLQT +R V++GVGGL+HEQWR+F N+++ +A+ F+DGDLIES LD
Sbjct: 1087 VARLPPALYEWATKLQTAMRSVVRGVGGLDHEQWRAFANDRRCGEARGFVDGDLIESLLD 1146
Query: 1006 L 1006
L
Sbjct: 1147 L 1147
>gi|302837243|ref|XP_002950181.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
nagariensis]
gi|300264654|gb|EFJ48849.1| UV-damaged DNA binding complex subunit 1 protein [Volvox carteri f.
nagariensis]
Length = 1104
Score = 1043 bits (2698), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 535/1064 (50%), Positives = 739/1064 (69%), Gaps = 67/1064 (6%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDVPIYG I+ L++FRP D LFI TE+YKFCVLQ+DA +L+TRA GDV+DRIG
Sbjct: 50 VLDVPIYGNISALQVFRPRDLHADLLFILTEKYKFCVLQYDAAKGQLLTRANGDVADRIG 109
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK-GQLKEAFNIRLEELQVLDIKFLYG 119
RP +NGQ+G++DP CRLIGLHLYDG+ KVIP D++ GQL EAF++RLEEL VLD+ +L+
Sbjct: 110 RPAENGQLGVVDPACRLIGLHLYDGMLKVIPMDDRSGQLSEAFSVRLEELSVLDMAWLHP 169
Query: 120 CAKPT-----------IVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
+ VL+QD K ARHVKTYE+ L D ++GPW Q ++D+GA LLI
Sbjct: 170 QPPGGGGGGGSPGRPLLCVLHQDPKGARHVKTYEMQLGANDLLDGPWQQQHVDSGAALLI 229
Query: 169 PVPPPLCGVLIIGEETIVYCSANAFKA---IPIRPSITKAYGRVDADGSRYLLGDHAGLL 225
PVP PL GV+++GE + Y +A P+R +I A+ VD DGSR+LLG+ G +
Sbjct: 230 PVPSPLGGVVVVGENVVSYLGGPGGQAPVSAPLRQTIVTAWCPVDPDGSRFLLGNRQGGM 289
Query: 226 HLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP-DAKG 284
LLV+ H+ +V+GL+ E LG T S ++YLD+ + ++GS GDSQL++++ QP +
Sbjct: 290 QLLVLAHDGSRVSGLRTEPLGYTCAPSCLAYLDSGLTFVGSRSGDSQLVRISAQPVNQPP 349
Query: 285 SYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL 344
+Y+E+++ + +L PIVDF V+DLERQGQGQ+V CSG DGSLR+VRNGIGIN QA+VEL
Sbjct: 350 TYLELVDSFPSLAPIVDFVVMDLERQGQGQLVMCSGIDSDGSLRVVRNGIGINRQATVEL 409
Query: 345 QGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA 404
GIKG+WSLRS DD +D +L+++F+ ETR+LA+N E+EL+E E+ GF S +QTL+C +
Sbjct: 410 PGIKGVWSLRSHYDDEYDKYLLLTFVGETRLLALNTEEELDEAELPGFDSGSQTLWCGNM 469
Query: 405 IYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI- 463
+ L+Q + SVRLV + S +L +EW+ PG++++VA + +QV++ATGGGHLVYLE+
Sbjct: 470 ATDHLLQASGSSVRLVDTASLQLVSEWRPAPGFAIHVAAGSPTQVVVATGGGHLVYLEVV 529
Query: 464 --GDGI--LTEVKHAQLEYEISCLDINPI--------GENP-SYSQIAAVGMWTDISVRI 510
+G+ + E+ + L+ E++C+D++P+ GE P S + AVG W D ++++
Sbjct: 530 RRPEGVVEVVEISNVVLDSEVACVDVSPLMLQQQPLAGEWPGGRSSVVAVGRW-DQTMQL 588
Query: 511 FSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
S+P L ++ LGGE+IPRSVL A EG+ Y + LGDG L + L+ TG L+D+K+
Sbjct: 589 LSVPSLAPLSSTPLGGEVIPRSVLCTALEGVPYCMVGLGDGALHTWRLDPATGGLSDKKR 648
Query: 571 VSLGTQPITLRTFSSKNT------------------THVFAASDRPTVIYSSNKKLLYSN 612
+ LGT+PI LRTF + VFAASDRPTV+YSSNKKLLYSN
Sbjct: 649 LVLGTKPIMLRTFRTAAAVSGGGGGSLRGSGGGHGGVSVFAASDRPTVVYSSNKKLLYSN 708
Query: 613 VNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQS 672
+N +V+ + F+SAAFP SLA+A EG LTIGT D+IQKLH+R++PLGE+PRRI H E +
Sbjct: 709 LNENDVAFLASFHSAAFPRSLAVASEGALTIGTADEIQKLHVRAVPLGENPRRIAHHEGA 768
Query: 673 RTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS---- 728
R + +++ S + SE F+RLLDD TF+ +++Y L E CS+ + S +
Sbjct: 769 RMLGVLTMRLDS--DGSERSFLRLLDDTTFDVVASYALAPGEMPCSLAAWPGSSNGTAAV 826
Query: 729 ---NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF-NGKLL 784
N + VGTA+++PEE EPTKGRILV ++L+ EKE KGA Y++ F K+L
Sbjct: 827 GALNACFLVGTAFIVPEEPEPTKGRILVL----EHVRLVTEKEVKGAAYNVLPFVKDKIL 882
Query: 785 AAINQKIQLYKWMLRDDGTR-ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
A++N K+ L G R EL SEC + G+ILALY+ TRG+ +VVGDLM+S+SLL Y
Sbjct: 883 ASVNSKVPASGCDL--GGVRVELASECSYLGNILALYLATRGNLVVVGDLMRSVSLLSYN 940
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVG 903
E+G +E RA DYN+ W ++VE LDDD YL +N+ NL +R+N++ ATDEER RL+VVG
Sbjct: 941 VEQGVLEHRAADYNSGWTTSVEALDDDTYLEGDNHLNLVVLRRNADSATDEERARLQVVG 1000
Query: 904 EYHLGEFVNRFRHGSLVMRLPDSDVGQIPT-VIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
EYH G FVNRFRHGSLVMR PDS+ +P ++FG +G +GVIA LP Y L KLQ+
Sbjct: 1001 EYHTGTFVNRFRHGSLVMRPPDSEFVSLPVPLLFGGTDGRLGVIARLPPGLYEMLTKLQS 1060
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
LR+V++GVGGL+HE W +F+NE++T DAK F+DGDLIE+FLDL
Sbjct: 1061 ALRQVVRGVGGLSHEAWIAFSNERRTADAKGFVDGDLIETFLDL 1104
>gi|58383228|ref|XP_312466.2| AGAP002472-PA [Anopheles gambiae str. PEST]
gi|55242305|gb|EAA08181.2| AGAP002472-PA [Anopheles gambiae str. PEST]
Length = 1138
Score = 1043 bits (2696), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 519/1092 (47%), Positives = 729/1092 (66%), Gaps = 65/1092 (5%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
++ I G+IA ++LFRP E +D +FI T RY +LQ + E+IT+A G+V+DR+G
Sbjct: 54 EIGINGKIAVMKLFRPAEETKDLIFILTHRYNAMILQCAVQGDDIEIITKAHGNVADRVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+P + G + +IDP R+IG+ LY+GLFK+IP D +A ++R+EE+ V D++FLYG
Sbjct: 114 KPAETGILAVIDPKARVIGMRLYEGLFKIIPLDKDTNELKATSLRMEEMHVQDVEFLYGT 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V++QD + RH+KT+E++LKDK+F + W Q+N++ A +LI VP PL G ++I
Sbjct: 174 THPTLIVIHQD-INGRHIKTHEISLKDKEFTKIAWKQDNVETEATMLIAVPMPLGGAIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G+E+IVY +++ A+ I+ S Y R+D+ G RYLLG+ AG L ++ + T E K
Sbjct: 233 GQESIVYHDGDSYVAVAPAIIKQSTINCYARIDSKGLRYLLGNMAGNLFMMFLETEENAK 292
Query: 237 ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V +K+ELLGE +I I+YLDN V++IGS +GDSQL+KLN G+YV ++E
Sbjct: 293 GQTTVRDIKVELLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVMLMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PIVD CVVDLERQGQGQ++TCSG++K+GSLRI+RNGIGI E A ++L GIKGMW+
Sbjct: 353 FTNLAPIVDMCVVDLERQGQGQMITCSGSFKEGSLRIIRNGIGIQEHACIDLPGIKGMWA 412
Query: 353 LRSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
LR DD P+D L++SF+ TR+L ++ DE+EETEI G QT +C + + Q++Q
Sbjct: 413 LRVGIDDSPYDNTLILSFVGHTRVLMLS-GDEVEETEIAGILGDQQTFYCANVSHGQILQ 471
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
VT S RL+S ++ + EWK P + V AN +Q++ A+ + Y+EIGDG L
Sbjct: 472 VTPSSARLISCDNKAMICEWKPPDNKRIGVVGANTTQIVCASAQ-DVYYVEIGDGKLEYK 530
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
L YE++CLD++P+ +N + S+ AVG+WTDIS + +P L L+ KE LGGEIIPR
Sbjct: 531 GQTTLSYEVACLDVSPLEDNATRSEFVAVGLWTDISACVLKVPTLELLHKEKLGGEIIPR 590
Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
S+L+ AFEGI+YLLCALGDG + F+LN TG LT++KKV+LGTQP L+TF S +TT+V
Sbjct: 591 SILMAAFEGINYLLCALGDGSMYYFVLNKATGCLTEQKKVTLGTQPTILKTFRSLSTTNV 650
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
FA SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+ DSLA+A + GTID+IQK
Sbjct: 651 FACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAEAYQDSLALATRNSVIFGTIDEIQK 710
Query: 652 LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
LHIR++PLGE PRRI +QE S+TF + + +
Sbjct: 711 LHIRTVPLGESPRRIAYQEASQTFGVITFRMDVQDSSGLTPARQSASTQTNNITQSSGMG 770
Query: 682 -------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
N +E E+H + ++D TFE + + EY S++S +D N Y+ V
Sbjct: 771 LLKPGASNTEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYALSLMSAKLGNDPNTYFIV 830
Query: 735 GTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
GT V PEE EP GRI+++ D +L+++++KE KGA YSL FNG++LA IN ++LY
Sbjct: 831 GTGLVNPEEPEPKTGRIIIYRYADNELKMVSDKEVKGACYSLVEFNGRVLACINSTVRLY 890
Query: 795 KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
+W DD ++L+ EC H ++LALY +T+GDFI+VGDLM+SI+LL YK EG+ EE AR
Sbjct: 891 EWT--DD--KDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLLQYKQMEGSFEEIAR 946
Query: 855 DYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
DY NWM+AVEILDDD +LGA+N+ NLF K+S TDEER ++ V ++HLG+ VN F
Sbjct: 947 DYQPNWMTAVEILDDDAFLGADNSNNLFVCLKDSAATTDEERQQMPEVAQFHLGDMVNVF 1006
Query: 915 RHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
RHGSLVM+ + + V+FGTV+G IG++ + + Y FL KLQ NL IK VG
Sbjct: 1007 RHGSLVMQNISERSTPTTGCVLFGTVSGAIGLVTQIQSDFYEFLRKLQENLTNTIKSVGK 1066
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE---------E 1024
++H WRSF+ E K + F+DGDL+ESFLDLSR +M E S + + VE +
Sbjct: 1067 IDHSYWRSFHTETKMERCEGFIDGDLVESFLDLSREKMREASLGLEIDVEGTKREATVDD 1126
Query: 1025 LCKRVEELTRLH 1036
+ K VE+LTR+H
Sbjct: 1127 IIKIVEDLTRIH 1138
>gi|350410909|ref|XP_003489174.1| PREDICTED: DNA damage-binding protein 1-like [Bombus impatiens]
Length = 1141
Score = 1042 bits (2694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 534/1094 (48%), Positives = 735/1094 (67%), Gaps = 69/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V IYG+IA ++ FRP E +D LF+ T RY +L+ + E+ E+IT+A G+V+DRIG
Sbjct: 57 EVGIYGKIAVVKFFRPPHEKKDLLFLLTTRYNAMILECIGEGENIEIITKAHGNVADRIG 116
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+ ++ G +IDP R+IGL LYDGLFK+IP D +A +IR+EE QV D+ FL+GC
Sbjct: 117 KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMEEHQVQDVNFLHGC 176
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PT+++++QD + RHVKT+E++L+DK+FV+ PW Q+N++ A ++IPVP P+CG +II
Sbjct: 177 ANPTLILIHQD-INGRHVKTHEISLRDKEFVKVPWRQDNVEREAMIVIPVPSPICGAIII 235
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
G+E+I+Y N + A+ I+ S Y +VD G RYLLGD AG L +L + EK+
Sbjct: 236 GQESILYHDGNTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFVEQEKKPD 295
Query: 237 ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V LK+ELLGE SI I+YLDN V+++GS GDSQL+KL + D GSY +E
Sbjct: 296 GTQVVKDLKVELLGEISIPECITYLDNGVIFVGSRLGDSQLVKLITKADENGSYCVPMET 355
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PIVD VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 356 FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIEEHASIDLPGIKGMWA 415
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
L+ + FD LV+SF+ +TRIL +N E E+EET+I GF + QT + + +Q+
Sbjct: 416 LKVGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 473
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
T S RL+S ++ + +EW+ +++V N +QVL ATG L Y+EI G +
Sbjct: 474 TPTSARLISHETKTVVSEWEPENKRTISVVACNGTQVLCATGN-DLFYMEISCGQIVPKG 532
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
A L+YE++CLDI+P+ + + ++IAAVG+WT ISVRI +LP L I KE LGGEIIPRS
Sbjct: 533 FATLQYEVACLDISPL-DGHTEAKIAAVGLWTHISVRILTLPALEEINKELLGGEIIPRS 591
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEG +YLLCALGDG + F L+ + G L+D+KKV+LGTQP LRTF S TT+VF
Sbjct: 592 ILMTCFEGNTYLLCALGDGSMYYFTLHKQNGILSDKKKVTLGTQPTVLRTFRSLFTTNVF 651
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ ++PDSLA+A + +TIGTID+IQKL
Sbjct: 652 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 711
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TF + +++
Sbjct: 712 HIRTVPLGESPRRIAYQESSQTFGVITMRVDIQDSSGVSIVRHSASTQAASTSSSSHIAS 771
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
+E E+H + ++D TFE + + L EY S++S +D YY
Sbjct: 772 YNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYALSLISTKLGEDPTSYYI 831
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V P+E EP GRIL++ DGKL +AEKE KG+ YSL FNGKLLA+IN ++L
Sbjct: 832 VGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSLTEFNGKLLASINSTVRL 891
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
++W +EL+ EC H +I+ALY++T+GDFI+VGDLM+S++LL YK EG EE A
Sbjct: 892 FEWT----AEKELRLECSHFNNIIALYLKTKGDFILVGDLMRSLTLLQYKTMEGCFEEIA 947
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RDYN NWM+A+EILDDD +LGAEN FNLF +K+S +++ER +++ VG++HLG+ VN
Sbjct: 948 RDYNPNWMTAIEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNV 1007
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
FRHGSLVM+ L +S V+FGTV+G IG++ +P Y FL L+ L VIK VG
Sbjct: 1008 FRHGSLVMQNLGESSTPTQGCVLFGTVSGAIGLVTQIPFTFYEFLRNLEERLTGVIKSVG 1067
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSFN E K + F+DGDLIESFLDLS +M +++ + + +V
Sbjct: 1068 KIEHNFWRSFNTELKIEQCEGFIDGDLIESFLDLSPNKMADVASGLMIDDPSGMKKEATV 1127
Query: 1023 EELCKRVEELTRLH 1036
++L K VE+LTR+H
Sbjct: 1128 DDLVKIVEDLTRIH 1141
>gi|170057515|ref|XP_001864517.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167876915|gb|EDS40298.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1138
Score = 1042 bits (2694), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 524/1096 (47%), Positives = 733/1096 (66%), Gaps = 73/1096 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVL--QWDAESSELITRAMGDVSDRIG 60
+V I G+IA ++LFRP +D LFI T RY +L Q + E+IT+A G+V+DR+G
Sbjct: 54 EVGINGKIAVMKLFRPAEAQKDLLFILTHRYNAMILACQVQGDDIEIITKAHGNVADRVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+P + G + +IDP R+IG+ LY+GLFK+IP D +A ++R+EE+ V D++FLYG
Sbjct: 114 KPAETGILAVIDPKARVIGMRLYEGLFKIIPLDRDTHELKATSLRMEEMHVQDVEFLYGT 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PT++V++QD + RH+KT+E+ LKDKDF + W Q+N++ A +LIPVP PL G ++I
Sbjct: 174 AHPTLIVIHQD-LNGRHIKTHEINLKDKDFTKIAWKQDNVETEATMLIPVPTPLGGAIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
G+E++VY +++ A+ I+ S Y RVD+ G RYLLG+ G L ++ + E+
Sbjct: 233 GQESVVYHDGDSYVAVAPAIIKQSTINCYARVDSRGFRYLLGNMIGHLFMMFLETEENTR 292
Query: 237 ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V +K+ELLGE +I I+YLDN V++IGS +GDSQL+KLN A G+YV V+E
Sbjct: 293 GQLTVKDIKVELLGEITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAAASGAYVTVMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PI+D C+VDLERQGQGQ++TCSG+YK+GSLRI+RNGIGI E A ++L GIKGMW+
Sbjct: 353 FTNLAPIIDMCIVDLERQGQGQMITCSGSYKEGSLRIIRNGIGIQEHACIDLPGIKGMWA 412
Query: 353 LRSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
LR DD P+D LV+SF+ TRIL ++ E E+EETEI GF S QT +C + + Q++Q
Sbjct: 413 LRVGIDDSPYDNTLVLSFVGHTRILMLSGE-EVEETEIPGFLSDQQTFYCANVDFGQIIQ 471
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
VT + RL+S ++ + EWK P + V N+ Q++ AT + Y+EI G L
Sbjct: 472 VTPMTARLISCDNKSMICEWKPPDDKRIGVVACNSCQMVCATARD-IYYIEIEAGKLVHK 530
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
L+YE++CLDI+P+ E + S++ AVG+WT+IS I LP+L + KE LGGEIIPR
Sbjct: 531 STVTLDYEVACLDISPLDEPATRSELVAVGLWTEISACILRLPNLEFVHKEKLGGEIIPR 590
Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
S+L+ FEGI YLLCALGDG + F+++ T LTD+KKV+LGTQP L+TF S +TT+V
Sbjct: 591 SILMACFEGIIYLLCALGDGSMFYFVVDKTTHRLTDQKKVTLGTQPTILKTFRSLSTTNV 650
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
FA SDRPTVIYSSN KL++SNVNLKEV+HMC N+ ++ DSLA+A + + +GTID+IQK
Sbjct: 651 FACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYQDSLALATKNSVILGTIDEIQK 710
Query: 652 LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
LHIR++PLGE PRRI +QE S+TF + +++
Sbjct: 711 LHIRTVPLGESPRRIAYQEASQTFGVITVRMDIQDSSGLTPSRQSASTQTSNVTSSSNMG 770
Query: 682 -------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
N +E E+H + ++D TFE + + EY S++S +D YY V
Sbjct: 771 LLKPGASNTEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYVLSLISAKLGNDPATYYIV 830
Query: 735 GTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
GTA V PEE EP GRI+++ DG L ++EKE KGA YSL FNG++LA IN ++LY
Sbjct: 831 GTAMVNPEEREPKVGRIIIYHYADGALTQVSEKEIKGACYSLVEFNGRVLATINSTVRLY 890
Query: 795 KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
+W DD ++L+ EC H ++LALY +T+GDFI+VGDLM+SI+LL YK EG+ EE AR
Sbjct: 891 EWT--DD--KDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLLQYKQMEGSFEEIAR 946
Query: 855 DYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
DY WM+AVEILDDD +LGAEN+ NLF K+S TD+ER ++ V ++HLG+ VN F
Sbjct: 947 DYQPKWMTAVEILDDDAFLGAENSNNLFVCLKDSAATTDDERQQMPEVAQFHLGDMVNVF 1006
Query: 915 RHGSLVMRLPDSDVGQ--IPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 969
RHGSLVM+ ++G+ PT V+FGTV+G IG++ +P + Y FL KLQ NL IK
Sbjct: 1007 RHGSLVMQ----NIGERTTPTSGCVLFGTVSGAIGLVTQIPPDYYEFLRKLQENLTNTIK 1062
Query: 970 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE------ 1023
VG ++H WRSF+ E KT +++ F+DGDL+ESFLDL+R +M E + + + VE
Sbjct: 1063 SVGRIDHTYWRSFHTEMKTENSEGFIDGDLVESFLDLTREKMHEAALGLQIDVEGTKKEA 1122
Query: 1024 ---ELCKRVEELTRLH 1036
++ K VE+LTR+H
Sbjct: 1123 NVDDIIKIVEDLTRIH 1138
>gi|198432471|ref|XP_002129229.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
DNA-binding protein 1) (UV-damaged DNA-binding factor)
(DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
(UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
pigmentosum group E-co... isoform 2 [Ciona intestinalis]
Length = 1142
Score = 1041 bits (2693), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 550/1100 (50%), Positives = 724/1100 (65%), Gaps = 77/1100 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
++ IYGRIA L+ FRP GE +D LFI TERY C+LQ+ A E+IT+A GD+SD +G
Sbjct: 54 EINIYGRIAVLKFFRPEGEDRDLLFILTERYHGCILQYKATDGGCEIITKASGDLSDTVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP + G IGIIDP +LIGL LY+G+FK +P+D + FNIR+EEL V+D KFL+G
Sbjct: 114 RPPETGIIGIIDPTSKLIGLRLYEGVFKFLPYDPTSEELRPFNIRIEELSVIDAKFLHGY 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT+V++YQ N RHVKTY V ++DK+ V GPW Q N+D A+ +I VP PL G +II
Sbjct: 174 TTPTLVIIYQ-NSQGRHVKTYIVDVRDKEVVAGPWKQENIDAEANFIINVPKPLAGSIII 232
Query: 181 GEETIVYCSANAFKAIPIRP-----SITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK- 234
G+E+I Y N K IPI P Y VD DGSRYLLGD AG L +L++ ++
Sbjct: 233 GQESITY--HNGDKYIPIAPLCFFQDTINCYAPVDKDGSRYLLGDLAGHLFILLLESDEM 290
Query: 235 ----EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
V LKIELLGE SI ISYLDN VVYIGS GDSQLI+L+ P + + VL
Sbjct: 291 MDGTNTVRDLKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIRLHTTP-VEVHLISVL 349
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
+ Y NLGPI+D CVVDL+RQGQGQVVTCSGA+K+GSLRI+RNGIGI E AS++L GIKG+
Sbjct: 350 DTYTNLGPIIDMCVVDLDRQGQGQVVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGIKGL 409
Query: 351 WSLRS-STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
W LR T +DT LV+SF+ +RIL ++ E E+EET++ GF ++QT +C + +NQL
Sbjct: 410 WPLRVFDTSRSYDT-LVISFVGHSRILQLSGE-EVEETDLPGFDDESQTFYCSNVCHNQL 467
Query: 410 VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILT 469
VQ+T S+RL+S T R +EWK ++VAT N SQVLLA G L Y EI G +
Sbjct: 468 VQITEKSIRLISHTERRQVHEWKPKNDRHISVATCNKSQVLLAIGSS-LHYFEIQPGEVI 526
Query: 470 EVKHAQLEYEISCLDINPIGENPS--------YSQIAAVGMWTDISVRIFSLPDLNLITK 521
E L +E++CL I P+ +PS + I AVG+W D + R+ LP L + +
Sbjct: 527 ERACVDLPHEVACLTIEPLVSDPSELEGPDFVTASICAVGLWNDNTARVLKLPTLEEMHQ 586
Query: 522 EHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
+ L EIIPRS+LL F+GI+YLL LGDG L F LN +TG ++DRKKV LGTQP +L
Sbjct: 587 QKLADEIIPRSILLVQFDGINYLLVTLGDGTLFYFTLNPETGYISDRKKVPLGTQPTSLS 646
Query: 582 TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
F+S + VFA SDRPTV+YSSNKKL++SNVNLKEVSHMCP +S +PDSLA+A + L
Sbjct: 647 VFTSGGSRTVFACSDRPTVVYSSNKKLVFSNVNLKEVSHMCPLDSDGYPDSLALANDNTL 706
Query: 642 TIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVR----- 695
IGTID+IQKLHIR++PL E PRRI +QE+S+ F + +L+ S A +M R
Sbjct: 707 LIGTIDEIQKLHIRTVPLYESPRRIAYQEESQCFGLVTLRTDSVDATGDKMKITRPSAST 766
Query: 696 -----------------------------LLDDQTFEFISTYPLDTFEYGCSILSCSFSD 726
++D TFE Y LDT E SI+SC
Sbjct: 767 QASVCTKSPPVDGRSVEGFSATADIGSLLIIDQHTFEVHHAYQLDTNEEPLSIMSCKLGS 826
Query: 727 DSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAA 786
D N Y+ VGTA+V EE EP GRILVF D KL L+AEKE KGAV+ L FNG +LAA
Sbjct: 827 DPNSYFVVGTAFVYMEETEPKHGRILVFHYIDNKLTLVAEKEVKGAVFCLCQFNGHVLAA 886
Query: 787 INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
IN + +Y+W +EL++EC + +ILALY++ +GDF++VGDLM+S+S+L YKH E
Sbjct: 887 INTSVSIYQWTTE----KELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYKHVE 942
Query: 847 GAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
G ++E A+DY+ NWM+AVEILDDD +LGAEN +N+F +K+S TDEER +L +H
Sbjct: 943 GNLDEIAKDYSPNWMTAVEILDDDNFLGAENFYNVFICQKDSGATTDEERSKLREAALFH 1002
Query: 907 LGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLR 965
+G+ +N FRHGSLVM+ + ++ V ++FGTV+G IGVI ++ + Y FL +Q L
Sbjct: 1003 VGDSINTFRHGSLVMQNVGETAVSSKGHILFGTVHGSIGVITTVDEDLYAFLHSIQNRLA 1062
Query: 966 KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----- 1020
KVIK VG ++HE WRSF +KT + F+DGDLIE FLDL+R +M E++K + V
Sbjct: 1063 KVIKSVGNIDHESWRSFCTNEKTEAHRGFVDGDLIECFLDLNREKMAEVAKGLMVKEHGT 1122
Query: 1021 ----SVEELCKRVEELTRLH 1036
+V++L K VEE+ R+H
Sbjct: 1123 KREATVDDLIKAVEEMNRIH 1142
>gi|198432469|ref|XP_002129207.1| PREDICTED: similar to DNA damage-binding protein 1 (Damage-specific
DNA-binding protein 1) (UV-damaged DNA-binding factor)
(DDB p127 subunit) (DNA damage-binding protein a) (DDBa)
(UV-damaged DNA-binding protein 1) (UV-DDB 1) (Xeroderma
pigmentosum group E-co... isoform 1 [Ciona intestinalis]
Length = 1150
Score = 1041 bits (2692), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 552/1107 (49%), Positives = 727/1107 (65%), Gaps = 83/1107 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
++ IYGRIA L+ FRP GE +D LFI TERY C+LQ+ A E+IT+A GD+SD +G
Sbjct: 54 EINIYGRIAVLKFFRPEGEDRDLLFILTERYHGCILQYKATDGGCEIITKASGDLSDTVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP + G IGIIDP +LIGL LY+G+FK +P+D + FNIR+EEL V+D KFL+G
Sbjct: 114 RPPETGIIGIIDPTSKLIGLRLYEGVFKFLPYDPTSEELRPFNIRIEELSVIDAKFLHGY 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT+V++YQ N RHVKTY V ++DK+ V GPW Q N+D A+ +I VP PL G +II
Sbjct: 174 TTPTLVIIYQ-NSQGRHVKTYIVDVRDKEVVAGPWKQENIDAEANFIINVPKPLAGSIII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITK----AYGRVDADGSRYLLGDHAGLLHLLVITHEK-- 234
G+E+I Y N K IPI P K Y VD DGSRYLLGD AG L +L++ ++
Sbjct: 233 GQESITY--HNGDKYIPIAPPQIKDTINCYAPVDKDGSRYLLGDLAGHLFILLLESDEMM 290
Query: 235 ---EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL----NLQPDAKGSYV 287
V LKIELLGE SI ISYLDN VVYIGS GDSQLI+L +++ K S +
Sbjct: 291 DGTNTVRDLKIELLGEVSIPEAISYLDNGVVYIGSRLGDSQLIRLPTDSSMEGRPKPSLI 350
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
VL+ Y NLGPI+D CVVDL+RQGQGQVVTCSGA+K+GSLRI+RNGIGI E AS++L GI
Sbjct: 351 SVLDTYTNLGPIIDMCVVDLDRQGQGQVVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGI 410
Query: 348 KGMWSLRS-STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
KG+W LR T +DT LV+SF+ +RIL ++ E E+EET++ GF ++QT +C + +
Sbjct: 411 KGLWPLRVFDTSRSYDT-LVISFVGHSRILQLSGE-EVEETDLPGFDDESQTFYCSNVCH 468
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG 466
NQLVQ+T S+RL+S T R +EWK ++VAT N SQVLLA G L Y EI G
Sbjct: 469 NQLVQITEKSIRLISHTERRQVHEWKPKNDRHISVATCNKSQVLLAIGSS-LHYFEIQPG 527
Query: 467 ILTEVKHAQLEYEISCLDINPIGENPS--------YSQIAAVGMWTDISVRIFSLPDLNL 518
+ E L +E++CL I P+ +PS + I AVG+W D + R+ LP L
Sbjct: 528 EVIERACVDLPHEVACLTIEPLVSDPSELEGPDFVTASICAVGLWNDNTARVLKLPTLEE 587
Query: 519 ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
+ ++ L EIIPRS+LL F+GI+YLL LGDG L F LN +TG ++DRKKV LGTQP
Sbjct: 588 MHQQKLADEIIPRSILLVQFDGINYLLVTLGDGTLFYFTLNPETGYISDRKKVPLGTQPT 647
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
+L F+S + VFA SDRPTV+YSSNKKL++SNVNLKEVSHMCP +S +PDSLA+A +
Sbjct: 648 SLSVFTSGGSRTVFACSDRPTVVYSSNKKLVFSNVNLKEVSHMCPLDSDGYPDSLALAND 707
Query: 639 GELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVR-- 695
L IGTID+IQKLHIR++PL E PRRI +QE+S+ F + +L+ S A +M R
Sbjct: 708 NTLLIGTIDEIQKLHIRTVPLYESPRRIAYQEESQCFGLVTLRTDSVDATGDKMKITRPS 767
Query: 696 --------------------------------LLDDQTFEFISTYPLDTFEYGCSILSCS 723
++D TFE Y LDT E SI+SC
Sbjct: 768 ASTQASVCTKSPPVDGRSVEGFSATADIGSLLIIDQHTFEVHHAYQLDTNEEPLSIMSCK 827
Query: 724 FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
D N Y+ VGTA+V EE EP GRILVF D KL L+AEKE KGAV+ L FNG +
Sbjct: 828 LGSDPNSYFVVGTAFVYMEETEPKHGRILVFHYIDNKLTLVAEKEVKGAVFCLCQFNGHV 887
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
LAAIN + +Y+W +EL++EC + +ILALY++ +GDF++VGDLM+S+S+L YK
Sbjct: 888 LAAINTSVSIYQWTTE----KELRAECSNQSNILALYLKCKGDFVLVGDLMRSMSILNYK 943
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVG 903
H EG ++E A+DY+ NWM+AVEILDDD +LGAEN +N+F +K+S TDEER +L
Sbjct: 944 HVEGNLDEIAKDYSPNWMTAVEILDDDNFLGAENFYNVFICQKDSGATTDEERSKLREAA 1003
Query: 904 EYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
+H+G+ +N FRHGSLVM+ + ++ V ++FGTV+G IGVI ++ + Y FL +Q
Sbjct: 1004 LFHVGDSINTFRHGSLVMQNVGETAVSSKGHILFGTVHGSIGVITTVDEDLYAFLHSIQN 1063
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV-- 1020
L KVIK VG ++HE WRSF +KT + F+DGDLIE FLDL+R +M E++K + V
Sbjct: 1064 RLAKVIKSVGNIDHESWRSFCTNEKTEAHRGFVDGDLIECFLDLNREKMAEVAKGLMVKN 1123
Query: 1021 -----------SVEELCKRVEELTRLH 1036
+V++L K VEE+ R+H
Sbjct: 1124 FNDQHGTKREATVDDLIKAVEEMNRIH 1150
>gi|383863765|ref|XP_003707350.1| PREDICTED: DNA damage-binding protein 1-like [Megachile rotundata]
Length = 1138
Score = 1039 bits (2686), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 536/1091 (49%), Positives = 737/1091 (67%), Gaps = 66/1091 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELI--TRAMGDVSDRIG 60
+V IYG+IA ++ FRP E +D LF+ T RY +L+ E E+ T+A G+V+DRIG
Sbjct: 57 EVGIYGKIAVVKFFRPPHEKKDLLFLLTTRYNAMILECIGEGEEIEIITKAHGNVADRIG 116
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+ ++ G +IDP R+IGL LYDGLFK+IP D +A +IR++E QV D+ FL+GC
Sbjct: 117 KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMDEQQVQDVNFLHGC 176
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PT+++++QD + RHVKT+E++L+DK+FV+ PW Q+N++ A ++IPVP P+CG +II
Sbjct: 177 ANPTLILIHQD-INGRHVKTHEISLRDKEFVKIPWRQDNVEREATMVIPVPSPICGAIII 235
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK--- 234
G+E+I+Y + A+ I+ S Y +VD G RYLLGD AG L +L + EK
Sbjct: 236 GQESILYHDGTTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFLEQEKNPD 295
Query: 235 --EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ V LK+ELLGE SI I+YLDN V+++GS GDSQLIKL + D GSY +E
Sbjct: 296 GTQVVKDLKVELLGEISIPECITYLDNGVIFVGSRLGDSQLIKLITKADENGSYCVPMET 355
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PIVD VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 356 FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIEEHASIDLPGIKGMWA 415
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LR + FD LV+SF+ +TRIL +N E E+EET+I GF + QT + + +Q+
Sbjct: 416 LRIGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 473
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
T S RL+S ++ + +EW+ +++V N +QVL ATG L Y+EI G +
Sbjct: 474 TPTSARLISHETKTVVSEWEPENKRTISVVACNGTQVLCATGN-DLFYMEISCGQIVPKG 532
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
A L+YE++CLDI+P+ N + ++IAAVG+WTDISVRI +LP L I KE LGGEIIPRS
Sbjct: 533 FATLQYEVACLDISPLDGN-TEAKIAAVGLWTDISVRILTLPALEEINKELLGGEIIPRS 591
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEG +YLLCALGDG + F L+ + G L+D+KKV+LGTQP LRTF S +T +VF
Sbjct: 592 ILMTCFEGNTYLLCALGDGSMYYFTLHKQNGILSDKKKVTLGTQPTVLRTFRSLSTINVF 651
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ ++PDSLA+A + +TIGTID+IQKL
Sbjct: 652 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 711
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TF + +++
Sbjct: 712 HIRTVPLGESPRRIAYQENSQTFGVITMRVDIQDSSGVSIVRHSASTQAASTSSSSHIAS 771
Query: 682 -NQSCA----EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGT 736
N+ A +E E+H + ++D TFE + + L EY S++S +D YY VGT
Sbjct: 772 HNKPTASDISQEIEVHNLLIIDQHTFEVLHAHMLMPNEYALSLISTKLGEDPTFYYVVGT 831
Query: 737 AYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW 796
A V P+E EP GRIL++ DGKL +AEKE KG+ YSL FNGKLLA+IN ++L++W
Sbjct: 832 ALVNPDETEPKMGRILLYHWNDGKLTQVAEKEIKGSCYSLVEFNGKLLASINSTVRLFEW 891
Query: 797 MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
+EL+ EC H +I+ALY++T+GDF++VGDLM+S++LL YK EG+ EE ARDY
Sbjct: 892 T----AEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYKTMEGSFEEIARDY 947
Query: 857 NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
N NWM+AVEILDDD +LGAEN FNLF +K+S +++ER +++ +G++HLG+ VN FRH
Sbjct: 948 NPNWMTAVEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEIGQFHLGDMVNVFRH 1007
Query: 917 GSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLN 975
GSLVM+ L +S V+FGTV+G IG++ +P Y FL L+ L +VIK VG +
Sbjct: 1008 GSLVMQNLGESSTPTQGCVLFGTVSGAIGLVTQIPFTFYEFLRHLEYRLTEVIKSVGKIE 1067
Query: 976 HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM----------NVSVEEL 1025
H WRSFN E K + + F+DGDLIESFLDLS +M E++ + +V++L
Sbjct: 1068 HRFWRSFNTELKVENCEGFIDGDLIESFLDLSPDKMAEVAVDLMMDDSSGMRKEATVDDL 1127
Query: 1026 CKRVEELTRLH 1036
K VE+LTR+H
Sbjct: 1128 VKIVEDLTRIH 1138
>gi|157128864|ref|XP_001655231.1| DNA repair protein xp-e [Aedes aegypti]
gi|108882186|gb|EAT46411.1| AAEL002407-PB [Aedes aegypti]
Length = 1138
Score = 1039 bits (2686), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 518/1096 (47%), Positives = 735/1096 (67%), Gaps = 73/1096 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
++ I G+IA ++LFRP +D +FI T RY +L+ + E+IT+A G+V+DR+G
Sbjct: 54 EIGINGKIAVMKLFRPAEAQKDLIFILTHRYNAMILECAVQGDDIEIITKAHGNVADRVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+P + G + +IDP R+IG+ LY+GLFK+IP D +A ++R+EE+ V D++FLYG
Sbjct: 114 KPAETGILAVIDPKARVIGMRLYEGLFKIIPLDRDTHELKATSLRMEEVHVQDVEFLYGT 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V++QD + RH+KT+E+ LKDKDF + W Q+N++ A +LIPVP PL G ++I
Sbjct: 174 QHPTLIVIHQD-LNGRHIKTHEINLKDKDFTKIAWKQDNVETEATMLIPVPTPLGGAIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G+E++VY +++ A+ I+ S Y RVD+ G RYLLG+ +G L ++ + T E K
Sbjct: 233 GQESVVYHDGDSYVAVAPAIIKQSTINCYARVDSKGFRYLLGNMSGHLFMMFLETEENSK 292
Query: 237 ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V +K+ELLG+ +I I+YLDN V++IGS +GDSQL+KLN G+YV V+E
Sbjct: 293 GLLSVKDIKVELLGDITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PI+D C+VDLE+QGQGQ++TCSG+YK+GSLRI+RNGIGI E A ++L GIKGMW+
Sbjct: 353 FTNLAPIIDMCIVDLEKQGQGQMITCSGSYKEGSLRIIRNGIGIQEHACIDLPGIKGMWA 412
Query: 353 LRSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
LR DD P+D LV+SF+ TRIL ++ E E+EETEI GF S QT +C + + Q++Q
Sbjct: 413 LRVGIDDSPYDNTLVLSFVGHTRILTLSGE-EVEETEIPGFLSDQQTFYCANVDFGQIIQ 471
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
VT + RL+ ++ + EWK P ++V N+ Q++ AT + Y+EIG+ L
Sbjct: 472 VTPTTARLIQCDNKSMICEWKPPDDKRISVVACNSCQMVCATACD-IYYIEIGESKLVHK 530
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
L+YE++CLDI+P+ +N +++++ AVG+WTDIS I LP+L ++ E LGGEIIPR
Sbjct: 531 STVTLDYEVACLDISPLEDNATHAELVAVGLWTDISACILRLPNLEVVHTEKLGGEIIPR 590
Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
S+L+ FEGI YLLCALGDG + F+L+ T LTD+KKV+LGTQP L+TF S +TT+V
Sbjct: 591 SILMAHFEGIVYLLCALGDGSMFYFVLDKNTNRLTDQKKVTLGTQPTILKTFRSLSTTNV 650
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
FA SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+ DSLA+A + + +GTID+IQK
Sbjct: 651 FACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAEAYQDSLALATKNSVILGTIDEIQK 710
Query: 652 LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
LHIR++PLGE PRRI +QE S+TF + +++
Sbjct: 711 LHIRTVPLGESPRRIAYQEASQTFGVITVRTDIQDSSGLTPSRQSASTQTTNVTLSTNMG 770
Query: 682 -------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
N +E E+H + ++D TFE + + EY S++S +D N YY V
Sbjct: 771 LLKAGASNAEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYAMSLISAKLGNDPNTYYIV 830
Query: 735 GTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
GTA V PEE EP GRI+++ DG L ++EKE KG+ YSL FNG++LA+IN ++LY
Sbjct: 831 GTALVNPEEPEPKVGRIIIYHYADGNLTQVSEKEIKGSCYSLVEFNGRVLASINSTVRLY 890
Query: 795 KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
+W DD ++L+ EC H ++LALY +T+GDFI+VGDLM+SI+LL YK EG+ EE AR
Sbjct: 891 EWT--DD--KDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLLQYKQMEGSFEEIAR 946
Query: 855 DYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
DY NWM+AVEILDDD +LGA+N+ NLF K+ TD+ER ++ V + HLG+ VN F
Sbjct: 947 DYQPNWMTAVEILDDDAFLGADNSNNLFVCLKDGAATTDDERQQMPEVAQVHLGDMVNVF 1006
Query: 915 RHGSLVMRLPDSDVGQ--IPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 969
RHGSLVM ++G+ PT V+FGTV+G IG++ +P + Y FL KLQ NL IK
Sbjct: 1007 RHGSLVME----NIGERTTPTSGCVLFGTVSGAIGLVTQIPADYYEFLRKLQENLTDTIK 1062
Query: 970 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV--------- 1020
VG ++H WRSF+ E KT + F+DGDL+ESFLDLSR +M E + + +
Sbjct: 1063 SVGKIDHAYWRSFHTEMKTERCEGFIDGDLVESFLDLSREKMHEAALGLQIDVDGTKKEA 1122
Query: 1021 SVEELCKRVEELTRLH 1036
+V+++ K VE+LTR+H
Sbjct: 1123 TVDDIIKIVEDLTRIH 1138
>gi|328788389|ref|XP_396048.3| PREDICTED: DNA damage-binding protein 1-like isoform 1 [Apis
mellifera]
Length = 1141
Score = 1038 bits (2683), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 534/1094 (48%), Positives = 733/1094 (67%), Gaps = 69/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V IYG+IA ++ FRP E +D LF+ T RY +L+ + E+ E+IT+A G+V+DRIG
Sbjct: 57 EVGIYGKIAVVKFFRPPHEKKDLLFLLTTRYNAMILECIGEGENIEIITKAHGNVADRIG 116
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+ ++ G +IDP R+IGL LYDGLFK+IP D +A +IR+EE QV D+ FL+GC
Sbjct: 117 KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMEEHQVQDVNFLHGC 176
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PT+++++QD + RHVKT+E++L+DK+FV+ PW Q+N++ A ++IPVP P+CG +II
Sbjct: 177 ANPTLILIHQD-INGRHVKTHEISLRDKEFVKIPWRQDNVEREAMIVIPVPSPICGAIII 235
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
G+E+I+Y N + A+ I+ S Y +VD G RYLLGD AG L +L + EK+
Sbjct: 236 GQESILYHDGNTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFVEQEKKAD 295
Query: 237 ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V LK+ELLGE SI I+YLDN V+++GS GDSQL+KL + D GSY +E
Sbjct: 296 GTQVVKDLKVELLGEISIPECITYLDNGVIFVGSRLGDSQLVKLITKADENGSYCVPMET 355
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PIVD VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 356 FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIEEHASIDLPGIKGMWA 415
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
L+ + FD LV+SF+ +TRIL +N E E+EET+I GF + QT + + +Q+
Sbjct: 416 LKIGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 473
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
T S RL+S ++ + +EW+ +++V N +QVL ATG L YLEI G +
Sbjct: 474 TPTSARLISYETKTVVSEWEPENKRTISVVACNGTQVLCATGN-DLFYLEISCGQILPKG 532
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L+YE++CLDI+P+ N + ++IAAVG+WT ISV I +LP L I KE LGGEIIPRS
Sbjct: 533 FTTLQYEVACLDISPLDGN-TEAKIAAVGLWTHISVHILTLPALEEINKELLGGEIIPRS 591
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEG +YLLCALGDG + F L+ + G L+D+KKV+LGTQP LRTF S TT+VF
Sbjct: 592 ILMTCFEGNTYLLCALGDGSMYYFTLHKQNGILSDKKKVTLGTQPTVLRTFRSLFTTNVF 651
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ ++PDSLA+A + +TIGTID+IQKL
Sbjct: 652 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 711
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TF + +++
Sbjct: 712 HIRTVPLGESPRRIAYQESSQTFGVITMRVDIQDSSGVSIVRHSASTQAASTSSSSHIAS 771
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
+E E+H + ++D TFE + + L EY S++S +D YY
Sbjct: 772 YNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYALSLISTKLGEDPTSYYI 831
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V P+E EP GRIL++ DGKL +AEKE KG+ YSL FNGKLLA+IN ++L
Sbjct: 832 VGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSLTEFNGKLLASINSTVRL 891
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
++W +EL+ EC H +I+ALY++++GDFI+VGDLM+S++LL YK EG EE A
Sbjct: 892 FEWT----AEKELRLECSHFNNIIALYLKSKGDFILVGDLMRSLTLLQYKTMEGCFEEIA 947
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RDYN NWM+A+EILDDD +LGAEN FNLF +K+S +++ER +++ VG++HLG+ VN
Sbjct: 948 RDYNPNWMTAIEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNV 1007
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
FRHGSLVM+ L +S V+FGTV+G IG++ +P Y FL L+ L VIK VG
Sbjct: 1008 FRHGSLVMQNLGESSTPTQGCVLFGTVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVG 1067
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSFN E K + F+DGDLIESFLDLS +M E++ + + +V
Sbjct: 1068 KIEHNFWRSFNTELKIEQCEGFIDGDLIESFLDLSPDKMAEVASGLMIDDPSGMKKEATV 1127
Query: 1023 EELCKRVEELTRLH 1036
++L K VE+LTR+H
Sbjct: 1128 DDLVKIVEDLTRIH 1141
>gi|307205760|gb|EFN83990.1| DNA damage-binding protein 1 [Harpegnathos saltator]
Length = 1138
Score = 1035 bits (2676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 529/1094 (48%), Positives = 735/1094 (67%), Gaps = 69/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V IYGRIA ++ FRP E +D LF+ T RY +L+ + E E+IT+A G+V+DRIG
Sbjct: 54 EVGIYGRIAVVKFFRPPHEKKDLLFLLTTRYNAMILECIGEGEDIEIITKAHGNVADRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+ ++ G +IDP R+IGL LYDGLFK+IP D +A +IR++E QV D+ FL+GC
Sbjct: 114 KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMDEQQVQDVNFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PT+++++QD + RHVKT+E++L+DK+FV+ PW Q+N++ A ++IPVP P+CG +II
Sbjct: 174 ANPTLILIHQD-INGRHVKTHEISLRDKEFVKIPWRQDNVEREAMMVIPVPSPICGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
G+E+I+Y + A+ I+ S Y +VD G RYLLGD AG L +L + EK+
Sbjct: 233 GQESILYHDGTTYIAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFLEQEKKPD 292
Query: 237 ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V LK+ELLGE SI I+YLDN V+++GS GDSQLIKL + D GSY +E
Sbjct: 293 GTQVVKDLKVELLGEISIPECITYLDNGVIFVGSRLGDSQLIKLITKADENGSYCVPMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PIVD VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 353 FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGIKGMWA 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
L+ + FD LV+SF+ +TRIL +N E E+EET+I GF + QT + + +Q+
Sbjct: 413 LKVGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
T S RL+S ++ + +EW+ +++V N +QVL ATG L Y+EI +
Sbjct: 471 TPTSARLISHETKIVVSEWEPENKRTISVVACNGTQVLCATGN-DLFYMEIICNQIVPKG 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
A L++E++CLDI+P+ + S ++I AVG+WTDISVRI +LP L I KE LGGEIIPRS
Sbjct: 530 FATLQHEVACLDISPL-DGISEAKIVAVGLWTDISVRILTLPGLEEINKELLGGEIIPRS 588
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEG +YLLCALGDG + F L+ + G L+D+KKV+LGTQP LRTF S +TT+VF
Sbjct: 589 ILMTCFEGNTYLLCALGDGSMYYFTLHKQNGILSDKKKVTLGTQPTVLRTFRSLSTTNVF 648
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ ++PDSLA+A + +TIGTID+IQKL
Sbjct: 649 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 708
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TF + +++
Sbjct: 709 HIRTVPLGESPRRIAYQESSQTFGVITMRVDVQESSGVSIVRHSASTQAASTSSSSHIAS 768
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
+E E+H + ++D TFE + + L EY S++S +D Y+
Sbjct: 769 HNKPSGHTASEIGQEIEVHNLLIIDQHTFEVLHAHTLMPTEYALSLISTRLGEDPTSYFV 828
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA + P+E EP GRIL++ DGKL +AEKE KG+ YSL FNGKLLA+IN ++L
Sbjct: 829 VGTALINPDETEPKMGRILLYHWSDGKLTQVAEKEIKGSCYSLVEFNGKLLASINSTVRL 888
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
++W +EL+ EC H +I+ALY++T+GDF++VGDLM+S++LL YK EG+ EE A
Sbjct: 889 FEWT----AEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYKTMEGSFEEIA 944
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RDYN NWM+++EILDDD +LGAEN FNLF +K+S +++ER +++ VG++HLG+ VN
Sbjct: 945 RDYNPNWMTSIEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNV 1004
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
FRHGSLVM+ L +S + V+FGTV+G IG++ +P Y FL L+ L VIK VG
Sbjct: 1005 FRHGSLVMQNLGESSTPTLGCVLFGTVSGAIGLVTQIPFAFYEFLRNLEDRLNSVIKSVG 1064
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSFN E K + F+DGDLIESFLDL+ +M E++ + + +V
Sbjct: 1065 KIEHNFWRSFNTELKIEQCEGFIDGDLIESFLDLNHDKMAEVAMGLMIDDGSGMKKEATV 1124
Query: 1023 EELCKRVEELTRLH 1036
++L K VE+LTR+H
Sbjct: 1125 DDLVKVVEDLTRIH 1138
>gi|380025901|ref|XP_003696702.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
[Apis florea]
Length = 1141
Score = 1035 bits (2676), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 533/1094 (48%), Positives = 732/1094 (66%), Gaps = 69/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V IYG+IA ++ FRP E +D LF+ T RY +L+ + E+ E+IT+A G+V+DRIG
Sbjct: 57 EVGIYGKIAVVKFFRPPHEKKDLLFLLTTRYNAMILECIGEGENIEIITKAHGNVADRIG 116
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+ ++ G +IDP R+IGL LYDGLFK+IP D +A +IR+EE QV D+ FL+GC
Sbjct: 117 KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMEEHQVQDVNFLHGC 176
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PT+++++QD + RHVKT+E++L+DK+FV+ PW Q+N++ A ++IPVP P+CG +II
Sbjct: 177 ANPTLILIHQD-INGRHVKTHEISLRDKEFVKIPWRQDNVEREAMIVIPVPSPICGAIII 235
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
G+E+I+Y N + A+ I+ S Y +VD G RYLLGD AG L +L + EK+
Sbjct: 236 GQESILYHDGNTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFVEQEKKTD 295
Query: 237 ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V LK+ELLGE SI I+YLDN V+++GS GDSQL+KL + D GSY +E
Sbjct: 296 GTQVVKDLKVELLGEISIPECITYLDNGVIFVGSRLGDSQLVKLITKADENGSYCVPMET 355
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PIVD VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 356 FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIEEHASIDLPGIKGMWA 415
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
L+ + FD LV+SF+ +TRIL +N E E+EET+I GF + QT + + +Q+
Sbjct: 416 LKIGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 473
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
T S RL+S ++ + +EW+ +++V N +QVL ATG L YLEI G +
Sbjct: 474 TPTSARLISYETKTVVSEWEPENKRTISVVACNGTQVLCATGN-DLFYLEISCGQILPKG 532
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L+YE++CLDI+P+ N + ++IAAVG+WT ISV I +LP L I KE LGGEIIPRS
Sbjct: 533 FTTLQYEVACLDISPLDGN-TEAKIAAVGLWTHISVHILTLPALEEINKELLGGEIIPRS 591
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEG +YLLCALGDG + F L+ + G L+D+KKV+LGTQP LRTF S TT+VF
Sbjct: 592 ILMTCFEGNTYLLCALGDGSMYYFTLHKQNGILSDKKKVTLGTQPTVLRTFRSLFTTNVF 651
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ ++PDSLA+A + +TIGTID+IQKL
Sbjct: 652 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 711
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TF + +++
Sbjct: 712 HIRTVPLGESPRRIAYQESSQTFGVITMRVDIQDSSGVSIVRHSASTQAASTSSSSHIAS 771
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
+E E+H + ++D TFE + + L EY S++S +D YY
Sbjct: 772 YNKPTGHTASDICQEIEVHNLLIIDQHTFEVLHAHMLMPTEYALSLISTKLGEDPTSYYI 831
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V P+E EP GRIL++ DGKL +AEKE KG+ YSL FNGKLLA+IN ++L
Sbjct: 832 VGTALVHPDETEPKMGRILLYHWSDGKLTQVAEKEXKGSCYSLTEFNGKLLASINSTVRL 891
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
++W +EL+ EC H +I+ALY++++GDFI+VGDLM+S++LL YK EG EE A
Sbjct: 892 FEWT----AEKELRLECSHFNNIIALYLKSKGDFILVGDLMRSLTLLQYKTMEGCFEEIA 947
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RDYN NWM+A+EILDDD +LGAEN FNLF +K+S +++ER +++ VG++HLG+ VN
Sbjct: 948 RDYNPNWMTAIEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEVGQFHLGDMVNV 1007
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
FRHGSLVM+ L +S V+ GTV+G IG++ +P Y FL L+ L VIK VG
Sbjct: 1008 FRHGSLVMQNLGESSTPTQGCVLXGTVSGAIGLVTQIPFIFYEFLRNLEDRLTSVIKSVG 1067
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSFN E K + F+DGDLIESFLDLS +M E++ + + +V
Sbjct: 1068 KIEHNFWRSFNTELKIEQCEGFIDGDLIESFLDLSPDKMAEVASGLMIDDPSGMKKEATV 1127
Query: 1023 EELCKRVEELTRLH 1036
++L K VE+LTR+H
Sbjct: 1128 DDLVKIVEDLTRIH 1141
>gi|332030156|gb|EGI69950.1| DNA damage-binding protein 1 [Acromyrmex echinatior]
Length = 1138
Score = 1031 bits (2665), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 523/1094 (47%), Positives = 736/1094 (67%), Gaps = 69/1094 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V IYG+I+ ++ FRP E +D LF+ T RY +L+ + E E+IT+A G+V+DRIG
Sbjct: 54 EVGIYGKISVIKFFRPPHEKKDLLFLLTTRYNAMILECIGEGEDIEIITKAHGNVADRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+ ++ G +IDP R+IGL LYDGLFK+IP D +A +IR++E QV D+ FL+GC
Sbjct: 114 KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMDEQQVQDVNFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PT+++++QD + RHVKT+E+ L+DK+F + PW Q+N++ A ++IPVP P+CG +II
Sbjct: 174 ANPTLILIHQD-INGRHVKTHEINLRDKEFAKIPWRQDNVEREAMMVIPVPSPICGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
G+E+I+Y + A+ I+ S Y +VD G RYLLGD AG L +L + EK+
Sbjct: 233 GQESILYHDGTTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFLEQEKKPD 292
Query: 237 ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V LK+ELLGE SI I+YLDN V+Y+GS GDSQLIKL + D GSY +E
Sbjct: 293 GSQVVKDLKVELLGEISIPECITYLDNGVIYVGSRLGDSQLIKLITKADENGSYCVPMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PIVD VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 353 FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIQEHASMDLPGIKGMWA 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
L+ FD LV+SF+ +TRIL +N E E+EET+I GF + QT + + +Q+
Sbjct: 413 LKVGGSH-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTDDLFIQI 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRL+S ++ + +EW+ +++V N +QVL ATG L Y+EI +
Sbjct: 471 TSTSVRLISHENKIVVSEWEPQNKRTISVVACNGTQVLCATGND-LFYIEIICNQIVSKG 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L++E++CLDI+P+ + + ++I AVG+WTDISVRI +LP+L I KE LGGEIIPRS
Sbjct: 530 FVTLQHEVACLDISPL-DGVNEAKIVAVGLWTDISVRILTLPNLEEINKELLGGEIIPRS 588
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEG +YLLCALGDG + F+L+ ++G L+D+KKV+LGTQP LRTF S +TT+VF
Sbjct: 589 ILMTCFEGNTYLLCALGDGSMYYFILHRQSGMLSDKKKVTLGTQPTVLRTFRSLSTTNVF 648
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ ++PDSLA+A + +TIGTID+IQKL
Sbjct: 649 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 708
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TF + +++
Sbjct: 709 HIRTVPLGESPRRIAYQESSQTFGVITMRVDVQESSGVSIVRHSASTQAASTSSSSHVVS 768
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
+E E+H + ++D TFE + + L EY S++S +D Y+
Sbjct: 769 HNKPSGHTASEIGQEIEVHNLLIIDQHTFEVLHAHTLMATEYALSLISTKLGEDPTSYFV 828
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA++ P+E EP GRIL++ +GK +AEKE KG+ YSL FNGKLLA+IN ++L
Sbjct: 829 VGTAFINPDETEPKMGRILLYHWSEGKFTQVAEKEIKGSCYSLVEFNGKLLASINSTVRL 888
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
++W +EL+ EC H +I+ALY++T+GDF++VGDLM+S++LL YK EG+ EE A
Sbjct: 889 FEWT----AEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYKTMEGSFEEIA 944
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
RDYN NWM+++EILDDD +LGAEN FNLF +K+S +++ER +++ +G++HLG+ VN
Sbjct: 945 RDYNPNWMTSIEILDDDTFLGAENCFNLFVCQKDSAATSEDERQQMQEIGQFHLGDMVNV 1004
Query: 914 FRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG 972
FRHGSLVM+ L +S + V+FGTV+G IG++ +P Y FL ++ L VIK VG
Sbjct: 1005 FRHGSLVMQNLGESSTPTLGCVLFGTVSGAIGLVTQIPVTFYEFLRNMEDRLNSVIKSVG 1064
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SV 1022
+ H WRSFN E K + F+DGDLIESFLDL+ +M E++ + + +V
Sbjct: 1065 KIEHNFWRSFNTELKIEQCEGFIDGDLIESFLDLNHDKMAEVAMGLMIDDGSGMKKEATV 1124
Query: 1023 EELCKRVEELTRLH 1036
++L K VE+LTR+H
Sbjct: 1125 DDLVKIVEDLTRIH 1138
>gi|193644722|ref|XP_001942922.1| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
Length = 1156
Score = 1030 bits (2664), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 523/1110 (47%), Positives = 744/1110 (67%), Gaps = 85/1110 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVL---QWDAESSELITRAMGDVSDRI 59
++ +YG+I +++FRP + +D LF+ T RY +L Q ++ ++IT+A G++SD+I
Sbjct: 56 EIALYGKIEIMKVFRPKHKDKDLLFVVTARYNTMILECVQTESGDIDIITKAHGNISDQI 115
Query: 60 GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYG 119
G+ ++ G + +IDP R+IGL LYDGLFK+IP D +G+LK A+ +R+EE++V DI FLYG
Sbjct: 116 GKISEIGAMAVIDPSARVIGLKLYDGLFKIIPLDKEGELK-AYCLRMEEVEVQDIDFLYG 174
Query: 120 CAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLI 179
CA PTI++++QD RH+K E+++KDK+FV+ PW Q N++ A ++IPVP PLCG +I
Sbjct: 175 CANPTIIIIHQDTM-GRHIKAKELSIKDKEFVKTPWKQENVETEASMIIPVPEPLCGAII 233
Query: 180 IGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE- 235
IG E+++Y + ++F AI I+ S Y R+D +G+RYLLGD AG L +L++ +EK
Sbjct: 234 IGRESVLYHNGSSFIAISPPVIKQSTIVCYARIDPEGTRYLLGDMAGHLFMLLLNYEKNP 293
Query: 236 ----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
K+ K++LLGE SI +++YLDN ++Y+ S GDSQLIKLN +PD GS++ VL+
Sbjct: 294 DGTFKIKDPKVDLLGEISIPESLTYLDNKIIYVASRVGDSQLIKLNKKPDQFGSHITVLD 353
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
++NLGPIVD CV+DLERQGQGQVVTCSGAYK+GSLRI+RNGIGI E A+++L GIKGMW
Sbjct: 354 TFMNLGPIVDMCVIDLERQGQGQVVTCSGAYKEGSLRIIRNGIGIQEVATIDLVGIKGMW 413
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
LR +TD D LV+SF+ +R+LA + E E+EE ++EGF S+ QT +C + N++VQ
Sbjct: 414 PLRITTDSLLDDTLVLSFVGHSRVLAYSGE-EVEEIDLEGFQSELQTFYCGNTSDNKMVQ 472
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
+TS SVRL+ S+ L +EW P G S+NV + N Q + ATG L Y+EIG + +
Sbjct: 473 ITSASVRLICLESKCLVSEWNVPDGKSINVVSCNGHQAVCATGNS-LYYIEIGTDKVDQK 531
Query: 472 KHAQLEYEISCLDINPIGENPSY--SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
LE+E+SCLD+ +N Y S + AVG+W DISV+I LPD + +E LG EII
Sbjct: 532 GFITLEHEVSCLDVCSF-KNEFYKNSSLVAVGLWMDISVKILQLPDFVELVREPLGEEII 590
Query: 530 PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
PRS+L+ FE I YLLCALGDG L F LN + G L+D++KV+LGTQP +R F S TT
Sbjct: 591 PRSILMVTFENIDYLLCALGDGSLCYFHLNPENGVLSDKRKVNLGTQPTLIRKFQSLATT 650
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
VFA SD PTVIYSSN KL++SNVNL++V+HMC N+ ++PDSLA+A + + IGTID++
Sbjct: 651 SVFACSDHPTVIYSSNNKLIFSNVNLRKVNHMCSLNTKSYPDSLAMATDTAIIIGTIDEM 710
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---------------------------- 681
QKLHIR+IPLGE PRRI HQE S++F I +++
Sbjct: 711 QKLHIRTIPLGEAPRRIAHQESSKSFGIITMRIDVHEGINLVPARPSASTSAQNISGAIN 770
Query: 682 -----NQSCAE------------ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF 724
N S E E+H + +LD TFE + + L++ EY SI+S
Sbjct: 771 NRMPNNASAVNSNQGPLSSEYGLEVEIHNMLVLDQNTFEVLHAHQLNSNEYALSIISAKL 830
Query: 725 SDDSNVYYCVGTAYVLPEENEPTKGRILVFIVED--GKLQLIAEKETKGAVYSLNAFNGK 782
DD YY +GTA V PE+ +P GRIL+F +D KL I EKE KGA Y + FNGK
Sbjct: 831 GDDPATYYILGTAVVNPEDQDPKLGRILIFHWDDSSSKLTPITEKEVKGACYGMAEFNGK 890
Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
LLAA+N ++L++W +EL+ EC H +I+AL+V+T+GDFIV GDLM+S++LL Y
Sbjct: 891 LLAAVNCTVRLFEWT----AEKELRLECSHFNNIVALFVKTKGDFIVCGDLMRSLTLLQY 946
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVV 902
K EG+ EE ARDYN W +A+EI+DDD++LGAEN+ NLF + K+S +DE R +L+ +
Sbjct: 947 KTMEGSFEEIARDYNPKWSTAIEIIDDDVFLGAENDKNLFIIHKDSTLTSDEARHQLQEI 1006
Query: 903 GEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
G++H G+ +N FRHGSLVM+ D+ V +++GT +G +G++ L + + FL L+
Sbjct: 1007 GQFHCGDLINVFRHGSLVMQHFTDTYVSVQGGILYGTCSGALGLVTQLTPKMFDFLSDLE 1066
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM---------- 1011
+L V+KGVG +NH+ WRS++ E +T +++F+DGDLIESFLDLS+ M
Sbjct: 1067 KSLATVVKGVGKINHQFWRSYHTEIRTEPSESFVDGDLIESFLDLSKREMIAVVDALQGA 1126
Query: 1012 -----DEISKTMNVSVEELCKRVEELTRLH 1036
+I K +S++++ K VE+LTR+H
Sbjct: 1127 YDHEFKKIPKDTKISLDDVIKLVEDLTRIH 1156
>gi|307186138|gb|EFN71863.1| DNA damage-binding protein 1 [Camponotus floridanus]
Length = 1136
Score = 1025 bits (2649), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 525/1092 (48%), Positives = 728/1092 (66%), Gaps = 67/1092 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V IYG+IA ++ FRP E +D LF+ T Y +L+ + E E+IT+A G+V+DRIG
Sbjct: 54 EVGIYGKIAVIKFFRPPHEKKDLLFLLTTSYNAMILECMGEGEDIEIITKAHGNVADRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+ ++ G +IDP R+IGL LYDGLFK+IP D +A +IR++E QV D+ FL+GC
Sbjct: 114 KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMDEQQVQDVNFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT+++++QD + RHVKT+E+ L++K+F + PW Q+N++ A ++IPVP P+CG +II
Sbjct: 174 TNPTLILIHQD-INGRHVKTHEINLREKEFSKIPWRQDNVEREAMMVIPVPSPICGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK- 236
G+E+I+Y + A+ I+ S Y +VD G RYLLGD AG L +L + EK+
Sbjct: 233 GQESILYHDGTTYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFLELEKKPD 292
Query: 237 ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V LK+ELLGE SI I+YLDN V+Y+GS GDSQLIKL + D GSY +E
Sbjct: 293 GTQVVKDLKVELLGEISIPECITYLDNGVIYVGSRLGDSQLIKLITKADENGSYCVPMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PIVD VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 353 FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGIKGMWA 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
L+ + FD LV+SF+ +TRIL +N E E+EET+I GF + QT + + +Q+
Sbjct: 413 LKVGGGN-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
T S RL+S S+ + +EW+ +++V N +QVL ATG L Y+EI +
Sbjct: 471 TPTSARLISHESKMVVSEWEPQNKRTISVVACNGTQVLCATGN-DLFYMEISCNQIVPKG 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
A L++E++CLDI+P+ + + ++I AVG+WTDISVRI +LP L I KE LGGEIIPRS
Sbjct: 530 FATLQHEVACLDISPL-DGVNEAKIVAVGLWTDISVRILTLPGLEEINKELLGGEIIPRS 588
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEG +YLLCALGDG + F L + G L+D+K+V+LGTQP LRTF S +TT+VF
Sbjct: 589 ILMTCFEGNTYLLCALGDGSMYYFTLYKQNGVLSDKKRVTLGTQPTVLRTFRSLSTTNVF 648
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ ++PDSLA+A + +TIGTID+IQKL
Sbjct: 649 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 708
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TF + +++
Sbjct: 709 HIRTVPLGESPRRIAYQESSQTFGVITMRVDVQESSGVSIVRHSASTQAASLSSGIASHN 768
Query: 682 ------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVG 735
+E E+H + ++D TFE + + L EY S++S +DS YY VG
Sbjct: 769 KPSGHTASEIGQEIEVHNLLIIDQHTFEVLHAHTLMPTEYALSLISTRLGEDSTSYYVVG 828
Query: 736 TAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
TA++ P+E EP GRIL+F DGKL +AEKE KG+ YSL FNGKLLA+IN ++L++
Sbjct: 829 TAFINPDETEPKMGRILLFHWSDGKLSQVAEKEIKGSCYSLVEFNGKLLASINSTVRLFE 888
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
W +EL+ EC H +I+ALY++T+ DF++VGDLM+S++LL YK EG+ EE ARD
Sbjct: 889 WT----AEKELRLECSHFNNIIALYLKTKSDFVLVGDLMRSLTLLQYKTMEGSFEEIARD 944
Query: 856 YNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
YN NWM+++EILDDD +LGAEN FNLF +K+S +++ER +++ VG++HLG+ VN FR
Sbjct: 945 YNPNWMTSIEILDDDTFLGAENCFNLFICQKDSAATSEDERQQMQEVGQFHLGDMVNVFR 1004
Query: 916 HGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
HGSLVM+ L +S V+FGTV+G IG++ +P Y FL L+ L VIK VG +
Sbjct: 1005 HGSLVMQNLGESSTPTQGCVLFGTVSGAIGLVTQIPFGFYEFLRNLEDKLTSVIKSVGKI 1064
Query: 975 NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM----------NVSVEE 1024
H WRSF + K + F+DGDLIESFLDLS +M E++ + +V++
Sbjct: 1065 EHNFWRSFKTDLKIEQCEGFIDGDLIESFLDLSHDKMAEVAMGLMMDDGSGMKKEATVDD 1124
Query: 1025 LCKRVEELTRLH 1036
L K VE+LTR+H
Sbjct: 1125 LVKIVEDLTRIH 1136
>gi|432089478|gb|ELK23419.1| DNA damage-binding protein 1 [Myotis davidii]
Length = 1047
Score = 1008 bits (2606), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 501/942 (53%), Positives = 661/942 (70%), Gaps = 56/942 (5%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF + QT FC + + QL+Q+
Sbjct: 413 LRSDPTRETDDTLVLSFVGQTRVLMLNGE-EVEETELTGFVDEQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L L+I L ++
Sbjct: 472 TSASVRLVSQDPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYCLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
RD+N NWMSAVEILDDD +LGAEN FNLF +K+ T+ +
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDRSFHTERK 988
Score = 60.5 bits (145), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 31/67 (46%), Positives = 42/67 (62%), Gaps = 10/67 (14%)
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSVEELCKRV 1029
RSF+ E+KT A F+DGDLIESFLD+SR +M E+ + + ++L K V
Sbjct: 981 RSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVV 1040
Query: 1030 EELTRLH 1036
EELTR+H
Sbjct: 1041 EELTRIH 1047
>gi|16197726|emb|CAC94909.1| damaged-DNA recognition protein 1 [Mus musculus]
Length = 994
Score = 1008 bits (2605), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 499/935 (53%), Positives = 658/935 (70%), Gaps = 56/935 (5%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPRAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKK++LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKMTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVN KEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNFKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
RD+N NWMSAVEILDDD +LGAEN FNLF +K+S
Sbjct: 947 RDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDS 981
>gi|357623954|gb|EHJ74904.1| putative DNA repair protein xp-e [Danaus plexippus]
Length = 1128
Score = 998 bits (2579), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 511/1090 (46%), Positives = 719/1090 (65%), Gaps = 67/1090 (6%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSD 57
M +V +YGR+A ++LFRP E +D +FI T RY +L+W ++ E++TRA G+V+D
Sbjct: 52 MKEVGLYGRVAKMKLFRPPYEQKDLVFILTARYNAMILEWRTGANGELEVVTRAHGNVAD 111
Query: 58 RIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
RIG+P++NG + +IDP R+IGL LYDGLFK+IP D +A ++RLEEL V D++FL
Sbjct: 112 RIGKPSENGILAVIDPQARVIGLRLYDGLFKIIPLDKDSTELKAASLRLEELNVYDLEFL 171
Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGV 177
+GC+ PT+++++QD + RH+KT+E+ L+DK+F++ PW Q+N++ A +LIPVP PL G
Sbjct: 172 HGCSNPTLILIHQD-LNGRHIKTHEINLRDKEFMKIPWKQDNVETEASILIPVPSPLGGA 230
Query: 178 LIIGEETIVYCSANAFKAI--PIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
++IG+E+IVY ++ A+ P + Y RVD G RYLLGD AG L +L++ +
Sbjct: 231 IVIGQESIVYHDGQSYVAVAPPQIKTPINCYCRVDVRGLRYLLGDIAGRLFMLLLELSER 290
Query: 236 ----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
V LK+ELLG+ I ++YLDN VV++GS GDS L++L D YV+ +E
Sbjct: 291 DGTASVRDLKVELLGDIPIPECMTYLDNGVVFVGSRLGDSALVRLAAVRDDASQYVQPME 350
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
+ +L PIVD CVVDLERQGQ Q++TCSGA+K GSLRI+RNGIGI EQAS++L GIKGMW
Sbjct: 351 TFTSLAPIVDMCVVDLERQGQNQLITCSGAFKMGSLRIIRNGIGIQEQASIDLPGIKGMW 410
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+L DT LV+SF+ +TR+L +N E E+EETEI+GF S QT F + ++QL+Q
Sbjct: 411 ALTLGQGPHHDT-LVLSFVGQTRVLTLNGE-EVEETEIKGFVSDRQTFFTGNVCHDQLIQ 468
Query: 412 VTSGSVRLVSSTSRELRN--EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILT 469
VT +RL+ W +P G +V+V + ++ + A G + + I G L
Sbjct: 469 VTDEGIRLIGRGPGGWNGVAAW-APAGRAVSVVSCGETRAV-AAAGLRIYLVAIKQGALE 526
Query: 470 EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
+ + E++CLD+ P GE + VG+WTDISVR+ LPDL + E L GEII
Sbjct: 527 LISEVCMNEEVACLDLGPGGEE----ALLGVGLWTDISVRVLKLPDLRPLHTEKLSGEII 582
Query: 530 PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
PRS+L+C EG+ YLLCALGDG + F ++ +G LT++KKV+LGTQP LR+F S +TT
Sbjct: 583 PRSLLICVLEGVCYLLCALGDGSMFYFTVDPDSGVLTNKKKVTLGTQPTVLRSFRSLSTT 642
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
++FA SDRPTVI+SSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + +TIGTID+I
Sbjct: 643 NIFACSDRPTVIFSSNHKLVFSNVNLKEVAHMCSLNAVAYPDSLALATDSTVTIGTIDEI 702
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSL----------------------------- 680
QKLHIR++PLGE PRRI +QE S+TF + ++
Sbjct: 703 QKLHIRTVPLGETPRRIAYQEASQTFGVITMRVDKVEWTGGCGSLVRPSASTAAASASAA 762
Query: 681 ----KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGT 736
K+ + E+H + +LD TFE + + L E+ S++SC +DD N YY VGT
Sbjct: 763 APPSKHAPAPLDLELHNLLILDHHTFEVLHAHQLLANEFAMSLVSCKLADDPNHYYAVGT 822
Query: 737 AYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW 796
A + PEE+EP +GRIL+F +GKL +AEKE KG Y+L FNGKLLA+IN ++L++W
Sbjct: 823 AILNPEESEPKQGRILLFHWCEGKLTQVAEKEIKGGCYTLVEFNGKLLASINSTVRLFEW 882
Query: 797 MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
+EL+ EC H +I+ALY++ +GDFI+VGDLM+S+SLL YK EG+ EE ARDY
Sbjct: 883 T----SEKELRLECSHFNNIVALYLKVKGDFILVGDLMRSMSLLQYKQMEGSFEEIARDY 938
Query: 857 NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
+ NWM+AVEILDDD +LGAEN+FNLF +K+S TDEER ++ +G++H+G+ VN R
Sbjct: 939 SPNWMTAVEILDDDTFLGAENSFNLFVCQKDSAATTDEERQQMGYMGQFHVGDMVNVMRR 998
Query: 917 GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
G+LV +L D+ V+ TV+G I ++ L E + FL +L+ L IK VG + H
Sbjct: 999 GALVAQLADTAAPVARPVLLATVSGAICLVVQLSQELFDFLHQLEERLTHTIKSVGKIPH 1058
Query: 977 EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVEELC 1026
WRSFN + KT A+ F+DGDLIESFLDLSR E + + + +V++L
Sbjct: 1059 SFWRSFNTDIKTEPAEGFIDGDLIESFLDLSRDMQQETLQGLQIDDGGGMMRDATVDDLI 1118
Query: 1027 KRVEELTRLH 1036
K VE+LTR+H
Sbjct: 1119 KIVEDLTRIH 1128
>gi|148709424|gb|EDL41370.1| damage specific DNA binding protein 1 [Mus musculus]
Length = 968
Score = 993 bits (2566), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 492/922 (53%), Positives = 648/922 (70%), Gaps = 56/922 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 233 GQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E
Sbjct: 293 GTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W
Sbjct: 353 FTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWP 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LRS D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+Q+
Sbjct: 413 LRSDPGRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLIQI 471
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVS + L +EWK P G +++VA+ N+SQV++A G L YL+I L ++
Sbjct: 472 TSASVRLVSQEPKALVSEWKEPQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQIS 530
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIPRS
Sbjct: 531 HTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRS 590
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VF
Sbjct: 591 ILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVF 650
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKL
Sbjct: 651 ACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKL 710
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PL E PR+IC+QE S+ F + S +
Sbjct: 711 HIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKL 770
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
S EE E+H + ++D TFE + + EY S++SC D N Y+
Sbjct: 771 FSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFI 830
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++L
Sbjct: 831 VGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRL 890
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
Y+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE A
Sbjct: 891 YEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIA 946
Query: 854 RDYNANWMSAVEILDDDIYLGA 875
RD+N NWMSAVEILDDD + G+
Sbjct: 947 RDFNPNWMSAVEILDDDNFTGS 968
>gi|194901554|ref|XP_001980317.1| GG19434 [Drosophila erecta]
gi|190652020|gb|EDV49275.1| GG19434 [Drosophila erecta]
Length = 1140
Score = 968 bits (2502), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 511/1099 (46%), Positives = 710/1099 (64%), Gaps = 77/1099 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDA--ESSELITRAMGDVSDRIG 60
++ I G IA + FRP +D LFI T RY +L+ ++ ++T+A G+VSD +G
Sbjct: 54 EININGTIAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDAITVVTKANGNVSDSVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
P++ G I IDP R+IG+ LY GLF +IP D +A N+R++EL V D++FL+GC
Sbjct: 114 VPSEGGVIAAIDPKARVIGMCLYQGLFTIIPLDKDASELKATNLRMDELNVYDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V+++DN D RHVK++E+ L++K+F++ W Q+N++ A +LIPVP P+ GV++I
Sbjct: 174 MNPTVIVIHKDN-DGRHVKSHEINLREKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G E+IVY + + A+ R S Y RV ++G RYLLG+ G L++L + T E K
Sbjct: 233 GRESIVYHDGSNYHAVAPLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSK 292
Query: 237 ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
V +K+E LGE SI I+YLDN +YIG+ +GDSQL++LN + GSYV +E +
Sbjct: 293 GVTVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLNSEA-IDGSYVVPVENF 351
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352 TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411
Query: 354 RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ D+ P++ LV++F+ TRIL ++ E E+EETEI GF S QT C + Y+QL+QV
Sbjct: 412 KVGVDESPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCSNVDYDQLIQV 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVSS ++ L EW+ S+ V + N +Q+++A+ L Y+ I DG L E
Sbjct: 471 TSDSVRLVSSATKALVAEWRPTGDRSIGVVSCNTTQIVVASAC-DLFYIVIEDGTLREQS 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
LEYE++CLDI P+ E S + AVG+WTDIS I SLPDL I E L GEIIPRS
Sbjct: 530 RRTLEYEVACLDITPLDETQKKSDMVAVGLWTDISAVIMSLPDLETIYTEKLSGEIIPRS 589
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEGI YLLCALGDG + F+++ TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590 ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSLSTTNVF 649
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + + +GTID+IQKL
Sbjct: 650 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALANKNAVILGTIDEIQKL 709
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TFA+ +L+
Sbjct: 710 HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNFLP 769
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
N +E ++H + ++D TFE + + + E S++S DD N Y
Sbjct: 770 KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVSPETISSLMSAKLGDDPNTY 829
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
Y V T+ V+PEE EP GRI++F + KL +AE + G Y+L FNGK+LA I +
Sbjct: 830 YVVATSLVIPEEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL+ EC I ALY++ +GDFI+VGDLM+SI+LL +K EG E
Sbjct: 890 RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD WM AVEILDDD +LG+E N NLF +K+S TDEER L + +HLG+ V
Sbjct: 946 IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005
Query: 912 NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
N FRHGSLVM+ +VG+ T V++GT NG IG++ +P + Y FL L+ L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS----- 1021
+IK VG + H +R+F K ++ F+DGDLIESFLDL R +M + + + ++
Sbjct: 1062 IIKSVGKIEHTYYRNFQINTKVEPSEGFIDGDLIESFLDLGREKMRDAVQGLELTLNGER 1121
Query: 1022 ----VEELCKRVEELTRLH 1036
VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140
>gi|195500686|ref|XP_002097479.1| GE26244 [Drosophila yakuba]
gi|194183580|gb|EDW97191.1| GE26244 [Drosophila yakuba]
Length = 1140
Score = 965 bits (2495), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 511/1099 (46%), Positives = 712/1099 (64%), Gaps = 77/1099 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDA--ESSELITRAMGDVSDRIG 60
++ I G IA + FRP +D +FI T RY +L+ ++ ++T+A G+VSD +G
Sbjct: 54 EININGTIAVMRHFRPPDSNKDLIFILTRRYNVMILEARMVNDAITVVTKANGNVSDSVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
P++ G + IDP R+IG+ LY GLF +IP D +A N+R++EL V D++FL+GC
Sbjct: 114 IPSEGGVMAAIDPKARVIGMCLYQGLFTIIPLDKDASELKATNLRMDELTVYDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V+++DN D RHVK++E+ L++K+F++ W Q+N++ A +LIPVP P+ GV++I
Sbjct: 174 LNPTVIVIHKDN-DGRHVKSHEINLREKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G E+IVY + + A+ R S Y RV ++G RYLLG+ G L++L + T E K
Sbjct: 233 GRESIVYHDGSNYHAVAPLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTSETSK 292
Query: 237 ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
V +K+E LGE SI I+YLDN +YIG+ +GDSQL++LN + GSYV +E +
Sbjct: 293 GVTVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLNSEA-IDGSYVVPVENF 351
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352 TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411
Query: 354 RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ D+ P++ LV++F+ TRIL ++ E E+EETEI GF S QT C + Y+Q++QV
Sbjct: 412 KVGVDESPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCSNVDYDQVIQV 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVSS ++ L EW+ S+ V + N +Q+++A+ + Y+ I DG L E
Sbjct: 471 TSDSVRLVSSATKALVAEWRPTGDRSIGVVSCNTTQIVVASAC-DIFYIVIEDGSLREQS 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
LEYE++CLDI P+ E + S + AVG+WTDIS I SLPDL I E L GEIIPRS
Sbjct: 530 RRTLEYEVACLDITPLDETQTKSDLVAVGLWTDISAVIMSLPDLKTIYTEKLSGEIIPRS 589
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEGI YLLCALGDG + F+++ TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590 ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSLSTTNVF 649
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + + +GTID+IQKL
Sbjct: 650 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALANKNAVILGTIDEIQKL 709
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TFA+ +L+
Sbjct: 710 HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNFLP 769
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
N +E ++H + ++D TFE + + + E S++S DD N Y
Sbjct: 770 KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHHFVSPETISSLMSAKLGDDPNTY 829
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
Y V T+ V+PEE EP GRI++F + KL +AE + G Y+L FNGK+LA I +
Sbjct: 830 YVVATSLVIPEEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL+ EC I ALY++ +GDFI+VGDLM+SI+LL +K EG E
Sbjct: 890 RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD WM AVEILDDD +LG+E N NLF +K+S TDEER L + +HLG+ V
Sbjct: 946 IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005
Query: 912 NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
N FRHGSLVM+ +VG+ T V++GT NG IG++ +P + Y FL LQ L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLQERLKK 1061
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE----ISKTMN--- 1019
+IK VG + H +R+F K ++ F+DGDLIESFLDLSR +M + + T+N
Sbjct: 1062 IIKSVGKIEHTYYRNFQINNKVEPSEGFIDGDLIESFLDLSREKMRDSVQGLELTLNGER 1121
Query: 1020 --VSVEELCKRVEELTRLH 1036
VE++ K VE+LTR+H
Sbjct: 1122 KGADVEDVIKIVEDLTRMH 1140
>gi|21357503|ref|NP_650257.1| piccolo [Drosophila melanogaster]
gi|74872881|sp|Q9XYZ5.1|DDB1_DROME RecName: Full=DNA damage-binding protein 1; Short=D-DDB1; AltName:
Full=Damage-specific DNA-binding protein 1; AltName:
Full=Protein piccolo
gi|4928452|gb|AAD33592.1|AF132145_1 damage-specific DNA binding protein DDBa p127 subunit [Drosophila
melanogaster]
gi|7299719|gb|AAF54901.1| piccolo [Drosophila melanogaster]
gi|220942640|gb|ACL83863.1| DDB1-PA [synthetic construct]
Length = 1140
Score = 962 bits (2486), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 509/1099 (46%), Positives = 708/1099 (64%), Gaps = 77/1099 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
++ I G IA + FRP +D LFI T RY +L+ + ++T+A G+VSD +G
Sbjct: 54 EININGTIAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDVITVVTKANGNVSDSVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
P++ G I IDP R+IG+ LY GLF +IP D +A N+R++EL V D++FL+GC
Sbjct: 114 IPSEGGVIAAIDPKARVIGMCLYQGLFTIIPMDKDASELKATNLRMDELNVYDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V+++D+ D RHVK++E+ L+DK+F++ W Q+N++ A +LIPVP P+ GV++I
Sbjct: 174 LNPTVIVIHKDS-DGRHVKSHEINLRDKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G E+IVY + + A+ R S Y RV ++G RYLLG+ G L++L + T E K
Sbjct: 233 GRESIVYHDGSNYHAVAPLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSK 292
Query: 237 ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
V +K+E LGE SI I+YLDN +YIG+ +GDSQL++LN + GSYV +E +
Sbjct: 293 GVTVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLNSEA-IDGSYVVPVENF 351
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352 TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411
Query: 354 RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ D+ P++ LV++F+ TRIL ++ E E+EETEI GF S QT C + Y+QL+QV
Sbjct: 412 KVGVDESPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCSNVDYDQLIQV 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVSS ++ L EW+ ++ V + N +Q+L+A+ + Y+ I DG L E
Sbjct: 471 TSDSVRLVSSATKALVAEWRPTGDRTIGVVSCNTTQILVASAC-DIFYIVIEDGSLREQS 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L YE++CLDI P+ E S + AVG+WTDIS I SLPDL I E L GEIIPRS
Sbjct: 530 RRTLAYEVACLDITPLDETQKKSDLVAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEGI YLLCALGDG + F+++ TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590 ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSLSTTNVF 649
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + + +GTID+IQKL
Sbjct: 650 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALANKNAVILGTIDEIQKL 709
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TFA+ +L+
Sbjct: 710 HIRTVPLGEGPRRIAYQESSQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNFLP 769
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
N +E ++H + ++D TFE + + E S++S DD N Y
Sbjct: 770 KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVAPETISSLMSAKLGDDPNTY 829
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
Y V T+ V+PEE EP GRI++F + KL +AE + G Y+L FNGK+LA I +
Sbjct: 830 YVVATSLVIPEEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL+ EC I AL+++ +GDFI+VGDLM+SI+LL +K EG E
Sbjct: 890 RLYEWT----NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD WM AVEILDDD +LG+E N NLF +K+S TDEER L + +HLG+ V
Sbjct: 946 IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005
Query: 912 NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
N FRHGSLVM+ +VG+ T V++GT NG IG++ +P + Y FL L+ L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS----- 1021
+IK VG + H +R+F K ++ F+DGDLIESFLDLSR +M + + + ++
Sbjct: 1062 IIKSVGKIEHTYYRNFQINSKVEPSEGFIDGDLIESFLDLSRDKMRDAVQGLELTLNGER 1121
Query: 1022 ----VEELCKRVEELTRLH 1036
VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140
>gi|195449948|ref|XP_002072297.1| GK22405 [Drosophila willistoni]
gi|194168382|gb|EDW83283.1| GK22405 [Drosophila willistoni]
Length = 1140
Score = 961 bits (2485), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 510/1099 (46%), Positives = 712/1099 (64%), Gaps = 77/1099 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD--AESSELITRAMGDVSDRIG 60
++ I G IA ++ FRP +D LFI T RY +L+ ++ +IT+A G+VSD +G
Sbjct: 54 EININGTIAVMKHFRPPDSNKDLLFILTRRYNVMILEARMVGDTITVITKANGNVSDSVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
P++ G I IDP R+IG+ LY GLF +IP + +A N+R++EL V D++FL+GC
Sbjct: 114 IPSEGGVIAAIDPKARVIGMCLYQGLFTIIPMEKDASELKATNLRMDELMVYDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V+++DN D RHVK++E+ L+DK+F++ W Q+N++ A +LIPVP P+ GV++I
Sbjct: 174 LNPTVIVIHKDN-DGRHVKSHEINLRDKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G E+IVY + + A+ R S Y RVD+ G RYLLG+ G L++L + T E K
Sbjct: 233 GRESIVYHDGSNYHAVAPLTFRQSTINCYARVDSKGLRYLLGNMHGQLYMLFLGTSESSK 292
Query: 237 ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
V +K+E LGE SI I+YLDN +YIG+ +GDSQL++L+ + +GSYV +E +
Sbjct: 293 GITVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLSSEA-IEGSYVIPVENF 351
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352 TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411
Query: 354 RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ TDD P++ LV++F+ TRIL ++ E E+EETEI GF S QT C + ++QL+QV
Sbjct: 412 KVGTDDSPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCANVEHDQLIQV 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLV S+S+ L EW+ S+ V + N +Q+++A+ + Y+ I DG L E
Sbjct: 471 TSDSVRLVKSSSKSLVAEWRPEGDRSIGVVSCNGTQIVVASAR-DIFYIVIEDGNLQEKC 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L YE++CLDI P+ E + S++ AVG+WTDIS + SLP L+ I E L GEIIPRS
Sbjct: 530 RKTLAYEVACLDITPLDEKQNKSELVAVGLWTDISAVMLSLPSLDTIYTEKLSGEIIPRS 589
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEGI YLLCALGDG + F+++ TG+LTD+KKV+LGTQP TLRTF S TT+VF
Sbjct: 590 ILMTTFEGIHYLLCALGDGSMYYFIMDPITGQLTDKKKVTLGTQPTTLRTFRSFATTNVF 649
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + + +GTID+IQKL
Sbjct: 650 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQKL 709
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TFA+ +L+
Sbjct: 710 HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGPKPLRSSASTRAQNITCSSNILP 769
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
N +E ++H + ++D TFE + + + E +++S DD N Y
Sbjct: 770 KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVSPETISALMSAKLGDDPNTY 829
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
Y V T+ V+P+E EP GRI++F D KL +AE + G Y+L FNGK+LA I +
Sbjct: 830 YVVATSLVIPDEPEPKVGRIIIFHYHDNKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL+ EC I ALY++ +GDFI+VGDLM+SI+LL +K EG E
Sbjct: 890 RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD WM AVEILDDD +LG+E N NLF +K+S TDEER L + +HLG+ V
Sbjct: 946 IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005
Query: 912 NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
N FRHGSLVM+ +VG+ T V++GT NG IG++ +P + Y FL L+ L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI---------SKT 1017
+IK VG + H +R+F K ++ F+DGDLIESFLDLSR +M E +
Sbjct: 1062 IIKSVGKIEHTYYRNFQINTKVEPSEGFIDGDLIESFLDLSRDKMREAVLGLELTLYGER 1121
Query: 1018 MNVSVEELCKRVEELTRLH 1036
+ VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140
>gi|195329354|ref|XP_002031376.1| GM24084 [Drosophila sechellia]
gi|194120319|gb|EDW42362.1| GM24084 [Drosophila sechellia]
Length = 1140
Score = 961 bits (2484), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 510/1099 (46%), Positives = 709/1099 (64%), Gaps = 77/1099 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
++ I G IA + FRP +D LFI T RY +L+ + ++T+A G+VSD +G
Sbjct: 54 EININGTIAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDVITVVTKANGNVSDSVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
P++ G I IDP R+IG+ LY GLF +IP D +A N+R++EL V D++FL+GC
Sbjct: 114 IPSEGGVIAAIDPKARVIGMCLYQGLFTIIPMDKDASELKATNLRMDELNVYDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V+++DN D RHVK++E+ L+DK+F++ W Q+N++ A +LIPVP P+ GV++I
Sbjct: 174 LNPTVIVIHKDN-DGRHVKSHEINLRDKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G E+IVY + + A+ R S Y RV ++G RYLLG+ G L++L + T E K
Sbjct: 233 GRESIVYHDGSNYHAVAPLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSK 292
Query: 237 ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
V +K+E LGE SI I+YLDN +YIG+ +GDSQL++LN + GSYV +E +
Sbjct: 293 GVTVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLNSEA-IDGSYVVPVENF 351
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352 TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411
Query: 354 RSSTDDP-FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ D+ ++ LV++F+ TRIL ++ E E+EETEI GF S QT C + Y+QL+QV
Sbjct: 412 KVGVDESSYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCSNVDYDQLIQV 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVSS ++ L EW+ S+ V + N +Q+L+A+ + Y+ I DG L E
Sbjct: 471 TSDSVRLVSSATKALVAEWRPTGDRSIGVVSCNTTQILVASAC-DIFYIVIEDGSLREQS 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L YE++CLDI P+ E S + AVG+WTDIS I SLPDL I E L GEIIPRS
Sbjct: 530 RRTLAYEVACLDITPLDETQKKSDLVAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEGI YLLCALGDG + F+++ TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590 ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSLSTTNVF 649
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + + +GTID+IQKL
Sbjct: 650 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALANKNAVILGTIDEIQKL 709
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TFA+ +L+
Sbjct: 710 HIRTVPLGEGPRRIAYQESSQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNFLP 769
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
N +E ++H + ++D TFE + + + E S++S DD N Y
Sbjct: 770 KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVSPETISSLMSAQLGDDPNTY 829
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
Y V T+ V+PEE EP GRI++F + KL +AE + G Y+L FNGK+LA I +
Sbjct: 830 YVVATSLVIPEEPEPKVGRIIIFHYNENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL+ EC I ALY++ +GDFI+VGDLM+SI+LL +K EG E
Sbjct: 890 RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD WM AVEILDDD +LG+E N NLF +K+S TDEER L + +HLG+ V
Sbjct: 946 IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005
Query: 912 NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
N FRHGSLVM+ +VG+ T V++GT NG IG++ +P + Y FL L+ L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS----- 1021
+IK VG + H+ +R+F + ++ F+DGDLIESFLDLSR +M + + + ++
Sbjct: 1062 IIKLVGKIGHKFYRNFRIHTQVEPSQGFIDGDLIESFLDLSRDKMRDAVQGLELTLNGER 1121
Query: 1022 ----VEELCKRVEELTRLH 1036
VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140
>gi|195145844|ref|XP_002013900.1| GL24391 [Drosophila persimilis]
gi|194102843|gb|EDW24886.1| GL24391 [Drosophila persimilis]
Length = 1140
Score = 960 bits (2481), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 513/1099 (46%), Positives = 709/1099 (64%), Gaps = 77/1099 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDA--ESSELITRAMGDVSDRIG 60
++ I G +A + FRP +D LFI T RY +L+ +S +IT+A G+VSD G
Sbjct: 54 EININGTVAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDSITVITKANGNVSDSAG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
P++ G I IDP R+IG+ LY GLF +IP D + +A N+R++EL V D++FL+GC
Sbjct: 114 IPSEGGVIAAIDPKARVIGMVLYQGLFTIIPMDKEASELKATNLRMDELNVYDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI+V+++DN D RHVK++E+ L++K+F++ W Q+N++ A +LIPVP P+ GV++I
Sbjct: 174 LNPTIIVIHKDN-DGRHVKSHEINLREKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G E+IVY + + A+ R S Y RVD G RYLLG+ G L++L + T E K
Sbjct: 233 GRESIVYHDGSNYHAVAPLTFRQSTINCYARVDGKGLRYLLGNMDGQLYMLFLGTSETSK 292
Query: 237 ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
V +K+E LGE SI I+YLDN +YIG+ +GDSQL++L+ + +GSYV +E +
Sbjct: 293 GVTVKDIKVEKLGEISIPECITYLDNGFLYIGARHGDSQLVRLSSE-SIEGSYVIPVENF 351
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352 TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411
Query: 354 RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ D+ P++ LV++F+ TRIL ++ E E+EETEI GF S QT C + ++QL+QV
Sbjct: 412 KVGIDESPYENTLVLAFVGLTRILTLSGE-EVEETEIPGFASDLQTFLCANVEHDQLIQV 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLV S ++ L EW+ S+ V + N++Q++LA+ + Y+ I DG L E
Sbjct: 471 TSDSVRLVKSATKALICEWRPEGDRSIGVVSCNSTQIVLASAR-DIFYIVIEDGRLVEKS 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L YE++CLDI P+ E + S + AVG+WTDIS I SLPDL I E L GEIIPRS
Sbjct: 530 RKTLAYEVACLDITPLDETQNKSDLIAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEGI YLLCALGDG + F+++ TG+LTD+KKV+LGTQP TLRTF S TT+VF
Sbjct: 590 ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSFATTNVF 649
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + + +GTID+IQKL
Sbjct: 650 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQKL 709
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TFA+ +L+
Sbjct: 710 HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNILP 769
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
N +E ++H + ++D TFE + + E S++S DD N Y
Sbjct: 770 KPGGNNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVAPETISSLMSAKLGDDPNTY 829
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
Y V T+ V+P+E EP GRI++F D KL +AE + G Y+L FNGK+LA I +
Sbjct: 830 YVVATSLVIPDEPEPKVGRIIIFHYHDSKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL+ EC I ALY++ +GDFI+VGDLM+SI+LL +K EG E
Sbjct: 890 RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD WM AVEILDDD +LG+E N NLF +K+S TDEER L + +HLG+ V
Sbjct: 946 IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005
Query: 912 NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
N FRHGSLVM+ +VG+ T V++GT NG IG++ +P + Y FL L+ L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE----ISKTMN--- 1019
+IK VG + H +R+F K ++ F+DGDLIESFLDLSR +M + + T+N
Sbjct: 1062 IIKSVGKIEHTYYRNFQINTKVEPSEGFIDGDLIESFLDLSRDKMRDAVLGLELTLNGER 1121
Query: 1020 --VSVEELCKRVEELTRLH 1036
VE++ K VE+LTR+H
Sbjct: 1122 KGADVEDVIKIVEDLTRMH 1140
>gi|125774475|ref|XP_001358496.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
gi|54638233|gb|EAL27635.1| GA20574 [Drosophila pseudoobscura pseudoobscura]
Length = 1140
Score = 960 bits (2481), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 513/1099 (46%), Positives = 709/1099 (64%), Gaps = 77/1099 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDA--ESSELITRAMGDVSDRIG 60
++ I G +A + FRP +D LFI T RY +L+ +S +IT+A G+VSD G
Sbjct: 54 EININGTVAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDSITVITKANGNVSDSAG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
P++ G I IDP R+IG+ LY GLF +IP D + +A N+R++EL V D++FL+GC
Sbjct: 114 IPSEGGVIAAIDPKARVIGMVLYQGLFTIIPMDKEASELKATNLRMDELSVYDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI+V+++DN D RHVK++E+ L++K+F++ W Q+N++ A +LIPVP P+ GV++I
Sbjct: 174 LNPTIIVIHKDN-DGRHVKSHEINLREKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G E+IVY + + A+ R S Y RVD G RYLLG+ G L++L + T E K
Sbjct: 233 GRESIVYHDGSNYHAVAPLTFRQSTINCYARVDGKGLRYLLGNMDGQLYMLFLGTSETSK 292
Query: 237 ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
V +K+E LGE SI I+YLDN +YIG+ +GDSQL++L+ + +GSYV +E +
Sbjct: 293 GVTVKDIKVEKLGEISIPECITYLDNGFLYIGARHGDSQLVRLSSE-SIEGSYVIPVENF 351
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352 TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411
Query: 354 RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ D+ P++ LV++F+ TRIL ++ E E+EETEI GF S QT C + ++QL+QV
Sbjct: 412 KVGIDESPYENTLVLAFVGLTRILTLSGE-EVEETEIPGFASDLQTFLCANVEHDQLIQV 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLV S ++ L EW+ S+ V + N++Q++LA+ + Y+ I DG L E
Sbjct: 471 TSDSVRLVKSATKALICEWRPEGDRSIGVVSCNSTQIVLASAR-DIFYIVIEDGRLVEKS 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L YE++CLDI P+ E + S + AVG+WTDIS I SLPDL I E L GEIIPRS
Sbjct: 530 RKTLAYEVACLDITPLDETQNKSDLIAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEGI YLLCALGDG + F+++ TG+LTD+KKV+LGTQP TLRTF S TT+VF
Sbjct: 590 ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSFATTNVF 649
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + + +GTID+IQKL
Sbjct: 650 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQKL 709
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TFA+ +L+
Sbjct: 710 HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNILP 769
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
N +E ++H + ++D TFE + + E S++S DD N Y
Sbjct: 770 KPGGNNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVAPETISSLMSAKLGDDPNTY 829
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
Y V T+ V+P+E EP GRI++F D KL +AE + G Y+L FNGK+LA I +
Sbjct: 830 YVVATSLVIPDEPEPKVGRIIIFHYHDNKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL+ EC I ALY++ +GDFI+VGDLM+SI+LL +K EG E
Sbjct: 890 RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD WM AVEILDDD +LG+E N NLF +K+S TDEER L + +HLG+ V
Sbjct: 946 IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005
Query: 912 NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
N FRHGSLVM+ +VG+ T V++GT NG IG++ +P + Y FL L+ L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE----ISKTMN--- 1019
+IK VG + H +R+F K ++ F+DGDLIESFLDLSR +M + + T+N
Sbjct: 1062 IIKSVGKIEHTYYRNFQINTKVEPSEGFIDGDLIESFLDLSRDKMRDAVLGLELTLNGER 1121
Query: 1020 --VSVEELCKRVEELTRLH 1036
VE++ K VE+LTR+H
Sbjct: 1122 KGADVEDVIKIVEDLTRMH 1140
>gi|330792580|ref|XP_003284366.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
gi|325085712|gb|EGC39114.1| hypothetical protein DICPUDRAFT_86223 [Dictyostelium purpureum]
Length = 1064
Score = 955 bits (2469), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 505/1062 (47%), Positives = 722/1062 (67%), Gaps = 37/1062 (3%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
M DV IYGRI+ L+LF G+ QD+LFI+TE +KFC+L +D+E E++T+A G+ D IG
Sbjct: 9 MFDVNIYGRISVLKLFSAGGK-QDYLFISTESFKFCILAYDSEKKEIVTKASGNAEDTIG 67
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFD---NKGQLKEAFNIRLEELQVLDIKFL 117
RPT+ GQ+GIIDPD RLI LHLY+GL K+I + N K A N RLEELQV+D+ FL
Sbjct: 68 RPTEAGQLGIIDPDGRLIALHLYEGLLKLINIEKGLNNPIQKTAANTRLEELQVMDMTFL 127
Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGV 177
YGC PTI VL++D KD +H+ TYEV+ KD++ GPWSQ+N+ + +L+ VP L GV
Sbjct: 128 YGCKIPTIAVLFKDTKDEKHIVTYEVSQKDQELCPGPWSQSNVGVYSSMLVAVP--LGGV 185
Query: 178 LIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
L++ + I Y + ++I I + AY RVD DGSRYL GDH G L +LV+ + +++V
Sbjct: 186 LVVADNGITYMNGRTTRSIAIPYTKFLAYDRVDKDGSRYLFGDHFGRLSVLVLLNHQQRV 245
Query: 238 TGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPD-AKGSYVEVLERYVNL 296
T LK E LG TSI S+ISYLD+ VV+IGSS GDSQLI+LN + D A SY+ LE + N+
Sbjct: 246 TELKFETLGRTSIPSSISYLDSGVVFIGSSSGDSQLIRLNTEKDPATDSYISHLENFTNI 305
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR-- 354
GPIVDFC+VD E+QGQ Q+VTCSG Y+DG+LR++RNGIGI E+A +EL+G+KG+W ++
Sbjct: 306 GPIVDFCLVDTEKQGQAQIVTCSGTYRDGTLRVIRNGIGIAEKALIELEGVKGLWPIKEN 365
Query: 355 --SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA-IYNQLVQ 411
S +P D +L+VSFI T++L +E+EETE EG S + T+ C + N +VQ
Sbjct: 366 DPSDPLNPKDQYLIVSFIGYTKVLQFQ-GEEIEETEFEGLDSNSSTILCSNIDKENVIVQ 424
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE- 470
VT+ ++ L++ + + ++WKSP G +N+ ++N SQ+ L+ G L Y EI + E
Sbjct: 425 VTNQAINLINPITFKRVDQWKSPSGSPINLVSSNQSQIALSIGKS-LYYFEINEQSRIEL 483
Query: 471 VKHAQLEYEISCLDINPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
+K +L +EISC+DI+P+ S SQI AVG+WTDI++R+F LP L I KE LGGEII
Sbjct: 484 IKEIELPHEISCIDISPLDSFMDSRSQICAVGLWTDITLRLFKLPTLEEIHKEPLGGEII 543
Query: 530 PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG-ELTDRKKVSLGTQPITLRTFSSKNT 588
PRS+L+ +FEGI Y+ C+LGDGHL F +++ +L D++K++LGTQPI L+ F KNT
Sbjct: 544 PRSILMISFEGIDYIFCSLGDGHLFKFKIDIANNWKLFDKRKLTLGTQPIILKKFKLKNT 603
Query: 589 THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
++FA SDRPTVIYS+NKKL YS VNLKEV+++ FNS AFP S+AI+ E L IGTID+
Sbjct: 604 INIFALSDRPTVIYSNNKKLFYSVVNLKEVTNVTSFNSDAFPGSMAISSESSLIIGTIDE 663
Query: 649 IQKLHIRSIPL-GEHPRRICHQEQSRTFAICSLK-----------NQSCAEE--SEMHFV 694
IQKLHI++I L GE RRI H E+ +A+ ++K N + +E E+ +V
Sbjct: 664 IQKLHIKTISLNGEMARRIVHLEEYSCYAVITIKTNEDIISGNGENATTIDEVEEEVSYV 723
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF 754
RL DDQTFE +S++ L+ +E G S+ S F DD Y VGT+ +P+ T GR+L+F
Sbjct: 724 RLFDDQTFEPLSSFRLEHYEMGWSLTSTKFDDDPCTYLAVGTSINIPDRQ--TSGRVLLF 781
Query: 755 -IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML-RDDGTRELQSECGH 812
I E KL L+ E + V L+ FNG+L+AA+ +++ ++ ++ + + SE H
Sbjct: 782 NINEAKKLVLLEEISFRSGVLYLHQFNGRLIAAVLKRLYSIRYSYSKEKNCKVISSENVH 841
Query: 813 HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
GH + L + +RG F++VGD+MKS+SLL + E G++ + A++ W+ ++ +++DD +
Sbjct: 842 KGHTMILKLASRGHFMLVGDMMKSMSLL-GQSENGSLVQIAKNPQPIWIRSIAMINDDYF 900
Query: 873 LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
+G+E + N V+KN++ + ER L+ VG YH+GE +N GSLV RLPDSD IP
Sbjct: 901 IGSETSNNFVVVKKNNDSTNELERELLDSVGHYHIGESINSMLCGSLV-RLPDSDAPPIP 959
Query: 933 TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK 992
T+++ +VNG IGVIAS+ E Y F KLQ L +V+ G+GG HE WR+F+N+ TV+++
Sbjct: 960 TILYASVNGSIGVIASISKEDYEFFSKLQKGLNRVVNGIGGFTHESWRAFSNDHHTVESR 1019
Query: 993 NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
NF+DGDLIE F DL M ++ + MNV+++E KR+E L +
Sbjct: 1020 NFIDGDLIEMFPDLKIESMAKVIQDMNVTLDETLKRIESLMQ 1061
>gi|195571247|ref|XP_002103615.1| GD18880 [Drosophila simulans]
gi|194199542|gb|EDX13118.1| GD18880 [Drosophila simulans]
Length = 1140
Score = 955 bits (2469), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 510/1099 (46%), Positives = 706/1099 (64%), Gaps = 77/1099 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
++ I G IA + FRP +D LFI T RY +L+ + ++T+A G+VSD +G
Sbjct: 54 EININGTIAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDVITVVTKANGNVSDSVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+ G I ID R+IG+ LY GLF +IP D +A N+R++EL V D++FL+GC
Sbjct: 114 ILSAGGVIAAIDLKARVIGMCLYQGLFTIIPMDKDASELKATNLRMDELNVYDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V+++DN D RHVK++E+ L+DK+F++ W Q+N++ A +LIPVP P+ GV++I
Sbjct: 174 LNPTVIVIHKDN-DGRHVKSHEINLRDKEFMKIAWKQDNVETEATMLIPVPSPIGGVIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G E+IVY + + A+ R S Y RV ++G RYLLG+ G L++L + T E K
Sbjct: 233 GRESIVYHDGSNYHAVAPLTFRQSTINCYARVSSNGLRYLLGNMDGQLYMLFLGTAETSK 292
Query: 237 ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
V +K+E LGE SI I+YLDN +YIG+ +GDSQL++LN + GSYV +E +
Sbjct: 293 GVTVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLNSEA-IDGSYVVPVENF 351
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352 TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411
Query: 354 RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ D+ P++ LV++F+ TRIL ++ E E+EETEI GF S QT C + Y+QL+QV
Sbjct: 412 KVGVDESPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCSNVDYDQLIQV 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS SVRLVSS ++ L EW+ S+ V + N +Q+L+A+ + Y+ I DG L E
Sbjct: 471 TSDSVRLVSSATKALVAEWRPTGDRSIGVVSCNTTQILVASAC-DIFYIVIEDGNLREQC 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L YE++CLDI P+ E S + AVG+WTDIS I SLPDL I E L GEIIPRS
Sbjct: 530 RRTLAYEVACLDITPLDETQKKSDLVAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEGI YLLCALGDG + F+++ TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590 ILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSLSTTNVF 649
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + + +GTID+IQKL
Sbjct: 650 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALANKNAVILGTIDEIQKL 709
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TFA+ +L+
Sbjct: 710 HIRTVPLGEGPRRIAYQESSQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNFLP 769
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
N +E ++H + ++D TFE + + + E S++S DD N Y
Sbjct: 770 KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHAHQFVSPETISSLMSAKLGDDPNTY 829
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
Y V T+ V+PEE EP GRI++F + KL +AE + G Y+L FNGK+LA I +
Sbjct: 830 YVVATSLVIPEEPEPKVGRIIIFHYNENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL+ EC I ALY++ +GDFI+VGDLM+SI+LL +K EG E
Sbjct: 890 RLYEWT----NEKELRMECNIQNMIAALYLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD WM AVEILDDD +LG+E N NLF +K+S TDEER L + +HLG+ V
Sbjct: 946 IARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDTV 1005
Query: 912 NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
N FRHGSLVM+ +VG+ T V++GT NG IG++ +P + Y FL L+ L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS----- 1021
+IK VG + H +R+F K ++ F+DGDLIESFLDLSR +M + + + ++
Sbjct: 1062 IIKSVGKIEHTYYRNFQINTKVEPSEGFIDGDLIESFLDLSRDKMRDAVQGLELTLNGER 1121
Query: 1022 ----VEELCKRVEELTRLH 1036
VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140
>gi|194741158|ref|XP_001953056.1| GF17579 [Drosophila ananassae]
gi|190626115|gb|EDV41639.1| GF17579 [Drosophila ananassae]
Length = 1140
Score = 954 bits (2465), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 506/1100 (46%), Positives = 710/1100 (64%), Gaps = 79/1100 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
++ I G IA + FRP +D LFI T RY +L+ + ++T+A G+VSD +G
Sbjct: 54 EININGTIAVMRHFRPPDSNKDLLFILTRRYNVMILEARMVNDVITVVTKANGNVSDSVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
P++ G I IDP R+IG+ LY GLF +IP D + +A N+R++EL V D++FL+GC
Sbjct: 114 IPSEGGVIAAIDPKARVIGMCLYQGLFTIIPMDKEASELKATNLRMDELNVYDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V+++DN D RHVK++E+ L++K+F++ W Q+N++ A +LI VP P+ GV++I
Sbjct: 174 LNPTVIVIHKDN-DGRHVKSHEINLREKEFMKIAWKQDNVETEATMLITVPSPIGGVIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G E+IVY + + A+ R S Y RVD+ G RYLLG+ G L++L + T E K
Sbjct: 233 GRESIVYHDGSNYHAVAPLTFRQSTINCYARVDSKGFRYLLGNMDGQLYMLFLGTSETSK 292
Query: 237 ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDA-KGSYVEVLER 292
V +K+E LGE SI I+YLDN +YIG+ +GDSQL++L+ DA GSYV +E
Sbjct: 293 GITVKDIKVEQLGEISIPECITYLDNGFLYIGARHGDSQLVRLS--SDAIDGSYVIPVEN 350
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWS
Sbjct: 351 FTNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWS 410
Query: 353 LRSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
L+ DD P++ LV++F+ TRIL ++ E E+EETEI GF S QT C + ++Q++Q
Sbjct: 411 LKVGIDDSPYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCANVEHDQIIQ 469
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
VTS SVRLV S +++L EW+ S+ V + N++Q+++A+ + Y+ I DG L E
Sbjct: 470 VTSDSVRLVKSATKDLVAEWRPEGDRSIGVVSCNSTQIVVASAR-DIFYIVIEDGKLVEK 528
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
L YE++CLDI P+ E+ + S + AVG+WTDIS I SLP+L I E L GEIIPR
Sbjct: 529 SRKTLAYEVACLDITPLDESQNKSDLVAVGLWTDISAVILSLPELETIYTEKLSGEIIPR 588
Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
S+L+ FEGI YLLCALGDG + F+++ TG+LTD+KKV+LGTQP TLRTF S TT+V
Sbjct: 589 SILMTTFEGIHYLLCALGDGSMYYFIMDQTTGQLTDKKKVTLGTQPTTLRTFRSFATTNV 648
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
FA SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + + +GTID+IQK
Sbjct: 649 FACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQK 708
Query: 652 LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
LHIR++PLGE PRRI +QE S+TFA+ +L+
Sbjct: 709 LHIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGTKPLRNSASTQAQNITYSSNIL 768
Query: 682 -----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 730
N +E ++H + ++D TFE + + E S++S DD N
Sbjct: 769 PKPGTGTSTATNAEVGQEIDVHNLLVIDQNTFEVLHAHQFVAPETISSLMSAKLGDDPNT 828
Query: 731 YYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
YY V T+ V+P+E EP GRI++F D KL +AE + G Y+L FNGK+LA I
Sbjct: 829 YYVVATSLVIPDEPEPKVGRIIIFHYHDNKLTQVAETKVDGTCYALVEFNGKVLAGIGSF 888
Query: 791 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
++LY+W +EL+ EC I AL+++ +GDFI+VGDLM+SI+LL +K EG
Sbjct: 889 VRLYEWT----NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFV 944
Query: 851 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 910
E ARD WM AVEILDDD +LG+E N NLF +K+S TDEER L + +HLG+
Sbjct: 945 EIARDCEPKWMRAVEILDDDTFLGSETNGNLFVCQKDSAATTDEERQLLPELARFHLGDT 1004
Query: 911 VNRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLR 965
VN FRHGSLVM+ +VG+ T V++GT NG IG++ +P + Y FL L+ L+
Sbjct: 1005 VNVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLK 1060
Query: 966 KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS---- 1021
K+IK VG + H +R+F K ++ F+DGDLIESFLDL R +M + + + ++
Sbjct: 1061 KIIKSVGKIEHTYYRNFQINTKVEPSEGFIDGDLIESFLDLGRDKMRDAVQGLEITLNGE 1120
Query: 1022 -----VEELCKRVEELTRLH 1036
VE++ K VE+LTR+H
Sbjct: 1121 RKSADVEDVIKIVEDLTRMH 1140
>gi|195395112|ref|XP_002056180.1| GJ10363 [Drosophila virilis]
gi|194142889|gb|EDW59292.1| GJ10363 [Drosophila virilis]
Length = 1140
Score = 945 bits (2443), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 502/1099 (45%), Positives = 708/1099 (64%), Gaps = 77/1099 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD--AESSELITRAMGDVSDRIG 60
++ I G+I + FRP +D LFI T R+ +L+ ++ ++T+A G+VSD +G
Sbjct: 54 EININGKITVMRHFRPPDSNKDLLFILTRRFNVMILEARMVGDAVTVVTKANGNVSDSVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
++ G I IDP R+IG+ LY GLF +IP D +A N+R++EL V D++FL+GC
Sbjct: 114 ILSEGGFIAAIDPKARVIGMCLYQGLFTIIPLDKDASELKATNLRMDELTVYDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V+++DN D RHVK++E+ L+DK+F++ W Q+N++ A +LIPVP + GV++I
Sbjct: 174 QNPTVIVIHKDN-DGRHVKSHEINLRDKEFIKVAWKQDNVETEATMLIPVPSSIGGVIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI----THE 233
G E+IVY + + A+ R S Y RVD+ G RYLLG+ G L++L + T +
Sbjct: 233 GRESIVYHDGSNYHAVAPLTFRQSTINCYARVDSKGLRYLLGNMDGQLYMLFLGTTETSK 292
Query: 234 KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
V +K+E LGE SI I+YLDN +YIGS +GDSQL++L+ + GSYV +E +
Sbjct: 293 GTTVKDIKVEQLGEISIPECITYLDNGFLYIGSRHGDSQLVRLSSEA-IDGSYVIPVENF 351
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352 TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411
Query: 354 RSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ DD ++ LV++F+ TRIL ++ E E+EETEI GF S QT C + Y+QL+QV
Sbjct: 412 KVGIDDSAYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQTFLCANVDYDQLIQV 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
T+ SVRLV S ++ L EWK S+ V + N++Q++ A+ + Y+ I DG L E
Sbjct: 471 TAESVRLVKSATKTLVGEWKPEGDRSIGVVSCNSTQIVAASAR-EIFYISIEDGSLVEKC 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L YE++CLD+ P+ E + S++ AVG+WTDIS I LPDL I E L GEIIPRS
Sbjct: 530 RKTLPYEVACLDVTPLDEKQTKSELVAVGLWTDISAVILRLPDLETIYTEKLSGEIIPRS 589
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE I+YLLCALGDG + F+L+ TG+LTD+KKV+LGTQP TLRTF S +TT+VF
Sbjct: 590 ILMTTFEDINYLLCALGDGSMYYFILDRTTGQLTDKKKVTLGTQPTTLRTFRSFSTTNVF 649
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + + +GTID+IQKL
Sbjct: 650 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQKL 709
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK---------------------NQSCA----- 686
HIR++PLGE PRRI +QE S+TFA+ +L+ N SC+
Sbjct: 710 HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNSSCSSNILP 769
Query: 687 ---------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
+E ++H + ++D TFE + ++ E S++S DD N Y
Sbjct: 770 KPGGGNSTAANAEVGQEIDVHNLLVIDQNTFEVLHSHQFVPPETISSLMSAKLGDDPNTY 829
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
Y V T+ V P+E EP GRI++F + KL +AE + G Y+L FNGK+LA I +
Sbjct: 830 YVVATSLVFPDEPEPKVGRIIIFHYNENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL+ EC I AL+++ +GDFI+VGDLM+SI+LL +K EG E
Sbjct: 890 RLYEWT----NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD WM AVEILDDD +LG E + NLF +K+S TDEER L + +HLG+ +
Sbjct: 946 IARDCEPKWMRAVEILDDDTFLGCETHDNLFVCQKDSAATTDEERQLLPELARFHLGDTI 1005
Query: 912 NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
N FRHGSLVM+ +VG+ T V++GT NG IG++ +P + Y FL L+ L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE----ISKTMN--- 1019
+IK VG ++H +R++ K ++ F+DGDLIESFLDL+R +M E + T+N
Sbjct: 1062 IIKSVGKIDHTYYRNYQINTKVEPSEGFIDGDLIESFLDLNRDKMREAVSGLELTLNGER 1121
Query: 1020 --VSVEELCKRVEELTRLH 1036
VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140
>gi|119594340|gb|EAW73934.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_b [Homo
sapiens]
Length = 923
Score = 942 bits (2435), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 481/929 (51%), Positives = 632/929 (68%), Gaps = 64/929 (6%)
Query: 166 LLIPVPPPLCGVLIIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHA 222
++I VP P G +IIG+E+I Y + + + AI I+ S + RVD +GSRYLLGD
Sbjct: 1 MVIAVPEPFGGAIIIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDME 60
Query: 223 GLLHLLVITHEKE-----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
G L +L++ E++ + L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN
Sbjct: 61 GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLN 120
Query: 278 LQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
+ + +GSYV +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+
Sbjct: 121 VDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIH 180
Query: 338 EQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQ 397
E AS++L GIKG+W LRS + D LV+SF+ +TR+L +N E E+EETE+ GF Q
Sbjct: 181 EHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQ 239
Query: 398 TLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
T FC + + QL+Q+TS SVRLVS + L +EWK P +++VA+ N+SQV++A G
Sbjct: 240 TFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA- 298
Query: 458 LVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLN 517
L YL+I L ++ H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP
Sbjct: 299 LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFE 358
Query: 518 LITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
L+ KE LGGEIIPRS+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP
Sbjct: 359 LLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQP 418
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A
Sbjct: 419 TVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALAN 478
Query: 638 EGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---------------- 681
LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 479 NSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPS 538
Query: 682 -----------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
S EE E+H + ++D TFE + + EY S
Sbjct: 539 ASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALS 598
Query: 719 ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
++SC D N Y+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+
Sbjct: 599 LVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVE 658
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
FNGKLLA+IN ++LY+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+
Sbjct: 659 FNGKLLASINSTVRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVL 714
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
LL YK EG EE ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER
Sbjct: 715 LLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQH 774
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL 957
L+ VG +HLGEFVN F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L
Sbjct: 775 LQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLL 834
Query: 958 EKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKT 1017
+Q L KVIK VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+
Sbjct: 835 LDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVAN 894
Query: 1018 MN----------VSVEELCKRVEELTRLH 1036
+ + ++L K VEELTR+H
Sbjct: 895 LQYDDGSGMKREATADDLIKVVEELTRIH 923
>gi|195108657|ref|XP_001998909.1| GI23368 [Drosophila mojavensis]
gi|193915503|gb|EDW14370.1| GI23368 [Drosophila mojavensis]
Length = 1140
Score = 941 bits (2432), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 501/1099 (45%), Positives = 704/1099 (64%), Gaps = 77/1099 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD--AESSELITRAMGDVSDRIG 60
++ I G+I + FRP +D LFI T R+ +L+ ++ ++T+A G+VSD +G
Sbjct: 54 EININGKITVMRHFRPPDSNKDLLFILTRRFNVMILEARMVGDNITVVTKANGNVSDSVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
++ G I IDP R+IG+ LY GLF +IP D +A ++R++EL V D++FL+GC
Sbjct: 114 ILSEGGFIAAIDPKARVIGMCLYQGLFTIIPLDKDASELKATSLRMDELIVYDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V+++DN D RHVK +E+ L+DK+F++ W Q+N++ A +LIPVP P+ GV++I
Sbjct: 174 LNPTVIVIHKDN-DGRHVKCHEINLRDKEFMKLAWKQDNVETEATMLIPVPSPIGGVIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI----THE 233
G E+IVY + + A+ R S Y RVD+ G RYLLG+ G L++L + T +
Sbjct: 233 GRESIVYHDGSNYHAVAPLTFRQSTINCYARVDSKGLRYLLGNMDGQLYMLFLGINETGK 292
Query: 234 KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
V +K+E LGE SI I+YLDN +YIGS +GDSQL++L+ +P GSYV +E +
Sbjct: 293 VPTVKDIKVEQLGEISIPECITYLDNGFLYIGSRHGDSQLVRLSSEP-IDGSYVIPVENF 351
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352 TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411
Query: 354 RSSTDDP-FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ DD ++ LV++F+ TRIL ++ E E+EETEI GF S Q+ C + Y+QL+QV
Sbjct: 412 KVGIDDSVYENTLVLAFVGHTRILTLSGE-EVEETEIPGFASDLQSFLCANVEYDQLIQV 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
T+ SVRLV S+++ L EWK S+ V + N++Q++ A+ + Y+ I DG L E
Sbjct: 471 TAESVRLVKSSTKALVGEWKPEGDRSIGVVSCNSTQIVAASAR-EIFYIAIEDGSLVEKC 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L YE++CLD+ P+ E + S++ AVG+WTDIS I SLPDL I E L GEIIPRS
Sbjct: 530 RKTLAYEVACLDVTPLDEKQNKSELVAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE I YLLCALGDG + F+L+ TG LTD+KKV+LGTQP TLRTF S TT+VF
Sbjct: 590 ILMTTFEDIHYLLCALGDGSMYYFILDRTTGFLTDKKKVTLGTQPTTLRTFRSFATTNVF 649
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + + +GTID+IQKL
Sbjct: 650 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQKL 709
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TFA+ +L+
Sbjct: 710 HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCSSNMLP 769
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
N +E ++H + ++D TFE + + E +++S DD N Y
Sbjct: 770 KPGGGNSTAANAEVGQEIDVHNLLIIDQNTFEVLHAHQFVPPETISALMSAKLGDDPNTY 829
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
Y V T+ V P+E EP GRI++F + KL +AE + G Y+L FNGK+LA I +
Sbjct: 830 YVVATSLVFPDEPEPKVGRIIIFHYHENKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL+ EC I AL+++ +GDFI+VGDLM+SI+LL +K EG E
Sbjct: 890 RLYEWT----NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD WM AVEILDDD +LG E + NLF +K+S TDEER L + +HLG+ +
Sbjct: 946 IARDCEPKWMRAVEILDDDTFLGCETHDNLFVCQKDSAATTDEERQLLPELARFHLGDTI 1005
Query: 912 NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
N FRHGSLVM+ +VG+ T V++GT NG IG++ +P + Y FL L+ L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE----ISKTMN--- 1019
+IK VG ++H +R++ K ++ F+DGDLIESFLDLSR +M E + T+N
Sbjct: 1062 IIKSVGKIDHTYYRNYQINTKVEPSEGFIDGDLIESFLDLSRDKMHEAVTGLELTLNGER 1121
Query: 1020 --VSVEELCKRVEELTRLH 1036
VE++ K VE+LTR+H
Sbjct: 1122 KSADVEDVIKIVEDLTRMH 1140
>gi|195037449|ref|XP_001990173.1| GH18378 [Drosophila grimshawi]
gi|193894369|gb|EDV93235.1| GH18378 [Drosophila grimshawi]
Length = 1140
Score = 937 bits (2422), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 499/1099 (45%), Positives = 702/1099 (63%), Gaps = 77/1099 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD--AESSELITRAMGDVSDRIG 60
++ I G+I + FRP +D LFI T R+ +L+ ++ ++T+A G+VSD +G
Sbjct: 54 EININGKITVMRHFRPPDTKKDLLFILTRRFNVMILEARMVGDTITVVTKANGNVSDSVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
++ G I IDP R+IG+ LY GLF +IP D +A N+R++EL V D++FL+GC
Sbjct: 114 ILSEGGFIAAIDPKARVIGMCLYQGLFTIIPLDKDASELKATNLRMDELTVYDVEFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V+++DN D RHVK++E+ L+DK+F++ W Q+N++ A +LIPVP P+CGV++I
Sbjct: 174 LNPTVIVIHRDN-DGRHVKSHEINLRDKEFMKIAWKQDNVETEATMLIPVPSPICGVIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G E+IVY + + A+ R S Y R+D G RYLLG+ G L++L + T E K
Sbjct: 233 GRESIVYHDGSNYHAVAPLTFRQSTINCYARIDEKGLRYLLGNMDGQLYMLFLGTTETSK 292
Query: 237 ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
V +K+E LGE SI I+YLDN +YIGS +GDSQL++L+ + G++V +E +
Sbjct: 293 GITVKDIKVEQLGEISIPECITYLDNGFLYIGSRHGDSQLVRLSSEA-IDGTFVIPVENF 351
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
NL PI+D VVDL+RQGQGQ++TCSG++KDGSLRI+R GIGI E A ++L GIKGMWSL
Sbjct: 352 TNLAPILDIAVVDLDRQGQGQIITCSGSFKDGSLRIIRIGIGIQEHACIDLPGIKGMWSL 411
Query: 354 RSSTDDP-FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ D+ ++ LV++F+ TRIL ++ E E+EET+I GF S Q+ C + Y+QL+QV
Sbjct: 412 KVGIDESSYENTLVLAFVGHTRILTLSGE-EVEETDIPGFASDLQSFLCANVDYDQLIQV 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
T+ SVRLV S ++ L EWK S+ V + N +Q++ A+ + Y+ I DG L E
Sbjct: 471 TAESVRLVKSATKALVGEWKPEGDRSIGVVSCNTTQIVAASAR-EIFYISIEDGSLVEKC 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L YE++CLD+ P+ E + S + AVG+WTDIS I SLPDL I E L GEIIPRS
Sbjct: 530 RKILPYEVACLDVTPLDEKQNKSDLVAVGLWTDISAVILSLPDLETIYTEKLSGEIIPRS 589
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FE I YLLCALGDG + F+L+ TG+LT++KKV+LGTQP TLRTF S TT+VF
Sbjct: 590 ILMTTFEDIHYLLCALGDGSMYYFILDKTTGQLTEKKKVTLGTQPTTLRTFRSFATTNVF 649
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+PDSLA+A + + +GTID+IQKL
Sbjct: 650 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAQAYPDSLALATKNSVILGTIDEIQKL 709
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TFA+ +L+
Sbjct: 710 HIRTVPLGEGPRRIAYQEASQTFAVSTLRIDVHGRGGAKPLRNSASTQAQNITCGSNILP 769
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
N +E ++H + ++D TFE + + E +++S DD N Y
Sbjct: 770 KPGGGNSTAANAEVGQEIDVHNLLIIDQNTFEVLHAHQFVPPETISTLMSAKLGDDPNTY 829
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
Y V T+ V PEE EP GRI++F D KL +AE + G Y+L FNGK+LA I +
Sbjct: 830 YVVATSLVYPEEPEPKVGRIIIFHYNDNKLTQVAETKVDGTCYALVEFNGKVLAGIGSFV 889
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL+ EC I AL+++ +GDFI+VGDLM+SI+LL +K EG E
Sbjct: 890 RLYEWT----NEKELRMECNIQNMIAALFLKAKGDFILVGDLMRSITLLQHKQMEGIFVE 945
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD WM AVEILDDD +LG E + NLF +K+S TDEER L + +HLG+ +
Sbjct: 946 IARDCEPKWMRAVEILDDDTFLGCETHDNLFVCQKDSAATTDEERQLLPELARFHLGDTI 1005
Query: 912 NRFRHGSLVMRLPDSDVGQIPT-----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
N FRHGSLVM+ +VG+ T V++GT NG IG++ +P + Y FL L+ L+K
Sbjct: 1006 NVFRHGSLVMQ----NVGERTTPINGCVLYGTCNGAIGIVTQIPQDFYDFLHGLEERLKK 1061
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE----ISKTMN--- 1019
+IK VG ++H +R++ K ++ F+DGDLIESFLDL+R +M E + TM
Sbjct: 1062 IIKSVGKIDHTYYRNYQINTKVEPSEGFIDGDLIESFLDLNREKMREAVLGLELTMGGER 1121
Query: 1020 --VSVEELCKRVEELTRLH 1036
VE++ K VE+LTR+H
Sbjct: 1122 KAADVEDVIKIVEDLTRMH 1140
>gi|281208174|gb|EFA82352.1| UV-damaged DNA binding protein1 [Polysphondylium pallidum PN500]
Length = 1054
Score = 932 bits (2410), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 487/1054 (46%), Positives = 706/1054 (66%), Gaps = 77/1054 (7%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDV IYGRI+ + + G+ QD LFI TE+YK+C+L +++ES EL+T A GD IG
Sbjct: 51 VLDVNIYGRISDMRILTATGDKQDRLFILTEKYKYCILAFNSESRELVTIATGDAEGTIG 110
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP + GQIGI+DP+CR+IG+HLY+GLF+V+P ++ ++E+F++R+E+LQ++D+ FL C
Sbjct: 111 RPAEAGQIGIVDPECRMIGMHLYEGLFRVVPLEHGQPVRESFSMRIEQLQIVDMVFLKQC 170
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
AKPT+ +L++D +DARH+ TY + + K+ +EG SQ++++ + +L+P+ +LI+
Sbjct: 171 AKPTLALLFKDTRDARHIVTYSIDVVTKELIEGA-SQDSVEENSTMLVPLDNG--AMLIV 227
Query: 181 GEETIVYCSANAFKAIPIRPSITK----AYGRVDADGSRYLLGDHAGLLHLLVITHEKEK 236
GE I Y + + P+ SI AY ++D D R+LL D G
Sbjct: 228 GEMAITYMNLKG-NSQPVTISIDHTHIVAYEQIDRD--RFLLADDCGSF----------- 273
Query: 237 VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAK-GSYVEVLERYVN 295
VVY+GSS GDSQLI+LN D GSY+ V++++ N
Sbjct: 274 -----------------------GVVYVGSSSGDSQLIRLNSHIDPNTGSYISVIDQFTN 310
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS 355
LGPI DFCVVD+E+QGQGQ+VTCSG ++DGSLRI+RNGIGI EQAS+EL GI+G+WSL +
Sbjct: 311 LGPITDFCVVDVEKQGQGQLVTCSGTFQDGSLRIIRNGIGIAEQASIELPGIRGLWSLSN 370
Query: 356 STDDPF-DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI-YNQLVQVT 413
+++ L+VSFI+ T++L + E E+EETEI GF S TL+C + N +Q+
Sbjct: 371 NSNPSSLHRHLIVSFINSTKVLTFSGE-EIEETEIAGFDSNATTLYCGNTTENNHFIQIA 429
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKH 473
+ + LV S+S +R + +P S+N+A+ N SQ+L++ G +L YLEI D L K
Sbjct: 430 TSGIYLVDSSSL-MRLDQYTPEKGSINLASCNGSQILISQGS-NLTYLEISDSKLIIKKE 487
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV 533
AQL+YEISCLDI+ + + + S + AVG+WTDISVRI LP+LN + KE LGGEI+PRS+
Sbjct: 488 AQLQYEISCLDISLL-DGFTSSPVCAVGLWTDISVRILQLPNLNEVCKETLGGEILPRSI 546
Query: 534 LLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
L FEG +YLLC+LGDGHL NF ++ L +RKK+SLGT PI L N+T+VFA
Sbjct: 547 LFITFEGTNYLLCSLGDGHLFNFTFDVVENLLQERKKLSLGTTPILL------NSTNVFA 600
Query: 594 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
+SDRPTVIYS+NKKLLYS +N+K VSH+C FNS AF DS+AIA E L IGTID+IQKLH
Sbjct: 601 SSDRPTVIYSNNKKLLYSAINMKVVSHVCSFNSEAFRDSIAIATESSLVIGTIDEIQKLH 660
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE---------------MHFVRLLD 698
IR++PLGE RRI + E+ ++A+ +++ + ++V+LL+
Sbjct: 661 IRNVPLGEMARRITYVEEYHSYAVITIQRNDGNNNNNDNDNFNNNNNNGVPLTNYVKLLN 720
Query: 699 DQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVED 758
+QTFE S Y L +FE+G SI++C F +D +Y VGTA+ E++ +KGRILVF +ED
Sbjct: 721 EQTFETTSKYALKSFEFGWSIVTCRFKNDDALYVVVGTAFHNEVESQQSKGRILVFRIED 780
Query: 759 GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 818
+L L+ E VY L FNG+LLA IN+++Q + W + T +L + GH L+
Sbjct: 781 NRLILLDEVALPACVYCLLPFNGRLLAGINKRVQAFNWGV---DTNKLTKAESYSGHTLS 837
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+ +RG F++V DLMKS++LL+ + ++GAI+E AR+ W+S +E++DD+ ++G +N+
Sbjct: 838 HSMVSRGHFVLVADLMKSMTLLV-EDQQGAIKELARNPLPIWLSRIEMIDDETFIGGDNS 896
Query: 879 FNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
+NL V+KN+E +++ + L+ VG++HLGE +N+F+HGSLV PD D ++PT++FGT
Sbjct: 897 YNLIVVQKNAEASSEIDNELLDTVGQFHLGETINKFKHGSLVTS-PDMDSPKLPTILFGT 955
Query: 939 VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
V+G IGVI S+ + Y F EKLQ L +V+ GVGGL E WRSF+ E T+ +KNF+DGD
Sbjct: 956 VSGAIGVIVSISKDDYEFFEKLQKGLNRVVHGVGGLPFENWRSFSTEHMTIPSKNFIDGD 1015
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
LIE+FLDL +M E K MN+S+E+ +R+E L
Sbjct: 1016 LIETFLDLRHDKMLEAIKDMNISIEDTYRRIESL 1049
>gi|166240328|ref|XP_637896.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
gi|238064940|sp|B0M0P5.1|DDB1_DICDI RecName: Full=DNA damage-binding protein 1; AltName: Full=DNA repair
protein E; AltName: Full=UV-damaged DNA-binding protein 1
gi|165988543|gb|EAL64385.2| UV-damaged DNA binding protein1 [Dictyostelium discoideum AX4]
Length = 1181
Score = 929 bits (2402), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 499/1138 (43%), Positives = 719/1138 (63%), Gaps = 114/1138 (10%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
M DV IYGRI+ L+LF G QD+LFI+TE +KFC+L +D E E+IT+A G+ D IG
Sbjct: 51 MFDVNIYGRISVLKLFSVAGSKQDYLFISTESFKFCILAYDYEKKEIITKASGNAEDTIG 110
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQL---------------------- 98
RPT+ GQ+GIIDPD R++ LHLY+GL K+I DN
Sbjct: 111 RPTEAGQLGIIDPDGRIVALHLYEGLLKLITLDNNNTPNKINNNNNNNNNNNNNNNNNNN 170
Query: 99 --------------------KEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHV 138
K N+RLEELQVLD+ FLYGC PTI VL++D KD +H+
Sbjct: 171 NNINNNNFNINNNNNNSPIQKNVNNVRLEELQVLDMTFLYGCKVPTIAVLFKDTKDEKHI 230
Query: 139 KTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI 198
TYE++ KD + V GPWSQ+N+ + LL+PVP L GVL++ + I Y + +++ +
Sbjct: 231 STYEISSKDTELVVGPWSQSNVGVYSSLLVPVP--LGGVLVVADNGITYLNGKVTRSVAV 288
Query: 199 RPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
+ A+ RVD DGSR+L GDH G L +LV+ H+++KV LK E LG SI S+ISYLD
Sbjct: 289 SYTKFLAFTRVDKDGSRFLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPSSISYLD 348
Query: 259 NAVVYIGSSYGDSQLIKLNLQPD-AKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVT 317
+ VVYIGSS GDSQLI+LN + D SYV LE + N+GP+VDFCVVD E+QGQ Q+VT
Sbjct: 349 SGVVYIGSSSGDSQLIRLNTEKDQTTDSYVTYLEAFTNIGPVVDFCVVDAEKQGQAQIVT 408
Query: 318 CSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL---------------------RSS 356
CSG Y+DGSLRI+RNGIGI EQAS+EL+GIKG++ + +
Sbjct: 409 CSGTYRDGSLRIIRNGIGIAEQASIELEGIKGIFPINNNNNNNNNNNNNNNNNNNNNSNG 468
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH--DAIYNQLVQVTS 414
D D +L+ SFI T++L+ +E+EETE EG S TL+C D + N L+Q+T+
Sbjct: 469 ITDSKDRYLITSFIECTKVLSFQ-GEEIEETEFEGLESNCSTLYCGTIDKL-NLLIQITN 526
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI--GDGILTEVK 472
S+ L+ S + + ++W P +N+ + N Q++L+ L+Y +I + + VK
Sbjct: 527 VSINLIDSNTFKRVSQWNVEPSRRINLVSTNQDQIVLSIDKS-LLYFQINSSNKSIQLVK 585
Query: 473 HAQLEYEISCLDINPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
+L +EISC+DI+P + SQ+ +VG+W DI++RIF LP L I KE LGGEI+PR
Sbjct: 586 EIELPHEISCIDISPFDSFMDTKSQLVSVGLWNDITLRIFKLPTLEEIWKEPLGGEILPR 645
Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
S+L+ +F+ I Y+ C+LGDGHL F + + +L D++K++LGTQPI L+ F KNT ++
Sbjct: 646 SILMISFDSIDYIFCSLGDGHLFKFQFDFSSFKLFDKRKLTLGTQPIILKKFKLKNTINI 705
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
FA SDRPTVIYS NKKL YS VNLK+V+++ FNS FP+S+AIA LTIGTID+IQK
Sbjct: 706 FAISDRPTVIYSHNKKLFYSVVNLKDVTNVTSFNSDGFPNSMAIATTNSLTIGTIDEIQK 765
Query: 652 LHIRSIPLGEH-PRRICHQEQSRTFAICSLKNQS---------CAEESEMHFVRLLDDQT 701
LHI++IPL E RRI H E +A+ ++KN C E+ E+ ++R+ +DQT
Sbjct: 766 LHIKTIPLNEEMGRRIVHLEDHSCYAVITVKNNEGLLGGAQDLCEEDEEVSYIRIYNDQT 825
Query: 702 FEFISTYPLDTFEYGCSILSCSFS-DDSNVYYCVGTAYVLPEENEPTKGRILVFIV---- 756
FE IS+Y LD +E G SI C F+ DD N Y VGT+ P + + GR+L+F +
Sbjct: 826 FELISSYKLDPYEMGWSITPCKFAGDDVNTYLAVGTSINTPIK---SSGRVLLFSLSSSS 882
Query: 757 -------------------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
+GKL L+ E + + +VY L +FNG+L+AA+++++ ++
Sbjct: 883 SSNDKDSLDNNNNNNNNSGANGKLTLLEEIKFRSSVYFLLSFNGRLIAAVHKRLFSIRYT 942
Query: 798 -LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
++ + + SE H GH + L + +RG FI+VGD+MKS+SLL+ + +G++E+ AR+
Sbjct: 943 HSKEKNCKVISSESVHKGHTMILKLASRGHFILVGDMMKSMSLLV-EQSDGSLEQIARNP 1001
Query: 857 NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
W+ +V +++DD ++GAE + N V+KN++ + ER L+ VG YH+GE +N RH
Sbjct: 1002 QPIWIRSVAMINDDYFIGAEASNNFIVVKKNNDSTNELERELLDSVGHYHIGESINSMRH 1061
Query: 917 GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
GSLV RLPDSD IPT+++ +VNG IGV+AS+ E ++F KLQ L +V++GVGG +H
Sbjct: 1062 GSLV-RLPDSDQPIIPTILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSH 1120
Query: 977 EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
E WR+F+N+ T+D+KNF+DGDLIE+FLDL + + ++ ++ +R+E L +
Sbjct: 1121 ETWRAFSNDHHTIDSKNFIDGDLIETFLDLKYESQLKAVADLGITPDDAFRRIESLMQ 1178
>gi|1399512|gb|AAC47162.1| repE [Dictyostelium discoideum]
Length = 1139
Score = 929 bits (2401), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 499/1138 (43%), Positives = 719/1138 (63%), Gaps = 114/1138 (10%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
M DV IYGRI+ L+LF G QD+LFI+TE +KFC+L +D E E+IT+A G+ D IG
Sbjct: 9 MFDVNIYGRISVLKLFSVAGSKQDYLFISTESFKFCILAYDYEKKEIITKASGNAEDTIG 68
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQL---------------------- 98
RPT+ GQ+GIIDPD R++ LHLY+GL K+I DN
Sbjct: 69 RPTEAGQLGIIDPDGRIVALHLYEGLLKLITLDNNNTPNKINNNNNNNNNNNNNNNNNNN 128
Query: 99 --------------------KEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHV 138
K N+RLEELQVLD+ FLYGC PTI VL++D KD +H+
Sbjct: 129 NNINNNNFNINNNNNNSPIQKNVNNVRLEELQVLDMTFLYGCKVPTIAVLFKDTKDEKHI 188
Query: 139 KTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI 198
TYE++ KD + V GPWSQ+N+ + LL+PVP L GVL++ + I Y + +++ +
Sbjct: 189 STYEISSKDTELVVGPWSQSNVGVYSSLLVPVP--LGGVLVVADNGITYLNGKVTRSVAV 246
Query: 199 RPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
+ A+ RVD DGSR+L GDH G L +LV+ H+++KV LK E LG SI S+ISYLD
Sbjct: 247 SYTKFLAFTRVDKDGSRFLFGDHFGRLSVLVLIHQQQKVMELKFEQLGRISIPSSISYLD 306
Query: 259 NAVVYIGSSYGDSQLIKLNLQPD-AKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVT 317
+ VVYIGSS GDSQLI+LN + D SYV LE + N+GP+VDFCVVD E+QGQ Q+VT
Sbjct: 307 SGVVYIGSSSGDSQLIRLNTEKDQTTDSYVTYLEAFTNIGPVVDFCVVDAEKQGQAQIVT 366
Query: 318 CSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL---------------------RSS 356
CSG Y+DGSLRI+RNGIGI EQAS+EL+GIKG++ + +
Sbjct: 367 CSGTYRDGSLRIIRNGIGIAEQASIELEGIKGIFPINNNNNNNNNNNNNNNNNNNNNSNG 426
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH--DAIYNQLVQVTS 414
D D +L+ SFI T++L+ +E+EETE EG S TL+C D + N L+Q+T+
Sbjct: 427 ITDSKDRYLITSFIECTKVLSFQ-GEEIEETEFEGLESNCSTLYCGTIDKL-NLLIQITN 484
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI--GDGILTEVK 472
S+ L+ S + + ++W P +N+ + N Q++L+ L+Y +I + + VK
Sbjct: 485 VSINLIDSNTFKRVSQWNVEPSRRINLVSTNQDQIVLSIDKS-LLYFQINSSNKSIQLVK 543
Query: 473 HAQLEYEISCLDINPIGE-NPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
+L +EISC+DI+P + SQ+ +VG+W DI++RIF LP L I KE LGGEI+PR
Sbjct: 544 EIELPHEISCIDISPFDSFMDTKSQLVSVGLWNDITLRIFKLPTLEEIWKEPLGGEILPR 603
Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
S+L+ +F+ I Y+ C+LGDGHL F + + +L D++K++LGTQPI L+ F KNT ++
Sbjct: 604 SILMISFDSIDYIFCSLGDGHLFKFQFDFSSFKLFDKRKLTLGTQPIILKKFKLKNTINI 663
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
FA SDRPTVIYS NKKL YS VNLK+V+++ FNS FP+S+AIA LTIGTID+IQK
Sbjct: 664 FAISDRPTVIYSHNKKLFYSVVNLKDVTNVTSFNSDGFPNSMAIATTNSLTIGTIDEIQK 723
Query: 652 LHIRSIPLGEH-PRRICHQEQSRTFAICSLKNQS---------CAEESEMHFVRLLDDQT 701
LHI++IPL E RRI H E +A+ ++KN C E+ E+ ++R+ +DQT
Sbjct: 724 LHIKTIPLNEEMGRRIVHLEDHSCYAVITVKNNEGLLGGAQDLCEEDEEVSYIRIYNDQT 783
Query: 702 FEFISTYPLDTFEYGCSILSCSFS-DDSNVYYCVGTAYVLPEENEPTKGRILVFIV---- 756
FE IS+Y LD +E G SI C F+ DD N Y VGT+ P + + GR+L+F +
Sbjct: 784 FELISSYKLDPYEMGWSITPCKFAGDDVNTYLAVGTSINTPIK---SSGRVLLFSLSSSS 840
Query: 757 -------------------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
+GKL L+ E + + +VY L +FNG+L+AA+++++ ++
Sbjct: 841 SSNDKDSLDNNNNNNNNSGANGKLTLLEEIKFRSSVYFLLSFNGRLIAAVHKRLFSIRYT 900
Query: 798 -LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
++ + + SE H GH + L + +RG FI+VGD+MKS+SLL+ + +G++E+ AR+
Sbjct: 901 HSKEKNCKVISSESVHKGHTMILKLASRGHFILVGDMMKSMSLLV-EQSDGSLEQIARNP 959
Query: 857 NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
W+ +V +++DD ++GAE + N V+KN++ + ER L+ VG YH+GE +N RH
Sbjct: 960 QPIWIRSVAMINDDYFIGAEASNNFIVVKKNNDSTNELERELLDSVGHYHIGESINSMRH 1019
Query: 917 GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
GSLV RLPDSD IPT+++ +VNG IGV+AS+ E ++F KLQ L +V++GVGG +H
Sbjct: 1020 GSLV-RLPDSDQPIIPTILYASVNGSIGVVASISEEDFIFFSKLQKGLNQVVRGVGGFSH 1078
Query: 977 EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
E WR+F+N+ T+D+KNF+DGDLIE+FLDL + + ++ ++ +R+E L +
Sbjct: 1079 ETWRAFSNDHHTIDSKNFIDGDLIETFLDLKYESQLKAVADLGITPDDAFRRIESLMQ 1136
>gi|194389106|dbj|BAG61570.1| unnamed protein product [Homo sapiens]
Length = 1009
Score = 928 bits (2399), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 475/916 (51%), Positives = 623/916 (68%), Gaps = 64/916 (6%)
Query: 179 IIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
IIG+E+I Y + + + AI I+ S + RVD +GSRYLLGD G L +L++ E++
Sbjct: 100 IIGQESITYHNGDKYLAIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQ 159
Query: 236 -----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
+ L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +
Sbjct: 160 MDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAM 219
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 220 ETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGL 279
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT FC + + QL+
Sbjct: 280 WPLRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTFFCGNVAHQQLI 338
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
Q+TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L YL+I L +
Sbjct: 339 QITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQ 397
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
+ H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+ KE LGGEIIP
Sbjct: 398 ISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIP 457
Query: 531 RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
RS+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+
Sbjct: 458 RSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTN 517
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQ
Sbjct: 518 VFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQ 577
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------------- 681
KLHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 578 KLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQGTSGGTTALRPSASTQALSSSVSSS 637
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
S EE E+H + ++D TFE + + EY S++SC D N Y
Sbjct: 638 KLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTY 697
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN +
Sbjct: 698 FIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTV 757
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+LY+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE
Sbjct: 758 RLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEE 813
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
ARD+N NWMSAVEILDDD +LGAEN FN F +K+S TDEER L+ VG +HLGEFV
Sbjct: 814 IARDFNPNWMSAVEILDDDNFLGAENAFNSFVCQKDSAATTDEERQHLQEVGLFHLGEFV 873
Query: 912 NRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
N F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK
Sbjct: 874 NVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKS 933
Query: 971 VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------V 1020
VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ +
Sbjct: 934 VGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREA 993
Query: 1021 SVEELCKRVEELTRLH 1036
+ ++L K VEELTR+H
Sbjct: 994 TADDLIKVVEELTRIH 1009
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 37/67 (55%), Gaps = 7/67 (10%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW-------DAESSELITRAMGDV 55
+V +YG+IA +ELFRP GE++ LFI T +Y C+L++ D E IT GD
Sbjct: 54 EVGMYGKIAVMELFRPKGESKGLLFILTAKYNACILEYKQSGESIDIIGQESITYHNGDK 113
Query: 56 SDRIGRP 62
I P
Sbjct: 114 YLAIAPP 120
>gi|412992547|emb|CCO18527.1| predicted protein [Bathycoccus prasinos]
Length = 1275
Score = 922 bits (2382), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 514/1203 (42%), Positives = 718/1203 (59%), Gaps = 172/1203 (14%)
Query: 1 MLDVPIYGRIATLELF--RPHGEAQDF---LFIATE-RYKFCVLQWDAESSELITRAMGD 54
+L+ +YGRIA L+L R +G+ ++ L + TE RY VL +DA ++ L T + GD
Sbjct: 75 VLETTVYGRIAMLKLCEGRKNGDRENIPGRLVVVTEDRYSLAVLSYDAATNTLRTVSSGD 134
Query: 55 VSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDI 114
VSD GRP ++G +G ID + R+I LH+YDGL K IPFD G L +AFN L EL+VLD
Sbjct: 135 VSDLTGRPVEHGMLGDIDRNARVIALHIYDGLIKFIPFDENGNLLQAFNANLNELKVLDF 194
Query: 115 KFL----------------YG--------------------------------------C 120
FL YG
Sbjct: 195 VFLPSTSSSTSGIAIDASEYGNNNNNNNNNNASDMNITLSNVSSLTRSINRENAGGGGLS 254
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
+ P I VL++D KD RHV TY V L+D+D GP+ Q ++ G+ L+ + P+ G +++
Sbjct: 255 SMPVIAVLHKDVKDERHVSTYRVNLRDRDLEPGPFEQREVEIGSKKLLALSSPIGGCVVV 314
Query: 181 GEETIVY-----------------------------CSANAFKAIPIR---PSITKAYGR 208
GEETI Y S NA + R S T A R
Sbjct: 315 GEETIAYLNEPAGIMDDHENNNNKNNNTATKNATSTLSNNANSNVMFRGENKSSTSALVR 374
Query: 209 V-----------------DADGSRYL-----LGDHAGLLHLLVI------THEKEKVTGL 240
D + S L + D G L+LL++ T V+ L
Sbjct: 375 AITTPDLLMFSSACLIPSDENVSNQLVDRFLIADEDGDLYLLMLSKGTTSTMSTSIVSKL 434
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQ--------PDAKGSYVEVLER 292
IE LG+TS AS +SYLDN+VV+IGS+YGDSQ+IKL+ + + SYVEVLE
Sbjct: 435 TIERLGKTSSASAMSYLDNSVVFIGSAYGDSQIIKLHTEQVNTMMSASNELPSYVEVLEE 494
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NLGPIVDFCV+DLER GQGQ+VTCSG GSLR++RNGIGI EQA + L GIKG++S
Sbjct: 495 FTNLGPIVDFCVMDLERHGQGQLVTCSGVGTSGSLRVIRNGIGIREQAQISLSGIKGLYS 554
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ P D++L+VSFI+ETRILA+N DELEE GF + +QT+ C + N + QV
Sbjct: 555 CKRDETMPLDSYLIVSFIAETRILAINDNDELEEAVFPGFDASSQTIECANISGNVICQV 614
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
TS V + ++S EL W + ++A+ + + +AT GG+L YL G ++E
Sbjct: 615 TSKGVYVCDASSGELVASWVPSDESPITASSASENTIAVATVGGNLHYLSFQGGKISETG 674
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
+ EISC+D G+ + AVG+W+ + + + + E+L +++PRS
Sbjct: 675 KMTFDAEISCIDKTQCGDR----NVCAVGLWSTKVLLVELGSGMKVCHTENLSLDVVPRS 730
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTG----ELTDRKKVSLGTQPITLRTFSSKNT 588
L C FE YLL LGDGHL+ +++ + L+DRK VSLGTQP+TL+ F S+++
Sbjct: 731 TLFCFFEDTIYLLTGLGDGHLITNVVDSSSSAGGFALSDRKSVSLGTQPVTLKLFKSQHS 790
Query: 589 THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
HVFA SDRPT+IYS++KKL+YSNVNL EV H+ PFN AFPD+LA+A LTIG +DD
Sbjct: 791 MHVFAGSDRPTIIYSNSKKLVYSNVNLNEVLHVAPFNCDAFPDALALASGEHLTIGAVDD 850
Query: 649 IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTY 708
IQKLHIR++PL E PRRI HQ +++T A+ ++K + E FVRL D++TFE ++ Y
Sbjct: 851 IQKLHIRTVPLREQPRRIAHQPETKTLAVLTMKESDVPGQEEEFFVRLFDNKTFETLAKY 910
Query: 709 PLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLP-EENEPTKGRILVFIVE---------- 757
PL+ E SI+SCSF D ++Y+ VGTA+ P E E ++GRILVF V
Sbjct: 911 PLEPNENDASIISCSFDGDDDIYFVVGTAFADPHSEPESSRGRILVFKVSNTSSSGGGNA 970
Query: 758 --------DGK-----------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
DG+ L L+ EKET+GAVY+LNAF GKLLA IN ++L+ W +
Sbjct: 971 VVNGNDHGDGRASASSSVLQKSLTLVCEKETRGAVYNLNAFCGKLLAGINSLVKLFNWGV 1030
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
+ REL EC H GHI+AL V+T+ + IVVGDLMKSI+LL Y+ E G IEE A D+++
Sbjct: 1031 SKENKRELVHECSHMGHIIALKVETKDNLIVVGDLMKSITLLQYQRESGRIEEVAHDFSS 1090
Query: 859 NWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGS 918
NWM+AVEILDD+ YLGAE+++NLFTV++N++ T+++RG LE+ G +HLG+ VNRFR GS
Sbjct: 1091 NWMTAVEILDDNTYLGAESSYNLFTVQRNADADTEDKRGTLELCGAFHLGDSVNRFRRGS 1150
Query: 919 LVMRLPD-----SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
LVMR+PD S + +I T +FGT++G +GV+A+LP ++ L K+Q ++KV+ GVG
Sbjct: 1151 LVMRMPDLSDDTSSLSEISTWLFGTISGGLGVVATLPKRDFMLLNKVQEAMQKVVTGVGN 1210
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS-VEELCKRVEEL 1032
+H +RSF+N +++V+ +NF+DGDL+E FLDLS+ +S+ VS E+L K++EE+
Sbjct: 1211 FSHSDFRSFHNVQRSVEMRNFIDGDLVEIFLDLSKEDQVAVSELSGVSNSEDLVKKIEEI 1270
Query: 1033 TRL 1035
+RL
Sbjct: 1271 SRL 1273
>gi|119594343|gb|EAW73937.1| damage-specific DNA binding protein 1, 127kDa, isoform CRA_e [Homo
sapiens]
Length = 896
Score = 917 bits (2371), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 466/886 (52%), Positives = 608/886 (68%), Gaps = 61/886 (6%)
Query: 206 YGRVDADGSRYLLGDHAGLLHLLVITHEKE-----KVTGLKIELLGETSIASTISYLDNA 260
+ RVD +GSRYLLGD G L +L++ E++ + L++ELLGETSIA ++YLDN
Sbjct: 17 HNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNG 76
Query: 261 VVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 320
VV++GS GDSQL+KLN+ + +GSYV +E + NLGPIVD CVVDLERQGQGQ+VTCSG
Sbjct: 77 VVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSG 136
Query: 321 AYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNL 380
A+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS + D LV+SF+ +TR+L +N
Sbjct: 137 AFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNG 196
Query: 381 EDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVN 440
E E+EETE+ GF QT FC + + QL+Q+TS SVRLVS + L +EWK P +++
Sbjct: 197 E-EVEETELMGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNIS 255
Query: 441 VATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAV 500
VA+ N+SQV++A G L YL+I L ++ H ++E+E++CLDI P+G++ S + A+
Sbjct: 256 VASCNSSQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAI 314
Query: 501 GMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNM 560
G+WTDIS RI LP L+ KE LGGEIIPRS+L+ FE YLLCALGDG L F LN+
Sbjct: 315 GLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNI 374
Query: 561 KTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSH 620
+TG L+DRKKV+LGTQP LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++
Sbjct: 375 ETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNY 434
Query: 621 MCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSL 680
MCP NS +PDSLA+A LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S
Sbjct: 435 MCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSS 494
Query: 681 K---------------------------------------NQSCAEESEMHFVRLLDDQT 701
+ S EE E+H + ++D T
Sbjct: 495 RIEVQDTSGGTTALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHT 554
Query: 702 FEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKL 761
FE + + EY S++SC D N Y+ VGTA V PEE EP +GRI+VF DGKL
Sbjct: 555 FEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKL 614
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV 821
Q +AEKE KGAVYS+ FNGKLLA+IN ++LY+W +EL++EC H+ +I+ALY+
Sbjct: 615 QTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTE----KELRTECNHYNNIMALYL 670
Query: 822 QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
+T+GDFI+VGDLM+S+ LL YK EG EE ARD+N NWMSAVEILDDD +LGAEN FNL
Sbjct: 671 KTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNL 730
Query: 882 FTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVN 940
F +K+S TDEER L+ VG +HLGEFVN F HGSLVM+ L ++ +V+FGTVN
Sbjct: 731 FVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVN 790
Query: 941 GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
G+IG++ SL Y L +Q L KVIK VG + H WRSF+ E+KT A F+DGDLI
Sbjct: 791 GMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLI 850
Query: 1001 ESFLDLSRTRMDEISKTMN----------VSVEELCKRVEELTRLH 1036
ESFLD+SR +M E+ + + ++L K VEELTR+H
Sbjct: 851 ESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVVEELTRIH 896
>gi|322787057|gb|EFZ13281.1| hypothetical protein SINV_13198 [Solenopsis invicta]
Length = 986
Score = 914 bits (2361), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 462/943 (48%), Positives = 639/943 (67%), Gaps = 59/943 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V IYG+IA ++ FRP E +D LF+ T RY +L+ + E E+IT+A G+V+DRIG
Sbjct: 54 EVGIYGKIAIIKFFRPPHEKKDLLFLLTTRYNAMILECIGEGEDIEIITKAHGNVADRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+ ++ G +IDP R+IGL LYDGLFK+IP D +A +IR++E QV D+ FL+GC
Sbjct: 114 KASETGIKAVIDPKARVIGLRLYDGLFKIIPLDKDNPELKASSIRMDEQQVQDVNFLHGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
A PT+++++QD + RHVKT+E+ L+DK+F + PW Q+N++ A ++IPVP P+CG +II
Sbjct: 174 ANPTLILIHQD-INGRHVKTHEINLRDKEFSKIPWRQDNVEREAMMVIPVPSPMCGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-- 235
G+E+I+Y N++ A+ I+ S Y +VD G RYLLGD AG L +L + EK+
Sbjct: 233 GQESILYHDGNSYVAVVPPIIKQSTITCYAKVDNQGLRYLLGDMAGHLFMLFLEQEKKPD 292
Query: 236 ---KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V LK+ELLGE SI I+YLDN V+Y+GS GDSQLIKL + D GSY +E
Sbjct: 293 GTLSVKDLKVELLGEISIPECITYLDNGVIYVGSRLGDSQLIKLITKADENGSYCVPMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PIVD VVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI E AS++L GIKGMW+
Sbjct: 353 FTNLAPIVDMAVVDLERQGQGQMVTCSGAFKEGSLRIIRNGIGIQEHASIDLPGIKGMWA 412
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
L+ FD LV+SF+ +TRIL +N E E+EET+I GF + QT + + +Q+
Sbjct: 413 LKVGGSH-FDNTLVLSFVGQTRILTLNGE-EVEETDIPGFVADEQTFHTGNVTNDLFIQI 470
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
T SVRL+S S+ + +EW+ +++V N +QVL ATG L Y+EI +
Sbjct: 471 TPTSVRLISYESKMVISEWEPQNKRTISVVACNGTQVLCATGND-LFYIEISCNQIVSKG 529
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
L++E++CLDI+P+ N + +I AVG+WTDISVRI LP+L I KE LGGEIIPRS
Sbjct: 530 FVTLQHEVACLDISPLDGNEA--KIVAVGLWTDISVRILILPNLEEINKELLGGEIIPRS 587
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+L+ FEG +YLLCALGDG + F+L+ +TG L+D+KKV+LGTQP LRTF S +TT+VF
Sbjct: 588 ILMTCFEGNTYLLCALGDGSMYYFILHKQTGILSDKKKVTLGTQPTVLRTFRSLSTTNVF 647
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A SDRPTVIYSSN KL++SNVNLKEV+HMC N+ ++PDSLA+A + +TIGTID+IQKL
Sbjct: 648 ACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAESYPDSLALATDSTVTIGTIDEIQKL 707
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------- 681
HIR++PLGE PRRI +QE S+TF + +++
Sbjct: 708 HIRTVPLGESPRRIAYQESSQTFGVITMRVDVQESSGVSIVRHSASTQATSTSSSSHIAS 767
Query: 682 --------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
+E E+H + ++D TFE + + L EY S++S +D Y+
Sbjct: 768 HNKPSGHTASEIGQEIEVHNLLIIDQHTFEVLHAHTLMATEYALSLISTRLGEDPTSYFV 827
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
VGTA++ P+E EP GRIL++ +GK +AEKE KG+ YSL FNGKLLA+IN ++L
Sbjct: 828 VGTAFINPDETEPKMGRILLYHWSEGKFTQVAEKEIKGSCYSLVEFNGKLLASINSTVRL 887
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
++W +EL+ EC H +I+ALY++T+GDF++VGDLM+S++LL YK EG+ EE A
Sbjct: 888 FEWT----AEKELRLECSHFNNIIALYLKTKGDFVLVGDLMRSLTLLQYKTMEGSFEEIA 943
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER 896
RDYN NWM+++EILDDD +LGAEN FNLF +K+S +++ER
Sbjct: 944 RDYNPNWMTSIEILDDDTFLGAENCFNLFVCQKDSAATSEDER 986
>gi|195996153|ref|XP_002107945.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
gi|190588721|gb|EDV28743.1| hypothetical protein TRIADDRAFT_18324 [Trichoplax adhaerens]
Length = 1134
Score = 912 bits (2357), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 490/1101 (44%), Positives = 715/1101 (64%), Gaps = 83/1101 (7%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDV +YGRIA+++L RP E D LFI T RY+ C+LQ+ E+ +ITRA GD+ +R+
Sbjct: 52 LLDVGVYGRIASMQLIRPENENCDLLFILTCRYRVCILQYKPETKSIITRAYGDMKNRVS 111
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFD-NKGQLKEAFNIRLEELQVLDIKFLYG 119
RP++ G IGI+DPDC++I L LYDG K+IP + + + A ++RLEELQVLD+KFLYG
Sbjct: 112 RPSETGLIGIVDPDCKVICLKLYDGWLKLIPLELDTDKEMSAEDVRLEELQVLDVKFLYG 171
Query: 120 CAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLI 179
+PTI ++Y+ ++ R++KTYE++L++ D PW+ ++ A +++PVPPP CG+++
Sbjct: 172 FTEPTIALIYESGQN-RYLKTYEISLQNADIHRQPWNIGKVEEEAFMILPVPPPSCGMVV 230
Query: 180 IGEETIVYCSANAFKAIPIRPSITK----AYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
IG +I Y ++ I P+ K +GRVD++G RYLLGD++G L +L++ E
Sbjct: 231 IGAGSISYYKGQ--DSLHITPASLKDRITCFGRVDSNGCRYLLGDYSGRLFMLILVQEHS 288
Query: 236 ----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK-LNLQPDAK-GSYVEV 289
KV L +E LGETSI S I+YLDNA YIGSS GDSQLIK LN PD++ SY++V
Sbjct: 289 QSGIKVKDLCLEYLGETSIPSCITYLDNAFAYIGSSCGDSQLIKVLNTSPDSETDSYIDV 348
Query: 290 LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG 349
++ + NLGPI+D VDL++QGQ Q+VTCSG K+ SLR++RNGIGI+E A+++L I G
Sbjct: 349 IDNFTNLGPIIDMVSVDLDKQGQSQLVTCSGFGKNASLRVLRNGIGIHELANIDLDHICG 408
Query: 350 MWSLR--SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
+W LR S + +D LV+SF +R L + E+EET+I GF +T F + ++
Sbjct: 409 IWRLRTVSRSISEYDDVLVLSFAGHSRFLKFDGR-EVEETDISGFDDYKETDFAANVAFD 467
Query: 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI 467
Q+VQ+++ SVRL R L EWK P G +++ +TA +Q+++A+G L YLEIG+G
Sbjct: 468 QIVQISNESVRLAGCDGRGLLQEWKPPNGKTISKSTAGNTQIMVASGC-ELFYLEIGEGE 526
Query: 468 LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
L +V + LE++I+C+DI+ + ++ +QI AVG+W D+S R+ LP+L L+ E LGG+
Sbjct: 527 LKQVSNISLEHDIACIDIS-LKDDNERAQICAVGLWVDMSARLLLLPNLQLMLTESLGGD 585
Query: 528 IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
IIPRS++L F+ YLL A+GDG L +L+N T LT+RK V+LG L TF S +
Sbjct: 586 IIPRSIMLNRFDNEIYLLVAMGDGTLAYYLVNTTTCSLTNRKSVNLGVVHSNLYTFKSGS 645
Query: 588 TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
++VFA SDRPTVIY +N KL++SNVNLK+V+ M PF+S +FP+SLA+ + IGTID
Sbjct: 646 ISNVFACSDRPTVIYINNHKLVFSNVNLKKVNFMSPFHSESFPNSLALVNDSGFIIGTID 705
Query: 648 DIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ------------------------ 683
+IQKLHIR+ PLGE R QE+S++F I + + +
Sbjct: 706 EIQKLHIRTKPLGETTR----QEESQSFGIITCRTEVPSEDDKNFVPTHQSASLLVSNRT 761
Query: 684 SCAEESE---------------MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS 728
C E+S+ + V ++D + + L E+G S++SC+F +D
Sbjct: 762 MCPEQSDNSSSTFDSDTLSEKNIDSVLIIDQHSLDAQCALQLQDCEWGMSLISCTFENDP 821
Query: 729 NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAIN 788
YYCVGTA+V E+ EPTKG I + +GK+Q + KE GAVY + AFNG+LLA++N
Sbjct: 822 EAYYCVGTAFVNLEDKEPTKGNIRILKYFEGKIQQVHSKEVSGAVYCMVAFNGRLLASVN 881
Query: 789 QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
+ +Y+W +EL E H ++LALY++T+GDFI++GDLM+SISL Y+
Sbjct: 882 STVSVYEWT----SNKELVEETSFHNNVLALYLKTKGDFILIGDLMRSISLCAYRPMNNE 937
Query: 849 IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
IE ++ + NWM+AVEI+DDD YLG EN+ NLFT +KNS +++EE+ L VG YH+G
Sbjct: 938 IELICKNNDPNWMTAVEIIDDDSYLGGENSHNLFTCQKNS-SSSEEEQKHLPTVGVYHVG 996
Query: 909 EFVNRFRHGSLVMR----LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
EFVN FR GSLVM+ +PDS G I +FGTV+G +GV+ +L + F+ + L
Sbjct: 997 EFVNVFRQGSLVMQNTVDIPDSVQGSI---LFGTVSGAVGVVVTLAPAMFEFVSAIANKL 1053
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS--- 1021
V+KGVG + H+ WRSF+N++KT ++F+DGDL+ESFLDLS M ++ + +
Sbjct: 1054 STVVKGVGKIEHQFWRSFSNDRKTEPCQSFVDGDLVESFLDLSPEDMQRVANGLTIQTAD 1113
Query: 1022 ------VEELCKRVEELTRLH 1036
VE++ K VEEL+R+H
Sbjct: 1114 GTRPAMVEDVLKTVEELSRIH 1134
>gi|47230701|emb|CAF99894.1| unnamed protein product [Tetraodon nigroviridis]
Length = 953
Score = 910 bits (2351), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 487/1018 (47%), Positives = 651/1018 (63%), Gaps = 135/1018 (13%)
Query: 88 KVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKD 147
K+IP D + +A+NIRLEELQV+D+ FLYGC PT+ +YQD + RHVKTYEV+L D
Sbjct: 2 KMIPLDGDNREWKAYNIRLEELQVIDVHFLYGCQAPTVCFIYQDPQ-GRHVKTYEVSLAD 60
Query: 148 KDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIP---IRPSITK 204
K+F +GPW Q N++ A ++IPVP P G +IIG+E+I Y + + + AI I+ S
Sbjct: 61 KEFNKGPWKQENVEAEASMVIPVPEPFGGAIIIGQESITYHNGDKYLAIAPPTIKQSTIV 120
Query: 205 AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG-------LKIELLGETSIASTISYL 257
+ RVD +GSRYLLGD G L +L++ EKE++ L +ELLGETSIA ++YL
Sbjct: 121 CHNRVDPNGSRYLLGDMEGRLFMLLL--EKEELMDGTVALKDLHVELLGETSIAECLTYL 178
Query: 258 DNAVVYIGSSYGDSQLIK---------LNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLE 308
DN VV++GS GDSQL+K LN+ + +GS+V V+E + NLGPIVD CVVDLE
Sbjct: 179 DNGVVFVGSRLGDSQLVKVRVTHSLSELNVDSNDQGSFVTVMETFTNLGPIVDMCVVDLE 238
Query: 309 RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVS 368
RQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LRS D LV+S
Sbjct: 239 RQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLRSEAGRETDDMLVLS 298
Query: 369 FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 428
F+ +TR+L ++ E E+EETE+ GF QT +C + +NQL+Q+TSGSVRLV S+ L
Sbjct: 299 FVGQTRVLMLSGE-EVEETELPGFVDNQQTFYCGNVAHNQLIQITSGSVRLVLQDSKALV 357
Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPI 488
+EW+ P G +++VA N +QV+LA G L YL+I G L ++ ++E+E++CLDI P+
Sbjct: 358 SEWREPQGRNISVAACNHTQVVLAVGRA-LYYLQILAGELKQISTTEMEHEVACLDITPL 416
Query: 489 GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCAL 548
GE + S + AVG+WTDIS R+ LP + KE LGGEIIPRS+L+ FEG YLLCAL
Sbjct: 417 GEAGAESPLCAVGLWTDISARVLKLPCFTALHKEMLGGEIIPRSILMTTFEGSYYLLCAL 476
Query: 549 GDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKL 608
GDG L F L+++TG L++RKKV+LGTQP LRTF S +T++VFA SDRPTVIYSSN KL
Sbjct: 477 GDGALFYFGLDLQTGALSERKKVTLGTQPTVLRTFRSLSTSNVFACSDRPTVIYSSNHKL 536
Query: 609 LYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICH 668
++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKLHIR++PL E PRRIC+
Sbjct: 537 VFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRRICY 596
Query: 669 QEQSRTFAICSLKNQ---------------------------------------SCAEES 689
QE S+ F + S + + S EE
Sbjct: 597 QEVSQCFGVLSSRMEIQDVSGTTSPVRPSASTQALSSSVSSSKLFPSSTSPHESSFGEEV 656
Query: 690 EMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG 749
E+H + ++D TFE + + EY S++SC D ++ +P E
Sbjct: 657 EVHNLLVVDQHTFEVLHAHQFLPSEYALSLVSCRLGKDPSL-------QTIPNE------ 703
Query: 750 RILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
VF +++LY+W +EL++E
Sbjct: 704 --CVF-----------------------------------QVRLYEWT----AEKELRTE 722
Query: 810 CGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
C H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE ARD+N NWMSA+EILDD
Sbjct: 723 CNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAIEILDD 782
Query: 870 DIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDV 928
D +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN F HGSLV++ L ++
Sbjct: 783 DNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGVFHLGEFVNVFCHGSLVLQNLGETST 842
Query: 929 GQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT 988
+V+FGTVNG+IG++ SL Y L LQ L KVIK WRSF+ E+KT
Sbjct: 843 PTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKTT-------WRSFHTERKT 895
Query: 989 VDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVEELCKRVEELTRLH 1036
A F+DGDLIESFLDL R +M E+ T+ + +V+E+ K VEELTR+H
Sbjct: 896 EQATGFIDGDLIESFLDLGRAKMQEVVSTLQIDDGSGMKREATVDEVIKIVEELTRIH 953
>gi|328874742|gb|EGG23107.1| UV-damaged DNA binding protein1 [Dictyostelium fasciculatum]
Length = 1116
Score = 907 bits (2344), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 492/1084 (45%), Positives = 714/1084 (65%), Gaps = 80/1084 (7%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L+ IYGRI+ ++LF HGE Q+ LFI TER+K+CVL +DAE ELITRA G+ D IG
Sbjct: 52 ILETNIYGRISNMKLFSVHGEKQELLFILTERFKYCVLSYDAEKKELITRASGEADDSIG 111
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP D+GQ+ IIDPD +++ LHLY+GL K+IP + KG LK A+N+RL+ELQV+DI F+
Sbjct: 112 RPADSGQLAIIDPDSKVLALHLYEGLLKIIPIE-KGTLKTAYNVRLQELQVIDIVFIAPT 170
Query: 121 A-KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGP--WSQNNLDNGADLLIPVPPPLCGV 177
+PT+ VL +D +D+++V TY ++LKDK+ +E W+Q+N+++GA LI G+
Sbjct: 171 NNQPTLAVLSKDTRDSKNVNTYILSLKDKELIESANTWAQSNVEDGASKLIAHQN---GL 227
Query: 178 LIIGEETIVYCS-ANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVIT-HEKE 235
L++GE + + ++ + IPI + A GRVD D ++L GDH G L+ L+++ +EKE
Sbjct: 228 LVVGETLVSFLKPTSSPRTIPIAGTRITAVGRVDKD--KFLFGDHLGQLYFLLLSQNEKE 285
Query: 236 KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAK-GSYVEVLERYV 294
L+ E LGET ASTISYLD+ VV+IGS+ GDSQLI+L D SYV VL+ +
Sbjct: 286 ----LRFEKLGETCTASTISYLDSGVVFIGSALGDSQLIRLLSDRDPNTNSYVTVLDTFA 341
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR 354
NLGPI DFC+VD+E+QGQ Q+V CSG +K+GSLR++RNGIGI EQAS++L GIK +WSL
Sbjct: 342 NLGPIPDFCLVDIEKQGQNQIVACSGGFKEGSLRVIRNGIGITEQASIDLPGIKAIWSLA 401
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
+D +L++SFIS T++L ED +EETEI GF Q+ TL+C + Q++Q+++
Sbjct: 402 RGSD----RYLILSFISSTKVLEFQGED-IEETEIAGFDLQSPTLYCGNVADKQILQIST 456
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHA 474
+ LV + + WK P S+N+A+ +Q+L++ G L+Y EI D + ++K
Sbjct: 457 SGIYLVDHETNLNYDVWK-PSSGSINLASHQGNQILISFGKT-LIYFEIKDQKIIKLKEL 514
Query: 475 QLEYEISCLDINP--IGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
++E+EISCLDI+ +GE S+I VG+WTDIS+RI SLP L + KE+LGGE+IPRS
Sbjct: 515 EMEFEISCLDISSFQVGER---SKICVVGLWTDISLRILSLPTLEQVHKENLGGEVIPRS 571
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
VL+ AFEGI+YLLC+LGDGHL NF+++ L +RKKVSLG QPI L F K T +VF
Sbjct: 572 VLMIAFEGINYLLCSLGDGHLFNFIVDTINNTLHERKKVSLGNQPIILSKFQLKGTVNVF 631
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A+SDRPTVIYSSN+KLLYS VNLK V +C F+S F D +AIA + L+IG IDDIQ+L
Sbjct: 632 ASSDRPTVIYSSNRKLLYSVVNLKVVVGVCSFDSEVFRDCIAIATDKTLSIGHIDDIQRL 691
Query: 653 HIRSIPL-GEHPRRICHQEQSRTFAICSLK-NQSCAEES--EMHFVRLLDDQTFEFISTY 708
HI+ I L GE RRI H E + +A+ +++ ++S EES E +F+R+++ QTFE I Y
Sbjct: 692 HIKKIDLGGEFARRIAHVEAHQCYALITIRPDESGLEESETEQNFIRIVNQQTFETIVKY 751
Query: 709 PLDTFEYGCSILSCSFS-----------------------------------------DD 727
PL EYG +ILS +F ++
Sbjct: 752 PLKENEYGWTILSTNFKETLTSNSSSTSSSTSTTSTTSTSGRRRLPTTTTTTTTSNDEEE 811
Query: 728 SNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAI 787
+ Y VGT Y + E GRILVF + D +L L+ E +G+++ + AFNG+LL AI
Sbjct: 812 QSEYIVVGTTYHCHDRKEC--GRILVFKMIDSRLILLDETTVRGSIFCMIAFNGQLLVAI 869
Query: 788 NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
N+ + Y W D + +L E + GH +LY+ RGDF++VGD+MKS++LL ++
Sbjct: 870 NKSVHRYTWS-GDSSSGKLTGEEIYGGHTASLYLAGRGDFVLVGDMMKSMALLQASGKD- 927
Query: 848 AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
++E +R W++ + +DDD YLG++N++NL ++KN+E A + + L+ +G H
Sbjct: 928 -VKELSRSSQPFWLTGLTFIDDDTYLGSDNSYNLILMKKNTETANEVDSQLLDNIGHIHT 986
Query: 908 GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
GEF+NRF HG+L L D D + ++IF T++G IGVI+++ + Y F KLQ L +V
Sbjct: 987 GEFINRFHHGTLAT-LTDVDSPKPNSIIFATISGCIGVISTISKQDYDFFSKLQVGLNRV 1045
Query: 968 IKGVGGLNHEQWRSFNNEK-KTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
I+G+GG +H++WRSF NE ++++NF+DGDL+E FL L +M E++K M++S+E+
Sbjct: 1046 IRGIGGFSHDRWRSFQNEHISNIESRNFIDGDLVEQFLHLRHDKMLEVTKDMDISIEDTY 1105
Query: 1027 KRVE 1030
K++E
Sbjct: 1106 KKIE 1109
>gi|157128866|ref|XP_001655232.1| DNA repair protein xp-e [Aedes aegypti]
gi|108882187|gb|EAT46412.1| AAEL002407-PA [Aedes aegypti]
Length = 980
Score = 905 bits (2340), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 445/933 (47%), Positives = 632/933 (67%), Gaps = 55/933 (5%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIG 60
++ I G+IA ++LFRP +D +FI T RY +L+ + E+IT+A G+V+DR+G
Sbjct: 54 EIGINGKIAVMKLFRPAEAQKDLIFILTHRYNAMILECAVQGDDIEIITKAHGNVADRVG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
+P + G + +IDP R+IG+ LY+GLFK+IP D +A ++R+EE+ V D++FLYG
Sbjct: 114 KPAETGILAVIDPKARVIGMRLYEGLFKIIPLDRDTHELKATSLRMEEVHVQDVEFLYGT 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT++V++QD + RH+KT+E+ LKDKDF + W Q+N++ A +LIPVP PL G ++I
Sbjct: 174 QHPTLIVIHQD-LNGRHIKTHEINLKDKDFTKIAWKQDNVETEATMLIPVPTPLGGAIVI 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI-THEKEK 236
G+E++VY +++ A+ I+ S Y RVD+ G RYLLG+ +G L ++ + T E K
Sbjct: 233 GQESVVYHDGDSYVAVAPAIIKQSTINCYARVDSKGFRYLLGNMSGHLFMMFLETEENSK 292
Query: 237 ----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V +K+ELLG+ +I I+YLDN V++IGS +GDSQL+KLN G+YV V+E
Sbjct: 293 GLLSVKDIKVELLGDITIPECITYLDNGVLFIGSRHGDSQLVKLNTTAGDNGAYVTVMET 352
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
+ NL PI+D C+VDLE+QGQGQ++TCSG+YK+GSLRI+RNGIGI E A ++L GIKGMW+
Sbjct: 353 FTNLAPIIDMCIVDLEKQGQGQMITCSGSYKEGSLRIIRNGIGIQEHACIDLPGIKGMWA 412
Query: 353 LRSSTDD-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
LR DD P+D LV+SF+ TRIL ++ E E+EETEI GF S QT +C + + Q++Q
Sbjct: 413 LRVGIDDSPYDNTLVLSFVGHTRILTLSGE-EVEETEIPGFLSDQQTFYCANVDFGQIIQ 471
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
VT + RL+ ++ + EWK P ++V N+ Q++ AT + Y+EIG+ L
Sbjct: 472 VTPTTARLIQCDNKSMICEWKPPDDKRISVVACNSCQMVCATAC-DIYYIEIGESKLVHK 530
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
L+YE++CLDI+P+ +N +++++ AVG+WTDIS I LP+L ++ E LGGEIIPR
Sbjct: 531 STVTLDYEVACLDISPLEDNATHAELVAVGLWTDISACILRLPNLEVVHTEKLGGEIIPR 590
Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
S+L+ FEGI YLLCALGDG + F+L+ T LTD+KKV+LGTQP L+TF S +TT+V
Sbjct: 591 SILMAHFEGIVYLLCALGDGSMFYFVLDKNTNRLTDQKKVTLGTQPTILKTFRSLSTTNV 650
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
FA SDRPTVIYSSN KL++SNVNLKEV+HMC N+ A+ DSLA+A + + +GTID+IQK
Sbjct: 651 FACSDRPTVIYSSNHKLVFSNVNLKEVNHMCSLNAEAYQDSLALATKNSVILGTIDEIQK 710
Query: 652 LHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------------------ 681
LHIR++PLGE PRRI +QE S+TF + +++
Sbjct: 711 LHIRTVPLGESPRRIAYQEASQTFGVITVRTDIQDSSGLTPSRQSASTQTTNVTLSTNMG 770
Query: 682 -------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
N +E E+H + ++D TFE + + EY S++S +D N YY V
Sbjct: 771 LLKAGASNAEFGQEVEVHNLLIIDQNTFEVLHAHQFMQTEYAMSLISAKLGNDPNTYYIV 830
Query: 735 GTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
GTA V PEE EP GRI+++ DG L ++EKE KG+ YSL FNG++LA+IN ++LY
Sbjct: 831 GTALVNPEEPEPKVGRIIIYHYADGNLTQVSEKEIKGSCYSLVEFNGRVLASINSTVRLY 890
Query: 795 KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
+W DD ++L+ EC H ++LALY +T+GDFI+VGDLM+SI+LL YK EG+ EE AR
Sbjct: 891 EWT--DD--KDLRLECSHFNNVLALYCKTKGDFILVGDLMRSITLLQYKQMEGSFEEIAR 946
Query: 855 DYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
DY NWM+AVEILDDD +LGA+N+ NLF K+
Sbjct: 947 DYQPNWMTAVEILDDDAFLGADNSNNLFVCLKD 979
>gi|41055225|ref|NP_956920.1| DNA damage-binding protein 1 [Danio rerio]
gi|34784892|gb|AAH56837.1| Zgc:63840 [Danio rerio]
Length = 897
Score = 893 bits (2308), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 445/845 (52%), Positives = 594/845 (70%), Gaps = 56/845 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ +S ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGDSIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGI+DP+CR+IGL LYDGLFKVIP D + + +AFNIRLEELQV+D++FLYGC
Sbjct: 114 RPSETGIIGIVDPECRMIGLRLYDGLFKVIPLDRENRELKAFNIRLEELQVIDVQFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PT+ +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++IPVP P G +II
Sbjct: 174 QAPTVCFIYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIPVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
G+E+I Y + + + A+ I+ S + RVD +GSRYLLGD G L +L++ EKE++
Sbjct: 233 GQESITYHNGDKYLAVAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLL--EKEEL 290
Query: 238 TG-------LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVL 290
L +ELLGETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV V+
Sbjct: 291 MDGAVVLKDLHVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNDQGSYVGVM 350
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+
Sbjct: 351 ETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGL 410
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W LRS + D LV+SF+ +TR+L ++ E E+EETE++GF QT FC + + QL+
Sbjct: 411 WPLRSESSRDTDDMLVLSFVGQTRVLMLSGE-EVEETELQGFVDNQQTFFCGNVAHQQLI 469
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
Q+TS SVRLV+ S+ L +EWK P G +++VA+ N +QV+LA G L YL+I G L +
Sbjct: 470 QITSVSVRLVTQDSKALVSEWKEPQGRNISVASCNNTQVVLAVGR-VLYYLQILSGELKQ 528
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP 530
+ ++E+E++CLDI P+GE + S I AVG+WTDIS R+ LP + KE LGGEIIP
Sbjct: 529 ISSTEMEHEVACLDITPLGERTADSCICAVGLWTDISARLLKLPCFTPLHKEMLGGEIIP 588
Query: 531 RSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
RS+L+ FEG YLLCALGDG L F L+++TG L++RKKV+LGTQP LRTF S +T++
Sbjct: 589 RSILMTTFEGSHYLLCALGDGALFYFGLDIQTGVLSERKKVTLGTQPTVLRTFRSLSTSN 648
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQ
Sbjct: 649 VFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSEGYPDSLALANNSTLTIGTIDEIQ 708
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------------- 681
KLHIR++PL E P+RIC+QE S+ F + S +
Sbjct: 709 KLHIRTVPLYESPKRICYQEVSQCFGVLSSRVEMQDASGTTAAVRPSASTQALSSSVSSS 768
Query: 682 ----------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
S EE E+H + ++D TFE + + EY S++SC D VY
Sbjct: 769 KLLPSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLQNEYALSMVSCKLGRDPAVY 828
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN +
Sbjct: 829 FIVGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTV 888
Query: 792 QLYKW 796
+LY+W
Sbjct: 889 RLYEW 893
>gi|290998415|ref|XP_002681776.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
gi|284095401|gb|EFC49032.1| damage-specific DNA binding protein 1 [Naegleria gruberi]
Length = 1103
Score = 879 bits (2272), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 469/1058 (44%), Positives = 689/1058 (65%), Gaps = 45/1058 (4%)
Query: 8 GRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQ 67
GRI T+ LFRP GE QD LFI E F + D + + T + G + D +GR +++G
Sbjct: 59 GRIDTMSLFRPSGEKQDLLFITIEDTFFTLGFIDGK---IETLSSGSIDDPVGRRSESGS 115
Query: 68 IGIIDPDCRLIGLHLYDGLFKVIPFDN-KGQLKEAFNIRLEELQVLDIKFLYGCAK---- 122
I IDP CR + L +Y+GL K+IPF+N K Q KEAFN+RLEEL V+DI FL
Sbjct: 116 ITTIDPLCRAVALSIYEGLLKIIPFENNKHQFKEAFNVRLEELNVIDIAFLESLGSKSKS 175
Query: 123 -PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
PT +LYQD+ +RHVKTYEV DKD E +Q N+D+GA++LIPVP PL GV+ +G
Sbjct: 176 GPTFALLYQDHVGSRHVKTYEVKTLDKDMEESSLNQLNVDHGANILIPVPAPLGGVICVG 235
Query: 182 EETIVY---CSANAFKAIPIRPSIT-KAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKV 237
E + Y + N A P + ++YG++D +R+ LGD +G L+LL + +V
Sbjct: 236 EAQVSYINESNKNHSVASPANSRMAIRSYGKLD--NTRWFLGDQSGQLYLLSLQVSDSEV 293
Query: 238 TGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN--LQPDAKGSYVEVLERYVN 295
TGL ++ LG TSI+S ISYLDN V+IGS+YGDSQ+I+++ L P+ SY EV + Y N
Sbjct: 294 TGLTLKELGVTSISSCISYLDNGYVFIGSNYGDSQVIRISDELNPETN-SYFEVFQTYSN 352
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLR 354
+GPIVDFC VD ++QGQGQ+VTCSGA+KDGSLRI++NGIGI E ++ +L G+ +WSL+
Sbjct: 353 IGPIVDFCFVDADKQGQGQIVTCSGAFKDGSLRIIKNGIGIEELTTISDLVGLNRIWSLK 412
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
+ T + + +LV+SF +T I +++ E E+ E +I GF + T+ C I N +QVT
Sbjct: 413 TETGE--EKYLVMSFTGQTLISSVDNE-EIGEAKIPGFDVDSTTVLCDTVIGNNYLQVTD 469
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHA 474
+ RLVSS + EL +EWK P ++++A +N +Q++++ G G LVY+EI L +++
Sbjct: 470 KTARLVSSHTLELIDEWK-PSSGTISLAASNPTQLVVSLGEGKLVYIEISAQSLKQIQQT 528
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL 534
+L+YE+SC+DI+P E S + AVGMWT+ISVR+ +LP +++T + LGGEI+PRS+L
Sbjct: 529 KLDYEVSCIDISP-EEGKISSTVCAVGMWTEISVRVLTLPSFDILTVQELGGEILPRSIL 587
Query: 535 LCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFA 593
+ FEGI+YL+C LGDGHL F ++ G LTD+K++ G++PI LR F SK+ +VFA
Sbjct: 588 MPTFEGINYLMCGLGDGHLFTFKMDRGMGILTDKKRIVAGSKPIMLRPFISKDRQLNVFA 647
Query: 594 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
AS+RPT+IYS N+KL+YSNVNL EV+ MC FN +FP +AIA E + IG ID+IQKLH
Sbjct: 648 ASERPTIIYSRNQKLVYSNVNLGEVNDMCSFNHESFPFHIAIANENSILIGQIDEIQKLH 707
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTF 713
I+++ L P +ICHQ ++ + + + E E + LD+ FE S L+
Sbjct: 708 IKTVQLHAQPTKICHQPSTKAYGLLITQ----FENPETSAIFSLDENNFEKKSEIRLEGN 763
Query: 714 EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
E G SI+SC F+DD N Y+ VGTA +E EP+KGRILV V+D KL L AEK+ KGAV
Sbjct: 764 ELGQSIISCKFTDDDNEYFIVGTAITEGDEEEPSKGRILVLQVQDDKLVLKAEKDVKGAV 823
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDG-TRELQSECGHHGHILALYVQTRGDFIVVGD 832
L++FNGKLLA ++ ++ L+KW DDG ++L EC G I L + + GDFI++GD
Sbjct: 824 MVLHSFNGKLLAGVSGRLMLFKWAESDDGDNKDLVQECSCSGGIYILDIDSHGDFILIGD 883
Query: 833 LMKSISLLIY-----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
+MKS+ L +Y +H G + ++DY +W+S +L++ Y+ + N+ T++KN
Sbjct: 884 MMKSVHLFVYENPEEQHVSGNLRLISKDYQYSWLSCSLMLNESEYVAVDQQGNMITLKKN 943
Query: 888 SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD--SDVGQIP--TVIFGTVNGVI 943
E A++EER +L VG+Y+ + VNR + G + MR + SD+ P T +FGT++G I
Sbjct: 944 DEAASEEERKQLVRVGKYYCSDRVNRIQPGFIGMRFANSSSDINTQPVKTALFGTISGGI 1003
Query: 944 GVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
GV+A LP E + F+ K+Q + V+ G+ ++ E +R + +E+ D+ F+DGD +ESF
Sbjct: 1004 GVLAQLPPETFAFVTKIQKAMSSVVTGLANISRETYRQYRSERTREDSVGFIDGDFVESF 1063
Query: 1004 LDL----SRTRMDEISKTM--NVSVEELCKRVEELTRL 1035
L+ + ++E+S +++EEL K +E+L+ L
Sbjct: 1064 LEFDFETQQRVIEELSNNHQEQITLEELVKNIEDLSHL 1101
>gi|391335522|ref|XP_003742140.1| PREDICTED: DNA damage-binding protein 1-like [Metaseiulus
occidentalis]
Length = 1154
Score = 873 bits (2256), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 484/1118 (43%), Positives = 677/1118 (60%), Gaps = 101/1118 (9%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSE---LITRAMGDVSDRI 59
+V IYGRI+ ++ FR E++D LF T++Y +L++ AESSE +ITR+ G VSD
Sbjct: 54 EVNIYGRISVMKKFRYPNESKDLLFFLTDKYNVAILEFSAESSENFEVITRSHGCVSDPY 113
Query: 60 GRPTDNGQIGIID-PDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
RP++ G + ++D P R+I L LYDGL K+IP + + + ++NIR+EE Q+ D+ FL
Sbjct: 114 ARPSEAGNLVVVDQPKARVIALRLYDGLLKMIPLNREAKELRSYNIRVEEAQITDMCFLS 173
Query: 119 GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVL 178
+ ++ + + + RH+KT+ +AL+DK+ ++GPW Q NLD AD+LIPV GV+
Sbjct: 174 SSSSDPVLAIVYEEQQTRHMKTHVIALRDKELMKGPWGQRNLDLEADMLIPVEDTETGVI 233
Query: 179 IIGEETIVYCSANAFKAIPIRPSITKA-----YGRVDADGSRYLLGDHAGLLHLLVITHE 233
I+G ETIVY + I I+PS + Y R+D + ++LG G L +L + E
Sbjct: 234 IVGGETIVYHYGQDY--ICIQPSFLRTTKISCYCRIDNNRLVFILGGICGRLFILTLRRE 291
Query: 234 KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERY 293
+KV ++LLG SI +SYLDN VV++GS GDSQLI+++ A+ ++EVLE Y
Sbjct: 292 NKKVVSHSLDLLGSVSIPECLSYLDNGVVFVGSRLGDSQLIRMH----AQEPFIEVLESY 347
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
NLG I+D VVDLE+QGQ Q++TCSG GSLRI+RNGIGI+E A VEL GIKG+W+L
Sbjct: 348 TNLGAILDMIVVDLEKQGQDQLITCSGQGACGSLRIIRNGIGIHELACVELSGIKGIWAL 407
Query: 354 RSST-----DDPFDTFLVVSFISETRILAMNLEDELEETEI-EGFCSQTQTLFCHDAIYN 407
R +T D P D LV+SF+ +TR+ + +ELE+ + F +QT + + N
Sbjct: 408 RMNTAQLEEDTPTDDTLVLSFVGQTRVFNCSSTEELEQVTLPAAFDIDSQTFCARNVLGN 467
Query: 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI 467
Q++QVT V L+S TS+ ++W P G + N QV LA LVYLEI DG
Sbjct: 468 QVIQVTDKRVNLISVTSKTRVDQWFPPEGEIITQCACNDVQVALALKNV-LVYLEIRDGS 526
Query: 468 LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
LTE+K +LEYEI+C+D+N + + + I VG+WTD+S+ + SLPDL + ++ L +
Sbjct: 527 LTEIKRTRLEYEIACMDLNTLDKEGDQTSIITVGLWTDVSILVLSLPDLEQLFRQELPKD 586
Query: 528 IIPRSVLLCAFEGIS-YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+IPRSVL FEG + YLLC L DG L + L TGELT +K+V+LGTQP TLR F S+
Sbjct: 587 VIPRSVLKITFEGSTDYLLCTLADGSLFYYHL-ASTGELTGQKRVTLGTQPTTLRKFRSQ 645
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
T VFA SDRPTVIYSS KL++S VNL+EV HMC F+S AFPDSLA+A E E IGTI
Sbjct: 646 RTWSVFACSDRPTVIYSSTSKLVFSKVNLREVKHMCSFSSEAFPDSLALASEDEFVIGTI 705
Query: 647 DDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--------CSLKNQS-------------- 684
D IQKLHIR++PLGE PRRI +QE++ TF + CS + S
Sbjct: 706 DAIQKLHIRTVPLGESPRRIAYQEETGTFGVIVSRSDMACSTRCASLDAPNKSNASPYAW 765
Query: 685 -----------CAEE--------------------SEMHFVRLLDDQTFEFISTYPLDTF 713
CA+ E + ++D TFE +
Sbjct: 766 HKDFSSFGHTQCADRVDSGIPSCSSTSLQRPPSGCDETFSLLIIDQNTFEVLHAMQFCPN 825
Query: 714 EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
EYG SI S D N YY VGTA++ EE+EP GRI V DGKL+ IAEKE GA
Sbjct: 826 EYGVSICSAKLGSDPNPYYIVGTAFINQEESEPKVGRIFVLRWHDGKLETIAEKEAAGAP 885
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDL 833
YS+ F+ KL AIN ++LY W + ++LQSEC +I+ L+++ GD+I+VGDL
Sbjct: 886 YSIREFHQKLAIAINSTVRLYSW----NAEKDLQSECTPFFNIVILHLKCLGDYILVGDL 941
Query: 834 MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
M+S++LL Y + ++EE RDY NW +AVEILD+D +L AE+N NL+ +++ A D
Sbjct: 942 MRSMTLLNYNADITSLEEIGRDYQTNWTTAVEILDEDTFLAAESNLNLYVCKRDPSAA-D 1000
Query: 894 EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP-----TVIFGTVNGVIGVIAS 948
+ R + V YHLGE VN GSLVM P G +P + ++G+++G +GVI
Sbjct: 1001 DTRQHMHEVALYHLGEMVNVIVKGSLVMAQP----GDMPLPLNKSFLYGSLHGAVGVIVP 1056
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+ E Y L ++QTNL K IK VG + H WR+F E+K A F+DGDLIE LDL +
Sbjct: 1057 IKQELYAILNQIQTNLAKTIKSVGKIEHGFWRTFLAERKIEPATGFIDGDLIEQLLDLPK 1116
Query: 1009 TRMDEISKTM----------NVSVEELCKRVEELTRLH 1036
++ +S+++ N++ E+L K VE+LTR+H
Sbjct: 1117 EALESVSQSIKVDEEGGHQRNMTPEDLVKLVEDLTRIH 1154
>gi|449488592|ref|XP_004158102.1| PREDICTED: LOW QUALITY PROTEIN: DNA damage-binding protein 1-like
[Cucumis sativus]
Length = 570
Score = 853 bits (2205), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 420/450 (93%), Positives = 437/450 (97%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWD ESSELITRAMGDVSDRIG
Sbjct: 53 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDTESSELITRAMGDVSDRIG 112
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RPTD+GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC
Sbjct: 113 RPTDSGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 172
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
++PTIVVLYQDNKDARHVKTYEV LKDKDFVEGPWSQNNLDNGA +LIPVPPPLCGV+II
Sbjct: 173 SRPTIVVLYQDNKDARHVKTYEVVLKDKDFVEGPWSQNNLDNGAAVLIPVPPPLCGVIII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
GEETIVYCSA AFKAIP+RPSIT+AYGRVDADGSRYLLGDHAGLLHLLVITHEKE+VTGL
Sbjct: 233 GEETIVYCSATAFKAIPVRPSITRAYGRVDADGSRYLLGDHAGLLHLLVITHEKERVTGL 292
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
KIELLGETSIASTISYLDNA VYIGSSYGDSQL+KLN+QPDAKGSYVEVLERYVNLGPIV
Sbjct: 293 KIELLGETSIASTISYLDNAFVYIGSSYGDSQLVKLNVQPDAKGSYVEVLERYVNLGPIV 352
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVVDLERQGQGQVVTCSGAYKDGSLR+VRNGIGINEQASVELQGIKGMWSLRSSTDDP
Sbjct: 353 DFCVVDLERQGQGQVVTCSGAYKDGSLRVVRNGIGINEQASVELQGIKGMWSLRSSTDDP 412
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FDTFLVVSFISETRILAMNLEDELEETEIEGF SQ QTLFCHDA++NQLVQVTS SVRLV
Sbjct: 413 FDTFLVVSFISETRILAMNLEDELEETEIEGFNSQVQTLFCHDALFNQLVQVTSSSVRLV 472
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVL 450
SST+REL NEW +P YS+NVATANASQ +
Sbjct: 473 SSTTRELLNEWNAPSNYSINVATANASQCI 502
>gi|340381612|ref|XP_003389315.1| PREDICTED: DNA damage-binding protein 1-like [Amphimedon
queenslandica]
Length = 1142
Score = 823 bits (2126), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 455/1104 (41%), Positives = 679/1104 (61%), Gaps = 81/1104 (7%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LDV ++GRI ++LFRP GE D LFI T RY ++ +D +++++TRA GD+ +R+G
Sbjct: 52 VLDVDVFGRILAIQLFRPQGERMDLLFILTARYHVAIVGYDPIANDVLTRAYGDIKERVG 111
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIP--FDNKGQLKEAFNIRLEELQVLDIKFLY 118
+PT + ++DP C+LI LHLY G K++P FD+ LK AFNIRLE+L + DI+FL+
Sbjct: 112 KPTLGKNVAMVDPSCQLIALHLYTGQLKIVPLQFDSGSPLK-AFNIRLEDLYITDIQFLH 170
Query: 119 GCAKPTIVVLYQDNKDA--RHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCG 176
G PTI + ++ A R +KT+ ++ +DK+ + GPW N ++ A LL VP P G
Sbjct: 171 GTENPTIAYISEEPSVATGRVLKTFVISQRDKELLPGPWKPNTIEGQASLLCSVPSPYNG 230
Query: 177 VLIIGEETIVYC--SANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEK 234
++++G +++ Y +++ I I+ S+ +D SRYLLGD G L L + +
Sbjct: 231 LIVVGADSVAYFNDTSHTVDPIVIKESVISCIEPLDH--SRYLLGDFRGRLLTLFLEFSE 288
Query: 235 EKVTGL------KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVE 288
E +G+ K+E+LGE SI T+SYLDN VV++GS+ GDSQL+KL+ P G Y++
Sbjct: 289 EMESGMTNIVNMKLEVLGEISIPHTLSYLDNGVVFVGSTKGDSQLVKLSSSPLENGGYID 348
Query: 289 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 348
VLE N+GPI+D VVDL++QG+ +V CSG KDG+LRIV++GIGINE AS++L GIK
Sbjct: 349 VLESMTNIGPILDMSVVDLDKQGRDVLVCCSGLGKDGALRIVKSGIGINEAASIDLPGIK 408
Query: 349 GMWSLR-SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
G+WSL+ + +D D +V++F+ +T L + E E+EETE+ + QT +C + N
Sbjct: 409 GIWSLKCAGREDELDDTVVLTFVGQTMALRLAGE-EVEETELPALVTDQQTFYCSNVTGN 467
Query: 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI 467
++Q+T+ SVRL+ + EL +W P G ++ A N+SQV++A G L YLE+ G
Sbjct: 468 AIIQITTKSVRLMDDKAMELICDWSPPDGRGISTAACNSSQVMVAVGC-DLYYLEVKPGS 526
Query: 468 LTE---VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
E + H + +E++CLDINP+ E + S + AVG+WTDISV+I ++P + + L
Sbjct: 527 PGELLLISHTTMSHEVACLDINPLSEAGT-SSLCAVGLWTDISVQILNVPQFEHLFTQPL 585
Query: 525 GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
GG+IIPRSVL+ G YLLCALGDG L + ++ +TG+L K+V LGT+PI L+ F
Sbjct: 586 GGDIIPRSVLMVELGGACYLLCALGDGCLHYYTMDSETGQLRGGKRVVLGTKPIVLKQFK 645
Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
S T VFA SD PT+I+ SN+KLL+SNVN+KEV+++C NS AF +SLA+ L G
Sbjct: 646 SDGVTSVFACSDHPTIIHFSNQKLLFSNVNVKEVNYICTLNSEAFQNSLALVDSSTLIFG 705
Query: 645 TIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK----------------------N 682
+D IQ LHI +IPLGE PR I +QE S+TF + + N
Sbjct: 706 CVDQIQMLHIETIPLGESPRCIAYQESSQTFLVGGYRTDKSGPDNTYTPSRQSVSTRTSN 765
Query: 683 QSCA-------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS--DD 727
S A + EMH + L D TF+ Y L E+ + SC+ + D+
Sbjct: 766 VSVAVVPPQLNIEEFKCPQVEMHSLILFDQTTFDVSHVYQLCPQEHILCVTSCNLTTNDE 825
Query: 728 SNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAI 787
Y VGTA V PEE E + GRILVF V GKL+L+ EK GAV+ + FNGK+L ++
Sbjct: 826 ERSVYVVGTALVKPEEKESSTGRILVFAVNSGKLELLHEKLENGAVFQVLGFNGKILNSV 885
Query: 788 NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
N + + + DG L+ EC + +ILALY++T+GDFI+VGD+++S+ LL+YK E G
Sbjct: 886 NSGVFVNALV---DGA--LKEECAYKNNILALYLKTKGDFILVGDILRSLKLLVYKEELG 940
Query: 848 AIEERARDYNAN--WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
+EE D+N + + +A+E++DD+ YLGA+ ++F +KN+E ++ + +
Sbjct: 941 -LEEIGVDHNISPCFCTAIEMIDDENYLGADGR-HIFICQKNTEATSEADLLYMVQPSRM 998
Query: 906 HLGEFVNRFRHGSLVMRLPD---SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
+ G+ VN F GS VM P S + Q ++FGTV+G IG+I +L + Y L KLQ
Sbjct: 999 YFGDNVNVFSRGSFVMDHPGAGASSLLQGKPILFGTVHGAIGLIGTLNMDTYTLLSKLQQ 1058
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM---- 1018
+ IK VG + HE +RSF+NE ++ F+DGDL+E FL+L R +M +I + +
Sbjct: 1059 KMAANIKSVGNIEHEIYRSFSNEHRSKPFAGFIDGDLVEKFLELPRPQMSQIVQGIKTTD 1118
Query: 1019 ------NVSVEELCKRVEELTRLH 1036
+VSV+++ K +E+L+RLH
Sbjct: 1119 VTGTEVDVSVDDILKLIEDLSRLH 1142
>gi|328770638|gb|EGF80679.1| hypothetical protein BATDEDRAFT_11194 [Batrachochytrium dendrobatidis
JAM81]
Length = 1098
Score = 813 bits (2101), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 428/1051 (40%), Positives = 656/1051 (62%), Gaps = 25/1051 (2%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
DVPIYGRIA + FRP D +F++TER VL +D ++ L+ A GD +D R
Sbjct: 56 DVPIYGRIACIHAFRPEHRDTDLIFVSTERGSVIVLSFDQQTQRLVPEATGDFNDPSSRL 115
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
D GQIG++DP R+IGLH++ G FK+IP + ++N+ + Q + K +
Sbjct: 116 ADPGQIGLVDPQNRMIGLHIHQGNFKIIPMFHSQISLSSWNLLVNSSQSIKGKGVARPPH 175
Query: 123 PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGE 182
PT+ VL+ D+K++R+V YEV +K+K V + ++NG+ LLIPVP GVL++GE
Sbjct: 176 PTLAVLHMDSKESRYVTQYEVVIKEKALVLKS-AATKVENGSSLLIPVPLAAGGVLVVGE 234
Query: 183 ETIVYCSANAFKAI--PIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
++I S + K I I+ ++ K + +VD G RYLLGD+ G L++L I L
Sbjct: 235 QSIALHSPSLQKPIMLSIKATLIKCFNQVDV-GFRYLLGDYEGQLYILAIIFANGTAREL 293
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
K+ +G+T AS+++YL + +++ S +GDS+L + + G+ + + + + NL PI
Sbjct: 294 KMTPVGQTVQASSLTYLTDGYLFVASHFGDSELYLIVPEDANTGNVLTLCKTFSNLAPIS 353
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDD 359
DFCVVD+E+QGQ Q+V CSGA +DGS+RI+RNGIG+ E ++ ++ + G+W+L+ +
Sbjct: 354 DFCVVDIEKQGQAQIVACSGAQRDGSIRIIRNGIGVEEIGQLDDMEELTGVWALKPYSAA 413
Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
D LV+SFI ETR+ ++ + E + F + +TL+C + + +VQ+TS S+ +
Sbjct: 414 RHDNVLVLSFIGETRLQKLDGDSMAEMDMLGNFKTAERTLWCQNLSSDMVVQITSQSITI 473
Query: 420 VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYE 479
++ EW G SV A+ + +L++ GGG + E D L + Q++ E
Sbjct: 474 LTIEGWTTVAEWCFDLGASVTHASVYQNMILVSLGGGMIHLFEFNDRELVMKRSIQIQVE 533
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
+SCL I + E + AVG W D SVR+ +PDL+ I KE L G+ IPRS+LL F+
Sbjct: 534 VSCLHICKMEELNVC--LCAVGCWEDNSVRLLKIPDLSEIQKEILPGDTIPRSILLVEFD 591
Query: 540 GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPT 599
+ YLL +LGDG L NF + K+ +L DRKK++L TQPITLRTF S THVFAASDRPT
Sbjct: 592 NLPYLLVSLGDGQLFNFRIG-KSLKLADRKKITLATQPITLRTFQSHGRTHVFAASDRPT 650
Query: 600 VIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL 659
VI+ + +LLYSNVN++E+SH+ PFNS +LA A +G L IGTI+ +QKLHI++I L
Sbjct: 651 VIFVKSGQLLYSNVNVREISHVSPFNSHMAEGALAFASDGALKIGTIETVQKLHIKTIKL 710
Query: 660 GEHPRRICHQEQSRTFAICSL--KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGC 717
GE PRRI + + S TF + ++ +N + +++ +RLLD Q +E + + L FE
Sbjct: 711 GETPRRIAYHDVSHTFGVLTVFSRNLPNGDLADISCLRLLDGQGYEVLDSIELQPFEIAS 770
Query: 718 SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKETKGAVYSL 776
S+++ F+DD +YY VGT + P E+EP +GRILVF V D + LQL+ E + +G+ YS
Sbjct: 771 SLITIRFTDDDTLYYTVGTGFAFPHEDEPVRGRILVFKVNDMRLLQLVHEYDIRGSAYSF 830
Query: 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
+ +G+L+A +N + + +W D ELQS +HGH+LAL + RGDFI+V DL+KS
Sbjct: 831 VSVHGRLVAGVNSNVMVLRWN-SDTSLLELQSM--NHGHVLALSLAVRGDFILVADLIKS 887
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER 896
I+LL + +++E A D ++NWM+A E++DDD +LGA+++ N+F + K + ++EER
Sbjct: 888 ITLLQFDLATDSLKELAYDADSNWMTAAELIDDDTFLGADSSMNIFALSKQGDQVSEEER 947
Query: 897 GRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG--QIPTVIFGTVNGVIGVIASLPHEQY 954
RL G +H GE +NRFR GSL + D + IP +++ TV+G IGV+A +P ++
Sbjct: 948 QRLRPKGWFHTGELINRFRKGSLTLHATDETLALPAIPEILYCTVHGAIGVVARIPSDET 1007
Query: 955 L-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD- 1012
L LQ L+ V++GVGGL H WR + E++++ + +DGDLIESFL+L R+ D
Sbjct: 1008 AKILSTLQEALKSVVQGVGGLIHSDWRRYRTERRSIKSAGIIDGDLIESFLELDRSMQDH 1067
Query: 1013 -------EISKTMNVSVEELCKRVEELTRLH 1036
+++ + V++E L K VE+LTR+H
Sbjct: 1068 VFTQVATQVAGSTPVTLETLTKMVEDLTRIH 1098
>gi|324502823|gb|ADY41238.1| DNA damage-binding protein 1, partial [Ascaris suum]
Length = 1129
Score = 805 bits (2080), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 458/1098 (41%), Positives = 662/1098 (60%), Gaps = 86/1098 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
+VP++GRIA ++ FR E+ D + + T+++ +L WD+ + EL TRA G ++DR+GR
Sbjct: 54 EVPVFGRIAAVKAFRVKNESVDSILVLTQKHHLAILCWDS-NGELRTRASGHIADRVGRH 112
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
++ G I + L+ LYDGL KV+ + L+ FNIR E+L ++D+ FL K
Sbjct: 113 SETGIIACVHSS-GLMAFRLYDGLIKVVQWTEGSDLR-GFNIRCEDLYIVDLDFLSDPEK 170
Query: 123 PTIVVLYQDNKDARHVKTYEVALKDKDFVEGP-WSQNNLDNGADLLIPVPPPLCGVLIIG 181
PT+V +Y+D+ + RH+K + L+DK+ P W Q+N++ A ++IP+P P GV+++G
Sbjct: 171 PTVVYIYRDH-NGRHLKAAAINLEDKELSSPPLWKQDNIEAEACMVIPIPQPYGGVIVVG 229
Query: 182 EETIVYCS-ANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAG-----LLHLLVITH 232
E I Y ANA+ AI I S YG++D DG RYLLGD +G LL L V T
Sbjct: 230 HEAISYHKDANAYSAIAPPLIHQSQISCYGKIDRDGQRYLLGDLSGRIFMLLLDLDVATD 289
Query: 233 EKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
V LK+ELLGETSI + YLDN VV+IGS +GDSQL++L +P GSY+ +++
Sbjct: 290 GTASVKDLKVELLGETSIPECVVYLDNGVVFIGSRFGDSQLVRLRSEPCPDGSYISLMDT 349
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS 352
YVNL PI D V++ + GQ QV+TCSGA+KDGSLRI+RNGIGI E ASVEL G+K +++
Sbjct: 350 YVNLAPIRDMVVINAD--GQQQVITCSGAFKDGSLRIIRNGIGIEELASVELPGVKALFT 407
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
L ++ D +LVV F+ ET IL ++ E ELE+T++ GF + QTL+ + Q+
Sbjct: 408 LNVESE--LDDYLVVGFVDETHILKISGE-ELEDTQLPGFSTTEQTLWAGRVGSGGIAQI 464
Query: 413 TSGSVRLVSSTSRELRNE---WKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILT 469
T V L+ LR W+ P SV + QV+LA G L YL + D I T
Sbjct: 465 TPLKVVLI------LRGNTLTWEPPSRISVVSVNELSGQVVLACGN-QLHYLLLTDKI-T 516
Query: 470 EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
+ + + E+EI+C+D+ +G+ S++ AV WTD+SV + SLPDLN I +E GGE++
Sbjct: 517 PITNIECEFEIACIDVGCVGDEIE-SKLCAVAYWTDMSVALRSLPDLNEIVREKCGGEML 575
Query: 530 PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
RS+L+C EGI YLL ALGDG L + ++M +G LT KK +LGTQP TL+ F S+
Sbjct: 576 ARSLLICMMEGIVYLLVALGDGTLYYYQIDMNSGALTQPKKATLGTQPTTLKKFMSRGAR 635
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
+VFA SDRPTVIYSS++KL++SNVNLK V+HMC NSA + DSL + L IG IDDI
Sbjct: 636 NVFACSDRPTVIYSSSQKLVFSNVNLKLVAHMCALNSATYTDSLVMTDGQTLVIGRIDDI 695
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-----------SCA------------ 686
QKLHIR++PLGE RI +Q ++ T AI +N+ CA
Sbjct: 696 QKLHIRTVPLGESVSRIAYQPETGTIAILVQRNEFVDADGKHHCGHCASKMAVNASSSHP 755
Query: 687 -------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
EE E+ V + D T E + ++ L E SI SC DD YY
Sbjct: 756 SVVTSATTPPIEPEEIEVSSVVVFDANTLEILHSHELGKNELAMSIKSCVLGDDPQPYYA 815
Query: 734 VGTAYVLPEENEPTKGRILVFIV----EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
VGTA VL +E E GR+L+F V E G+++L+ +KE KGA YS+ GKL+ AIN
Sbjct: 816 VGTAVVLTDETESKSGRLLIFQVAPSSEGGRMRLVHDKEIKGAAYSIQVLMGKLVVAINS 875
Query: 790 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
++L++W +EL+ EC ++ ALY++T+ D ++VGDLM+S+S+L YK E +
Sbjct: 876 CVRLFEWT----AEKELRLECSDFDNVTALYLRTKNDVVLVGDLMRSLSVLAYKPMESSF 931
Query: 850 EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGE 909
E+ ARD+ NWM+A EI+D + +LGAE FNLFTV K+ + R +L+ G Y+LGE
Sbjct: 932 EKIARDFVTNWMTACEIIDMETFLGAEIMFNLFTVVKDCSSKDEGIRLQLQETGMYYLGE 991
Query: 910 FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK 969
VN F HGSL+ D +++GT +G +GVI L + Y F+ +L+T + V K
Sbjct: 992 SVNAFCHGSLIATHIDLTPSFTTPILYGTSDGGLGVIVQLTPQFYDFVHELETRIAAVTK 1051
Query: 970 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV--------- 1020
+ H Q+R+F ++ +T + F+DGDL+E LD+SR ++++ + +
Sbjct: 1052 NCMRIEHGQYRTFESDGRTEQSVGFIDGDLVEGLLDMSRDSVEKLIDGLTLPAAAGQEQK 1111
Query: 1021 --SVEELCKRVEELTRLH 1036
++EE+ K VE+L R+H
Sbjct: 1112 VATIEEVLKVVEDLARIH 1129
>gi|393905247|gb|EJD73911.1| CPSF A subunit region family protein [Loa loa]
Length = 1145
Score = 770 bits (1989), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 445/1108 (40%), Positives = 659/1108 (59%), Gaps = 90/1108 (8%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
+ P++GRIAT++LFR GE D L I T +Y +++W +SEL TRA G + DR+GRP
Sbjct: 54 ECPVFGRIATIKLFRAPGEDVDSLLILTAKYHLAIIRW-TPTSELRTRASGHIVDRVGRP 112
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
++ G I + ++ LYDGL KV+ + N+G+ FN+R ++L ++DI F+ +
Sbjct: 113 SETGMIATVHSSGLMV-FRLYDGLLKVVQW-NEGKDLRGFNVRCDDLYIIDITFMSDPDR 170
Query: 123 PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGE 182
PT+ +YQD+ + RH+K + ++DK+ W +NL+ A ++I VP P G LI G
Sbjct: 171 PTLAYIYQDD-NGRHIKVVTLNIEDKELSSPLWKHDNLEGEASMVIGVPEPAGGCLIAGP 229
Query: 183 ETIVYCS----ANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
+ I Y A + +P + + Y VD DG RYLL D AG L++L++ K
Sbjct: 230 DAISYHKGGDDALRYAGVPGSRLHNTHPNCYAPVDRDGQRYLLADLAGNLYMLLLEFGKG 289
Query: 236 K---------VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS- 285
+ V +K+E LG T IA + YLDN V +IGS +GDSQLI+L+ +P A G+
Sbjct: 290 QEQDESSTVSVKDMKVESLGNTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTG 349
Query: 286 YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
Y+ +L+ Y NL PI D V + GQ Q++TCSGAYKDG++RI+RNGIGI E ASVEL+
Sbjct: 350 YISLLDSYTNLAPIRDMTV--MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELASVELK 407
Query: 346 GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
GIK M++LR+ D FD +L++SF S+T +L +N E ELE+T+I GF TL+
Sbjct: 408 GIKNMFTLRTR-DHEFDDYLILSFDSDTHVLLINGE-ELEDTQITGFVVDGATLWAGCLF 465
Query: 406 YNQ-LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG 464
+ ++QVT G V L+ + ++ WK+ ++ Q+++A G L+YLE
Sbjct: 466 QSTTILQVTHGEVILIDGDNIQI---WKASKWITLVAVNEITGQLVIACGA-LLIYLEAD 521
Query: 465 DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
+ + E+EISC+DI PIG S+I AVG WTD+SV + +LP L + +E +
Sbjct: 522 SAGFKLISELECEFEISCIDITPIGNETLRSEICAVGYWTDLSVALRTLPQLMEVVREKI 581
Query: 525 GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
G+++ RS++L EG YLL ALGDG + F ++MKTG L D KK +LGTQPI LR F
Sbjct: 582 PGDMLSRSIMLSPMEGHVYLLVALGDGTVHYFQIDMKTGALLDPKKATLGTQPIHLRKFR 641
Query: 585 SKNTT--HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
S+ ++ ++F SDRP VIYSSN+KLL+SNVNL+ VS M P + A+PD+L + L
Sbjct: 642 SRCSSVHNIFVCSDRPAVIYSSNQKLLFSNVNLRMVSTMTPLYAEAYPDALVLTDGNSLV 701
Query: 643 IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI------------------CSLKN-- 682
IG IDDIQKLHIR++PLGE P RI +Q ++ T A+ C+ KN
Sbjct: 702 IGRIDDIQKLHIRTVPLGESPSRIAYQPETNTIAVTVERLEFVDAMGKHHFGQCASKNAM 761
Query: 683 ----------------QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD 726
+ AEE E+ + LLD TFE + ++ L+ E S+ SC +
Sbjct: 762 ETSSSRLSSMRREPTPECLAEEMEVSSILLLDSNTFEILHSHELEGSEMAMSLASCQLGN 821
Query: 727 DSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFNGKLL 784
DS Y+ VGTA ++ +E E GRI++F +G +++L+ EKE KGA YS+ + +GKL+
Sbjct: 822 DSQPYFVVGTAVIMSDETESKMGRIMMFQASEGPERMRLVYEKEIKGAAYSIQSMDGKLV 881
Query: 785 AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
A+N ++L++W +EL+ EC ++ ALY++T+ D I+VGDLM+S+SLL YK
Sbjct: 882 VAVNSCVRLFEWT----ADKELRLECSDFDNVTALYLKTKNDLILVGDLMRSLSLLSYKS 937
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
E E+ ARD+ NWMSA EI+D D +LGAEN++NLFTV K+S EE RL+ +G
Sbjct: 938 VESTFEKVARDFMTNWMSACEIIDSDSFLGAENSYNLFTVVKDSFTVFKEEGTRLQELGL 997
Query: 905 YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
++LGE VN F HGSL D ++++GT +G IGVI +P Y FL +Q L
Sbjct: 998 FYLGEMVNVFCHGSLTATQVDVAPLYHSSILYGTSDGGIGVIVQMPPVLYTFLHDVQKRL 1057
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI---------- 1014
+ ++H Q+R+F EK++ F+DGDLIES LD+ + + +I
Sbjct: 1058 ADYTENCMRISHTQYRTFETEKRSEVPNGFIDGDLIESLLDMGKDSVGQIVNGLKMPLLN 1117
Query: 1015 ------SKTMNVSVEELCKRVEELTRLH 1036
++ ++ + E++ K VE+L+R+H
Sbjct: 1118 IPSSETTELVDATAEDVLKLVEDLSRIH 1145
>gi|194381178|dbj|BAG64157.1| unnamed protein product [Homo sapiens]
Length = 826
Score = 746 bits (1926), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 382/747 (51%), Positives = 499/747 (66%), Gaps = 56/747 (7%)
Query: 340 ASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
AS++L GIKG+W LRS + D LV+SF+ +TR+L +N E E+EETE+ GF QT
Sbjct: 86 ASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQTF 144
Query: 400 FCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLV 459
FC + + QL+Q+TS SVRLVS + L +EWK P +++VA+ N+SQV++A G L
Sbjct: 145 FCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQVVVAVGRA-LY 203
Query: 460 YLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLI 519
YL+I L ++ H ++E+E++CLDI P+G++ S + A+G+WTDIS RI LP L+
Sbjct: 204 YLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISARILKLPSFELL 263
Query: 520 TKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
KE LGGEIIPRS+L+ FE YLLCALGDG L F LN++TG L+DRKKV+LGTQP
Sbjct: 264 HKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTV 323
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A
Sbjct: 324 LRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNS 383
Query: 640 ELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------ 681
LTIGTID+IQKLHIR++PL E PR+IC+QE S+ F + S +
Sbjct: 384 TLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQCFGVLSSRIEVQDTSGGTTALRPSAS 443
Query: 682 ---------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
S EE E+H + ++D TFE + + EY S++
Sbjct: 444 TQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLV 503
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN 780
SC D N Y+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FN
Sbjct: 504 SCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFN 563
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
GKLLA+IN ++LY+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL
Sbjct: 564 GKLLASINSTVRLYEWTTE----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLL 619
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
YK EG EE ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+
Sbjct: 620 AYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQ 679
Query: 901 VVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
VG +HLGEFVN F HGSLVM+ L ++ +V+FGTVNG+IG++ SL Y L
Sbjct: 680 EVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLD 739
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
+Q L KVIK VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ +
Sbjct: 740 MQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQ 799
Query: 1020 ----------VSVEELCKRVEELTRLH 1036
+ ++L K VEELTR+H
Sbjct: 800 YDDGSGMKREATADDLIKVVEELTRIH 826
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 25/31 (80%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERY 33
+V +YG+IA +ELFRP GE++D LFI T +Y
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKY 84
>gi|226510488|ref|NP_001145925.1| uncharacterized protein LOC100279448 [Zea mays]
gi|219884971|gb|ACL52860.1| unknown [Zea mays]
Length = 416
Score = 736 bits (1901), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/416 (82%), Positives = 379/416 (91%)
Query: 621 MCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSL 680
MCPFN+AAFPDSLAIAKEGEL+IGTIDDIQKLHIR+IPL E RRICHQEQSRT A CS
Sbjct: 1 MCPFNTAAFPDSLAIAKEGELSIGTIDDIQKLHIRTIPLNEQARRICHQEQSRTLAFCSF 60
Query: 681 KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVL 740
K EESE H +RLLD QTFE + YPLD +E GCSI+SCSF+DDSNVYYCVGTAYV+
Sbjct: 61 KYNQSVEESETHLIRLLDHQTFESLCVYPLDQYECGCSIISCSFADDSNVYYCVGTAYVI 120
Query: 741 PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
PEENEPTKGRILVF VEDG LQLI EKETKGAVYSLNAFNGKLLAAINQKIQLYKWM R+
Sbjct: 121 PEENEPTKGRILVFAVEDGSLQLIVEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMSRE 180
Query: 801 DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
DG+ ELQSECGHHGHILALY QTRGDFIVVGDLMKSISLL+YKHEE AIEERARDYNANW
Sbjct: 181 DGSHELQSECGHHGHILALYTQTRGDFIVVGDLMKSISLLVYKHEESAIEERARDYNANW 240
Query: 861 MSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 920
M+AVE+LDD++Y+GAEN++NLFTVRKNS+ ATD+ER RLEVVGEYHLGEFVNRFRHGSLV
Sbjct: 241 MTAVEMLDDEVYVGAENSYNLFTVRKNSDAATDDERARLEVVGEYHLGEFVNRFRHGSLV 300
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
MRLPDSD+GQIPTVIFGT+NGVIG+IASLPH+QY+FLEKLQ+ L K IKGVG L+HEQWR
Sbjct: 301 MRLPDSDIGQIPTVIFGTINGVIGIIASLPHDQYIFLEKLQSTLVKYIKGVGNLSHEQWR 360
Query: 981 SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
SF+N+KKT +A+NFLDGDLIESFLDLSR++M+E+SK M V VEEL KRVEELTRLH
Sbjct: 361 SFHNDKKTAEARNFLDGDLIESFLDLSRSKMEEVSKAMGVPVEELSKRVEELTRLH 416
>gi|395544366|ref|XP_003774082.1| PREDICTED: DNA damage-binding protein 1 [Sarcophilus harrisii]
Length = 1239
Score = 711 bits (1834), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/713 (51%), Positives = 476/713 (66%), Gaps = 56/713 (7%)
Query: 374 RILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKS 433
R+L +N E E+EETE+ GF QT FC + + QL+Q+TS SVRLVS + L +EWK
Sbjct: 533 RVLMLNGE-EVEETELTGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKE 591
Query: 434 PPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPS 493
P G +++VA+ N+SQV++A G L YL+I L ++ H ++E+E++CLDI P+G++
Sbjct: 592 PQGKNISVASCNSSQVVVAVGRA-LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNG 650
Query: 494 YSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHL 553
S + A+G+WTDIS RI LP L+ KE LGGEIIPRS+L+ FE YLLCALGDG L
Sbjct: 651 MSPLCAIGLWTDISARILKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGAL 710
Query: 554 LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNV 613
F L+++TG L+DRKKV+LGTQP LRTF S +TT+VFA SDRPTVIYSSN KL++SNV
Sbjct: 711 FYFGLSIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNV 770
Query: 614 NLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSR 673
NLKEV++MCP NS +PDSLA+A LTIGTID+IQKLHIR++PL E PR+IC+QE S+
Sbjct: 771 NLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQ 830
Query: 674 TFAICSLK---------------------------------------NQSCAEESEMHFV 694
F + S + S EE E+H +
Sbjct: 831 CFGVLSSRIEVQDASGGTTALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNL 890
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF 754
++D TFE + + EY S++SC D N Y+ VGTA V PEE EP +GRI+VF
Sbjct: 891 LIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVF 950
Query: 755 IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++LY+W +EL++EC H+
Sbjct: 951 QYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWT----AEKELRTECNHYN 1006
Query: 815 HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
+I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE ARD+N NWMSAVEILDDD +LG
Sbjct: 1007 NIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLG 1066
Query: 875 AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPT 933
AEN FNLF +K+S TDEER L+ VG +HLGEFVN F HGSLVM+ L ++ +
Sbjct: 1067 AENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQGS 1126
Query: 934 VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
V+FGTVNG+IG++ SL Y L +Q L KVIK VG + H WRSF+ E+KT A
Sbjct: 1127 VLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSFHTERKTEPATG 1186
Query: 994 FLDGDLIESFLDLSRTRMDEISKTMN----------VSVEELCKRVEELTRLH 1036
F+DGDLIESFLD+SR +M E+ + + ++L K VEELTR+H
Sbjct: 1187 FIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVVEELTRIH 1239
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 198/348 (56%), Positives = 261/348 (75%), Gaps = 11/348 (3%)
Query: 17 RPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIGRPTDNGQIGIIDPD 74
R GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIGRP++ G IGIIDP+
Sbjct: 73 RQQGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIGRPSETGIIGIIDPE 132
Query: 75 CRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKD 134
CR+IGL LYDGLFKVIP D + +AFNIRLEELQV+D+KFLYGC PTI +YQD +
Sbjct: 133 CRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELQVIDVKFLYGCQAPTICFVYQDPQ- 191
Query: 135 ARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFK 194
RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +IIG+E+I Y + + +
Sbjct: 192 GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIIIGQESITYHNGDKYL 251
Query: 195 AIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-----KVTGLKIELLG 246
AI I+ S + RVD +GSRYLLGD G L +L++ E++ + L++ELLG
Sbjct: 252 AIAPPIIKQSTIVCHNRVDPNGSRYLLGDMEGRLFMLLLEKEEQMDGTVTLKDLRVELLG 311
Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVD 306
ETSIA ++YLDN VV++GS GDSQL+KLN+ + +GSYV +E + NLGPIVD CVVD
Sbjct: 312 ETSIAECLTYLDNGVVFVGSRLGDSQLVKLNVDSNEQGSYVVAMETFTNLGPIVDMCVVD 371
Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR 354
LERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+E AS++L GIKG+W LR
Sbjct: 372 LERQGQGQLVTCSGAFKEGSLRIIRNGIGIHEHASIDLPGIKGLWPLR 419
>gi|328700785|ref|XP_001945395.2| PREDICTED: DNA damage-binding protein 1-like [Acyrthosiphon pisum]
Length = 1072
Score = 699 bits (1803), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 387/1003 (38%), Positives = 601/1003 (59%), Gaps = 86/1003 (8%)
Query: 8 GRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS--ELITRAMGDVSDRIGRPTDN 65
G+I L++FRP + D +F+ T +Y +L++ S E++T+ +V D G T+
Sbjct: 91 GKIEILKVFRPKSKTTDLIFVVTAQYNAMILEYTQGSDKIEMMTKGHCNVFDHFGVATE- 149
Query: 66 GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTI 125
+GIIDP+ +L+ L L++ +FK+IP D +G+L + ++I +EE + DI FLYG PTI
Sbjct: 150 -FMGIIDPNAKLVMLKLFEKMFKIIPLDKEGEL-DVYSITMEETNIQDIGFLYGFTNPTI 207
Query: 126 VVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETI 185
+++Y+ N R +K ++ +++ A ++IPVP PLCG +IIGE +I
Sbjct: 208 IIIYE-NAMGRTIKIKKIIDSK--------KYKSIEKEASMVIPVPSPLCGAIIIGENSI 258
Query: 186 VY--CSANAFKAIPIRPSI-TKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-----KV 237
Y S N + +PIR I Y RVD +G+RYLLGDH+G L +L + +EK KV
Sbjct: 259 FYHNGSCNIIR-LPIRQKIEIVCYTRVDLEGTRYLLGDHSGCLLMLFLKYEKTLNGKFKV 317
Query: 238 TGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLG 297
T L + GE SI +++YLDN V+Y+ S +GDSQLIKL+ + + GS++ +L++Y+NLG
Sbjct: 318 TDLYLRYFGEISIPISLTYLDNKVIYVASKFGDSQLIKLHYELNETGSHLTILDQYLNLG 377
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST 357
PIVD C+VD++++GQ Q+VTCSGAYKDGSLRI+ NG+GI E A+++L GIKG+WSL +T
Sbjct: 378 PIVDMCLVDIDQRGQEQIVTCSGAYKDGSLRIINNGVGIQEIATIDLLGIKGIWSLSFNT 437
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
D LV+SF+ +++LA + E E EE +EGF S+ QT +C + N++VQVTS SV
Sbjct: 438 KSDLDDTLVLSFVWHSKVLAYDSE-EAEEIYVEGFESELQTFYCGKTLDNKMVQVTSASV 496
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLE 477
RL+ S++L +EWK P ++N + N Q + ++G L Y+EIG + + KH LE
Sbjct: 497 RLICMESKKLISEWKVPYFRNINAVSCNGHQAVCSSGHD-LYYIEIGSQKIFQNKHITLE 555
Query: 478 YEISCLDINPIGENPSYSQI-AAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLC 536
+E+SCLDI + + I A+G+WTD S+++ LPD + KE+L I+PRSV
Sbjct: 556 HEVSCLDICLFKDKFGETIILLAIGLWTDTSIKVLKLPDFVELEKENLFEGIVPRSVSFV 615
Query: 537 AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASD 596
E I YL CALG+G L F LN++TG+L K+ LG +P ++ F + ++ +F SD
Sbjct: 616 TLENIHYLFCALGNGSLCYFYLNIETGKLYKNGKIKLGNRPALIKKFQTASS--IFICSD 673
Query: 597 RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
P +I+SSNKKL+++NVN +V+++C N+ +P+SL +A + + IG ID +K H+R+
Sbjct: 674 FPIIIHSSNKKLVFTNVNSIKVNNICMVNTDNYPNSLILATDTAIIIGVIDMEKKHHVRT 733
Query: 657 IPLGEHPRRICHQEQSRTFAICSLKNQSCAE----------------------------- 687
IPLGE PRRI +QE S+TF I ++K E
Sbjct: 734 IPLGESPRRIAYQEASKTFGIITIKKNIKDEMIKGPVHPSASTRTQNITSAMGNRFLIDQ 793
Query: 688 ------------------ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
E E+ + +L TFE + Y L + EY SI+S ++D N
Sbjct: 794 HLSSSSEASSLSDSDFNPEVEISSMIILHQDTFEILHVYQLYSNEYALSIISTKLANDPN 853
Query: 730 VYYCVGTAYVLPEENEPTKGRILVFIVED--GKLQLIAEKETKGAVYSLNAFNGKLLAAI 787
YY +GTA + E +P +GRI+VF + KL I+EK G +S+ F+ L+A +
Sbjct: 854 TYYVLGTALMTEECQDPKEGRIVVFHYDTSLSKLTQISEKIVYGGCFSMVTFHDMLIATV 913
Query: 788 NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
N +QLY W ++ C + + L+ +V+T G +I+ GDLMKS++L YK +E
Sbjct: 914 NSSVQLYIWT----HEKKFVLRCTQNNNSLSQHVKTNGKYILCGDLMKSLALFKYKTDEP 969
Query: 848 AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT-DEERGRLEVVGEYH 906
+E+ D W SA+EI+DDD+++GAEN+ L+ K+S + D + + +G +H
Sbjct: 970 NLEKIVTDDCLKWSSAIEIIDDDLFIGAENDKYLYVFYKHSNFVSGDYQHNNFKEIGRFH 1029
Query: 907 LGEFVNRFRHGSLVMRLPDSD----VGQIPTVIFGTVNGVIGV 945
LG+ VN FRHGSLVM+ +++ + ++++GT++G +G+
Sbjct: 1030 LGDLVNVFRHGSLVMKQFENEYETQLSVQGSILYGTISGALGL 1072
>gi|449684814|ref|XP_004210722.1| PREDICTED: DNA damage-binding protein 1-like, partial [Hydra
magnipapillata]
Length = 725
Score = 699 bits (1803), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 374/801 (46%), Positives = 514/801 (64%), Gaps = 88/801 (10%)
Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL 307
TSI ++YLDN VV+IGS GDSQ++KLN +PD KGS++ +L + NLGPI+D CVVDL
Sbjct: 1 TSIPHCLTYLDNGVVFIGSCLGDSQIVKLNTEPDKKGSFITILRSFTNLGPILDMCVVDL 60
Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR--SSTDDPFDTFL 365
ERQGQ Q+VTCSGA+KDGSLRI+RNGIGINE AS++L GI G+W L+ S D DT +
Sbjct: 61 ERQGQDQLVTCSGAFKDGSLRIIRNGIGINELASIDLAGIMGLWCLKVNSINSDLHDT-M 119
Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
V+SF+ ++R+L+++ +E+EE EIEGF S QT +C + +NQL+Q
Sbjct: 120 VLSFVGQSRVLSLS-TEEVEEIEIEGFSSDKQTTYCANVNFNQLIQ------------KH 166
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI 485
+ W P ++VA +N+ Q++++ G L+YLE+ D + ++ H LEYE++CLD+
Sbjct: 167 FFFSNWLPPDNKHISVAVSNSFQIVVSLGK-ELIYLEVEDSNIKQISHTVLEYEVACLDL 225
Query: 486 NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLL 545
+P G N + S VG+WTDISVRI +LP+L + E L GE+IPRS+L+ FE YLL
Sbjct: 226 SPKGSNETTSDRLCVGLWTDISVRILALPNLEELYVEKLSGEMIPRSILMITFEDKEYLL 285
Query: 546 CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSN 605
CALGDG L FLLN TG L+D+KKVSLGT+P +++F S ++THVFA SDRPTVIYSSN
Sbjct: 286 CALGDGSLFYFLLNRLTGVLSDQKKVSLGTKPTVIQSFKSGSSTHVFACSDRPTVIYSSN 345
Query: 606 KKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRR 665
KL++SNVNLKEV +M P N+ A+P+S G ID I K+ L H +
Sbjct: 346 NKLVFSNVNLKEVCYMSPLNTQAYPNSFY--------SGHIDSIYKMFF----LVTHSHQ 393
Query: 666 ICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS 725
E + + CS FS
Sbjct: 394 FLENEWATSLTSCS--------------------------------------------FS 409
Query: 726 DDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLA 785
+D N YYCVGT+ V PEE+EP +G+I++F + +GKL I K GAVY L FNGKLLA
Sbjct: 410 NDPNTYYCVGTSMVYPEESEPKEGKIILFQLFEGKLVQIGSKTVNGAVYVLQGFNGKLLA 469
Query: 786 AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
+N + +Y+W +EL+ EC +H ILALY++++GDFI+VGDLM+S++LL YK
Sbjct: 470 GVNSLVSVYEWT----SDKELKQECCYHNTILALYLKSKGDFILVGDLMRSMTLLAYK-P 524
Query: 846 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
G +EE A D++ NWM+AVEI+DDD +LGAEN+FNLF +K++ DEER L+ +G+Y
Sbjct: 525 LGRLEEIAHDFSPNWMTAVEIIDDDTFLGAENSFNLFICQKDNSSVNDEERHHLQTIGKY 584
Query: 906 HLGEFVNRFRHGSLVMRLPDSDVGQI-PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
HLG+FVN F+HGSLVM + I ++++GTV G IG++A LP + FL ++Q L
Sbjct: 585 HLGDFVNVFKHGSLVMHHSTEQLTPISSSILYGTVRGAIGLVAGLPKNTFDFLSQVQEKL 644
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV---- 1020
K IK VG + HE WRSF N+KKT A +DGDLIES LDL+RT++ E+ + +
Sbjct: 645 SKTIKSVGKIEHEFWRSFYNDKKTDLAVGCVDGDLIESCLDLTRTQLHEVVSGLEIEEAG 704
Query: 1021 -----SVEELCKRVEELTRLH 1036
+V++L K VEEL+R+H
Sbjct: 705 IKRECTVDDLIKVVEELSRIH 725
>gi|313238818|emb|CBY20011.1| unnamed protein product [Oikopleura dioica]
gi|313245836|emb|CBY34826.1| unnamed protein product [Oikopleura dioica]
Length = 1135
Score = 697 bits (1799), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 397/1088 (36%), Positives = 622/1088 (57%), Gaps = 71/1088 (6%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YGRIA +E+FR E +D LFI TE C+L++ ++ITRA GD+ D+ + +
Sbjct: 62 LYGRIAVIEVFRYKNEKKDCLFILTESCYACILEY--VDGKIITRAYGDMRDKNYSVSQS 119
Query: 66 GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTI 125
G +DP+ R I L LYDG+ K+I ++ + + R+EE+ V+D+ FL+ KPT+
Sbjct: 120 GMHACVDPEARCIALRLYDGVLKIINLNSSSKHLTSAEQRIEEILVVDMCFLHTANKPTL 179
Query: 126 VVLYQDNKDARHVKTYEVALKDK----DFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
+LY DN +RH+ T + L + +GP+ ++ L++ VP PL G+L++G
Sbjct: 180 ALLYDDNS-SRHLSTIAITLDNSGSGASIHKGPFRHTQVEQDTILIVAVPEPLAGILLLG 238
Query: 182 EETIVYCSANAFKAIPIRPSITKAYGRVDA-DGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
I Y + I + + V D RYL GD G L LL++ + + ++
Sbjct: 239 HVNITYHDSKNRSTCSIENIVKRTIECVTPIDKHRYLCGDSNGELFLLLLDYNENRIPEE 298
Query: 241 KIEL----LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNL 296
++ L LG T++ +T+SY+DN VV++GS++GDS+LI++ + + G + L +Y NL
Sbjct: 299 RMRLATKYLGRTTLPNTLSYIDNYVVFVGSTFGDSELIRIEVSDNNSGQHFTSLHQYDNL 358
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSS 356
GPI D C+VD E+QGQGQ+VT SG GSLRI+RNG+GI+E AS++L+G+KG+W+L+
Sbjct: 359 GPIKDMCIVDFEKQGQGQLVTASGVGTGGSLRIIRNGVGIHEYASIDLEGVKGLWALKYL 418
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+ L++SF+ +T L + +D E EI GF + QT++ + Q +Q+T
Sbjct: 419 SSSTKQDSLLLSFVGQTIFLRLEGQDVTEVEEIPGFTNGEQTMYAGNVTDQQFLQITEKQ 478
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL 476
VRL++ S L+ W+ +N+ + N +QVLL G +YLEI D + E
Sbjct: 479 VRLIADES--LKGSWEPEENTQINLCSVNKNQVLLGVGST-AIYLEINDCEIVEKSRHVF 535
Query: 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLC 536
+ EI+C+DI+P+ + S S +G+W +++V + LP + +I K LGG IIPRSVLL
Sbjct: 536 DSEIACVDISPLQKEMS-SDFFTIGLW-NVTVSVNKLPSMEVIAKMELGGNIIPRSVLLN 593
Query: 537 AF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAA 594
+F YLL ++GDG L LN + + +K++ LGTQP +L F + N + VFA
Sbjct: 594 SFGENNTPYLLVSIGDGALFYIKLN-EDHSFSSKKRIQLGTQPTSLNKFQTSNGSTVFAC 652
Query: 595 SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHI 654
SDRP VI+S+N+K+ +SNVNLK+V+HMC ++ +P++LA+ E L IG IDDIQKLH
Sbjct: 653 SDRPAVIHSTNEKIFFSNVNLKQVNHMCVLDTEGYPNALALVNENALLIGKIDDIQKLHT 712
Query: 655 RSIPLGEHPRRICHQEQSRTFAI-----------------------------CSLK---- 681
+I L E P + H E+ FA C K
Sbjct: 713 STIRLNETPHSVLHYEEREVFAYLGEFDEEDLRDTRPDQESTKKLFTPLSIQCPYKSGVV 772
Query: 682 ----NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTA 737
+ S + ++ + + D+ TF + D E + + VGTA
Sbjct: 773 ERDDSNSLTHYTMVNTLVICDEITFTPKWAHFFDVGEISSCMCIAKLGKKDEQFIVVGTA 832
Query: 738 YVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNG-KLLAAINQKIQLYK 795
+ +E E GRI VF ++ KL L++ K+ GAVYS+ A NG K++ AINQ++++++
Sbjct: 833 -ITADEQECKNGRICVFSYSKEEKLTLVSTKQVNGAVYSVKALNGNKIICAINQQLKVFE 891
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQ-TRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
+ LQSE HI + V ++ FI+ DLM+SIS+ YK EGA+EE AR
Sbjct: 892 M----NEQTTLQSEAPIANHITCVAVDVSKNGFILSADLMRSISVFSYKPLEGALEEIAR 947
Query: 855 DYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
DY+ NWM+A++++DDD Y+GAEN+ N+F +N+E +E+R +L G YH+GE +N
Sbjct: 948 DYHPNWMTAIKMIDDDNYIGAENSENIFICTRNTEAPDEEDRQQLLPTGYYHVGEHINTI 1007
Query: 915 RHGSLVMRLP-DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
G+LVM + +S + T + G+V+G +G++A P +Q+ FL KL+ +RKVI+GVG
Sbjct: 1008 VEGNLVMDVHVESSITPTRTFLMGSVSGYVGLLAIFPEKQWQFLSKLEAKMRKVIRGVGK 1067
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE-----ELCKR 1028
++HE WR F ++ + D K F+DGDLIE F DL + E+ + + E ++ +
Sbjct: 1068 IDHESWRRFESDSRMEDCKGFVDGDLIEMFQDLRPEKQKEVISELTMDGEPATHDDVVRL 1127
Query: 1029 VEELTRLH 1036
V++L RLH
Sbjct: 1128 VDDLCRLH 1135
>gi|255316764|gb|ACU01763.1| putative DNA damage binding protein [Brachypodium distachyon]
Length = 384
Score = 667 bits (1721), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/384 (81%), Positives = 352/384 (91%), Gaps = 1/384 (0%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDT 712
IR+IPL E RRICHQEQSRT A CS K N + EESE HF+RLLD QTFEF+ST+PLD
Sbjct: 1 IRTIPLNEQARRICHQEQSRTLAFCSFKYNPNSMEESEAHFIRLLDHQTFEFLSTHPLDQ 60
Query: 713 FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA 772
+E GCS++SCSFSDD+N YYCVGTAYVLPEENEPTKGRILVF VEDG+LQLI EKETKGA
Sbjct: 61 YECGCSMISCSFSDDNNFYYCVGTAYVLPEENEPTKGRILVFAVEDGRLQLIVEKETKGA 120
Query: 773 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
VYSLNAFNGKLLAAINQKIQLYKWM R+DG+ ELQSECGHHGHILAL+ QTRGDFIVVGD
Sbjct: 121 VYSLNAFNGKLLAAINQKIQLYKWMTREDGSHELQSECGHHGHILALFTQTRGDFIVVGD 180
Query: 833 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
LMKSISLL+YKHEE AIEE ARDYNANWM+AVE++DDDIY+GAEN++NLFTVRKNS+ AT
Sbjct: 181 LMKSISLLVYKHEESAIEELARDYNANWMTAVEMIDDDIYVGAENSYNLFTVRKNSDAAT 240
Query: 893 DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD+++GQIPTVIFGT+NGVIG+IASLPH+
Sbjct: 241 DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDTEMGQIPTVIFGTINGVIGIIASLPHD 300
Query: 953 QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD 1012
QY+FLEKLQ+ L K IKGVG L+H+QWRSF+NEKKT +A+NFLDGDLIESFLDL+R++M+
Sbjct: 301 QYVFLEKLQSILGKFIKGVGSLSHDQWRSFHNEKKTAEARNFLDGDLIESFLDLNRSKME 360
Query: 1013 EISKTMNVSVEELCKRVEELTRLH 1036
E+SK M VSVE L KRVEELTRLH
Sbjct: 361 EVSKGMGVSVENLSKRVEELTRLH 384
>gi|219125301|ref|XP_002182922.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
1055/1]
gi|217405716|gb|EEC45658.1| damage-specific DNA binding protein 1 [Phaeodactylum tricornutum CCAP
1055/1]
Length = 1284
Score = 607 bits (1564), Expect = e-170, Method: Compositional matrix adjust.
Identities = 411/1176 (34%), Positives = 621/1176 (52%), Gaps = 164/1176 (13%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS-----ELITRAMGDV 55
+ VPI GRI L F+ HG ++F+ T R ++ VL +D +S L+T A G +
Sbjct: 64 LASVPINGRIVGLVPFKVHGSDTSYVFVLTARQQYAVLAYDRTNSGSAAYPLVTLASGTL 123
Query: 56 SDR----IGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIP----FDNKGQLKEAFNIRLE 107
+ +G+ ++G I ID R I LH+YDGL +IP ++ L F+ R+E
Sbjct: 124 QSQEHAVLGQEAESGPIVAIDHFHRCIALHVYDGLLTIIPSTRTLASQQLLGTPFHCRIE 183
Query: 108 ELQVLDIKFLYGC--AKPTIVVLYQDNKDARHVKTYEVALKDKD-FVEGP--------WS 156
E +L + FL A P + VL+QD + A+H+ ++ + K K+ F+ G W
Sbjct: 184 ERTILHLAFLQIPFEALPQLAVLHQDARGAQHITSHVINWKRKNIFLYGSSSAPAATEWL 243
Query: 157 Q-NNLDNGADLLIPVP---PP--------LCGVLIIGEETIVYCSANAFKAIPIRPSITK 204
Q +N+D G+ L+IPVP PP GVL++G+ + + + N K +PI ++
Sbjct: 244 QKSNVDGGSSLIIPVPAEAPPDFAPAKHRSGGVLVVGQRQLTFINNNVTKVVPIPQALHL 303
Query: 205 AYGRVDADGS----RYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
+ AD + RYLL D G LH++ I +KV L+I+ LG ++A++I+YL
Sbjct: 304 CVEELPADPNGGLPRYLLADEFGNLHMVTIVLVVDKVVALQIDTLGSCTLATSIAYLREG 363
Query: 261 VVYIGSSYGDSQLIKLNLQP-----------DAKGSYVEVLERYVNLGPIVDFCVVD--- 306
+V++GS+ GD QLI+++ +P A+ SY+ V+E Y +LGPI+DF +V
Sbjct: 364 LVFVGSTLGDPQLIQIHDEPIVDVEDEEDMVGAESSYLSVVEEYTHLGPILDFDLVPTAP 423
Query: 307 ----------LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSS 356
+ Q QVVT SG+ K GSLR++RNGIG+NE A+VE+ GI+ +WSLR S
Sbjct: 424 GGGGLGQTEGIHGPSQSQVVTASGSSKSGSLRLIRNGIGMNESAAVEIPGIQNVWSLRRS 483
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEG---------------FCSQTQTLFC 401
D DT+LV SF+ ETR+L + D++ + E EG S TL+
Sbjct: 484 FADVDDTYLVQSFVHETRVLGVTTMDDMSQDEKEGDVAPGGTLEEVFLIGLKSSCATLYV 543
Query: 402 HD--AIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATAN-ASQVLLATGGGHL 458
+ A N L+Q+T G VR +T + + W P G ++ V TAN A Q+ +A GG +
Sbjct: 544 GNVQAHQNGLLQITEGEVRF--ATMEAVLDTWLVPSGAAITVGTANEAGQIAVALNGGKV 601
Query: 459 VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
+YL+I +G + E Q+E E+SCL++NP S AVG+W D +VR+F L L
Sbjct: 602 LYLKIEEGKIRECSGQQMEREVSCLNLNPFAATSHTSSFLAVGLWDDFTVRLF----LCL 657
Query: 519 ITKEHLGGEIIPRSVLLCAFEG----ISYLLCALGDGHLLNFLLNMKTGEL--TDRKKVS 572
IT + G + + ++ L LGDG L++F + + + +K+V
Sbjct: 658 ITLDFSSGTSGNTTSTSTSLSSTGSGVNMLFVGLGDGTLISFAVVERGASIFVQSKKEVC 717
Query: 573 LGTQPITLRTF-SSKNTTHVFAASDRPTVIYSS----------NKKLLYSNVNLKE---- 617
LGTQ I L + + T V A DRPTVIY + N KL YSNVNL
Sbjct: 718 LGTQRIDLVPLCTEQGGTCVLATGDRPTVIYLAGVGGISANQFNPKLCYSNVNLSAGDDE 777
Query: 618 ---------------VSHMCPFNSAAFPDS---------LAIAKEGELTIGTIDDIQKLH 653
V+ PF+S+ DS L +A + L +G IDDIQKLH
Sbjct: 778 EEDDVSRPPSQQSIVVNVATPFSSSLLFDSATGGSQRYSLCVADDSFLRMGIIDDIQKLH 837
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKN-----QSCAEESEM-HFVRLLDDQTFEFIST 707
+ + LG P RI H R FA+ +++ +E+ M + +R +DD F+ I
Sbjct: 838 VTTCRLGMAPCRIVHCADGRLFAVGCIESGIKQFSLGGDEANMGNCIRFMDDANFDDIHR 897
Query: 708 YPLDTFEYGCSILSCSFSDDSN----------VYYCVGTAYVLPEENEPTKGRILVFIVE 757
L+ FE S++ + S+ + VGTAY +P+E+EP++GRILV+ +
Sbjct: 898 VDLEPFEMILSMVYATLRIPSDGDQSDQPVHRPFLLVGTAYAMPDEDEPSRGRILVYSCQ 957
Query: 758 DGK----------LQLIAEKETKGAVYSLNAF-NGKLLAAINQKIQLYKWMLRDDGTREL 806
+ ++ I E T+G VYS+ F +G L +N K + + ++ D G L
Sbjct: 958 ADEASGTPTSTRAVRQITEMSTQGGVYSICQFYDGNFLCTVNSKTHVVQ-IVADCGVLRL 1016
Query: 807 Q-SECGHHGHILALYVQTRGD-FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV 864
+ GHHGHI++L+V++R +VGDLM+S+SL+ Y + +EE ARD+N NW +AV
Sbjct: 1017 EYVGIGHHGHIVSLFVKSRAKPLAIVGDLMRSVSLMQYYPQHETLEEVARDFNPNWTTAV 1076
Query: 865 EILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRL- 923
E+L DD+Y+GAEN NLF +R+N ++E R RL+ +GE+HLGE N+F GSLVM +
Sbjct: 1077 EMLTDDVYIGAENWNNLFCLRRNKAATSEEIRCRLDNIGEFHLGEMCNKFMSGSLVMPVS 1136
Query: 924 PDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
+S +FGTV G +GVI L F L+ + K I+ VGG +H+ +RS
Sbjct: 1137 SNSTTSSRRATLFGTVEGSLGVILGLDGRTAAFFITLERAIAKTIQPVGGFSHQLYRSCQ 1196
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
E + A F+DGDL+E+FLDL R M+ + MN
Sbjct: 1197 AELRVHPAHGFVDGDLVETFLDLDRRTMEAVVAEMN 1232
>gi|384500266|gb|EIE90757.1| hypothetical protein RO3G_15468 [Rhizopus delemar RA 99-880]
Length = 1057
Score = 604 bits (1558), Expect = e-170, Method: Compositional matrix adjust.
Identities = 363/957 (37%), Positives = 559/957 (58%), Gaps = 51/957 (5%)
Query: 99 KEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQN 158
++ NI L + +V+ + FL PT+++LY+D + R ++ + + KD+ V G +
Sbjct: 132 QDHINISLPDKKVISLAFLQDTLDPTLLILYEDALEQRLLQMFTI--KDRQLVPGDIILD 189
Query: 159 NLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFK-AIPIRPSITKAYGRVDADGSRYL 217
+ ++ A LLI +PP + GVL++ + I Y N AI IR S ++ ++ +GSR L
Sbjct: 190 HFESDASLLIAMPPAVGGVLLVASKFIRYLKPNQPPIAIGIRSSTINSHCIMNEEGSRVL 249
Query: 218 LGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
LGD GLL+LL + + V L LG SI S ++YLDN +V++GS+ DSQL+ +
Sbjct: 250 LGDAEGLLYLLALNTTNQCVESLSFIYLGSISIPSCLAYLDNDIVFVGSNLADSQLVYIQ 309
Query: 278 LQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
++++E + NLGPI DFCV D + GQ QV+TCSGA K+GSLRI+RNG+G+N
Sbjct: 310 RTTGESEDILQIIETFANLGPITDFCVAD--KGGQTQVITCSGAGKEGSLRIIRNGVGLN 367
Query: 338 EQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCS--- 394
E A + + G+KG+W+L +D L++SF+ +TR+L + + + +++ F +
Sbjct: 368 ELAMIPISGVKGIWALGE-----YD-LLLMSFVHQTRLLQLQKDHTI--VQLDTFSAIDL 419
Query: 395 QTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATG 454
+TL + ++QVT SVRL+ + S L + W S + VA+ N +Q +++ G
Sbjct: 420 NARTLVAGCVVDGMIIQVTDHSVRLMDTMS--LLDVWSSDE--LITVASVNPTQCVISLG 475
Query: 455 GGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMW-TDISVRIFSL 513
G LV L++ + L + +L +EISC+DI+PIG + S A+G W ++ +V + SL
Sbjct: 476 FGKLVALQVLNRKLNVIGETRLSFEISCIDIHPIGSR-TESAFVALGTWNSNTNVCLLSL 534
Query: 514 PDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573
DL I ++ LGG ++PRS+L+ FE YLL ALGDG NF L+ +G+L+D+K+ L
Sbjct: 535 SDLQPIAQKSLGGTVVPRSILISQFENTVYLLVALGDGQFYNFKLDSISGQLSDKKRTFL 594
Query: 574 GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
G PI L FS THVFAASD+P+V++S N+KL+YSNVNLKEV F AFPD++
Sbjct: 595 GKLPIHLSNFSLNGVTHVFAASDKPSVVHSRNQKLVYSNVNLKEVRCATSFGCHAFPDAV 654
Query: 634 A-IAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS--LKNQSCAEESE 690
A I KEG L IG +++IQKLHI IP + PRRI +QE +++F I + + + +
Sbjct: 655 ALITKEG-LIIGQMEEIQKLHITKIPTIDTPRRIVYQESTKSFGIITERMISDRYRPSTV 713
Query: 691 MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGR 750
+LDDQ+F + FE S+ S F DD N YY V + +EN+ GR
Sbjct: 714 TGGFEVLDDQSFTVLDRIYFKEFERPLSVTSVLFEDDPNEYYIVASG----KENDGL-GR 768
Query: 751 ILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM---LRDDGTREL 806
ILV + D KL+LI++ +T G + + GKLLA+I + LY+W L +R L
Sbjct: 769 ILVLQLASDRKLRLISQLKTGGMIDCVRPIEGKLLASIQGTLYLYRWQSQRLVKVSSRRL 828
Query: 807 QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
S + + T +FI+ GDL S+ + Y + + E A + A++
Sbjct: 829 PS--------VTRCMTTHENFIMTGDLAYSVVMFQYDRQSDQLLEVAAHEKTKEVLAMKA 880
Query: 867 LDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
+D ++ +GAE +LF + + + +E L+V+ +HLG+ V+RFR GSL M D
Sbjct: 881 IDSNLVIGAEREGHLFVLEHCQDEVSADE-PLLDVISTWHLGDVVSRFRFGSLGMNNVDP 939
Query: 927 DVGQI-PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
D I P++IF T +G IGVIA L E+Y L ++Q N+ +V+KG+G L+H WR+ N
Sbjct: 940 DSSPIAPSLIFATASGAIGVIADLSPERYKLLYQMQCNMCRVVKGIGELSHTDWRNVNIM 999
Query: 986 KKTVDAKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
+ +A NF+DGDLIESFLDLS +M + + ++++VE+LCK VEEL +H
Sbjct: 1000 YRKEEAMNFIDGDLIESFLDLSSQQMQNVVDGLHGGRKLDLTVEDLCKVVEELMSIH 1056
>gi|17541566|ref|NP_502299.1| Protein DDB-1 [Caenorhabditis elegans]
gi|74965443|sp|Q21554.2|DDB1_CAEEL RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|5824558|emb|CAA92824.2| Protein DDB-1 [Caenorhabditis elegans]
Length = 1134
Score = 596 bits (1536), Expect = e-167, Method: Compositional matrix adjust.
Identities = 368/1098 (33%), Positives = 603/1098 (54%), Gaps = 81/1098 (7%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
++PIYG++ T+ L + + + L + TE++ +L + +++TRA G ++D GR
Sbjct: 54 EIPIYGQVLTIALVKCKRDKRHSLIVVTEKWHMAILAY--RDGKVVTRAAGCIADPTGRA 111
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-YGCA 121
TDN + LI + ++G K+I +++ L+ FN+R + V D KF+ G
Sbjct: 112 TDN-LFSLTIHRNGLIAIRAFEGSVKMIQWESGTDLRH-FNVRFDYPNVSDFKFVDTGED 169
Query: 122 KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
V D+ +H++ ++ + DK+F Q ++ + +LIPVP + GV+++G
Sbjct: 170 DVYRVAFIYDDDHGKHLQFSDLNMHDKEF-RTYSRQASIAADSSVLIPVPHAIGGVIVLG 228
Query: 182 EETIVY-CSANAFKAIPIRPSITK-----AYGRVDADGSRYLLGDHAG-LLHLLVITHEK 234
+++Y + N + +P S+ + +G VDA G R+LL D G LL LL+ E
Sbjct: 229 SNSVLYKPNDNLGEVVPYTCSLLENTTFTCHGIVDASGERFLLSDTDGRLLMLLLNVTES 288
Query: 235 EK---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
+ V ++I+ LGETSIA +I+Y+DN VV++GS GDSQLI+L +P+ GSY +LE
Sbjct: 289 QSGYTVKEMRIDYLGETSIADSINYIDNGVVFVGSRLGDSQLIRLMTEPNG-GSYSVILE 347
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
Y N+GPI D +V E GQ Q+VTC+GA KDGSLR++RNGIGI+E ASV+L G+ G++
Sbjct: 348 TYSNIGPIRDMVMV--ESDGQPQLVTCTGADKDGSLRVIRNGIGIDELASVDLAGVVGIF 405
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ--- 408
+R D D +++VS ET +L + E ELE+ ++ + T+F
Sbjct: 406 PIR--LDSNADNYVIVSLSDETHVLQITGE-ELEDVKLLEINTDLPTIFASTLFGPNDSG 462
Query: 409 -LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL------ 461
++Q T +RL+SS+ L W+ G ++ + NA+ + VYL
Sbjct: 463 IILQATEKQIRLMSSSG--LSKFWEPTNGEIISKVSVNAANGQIVLAARDTVYLLTCIVD 520
Query: 462 EIGDGILTEVKHAQLEYEISCLDINPIGENPS-YSQIAAVGMWTDISVRIFSLPDLNLIT 520
E+G + + E EI+CLD++ G++P+ + + W+ ++ + LPDL +
Sbjct: 521 EMGALDIQLTAEKKFENEIACLDLSNEGDDPNNKATFLVLAFWSTFAMEVIQLPDLITVC 580
Query: 521 KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
L +IIPRS++ E + YLL A GDG L+ ++ ++KTG + KK ++GT+P +L
Sbjct: 581 HTDLPTKIIPRSIIATCIEEVHYLLVAFGDGALVYYVFDIKTGTHGEPKKSNVGTRPPSL 640
Query: 581 RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
+KN H+F SDRP +I+S++KKL++SNVN+K V +C +S+A+ D L I+
Sbjct: 641 HRVRNKNRQHLFVCSDRPVIIFSASKKLVFSNVNVKLVDTVCSLSSSAYRDCLVISDGNS 700
Query: 641 LTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEE------------ 688
+ GT+DDIQK+H+RSIP+GE RI +Q+ + T+ +CS + +S AE
Sbjct: 701 MVFGTVDDIQKIHVRSIPMGESVLRIAYQKSTSTYGVCSNRTESKAERVFASKNALVTSQ 760
Query: 689 -----------------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
+ +LD TF+ + ++ +E S +S F++DS+ Y
Sbjct: 761 SRPKVASTRADMDESPPNTTSSFMVLDQNTFQVLHSHEFGPWETALSCISGQFTNDSSTY 820
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVED---GKLQLIAEKETKGAVYSLNAFNGKLLAAIN 788
Y VGT + P+E E GRI+VF V+D KL+ + E +G+ ++ NGKL+AAIN
Sbjct: 821 YVVGTGLIYPDETETKIGRIVVFEVDDVERSKLRRVHELVVRGSPLAIRILNGKLVAAIN 880
Query: 789 QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
I+L++W +EL+ EC H++AL ++ + + V D+M+S+SLL Y+ EG
Sbjct: 881 SSIRLFEWTT----DKELRLECSSFNHVIALDLKVMNEEVAVADVMRSVSLLSYRMLEGN 936
Query: 849 IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-KNSEGATDEERGRLEVVGEYHL 907
EE A+D+N+ WM E + + LG E + NLFTV + TD+ R LE G ++L
Sbjct: 937 FEEVAKDWNSQWMVTCEFITAESILGGEAHLNLFTVEVDKTRPITDDGRYVLEPTGYWYL 996
Query: 908 GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
GE +LV++ DS + ++FGT G IG+I + + FL ++ +
Sbjct: 997 GELPKVMTRSTLVIQPEDSIIQYSQPIMFGTNQGTIGMIVQIDDKWKKFLIAIEKAIADS 1056
Query: 968 IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRT-RMDEISKTMNVSVE--- 1023
+K + H +R+F +K+ F+DGDL+ES LD+ R+ MD +SK + +
Sbjct: 1057 VKNCMHIEHSSYRTFVFQKRAEPPSGFVDGDLVESILDMDRSVAMDILSKVSDKGWDPSL 1116
Query: 1024 -----ELCKRVEELTRLH 1036
E+ K +E+L R+H
Sbjct: 1117 PRDPVEILKVIEDLARMH 1134
>gi|268536658|ref|XP_002633464.1| C. briggsae CBR-DDB-1 protein [Caenorhabditis briggsae]
Length = 1134
Score = 578 bits (1491), Expect = e-162, Method: Compositional matrix adjust.
Identities = 374/1102 (33%), Positives = 598/1102 (54%), Gaps = 85/1102 (7%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+ ++PIYG+++T+ L + E Q L + T+++ VL + ++ITRA G +++ G
Sbjct: 52 ICEIPIYGQVSTISLIKFRREKQHSLVVVTDKFNLAVLGY--RDGKVITRAAGSIAEYSG 109
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
R D + + LI + Y+G K++ ++ ++ FN+R + V D KF+
Sbjct: 110 RSADTA-VAMTIHRSGLIAIIGYEGSVKMVHWEPGADVRH-FNVRFDYPNVSDFKFVDTG 167
Query: 121 AKPTIVVLY-QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLI 179
T+ V + D+ +H++ ++ + DK+F Q+ + + +LIPVP P+ GV++
Sbjct: 168 DDDTVRVAFIYDDDHGKHLQFSDLNMHDKEF-HTFSKQSCIAADSQVLIPVPAPVGGVIV 226
Query: 180 IGEETIVYCSANA------FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVIT-- 231
+G + +Y +++ + ++ +I +G VDA G R+LL D G L +L++
Sbjct: 227 LGANSALYKASDVNGDVVPYSCSLLKNTIFTCHGIVDASGDRFLLADTDGRLLMLLLNIG 286
Query: 232 --HEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEV 289
V ++IE LGETS+A +++Y+DN VV++GS GDSQLI+L P+ GSY V
Sbjct: 287 EGRSGTTVKEMRIEYLGETSVADSVNYVDNGVVFVGSRLGDSQLIRLMTAPNG-GSYSVV 345
Query: 290 LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG 349
LE Y N GPI D +V E GQ Q+VTCSGA KDGSLR++RNGIGI E ASV+L + G
Sbjct: 346 LETYTNTGPIRDMVLV--ESDGQPQLVTCSGADKDGSLRVIRNGIGIEELASVDLAKVIG 403
Query: 350 MWS--LRSSTDDPFDTFLVVSFISETRILAMN---LED-ELEETEIEGFCSQTQTLFCHD 403
M+ LRS+TD+ F++VS ET +L + LED +L E E E +LF D
Sbjct: 404 MFPIRLRSTTDN----FVIVSLPDETHVLKITGEELEDVQLLEIETERTTMYASSLFGPD 459
Query: 404 AIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL-- 461
++QVT +R +S + W+ G SV+ + NA + VY
Sbjct: 460 D-SELILQVTEEEIRFMSFQKQV--KIWRPTNGESVSKVSVNAIHGQIVVAARDTVYYLK 516
Query: 462 ----EIGDGILTEVKHAQLEYEISCLDINPIGEN-PSYSQIAAVGMWTDISVRIFSLPDL 516
E G + V + E EI+CLDI+ G++ + +W S+ + L DL
Sbjct: 517 CMVDEAGALDIQIVAERKFEAEIACLDISNEGDDYKKPGTFMVLALWGSFSMEVVQLSDL 576
Query: 517 NLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
+ +L +I+PRS++ E + YLL A GDG L+ ++ ++KTG + KK S+GT+
Sbjct: 577 KTACQTNLPSKIVPRSIVATCIEDVHYLLVAFGDGALIYYVFDIKTGTHGEAKKSSVGTR 636
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P +L +KN H+F SDRP +I+S+NKKL++SNVN+K V+ +C +S ++ D L I+
Sbjct: 637 PPSLHRVRNKNRQHLFVCSDRPVIIFSANKKLVFSNVNVKVVNTVCSLSSTSYRDCLVIS 696
Query: 637 KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE--------- 687
+ GT+DDIQK+HIRSIP+GE RI +Q+ + T+ +CS ++++ E
Sbjct: 697 DGNSMVFGTVDDIQKIHIRSIPMGESVMRIAYQKSTGTYGVCSSRSETKPERVYASKNAL 756
Query: 688 ---ESEMHFVR-----------------LLDDQTFEFISTYPLDTFEYGCSILSCSFSDD 727
+S V +LD TF+ + + FE S +S FSDD
Sbjct: 757 ANSQSRAKIVATRNEVNDGPPASTSSFMILDQNTFQVLHAHEFGPFEAAVSCISGQFSDD 816
Query: 728 SNVYYCVGTAYVLPEENEPTKGRILVFIVED---GKLQLIAEKETKGAVYSLNAFNGKLL 784
+ YY VGT + P+E++ GRI+VF V+D KL+ + E +G+ +L NGKL+
Sbjct: 817 ARQYYIVGTGLIYPDESDTKLGRIIVFEVDDVERTKLRRVHELVVRGSPLALRILNGKLV 876
Query: 785 AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
AAIN ++L++W + L+ EC + HI+AL ++ + + V DLM+S+SLL Y+
Sbjct: 877 AAINSSVRLFEWT----ADKVLRLECSNFNHIVALDLKVMNEEVAVADLMRSVSLLSYRM 932
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-KNSEGATDEERGRLEVVG 903
EG EE A+D+N+ WM E + + LG E + N+FTV S TD+ R LE G
Sbjct: 933 MEGNFEEVAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPITDDGRYVLEPTG 992
Query: 904 EYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
++LGE SLV++ DS + ++FGT G IG++ + + FL ++
Sbjct: 993 YWYLGELPKVMVRASLVVQPEDSTIEYSHPIMFGTNQGTIGMLVQIDDKWKKFLVSIEKA 1052
Query: 964 LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM----- 1018
+ +K + H +RSF +K+ F+DGDL+ES LD+ R+ +I K +
Sbjct: 1053 ISDSVKNCMQIEHSTYRSFIFQKRIEPPSGFIDGDLVESILDMDRSVAIDILKKVSDKGW 1112
Query: 1019 NVSVE----ELCKRVEELTRLH 1036
+ S+ E+ K +E+L R+H
Sbjct: 1113 DASLPRDPVEMLKVIEDLARMH 1134
>gi|308477185|ref|XP_003100807.1| CRE-DDB-1 protein [Caenorhabditis remanei]
gi|308264619|gb|EFP08572.1| CRE-DDB-1 protein [Caenorhabditis remanei]
Length = 1154
Score = 570 bits (1469), Expect = e-159, Method: Compositional matrix adjust.
Identities = 378/1123 (33%), Positives = 602/1123 (53%), Gaps = 111/1123 (9%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
++PIYG++ T+ L R E + L + TE+++ VL + ++ITR G ++D+ GR
Sbjct: 54 EIPIYGQVLTMALVRCKREKRQSLVVVTEKWQMAVLTY--RDGKVITRTAGALADQSGRA 111
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-YGCA 121
+DN + L+ + Y+G K+I ++ G +FN+R + V D KF+ G
Sbjct: 112 SDN-LFSLTIHRSGLVAIRAYEGSVKMIQWE-PGTDVRSFNVRFDYPNVSDFKFIDTGVD 169
Query: 122 KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
+ D+ +H++ ++ + DK+ Q ++ A +LIPVP P+ GVL++
Sbjct: 170 DTYRIAFIYDDDHGKHLQFSDLNMHDKEL-HTFSRQASIAADASVLIPVPAPISGVLVLA 228
Query: 182 EETIVYCSANA------FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
+I+Y S++ + + + ++ +G VD G R++L D G L +L++ +
Sbjct: 229 ANSILYKSSDVNGDVVPYASPLLDNTVFTCHGLVDPSGERFILSDTEGRLLMLILNIGEG 288
Query: 236 K----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
+ V ++IE LGETSIA +I+Y+D VV++GS GDSQLI+L P GSY VLE
Sbjct: 289 RSGITVKDMRIEYLGETSIADSINYIDAGVVFVGSRLGDSQLIRLMPTPSG-GSYSVVLE 347
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
Y N+GPI D +V E GQ Q+VTCSGA KDGSLR++RNGIGI E ASVEL G+ G++
Sbjct: 348 TYSNIGPIRDMIMV--ESDGQAQLVTCSGAEKDGSLRVIRNGIGIEELASVELAGVIGIF 405
Query: 352 SLR--SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ- 408
+R S+TD+ +++VS ET +L +N E ELE+ ++ C++ T+F I+
Sbjct: 406 PIRLNSTTDN----YVIVSLAEETHVLQINGE-ELEDVQLLQICTEMPTIFA-STIFGPD 459
Query: 409 ----LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL--- 461
L+QVT VR ++ + L W+ P G ++ + NA + VY
Sbjct: 460 NSEVLLQVTEKHVRFMAFSG--LSKIWEPPNGELISKVSVNAIHGQIVVAARDTVYFLLC 517
Query: 462 ---EIGDGILTEVKHAQLEYEISCLDINPIGENPSY-SQIAAVGMWTDISVRIFSLPDLN 517
E+G + V + E EI+CLDI+ G++ + + +W+ + + LPDL
Sbjct: 518 VIEEMGGLDINLVAERKFEDEIACLDISNEGDDHTKPGTFMVLALWSTFCMEVVQLPDLK 577
Query: 518 LITKE------------------HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLN 559
++ E +L +I+PRS++ E + YLL A GDG L+ ++ +
Sbjct: 578 TVSCERRSLLETVLSYLSQVCQTNLPSKIVPRSIVATCIEEVHYLLIAFGDGALVYYVFD 637
Query: 560 MKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS 619
+KTG + KK S+GT+P TL +KN H+F SDRP +I+SS+KKL++SNVN+K V+
Sbjct: 638 IKTGTHGEAKKSSVGTRPPTLYRVRNKNRQHLFVCSDRPVIIFSSSKKLVFSNVNVKVVN 697
Query: 620 HMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS 679
+C +S+++ D L I+ + GT+DDIQK+H+RSIP+GE RI +Q S T+ +CS
Sbjct: 698 TVCSLSSSSYCDCLVISDGISMVFGTVDDIQKIHVRSIPMGESVLRIAYQRSSGTYGVCS 757
Query: 680 LKNQSCAE------------------------------ESEMHFVRLLDDQTFEFISTYP 709
+ +S E +S F+ +LD TF+ ++
Sbjct: 758 SRTESKRERIYASKNAIYTSNSRPKITSTRNEANDNPPKSTSSFM-VLDQNTFQRSGSFS 816
Query: 710 LDTFEYGCSILSC---SFSDDSNVYYCVGTAYVLPEENEPTKGRILVF---IVEDGKLQL 763
+ I SC F++DS VYY VGT + PEE + GRI+VF VE KL+
Sbjct: 817 -NNIRKTYMIFSCISGQFTNDSKVYYIVGTGLIYPEETDTKFGRIVVFEVDEVERSKLRR 875
Query: 764 IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQT 823
+ + +G+ +L NGKL+AAIN ++L++W + +EL+ EC + HI+AL ++
Sbjct: 876 VHDLVCRGSPLALRILNGKLVAAINSSVRLFEWTM----DKELRLECSNFNHIMALDLKV 931
Query: 824 RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
+ + V D+M+S+SLL Y+ EG EE A+D+N+ WM E + + LG E + NLFT
Sbjct: 932 MNEEVAVADVMRSVSLLSYRMLEGNFEEVAKDWNSEWMVTCEFITAEQILGGEAHLNLFT 991
Query: 884 VR-KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGV 942
V S TD+ R LE G Y+LGE SLV + D + ++FGT G
Sbjct: 992 VEVDKSRPITDDGRYVLEPTGYYYLGELPRVMVRSSLVAQPDDCSIQYSQPIMFGTNQGS 1051
Query: 943 IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
IG++ + + FL ++ + +K + H +RSF +K+ F+DGDL+ES
Sbjct: 1052 IGMVVQIDDKWKKFLIAVEKAIADSVKNCMHIEHTTYRSFIFQKRLESPTGFIDGDLVES 1111
Query: 1003 FLDLSRTRMDEI-----SKTMNVSVE----ELCKRVEELTRLH 1036
LD+ R+ I K + S+ E+ K +E+L R+H
Sbjct: 1112 ILDMDRSAAIAILYKVSDKGWDASLPRDPIEILKVIEDLARMH 1154
>gi|341884150|gb|EGT40085.1| CBN-DDB-1 protein [Caenorhabditis brenneri]
Length = 1134
Score = 567 bits (1462), Expect = e-159, Method: Compositional matrix adjust.
Identities = 364/1100 (33%), Positives = 598/1100 (54%), Gaps = 87/1100 (7%)
Query: 4 VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT 63
+PIYG++ +L L + +++ L + TE++ +L + ++TRA G + D GR
Sbjct: 55 IPIYGQVLSLALVKAKRDSRQSLVVLTEKWHMSILAY--RDGAIVTRAAGCILDPTGRTV 112
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKP 123
DN + LI + ++G+ K+I ++ L+ FN+R + V + KF+
Sbjct: 113 DNF-FTLSVHRSGLIAIKAFEGIVKLIQWEAGADLRH-FNVRFDFPNVSEFKFVDTTEDD 170
Query: 124 T--IVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
+++D+ + +H++ ++ + DK+F Q ++ + +LIPVP P+ GV+++G
Sbjct: 171 VYRAAFIFEDD-NGKHLQFSDLNMHDKEF-RPHLRQTSIAADSSMLIPVPSPISGVVVLG 228
Query: 182 EETIVYCSA-NAFKAIP-----IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
+++Y S+ N + +P + +I ++ VD G R+++ D G L +L++ +
Sbjct: 229 THSLLYKSSENDGEVVPYSSPLLENTIFTSHSIVDPTGERFIVSDTDGRLLMLLLNAVEN 288
Query: 236 K----VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
+ V ++I+LLG+TS+A +I+Y+DN VV+IGS +GDSQLI+L L SY+ VL+
Sbjct: 289 QSGLSVKEIRIDLLGDTSVAESINYIDNGVVFIGSRFGDSQLIRL-LSEKTNSSYISVLD 347
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
Y N+GPI D +V E GQ Q+VTCSGA KDGSLR++RNGIGI E A+V+L G+ G++
Sbjct: 348 TYYNIGPIRDMIMV--ESDGQPQLVTCSGAEKDGSLRVIRNGIGIEELATVDLPGVVGIF 405
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI----YN 407
+R D D +++VS + ET +L + E ELE+ + + T+F
Sbjct: 406 PIR--LDSSADNYVIVSLVEETHVLQITGE-ELEDVQFLQIDTALPTMFAGTLFGPNDSG 462
Query: 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANA--SQVLLATGGGHLVYL---- 461
+VQVT VRL+S+ L W+ G ++ NA QV +A H+ +L
Sbjct: 463 LVVQVTERQVRLMSNGG--LSKFWEPANGEMISKVAVNAVSGQVCVA-ARDHVYFLSCIV 519
Query: 462 -EIGDGILTEVKHAQLEYEISCLDINPIGENPSY-SQIAAVGMWTDISVRIFSLPDLNLI 519
E+G ++ + Q E EI+CLDI+ G++ + + W S+ + LPDL +
Sbjct: 520 DEMGALDISVIAEKQFEDEIACLDISNEGDDADKPATFMVLAFWRTFSMEVVQLPDLKTV 579
Query: 520 TKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
+ L I+PRS++ E + YLL A GDG L ++ ++KTG + KK S+GT+P +
Sbjct: 580 CQTDLPSRIVPRSIIATCIEDVHYLLVAFGDGALTYYVFDIKTGTHGEPKKSSVGTRPPS 639
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L +KN H+F SDRP +I+SS+KKL++SNVN+K V+ +C +S + D L I+
Sbjct: 640 LHRVRNKNRQHLFVCSDRPMIIFSSSKKLVFSNVNVKVVNTVCSMSSRVYRDCLVISDGN 699
Query: 640 ELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE------------ 687
+ GT+DDIQK+H+R+IP+GE R+ HQ+ S T+ +CS + + E
Sbjct: 700 CMVFGTVDDIQKVHVRTIPMGESVLRVAHQKSSGTYGVCSSRTDTRFERITASKSAIGKS 759
Query: 688 ------------------ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
+S F+ +LD TF+ + ++ E S++S F+DD+N
Sbjct: 760 LSKPKVTATRNDMNETPPKSTSSFI-VLDHNTFQVLHSHTFGPHETAVSVISGQFNDDNN 818
Query: 730 VYYCVGTAYVLPEENEPTKGRILVFIVED---GKLQLIAEKETKGAVYSLNAFNGKLLAA 786
YY VGTA V P+E+E GRI+VF V++ KL+ + E +GA + NGKL+AA
Sbjct: 819 SYYIVGTALVYPDESETKIGRIIVFEVDETDKTKLRFMTEIVVRGAPMGIRILNGKLVAA 878
Query: 787 INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
IN +++++W +EL+ EC HI A+ ++ + I V D+M+S+SLL Y+ E
Sbjct: 879 INSSVRMFEWT----AEKELRVECSTFNHIAAVDLKVLNEEIAVADVMRSVSLLSYRTLE 934
Query: 847 GAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-KNSEGATDEERGRLEVVGEY 905
G EE A+D+N+ WM E + + LG E + N+FTV S TD+ R LE G +
Sbjct: 935 GNFEEVAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPVTDDGRYVLEPTGYW 994
Query: 906 HLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLR 965
+LGE LV + D+ + +++GT G +G++ + FL ++ +
Sbjct: 995 YLGELTKVMIRAVLVPQPDDNSIRYTQPIMYGTNQGSLGLVVQIDDMYKKFLLSIEKAIS 1054
Query: 966 KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI-----SKTMNV 1020
K + H +RSF K+ F+DGDLIES LD+ R+R EI +K +
Sbjct: 1055 DAEKNCMQIEHSTYRSFTYNKRIEPPSGFIDGDLIESILDMDRSRAIEILEKANTKGWDP 1114
Query: 1021 SVE----ELCKRVEELTRLH 1036
S+ E+ K +++L+R H
Sbjct: 1115 SIPKDPVEILKIIDDLSRTH 1134
>gi|390366809|ref|XP_780126.3| PREDICTED: DNA damage-binding protein 1-like isoform 1
[Strongylocentrotus purpuratus]
Length = 630
Score = 559 bits (1441), Expect = e-156, Method: Compositional matrix adjust.
Identities = 293/565 (51%), Positives = 375/565 (66%), Gaps = 59/565 (10%)
Query: 527 EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
EIIPRS+LL FEG +Y+LCALGDG L F LN +TG +TDRKKV LGTQP L+TF S
Sbjct: 70 EIIPRSILLTTFEGQNYILCALGDGSLFYFQLNAETGYMTDRKKVILGTQPTVLKTFKSL 129
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
+T +VFA SDRPTVIYSSN KL++SNVNLKEVS+MCP NS +PDSLA+ + L IG+I
Sbjct: 130 STVNVFACSDRPTVIYSSNHKLVFSNVNLKEVSYMCPLNSDGYPDSLALCNDTTLMIGSI 189
Query: 647 DDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN----------------QSCA---- 686
D+IQKLHIR++PLGE P RI +QE S+TF I S + QS +
Sbjct: 190 DEIQKLHIRTVPLGETPLRITYQEPSQTFGIISTRTDVVDSSGTTASMGQTRQSASTSAL 249
Query: 687 -------------------------EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
+E E+H + ++D TFE + + + EY S++S
Sbjct: 250 NITKSGGNKGMAGQAGGSGEGSSFGDEVEVHSLLVIDQHTFEVLHAHHFGSSEYATSLIS 309
Query: 722 CSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNG 781
C +D N YY VG A V P+E EP GRI+VF DGKLQ IAEKE KGA YSL FNG
Sbjct: 310 CKLCNDPNWYYIVGLANVHPDEAEPKSGRIVVFQYSDGKLQEIAEKEIKGAPYSLVEFNG 369
Query: 782 KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
KLLA++N ++L++W L+ EC H+ ++LALY++T+GDFIVVGDLM+SI+LL
Sbjct: 370 KLLASVNSVVRLFEWTPE----HSLRVECSHYNNVLALYLKTKGDFIVVGDLMRSITLLA 425
Query: 842 YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEV 901
YK EG +EE ARDY+ NWMSAVEILDDD +LGAEN+ NLFT +K+S TDEER L+
Sbjct: 426 YKPMEGCLEEIARDYSPNWMSAVEILDDDTFLGAENSSNLFTCQKDSAATTDEERRHLQE 485
Query: 902 VGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VG +HLGEFVN FRHGSLVM+ + +S + +V+FGTV+G +G++ L E Y FL ++
Sbjct: 486 VGLFHLGEFVNVFRHGSLVMQNIGESTIPTTGSVLFGTVSGSVGLVTQLNEEFYRFLLEV 545
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q L KVIK VG + H WRSF +E+KT NF+DGDL+ESFLDLSR MDE+++ + +
Sbjct: 546 QNKLTKVIKSVGKIKHSFWRSFYSERKTEPMDNFIDGDLLESFLDLSRDTMDEVAQGLQI 605
Query: 1021 S---------VEELCKRVEELTRLH 1036
+L K VEELTR+H
Sbjct: 606 DDGGMKRDCMANDLIKIVEELTRIH 630
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 49/71 (69%), Positives = 57/71 (80%)
Query: 565 LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPF 624
+TDRKKV LGTQP L+TF S +T +VFA SDRPTVIYSSN KL++SNVNLKEVS+MCP
Sbjct: 1 MTDRKKVILGTQPTVLKTFKSLSTVNVFACSDRPTVIYSSNHKLVFSNVNLKEVSYMCPL 60
Query: 625 NSAAFPDSLAI 635
NS +PD I
Sbjct: 61 NSDGYPDRYEI 71
>gi|74208347|dbj|BAE26370.1| unnamed protein product [Mus musculus]
Length = 599
Score = 549 bits (1415), Expect = e-153, Method: Compositional matrix adjust.
Identities = 286/548 (52%), Positives = 361/548 (65%), Gaps = 54/548 (9%)
Query: 539 EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRP 598
E YLLCALGDG L F LN++TG L+DRKKV+LGTQP LRTF S +TT+VFA SDRP
Sbjct: 56 ESSHYLLCALGDGALFYFGLNIETGLLSDRKKVTLGTQPTVLRTFRSLSTTNVFACSDRP 115
Query: 599 TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIP 658
TVIYSSN KL++SNVNLKEV++MCP NS +PDSLA+A LTIGTID+IQKLHIR++P
Sbjct: 116 TVIYSSNHKLVFSNVNLKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVP 175
Query: 659 LGEHPRRICHQEQSRTFAICSLK------------------------------------- 681
L E PR+IC+QE S+ F + S +
Sbjct: 176 LYESPRKICYQEVSQCFGVLSSRIEVQDSSGGTTALRPSASTQALSSSVSSSKLFSSSTA 235
Query: 682 --NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYV 739
S EE E+H + ++D TFE + + EY S++SC D N Y+ VGTA V
Sbjct: 236 PHETSFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMV 295
Query: 740 LPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
PEE EP +GRI VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++LY+W
Sbjct: 296 YPEEAEPKQGRIAVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTE 355
Query: 800 DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
+EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE ARD+N N
Sbjct: 356 ----KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPN 411
Query: 860 WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
WMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN F HGSL
Sbjct: 412 WMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSL 471
Query: 920 VMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
VM+ L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG + H
Sbjct: 472 VMQNLGEASTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSF 531
Query: 979 WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSVEELCKR 1028
WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ + + ++L K
Sbjct: 532 WRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKV 591
Query: 1029 VEELTRLH 1036
VEELTR+H
Sbjct: 592 VEELTRIH 599
>gi|339235331|ref|XP_003379220.1| DNA damage-binding protein 1 [Trichinella spiralis]
gi|316978142|gb|EFV61158.1| DNA damage-binding protein 1 [Trichinella spiralis]
Length = 1329
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 335/1117 (29%), Positives = 585/1117 (52%), Gaps = 97/1117 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++GRI +LF P GE + + I T + ++++D + + T A ++S+ GRP N
Sbjct: 224 MFGRIGAAKLFTPKGENKALMVIVTLKQDVAIVEYD--NGRIKTLASRNISENFGRPASN 281
Query: 66 GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTI 125
G + + PD +IGL + FK I ++ +++ + D FL+G P I
Sbjct: 282 GILLSVHPDGEVIGLRIMSSTFKCITWNRATSKLSTYSLNYSLTHLSDFVFLHGFQFPVI 341
Query: 126 VVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETI 185
++Y D RHV T ++L +++F GPWS+ +++ A LI VPPPLCGV+++G ++
Sbjct: 342 ALIYGD-LVGRHVITCRISLDEQEFENGPWSRGHIEWEAHTLIAVPPPLCGVIVVGCSSL 400
Query: 186 VYCSANAFKAIPIRPSITKAYGRV-DA--DGSRYLLGDHAGLLHLLVITHE-----KEKV 237
+Y N+ + P ++K+ DA DG Y LG G L LL + E K +
Sbjct: 401 LYIRDNSTISTVSPPFLSKSIVNCYDAAPDGLTYFLGQLDGTLSLLKLDIETDAEGKVTL 460
Query: 238 TGLKIELLGETSIASTISYLDN-AVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNL 296
+ ++ +LG TS ++SY+ +++++GS DS+L++LN ++ EV E + NL
Sbjct: 461 SRMRATILGVTSPPDSLSYMHKESLLFVGSRIADSKLLRLNSLAICDETWTEV-ESFPNL 519
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSS 356
PIVD +VD+ +QGQG++++CSG KDGSL+++R+GIG+ E A +++ + +W+LR
Sbjct: 520 APIVDMVLVDMAKQGQGEIISCSGYGKDGSLKVIRSGIGLYELARLDIPFVNRVWALRYI 579
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF---CHDAIYN-QLVQV 412
T++P+D ++ + + ++ E++ E+ ++G + QT CH A +VQV
Sbjct: 580 TNEPYDNLFILGVVGNSTMVIKFQENQAEQISVDGLETAEQTFVAANCHMADGALAIVQV 639
Query: 413 TSGSVRLVSST--SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLV-YLEIGDGILT 469
++L ++ R+L +EWK P G V++A + L H++ Y + D L
Sbjct: 640 VRSKIQLADASLDGRKL-DEWKFPDGSEVSLAACEGTSGRLLVACRHILHYFNVRDDRLQ 698
Query: 470 EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
V +E E SC++ + + + VG WT I +++ +P L + +L +
Sbjct: 699 LVTTRTMENEASCVEFGCLSDEGV--GVCIVGHWTKICLQLLFVPSLEPVQTVYLEDSSV 756
Query: 530 PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
RS L+C FE YL +L DG+L+ + ++ + L D KKVSLGT I+L+ F +KNT
Sbjct: 757 IRSALMCKFESNMYLFVSLADGNLIFYSIDENSFALIDGKKVSLGTDSISLKMFKTKNTL 816
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
+ A +++PTVIY +N KL +S+++ + +M P ++A+ + + LTIG +D+I
Sbjct: 817 SILACTEKPTVIYMNNNKLQFSSLDSSAIYYMTPLFTSAYDNGVLFTNGQCLTIGVVDEI 876
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---------------NQSCAEES----- 689
+KLHIRS+ LGE PRRI Q +++ + +++ N++ S
Sbjct: 877 KKLHIRSVVLGETPRRIVWQPENKLVGVLTIRIMDLPGNSSSESSASNRALHSASAPPLR 936
Query: 690 ------------------------------EMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
E H LLD +F+ + Y ++ E SI
Sbjct: 937 RTTLTVDPNSAINEGAVNIGAEDTGVDRSIETHSFLLLDQNSFDVLHAYSMNPLEECTSI 996
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE-DGK----LQLIAEKETKGAVY 774
LSC+ +D N ++ + A + +E EP +GR+L+ E DG+ L L+ EKE G VY
Sbjct: 997 LSCTMGEDQNPFFILSAAVITADETEPLQGRLLMLRYERDGQGNSSLNLVHEKEVNGCVY 1056
Query: 775 SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
++ +F KLL A+N + L++W D +L S C + A++++ R + I+VGD+
Sbjct: 1057 AMASFKSKLLVAMNSSVLLFEW--SDVTGLQLVSSCSL--FVTAMHLKVRDEVILVGDIQ 1112
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
+SI++L Y E + E ARDY+ NW+SA+E++D+D ++ AEN+ N+ +K+ +
Sbjct: 1113 RSIAVLRYVPSESSFVEEARDYHPNWISAIEVIDNDYFMAAENSLNITVSQKDLQQQPVS 1172
Query: 895 ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT-VIFGTVNGVIGVIASLPHEQ 953
E ++ G HLGE++N F+HG+L M + ++ GT G I + +
Sbjct: 1173 ESQVVKSAGRLHLGEYINVFKHGALSMYSYAGISSLVSNPIMIGTAEGSILIYCQIHDSH 1232
Query: 954 YLFLEKLQTNLRKVIK-GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD 1012
+ L LQ ++ VG + ++ +R + +K A F+DGDLIE L++ R
Sbjct: 1233 FRVLNDLQRCFSDIVPDNVGCIAYDSYRRYVVYEKNAPAFGFIDGDLIEQLLEMPRQEAI 1292
Query: 1013 EI------------SKTMNV-SVEELCKRVEELTRLH 1036
+ + N+ + +++ +EEL+R+H
Sbjct: 1293 RLINGWIAAGRISPERCSNITTAQQIIDLIEELSRIH 1329
>gi|223994993|ref|XP_002287180.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220976296|gb|EED94623.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 1517
Score = 541 bits (1393), Expect = e-150, Method: Compositional matrix adjust.
Identities = 446/1380 (32%), Positives = 628/1380 (45%), Gaps = 369/1380 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG----R 61
I GRI +L + D LF TER + ++ +D + ++L A+ + G +
Sbjct: 84 INGRITSLAPIKYSSSNTDCLFFTTERGCYALISYDEQLAQLQFNAVTSTTSSTGGEETK 143
Query: 62 PTDN----------------------------------GQIGIIDPDCRLIGLHLYDGLF 87
D+ G I +DP R I LH+YDG
Sbjct: 144 DHDDSAIIGLHYPIQTHATGSFASYSSSVIAGGCQAECGPIMTVDPLNRCIALHVYDGWV 203
Query: 88 KVIPFD-----------------------NKGQLKEAFNIRLEELQVLDIKFLYGCAK-- 122
VIP + + G +AF+ R+EE +L + FL +
Sbjct: 204 TVIPINRGYDVGSARASSSSSRGGKPRGLSTGPFGDAFHARVEERTLLSMTFLRPKSNAG 263
Query: 123 --------------PTIVVLYQDNKDARHVKTYEVALKDKDFV----------------- 151
P +VVL+QD + +HV T+ V L K V
Sbjct: 264 GRGNSGGGVSSYYHPQLVVLHQDARGFQHVITHGVDLAKKGLVLHGSPNDSAVATMMGVK 323
Query: 152 --EG--PWS----------QNNLDNGADLLIPVPP----PLCGVLIIGEETIVY--CSAN 191
EG P S ++ +D G+ LI +PP L GVLI+G+ I Y
Sbjct: 324 SDEGKMPSSTMPPANERLKKSRIDGGSAALIAIPPSKTSALGGVLILGQRQITYHHTGEG 383
Query: 192 AFKAIPIRPSITKAYGRV-DADGSR----------YLLGDHAGLLHLLVI----THEKEK 236
+ +PI S+ +Y V D D R Y+LGD G +HLL I T E +
Sbjct: 384 ITRTLPIGSSLLYSYCLVADKDADRNINNNAPVLKYILGDSTGRIHLLTILLNVTPEGKG 443
Query: 237 ---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP-------DAKG-- 284
VT L +E LG S AS + YL +++GS +GDSQL+K+ P D +G
Sbjct: 444 DGCVTTLLMETLGTASSASAVVYLGKGCLFVGSQFGDSQLLKILDAPVPLSGGRDDEGEE 503
Query: 285 -------SYVEVLERYVNLGPIVDFCV---VDLERQGQGQ---------------VVTCS 319
+Y+ +L+ Y NLGPIVDF + D G G VVTCS
Sbjct: 504 KNPLEETTYLRLLDEYTNLGPIVDFDLRPCSDATSGGHGNAKNDSKSRDQHRQSLVVTCS 563
Query: 320 GAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
G KDG++R+VRNG+G+ E A VE+ GIKGMWSLR + D D+FLV SF+ ETRIL +
Sbjct: 564 GVGKDGTVRLVRNGVGMREHAEVEMPGIKGMWSLRRTFADEDDSFLVQSFVRETRILGVQ 623
Query: 380 ----------------LEDELEETEIEGFCSQTQTLFCHDAIYNQ---LVQVTSGSVRLV 420
L E I GF S TLF + + L+QV SVRLV
Sbjct: 624 SGGEAEMEEEDDEDQEEGGALAEVTIPGFSSSKSTLFAGNLLVGSSDLLLQVVEDSVRLV 683
Query: 421 SSTSRELRNEWK------------SPPGYSVNVATANAS-QVLLATGGGHLVYLEI-GDG 466
+S + EL EW P G+ + VA+AN+S Q+++A GG LVYL + GD
Sbjct: 684 NSETLELVTEWSPFSNDNEGSDDDEPLGF-ITVASANSSGQIVVALRGGTLVYLVVEGDA 742
Query: 467 I--LTEVKHAQLEYEISCLDINP-----IGENP------SYSQIAAVGMWTDISVRIFSL 513
+ +K L+ EISC+D+NP +G N +S++ AVG+W D SVR+ L
Sbjct: 743 SPSIRRLKRVTLDREISCIDLNPFDDTAVGGNTMDVDGGHHSKLVAVGLWDDFSVRLLDL 802
Query: 514 PD----------LNLITKEHLGG----------------EIIPRSVLLC----------- 536
+NL GG ++ RS+ L
Sbjct: 803 SSSHSVLDQVLQINLGVGNDNGGGKSAEEDDDELGESSQHMMARSLCLVTLDSHSSNSSS 862
Query: 537 -------AFEG---ISYLLCALGDGHLLNFLL-----NMKTGELTDRKKVSLGTQPITLR 581
+F G + LL LGDG L++F + + K + RK+VSLGTQ I L
Sbjct: 863 LINKKSDSFTGSNKVDMLLVGLGDGKLISFAVVHSAASSKKWFVHSRKEVSLGTQGIHLI 922
Query: 582 TFSS-KNTTHVFAASDRPTVIY--------SSNKKLLYSNVNLK------EVSHMCPFNS 626
SS N T V A DRPTV+Y +SN KL YS ++L E H N
Sbjct: 923 PMSSGSNGTCVLATGDRPTVVYLTGGNGGGNSNPKLCYSTISLTVEDDDDEDGHASHRNV 982
Query: 627 AA--------------------FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRI 666
+ SL +A E L +G IDDIQKLH+ S LG PRRI
Sbjct: 983 SVNVASSFRSSLLFSSSNSTSNHNYSLCVADETTLRLGMIDDIQKLHVTSYKLGMTPRRI 1042
Query: 667 CHQEQSRTFAICSL--------KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
+ E R + + + NQ AE + + VR DD TFE I+ L+ FE S
Sbjct: 1043 AYHEAGRVYCVGCIDGNAKGGNNNQVGAEINMGNCVRFFDDSTFEEINQIDLEPFETILS 1102
Query: 719 ILSCSFSDDSNV---------------YYCVGTAYVLPEENEPTKGRILVFIVEDGK--- 760
++S S S Y +GTAY P+E+EPT+GRILV G+
Sbjct: 1103 LVSVSLCTSSQTLTQSNSKQDTSEYKPYILIGTAYAYPDEDEPTQGRILVVECNSGEAEP 1162
Query: 761 ---------------LQLIAEKETKGAVYSLNAF-NGKLLAAINQKIQLYKWMLRDDGTR 804
++ + + T+G VYS++ F G +LA +N K L + + D
Sbjct: 1163 HLKSDDDMEDTYSRYVRHVTQMPTRGGVYSISPFYGGTVLATVNSKTHLCRLSIGCDQIG 1222
Query: 805 ELQ-SECGHHGHILALYVQTRG----------------DFIVVGDLMKSISLLIYKHEEG 847
EL+ GHHGH+L+L+V++ +VGDLM+SISL+ Y+ +
Sbjct: 1223 ELKFVGAGHHGHMLSLFVKSLAGSESESESSGTNRQAKQLAIVGDLMRSISLVEYQPKHN 1282
Query: 848 AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
IEE ARDYNAN+ +AVE+L + YLG+E NLF +R N+ +++E R RL+ VGEYHL
Sbjct: 1283 VIEELARDYNANFCTAVEMLTNGTYLGSEGFNNLFVLRHNANASSEEARVRLDTVGEYHL 1342
Query: 908 GEFVNRFRHGSLVMRLPDSDVGQI--------PTVIFGTVNGVIGVIASLPHEQYLFLEK 959
GE N+F GSL+M P + G + +FGTV+G IG + L + FL
Sbjct: 1343 GEMTNKFMGGSLIM--PSNSGGIMGAQNAYVGSQTLFGTVDGSIGSVLGLDGPTFAFLAC 1400
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ + ++K VG ++HE++R+F E++ ++ F+DGDLIE+FLDL+R M+ I K MN
Sbjct: 1401 LQRAILSIVKTVGDISHEEYRAFRAERQVRPSRGFIDGDLIETFLDLNRPTMERIVKYMN 1460
>gi|345570887|gb|EGX53705.1| hypothetical protein AOL_s00006g33 [Arthrobotrys oligospora ATCC
24927]
Length = 1133
Score = 532 bits (1370), Expect = e-148, Method: Compositional matrix adjust.
Identities = 353/1101 (32%), Positives = 582/1101 (52%), Gaps = 92/1101 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
+YGR+ L RP D LF+ T+RY++ + WD+++ + R DV+DR RP
Sbjct: 55 VYGRVTGLLSLRPQQSTLDHLFLGTDRYEYFTVSWDSQTGTIRNERKAHDVTDRFQRPAQ 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK---------------GQLKEAFNIRLEEL 109
G + + DP RL+GL+LY+G+F IP + G L + IR+ EL
Sbjct: 115 VGHLYLADPGGRLLGLYLYEGIFTAIPIKRQSKGRGRHAQLPEAEIGNLDDPCPIRMNEL 174
Query: 110 QVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEV-----ALKDKDFVEGPWSQNNLDNGA 164
+V+++ FLYG + P I VLY D+K H+ TYE+ A+KD +F + NNLD+GA
Sbjct: 175 KVINMVFLYGTSVPVIAVLYTDSKKLVHLITYELNVAKRAVKDPEFAQWGIKANNLDHGA 234
Query: 165 DLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR-----PSITKAYGRVDADGSRYLLG 219
LLIPV P G+L+IGE+ + Y + +P++ P+ +G++D + RYLL
Sbjct: 235 KLLIPVDNPTGGILVIGEQVVSYFHPE--RTVPMKKPLHEPTSFVTHGKIDPE--RYLLS 290
Query: 220 DHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQ 279
D G L+LL++ E K+ ++IE LGE A I YLDN V++GS +GDS L++++
Sbjct: 291 DELGHLYLLLLIIENNKLINMRIENLGEVCQARAIVYLDNGYVFLGSHFGDSTLVRIS-- 348
Query: 280 PDAKGSYVEVLERYVNLGPIVDFCVVDLE---------RQGQGQVVTCSGAYKDGSLRIV 330
+KG +EV++ NL PI DF V+ E GQ ++TCSG + DG LR V
Sbjct: 349 --SKGPRIEVVQSLPNLAPISDFIVLGTEVGGVEIHQYSAGQTMILTCSGGFYDGGLRSV 406
Query: 331 RNGIGINEQASV-ELQGIKGMWSL-RSSTDDPFDTFLVVSFISETRILAMNLEDELEETE 388
R+G+GI + + E+ G++ MW+L R+ D+ FD L+ SF +E+R A + E+EE +
Sbjct: 407 RSGVGIRDIGLLGEMSGVQNMWALKRAILDNGFDDTLLFSFANESRAFAFGADGEVEEVD 466
Query: 389 -IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANAS 447
E F T TL + ++LVQVT V +V + +L N W P G + +A+ + +
Sbjct: 467 TFENFFLDTTTLEAGNVGNDKLVQVTPFKVIVVEKATSKLWN-WAPPVGAKIVMASLSGA 525
Query: 448 QVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDIS 507
++++ G + ++ + ++ + E EISC+ I P + S VG W S
Sbjct: 526 RLVVVLNGRICLLFDLSSEPIKQIANRTFENEISCIHI-PTKQ----SDFLVVGFWMPAS 580
Query: 508 VRIFSLPDLNLITKEHLG---GEIIPRSVLLCAFEG--ISYLLCALGDGHLLNFLLNMKT 562
+ + + DL + +EHL G + PRSV++ EG S L + DG ++++ +
Sbjct: 581 LALLRIADLETMKEEHLAVFEGSV-PRSVMVANMEGDGPSSLFVGMADGEVISYTITEGP 639
Query: 563 GELTDRKKVSLGTQPITLRTFSSK---NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS 619
G L D+K++ LGTQ +T K +++ V A +RPT++Y + +YS + L + S
Sbjct: 640 GILDDQKRIRLGTQTVTFEALPRKTGDDSSCVIATGERPTMVYGEEGRTVYSAITLNQAS 699
Query: 620 HMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS 679
+ FN+ AFPD++ +A + + I ID+ + H R PL + RR+ ++ + + + +
Sbjct: 700 SVVAFNAEAFPDTVVVATDESVFIAKIDEARTTHTRMSPLCQFARRVAFSKEKKAYGVAT 759
Query: 680 LKNQ---SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY---YC 733
++N + ES ++ ++D+ ++ I Y L E S+L S ++ +
Sbjct: 760 IRNSIDTTTGMESSSCYIHVIDENFYDKIDAYELYPNELVESLLCASLANPDGTISEKFV 819
Query: 734 VGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
VGTA + + +E GR+L + D L+LI E E GA +SL G +LA +++ I
Sbjct: 820 VGTA-IGNDSDESEHGRLLFLELGADKMLRLITELELPGACHSLAIVKGYILAGLSKSID 878
Query: 793 LYKW-MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL--IYKHEEG-- 847
LY++ R +Q L + + G + VGDL+K + +L + EG
Sbjct: 879 LYRFSYTRGSLGASIQQISSIRAATLPVSLSVYGKRVFVGDLVKGVMVLEVVEGGGEGND 938
Query: 848 AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
+ E R Y +W++A+E LD+D + A+++ NL +R+ S GATDE+ R+ + E L
Sbjct: 939 KLVEVCRQYGVSWVTALEALDEDTCISADSDGNLVLLRRESTGATDEDTRRMRPLSEIRL 998
Query: 908 GEFVNRFRHGSLVMRLPDSDV-GQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
GE VN R R+ D G + P GTV+G + ++ + + + L K Q N+
Sbjct: 999 GEMVNCIR------RVNDPITQGYVVQPKAYLGTVDGGLFMLGLIHPDYFDILMKCQVNM 1052
Query: 965 RKVIKGVGGLNHEQWRSFNNEK-KTVDAKNFLDGDLIESFLDLSRTRM--------DEIS 1015
KVIKG+G L+ ++R++N + + + F+DG+L+E FLDL M D+ +
Sbjct: 1053 AKVIKGIGDLDFNRYRAYNTKGIQPEEPFRFVDGELVEKFLDLDEDAMRMVIDGANDDDN 1112
Query: 1016 KTMNVSVEELCKRVEELTRLH 1036
+ +V E+ VE L RLH
Sbjct: 1113 SQIECTVGEMKNIVETLKRLH 1133
>gi|18377609|gb|AAL66955.1| putative UV-damaged DNA binding factor [Arabidopsis thaliana]
Length = 270
Score = 527 bits (1358), Expect = e-146, Method: Compositional matrix adjust.
Identities = 250/270 (92%), Positives = 264/270 (97%)
Query: 767 KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD 826
KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD
Sbjct: 1 KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD 60
Query: 827 FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
FIVVGDLMKSISLL+YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL TV+K
Sbjct: 61 FIVVGDLMKSISLLLYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLLTVKK 120
Query: 887 NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS++GQIPTVIFGTVNGVIGVI
Sbjct: 121 NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSEIGQIPTVIFGTVNGVIGVI 180
Query: 947 ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
ASLP EQY FLEKLQ++LRKVIKGVGGL+HEQWRSFNNEK+T +A+NFLDGDLIESFLDL
Sbjct: 181 ASLPQEQYTFLEKLQSSLRKVIKGVGGLSHEQWRSFNNEKRTAEARNFLDGDLIESFLDL 240
Query: 1007 SRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
SR +M++ISK+MNV VEELCKRVEELTRLH
Sbjct: 241 SRNKMEDISKSMNVQVEELCKRVEELTRLH 270
>gi|297815870|ref|XP_002875818.1| hypothetical protein ARALYDRAFT_323319 [Arabidopsis lyrata subsp.
lyrata]
gi|297321656|gb|EFH52077.1| hypothetical protein ARALYDRAFT_323319 [Arabidopsis lyrata subsp.
lyrata]
Length = 987
Score = 523 bits (1347), Expect = e-145, Method: Compositional matrix adjust.
Identities = 357/1050 (34%), Positives = 552/1050 (52%), Gaps = 131/1050 (12%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
+ P+YGR+ +EL + +D LFIATE K C+ WD ++S + +
Sbjct: 53 EFPMYGRVVFMELSHLNNLRKDVLFIATETEKLCMFDWDVQTSNVFS------------- 99
Query: 63 TDNGQIGIIDPDCRLIGLHLY---DGLFKVIPFDNKGQLKEAFNI-RLEELQVLDIKFLY 118
+D + L +Y D ++ F G + + I L ++D+KFL
Sbjct: 100 --------LDTIQYVPNLEMYQIEDESYEYYDFCQGGTIFDGKTIVLLSSTNIVDVKFLK 151
Query: 119 GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVL 178
+P + +LY+ LLIP L L
Sbjct: 152 SYLRPGLALLYRTIMPT------------------------------LLIP---NLFSYL 178
Query: 179 IIGEETIVYCSANAFKAIPIRPSITKAYGRVDADG--SRYLLGDHAGLLHLLVIT-HEKE 235
IIG++ +VY S N K + + YGRV+ RY+LG H G + +L I + E
Sbjct: 179 IIGKKGVVYYSDNEIKTKLMGSLDIQTYGRVEETRHMCRYILGGHDGHISVLEIKINTAE 238
Query: 236 KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL----QPDAKGSYVEVLE 291
+++GL + LG+ S+ S+I+YL + +V++GS Y DSQL+K++ + ++VL
Sbjct: 239 RLSGLTFKPLGKISVPSSITYLRDDIVFVGSRYADSQLVKMHQLHGPSLNTNTPALQVLT 298
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
Y N GP++DF + + + G ++V C G ++D SL V G+GI+ + E+ I+ +W
Sbjct: 299 TYTNSGPVIDFSLCN-KPIGVRKLVMCCGTHEDSSLEGVSTGVGISVKFFAEIPNIQRIW 357
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLED--ELEETEIEGFCSQTQTLFCHDAIYNQL 409
SL+SS+ + FLV+ I+ L ++D + GF + T+ CHDA
Sbjct: 358 SLKSSSLNDNHPFLVLQPIAVAPALVFTIDDTGAFMSHTMVGFAYDSVTICCHDAADRLF 417
Query: 410 VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILT 469
+QVT SVRL+ S + LR+EW + + A+A QVLLAT LVYL+I +GIL
Sbjct: 418 IQVTEKSVRLIDSKTGVLRSEWLASS--KILFASATIKQVLLATLDKQLVYLDIRNGILV 475
Query: 470 EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
+ K Q Y++SCLDI P+G N +YS++AAVG+W + S+ + LP+L ++T EH+ +
Sbjct: 476 QAK-VQKGYDVSCLDIGPVGSNTTYSKLAAVGLW-EKSILVLKLPELEVLTVEHVHAHV- 532
Query: 530 PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
PR +LLCAFE +SYLLCAL DG LL F LN G+L ++ L P TLRTF S +
Sbjct: 533 PRCILLCAFEQVSYLLCALADGSLLTFKLNEDDGQLQEKNTKELEGYPKTLRTFISNGSV 592
Query: 590 HV-FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
F+ S PT+I++ +KL Y+ +NL ++ MCPF A + LAI + + I T+ D
Sbjct: 593 RACFSTSKVPTLIFTMKQKLQYNRLNLTDIEDMCPFQRADISEGLAIVTKDGVVICTMGD 652
Query: 649 IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTY 708
++ +I+ IP+ + RR+CHQE+S+T+A+ E+ H ++LLD + S +
Sbjct: 653 MET-NIK-IPMFSYGRRLCHQEESQTYAVL------VEEKDGRHSLQLLDKELNTLFS-F 703
Query: 709 PLDTFEYGCSILSCSFSDDSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDGKLQLIAEK 767
L+ E+G SI+SCSF+ D YYCVGT+ VL EEN T+GRILVF+V DG+L +I +
Sbjct: 704 ELNEHEHGLSIVSCSFAGDDQAYYCVGTSVNVLDEEN--TQGRILVFLVRDGELNIIFQY 761
Query: 768 ETKGAVYSLNAFNGKLLAAIN-QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD 826
T GAVY L NG LL ++ I ++WML DD REL
Sbjct: 762 PTNGAVYYLKPLNGGLLVTVSVHTIHYFEWMLHDDA-RELH------------------- 801
Query: 827 FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
SI+ + E+G E+R ++ + Y A+ +++
Sbjct: 802 ---------SINPEVVGIEDGGWEQRFAVTSSRIQKS--------YFYADGLYSI----H 840
Query: 887 NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
+++G D G VG+++LG+ + + G+L RL D D +P+V+F TV+G +G+I
Sbjct: 841 SADGVND--HGYPNTVGQWYLGQSITHMKAGTL-SRLSDEDSAIVPSVMFSTVSGAVGMI 897
Query: 947 ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
SLP Y FL KL+ +R+ I+G+G N + R + ++ FLDGD IESF L
Sbjct: 898 VSLPPVIYEFLHKLEQYMREQIRGIGCPNEDFPRMCVELQSLSESTEFLDGDFIESFQIL 957
Query: 1007 SRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
+E+++ M V+VE++ K VE L LH
Sbjct: 958 DIDDQEEVAEEMEVTVEDIRKIVEYLKLLH 987
>gi|358338734|dbj|GAA31211.2| DNA damage-binding protein 1, partial [Clonorchis sinensis]
Length = 1515
Score = 511 bits (1315), Expect = e-141, Method: Compositional matrix adjust.
Identities = 337/1035 (32%), Positives = 539/1035 (52%), Gaps = 151/1035 (14%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
DVPI +I LFR D LFI T + F ++ + + +T G V D
Sbjct: 33 DVPINAKIVAACLFRRKERQTDSLFILTHKAGFALIDCARKGDDVQFVTVVSGSVEDCGA 92
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ------------LKEAFNIRLEE 108
R D G ++DP + + LY GL ++IP + G+ +N+R+EE
Sbjct: 93 RLIDQGFDVLVDPGANCVVVRLYHGLLRIIPLNGIGEKLTTDSLEVNQYAANTYNVRIEE 152
Query: 109 LQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
++D+ FL+G PT ++Y+D H+KTYE++ ++ + ++++ + LLI
Sbjct: 153 GNIVDMAFLHGYTLPTFAMIYEDEL-VLHMKTYEISGREPALRNVQLTLDSIEPDSKLLI 211
Query: 169 PVPPPLCGVLIIGEETIVYCSANA---FKAIP-IRPSITKAYGRVDADGSRYLLGDHAG- 223
PVP P GV+++G+ I Y + + + IP + S Y VDA RYLLGD AG
Sbjct: 212 PVPKPFGGVILVGDNIIYYHTKDGPHISQYIPQAKASQVLCYAAVDAQ--RYLLGDMAGR 269
Query: 224 --LLHLLVITHEKE----------------KVTGLKIELLGETSIASTISYLDNAVVYIG 265
++HLL H ++ ++IELLGET+ +I+Y+DN VV+IG
Sbjct: 270 LYMVHLLAEDHTPSGNGLLGSTSSAAVPSARIGSIRIELLGETATPESIAYVDNGVVFIG 329
Query: 266 SSYGDSQLIKLNLQPDA-KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKD 324
+ GDSQLI+LN PD + SY+ VLE Y N+GPIVD +V LE +GQ Q++TCSGAYK+
Sbjct: 330 CTLGDSQLIRLNPDPDPERNSYITVLENYTNIGPIVD--MVLLESKGQNQLITCSGAYKE 387
Query: 325 GSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDEL 384
G+LRI+RNGIGI+E A+++ IKG W +D +D +VVS + +T++L + +D++
Sbjct: 388 GTLRIIRNGIGIHEHATIDQDLIKGAWCFPLESDR-YDDSIVVSMVGQTQLLRLT-DDDI 445
Query: 385 EETEIEGFCSQTQTLFCH-----DAI----------------YNQLVQVTSGSVRLVSS- 422
+EGF + QT++C DA ++ L+Q T+ +RL+
Sbjct: 446 TALHLEGFKTDEQTVYCATLSPMDACTSDNPESSSRSYSVFQHSLLMQATTSGIRLIGIH 505
Query: 423 --TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG---ILTEVKHAQLE 477
EW+SP G ++ +++ + +++A+G V +G + H +
Sbjct: 506 HLNGSGCLAEWRSPSGRGISCLSSHGALIVVASGPELYVLRVVGPANQPSFEQTAHRTMS 565
Query: 478 YEISCLDINP---------------IGENPSYS--QIAAVGMWTDISVRIFSLPDLNLIT 520
E++C+D+ P I E Y+ Q+ AVG+W + + LP+L L+
Sbjct: 566 NEVACIDLTPFDHKRAAYAASQTSTIDEPVDYTVPQLVAVGLWLGYGLALLRLPNLELVH 625
Query: 521 KEHL-------GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE--LTDRKKV 571
+E L G ++PRSVLL E ++YL A+GDG L + + L D K+V
Sbjct: 626 EEPLPETTASTGTALLPRSVLLAQLEDMAYLFAAMGDGTLYFYTVCPSADGIVLRDAKRV 685
Query: 572 SLGTQP-ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
+ GT P + L+ + S+ +VF S+ P VIYS KL+++N+N+KEV+ M P N A +
Sbjct: 686 NAGTGPSMFLKQWRSQCKVNVFVCSNHPCVIYSIKNKLIFANLNMKEVNFMAPLNGAFYR 745
Query: 631 DSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE 690
D +A+ L IG++D+IQKLH+R++PL E P+R+ Q+++ + + + + + E S
Sbjct: 746 DCIALVTPTALVIGSVDEIQKLHVRTVPLEETPKRLALQDETGSLGVITYRQEVFQEGSG 805
Query: 691 MHFVR----------------------------------------LLDDQTFEFI---ST 707
VR + + T E + S
Sbjct: 806 FKPVRSSISLSQKVPKSTSRLPKTAPSSVSATERKFREVEVSSLLIFNKSTMELMFAHSF 865
Query: 708 YPLDTF-EYGCSILSCSFSDDS-NVYYCVGTAYVLPEENEPTKGRILVFI--VEDGKLQL 763
Y T E SI S +D S ++ Y VGTA+++ EE EP+KGRI +F E +L+
Sbjct: 866 YFSQTLVEVAVSIASIEPTDGSKSMLYAVGTAFLVEEEVEPSKGRIHLFHWDPETARLET 925
Query: 764 IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQT 823
+ + GAVY L FNG+LLAAIN ++L+ +++D L+ C + +I+AL+++
Sbjct: 926 VLVHDVNGAVYRLLDFNGRLLAAINSSVRLFD--IKEDS---LRLACSFNENIIALFLRR 980
Query: 824 RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
+GDF++VGDLM+S++LL+Y+ E R N W + +EILDD+ +L AE +LF
Sbjct: 981 KGDFVLVGDLMRSLTLLLYRPNVNNFEAIGRHRNPRWTTCIEILDDEHFLAAEVENSLFV 1040
Query: 884 VRKNSEGATDEERGR 898
V ++S T E R
Sbjct: 1041 VSRDSPENTQEPTVR 1055
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 76/130 (58%), Gaps = 2/130 (1%)
Query: 895 ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHE 952
E RL H G+F+N F G+LVM+ + I P+ ++GTV+G IG+I +
Sbjct: 1149 EVQRLADCAYIHTGDFINVFVRGNLVMQNNEERWQAIGHPSHLYGTVSGSIGLIIQVSPV 1208
Query: 953 QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD 1012
+ FL+++++ L K++ VGG H+ WR+F E+ A NF+DGDLIE+ LDLS
Sbjct: 1209 LFAFLKEIESRLAKLVNPVGGFAHDMWRAFKAERIVRMAHNFVDGDLIETLLDLSSEDRA 1268
Query: 1013 EISKTMNVSV 1022
++ K + + V
Sbjct: 1269 KLVKGLRIPV 1278
>gi|353232348|emb|CCD79703.1| putative dna repair protein xp-E [Schistosoma mansoni]
Length = 1329
Score = 504 bits (1297), Expect = e-139, Method: Compositional matrix adjust.
Identities = 331/1028 (32%), Positives = 531/1028 (51%), Gaps = 144/1028 (14%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
DVPI +I LFR D LF+ T + +++ + +S E +T A G V DR
Sbjct: 54 DVPINAKIVAASLFRRKDRETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVEDRSA 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ-LKEAF--NIRLEELQVLDIKFL 117
R D G +IDP I + LY GL K+I G+ + F ++EE ++D+ F+
Sbjct: 114 RIIDQGFDVLIDPGANYIVVRLYHGLLKIILLQCIGEKIGTDFLDTNQIEEGNIVDMAFI 173
Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGV 177
YG + PT ++Y+D H+KTYE+ ++ + ++++ + LLIPVP P GV
Sbjct: 174 YGYSLPTFAMIYEDEL-VLHMKTYEIYGREPVLRNVQLTLDSIEPDSKLLIPVPKPYGGV 232
Query: 178 LIIGEETIVYCSANA---FKAIP-IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHE 233
+++G+ I Y + + + IP + S Y VDA RYLLGD AG L+++ + E
Sbjct: 233 ILVGDNIICYHTKDGPHISQYIPQAKASQVLCYAAVDA--QRYLLGDMAGRLYMVHLLSE 290
Query: 234 -----------------KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
++ ++IELLGET+ +I+YLDN VV+IGS+ GDSQLI+L
Sbjct: 291 DISAAANNGTSNSDSLSAVRIGSIRIELLGETATPESIAYLDNGVVFIGSTLGDSQLIRL 350
Query: 277 NLQPDA-KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIG 335
N PD + SY+ +LE Y N+GPIVD +V LE +GQ Q++TCSGAYK+GSLR++RNGIG
Sbjct: 351 NPDPDPERNSYITILETYTNIGPIVD--MVLLETKGQNQLITCSGAYKEGSLRVIRNGIG 408
Query: 336 INEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ 395
I+E A+++ IKG W +D FD +VVS + +T++L + +D++ +EGF +
Sbjct: 409 IHEHATIDQDLIKGAWCFPIESDR-FDDTIVVSMVGQTQLLHL-ADDDITALHLEGFKTD 466
Query: 396 TQTLFC--------------------HDAIYNQLVQVTSGSVRLV---SSTSRELRNEWK 432
QT++C ++ + L+Q T+ +RL+ S S+ EWK
Sbjct: 467 EQTVYCATLSPANDSPRPESMMIDENNNTLDPLLLQATTSGLRLIGIQSLCSKGCLTEWK 526
Query: 433 SPPGYSVNVATANASQVLLATGGGHLVYLEIGDG---ILTEVKHAQLEYEISCLDINPIG 489
P G ++ ++ +++A+G V +G+ +V H Q+ +E++C+D+ P
Sbjct: 527 PPTGRGISCLSSFRHTIVVASGTELYVLKVVGESNNPKFEQVSHRQMSHEVACIDLTPFN 586
Query: 490 ENPSYSQI-----------------------AAVGMWTDISVRIFSLPDLNLITKEHL-- 524
+ + + I AVG+W + + LP+L L+ +E L
Sbjct: 587 RDRAIAAINAVSSNHLDSTSSSTPDKSVPYLVAVGLWLGHGLALLKLPNLELVHEEPLPE 646
Query: 525 -----GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE--LTDRKKVSLGTQP 577
G ++PRSVL+ E I+YL A+GDG L + ++ L D K+VS GT P
Sbjct: 647 TTASTGTALLPRSVLIAQLEDIAYLFAAMGDGTLYFYTIDPSEDHVCLRDPKRVSAGTGP 706
Query: 578 -ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
+ LR + S+ +VF S+ P VIYS KL+++N+NLKEV+ M P N + D +A+
Sbjct: 707 SMFLRQWRSQRKVNVFVCSNHPCVIYSIKNKLIFANLNLKEVNFMTPLNGLFYSDCIALV 766
Query: 637 KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR- 695
L IG++D+IQKLH+R++PL E P+R+ Q ++ + + + + + E VR
Sbjct: 767 TPTALIIGSVDEIQKLHVRTLPLEETPKRLALQSETSSLGVITYRQEMFQEGMGFKPVRS 826
Query: 696 -----------------------LLDDQTFEFISTYPLDTF------------------- 713
++ F I L F
Sbjct: 827 SISLSQKIPKSASRLPKTAPSSVSATERKFREIEVSSLLIFNQSNLEIQFAHNFYFSQTL 886
Query: 714 -EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI--VEDGKLQLIAEKETK 770
E SI S + + VGTA+++ +E EP+KGRI +F E +L + +
Sbjct: 887 VEVAVSIASVQSDIHNGPLFAVGTAFLVEDEVEPSKGRIHLFRWDPESSRLDTVLVHDVN 946
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
G+VY + FNG+LLAAIN ++L+ +++D L+ C + +I+ L+++ +GDF++V
Sbjct: 947 GSVYRIVDFNGRLLAAINSSVRLFD--IKEDS---LRLACSFNENIIVLFLRRKGDFVLV 1001
Query: 831 GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
GDLM+S++LL++K E R + W + +EILDD+ +L AE NLF V ++
Sbjct: 1002 GDLMRSLTLLLFKSNVNNFEAIGRHRHPRWTTCIEILDDEHFLAAEVENNLFVVSRDLPE 1061
Query: 891 ATDEERGR 898
T E R
Sbjct: 1062 NTKEPSFR 1069
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 95/180 (52%), Gaps = 32/180 (17%)
Query: 885 RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--PTVIFGTVNGV 942
+ NS G + E + RL +H G+FVN F G+LV++ + + P ++GTVNG
Sbjct: 1154 KPNSSGVSGEMQ-RLVDCAYFHTGDFVNVFVRGNLVLQNAEERWTAVGYPGHLYGTVNGG 1212
Query: 943 IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
+G++ + + FL++++ L ++ VGG +H+ WR+F +++ A NF+DGDLIE+
Sbjct: 1213 LGLLVQVSPVLFAFLKEVEFRLSNLVVPVGGFSHDTWRAFKADREIKMAHNFVDGDLIET 1272
Query: 1003 FLDLSRTRMDEISK--------------------------TMNVSVEELCKRVEELTRLH 1036
DLS MD+ +K T +VE+L K VEE++RLH
Sbjct: 1273 VTDLS---MDDKAKLVKGLRIPVNMNEFGTAGSTCTTDPETRECTVEDLVKVVEEMSRLH 1329
>gi|169611218|ref|XP_001799027.1| hypothetical protein SNOG_08717 [Phaeosphaeria nodorum SN15]
gi|160702249|gb|EAT83885.2| hypothetical protein SNOG_08717 [Phaeosphaeria nodorum SN15]
Length = 1140
Score = 504 bits (1297), Expect = e-139, Method: Compositional matrix adjust.
Identities = 336/1081 (31%), Positives = 567/1081 (52%), Gaps = 69/1081 (6%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
IYG++ L+ RP D LF+ T+R+ + L W+AE +L T ++ V+D R +
Sbjct: 79 IYGKVTMLQKLRPALSPTDHLFVGTDRFMYFTLSWNAEKKQLQTEKSFDSVADNAARESQ 138
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ----------LKEAFNIRLEELQVLDI 114
G+ +DP R + + +Y+G+ VIP +G+ L E +RL E+ V
Sbjct: 139 TGERCHVDPSGRFMTVEVYEGIITVIPLVQRGKKRKQEPDIAHLGEPQPVRLPEMFVRSS 198
Query: 115 KFLYGCA---KPTIVVLYQDNKDARHVKTYEVALKDKDFVE---GPWSQNNLDNGADLLI 168
FL A KP I +L++D ++ E+ D VE G + L+ GA LI
Sbjct: 199 AFLRPKAFDDKPKIALLFEDTHSQVKLRLREILFAGGDSVELEDGEACRAELELGASHLI 258
Query: 169 PVPPPLCGVLIIGEETIVYC--SANAFKAIPI-RPSITKAYGRVDADGSRYLLGDHAGLL 225
P+ P G+++IGE +I Y +N + P+ +I A+ R+DA R++L D G L
Sbjct: 259 PIEAPSHGLIVIGETSIAYYDEESNDLQTEPLDEATIFVAWERIDA--QRFVLADDYGRL 316
Query: 226 HL-LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKG 284
+L ++I EK+KV K++++G+TS AS + YLD V++GS GDSQ+I++ Q
Sbjct: 317 YLFMLILDEKKKVQSWKLDIIGQTSRASVLVYLDAGYVFVGSHQGDSQVIRITEQS---- 372
Query: 285 SYVEVLERYVNLGPIVDFCVVDL-ERQGQGQ----------VVTCSGAYKDGSLRIVRNG 333
+E+++ + N+ P++DF ++D+ R G+GQ +VT SGAY+DGSLR VR+G
Sbjct: 373 --MEIVQTFANIAPVLDFTIMDMGNRSGEGQTNEYSSGQARIVTGSGAYQDGSLRSVRSG 430
Query: 334 IGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET-EIEG 391
+G+ + + E++ I ++ L+S+ + L+V+F++ETRI + + E+EE E
Sbjct: 431 VGLEDLGVLGEMEHISDLFGLKSNASAEYADTLLVTFVNETRIFRFDPQGEVEEVDEFAS 490
Query: 392 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 451
TL + ++VQVT G VR+ + +EW P G ++ A+ N S VLL
Sbjct: 491 VALDETTLAAANISQGRVVQVTGGRVRVSDLDGGMITSEWVPPSGETITAASVNDSHVLL 550
Query: 452 ATGGGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQ-IAAVGMWTDISVR 509
+ GG ++ L + G +LTE K +E +++C+ + PS S + +G W + +
Sbjct: 551 SLGGVSVITLNMDGLKVLTEKKFG-VESQVACIAL------PSTSSSMCFIGFWKNSQLA 603
Query: 510 IFSLPDLNLITKEHLGGEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTD 567
I SL L + + + +PRS+LL + L A+ DG+++ + + GEL+
Sbjct: 604 ICSLDTLEAVKTVQVSEDSVPRSLLLTQIFPDQPPTLFAAMADGNVVTYSFDTSNGELSG 663
Query: 568 RKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS 626
RK + LGT+ T R N +VFA + P++IY+S +L+YS V + + +CPF+S
Sbjct: 664 RKSIVLGTREATFRALPRGNGLFNVFATCEHPSLIYASEGRLVYSAVTAENATTVCPFDS 723
Query: 627 AAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA 686
A+P S+AIA +L I +D + H++++ + E RRI + + F + ++K A
Sbjct: 724 EAYPGSVAIATSDDLRIALVDTERTTHVQTLKVDETVRRIAYSPGLKAFGLGTVKRILKA 783
Query: 687 EESEM--HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN---VYYCVGTAYVLP 741
E M HF +L+D+ F+ + TY L+ E ++ C +D S + +GTAY+
Sbjct: 784 GEEIMLSHF-KLVDEIQFKELDTYALNEEELVECVMRCDLADGSGGTAERFVIGTAYLDD 842
Query: 742 EENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
+ + +GRIL+ + + L+L+ E KG L GK++AA+ + I +Y R
Sbjct: 843 QNSTVERGRILILEVTPERVLKLVTEIAVKGGCRCLAMCEGKIVAALIKTIVVYDIEYRT 902
Query: 801 DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER----ARDY 856
+L + + G I + DLMKS+ ++ Y+ E + ++ AR +
Sbjct: 903 QSKPDLVKAATFRCSTAPIDITVNGTQIAIADLMKSMVVVEYQRGETGLPDKLVEVARHF 962
Query: 857 NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
W +AV +D++ YL ++ NL + ++ +G TD+++ RL V E LGE VNR R
Sbjct: 963 QVTWATAVAEVDENTYLESDAEGNLLVLYRDPKGVTDDDKRRLNVSSEMLLGEMVNRIRR 1022
Query: 917 GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
V PD+ V +P GTV G I + A + L LQ+NL ++ G ++
Sbjct: 1023 ID-VATAPDAVV--VPRAFMGTVEGSIYLFALISQNYLDLLITLQSNLGNLVVSPGNMDF 1079
Query: 977 EQWRSFNNEKKTVDAKN-FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
++R+F N+ +T + N F+DG+LIE FLD + + + V +E++ VE L RL
Sbjct: 1080 AKFRAFKNQVRTEEEPNRFVDGELIERFLDCEEDVQRKAIEGLGVELEDIRSLVEGLRRL 1139
Query: 1036 H 1036
H
Sbjct: 1140 H 1140
>gi|256088964|ref|XP_002580590.1| DNA repair protein xp-E [Schistosoma mansoni]
Length = 1329
Score = 503 bits (1296), Expect = e-139, Method: Compositional matrix adjust.
Identities = 331/1028 (32%), Positives = 531/1028 (51%), Gaps = 144/1028 (14%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
DVPI +I LFR D LF+ T + +++ + +S E +T A G V DR
Sbjct: 54 DVPINAKIVAASLFRRKDRETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVEDRSA 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ-LKEAF--NIRLEELQVLDIKFL 117
R D G +IDP I + LY GL K+I G+ + F ++EE ++D+ F+
Sbjct: 114 RIIDQGFDVLIDPGANYIVVRLYHGLLKIILLQCIGEKIGTDFLDTNQIEEGNIVDMAFI 173
Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGV 177
YG + PT ++Y+D H+KTYE+ ++ + ++++ + LLIPVP P GV
Sbjct: 174 YGYSLPTFAMIYEDEL-VLHMKTYEIYGREPVLRNVQLTLDSIEPDSKLLIPVPKPYGGV 232
Query: 178 LIIGEETIVYCSANA---FKAIP-IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHE 233
+++G+ I Y + + + IP + S Y VDA RYLLGD AG L+++ + E
Sbjct: 233 ILVGDNIICYHTKDGPHISQYIPQAKASQVLCYAAVDA--QRYLLGDMAGRLYMVHLLSE 290
Query: 234 -----------------KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
++ ++IELLGET+ +I+YLDN VV+IGS+ GDSQLI+L
Sbjct: 291 DISAAANNGTSNSDSLSAVRIGSIRIELLGETATPESIAYLDNGVVFIGSTLGDSQLIRL 350
Query: 277 NLQPDA-KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIG 335
N PD + SY+ +LE Y N+GPIVD +V LE +GQ Q++TCSGAYK+GSLR++RNGIG
Sbjct: 351 NPDPDPERNSYITILETYTNIGPIVD--MVLLETKGQNQLITCSGAYKEGSLRVIRNGIG 408
Query: 336 INEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ 395
I+E A+++ IKG W +D FD +VVS + +T++L + +D++ +EGF +
Sbjct: 409 IHEHATIDQDLIKGAWCFPIESDR-FDDTIVVSMVGQTQLLHL-ADDDITALHLEGFKTD 466
Query: 396 TQTLFC--------------------HDAIYNQLVQVTSGSVRLV---SSTSRELRNEWK 432
QT++C ++ + L+Q T+ +RL+ S S+ EWK
Sbjct: 467 EQTVYCATLSPANDSPRPESMMIDENNNTLDPLLLQATTSGLRLIGIQSLCSKGCLTEWK 526
Query: 433 SPPGYSVNVATANASQVLLATGGGHLVYLEIGDG---ILTEVKHAQLEYEISCLDINPIG 489
P G ++ ++ +++A+G V +G+ +V H Q+ +E++C+D+ P
Sbjct: 527 PPTGRGISCLSSFRHTIVVASGTELYVLKVVGESNNPKFEQVSHRQMSHEVACIDLTPFN 586
Query: 490 ENPSYSQI-----------------------AAVGMWTDISVRIFSLPDLNLITKEHL-- 524
+ + + I AVG+W + + LP+L L+ +E L
Sbjct: 587 RDRAIAAINAVSSNHLDSTSSSTPDKSVPYLVAVGLWLGHGLALLKLPNLELVHEEPLPE 646
Query: 525 -----GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE--LTDRKKVSLGTQP 577
G ++PRSVL+ E I+YL A+GDG L + ++ L D K+VS GT P
Sbjct: 647 TTASTGTALLPRSVLIAQLEDIAYLFAAMGDGTLYFYTIDPSEDHVCLRDPKRVSAGTGP 706
Query: 578 -ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
+ LR + S+ +VF S+ P VIYS KL+++N+NLKEV+ M P N + D +A+
Sbjct: 707 SMFLRQWRSQRKVNVFVCSNHPCVIYSIKNKLIFANLNLKEVNFMTPLNGLFYSDCIALV 766
Query: 637 KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR- 695
L IG++D+IQKLH+R++PL E P+R+ Q ++ + + + + + E VR
Sbjct: 767 TPTALIIGSVDEIQKLHVRTLPLEETPKRLALQSETSSLGVITYRQEMFQEGMGFKPVRS 826
Query: 696 -----------------------LLDDQTFEFISTYPLDTF------------------- 713
++ F I L F
Sbjct: 827 SISLSQKIPKSASRLPKTAPSSVSATERKFREIEVSSLLIFNQSNLEIQFAHNFYFSQTL 886
Query: 714 -EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI--VEDGKLQLIAEKETK 770
E SI S + + VGTA+++ +E EP+KGRI +F E +L + +
Sbjct: 887 VEVAVSIASVQSDIHNGPLFAVGTAFLVEDEVEPSKGRIHLFRWDPESSRLDTVLVHDVN 946
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
G+VY + FNG+LLAAIN ++L+ +++D L+ C + +I+ L+++ +GDF++V
Sbjct: 947 GSVYRIVDFNGRLLAAINSSVRLFD--IKEDS---LRLACSFNENIIVLFLRRKGDFVLV 1001
Query: 831 GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
GDLM+S++LL++K E R + W + +EILDD+ +L AE NLF V ++
Sbjct: 1002 GDLMRSLTLLLFKSNVNNFEAIGRHRHPRWTTCIEILDDEHFLAAEVENNLFVVSRDLPE 1061
Query: 891 ATDEERGR 898
T E R
Sbjct: 1062 NTKEPSFR 1069
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 95/180 (52%), Gaps = 32/180 (17%)
Query: 885 RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--PTVIFGTVNGV 942
+ NS G + E + RL +H G+FVN F G+LV++ + + P ++GTVNG
Sbjct: 1154 KPNSSGVSGEMQ-RLVDCAYFHTGDFVNVFVRGNLVLQNAEERWTAVGYPGHLYGTVNGG 1212
Query: 943 IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
+G++ + + FL++++ L ++ VGG +H+ WR+F +++ A NF+DGDLIE+
Sbjct: 1213 LGLLVQVSPVLFAFLKEVEFRLSNLVVPVGGFSHDTWRAFKADREIKMAHNFVDGDLIET 1272
Query: 1003 FLDLSRTRMDEISK--------------------------TMNVSVEELCKRVEELTRLH 1036
DLS MD+ +K T +VE+L K VEE++RLH
Sbjct: 1273 VTDLS---MDDKAKLVKGLRIPVNMNEFGTAGSTCTTDPETRECTVEDLVKVVEEMSRLH 1329
>gi|407923753|gb|EKG16818.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1129
Score = 501 bits (1290), Expect = e-139, Method: Compositional matrix adjust.
Identities = 346/1095 (31%), Positives = 574/1095 (52%), Gaps = 84/1095 (7%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
+YG++ L RP E + LF+ T+RY + + W E +L T + D++D+ R
Sbjct: 55 VYGKVTMLNRLRPMNEPTEHLFVGTDRYYYFTMAWSPEEKQLKTVKTYQDLADKAARDAQ 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLK---------EAFNIRLEELQVLDIK 115
G ++DP R + L LY+G+ V+P KG+ K E R+EE+ V
Sbjct: 115 TGDRSLLDPTGRFMTLELYEGIVTVVPLTEKGKRKGDPEVSALGEPVPSRIEEMFVRSSA 174
Query: 116 FLYGCA----KPTIVVLYQDNKDAR-----------HVKTYEVALKDKDFVEGPWSQNNL 160
FL+ + KP + +LY++++D++ T E ++ + VEG + L
Sbjct: 175 FLHRKSPESEKPLVALLYEEDEDSKIRLRLRQLAFQTAGTEEQSVAALEPVEG--LKEEL 232
Query: 161 DNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVDADGSRY 216
D GA LIPVP P GVL++GE I Y + + KA+ +P +I A+ ++D R+
Sbjct: 233 DLGASHLIPVPGPCYGVLVLGETCITYFN-DYTKALVKKPLQDSTIFVAWEQID--NQRF 289
Query: 217 LLGDHAGLLHLLVITHEKEK--VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
LL D G L+L ++ + V G +++ +GETS AS + YLD V++GS GDSQ+I
Sbjct: 290 LLADDFGGLYLFMLLLDDNSGVVEGWRLDKIGETSRASVLVYLDAGHVFVGSHEGDSQVI 349
Query: 275 KLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-ERQGQGQ----------VVTCSGAYK 323
++ +GS +EV++ + N+ PI+DF ++D+ R G+GQ +VT SGA+K
Sbjct: 350 RIT-----EGS-IEVVQTFHNIAPILDFTIMDMGNRSGEGQSNEYSSGQARIVTGSGAFK 403
Query: 324 DGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLED 382
DGSLR VR+G+G+ +Q ++ +L I +++ S+ +D L+VS ++E+RI ++ +
Sbjct: 404 DGSLRSVRSGVGLEDQGAIGDLGSISNIFAFSSTASGEYDDTLLVSLVNESRIFCVDADG 463
Query: 383 ELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNV 441
EL+E E +G TL + +L+QVT +VR++ + + +EW +P G +
Sbjct: 464 ELDERESFKGLQLDESTLMTKNVPNYRLLQVTQFAVRIMDLENDMVVSEWHAPEGQVITD 523
Query: 442 ATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAV 500
+N ++L+ GG + L I D I K + +ISC+ + +N I V
Sbjct: 524 VDSNDRFIILSIGGVRAIILSIADTIHAVAEKDFGADSQISCVAMPQAADN-----ICFV 578
Query: 501 GMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGHLLNF 556
G W + +V I L L I L G +PRS+LL L A DG+++ F
Sbjct: 579 GFWQNCTVAILGLDGLQTIQTISLSEDGSSVPRSLLLTQIFANQPPTLFIATADGYVVTF 638
Query: 557 LLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVNL 615
++ T L+ RK LGTQ R N + VFA + P++IY S +L++S V
Sbjct: 639 SMDPSTFALSGRKSTVLGTQQANFRALPRGNGLYNVFATCEHPSLIYGSEGRLVFSAVTA 698
Query: 616 KEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTF 675
++ + +CPFN+ A+P S+AIA GEL + +D+ ++ H++++ + E RRI + Q + F
Sbjct: 699 EKATCVCPFNAEAYPRSIAIAASGELHLAVVDEERRTHVQTLHVNETVRRIAYSPQLKAF 758
Query: 676 AICSLKN--QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD---DSNV 730
+ ++K + E + HF RL D+ F+ + + ++ +E + D ++
Sbjct: 759 GLGTIKRVLRDREEVVQGHF-RLADEVIFKELDNFEMNEYEIVECAIRAELDDGDGETAE 817
Query: 731 YYCVGTAYVLPEENE-PTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAIN 788
+ VGT++++ EE + T+GRILVF + ED +L++IAE TKGA L + K++A +
Sbjct: 818 RFIVGTSHLVEEEEQGSTRGRILVFEVTEDRRLKVIAEISTKGACRCLAMVDNKIVAGLI 877
Query: 789 QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG- 847
+ + +Y + T L + + + G+ I V DL+KS+S+L YK G
Sbjct: 878 KTVVIYSFEYSTPSTPFLVKKASFRTSTAPIDITVTGNQIAVADLIKSVSVLEYKPGAGD 937
Query: 848 ---AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
++E AR +W A+ +D++ YL A+ NL + ++ G T+E+R RL + G+
Sbjct: 938 QSDELKEVARHVQVSWSMALAEVDENTYLQADAEGNLILLERDVSGVTEEDRKRLMLRGD 997
Query: 905 YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
LGE VNR R + SD IP F TV G I + A + + L +LQ+ L
Sbjct: 998 MLLGEQVNRIRRIDMATV---SDAPVIPRAFFATVEGSIYLFALIAPAKVDLLIRLQSQL 1054
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKN-FLDGDLIESFLDLSRTRMDEISKTM--NVS 1021
++ G ++R+F N+ + D N F+DGDLIE FLDL +E+ K + ++
Sbjct: 1055 ADFVRSPGHYPFLRYRAFRNQVREEDEPNRFVDGDLIERFLDLKPREQEEVVKGVYADLD 1114
Query: 1022 VEELCKRVEELTRLH 1036
VEE+ VE L RLH
Sbjct: 1115 VEEVRSLVETLRRLH 1129
>gi|449019082|dbj|BAM82484.1| UV-damaged DNA binding protein [Cyanidioschyzon merolae strain 10D]
Length = 1372
Score = 498 bits (1281), Expect = e-137, Method: Compositional matrix adjust.
Identities = 380/1216 (31%), Positives = 577/1216 (47%), Gaps = 228/1216 (18%)
Query: 21 EAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGL 80
++ D +FI TER F V+++D + T G ++ ++GR G+ +DP R + +
Sbjct: 185 QSTDAVFILTERLDFAVIKYDTRLQGVATLVAGSLASKLGRRCPLGEFVALDPSARCLAV 244
Query: 81 HLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKT 140
+ YDGL KV+P G EAF +RL+ +QV ++FL G PT+ +++ D RH+ +
Sbjct: 245 YAYDGLLKVVPGSLMG---EAFEVRLDVVQVQSMEFLIGTEHPTLAIIHTDYLHNRHLVS 301
Query: 141 YEVALKDKDFVEGPWSQNNLDNGADLLIPVPPP--------LCGVLIIGEET-------- 184
YE+ L +D EGP +Q LD A+L+IPVPP CG ++ +T
Sbjct: 302 YEILLGRQDVREGPIAQQYLDASAELIIPVPPAPNDSANDGACGGFVVACDTSLALHPGR 361
Query: 185 ------IVYCSANAFKAI----PIR------PSITKAYGRVD----------ADG--SRY 216
+++ F + PI S + R D A G + Y
Sbjct: 362 SGVAKQVIHFHGGQFAPMRCFCPISGIGGSSKSASVEANRQDHFETGLEMLAAAGYHAHY 421
Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
LLGD G L++L + E G+++E + E+S+AS I+YL + +V++GS+ GDSQL++L
Sbjct: 422 LLGDRDGFLYVLSVGAE----VGMRLECVCESSVASAIAYLGSDLVFVGSALGDSQLVRL 477
Query: 277 -------------NLQPDAKGS-----------------YVEVLERYVNLGPIVDFCVVD 306
L A S + +V++ Y N+GPI D V
Sbjct: 478 LPSQTESPELVQTTLFATASASSHERRESRAERAECQLWFGQVVQCYTNIGPIQDLLVTG 537
Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLV 366
+ G ++TCSG + GSLRI+RNGIG E A+VEL GIK +++L S T +D FLV
Sbjct: 538 DDSFSDGHIITCSGVSRMGSLRIIRNGIGFVEHAAVELDGIKALFTLPSLTSPEWDEFLV 597
Query: 367 VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL-VQVTSGSVRLVSSTSR 425
VSF +ETR+L + DEL E+E T+ QL + VT+ V LV T
Sbjct: 598 VSFTAETRVLRLAAHDEL--VEVESLAVDEATILAMRLPAEQLALWVTASYVGLVDLTLL 655
Query: 426 ELRN-EWKSPPGYSVNVATANASQ--VLLATGGGHLVYLEIGDGILTEVKHAQLEYEISC 482
E W P + A + VL++T L L+ L V L E++C
Sbjct: 656 ERPAVAWTPPSSEQITNAVFDELHKLVLVSTSNAQLYVLQQRKTSLVPVGSQTLPAEVAC 715
Query: 483 LDINPIGENPSYSQIAAVGMWTDISVRIFSLP-------DLNLITKEHLGGEIIPRSVLL 535
+ + +Y I A+G W + +R+F L +L I + L +PRSVLL
Sbjct: 716 I-------HAAYG-IVALGTWAESRIRLFRLHEDTGNRWELECIRESALPSTSVPRSVLL 767
Query: 536 CAFE----------GISY--LLCALGDGHLLNFLLNM----KTG-----------ELTDR 568
+ G ++ LL A+GDG L F ++ ++G +L
Sbjct: 768 TYLDDHGGLMADHGGRAHLCLLVAVGDGRLFAFNVSQPDAKRSGPEPEDLSKCELQLQHP 827
Query: 569 KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
+++ LG++P L +VFAA R +VI++ + L NVNL++V+ F++
Sbjct: 828 RQLRLGSRPAALNNLQLHGMRYVFAACGRASVIHAHHGTLFCGNVNLRDVTRAVRFHTKG 887
Query: 629 FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEE 688
FPDS+A+A E L +G I+ IQ+LHIR L E PRRI H R + IC L S
Sbjct: 888 FPDSIAVATEQGLALGGIEHIQQLHIRRHDLREQPRRIAHL---RGY-IC-LLTVSIVFG 942
Query: 689 SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTK 748
E H VRLL+D+T E I+++ L E+G S+++ D + VGTAYVLP E EP++
Sbjct: 943 EERHHVRLLNDETLETITSHDLAMNEHGLSLVAIPERD----VFVVGTAYVLPSEMEPSR 998
Query: 749 GRILVFIVEDGKLQLIAEKETKGAVYSLNAF-------------NGKLLAA-INQKIQLY 794
GRILVF +L L+ E T GAVY+++A + LAA +N + LY
Sbjct: 999 GRILVF--SREELLLLNELYTPGAVYTMSALADPSDRTCRFPASAARFLAAGVNNVVILY 1056
Query: 795 KWMLRDDG-TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG------ 847
W G EL+ H GH+L L ++ RGD ++VGDLMKS+ +L EG
Sbjct: 1057 DWGQSGHGDDYELREVARHLGHVLVLRLEARGDQLLVGDLMKSLCVLQLVLPEGETSDGA 1116
Query: 848 --AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
++ A DY W++A L++D YL A+N++NL ++++N E R L G +
Sbjct: 1117 SPCLKAVAWDYETAWITACAFLNEDTYLAADNSYNLLSLQRNPHETRSEFRHALNRAGAF 1176
Query: 906 HLGEFVNRFRHGSLVMRLPDS------------------DVGQIP------------TVI 935
HLG+ VN FR G LV + DV + T++
Sbjct: 1177 HLGDLVNVFRRGKLVTEASGNEEAGTGNGHSTIDTESTRDVARASTGTTTADNVSRQTLL 1236
Query: 936 FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK-------------------------G 970
F T G IG+I L Q+ L +++ LR + G
Sbjct: 1237 FATTAGAIGIIVPLDPAQHRMLSRVEKALRSLTDHPAEAGAADNGHERVRRSPLTSIRLG 1296
Query: 971 VGGLNHEQWRSFNNEKK----------TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
V G H WR+ +E+ + + F+DGDL+E F +L T M+ ++ +
Sbjct: 1297 VEGFAHADWRTPLSERALNSLHSMTCFDLPRRGFVDGDLVERFRELEPTEMEFVAAQVGA 1356
Query: 1021 SVEELCKRVEELTRLH 1036
S+E + V++L RLH
Sbjct: 1357 SIEHIKLLVDDLARLH 1372
>gi|390342012|ref|XP_793599.3| PREDICTED: uncharacterized protein LOC588842 [Strongylocentrotus
purpuratus]
Length = 1161
Score = 494 bits (1271), Expect = e-136, Method: Compositional matrix adjust.
Identities = 254/513 (49%), Positives = 347/513 (67%), Gaps = 42/513 (8%)
Query: 44 SSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFN 103
+ E+I +A G+V DRIGRP++ G IGIIDP+CR+IGL LYDGLFK+IP D + +AFN
Sbjct: 28 TEEIIPKAQGNVQDRIGRPSETGPIGIIDPECRMIGLRLYDGLFKIIPLDRDNKELKAFN 87
Query: 104 IRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNG 163
IRLEEL V+D++FLYGC +PTIV L+QD RHVKTYEV L++K+F GPW Q+N++
Sbjct: 88 IRLEELNVIDVQFLYGCHQPTIVFLHQD-PHGRHVKTYEVNLREKEFNRGPWKQDNVETE 146
Query: 164 ADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGD 220
A ++I VP P G LIIG+E+I Y + + AI I+ S YGR+D +GSRYLLGD
Sbjct: 147 ATMVIAVPQPYGGALIIGQESITYHKGDNYVAIAPPTIKNSTLVCYGRLDNNGSRYLLGD 206
Query: 221 HAGLLHLLVITHEKE-----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
G L LL++ E+ V LK+E LGETSIA ++YLDN VV+IGS GDSQL++
Sbjct: 207 LTGRLFLLLLDKEESMDGAATVKDLKLEFLGETSIAECLTYLDNGVVFIGSRLGDSQLVR 266
Query: 276 LNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIG 335
LN + D GSYV ++E + NLGPIVD VVDL+RQGQGQ+VTCSGAYK+GSLRI+RNGIG
Sbjct: 267 LNTESDESGSYVTMMETFTNLGPIVDMAVVDLDRQGQGQLVTCSGAYKEGSLRIIRNGIG 326
Query: 336 INEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQ 395
I+E AS++L GIKG+W L+ + FD LV+SF+ +TR+ +
Sbjct: 327 IHEHASIDLPGIKGIWPLKVDMNSQFDDTLVLSFVGQTRVKSF----------------- 369
Query: 396 TQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGG 455
+ +T SVRLVS+ ++ + +EW+ G +++VA+ N+ QV+ A G
Sbjct: 370 -------------INYITGASVRLVSTPTKRMVSEWRPQSGKNISVASCNSKQVVCA-AG 415
Query: 456 GHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515
+ YLEI +G + +V +E+E++CLDI P+ + ++ + AVG+WTDIS RI +P
Sbjct: 416 SDIFYLEIFEGEVRQVSTVTMEHEVACLDITPLTGD--FTDLCAVGLWTDISARILKIPT 473
Query: 516 LNLITKEHLGGEIIPRSVLLCAFEGISYLLCAL 548
+ + E LGGEIIPRS+LL FEG +Y+LCAL
Sbjct: 474 MESMHVELLGGEIIPRSILLMTFEGQNYILCAL 506
>gi|430814207|emb|CCJ28534.1| unnamed protein product [Pneumocystis jirovecii]
Length = 904
Score = 491 bits (1263), Expect = e-135, Method: Compositional matrix adjust.
Identities = 309/909 (33%), Positives = 511/909 (56%), Gaps = 69/909 (7%)
Query: 122 KPTIVVLYQDNKDARHVKTYEVAL--KDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLI 179
+PTI VLYQD+K A+H+ +YE+ + ++K EGP +LD G+ LLIP+ G++I
Sbjct: 8 RPTIAVLYQDSKYAKHLSSYEIIVQSREKRMKEGPLKSRDLDIGSSLLIPLLDS--GIII 65
Query: 180 IGEETIVYCSANA-FKA---IPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
+GE+T++Y N+ K+ IPI P++ +Y ++ ++++L D G + +L +++ +
Sbjct: 66 VGEQTLMYIHPNSEIKSKFIIPI-PTVFSSYTTIN--NNKHILADDYGRIFMLTLSNSYK 122
Query: 236 KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVN 295
+KI +G TSIAS + YL N+ ++IGS YGDSQL+ + PD VL+ + N
Sbjct: 123 DSNFMKISQIGITSIASVLVYLPNSYLFIGSHYGDSQLVNI---PDCL-----VLQSFPN 174
Query: 296 LGPIVDFCVVDLERQGQGQ-VVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSL 353
+ PI DFC V R+G+ + +VTCSGAYKDGSLRI+R + +N+ + L GI G+W L
Sbjct: 175 ISPISDFCFV--TREGRNEFIVTCSGAYKDGSLRILRYNVEMNKTLEISNLNGIYGIWGL 232
Query: 354 RSSTDDPFDTFLVVSFISETRILAM--NLEDELEETEIEGFC---SQTQTLFCHDAIYNQ 408
+ + T LV+SF++ETRIL + N+ +E E E + F ++ TL + N
Sbjct: 233 YLQNEFEY-TALVLSFVNETRILKVFHNILNEPEIEEWDNFAIPNAKLPTLVAKNVNNNL 291
Query: 409 LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGIL 468
+++ S+ L+ S + EW + A + ++ G +V + + +
Sbjct: 292 FCFISNKSICLIDWKSNLVLKEWIPTADDIITCACLDTEFASVSLTKGKIVVFSLKNMTI 351
Query: 469 TEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
E+ + YE+SC+DI S + + +VG+W S+ I S+P + L+ L G +
Sbjct: 352 VEIGEYKFNYEVSCIDI-------SNNALISVGLWIVPSIHILSIPSMELLLSHSLLGTV 404
Query: 529 IPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+PRS+ + + ++ +L +GDG LL++ L+ + GT PI L F +
Sbjct: 405 VPRSICIVSLASMNKPVILVGMGDGTLLSYGLD----------GLDKGTLPINLSKFITP 454
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
+VFA SDR +IY S+ KL +S++NLKE++ M F S+ F ++ + E + IG+I
Sbjct: 455 IGMNVFAISDRSIIIYGSSGKLSFSSINLKEINCMSSFISSIFSSTIVVVSENIIKIGSI 514
Query: 647 DDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK---NQSCAEESEMHFVRLLDDQTFE 703
D +Q+L I++IPLGE PRRIC+ ++ + F + ++K S E + ++++LD +F+
Sbjct: 515 DSLQQLQIQTIPLGELPRRICYHDKQKVFGVLTIKLSLEASNGNEVQTSYLKILDVTSFD 574
Query: 704 FISTYPLDTFEY---GCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK 760
I LD+F+ C S + D+ + VGT + LPEE E +KGRI++F V + K
Sbjct: 575 GI----LDSFQLELNECVQCITSVTIDNQDIFVVGTGFSLPEEEESSKGRIILFGVTNKK 630
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY 820
+ + +E + AVY + + K++A IN + +Y + D + + L L
Sbjct: 631 IWVFSEIQVNDAVYCIGIIDNKIIAGINALVHIYAY---DSSLKNFNVIATYRSTTLCLS 687
Query: 821 VQTRGDFIVVGDLMKSISLLIYKHEEGA--IEERARDYNANWMSAVEILDDDIYLGAENN 878
+ G +++GDLMKS+SLL + + E ++E A+D N WM+ V LD+D+Y+GAE
Sbjct: 688 LAVHGTHVIIGDLMKSVSLLAFINTENGPRLKEVAKDCNPLWMTCVAALDNDLYIGAEAE 747
Query: 879 FNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
NL K+ T E +L+++ E GE VN+ + G++ L + IP F T
Sbjct: 748 GNLSLFWKDFN--TTFEENKLQIISEIKWGELVNQIKPGTI---LYSENSIIIPKATFVT 802
Query: 939 VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF-NNEKKTVDAKNFLDG 997
V+G IG+I ++ E FL LQ+N+ K+I G+G LNH WR+F N KK+ + K F+DG
Sbjct: 803 VDGSIGIIFTVKREYLEFLVNLQSNMGKIISGIGCLNHSNWRAFCNRRKKSNEPKCFIDG 862
Query: 998 DLIESFLDL 1006
D +E F++L
Sbjct: 863 DFVEIFINL 871
>gi|170589357|ref|XP_001899440.1| CPSF A subunit region family protein [Brugia malayi]
gi|158593653|gb|EDP32248.1| CPSF A subunit region family protein [Brugia malayi]
Length = 655
Score = 483 bits (1244), Expect = e-133, Method: Compositional matrix adjust.
Identities = 264/658 (40%), Positives = 387/658 (58%), Gaps = 71/658 (10%)
Query: 445 NASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
N Q+++A G L+YLE + + E+EISC+DI PIG+ S+I AVG WT
Sbjct: 3 NLGQLVVACGA-LLIYLEANSAGFKVITEIECEFEISCIDITPIGKGTLRSEICAVGYWT 61
Query: 505 DISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGE 564
D+SV + +LP L + +E + G+++ RS++L EG YLL ALGDG + F ++MKTG
Sbjct: 62 DLSVALRALPQLVEVVREKIVGDMLSRSIMLSPMEGHVYLLVALGDGTVHYFQIDMKTGA 121
Query: 565 LTDRKKVSLGTQPITLRTFSSKNTT--HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
L D KK +LGTQPI LR F S+ + ++F SDRP VIYSSN+KLL+SNVNL+ VS M
Sbjct: 122 LLDPKKATLGTQPIHLRKFRSRCSPVHNIFVCSDRPAVIYSSNQKLLFSNVNLRMVSTMT 181
Query: 623 PFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI----- 677
P + A+PD+L + L IG IDDIQKLHIR++PLGE P RI +Q ++ T A+
Sbjct: 182 PLYAEAYPDALVLTDGHSLVIGRIDDIQKLHIRTVPLGESPSRIAYQPETNTIAVIVERL 241
Query: 678 ----------------------CSLKN------------------QSCAEESEMHFVRLL 697
C+ KN + AEE E+ V LL
Sbjct: 242 EVILFLFFYVFVDAMGKHHFGQCASKNAMETSSSRLSSMRREPTPECLAEEMEVSSVLLL 301
Query: 698 DDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
D TFE + ++ L+ E S+ SC DDS Y+ VGTA ++ +E E GRI++F
Sbjct: 302 DSNTFEILHSHELEGSEMAMSLASCQLGDDSQPYFVVGTAVIMSDETESKMGRIMMFQAS 361
Query: 758 DG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 815
+G +++L+ EKE KGA YS+ + +GKL+ A+N ++L++W +EL+ EC +
Sbjct: 362 EGPERMRLVYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEW----TADKELRLECSDFDN 417
Query: 816 ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
+ ALY++T+ D I+VGDLM+S+SLL YK E E+ ARD+ NWMSA EI+D D +LGA
Sbjct: 418 VTALYLKTKNDLILVGDLMRSLSLLSYKSMESTFEKVARDFMTNWMSACEIIDSDNFLGA 477
Query: 876 ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVI 935
EN++NLFTV K+S EE RL+ +G ++LGE VN F HGSL D +++
Sbjct: 478 ENSYNLFTVMKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVDVAPLYHSSIL 537
Query: 936 FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
+GT +G IGVI +P Y FL+ +Q L + + ++H Q+R+F EK++ F+
Sbjct: 538 YGTSDGGIGVIVQMPPVLYTFLQDVQKRLAEYAENCMRISHTQYRTFETEKRSEAPNGFI 597
Query: 996 DGDLIESFLDLSRTRMDEI-----------------SKTMNVSVEELCKRVEELTRLH 1036
DGDLIES LD+ + ++++ ++ ++ E++ K VE+L+R+H
Sbjct: 598 DGDLIESLLDMGKDSVEQVVNGLKMPLLNSISSSETTELVDALAEDVLKLVEDLSRIH 655
>gi|452003044|gb|EMD95501.1| hypothetical protein COCHEDRAFT_1165632 [Cochliobolus heterostrophus
C5]
Length = 1116
Score = 483 bits (1243), Expect = e-133, Method: Compositional matrix adjust.
Identities = 325/1078 (30%), Positives = 558/1078 (51%), Gaps = 63/1078 (5%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
IYG++ L+ RP D LFI T+R+ + L W+AE +L T ++ V+D R +
Sbjct: 55 IYGKVTMLQKLRPALSPTDHLFIGTDRFMYFTLSWNAEKKQLQTEKSFASVADNAARESQ 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ----------LKEAFNIRLEELQVLDI 114
G+ IDP R + + +Y+G+ VIP +G+ L + +RL E+ V
Sbjct: 115 TGERCHIDPTGRFMTIEVYEGILTVIPLVQRGKKRKQEADIAHLGDPQPVRLPEMFVRSS 174
Query: 115 KFLYGCA---KPTIVVLYQDNKDARHVKTYEVAL--KDKDFVEGPWSQNNLDNGADLLIP 169
FL + KP + +LY+D ++ E+ + D EG ++ L+ G+ LIP
Sbjct: 175 AFLRPRSTDDKPKMALLYEDTHAQVKLRLRELTYVGDEVDLQEGEACKSELELGSSHLIP 234
Query: 170 VPPPLCGVLIIGEETIVYC--SANAFKAIPI-RPSITKAYGRVDADGSRYLLGDHAGLLH 226
+ P G+++IGE +I Y + + P+ +I A+ R+DA R++L D G L+
Sbjct: 235 LEEPSHGLVVIGETSIGYFDDESGDLQTEPLDEATIFVAWERIDA--QRFVLADDYGRLY 292
Query: 227 L-LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
+ +++ + + +V K++++G+TS AST+ YLD V++GS GDSQ+I++ A+ S
Sbjct: 293 MFMLVLNAQGRVQSWKLDVIGQTSRASTLVYLDAGYVFVGSHQGDSQVIRI-----AEKS 347
Query: 286 YVEVLERYVNLGPIVDFCVVDL-ERQGQGQ----------VVTCSGAYKDGSLRIVRNGI 334
+E+++ + N+ PI+DF ++D+ R G+GQ +VT SGAY+DGSLR VR+G+
Sbjct: 348 -MEIVQTFSNIAPILDFTIMDMGNRSGEGQTNEYSSGQARIVTGSGAYQDGSLRSVRSGV 406
Query: 335 GINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET-EIEGF 392
G+ + + E++ I ++ LRS+ + L+VSF+ E+R+ + + E+EE E
Sbjct: 407 GLEDLGVLGEMEHISELFCLRSAPSAQYTDTLLVSFVGESRVFRFDSQGEVEEVDEFVSL 466
Query: 393 CSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLA 452
TL + ++VQVT+G R+ + +EW G ++ A+ NA+ VL++
Sbjct: 467 ALDETTLAAANIPQGRIVQVTNGRARICDLDGGMMTSEWVPADGKTITAASVNATHVLVS 526
Query: 453 TGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 511
GG +V L + DG+ + + K E +++C+ + P G S VG W + + I
Sbjct: 527 LGGVTIVSLSMADGLQVVKEKTFGAESQVACVTV-PSGT----SSTCFVGFWNNSQLAIC 581
Query: 512 SLPDLNLITKEHLGGEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRK 569
SL L + + + +PRS+LL + L AL DG+++ + + EL+ RK
Sbjct: 582 SLDTLEAVKTVQISDDSVPRSLLLTQIFPDQPPSLFVALADGNVVTYTFDPSNHELSGRK 641
Query: 570 KVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
+ LGT+ T R + +VFA + P++IY+S +L+YS V ++ + +CPF+S A
Sbjct: 642 SIVLGTREATFRALPRGDGLFNVFATCEHPSLIYASEGRLVYSAVTAEKATTVCPFDSEA 701
Query: 629 FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN-QSCAE 687
+P S+AIA +L I +D + H++++ + E RRI + + F + ++K E
Sbjct: 702 YPSSVAIATSQDLRIALVDTERTTHVQTLKVDETVRRIAYSPSLKAFGLGTIKRILKHGE 761
Query: 688 ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN---VYYCVGTAYVLPEEN 744
E + +L+D+ F+ + TY L+ E ++ C D S + VGTAYV +
Sbjct: 762 EIMLSHFKLVDEIQFKELDTYALNEEELVECVMRCELPDGSGGLAERFVVGTAYVDDQNT 821
Query: 745 EPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT 803
+GRI+V V + L+L+ E KG L GK++AA+ + I +Y + R T
Sbjct: 822 TSERGRIIVLEVTPERILKLVTEIAVKGGCRCLAICQGKIVAALIKTIVVYDFEYRAPST 881
Query: 804 RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER----ARDYNAN 859
+L + V G I + DLMKS+ ++ Y E + ++ AR Y
Sbjct: 882 PDLVKLASFRCSTAPIDVTVNGSLIAIADLMKSLVIVEYTKGETGLPDKLVEVARHYQIT 941
Query: 860 WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
W +AV +D + YL ++ NL + ++ G TD+++ RL + E LGE VNR R
Sbjct: 942 WATAVAEVDTNTYLESDAEGNLVVLYRDPNGVTDDDKRRLNISSEMLLGEMVNRIRR--- 998
Query: 920 VMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
V L +D +P GTV G I + + + L LQ+NL ++ G + ++
Sbjct: 999 VDVLTAADAVIVPKAFVGTVEGSIYLFGLISPDYQNLLMTLQSNLGSLVAAPGDMQFAKF 1058
Query: 980 RSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
R+F N ++ + F+DG+L+E FLD+S + + V +EE+ +E L RLH
Sbjct: 1059 RAFKNSVREEEEPMRFVDGELVERFLDVSEEVQHKAIDGLGVELEEVKGLIEALRRLH 1116
>gi|312076590|ref|XP_003140929.1| CPSF A subunit region family protein [Loa loa]
Length = 655
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 261/652 (40%), Positives = 381/652 (58%), Gaps = 69/652 (10%)
Query: 450 LLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVR 509
L+ G L+YLE + + E+EISC+DI PIG S+I AVG WTD+SV
Sbjct: 8 LVIACGALLIYLEADSAGFKLISELECEFEISCIDITPIGNETLRSEICAVGYWTDLSVA 67
Query: 510 IFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRK 569
+ +LP L + +E + G+++ RS++L EG YLL ALGDG + F ++MKTG L D K
Sbjct: 68 LRTLPQLMEVVREKIPGDMLSRSIMLSPMEGHVYLLVALGDGTVHYFQIDMKTGALLDPK 127
Query: 570 KVSLGTQPITLRTFSSKNTT--HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
K +LGTQPI LR F S+ ++ ++F SDRP VIYSSN+KLL+SNVNL+ VS M P +
Sbjct: 128 KATLGTQPIHLRKFRSRCSSVHNIFVCSDRPAVIYSSNQKLLFSNVNLRMVSTMTPLYAE 187
Query: 628 AFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI---------- 677
A+PD+L + L IG IDDIQKLHIR++PLGE P RI +Q ++ T A+
Sbjct: 188 AYPDALVLTDGNSLVIGRIDDIQKLHIRTVPLGESPSRIAYQPETNTIAVTVERLEFVDA 247
Query: 678 --------CSLKN------------------QSCAEESEMHFVRLLDDQTFEFISTYPLD 711
C+ KN + AEE E+ + LLD TFE + ++ L+
Sbjct: 248 MGKHHFGQCASKNAMETSSSRLSSMRREPTPECLAEEMEVSSILLLDSNTFEILHSHELE 307
Query: 712 TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG--KLQLIAEKET 769
E S+ SC +DS Y+ VGTA ++ +E E GRI++F +G +++L+ EKE
Sbjct: 308 GSEMAMSLASCQLGNDSQPYFVVGTAVIMSDETESKMGRIMMFQASEGPERMRLVYEKEI 367
Query: 770 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
KGA YS+ + +GKL+ A+N ++L++W +EL+ EC ++ ALY++T+ D I+
Sbjct: 368 KGAAYSIQSMDGKLVVAVNSCVRLFEWT----ADKELRLECSDFDNVTALYLKTKNDLIL 423
Query: 830 VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE 889
VGDLM+S+SLL YK E E+ ARD+ NWMSA EI+D D +LGAEN++NLFTV K+S
Sbjct: 424 VGDLMRSLSLLSYKSVESTFEKVARDFMTNWMSACEIIDSDSFLGAENSYNLFTVVKDSF 483
Query: 890 GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL 949
EE RL+ +G ++LGE VN F HGSL D ++++GT +G IGVI +
Sbjct: 484 TVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVDVAPLYHSSILYGTSDGGIGVIVQM 543
Query: 950 PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRT 1009
P Y FL +Q L + ++H Q+R+F EK++ F+DGDLIES LD+ +
Sbjct: 544 PPVLYTFLHDVQKRLADYTENCMRISHTQYRTFETEKRSEVPNGFIDGDLIESLLDMGKD 603
Query: 1010 RMDEI-------------------------SKTMNVSVEELCKRVEELTRLH 1036
+ +I ++ ++ + E++ K VE+L+R+H
Sbjct: 604 SVGQIVNGLKMPLLNIIRVITLLPVPSSETTELVDATAEDVLKLVEDLSRIH 655
>gi|452838792|gb|EME40732.1| hypothetical protein DOTSEDRAFT_177898 [Dothistroma septosporum
NZE10]
Length = 1138
Score = 481 bits (1238), Expect = e-133, Method: Compositional matrix adjust.
Identities = 332/1105 (30%), Positives = 561/1105 (50%), Gaps = 98/1105 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
+YG++ L +P D LF+ T+RY + + W+ ++++L T +A D++++ R +
Sbjct: 58 LYGKVTLLHKLKPATSQTDHLFVGTDRYHYFTISWNLKTNDLNTEKAYVDIAEKAARDSQ 117
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF-----------DNK-GQLKEAFNIRLEELQVL 112
G IDP R + + Y+G+ V+P DN+ G+L E +R+ EL V
Sbjct: 118 TGDRVHIDPTSRFMTIECYEGVVNVLPIAHAGKGKRKAADNEIGELHEPIPVRIPELFVR 177
Query: 113 DIKFLY------GCAKPTIVVLYQDNKDARHVKT----YEVALKDKD------FVEGPWS 156
FL+ P +L++D+ + +K Y +L+ D +G
Sbjct: 178 SSCFLHRRQPTTKAVDPLFAILHEDSTNKVRLKIRELEYSPSLRPNDEPAVAELEKGHDV 237
Query: 157 QNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP--SITKAYGRVDADGS 214
++ L+ GA +LIP+P P+ G+L++GE +I Y ++ P T D
Sbjct: 238 EDTLELGASILIPLPSPMHGLLVVGETSISYIEEWQYRVANTEPLDEATIFVAWCSLDDQ 297
Query: 215 RYLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL 273
RY+L D G L+LL++ +E + G +I++LGETS AST+ YLD V++GS GDSQ+
Sbjct: 298 RYVLADDYGKLYLLMVQQNREGEYHGHQIDILGETSRASTLVYLDGGRVFVGSHQGDSQI 357
Query: 274 IKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCSGAY 322
I+++ Q +EVL+ + N+ PI+DF V+D+ + GQ ++VT SGA+
Sbjct: 358 IQISEQS------IEVLQTFPNIAPILDFTVMDMGNRSSDAPVNEFSSGQARIVTGSGAF 411
Query: 323 KDGSLRIVRNGIGINEQASVELQG--IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNL 380
KDGSLR VR+G+G+ ++ + G I ++SLRSS LVVSF+ +TR+ +
Sbjct: 412 KDGSLRSVRSGVGLEDRGDLGSLGAPISEVFSLRSSAAAKLVDTLVVSFVDQTRVFVFDQ 471
Query: 381 EDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSV 439
E ++EE E GF TLF + ++VQV+S S+ L + S + + W +P G ++
Sbjct: 472 EGDVEERDEFRGFSLSDATLFAGNLPDGRVVQVSSTSILLADAESGMITDTWAAPGGANI 531
Query: 440 NVATANASQVLLATGGGHLVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQI 497
A+ S++LL+ G LV L++ G+ I + + E+SC+ ++ S
Sbjct: 532 TAVAADRSKLLLSLSGKALVALDLSGESIAVQARREFSSTEEVSCIALSA-----SIPDT 586
Query: 498 AAVGMWTDISVRIFSLPDLNLITKEHLGGE--IIPRSVLLCAF--EGISYLLCALGDGHL 553
G W D V SL +L IT E + + +PRS+ + + + L L DG++
Sbjct: 587 CVAGFWKDGKVAFLSLHNLEAITTERVADDDLAVPRSLSIATILRDQPATLFVGLADGNV 646
Query: 554 LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASDRPTVIYSSNKKLLYSN 612
+ + + RK + LGTQ + +VFA + P++IY S +++YS
Sbjct: 647 VTYTIESLQKPFAARKSIILGTQQANFAVLPRGDGLQNVFATCEHPSLIYGSEGRMVYSA 706
Query: 613 VNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQS 672
V + + +C FNS ++ +++AIA EL I +D+ + H++ + + E RR + +
Sbjct: 707 VTAESATSICSFNSNSYGNAIAIASNDELRIAAVDEERTTHVQDLFIHETVRRTAYSAEL 766
Query: 673 RTFAICSLKNQSCA--EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD---D 727
+ F + ++ A EE + HF +L+D+ F+ + +Y L+ E S++ C D D
Sbjct: 767 KAFGLGCIQRTLTAGQEEVKSHF-KLVDEVAFKELDSYELNEDELVESVIRCKLDDGSGD 825
Query: 728 SNVYYCVGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAA 786
+ VGTAY+ +++ +GRIL+ + ED +L+L+ E KGA L GK++AA
Sbjct: 826 GAERFAVGTAYLDDQDSNTARGRILILEVTEDRRLKLVTELSVKGACRCLAVCEGKIVAA 885
Query: 787 INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
+ + + +Y + L + + + V G+ I V DLMKS+SL+ YK
Sbjct: 886 LIKTVIIYDFEFAASKA-TLTKKASYRTATAPIDVCVTGNVIAVTDLMKSMSLVEYKKGR 944
Query: 847 GA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVV 902
+ E AR + W +AV + D+ YL ++ NL ++ ++ G ++E+R RL V
Sbjct: 945 TGMPDTLTEIARHFETLWGTAVANVADNTYLQSDAEGNLIVLQHDTNGFSEEDRRRLRVT 1004
Query: 903 GEYHLGEFVNRFR-------HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQY 954
E LGE VNR R HG+LV IP TV G I + A + P +Q
Sbjct: 1005 SELLLGEMVNRIRRIDVTPTHGALV----------IPRAFLATVEGSIYLFALIVPGKQD 1054
Query: 955 LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA---KNFLDGDLIESFLDLSRTRM 1011
L + ++Q N+ ++K G + +R F N+ + A F+DG+LIE FLD +
Sbjct: 1055 LLM-RMQNNMASLVKSPGHVEFATFRGFKNQVRDEGANGPSRFVDGELIERFLDCGQDIQ 1113
Query: 1012 DEISKTMNVSVEELCKRVEELTRLH 1036
+EI + + + +EE VE L R+H
Sbjct: 1114 EEIIRDLGIELEEARGMVESLRRIH 1138
>gi|451856466|gb|EMD69757.1| hypothetical protein COCSADRAFT_32431 [Cochliobolus sativus ND90Pr]
Length = 1116
Score = 479 bits (1233), Expect = e-132, Method: Compositional matrix adjust.
Identities = 323/1078 (29%), Positives = 559/1078 (51%), Gaps = 63/1078 (5%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
IYG++ L+ RP D LF+ T+R+ + L W+ E +L T ++ V+D R +
Sbjct: 55 IYGKVTMLQKLRPALSPTDHLFVGTDRFMYFTLSWNTEKKQLQTEKSFASVADNAARESQ 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ----------LKEAFNIRLEELQVLDI 114
G+ IDP R + + +Y+G+ VIP +G+ L + +RL E+ V
Sbjct: 115 TGERCHIDPTGRFMTIEVYEGILTVIPLVQRGKKRKQEADIAHLGDPQPVRLPEMFVRSS 174
Query: 115 KFLYGCA---KPTIVVLYQDNKDARHVKTYEV--ALKDKDFVEGPWSQNNLDNGADLLIP 169
FL + KP + +LY+D ++ E+ A + D EG ++ L+ G+ LIP
Sbjct: 175 AFLRPRSTDDKPKMALLYEDTHAQVKLRLRELTYAGDEVDLQEGETCKSELELGSSHLIP 234
Query: 170 VPPPLCGVLIIGEETIVYC--SANAFKAIPI-RPSITKAYGRVDADGSRYLLGDHAGLLH 226
+ P G++IIGE +I Y + + P+ +I A+ R+DA R++L D G L+
Sbjct: 235 LEEPSHGLVIIGETSIGYFDDESGELQTEPLDEATIFVAWERIDA--QRFVLADDYGRLY 292
Query: 227 L-LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
+ +++ + + +V K++++G+TS AST+ YLD V++GS GDSQ+I++ A+ S
Sbjct: 293 MFMLVLNAQGRVQSWKLDVIGQTSRASTLVYLDAGYVFVGSHQGDSQVIRI-----AEKS 347
Query: 286 YVEVLERYVNLGPIVDFCVVDL-ERQGQGQ----------VVTCSGAYKDGSLRIVRNGI 334
+E+++ + N+ PI+DF ++D+ R G+GQ +VT SGAY+DGSLR VR+G+
Sbjct: 348 -MEIVQTFSNIAPILDFTIMDMGNRSGEGQTNEYSSGQARIVTGSGAYQDGSLRSVRSGV 406
Query: 335 GINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET-EIEGF 392
G+ + + E++ I ++SL+S+ + L+VSF+ E+R+ + + E+EE E
Sbjct: 407 GLEDLGVLGEMEHISELFSLKSTASAQYIDTLLVSFVDESRVFRFDSQGEVEEVDEFVSL 466
Query: 393 CSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLA 452
TL + ++VQVT+G R+ + +EW G ++ A+ N + VL++
Sbjct: 467 ALDETTLAAANIPQGRIVQVTNGRARICDLDGGMITSEWVPTDGKTITAASVNETHVLVS 526
Query: 453 TGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 511
GG +V L + +G+ + + K E +++C+ + P G S VG W + + I
Sbjct: 527 LGGVTIVSLSMANGLQVVKEKTFGAESQVACVTL-PSGT----SSTCFVGFWNNSQLAIC 581
Query: 512 SLPDLNLITKEHLGGEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRK 569
SL L + + + +PRS+LL + L AL DG+++ + + EL+ RK
Sbjct: 582 SLDTLEAVKTVQISDDSVPRSLLLTQIFPDQPPSLFVALADGNVVTYTFDQSNHELSGRK 641
Query: 570 KVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
+ LGT+ T R + +VFA + P++IY+S +L+YS V ++ + +CPF+S A
Sbjct: 642 SIVLGTREATFRALPRGDGLFNVFATCEHPSLIYASEGRLVYSAVTAEKATTVCPFDSEA 701
Query: 629 FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN-QSCAE 687
+P S+AIA +L I +D + H++++ + E RRI + + F + ++K E
Sbjct: 702 YPGSVAIATSQDLRIALVDTERTTHVQTLKVDETVRRIAYSPSLKAFGLGTIKRILKHGE 761
Query: 688 ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN---VYYCVGTAYVLPEEN 744
E + +L+D+ F+ + TY L+ E ++ C D S + VGTAYV +
Sbjct: 762 EIMLSHFKLVDEIQFKELDTYALNEEELVECVMRCELPDGSGGLAERFVVGTAYVDDQNT 821
Query: 745 EPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT 803
+GRI+V V + L+L+ E KG L GK++AA+ + I +Y + R T
Sbjct: 822 TSERGRIIVLEVTPERILKLVTEVAVKGGCRCLAICQGKIVAALIKTIVVYDFEYRTPST 881
Query: 804 RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER----ARDYNAN 859
+L + + G I + DLMKS+ ++ Y E + ++ AR Y
Sbjct: 882 PDLVKLASFRCSTAPIDLTVNGSLIAIADLMKSLVVVEYTKGETGLPDKLVEVARHYQIT 941
Query: 860 WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
W +AV +D ++YL ++ NL + ++ G TDE++ RL + E LGE VNR R
Sbjct: 942 WATAVAEVDTNMYLESDAEGNLVVLYRDPNGVTDEDKRRLNISSEMLLGEMVNRIRR--- 998
Query: 920 VMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
+ L +D +P GTV G I + + + L LQ+NL ++ G + ++
Sbjct: 999 IDVLTAADAVIVPKAFVGTVEGSIYLFGLISPDYQNLLMTLQSNLGSLVAAPGDMQFAKF 1058
Query: 980 RSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
R+F N ++ + F+DG+LIE FLD+S + + V +EE+ +E L RLH
Sbjct: 1059 RAFKNSVREEEEPMRFVDGELIERFLDVSEEVQSKAIDGLGVELEEVKGLIEALRRLH 1116
>gi|330935579|ref|XP_003305038.1| hypothetical protein PTT_17772 [Pyrenophora teres f. teres 0-1]
gi|311318228|gb|EFQ86975.1| hypothetical protein PTT_17772 [Pyrenophora teres f. teres 0-1]
Length = 1115
Score = 479 bits (1232), Expect = e-132, Method: Compositional matrix adjust.
Identities = 336/1083 (31%), Positives = 565/1083 (52%), Gaps = 74/1083 (6%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
IYG++ L+ RP + D LF+ T+R+ + L WDAE +L T ++ V+D R +
Sbjct: 55 IYGKVTMLQKLRPALSSTDHLFVGTDRFMYFTLSWDAEQGQLQTEKSFASVADNAARESQ 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ----------LKEAFNIRLEELQVLDI 114
G+ IDP R + + +Y+G+ VIP +G+ L E +RL E+ +
Sbjct: 115 TGERCHIDPLGRFMTVEVYEGIITVIPLVQRGKKRKQEADIAHLGEPQPVRLPEMFIRSS 174
Query: 115 KFLYGCA---KPTIVVLYQDNKDARHVKTYEV--ALKDKDFVEGPWSQNNLDNGADLLIP 169
FL + KP + +LY+D +K E+ A + D EG ++ L+ G+ LIP
Sbjct: 175 AFLRPRSPDEKPKMALLYEDTHSQVKLKLRELTYAGDEVDLQEGETCKSELELGSSHLIP 234
Query: 170 VPPPLCGVLIIGEETIVYC--SANAFKAIPI-RPSITKAYGRVDADGSRYLLGDHAGLLH 226
V P G+++IGE +I Y + + P+ +I A+ R+DA R++L D G L+
Sbjct: 235 VEKPSHGLVVIGETSIGYYDDESGEIHSEPLDEATIFVAWERIDA--QRFVLADDYGRLY 292
Query: 227 L-LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
+ +++ + KV K++++G+TS AST+ YLD VY+GS GDSQ+IK+ A+ S
Sbjct: 293 MFMLVLSAQGKVQSWKLDIIGQTSRASTLVYLDAGYVYVGSHQGDSQVIKI-----AEKS 347
Query: 286 YVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCSGAYKDGSLRIVRNGI 334
+E+++ + N+ PI+DF ++D+ + GQ ++VT SGAY+DGSLR VR+G+
Sbjct: 348 -MEIVQTFSNIAPILDFTIMDMGNRSAEGQTNEYSSGQARIVTASGAYQDGSLRSVRSGV 406
Query: 335 GINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET-EIEGF 392
G+ + + E++ I ++SL+S+ + L+VSF++E+RI + + E+EE E
Sbjct: 407 GLEDLGVLSEMEHISDLFSLKSTASAQYADTLLVSFVNESRIFRFDPQGEVEEVDEFASL 466
Query: 393 CSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLA 452
TL + ++VQVT+G R+ + +EW G ++ A+ N S VL++
Sbjct: 467 ALDETTLAAANISQGRIVQVTNGRARICDLDGGMIISEWLPMGGPTITAASINESHVLVS 526
Query: 453 TGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPS-YSQIAAVGMWTDISVRI 510
GG +V + + DG+ + + K E +++C+ + PS S I VG W + + I
Sbjct: 527 LGGVTVVSISMADGLQVVKEKTFNAESQVACIAL------PSDSSSICFVGFWNNSGLAI 580
Query: 511 FSLPDLNLITKEHLGGEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDR 568
SL L + + + +PRS+LL L AL DG+++ + + T EL+ R
Sbjct: 581 CSLDTLEPVKTVQISDDSVPRSLLLTQIFPAQPPSLFVALADGNVVTYAFDPSTHELSGR 640
Query: 569 KKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
K + LGT+ T R N +VFA + P++IYSS +L+YS V ++ + +CPF+S
Sbjct: 641 KSIVLGTREATFRALPRGNGLFNVFATCEHPSLIYSSEGRLVYSAVTAEKATTVCPFDSE 700
Query: 628 AFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE 687
A+P S+AIA +L I +D + H++++ + E RRI + + F + ++K +
Sbjct: 701 AYPGSVAIATSEDLRIALVDTERTTHVQTLKVDETVRRIAYSPSLKAFGLGTIKRILKND 760
Query: 688 ESEM--HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY---YCVGTAYVLPE 742
E M HF +L+D+ F+ + +Y L+ E ++ C D S + +GTAY+ +
Sbjct: 761 EELMASHF-KLVDEIQFKELDSYALNEEELIECVMRCELPDGSGGLAECFVIGTAYLDDQ 819
Query: 743 ENEPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD 801
+GRIL+ V + L+L+ E KG L GK++AA+ + I +Y +
Sbjct: 820 NTTSERGRILILEVTPERILKLVTEIAVKGGCRCLATCEGKIVAALIKTIVIYD-VEYPT 878
Query: 802 GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER----ARDYN 857
T L + + G I + DLMKS+ ++ Y E + ++ AR Y
Sbjct: 879 QTPFLTKLATFRCSTAPIDITVNGSKIAIADLMKSLVVVEYTKGEAGLPDKLVEVARHYQ 938
Query: 858 ANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHG 917
W +AV +D ++YL ++ NL + ++ G TD+++ RL V E LGE VNR R
Sbjct: 939 ITWATAVAEVDTNMYLESDAEGNLMVLYRDPNGVTDDDKRRLNVSSEMLLGEMVNRIRR- 997
Query: 918 SLVMRLPDSDVGQIPTVIFGTVNGVI---GVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
+ L SD IP GTV G I G+I+ P Q L + LQ+NL ++ G +
Sbjct: 998 --IDVLTASDAVVIPRAFVGTVEGSIYLFGLIS--PAHQNLLM-TLQSNLGALVPAPGDM 1052
Query: 975 NHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
+ ++R+F N ++ + F+DG+ +E FLD S + + + V +EE+ VE L
Sbjct: 1053 DFAKFRAFKNGVREEEEPMRFVDGEFVERFLDQSEEVQTKAIEGLGVGLEEVRGLVEGLR 1112
Query: 1034 RLH 1036
RLH
Sbjct: 1113 RLH 1115
>gi|189205943|ref|XP_001939306.1| DNA damage-binding protein 1 [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187975399|gb|EDU42025.1| DNA damage-binding protein 1 [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 1115
Score = 477 bits (1227), Expect = e-131, Method: Compositional matrix adjust.
Identities = 336/1083 (31%), Positives = 569/1083 (52%), Gaps = 74/1083 (6%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
IYG++ L+ RP + D LF+ T+R+ + L WDAE +L T ++ V+D R +
Sbjct: 55 IYGKVTMLQKLRPALSSTDHLFVGTDRFMYFTLSWDAEKRQLQTEKSFASVADNAARESQ 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ----------LKEAFNIRLEELQVLDI 114
G+ IDP R + + +Y+G+ VIP +G+ L E +RL E+ +
Sbjct: 115 TGERCHIDPSGRFMTVEVYEGIITVIPLVQRGKKRKQEADIAHLGEPQPVRLPEMFIRSS 174
Query: 115 KFLYGCA---KPTIVVLYQDNKDARHVKTYEV--ALKDKDFVEGPWSQNNLDNGADLLIP 169
FL + KP + +LY+D +K E+ A + D EG +++L+ G+ LIP
Sbjct: 175 AFLRPRSPDEKPKMALLYEDTHSQVKLKLRELTYAGDEVDLQEGEICESDLELGSSHLIP 234
Query: 170 VPPPLCGVLIIGEETIVYC--SANAFKAIPI-RPSITKAYGRVDADGSRYLLGDHAGLLH 226
V P G+++IGE +I Y + + P+ +I A+ R+DA R++L D G L+
Sbjct: 235 VEKPSHGLVVIGETSIGYYDDESGEIHSEPLDEATIFVAWERIDA--QRFVLADDYGRLY 292
Query: 227 L-LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
+ +++ + KV K++++G+TS AST+ YLD VY+GS GDSQ+IK+ A+ S
Sbjct: 293 MFMLVLSAQGKVQSWKLDIIGQTSRASTLVYLDAGYVYVGSHQGDSQVIKI-----AEKS 347
Query: 286 YVEVLERYVNLGPIVDFCVVDL-ERQGQGQ----------VVTCSGAYKDGSLRIVRNGI 334
+E+++ + N+ PI+DF ++D+ R G+GQ +VT SGAY+DGSLR VR+G+
Sbjct: 348 -MEIVQTFSNIAPILDFTIMDMGNRSGEGQTNEYSSGQARIVTASGAYQDGSLRSVRSGV 406
Query: 335 GINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET-EIEGF 392
G+ + + E++ I ++SL+S+ + L+VSF++E+RI + + E+EE E
Sbjct: 407 GLEDLGVLGEMEHISDLFSLKSTASAQYADTLLVSFVNESRIFRFDPQGEVEEVDEFASL 466
Query: 393 CSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLA 452
TL + +++QVT+G R+ + +EW G ++ A+ N S VL++
Sbjct: 467 ALDETTLAAANISQGRIIQVTNGRARICDLDGGMITSEWLPMGGPTITAASINESHVLVS 526
Query: 453 TGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPS-YSQIAAVGMWTDISVRI 510
GG +V + + +G+ + + K E +++C+ + PS S I VG W + + I
Sbjct: 527 LGGVTVVSISMANGLQVVKEKTFGAESQVACIAL------PSDSSSICFVGFWNNSGLAI 580
Query: 511 FSLPDLNLITKEHLGGEIIPRSVLLCA-FEG-ISYLLCALGDGHLLNFLLNMKTGELTDR 568
SL L + + + +PRS+LL F G L AL DG+++ + + T EL+ R
Sbjct: 581 CSLDTLEPVKTVQISDDSVPRSLLLTQIFPGQPPSLFVALADGNVVTYAFDPSTHELSGR 640
Query: 569 KKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
K + LGT+ T R + +VFA + P++IYSS +L+YS V ++ + +CPF+S
Sbjct: 641 KSIVLGTREATFRALPRGDGLFNVFATCEHPSLIYSSEGRLVYSAVTAEKATTVCPFDSE 700
Query: 628 AFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSC 685
A+P S+AIA +L I +D + H++++ + E RRI + + F + ++K ++
Sbjct: 701 AYPGSVAIATPEDLRIALVDTERTTHVQTLKVDETVRRIAYSPSLKAFGLGTIKRILKNG 760
Query: 686 AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN---VYYCVGTAYVLPE 742
E HF +L+D+ F+ + +Y L+ E ++ C D S + +GTAY+ +
Sbjct: 761 EEVMASHF-KLVDEIQFKELDSYALNEEELIECVMRCELPDGSGGVAERFVIGTAYLDDQ 819
Query: 743 ENEPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD 801
+GRIL+ V + L+L+ E KG L GK++AA+ + I +Y +
Sbjct: 820 STTSERGRILILEVTPERILKLVMEIAVKGGCRCLATCEGKIVAALIKTIVIYD-VEYPT 878
Query: 802 GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER----ARDYN 857
T L + + G IV+ DLMKS+ ++ Y E + ++ AR Y
Sbjct: 879 QTPFLTKLATFRCSTAPIDITVNGPKIVIADLMKSLVVVEYTKGEAGLPDKLVEVARHYQ 938
Query: 858 ANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHG 917
W +AV +D ++YL ++ NL + ++ G TD+++ RL V E LGE VNR R
Sbjct: 939 ITWATAVAEVDTNMYLESDAEGNLMVLYRDPNGVTDDDKRRLNVSSEMLLGEMVNRIRR- 997
Query: 918 SLVMRLPDSDVGQIPTVIFGTVNGVI---GVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
+ L SD IP GTV G I G+I+ P Q L + LQ+NL +I G +
Sbjct: 998 --IDVLTASDAVVIPRAFVGTVEGSIYLFGLIS--PAHQNLLM-TLQSNLGALIPAPGDM 1052
Query: 975 NHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
+ ++R+F N ++ + F+DG+ +E FLD + + + V +EE+ +E L
Sbjct: 1053 DFAKFRAFKNGVRQEEEPMRFVDGEFVERFLDQGEEVQVKAIEGLGVGLEEVRGLIEGLR 1112
Query: 1034 RLH 1036
RLH
Sbjct: 1113 RLH 1115
>gi|449295711|gb|EMC91732.1| hypothetical protein BAUCODRAFT_116696 [Baudoinia compniacensis UAMH
10762]
Length = 1148
Score = 476 bits (1226), Expect = e-131, Method: Compositional matrix adjust.
Identities = 327/1109 (29%), Positives = 559/1109 (50%), Gaps = 96/1109 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
+YG++ L RP D LF+ T+ Y + L WDA + +L T ++ D++++ R +
Sbjct: 58 LYGKVTLLHKLRPATSQTDHLFVGTDHYHYFTLSWDASTKQLRTEKSYVDIAEKSARDSQ 117
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF-----------DNK-GQLKEAFNIRLEELQVL 112
G IDP R + L Y+G+ V+P DN+ G+L++ +R+ EL V
Sbjct: 118 TGDRVHIDPTSRFLSLECYEGVINVLPIAHAGKGKRKAADNEIGELQDPIPVRIPELFVR 177
Query: 113 DIKFLYG------CAKPTIVVLYQDNKDARHVKTYEVALKDK----------DFVEGPWS 156
F++ A P + VL++D+ + +K E+ + +G +
Sbjct: 178 STCFVHKRQAGSKLANPELAVLWEDSTNKVRLKVRELEFTPSLRPAEEPPTAELEKGKDA 237
Query: 157 QNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVDAD 212
Q ++ GA LIP+PPP+ G+L++GE I Y ++ +P +I A+ VD
Sbjct: 238 QGEIELGASHLIPLPPPMYGMLVVGETRIAYVDEWEYRITDTQPLDEATIFVAWCAVDEQ 297
Query: 213 GSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
RY+L D G L+LL V+ + + G ++++LG+TS A+T+ YLD +++GS GDS
Sbjct: 298 --RYVLADDYGKLYLLFVLQNNTGEYAGHRLDILGQTSRANTLVYLDAGRIFVGSHQGDS 355
Query: 272 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCSG 320
Q+I+++ Q +EV++ + N+ PI+DF ++D+ + GQ ++VT SG
Sbjct: 356 QVIQISEQS------MEVVQTFANIAPILDFTIMDMGNRSSDAPVNEFSSGQARIVTGSG 409
Query: 321 AYKDGSLRIVRNGIGINEQASVELQG--IKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
AYKDGSLR VR+G+G+ + S+ G + M+SLRSS + F LVV F+S TR+
Sbjct: 410 AYKDGSLRSVRSGVGLEDVGSIGEMGAPVSAMFSLRSSPESHFVDTLVVGFVSYTRVFRF 469
Query: 379 NLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
+++ E+EE + + GF +L+ + ++VQVT +V + + +R + + W P G
Sbjct: 470 DVDGEVEEVDHLGGFELAAASLYAGNTSDGRIVQVTGSAVIVSHTETRTITSSWSLPDGR 529
Query: 438 SVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL------EYEISCLDINPIGEN 491
S+ A +L++ GG LV L++ + + +E A+ E ++SC+ + +
Sbjct: 530 SITAVAAEGDSLLVSIGGAELVVLDLSN-VSSERLEARTRRTFESEEQVSCIAL-----S 583
Query: 492 PSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI----IPRSVLLCAF--EGISYLL 545
+ + VG W + V SL DL I E + +PRSV+L + L
Sbjct: 584 KAVRDVCVVGFWQESRVAFLSLHDLQPIATERVADSFDTSAVPRSVVLANILQDAPPTLF 643
Query: 546 CALGDGHLLNFLLNMKTGELTDRKKVSLGTQP--ITLRTFSSKNTTHVFAASDRPTVIYS 603
L DG+++ + + T RK LGTQ TL +VFA + P++IY
Sbjct: 644 VGLADGNVVTYTVQSPQQPFTSRKSTILGTQQANFTLLPRGDGVLDNVFATCEHPSLIYG 703
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA-KEGELTIGTIDDIQKLHIRSIPLGEH 662
+ +YS V + +C F+S A+ ++AIA +EGEL + +D+ + H++++ +GE
Sbjct: 704 QEGRTVYSAVTAETAQSICSFDSEAYSGAIAIATEEGELKLAMVDEERTTHVQTLRVGET 763
Query: 663 PRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
RRI + + + F + ++K ++ EE F +L+D+ F+ + T L+ E ++
Sbjct: 764 VRRIAYSTELKAFGLGTIKRVLRAGVEEVTSSF-KLVDEVAFQELHTVALNEDELVECVM 822
Query: 721 SCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVY 774
C D S + VGTAY+ + TKGRILV + E+ +L+++AE KGA
Sbjct: 823 RCQLDDGSGTGETAERFVVGTAYLDDAPQQQTKGRILVLEVTEERRLKVVAELGLKGACR 882
Query: 775 SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
L G+++AA+ + + +Y + L + + + + G I V DLM
Sbjct: 883 CLAVVLGRIVAALVKTVVIYALEYQTPSHPFLVKKAAYRTSTAPIDICVTGSTIAVTDLM 942
Query: 835 KSISLLIYKHEEGAI----EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
KS+SL+ YK G + E AR Y W +A+ + ++ YL A+ NL ++ G
Sbjct: 943 KSVSLVSYKPGRGGVPDTLSEIARHYETLWGTAIANVAENTYLEADAEGNLVVLQHEVNG 1002
Query: 891 ATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLP 950
+DE+R RL V E LGE VNR R S+ P + +P TV G I + A +
Sbjct: 1003 YSDEDRRRLRPVSEMLLGEMVNRIRSISV---QPTATAVVVPRAFLATVEGSIYLFALIS 1059
Query: 951 HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK---NFLDGDLIESFLDLS 1007
+ L +LQ L + +K G + +WR F ++ + + + F+DG+L+E +L+
Sbjct: 1060 PGKQDLLMRLQALLAERVKSPGHVPFAKWRGFRSQVRDMGGEGPTRFVDGELVERYLEAP 1119
Query: 1008 RTRMDEISKTMNVSVEELCKRVEELTRLH 1036
+++ + VEEL VE L R+H
Sbjct: 1120 VEVQVDVASELGREVEELRGMVEGLRRMH 1148
>gi|398391687|ref|XP_003849303.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
gi|339469180|gb|EGP84279.1| hypothetical protein MYCGRDRAFT_87400 [Zymoseptoria tritici IPO323]
Length = 1143
Score = 473 bits (1218), Expect = e-130, Method: Compositional matrix adjust.
Identities = 326/1101 (29%), Positives = 547/1101 (49%), Gaps = 87/1101 (7%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
+YG+ L RP D LF+ T+RY + L WDAE EL T ++ D++++ R +
Sbjct: 60 LYGKATLLHKLRPATSPTDHLFVGTDRYNYLTLSWDAELKELKTEKSYVDIAEKAARDSQ 119
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF------------DNKGQLKEAFNIRLEELQVL 112
G IDP R + L Y+G+ ++P D G+L + +R+ EL V
Sbjct: 120 TGDRVHIDPTSRFMTLECYEGVVNILPIAHAGKGKRKAADDEIGELADPIQVRIPELFVR 179
Query: 113 DIKFLY------GCAKPTIVVLYQDNKDARHVKTYEVAL----------KDKDFVEGPWS 156
FL+ AKP ++Y+D+++ +K E+ D EG
Sbjct: 180 STCFLHRRQAGAKHAKPAFALMYEDSQNKVRIKVRELEYGQSLRPSEEPSTADLDEGKEV 239
Query: 157 QNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD---ADG 213
L+ GA LIP+PPP+ G+L++GE +I Y + +P + +A VD D
Sbjct: 240 SGQLELGASHLIPLPPPMYGMLVVGETSISYVEEFEHQVKETQP-LEEATVFVDWCQIDD 298
Query: 214 SRYLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQ 272
RY L D G L+LL+I + + +I++LGETS AST+ YLD V++GS GDSQ
Sbjct: 299 QRYALADDYGKLYLLMIQQTSNGEYSSHQIDILGETSRASTLVYLDEGRVFVGSHQGDSQ 358
Query: 273 LIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCSGA 321
+I++ Q +EVL+ + N+ PI+DF V+D+ + GQ ++VT SGA
Sbjct: 359 IIQILPQK------IEVLQTFPNIAPILDFAVMDMGNRSSDAPVNEFSSGQARIVTGSGA 412
Query: 322 YKDGSLRIVRNGIGINEQASV-ELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
+KDGSLR VR+G+G+ ++ + L I +++LRS LVVSF+S TR+
Sbjct: 413 FKDGSLRSVRSGVGLEDRGDLGSLDAPISSLFALRSRPAPAVVNTLVVSFVSHTRVFVFT 472
Query: 380 LEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYS 438
E ++EE E GF TL + +++Q+T+ SV L + + W+SP G S
Sbjct: 473 AEGDVEEQEQYRGFELGESTLHAGNLADGRVIQITNSSVVLADQEGDMVTDRWQSPSGTS 532
Query: 439 VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY----EISCLDINPIGENPSY 494
+ +AN + +L++ G L+ L++ + HAQ E+ ++SC+ ++P S
Sbjct: 533 ITAVSANGNNILVSLKGASLLVLDVSGSTIK--VHAQREFGSDEQVSCIALSP-----SI 585
Query: 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE---IIPRSVLLCA-FEG-ISYLLCALG 549
G W D V L L E + E +PRS+ + EG + L L
Sbjct: 586 PTQCVAGFWKDGKVAFLDLETLKTQASETVTEEDSLAVPRSLTVAKILEGQPATLFVGLA 645
Query: 550 DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASDRPTVIYSSNKKL 608
DG+++ + + + T RK + LGTQ N +VFA + P++IY S ++
Sbjct: 646 DGNVVTYSIQSASNPFTARKSIILGTQQANFAVLPRDNGLENVFATCEHPSLIYGSEGRI 705
Query: 609 LYSNVNLKEVSHMCPFNS-AAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRIC 667
++S V + + +C F+S + +++AIA EL I +D+ + H++ + L + RRI
Sbjct: 706 VFSAVTAENATCICSFDSFEPYGNAIAIASNDELKIAVVDEERTTHVQDLFLNQTVRRIA 765
Query: 668 HQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD 726
+ + F + C + S E + +L+D+ F+ + +Y L+ E S++ C D
Sbjct: 766 YSADMKAFGLGCIRRTLSAGREEVVSCFKLVDEIAFKELDSYQLNEDELVESVIRCKLDD 825
Query: 727 DSN---VYYCVGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGK 782
S + +GTAY+ ++ KGRILV + ED +L+L+ E +GA L +G+
Sbjct: 826 GSGGDAERFVIGTAYLDDQDASNAKGRILVLEVTEDRRLKLVTEISVRGACRCLAVSHGR 885
Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
++AA+ + + +Y + + + + + + + GD I V DLMKS+SL+ +
Sbjct: 886 IVAALIKTVIIYSFEYETPSSPAMVKKAAYRTSTAPIDMCVTGDIIAVTDLMKSMSLVQH 945
Query: 843 KHEEGA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR 898
+ + E AR ++ W +AV +D++IYL ++ NL + + +G ++E+R R
Sbjct: 946 TLGQAGGPDNLTEVARHFDTLWGTAVANVDENIYLESDAEGNLVVLEHDVKGFSEEDRRR 1005
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
L V E LGE VNR R + P + IP TV G I + A + + L
Sbjct: 1006 LRVTSEILLGEMVNRIRRIDVS---PTPNATVIPRAFLATVEGSIYLFALIAEGKQDLLI 1062
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK---NFLDGDLIESFLDLSRTRMDEIS 1015
++Q + ++++ G + ++R F + + + + F+DG+LIE FLD E++
Sbjct: 1063 RMQNKMAEMVQSPGHVPFAKFRGFKTQVRDMGEEGPSRFVDGELIERFLDCDEDVQAEVA 1122
Query: 1016 KTMNVSVEELCKRVEELTRLH 1036
K + VEEL VE L R+H
Sbjct: 1123 KELGGDVEELKIMVEGLKRIH 1143
>gi|242803623|ref|XP_002484212.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus ATCC
10500]
gi|218717557|gb|EED16978.1| UV-damaged DNA binding protein, putative [Talaromyces stipitatus ATCC
10500]
Length = 1140
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 341/1104 (30%), Positives = 549/1104 (49%), Gaps = 91/1104 (8%)
Query: 6 IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
+Y +I+ L L P D L + T++ + L+WD+E + T R+ D++D+ RP
Sbjct: 55 LYAKISMLARLPTPTNSTTDHLLVGTDQNTYFTLKWDSEKQRVRTERSYVDLADKASRPC 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIP------------------FDNK-------GQL 98
NG +IDP R + L +++G+ VIP + N G+L
Sbjct: 115 QNGDRCLIDPSGRFMTLEMFEGIITVIPIIQPHKKRGKPPVLKTSHYSNPDEPVPQIGEL 174
Query: 99 KEAFNIRLEELQVLDIKFLY--GCAKPTIVVLYQDNKDARHVKTYEV---ALKDKDFVEG 153
E R++EL V FL+ A P + +L++DN+ +K E+ A + F E
Sbjct: 175 GEPMPTRIDELMVRSSAFLHVESKAAPRLALLHEDNQRKVRLKIRELHFEASTEVVFQET 234
Query: 154 PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP--SITKAYGRVDA 211
LD GA LIPVP PL G+L++GE I Y + I RP T V
Sbjct: 235 EDFTEELDLGASHLIPVPAPLGGLLVLGETCIKYIDDANNETIS-RPLDEATIFVAWVQV 293
Query: 212 DGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
DG R+LL D G L L+++ + +V G KI+ LG S AS + YL + +IGS GD
Sbjct: 294 DGQRWLLADDYGRLFFLMLVLDSRNEVEGWKIDYLGSASRASVLIYLGAGMTFIGSHQGD 353
Query: 271 SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCS 319
SQ+I++ ++GS+ E+++ N+ PI+DF ++DL + GQ ++VT S
Sbjct: 354 SQVIRI-----SEGSF-EIIQTISNIAPILDFTIMDLGTREGDNYTHEFSSGQARIVTGS 407
Query: 320 GAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
GA+ DG+LR VR+G+G+ E + E++ I MW+L+ S+ D F LVV+F++ETRI
Sbjct: 408 GAFNDGTLRSVRSGVGMEELGVLGEMEHITDMWALQVSSTDEFSDTLVVTFVNETRIFQF 467
Query: 379 NLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
+ + E+EE E G TL ++ +++ VT V + + S + ++W SP
Sbjct: 468 SSDGEVEELDEFLGLNLAENTLLSNNLPGGRIIHVTERGVSIADTDSGMVISKW-SPDEQ 526
Query: 438 SVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQ 496
++ A N ++++ TGG +V L+I GD + K + ++S + I P + +Q
Sbjct: 527 TITSAACNDERLVVVTGGQVIVVLDITGDLKVLSQKDFDKDNQVSGVTIPP-----ASTQ 581
Query: 497 IAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGH 552
V + S D + + LG E PR+VLL + S L ++ DG
Sbjct: 582 AFIAAFPQKAQVAVLSFQDFKELHSQSLGIASEAFPRAVLLAEILEDSPSTLFVSMADGS 641
Query: 553 LLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYS 611
++ F + LT + K+ LG++ T + N + VFA + P++IY S +++YS
Sbjct: 642 VVTFFYDSDNHSLTSKNKLILGSEQPTFKKLPRGNGLYNVFATCEHPSLIYGSEGRIIYS 701
Query: 612 NVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQ 671
VN + S +C FN+ A+PD++A+A +L I +D + I+++P+ RR+ +
Sbjct: 702 AVNSEGASRVCHFNAEAYPDAIAVATSKDLKIALVDKERTTQIQTLPIEATVRRVAYSPT 761
Query: 672 SRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
+ F I ++K Q AE E FV L D+ F + ++ L E S++ S+
Sbjct: 762 EKAFGIGTIKRRLQDGAEIVESQFV-LADEIMFRKLDSFDLKPDELVESVIRAQMVVGSD 820
Query: 730 VY--------YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFN 780
Y + VGTAY+ E E +GRIL+F V+ KL L E KGA +L
Sbjct: 821 AYNKPIYKDRFIVGTAYLDDETAESIRGRILLFEVDSNRKLSLFLEHPVKGACRALAMMG 880
Query: 781 GKLLAAINQKIQLYKWMLRDD-GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
K++AA+ + + ++ + G L+ + + + IVV DLMKSIS
Sbjct: 881 NKIVAALVKTVVIFDVERKSQLGKHALKKVAAYRTSTAPVDIAVTDSTIVVADLMKSIS- 939
Query: 840 LIYKHEEGAI----EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
++ H+ A+ +E AR + W +AV + + +L ++ NL +R+N +G T+E+
Sbjct: 940 IVESHKTDALTVEAKEVARHFATVWTTAVADIGSNQWLVSDAEGNLIVLRRNVDGVTEED 999
Query: 896 RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL 955
R RLEV E LGE VNR R ++ L S V P GTV G I + A + E
Sbjct: 1000 RRRLEVTSELLLGEMVNRIRPVNI---LQTSTVAVNPKAFLGTVEGSIYLFALINPEHQD 1056
Query: 956 FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEI 1014
FL +LQT + + G + ++R+F + + D F+DG+LIE FLD R +EI
Sbjct: 1057 FLMRLQTAITAYVDSPGYMPFSKFRAFRSSVREGDEPFRFVDGELIERFLDCDRPVQEEI 1116
Query: 1015 SKTM--NVSVEELCKRVEELTRLH 1036
+ +E + K +E L RLH
Sbjct: 1117 LGVVGSGYDLESVQKMIEALRRLH 1140
>gi|453081643|gb|EMF09692.1| DNA damage-binding protein 1 [Mycosphaerella populorum SO2202]
Length = 1151
Score = 457 bits (1175), Expect = e-125, Method: Compositional matrix adjust.
Identities = 318/1090 (29%), Positives = 555/1090 (50%), Gaps = 91/1090 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
IYG++ L+ RP D LF+ T+RY + L WDA++ L T ++M D++D+ R +
Sbjct: 60 IYGKVTMLQKLRPATSPTDHLFVGTDRYNYFTLSWDADTRRLKTEKSMVDIADKSARDSQ 119
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF-----------DNK-GQLKEAFNIRLEELQVL 112
G IDP R + L Y+G+ V+P DN+ G+L + +R+ EL V
Sbjct: 120 TGDRVHIDPTARFMTLECYEGVVNVLPIAHAGKGKRRAADNEIGELGDPIPVRIPELYVR 179
Query: 113 DIKFLYG------CAKPTIVVLYQDNKDARHVKT----YEVALKDKD------FVEGPWS 156
FL+ A P VL++D+++ +K Y+ +L+ + G
Sbjct: 180 SSCFLHKRHGGTKIADPVFAVLHEDSQNMPRIKIRELEYQPSLRPNEEPATAELERGQEV 239
Query: 157 QNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVDAD 212
Q ++ GA LIP+P P G LIIGE +I Y ++ + +P +I A+ ++D
Sbjct: 240 QGKIEMGASHLIPLPAPTYGFLIIGETSISYVDEWKYEIVDTQPLDEATIFVAWCQIDE- 298
Query: 213 GSRYLLGDHAGLLHLLVITHEKE--KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
R++L D G L+LL++ H+K + +I++LGETS AST+ YLD V++GS GD
Sbjct: 299 -QRFVLADDYGKLYLLMV-HQKADGEYQSQQIDVLGETSRASTLVYLDEGRVFVGSHQGD 356
Query: 271 SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCS 319
SQ+I++ Q +EVL+ + N+ PI+DF V+D+ + GQ ++VT S
Sbjct: 357 SQIIQILPQ------RIEVLQTFSNIAPILDFTVMDMGNRSADAPVNEFSSGQARIVTGS 410
Query: 320 GAYKDGSLRIVRNGIGINEQASVELQG--IKGMWSLRSSTDDPFDTFLVVSFISETRILA 377
GA+KDGSLR VR+G+G+ ++ S+ G I ++SLRS L+ SF+S + +
Sbjct: 411 GAFKDGSLRSVRSGVGLEDKGSLGDLGEPISAVFSLRSGPGVQAVDTLIASFVSHSSAII 470
Query: 378 MNLEDELE-ETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPG 436
+ ++E E GF TL+ + + VQVTS +V L + + + W++P G
Sbjct: 471 FGSDGDIEARDEFRGFDLTQSTLYAGELPNGRAVQVTSSTVLLTDTEGDMITDRWEAPDG 530
Query: 437 YSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL--EYEISCLDINPIGENPSY 494
S+ +A+ VL++ G L+ L++ +T H QL E +ISCL + + S
Sbjct: 531 SSITSVSADGDHVLVSLGSAALIVLDLSGSSITVGAHRQLGNEEQISCLSL-----SRSL 585
Query: 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE---IIPRSVLLCAF--EGISYLLCALG 549
VG W D V + SL DL I E + + +PRS+++ + + + L L
Sbjct: 586 PGACVVGFWKDSKVSVLSLSDLQPIASESVAEDDNLAVPRSLIVASILRDQPATLFVGLA 645
Query: 550 DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS-SKNTTHVFAASDRPTVIYSSNKKL 608
DG+++ + + + RK + LGTQ S +VFA + P++IY S ++
Sbjct: 646 DGNVVTYSVESLQRPFSARKSIILGTQQANFTVLPRSDGLQNVFATCEHPSLIYGSEGRM 705
Query: 609 LYSNVNLKEVSHMCPFNS-AAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRIC 667
+YS V + +C F+S + +S+AIA EL + ++D+ + H++ +P+ E RRI
Sbjct: 706 VYSAVTADSATSICAFDSFGDYANSIAIATGSELKLSSVDEERTTHVQDLPVYETVRRIA 765
Query: 668 HQEQSRTFAICSLKNQSCA--EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS 725
+ + + F + +K A EE HF +L+D+ F+ + ++ L+ E S++ C
Sbjct: 766 YSSELKAFGLGCIKRTLAAGVEEVRSHF-KLVDEVAFKALDSWALNEDELVESVIRCPLD 824
Query: 726 DDSNV---YYCVGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNG 781
D + + + VGTAY+ ++ +GR+LVF + ED +++L+ E KGA L G
Sbjct: 825 DGTGLDAERFVVGTAYLDDQDANTARGRVLVFEVTEDRRIKLVTEMAVKGACRCLAVCKG 884
Query: 782 KLLAAINQKIQLYKWMLR-DDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKSIS 838
+++AA+ + + + + + +L + + + ++ + I + DLMKS++
Sbjct: 885 RIVAALVKTVVILAYEFSPPKSSPQLIKKASYRTSTAPIDIFASSLDGLIAISDLMKSLT 944
Query: 839 LLIYKH----EEGAIEERARDYNANWMSAVE-ILDDDIYLGAENNFNLFTVRKNSEGATD 893
L+ Y + ++ E AR ++ W +AV I Y+ ++ NL + + G +
Sbjct: 945 LVKYTPGRTGQPDSLVEIARHFDTLWGTAVAPIPGTHSYIQSDAEGNLVVLEHDPTGFSA 1004
Query: 894 EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQ 953
E+R RL V E LGE VNR R + V+ P ++ IP TV G + V ++ +
Sbjct: 1005 EDRRRLRVTSEMCLGEMVNRIRPITTVI-TPSANAVVIPKAFIATVEGSVYVFGTIAQQY 1063
Query: 954 YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK---NFLDGDLIESFLDLSRTR 1010
L +LQ ++ +++K G + ++R F + + + + F+DG++IE FL LS
Sbjct: 1064 QDLLIRLQGSMAEMVKSPGFVRFNRFRGFKTQVRDMGEEGPVRFVDGEIIEGFLGLSAEV 1123
Query: 1011 MDEISKTMNV 1020
+ ++K + V
Sbjct: 1124 QESVAKDLGV 1133
>gi|396465264|ref|XP_003837240.1| similar to DDB1B (Damaged DNA Binding Protein 1 B); damaged DNA
binding / protein binding [Leptosphaeria maculans JN3]
gi|312213798|emb|CBX93800.1| similar to DDB1B (Damaged DNA Binding Protein 1 B); damaged DNA
binding / protein binding [Leptosphaeria maculans JN3]
Length = 1089
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 321/1107 (28%), Positives = 555/1107 (50%), Gaps = 102/1107 (9%)
Query: 13 LELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGII 71
L+ RP D LF+ T+R+ + L WD++ +L T ++ V+D R + G+ +
Sbjct: 2 LQKLRPALSPTDHLFVGTDRFMYFTLSWDSDKKQLQTEKSFASVADNAARESQTGERCHV 61
Query: 72 DPDCRLIGLHLYDGLFKVIPFDNKGQ----------LKEAFNIRLEELQVLDIKFLYGCA 121
DP R + + +Y+G+ VIP +G+ L E +RL E+ V FL +
Sbjct: 62 DPTGRFMTVEVYEGIMTVIPLVQRGKKRKQEPDIAHLGEPQPVRLSEMFVRSSAFLRPRS 121
Query: 122 ---KPTIVVLYQDNKDARHVKTYEVALKDKDFVE---GPWSQNNLDNGADLLIPVPPPLC 175
KP I +LY+D+ +K E+ D VE G + L+ G+ LIP+ P
Sbjct: 122 FDDKPKIALLYEDSHSQVKLKLRELTFAGGDAVELEEGETCRAELELGSSHLIPLEEPTH 181
Query: 176 GVLIIGEETIVYC--SANAFKAIPIR-PSITKAYGRVDADGSRYLLGDHAGLLHL-LVIT 231
G+++I E +I Y + + P+ +I A+ R+DA R++L D G L++ ++I
Sbjct: 182 GLIVIAETSIGYYDDESGELQTEPLEEATIFVAWERIDA--QRFVLADDYGRLYMFMLIL 239
Query: 232 HEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
+ KV K++++G+TS AS + YLD V++GS GDSQ+IK+ + +E+++
Sbjct: 240 DARHKVRSWKLDIIGKTSRASVLVYLDAGYVFVGSHQGDSQVIKIAERS------MEIVQ 293
Query: 292 RYVNLGPIVDFCVVDL-ERQGQGQ----------VVTCSGAYKDGSLRIVRNGIGINEQA 340
+ N+ PI+DF ++D+ R G+GQ +VT SGAY+DGSLR VR+G+G+ +
Sbjct: 294 TFSNIAPILDFTIMDMGNRSGEGQTNEYSSGQARIVTGSGAYQDGSLRSVRSGVGLEDLG 353
Query: 341 SV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCS---QT 396
+ E++ I ++SL+S + L+V+F++E+R+ + + E+EE +E F S +
Sbjct: 354 VLGEMEHISNLFSLKSDATAQYADTLLVTFVNESRVFKFDPQGEVEE--VEEFASLALEE 411
Query: 397 QTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGG 456
TL + ++VQVTSG R+ + +EW ++ A+AN + VL++ GG
Sbjct: 412 TTLVAANISQGRVVQVTSGRARICDLDGGMIVSEWMPMGDQTITAASANDTHVLISLGGV 471
Query: 457 HLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPS-YSQIAAVGMWTDISVRIFSLP 514
+V L + DG+ + + K E +++C+ + PS S + +G W + + I SL
Sbjct: 472 TVVSLNMTDGLQVAQQKTFGTESQVACVAL------PSDSSSVCFLGFWMNSQLAICSLD 525
Query: 515 DLNLITKEHLGGEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 572
L I + E +PRS+LL + L A+ DG+++ + + + EL +K +
Sbjct: 526 TLETIKTVTISDESVPRSLLLTQVFPDQPPTLFAAMADGNVITYTFDPSSYELLGKKSIV 585
Query: 573 LGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
LGT+ T R + + VFA + P++IY+S +L+YS V ++ + +CPF+S A+P
Sbjct: 586 LGTREATFRALPRGDGIYNVFATCEHPSLIYASEGRLVYSAVTAEKATAVCPFDSEAYPG 645
Query: 632 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN-QSCAEESE 690
S+AIA +L I +D + H++++ + E RRI + + F + ++K EE
Sbjct: 646 SVAIATSEDLRIALVDTERTTHVQTLKVDETVRRIAYSPSLKAFGLGTIKRILKNGEEIM 705
Query: 691 MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN---VYYCVGTAYVLPEENEPT 747
+ +L+D+ F+ + TY L+ E ++ C +D S + VGTAY+ +
Sbjct: 706 LSHFKLVDEIQFKELDTYALNEEELVECVMRCELADGSGGLAERFVVGTAYLDDQNATAE 765
Query: 748 KGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
+GRIL+ + + L+L+ E KG L GK++AA+ + I LY + L
Sbjct: 766 RGRILILEVTPERVLKLVTELAVKGGCRCLAMCEGKIVAALIKTIVLYDVEFKTQSKPTL 825
Query: 807 QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER----ARDYNANWMS 862
+ + G I + DLMKS++++ Y+ EG + ++ AR Y W +
Sbjct: 826 VKAATFRCSTAPIDITVNGPVITIADLMKSLAMVKYQKGEGGLPDKLVELARHYQVTWAT 885
Query: 863 AVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR 922
AV +D + +L ++ NL + +N EG TD+++ + + E LGE VNR R V
Sbjct: 886 AVAEIDTETFLESDAEGNLMVLSRNVEGVTDDDKRHMLISSEMLLGEMVNRIRRID-VQT 944
Query: 923 LPDSDVGQIPTVIFGTVN--------------------------GVIGVIASLPHEQYLF 956
PD+ V IP GTV G I + +
Sbjct: 945 APDAVV--IPRAFVGTVRISPLLSSLPSSYPNNPPTNHQHTQVEGSIYLFGLIAPSHLHL 1002
Query: 957 LEKLQTNLRKVIKGVGGLNHEQWRSFNNE--KKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
L LQ+NL ++ G ++ ++R+F N+ + + K F+DG+L+E FLDL +
Sbjct: 1003 LMTLQSNLAALVPAPGNMDFAKFRAFRNQVRQDEEEPKRFVDGELVERFLDLGAEAQAKA 1062
Query: 1015 SKTMNVS-----VEELCKRVEELTRLH 1036
+ + + VE + VE L RLH
Sbjct: 1063 VQGLRLGEAELDVEAVRGLVEGLRRLH 1089
>gi|350629921|gb|EHA18294.1| damage-specific DNA binding protein [Aspergillus niger ATCC 1015]
Length = 1140
Score = 450 bits (1158), Expect = e-123, Method: Compositional matrix adjust.
Identities = 334/1105 (30%), Positives = 554/1105 (50%), Gaps = 93/1105 (8%)
Query: 6 IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
++ ++ L L P D LF+ T+RY +C L WD E +++ T R D+SD R
Sbjct: 55 LFAKVTMLARLPAPANSPTDHLFVGTDRYTYCTLSWDGERNQIRTERNYVDISDPSSREA 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
G +IDP R + L +Y+G+ V+P G+L E
Sbjct: 115 QTGNRCLIDPSGRFMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTT 174
Query: 104 IRLEELQVLDIKFLY-GCAKPTIVVLYQDNKDARHVKT----YEVALK----DKDFVEG- 153
R++EL V FL+ P + +LY+DN+ +K Y A D F E
Sbjct: 175 ARIDELFVRSSAFLHVQSGPPRLALLYEDNQKKVRLKVRALHYSAATASTGADAAFEESL 234
Query: 154 -PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGR 208
+SQ LD GA LIPVP PL G+L++GE +I Y ++ + + RP +I A+ +
Sbjct: 235 DGFSQE-LDLGASHLIPVPAPLGGLLVLGETSIKYVDTDSNEIVS-RPLDEATIFVAWEQ 292
Query: 209 VDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
VD+ R+LL D G L L+++ +V K++ LG T+ AS + YL V+++GS
Sbjct: 293 VDS--QRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGVIFVGSH 350
Query: 268 YGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVV 316
GDSQ++++ GS +EV++ N+ PI+DF ++DL E Q GQ ++V
Sbjct: 351 QGDSQVLRI-----GNGS-LEVIQTLSNIAPILDFTIMDLGNRTSESQTHEFSSGQARIV 404
Query: 317 TCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
T SGA+ DG+LR VR+G+G+ E + E++ I ++ L+ +T + L+V+F+ ETR+
Sbjct: 405 TGSGAFDDGTLRSVRSGVGMEELGVLGEMELITDLFGLQLATKGGYLDTLLVTFVDETRV 464
Query: 376 LAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
N + E+EE + G TL + +++QVT V + + NEW P
Sbjct: 465 FQFNFDGEVEELDSFLGLSLSENTLLAMNLPGGRILQVTEQRVLIADIEGGMVTNEWTPP 524
Query: 435 PGYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPS 493
+ A+AN ++L +GG + L+I + + + K + +IS + + P+ S
Sbjct: 525 DNLVITSASANNDSIVLVSGGQLMTVLDINNDVRVISQKDFGADSQISGVTV-PL----S 579
Query: 494 YSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALG 549
+ + VG V + L L+ + LG GE PRSVL+ S L ++
Sbjct: 580 SAGVCIVGFPQLAKVSVLDLGRLSELHTTSLGPAGEAFPRSVLVADVLANSPSTLFISMA 639
Query: 550 DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKL 608
DG ++ + + LT ++ LG++ T + + + VFA + P++IY S ++
Sbjct: 640 DGSVITYSFDASNYSLTGMNRLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYGSEGRI 699
Query: 609 LYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICH 668
+YS VN + S +C FNS A+P S+A+A +L I +D + I+++P+GE RR+ +
Sbjct: 700 IYSAVNSEGASRVCHFNSEAYPGSIAVATRHDLKIALVDKERTTQIQTLPMGETARRVAY 759
Query: 669 QEQSRTFAICSL--KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS- 725
+ F I ++ K + AE + FV L D+ F + + L E S++ FS
Sbjct: 760 SPSEKAFGIGTIERKLKDGAEMVQSRFV-LADEIMFRRLDAFDLRPEELVESVIRAEFSA 818
Query: 726 ----DDSNVY---YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLN 777
+ +V+ + VGTAY+ E E +GRILVF +++G KL +AE KGA +L
Sbjct: 819 GKDENGRDVFKDRFVVGTAYLDDENEESIRGRILVFEIDNGRKLTKVAELPVKGACRALA 878
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
K++AA+ + + +Y + D G +L+ + + V G+ I V DLMKS+
Sbjct: 879 MLGEKIVAALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAPVDVTVTGNVIAVADLMKSV 938
Query: 838 SLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
L+ Y E ++ E AR + W + V + D +L + NL +R+N G +
Sbjct: 939 CLVEYSEGENGMPDSLTEVARHFQTVWATGVSCIAKDTFLETDAEGNLIVLRRNLTGVEE 998
Query: 894 EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQ 953
+++ RLEV GE LGE VNR R ++ + V P GTV G I + A + E
Sbjct: 999 DDKRRLEVTGEISLGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLFAIINPEH 1055
Query: 954 YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMD 1012
FL +LQ + ++ +G + ++R F + ++T + F+DG+LIE FL + +
Sbjct: 1056 QDFLMRLQATMAGKVESLGNIPFNEFRGFRSMVRETKEPYRFVDGELIERFLTCEPSLQE 1115
Query: 1013 EISKTMN-VSVEELCKRVEELTRLH 1036
EI ++ ++V+E+ +E L RLH
Sbjct: 1116 EIVDSVGMMNVDEVKIMIEALRRLH 1140
>gi|452979181|gb|EME78944.1| hypothetical protein MYCFIDRAFT_43692 [Pseudocercospora fijiensis
CIRAD86]
Length = 1149
Score = 449 bits (1156), Expect = e-123, Method: Compositional matrix adjust.
Identities = 325/1111 (29%), Positives = 547/1111 (49%), Gaps = 98/1111 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
IYG++ L RP D LF+ T+RY + L WD E +L T ++M D++D+ R +
Sbjct: 57 IYGKVTLLHKLRPATSQTDHLFVGTDRYNYFTLSWDGERQDLKTEKSMVDIADKAARDSQ 116
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF-----------DNK-GQLKEAFNIRLEELQVL 112
G IDP R + L +Y+G+ V+P DN+ G+L + +R+ EL V
Sbjct: 117 TGDRVHIDPTSRFMTLEVYEGVVNVLPIAHAGKGKRKAADNEIGELGDPIQVRIPELFVR 176
Query: 113 DIKFLY-------GCAKPTIVVLYQDNKDARHVKTYEVALKDK----------DFVEGPW 155
FLY G P VLY+D+K+ +K E+ + +G
Sbjct: 177 SSCFLYRRDTGPRGKTDPMFAVLYEDSKNQVRIKIRELEYSPSLRPNEEPATAELEKGVD 236
Query: 156 SQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVDA 211
+L+ G+ +IP+P P G+L++GE I Y K +P ++ A+ ++D
Sbjct: 237 VTGSLEMGSSFMIPLPAPTYGLLVVGETRITYVDEWEHKVQDSQPLEDATVFVAWCKLDE 296
Query: 212 DGSRYLLGDHAGLLHLLVI-THEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
RY+L D L+LL++ T L+I++LGETS AST+ YLD V++GS GD
Sbjct: 297 --QRYILADDYKKLYLLMVNTSPHGDYQSLEIDVLGETSRASTLVYLDEGRVFVGSHQGD 354
Query: 271 SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCS 319
SQ+I+++ + ++VL+ + N+ PI+DF V+D+ + GQ ++VT S
Sbjct: 355 SQVIQISPK------RIDVLQSFPNIAPILDFTVMDMGNRSLDAPVNEFSSGQARIVTGS 408
Query: 320 GAYKDGSLRIVRNGIGINEQASVELQG--IKGMWSLRSSTDDPFDTFLVVSFISETRILA 377
GA+ DGSLR VR+G+G+ ++ S+ G I ++SLR+S + L+VS +S T ++
Sbjct: 409 GAFNDGSLRSVRSGVGLEDRGSLGDLGAPISSIFSLRTSAAAQYVDTLIVSHVSHTNVVV 468
Query: 378 MNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPG 436
+ ++E + GF + TLF D + VQVTS SV L S + + W++P G
Sbjct: 469 FTEDGDIEARDSFRGFELKDATLFAGDLRDGRAVQVTSSSVLLTDSEGDMVTDRWQAPSG 528
Query: 437 YSVNVATANASQVLLATGGGHLVYLEIGDG---ILTEVKHAQLEYEISCLDINPIGENPS 493
S+ A+ ++L++ G LV L++ + E K E ++SC+ + +PS
Sbjct: 529 TSITAVAADGEKILVSLQGAVLVVLDLSASNIQVQAERKFGS-EEQVSCIAL-----SPS 582
Query: 494 YSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE---IIPRSVLLCAF--EGISYLLCAL 548
G W D V SL DL E + E +PRS+++ + + L L
Sbjct: 583 IPNACVAGFWKDSKVSFLSLQDLQPTASERVAEEESLAVPRSLIVANILKDQPATLFVGL 642
Query: 549 GDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS-SKNTTHVFAASDRPTVIYSSNKK 607
DG+++ + + + RK + LGTQ + +VFA + P++IY S+ +
Sbjct: 643 ADGNVVTYSVTSAQQPFSARKCIILGTQQANFAALPRADGLQNVFATCEHPSLIYGSDGR 702
Query: 608 LLYSNVNLKEVSHMCPFNS-AAFPDSLAIAK-----EGELTIGTIDDIQKLHIRSIPLGE 661
L+YS V + + + F+S + ++AIA E EL + +D+ + H++ + + E
Sbjct: 703 LVYSAVTAENATCIASFDSFGDYAGAIAIATTDENGENELKLAVVDEERTTHVQDLFIHE 762
Query: 662 HPRRICHQEQSRTFAICSLKNQSCA--EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
RRI + + + F + +K A EE HF +L+D+ F+ + T+ L+ E +
Sbjct: 763 TVRRIAYSAELKAFGLGCIKRTLSAGNEEVASHF-KLVDEVAFKELDTWALNEDELVECV 821
Query: 720 LSCSFSDDSN---VYYCVGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYS 775
+ C D S + VGTAY+ ++ KGRILV I ED +++L+ E +GA
Sbjct: 822 IRCYLDDGSGEEAERFVVGTAYLDDQDANNAKGRILVLEITEDRRIKLVTELAVRGACRC 881
Query: 776 LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
L G+++AA+ + I +Y + + T L + + + + + I V DLMK
Sbjct: 882 LAVCQGRIVAALVKTIVVYDFEYQTPSTPALTKKASYRTATAPIDICVTNNTIAVTDLMK 941
Query: 836 SISLLIYKH----EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA 891
S+SLL +K + + E AR + W +A + ++ YL ++ NL ++ + G
Sbjct: 942 SLSLLEFKAGRQGQPDTLIEIARHFETLWGTACARVSENTYLESDAEGNLIVLQHDINGF 1001
Query: 892 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH 951
+ E+R RL V E+ LGE VNR R + P P T +G I V +
Sbjct: 1002 SQEDRRRLRVTSEFLLGEMVNRIRP---ITVQPSPGAVVTPQAFLATTDGSIYVYCEIGK 1058
Query: 952 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK---NFLDGDLIESFLDLSR 1008
+ L ++QT + ++K GG+ ++R F + + + F+DG+LIE FLD+
Sbjct: 1059 PRQDLLMRMQTLMADMVKSPGGVRFAKFRGFKTLVRDMGEEGPVRFVDGELIERFLDMPE 1118
Query: 1009 TRMDEISKTMN---VSVEELCKRVEELTRLH 1036
+E+ K ++ V +E L VE L R+H
Sbjct: 1119 VLQNEVVKGLDGTGVDLEGLRGMVEGLRRIH 1149
>gi|167517245|ref|XP_001742963.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778062|gb|EDQ91677.1| predicted protein [Monosiga brevicollis MX1]
Length = 1670
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 317/1091 (29%), Positives = 542/1091 (49%), Gaps = 77/1091 (7%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++D +YG + + + R G +D +F+ T ++F +L D E + ++T G+
Sbjct: 95 VMDKDLYGSVLAMNVIRLPGWERDAVFLLTSTFRFFILASD-EDNGVVTVIKGNALSSCQ 153
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP D G + P I + +Y G +IPFD G+ EA+++ + +LD F+ G
Sbjct: 154 RPADCGVHVAVHPKGNCIFVSVYPGNALIIPFDASGEPMEAYSVFVPVSSLLDATFVNGA 213
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
T+ +L D+ + +K + + ++++ VE + + ++ + L+P+ GVL++
Sbjct: 214 HDFTLALLSDDDTNFTSLKMFHLDVEERTLVEEQLADSTINTYSSRLLPLWNLDSGVLVL 273
Query: 181 GEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-KVTG 239
GEE + + + + S+ A G +D DGSR L+GD G LHLLV+ E +VT
Sbjct: 274 GEELCHVVTPSGIISSNLSESLPVAAGIIDTDGSRILIGDELGDLHLLVLEGIAERRVTS 333
Query: 240 LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL--NLQPDAKGSYVEVLERYVNLG 297
+ + LG S S I YLDN VVYIGS D QLI+L ++Q +A V+VL+ Y N+G
Sbjct: 334 IVRQHLGRISTPSAIVYLDNGVVYIGSDQADCQLIQLLSHVQAEADNK-VKVLQEYPNIG 392
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST 357
PIVDF +VDL+ GQ QVV+C G+ +DG LRI+R G+GI+ AS++L+G++ +W LRS++
Sbjct: 393 PIVDFEMVDLDGHGQQQVVSCCGSNQDGCLRILRKGVGIDVLASLDLEGLQDLWCLRSAS 452
Query: 358 ---DDPFDTFLVVSFISETRILAMNLED--------------ELEETEIEGFCSQTQTLF 400
+D D L + F+ +T L++ ++ EL+ ++ G ++ L
Sbjct: 453 NLGEDQHDV-LALKFLEQTAFLSLAGDEVCLLYSTPTSHSYTELDGVDVAGANTELPALH 511
Query: 401 CHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGG-HLV 459
C + + VTS RL+ + R W P G ++V + + +A+G + +
Sbjct: 512 CGNVRDGMWLVVTSQDARLLDAVDRTEVTRWSPPNGKGIDVCASTGDLLAVASGSDLYAL 571
Query: 460 YLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLI 519
L +G L ++K+A L++EI+CL I G + I A G+WTD S+R FS L
Sbjct: 572 SLSRTEG-LHDMKNATLDHEIACLSIRASGPDQGAGTILA-GLWTDFSLRAFSTRTLEEQ 629
Query: 520 TKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
K + +++ SV EG Y GDG L + + + V +G+ P+
Sbjct: 630 AKVEVPTQVVSSSVASVTMEGTCYFFIGHGDGKLAYGVFDPLSSTFGAPHVVQVGSLPVK 689
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
LR VF A+DRP V+ S KLL+ NV+ N+ A+ D LA ++
Sbjct: 690 LRACKRGKDEFVFVATDRPMVVSSRRGKLLFCNVSAGACRTADVLNAEAYVDCLAYVEQD 749
Query: 640 ELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE-MHFVRLLD 698
L G ++++Q L IR IPL E P + + + S F + + ++C E + ++RL+D
Sbjct: 750 RLVFGKMENMQNLQIRKIPLDETPLGVTYHKSSGAFCVATDAARACPTPQEPICYLRLID 809
Query: 699 DQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV 756
Q+FE ++ L+ E +G S+ + +DS Y VGTA P P +GRILV V
Sbjct: 810 AQSFEVRDSFKLEQAESLFGHSLHTMQLRNDSTEYIVVGTAMHDPNRPLPKQGRILVLRV 869
Query: 757 -EDGKLQLIAEKETK-GAVYSLNAFNGKLLAAINQKIQLYKWMLRD-DGTRELQSECGHH 813
+DGKL+L+ G ++SL AF ++A IN +++ + + E+ S+
Sbjct: 870 NDDGKLELVVSHAIHDGGIFSLQAFRDGVVAGINGRLEYFSLESTPLERKVEVASQTVFR 929
Query: 814 GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
G + G+ ++VGD+++S++ + Y + D + ++ + +D +L
Sbjct: 930 GMQTVSCLGVCGNTVLVGDILQSVTAVNYSEQRNRFVVGPGDPESRYLLTCFLPAEDRFL 989
Query: 874 GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN--------------------- 912
+++ NL + + + + + G H+G+ +N
Sbjct: 990 FCDSDQNL-VLGMPPVDTVENDASLMHLAGRIHIGDNINSYVICACIHVWTPYLLCPDST 1048
Query: 913 ----------RFRHGSLVMRLP-DSDVGQI----------PTVIFGTVNGVIGVIASLPH 951
RF GSL + ++ G+ P ++F TV G +G+I +
Sbjct: 1049 FCFAALFVTSRFAFGSLSLSYERPAEAGEAGEDGAKQQSSPPIVFTTVLGGVGMILEVQQ 1108
Query: 952 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRT 1009
+ F+ ++Q L + VGGL HE +RS N K+ A+ F+DG+LIESFL+L+
Sbjct: 1109 KHLWFMHEMQRRLADMGNAVGGLTHEDYRSTKNGKRESVTPARCFVDGNLIESFLELTPE 1168
Query: 1010 RMDEISKTMNV 1020
M+E+ K ++
Sbjct: 1169 EMEEVMKEFHI 1179
>gi|367044684|ref|XP_003652722.1| hypothetical protein THITE_2114471 [Thielavia terrestris NRRL 8126]
gi|346999984|gb|AEO66386.1| hypothetical protein THITE_2114471 [Thielavia terrestris NRRL 8126]
Length = 1187
Score = 446 bits (1147), Expect = e-122, Method: Compositional matrix adjust.
Identities = 330/1146 (28%), Positives = 552/1146 (48%), Gaps = 128/1146 (11%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
+ G IA L+ RP D LF+ T+R+++ L W+ E+S+L T D +R R +
Sbjct: 55 VNGTIAILQKLRPKDARTDLLFVGTDRFEYFTLAWNPETSQLDTINPFNDPGERHMRDSQ 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFN----IRLEELQVLDIKFLYG- 119
+ ++DP R + +HL++G+ ++ N+ + +RL EL + FLY
Sbjct: 115 SQDRCLVDPSGRFLAMHLWEGVLTILRLGNRKNTATVLDWMGQVRLSELFIKASTFLYTE 174
Query: 120 CAKPTIVVLYQDNKDAR--HVKTYEVALKDKDFVEGPWSQNN--------LDNGADLLIP 169
P I +LYQ D+ + TY + D++ + N D A +LIP
Sbjct: 175 TGHPKIALLYQSRADSSDAQLATYRLTSDDRNTELSRFDPNRDREIDAEIHDPSASMLIP 234
Query: 170 V------------------PPPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGR 208
V + G++++GE ++Y + ++ SI A+
Sbjct: 235 VRKVEEQVKRHNVRNVESAKAHIGGLIVVGETRLLYIDEVTKTTVESALTEASIFVAWAE 294
Query: 209 VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
D Y L D G LHLL + E VTG+ + +G+TS AS + YL + ++++ S Y
Sbjct: 295 YDV--KHYFLADDYGNLHLLTLETEDVVVTGMIVNRIGKTSRASNLVYLGDNLLFVASHY 352
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQVV 316
GDSQL +L+L+ D V++++ N+GPI+DF ++DL + GQ ++V
Sbjct: 353 GDSQLFRLDLENDDARQLVQLVQTLPNIGPILDFEIMDLGNRGDEGQLANEYSSGQARIV 412
Query: 317 TCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
TCSG +KDG+LR VR+G+G+ + + +L+ +G++ L SS P L VSF++ETR+
Sbjct: 413 TCSGVHKDGTLRSVRSGVGLEDVGILADLEHCRGLFPL-SSYGSPKTDTLAVSFLTETRV 471
Query: 376 LAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
+ ++EE E G QTL + QL+QVT + L+ + S W
Sbjct: 472 FKFDSHGDVEEVESFSGMTFDQQTLLAMNLPKGQLLQVTPAAASLLDAESGVTIASWAPE 531
Query: 435 PGYSVNVATANASQVLLATGGGHLVYLEIGDGILT-EVKHAQLEYEISCLDINPIGENPS 493
++ A+AN +LL+ GG LV L I + T + K + +++C+ + P
Sbjct: 532 GERTIISASANPRWLLLSVGGTELVSLSIANDFQTVQAKDMNQQDQVACIHV-----APG 586
Query: 494 YSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEGISY----LL 545
+ VG WT +V I L L I E L IPR + L S L
Sbjct: 587 LDDVGVVGFWTSGTVSIIDLHTLEPIHGESLRTSKDDASIPRDLALVQMLPPSASGPTLF 646
Query: 546 CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSS 604
A+ DG+++ F ++ K L+ RK+V LG + + +FA ++ P++IY S
Sbjct: 647 VAMQDGNVVTFNVS-KDLALSGRKRVILGMRQARFHLLPQPDGIFSIFATTEHPSLIYGS 705
Query: 605 NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPR 664
+++YS V +E +++CPF++ AFPD + +A + ++ I ID ++ H++ + +GE R
Sbjct: 706 EGRIVYSAVTAEEATYICPFDTEAFPDCIVLATDAQIRISQIDRERRTHVKPLQMGEMVR 765
Query: 665 RICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFIS-TYPLDTFEYGCSILSC 722
RI + + + F + C ++ EE +L+D+ F+ + ++PL + Y +
Sbjct: 766 RIAYSPREKVFGLGCIKRDLVAGEEVVQSSFKLVDEIIFDRVGRSFPLGSPSYTELVECV 825
Query: 723 SFSDDSNVY------YCVGTAYVL-PEENEPT--KGRILVFIVE-DGKLQLIAEKETKGA 772
++ + Y + VGT+Y+ P+ T +GRILVF V+ D L+ E KGA
Sbjct: 826 VRAELPDSYGAPAERFLVGTSYLADPDLGAGTDARGRILVFGVDADRNPYLVLSHELKGA 885
Query: 773 VYSLNAF-NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVG 831
L +GK++A + + + + ++ T EL + + + RG+ I V
Sbjct: 886 CRCLAVMDDGKIVAGLTKTVVVCRYEETSSTTAELTRLASYRPSTYPVELCVRGNTIAVA 945
Query: 832 DLMKSISLLIY---------------KHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
DLMKS++L+ + K E + E+AR + + W +AV + DD +L A+
Sbjct: 946 DLMKSVALVEFVPAGADETGAAGPSSKRGEAKLVEKARHFGSVWATAVSHVQDDSWLEAD 1005
Query: 877 NNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
NL +R+N EG T E++ R+EV E +LGE VNR R G V P + V +P
Sbjct: 1006 AQGNLMVLRQNLEGVTAEDKKRMEVTSEMNLGEMVNRIR-GIEVETTPGAIV--VPKAFL 1062
Query: 937 GTVN-------------------------GVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
GTV+ G+ +PH Q L L + Q L VIK
Sbjct: 1063 GTVSLRCGESLFGMNERRFANTGQWQVEGGIYMFATVVPHAQDLLL-RFQAKLADVIKTA 1121
Query: 972 GGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
GG+ +R+F N ++ D F+DG+L+E FLD+ + I + + +VE++ VE
Sbjct: 1122 GGIEFRTYRAFRNAEREGDGPFRFIDGELLERFLDVDEATQEVICQGLGPTVEDMRNLVE 1181
Query: 1031 ELTRLH 1036
EL R+H
Sbjct: 1182 ELRRMH 1187
>gi|326426696|gb|EGD72266.1| hypothetical protein PTSG_00286 [Salpingoeca sp. ATCC 50818]
Length = 1104
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 302/1068 (28%), Positives = 543/1068 (50%), Gaps = 51/1068 (4%)
Query: 3 DVPIYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGR 61
D P+YG++A + ++ P + D L + T+ +F VL ++ + +L T G++ +
Sbjct: 54 DYPVYGKVACIRKMTFPQMKGLDSLLVMTDDMRFFVLTFNEHTKKLETHTNGNLLNPSSF 113
Query: 62 PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCA 121
P + G + +DPDCR+I + +Y GL K++ +EA RL+ QVL + L+GCA
Sbjct: 114 PMETGPLVAVDPDCRVIVMAMYPGLVKILRIHGCKFEEEASEARLDADQVLSMAMLHGCA 173
Query: 122 KPTIVVLYQDNKDARHVKTYEVALKDKDF-VEGPWS---QNNLDNGADLLIPVPPPLCGV 177
+PT+ VL Q D R VK V +++ V W + + + A L PVP G
Sbjct: 174 EPTLAVLCQ-RADQRFVKLMRVTSRNELVEVSSVWKTRLEGAVPDSAHFLHPVPNTSEGC 232
Query: 178 LIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDH-AGLLHLLVITHEKEK 236
L+ G + +VY AN +KA I + +A VD G+R+L+GD G L LLV+ E+++
Sbjct: 233 LVFGADAVVYADANGYKAASIPEMMVQAVADVDDSGARFLIGDSLRGGLALLVLEREEDQ 292
Query: 237 VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNL 296
V + E GET ST++YLDN+VV++GS GDSQ++KL D + + VL+ + N+
Sbjct: 293 VQQIVYEPFGETVAPSTLAYLDNSVVFVGSVVGDSQVVKLETDDDNQNKII-VLDTHENI 351
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSS 356
G ++D C + G+ ++VT SGA KDGSLR+++ G+ I A+V+L ++ +W L
Sbjct: 352 GSVLDMCSLPQASFGETRLVTASGAGKDGSLRVIQRGVNITSSATVQLDDLQRVWVLTKP 411
Query: 357 TDDP----FDTFLVVSFISETRILAMNLEDELEETEIEGF----CSQTQTLFCHDAIYNQ 408
+ + ++FL +SF L + E E+ GF L C + NQ
Sbjct: 412 SGEAEAATEESFLALSFAGGLAFL------QFEGEELCGFEVPSAPSDPALLCSNVAENQ 465
Query: 409 LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGIL 468
+ VT V LV + + EWK+ G ++ A A + + L+ + G L+ EI G L
Sbjct: 466 WLFVTEDEVVLVCAETLARVAEWKAAEGQAIG-ACACSEKQLVVSSGRQLLIFEIAKGKL 524
Query: 469 TEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
T+ K LE+E+SCLDI P+ ++ + +A G W ++ +++ LP + + + +
Sbjct: 525 TQTKDTTLEHELSCLDILPVNDDGTSVMVA--GTW-NVEAKVYHLPSMRVASSAPFKAGV 581
Query: 529 IPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
I S + + +GDG ++ ++ + +T+++ V G QP++L + +
Sbjct: 582 IATSCAITRLGDQNVAFFGMGDGSVVRYIFAEGSWHMTNQRHVHAGKQPVSLVSCKFASG 641
Query: 589 THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
V A S+ + Y+ + ++ +S +N ++ + P ++ A+PDSL + L IG I
Sbjct: 642 PAVVALSNTSLLFYADSGRVTFSTLNEANLTCVAPLSTPAYPDSLVFSTPASLGIGQIGR 701
Query: 649 IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTY 708
+ LHI + LG P + + ++ + + + + +R+ D T E ++++
Sbjct: 702 MNNLHINKVALGFSPVSLTTISANPSYVVV-VGHVDQEDGGIASAIRVFDGTTLEMVASH 760
Query: 709 PLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEK 767
L E ++ F DD+ Y+ +GTA+V P E +P++GRIL+ +E+ K + ++ E
Sbjct: 761 ELPAPEAVNCVIQHKFKDDNTEYFIIGTAFVDPTETQPSRGRILISKLENKKEIAIVHEC 820
Query: 768 ETKGAVYSLNAFNGK----LLAAINQKIQLYKW-MLRDDGTRELQSECGHHGHILALYVQ 822
E G+VY L GK L+A IN ++ +K+ D ++L++ G+ + +
Sbjct: 821 EAAGSVYCLTKMCGKDTDDLVAGINNQVVHFKYDATGQDAAKKLRAVSGNQNFGAVVSLD 880
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
+ D ++VGD++ ++ + + + ++ A ANW+S+ ++++ ++L A + +L
Sbjct: 881 SCDDIVLVGDMLNAV--FVMQKAQDKLQLVAGSQTANWVSSCALVNETVFLVASHAHSLS 938
Query: 883 TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL-VMRLPDSDVGQIPT-VIFGTVN 940
++ E + + L E +LGE V F +L DS + T +FGT+
Sbjct: 939 VCQREFEPGSTMQ--TLNAKFEIYLGETVTSFVRAALGSAAAVDSSMPLRNTFFVFGTMG 996
Query: 941 GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN--FLDGD 998
G + + L Q L L+ + + I G+GGL+H ++R+ +E++ N +DGD
Sbjct: 997 GGLACLLPLTPPQTELLTALECRMEEKIGGLGGLDHREFRTARDEQRMAQQVNPRLVDGD 1056
Query: 999 LIESFLDLSRTRMDEISKTMN----------VSVEELCKRVEELTRLH 1036
L+E+FL L E+ M+ V+ E +E + RLH
Sbjct: 1057 LVETFLQLPEEEQKELVAGMSLMGEDGRPYVVTAEVAKAALEAMARLH 1104
>gi|70992271|ref|XP_750984.1| UV-damaged DNA binding protein [Aspergillus fumigatus Af293]
gi|66848617|gb|EAL88946.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
Af293]
gi|159124553|gb|EDP49671.1| UV-damaged DNA binding protein, putative [Aspergillus fumigatus
A1163]
Length = 1140
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 334/1104 (30%), Positives = 544/1104 (49%), Gaps = 91/1104 (8%)
Query: 6 IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
IY R+ L L P D LF+ T+RY + L WD+ + T R D+SD R +
Sbjct: 55 IYARVTMLARLPAPANSPTDHLFVGTDRYTYFTLSWDSSEKRVRTERDYVDMSDPSSRES 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
G +IDP R + L +Y+G+ VIP G+L E
Sbjct: 115 QTGNRCLIDPSGRFMTLEIYEGVIAVIPIIQLPSKKRGRAVALPSGPDAPRVGELGEPTT 174
Query: 104 IRLEELQVLDIKFLY-GCAKPTIVVLYQDN-KDAR--------HVKTYEVALKDKDFVEG 153
R+EEL V FL+ P + +LY+DN K R H T + D F E
Sbjct: 175 ARIEELFVRSSAFLHVQEGLPRLALLYEDNQKKVRLRVRELQYHAATGSNSTADATFGEP 234
Query: 154 PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRV 209
+L+ G+ LIPVP PL G+LI+GE +I Y A+ + I RP +I A+ +V
Sbjct: 235 ADFTQDLELGSSHLIPVPAPLGGLLILGEMSIKYVDADNNEIIS-RPLDEATIFVAWEQV 293
Query: 210 DADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
D+ R+LL D G L L+++ +V K++ LG TS AS + YL ++++GS
Sbjct: 294 DS--QRWLLADDYGRLFFLMLVLDSDSQVESWKLDHLGNTSRASVLVYLGGGILFLGSHQ 351
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVT 317
GDSQ+++++ P +EV++ N+ PI+DF ++DL E Q GQ ++VT
Sbjct: 352 GDSQVLRISNGP------LEVVQTLSNIAPILDFTIMDLGNRSSESQTHEFSSGQARIVT 405
Query: 318 CSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
SGA+ DG+LR VR+G+G+ E + ++ I +W L+ + F L+V+F+ ETR+
Sbjct: 406 GSGAFDDGTLRSVRSGVGMEELGVLGDMDHITDLWGLQVGSSGDFLDTLLVTFVDETRVF 465
Query: 377 AMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
+ + E+EE + G TL + +++QVT V + + EW P
Sbjct: 466 RFSSDGEVEEMDHFLGLSLSESTLLATNLPGGRILQVTEQRVLIAEIEGGMVIYEWTPPN 525
Query: 436 GYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSY 494
+ + A+AN ++L GG + L I + + + K + +IS + + +P+
Sbjct: 526 QFIITAASANDDSIVLVAGGELVTVLNITNEVQVVTQKDFGADSQISGVTVP---SSPTG 582
Query: 495 SQIAAVGMWTDISVRIFSLPDLNLI--TKEHLGGEIIPRSVLLCAF--EGISYLLCALGD 550
IA +SV L DL+ + T + GE PRSVL+ + L ++ D
Sbjct: 583 VCIAGFPQLAKVSV--LKLQDLSELHTTSVGMAGEAFPRSVLVANVLTDSPPTLFVSMAD 640
Query: 551 GHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLL 609
G ++ + N LT K+ LG+ QP + +VFA + P++IY S +++
Sbjct: 641 GSVITYSFNTNNFSLTGMTKLILGSEQPTFKKLPRGSGLFNVFATCENPSMIYGSEGRII 700
Query: 610 YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
YS VN + S +C FNS A+P+S+A+A +L I +D + I+++P+G RR+ +
Sbjct: 701 YSAVNSEGASRICHFNSEAYPESIAVATSHDLKIALVDKERTTQIQTLPIGATVRRVAYS 760
Query: 670 EQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-- 725
+ F I +++ + E + FV L D+ F + + L E S++ F
Sbjct: 761 PSEKAFGIGTIERKLVDGTEIVKSRFV-LADEILFRRLDAFELRPEELVESVIRAEFPAG 819
Query: 726 ------DDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNA 778
D+ + VGTAY+ E +E +GRIL+F V++G KL +AE KGA +L
Sbjct: 820 KGANDRDEVKDRFIVGTAYLDDEGDESIRGRILIFEVDNGRKLTQVAELPVKGACRALAM 879
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
K++AA+ + + +YK + + G L+ + + V G+ I V DLMKS+
Sbjct: 880 LGDKIVAALVKTVVVYKVINNNFGAMRLEKLASYRTSTAPVDVTVTGNLIAVSDLMKSMC 939
Query: 839 LLIYKHEEGA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
L+ YK E + E AR + W + V + D +L ++ NL + +N+ G ++
Sbjct: 940 LVEYKEGENGTPDTMTEVARHFQTVWATGVANIAPDTFLESDAEGNLIVLHRNTTGVEED 999
Query: 895 ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
++ RLEV GE LGE VNR R + + +L + V P GTV G I + A + +
Sbjct: 1000 DKRRLEVTGEISLGEMVNRIRPVN-IQQL--ASVAVTPRAFLGTVEGSIYLFAIINPDHQ 1056
Query: 955 LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
FL +LQ + ++ VG + ++R F + ++ + F+DG+LIE FL + +E
Sbjct: 1057 DFLMRLQATIAGKVELVGNIPFNEFRGFRSMVREAKEPYRFVDGELIERFLTCEPSLQEE 1116
Query: 1014 ISKTM-NVSVEELCKRVEELTRLH 1036
I T+ +SV+E+ +E L RLH
Sbjct: 1117 IVSTVGKMSVDEVKGMIEALRRLH 1140
>gi|296411833|ref|XP_002835634.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295629420|emb|CAZ79791.1| unnamed protein product [Tuber melanosporum]
Length = 1053
Score = 440 bits (1131), Expect = e-120, Method: Compositional matrix adjust.
Identities = 310/977 (31%), Positives = 491/977 (50%), Gaps = 76/977 (7%)
Query: 5 PIYGRIATLELFRPHGEAQDFLFIATER--YKFCVLQWDAESSELITRAMGDVSDRIGRP 62
PIYGR+ L FRP A D L I + Y++ + WD + + T M P
Sbjct: 54 PIYGRVIALAAFRPVKSATDHLLIVIGKVNYQYFTISWDPVAKKPKTEHMAVNFSEYNAP 113
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK--------------GQLKEAFNIRLEE 108
+ + DP ++G+H+Y G+F VIP + G + +RL+E
Sbjct: 114 LSDSFQCLADPGKNMLGIHVYKGIFLVIPQIQQSIKGSRRSRADLDVGNIGNPCVVRLKE 173
Query: 109 LQVLDIKFLYGCAKPTIVVLYQDN-KDARHVKTYEVALKDKDFVEGPWSQNNLDNG--AD 165
L++LD+KFL+G P + VLY+ + D V TYE+++K + W +L G A
Sbjct: 174 LEILDLKFLFGTISPVLAVLYKPSGADEMAVNTYELSVKSGEVKLLDWRIRDLKGGREAL 233
Query: 166 LLIPVPPPLCGVLIIGEETIVYCSANAFKA-IPIRPSITKAYGRVDADGSRYLLGDHAGL 224
LIPV PP G+L+IG I Y K +P+ P + + + RY+LGD AG
Sbjct: 234 FLIPVRPPSNGLLLIGVTKIQYFDNYGNKTFLPVDPPMVWVTWEMLSP-ERYILGDEAGG 292
Query: 225 LHLLVITHE-KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAK 283
LH+L ++ + GL ++L+G SI + +L+ ++++GS GDSQL L L +
Sbjct: 293 LHMLTLSAGLMDTKVGLHLKLVGNASIPEILVHLNQGLLFLGSHSGDSQL--LQLYQNGI 350
Query: 284 GSYVEVLERYVNLGPIVDFCVVDLE--------RQ---GQGQVVTCSGAYKDGSLRIVRN 332
V V + N+GPIVDF V+DL+ RQ G + ++ SG + G LR +R+
Sbjct: 351 KPMVRVQQVLRNIGPIVDFRVMDLDYSRSDEVMRQYSPGHIRFLSASGGHTQGHLRTIRS 410
Query: 333 GIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET-EIE 390
G+G+ + + E+ GI+G+WSLRS FD LVVSFI ETRI + E+EE E
Sbjct: 411 GVGLYDLGFLGEMSGIRGLWSLRSIPGSSFDDVLVVSFIEETRIFKFDNSGEIEELYEFM 470
Query: 391 GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQ-V 449
GF +T+ H + + +QVT+ +V+LV S L E S P S+ + A+A+Q +
Sbjct: 471 GFALNQRTILAHSVVGGRFLQVTATAVKLVDVRSNTLIAE--SYPDKSLTITIASANQDL 528
Query: 450 LLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVR 509
L+ G LV L + L E E EISCL++ S S I AVG WT V
Sbjct: 529 LIYAMGPTLVLLNLARD-LEEHIRTTFENEISCLNMPS-----SPSTICAVGFWTVSLVL 582
Query: 510 IFSLPDLNLITKEHLGGE---IIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGE 564
I S+ +++++E L E PRS+L G LL ALGDG + F LN T
Sbjct: 583 ILSVQSFSILSQEILSQEDSAATPRSLLFARLLENGPPTLLVALGDGSMFTFALNETTCG 642
Query: 565 LTDRKKVSLGTQPITLRTFSSKNT-THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCP 623
L++RK + LG QPI ++ N VFA D P+VIY S+ +++Y++V + +++
Sbjct: 643 LSERKHIILGAQPIRFQSIPGGNGGVTVFATCDHPSVIYGSDGRIVYASVTADKPTYVTS 702
Query: 624 FNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ 683
FNS +FPD++ IA E +L + +D ++ +H++S+P+G+ RRI + ++ AI ++
Sbjct: 703 FNSPSFPDAVVIASEDDLKLSVVDPVRTMHVQSLPVGDVVRRIAYSKEKNIIAIVTVSKV 762
Query: 684 SCAEESEMHF---VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY---YCVGTA 737
+ + +RL+D+ F + +Y L+ E S+ S + + + GT
Sbjct: 763 PDTRTGDYLYSSCIRLVDNTAFSVVDSYELNQLELVESLASGKICGGNGLLSEGFLAGTV 822
Query: 738 YVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFN-GKLLAAINQKIQLYK 795
Y +E KGRI+VF E +++LI +T G+V + GK +AAI ++I LY
Sbjct: 823 YPGGGRDESEKGRIIVFNASETKRIKLIVSYDTPGSVNGIQIVGEGKFVAAIGREIHLYS 882
Query: 796 WMLRDDGTRELQSE------------CGHHGHILALYVQTRGDFIVVGDLMKSISLLIY- 842
D ++ SE H L V D I V D M S+L +
Sbjct: 883 LKYTDIQSKTAASESVGATGCTITKLASFKAHSTPLDVAVYNDIIAVCDFMHGPSILQHI 942
Query: 843 ---KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
+++ E AR +W++A+E+LD+ A+ + NL ++ G T+++R +L
Sbjct: 943 EDKENKSSEFVEVARAPKPSWLTALELLDEKTVFCADTDGNLVVWQRQLSGVTEDDRKQL 1002
Query: 900 EVVGEYHLGEFVNRFRH 916
+ + +GE ++R R
Sbjct: 1003 QQIASMKIGEDIDRIRR 1019
>gi|317031116|ref|XP_001392900.2| UV-damaged DNA binding protein [Aspergillus niger CBS 513.88]
Length = 1124
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 329/1104 (29%), Positives = 549/1104 (49%), Gaps = 107/1104 (9%)
Query: 6 IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
++ ++ L L P D LF+ T+RY +C L WD E +++ T R D+SD R
Sbjct: 55 LFAKVTMLARLPAPANSPTDHLFVGTDRYTYCTLSWDGERNQIRTERNYVDISDPSSREA 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
G +IDP R + L +Y+G+ V+P G+L E
Sbjct: 115 QTGNRCLIDPSGRFMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTT 174
Query: 104 IRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKT----YEVALK----DKDFVEG-- 153
R++EL + +LY+DN+ +K Y A D F E
Sbjct: 175 ARIDEL---------------LALLYEDNQKKVRLKVRALHYSAATASTGADAAFEESLD 219
Query: 154 PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRV 209
+SQ LD GA LIPVP PL G+L++GE +I Y ++ + + RP +I A+ +V
Sbjct: 220 GFSQE-LDLGASHLIPVPAPLGGLLVLGETSIKYVDTDSNEIVS-RPLDEATIFVAWEQV 277
Query: 210 DADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
D+ R+LL D G L L+++ +V K++ LG T+ AS + YL V+++GS
Sbjct: 278 DS--QRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGVIFVGSHQ 335
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVT 317
GDSQ++++ GS +EV++ N+ PI+DF ++DL E Q GQ ++VT
Sbjct: 336 GDSQVLRI-----GNGS-LEVIQTLSNIAPILDFTIMDLGNRTSESQTHEFSSGQARIVT 389
Query: 318 CSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
SGA+ DG+LR VR+G+G+ E + E++ I ++ L+ +T + L+V+F+ ETR+
Sbjct: 390 GSGAFDDGTLRSVRSGVGMEELGVLGEMELITDLFGLQLATKGGYLDTLLVTFVDETRVF 449
Query: 377 AMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
N + E+EE + G TL + +++QVT V + + NEW P
Sbjct: 450 QFNFDGEVEELDSFLGLSLSENTLLAMNLPGGRILQVTEQRVLIADIEGGMVTNEWTPPD 509
Query: 436 GYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSY 494
+ A+AN ++L +GG + L+I + + + K + +IS + + P+ S
Sbjct: 510 NLVITSASANNDSIVLVSGGQLMTVLDINNDVRVISQKDFGADSQISGVTV-PL----SS 564
Query: 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGD 550
+ + VG V + L L+ + LG GE PRSVL+ S L ++ D
Sbjct: 565 AGVCIVGFPQLAKVSVLDLGRLSELHTTSLGPAGEAFPRSVLVADVLANSPSTLFISMAD 624
Query: 551 GHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLL 609
G ++ + + + LT ++ LG++ T + + + VFA + P++IY S +++
Sbjct: 625 GSVITYSFDARNYSLTGMNRLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYGSEGRII 684
Query: 610 YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
YS VN + S +C FNS A+P S+A+A +L I +D + I+++P+GE RR+ +
Sbjct: 685 YSAVNSEGASRVCHFNSEAYPGSIAVATRHDLKIALVDKERTTQIQTLPMGETARRVAYS 744
Query: 670 EQSRTFAICSL--KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-- 725
+ F I ++ K + AE + FV L D+ F + + L E S++ FS
Sbjct: 745 PSEKAFGIGTIERKLKDGAEMVQSRFV-LADEIMFRRLDAFDLRPEELVESVIRAEFSAG 803
Query: 726 ---DDSNVY---YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNA 778
+ +V+ + VGTAY+ E E +GRILVF +++G KL +AE KGA +L
Sbjct: 804 KDENGRDVFKDRFVVGTAYLDDENEESIRGRILVFEIDNGRKLTKVAELPVKGACRALAM 863
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
K++AA+ + + +Y + D G +L+ + + V G+ I V DLMKS+
Sbjct: 864 LGEKIVAALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAPVDVTVTGNVIAVADLMKSVC 923
Query: 839 LLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
L+ Y E ++ E AR + W + V + D +L + NL +R+N G ++
Sbjct: 924 LVEYSEGENGSPDSLTEVARHFQTVWATGVSCIAKDTFLETDAEGNLIVLRRNLTGVEED 983
Query: 895 ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
++ RLEV GE LGE VNR R ++ + V P GTV G I + A + E
Sbjct: 984 DKRRLEVTGEISLGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLFAIINPEHQ 1040
Query: 955 LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
FL +LQ + ++ +G + ++R F + ++ + F+DG+LIE FL + +E
Sbjct: 1041 DFLMRLQATMAGKVESLGNIPFNEFRGFRSMVREAKEPYRFVDGELIERFLTCEPSLQEE 1100
Query: 1014 ISKTMN-VSVEELCKRVEELTRLH 1036
I ++ ++V+E+ +E L RLH
Sbjct: 1101 IVDSVGMMNVDEVKIMIEALRRLH 1124
>gi|119471789|ref|XP_001258220.1| UV-damaged DNA binding protein, putative [Neosartorya fischeri NRRL
181]
gi|119406372|gb|EAW16323.1| UV-damaged DNA binding protein, putative [Neosartorya fischeri NRRL
181]
Length = 1140
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 335/1104 (30%), Positives = 545/1104 (49%), Gaps = 91/1104 (8%)
Query: 6 IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
IY R+ L L P D LF+ T+RY + L WD+ + T R D+SD R +
Sbjct: 55 IYARVTMLARLPAPANSPTDHLFVGTDRYTYFTLSWDSSEKRVRTERDYVDMSDPSSRES 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
G +IDP R + L +Y+G+ VIP G+L E
Sbjct: 115 QTGNRCLIDPSGRFMTLEIYEGVIAVIPIIQLPSKKRGRAVALPSGPDAPRVGELGEPTT 174
Query: 104 IRLEELQVLDIKFLY-GCAKPTIVVLYQDN-KDAR--------HVKTYEVALKDKDFVEG 153
R+EEL V FL+ P + +LY+DN K R H T + D F E
Sbjct: 175 ARIEELFVRSSAFLHVQEGLPRLALLYEDNQKKVRLRVRELQYHAATGSNSTADATFGEP 234
Query: 154 PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRV 209
+L+ G+ LIPVP PL G+LI+GE +I Y A+ + I RP +I A+ +V
Sbjct: 235 ADFTQDLELGSSHLIPVPAPLGGLLILGEMSIKYVDADNNEIIS-RPLDEATIFVAWEQV 293
Query: 210 DADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
D+ R+LL D G L L+++ +V K++ LG TS AS + YL V+++GS
Sbjct: 294 DS--RRWLLADDYGRLFFLMLVLDSDSQVESWKLDHLGNTSRASVLVYLGGGVLFLGSHQ 351
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVT 317
GDSQ+++++ P +EV++ N+ PI+DF ++DL E Q GQ ++VT
Sbjct: 352 GDSQVLRISNGP------LEVVQTLSNIAPILDFTIMDLGNRSSESQTHEFSSGQARIVT 405
Query: 318 CSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
SGA+ DG+LR VR+G+G+ E + +++ I +W L+ + F L+V+F+ ETR+
Sbjct: 406 GSGAFDDGTLRSVRSGVGMEELGVLGDMEHITDLWGLQVGSIGDFLDTLLVTFVDETRVF 465
Query: 377 AMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
+ + E+EE + G TL + +++QVT V + + EW P
Sbjct: 466 RFSSDGEVEEMDHFLGLSLSESTLLATNLPGGRILQVTEQRVLIAEVEGGMVIYEWTPPN 525
Query: 436 GYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSY 494
+ A+AN ++L GG + L I + + + K + +IS + + +P+
Sbjct: 526 QLIITAASANDDSIVLVAGGELVTVLNITNEVQIVTQKDFGADSQISGVTVP---SSPTG 582
Query: 495 SQIAAVGMWTDISVRIFSLPDLNLI--TKEHLGGEIIPRSVLLCAF--EGISYLLCALGD 550
IA +SV L DL+ + T L GE PRSVL+ + L ++ D
Sbjct: 583 VCIAGFPQLAKVSV--LKLQDLSELHTTSVGLAGEAFPRSVLVANVLTDSPPTLFVSMAD 640
Query: 551 GHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK-NTTHVFAASDRPTVIYSSNKKLL 609
G ++ + N LT K+ LG++ T + +VFA + P++IY S +++
Sbjct: 641 GSVITYSFNTDDYSLTGMTKLILGSEQPTFKKLPRGIGLFNVFATCENPSMIYGSEGRII 700
Query: 610 YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
YS VN + S +C FNS A+P+S+A+A +L I +D + I+++P+G RR+ +
Sbjct: 701 YSAVNSEGASRICHFNSEAYPESIAVATSHDLKIALVDKERTTQIQTLPIGATVRRVAYS 760
Query: 670 EQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-- 725
+ F I +++ + AE + FV L D+ F + + L E S++ F
Sbjct: 761 PSEKAFGIGTIERKLVDGAEIVKSRFV-LADEILFRRLDAFELRPEELVESVIRAEFPAG 819
Query: 726 ------DDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNA 778
D+ + VGTAY+ E +E +GRIL+F V++G KL +AE KGA +L
Sbjct: 820 KGANDRDEVKDRFIVGTAYLDDEGDESIRGRILIFEVDNGRKLTQVAELPVKGACRALAM 879
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
K++AA+ + + +Y+ + + G L+ + + V G+ I V DLMKS+
Sbjct: 880 LGDKIVAALVKTVVVYRVINNNFGAMRLEKLASYRTSTAPVDVTVTGNLIAVSDLMKSMC 939
Query: 839 LLIYKHEEGA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
L+ YK E + E AR + W + V + D +L ++ NL + +N+ G ++
Sbjct: 940 LVEYKEGENGTPDTMTEVARHFQTVWATGVANIAPDTFLESDAEGNLIVLHRNTTGVEED 999
Query: 895 ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
++ RLEV GE LGE VNR R + + +L + V P GTV G I + A + +
Sbjct: 1000 DKRRLEVTGEISLGEMVNRIRPVN-IQQL--ASVAVTPRAFLGTVEGSIYLFAIINPDHQ 1056
Query: 955 LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
FL +LQ + ++ VG + ++R F + ++ + F+DG+LIE FL + +E
Sbjct: 1057 DFLMRLQATIAGKVELVGNMPLNEFRGFRSMVREAKEPYRFVDGELIERFLTCEPSLQEE 1116
Query: 1014 ISKTM-NVSVEELCKRVEELTRLH 1036
I T+ +SV+E+ +E L RLH
Sbjct: 1117 IVSTVGKMSVDEVKGMIEALRRLH 1140
>gi|212539802|ref|XP_002150056.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
18224]
gi|210067355|gb|EEA21447.1| UV-damaged DNA binding protein, putative [Talaromyces marneffei ATCC
18224]
Length = 1139
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 326/1103 (29%), Positives = 551/1103 (49%), Gaps = 90/1103 (8%)
Query: 6 IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
+Y +++ L L P D LF+ T++ + L+WD+E + T R+ D++D+ R
Sbjct: 55 LYAKVSMLARLPAPANSTTDHLFVGTDQNTYFTLKWDSEKRRVRTERSYVDLADKASRAC 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIP------------------FDNK-------GQL 98
NG +IDP + + L +++G+ V+P + N G+L
Sbjct: 115 QNGDRCLIDPSGKYMTLEIFEGIITVLPIIQLHKKRGKPPVLKTSHYSNPDEPTPQIGEL 174
Query: 99 KEAFNIRLEELQVLDIKFLY--GCAKPTIVVLYQDNKDARHVKTYEVALK---DKDFVEG 153
E R++EL V FL+ A P + +L++DN+ +K E++ + + F E
Sbjct: 175 GEPMMTRIDELMVRSSAFLHVESKAHPRLALLHEDNQRKVKLKIRELSFEASAEPVFQET 234
Query: 154 PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SANAFKAIPIRPSITKAYGRVDA 211
LD GA LIPVP PL G+L++GE I Y + N + P+ + T V
Sbjct: 235 EDFTEELDLGASHLIPVPAPLGGLLVLGETCIKYIDDAKNETISNPLDEA-TIFVAWVQV 293
Query: 212 DGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
DG R+LL D G L L+++ + +V G K++ LGE S AS + YL + +IGS GD
Sbjct: 294 DGQRWLLADDYGRLFFLMLVLDSQNEVEGWKLDYLGEASRASVLIYLGAGMTFIGSHQGD 353
Query: 271 SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCS 319
SQ+I++ ++GS+ E+++ N+ PI+DF ++DL + GQ ++VT S
Sbjct: 354 SQVIRI-----SEGSF-EIIQTISNIAPILDFTIMDLGAREGENYTHEFSSGQARIVTGS 407
Query: 320 GAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
GA+ DG+LR VR+G+G++E + E++ I MW+L+ S+ F LVV+F++ETR+
Sbjct: 408 GAFNDGTLRSVRSGVGMDELGVLGEMEHITDMWALQISSPGDFSDTLVVTFVNETRVFRF 467
Query: 379 NLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
+ + E+EE E G TL + +++ VT V + + S + ++W SP G+
Sbjct: 468 SSDGEVEELDEFLGLNLAENTLLSSNLPGGRIIHVTESGVSIADTDSGMVTSKW-SPDGH 526
Query: 438 SVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQ 496
++ N ++++ TGG + L+I GD + K + ++S + I P + SQ
Sbjct: 527 TITSVACNDERLVVVTGGQIIATLDITGDLKVLSQKDFGKDNQVSGVTIPP-----APSQ 581
Query: 497 IAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGH 552
+ + V + L + + LG E PR+VLL + S L ++ DG
Sbjct: 582 VCIAAFPQNALVAVLDLHRFEELHSQSLGVASEAFPRTVLLAEILADSSSTLFVSMADGS 641
Query: 553 LLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYS 611
++ F + ++ LT K+ LG++ T + + + VF + P++IY+S +++YS
Sbjct: 642 VVTFSYDAESHSLTGTNKLILGSEQPTFKKLPRGDGLYNVFTTCEHPSLIYASEGRIIYS 701
Query: 612 NVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQ 671
VN + S +C FN+ A+PD++A+A +L I +D + I+++P+G RR+ +
Sbjct: 702 AVNSEGASRVCHFNTEAYPDAIAVATSRDLKIALVDKERTTQIQTLPIGATVRRVAYSPT 761
Query: 672 SRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
+ F I ++K + AE E FV L D+ F + ++ L E S++ +
Sbjct: 762 EKAFGIGTIKRRLEDGAEIVESQFV-LADEIMFRKLDSFDLKPDELVESVIRAQMVVGKD 820
Query: 730 VY--------YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFN 780
Y + VGTAY+ E E +GRIL+F V+ KL L E KGA +L
Sbjct: 821 RYDEPIYKDRFIVGTAYLDDETAESIRGRILLFEVDSNRKLSLFLEHPVKGACRALAMMG 880
Query: 781 GKLLAAINQKIQLYKWMLR-DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
++AA+ + + +++ + G LQ + + + IVV DLMKSIS+
Sbjct: 881 DYIVAALVKTVVIFEVTGQPQTGKYSLQKAAVYRTSTAPVDIAVTDKTIVVADLMKSISI 940
Query: 840 LIYKHEEG---AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER 896
+ + +E AR + W +AV + + +L ++ NL +R+N +G T+E+R
Sbjct: 941 VESNKTDALTMEAKEVARHFATVWTTAVADIGSNQWLVSDAEGNLIVLRRNVDGMTEEDR 1000
Query: 897 GRLEVVGEYHLGEFVNRFRHGSLVMRLPD-SDVGQIPTVIFGTVNGVIGVIASLPHEQYL 955
RLEV E LGE VNR R + +P S + P GTV G I + A + E
Sbjct: 1001 RRLEVTSELLLGEMVNRIRP----VNIPQTSTMAVTPKAFLGTVEGSIYLFALINPEHQD 1056
Query: 956 FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEI 1014
FL +LQT + + G + ++R+F + + + F+DG+LIE FLD R +EI
Sbjct: 1057 FLMRLQTAISAYVDSPGLMPFNKFRAFRSTVREAEEPFRFVDGELIERFLDCDRAVQEEI 1116
Query: 1015 SKTMNVS-VEELCKRVEELTRLH 1036
+ +E + K +E L RLH
Sbjct: 1117 LGVVGSGDLESVQKMIEALRRLH 1139
>gi|358366432|dbj|GAA83053.1| UV-damaged DNA binding protein [Aspergillus kawachii IFO 4308]
Length = 1643
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 326/1081 (30%), Positives = 538/1081 (49%), Gaps = 91/1081 (8%)
Query: 18 PHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPDCR 76
P D LF+ T+RY +C L WD E +++ T R D+SD R G +IDP R
Sbjct: 8 PANSPTDHLFVGTDRYTYCTLSWDGERNQIRTERNYVDISDPSSREAQTGNRCLIDPSGR 67
Query: 77 LIGLHLYDGLFKVIPF--------------------DNKGQLKEAFNIRLEELQVLDIKF 116
+ L +Y+G+ V+P G+L E R++EL V F
Sbjct: 68 FMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTTARIDELFVRSSAF 127
Query: 117 LY-GCAKPTIVVLYQDNKDARHVKT----YEVALK----DKDFVEG--PWSQNNLDNGAD 165
L+ P + +LY+DN+ +K Y A D F E +SQ LD GA
Sbjct: 128 LHVQSGPPRLALLYEDNQKKVRLKVRALHYSAATSSTGADAAFEESLDGFSQE-LDLGAS 186
Query: 166 LLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVDADGSRYLLGDH 221
LIPVP PL G+L++GE +I Y ++ I RP +I A+ +VD+ R+LL D
Sbjct: 187 HLIPVPAPLGGLLVLGETSIKYVDTDS-NEIVSRPLDEATIFVAWEQVDSQ--RWLLADD 243
Query: 222 AG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP 280
G L L+++ +V K++ LG T+ AS + YL V+++GS GDSQ++++
Sbjct: 244 YGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGVIFVGSHQGDSQVLRI---- 299
Query: 281 DAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTCSGAYKDGSLRI 329
GS +EV++ N+ PI+DF ++DL E Q GQ ++VT SGA+ DG+LR
Sbjct: 300 -GNGS-LEVIQTLSNIAPILDFTIMDLGNRTSESQTHEFSSGQARIVTGSGAFDDGTLRS 357
Query: 330 VRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE 388
VR+G+G+ E + E++ I ++ L+ +T + L+V+F+ ETR+ N + E+EE +
Sbjct: 358 VRSGVGMEELGVLGEMELITDLFGLQLATKGGYLDTLLVTFVDETRVFQFNFDGEVEELD 417
Query: 389 -IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANAS 447
G TL + +++QVT V + + NEW P + A+AN
Sbjct: 418 SFLGLSLSENTLLAMNLPGGRILQVTEQRVLIADIEGGMVTNEWTPPDRLVITSASANND 477
Query: 448 QVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDI 506
++L +GG L L+I + + + K + +IS + + P+ S + + VG
Sbjct: 478 SIVLVSGGQILTVLDINNDVRVVAQKDFGADSQISGVTV-PL----SSAGVCIVGFPQLA 532
Query: 507 SVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKT 562
V + L L+ + LG GE PRSVL+ S L ++ DG ++ + +
Sbjct: 533 KVSVLDLGRLSELHTTSLGPAGEAFPRSVLVADVLANSPSTLFISMADGSVITYAFDASD 592
Query: 563 GELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVNLKEVSHM 621
LT ++ LG++ T + + + VFA + P++IY S +++YS VN + S +
Sbjct: 593 YSLTGMNRLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYGSEGRIIYSAVNSEGASRV 652
Query: 622 CPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSL- 680
C FNS A+P S+A+A + +L I +D + I+++P+GE RR+ + + F I ++
Sbjct: 653 CHFNSEAYPGSIAVATKHDLKIALVDKERTTQIQTLPMGETARRVAYSPSEKAFGIGTIE 712
Query: 681 -KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-----DDSNVY--- 731
K + AE + FV L D+ F + + L E S++ FS + +V+
Sbjct: 713 RKLKDGAEMVQSRFV-LADEIMFRRLDAFDLRPEELVESVIRAEFSAGKDENGRDVFKDR 771
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
+ VGTAY+ E E +GRILVF +++G KL +AE KGA +L K++AA+ +
Sbjct: 772 FVVGTAYLDDENEESIRGRILVFEIDNGRKLTKVAELPVKGACRALAMLGEKIVAALVKT 831
Query: 791 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG--- 847
+ +Y + D G +L+ + + V G+ I + DLMKS+ L+ Y E
Sbjct: 832 VVIYGVVNNDFGAMKLEKLASYRTSTAPVDVTVTGNVIAIADLMKSVCLVEYSEGENGMP 891
Query: 848 -AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
++ E AR + W + V + D +L + NL +R+N G ++++ RLEV GE
Sbjct: 892 DSLTEVARHFQTVWATGVVCIAKDTFLETDAEGNLIVLRRNLTGVEEDDKRRLEVTGEIS 951
Query: 907 LGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
LGE VNR R ++ + V P GTV G I + A + E FL +LQ +
Sbjct: 952 LGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLFAIINPEHQDFLMRLQATMAG 1008
Query: 967 VIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEEL 1025
++ +G + ++R F + ++ + F+DG+LIE FL + +EI ++ + +
Sbjct: 1009 KVESLGNIPFNEFRGFRSMVREAKEPYRFVDGELIERFLTCEPSLQEEIVDSVGMMNKGS 1068
Query: 1026 C 1026
C
Sbjct: 1069 C 1069
>gi|121699866|ref|XP_001268198.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
1]
gi|119396340|gb|EAW06772.1| UV-damaged DNA binding protein, putative [Aspergillus clavatus NRRL
1]
Length = 1140
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 331/1103 (30%), Positives = 544/1103 (49%), Gaps = 91/1103 (8%)
Query: 7 YGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
Y R+ L L P D LF+ T+RY + L WD+ +++ T R D+SD R +
Sbjct: 56 YARVTMLARLPAPANSPTDHLFVGTDRYTYFTLSWDSSKNQVRTEREYVDISDPSSRESQ 115
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFNI 104
G +IDP R + L +Y+G+ V+P G+L E
Sbjct: 116 TGNRCLIDPSGRFMTLEIYEGVIAVVPIIQLPSKKRGRNVALPSGPDAPRVGELGEPATA 175
Query: 105 RLEELQVLDIKFLY-GCAKPTIVVLYQDN-KDAR--------HVKTYEVALKDKDFVEGP 154
R+EEL V FL+ P + +LY+DN K R H T + D F E
Sbjct: 176 RIEELFVRSSAFLHVQEGLPRLALLYEDNQKKVRLRVRELQYHAATGSTNIADATFGEPA 235
Query: 155 WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVD 210
L+ G+ LIPVP PL G+LI+GE +I Y A+ + I RP +I A+ +VD
Sbjct: 236 DFTQELELGSSHLIPVPAPLGGLLILGETSIKYVDADNNEIIS-RPLDEATIFVAWEQVD 294
Query: 211 ADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
+ R+LL D G L L+++ +V K++LLG+TS AS + YL V+++GS G
Sbjct: 295 S--QRWLLADDYGRLFFLMLVLDSDNQVESWKLDLLGKTSRASVLVYLGGGVLFLGSHQG 352
Query: 270 DSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTC 318
DSQ++++ + GS V+V++ N+ PI+DF ++DL E Q GQ ++VT
Sbjct: 353 DSQVLRI-----SNGS-VDVVQTLPNIAPILDFTIMDLGSRTSESQTHEFSSGQARIVTG 406
Query: 319 SGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILA 377
SGA+ DG+LR VR+G+G+ E + E+ I +W L+ + F L+V+F+ ETR+
Sbjct: 407 SGAFDDGTLRSVRSGVGMEELGVLGEMDHITDLWGLQVGSKGDFLDILLVTFVDETRVFH 466
Query: 378 MNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPG 436
+ + E+EE + G TL + +++QVT V + + EW P
Sbjct: 467 FSPDGEVEEMDHFLGLSLSENTLLAANLPGGRILQVTERRVLIAEVEGGMVTYEWTPPNQ 526
Query: 437 YSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYS 495
+ A+ N ++L GG + L+IG+ + L K +IS + + +P+
Sbjct: 527 LIITAASTNNDMIVLVAGGELVTVLDIGNEVRLVTQKDFGANSQISGVTVP---ASPTGV 583
Query: 496 QIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDG 551
IA +SV L DL + +G GE PRSVL+ + L ++ DG
Sbjct: 584 FIAGFPQLAKVSV--LKLQDLAEVQSTSVGPAGEAFPRSVLVADVLSDSPPTLFISMADG 641
Query: 552 HLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLY 610
++ + N LT K+ LG++ T + + +VFA + P++IY S +++Y
Sbjct: 642 SVITYSFNPDDVSLTGMTKLILGSEQPTFKKLPRGDGLFNVFATCENPSMIYGSEGRIIY 701
Query: 611 SNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQE 670
S VN + S +C NS A+P S+A+A +L I +D + I+++P+G RR+ +
Sbjct: 702 SAVNSEGASRICHLNSEAYPGSIAVATLHDLKIALVDKERTTQIQTLPIGATVRRVAYSP 761
Query: 671 QSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF---S 725
+ F I +++ + AE + F+ L D+ F + + L E S++ F
Sbjct: 762 SEKAFGIGTIERKLVDGAEIVKSEFM-LADEILFRRLDAFELRPEELVESVIRAEFPVGK 820
Query: 726 DDSNVY-----YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAF 779
D+ N + VGTA++ E +E +GRIL+F V++G KL +AE KGA +L
Sbjct: 821 DEKNGATVKDRFIVGTAFLDDEGDESIRGRILIFEVDNGRKLTQVAELPVKGACRALAML 880
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
+++AA+ + + +YK + + G +L+ + + V G+ I V DLMKS+ L
Sbjct: 881 GNRIVAALVKTVVVYKAVSNNFGAMKLEKLASYRTSTAPVDVTVTGNLIAVSDLMKSVCL 940
Query: 840 LIYKHEEGAIE----ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
+ YK E + E AR + W + V + D +L ++ NL + +N+ G +++
Sbjct: 941 VEYKEGEDGLPDTLTEVARHFQTVWATGVACIAQDTFLESDAEGNLIILCRNTTGVEEDD 1000
Query: 896 RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL 955
+ RLEV GE LGE VNR R ++ + V P TV G I + A + +
Sbjct: 1001 KRRLEVTGEISLGEMVNRIRPVNIQQL---TSVAVTPRAFLATVEGSIYLFAMINPDHQD 1057
Query: 956 FLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
FL +LQ + ++ VG + ++R F++ ++ + F+DG+LIE FL + +EI
Sbjct: 1058 FLMRLQATIAGKVELVGNMPFNEFRGFHSMVREAQEPYRFVDGELIERFLACEPSVQEEI 1117
Query: 1015 -SKTMNVSVEELCKRVEELTRLH 1036
S ++V+E+ +E L RLH
Sbjct: 1118 VSIVGKMTVDEVKSMIEALRRLH 1140
>gi|406865227|gb|EKD18269.1| CPSF A subunit region [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 1146
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 336/1106 (30%), Positives = 552/1106 (49%), Gaps = 88/1106 (7%)
Query: 5 PIYGRIATLELFRPH-GEAQDFLFIATERYKFCVLQWDAESSELITRA-MGDVSDRIGRP 62
PIYG I L RP + D LF+ T R+++ L ++ E+ L TR DVS+R
Sbjct: 55 PIYGTITMLAKIRPEIAHSADQLFVGTSRFQYFTLAYNQETQALETRQNFVDVSERHMID 114
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVI-PF-DNKGQ---LKEAFNIRLEELQVLDIKFL 117
+ + + ++DP + + L L++G+ P KG+ L + +R+ EL+V FL
Sbjct: 115 SQSRDLVLVDPAGKYVVLELFEGILSCFKPLRPRKGREDVLDKPEQVRITELRVRATTFL 174
Query: 118 YG-CAKPTIVVLYQDNKDAR-HVKTYEVALKDKDFVEGPWSQN------NLDNGADLLIP 169
Y +P I +L++D + TY + + S++ +LD GA LIP
Sbjct: 175 YTETQQPKIALLFEDGSGGEVRMATYRLVDEKGTLSRFDPSKDRENEISDLDPGASHLIP 234
Query: 170 VP-----------------PPLCGVLIIGEETIVYC---SANAFKAIPIRPSITKAYGRV 209
+P L GV+++GE Y S AF +I A+ +V
Sbjct: 235 IPKNAGQKRYIVRNSTVAKAHLGGVVVVGETKFTYLDDESKAAFDYPLNDAAIFVAWVKV 294
Query: 210 DADGSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
D RYLLGD G LHLL ++++ +V G+ + LLG S A+T+ L + V ++GS
Sbjct: 295 D--DQRYLLGDDYGNLHLLSILSNHDGEVMGMDLMLLGTISKATTMVNLGDGVFFVGSHE 352
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVT 317
+SQ+I+++L D K Y+ +++ N+ PI+D V+D+ E Q GQ ++VT
Sbjct: 353 AESQVIRVDL--DRKDHYITIIQTMQNIAPILDLAVMDMGNREGESQSNEYSTGQTRLVT 410
Query: 318 CSGAYKDGSLRIVRNGIGINEQASV--ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
SGA++ GSLR VR+G+G+ + + E+ I+ ++S+RS+ FD LVVS +ETR+
Sbjct: 411 GSGAFQSGSLRSVRSGVGLEDIGILVDEIGDIRDVYSMRSTAGTHFDDILVVSLPTETRV 470
Query: 376 LAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
E E E E G QTL ++QVT SV+++ + +W P
Sbjct: 471 FTFLGEIE-EVAEFRGLELNCQTLLASGLSNGMMLQVTESSVKILGPGPSYVAAKWTPPA 529
Query: 436 GYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY--EISCLDINPIGENPS 493
G + A+AN S VL++ G LV L+IG G L EV LE +++C+ + +
Sbjct: 530 GEYITDASANDSYVLVSVSGTTLVSLDIGQG-LKEVAVQPLEAADQVACVYVPR-----N 583
Query: 494 YSQIAAVGMWTDISVRIFSLPDLNLITKEHL---GGEIIPRSVLLC----AFEGISYLLC 546
S I VG W S+ I +L +L +I E L IPR ++L A L
Sbjct: 584 LSDIGVVGFWKSGSISILNLSNLEIILSEDLRRKNNASIPRHIILAQLLPAVAAGPTLFV 643
Query: 547 ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASDRPTVIYSSN 605
A+ DG +L F ++ T L+ RK + LGT+ + +V A + PT+IY+S
Sbjct: 644 AMEDGVVLTFNVDKSTFSLSGRKSLVLGTEHAKFHLLPREGGLNNVLATCEHPTLIYASE 703
Query: 606 KKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRR 665
+++YS V + CPFNS +P SL +A + L I IDD ++ H+R++ +G+ RR
Sbjct: 704 GRIVYSAVTADDAKCACPFNSEEYPGSLVVATQRSLKISKIDDERQTHVRTVHIGKTVRR 763
Query: 666 ICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTF-EFISTYPLDT------FEYGC 717
I + R F I C + + EE L++D F E + PLD EY
Sbjct: 764 IAYSGAERAFGIGCIERTMADKEERYTSTFSLVEDVKFAEVGNPVPLDDENGTELIEYII 823
Query: 718 SILSCSFSDDSNVYYCVGTAYVLPEENEPT-KGRILVFIVEDGK-LQLIAEKETKGAVYS 775
+ +D + VGT+++ E +P KGRILVF ++ K L+A K A
Sbjct: 824 RTEVLNAREDLAERFIVGTSFLDEESADPNIKGRILVFGIDPKKNPYLVASLNLKCACRR 883
Query: 776 LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
+ +GK++A +N+ + ++K++ + E + + + + + I + D+M+
Sbjct: 884 VAMLDGKIVAVLNKTVAMFKYVEITEKAGEFKKLATFRSSTVPIDIAITENIIAITDMMQ 943
Query: 836 SISLLIYKH-EEG---AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA 891
S+S++ Y +EG +E+ ARDY W +AV + D+ +L ++++ NL +++N +G
Sbjct: 944 SVSIVQYTPGKEGMPDKLEQVARDYQTCWGTAVTDIGDNSWLESDHHGNLLVLQRNIDGI 1003
Query: 892 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH 951
T E++ RL + GE +LGE VN R ++ P +P T G I + +++
Sbjct: 1004 TLEDKQRLRITGEMNLGEQVNMIRKIAID---PSPTAMVVPKAFLATTEGSIYLFSTILD 1060
Query: 952 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN-EKKTVDAKNFLDGDLIESFLDLSRTR 1010
L +LQ N+ + + +G L+ + +RSF + E+ T + F+DG+LIE FLD S
Sbjct: 1061 GSQDLLLRLQENITECVDTLGRLDFKTYRSFKSAERTTEEPYRFVDGELIERFLDESEDM 1120
Query: 1011 MDEISKTMNVSVEELCKRVEELTRLH 1036
+I + + +VE + VE L RLH
Sbjct: 1121 QQQICEGLGYTVEAIRDVVENLKRLH 1146
>gi|296803967|ref|XP_002842836.1| DNA damage-binding protein 1a [Arthroderma otae CBS 113480]
gi|238846186|gb|EEQ35848.1| DNA damage-binding protein 1a [Arthroderma otae CBS 113480]
Length = 1143
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 331/1113 (29%), Positives = 559/1113 (50%), Gaps = 106/1113 (9%)
Query: 6 IYGRIATLE-LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
+YGRI L+ L RP D LF+ T++Y + L WDA ++L T R D++D R
Sbjct: 55 VYGRITALQKLPRPDPALTDVLFVGTDQYAYFSLTWDAVHNQLRTERKYIDLADGSLREA 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF-----DNKG---------------QLKEAFN 103
+ ++DP + L +Y+G+ V+P NK QL E
Sbjct: 115 HSDDRCLLDPSGSFLTLEVYEGVVSVVPLVSADTHNKRSRSAAAHVTPSATLEQLGEPLQ 174
Query: 104 IRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVA---------------LK 146
+R+EEL V FL A P + +LY+D++ +K ++ LK
Sbjct: 175 VRIEELMVRSSAFLDQEASHAPRLALLYEDSQGKARLKLRDLKYTHAALSGDGGSAAELK 234
Query: 147 DKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SANAFKAIPIRPSITK 204
D + G LD GA LLIP+P PL G+LI+GE TI Y S N + P+ S T
Sbjct: 235 DVTTLSG-----ELDLGASLLIPIPRPLGGLLILGESTITYVDVSQNEIISRPLAES-TV 288
Query: 205 AYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
V DG R+LL D G L L+++ V K++LLG+TS AS + YLD +V+
Sbjct: 289 FVAWVQVDGQRWLLADDYGRLFFLMLVLDPDNAVEAWKVDLLGQTSRASVLVYLDGGLVF 348
Query: 264 IGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ---------GQGQ 314
+GS GDSQ+I++ +G + ++++ N+ PI+DF V+DL + GQ +
Sbjct: 349 VGSHQGDSQVIQIR-----EGGF-DLVQTIANIAPILDFTVMDLGDRSGEAREFSSGQTR 402
Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISET 373
+VT SGA+ DGSLR VR+G+GI + + ++ I +W LR++ +PF L+VSF++E+
Sbjct: 403 IVTGSGAFGDGSLRSVRSGVGIEDLGVLASMEHITDLWGLRAACPEPFSDTLLVSFVNES 462
Query: 374 RILAMNLEDELEETEIEGFCS---QTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNE 430
R+ + E ++EE E EGF TL + N+++QVT +++ S
Sbjct: 463 RVFHFSPEGDVEEKE-EGFLGLVFSQSTLLAANLPGNRIIQVTENMAKIIDLDSS--MTT 519
Query: 431 WKSP-PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIG 489
W+S ++ A+AN ++L GG L+ + + K + + ++S + I
Sbjct: 520 WQSSHEDSAITSASANDDYLVLVFGGIRLICVSLSSYEEVGSKDFEADNQVSGMTIPA-- 577
Query: 490 ENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCA--FEGISYLL 545
S +Q V + + I LPDL + K+ LG GE IPRSV++ ++ L
Sbjct: 578 ---SPAQACIVCLPQSAEIVILDLPDLKVQNKQTLGEPGEAIPRSVIVAEILYDQSPTLF 634
Query: 546 CALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTTHVFAASDRPTVIYSS 604
++ DG + +F +++ +++ K++LG+ QP+ + +VFA D P++IY+S
Sbjct: 635 VSMADGTVFSFSFSLEAFTISNSSKITLGSEQPLFKKLPRGNGQYNVFATCDHPSLIYAS 694
Query: 605 NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPR 664
+++YS V+ S +C N+ A+P S+A++ + EL I +D+ + I ++P+ R
Sbjct: 695 EGRIVYSAVDSDSASRICNLNTQAYPGSIALSNQRELKIAIVDEERTTQIHTLPMHASVR 754
Query: 665 RICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
R+ + + F + ++ + E+ FV L D+ F +ST+ L + E S++
Sbjct: 755 RLAYSPVEKAFGLGTVTRTISNGVEKVSSSFV-LADEILFRPLSTHELRSDELVESVIRS 813
Query: 723 SFSDDSN-----VY---YCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKETKGAV 773
D + V+ + VGTA++ ++ +GRIL F V + L L+ +K GA
Sbjct: 814 QIPDGEDEVGNTVFRDLFFVGTAFLDDVGDDNVRGRILAFEVNRSRELALLVDKPVLGAC 873
Query: 774 YSLNAF-NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
+L N KL+A + + + +++ + G EL + + + V + I V D
Sbjct: 874 RTLAVMDNDKLVAGLVKSVSIFRIVRDSFGNIELLKQTAYRTSTAPIDVSVTENTIAVAD 933
Query: 833 LMKSISLLIY-KHEEGAI----EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
+MKS+SL+ Y EEGA+ EE AR Y W +AV +++++YL AE NL +++N
Sbjct: 934 VMKSVSLVQYTPAEEGALEPKFEEIARHYQTLWSTAVGHIEENVYLLAEAEGNLVVLQRN 993
Query: 888 SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIA 947
+ G T+ +R RL+ E LGE VNR + ++ P + + TV+G I +
Sbjct: 994 TTGVTESDRKRLQPTSEMRLGEMVNRIH--PITVQAP-AKAAVSARALLATVDGSIYLFG 1050
Query: 948 SLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDL 1006
+ L +LQ + V G + ++R+F + D F+DG+LIE FL
Sbjct: 1051 LINPTYIDLLLRLQAIMASVTVSPGEIPFTKYRAFRTTVRQSDEPFRFVDGELIERFLGC 1110
Query: 1007 SRTRMDEISKTM---NVSVEELCKRVEELTRLH 1036
+ + +EI+ + N++V L + ++EL R+H
Sbjct: 1111 APSTQEEIASRLDDQNITVASLKEMIDELRRMH 1143
>gi|440639387|gb|ELR09306.1| hypothetical protein GMDG_03874 [Geomyces destructans 20631-21]
Length = 1138
Score = 434 bits (1117), Expect = e-118, Method: Compositional matrix adjust.
Identities = 338/1103 (30%), Positives = 553/1103 (50%), Gaps = 91/1103 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
++ RI L RP LF+ T R + L WDA + L T A D S++ R
Sbjct: 55 LHARITMLAAIRPPTSCTAHLFVGTTRAHYFTLAWDAATRRLETVHAFVDASEKHMRDAA 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFN-------IRLEELQVLDIKFL 117
G+ +DP R + L L++G+ + ++ A + IR+ EL V FL
Sbjct: 115 GGERCAVDPSGRQLCLSLFEGVLSFVKVMKPRKVAAAGSYLDDPEQIRITELFVRATVFL 174
Query: 118 YG-CAKPTIVVLYQDNKDARHVKTYEVA-----LKDKDFVEGPWSQNNLDNGADLLIPVP 171
+ P + +LYQD +D + TY V D + + ++ GA LIPVP
Sbjct: 175 HTESTSPKVALLYQDGRDRVGLATYRVTDGRGQYGGFDPRKAREDELGVEVGASHLIPVP 234
Query: 172 ------------------PPLCGVLIIGEETIVYC---SANAFKAIPIRPSITKAYGRVD 210
L GV+++GE ++Y S + + SI A+
Sbjct: 235 KGEGVQRRYVVRNNASLKAQLGGVIVVGETRMLYLDDESKATVEHVLDEASIFVAW--TA 292
Query: 211 ADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
DG RYLLGD G LHLL + + E VTG+ I+ S ST+ L++ +++IGS GD
Sbjct: 293 YDGLRYLLGDEYGWLHLLTLVVDAEVVTGMTIKKFVRISRPSTMVCLEDDLLFIGSHDGD 352
Query: 271 SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-----------GQGQVVTCS 319
SQ++KL+L DAK EV++ N+ PIVDF V+D+ + GQ ++VT S
Sbjct: 353 SQVLKLDL--DAK--VAEVVQTLDNIAPIVDFTVMDMGSRSEEARANEFSSGQARIVTGS 408
Query: 320 GAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
GA+++GSLR VR+G+G+ + + E+ IKG+++L+++ + DT LV+SF +ETR+
Sbjct: 409 GAFQEGSLRSVRSGVGLEDIGQLGEMDNIKGLYTLQTNNSEFHDT-LVISFSTETRVFRF 467
Query: 379 N-LEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
+ + E E G + TL + +++Q+T L+ S S W+ PG
Sbjct: 468 DSEGEVEEVEEFLGLSFEEHTLLAANVSNGRILQITPSKALLIDSESGVAVASWQPAPGE 527
Query: 438 SVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL--EYEISCLDINPIGENPSYS 495
+ A+ N LL+ G L+ L + D L+E+ +++C+D+ P E P
Sbjct: 528 IITAASTNEDYALLSADGKSLISLNL-DNDLSEIARQDFGDTDQVACVDV-PNTETP--- 582
Query: 496 QIAAVGMWTDISVRIFSLPDLNLITKEHLGGE---IIPRSVLLCAFEGISY---LLCALG 549
I VG+W SV I L L+ I + L + +PRS++L ++ LL A+
Sbjct: 583 -IGLVGLWQSGSVSIIDLRTLHPIQGDTLRNDDTAAVPRSMVLAQILPKTFGPTLLVAMS 641
Query: 550 DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS-SKNTTHVFAASDRPTVIYSSNKKL 608
DG + ++ + T LT+RK V LGTQ LR + +VFA + ++IYSS ++
Sbjct: 642 DGVVHSYSVTPCTFALTNRKSVVLGTQQANLRVLPRAGGLMNVFATCEHSSLIYSSEGRI 701
Query: 609 LYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICH 668
+YS V ++ + +CPFN+AA+PD++ +A E+ I ID ++ H+R++P+GE RR+ +
Sbjct: 702 IYSAVTAEDATFICPFNAAAYPDAIVVATASEIKISQIDTERRTHVRTLPMGETVRRVTY 761
Query: 669 QEQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTF-EFISTYPLDTFEYGCSILSCSFSD 726
+ F + ++K + EE RL+D+ F E + L T + + + +
Sbjct: 762 SPSEKVFGLGAIKRELIDGEEVIESSFRLVDEIVFSELGKPFQLGTSQGEELVEAVIRAP 821
Query: 727 DSNVY------YCVGTAYVLPEENEPTKGRILVFIVEDGKLQL-IAEKETKGAVYSLNAF 779
N Y + VGT+++ +GRILVF V+ + IAE + KGA L
Sbjct: 822 LPNTYGTPQERFIVGTSFLDDNPGLSYRGRILVFGVDSSRNPYKIAEYKVKGACRCLGVI 881
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
+GK++AA+ + I ++++ + ++ + + + G+ I V DLMKS+SL
Sbjct: 882 DGKIVAALVKTIVVFEYTELSGTSARIEKVASYRTSTCPVDLAIEGNTIAVADLMKSVSL 941
Query: 840 LIYKH----EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
+ Y+ E + E AR + + W +AV +D+ +L A+ + NL +R+N T E+
Sbjct: 942 VEYRAGTSGEAPTLVEVARHFQSVWATAVAHVDEG-WLEADADGNLIVLRRNEAAVTFED 1000
Query: 896 RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ-IPTVIFGTVNGVIGVIASLPHEQY 954
R ++EV GE+HLGE VNR R +R+ S+ +P T G + + S+
Sbjct: 1001 RKKMEVTGEFHLGEQVNRIRK----IRVDASEGATVVPRAFLATTEGSLFLYGSVAPASQ 1056
Query: 955 LFLEKLQTNLRKVIKGVGGLNHEQWRSFNN-EKKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
L +LQ L + ++ G + +RSF N E++T + F+DG+LIE FLDL R +
Sbjct: 1057 DLLLRLQQRLAENVETPGNIPFTTYRSFRNAERETEEPYRFIDGELIERFLDLDEERQEV 1116
Query: 1014 ISKTMNVSVEELCKRVEELTRLH 1036
+ K + VEE+ VEEL R+H
Sbjct: 1117 VCKGL-AKVEEVRDLVEELRRMH 1138
>gi|115397303|ref|XP_001214243.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114192434|gb|EAU34134.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 1140
Score = 434 bits (1116), Expect = e-118, Method: Compositional matrix adjust.
Identities = 329/1105 (29%), Positives = 545/1105 (49%), Gaps = 93/1105 (8%)
Query: 6 IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
+Y R+ L L P D LF+ T+RY +C L WD+ +++ T R DVSD R +
Sbjct: 55 LYARVTMLARLPAPANSPTDHLFVGTDRYTYCTLSWDSSQNQIRTERNYVDVSDPSSRES 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
G +IDP R + L +Y+GL VIP G+L E
Sbjct: 115 QTGNRCLIDPSGRFMTLEVYEGLVAVIPIVQLPGKKRGRAPAMPSGPDAPQVGELGELTT 174
Query: 104 IRLEELQVLDIKFLY-GCAKPTIVVLYQDNKDARHVKTYEVAL--------KDKDFVEGP 154
R++EL V FL+ P + +LY+DN+ +K E+ D +
Sbjct: 175 ARIDELFVRSSAFLHVQSGLPRLALLYEDNQKKVRLKVRELQYTAATSSSGADATLADLA 234
Query: 155 WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGRVDA 211
LD GA LIPVP PL G+LI+GE +I Y + + + +I A+ +VD+
Sbjct: 235 DFAQELDLGASHLIPVPAPLGGLLILGETSIKYVDDDNNEIVSRLLDEATIFVAWEQVDS 294
Query: 212 DGSRYLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
R+LL D G L L++ + E +V G +++ LG TS AST+ YL V+++GS GD
Sbjct: 295 --QRWLLADDYGRLFFLMLVLDSENQVQGWQLDHLGNTSRASTLVYLGGGVIFVGSHQGD 352
Query: 271 SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTCS 319
SQ++++ GS+ E+++ N+ PI+DF ++DL E Q GQ ++VT S
Sbjct: 353 SQVLRV-----GDGSF-EIIQTLPNIAPILDFTIMDLGNRTSESQTHEFSSGQARIVTGS 406
Query: 320 GAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
GA+ DG+LR VR+G+G+ E + E++ I +W L+ F L+V+FI ETR+
Sbjct: 407 GAFDDGTLRSVRSGVGMEELGVLGEMEHITDLWGLQFKAKGDFLDTLLVTFIDETRVFHF 466
Query: 379 NLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
+ + E+EE + G TL + +++QVT V + S + EW
Sbjct: 467 SSDGEVEELDQFLGLSLSENTLLAANLPGGRILQVTERRVLIADMESEMVTYEWTPSDQL 526
Query: 438 SVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDI--NPIGENPSY 494
+ A+AN V+L GG + +I + + + K+ + +IS + + +P G
Sbjct: 527 IITSASANEDSVVLVAGGELMTVFDIRNNVQIVTQKNFGADSQISGVTVPSSPTG----- 581
Query: 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLC--AFEGISYLLCALGD 550
+ VG V + L DL + LG GE PRSVL+ + + ++ D
Sbjct: 582 --VCIVGFPQSAKVSVLKLQDLTELHATSLGPEGEAFPRSVLVANVLVDSPPTIFISMAD 639
Query: 551 GHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLL 609
G ++ + N LT K+ LG++ T + + + VFA + P++IY S +++
Sbjct: 640 GSVITYSFNANDYSLTGMNKLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYGSEGRII 699
Query: 610 YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
YS VN + S +C FNS A+P+S+A+A EL I +D + I+++P+G RR+ +
Sbjct: 700 YSAVNSEGASRVCHFNSEAYPESIAVATSHELKIALVDKERTTQIQTLPMGATVRRVAYS 759
Query: 670 EQSRTFAICSL--KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-- 725
+ F I ++ K + AE + FV L D+ F + + L + E S++ F
Sbjct: 760 PSEKAFGIGTIERKLEEGAEIVKSQFV-LADEILFRRLDAFDLRSEELVESVIRAEFPVG 818
Query: 726 ------DDSNVYYCVGTAYVLPEEN-EPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLN 777
D + VGTAY+ +E+ + +GRIL+F V++G KL +AE KGA +L
Sbjct: 819 KDEKGRDMFKDRFVVGTAYLDEDEDRDSIRGRILMFEVDNGRKLTKVAELAVKGACRALA 878
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
K++AA+ + + +YK + G +L+ + + + + I V DLMKS
Sbjct: 879 MLGDKVVAALVKTVVIYKVTGNNFGAMKLEKLASYRTSTAPVDITVTDNVIAVSDLMKSS 938
Query: 838 SLLIY-KHEEG---AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
L+ Y + E+G +++E AR + W + + + YL ++ NL +R+N G +
Sbjct: 939 CLVEYIEGEDGLPDSLKEVARHFQTVWATGIACIAPHTYLESDAEGNLIILRRNLSGVEE 998
Query: 894 EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQ 953
+++ RLEV GE LGE VNR R ++ + V P GTV G I + A + E
Sbjct: 999 DDKRRLEVTGEISLGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLYAIINPEH 1055
Query: 954 YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMD 1012
FL +LQ + I+ +G + ++R F + ++ + F+DG+LIE FL + +
Sbjct: 1056 QDFLMRLQATMAGKIESLGDMPFNEFRGFRSMVREAKEPYRFVDGELIERFLTCEPSVQE 1115
Query: 1013 EISKTMN-VSVEELCKRVEELTRLH 1036
+I ++ ++V ++ +E L RLH
Sbjct: 1116 DIVNSVGMMNVHDVKVMIEALRRLH 1140
>gi|303313681|ref|XP_003066852.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
gi|240106514|gb|EER24707.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
gi|320031496|gb|EFW13458.1| UV-damaged DNA binding protein [Coccidioides posadasii str. Silveira]
Length = 1144
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 328/1115 (29%), Positives = 555/1115 (49%), Gaps = 109/1115 (9%)
Query: 6 IYGRIATLE-LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
IYG+I+ L+ + R H A D LF+ T+RY + L WD + +L T + D++D R
Sbjct: 55 IYGKISVLQKVPRSHSSATDLLFVGTDRYAYFTLSWDPSTCQLHTEQKYLDIADPSLRDN 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAFNI 104
+G +DP + + + +Y+G+ VIP + + L E
Sbjct: 115 QSGDRSWVDPSGKFLTMEIYEGIISVIPIAQEPLKRPSPSASRSSGTSEQREYLGEPLQT 174
Query: 105 RLEELQVLDIKFLY--GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG-PWSQ---- 157
R+EEL V FL+ P I +LY++ + +K L+D + G P +
Sbjct: 175 RIEELIVRSTAFLHHDPTKPPRIAILYENTQGKVKLK-----LRDLIYSRGIPGGEASAA 229
Query: 158 ---------NNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITK 204
++L+ GAD+L+PVP PL GVLI+GE+ I Y + I RP +I
Sbjct: 230 EFRDVDDLYDDLELGADILVPVPLPLGGVLILGEKFIKYIDTVKNETI-TRPLENNTIFV 288
Query: 205 AYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
A+ ++D R+LL D G L ++I + V K+ LLGETS AS + +L VV+
Sbjct: 289 AWEQLD--NQRWLLADDYGRLFFFMLILNSANAVQSWKVGLLGETSRASALVHLGGGVVF 346
Query: 264 IGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG----------QG 313
+GS GDS +I++ +GS+ E+++ N+GPI+DF V+DL +G Q
Sbjct: 347 LGSHQGDSHVIRIT-----EGSF-EIIQTLSNIGPILDFTVMDLGNRGETPTHEFSSGQA 400
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
++VT SGA++DGSLR VR+G+G+ + + ++ I +W + + + F L++SF+ E
Sbjct: 401 RIVTGSGAFRDGSLRSVRSGVGMEDLGVLGAMEHITDLWGVSAFCPEGFCDTLLLSFVDE 460
Query: 373 TRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
+R+ + + E+EE + G T+ + +++QVT R+ SR EW
Sbjct: 461 SRVFHFSPDGEVEEKDDFLGLLLGEPTIHAANLPSRRILQVTEHGARVTDVESRMTLWEW 520
Query: 432 KSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEIS--CLDINPI 488
+ + A++N ++L GG L+ +IGD I ++ K + + ++S L +PI
Sbjct: 521 SAVESRKITAASSNDRHLVLMVGGQKLMVFDIGDDIKMSSTKTFEADKQVSGVALTSSPI 580
Query: 489 GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLL-CAF-EGISYL 544
Q + V I L DLN+ E LG G+ +PRSVL+ C F + L
Sbjct: 581 -------QACILCFPQSAEVTIIDLTDLNIRHTETLGEPGDAVPRSVLVACMFSDRAPTL 633
Query: 545 LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
A+ DG + +F LN+ L+D K+ LG++ + + + +FA D P++IY+
Sbjct: 634 FVAMADGSVFSFSLNVANYSLSDANKLVLGSEAPVFKLLPRADGLYNIFATCDHPSLIYA 693
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
S +++YS VN + + +C FN+ A+P ++A+A EL I +D + I+++ + E
Sbjct: 694 SEDRIVYSAVNSDKATRICHFNAEAYPGAIAVATPDELKIALVDAERTTQIQTLMINETV 753
Query: 664 RRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEY-GCSIL 720
RR + R F + +++ EE + HF+ L D+ F +S + L+ E C I
Sbjct: 754 RRTSYSSTERAFGLGTIQRTLVQNVEEVKSHFI-LADEIMFRQLSVFDLNPNELVECVIR 812
Query: 721 SCSFSDDSNV-------YYCVGTAYV-LPEENEP-TKGRILVFIVEDGK-LQLIAEKETK 770
+ ++ + + VGT+ + PEE E TKGRIL+F V+ + L+ I + +
Sbjct: 813 TEHPGSNAQMGSSRPRDIFIVGTSVLDTPEEAEARTKGRILIFDVDTNRELRKICDFPVR 872
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
GA +L N K++AA+ + + + + E++ E + + + G+ I V
Sbjct: 873 GACRALAMINNKIVAALMKTVVVLNIKKGNLYNFEIEKEASYRTSTAPVDISVTGNIIAV 932
Query: 831 GDLMKSISLLIYKHEEGA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
DLMKSISL+ Y EG ++E AR Y W +A + ++ +L A+ NL + +
Sbjct: 933 ADLMKSISLVEYHAGEGGQPDTLKEVARHYQTLWTTAAAPVAENEFLVADAEGNLVVLNR 992
Query: 887 NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
N+ G T+++R R++V E LGE VNR H + P+S V IP TV+G I +
Sbjct: 993 NTTGVTEDDRRRMQVTSELRLGEMVNRI-HPMDLQTSPESPV--IPKAFLATVDGSIYLF 1049
Query: 947 ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFL- 1004
+ L +LQ+ L + G + ++R+F + + + F+DG+LIE FL
Sbjct: 1050 GLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQAEEPFRFVDGELIEQFLT 1109
Query: 1005 ---DLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
D+ + + V+V E+ +E L R+H
Sbjct: 1110 FPPDIQEAVLARMDGGGRVNVIEIKDMIEGLKRMH 1144
>gi|392864500|gb|EAS34654.2| UV-damaged DNA binding protein [Coccidioides immitis RS]
Length = 1144
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 327/1115 (29%), Positives = 556/1115 (49%), Gaps = 109/1115 (9%)
Query: 6 IYGRIATLE-LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
IYG+I+ L+ + R H A D LF+ T+RY + L WD + +L T + D++D R
Sbjct: 55 IYGKISVLQKVPRSHSSATDLLFVGTDRYAYFTLSWDPSTCQLHTEQKYLDIADPSLRDN 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAFNI 104
+G +DP + + + +Y+G+ VIP + + L E
Sbjct: 115 QSGDRSWVDPSGKFLTMEIYEGIISVIPIAQEPLKRPSPSAGRSSGTSEQREYLGEPLQT 174
Query: 105 RLEELQVLDIKFLY--GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG-PWSQ---- 157
R+EEL V FL+ P I +LY++ + +K L+D + G P +
Sbjct: 175 RIEELIVRSTAFLHHDPTKPPRIAILYENTQGKVKLK-----LRDLIYSRGIPGGEASAA 229
Query: 158 ---------NNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITK 204
++L+ GAD+L+PVP PL GVLI+GE+ I Y + I RP +I
Sbjct: 230 EFRDVDDLYDDLELGADILVPVPLPLGGVLILGEKFIKYIDTVKNETI-TRPLEHNTIFV 288
Query: 205 AYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
A+ ++D R+LL D G L ++I + V K+ LLGETS AS + +L VV+
Sbjct: 289 AWEQLD--NQRWLLADDYGRLFFFMLILNSANAVQSWKVGLLGETSRASALVHLGGGVVF 346
Query: 264 IGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG----------QG 313
+GS GDS +I++ +GS+ E+++ N+GPI+DF V+DL +G Q
Sbjct: 347 LGSHQGDSHVIRIT-----EGSF-EIIQTLSNIGPILDFTVMDLGNRGETPTHEFSSGQA 400
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
++VT SGA++DGSLR VR+G+G+ + + ++ I +W L + + F L++SF+ E
Sbjct: 401 RIVTGSGAFRDGSLRSVRSGVGMEDLGVLGAMEHITDLWGLSAFCPEGFCDTLLLSFVDE 460
Query: 373 TRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
+R+ + + E+EE + G T+ + +++QVT R+ SR EW
Sbjct: 461 SRVFHFSPDGEVEEKDDFLGLLLGEPTIHAANLPSRRILQVTEHGARVTDVESRMTLWEW 520
Query: 432 KSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEIS--CLDINPI 488
+ + A++N ++L GG L+ +IGD I ++ K + + ++S L +PI
Sbjct: 521 SAVESRKITAASSNDRHLVLMVGGQKLMVFDIGDDIKMSSTKTFEADKQVSGVALTSSPI 580
Query: 489 GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLL-CAF-EGISYL 544
Q + V I L LN+ E LG G+ +PRSVL+ C F + L
Sbjct: 581 -------QACILCFPQSAEVTIIDLTGLNIRHTETLGEPGDAVPRSVLVACMFSDRAPTL 633
Query: 545 LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
A+ DG + +F LN+ L+D K+ LG++ + + + +FA D P++IY+
Sbjct: 634 FVAMADGSVFSFSLNVANYSLSDANKLVLGSEAPVFKLLPRADGLYNIFATCDHPSLIYA 693
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
S +++YS VN + + +C FN+ A+P ++A+A ++ I +D + I+++ + E
Sbjct: 694 SEDRIVYSAVNSDKATRICHFNAEAYPGAIAVATPDDIKIALVDAERTTQIQTLMINETV 753
Query: 664 RRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEY-GCSIL 720
RR + R F + +++ EE + HF+ L D+ F +S + L+ E C I
Sbjct: 754 RRTSYSSTERAFGLGTIQRTLVQNVEEVKSHFI-LADEIMFRQLSVFDLNPNELVECVIR 812
Query: 721 SCSFSDDSNV-------YYCVGTAYV-LPEENEP-TKGRILVFIVEDGK-LQLIAEKETK 770
+ S ++ + + VGT+ + PEE E TKGRILVF V+ + L+ I + +
Sbjct: 813 TEHPSSNAQMGSSRPRDIFIVGTSVLDTPEEAEARTKGRILVFDVDTNRELRKICDFPVR 872
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
GA +L N K++AA+ + + + + E++ E + + + G+ I V
Sbjct: 873 GACRALAMINNKIVAALMKTVVVLNIKKGNLYNFEIEKEASYRTSTAPVDISVTGNIIAV 932
Query: 831 GDLMKSISLLIYKHEEGA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
DLMKSISL+ Y EG ++E AR Y W +A + ++ +L A+ NL + +
Sbjct: 933 ADLMKSISLVEYHAGEGGQPDTLKEVARHYQTLWTTAAAPVAENEFLVADAEGNLVVLNR 992
Query: 887 NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
++ G T+++R R++V E LGE VNR H + P+S V IP TV+G I +
Sbjct: 993 DTTGVTEDDRRRMQVTSELRLGEMVNRI-HPMDLQTSPESPV--IPKAFLATVDGSIYLF 1049
Query: 947 ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFL- 1004
+ L +LQ+ L + G + ++R+F + ++ + F+DG+LIE FL
Sbjct: 1050 GLISPSAQDTLMRLQSALADFVASPGEIPFNKYRAFKSSVRQAEEPFRFVDGELIEQFLT 1109
Query: 1005 ---DLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
D+ + + V+V E+ +E L R+H
Sbjct: 1110 FPPDIQEAALARMDGGGRVNVIEIKGMIEGLKRMH 1144
>gi|255956643|ref|XP_002569074.1| Pc21g20880 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211590785|emb|CAP96985.1| Pc21g20880 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1140
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 324/1097 (29%), Positives = 540/1097 (49%), Gaps = 99/1097 (9%)
Query: 18 PHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPDCR 76
P D LF+ T++Y + + WD ES+++ T R+ D+++ R + +IDP R
Sbjct: 65 PTNSPTDHLFVGTDQYNYFTITWDRESNQIKTARSCVDIAEPSSRESQCAPRCLIDPTGR 124
Query: 77 LIGLHLYDGLFKV---------------------IPFDNKGQLKEAFNIRLEELQVLDIK 115
+ L +Y+G+ V +P G+L + R++EL V
Sbjct: 125 FMTLEVYEGVIVVVPIVQPTKKRGRMSMGGPQADLPL-QVGELDKPTTSRIDELFVRSSA 183
Query: 116 FLYGCAKPTIVVLYQDNKDARHVKTYEV--------ALKDKDF-----VEGPWSQNNLDN 162
FL+ + P + +LY+DN+ ++ E+ L D F +EG LD
Sbjct: 184 FLHSESNPWLALLYEDNQQKVRLRIRELDFTPGTAGTLADATFKEVQKLEGGEFGQELDL 243
Query: 163 GADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGRVDADGSRYLLG 219
G+ LIP+P PL G++++GE +I Y NA I ++ A+ +VD+ R+LL
Sbjct: 244 GSSHLIPIPAPLGGLIVLGETSIKYIDDNANDVITRNLDEATVFVAWEKVDS--QRWLLA 301
Query: 220 DHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL 278
D G L L I + ++ K+E LG+TS AS + YL ++++GS +GDSQ+++L+
Sbjct: 302 DDYGRLFFLSFILNNMGEIDDWKLEYLGKTSRASVLVYLGGGMLFVGSHHGDSQVLRLD- 360
Query: 279 QPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTCSGAYKDGSL 327
GS EV++ N+ PI+DF ++DL E Q GQ ++VT SGA+ DG+L
Sbjct: 361 -----GSSFEVIQTLSNIAPILDFTIMDLGNRTSESQTHEFSSGQARIVTGSGAFDDGTL 415
Query: 328 RIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
R VR+G+G+ E + E++ I W L++ + D F L+V+F+ ETR+ + + E+EE
Sbjct: 416 RSVRSGVGMEELGVLGEMEHITDFWGLQTRSKDDFLNTLIVTFVDETRVFQFSTDGEVEE 475
Query: 387 TE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATAN 445
+ G TL ++++QVT V + S + EW + A+AN
Sbjct: 476 LDNFLGLSLTECTLLVTRLQGDRILQVTEQRVLVADLESGMVTFEWAPQDQKLITAASAN 535
Query: 446 ASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDI--NPIGENPSYSQIAAVGM 502
++L G + +I D + + K + +IS L + +P G + G
Sbjct: 536 EDHLVLVISGQIVASFDIRDNVQIITQKDLGADQQISGLTVPSSPTG-------VFIAGF 588
Query: 503 WTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAFEGISY--LLCALGDGHLLNFLL 558
V I ++ D ++ + LG GE PRSVL+ S+ L ++ DG ++ F L
Sbjct: 589 PQSAKVSIMAIKDFAILQTKSLGPTGESFPRSVLVAEVLADSHPTLFISMADGCVITFSL 648
Query: 559 NMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVNLKE 617
N + L++ K+ LG++ T + + + VFA + P++IY S +++YS VN +
Sbjct: 649 NPQDYSLSEMNKLILGSEQPTFKKLPKGDGLYNVFATCENPSLIYGSEGRIIYSAVNSEG 708
Query: 618 VSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI 677
S +C N+ AFPDS+A+A E EL I +D + I+++P+G RR+ + + F I
Sbjct: 709 ASRVCHLNAEAFPDSIAVATEKELKIALVDKERTTQIQTLPMGSTVRRVAYSPSEKAFGI 768
Query: 678 CSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF--SDDSNV--- 730
++ + + AE + HFV L D+ F + L E S++ D N
Sbjct: 769 GTIDRKLVNGAEVVKSHFV-LADEIMFRRLDALELGPDELVESVVRAELPAGKDENGKEI 827
Query: 731 ---YYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAA 786
+ VGTA+ E++E +GRIL+ V+ G KL +AE GA +L ++AA
Sbjct: 828 MKDRFVVGTAFADEEQDESIRGRILILEVDHGRKLSQVAELPVMGACRALAMMGDCVVAA 887
Query: 787 INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH-- 844
+ + + +Y+ + + G +L+ + + V D I V DLMKS+ L+ Y
Sbjct: 888 LVKTVVVYRVKINNVGPMKLEKLAAYRTSTAPVDVIVVDDLIAVADLMKSLCLVRYTPGH 947
Query: 845 --EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVV 902
E + E R Y W +A+ + D+ +L ++ NL + +N G T +++ RL
Sbjct: 948 AGEPAKLTEVGRHYQTVWSTAIACVGDETFLQSDAEGNLIVLSRNMNGVTAQDKHRLMPT 1007
Query: 903 GEYHLGEFVNRFRHGSLVMRLPD-SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
E LGE VNR R + +P S V P TV G I + A + E FL LQ
Sbjct: 1008 SEISLGEMVNRIRP----VNIPQLSSVMVTPRAFMATVEGSIFLFAVINPEHQDFLMTLQ 1063
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNV 1020
+L I +G L+ +++RSF ++ +A F+DG+LIE FL+ S + +EI + +
Sbjct: 1064 ASLSTKINSLGNLSFDKFRSFRTMVRSAEAPYRFVDGELIEQFLNCSPSMQEEIVQEIGS 1123
Query: 1021 S-VEELCKRVEELTRLH 1036
S V E+ + +E L RLH
Sbjct: 1124 SDVVEVKRMIEALRRLH 1140
>gi|320593036|gb|EFX05445.1| uv-damaged DNA-binding protein [Grosmannia clavigera kw1407]
Length = 1504
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 337/1157 (29%), Positives = 562/1157 (48%), Gaps = 146/1157 (12%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
I+G I+ L+ RP + LF+ T+R+++ L WD ES L T DV ++ R +
Sbjct: 368 IFGTISMLQSLRPIDSKTELLFVGTDRFEYFTLTWDMESHSLRTVDTFQDVGEKHLRDSQ 427
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF---------DNKGQLKEAFNIRLEELQVLDIK 115
+ ++DP R + L L++G+ V+ D+ L +RL EL +
Sbjct: 428 SQDRCLVDPSGRFMALLLWEGVLTVLRLQTRRDRARPDHMRHLVPMDQVRLSELFIKAST 487
Query: 116 FLYG-CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQNNL--------DNGA 164
FL+ +P I LYQ D K Y + D++ G + N D GA
Sbjct: 488 FLHSETGQPQIAFLYQSQGDRAEAKLSAYRLTAGDRNVDTGRFDANRDREISMDIDDPGA 547
Query: 165 DLLIPV-----------------PPPLCGVLIIGEETIVY--CSANAFKAIPIRP-SITK 204
LLIPV L G+L++GE ++Y + +P+R SI
Sbjct: 548 ALLIPVEKVEPAKRHNVRNTATATANLGGLLVVGETRLLYIDSTTKCTVEVPLRAASIFV 607
Query: 205 AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL-DNAVVY 263
A+ R DA + YLL D G LHLL I V L + +G+TS AS + YL D +++
Sbjct: 608 AWARYDA--THYLLADEYGTLHLLTILVSGAVVDNLDVSPIGKTSRASCLVYLPDRRLLF 665
Query: 264 IGSSYGDSQLIKLNLQPDAKGSY-VEVLERYVNLGPIVDFCVVDLERQ------------ 310
+GS GDSQL +L+L G + ++VL+ N+ P++DF V+D+ +
Sbjct: 666 VGSHNGDSQLFRLDLAASPVGLHELQVLQ---NIAPVLDFTVMDMGNREDDQQLANEYAS 722
Query: 311 GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRS---STDDPFDTFLV 366
GQ ++VT SG +KDGSLR VR+G+G+ + + +L G++G++SLRS DT LV
Sbjct: 723 GQARIVTGSGVHKDGSLRSVRSGVGLEDIGILGDLGGVRGLFSLRSPQTQQQQQVDT-LV 781
Query: 367 VSFISETRILAMNLEDELEETE-IEGFCSQTQTLFCHD-----AIYNQLVQVTSGSVRLV 420
SF++ETR+ + + E+EE E G TQTL + + ++L+Q+T GSV L
Sbjct: 782 ASFLTETRVFLFDGDGEIEEVEAFPGLNLGTQTLLATNLSSGSSSSSRLLQITPGSVTLA 841
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV-KHAQLEYE 479
+ S + W P ++ A+AN+ VLL+ G LV L + + + K +
Sbjct: 842 ETASGTIVASWTPPDDRTIVSASANSRWVLLSVEGTTLVSLSLDNSLAVAAQKEVGTTDQ 901
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI--------IPR 531
I+CL P + VG W V S+ DL + +H G + +PR
Sbjct: 902 IACLHAAP-----QLLDVGVVGQWASGMV---SVVDLATLEPKHGGKSLRRRDDNASVPR 953
Query: 532 SVLLCAF--EGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF--SS 585
S++L G++ L A+ DG+++ F ++ L+ RK V LGT+ L S
Sbjct: 954 SIVLAQVLPPGMAGPTLFVAMDDGNVITFAVSPSDLSLSGRKSVVLGTRHARLHPLPQSD 1013
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
+ T +FA ++ P++IY S ++ Y+ V ++ + +C F++AAFP ++A+A + ++ +
Sbjct: 1014 EATYSIFATTEHPSLIYGSEGRIAYAAVTAEDANFVCHFDAAAFPGAIAVATDSQIKLSR 1073
Query: 646 IDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEF 704
D ++ H+R+I +GE RRI + R FA+ +K + + E ++L+D+ F+
Sbjct: 1074 TDTTKQTHVRAIDMGETIRRIAYSATERVFALGCIKRELTQGREVVTSSLKLVDEVAFQP 1133
Query: 705 ISTYPLDTFEYGCSILSCSFS-------DDSNVYYCVGTAYVLP--EENEPTKGRILVFI 755
+ PL G ++ C + + VGT+++ ENE T GRILV
Sbjct: 1134 LGK-PLALEVEGTELVECVVRAELRDALGNPAERFLVGTSFMAAGSSENEHTLGRILVVG 1192
Query: 756 VE-DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
V+ D +++ +G L + ++A +++ + L ++ + EL+ +
Sbjct: 1193 VDSDHSPYIVSSHRVRGPCRCLAMVDDLIVAGLSKTVVLSRYTETSSMSGELKKVASYRT 1252
Query: 815 HILALYVQTRGDFIVVGDLMKSISLLIY-------------------KHEEG-------- 847
+ + G I VGD+MKS +L+ Y K +G
Sbjct: 1253 ATYVVDLAVDGHMIAVGDMMKSTALVEYIPATSGDGEDEEDDGAGDNKKGKGKTADRSKT 1312
Query: 848 -----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVV 902
+ ERAR Y A+W +AV ++ D++L A+ NL + ++ +G T +++ RL V
Sbjct: 1313 IAEGPKLVERARGYQASWATAVCHVEGDLWLEADGFGNLTMLERDVQGVTADDKRRLRTV 1372
Query: 903 GEYHLGEFVNRFRHGSLVMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
GE +LGE VNR R ++ ++ G + P TV G I ++ ++ E L L
Sbjct: 1373 GEMYLGEMVNRIRPIAV-----ETSPGAMVHPRAFLATVEGSIYMVGTIAPEAQDLLMNL 1427
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNN-EKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
QT L ++KG G + +RSF N E+++ + F+DG+L+E FLD+ E+++ +
Sbjct: 1428 QTKLAAIVKGPGNTSFSAYRSFRNAERESTEPFRFVDGELLERFLDVGEDVQKEVAQGLG 1487
Query: 1020 VSVEELCKRVEELTRLH 1036
SVE+L +EEL RLH
Sbjct: 1488 PSVEDLRNIIEELKRLH 1504
>gi|340960602|gb|EGS21783.1| hypothetical protein CTHT_0036510 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1100
Score = 427 bits (1097), Expect = e-116, Method: Compositional matrix adjust.
Identities = 331/1114 (29%), Positives = 553/1114 (49%), Gaps = 105/1114 (9%)
Query: 13 LELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGII 71
L+ RP D LF+AT+R+ + L W+ ++ L T + D +R R + + ++
Sbjct: 2 LQKLRPRDSKTDLLFVATDRFDYFTLLWNPQTFRLETVNSYKDPGERYMRDSQSQDRCLV 61
Query: 72 DPDCRLIGLHLYDGLFKVIPF----DNKGQLKEAFNIRLEELQVLDIKFLYG-CAKPTIV 126
DP R + +HL++G+ V +N L+ +RL EL + FLY P +
Sbjct: 62 DPSGRFMIMHLWEGVLNVWRMGHRKNNAMVLEWMEQVRLSELFIKASTFLYTETGHPKVA 121
Query: 127 VLYQDNKDARHVK--TYEVALKDKDFVEG---PWSQNNLD-----NGADLLIPV------ 170
LYQ DA K TY + D+D V P LD + A +LIPV
Sbjct: 122 FLYQTRADATDAKLATYRLTSDDRDTVASQFNPEKDRELDMEVRDSTASMLIPVRKVEEE 181
Query: 171 ------------PPPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGRVDADGSR 215
L G++++GE ++Y + +++ PSI A+ + +
Sbjct: 182 VKRHNVRHVELAKAHLGGLIVVGETRLLYIDEVTKAKVESVLKEPSIFVAWAEYNV--TH 239
Query: 216 YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
Y L D G LHLL I VTG+++ +GETS AST+ YL + +++GS YG+SQL++
Sbjct: 240 YFLADDYGHLHLLRIKTNGVVVTGMEVSRIGETSRASTLVYLGDNYLFVGSHYGNSQLLR 299
Query: 276 LNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQVVTCSGAYK 323
L+L+ ++++E + N+GPI+DF ++D+ + GQ ++VTCSG +K
Sbjct: 300 LDLENQNPKQRLQLIESFQNIGPILDFAIMDMGNRGDSGQPGNEYSSGQARIVTCSGVHK 359
Query: 324 DGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLED 382
DG+LR VR+G+G+ + + +L+ +G++SLRS + LV+SF++ETR+ + +
Sbjct: 360 DGTLRSVRSGVGLEDIGILADLELCRGLFSLRSHGSLKTN-ILVMSFLTETRVFKFDHQG 418
Query: 383 ELEETEIEGFCSQT---QTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSV 439
++E E+ FC T QTL D Q++ VT + L+ + S W G +
Sbjct: 419 DIE--ELSSFCGMTLDQQTLLAVDLPSGQILHVTPAAATLLDTESGVAITSWTPEEGRCI 476
Query: 440 NVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIA 498
A+ANA +LL+ G LV L + + + + + K+ +I+CL + P S I
Sbjct: 477 INASANAEWLLLSVDGVGLVSLSLSNDLRILKEKNLNQSDQIACLHV-----PPQSSGIG 531
Query: 499 AVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSV----LLCAFEGISYLLCALGD 550
VG WT +V I L L I E L IP+ + LL LL A D
Sbjct: 532 VVGFWTSGTVSIIDLHTLEPIHGESLRTSKDDTSIPQDLALVQLLPPEVSGPLLLIATQD 591
Query: 551 GHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLL 609
G++++F ++ + +++V LG +N + V A ++ P++IY S +++
Sbjct: 592 GNVVSFNISSDYS-FSGKRRVVLGVTQAKFHLLPQENNLYSVLATTEHPSLIYGSEGRIV 650
Query: 610 YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
YS V +E +++CPF+S AFPD +A+A + ++ + +D +K ++RS+PL E RRI +
Sbjct: 651 YSAVTAEEAAYVCPFDSEAFPDCVALATDSQIKLARLDRERKTYVRSLPLNEMVRRIAYS 710
Query: 670 EQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTFEFI-STYPLDTFEYGCSILSCSFSDD 727
+ + F + +K + EE +L+D+ F+ + + L + Y + + ++
Sbjct: 711 PKEKVFGLGCIKRELVKGEEIVQSSFKLVDEVLFDRVGKSIELGSPSYTELVECVTRAEF 770
Query: 728 SNVY------YCVGTAYV-------LPEENEP-TKGRILVF-IVEDGKLQLIAEKETKGA 772
+ Y + VGT+Y+ P P +GRILV I D I + KGA
Sbjct: 771 IDSYGNPAERFLVGTSYLPDPDYSPAPSHGNPEARGRILVLGIDSDRNPYQILSYQLKGA 830
Query: 773 VYSLNAF-NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVG 831
L +GK++ + + + + ++ T +L + + + G I V
Sbjct: 831 CRCLAVMDDGKVVVGLTKAVTVCEYKETSSTTAQLTKLASYRPSTYPVEIAIHGRTIAVA 890
Query: 832 DLMKSISLLIY-KHEEGA---IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
DLMKSISL+ Y EEG + ERAR Y + W +AV + + ++L A+ NL +R+N
Sbjct: 891 DLMKSISLVDYIPAEEGGQAKLVERARHYQSAWSTAVGYVQNGLWLEADAQGNLQVLRQN 950
Query: 888 SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIA 947
+G T+++R R+E+ E +LGE VNR R + V P++ + IP GTV G I +
Sbjct: 951 VDGITEDDRKRMELTAEINLGEMVNRIRSIT-VETSPEALI--IPRAFLGTVEGGIYMFG 1007
Query: 948 SL-PHEQYLFLEKLQTNLRKVIKGVG---GLNHEQWRSFNNEKKTVDAK-NFLDGDLIES 1002
++ PH L L + Q + VIK VG N +R+F N ++ FLDG+L+E
Sbjct: 1008 TIAPHALDLLL-RFQEKVADVIKAVGDSDNANFRSYRAFKNAERVGHGPFRFLDGELLER 1066
Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
FLD+ + I + +VE++ VEEL R+H
Sbjct: 1067 FLDVDEAIQEVICSGLGPTVEDMRNIVEELRRMH 1100
>gi|134077422|emb|CAK45676.1| unnamed protein product [Aspergillus niger]
Length = 1133
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 326/1103 (29%), Positives = 546/1103 (49%), Gaps = 116/1103 (10%)
Query: 6 IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
++ ++ L L P D LF+ T+RY +C L WD E +++ T R D+SD R
Sbjct: 55 LFAKVTMLARLPAPANSPTDHLFVGTDRYTYCTLSWDGERNQIRTERNYVDISDPSSREA 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
G +IDP R + L +Y+G+ V+P G+L E
Sbjct: 115 QTGNRCLIDPSGRFMTLEVYEGVIAVVPIVQLPSKKRGRQVAPPSGPDAPRVGELGEPTT 174
Query: 104 IRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKT----YEVALK----DKDFVEG-- 153
R++EL + +LY+DN+ +K Y A D F E
Sbjct: 175 ARIDEL---------------LALLYEDNQKKVRLKVRALHYSAATASTGADAAFEESLD 219
Query: 154 PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRV 209
+SQ LD GA LIPVP PL G+L++GE +I Y ++ + + RP +I A+ +V
Sbjct: 220 GFSQE-LDLGASHLIPVPAPLGGLLVLGETSIKYVDTDSNEIVS-RPLDEATIFVAWEQV 277
Query: 210 DADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
D+ R+LL D G L L+++ +V K++ LG T+ AS + YL V+++GS
Sbjct: 278 DS--QRWLLADDYGRLFFLMLVLDSNNQVQSWKLDHLGNTARASVLIYLGGGVIFVGSHQ 335
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVT 317
GDSQ++++ GS +EV++ N+ PI+DF ++DL E Q GQ ++VT
Sbjct: 336 GDSQVLRI-----GNGS-LEVIQTLSNIAPILDFTIMDLGNRTSESQTHEFSSGQARIVT 389
Query: 318 CSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
SGA+ DG+LR VR+G+G+ E + E++ I ++ L+ +T + L+V+F+ ETR+
Sbjct: 390 GSGAFDDGTLRSVRSGVGMEELGVLGEMELITDLFGLQLATKGGYLDTLLVTFVDETRVF 449
Query: 377 AMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
N + E+EE + G TL + +++QVT V + + NEW P
Sbjct: 450 QFNFDGEVEELDSFLGLSLSENTLLAMNLPGGRILQVTEQRVLIADIEGGMVTNEWTPPD 509
Query: 436 GYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSY 494
+ A+AN ++L +GG + L+I + + + K + +IS + + P+ S
Sbjct: 510 NLVITSASANNDSIVLVSGGQLMTVLDINNDVRVISQKDFGADSQISGVTV-PL----SS 564
Query: 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGD 550
+ + VG V + L L+ + LG GE PRSVL+ S L ++ D
Sbjct: 565 AGVCIVGFPQLAKVSVLDLGRLSELHTTSLGPAGEAFPRSVLVADVLANSPSTLFISMAD 624
Query: 551 GHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLL 609
G ++ + + + LT ++ LG++ T + + + VFA + P++IY S +++
Sbjct: 625 GSVITYSFDARNYSLTGMNRLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYGSEGRII 684
Query: 610 YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
YS VN + S +C FNS A+P S+A+A +L I +D + I+++P+GE RR+ +
Sbjct: 685 YSAVNSEGASRVCHFNSEAYPGSIAVATRHDLKIALVDKERTTQIQTLPMGETARRVAYS 744
Query: 670 EQSRTFAICSL--KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-- 725
+ F I ++ K + AE + FV L D+ F + + L E S++ FS
Sbjct: 745 PSEKAFGIGTIERKLKDGAEMVQSRFV-LADEIMFRRLDAFDLRPEELVESVIRAEFSAG 803
Query: 726 ---DDSNVY---YCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNA 778
+ +V+ + VGTAY+ E E +GRILVF +++G KL +AE KGA +L
Sbjct: 804 KDENGRDVFKDRFVVGTAYLDDENEESIRGRILVFEIDNGRKLTKVAELPVKGACRALAM 863
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
K++AA+ + + +Y + D G +L+ + + V G+ I V DLMKS+
Sbjct: 864 LGEKIVAALVKTVVIYGVVNNDFGAMKLEKLASYRTSTAPVDVTVTGNVIAVADLMKSVC 923
Query: 839 LLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
L+ Y E ++ E AR + W + V + D +L + NL +R+N G ++
Sbjct: 924 LVEYSEGENGSPDSLTEVARHFQTVWATGVSCIAKDTFLETDAEGNLIVLRRNLTGVEED 983
Query: 895 ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
++ RLEV GE LGE VNR R ++ Q+ +V TV G I + A + E
Sbjct: 984 DKRRLEVTGEISLGEMVNRIRP---------VNIQQLASV---TVEGSIYLFAIINPEHQ 1031
Query: 955 LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
FL +LQ + ++ +G + ++R F + ++ + F+DG+LIE FL + +E
Sbjct: 1032 DFLMRLQATMAGKVESLGNIPFNEFRGFRSMVREAKEPYRFVDGELIERFLTCEPSLQEE 1091
Query: 1014 ISKTMN-VSVEELCKRVEELTRL 1035
I ++ ++V+E + LT L
Sbjct: 1092 IVDSVGMMNVDERIRPTVYLTSL 1114
>gi|67516629|ref|XP_658200.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
gi|40747539|gb|EAA66695.1| hypothetical protein AN0596.2 [Aspergillus nidulans FGSC A4]
gi|259489136|tpe|CBF89158.1| TPA: damaged DNA binding protein (Eurofung) [Aspergillus nidulans
FGSC A4]
Length = 1132
Score = 426 bits (1094), Expect = e-116, Method: Compositional matrix adjust.
Identities = 327/1100 (29%), Positives = 539/1100 (49%), Gaps = 91/1100 (8%)
Query: 6 IYGRIATLE-LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
I+ R+ L L P D LF+ T+RY + L WD+ +++ T R D++D R
Sbjct: 55 IFARVTMLACLPAPANSPTDHLFVGTDRYSYFTLSWDSARNQVRTERDYVDIADPSSRDA 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
G +IDP R + L +YDG+ VIP G+L E
Sbjct: 115 RTGSRCMIDPSGRFMTLEIYDGMIVVIPIIQLPSKRRGRQVALPTGPDAPRIGELGEPII 174
Query: 104 IRLEELQVLDIKFLY-GCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGPWSQ 157
R++EL V FL+ P + +LY+DN+ R +K A + +F
Sbjct: 175 TRIDELFVRSSAFLHVQAGSPRLALLYEDNQKKVKLKVRELKYSTAAGAESEFTSIADYA 234
Query: 158 NNLDNGADLLIPVPPPLC---GVLIIGEETIVYCSA--NAFKAIPIR-PSITKAYGRVDA 211
LD GA LIPVP PL G+LI+GE +I Y A N + P+ +I A+ +VD+
Sbjct: 235 QELDLGASHLIPVPAPLAAAGGLLILGETSIKYVDADNNEIVSQPLEEATIFVAWEQVDS 294
Query: 212 DGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
R+LL D G L L++ +V ++ LG TS AS + YL VV++GS GDS
Sbjct: 295 --QRWLLADDYGRLFFLMLVLRNSEVERWELHSLGNTSRASVLVYLGGGVVFVGSHQGDS 352
Query: 272 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTCSG 320
Q+I++ Q +V++ N+ P++DF ++DL E Q GQ ++VT SG
Sbjct: 353 QVIRIGDQS------FQVIQTLSNIAPVLDFTIMDLGNRTSENQMHEFSSGQARIVTGSG 406
Query: 321 AYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
A+ DG+LR VR+G+G+ E + +++ I +W L+ + F L+V+F++ETR+ +
Sbjct: 407 AFDDGTLRSVRSGVGLEELGVLGDMEHITDLWGLQVGSRGDFLDTLLVTFVNETRVFRFS 466
Query: 380 LEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYS 438
+ E EE E G TL + ++++QVT V + EW
Sbjct: 467 PDGEAEELESFLGLSLSENTLLAANLPGSRILQVTEQRVLIADIECGMTIFEWTPKNQLI 526
Query: 439 VNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQI 497
+ A+AN ++L GG H+ L+I + + K + +IS + + + + +
Sbjct: 527 ITAASANDDTIVLVAGGKHVTVLDIQSEARVVSEKDFGADNQISGVTLPT-----TPTDV 581
Query: 498 AAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGHL 553
VG V + L DL+ I+ LG GE PRSVL+ + E L ++ DG +
Sbjct: 582 CIVGFPQLAKVSVLKLQDLSHISSTSLGPAGEAFPRSVLVASVLAENAPTLFISMADGSV 641
Query: 554 LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASDRPTVIYSSNKKLLYSN 612
+ + N + L+ K+ LG++ T + N ++VFA + P++IY S +++YS
Sbjct: 642 ITYDYNDQDHSLSGMNKLVLGSEQPTFKKLPRGNGLSNVFATCENPSLIYGSEGRIIYSA 701
Query: 613 VNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQS 672
VN + S +C FNS A+P+S+A+A EL IG +D + I+++P+ RR+ +
Sbjct: 702 VNSEGASRICHFNSEAYPESIAVATAQELKIGLVDKERTTQIQTLPIKATVRRVAYSPSE 761
Query: 673 RTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC-------- 722
+ F + +++ + S E + FV L D+ F + + L+ G I+ C
Sbjct: 762 KAFGMGTIERKLVSGEEIVKSQFV-LADEILFRRLDAFDLE----GEEIVECVIRAEAPE 816
Query: 723 SFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNG 781
S ++ + VG+AY+ ++ + T G I VF V++G KL +A++ KGA +L
Sbjct: 817 SKDGEAKDRFVVGSAYLGEDDGDSTLGYIRVFEVDNGRKLAKVAQERVKGACRALAVMGD 876
Query: 782 KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
K++AA+ + + +++ + R G +LQ + + + + I + DLMKS+ ++
Sbjct: 877 KIVAALVKTVVVFQVVPRSGGL-QLQRLASYRTSTAPVDITVTRNVIAIADLMKSVCVVE 935
Query: 842 YKH-EEGA---IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG 897
Y E GA + E AR + W + V + D YL ++ NL +R+N G +++R
Sbjct: 936 YHEGENGAPDKLVEVARHFQTVWATGVTSVAPDTYLESDAEGNLIVLRRNRSGVEEDDRR 995
Query: 898 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL 957
RLEV GE L E VNR R + + +LP + V +P TV G I + A + + FL
Sbjct: 996 RLEVTGEICLNEMVNRIRPVN-IQQLPSATV--VPRAFLATVEGSIYLYAIINPDYQDFL 1052
Query: 958 EKLQTNLRKVIKGVGGLNHEQWRSFNN-EKKTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
+LQ + +GG+ +R+F ++ + F+DG+LIE FL EI
Sbjct: 1053 MRLQATMASRADSLGGIPFTDYRAFRTMTRQATEPYRFVDGELIERFLTCEPAVQKEIVD 1112
Query: 1017 TMNVSVEELCKRVEELTRLH 1036
+ S+EE+ VE L RLH
Sbjct: 1113 IVGSSLEEVRAIVEALRRLH 1132
>gi|238491136|ref|XP_002376805.1| UV-damaged DNA binding protein, putative [Aspergillus flavus
NRRL3357]
gi|220697218|gb|EED53559.1| UV-damaged DNA binding protein, putative [Aspergillus flavus
NRRL3357]
Length = 1117
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 332/1110 (29%), Positives = 551/1110 (49%), Gaps = 104/1110 (9%)
Query: 6 IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
+Y +++ L L P D LF+ T+RY +C L WD+ + + T R D++D R +
Sbjct: 33 LYAKVSMLARLPAPAHSPTDHLFVGTDRYTYCTLSWDSAQNRIRTERNYVDIADPSSRES 92
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
G +IDP R + L +Y+GL V+P G+L E
Sbjct: 93 QTGNRCLIDPSGRFMTLEVYEGLVAVVPIVQLPARKRGRAPAVPTGPDAPKVGELGELTT 152
Query: 104 IRLEELQVLDIKFLY-GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQN---- 158
R++EL V FL+ P + +LY+DN+ K + +++ ++ S +
Sbjct: 153 ARIDELFVRSSAFLHVQSGLPRLALLYEDNQ-----KKVRLMVRELNYTSATASTSADAT 207
Query: 159 ---------NLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKA 205
LD GA LIPVP PL G+LI+GE +I Y + + I RP +I A
Sbjct: 208 LTHIADFAQELDLGASHLIPVPAPLGGLLILGETSIKYVDDDNNEIIS-RPLDEATIFVA 266
Query: 206 YGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
+ VD+ R+LL D G L L+++ +V G K++ LG TS AS + YL +V++
Sbjct: 267 WEGVDS--QRWLLADDYGRLFFLMLVLDSDNQVQGWKLDHLGNTSRASALVYLGGGIVFV 324
Query: 265 GSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQG 313
GS GDSQ++++ GS+ E+++ N+ PI+DF ++DL E Q GQ
Sbjct: 325 GSHQGDSQVLRI-----GNGSF-EIIQALSNIAPILDFTIMDLGNRTSESQTHEFSSGQA 378
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLR-SSTDDPFDTFLVVSFIS 371
++VT SGA+ DG+LR VR+G+G+ E + +++ I +W L+ + D DT L+V+FI
Sbjct: 379 RIVTGSGAFDDGTLRSVRSGVGMEELGVLGDMEHITDLWGLQVQAGGDTLDT-LLVTFID 437
Query: 372 ETRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNE 430
ETR+ + + E+EE + G TL + +++QVT V + + E
Sbjct: 438 ETRVFHFSPDGEVEELDHFLGLSLSENTLLAANLPRGRILQVTEQRVLIADLEGGMVVYE 497
Query: 431 WKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD--GILTEVKHAQLEYEISCLDINPI 488
W P + A+AN ++L GG + L+IG ++TE K + ++S + +
Sbjct: 498 WTPPNELVITAASANDDSLVLVIGGELMTVLDIGTEAQVITEKKFGA-DSQVSGVTVPA- 555
Query: 489 GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYL 544
S +++ VG V + L DL + LG GE PRSVL+ + L
Sbjct: 556 ----SPTEVCVVGFPQLAKVSVLRLRDLTEVHTTSLGPAGEAFPRSVLVADVLADSPPTL 611
Query: 545 LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
++ DG ++ + L+ K+ LG++ T + + + VFA + P++IY
Sbjct: 612 FISMADGSVITYSFKTDDYSLSHMNKLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYG 671
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
S +++YS VN + S +C FNS A+P S+A+A EL I +D + I+++ +G
Sbjct: 672 SEGRIIYSAVNSEGASRVCHFNSEAYPGSIAVATLHELKIALVDRERTTQIQTLQIGATV 731
Query: 664 RRICHQEQSRTFAICSLKNQSCAEESEMHFVRLL--DDQTFEFISTYPLDTFEYGCSILS 721
RR+ + + F I +++ + A+ +E+ R + D+ F + ++ L E S++
Sbjct: 732 RRVAYSPSEKAFGIGTIERK-LADGAEIVTSRFMLADEVLFRQLDSFELRPEEIVESVIR 790
Query: 722 CSF--SDDSNV------YYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGA 772
F D N + VGTAY+ E E +GRIL+F +++G KL +AE KGA
Sbjct: 791 AEFPAGKDENGREMTKDRFVVGTAYLDDEGEESIRGRILMFEIDNGRKLTKVAELPVKGA 850
Query: 773 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
+L K++AA+ + I +YK + + GT +L+ + V G+ IVV D
Sbjct: 851 CRALAMLGDKIVAALVKTIVIYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSD 910
Query: 833 LMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
LMKS+ LL +K E ++ E AR + W + V +D D +L ++ NL +R+N
Sbjct: 911 LMKSVCLLEFKEGENGLPDSLTEVARHFQTVWATGVACIDKDTFLESDAEGNLIVLRRNL 970
Query: 889 EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS 948
G +++R RLEV E LGE VNR R ++ + V P GTV G I + A
Sbjct: 971 AGVEEDDRRRLEVTSEISLGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLFAI 1027
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLS 1007
+ E FL +LQ + ++ +G + ++R F + ++ + F+DG+LIE FL+
Sbjct: 1028 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVREATEPYRFVDGELIEQFLNCE 1087
Query: 1008 RTRMDEISKTMN-VSVEELCKRVEELTRLH 1036
+EI ++ ++V E+ +E L RLH
Sbjct: 1088 PELQEEIVNSVGMMNVHEVKVMIEALRRLH 1117
>gi|169773185|ref|XP_001821061.1| UV-damaged DNA binding protein [Aspergillus oryzae RIB40]
gi|83768922|dbj|BAE59059.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1139
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 332/1110 (29%), Positives = 551/1110 (49%), Gaps = 104/1110 (9%)
Query: 6 IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
+Y +++ L L P D LF+ T+RY +C L WD+ + + T R D++D R +
Sbjct: 55 LYAKVSMLARLPAPAHSPTDHLFVGTDRYTYCTLSWDSAQNRIRTERNYVDIADPSSRES 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
G +IDP R + L +Y+GL V+P G+L E
Sbjct: 115 QTGNRCLIDPSGRFMTLEVYEGLVAVVPIVQLPARKRGRAPAVPTGPDAPKVGELGELTT 174
Query: 104 IRLEELQVLDIKFLY-GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQN---- 158
R++EL V FL+ P + +LY+DN+ K + +++ ++ S +
Sbjct: 175 ARIDELFVRSSAFLHVQSGLPRLALLYEDNQ-----KKVRLMVRELNYTSATASTSADAT 229
Query: 159 ---------NLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKA 205
LD GA LIPVP PL G+LI+GE +I Y + + I RP +I A
Sbjct: 230 LTHIADFAQELDLGASHLIPVPAPLGGLLILGETSIKYVDDDNNEIIS-RPLDEATIFVA 288
Query: 206 YGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
+ VD+ R+LL D G L L+++ +V G K++ LG TS AS + YL +V++
Sbjct: 289 WEGVDS--QRWLLADDYGRLFFLMLVLDSDNQVQGWKLDHLGNTSRASALVYLGGGIVFV 346
Query: 265 GSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQG 313
GS GDSQ++++ GS+ E+++ N+ PI+DF ++DL E Q GQ
Sbjct: 347 GSHQGDSQVLRI-----GNGSF-EIIQALSNIAPILDFTIMDLGNRTSESQTHEFSSGQA 400
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLR-SSTDDPFDTFLVVSFIS 371
++VT SGA+ DG+LR VR+G+G+ E + +++ I +W L+ + D DT L+V+FI
Sbjct: 401 RIVTGSGAFDDGTLRSVRSGVGMEELGVLGDMEHITDLWGLQVQAGGDTLDT-LLVTFID 459
Query: 372 ETRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNE 430
ETR+ + + E+EE + G TL + +++QVT V + + E
Sbjct: 460 ETRVFHFSPDGEVEELDHFLGLSLSENTLLAANLPRGRILQVTEQRVLIADLEGGMVVYE 519
Query: 431 WKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD--GILTEVKHAQLEYEISCLDINPI 488
W P + A+AN ++L GG + L+IG ++TE K + ++S + +
Sbjct: 520 WTPPNELVITAASANDDSLVLVIGGELMTVLDIGTEAQVITEKKFGA-DSQVSGVTVPA- 577
Query: 489 GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYL 544
S +++ VG V + L DL + LG GE PRSVL+ + L
Sbjct: 578 ----SPTEVCVVGFPQLAKVSVLRLRDLTEVHTTSLGPAGEAFPRSVLVADVLADSPPTL 633
Query: 545 LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
++ DG ++ + L+ K+ LG++ T + + + VFA + P++IY
Sbjct: 634 FISMADGSVITYSFKTDDYSLSHMNKLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYG 693
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
S +++YS VN + S +C FNS A+P S+A+A EL I +D + I+++ +G
Sbjct: 694 SEGRIIYSAVNSEGASRVCHFNSEAYPGSIAVATLHELKIALVDRERTTQIQTLQIGATV 753
Query: 664 RRICHQEQSRTFAICSLKNQSCAEESEMHFVRLL--DDQTFEFISTYPLDTFEYGCSILS 721
RR+ + + F I +++ + A+ +E+ R + D+ F + ++ L E S++
Sbjct: 754 RRVAYSPSEKAFGIGTIERK-LADGAEIVTSRFMLADEVLFRQLDSFELRPEEIVESVIR 812
Query: 722 CSF--SDDSNV------YYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGA 772
F D N + VGTAY+ E E +GRIL+F +++G KL +AE KGA
Sbjct: 813 AEFPAGKDENGREMTKDRFVVGTAYLDDEGEESIRGRILMFEIDNGRKLTKVAELPVKGA 872
Query: 773 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
+L K++AA+ + I +YK + + GT +L+ + V G+ IVV D
Sbjct: 873 CRALAMLGDKIVAALVKTIVMYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSD 932
Query: 833 LMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
LMKS+ LL +K E ++ E AR + W + V +D D +L ++ NL +R+N
Sbjct: 933 LMKSVCLLEFKEGENGLPDSLTEVARHFQTVWATGVACIDKDTFLESDAEGNLIVLRRNL 992
Query: 889 EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS 948
G +++R RLEV E LGE VNR R ++ + V P GTV G I + A
Sbjct: 993 AGVEEDDRRRLEVTSEISLGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLFAI 1049
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLS 1007
+ E FL +LQ + ++ +G + ++R F + ++ + F+DG+LIE FL+
Sbjct: 1050 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVREATEPYRFVDGELIEQFLNCE 1109
Query: 1008 RTRMDEISKTMN-VSVEELCKRVEELTRLH 1036
+EI ++ ++V E+ +E L RLH
Sbjct: 1110 PELQEEIVNSVGMMNVHEVKVMIEALRRLH 1139
>gi|116195210|ref|XP_001223417.1| hypothetical protein CHGG_04203 [Chaetomium globosum CBS 148.51]
gi|88180116|gb|EAQ87584.1| hypothetical protein CHGG_04203 [Chaetomium globosum CBS 148.51]
Length = 1127
Score = 424 bits (1090), Expect = e-115, Method: Compositional matrix adjust.
Identities = 322/1141 (28%), Positives = 549/1141 (48%), Gaps = 125/1141 (10%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
+ G I+ L+ RP D LF+ TER+++ L W+ E+S+L T D +R R +
Sbjct: 2 VNGTISILQKLRPKDSKTDLLFVGTERFEYFTLAWNPETSQLHTIDPFTDPGERHMRDSQ 61
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF---DNKGQLKEAFNIRLEELQVLDIKFLYG-C 120
+ ++DP R + +HL++G+ ++ NK L +RL EL + FLY
Sbjct: 62 SQDRCLVDPSGRYLAMHLWEGVLTMLRLGERKNKRGLAWMDQVRLSELFIKASTFLYTET 121
Query: 121 AKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQNN--------LDNGADLLIPV 170
P I LYQ D+ K TY + + D++ + + D A +LIPV
Sbjct: 122 GHPKIAFLYQSRADSADAKLATYRMTIDDRNTEASKFDPDRDREIDTDIPDPSASILIPV 181
Query: 171 ------------------PPPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGRV 209
+ G+L++GE ++Y + ++ + SI A+
Sbjct: 182 RKVEEEVKRHNVRNVESAKAHIGGLLVVGETRLLYIDEVTKATVQSALKQASIFVAWAEY 241
Query: 210 DADGSRYLLGDHAGLLHLLVITHEKEK---VTGLKIELLGETSIASTISYLDNAVVYIGS 266
+ + Y L D G LHLL + E VT +++ +G+TS AS + +L ++++ S
Sbjct: 242 NE--TNYFLADDYGNLHLLTLVRESPDSVVVTSMEVNRIGKTSRASHLVFLGGDMLFVAS 299
Query: 267 SYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER-----------QGQGQV 315
YGDSQL +L+ Q + +++++ N+GPI+DF ++D+ R GQ ++
Sbjct: 300 HYGDSQLFRLDFQ-NEDVPPIQLVQTLANIGPILDFAIMDMGRGDEGQQGNEYSSGQARI 358
Query: 316 VTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETR 374
VTCSG +KDGSLR VR+G+G+ + + +L+ +G++SL++ + P L SF++ETR
Sbjct: 359 VTCSGVHKDGSLRSVRSGVGLEDVGILADLEHCRGLFSLKT-YEAPKTNILAASFLTETR 417
Query: 375 ILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKS 433
+ + + E+EE + G QTL Q++QVT +V L+ + S + W
Sbjct: 418 VFKFDPQGEVEELDSFAGMTFNQQTLLARCLSTAQILQVTPAAVTLLDAESGLTIDSWTP 477
Query: 434 PPGYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENP 492
S+ A+ N +LL+ G LV I + L K + +I+C+ + P+
Sbjct: 478 GAQKSIISASGNNKSLLLSVDGTELVLFSIDINLRLVRTKEIGYQDQIACIHVPPL---- 533
Query: 493 SYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAF---EGISYLL 545
+ +A VG W+ +V I LP L + E L IPR + L G L
Sbjct: 534 -RNDLAVVGFWSSGTVSIIDLPSLEPMHGEQLRTSPDDASIPRDLALVKLLDNVGGGTLF 592
Query: 546 CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV---FAASDRPTVIY 602
A+ DG+++ F L+ + LT RK+V LG + + H+ FA ++ P++IY
Sbjct: 593 VAMQDGNVITFNLS-ENFRLTGRKRVILGMRQARFHLLPQPDAPHIYSIFATTEHPSLIY 651
Query: 603 SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEH 662
S +++YS V +E +++CPF++ AFPD + +A + +L I ID ++ H++ +P+ E
Sbjct: 652 GSEGRIVYSAVTAEEATYICPFDTEAFPDCIVLATDNQLKISHIDRERRTHVKPLPMNEM 711
Query: 663 PRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFI-STYPLD----TFEYG 716
RRI + + + F + C + EE +L+D+ F+ + T+PL T
Sbjct: 712 VRRIAYSPKEKVFGLGCIRRELVNGEEIVESSFKLVDEVIFDRVGKTFPLGVPSRTELVE 771
Query: 717 CSI---LSCSFSDDSNVYYCVGTAYVLPEENEP-------TKGRILVFIVE-DGKLQLIA 765
C + L S+ + + + VGT+++ + P +GR+LV V+ D L+
Sbjct: 772 CVVRAELRDSYGNPAE-RFLVGTSFLPDPDYGPGPGPAADARGRLLVLGVDADRNPYLVL 830
Query: 766 EKETKGAVYSLNAFN--------GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
+ KGA L G ++A + + + + ++ T EL +
Sbjct: 831 SHDLKGACRCLAVLGEDAGPNAAGLIVAGLTKTVVVCRYDETSSTTAELTRLASYRPSSY 890
Query: 818 ALYVQTRGDFIVVGDLMKSISLLIY--------------------KHEEGAIEERARDYN 857
+ RG I V DLMKSISL+ Y + + E AR ++
Sbjct: 891 PAEIAVRGSTIAVADLMKSISLVEYIPAGSSSGSGGGGSGSGESGSDDGPRLVEHARHFS 950
Query: 858 ANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHG 917
+ W +AV +++ +L A+ NL +R+N EG T E++ R+EV E +L E VNR R
Sbjct: 951 SVWATAVGFVEEGSWLEADAQGNLMVLRRNVEGVTAEDKRRMEVTSEINLNEMVNRIRTI 1010
Query: 918 SLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNH 976
V P + + +P GTV G I + ++ PH Q L L + Q+ L V+K G +
Sbjct: 1011 D-VETTPGAMI--VPKAFLGTVEGGIYMFGTVAPHVQDLLL-RFQSRLADVLKTAGDIEF 1066
Query: 977 EQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+R+F N ++ D F+DG+L+E FLD+ T + + K + +VE++ VEEL R+
Sbjct: 1067 RTYRAFRNAEREGDGPFRFVDGELLEKFLDVDETTQEAVCKGLGPTVEDMRNLVEELRRM 1126
Query: 1036 H 1036
H
Sbjct: 1127 H 1127
>gi|85107301|ref|XP_962347.1| hypothetical protein NCU06605 [Neurospora crassa OR74A]
gi|28923951|gb|EAA33111.1| conserved hypothetical protein [Neurospora crassa OR74A]
Length = 1158
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 323/1122 (28%), Positives = 551/1122 (49%), Gaps = 109/1122 (9%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
I G I L+ +P D LF+ T+++++ +WD E+ +L T D +R R +
Sbjct: 55 INGTITILQKLQPKDHPTDLLFVGTDQFEYFTAEWDRETQQLKTLNRFSDPGERHMRDSQ 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNI----RLEELQVLDIKFLYG- 119
+ I+DP R + +HL++G+ V N+ +I RL EL + FLY
Sbjct: 115 SQNKCIVDPSGRFMAMHLWEGVLSVWRLGNRKNTATTLDILVQVRLSELFIKGSTFLYTE 174
Query: 120 CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQNN--------LDNGADLLIP 169
P + LY++ ++ K TY + D+ + D GA +LIP
Sbjct: 175 TGIPKVAFLYRNQANSNETKLATYRLTSDDRHTEISKFDPTRDREIDADVEDPGAGILIP 234
Query: 170 VP------------------PPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGR 208
V P + G++++GE ++Y + ++ PSI A+
Sbjct: 235 VKKVEEEVKRHHFRNTEQAKPHVGGLIVVGETRLLYIDEVTKTQVESALREPSIFVAWAE 294
Query: 209 VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
D + Y L D G LHLL I E VTG+ + +G+T+ A ++YL + ++++GS Y
Sbjct: 295 YDP--THYFLSDDYGNLHLLTILTEGAVVTGMDVSNIGKTARAHVLTYLGDDMLFVGSHY 352
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQVV 316
G+SQL +LNL + +++++ N+GP+ DF V+D+ + GQ ++V
Sbjct: 353 GNSQLYRLNLLSEDLSEILQLVQVLENIGPVTDFTVMDMGNRENDSQLGNEYSSGQARIV 412
Query: 317 TCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
T SG +KDG+LR VR+G+G+ + A + ELQ + ++SL+S DT LV SF+++TRI
Sbjct: 413 TASGVFKDGTLRSVRSGVGLQDIAILGELQHTRALFSLQSYNSPRVDT-LVASFLTDTRI 471
Query: 376 LAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
+ E+EE + G Q QTL + QL+QVT+ + L+ + S W
Sbjct: 472 FKFDPHGEIEEVDNYYGMDLQHQTLLATNLDNGQLLQVTTAAATLLDAESGVTIASWAPE 531
Query: 435 PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL--EYEISCLDINPIGENP 492
+ A+AN +LL+ G LV + I D LT V+ + + +I+C+ + P
Sbjct: 532 GDRQIINASANKHWLLLSVQGTTLVSINI-DNDLTVVQEKDISEQDQIACIHV-----AP 585
Query: 493 SYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI----IPRSVLLCAF----EGISYL 544
S + VG WT +V I + L I E L IPR ++L G++ L
Sbjct: 586 QLSDVGVVGFWTSGTVSIIDMSTLEPIHGESLRRSADDASIPRDLVLAKVLPNAPGMT-L 644
Query: 545 LCALGDGHLLNFLLNMKTGE---LTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTV 600
A+ DG+++ F GE + RK V LGT+ ++ + +FA ++ P++
Sbjct: 645 FIAMEDGNVVTF----NIGEDLTFSGRKSVILGTREARFHLLPQQDGIYSIFATTEHPSL 700
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IY S +++YS V ++ + +CPF+S AFP ++ ++ E E+ I ID ++ H+RS+ LG
Sbjct: 701 IYGSEGRIIYSAVTAEDATCVCPFDSEAFPGAVILSTENEIRISEIDTARQTHVRSLELG 760
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMH-FVRLLDDQTFEFIST-YPLDTFEYGCS 718
E RRI + + F + ++ + E +H +L+D+ F + + L T Y
Sbjct: 761 EMVRRIAYSPSEKGFGLGCIRREVVNGEEIIHSSFKLVDEILFARVGKEFMLGTSSYSEL 820
Query: 719 ILSCSFSDDSNVY------YCVGTAYVL-PEENEPT--KGRILVFIVEDGK-LQLIAEKE 768
+ ++ + Y + VGT+++ P+ T +GRILVF ++ + L+ + E
Sbjct: 821 VEDVIRAELPDSYGNLVERFIVGTSFLEDPDRGAGTDKRGRILVFGIDSNRDPYLVLKHE 880
Query: 769 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
KG +L K++AA+++ + + ++ L + + + G+ I
Sbjct: 881 LKGGCRALAVMGSKIVAALHKTVVISQYEETSSTEAHLVKLASYRCTTYPVDIAVHGNMI 940
Query: 829 VVGDLMKSISLLIY--------KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
V D+MKS +L+ Y K E + E AR ++ W +AV ++ + +L A+ N N
Sbjct: 941 AVADMMKSATLVEYVPAKTGGEKSEAPKLVECARHRHSAWATAVAHVEGESWLEADANGN 1000
Query: 881 LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
L +++N+EG T E++ +L + E +LGE VN+ R V P++ + IP T
Sbjct: 1001 LIVLQRNAEGVTVEDQRQLRITSELNLGEQVNKIRPIK-VETSPNAII--IPRAFLATAE 1057
Query: 941 GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK------NF 994
G I + ++ EQ L L + Q L VIK VG L+ +R+F N ++ +A F
Sbjct: 1058 GGIYMFGTIAREQDLLL-RFQDKLAAVIKTVGELDFNSYRAFRNAERGPEADGTTGPVRF 1116
Query: 995 LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
LDG+L+E FLD+ EI + + SVE++ VEEL R+H
Sbjct: 1117 LDGELLERFLDVDEKTQKEICEGLGPSVEQMRNMVEELRRMH 1158
>gi|358056808|dbj|GAA97158.1| hypothetical protein E5Q_03834 [Mixia osmundae IAM 14324]
Length = 1243
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 334/1182 (28%), Positives = 553/1182 (46%), Gaps = 169/1182 (14%)
Query: 2 LDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE--SSELITRAMG------ 53
L++P++G I L+ R + L + T F VL ++ + + TR++G
Sbjct: 84 LEIPLFGTILDLQPIRFKDRSTSSLLLLTTSLHFSVLTYEPDFLTPASGTRSLGWSNARV 143
Query: 54 ------DVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG----------- 96
VS+R GR ++ Q ++DP R + LH+Y GL +V+P +
Sbjct: 144 QDEATLSVSERAGRQSEEAQTILVDPHNRCVLLHIYCGLIRVLPIKSSAGAASKPSRRRS 203
Query: 97 ---------------------------------QLKEAFNIRLEELQVLDIKFLY--GCA 121
+ ++NIRL L V I FL +
Sbjct: 204 SAAAPVGELSSSLKKSSGKARASAAASAESDGIDVFRSYNIRLPYLNVKHIAFLPLPESS 263
Query: 122 KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNL---DNGADLLIPVPPPLC--- 175
PTI +L+ + + + ++LKDK+ V L D DL IP+ L
Sbjct: 264 LPTIALLHTNFLGQNVLSIHAISLKDKELVNSAKPDFELVLDDETVDLFIPLERNLATGQ 323
Query: 176 --GVLIIGEETI----------------------------VYCSANAFKAIPIRPS---- 201
G+L++G I V +A+ ++ P P+
Sbjct: 324 PLGLLLVGSGQINWLPLPLLDTIASPPISPTTARRASISSVSTNASGRRSAPPSPTNTRK 383
Query: 202 -------------------ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKI 242
I + Y +D D S L+GD G L + I + E+VT L +
Sbjct: 384 SKGKERASDRRVTSKVPVGIYRTYCAIDHDPSLLLVGDEEGYLVSVRIGLQGERVTSLDV 443
Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL-NLQPDA-KGSYVEVLERYVNLGPIV 300
LG+ ++++++ + V++GS YGDS L+ + L D + V V+ NL PIV
Sbjct: 444 VDLGKVPSPTSLTHIADEYVFVGSYYGDSSLVAIPALTSDGMECDSVSVISAMANLAPIV 503
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
DFCVV + GQ +VTCSGA GSLR+VR G+G++ A+++L ++ W L+ ++
Sbjct: 504 DFCVVT-DDVGQSHLVTCSGAKNSGSLRLVRQGVGLSILATIDLPAVQNAWPLKLASSSI 562
Query: 361 FDTFLVVSFISETRILAM---NLEDELEETEIEGFCSQTQTLFCH---DAIYNQLVQVTS 414
++VSF+ +++L++ NL +E+ TL+ DA + T+
Sbjct: 563 KHDAILVSFLDRSQLLSLCNGNL------SELASPALTEPTLYAGTLGDAGAQVAICATA 616
Query: 415 GSV-RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKH 473
++ RL EL W SP G + A+++ S +LLAT LV + + +
Sbjct: 617 KAILRLSIEKPSEL---WTSPTGDLITAASSDGSSLLLATSSKALVLMTLSPNGIALTTT 673
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV 533
Q E+SCL + SQIAAV W+ ++ ++SLP L +T L + +P+S+
Sbjct: 674 KQTISEVSCL---AAWTTTTGSQIAAVASWSTNAILLYSLPHLEPVTGAELSFDHLPQSM 730
Query: 534 LLCAFEGIS-YLLCALGDGHLLNFLLNMKTGELT--DRKKVSLGTQPITLRTFSSKNTTH 590
L FE + +L LG G L++F ++ +G + RK V+LG +P+ L S+
Sbjct: 731 LFQKFEDDAVHLFVGLGSGDLISFGIDAASGAVLPLSRKSVTLGKKPVLLSQCSAAGQPA 790
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
VFA +DRPTV+ S +L Y++ N + + + ++ F SL + + D +
Sbjct: 791 VFAVTDRPTVVSRSAGRLSYASDNRRTLVAINQIDALRFEQSLMLVSPEGIQFARADGNE 850
Query: 651 KLHIRSIPLGE-HPRRICHQEQSRTFAICSLK---NQSCAEESEMHFVRLLDDQTFEFIS 706
LH+RS+ LGE PR+I H + R +A+ L+ ++S + V+ +D F +
Sbjct: 851 SLHVRSLSLGELQPRKIAHSAELRAYAVLCLQETIDRSTGHLTRAGSVQFVDASDFALLD 910
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYV--LPEENEPTKGRILVFIVEDGKLQLI 764
++ L + E+G ++ + S ++ VGTA+ + EP KGR+L F+ + K +
Sbjct: 911 SFDLQSDEHGTALETVSL--HGAAHFAVGTAFSDRTVDAREPKKGRVLTFMRDGDKFEQH 968
Query: 765 AEKETKGAVYSLNAFNGKLLAAI-NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQT 823
+G V+ L LAAI N +++++ + Q C G LA + +
Sbjct: 969 VHAVLEGGVFGLCQLPNSFLAAIANAQVKVFHVT---EQAHIDQMTC-WAGTFLAQSISS 1024
Query: 824 RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL--DDDIYLGAENNFNL 881
R I+VGDL +S+ LL + + + E AR+++ N MSAVE L DD Y+G E N+
Sbjct: 1025 RDSQIIVGDLYRSVVLLQWDEAKDTLSEVAREHHVNGMSAVEFLGFTDDRYIGTEQELNI 1084
Query: 882 FTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
FT+ K T E LE G +H+GE+V R R G+LV D+ G P ++FGT +G
Sbjct: 1085 FTLTKTK---TRERIDILETEGMFHIGEYVTRIRKGALVPGYTDTSFGAAPQLLFGTSDG 1141
Query: 942 VIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
+GVI + E L L L+ N+R VI+ GGL WR+F + + F+DGD+I
Sbjct: 1142 SLGVIVNCTPEVSLKLFALERNMRAVIRAFGGLEQVDWRAFRAPHRVHEPVGFVDGDMIG 1201
Query: 1002 SFLDLSRTRMDEI-------SKTMNVSVEELCKRVEELTRLH 1036
F +L+ T+++++ S N + E+L + V+EL R+H
Sbjct: 1202 RFAELNETQVNQVLQGASEHSALTNTTAEDLYRLVDELQRMH 1243
>gi|336263557|ref|XP_003346558.1| hypothetical protein SMAC_04731 [Sordaria macrospora k-hell]
gi|380090453|emb|CCC11749.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1149
Score = 422 bits (1085), Expect = e-115, Method: Compositional matrix adjust.
Identities = 324/1111 (29%), Positives = 543/1111 (48%), Gaps = 110/1111 (9%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
I G I L+ +P D LF+ T+++++ +WD E+ +L T D +R R +
Sbjct: 55 INGTITILQKLQPKDHPTDLLFVGTDQFEYFTAEWDHETQQLKTLNRFSDPGERHMRDSQ 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNI----RLEELQVLDIKFLYG- 119
+ I+DP R + +HL++G+ V N+ +I RL EL + FLY
Sbjct: 115 SQDKCIVDPSGRFMAMHLWEGVLSVWRLGNRKSTATTLDILVQVRLSELFIKGSTFLYTE 174
Query: 120 CAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVP-------- 171
P + LY++ ++ K D D VE P GA +LIPV
Sbjct: 175 TGIPKVAFLYRNQANSNETKLATDREIDAD-VEDP--------GAGILIPVKKVEEEVKR 225
Query: 172 ----------PPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGRVDADGSRYLL 218
P + G+++IGE ++Y + ++ PSI A+ D + Y L
Sbjct: 226 HHFRNTEQAKPHVGGLIVIGETRLLYIDEVTKTQVESALKEPSIFVAWAEYDP--THYFL 283
Query: 219 GDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL 278
D G LHLL I E VTG+ + +G T+ A ++YL + ++++GS YG+SQL +LNL
Sbjct: 284 ADDYGNLHLLTILTEGAVVTGMDVSNIGRTARAHVLTYLGDDMLFVGSHYGNSQLYRLNL 343
Query: 279 QPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQVVTCSGAYKDGS 326
+ +++++ N+GPI DF ++D+ + GQ ++VT SG +KDG+
Sbjct: 344 LNEDLNEILQLVQVLENIGPITDFTIMDMGNRENDSQLGNEYSSGQARIVTASGIFKDGT 403
Query: 327 LRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
LR VR+G+G+ + A + ELQ + ++SL+S DT LV SF+++TRI + E+E
Sbjct: 404 LRSVRSGVGLQDIAILGELQHTRALFSLQSYNSSRADT-LVASFLTDTRIFRFDPHGEIE 462
Query: 386 ETEIEGFCS---QTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVA 442
E+ +C Q QTL + QL+QVT+ + L+ + S W + A
Sbjct: 463 --EVADYCGMDLQHQTLLTTNLDNGQLLQVTTAAATLLDAESGVTIASWAPEGDRQIINA 520
Query: 443 TANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVG 501
+AN +LL+ G LV + I D + + K + +I+C+ + P S + VG
Sbjct: 521 SANKHWLLLSVQGTTLVSINIDNDLTVVQEKDVSEQDQIACIHV-----APQLSDVGVVG 575
Query: 502 MWTDISVRIFSLPDLNLITKEHLGGEI----IPRSVLLCAF----EGISYLLCALGDGHL 553
WT +V I + L I E L IPR ++L G++ L A+ DG++
Sbjct: 576 FWTSGTVSIIDMSTLEPIHGESLRRSADDASIPRDIVLAKVLPNTPGMT-LFIAMEDGNV 634
Query: 554 LNFLLNMKTGE---LTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLL 609
+ F GE + RK V LGT+ ++ + +FA ++ P++IY S +++
Sbjct: 635 VTF----NIGEDLTFSGRKSVILGTREARFHLLPQQDGIYSIFATTEHPSLIYGSEGRII 690
Query: 610 YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
YS V ++ + +CPF+S AFP ++ ++ E E+ I ID ++ H+RS+ LGE RRI +
Sbjct: 691 YSAVTAEDATCVCPFDSEAFPGAVVLSTETEIKISEIDTARRTHVRSLELGEMVRRIAYS 750
Query: 670 EQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFIST-YPLDTFEYGCSILSCSFSDD 727
+ F + C + EE +L+D+ F + L T Y + ++
Sbjct: 751 PSEKGFGLGCIRREMVNGEEIIQSSFKLVDEILFARAGREFRLGTSSYSELVEDVIRAEL 810
Query: 728 SNVY------YCVGTAYVL-PEENEPT--KGRILVFIVEDGK-LQLIAEKETKGAVYSLN 777
+ Y + VGT+++ P+ T +GRILVF ++ + L+ + E +GA +L
Sbjct: 811 PDSYGNLLERFIVGTSFLEDPDRGAGTDKRGRILVFGIDSNRDPYLVLKHELRGACRALA 870
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
K++AA+++ + + ++ L + + + G+ I V D+MKS
Sbjct: 871 VMGSKIVAALHKTVVISQYEETSSTEARLVKLASYRCTTYPIDIAVHGNIIAVADMMKSA 930
Query: 838 SLLIY--------KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE 889
+L+ Y K+E + E AR ++ W +AV ++ + +L A+ N NL +++N E
Sbjct: 931 TLVEYVQAKTEEEKYEPAKLVECARHRHSAWATAVAHVEGESWLEADANGNLVVLQRNVE 990
Query: 890 GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL 949
G T E++ +L + E +LGE VN+ R V P++ + IP T G I + ++
Sbjct: 991 GVTAEDQRQLRITSELNLGEQVNKIRPIK-VETSPNTII--IPRAFLATAEGGIYLFGTI 1047
Query: 950 PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK------TVDAKNFLDGDLIESF 1003
EQ L L + Q L VIK VG L+ +R+F N ++ T FLDG+L+E F
Sbjct: 1048 AREQDLLL-RFQDKLAAVIKTVGELDFNSYRAFRNAERGPETDGTTGPVRFLDGELLERF 1106
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
LD+ T EI + + SVE++ VEEL R
Sbjct: 1107 LDVDETTQKEICEGLGPSVEQMRNMVEELRR 1137
>gi|347838030|emb|CCD52602.1| similar to DDB1B (Damaged DNA Binding protein 1 B); damaged DNA
binding / protein binding [Botryotinia fuckeliana]
Length = 1157
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 334/1125 (29%), Positives = 568/1125 (50%), Gaps = 116/1125 (10%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
IYGRI+ L+ +P G D LF+ T R ++ + W+ ++ +L T ++ DVS R ++
Sbjct: 55 IYGRISMLQKIQPAGSKTDHLFVGTVRAQYFTVMWNPQTHKLDTMQSFVDVSQEHMRDSE 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVI----PFDNKGQ-LKEAFNIRLEELQVLDIKFLYG 119
+ ++DP RL+ + LY+G+ ++ P K L+ +R+ E++V FLY
Sbjct: 115 SRDRCLVDPTGRLLVMELYEGVLNLVKIVKPRGGKTDYLENPEQVRISEMKVRASAFLYT 174
Query: 120 CAK-PTIVVLYQDNKDARHVKTYEVALKDK-----DFVEGPWSQNNLDN---GADLLIPV 170
K P + +LYQD ++ + TY + L DK F +N++D+ GA +IPV
Sbjct: 175 DTKQPKLALLYQDARENVKLATYRM-LDDKGQLILQFDPKKNRENDVDDLCVGAMHIIPV 233
Query: 171 PPP-------------------LCGVLIIGEETIVYCSANAFKAIPIRPSITKAY---GR 208
P L G++++GE Y + KAI + ++ +A
Sbjct: 234 PKARDEASKRYIVRNATTAKANLGGLVVLGETKFTYLDDES-KAI-VEYALDEAVLWAAW 291
Query: 209 VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
D YLLGD G L++L I + VTGL++ LG+ S +++ L N V YI S
Sbjct: 292 EPIDERTYLLGDDYGFLYILTILVDGATVTGLQVVKLGQVSKPTSLENLGNGVFYIASHE 351
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVT 317
D+Q+++++L+ G V +L+ N+ PI+DF V+D+ E Q GQ ++VT
Sbjct: 352 ADNQVVQIDLESPEHG--VTLLQTLPNIAPILDFTVMDMGGREGETQLNEYSSGQARLVT 409
Query: 318 CSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
SG ++ GSLR VR+G+G+++ A + E++GI+ +++L S P DT LVVSF +ETR
Sbjct: 410 GSGGFEGGSLRSVRSGVGLDDTAILAEMEGIRKVFALHSGPTLPNDT-LVVSFSTETRFF 468
Query: 377 AMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
+ + ++EE E ++ S ++TL ++ ++QVT V + + + W+ P
Sbjct: 469 KFDTQGDIEEVETVKNLSSTSETLLTYNLDEGCILQVTQHEVSIYGKSPG---HRWQPPN 525
Query: 436 GYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL-EYEISCLDINPIGENPSY 494
G + A+ N + +LL++ G L L I L EV +L + +++C+ + +
Sbjct: 526 GQIITAASGNQNYILLSSNGRTLFTLSIQQN-LAEVAFQELGDDQVACIHVPQV-----M 579
Query: 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHL---GGEIIPRSVLLCA-----FEGISYLLC 546
I VG+W SV + L LN I E L G IPR + L G + L
Sbjct: 580 GDIGVVGLWKSGSVSLLDLATLNTIVSEDLRRADGASIPRDIALTQILPPELSGPT-LFV 638
Query: 547 ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSN 605
++ DG +L+F ++ L+ RK + LGTQ L+ NTT +VFA + P++IY S
Sbjct: 639 SMEDGIVLSFNVDKSDCSLSGRKSIVLGTQQAQLQILPRDNTTFNVFATCEHPSLIYGSE 698
Query: 606 KKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRR 665
+ +YS V ++ +C NS A+P S+ +A EL + ID+ ++ H+R++P+GE RR
Sbjct: 699 GRTVYSAVTAEDAIAVCSLNSVAYPGSVVVATINELKLAVIDNERRTHVRTLPIGETVRR 758
Query: 666 ICHQEQSRTFAICSLKNQ-SCAEESEMHFVRLLDDQTF-EFISTYPL----DTFEYGCSI 719
+ + + ++FAI ++K + + +E RL+D+ F E Y L + E
Sbjct: 759 VAYSAKEKSFAIGAIKRELTKGQEVVTTSFRLVDEVVFGELGEPYYLPPNNEIIETVVRA 818
Query: 720 LSCSFSDDSNVY--YCVGTAYVLPEENEPTKGRILVFIVE-DGKLQLIAEKETKGAVYSL 776
+ D + + VGT++ L EE +GR+L+F V D +IA KG+ +
Sbjct: 819 ELPTRHGDGELVERFLVGTSF-LHEEEANVRGRLLIFGVNADRAPYMIASHNLKGSCRCI 877
Query: 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
+GK++AA+N+ + +Y + + L+ + + + + I V D+MKS
Sbjct: 878 GVLDGKIVAALNKTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIAVADIMKS 937
Query: 837 ISLLIYKHEEGA------IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
I+L+ Y GA +EE AR + ++V +D D YL +++ NL +++N EG
Sbjct: 938 IALVEYT--PGADGLPDKLEEVARHAQQVFSTSVAEVDTDTYLETDHDGNLILLKRNREG 995
Query: 891 ATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLP 950
T E++ R+EV E +LGE VNR + ++ D IP GT G I + + +P
Sbjct: 996 VTREDKTRMEVTCEMNLGEMVNRVKRINVET---SKDALLIPRAFLGTTEGSIYLFSLIP 1052
Query: 951 HEQYLFLEKLQTNLRKV-----IKGV-------------GGLNHEQWRSF-NNEKKTVDA 991
+ L +LQ+ L + I+G G L+ ++RS+ + ++T +
Sbjct: 1053 PQNQDLLMRLQSRLASLPSASSIRGSSDSTSPHQIELSPGNLDFNKYRSYISATRETSEP 1112
Query: 992 KNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
F+DG+LIE FLDL + +++ + V E+L VE L RLH
Sbjct: 1113 FRFVDGELIERFLDLEVEVQEHVAEGLGVKAEDLRGMVEGLRRLH 1157
>gi|336470789|gb|EGO58950.1| hypothetical protein NEUTE1DRAFT_145059 [Neurospora tetrasperma FGSC
2508]
gi|350291855|gb|EGZ73050.1| hypothetical protein NEUTE2DRAFT_156600 [Neurospora tetrasperma FGSC
2509]
Length = 1158
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 323/1122 (28%), Positives = 550/1122 (49%), Gaps = 109/1122 (9%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
I G I L+ +P D LF+ T+++++ +WD E+ +L T D +R R +
Sbjct: 55 INGTITILQKLQPKDHPTDLLFVGTDQFEYFTAEWDRETQQLKTLNRFSDPGERHMRDSQ 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNI----RLEELQVLDIKFLYG- 119
+ I+DP R + +HL++G+ V N+ +I RL EL + FLY
Sbjct: 115 SQNKCIVDPSGRFMAMHLWEGVLSVWRLGNRKNTATTLDILVQVRLSELFIKGSTFLYTE 174
Query: 120 CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQNN--------LDNGADLLIP 169
P + LY++ ++ K TY + D+ + D GA +LIP
Sbjct: 175 TGIPKVAFLYRNQANSNETKLATYRLTSDDRHTEISKFDPTRDREIDADVEDPGAGILIP 234
Query: 170 VP------------------PPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGR 208
V P + G++++GE ++Y + ++ SI A+
Sbjct: 235 VKKVEEEVKRHHFRNTEQAKPHVGGLIVVGETRLLYIDEVTKTQVESALREASIFVAWAE 294
Query: 209 VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
D + Y L D G LHLL I E VTG+ + +G+T+ A ++YL + ++++GS Y
Sbjct: 295 YDP--THYFLSDDYGNLHLLTILTEGAVVTGMDVSNIGKTARAHVLTYLGDDMLFVGSHY 352
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQVV 316
G+SQL +LNL + +++++ N+GPI DF V+D+ + GQ ++V
Sbjct: 353 GNSQLYRLNLLSEDLSEILQLVQVLENIGPITDFTVMDMGNRENDSQLGNEYSSGQARIV 412
Query: 317 TCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
T SG +KDG+LR VR+G+G+ + A + ELQ + ++SL+S DT LV SF+++TRI
Sbjct: 413 TASGVFKDGTLRSVRSGVGLQDIAILGELQHTRALFSLQSYNSPRVDT-LVASFLTDTRI 471
Query: 376 LAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
+ E+EE + G Q QTL + QL+QVT+ + L+ + S W
Sbjct: 472 FKFDPHGEIEEVDNYYGMDLQHQTLLATNLDNGQLLQVTTAAATLLDAESGVTIASWAPE 531
Query: 435 PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL--EYEISCLDINPIGENP 492
+ A+AN +LL+ G LV + I D LT V+ + + +I+C+ + P
Sbjct: 532 GDRQIINASANKHWLLLSVQGTTLVSINI-DNDLTVVQEKDISEQDQIACIHV-----AP 585
Query: 493 SYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI----IPRSVLLCAF----EGISYL 544
S + VG WT +V I + L I E L IPR ++L G++ L
Sbjct: 586 QLSDVGVVGFWTSGTVSIIDMSTLEPIHGESLRRSADDASIPRDLVLAKVLPNAPGMT-L 644
Query: 545 LCALGDGHLLNFLLNMKTGE---LTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTV 600
A+ DG+++ F GE + RK V LGT+ ++ + +FA ++ P++
Sbjct: 645 FIAMEDGNVVTF----NIGEDLTFSGRKSVILGTREARFHLLPQQDGIYSIFATTEHPSL 700
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG 660
IY S +++YS V ++ + +CPF+S AFP ++ ++ E E+ I ID ++ H+RS+ LG
Sbjct: 701 IYGSEGRIIYSAVTAEDATCVCPFDSEAFPGAVILSTENEIRISEIDTARQTHVRSLELG 760
Query: 661 EHPRRICHQEQSRTFAICSLKNQSCAEESEMH-FVRLLDDQTFEFIST-YPLDTFEYGCS 718
E RRI + + F + ++ + E +H +L+D+ F + + L T Y
Sbjct: 761 EMVRRIAYSPSEKGFGLGCIRREVVNGEEIIHSSFKLVDEILFARVGKEFMLGTSSYSEL 820
Query: 719 ILSCSFSDDSNVY------YCVGTAYVL-PEENEPT--KGRILVFIVEDGK-LQLIAEKE 768
+ ++ + Y + VGT+++ P+ T +GRILVF ++ + L+ + E
Sbjct: 821 VEDVIRAELPDSYGNLVERFIVGTSFLEDPDRGAGTDKRGRILVFGIDSNRDPYLVLKHE 880
Query: 769 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
KG +L K++AA+++ + + ++ L + + + G+ I
Sbjct: 881 LKGGCRALAVMGSKIVAALHKTVVISQYEETSSTEAHLVKLASYRCTTYPVDIAVHGNII 940
Query: 829 VVGDLMKSISLLIY--------KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
V D+MKS +L+ Y K E + E AR ++ W +AV ++ + +L A+ N N
Sbjct: 941 AVADMMKSATLVEYVPAKTGGEKSEAPKLVECARHRHSAWATAVAHVEGESWLEADANGN 1000
Query: 881 LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
L +++N+EG T E++ +L + E +LGE VN+ R V P++ + IP T
Sbjct: 1001 LIVLQRNAEGVTVEDQRQLRITSELNLGEQVNKIRPIK-VETSPNAII--IPRAFLATAE 1057
Query: 941 GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK------NF 994
G I + ++ EQ L L + Q L VIK VG L+ +R+F N ++ +A F
Sbjct: 1058 GGIYMFGTIVREQDLLL-RFQDKLAAVIKTVGELDFNSYRAFRNAERGPEADGTTGPVRF 1116
Query: 995 LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
LDG+L+E FLD+ EI + + SVE++ VEEL R+H
Sbjct: 1117 LDGELLERFLDVDEKTQKEICEGLGPSVEQMRNMVEELRRMH 1158
>gi|171691144|ref|XP_001910497.1| hypothetical protein [Podospora anserina S mat+]
gi|170945520|emb|CAP71632.1| unnamed protein product [Podospora anserina S mat+]
Length = 1158
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 326/1117 (29%), Positives = 550/1117 (49%), Gaps = 100/1117 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL-ITRAMGDVSDRIGRPTD 64
+ G IA L+ RP D LF+ T+R+++ L W+ E+S++ T A D + R +
Sbjct: 56 VNGTIAMLQKLRPKDAETDLLFVGTDRFEYFTLYWNPETSQMETTNATRDPGEHFMRNSQ 115
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF----DNKGQLKEAFNIRLEELQVLDIKFLYG- 119
+ I+DP R I +HL++G+ + N QL IRL EL + FL+
Sbjct: 116 SLDRAIVDPSGRFIAMHLWEGVMTIARLGTRKTNAAQLDWMGQIRLAELFIKASTFLHNE 175
Query: 120 CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQNN--------LDNGADLLIP 169
PT+ LYQ + +A+ K TY + D++ V ++ D GA++LIP
Sbjct: 176 TGHPTVAFLYQTSANAQDSKLATYRLTSDDRNTVASEFNAQKHRIIDITIADAGANMLIP 235
Query: 170 V------------------PPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRV-- 209
V P L GV+++GE ++Y + KA + + KA V
Sbjct: 236 VRKVEEEVKRHNFRNTGSAKPHLGGVIVVGETRLLYID-DVTKAT-VESKLDKASIFVKW 293
Query: 210 -DADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
+ + Y L D G LHLL I + +V G+ + +G TS AS + YL N ++++ S +
Sbjct: 294 AEYNVQTYFLADDYGSLHLLTINTDGAEVKGMVLTKIGVTSRASELVYLGNEMLFVASHH 353
Query: 269 GDSQLIKLNLQPD--AKGSYVEVLERYVNLGPIVDFCVVDLERQG-------------QG 313
GDS+L +L+L D A ++ +++ N+GPI+DF V+D+ +G Q
Sbjct: 354 GDSRLFQLDLSADKPADKPFLTLIQTISNIGPIMDFAVMDMGNRGGEDSQLGNEYSSGQA 413
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVEL----QGIKGMWSLRSSTDDPFDTFLVVSF 369
++V SG YKDGSLR VR+G+G+ E + L Q ++G++SLR + + L VSF
Sbjct: 414 RIVCGSGVYKDGSLRSVRSGVGL-EDVGLLLEDLGQHVRGVFSLRGAVGEGKMDTLAVSF 472
Query: 370 ISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 428
++ETR+ + E +EE + GF QTL + ++QVT+ V L+ + S
Sbjct: 473 LTETRVFKFDSEGGVEEVGDFMGFGLDCQTLLARNLGGGMILQVTTRGVVLIDAESGVTV 532
Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL--EYEISCLDIN 486
W ++ A+AN +LL+ G LV + L VK + + +++C+ +
Sbjct: 533 ATWVPRDENTIINASANGEWLLLSVEGTGLVSISTAGNELRLVKEKDISQQDQVACIHVA 592
Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLC--AFEG 540
P I VG WT +V I L L + E L IPR V+L A
Sbjct: 593 P-----QLQGIGVVGFWTSGTVSIIDLNTLEPMHGESLRQSQDDASIPREVVLVQVASPK 647
Query: 541 IS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT--HVFAASD 596
+S L A+ DGH++ F ++ E + +K+V LGT+ L N + + A ++
Sbjct: 648 VSGPTLFIAMEDGHVVTFNISADF-EFSGKKQVILGTRQARLHLLPQDNDSIYSILATTE 706
Query: 597 RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
P++IY +++YS V +E +CPF++ AFPDS+ +A + ++ I ID ++ H+R
Sbjct: 707 HPSLIYGEENRIVYSAVTAEEAMFICPFDTEAFPDSIIVATDTQIKISKIDRTRRTHVRE 766
Query: 657 IPLGEHPRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFI-STYPLDTFE 714
+P+GE RRI + + + F + C ++ +E RL+D+ F+ + T+ L+
Sbjct: 767 LPMGEMVRRIAYSPKEKVFGLGCIKRSLVDGDEVVQSSFRLVDEVIFQPVGKTFQLERTN 826
Query: 715 YGCSILSCSFSDDSNVY------YCVGTAYVLPEENEPT----KGRILVFIVEDGK-LQL 763
Y + + ++ + Y + VGT++ LP+ + +GRILVF ++D K L
Sbjct: 827 YVELVEAVVRAELPDSYGNPAERFIVGTSF-LPDPDYAMTGEHRGRILVFGIDDNKDPYL 885
Query: 764 IAEKETKGAVYSLNAFNG-KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQ 822
I TKG L +G K++A + + + + ++ T L + + +
Sbjct: 886 ILSHLTKGVCRCLEVLDGNKIVAGLAKTVAIARYDETSTTTATLTRLASYKPSTHPIQIA 945
Query: 823 TRGDFIVVGDLMKSISLLIY-KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
+G+ I V D+MKS++L+ Y ++ + E AR + + +A+ +D D +L A++ NL
Sbjct: 946 AQGNIIGVADVMKSMTLVEYMPGDKDRLVEVARHWQSAAGTALCHVDGDDWLEADDQGNL 1005
Query: 882 FTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ-IPTVIFGTVN 940
+R+N++ E+R + V E +LGE VNR R +R+ S +P GTVN
Sbjct: 1006 MMLRRNADAVVMEDRKIMSVTAEMNLGEMVNRIR----AVRVETSRGAMVVPRAFLGTVN 1061
Query: 941 GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN-EKKTVDAKNFLDGDL 999
G I + ++ E L + Q L +V+ G + +R+F N E++ + FLDG+L
Sbjct: 1062 GGIYMFGTVAPEAQDLLLRFQEKLARVVHTAGEIEFNCYRAFRNAEREGSEPVRFLDGEL 1121
Query: 1000 IESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
+E FLD EI + + S+E + VEEL R+H
Sbjct: 1122 LERFLDQDEATQREICEGLGPSLEHMRNVVEELRRMH 1158
>gi|224587439|gb|ACN58665.1| DNA damage-binding protein 1 [Salmo salar]
Length = 444
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 221/446 (49%), Positives = 284/446 (63%), Gaps = 54/446 (12%)
Query: 641 LTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK------------------- 681
LTIGTID+IQKLHIR++PL E PRRIC+QE S+ F + S +
Sbjct: 3 LTIGTIDEIQKLHIRTVPLYESPRRICYQEVSQCFGVLSSRVEMQDASGTTAAVRPSAST 62
Query: 682 --------------------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
S EE E+H + ++D TFE + + EY S++S
Sbjct: 63 QALSSSVSSSKLFPSSTSPHETSFGEEVEVHSLLVVDQHTFEVLHAHQFLQSEYALSMVS 122
Query: 722 CSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNG 781
C D +VY+ VGTA V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNG
Sbjct: 123 CRLGRDLSVYFIVGTAMVYPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMMEFNG 182
Query: 782 KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
KLLA+IN ++LY+W +EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL
Sbjct: 183 KLLASINSTVRLYEWT----AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLA 238
Query: 842 YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEV 901
YK EG EE ARD+N NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+
Sbjct: 239 YKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQE 298
Query: 902 VGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
VG +HLGEFVN F HGSLV++ L +S +V+FGTVNG+IG++ SL Y L L
Sbjct: 299 VGVFHLGEFVNVFSHGSLVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDL 358
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q L KVIK VG + H WRSF+ E+KT A F+DGDLIESFLDL R +M E+ T+ +
Sbjct: 359 QNRLNKVIKSVGKIEHSFWRSFHTERKTEQATGFIDGDLIESFLDLGRAKMQEVVSTLQI 418
Query: 1021 ----------SVEELCKRVEELTRLH 1036
+V+E+ K VEELTR+H
Sbjct: 419 DDGSGMKREATVDEVIKIVEELTRIH 444
>gi|358380497|gb|EHK18175.1| hypothetical protein TRIVIDRAFT_80808 [Trichoderma virens Gv29-8]
Length = 1161
Score = 417 bits (1073), Expect = e-113, Method: Compositional matrix adjust.
Identities = 317/1124 (28%), Positives = 555/1124 (49%), Gaps = 110/1124 (9%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
I+G I L+ +P A D LFI T+R ++ + W+ E+++L + + + D ++ R +
Sbjct: 55 IHGTIDMLQRLQPKDSATDLLFIGTDRLQYFNIAWNPETNQLDAVEQTIHDSAEPYMRQS 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPFD-NKG---QLKEAFNIRLEELQVLDIKFLYG 119
+ ++DP + + +HL++G+ V KG +L+ +RL EL + FL+
Sbjct: 115 QSQNRCLVDPTGKFMAMHLWEGVLNVFRLRIRKGLTTRLEVLDQVRLTELWMKSSVFLHS 174
Query: 120 -CAKPTIVVLYQ---DNKDARHVKTYEVALKDKDFVEG---PWSQNNLDN-----GADLL 167
P I LY+ D ++AR V Y + DK V P LD A +L
Sbjct: 175 RTGHPRIAFLYKNQLDREEAR-VAVYRLTEDDKGGVSSKFDPLKDRELDQVIPDPYASML 233
Query: 168 IPVP----------------PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAY-GRVD 210
IPVP L G+L++GE + Y + + ++ K Y +
Sbjct: 234 IPVPVYEEKRYHVRNNEGARAHLGGLLVVGETLLTYFDSLTYSSVCSSIEDPKIYVAWAE 293
Query: 211 ADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKIELLG------ETSIASTISYLDNA 260
DG+ Y L D G L LL I E VTG+++ + TS AS++ Y+ N
Sbjct: 294 YDGTHYFLADDYGRLDLLEIKTTNESTGVVVTGMEVHPITFQDSSRYTSRASSLVYMGNN 353
Query: 261 VVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ---------- 310
+++IGS +GDSQL+ ++++ + S V+VL N PI+DF ++DL +
Sbjct: 354 LLFIGSHHGDSQLLHIDIETQ-QMSLVKVLS---NNAPIMDFTIMDLGNREGDAQSGNTF 409
Query: 311 --GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVV 367
GQ ++V GAY+DGSLR +R+G+G+ ++ + E QG +G+++LRS + DT +++
Sbjct: 410 SSGQARIVAGCGAYQDGSLRSIRSGVGLEDRGLLDEFQGTRGLFTLRSVESEKVDT-VII 468
Query: 368 SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
S ++ TR+L ++ E +G +++TL + QL+Q+T V ++ S
Sbjct: 469 STLTGTRVLRFEPDNIEELYSFQGIDLESETLLAANLPNGQLLQITPRVVNVLDPDSGTS 528
Query: 428 RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY------EIS 481
W+SP G + A+AN LL+ G LV L + DG+ VK+A + +IS
Sbjct: 529 LCSWQSPEGKVITAASANTKWALLSIDGSILVSLNLLDGLKAVVKNASQDSVSGQPDQIS 588
Query: 482 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCA 537
CL +P +G WT ++ + L L I E L +PR V L
Sbjct: 589 CLHA---ARDPQ--DFGVIGWWTSGTISVVDLATLTPIHGEPLRQTDDSSSVPRDVALVQ 643
Query: 538 FE--GIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVF 592
IS +L AL DG++++F +++K ++ RK V+LG+ P L + +VF
Sbjct: 644 LHPPDISGPTMLVALEDGNVISFNVSVKGFSISGRKTVTLGSGPARLHVLPQADGICNVF 703
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A ++ ++IYSS +++YS + + + PF+S AFPDS+ ++ E + I +D+ +
Sbjct: 704 ATTEHASLIYSSEGRVVYSATTADDATFVAPFDSEAFPDSIVLSTEEHIRICHVDNERLT 763
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDD-------QTFEF 704
H++++P+ E RR+ + + F + C K EE +RL+D+ + FE
Sbjct: 764 HVKALPMSETVRRVAYSPGLKAFGLGCIRKELIENEEVITSTIRLVDEIIFQELGKPFEL 823
Query: 705 ISTYPLDTFEYGCSILSCSFSDDSNV---YYCVGTAYVL---PEENEPTKGRILVFIVED 758
+ L+ E C I+ D + V + VGT++V EE T+GRILV V++
Sbjct: 824 NGSASLELVE--C-IIRAELPDSNGVPAERFLVGTSFVADPGTEEAGETRGRILVLGVDE 880
Query: 759 GK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
+ L IA KG L + ++A + + + +Y + L
Sbjct: 881 SRQLYQIASHNLKGVCRCLAIMDDYIVAGLTKTVVVYGYTQETSTAASLNKLASFRPASF 940
Query: 818 ALYVQTRGDFIVVGDLMKSISLLIYKH----EEGAIEERARDYNANWMSAVEILDDDIYL 873
+ + G+ I +GDLM+S++L+ + ++ +EE+AR Y W ++V LD+ +L
Sbjct: 941 PIDLDVSGNIIGIGDLMQSLTLVEFTPGQDGKKPTLEEKARHYQQAWTTSVSALDESRWL 1000
Query: 874 GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
A+ N+ +R+N E T+++R +LE+ E ++GE +NR R + P + IP
Sbjct: 1001 EADAQGNIIVLRQNQEAPTEQDRSQLEITSELNIGEQINRIRK---IQVAPAENAIVIPK 1057
Query: 934 VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK- 992
G++ G + + + + L Q+ L++ I+ G L+ + WR+F N+ + +A
Sbjct: 1058 AFLGSIEGTLYLYGDIAPKYQDLLMTFQSRLQEYIQTPGNLSFDTWRAFRNQARDGEAPF 1117
Query: 993 NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
F+DG++IE FLDL + + + + + SVE++ +EEL R+H
Sbjct: 1118 RFVDGEMIERFLDLDEKQQELVCEGLGPSVEDMRNMIEELRRMH 1161
>gi|325094412|gb|EGC47722.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H88]
Length = 1201
Score = 417 bits (1073), Expect = e-113, Method: Compositional matrix adjust.
Identities = 328/1133 (28%), Positives = 544/1133 (48%), Gaps = 135/1133 (11%)
Query: 16 FRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPD 74
FRP D LFI T+R + + WD ++ L T R D++D R + G +IDP
Sbjct: 92 FRPQ---TDILFIGTDRASYFTVSWDQKTKNLRTERKYVDLADPSARASQLGDRCLIDPS 148
Query: 75 CRLIGLHLYDGLFKVIP----FDNKG--------------------------------QL 98
R + L LY+G+ VIP ++G +L
Sbjct: 149 GRFLTLELYEGIVTVIPIGQPLRSRGSGRRLGKRAANASASAAYANQDAAAGAAAADLEL 208
Query: 99 KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV------------- 143
E R+EEL V FL+ A P + +LY+D +K E+
Sbjct: 209 GEPCQARVEELLVRSSTFLHTQADTLPRMALLYEDTMGKVRLKVRELEVMHGGMGMGGGT 268
Query: 144 --------------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC- 188
LK D + + L+ GA L+PVP PL G+L++GE +I Y
Sbjct: 269 GSGGGIGADGGIVAVLKGLDLL-----KEELEMGASFLVPVPAPLGGLLVLGETSIRYLD 323
Query: 189 -SANAFKAIPIR-PSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELL 245
++N + P++ +I A+ +VD G R+LL D G L L+++ V K++LL
Sbjct: 324 DASNECISQPLKEATIFVAWEQVD--GQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLL 381
Query: 246 GETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVV 305
G+ AS + Y+ + +IGS GDS+LI++ +GS+ EV++ + N+ PI+DF ++
Sbjct: 382 GDIPRASVLVYMGGGITFIGSHQGDSELIRIT-----EGSF-EVIQTFANIAPILDFTIM 435
Query: 306 DL------ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWS 352
DL E Q GQ ++VT SGA+ DGSLR VR+G+G+ E + ++ I +W+
Sbjct: 436 DLGGRAIGESQTHEFSSGQARIVTGSGAFNDGSLRSVRSGVGMEEVGVLGAMKHITDLWA 495
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQ 411
LR + F L+VSF+ ETR+ + E+EE E G TL + + +++Q
Sbjct: 496 LRVACPQEFSDTLLVSFVDETRVFYFTPDGEVEEKEEFMGLGLAESTLLAANLPHGRILQ 555
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
VT +VR+ + EW ++ A++N V+L GG L+ +I GD +T
Sbjct: 556 VTEWNVRVAELDGGMVIWEWSPEQQKAITAASSNDDHVVLMVGGQVLMIFDISGDINITG 615
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEI 528
K ++ ++S + + +P+ + I + ++ V SLPDL + LG G+
Sbjct: 616 EKDFGVDTQVSGVTVT---TSPARACILCLPQTAEVVV--MSLPDLAIRRSTSLGEPGDA 670
Query: 529 IPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSS 585
+PRSVL+ + L ++ DG + +F N + LT K++LG+ QP +
Sbjct: 671 VPRSVLVAEVLPNNPATLFVSMADGSVFSFSFNSEDFSLTSMSKLTLGSEQPSFKKLPRG 730
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
+VFA ++P++IY+ +++YS VN + S +C FNS A+P S+A+A EL I
Sbjct: 731 DGLYNVFATCEQPSLIYAVEGRIVYSAVNSDQASRICHFNSEAYPGSIALATPSELKITL 790
Query: 646 IDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFE 703
+D + I+++ +GE RR+ + R F I ++K + AE F+ L D+ F
Sbjct: 791 VDAERTTQIQTLEVGETVRRVAYSAPERAFGIGTIKRTLEDGAEVIASRFM-LADEIMFR 849
Query: 704 FISTYPLDTFEYGCSILSCSFSD-----DSNVY---YCVGTAYVLPEENEPTKGRILVF- 754
+ Y L+ E S++ F D ++++ + VGT+Y+ +GRIL F
Sbjct: 850 ELDIYDLNKDELVESVIRAQFPDGIGSEGNDLFKDLFVVGTSYLDDFGEGSIRGRILAFE 909
Query: 755 IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
+ + +L +AE KGA +L K++AA+ + + +Y L +
Sbjct: 910 VTANRQLAKVAEMPVKGACRALAIVQDKIVAALMKTVVVYTLSKGQFADYTLSKTASYRT 969
Query: 815 HILALYVQTRGDFIVVGDLMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDD 870
+ + G+ I V DLMKS+S++ Y+ ++ E AR + W +AV + +D
Sbjct: 970 STAPVDIAVTGNLIAVADLMKSVSIVEYQQGANGLPDSLTEVARHFQTLWSTAVAPVAED 1029
Query: 871 IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
+L ++ NL + +N G TD++R RLEV E LGE VNR R + + ++
Sbjct: 1030 TWLESDAEGNLVMLHRNVNGVTDDDRRRLEVTSEILLGEMVNRIRPVN-IQGSQGAEAAI 1088
Query: 931 IPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTV 989
P GTV G I + + L +LQ+ + ++ GG+ ++R+F N ++T
Sbjct: 1089 SPRAFLGTVEGSIYLFGIINPTYQDLLMRLQSAMAGMVVTPGGMPFNKFRAFRNTIRQTE 1148
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
+ F+DG+LIE FL+ +EI V+VE + VEEL R+H
Sbjct: 1149 EPYRFVDGELIERFLNCGVELQEEIVGKVIADGVAGVTVESVKGLVEELRRMH 1201
>gi|225558618|gb|EEH06902.1| DNA damage-binding protein 1a [Ajellomyces capsulatus G186AR]
Length = 1201
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 329/1133 (29%), Positives = 540/1133 (47%), Gaps = 135/1133 (11%)
Query: 16 FRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPD 74
FRP D LFI T+R + + WD ++ L T R D++D R + G +IDP
Sbjct: 92 FRPQ---TDILFIGTDRASYFTVSWDQKTKNLRTERKYVDLADPSARASQLGDRCLIDPS 148
Query: 75 CRLIGLHLYDGLFKVIP----FDNKG--------------------------------QL 98
R + L LY+G+ VIP ++G +L
Sbjct: 149 GRFLTLELYEGIVTVIPIGQPLRSRGSGRRLGKRAANASASAAYANQDAAAGAAAADLEL 208
Query: 99 KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV------------- 143
E R+EEL V FL+ A P + +LY+D +K E+
Sbjct: 209 GEPCQARVEELLVRSSTFLHTQADTLPRMALLYEDTMGKVRLKVRELEVMHGGMGMGGGT 268
Query: 144 --------------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC- 188
LK D + + L+ GA L+PVP PL G+L++GE +I Y
Sbjct: 269 GSGGGIGADGGIVAVLKGLDLL-----KEELEMGASFLVPVPAPLGGLLVLGETSIRYLD 323
Query: 189 -SANAFKAIPIR-PSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELL 245
++N + P++ +I A+ +VD G R+LL D G L L+++ V K++LL
Sbjct: 324 DASNECISQPLKEATIFVAWEQVD--GQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLL 381
Query: 246 GETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVV 305
G+ AS + Y+ + +IGS GDS+LI++ +GS+ EV++ + N+ PI+DF ++
Sbjct: 382 GDIPRASVLVYMGGGITFIGSHQGDSELIRIT-----EGSF-EVIQTFANIAPILDFTIM 435
Query: 306 DL------ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWS 352
DL E Q GQ ++VT SGA+ DGSLR VR+G+G+ E + ++ I +W+
Sbjct: 436 DLGGRAIGESQTHEFSSGQARIVTGSGAFNDGSLRSVRSGVGMEEVGVLGAMKHITDLWA 495
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQ 411
LR + F L+VSF+ ETR+ + E+EE E G TL + + +++Q
Sbjct: 496 LRVACPQEFSDTLLVSFVDETRVFYFTPDGEVEEKEEFMGLGLAESTLLAANLPHGRILQ 555
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
VT +VR+ + EW ++ A++N ++L GG L+ +I GD +T
Sbjct: 556 VTEWNVRVAELDGGMVIWEWSPEQTKAITAASSNDDHLVLMVGGQVLMIFDISGDINITG 615
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEI 528
K ++ ++S + + +P+ + I + ++ V SLPDL + LG G+
Sbjct: 616 EKDFGVDTQVSGVTVT---TSPARACILCLPQTAEVVV--MSLPDLAIRRSTSLGEPGDA 670
Query: 529 IPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSS 585
+PRSVL+ + L ++ DG + +F N + LT K++LG+ QP +
Sbjct: 671 VPRSVLVAEVLPNNPATLFVSMADGSVFSFSFNSEDFSLTSMSKLTLGSEQPSFKKLPRG 730
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
+VFA ++P++IY+ +++YS VN + S +C FNS A+P S+A+A EL I
Sbjct: 731 DGLYNVFATCEQPSLIYAVEGRIVYSAVNSDQASRICHFNSEAYPGSIALATPSELKITL 790
Query: 646 IDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFE 703
+D + I+++ +GE RR+ + R F I ++K + AE F+ L D+ F
Sbjct: 791 VDAERTTQIQTLEVGETVRRVAYSAPERAFGIGTIKRTLEDGAEVIASRFM-LADEIMFR 849
Query: 704 FISTYPLDTFEYGCSILSCSFSD--DS------NVYYCVGTAYVLPEENEPTKGRILVF- 754
+ Y L+ E S++ F D DS + VGT+Y+ +GRIL F
Sbjct: 850 ELDIYDLNKDELVESVIRAQFPDGIDSEGNDLFKDLFVVGTSYLDDFGEGSIRGRILAFE 909
Query: 755 IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
+ + +L +AE KGA +L K++AA+ + + +Y L +
Sbjct: 910 VTANRQLAKVAEMPVKGACRALAIVQDKIVAALMKTVVVYTLSKGQFADYTLSKTASYRT 969
Query: 815 HILALYVQTRGDFIVVGDLMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDD 870
+ + G+ I V DLMKS+S++ Y+ ++ E AR + W +AV + +D
Sbjct: 970 STAPVDIAVTGNLIAVADLMKSVSIVEYQQGANGLPDSLTEVARHFQTLWSTAVAPVAED 1029
Query: 871 IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
+L ++ NL + +N G TD++R RLEV E LGE VNR R + + ++
Sbjct: 1030 TWLESDAEGNLVMLHRNVNGVTDDDRRRLEVTSEISLGEMVNRIRPVN-IQGSQGAEAAI 1088
Query: 931 IPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTV 989
P GTV G I + + L +LQ+ + ++ GG+ ++R+F N ++
Sbjct: 1089 SPRAFLGTVEGSIYLFGIINPTYQDLLMRLQSAMAGMVVTPGGMPFNKFRAFRNTIRQAE 1148
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
+ F+DG+LIE FL S +EI V+VE VEEL R+H
Sbjct: 1149 EPYRFVDGELIERFLGCSVELQEEIVGKVIADGVAGVTVESAKGLVEELRRMH 1201
>gi|239613967|gb|EEQ90954.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ER-3]
gi|327353314|gb|EGE82171.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis ATCC 18188]
Length = 1199
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 328/1129 (29%), Positives = 536/1129 (47%), Gaps = 131/1129 (11%)
Query: 16 FRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPD 74
FRP D LF+ T+R + + WD ++ L T R D++D R + G +IDP
Sbjct: 94 FRPQ---TDILFVGTDRSTYFTVSWDQKTKNLHTERKYVDLADGSARESQLGDRCLIDPS 150
Query: 75 CRLIGLHLYDGLFKVIP----FDNKG----------------------------QLKEAF 102
R + L LY+G+ VIP ++G +L E
Sbjct: 151 GRFLTLELYEGIVTVIPIGQPLRSRGSGRKLGKRAANASASTMQGNQDVGAADLELGEPC 210
Query: 103 NIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV----------------- 143
R+EEL V FL+ A P + +LY+D +K E+
Sbjct: 211 QARVEELLVRSSTFLHTQADMLPRMALLYEDTMGKVRLKVRELEVTYGGMGIGSGSGGGS 270
Query: 144 ----------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SAN 191
LK D + + L+ GA L+PVP PL G+L++GE +I Y ++N
Sbjct: 271 GVGADGGSAAVLKGLDML-----KEELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASN 325
Query: 192 AFKAIPIR-PSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETS 249
+ P+ +I A+ +VD G R+LL D G L L++I V K++ LG
Sbjct: 326 ECISQPLEEATIFVAWEQVD--GQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIP 383
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-- 307
AS + Y+ V +IGS GDSQLI++ +GS+ EV++ + N+ PI+DF ++DL
Sbjct: 384 RASVLVYMGGGVTFIGSHQGDSQLIRIT-----EGSF-EVIQTFANIAPILDFTIMDLGG 437
Query: 308 ----ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSS 356
E Q GQ ++VT SGA+ DGSLR VR+G+G+ E + ++ I +W+LR +
Sbjct: 438 RAMGESQTHEFSSGQARIVTGSGAFNDGSLRSVRSGVGMEEVGVLGSMEHITDLWALRVA 497
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ F L+VSF+ ETR+ + +EE E G TL + +++QVT
Sbjct: 498 CPEEFSDTLLVSFVDETRVFYFTPDGGVEEKDEFMGLGLTESTLIAANLPNGRILQVTER 557
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHA 474
+VR+ + EW ++ A++N ++L GG L+ +I G+ + K
Sbjct: 558 NVRVAELDGGMVMWEWSPVSQKAITAASSNDDHLVLMVGGQVLMIFDIRGEIKVAGEKDF 617
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRS 532
++ ++S + + +P+ S I + ++ V SL DL + LG G+ +PRS
Sbjct: 618 GVDTQVSGVTVT---ASPARSCILCLPQTAEVIV--MSLADLTIRHSTSLGEPGDAVPRS 672
Query: 533 VLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTT 589
VL+ + L ++ DG + +F N LT K++LG+ QP +
Sbjct: 673 VLVAEVLPNNPATLFVSMADGSVFSFSFNATDFSLTSMSKITLGSEQPSFKKLPRGDGLY 732
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
+VFA ++P++IYS+ +++YS VN + S +C FNS A+P S+A+A EL I +D
Sbjct: 733 NVFATCEQPSLIYSAEGRIVYSAVNSDQASRICHFNSEAYPASIALATPSELKIALVDAE 792
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFIST 707
+ I+++ +GE RR+ + R F I ++K + AE F+ L D+ F +
Sbjct: 793 RTTQIQTLEVGETVRRVAYSAPERAFGIGTIKRTLEDGAEVIASRFM-LADEIMFRELDV 851
Query: 708 YPLDTFEYGCSILSCSFSDDSNV--------YYCVGTAYVLPEENEPTKGRILVF-IVED 758
Y L+ E S++ F DD + + VGT+Y+ +GRIL F + +
Sbjct: 852 YDLNKDELVESVIRAQFPDDKDSEGEELMKDLFIVGTSYLDDFGEGSIRGRILAFEVTPN 911
Query: 759 GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 818
+L +AE KGA +L K++AA+ + + +Y L +
Sbjct: 912 RQLGKVAEMPVKGACRALAIVQDKIVAALMKTVVVYTLSKGQFADYILTKTASYRTSTAP 971
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHE----EGAIEERARDYNANWMSAVEILDDDIYLG 874
+ + G+ I V DLMKS+S++ Y+ G++ E AR + W +AV + D +L
Sbjct: 972 IDIAVTGNLIAVADLMKSVSIVEYQQGTDGLSGSLTEVARHFQTLWSTAVAPVAQDTWLE 1031
Query: 875 AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
++ NL +R+N G T+++R RLEV E LGE VNR R ++ L ++ P
Sbjct: 1032 SDAEGNLVVLRRNVNGVTEDDRRRLEVTSEVLLGEMVNRIRPVNIQASL-GTEAAISPRA 1090
Query: 935 IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKN 993
GTV G I + + L +LQ+ + ++ GG+ ++R+F N ++ +
Sbjct: 1091 FLGTVEGSIYLFGIINPTYQDLLMRLQSAMAGMVVTPGGMPFNKFRAFRNTVRQAEEPYR 1150
Query: 994 FLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
F+DG+LIE FL +EI V+VE + VEEL R+H
Sbjct: 1151 FVDGELIERFLGCGAELQEEIVGKVIADGVAGVTVERVKAIVEELKRMH 1199
>gi|340521192|gb|EGR51427.1| predicted protein [Trichoderma reesei QM6a]
Length = 1161
Score = 414 bits (1064), Expect = e-112, Method: Compositional matrix adjust.
Identities = 314/1125 (27%), Positives = 555/1125 (49%), Gaps = 112/1125 (9%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
I+G I L+ +P A D LFI T+R ++ L W+ E+++L + + + D ++ R +
Sbjct: 55 IHGTIDMLQRLQPKDSATDLLFIGTDRLQYFNLAWNPETNQLDAVEQTIEDTAEPYMRQS 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPFD-NKG---QLKEAFNIRLEELQVLDIKFLYG 119
+ ++DP + + +HL++G+ V KG +L+ +RL EL + FLY
Sbjct: 115 QSQNRCLVDPTGKFMAMHLWEGVLNVFRLRIRKGLTTKLEGLDQVRLTELWMKASTFLYS 174
Query: 120 -CAKPTIVVLYQ---DNKDARHVKTYEVALKDKDFVEGPWS-QNNLD-------NGADLL 167
PTI LY+ D ++AR + Y + DK V + Q N D A +L
Sbjct: 175 RTGHPTIAFLYKNQLDREEAR-IAVYRLTEDDKPGVSSKFDPQKNRDFEEVIRDPYASML 233
Query: 168 IPVP----------------PPLCGVLIIGEETIVYCSANAFKAIP---IRPSITKAYGR 208
IPVP L G+L++GE + Y + + ++ P I A+
Sbjct: 234 IPVPVYEEKRYHVRNNEGARAHLGGLLVVGETLLTYFDSLTYSSVCSSLAEPKIYVAWA- 292
Query: 209 VDADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKI------ELLGETSIASTISYLD 258
+ DG+ YLL D G L +L I E VTG+++ +L TS AS++ Y+
Sbjct: 293 -EYDGTHYLLADDYGRLDMLEIKTTNESTGVVVTGMEVHPFAFEDLSRYTSRASSLVYMG 351
Query: 259 NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-------- 310
N +++IGS +GDSQL+++ D + + +L+ N PI+DF ++DL +
Sbjct: 352 NNLLFIGSHHGDSQLLRI----DIETQQMTLLKVLPNNAPIMDFTIMDLGNREGDAQAGN 407
Query: 311 ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFL 365
GQ ++V GAY+DGSLR +R+G+G+ ++ + E G +G+++LRS + DT +
Sbjct: 408 TFSSGQARIVAGCGAYQDGSLRSIRSGVGLEDRGLLDEFPGTRGLFTLRSVDSEKVDT-V 466
Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
VVS ++ TR+L E E +G +++TL + QL+Q+T V L+ S
Sbjct: 467 VVSTLAGTRVLRFEPESIEEVYSFQGMDLESETLLAANLPNGQLLQITPRVVTLLDPESG 526
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY------E 479
W+SP G + A+AN LL+ G LV L + DG+ V+ A +
Sbjct: 527 ASLCSWQSPDGKVITAASANTKWALLSIDGSLLVSLNLLDGLKAVVQDASENPVSGQPDQ 586
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLL 535
ISC I+ + P + VG W ++ + L L + E L +PR V L
Sbjct: 587 ISC--IHAARDPPDF---GVVGWWASGTISVVDLATLTPLHGESLRQTDDSASVPRDVAL 641
Query: 536 CAFEGISY----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTH 590
LL AL DG++++F +++K ++ RK V+LG+ P L + +
Sbjct: 642 VQLHPPDMSGPTLLVALEDGNVISFNVSLKGFSISGRKTVTLGSGPARLHVLPRPDGICN 701
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
VFA ++ ++IYSS +++YS + + + PF+S AFPDS+ ++ + + I +D +
Sbjct: 702 VFATTEHASLIYSSEGRIVYSATTADDATFVAPFDSEAFPDSIVLSTDEHIRICHVDSER 761
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA-EESEMHFVRLLDD-------QTF 702
H++S+P+ E RR+ + + F + +K + EE VRL+D+ Q F
Sbjct: 762 LTHVKSLPMHETVRRVAYSPGLKAFGLGCIKKELVENEEVVTSTVRLVDEIIFQELGQPF 821
Query: 703 EFISTYPLDTFEYGCSILSCSFSDDSNV--YYCVGTAYVL---PEENEPTKGRILVFIVE 757
E ++ L+ E C I + + N+ + VGT++V +E T+GRI+V V+
Sbjct: 822 ELNASASLELVE--CVIRAELPDSNGNMTERFLVGTSFVADPGTDEAGETRGRIVVLGVD 879
Query: 758 DGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI 816
+ + L IA KG L + ++A +++ + +Y + L +
Sbjct: 880 ESRQLYQIASHNLKGVCRCLAMLDDYIVAGLSKTVVVYSYAQETSTAASLTKVASYRPAS 939
Query: 817 LALYVQTRGDFIVVGDLMKSISLLIYKHEE----GAIEERARDYNANWMSAVEILDDDIY 872
+ + G+ I VGDLM+S++L+ + + +EE+AR Y W ++V LD+ +
Sbjct: 940 FPVDLDVSGNMIGVGDLMQSLTLIEFTPPQDGKMAKLEEKARHYQQAWTTSVCALDETRW 999
Query: 873 LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
L A+ N+ +R+ E T+++R +LE+ E ++GE +NR R + P + +P
Sbjct: 1000 LEADAQGNVIVLRQRQEAPTEQDRSQLEITSELNIGEQINRIRKLQVA---PGENAVVVP 1056
Query: 933 TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK 992
G++ G + + + + L Q+ L+ I+ G L+ + WR+F N+ + ++
Sbjct: 1057 KAFLGSIEGTLYLYGDIAPKYQDLLMTFQSRLQGYIQTPGNLSFDLWRAFRNQAREGESP 1116
Query: 993 -NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
F+DG++IE FLDL ++ + + + + +VE++ +EEL R+H
Sbjct: 1117 YRFVDGEMIERFLDLDESQQELVCEGLGPNVEDMRNLIEELRRMH 1161
>gi|156049323|ref|XP_001590628.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980]
gi|154692767|gb|EDN92505.1| hypothetical protein SS1G_08368 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1153
Score = 414 bits (1064), Expect = e-112, Method: Compositional matrix adjust.
Identities = 331/1118 (29%), Positives = 561/1118 (50%), Gaps = 108/1118 (9%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
IYGRI+ L+ +P G D LF+ T R ++ + W+ + +L T ++ D+S R ++
Sbjct: 55 IYGRISMLQKIQPAGSKTDHLFVGTVRAQYFTVMWNPNTHKLDTMQSFVDISQEHMRDSE 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVI----PFDNKGQ-LKEAFNIRLEELQVLDIKFLYG 119
+ ++DP RL+ + LY+G+ ++ P K L++ +R+ E++V FLY
Sbjct: 115 SRDRCLVDPTGRLLVMELYEGVLNLVKIVKPRGGKTDYLEKPEQVRISEMKVRASAFLYT 174
Query: 120 CAK-PTIVVLYQDNKDARHVKTYEVALKDK-----DFVEGPWSQNNLDN---GADLLIPV 170
K P + +LYQD ++ ++ TY + L DK F +N+LD+ GA +IPV
Sbjct: 175 HTKQPKLALLYQDARENVNLATYRM-LDDKGHPNLQFDPKKNRENDLDDLCVGAMHIIPV 233
Query: 171 P-------------------PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAY---GR 208
P L GV+++GE Y + KAI + ++ +A
Sbjct: 234 PKGEDEGSKRYIVRNATTAKANLGGVVVLGETKFTYLDDES-KAI-VEYALDEAVLWAAW 291
Query: 209 VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
D YLLGD G L+L I + VTG+K+ LG+ S +++ L N V Y+ S
Sbjct: 292 EPIDDRNYLLGDDYGFLYLFTILVDGATVTGMKVLKLGQVSKPTSLENLGNGVFYVASHE 351
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL---ERQ--------GQGQVVT 317
D+Q+I+++L + V VL+ N+ PI+DF ++D+ E Q GQ ++VT
Sbjct: 352 ADNQVIQIDL--GSPNYSVTVLQVLPNIAPILDFTIMDMGGREGQTQLNEYSSGQARLVT 409
Query: 318 CSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
SG ++ GSLR VR+G+G+++ A + E++GI +++L S P DT LVVSF +ETR
Sbjct: 410 GSGGFEGGSLRSVRSGVGLDDIAILAEMEGIHKVFALHSGPSLPNDT-LVVSFSTETRFF 468
Query: 377 AMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
+ ++EE I + ++TL ++ ++QVT V + + + W+ P
Sbjct: 469 KFDSLGDIEEVNTISNLSAASETLLTYNLDNGCILQVTQHEVAI---HGKSPGHRWQPPD 525
Query: 436 GYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL-EYEISCLDINPIGENPSY 494
G + A+ N + +LL++ G LV L I L EV +L + +++C+ + +
Sbjct: 526 GQIITAASGNQNYILLSSNGRTLVTLSIEQN-LAEVAFQELGDDQVACIHVPQL-----L 579
Query: 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHL---GGEIIPR-----SVLLCAFEGISYLLC 546
I +G+W S+ + L L I E L G IPR +LL G + L
Sbjct: 580 GDIGVIGLWKSGSISLLDLKTLKTIVSEDLRRNDGASIPRDIALTQILLPELSGPT-LFV 638
Query: 547 ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSN 605
++ DG +L+F ++ + L+ RK + LGTQ L+ NTT +VFA + P++IY S
Sbjct: 639 SMEDGIVLSFDVDSENCSLSGRKSIVLGTQQARLQILPRDNTTFNVFATCEHPSLIYGSE 698
Query: 606 KKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRR 665
+ +YS V ++ +C FNS A+PDS+ +A EL + ID+ ++ H+R++P+GE RR
Sbjct: 699 GRTVYSAVTAEDAITVCSFNSRAYPDSVVVATTNELKLSAIDNERRTHVRTLPIGETVRR 758
Query: 666 ICHQEQSRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFIST--YPLDTFEYGCSI--- 719
I + + R FAI ++K + + +E RL+D+ F + Y E S+
Sbjct: 759 IAYSAKERAFAIGAIKRELTKGQEIVTTSFRLVDEVVFGELGEPFYLPANDEIIESVIRA 818
Query: 720 -LSCSFSDDSNV-YYCVGTAYVLPEENEPTKGRILVFIV-EDGKLQLIAEKETKGAVYSL 776
L + D V + VGT++ L + +GR+L+F V D +IA KG+ +
Sbjct: 819 ELPTKYGDGELVERFLVGTSF-LHDGEVNIRGRLLIFGVNSDRTPYIIASHTLKGSCRCI 877
Query: 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
NGK++AA+N+ + +Y + T L+ + + + RG+ I V D+MKS
Sbjct: 878 GVLNGKIVAALNKTVVMYDYEETSRTTANLRKVATYRCATCPIDIDIRGNIIAVADIMKS 937
Query: 837 ISLLIYKHE----EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
++L+ Y +EE R + +++ +D D YL ++++ NL +++N EG T
Sbjct: 938 VALVEYTPGVDGLPDKLEEVGRHAQQVFATSIAEVDTDTYLESDHDGNLIVLKRNREGVT 997
Query: 893 DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
E++ RLEV+ E +LGE VN+ + ++ D IP T G I + + +P +
Sbjct: 998 REDKLRLEVLCEMNLGEMVNKIKRINVET---SKDALLIPRAFVATTEGSIYLFSLIPPQ 1054
Query: 953 QYLFLEKLQTNLRKVIK--------------GVGGLNHEQWRSFNNE-KKTVDAKNFLDG 997
L +LQ+ L + G L+ +++RS+ + ++T + F+DG
Sbjct: 1055 NQDLLMRLQSRLASLPARSLTDSSFSAPIEFSPGNLDFDKYRSYVSAVRETNEPFRFVDG 1114
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+LIE FLDL + I + V E+L VE L RL
Sbjct: 1115 ELIERFLDLDGAIQENICDGLGVRAEDLRGVVEGLRRL 1152
>gi|391865638|gb|EIT74917.1| damage-specific DNA binding complex, subunit DDB1 [Aspergillus oryzae
3.042]
Length = 1135
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 329/1110 (29%), Positives = 548/1110 (49%), Gaps = 108/1110 (9%)
Query: 6 IYGRIATL-ELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
+Y +++ L L P D LF+ T+RY +C L WD+ + + T R D++D R +
Sbjct: 55 LYAKVSMLARLPAPAHSPTDHLFVGTDRYTYCTLSWDSAQNRIRTERNYVDIADPSSRES 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------------DNKGQLKEAFN 103
G +IDP R + L +Y+GL V+P G+L E
Sbjct: 115 QTGNRCLIDPSGRFMTLEVYEGLVAVVPIVQLPARKRGRAPAVPTGPDAPKVGELGELTT 174
Query: 104 IRLEELQVLDIKFLY-GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQN---- 158
R++EL V FL+ P + +LY+DN+ K + +++ ++ S +
Sbjct: 175 ARIDELFVRSSAFLHVQSGLPRLALLYEDNQ-----KKVRLMVRELNYTSATASTSADAT 229
Query: 159 ---------NLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKA 205
LD GA LIPVP +LI+GE +I Y + + I RP +I A
Sbjct: 230 LTHIADFAQELDLGASHLIPVP----DLLILGETSIKYVDDDNNEIIS-RPLDEATIFVA 284
Query: 206 YGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
+ VD+ R+LL D G L L+++ +V G K++ LG TS AS + YL +V++
Sbjct: 285 WEGVDS--QRWLLADDYGRLFFLMLVLDSDNQVQGWKLDHLGNTSRASALVYLGGGIVFV 342
Query: 265 GSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQG 313
GS GDSQ++++ GS+ E+++ N+ PI+DF ++DL E Q GQ
Sbjct: 343 GSHQGDSQVLRI-----GNGSF-EIIQALSNIAPILDFTIMDLGNRTSESQTHEFSSGQA 396
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLR-SSTDDPFDTFLVVSFIS 371
++VT SGA+ DG+LR VR+G+G+ E + +++ I +W L+ + D DT L+V+FI
Sbjct: 397 RIVTGSGAFDDGTLRSVRSGVGMEELGVLGDMEHITDLWGLQVQAGGDTLDT-LLVTFID 455
Query: 372 ETRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNE 430
ETR+ + + E+EE + G TL + +++QVT V + + E
Sbjct: 456 ETRVFHFSPDGEVEELDHFLGLSLSENTLLAANLPRGRILQVTEQRVLIADLEGEMVVYE 515
Query: 431 WKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD--GILTEVKHAQLEYEISCLDINPI 488
W P + A+AN ++L GG + L+IG ++TE K + ++S + +
Sbjct: 516 WTPPNELVITAASANDDSLVLVIGGELMTVLDIGTEAQVITEKKFGA-DSQVSGVTVPA- 573
Query: 489 GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYL 544
S +++ VG V + L DL + LG GE PRSVL+ + L
Sbjct: 574 ----SPTEVCVVGFPQLAKVSVLRLRDLTEVHTTSLGPAGEAFPRSVLVADVLADSPPTL 629
Query: 545 LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
++ DG ++ + L+ K+ LG++ T + + + VFA + P++IY
Sbjct: 630 FISMADGSVITYSFKTDDYSLSHMNKLILGSEQPTFKKLPRGDGLYNVFATCENPSLIYG 689
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
S +++YS VN + S +C FNS A+P S+A+A EL I +D + I+++ +G
Sbjct: 690 SEGRIIYSAVNSEGASRVCHFNSEAYPGSIAVATLHELKIALVDRERTTQIQTLQIGATV 749
Query: 664 RRICHQEQSRTFAICSLKNQSCAEESEMHFVRLL--DDQTFEFISTYPLDTFEYGCSILS 721
RR+ + + F I +++ + A+ +E+ R + D+ F + ++ L E S++
Sbjct: 750 RRVAYSPSEKAFGIGTIERK-LADGAEIVTSRFMLADEVLFRQLDSFELRPEEIVESVIR 808
Query: 722 CSF--SDDSNV------YYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGA 772
F D N + VGTAY+ E E +GRIL+F +++G KL +AE KGA
Sbjct: 809 AEFPAGKDENGREMTKDRFVVGTAYLDDEGEESIRGRILMFEIDNGRKLTKVAELPVKGA 868
Query: 773 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
+L K++AA+ + I +YK + + GT +L+ + V G+ IVV D
Sbjct: 869 CRALAMLGDKIVAALVKTIVIYKVVNNNFGTMKLEKLASFRTSTAPVDVTVVGNVIVVSD 928
Query: 833 LMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
LMKS+ LL +K E ++ E AR + W + V +D D +L ++ NL +R+N
Sbjct: 929 LMKSVCLLEFKEGENGLPDSLTEVARHFQTVWATGVACIDKDTFLESDAEGNLIVLRRNL 988
Query: 889 EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS 948
G +++R RLEV E LGE VNR R ++ + V P GTV G I + A
Sbjct: 989 AGVEEDDRRRLEVTSEISLGEMVNRIRPVNIQQL---ASVTVTPRAFLGTVEGSIYLFAI 1045
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLS 1007
+ E FL +LQ + ++ +G + ++R F + ++ + F+DG+LIE FL+
Sbjct: 1046 INPEHQDFLMRLQATMAGKVESLGEMPFNEFRGFRSMVREATEPYRFVDGELIEQFLNCE 1105
Query: 1008 RTRMDEISKTMN-VSVEELCKRVEELTRLH 1036
+EI ++ ++V E+ +E L RLH
Sbjct: 1106 PELQEEIVNSVGMMNVHEVKVMIEALRRLH 1135
>gi|367033707|ref|XP_003666136.1| hypothetical protein MYCTH_2310607 [Myceliophthora thermophila ATCC
42464]
gi|347013408|gb|AEO60891.1| hypothetical protein MYCTH_2310607 [Myceliophthora thermophila ATCC
42464]
Length = 1043
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 310/1047 (29%), Positives = 525/1047 (50%), Gaps = 94/1047 (8%)
Query: 70 IIDPDCRLIGLHLYDGLFKVIPF---DNKGQLKEAFNIRLEELQVLDIKFLYG-CAKPTI 125
++DP R + +HL++G+ ++ NK L +RL EL + FLY P I
Sbjct: 11 LVDPTGRFLAMHLWEGVLTMMRLGDRKNKMVLTWMDQVRLSELFIRASTFLYTETGHPKI 70
Query: 126 VVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQNN--------LDNGADLLIPV----- 170
LYQ D+ K TY + D++ ++ D+ A +LIPV
Sbjct: 71 AFLYQSRADSGDAKLATYRMTADDRNTEASKFNPERDREIDMDVPDSSASMLIPVRKVEE 130
Query: 171 -------------PPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKA---YGRVDADGS 214
+ G++++GE ++Y + + ++ ++ +A + +
Sbjct: 131 GVKRHNVRNVESAKAHIGGLVVVGETRLLYI--DEVTKVTVQSALKEASIFVAWAEYNAV 188
Query: 215 RYLLGDHAGLLHLLVITHEKEK---VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
Y L D G LHLL + E VT L + +G+TS AS + YL N ++++GS YGDS
Sbjct: 189 NYFLADDYGNLHLLTLVTENADSVVVTSLDVNWIGKTSRASHLVYLGNDLLFVGSHYGDS 248
Query: 272 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG------------QGQVVTCS 319
QL +L+LQ + + +++++ N+GPI+DF ++D+ +G Q ++VTCS
Sbjct: 249 QLFRLDLQNEDPKALLQLVQTLHNIGPILDFAIMDMGNRGDEGQLGNEYSSGQARIVTCS 308
Query: 320 GAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
G +KDGSLR VR+G+G+ + + +L+ +G++SL+S P L VSF++ETR+
Sbjct: 309 GVHKDGSLRSVRSGVGLEDVGILADLEHCRGLFSLKS-YGAPKTDILAVSFLTETRVFKF 367
Query: 379 NLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
+ + ++EE E G QTL D +L+QVT + ++ + S WK
Sbjct: 368 DAQGDVEELESFAGLAFDQQTLLATDLQGGRLLQVTPAAATVLDTESGLTVASWKPGDQR 427
Query: 438 SVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQ 496
S+ A+AN+ +LL+ G LV L I D + + + + +I+C+ + +N
Sbjct: 428 SIINASANSKSLLLSVDGTELVSLSISDDLQVVRTRDISKQDQIACIHVPAQPDN----- 482
Query: 497 IAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAF--EGIS--YLLCAL 548
+ VG W+ +V I L L I E L IPR + L G+S L A+
Sbjct: 483 VGVVGFWSSGTVSIIDLHTLEPIHGESLRTSQDDASIPRDLALVQILPPGVSGPTLFVAM 542
Query: 549 GDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKK 607
DG+++ F + K L+ RK+V LG + L + + +FA ++ P++IY S +
Sbjct: 543 QDGNVVTFNIT-KDLALSGRKRVILGMRQARLHLLPQPDGIYSIFATTEHPSLIYGSEGR 601
Query: 608 LLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRIC 667
++YS V +E +++CPF++ AFP+ +A+A + +L I ID ++ H++ + + E RRI
Sbjct: 602 IVYSAVTAEEATYICPFDAQAFPECIALATDKQLKICHIDRERRTHVKPLAMNEMVRRIA 661
Query: 668 HQEQSRTFAICSLKNQSCA-EESEMHFVRLLDDQTFEFIS-TYPLDTFEYG----CSI-- 719
+ + + F + +K + A EE +L+D+ F+ + ++PL Y C +
Sbjct: 662 YSPKEKVFGLGCIKRELVAGEEVVQSSFKLVDEVIFDRVGRSFPLGAPSYTELVECVVRA 721
Query: 720 -LSCSFSDDSNVYYCVGTAYVLPEENEP---TKGRILVFIVE-DGKLQLIAEKETKGAVY 774
L S+ + + + VGT+++ + P +GR+L F V+ D LI E KGA
Sbjct: 722 ELRDSYGNPAE-RFIVGTSFLPDPDYGPGTDVRGRMLFFGVDADRNPYLILSHELKGACR 780
Query: 775 SLNAF-NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDL 833
+ +G ++A + + + + ++ T EL + + RG+ I V DL
Sbjct: 781 CVAVMEDGTIVAGLTKTVVVCRYEETSSTTGELTRLASYRPSSYPAEICVRGNTIAVADL 840
Query: 834 MKSISLLIYKHEEGA--IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA 891
MKS+SL+ Y +G + ERAR + + W +AV + + +L A+ NL +R+N EG
Sbjct: 841 MKSVSLVEYVPGDGTPRLVERARHFGSVWATAVGFVREGSWLEADAQGNLMVLRRNVEGV 900
Query: 892 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-P 950
T E+R R+E+ E +LGE VNR R V P + V +P GTV G I + ++ P
Sbjct: 901 TAEDRRRMEITSEMNLGEMVNRIREVE-VETTPGALV--VPRAFLGTVEGGIYMFGTVAP 957
Query: 951 HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRT 1009
H Q L L + Q L V+K G + +R+F N ++ D FLDG+L+E FLD+
Sbjct: 958 HVQDLLL-RFQGKLADVLKTAGDIPFRSYRAFRNAEREGDGPFRFLDGELLERFLDVDEA 1016
Query: 1010 RMDEISKTMNVSVEELCKRVEELTRLH 1036
+ I K + SVE++ VEEL R+H
Sbjct: 1017 TQEIICKGLGPSVEDMRNMVEELRRMH 1043
>gi|302503607|ref|XP_003013763.1| hypothetical protein ARB_07875 [Arthroderma benhamiae CBS 112371]
gi|291177329|gb|EFE33123.1| hypothetical protein ARB_07875 [Arthroderma benhamiae CBS 112371]
Length = 1148
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 326/1117 (29%), Positives = 544/1117 (48%), Gaps = 109/1117 (9%)
Query: 6 IYGRIATLELFRPHGEA-----QDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRI 59
IYGRI L+ P A D LF+ T++Y + + WD+ +L T R D++D
Sbjct: 55 IYGRITLLKKL-PRSTAGGLALTDALFVGTDQYAYFSITWDSVHEQLRTERKYIDLADGS 113
Query: 60 GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG---------------------QL 98
R + IDP + L +Y+G+ + P QL
Sbjct: 114 LREAHSDDRCQIDPSGSFLTLEVYEGVVSIFPLVTADSHKRAKSAVSASASSASTPVEQL 173
Query: 99 KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWS 156
E +R+EEL V FL A P +LY+D + +K ++ P S
Sbjct: 174 GEPLQVRIEELMVRSSAFLDQEASNTPRFALLYEDTQGKVKLKLRDLKYTHAIITSDPGS 233
Query: 157 QNNL----------DNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SI 202
L D GA +LIPVP PL G+LIIGE +I Y + + I RP ++
Sbjct: 234 AAELKDVTTLSDEVDLGASILIPVPRPLGGLLIIGESSIKYVDVSRNETIS-RPLAESTV 292
Query: 203 TKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
A+ +VD G R+LL D G L L+++ V K++ LG TS AS + YLD +
Sbjct: 293 FVAWEQVD--GQRWLLADDYGRLFFLMLVLDADNAVDTWKVDFLGVTSRASVLVYLDGGI 350
Query: 262 VYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ---------GQ 312
V++GS GDSQ+I++ +G + ++++ N+ PI+DF V+D+ + GQ
Sbjct: 351 VFVGSHQGDSQVIQIK-----EGGF-DLVQTISNIAPILDFTVMDMGDRSGATREFSSGQ 404
Query: 313 GQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
++VT SGA+ DGSLR VR+G+GI E + ++ I +W+LRS+ +PF L+VSF++
Sbjct: 405 TRIVTGSGAFGDGSLRSVRSGVGIEELGVLASMEHITDLWTLRSACPEPFFDTLLVSFVN 464
Query: 372 ETRILAMNLEDELEETE--IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
ETR+ + + ++EE E G TL + N+++QVT + R + S +
Sbjct: 465 ETRVFHFSADGDVEEKEDGFLGLVFSQSTLLATNIPGNRILQVTESTSRAIDVDSGMII- 523
Query: 430 EWKSP-PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPI 488
W+S +++ A+AN ++L GG LV + + L + + + ++S + I
Sbjct: 524 -WRSSYEEFTITSASANDDYLVLVLGGVRLVCISLSTFELVGSRDFEADNQVSGMTIPA- 581
Query: 489 GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYL 544
S +Q V + + I +LP+L + K+ LG GE IPRSV++ L
Sbjct: 582 ----SPTQACIVCLPQSAEIIILNLPELEVKNKQALGEPGEAIPRSVIVAEILPNKPPTL 637
Query: 545 LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
++ DG + +F ++ + + K++LG++ + + N + VFA D P++I++
Sbjct: 638 FVSMADGTVFSFSFDVHAFAIFNSSKITLGSEQPSFKKLPRGNGQYNVFATCDHPSLIHA 697
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
S +++YS V+ S +C N+ A+P S+A++ + EL I +D+ + I ++P+
Sbjct: 698 SEGRIVYSAVDSASASRICSLNTQAYPGSIALSSQNELKIAIVDEERTTQIHTLPMHASV 757
Query: 664 RRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPL----------- 710
RR+ + + F + ++K + + EE FV L D+ F +STY L
Sbjct: 758 RRLAYSPMEKAFGLGTVKRKISNGVEEVSSSFV-LADEILFRPLSTYDLRPDELVECVIR 816
Query: 711 DTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKET 769
YG + S S D + VGTA++ ++ +GRIL+F V + L LI EK
Sbjct: 817 SQLNYGKDEVGNSISKD---LFFVGTAFLDDVGDDHIRGRILIFEVNRSRELSLIVEKSL 873
Query: 770 KGAVYSLNAFNGKLL-AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
GA +L + LL A + + + ++K G L+ + + + GD +
Sbjct: 874 MGACRTLAVMDHTLLVAGLVKSVSVFKLARDRFGNILLEKHTAYRTSTAPIDISVVGDTV 933
Query: 829 VVGDLMKSISLLIY-KHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
V D+MKS+SL+ Y + EEG EE AR Y W +AV +++++YL A+ NL
Sbjct: 934 AVADVMKSMSLVQYTQAEEGEQEPKFEEVARHYQTLWSTAVAPIEENVYLLADAEGNLVV 993
Query: 884 VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVI 943
+++N G T+ +R RL+ E LGE VNR H +V ++ V ++ V+G I
Sbjct: 994 LQQNITGVTESDRKRLQPTSEIRLGEMVNRI-HPIVVQTHTETAVSA-RALLATQVDGSI 1051
Query: 944 GVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIES 1002
+ + L +LQT + + G + ++R+F + D F+DG+LIE
Sbjct: 1052 YLFGLINPAYIDLLLRLQTAMGSITISPGEIPFPKYRAFRTTVRQSDEPFRFVDGELIER 1111
Query: 1003 FLDLSRTRMDEISKTMN---VSVEELCKRVEELTRLH 1036
FL + +EI ++ V+V L + +EEL R+H
Sbjct: 1112 FLSCTPAMQEEIVGRLDDSTVTVSSLKEMIEELRRMH 1148
>gi|310796681|gb|EFQ32142.1| CPSF A subunit region [Glomerella graminicola M1.001]
Length = 1163
Score = 411 bits (1056), Expect = e-111, Method: Compositional matrix adjust.
Identities = 320/1128 (28%), Positives = 546/1128 (48%), Gaps = 117/1128 (10%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
+YG I L+ RP D LF+ T+R+++ +W + L T + + D ++ R
Sbjct: 56 VYGTILLLQRLRPKDSNADLLFVGTDRFQYFTARWGPGTQRLHTEQVIEDAAEPHMRDAQ 115
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDN-KGQ---LKEAF-NIRLEELQVLDIKFL-Y 118
+ ++DP R + +HL++G+ V+ N KGQ L A+ +RL EL V F+
Sbjct: 116 SQDKCLVDPTGRFMAMHLWEGVMNVMRLGNRKGQFARLDGAWEQVRLSELFVKASAFVPT 175
Query: 119 GCAKPTIVVLYQ---DNKDARHVKTYEVALKDKDF--------VEGPWSQNNLDNGADLL 167
PTI LYQ D +DAR + Y + DK+ + + + D A ++
Sbjct: 176 DTGNPTIAFLYQSNIDKEDAR-LAIYRLMGDDKNTNVSRFDPSKDREYELDIKDPYARII 234
Query: 168 IPVP------------------PPLCGVLIIGEETIVYCSANAFKAIP---IRPSITKAY 206
IPVP L G++++GE +VY + P+I A+
Sbjct: 235 IPVPIIEDDVKRYHKRDTTGSKAQLGGLVVVGETLLVYVDTLTGTVVEEGLDSPAIFVAW 294
Query: 207 GRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
D + Y L D G LHLL I E VTGL + +LG TS AS + Y+ + ++++GS
Sbjct: 295 AAYDH--TNYFLSDDYGNLHLLTIQTEGVVVTGLFMRMLGVTSRASCLVYMGDGMLFLGS 352
Query: 267 SYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER-----------QGQGQV 315
YGDSQL+ ++++ + ++++ N+ PI+DF ++DL GQ ++
Sbjct: 353 HYGDSQLLSVSVETET----TKLVQTIPNIAPILDFSIMDLGNAGDSQVGNAFSSGQARI 408
Query: 316 VTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETR 374
V G +++GSLR +R+ +G+ + ++ L+ ++G++SLRS DT LVVSFI+ETR
Sbjct: 409 VAGCGVHQNGSLRSIRSSVGLEDIGVLDDLRDVRGLFSLRSYGSAKVDT-LVVSFITETR 467
Query: 375 ILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKS 433
+ + E +EE + +G TL L+Q+T+ +V ++ S S + W
Sbjct: 468 VFRFDPEGGIEEVFDFQGLVLDRPTLVATTLPSGHLLQITATAVTVLDSESGVTLSTWAV 527
Query: 434 PPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKH-----AQLEYEISCL----D 484
P G ++ A+AN LL+ G LV L + D + + + E +ISC+ D
Sbjct: 528 PDGKTIVNASANNKWALLSISGTTLVSLNLSDNLSAQEQVLCRPIGGQEDQISCIHAARD 587
Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE----IIPRSVLLCAFEG 540
++ IG VG W SV I L L+ + E + +PR + L
Sbjct: 588 LDDIG---------VVGFWATGSVSIIHLGTLHALQGETVKQTDDSVSVPRDLALVQLHP 638
Query: 541 ISYL----LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF---SSKNTTHVFA 593
L A+ DG +++F ++ + L+ RK V+LG+Q L + +++FA
Sbjct: 639 PHLLGPTLFVAMEDGQVVSFNVSKEDFSLSSRKSVTLGSQQAGLHILPRPGDEGISNIFA 698
Query: 594 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
++ ++IYSS +++YS ++V+++ PF+S AFPD++ +A + I ID ++ H
Sbjct: 699 TTEHSSLIYSSEARVMYSAATAEDVTYIAPFDSEAFPDAIFLATNRNIRIAHIDPERRTH 758
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFIST-YPLD 711
+ + L E RR+ + R F I ++ + + EE +L+D+ + + LD
Sbjct: 759 VNPLQLRETVRRVAYSPALRAFGIGTIHRELTNNEEVVTSSFQLVDEIVLGVVGKPFHLD 818
Query: 712 ---TFEYGCSILSCSFSDDSNV---YYCVGTAYVLP---EENEPTKGRILVFIVE-DGKL 761
+ E S++ D + VGT+Y+ +E+ TKGRILV V+ D
Sbjct: 819 GSISAEMVESVIRAELPDSMGQPAERFIVGTSYLADPDIDESGDTKGRILVLGVDSDKNP 878
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV 821
LI E KGA SL KL+A +++ + +Y ++ + L+ + +
Sbjct: 879 YLIVSHELKGACRSLGVMGEKLVAGLSKTVVVYDYVEESTTSGALRKLATFRPSTFPVDI 938
Query: 822 QTRGDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
G+ I + DLM+S++L+ + + + ERAR + W ++V L+ +L A+
Sbjct: 939 DVHGNMIGIADLMQSLTLVEFVPAQDGNKAKLVERARHFQYIWATSVCHLEGHSWLEADA 998
Query: 878 NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
NL +R+N T+ +R ++EV GE+HLGE VN+ R + P+ + IP
Sbjct: 999 QGNLMVLRRNPNAPTEHDRKQMEVTGEFHLGEQVNKIRSLDIT---PNENDPIIPKAFLA 1055
Query: 938 TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG--------GLNHEQWRSFNNEKKTV 989
TV G + V A + E L + Q L +V++ +G GL+ WR F N K+
Sbjct: 1056 TVEGSLYVFADIKSEYQSLLIQFQERLAEVVRALGQADGEPGSGLSFTTWRGFRNAKRAA 1115
Query: 990 DAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
+ F+DG+LIE FLDL + + + + + +VE + VEEL R+H
Sbjct: 1116 EGPFRFVDGELIERFLDLDEAKQEAVVQGLGPTVESMRNLVEELKRMH 1163
>gi|302894051|ref|XP_003045906.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256726833|gb|EEU40193.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1162
Score = 410 bits (1055), Expect = e-111, Method: Compositional matrix adjust.
Identities = 315/1125 (28%), Positives = 549/1125 (48%), Gaps = 111/1125 (9%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
I+G I+ L+ RP G D LFI T+R ++ L W+ E +L I R + D+S+ R +
Sbjct: 55 IHGTISMLQRLRPKGSETDLLFIGTDRLQYFNLAWNPEMKQLDTIERVIEDLSEPYMRHS 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK----GQLKEAFNIRLEELQVLDIKFLYG 119
+ ++DP R + +HL++G+ V + +L+ +RL EL + F++
Sbjct: 115 QSQNKCLVDPTARFLAMHLWEGVLNVFRLPTRKGSTNKLEILDQVRLTELFMKASTFIHS 174
Query: 120 -CAKPTIVVLYQ---DNKDARHVKTYEVALKDKDFVEG---PWSQNNLD-----NGADLL 167
PTI LY+ + ++AR + Y + DK V P LD A +L
Sbjct: 175 RTGHPTIAFLYKSQMEQEEAR-LAIYRLTHDDKGGVVSKFDPHKDRELDVVIPDPYASML 233
Query: 168 IPVP----------------PPLCGVLIIGEETIVYCSANAFKAIPIR---PSITKAYGR 208
IPVP L G+LI+GE + Y +++ P I A+
Sbjct: 234 IPVPLDEEKRYHVRNTEGAKAHLGGLLIVGETLLTYYDGLTHRSVSSSLKDPRIFVAWA- 292
Query: 209 VDADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKI--ELLGE----TSIASTISYLD 258
+ DG+ Y L D G L +L I E VTG+ + LGE TS AS++ Y+
Sbjct: 293 -EYDGTHYFLADDYGRLDMLTIETNVEATGVVVTGMTLVPMKLGESPALTSRASSLVYMG 351
Query: 259 NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-------- 310
N +++ S +GDSQL +++ + +A + +++ N PI+DF ++D+ +
Sbjct: 352 NNTLFVASHHGDSQLYQIDPETNA----MLLIKSLSNNAPILDFSIMDMGNREGDAQAGN 407
Query: 311 ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFL 365
GQ ++V GAY+DGSLR +R+G+G+ E+ + EL G +G+++LRS D DT L
Sbjct: 408 AFSSGQSRIVAGCGAYQDGSLRSIRSGVGLEERGILDELDGTRGLFTLRSYNSDLVDT-L 466
Query: 366 VVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
VVS I+ETRIL+ + + +EE +G T+TL + QL+Q+T SV L+ S
Sbjct: 467 VVSSITETRILSFDTDGGIEEIYSFQGMEQDTETLLASNLPNGQLLQITPKSVVLLDPES 526
Query: 425 RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY------ 478
+ W P G S+ A+AN LL+ G LV L + + V+ Q E
Sbjct: 527 GVAVSRWDVPTGKSITRASANTKWALLSVDGTSLVSLNLLQNLAVNVQQTQAEPGSQQPD 586
Query: 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVL 534
+ISC I+ + P I VG W+ + + + + E + +PR +
Sbjct: 587 QISC--IHAARDPP---DIGVVGWWSSGRISLIDMATFQPLHGESMRQTDDSATVPRDIA 641
Query: 535 LCAFEG--IS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TT 589
L IS LL A+ DG ++ F +++K ++ RK V+LG+ P L + T+
Sbjct: 642 LVQLHPPEISGPTLLVAMEDGTVVTFNVSIKGFAVSGRKSVTLGSNPARLHVLPQDDGTS 701
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
VF ++ ++IYSS ++++S + + + PF+S AFPDS+ ++ + + I +D
Sbjct: 702 SVFVTTEHASLIYSSEGRIIFSATTADDATFVAPFDSHAFPDSIVLSTDSHIRICHVDKE 761
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-SCAEESEMHFVRLLDDQTFEFISTY 708
+ H++++P+ E RR+ + + F + S+K + EE RL+D+ F+ +
Sbjct: 762 RLTHVKALPVHETVRRVAYSPGLKAFGLGSIKKELVLNEEVVTSSFRLVDEIVFKELGEP 821
Query: 709 PLDTFEYGCSILSCSFSDD-----SNV--YYCVGTAYV---LPEENEPTKGRILVFIVED 758
L I+ C + N+ + VGT+++ +E T GRILV V++
Sbjct: 822 FLLNASSTLEIVECVIRAELPDAGGNLTERFIVGTSFINDGQVQEANGTLGRILVLGVDE 881
Query: 759 GK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
+ + I KG L + ++A +++ + +Y + + L+ + L
Sbjct: 882 HRQVYQIVSHNLKGPCRCLGMMDDYIVAGLSKTVVVYNYSQDTSSSGSLEKLAAYRPAAL 941
Query: 818 ALYVQTRGDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYL 873
+ + G+ I VGDLM+S+SL+ + + +EERAR Y W +++ LD++ +L
Sbjct: 942 PVDLDISGNMIGVGDLMQSLSLVEFIPAQDGRKAKLEERARHYEPIWTTSLCHLDEERWL 1001
Query: 874 GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-P 932
A++ NL +++N++ T+++R RLEV E +GE +NR R + +P D + P
Sbjct: 1002 EADSQGNLIVLQRNADAPTEQDRSRLEVTSEIGIGEQINRIRK----LHVPAGDNSIVHP 1057
Query: 933 TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK 992
+ G + + + + L Q+ + + I G + + WRSF NE + D
Sbjct: 1058 RAFLASAEGSLYLYGDIAPQYQDLLMTFQSKMEEYIHAPGNIEFKLWRSFRNENRESDGP 1117
Query: 993 -NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
F+DG+++E FLD+ + + + + + SVE++ +EEL R+H
Sbjct: 1118 YRFIDGEMVERFLDMDEGKQELVCEGLGPSVEDMRNLIEELRRMH 1162
>gi|358400469|gb|EHK49795.1| hypothetical protein TRIATDRAFT_146031 [Trichoderma atroviride IMI
206040]
Length = 1161
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 306/1123 (27%), Positives = 550/1123 (48%), Gaps = 108/1123 (9%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
I+G I L+ +P A D LFI T+R ++ + W+ E+++L + +A+ D ++ R +
Sbjct: 55 IHGTIDMLQRLQPKDSATDLLFIGTDRLQYFNVAWNPETNQLDAVEQAIEDTAEPYMRQS 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPFD-NKG---QLKEAFNIRLEELQVLDIKFLYG 119
+ ++DP + + +HL++G+ V KG +L+ +RL EL + FL+
Sbjct: 115 QSQNRCLVDPTGKFMAMHLWEGVLNVFRLRIRKGSTTRLEGLDQVRLTELWMKTSTFLHS 174
Query: 120 -CAKPTIVVLYQ---DNKDARHVKTYEVALKDKDFVEG---PWSQNNLDNG-----ADLL 167
P I LY+ D ++AR + Y + DK V P LD A +L
Sbjct: 175 RTGHPRIAFLYKNQLDREEAR-IAVYRLTEDDKGGVSSKFDPQKDRELDQVIPDPYASML 233
Query: 168 IPVP----------------PPLCGVLIIGEETIVYCSANAFKAIPIR---PSITKAYGR 208
IPVP L G+L++GE + Y + + + P I A+
Sbjct: 234 IPVPVYEEKRYHVRNNEGARAHLGGLLVVGETLLTYFDSLTYSNVCSTLDDPKIYVAWA- 292
Query: 209 VDADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKIELLG------ETSIASTISYLD 258
+ DG+RY L D G L LL I E VTG+ + + TS AS++ Y+
Sbjct: 293 -ECDGTRYFLADDYGRLDLLEIKTTNESTGVVVTGMTVHPMAFGDSSRYTSRASSLVYMG 351
Query: 259 NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-------- 310
N +++IGS +GDSQL+ + D + + +++ N PI+DF ++DL +
Sbjct: 352 NNLLFIGSHHGDSQLLHV----DIESQQMTLIKVISNNAPIMDFTIMDLGNREGDAQSGN 407
Query: 311 ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFL 365
GQ ++V SGAY+DGSLR +R+G+G+ ++ + E++G +G+++LRS DT +
Sbjct: 408 TFSSGQARIVAGSGAYQDGSLRSIRSGVGLEDRGLLDEIEGTRGLFTLRSVDSAKADT-V 466
Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
V+S ++ TR+L ++ E +G ++ TL + Q++Q+T +V L+ S
Sbjct: 467 VISTLAGTRVLRFEPDNIEELFSFQGMDLESPTLLAANLPSGQILQITPQAVNLLDPDSG 526
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY------E 479
+ W++P G ++ A+AN LL+ G LV L + D + K A + +
Sbjct: 527 VSVSSWQAPEGKTITAASANTKWALLSVDGSILVSLNLLDNLKATTKDASQDSVSGRPDQ 586
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLL 535
ISC+ +P +G WT ++ + + L + E L +PR V L
Sbjct: 587 ISCIHA---ARDPQ--DFGVIGWWTSGTISVVDMATLTPLHGEPLRQTDDSASVPRDVAL 641
Query: 536 CAF----EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTH 590
+L AL DG++++F +++K ++ RK V+LG+ P L ++ +
Sbjct: 642 VQLHPPESSGPTMLVALEDGNVISFNVSVKGFAISGRKTVTLGSGPARLHVLPREDGICN 701
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
VFA ++ ++IYSS +++YS + + + PF+S AFPDS+ ++ E + I +D+ +
Sbjct: 702 VFATTEHASLIYSSQGRIVYSATTAADATFVAPFDSEAFPDSIVLSTEDHIRICHVDNER 761
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFISTYP 709
H++++P+ E RR+ + R F + C K EE V+L+D+ F+ + T
Sbjct: 762 LTHVKALPMYETVRRVAYSPGLRAFGLGCIKKELVNNEEVITSTVKLVDEIIFQELGTPF 821
Query: 710 LDTFEYGCSILSCSFS---DDSN----VYYCVGTAYVLPE---ENEPTKGRILVFIVEDG 759
L ++ C D+N + VGT++V + E+ TKGRILV V++
Sbjct: 822 LLNASSSLELVECVIRAELPDANGNLAERFLVGTSFVADQGTAESGETKGRILVLGVDES 881
Query: 760 K-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 818
+ L I KG L N ++A + + + Y + + L
Sbjct: 882 RQLYQITSHSLKGVCRCLVMMNNYIVAGLTKTVVAYSYNQDTSTSASLTKLASFRPASFP 941
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKH----EEGAIEERARDYNANWMSAVEILDDDIYLG 874
+ + G+ I +GDLM+S++L+ + ++ +EE+AR Y W +++ LDD +L
Sbjct: 942 VDLDVSGNIIGIGDLMQSLTLVEFTPPQDGKKAKLEEKARHYQQAWTTSISALDDSRWLE 1001
Query: 875 AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
A+ N+ +R+N E T++++ +L+V+ E ++GE +NR R + P + +P
Sbjct: 1002 ADAQGNVIVLRQNLEAPTEQDQSQLQVISELNIGEQINRIRKLQVA---PGENAIVVPKA 1058
Query: 935 IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-N 993
G+ G + + + + L Q+ L++ I G L+ + WR+F N+ + +A
Sbjct: 1059 FLGSTEGTLYLYGDIAPKYQDLLMTFQSRLQEYISTPGNLSFDLWRAFRNQSREGEAPFR 1118
Query: 994 FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
F+DG++IE FLDL + + + + + SVE++ +EEL R+H
Sbjct: 1119 FVDGEMIERFLDLDEGKQELVCEGLGPSVEDMRNLIEELRRMH 1161
>gi|342885673|gb|EGU85655.1| hypothetical protein FOXB_03801 [Fusarium oxysporum Fo5176]
Length = 1160
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 308/1118 (27%), Positives = 552/1118 (49%), Gaps = 99/1118 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
I+G IA L+ RP G D LFI T+R + L W+ + +L I R + D+++ R +
Sbjct: 55 IHGSIAMLQCLRPKGSETDLLFIGTDRLHYFNLVWNPLTKQLETIERVIEDLAEPYMRHS 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPFD-NKG---QLKEAFNIRLEELQVLDIKFLYG 119
+ ++DP R + +HL++G+ V KG +L+ +RL EL + F++
Sbjct: 115 SSQNKCLVDPTGRFLAMHLWEGVLNVFKLPIRKGSTNKLERLDQVRLTELFMKASTFIHS 174
Query: 120 -CAKPTIVVLYQ---DNKDARHVKTYEVALKDKDFVEG---PWSQNNLD-----NGADLL 167
PTI LY+ + ++AR V Y + DK P LD A +L
Sbjct: 175 RTGHPTIAFLYKTQLEQEEARLV-IYRLTHDDKGNTVSKFDPHKDRELDVVIPDPYASML 233
Query: 168 IPVP----------------PPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGR 208
IPVP L G+L+IGE + Y + + ++ P I ++
Sbjct: 234 IPVPLDEEKRYHVRNTEGAKAHLGGLLVIGETLLTYFDGLTHRSVSSVLQDPRIFVSWA- 292
Query: 209 VDADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKIELL------GETSIASTISYLD 258
+ DG+ YLL D G L LL I E VTG+ +E L TS AS + YL
Sbjct: 293 -EYDGTHYLLADDYGRLDLLTIDTNLETTGVVVTGMTLEPLKIGRSPAITSRASNLVYLG 351
Query: 259 NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-------- 310
++ +++ S +GDSQL ++ D + + V +++ + N PI+DF ++D+ +
Sbjct: 352 DSTLFVASHHGDSQLYQI----DVESATVTLVQSFSNNAPILDFSIMDMGNREGDAQAGN 407
Query: 311 ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFL 365
GQ ++V GAY+DGSLR +R+G+G+ ++ + EL+G +G+++LRS D DT L
Sbjct: 408 AFSSGQSRIVAGCGAYRDGSLRSIRSGVGLEDRGVLDELEGTRGLFTLRSYGSDLVDT-L 466
Query: 366 VVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
VVS I+ETR+L+ + E +EE +G T+TL + QL+Q+T SV L+
Sbjct: 467 VVSAITETRVLSFDREGGIEEIYSFQGMSLDTETLLASNLPNGQLLQITPRSVVLLDPEG 526
Query: 425 RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLD 484
+ ++W P G S+ A+AN+ LL+ G LV L + + V+ +Q
Sbjct: 527 GTVTSKWDVPSGKSITRASANSKWALLSVDGTSLVSLNLLQNLAVNVQQSQNNSGSQADQ 586
Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEG 540
I+ I + VG W+ + + + L + E + +PR + L
Sbjct: 587 ISCIHAARDPPDLGVVGWWSSGQISLIDMASLKPLHGESMRQTEDSATVPRDIALVQLHP 646
Query: 541 --IS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAAS 595
IS LL A+ DG+++ F ++ K ++ RK V+LG+ P L ++ T++VF +
Sbjct: 647 PEISGPTLLVAMEDGNVVTFNVSTKDFAVSGRKSVTLGSNPARLHILPQQDGTSNVFVTT 706
Query: 596 DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR 655
+ ++IYS+ ++++S + + + PF+S AFPDS+ ++ + + I +D + H++
Sbjct: 707 EHASLIYSAEGRVIFSATTADDATFVAPFDSHAFPDSVILSTDQHIRICHVDKERLTHVK 766
Query: 656 SIPLGEHPRRICHQEQSRTFAICSLKNQSCA-EESEMHFVRLLDDQTF-EFISTYPLD-- 711
++P+ E RR+ + + F + S+K + EE RL+D+ F E S +PLD
Sbjct: 767 ALPVNETVRRVAYSPGLKAFGLGSIKKELVGNEEVVSSSFRLVDEIVFKELGSPFPLDAS 826
Query: 712 -TFEYGCSILSCSFSDDSNVY---YCVGTAYVLPEENEP--TKGRILVFIVEDGK-LQLI 764
+ E ++ D + + VGT+++ +P T GRILV V+ + + I
Sbjct: 827 SSLEIVECVIRAELPDVGGNHVERFIVGTSFISDGVEDPNGTGGRILVLGVDSNRQVYQI 886
Query: 765 AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
KG L + ++A +++ + Y ++ + LQ + L + +
Sbjct: 887 VSHNLKGPCRCLGMIDDNIIAGLSKTVVAYSFLQETSSSGSLQKLAVYRPAALPVDLDIS 946
Query: 825 GDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
G+ I V DLM+S+SL+ + + +EERAR + W ++V ++ + +L A++ N
Sbjct: 947 GNMIGVVDLMQSLSLVEFIPAQDGNKAKLEERARHFEPLWATSVCHIEGERWLEADSKGN 1006
Query: 881 LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-PTVIFGTV 939
L +++N + T+++R RLE+ E ++GE +NR R + +P ++ G + P +
Sbjct: 1007 LVVLQRNVDAPTEQDRSRLEITSEMNIGEQINRIRK----LHVPMAENGIVHPRAFLASA 1062
Query: 940 NGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGD 998
G + + + + L Q+ + + I G + + WRSF NE + + F+DG+
Sbjct: 1063 EGSLYLYGDIAPQYQDLLMTFQSKMEEYIHVPGSVEFKLWRSFRNENRESEGPFRFIDGE 1122
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
++E FLD+ + + + + + S+E++ +EEL R+H
Sbjct: 1123 MVERFLDMDEGKQELVCEGLGPSIEDMRNLIEELRRMH 1160
>gi|19114492|ref|NP_593580.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe 972h-]
gi|46395602|sp|O13807.1|DDB1_SCHPO RecName: Full=DNA damage-binding protein 1; AltName:
Full=Damage-specific DNA-binding protein 1
gi|2330717|emb|CAB11219.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces pombe]
Length = 1072
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 295/1072 (27%), Positives = 549/1072 (51%), Gaps = 96/1072 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITR-AMGDVSDRIGRPTD 64
I+ +I ++ F+P D + +AT+ +++ L WDA + + + D S+R R +
Sbjct: 55 IFAKIVNVKAFKPVSSPTDHIIVATDSFRYFTLFWDANDNTVSNGIKIQDCSERSLRESQ 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF----------DNKGQLKEAFNIRLEELQVLDI 114
+G + ++DP R+I LH+Y GL +IP N L + F++R++EL V+DI
Sbjct: 115 SGPLLLVDPFQRVICLHVYQGLLTIIPIFKSKKRFMTSHNNPSLHDNFSVRIQELNVVDI 174
Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL 174
LY ++P++ VLY+D+K H+ TY++ +++++ E ++++ G +P
Sbjct: 175 AMLYNSSRPSLAVLYKDSKSIVHLSTYKINVREQEIDEDDVVCHDIEEGK----LIPSEN 230
Query: 175 CGVLIIGEETIVYCSAN--------AFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLH 226
GV + GE + Y S + + PSI+ D S Y++ D +G+L+
Sbjct: 231 GGVFVFGEMYVYYISKDIQVSKLLLTYPITAFSPSISND-PETGLDSSIYIVADESGMLY 289
Query: 227 LLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSY 286
E V+ +++E LGE+SIAS + L + +++GS + +S L++L
Sbjct: 290 KFKALFTDETVS-MELEKLGESSIASCLIALPDNHLFVGSHFNNSVLLQLP-SITKNNHK 347
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+E+L+ +VN+ PI DF + D Q ++TCSGAYKDG+LRI+RN I I A +E++G
Sbjct: 348 LEILQNFVNIAPISDFIIDD--DQTGSSIITCSGAYKDGTLRIIRNSINIENVALIEMEG 405
Query: 347 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA-- 404
IK +S+ + +D ++ +S I ETR + ++ EG S L C ++
Sbjct: 406 IKDFFSVSFRAN--YDNYIFLSLICETRAIIVSP---------EGVFSANHDLSCEESTI 454
Query: 405 ----IY--NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
IY +Q++Q+T+ +RL + W SP S+ ++ A V +A GG +
Sbjct: 455 FVSTIYGNSQILQITTKEIRLFDGKK---LHSWISP--MSITCGSSFADNVCVAVAGGLI 509
Query: 459 VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT-DISVRIFSLPDLN 517
++ E GI TEV Q + E+S L EN Y VG+W+ DI + + ++
Sbjct: 510 LFFE---GI-TEVGRYQCDTEVSSLCFTE--ENVVY-----VGLWSADIIMLTYCQDGIS 558
Query: 518 LITKEHLGGEIIPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKTGELTDR--KKVSL 573
L L IPRS++ G L + +G++L F N + G++ + ++ L
Sbjct: 559 LTHSLKLTD--IPRSIVYSQKYGDDGGTLYVSTNNGYVLMF--NFQNGQVIEHSLRRNQL 614
Query: 574 GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
G PI L+ F SK +FA ++P ++Y + KL+ + ++ E+ ++ + + + ++
Sbjct: 615 GVAPIILKHFDSKEKNAIFALGEKPQLMYYESDKLVITPLSCTEMLNISSYVNPSLGVNM 674
Query: 634 AIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA--EESEM 691
+++ + +I+ L+++++ + PRRIC F +C +S E+ +
Sbjct: 675 LYCTNSYISLAKMSEIRSLNVQTVSVKGFPRRICSNSLFY-FVLCMQLEESIGTQEQRLL 733
Query: 692 HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRI 751
F+R+ + T I+ + + +E SI+ +DD V VGT + P+++ P GR+
Sbjct: 734 SFLRVYEKNTLSEIAHHKFNEYEMVESII--LMNDDKRV--VVGTGFNFPDQDAPDSGRL 789
Query: 752 LVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
+VF + D +++ AE + +G+V +L + ++A IN + ++++ + GT +++
Sbjct: 790 MVFEMTSDNNIEMQAEHKVQGSVNTLVLYKHLIVAGINASVCIFEY---EHGTMHVRNSI 846
Query: 811 GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
+ + + V D I+ DLMKSI++L + ++ + E ARDY+ W ++VEIL +
Sbjct: 847 RTPTYTIDISVNQ--DEIIAADLMKSITVLQFIDDQ--LIEVARDYHPLWATSVEILSER 902
Query: 871 IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
Y E + N + +++ +R +L +++LGE +N+ RH + + P
Sbjct: 903 KYFVTEADGNAVILLRDNVSPQLSDRKKLRWYKKFYLGELINKTRHCTFIE--PQDKSLV 960
Query: 931 IPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
P ++ TV+G + ++ L +LQ N+RKVI GGL+H++W+ + E +T
Sbjct: 961 TPQLLCATVDGSLMIVGDAGMSNTPLLLQLQDNIRKVIPSFGGLSHKEWKEYRGENET-S 1019
Query: 991 AKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
+ +DG LIES L L ++EI +++SV++L +E L +LH
Sbjct: 1020 PSDLIDGSLIESILGLREPILNEIVNGGHEGTKLDISVQDLKSIIENLEKLH 1071
>gi|327301962|ref|XP_003235673.1| UV-damaged DNA binding protein [Trichophyton rubrum CBS 118892]
gi|326461015|gb|EGD86468.1| UV-damaged DNA binding protein [Trichophyton rubrum CBS 118892]
Length = 1147
Score = 407 bits (1047), Expect = e-110, Method: Compositional matrix adjust.
Identities = 325/1119 (29%), Positives = 540/1119 (48%), Gaps = 114/1119 (10%)
Query: 6 IYGRIATLELFRPHGEAQ-----DFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRI 59
IYGR+ L+ P A D LF+ T++Y + + WD +L T R D++D
Sbjct: 55 IYGRVTLLKKL-PRSTAAGLALTDALFVGTDQYAYFSITWDPVHEQLRTERKYIDLADGS 113
Query: 60 GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG---------------------QL 98
R + IDP + L +Y+G+ + P QL
Sbjct: 114 LREAHSDDRCQIDPSGSFLTLEVYEGVVSIFPLVTADSHKRAKSAVSASAFSASTSVEQL 173
Query: 99 KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWS 156
E +R+EEL V FL A P +LY+D + +K ++ P S
Sbjct: 174 GEPLQVRIEELMVRSSAFLDQEASNTPRFALLYEDTQGKVRLKLRDLKYTHAIITSDPGS 233
Query: 157 QNNL----------DNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SI 202
L D GA +LIPVP PL G+LIIGE +I Y + + I RP ++
Sbjct: 234 AAELKDVTTLSDEVDLGASILIPVPRPLGGLLIIGESSIKYVDVSRNETIS-RPLAESTV 292
Query: 203 TKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
A+ +VD G R+LL D G L L+++ V K++ LG TS AS + YLD +
Sbjct: 293 FVAWEQVD--GQRWLLADDYGRLFFLMLVLDADNAVESWKVDFLGVTSRASVLVYLDGGI 350
Query: 262 VYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ---------GQ 312
V++GS GDSQ+I++ +G + ++++ N+ PI+DF ++D+ + GQ
Sbjct: 351 VFVGSHQGDSQVIQIK-----EGGF-DLVQTISNIAPILDFTIMDMGDRSGATREFSSGQ 404
Query: 313 GQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
++VT SGA+ DGSLR VR+G+GI E + ++ I +W+LRS+ +PF L+VSF++
Sbjct: 405 TRIVTGSGAFGDGSLRSVRSGVGIEELGVLASMEHITDLWALRSACPEPFFDTLLVSFVN 464
Query: 372 ETRILAMNLEDELEETE--IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
ETR+ + + ++EE E G TL + N+++QVT R + S +
Sbjct: 465 ETRVFHFSADGDVEEKEDGFLGLVFSESTLLATNIPGNRILQVTESISRAIDVDSGMII- 523
Query: 430 EWKSP-PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI--N 486
W+S +++ A+AN ++L GG LV + + L + + + ++S + I +
Sbjct: 524 -WRSSYEEFTITSASANDDYLVLVLGGTRLVCISLSTFELVGSRDFEADNQVSGMTIPAS 582
Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGIS 542
PI Q V + + I LP L + K+ LG GE IPRSV++
Sbjct: 583 PI-------QACIVCLPQSAEIIILDLPGLEVKNKQTLGEPGEAIPRSVIVAEILPNKPP 635
Query: 543 YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVI 601
L ++ DG + +F ++ + + K++LG++ + + N + VFA D P++I
Sbjct: 636 TLFVSMADGTVFSFSFDVHAFAIFNSSKITLGSEQPSFKKLPRGNGQYNVFATCDHPSLI 695
Query: 602 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 661
++S +++YS V+ S +C N+ A+P S+A++ + EL I +D+ + I ++P+
Sbjct: 696 HASEGRIVYSAVDSASASRICSLNTQAYPGSIALSSQHELKIAIVDEERTTQIHTLPMHA 755
Query: 662 HPRRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPL--------- 710
RR+ + + F + ++K + + EE FV L D+ F +STY L
Sbjct: 756 SVRRLAYSPTEKAFGLGTVKRKISNGVEEVSSSFV-LADEILFRPLSTYDLRPDELVECV 814
Query: 711 --DTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEK 767
+G + S S D + VGTA++ ++ +GRILVF V + L LI EK
Sbjct: 815 IRSQLNHGKDEVGNSISKD---LFFVGTAFLDDVGDDHIRGRILVFEVNRSRELSLIVEK 871
Query: 768 ETKGAVYSLNAFNGKLL-AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGD 826
GA +L + LL A + + + ++K G L+ + + + GD
Sbjct: 872 SLMGACRTLAVMDHTLLVAGLVKSVSVFKLARDRFGNILLEKHTAYRTSTAPIDISVVGD 931
Query: 827 FIVVGDLMKSISLLIY-KHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
+ V D+MKS+SL+ Y + EEG EE AR Y W +AV +++++YL A+ NL
Sbjct: 932 TVAVADVMKSMSLVQYTQAEEGEREPKFEEIARHYQTLWSTAVAPIEENVYLLADAEGNL 991
Query: 882 FTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
+++N G T+ +R RL+ E LGE VNR H ++ ++ V + TV+G
Sbjct: 992 VVLQQNITGVTESDRKRLQPTSEIRLGEMVNRI-HPIVIQTYTETAVSA--RALLATVDG 1048
Query: 942 VIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLI 1000
I + + L +LQT + + G + ++R+F D F+DG+LI
Sbjct: 1049 SIYLFGLINPTYIDLLLRLQTAMGSITISPGEIPFSKYRAFRTTVHQSDEPFRFVDGELI 1108
Query: 1001 ESFLDLSRTRMDEISKTMN---VSVEELCKRVEELTRLH 1036
E FL + +EI ++ V+V L + +EEL R+H
Sbjct: 1109 ERFLSCTPGMQEEIVSRLDDSTVTVSSLKEMIEELRRMH 1147
>gi|295667673|ref|XP_002794386.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
gi|226286492|gb|EEH42058.1| DNA damage-binding protein 1a [Paracoccidioides sp. 'lutzii' Pb01]
Length = 1195
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 326/1131 (28%), Positives = 534/1131 (47%), Gaps = 131/1131 (11%)
Query: 15 LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDP 73
LF P D LFI T+R + + W+ +S+L T R D++D R + G +IDP
Sbjct: 87 LFLPQ---TDVLFIGTDRATYFTVSWNPVTSQLRTERKYVDLADPSSRESQLGDRCLIDP 143
Query: 74 DCRLIGLHLYDGLFKVIPFDNKG-----------------------------------QL 98
+ I L LY+G+ VIP +L
Sbjct: 144 SGKFITLELYEGIITVIPIGQPQRTARQGGRKYGKRAVTAQNQHDSSHIGNTNAAGEVEL 203
Query: 99 KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV------------- 143
E R++EL V FL+ A P + LY+D +K E+
Sbjct: 204 GEPCQARVDELLVRSSAFLHTQADMLPRMAFLYEDTMGQVRLKVRELEFTYGGMGIGTGG 263
Query: 144 ----------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SAN 191
LK D + + L+ GA LIPVP PL G+L++GE +I Y + N
Sbjct: 264 GAGQDTGCIAVLKALDLL-----KEELEMGASFLIPVPAPLGGLLVLGETSIRYLDDATN 318
Query: 192 AFKAIPI-RPSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETS 249
++P+ +I A+ +VD G R+LL D G L L++I E V K++LLG
Sbjct: 319 ECISLPLDEATIFVAWEQVD--GQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIP 376
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-- 307
AS + YL V +IGS GDSQLI++ +GS+ EV++ + N+ PI+DF ++DL
Sbjct: 377 RASVLVYLGGGVTFIGSHQGDSQLIRIT-----EGSF-EVIQTFSNIAPILDFTIMDLGG 430
Query: 308 ---ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSST 357
E Q GQ ++VT SGA+ DGSLR VR+G+G+ E + ++ I +W+LR +
Sbjct: 431 RAGENQTHDFSSGQARIVTGSGAFDDGSLRSVRSGVGMEEVGVLGAMEHITDLWALRVAC 490
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+ F L+VSF+ E+R+ + E+EE E G TL + +++QVT +
Sbjct: 491 QEGFSDTLLVSFVDESRVFHFTQDGEVEEKDEFMGLGLAESTLLAANLPNGRILQVTERN 550
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQ 475
VR+ L W P ++ A++N ++L GG L+ +I G+ L K
Sbjct: 551 VRVADLDDGMLLWNWSPPSQKAITAASSNDDHLVLVVGGQVLMCFDIQGEIKLAGKKDFG 610
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSV 533
+ ++S + + +P+ I + +++V SL D+ + +G GE PRSV
Sbjct: 611 DDTQVSGVTV---ASSPATYCILCLPQTAEVAV--MSLADMTIRHSTSVGEPGEAFPRSV 665
Query: 534 LLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTTH 590
L+ + L ++ DG + +F N LT K+ LG+ QP + +
Sbjct: 666 LVAEVLPNQPATLFVSMADGGVFSFSFNADEFTLTKMSKLVLGSEQPSFKKLPRGDGLYN 725
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
+FA ++P++IY++ +++YS V+ + S +C FNS A+P S+A+A EL I +D +
Sbjct: 726 IFATCEQPSLIYATEGRIIYSAVHSDQASRICHFNSEAYPGSIALATPTELKIAHVDSER 785
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTY 708
I ++ +GE RR+ + + F I ++K ++ AE F+ L D+ F + Y
Sbjct: 786 TTQIHTLEIGETVRRVAYSAAEKAFGIGTIKRTLENGAEVITSRFM-LADEIMFRELDDY 844
Query: 709 PLDTFEYGCSILSCSF-----SDDSNVY---YCVGTAYVLPEENEPTKGRILVFIVEDGK 760
L E S++ F SD + + + VGT+Y+ +GRIL F V +
Sbjct: 845 SLRPDELVESVIQAQFPEGKDSDGNESFKDMFVVGTSYLDDVGEGSIRGRILAFEVTGSR 904
Query: 761 -LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
L +AE KGA +L K++AA+ + + +Y + L + +
Sbjct: 905 QLAKVAELPVKGACRALAVMQDKIVAALMKTVVIYSIAKGELSDYTLNKTASYRTSTAPI 964
Query: 820 YVQTRGDFIVVGDLMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGA 875
+ G+ I V DLMKS+S++ +K E ++ E AR + W +AV + ++++L +
Sbjct: 965 DIAVTGNLIAVADLMKSVSIIEFKQGENDQPDSLTEVARHFQTLWSTAVAPIAENMFLES 1024
Query: 876 ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI---P 932
+ NL + +N G TD+++ RLEV E LGE VNR R S+ LP + + P
Sbjct: 1025 DAEGNLVVLNQNVNGVTDDDKRRLEVTSEILLGEMVNRIRPVSIQGSLPATGPREAVISP 1084
Query: 933 TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDA 991
GTV G I + + L +LQ+ + ++ G + ++R+F N ++ +
Sbjct: 1085 KAFLGTVEGSIYLFGLINPAYQDLLMRLQSAMAGLVVTPGAMPFNKFRAFKNAVRQAEEP 1144
Query: 992 KNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
F+DG+LIE F +EI V+VE+ + VEEL R+H
Sbjct: 1145 YRFVDGELIERFWTCEAALQEEIVGLVVAGGVAGVTVEKTKRIVEELRRMH 1195
>gi|46121747|ref|XP_385428.1| hypothetical protein FG05252.1 [Gibberella zeae PH-1]
Length = 1161
Score = 404 bits (1039), Expect = e-109, Method: Compositional matrix adjust.
Identities = 302/1123 (26%), Positives = 548/1123 (48%), Gaps = 108/1123 (9%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
I+G I+ L+ RP G D LFI T+R ++ L W+ + +L I R + D+S+ R +
Sbjct: 55 IHGTISMLQRLRPKGSETDLLFIGTDRLQYFNLIWNDATKQLETIERVIEDLSEPYMRQS 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK----GQLKEAFNIRLEELQVLDIKFLYG 119
+ ++DP R + +HL++G+ V + +L+ +RL EL + F Y
Sbjct: 115 QSQNKCLVDPTGRFLAMHLWEGVLNVFKLPTRKGSTNKLEVLDQVRLTELFMKASTFTYS 174
Query: 120 CAK-PTIVVLYQDNKDARHVK--TYEVALKDKDFVEG---PWSQNNLD-----NGADLLI 168
PTI LY+ D + Y + DK P LD A +LI
Sbjct: 175 YTNHPTIAFLYKTQMDQEETRLVIYRLTHDDKGNTVSKFDPHKDRELDIEIPDPYASMLI 234
Query: 169 PVP----------------PPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGRV 209
PVP P L G++++GE + Y +++ P I A+
Sbjct: 235 PVPLDEEKRYNVRNNQGAKPHLGGLVVVGETLLTYYDGQTHRSVSAGLKDPRIFVAWAEY 294
Query: 210 DADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKIELL------GETSIASTISYLDN 259
D+ +RYLL D G L LL I E VTG+ +E TS AS++ YL N
Sbjct: 295 DS--TRYLLADDYGRLDLLTIRTNLEPTGVVVTGMTVEPFEFDNSPAITSRASSLVYLGN 352
Query: 260 AVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ--------- 310
+++ S +GDSQL ++N+ D K + + ++ + N PI+DF ++D+ +
Sbjct: 353 GNLFVASHHGDSQLYQINI--DTKAAML--VQSFSNNAPILDFSIMDMGNREGDTQSGNV 408
Query: 311 ---GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLV 366
GQ ++V GAY+DG+LR +R+G+G+ ++ + EL+G +G+++L S + DT LV
Sbjct: 409 FSSGQSRIVAGCGAYRDGTLRSIRSGVGLEDRGVLDELEGTRGLFTLHSYGSEMVDT-LV 467
Query: 367 VSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
VS I+ETRIL+ + E +EE +G ++TL + QL+Q+T SV L+
Sbjct: 468 VSSITETRILSFDFEGGIEEIYSFQGMSLDSETLLASNLPSGQLLQITPKSVVLLDPEGG 527
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY-----EI 480
++W P G ++ A+AN+ LL+ G LV L + + + + +I
Sbjct: 528 TTISKWDVPDGKTITRASANSKWALLSVDGTSLVSLNLLQNLAVNAQQINNDSTSQPDQI 587
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLC 536
SC I+ ++P + VG W+ + + + L + E + +PR V L
Sbjct: 588 SC--IHAARDSP---DLGVVGWWSSGQISLIDMASLKPLHGESMRQTEDSATVPRDVALV 642
Query: 537 AFE--GIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHV 591
IS LL A+ DG+++ F ++ K ++ RK V+LG+ P L ++ T++V
Sbjct: 643 QLHPPDISGPTLLVAMEDGNIVTFNVSTKGFSVSGRKSVTLGSNPARLHILPQEDGTSNV 702
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
F ++ ++IYSS ++++S + + + PF+S AFPDS+ ++ + + I +D +
Sbjct: 703 FVTTEHASLIYSSEGRIIFSATTADDATFVAPFSSHAFPDSVVLSTDQHIRICHVDRERL 762
Query: 652 LHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTF-EFISTYP 709
H++++P+ E RR+ + + F + S+K + EE RL+D+ F E S +P
Sbjct: 763 THVKALPVNETVRRVAYSPGLKAFGLGSIKRELVNNEEVVTSSFRLVDEIVFKELGSPFP 822
Query: 710 LD---TFEYGCSILSCSFSD---DSNVYYCVGTAYVLP---EENEPTKGRILVFIVE-DG 759
L+ T E ++ D + + VGT+++ E+ T GRILV V+ +
Sbjct: 823 LNGSSTLELVECVIRAELPDVGGNPTERFIVGTSFISDGGVEDPNGTLGRILVLGVDANR 882
Query: 760 KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
++ I KG L ++A +++ + +Y + + LQ + + +
Sbjct: 883 QVYQIVSHNLKGPCRCLGMIEDNIVAGLSKTVVVYSFSQETSSSGSLQKLAAYRPAAIPI 942
Query: 820 YVQTRGDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
+ G+ I VGDLM+S+SL+ + + ++ERAR + + W +AV ++ + +L A
Sbjct: 943 DIDVSGNMIGVGDLMQSLSLVEFIPAQDGNKAQLQERARHFESLWTTAVCHIEGERWLEA 1002
Query: 876 ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-PTV 934
++ NL +++N T+++R R+E+ E +GE +NR R + +P ++ + P
Sbjct: 1003 DSRGNLVVLQRNVNAPTEQDRRRMEITSEMGIGEQINRIRK----LNVPAAENNIVHPRA 1058
Query: 935 IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-N 993
+ G + + + + L Q+ + + + +G + + WRSF N+ + +
Sbjct: 1059 FLASAEGSLYLYGDIAPQYQDLLMTFQSRMEECVLALGNVEFKLWRSFRNDNRESEGPYR 1118
Query: 994 FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
F+DG++IE FLD+ + + + + +VE++ +EEL R+H
Sbjct: 1119 FIDGEMIERFLDMGEEQQELVCDGLGPTVEDMRNMIEELRRMH 1161
>gi|378732825|gb|EHY59284.1| DNA damage-binding protein 1 [Exophiala dermatitidis NIH/UT8656]
Length = 1185
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 335/1150 (29%), Positives = 554/1150 (48%), Gaps = 138/1150 (12%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
+YG I LE RP D LF+ T+RY + W+AE +L T ++ D++D + R +
Sbjct: 55 VYGYITILEKLRPASSQTDHLFVGTDRYHYFTCSWNAEIKQLRTEQSYVDLADNVLRNSK 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF---DNK------------GQLKEAFNIRLEEL 109
IDP R + L LYDG+ V+PF NK G L E +R+EEL
Sbjct: 115 EVDRCHIDPTGRYMTLELYDGVISVLPFVQPSNKRVKRETTTNSPVGSLGEPVQVRVEEL 174
Query: 110 QVLDIKFLYGCA----KPTIVVLYQDNKDARHVKTYEVAL---KDKDFVE----GPWSQN 158
FL P + +L++DN + +K E+ D+ E +
Sbjct: 175 LTRSSAFLETDPDTRENPKLAILWEDNLENPQLKLRELKYYPGGDQATAELETVAELHGD 234
Query: 159 NLDNGADLLIPVPPPLCGVLIIGEETIVYCS---ANAF-KAIPIRPSITKAYGRVDADGS 214
LD G LIPV P G LI+G+ I Y N +A+ +I + ++D
Sbjct: 235 QLDKGVSHLIPVSAPYGGFLILGDHAIRYVDRGLTNVITQALEDNATIWTCWTKID--DR 292
Query: 215 RYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
R+LL D G L L+I +V+ +++ +G S A+ + YLD V++GS GDSQ++
Sbjct: 293 RWLLADDFGRLFFLMIEVSGGEVSSWRLDTVGVASKATCLVYLDEGYVFVGSHSGDSQVV 352
Query: 275 KLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER-------------QGQGQVVTCSGA 321
+ + +G V V++ + N+ PI+DF ++DL R GQ ++VT SGA
Sbjct: 353 HI----EEEG--VRVVQSFANIAPILDFTIMDLGRGAEAGGQQALEFSSGQARIVTASGA 406
Query: 322 YKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNL 380
++DG++R VR+G+G+ E ++ EL I +W L S+ LVVSF++ETR+ +
Sbjct: 407 WQDGTIRSVRSGVGLEELGTIGELSHITDLWGLSSTGQSDVQDVLVVSFVTETRVFKFDS 466
Query: 381 EDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP-PGYS 438
E +EE E TL + +L Q+ +R+ S L EWK P
Sbjct: 467 EAAVEEVDEFHHLEFSQPTLLAANLPDRKLAQIYETGLRITDLESNMLTLEWKPPDSAAK 526
Query: 439 VNVATANASQVLLATGGG--HLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQ 496
+ A AN+ +L+ GG H+ + + T K L+ +IS + + S S
Sbjct: 527 ITAACANSVHLLMVEGGHTLHVFHASNNEAKPTVSKSFPLDSQISSVTVAD-----SQSN 581
Query: 497 IAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGH 552
+ V W SV I L L+ + + LG G IPRSVL+ + L A+ DG
Sbjct: 582 VCIVSFWQTASVAILDLHSLDTLYTQTLGTPGTDIPRSVLVAHVLPDAPPTLFIAMADGT 641
Query: 553 LLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT-----THVFAASDRPTVIYSSNKK 607
+L++ ++ L+ ++ LGT+P+ + N+ ++VFA+ ++P++IYSS +
Sbjct: 642 VLSYSFDLAKHSLSGMTRILLGTEPVVFKQLPKDNSPDSNLSNVFASCEQPSLIYSSEGR 701
Query: 608 LLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRIC 667
++YS VN + S +C FN A+P+++A+A EL + I + ++++ +GE R +
Sbjct: 702 IIYSAVNAESASRVCNFNCEAYPEAIAMATPTELKLALIGRERTTQLQTLSIGETVRCLT 761
Query: 668 HQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEY-GCSILSCSF- 724
++ ++ FA+ C + E+ + V++ D+ +F+ + + L E C + + SF
Sbjct: 762 YEPTAKMFAMGCIRRIMEGGTEALLSSVKIADEVSFKELDSVELQDRELVECIVSTGSFD 821
Query: 725 -SDDSNVY---YCVGTAYVLPEEN---EPTKGRILVFIV-EDGKLQLIAEKETKGAVYSL 776
D+S Y + VGT+ +L E++ E TKGRILVF V ++ KL+ + + + KGA SL
Sbjct: 822 TEDESKGYGDMFVVGTS-ILEEDDAGEEVTKGRILVFEVNKEKKLKAVTQIKVKGACRSL 880
Query: 777 NAFNGKLLAAINQKIQLYKWM---LRDDGTRELQSECGHHGHI--LALYV----QTRGDF 827
+GK++A + + + LY + R + + ELQ + L+L V Q
Sbjct: 881 AMCDGKIVAGLVKTVVLYGLIPSSTRGEHSLELQKLATYRTSTNPLSLAVTPATQDSPAL 940
Query: 828 IVVGDLMKSISLLIY-----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
I V DLMKS+S+L + E + E +R + W SA +L D ++ A+ NL
Sbjct: 941 IAVADLMKSLSVLQVLPPDSSNSEYRLHETSRHFATLWSSATAVLRHDEWVVADMEGNLI 1000
Query: 883 TVRKNSEGATDE----ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS----------DV 928
T+R+ S A + R RLEV GE+ LGE VN+ +V LP DV
Sbjct: 1001 TLRQGSLDAENNGAVAARRRLEVTGEFRLGEVVNK-----IVSILPPGSQRQSDQKMVDV 1055
Query: 929 G-------------QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLN 975
G P GTV G I ++A++ L LQ++L ++ G +
Sbjct: 1056 GLDGGKKPARTGPLVTPRTFLGTVEGAIYMLATINPAYVNVLLLLQSSLATRVQAPGYMP 1115
Query: 976 HEQWRSFNNEKKTVDAK-NFLDGDLIES-FLDLSRTRMDEI-------SKTMNVSVEELC 1026
++R++ E D +DG+++E + L ++++ + +NVSVEE+
Sbjct: 1116 WAKYRAWKTEVLEKDEPFRVVDGEMLEQGLVALDDRELEDVLREAGLMEEALNVSVEEVR 1175
Query: 1027 KRVEELTRLH 1036
+EL RL+
Sbjct: 1176 GWADELRRLY 1185
>gi|326474493|gb|EGD98502.1| UV-damaged DNA binding protein [Trichophyton tonsurans CBS 112818]
gi|326481558|gb|EGE05568.1| UV-damaged DNA binding protein [Trichophyton equinum CBS 127.97]
Length = 1147
Score = 400 bits (1029), Expect = e-108, Method: Compositional matrix adjust.
Identities = 323/1116 (28%), Positives = 541/1116 (48%), Gaps = 108/1116 (9%)
Query: 6 IYGRIATLELFRPHGEAQ-----DFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRI 59
IYGRI L+ P A D LF+ T++Y + + WD + L T R D++D
Sbjct: 55 IYGRITLLKRL-PRSTAAGLALTDALFVGTDQYAYFSITWDPDHELLRTERKYIDLADGS 113
Query: 60 GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG---------------------QL 98
R + IDP + L +Y+G+ + P QL
Sbjct: 114 LREAHSDDRCQIDPSGNFLTLEVYEGVVSIFPLVTTDSHKRAKSAVSASASAASTSIEQL 173
Query: 99 KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWS 156
E +R EEL V FL A P +LY+D + + ++ P S
Sbjct: 174 GEPLQVRTEELMVRSSAFLDQEASNTPRFALLYEDTQGQVRLTLRDLKYTHAIVTSDPGS 233
Query: 157 ----------QNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SI 202
+ ++ GA +LIPVP PL G+L+IGE +I Y + + I RP ++
Sbjct: 234 AAELKYVTTLSDEIELGASILIPVPRPLGGLLVIGESSIKYVDVSKNETIS-RPLAESTV 292
Query: 203 TKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
A+ +VD G R+LL D G L L+++ V K++ LG TS AS + YLD +
Sbjct: 293 FVAWEQVD--GQRWLLADDYGRLFFLMLVLDTDNAVESWKVDYLGVTSRASVLVYLDGGI 350
Query: 262 VYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ---------GQ 312
V++GS GDSQ+I++ +G + ++++ N+ PI+DF ++D+ + GQ
Sbjct: 351 VFVGSHQGDSQVIQIK-----EGGF-DLIQTISNIAPILDFTIMDMGDRLGATREFSSGQ 404
Query: 313 GQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
++VT SGA+ DGSLR VR+G+GI E + ++ I +W+LRS+ +PF L+VSF++
Sbjct: 405 TRIVTGSGAFGDGSLRSVRSGVGIEELGVLASMEHITDLWALRSACPEPFSDTLLVSFVN 464
Query: 372 ETRILAMNLEDELEETE--IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
ETR+ + + ++EE E G TL + N+++QVT S R++ +
Sbjct: 465 ETRVFHFSADGDVEEKEDGFLGLVFSQSTLLATNIPGNRILQVTESSSRVIDIDGGMII- 523
Query: 430 EWKSP-PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI--N 486
W+S +++ A+AN ++L GG LV + + L + + + ++S + I +
Sbjct: 524 -WRSTYEEFTITSASANDDYLVLVLGGIRLVCISLSTFELIGSREFEADNQVSGMTIPAS 582
Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGIS 542
PI Q V + + I LP L + K+ LG GE IPRSV++
Sbjct: 583 PI-------QACIVCLPQSAEIVILGLPGLEVKNKQTLGEPGEAIPRSVIVAEILPNNPP 635
Query: 543 YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVI 601
L ++ DG + +F ++ + + K++LG++ + + N + VFA D P++I
Sbjct: 636 TLFVSMADGTVFSFSFDVHAFAIFNSSKITLGSEQPSFKKLPRGNGQYNVFATCDHPSLI 695
Query: 602 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 661
++S +++YS V+ S +C N+ A+P S+A++ + EL I +D+ + I ++P+
Sbjct: 696 HASEGRIVYSAVDSASASRICSLNTQAYPGSIALSSQHELKIAIVDEERTTQIHTLPMHA 755
Query: 662 HPRRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEY-GCS 718
RR+ + + F + ++K + + EE FV L D+ F +STY L E C
Sbjct: 756 SVRRLAYSPVEKAFGLGTVKRKISNGVEEVSSSFV-LADEILFRPLSTYDLRPDELVECV 814
Query: 719 ILS-CSFSDD------SNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKETK 770
I S ++ D S + VGTA++ ++ +GRILVF V + L LI EK
Sbjct: 815 IRSQLNYGKDEVGNLISKDLFFVGTAFLDDVGDDHIRGRILVFEVNRSRELSLIVEKSLM 874
Query: 771 GAVYSLNAFNGKLL-AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
GA +L + LL A + + + ++K G L+ + + + + +
Sbjct: 875 GACRTLAVMDQTLLVAGLVKSVSVFKLARGRFGNILLEKHAAYRTSTAPIDISVVDNTVA 934
Query: 830 VGDLMKSISLLIY-KHEEGA----IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
V D+MKS+SL+ Y + EEG EE AR Y W +AV +++++YL A+ NL +
Sbjct: 935 VADVMKSMSLVQYTQAEEGGQEPKFEEIARHYQTLWSTAVAPIEENVYLLADAEGNLVVL 994
Query: 885 RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
++N G T+ +R RL+ E LGE VNR H +V ++ V + TV+G I
Sbjct: 995 QQNITGVTESDRRRLQPTSEIRLGEMVNRI-HPIVVQTHTETAVSA--RALLATVDGSIY 1051
Query: 945 VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESF 1003
+ + L +LQT + + G + ++R+F + D F+DG+LIE F
Sbjct: 1052 LFGLINPNYVDLLLRLQTAMGSIATSPGEIPFSKYRAFRTTVRQSDEPFRFVDGELIERF 1111
Query: 1004 LDLSRTRMDEISKTMN---VSVEELCKRVEELTRLH 1036
L + EI ++ V+V L + +EEL R+H
Sbjct: 1112 LSCTLAMQQEIVGRLDDSTVTVSSLKEMIEELRRMH 1147
>gi|226291941|gb|EEH47369.1| DNA damage-binding protein 1a [Paracoccidioides brasiliensis Pb18]
Length = 1209
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 323/1126 (28%), Positives = 533/1126 (47%), Gaps = 121/1126 (10%)
Query: 15 LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDP 73
LF P D LFI T+R + + W+ +S+L T R D++D R + G +IDP
Sbjct: 101 LFLPQ---TDVLFIGTDRATYFTVSWNPVTSQLRTERKYVDLADPSSRESQLGDRCLIDP 157
Query: 74 DCRLIGLHLYDGLFKVIPFDNKG-----------------------------------QL 98
+ I L LY+G+ VIP +L
Sbjct: 158 SGKFITLELYEGIITVIPIGQPQRTARQSGRKYGKRAVTAQNQHDSSHIGNTNAAGEVEL 217
Query: 99 KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV------------- 143
E R++EL V FL+ A P + LY+D +K E+
Sbjct: 218 GEPCQARVDELLVRSSAFLHTQADMLPRMAFLYEDTMGQVRLKVRELEFTYGGMGIGTGG 277
Query: 144 -ALKDKD----FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SANAFKAI 196
A +D F + L+ GA LIPVP PL G+L++GE +I Y + N ++
Sbjct: 278 GAWQDTGCIAVFKALDLLKEELEMGASFLIPVPAPLGGLLVLGETSIRYLDDATNECISL 337
Query: 197 PI-RPSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTI 254
P+ +I A+ +VD G R+LL D G L L++I E V K++LLG AS +
Sbjct: 338 PLDEATIFVAWEQVD--GQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIPRASVL 395
Query: 255 SYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ER 309
YL V +IGS GDSQLI++ +GS+ E+++ + N+ PI+DF ++DL E
Sbjct: 396 VYLGGGVTFIGSHQGDSQLIRIT-----EGSF-EIIQTFSNIAPILDFTIMDLGGRAGEN 449
Query: 310 Q------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFD 362
Q GQ ++VT SGA+ DGSLR VR+G+G+ E + ++ I +W+LR + + F
Sbjct: 450 QTHDFSSGQARIVTGSGAFDDGSLRSVRSGVGMEEVGVLGAMEHITDLWALRVACQEGFS 509
Query: 363 TFLVVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
L+VSF+ ETR+ + E+EE E G TL + +++QVT +VR+
Sbjct: 510 DTLLVSFVDETRVFHFTQDGEVEEKDEFMGLGLAESTLLAANLPNGRILQVTERNVRVAD 569
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEI 480
L W P ++ A++N ++L GG L+ +I G+ L K + ++
Sbjct: 570 LDDGMLLWNWSPPSQKAITAASSNDDHLVLVVGGQVLMCFDIQGEVKLAGKKDFGDDTQV 629
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF 538
S + + +P+ I + ++ V +L D+ + +G GE PRSVL+
Sbjct: 630 SGVTVT---SSPATYCILCLPQTAEVVV--MNLEDMTIRHSTSVGEPGEAFPRSVLVAEV 684
Query: 539 --EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTTHVFAAS 595
+ L ++ DG + +F N L K+ LG+ QP + ++FA
Sbjct: 685 LPNQPATLFVSMADGRVFSFSFNADEFTLKKMSKLVLGSEQPSFKKLPRGDGLYNIFATC 744
Query: 596 DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR 655
++P++IY++ +++YS V+ + S +C FNS A+P S+A+A EL I +D + I
Sbjct: 745 EQPSLIYATEGRIIYSAVHSDQASRICHFNSEAYPGSIALATPTELKIAHVDSERTTQIH 804
Query: 656 SIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTYPLDTF 713
++ + E RR+ + + F I ++K ++ AE F+ L D+ F + Y L
Sbjct: 805 TLEIDETVRRVAYSAAEKAFGIGTIKRTLENGAEVITSRFM-LADEIMFRELDDYSLRPD 863
Query: 714 EYGCSILSCSF-----SDDSNVY---YCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLI 764
E S++ F SD + + + VGT+Y+ +GRIL F V + L +
Sbjct: 864 ELVESVIQAQFPEGKDSDGNESFKDIFVVGTSYLDDVGEGSIRGRILAFEVTGSRQLAKV 923
Query: 765 AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
AE KGA +L K++AA+ + + +Y + L + + +
Sbjct: 924 AELPVKGACRALAVVQDKIVAALMKTVVIYSIAKGELSDYTLNKTASYRTSTAPIDIAVT 983
Query: 825 GDFIVVGDLMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
G+ I V DLMKS+S++ +K E ++ E AR + W +AV + ++++L ++ N
Sbjct: 984 GNLIAVADLMKSVSIIEFKQGENDQPDSLTEVARHFQTLWSTAVAPIAENMFLESDAEGN 1043
Query: 881 LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI---PTVIFG 937
L + +N G TD+++ RLEV E LGE VNR R S+ LP + + P G
Sbjct: 1044 LVVLNRNVNGVTDDDKRRLEVTSEILLGEMVNRIRPVSIQGSLPATGPREAVISPKAFLG 1103
Query: 938 TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLD 996
TV G I + + L +LQ+ + ++ G + ++R+F N ++ + F+D
Sbjct: 1104 TVEGSIYLFGLINPAYQDLLMRLQSAMAGLVVTPGAMPFNKFRAFKNAVRQAEEPYRFVD 1163
Query: 997 GDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
G+LIE F +EI +V+VE++ + VEEL R+H
Sbjct: 1164 GELIERFWACETALQEEIVGLVVAGGVADVTVEKVKRIVEELRRMH 1209
>gi|402083318|gb|EJT78336.1| hypothetical protein GGTG_03437 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 1155
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 302/1115 (27%), Positives = 545/1115 (48%), Gaps = 98/1115 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
I+G I L+ +P + + + T+R+++ + W+ E+ EL T + D ++ R
Sbjct: 55 IHGTIVMLQALQPKDSPTEMILVGTDRFRYFSVLWNQETRELETVQDFLDHGEQFVRDVT 114
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDN-KGQLKEAFNI---RLEELQVLDIKFLYG- 119
++DP R I + ++ G+ V+ KG+ ++ + R+ EL + D FLY
Sbjct: 115 TRHRCLVDPSGRFIVMLVWSGVINVMCLQRRKGKQRDLLLLDQSRISELYIKDAAFLYSE 174
Query: 120 CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWS--------QNNLDNGADLLIP 169
A P + LYQ + K Y + DKD + + D GA +LIP
Sbjct: 175 DAHPRVAFLYQTKPNVPESKLVAYRLCSDDKDLEVSRFDPLKDRIIDIDIPDPGAVMLIP 234
Query: 170 VP---------------PPLCGVLIIGEETIVYC---SANAFKAIPIRPSITKAYGRVDA 211
V L G+L++GE ++Y + + + P + A+ + DA
Sbjct: 235 VERVESKRHHFRNPGADARLGGLLVVGETQVLYIDDMTQATVETALMSPMVFVAWAKCDA 294
Query: 212 DGSRYLLGDHAGLLHLLVITHEKEKVT--GLKIELLGETSIASTISYLDNAVVYIGSSYG 269
S Y+L D G LHLL + +++ VT + + L+G+ S A+ + LD+ +++GS YG
Sbjct: 295 --SHYILADDYGGLHLLTLDVKEDGVTVKDIIVCLIGKASRANELVCLDSGYLFVGSHYG 352
Query: 270 DSQLIKLNL-QPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQVV 316
DS+L+ LNL + D VE ++ ++ PI+DF V+D+ + GQ ++V
Sbjct: 353 DSELLCLNLVREDTDRPLVETVQTLPSIAPILDFTVMDMGNREADNTVGNEYASGQARIV 412
Query: 317 TCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
T SGA++DGSLR VR+G+ +++ + ++ G++G++ L S+ + LV SF++ETR+
Sbjct: 413 TGSGAHRDGSLRSVRSGVRLDDIGILADMPGVRGLFPLESNGET---NALVASFLTETRV 469
Query: 376 LAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
+ E ++EE E +G ++L + + ++Q+T ++ + + W+ P
Sbjct: 470 FTFDAEGDIEEVEAFQGLDLSVRSLLVTNLGDDSILQITDHGASVIGPEAGITISSWEPP 529
Query: 435 PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSY 494
G + + N +LL+ G LV L I D ++ + + +ISCL +P
Sbjct: 530 TGRVITGCSTNGKWLLLSIDGKTLVSLRIPDLTVSTQRESSPNDQISCL-----AASPHL 584
Query: 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLC----AFEGISYLLC 546
I AVG W+D SV I +L L++I E L I+PR ++L A L
Sbjct: 585 PDIGAVGFWSDGSVSIINLCTLDVIHSETLRRTKDAAIVPRDIVLARVLPAEVAGPTLFV 644
Query: 547 ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK-NTTHVFAASDRPTVIYSSN 605
++ DG ++ F LN G+L+ RK V LGT+ + ++F S+ ++IY S+
Sbjct: 645 SMDDGEVVTFTLN-DVGKLSARKSVVLGTRQARFHVLPQPGDVCNIFVTSEHSSLIYGSD 703
Query: 606 KKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRR 665
K+++YS V + +++CPF++A FP SLA+A + EL I ID ++ H++S+P+GE+ R
Sbjct: 704 KRIIYSAVTAESATYVCPFDTAVFPGSLAVATDSELKISKIDPQRQSHVQSLPMGENVRS 763
Query: 666 ICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFI-STYPLDTFEYGCSILSCS 723
I + +R F + C + S E RL+D+ + + + + L+ E +++
Sbjct: 764 IAYSAPTRVFGLGCIRREISKGVEKASSTFRLVDEVVLQPLGNPFELNEGEVVETVIRAQ 823
Query: 724 FSDDSNVY---YCVGTAYVLPEENEP---TKGRILVFIVEDGKLQL-IAEKETKGAVYSL 776
D + VGT +++ E P +KGR+LVF V++ + I K L
Sbjct: 824 LRDTFGRLAERFIVGTRFLVDENLVPGSNSKGRVLVFGVDEERSPFQIVSHPLKSGCRRL 883
Query: 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
++ A+ + + + ++ + +L + A+ V G I VGD+MKS
Sbjct: 884 AVMEEMIVVALTKTVVVARYEELTSTSGKLIKVASYQTTSYAIDVAVEGRLIAVGDIMKS 943
Query: 837 ISLLIY--------------KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
+SL+ + + + E R Y ++W +AV + + +L A+ + N+
Sbjct: 944 MSLVEFVPPTTVAGDGKAGETKKPAQLIEVCRHYQSSWSTAVAHFEGESWLEADADGNVM 1003
Query: 883 TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGV 942
+ +N+ G T E+R R+E+ E +LGE +NR + S V P++ + P T G
Sbjct: 1004 VLGRNTTGVTLEDRRRMEITSEINLGENINRIQKIS-VETGPNAPIH--PKAFLSTTEGS 1060
Query: 943 IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIE 1001
I ++ ++ + L LQ L + +G + + +RSF N ++ D F+DG+ IE
Sbjct: 1061 IYLVGAIAPQMRDLLLNLQDRLEDYVGTLGNIPFKNFRSFRNAEREADGPVRFIDGEYIE 1120
Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
FLD++ ++ + + SVE++ VEEL +H
Sbjct: 1121 RFLDMNEETQSQVCRDLGPSVEDMRNLVEELKNMH 1155
>gi|154286506|ref|XP_001544048.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407689|gb|EDN03230.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 1158
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 317/1090 (29%), Positives = 523/1090 (47%), Gaps = 92/1090 (8%)
Query: 16 FRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPD 74
FRP D LFI T+R + + WD ++ L T R D++D R + G +IDP
Sbjct: 92 FRPQ---TDILFIGTDRASYFTVSWDQKTKNLRTERKYVDLADPSARASQLGDRCLIDPS 148
Query: 75 CRLIGLHLYDGLFKVIP---------------FDNKGQLKEAFNIRLEELQ------VLD 113
R + L LY+G+ VIP +L + R+EEL L
Sbjct: 149 GRFLTLELYEGIVTVIPTLRPHDPQGLGGLGAAAAGQELGDPCQARVEELSDTMGKVRLK 208
Query: 114 IKFLYGCAKPTIVVLYQDNKDARHVKTYEVA-LKDKDFVEGPWSQNNLDNGADLLIPVPP 172
++ L + VA LK D + + L+ GA L+PVP
Sbjct: 209 VRELEVMHGGMRMGGGTGGGGGIGADGGIVAVLKGLDLL-----KEELEMGASFLVPVPA 263
Query: 173 PLCGVLIIGEETIVYC--SANAFKAIPIR-PSITKAYGRVDADGSRYLLGDHAG-LLHLL 228
PL G+L++GE +I Y ++N + P++ +I A+ +VD G R+LL D G L L+
Sbjct: 264 PLGGLLVLGETSIRYLDDASNECISQPLKEATIFVAWEQVD--GQRWLLADDYGRLFFLM 321
Query: 229 VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVE 288
++ V K++LLG+ AS + Y+ + +IGS GD +LI++ +GS+ E
Sbjct: 322 LVLDTDNAVQSWKLDLLGDIPRASVLVYMGGGITFIGSHQGDPELIRIT-----EGSF-E 375
Query: 289 VLERYVNLGPIVDFCVVDL------ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGI 336
V++ + N+ PI+DF ++DL E Q GQ ++VT SGA+ DGSLR VR+G+G+
Sbjct: 376 VIQTFANIAPILDFTIMDLGGRAIGESQTHEFSSGQARIVTGSGAFNDGSLRTVRSGVGM 435
Query: 337 NEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE-IEGFCS 394
E + ++ I +W+LR + F L+VSF+ ETR+ + E+EE E G
Sbjct: 436 EEVGVLGAMKHITDLWALRVACPQEFSDTLLVSFVDETRVFYFTPDGEVEEKEEFMGLGL 495
Query: 395 QTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATG 454
TL + + +++QVT +VR+ + EW ++ A++N ++L G
Sbjct: 496 AESTLLAANLPHGRILQVTEWNVRVAELDGGMVIWEWSPEQQKAITAASSNDDHLVLMVG 555
Query: 455 GGHLVYLEI-GDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL 513
G L+ +I GD +T K + ++S + + +P + I + ++ V SL
Sbjct: 556 GQVLMIFDIRGDINITGEKDFGFDTQVSGVTVT---TSPVRACILCLPQTAEVVV--MSL 610
Query: 514 PDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRK 569
DL + LG G+ +PRSVL+ + L ++ DG + +F N++ LT
Sbjct: 611 SDLAIRRSTSLGEPGDAVPRSVLVAEVLSNNPATLFVSMADGSVFSFSFNLEDFSLTSMS 670
Query: 570 KVSLGT-QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
K++LG+ QP + +VFA ++P +IY+ +++YS VN S +C FNS A
Sbjct: 671 KLTLGSEQPSFKKLPRGDGLYNVFATCEQPRLIYAVEGRIVYSAVNSGLASRICHFNSEA 730
Query: 629 FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCA 686
+P S+A+A EL I +D + I+++ +GE RR+ + R F I ++K + A
Sbjct: 731 YPGSIALATPSELKITLVDAERTTQIQTLEVGETVRRVAYSAPERAFGIGTIKRTLEDGA 790
Query: 687 EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV--------YYCVGTAY 738
E F+ L D+ F + Y L+ E S++ F D + + VGT+Y
Sbjct: 791 EVIASRFM-LADEIMFRELDIYDLNKDELVESVIRAQFPDGIDREGNDLFKDLFVVGTSY 849
Query: 739 VLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
+ +GRIL F + + +L +AE KGA +L K++AA+ + + +Y
Sbjct: 850 LDDFGEGSIRGRILAFEVTANRQLAKVAEMPVKGACRALAIVQDKIVAALMKTVVVYTIS 909
Query: 798 LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG----AIEERA 853
L + + + G+ I V DLMKS+S++ Y+ ++ E A
Sbjct: 910 KGQFADYTLSKTASYRTSTAPIDIAVTGNLIAVADLMKSVSIVEYQQGSNGLPDSLTEVA 969
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
R + W +AV + +D +L ++ NL + +N G TD++R RLEV E LGE VNR
Sbjct: 970 RHFQTLWSTAVAHVAEDTWLESDAEGNLVMLHRNVNGVTDDDRRRLEVTSEILLGEMVNR 1029
Query: 914 FRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
R + + ++ P GTV G I + + L +LQ+ + ++ GG
Sbjct: 1030 IRPVN-IQGSQGAEAAISPRAFLGTVEGSIYLFGIINPTYQDLLMRLQSAMAGMVVTPGG 1088
Query: 974 LNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELC 1026
+ ++R+F N ++ + F+DG+LIE FL S +EI V+VE +
Sbjct: 1089 MPFNKFRAFRNTIRQAEEPYRFVDGELIERFLSCSVELQEEIVGKVIADGVTGVTVESVK 1148
Query: 1027 KRVEELTRLH 1036
VEEL R+H
Sbjct: 1149 GLVEELRRMH 1158
>gi|261193401|ref|XP_002623106.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
gi|239588711|gb|EEQ71354.1| UV-damaged DNA binding protein [Ajellomyces dermatitidis SLH14081]
Length = 1168
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 321/1120 (28%), Positives = 526/1120 (46%), Gaps = 144/1120 (12%)
Query: 16 FRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPD 74
FRP D LF+ T+R + + WD ++ L T R D++D R + G +IDP
Sbjct: 94 FRPQ---TDILFVGTDRSTYFTVSWDQKTKNLHTERKYVDLADGSARESQLGDRCLIDPS 150
Query: 75 CRLIGLHLYDGLFKVIP----FDNKG----------------------------QLKEAF 102
R + L LY+G+ VIP ++G +L E
Sbjct: 151 GRFLTLELYEGIVTVIPIGQPLRSRGSGRKLGKRAANASASTMQGNQDVGAADLELGEPC 210
Query: 103 NIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV----------------- 143
R+EEL V FL+ A P + +LY+D +K E+
Sbjct: 211 QARVEELLVRSSTFLHTQADMLPRMALLYEDTMGKVRLKVRELEVTYGGMGIGSGSGGGS 270
Query: 144 ----------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SAN 191
LK D + + L+ GA L+PVP PL G+L++GE +I Y ++N
Sbjct: 271 GVGADGGSAAVLKGLDML-----KEELEMGASFLVPVPAPLGGLLVLGETSIRYLDDASN 325
Query: 192 AFKAIPIR-PSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETS 249
+ P+ +I A+ +VD G R+LL D G L L++I V K++ LG
Sbjct: 326 ECISQPLEEATIFVAWEQVD--GQRWLLADDYGRLFFLMLILDSDNAVQSWKLDRLGNIP 383
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-- 307
AS + Y+ V +IGS GDSQLI++ +GS+ EV++ + N+ PI+DF ++DL
Sbjct: 384 RASVLVYMGGGVTFIGSHQGDSQLIRIT-----EGSF-EVIQTFANIAPILDFTIMDLGG 437
Query: 308 ----ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSS 356
E Q GQ ++VT SGA+ DGSLR VR+G+G+ E + ++ I +W+LR +
Sbjct: 438 RAMGESQTHEFSSGQARIVTGSGAFNDGSLRSVRSGVGMEEVGVLGSMEHITDLWALRVA 497
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ F L+VSF+ ETR+ + +EE E G TL + +++QVT
Sbjct: 498 CPEEFSDTLLVSFVDETRVFYFTPDGGVEEKDEFMGLGLTESTLIAANLPNGRILQVTER 557
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHA 474
+VR+ + EW ++ A++N ++L GG L+ +I G+ + K
Sbjct: 558 NVRVAELDGGMVMWEWSPVSQKAITAASSNDDHLVLMVGGQVLMIFDIRGEIKVAGEKDF 617
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRS 532
++ ++S + + +P+ S I + ++ V SL DL + LG G+ +PRS
Sbjct: 618 GVDTQVSGVTVT---ASPARSCILCLPQTAEVIV--MSLADLTIRHSTSLGEPGDAVPRS 672
Query: 533 VLLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTT 589
VL+ + L ++ DG + +F N LT K++LG+ QP +
Sbjct: 673 VLVAEVLPNNPATLFVSMADGSVFSFSFNATDFSLTSMSKITLGSEQPSFKKLPRGDGLY 732
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
+VFA ++P++IYS+ +++YS VN + S +C FNS A+P S+A+A EL I +D
Sbjct: 733 NVFATCEQPSLIYSAEGRIVYSAVNSDQASRICHFNSEAYPASIALATPSELKIALVDAE 792
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFIST 707
+ I+++ +GE RR+ + R F I ++K + AE F+ L D+ F +
Sbjct: 793 RTTQIQTLEVGETVRRVAYSAPERAFGIGTIKRTLEDGAEVIASRFM-LADEIMFRELDV 851
Query: 708 YPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEK 767
Y L+ E S++ F DD + ++G L+ L ++AE
Sbjct: 852 YDLNKDELVESVIRAQFPDDKD-----------------SEGEELM-----KDLFIVAEM 889
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDF 827
KGA +L K++AA+ + + +Y L + + + G+
Sbjct: 890 PVKGACRALAIVQDKIVAALMKTVVVYTLSKGQFADYILTKTASYRTSTAPIDIAVTGNL 949
Query: 828 IVVGDLMKSISLLIYKHE----EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
I V DLMKS+S++ Y+ G++ E AR + W +AV + D +L ++ NL
Sbjct: 950 IAVADLMKSVSIVEYQQGTDGLSGSLTEVARHFQTLWSTAVAPVAQDTWLESDAEGNLVV 1009
Query: 884 VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVI 943
+R+N G T+++R RLEV E LGE VNR R ++ L ++ P GTV G I
Sbjct: 1010 LRRNVNGVTEDDRRRLEVTSEVLLGEMVNRIRPVNIQASL-GTEAAISPRAFLGTVEGSI 1068
Query: 944 GVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIES 1002
+ + L +LQ+ + ++ GG+ ++R+F N ++ + F+DG+LIE
Sbjct: 1069 YLFGIINPTYQDLLMRLQSAMAGMVVTPGGMPFNKFRAFRNTVRQAEEPYRFVDGELIER 1128
Query: 1003 FLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
FL +EI V+VE + VEEL R+H
Sbjct: 1129 FLGCGAELQEEIVGKVIADGVAGVTVERVKAIVEELKRMH 1168
>gi|342318986|gb|EGU10938.1| DNA damage-binding protein 1b [Rhodotorula glutinis ATCC 204091]
Length = 1102
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 316/1121 (28%), Positives = 533/1121 (47%), Gaps = 153/1121 (13%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS-ELITRAMGDVSDRI 59
+L+VP++ IA+++ G+A L + T + L D S + T A +++
Sbjct: 50 LLEVPLFSTIASMQ----SGQATASLVVLTTCLRLFALVHDPTSPYNVKTVASVSIAEPF 105
Query: 60 GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF---DNKGQ------------------- 97
GR + I ++DP R + +H Y GL +V+PF D Q
Sbjct: 106 GRLAEYQDI-LVDPANRCLLVHAYAGLVRVVPFFIGDTPAQRGSRTATRLAGSPHGSPLD 164
Query: 98 LKEAFNIRLEELQVLDIKFLYGCA--KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPW 155
++N+RL L V + + A P + ++ ++ R + T+ ++LKDKD EGP
Sbjct: 165 FSYSYNVRLPILNVASLAIVASSADDPPILAGIHIEHTGTRTLSTFAISLKDKDLCEGPV 224
Query: 156 SQNNL-DNGADLLIPVPPP-LCGVLIIGEETIVY--------CSANAFKA---------I 196
+ L D G++L++PV G+L+ GEET+ + S+ A K +
Sbjct: 225 AAFALADPGSELVLPVNEANAPGILVAGEETVTWVSLEERNGASSTASKGKGRAASQMLV 284
Query: 197 PIRPSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
R + + + R LLGD G L + + E V+G+ + +G+ + AS I
Sbjct: 285 SCRLPVARITAWSWRNNDRLLLGDIYGKLFEVNLRRAENGVVSGISAQDVGDAASASAIV 344
Query: 256 YLDNAVVYIGSSYGDSQLIKLNLQPD---AKGSYVEVLERYVNLGPIVDFCVVDLERQGQ 312
L VY+ S +GDSQ+++ D A + + +++ N+ PIVD C+V Q
Sbjct: 345 PLGATTVYLASRFGDSQIVRFVSGEDEGMAGPAELALVDSISNIAPIVDLCIVSAHGQAA 404
Query: 313 GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
VTCSGAYK GSLR++R G+G++E A++E G++ WSL S + LV+ F SE
Sbjct: 405 NYAVTCSGAYKTGSLRVIRRGVGLSELAALEADGVQQAWSLTSPAPST-EPILVLGFFSE 463
Query: 373 TRILAMNLED-------ELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
TR+ M+ D +++E +++ F S T+F + + + LV+VT+ +V S
Sbjct: 464 TRVFRMSAGDPAGAEAVQIDEMDLQLFASPGPTIFAGN-VGSMLVRVTAEAVMYASDRGE 522
Query: 426 ELRNEWKS-PPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLD 484
E WK+ G A A ++LA GG L L+ +G TE + E++I+ +
Sbjct: 523 E---SWKATSAGKITAAAAAGGDHLVLAFDGGFLQLLKAKNGTFTESGNIAFEHDIASVS 579
Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAF-EGISY 543
+ S A VG+WT +V + +PDL + + + + RS L F +G
Sbjct: 580 L----AETSAGAFAVVGLWTSQAVHLVGIPDLAVYASQKITSTFLIRSAALINFGDGDYT 635
Query: 544 LLCALGDGHLLNFLLNMKTGELTDR--KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 601
L LGDG L ++ +++ + D K ++LG +P+ L ++ + A SDRPTVI
Sbjct: 636 LFVGLGDGTLASYRVDLAAPAVIDSTGKMIALGRRPLLLSEIGPESAKVLLAVSDRPTVI 695
Query: 602 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 661
+ +L YS+ VS + + A D +A+ + IG +D +QK+ +R++PL E
Sbjct: 696 SKARDRLNYSS----GVSSVANISHPALGDLVALVSREGVQIGRMDTVQKVDVRTVPLAE 751
Query: 662 -HPRRICHQEQSRTFAI-CSLK--NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGC 717
PRRI + SR F + CS + ++ +++ VRL+ + +FE S + L+ E
Sbjct: 752 DEPRRIVYDPVSRKFCVACSRRDIDRHTGQQAVTSVVRLVSEDSFETSSVFELERGE--- 808
Query: 718 SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV-YSL 776
+ Q +A E++G V +S
Sbjct: 809 ------------------------------------------EAQSLALVESQGTVAWSS 826
Query: 777 NAFNG-----KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG--DFIV 829
+ F K A ++ + + W + +G L+S G +A Y R D ++
Sbjct: 827 SPFQKLETCVKFRAPVSAAVAV--WTIDSEGALALKSTW--SGAFVA-YTLARAPDDTLI 881
Query: 830 VGDLMKSISLLIY-KHEEGAIEERARDYNANWMSAVEILDDDI-----YLGAENNFNLFT 883
VGD ++S+++L Y + + E A+DY + +M VE L D Y+GAE + NLF
Sbjct: 882 VGDALRSLTVLRYTASPQPKLLEVAKDYRSRYMVGVEDLGRDSTGADRYIGAETDLNLFA 941
Query: 884 VRKNSEGATDE--ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
V ++ + A L+ G +HLGE V RFR GSL + D++ +P +++ T G
Sbjct: 942 VSRHPQQAAGNLANAATLQDAGAFHLGELVTRFRTGSLGDLIGDANTTVVPRLVYSTSAG 1001
Query: 942 VIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
IGVIA L L L+ N+R+ +KGVGGL E+ + +K + F+DGD ++
Sbjct: 1002 TIGVIADLDAASSRILSDLERNMREFVKGVGGLEQEELVAGVADKVKTPSTGFIDGDFVQ 1061
Query: 1002 SFLDLSRTRMDEISK------TMNVSVEELCKRVEELTRLH 1036
SFLDLS+ +++ + + V E+ + +EE+ R+H
Sbjct: 1062 SFLDLSKDVQEQVMQGKSEHERLTVDKAEIVRLLEEVARVH 1102
>gi|429850956|gb|ELA26181.1| DNA damage-binding protein 1 [Colletotrichum gloeosporioides Nara
gc5]
Length = 1409
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 321/1127 (28%), Positives = 533/1127 (47%), Gaps = 139/1127 (12%)
Query: 4 VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRP 62
V IYG + L+ P D LF+ T+R+++ +WD++ L+T + + D + R
Sbjct: 52 VSIYGTVLLLQRLTPKDSKTDLLFVGTDRFQYFTARWDSDLQRLVTEQVIEDSGEPHMRE 111
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFN-----IRLEELQVLDIKFL 117
+ ++DP R + +HL++G+ V+ + + +RL EL + F+
Sbjct: 112 AQSQDKCLVDPSGRFMAMHLWEGVMNVMRLGARAPHASRLDPLWEQVRLSELSMKASTFV 171
Query: 118 -YGCAKPTIVVLYQ---DNKDAR-----------HVKTYEV-ALKDKDFVEGPWSQNNLD 161
PTI LYQ D +DAR + KT A KD++F LD
Sbjct: 172 PTETGHPTIAFLYQSKIDKEDARLAIYRLMSDDRNTKTSNFDAEKDREF-----DMEILD 226
Query: 162 NGADLLIPVP------------------PPLCGVLIIGEETIVYC---SANAFKAIPIRP 200
A +LIPV L G++++GE +VY + A P
Sbjct: 227 PYARILIPVSVEEDQVKRYHKRDTSAAKAQLGGLIVVGETLLVYVDTLTKTKVSAGIDNP 286
Query: 201 SITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
+I AY D Y L D G L LL I + VTGL++ LG TS AS + Y+ +
Sbjct: 287 TIFVAYAEYDV--RNYFLSDDYGNLQLLTIDTDGVVVTGLRVTHLGLTSRASCLVYMGDG 344
Query: 261 VVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER----------- 309
++++GS YGDSQL+K+++ A ++++ N+ PI+DF ++DL
Sbjct: 345 MLFLGSHYGDSQLLKVDVT--AGSETTQIVQTIPNIAPILDFAIMDLGNAGDSQVGNAFS 402
Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVS 368
GQ ++V G +++GSLR +R+ +G+ + +E L ++G++SLRS+ D DT LVVS
Sbjct: 403 SGQARIVAGCGVHQNGSLRSIRSSVGLEDIGVLEDLSDVRGLFSLRSNGSDKDDT-LVVS 461
Query: 369 FISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
FI++TR+ + E+ +EE + +G TL QL+QVT+ S L+
Sbjct: 462 FITDTRVFRFDAEEGIEEVFQFQGLGLDKPTLIASTLPNGQLLQVTANSAALLDPEGGIT 521
Query: 428 RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-----------GDGILTEVKHAQL 476
+ W +P G + A++NA VLL+ G LV L + G G+ E
Sbjct: 522 LSTWSAPEGKVIVNASSNAKWVLLSIDGTTLVSLSLLENLAAREQVLGSGVGGEGD---- 577
Query: 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE----IIPRS 532
+ISC+ +N + VG W S+ I L L+ + E + +PR+
Sbjct: 578 --QISCIHAATDLDN-----VGVVGFWATGSISIIDLQTLDAMHGETIKQTDDSVSVPRN 630
Query: 533 VLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT---T 589
+ L +++F ++ L+ RK V LG++ L +T +
Sbjct: 631 IAL-----------------VVSFDVSKDNYSLSGRKSVILGSKQSGLHILPRPDTDGIS 673
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
+VFA ++ ++IYSS +++YS ++V+++ PF+S AFPD++ +A + + I ID
Sbjct: 674 NVFATTEHSSLIYSSEGRIIYSAATAEDVTYIAPFDSEAFPDAIVLATDKNVRIAHIDVE 733
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTFEFI-ST 707
++ H+ +PL + RR+ + + F I +++ + EE +L+D+ +
Sbjct: 734 RRTHVNPLPLRQTVRRVAYSPALKAFGIGTIRRELFNNEEMVTSSFQLVDEIVLGVVGKP 793
Query: 708 YPLD---TFEYGCSILSCSFSDDSNV---YYCVGTAYVL-PE--ENEPTKGRILVFIVE- 757
+ LD T E S++ D S + VGT+Y+ PE EN KGRILV V+
Sbjct: 794 FHLDGAATTELVESVIRAELPDSSGQPAERFIVGTSYLADPEMDENSEVKGRILVLGVDS 853
Query: 758 DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
D I E KGA SL KL+A +++ + +Y + + L
Sbjct: 854 DKNPYQIVSHELKGACRSLAVMGDKLVAGLSKTVVVYDYAEESSTSGSLLKLATFRPSTF 913
Query: 818 ALYVQTRGDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYL 873
+ + G+ I V DLM+S++L+ + + + ERAR + W +AV L+ D+++
Sbjct: 914 PVDLDVNGNMIGVADLMQSMTLIEFIPAQDGNKARLVERARHFQYIWATAVCHLEQDLWI 973
Query: 874 GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
A+ NL +R+N T+ ++ ++EV+ E+HLGE +N+ R +V D P
Sbjct: 974 EADAQGNLMVLRRNPNAPTEHDKKQMEVISEFHLGEQINKIRPLDVVSGENDPIE---PK 1030
Query: 934 VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG-------GLNHEQWRSFNNEK 986
T+ G I V A + E L + Q L VIK +G GL+ WR F N K
Sbjct: 1031 AFLATIEGSIYVFADIKPEYQSLLLQFQERLAGVIKTLGQADEPGAGLSFMSWRGFRNAK 1090
Query: 987 KTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
++ D F+DG+LIE FLDL R + + + + +VE + VEEL
Sbjct: 1091 RSADGPFRFVDGELIERFLDLDAGRQEAVVQGLGPTVERMRDLVEEL 1137
>gi|408393339|gb|EKJ72604.1| hypothetical protein FPSE_07241 [Fusarium pseudograminearum CS3096]
Length = 1161
Score = 397 bits (1019), Expect = e-107, Method: Compositional matrix adjust.
Identities = 297/1118 (26%), Positives = 540/1118 (48%), Gaps = 98/1118 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPT 63
I+G I+ L+ RP G D LFI T+R ++ L W+ + +L I R + D+S+ R +
Sbjct: 55 IHGTISMLQRLRPKGSETDLLFIGTDRLQYFNLIWNDATKQLETIERVIEDLSEPYMRQS 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK----GQLKEAFNIRLEELQVLDIKFLYG 119
+ ++DP R + +HL++G+ V + +L+ +RL EL + F Y
Sbjct: 115 QSQNKCLVDPTGRFLAMHLWEGVLNVFKLPTRKGSTNKLEVLDQVRLTELFMKASTFTYS 174
Query: 120 CAK-PTIVVLYQDNKDARHVK--TYEVALKDKDFVEG---PWSQNNLD-----NGADLLI 168
PTI LY+ D + Y + DK P LD A +LI
Sbjct: 175 YTNHPTIAFLYKTQMDQEETRLVIYRLTHDDKGNTVSKFDPHKDRELDIEIPDPYASMLI 234
Query: 169 PVP----------------PPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGRV 209
PVP P L G++++GE + Y +++ P I A+
Sbjct: 235 PVPLDEEKRYHVRNNQGAKPHLGGLVVVGETLLTYYDGQTHRSVSAGLKDPRIFVAWAEY 294
Query: 210 DADGSRYLLGDHAGLLHLLVITHEKEK----VTGLKIELL------GETSIASTISYLDN 259
D+ +RYLL D G L LL I E VTG+ +E TS AS++ YL +
Sbjct: 295 DS--TRYLLADDYGRLDLLTIRTNLEPTGVVVTGMTLEPFEFDKSPAITSRASSLVYLGD 352
Query: 260 AVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ--------- 310
+++ S +GDSQL ++N+ A V +++ + N PI+DF ++D+ +
Sbjct: 353 GNLFVASHHGDSQLYQINIDTKA----VMLVQSFSNNAPILDFSIMDMGNREGDTQSGNV 408
Query: 311 ---GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLV 366
GQ ++V GAY+DG+LR +R+G+G+ ++ + EL+G +G+++L S DT LV
Sbjct: 409 FSSGQSRIVAGCGAYRDGTLRSIRSGVGLEDRGVLDELEGTRGLFTLHSYGSKMVDT-LV 467
Query: 367 VSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
VS I+ETRIL+ +LE +EE +G ++TL + QL+Q+T SV L+
Sbjct: 468 VSSITETRILSFDLEGGIEEIYSFQGMSLDSETLLASNLPSGQLLQITPKSVVLLDPEGG 527
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI 485
++W P G ++ A+AN+ LL+ G LV L + + + + I
Sbjct: 528 TTISKWDVPDGKTITRASANSKWALLSVDGTSLVSLNLLQNLAVNAQQINNDSTSQPDQI 587
Query: 486 NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFE-- 539
+ I + G W+ + + + L + E + +PR V L
Sbjct: 588 SCIHAARDPPDLGVAGWWSSGQISLIDMASLKPLHGESMRQTEDSATVPRDVALVQLHPP 647
Query: 540 GIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASD 596
IS LL A+ DG+++ F ++ K ++ RK V+LG+ P L ++ T++VF ++
Sbjct: 648 NISGPTLLVAMEDGNVVTFNVSTKGFSVSGRKSVTLGSNPARLHILPQEDGTSNVFVTTE 707
Query: 597 RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
++IYSS ++++S + + + PF+S AFPDS+ ++ + + I +D + H+++
Sbjct: 708 HASLIYSSEGRIIFSATTADDATFVAPFSSHAFPDSVVLSTDQHIRICHVDRERLTHVKA 767
Query: 657 IPLGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTF-EFISTYPLD--- 711
+P+ E RR+ + + F + S+K + EE RL+D+ F E S + L+
Sbjct: 768 LPVNETVRRVAYSPGLKAFGLGSIKRELVNNEEVVTSSFRLVDEIVFKELGSPFALNGSS 827
Query: 712 TFEYGCSILSCSFSD---DSNVYYCVGTAYVLP---EENEPTKGRILVFIVE-DGKLQLI 764
+ E ++ D + + VGT+++ E+ T GRILV V+ + ++ I
Sbjct: 828 SLELVECVIRAELPDVGGNPTERFIVGTSFISDGGVEDPNGTLGRILVLGVDANRQVYQI 887
Query: 765 AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
KG L ++A +++ + +Y + + LQ + + + +
Sbjct: 888 VSHNLKGPCRCLGMIEDNIVAGLSKTVVVYSFSQETSSSGSLQKLAAYRPAAIPIDIDVS 947
Query: 825 GDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
G+ I VGDLM+S+SL+ + + ++ERAR + + W +AV ++ + +L A++ N
Sbjct: 948 GNMIGVGDLMQSLSLVEFIPAQDGNKAQLQERARHFESLWTTAVCHIEGERWLEADSRGN 1007
Query: 881 LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-PTVIFGTV 939
L +++N + T+++R RLE+ E +GE +NR R + +P ++ + P +
Sbjct: 1008 LVVLQRNVDAPTEQDRRRLEITSEMGIGEQINRIRK----LNVPAAENNIVHPRAFLASA 1063
Query: 940 NGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGD 998
G + + + + L Q+ + + + +G + + WRSF N+ + + F+DG+
Sbjct: 1064 EGSLYLYGDVAPQYQDLLMTFQSRMEECVLALGNVEFKLWRSFRNDNRESEGPYRFIDGE 1123
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
+IE FLD+ + + + + +VE++ +EEL R+H
Sbjct: 1124 MIERFLDMDEEQQELVCDGLGPTVEDMRNLIEELRRMH 1161
>gi|425777692|gb|EKV15851.1| UV-damaged DNA binding protein, putative [Penicillium digitatum Pd1]
gi|425779888|gb|EKV17916.1| UV-damaged DNA binding protein, putative [Penicillium digitatum
PHI26]
Length = 1140
Score = 396 bits (1018), Expect = e-107, Method: Compositional matrix adjust.
Identities = 314/1096 (28%), Positives = 533/1096 (48%), Gaps = 100/1096 (9%)
Query: 18 PHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPDCR 76
P D +F+ T++Y + + WD E++++ T R D+++ R + +IDP R
Sbjct: 68 PTNSPTDHIFVGTDQYNYFTITWDRETNDIKTARTCVDIAEPSSRESQCAPRCLIDPTGR 127
Query: 77 LIGLHLYDGLFKV---------------------IPFDNKGQLKEAFNIRLEELQVLDIK 115
+ L +Y+G+ V P G+L + R++EL V
Sbjct: 128 FMTLEVYEGVIVVVPIVQPTKKRGRMSMAGSQPDFPL-QVGELDKPTTSRIDELFVRSSA 186
Query: 116 FLYGCAKPTIVVLYQDNKDARHVKTYEV--------ALKDKDF-----VEGPWSQNNLDN 162
FL+ + P + +LY+DN+ ++ E+ L D F +EG LD
Sbjct: 187 FLHSESSPWLALLYEDNQQKVRLRIRELDFTPGSSGTLADATFKEVQKLEGGEFGQELDL 246
Query: 163 GADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGRVDADGSRYLLG 219
G+ LIP+P PL G++++GE +I Y NA I ++ A+ +VD+ R+LL
Sbjct: 247 GSSHLIPIPAPLGGLIVLGETSIKYIDDNANDVITRHLEEATVFVAWEKVDS--QRWLLA 304
Query: 220 DHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL 278
D G L L I + ++ K+E LG+T+ AS + YL ++++GS +GDSQ+++L+
Sbjct: 305 DDYGRLFFLSFILNNLGEIDDWKLESLGKTARASVLVYLGGGMLFVGSHHGDSQVLRLD- 363
Query: 279 QPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTCSGAYKDGSL 327
G+ EV++ N+ PI+DF ++DL E Q GQ ++VT SGA+ DG+L
Sbjct: 364 -----GTSFEVIQTLSNIAPILDFTIMDLGNRTNESQTHEFSSGQARIVTGSGAFDDGTL 418
Query: 328 RIVRNGIGINEQASV-ELQGIKGMWSLRS-STDDPFDTFLVVSFISETRILAMNLEDELE 385
R +R+G+G+ + + E++ I +W L++ ST D DT L+V+F+ ETR+ + + E+E
Sbjct: 419 RSLRSGVGMEDLGVLGEMEHISDLWGLQTRSTGDYLDT-LIVTFVDETRVFQFSPDGEVE 477
Query: 386 ETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATA 444
E + G TL +++QVT + S + EW + +
Sbjct: 478 ELDNFLGLSLTESTLLATRLQGGRILQVTEQRALVADLESGMVIFEWVPQDQKLITAVST 537
Query: 445 NASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMW 503
N ++L G + +I D + + ++K + +IS L + P+ IA G
Sbjct: 538 NEDHLVLVISGQIVASFDIRDNVQIIKLKDLGADQQISGLTVP---STPTGVFIA--GFP 592
Query: 504 TDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAFEGISY--LLCALGDGHLLNFLLN 559
V I + D ++ + LG GE PRSVL+ S L ++ DG ++ F L+
Sbjct: 593 QSAKVSILDIKDFAVLQTKSLGPPGESFPRSVLVAEVLANSQPTLFISMADGCVITFSLD 652
Query: 560 MKTGELTDRKKVSLGT-QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEV 618
+ L++ K+ LG+ QPI + +VFA + P++IY S +++YS VN
Sbjct: 653 SRDCSLSEMNKLILGSEQPIFKKLPKGDGLYNVFATCENPSLIYGSEGRIIYSAVNSDGA 712
Query: 619 SHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAIC 678
S +C N+ A+PDS+A+A E EL I +D + I+++P+G RR+ + + F I
Sbjct: 713 SRVCHLNAEAYPDSIAVATEKELKIALVDRERTTQIQTLPMGSTVRRVAYSPSEKAFGIG 772
Query: 679 SL--KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF--SDDSNV---- 730
++ K ++ AE + HFV L D+ F + L E S++ D N
Sbjct: 773 TIDRKLENGAEVVKSHFV-LADEIMFRRLDALELGPDELVESVVRAELPAGKDENGKEIV 831
Query: 731 --YYCVGTAYVLPEENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAI 787
+ VGTA+ ++ E +GRIL+ V+ G KL +AE GA +L ++AA+
Sbjct: 832 KDRFVVGTAFADEDQEESIRGRILILEVDHGRKLSQVAELPVMGACRALAMMGDCIVAAL 891
Query: 788 NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH--- 844
+ +Y+ + + G +L+ + + V + I V DLMKS+ L+ Y
Sbjct: 892 ---VVVYRVKINNVGPMKLEKLAAYRTSTAPVDVTVVDNLIAVADLMKSLCLIRYTPGHT 948
Query: 845 -EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVG 903
E + E R Y W +A+ + D+ +L ++ NL + +N+ G T +++ RL
Sbjct: 949 GEPAKLTEVGRHYQTVWSTAIACVGDETFLQSDAEGNLIVLSRNTNGVTAQDKHRLIPTS 1008
Query: 904 EYHLGEFVNRFRHGSLVMRLPD-SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
E LGE VNR R + +P V P TV G I + A + E FL LQ
Sbjct: 1009 EISLGEMVNRIRP----VHIPQLCSVMVTPRAFMATVEGSIFLFAVINPEHQDFLMTLQA 1064
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
L + + +G L+ +++R F ++ A F+DG+LIE FL + + +EI++ + S
Sbjct: 1065 ALSQKLNSLGNLSFDKFRGFRTMVRSAAAPYRFVDGELIEQFLKCTPSMQEEIAQEVGSS 1124
Query: 1022 -VEELCKRVEELTRLH 1036
V E+ + +E L RLH
Sbjct: 1125 DVGEVKRLIEALRRLH 1140
>gi|402595041|gb|EJW88967.1| hypothetical protein WUBG_00126 [Wuchereria bancrofti]
Length = 621
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 236/579 (40%), Positives = 355/579 (61%), Gaps = 29/579 (5%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
+ P++GRIAT++LFR GE D L I T +Y +++W +SEL TRA G + DR+GRP
Sbjct: 54 ECPVFGRIATIKLFRAPGEDVDSLLILTAKYHLAIIRW-TPTSELRTRASGHIVDRVGRP 112
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
++ G I + ++ LYDGL K++ + N+G+ FN+R ++L ++DI F+ +
Sbjct: 113 SETGMIATVHSSGLMV-FRLYDGLLKIVQW-NEGKDLRGFNVRCDDLYIVDITFMSDPDR 170
Query: 123 PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGE 182
PT+ +YQD+ + RH+K + + DK+ W +NL+ A+++I VP P+ G LI G
Sbjct: 171 PTLAYIYQDD-NGRHIKVVTLNIDDKELSSPLWKHDNLEGEANIVISVPEPVGGCLIAGP 229
Query: 183 ETIVYCS----ANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
+ I Y A + +P + + Y VD DG RYLL D AG L++L++ K+
Sbjct: 230 DAISYHKGGDDALRYAGVPGSRLHNTHPNCYAPVDRDGQRYLLADLAGNLYMLLLELGKD 289
Query: 236 K---------VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS- 285
+ V +K+E LGET IA + YLDN V +IGS +GDSQLI+L+ +P A G+
Sbjct: 290 QEQDENSAVIVRDMKVESLGETCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTG 349
Query: 286 YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
Y+ +L+ Y NL PI D V + GQ Q++TCSGAYKDG++RI+RNGIGI E ASVEL+
Sbjct: 350 YISLLDSYTNLAPIRDMTV--MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELASVELK 407
Query: 346 GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
GIK M++LR+ D+ FD +L++SF SET +L +N E ELE+TEI GF TL+
Sbjct: 408 GIKNMFTLRTRGDE-FDDYLILSFDSETHVLFINGE-ELEDTEITGFAVDGATLWAGCLF 465
Query: 406 YNQ-LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG 464
+++ ++QVT G V L+ + ++ WKSP ++ ++ L+ G L+YLE
Sbjct: 466 HSKTILQVTHGEVILIDGDNIQV---WKSPKWITLLNYDERSTGQLVVACGALLIYLEAN 522
Query: 465 DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
+ + E+EISC+DI PIG+ S+I AVG WTD+SV + +LP L + +E +
Sbjct: 523 SAGFKVITQIECEFEISCIDITPIGKGTLRSEICAVGYWTDLSVALRALPQLVEVVREKI 582
Query: 525 GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG 563
G+++ RS++L EG YLL ALGDG + F ++MKTG
Sbjct: 583 AGDMLSRSIMLSPMEGHVYLLVALGDGTVHYFQIDMKTG 621
>gi|258572939|ref|XP_002540651.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
gi|237900917|gb|EEP75318.1| conserved hypothetical protein [Uncinocarpus reesii 1704]
Length = 1144
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 314/1113 (28%), Positives = 555/1113 (49%), Gaps = 105/1113 (9%)
Query: 6 IYGRIATLE-LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
+YG+I+ L+ L RP+ D LF+ T+R + L W++ + +L T + D++D R +
Sbjct: 55 VYGKISVLQKLPRPNPSETDLLFVGTDRDAYFTLSWNSATGQLHTEQKYVDMADPSLRDS 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFNIR 105
+G +DP + + L +Y+G+ VIP + L E R
Sbjct: 115 QSGDRSWVDPSGKFLTLEIYEGIITVIPIAQEPLKRSSLSGPSLGPPKERAHLGEPVQAR 174
Query: 106 LEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWS------- 156
+EEL + FL+ + P I +LY+ +K ++ + + V G S
Sbjct: 175 IEELAIRSTAFLHQDSSRVPRIAILYESTDGRVKLKLRDL-IYTRGVVNGEASVAEFHNV 233
Query: 157 ---QNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS--ANAFKAIPIR-PSITKAYGRVD 210
+NL+ GA++L+PVP PL G+LI+GE+ I Y +N +P+ ++ A+ ++D
Sbjct: 234 DELYDNLELGAEILVPVPLPLGGILILGEKCIKYVDTISNETITLPLEYNTVFVAWEQLD 293
Query: 211 ADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
R+LL D G L L+++ V K++LLGETS AS + +L VV++GS G
Sbjct: 294 --NQRWLLADDYGRLFFLMLVLDSANAVRTWKVDLLGETSRASVLVHLGGGVVFLGSHQG 351
Query: 270 DSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG----------QGQVVTCS 319
DS +I++ +GS+ E+++ N+ PI+DF V+DL +G Q ++VT S
Sbjct: 352 DSHVIRIT-----EGSF-EIIQTLSNIAPILDFTVMDLGHRGDTLTHEFSSGQARIVTGS 405
Query: 320 GAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM 378
GA+ DGSLR VR+G+G+ + + ++ I +W L + + L+++F++E+R+
Sbjct: 406 GAFHDGSLRSVRSGVGMEDLGVLGAMEHITDLWGLSAFCAEENCDTLLLAFVNESRVFHF 465
Query: 379 NLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGY 437
+ + E+EE E G TL + +++QVT + R+ S + W SP G+
Sbjct: 466 SPDGEVEEKDEFLGLLLGETTLHASNLAGCRILQVTERTARITDVESELVIWHW-SPSGH 524
Query: 438 S-VNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYS 495
+ A N ++L GG V +I I ++ K + ++S + + +P+ +
Sbjct: 525 QKITAAAVNEQYLVLMIGGQEAVIFDIASDIQVSGPKTFKANKQVSGVTLT---SSPAQA 581
Query: 496 QIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDG 551
I +IS I +L DL + E LG G+ +PRSVL+ L ++ DG
Sbjct: 582 CIFCFPQSAEIS--IVNLTDLTIRHTETLGEPGDAVPRSVLVANMIPSKPPSLFVSMADG 639
Query: 552 HLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKLLY 610
+ +F LN + L++ K+ LG++ + + +VFA D P++IY+S +++Y
Sbjct: 640 SVFSFSLNAEDYSLSNANKLVLGSEAPVFKLLPRGDGLFNVFATCDHPSLIYASEDRIVY 699
Query: 611 SNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQE 670
S VN + + +C FN+ A+P ++A+A E+ I +D + I+++ + RRI +
Sbjct: 700 SAVNSDKATRICHFNAEAYPGAIAVATPDEIKIALVDAERTTQIQTLMINGTVRRIAYSA 759
Query: 671 QSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFEY-GCSILS--CSFS 725
R F + +++ AEE + +F+ L D+ F +S + L++ E C I + +F+
Sbjct: 760 AERAFGLGTVRRSLVQNAEEVKSYFI-LSDEIMFRQLSVFDLNSNELVECVIRTEHPAFN 818
Query: 726 DDSNV-----YYCVGTAYVLPEE--NEPTKGRILVFIVE-DGKLQLIAEKETKGAVYSLN 777
N + VGT+ + P E TKGRIL+F V + +L++++E +GA +L
Sbjct: 819 GQQNNGRPKDIFIVGTSVLDPAEAPESQTKGRILIFDVGVNRELRMVSEFPVRGACRALA 878
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
NGK++AA+ + + + + + ++ E + + + + IVV DLMKSI
Sbjct: 879 MVNGKIVAALMKSVVILSMKKGNSYSIDIGKESSYRTSTAPVDLSVTDNIIVVADLMKSI 938
Query: 838 SLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
SLL Y+ E +++E AR Y W + + ++ +L ++ NL + +N+ G T+
Sbjct: 939 SLLEYQAGEAGQPDSLKEVARHYQTLWTTTAAPIAENAFLVSDAEGNLVVLNRNTTGVTE 998
Query: 894 EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ-IPTVIFGTVNGVI---GVIASL 949
+++ R+++ E LG VNR R M L S IP T +G I GVIA
Sbjct: 999 DDKRRMQITSELRLGTMVNRIRR----MDLQASQSSPVIPKAFLATTDGSIYLFGVIAQF 1054
Query: 950 PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSR 1008
+ L +LQ+ L + GG+ +R+F + + D F+DG+L+E FLD
Sbjct: 1055 AQD---LLMRLQSALASFVASPGGIPFSGYRAFKSATRQADEPFRFVDGELVEQFLDCPL 1111
Query: 1009 TRMDEISKTM-----NVSVEELCKRVEELTRLH 1036
+ + M +V++ +L VE L R+H
Sbjct: 1112 EVQEAVLAKMDGGGRDVTLSQLKDIVERLKRMH 1144
>gi|346970653|gb|EGY14105.1| hypothetical protein VDAG_00787 [Verticillium dahliae VdLs.17]
Length = 1160
Score = 387 bits (993), Expect = e-104, Method: Compositional matrix adjust.
Identities = 308/1131 (27%), Positives = 533/1131 (47%), Gaps = 124/1131 (10%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
+ G I L+ RP D LF+ T+ +++ +WD +L+T + + D+++ R
Sbjct: 54 VCGTITLLQTLRPKASETDILFVGTDLFQYFTARWDPAQGKLVTEQVIQDIAEPHMREAQ 113
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQL-----KEAFNIRLEELQVLDIKFL-Y 118
+ ++DP R + +HL++G+ ++ + K+ +RL EL + F+
Sbjct: 114 SQDKCLVDPAGRFLAMHLWEGVMNIMQLGTRKGANFRLDKDWAQVRLSELFMKASTFVPT 173
Query: 119 GCAKPTIVVLYQ---DNKDARHVKTYEVALKDKDFVE--GPWSQNNL-----DNGADLLI 168
PTI LYQ D +DA H+ Y + V P L D A LI
Sbjct: 174 ETGHPTIAFLYQSSIDREDA-HLAVYRLMEDGNTNVSKFDPLKNRELELEIPDPFARTLI 232
Query: 169 PVP------------------PPLCGVLIIGEETIVYC----SANAFKAIPIRPSITKAY 206
PV L G+++ GE ++Y KA+ P I ++
Sbjct: 233 PVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYVDTLTKVKISKALD-EPRIFVSW 291
Query: 207 GRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
+ D +RYLL D G LHLL + + VTGL ++ +G+TS AS + Y+ N ++++GS
Sbjct: 292 AKYDV--TRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYMGNEILFLGS 349
Query: 267 SYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER-----------QGQGQV 315
+GDSQL L+L + +++ N+ PI+DF ++DL GQ ++
Sbjct: 350 HHGDSQLFTLDL----CAHTIRLIQTIPNIAPILDFSIMDLGNAGDSGVGNAFSSGQARI 405
Query: 316 VTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETR 374
V G + +GSLR +R+ +G+ + ++ +Q ++G++SL+S + DT LVVSF++ETR
Sbjct: 406 VAGCGVHHNGSLRSIRSSVGLEDIGILDDIQDVRGLFSLKSYGSEKVDT-LVVSFLTETR 464
Query: 375 ILAMNLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKS 433
+ + + +EE T +G TLF +Q+T+ S L S S N W S
Sbjct: 465 VFKFDADGGVEELTAFQGLTLDQPTLFAGSLANGHTLQITASSALLQDSESGVTINSWTS 524
Query: 434 PPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLE-------YEISCLDIN 486
P G S+ A+ N LL+ GG LV L + +V+ L +ISC
Sbjct: 525 PNGGSIVNASVNEKYALLSVGGSTLVSLNLTQNF--DVREQALGNGIGGDGSQISC---- 578
Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII---------PRSVLLCA 537
I + + I A+G W+ SV + +NL T + L GE I PR +
Sbjct: 579 -IHASRDFPDIGAIGFWSTGSVSV-----INLGTLQALHGESIKQNDDNVSVPRDLAFVQ 632
Query: 538 FE----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF---SSKNTTH 590
L +L DG +++ ++ LT RK V+LG++ L + ++
Sbjct: 633 LHPPQLAGPTLFVSLEDGQVVSINVSKDDYSLTARKSVTLGSREAGLHVLPRPGAPGQSN 692
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
VFA ++ ++IYSS +++YS ++V+++ PF+S FPD++ +A + + I +D +
Sbjct: 693 VFATTEHSSLIYSSEGRIIYSAATAEDVTYIAPFDSEGFPDAIFLATDKNVRIANVDTER 752
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPL 710
+ H+ + +GE RR+ + + F I +++ + +E + L D+ P
Sbjct: 753 RTHVNPLHIGETVRRVAYSPALKAFGIGTIRKELLHDEEVVSSAFQLVDEIVLGKVGRPF 812
Query: 711 -----DTFEYGCSILSCSFSDDSNV---YYCVGTAYVL-PEENEP--TKGRILVFIVE-D 758
+ E +++ D + + +GT+Y+ P+ NE KGRILV V+ D
Sbjct: 813 ALGGEASVELVEAVIRAELDDSTGQPAERFIIGTSYLADPDVNESGDVKGRILVLGVDSD 872
Query: 759 GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 818
LI E KGA L KL+A +++ + +Y ++ + +L+
Sbjct: 873 RNPYLIVSHELKGACRCLGVMGDKLVAGLSKTVVVYDYIEDSTTSGKLEKLTTFRPSTFP 932
Query: 819 LYVQTRGDFIVVGDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
+ + G+ I V DLM+S++L+ + + + ERAR + W +AV L ++ +L
Sbjct: 933 VDLDISGNMIGVADLMQSMTLVEFLPAKDGRKAKLIERARHFEYIWATAVCSLGEESWLE 992
Query: 875 AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
A+ NL + + + T+ ++ ++ E HLGE VN+ R L + ++D+ +P
Sbjct: 993 ADAQGNLMILERQPDAPTEHDQKQMRTTSEMHLGEQVNKIR--PLQITATENDI-IVPKA 1049
Query: 935 IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVG--------GLNHEQWRSFNNEK 986
TV+G + V+A++ E L Q L +++ +G G + QWR F N K
Sbjct: 1050 FLATVDGSLYVLANISAEYQSILLPFQERLAGIVRYLGQAAPEDNEGPSFSQWRGFRNAK 1109
Query: 987 KTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
+ A F+DG+LIE FLDL R + + + + SVE + VEEL R+H
Sbjct: 1110 RMAAAPFRFVDGELIERFLDLDELRQEAVVEGLGPSVEAMRNMVEELRRMH 1160
>gi|393212467|gb|EJC97967.1| hypothetical protein FOMMEDRAFT_162310 [Fomitiporia mediterranea
MF3/22]
Length = 1161
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 308/1085 (28%), Positives = 518/1085 (47%), Gaps = 103/1085 (9%)
Query: 6 IYGRIATLELFRPHGE-AQDFLFIATERY--KFCVLQW----DAESSELITRAMGDVSDR 58
I+GRI +L+ E +Q L + T+ K +L++ L T + +R
Sbjct: 56 IWGRITSLQAVPTDDENSQHHLLVLTDHPDPKLFLLEYVQSNAGSGPSLKTLKTVSLHER 115
Query: 59 IGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
RPT+ ++D ++ Y G +V+ + KG + F+ + EL ++ + FL
Sbjct: 116 NARPTEYVSKCLVDHKGKVAVACSYTGKLRVLELE-KGLINSEFDTAVRELNIVSLCFLR 174
Query: 119 GC-AKPT-IVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQ----NNLDNGADLLIPVPP 172
A+ T + +LY+D V +++++L D + P S + D ++LLIPVPP
Sbjct: 175 TTNARATALAILYKDYMQKLSVTSHDLSLADLELSPSPSSFLPEFHVPDEDSNLLIPVPP 234
Query: 173 PL-------CGVLIIGEETIVYCSAN--------------AFKAIPIRP------SITKA 205
+ GVL++G TI + S + + IP IT A
Sbjct: 235 QIKSSWNVNGGVLVLGGSTIAFYSIDRKQKKKNSSSQSKSSTSKIPQAEVNWPYFDIT-A 293
Query: 206 YGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIG 265
+ ++D DG RYLLGD G L LL I + L I LLGE S ++++ L + +Y+G
Sbjct: 294 WAQIDEDGLRYLLGDSFGRLALLAI---NPQYAYLDIVLLGEVSPPTSLTPLASQYIYVG 350
Query: 266 SSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDG 325
S +GDSQLI++ + + GSY+E+ + + N+ PI+D D + GQ ++TCSG G
Sbjct: 351 SHFGDSQLIRVTSERSSNGSYLEISDTFKNIAPIMDAVFEDTDDSGQPTIITCSGGESTG 410
Query: 326 SLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM-NLEDE 383
SLR++RNG NE A +E + I GMW +R DD F +++V+ + T +L + N + E
Sbjct: 411 SLRVIRNGANFNEDARIEGIANITGMWPIRRQYDDTFHHYMLVTTDTNTHLLELPNSQQE 470
Query: 384 LEETEIEGFCSQT---QTLFCHDAIYNQLVQ--------------VTSGSVRLVSSTSRE 426
+ F T +TL + + + + V+ SV L+++ +
Sbjct: 471 TAVSRSNDFSDLTIDSRTLVAGNMLTRLMSESGKSEYVSSSYVVQVSRDSVILLNTRTGL 530
Query: 427 LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL------EYEI 480
++W PG + +A + SQ+ +A GG +V L + + E Q EI
Sbjct: 531 REDQWSPGPGNKIVLADISPSQICVAISGGTVVLLNLFADKINEQSRKQFYSPDGSSSEI 590
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAF-- 538
S L I+P+ ++S A+G W+ V++ LP I + +PRS+LLC F
Sbjct: 591 SALSISPMKRGANFSSFVALGFWSSHEVKVLRLPTFEQIDAVPVVMPHLPRSLLLCDFSE 650
Query: 539 ---EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS 595
+ YL+ L +G +++ K G L ++K LG P++L + VFA+
Sbjct: 651 EESKPHRYLMVGLANGTVVSMPFAEK-GVLGEKKFFGLGGAPVSLSRCEVNDKPAVFASG 709
Query: 596 DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR 655
R + Y S L +S V +K+V+ +S +F S+A++ L +G + ++ KLHIR
Sbjct: 710 ARSALFYRSKDTLSHSPVLIKDVTFASAIHSESFTSSMALSTLDGLVVGRVLELDKLHIR 769
Query: 656 SIPLG-EHPRRICHQEQSRTFAICSLK------NQSCAEESEMHFVRLLDDQTFEFISTY 708
++ +G E+P ++ + ++ + LK + S V +D +T IS
Sbjct: 770 TVFMGKENPTKLAYHSAAKLLGVGCLKLDRSTFKGAPVAASSFKLVDTVDFRTTHEISLE 829
Query: 709 PLDTFEYGCSI-LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQ---LI 764
P + + L C D+ ++ VG+ Y E EP++GRIL+ + Q ++
Sbjct: 830 PNEEITAVALVSLDCGIGWDT--FFAVGSVYFDETEREPSRGRILIISTGSKRNQTPHIL 887
Query: 765 AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
A E KGAV +L GKL+ AIN + +++ D+ + H+ ++ V
Sbjct: 888 ASTEVKGAVNALTCIQGKLVVAINTSVDVFRLKHGDNTVLTAVTSWNHNYLVITAVVM-- 945
Query: 825 GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
D IV+GD + S+++L K E+ + ARDY+ W + D+ +GA N+ NLF+
Sbjct: 946 DDLIVIGDAVSSLAVL--KLEDDKLTTFARDYSPLWPLCIGAFDNKTVIGANNDNNLFSY 1003
Query: 885 RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
R +G+ + L+ G Y++ E VN+ G P+ Q ++F T G I
Sbjct: 1004 RLQKQGS----KMLLDQDGLYNIEEIVNKCVPGCF-DSTPNKFGLQTKQLLF-TSTGRIA 1057
Query: 945 VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK----NFLDGDLI 1000
V+ ++ + L L+ N+ VIKG G L+H WR+ +N + DA FLDGDL+
Sbjct: 1058 VVVNVEDDLSRLLSSLERNMAGVIKGPGDLDHTLWRAPSNSRGRSDANGAATGFLDGDLL 1117
Query: 1001 ESFLD 1005
E FLD
Sbjct: 1118 ERFLD 1122
>gi|400600376|gb|EJP68050.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
Length = 1174
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 304/1129 (26%), Positives = 545/1129 (48%), Gaps = 111/1129 (9%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELI--TRAMGDVSDRIGRPT 63
++G I L+ RP D LF+ TER ++ + W+ +S+E++ + D+++ R +
Sbjct: 59 VHGTICMLQRLRPRDATTDLLFVGTERLQYFNVSWNPKSNEMVLTNETIHDMAEPYMRES 118
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDN----KGQLKEAFNIRLEELQVLDIKFLYG 119
+ ++DP + + +HL++G+ + N +LK+ IRL EL + D F++
Sbjct: 119 QSQNKCLVDPTAKYMVMHLWEGVLNIFRLPNTPRASKKLKKMEQIRLTELWMKDSAFIHS 178
Query: 120 -CAKPTIVVLYQ---DNKDARHVKTYEVALKDK--DFVE-GPWSQNNLDN-----GADLL 167
P I LY+ D ++AR V Y + DK D + P LD A +L
Sbjct: 179 RTGHPRIAFLYKTQLDQEEAR-VAVYRLTTDDKGTDVSKFDPHKDRELDQVIADPYASML 237
Query: 168 IPVP----------------PPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGR 208
IPVP L G+L++GE + Y + + ++ P I A+
Sbjct: 238 IPVPVAEEKRYHVRNTEGTRAHLGGLLVVGETLLTYFDSLTYSSVSSALDEPKIYVAWAM 297
Query: 209 VDADGSRYLLGDHAGLLHLLVI-THEKEK---VTGLKIELLGET------SIASTISYLD 258
D + Y L D G L L+ + T+ + K VTG+ +E L + S AS + Y+
Sbjct: 298 YD--DTHYFLADDYGRLDLMTLKTYTEPKGVIVTGMTVEPLKISDSRELISRASNLVYMG 355
Query: 259 NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-------- 310
+ ++++ S +G+SQL++++L D++ Y+ ++ N GPI+DF ++D+ +
Sbjct: 356 DGMLFLASHHGNSQLLRIDL--DSRTLYI--IKSLSNNGPILDFAIMDMGNRDGDNHQGN 411
Query: 311 ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFL 365
GQ ++V GAY+DGSLR +R+G+G+ + + EL+ +G++ ++S D L
Sbjct: 412 VFSSGQTRIVAGCGAYRDGSLRSIRSGVGLEDCVILDELENTRGLFMIKSRGSRLVDV-L 470
Query: 366 VVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
V SF++ETRI M LEDE+EE + G + +TL L+QVT V +
Sbjct: 471 VASFVTETRIFLM-LEDEVEELYDWCGLNLEVETLIAATLPGYDLLQVTRTDVAIFHFDD 529
Query: 425 RELRNEWKSPPGY------SVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY 478
R + W + + A+ N VLL G +V L I + V+ Q
Sbjct: 530 RPEIDSWVKTKSWVNESEGEITSASYNDKWVLLCIDGTTIVSLNIEKNLEACVQRYQGTD 589
Query: 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI----IPRSVL 534
++ + +P VG W + I L LN E+L + +PR +
Sbjct: 590 PSQSDQVSCLHASPDVMDFGVVGWWNSGKITIVDLATLNAKHGENLQQTMDAASVPRDIA 649
Query: 535 LCAFE--GIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF-----SS 585
L IS LL AL DG + +F ++++ EL+ +K V+LG+ P L +S
Sbjct: 650 LVQLHPPDISGPTLLVALEDGTVTSFDVSVENLELSGKKTVTLGSSPARLHILPEPDDTS 709
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
T+ +FA ++ ++IYSS +L+YS + + PF+S FP + ++ + + I
Sbjct: 710 DGTSSIFATTEHSSLIYSSEGRLVYSATTADNATCVVPFDSYGFPHCILVSTDKNVRICR 769
Query: 646 IDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEF 704
+D + H++S+P+ E RR+ + ++ FA+ C K+ E V+L+D+ F+
Sbjct: 770 VDKERLTHVKSLPVHETVRRVAYAPGAKAFALGCIKKDLIQNAEVITSSVKLVDEIMFQE 829
Query: 705 IST-YPL---DTFEYGCSILSCSFSDDSNVY---YCVGTAYV----LPEENEPTKGRILV 753
+ T PL T E S++ D + + VGT++V + E E T+GRILV
Sbjct: 830 LGTPLPLAASSTLEMVESVIRAELPDPTGALVERFVVGTSFVNDAEVGEAGE-TRGRILV 888
Query: 754 FIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
V++ + L I KGA L + ++A + + + +Y + + LQ +
Sbjct: 889 LGVDEKRQLYTIVSHNLKGACRCLGILDEYIVACLAKTVVVYSYTEENSTEGYLQKLAAY 948
Query: 813 HGHILALYVQTRGDFIVVGDLMKSISLLIYKH----EEGAIEERARDYNANWMSAVEILD 868
+ + G+ I V D+M+S+SL+ + E G +EE+AR + + W ++V L
Sbjct: 949 RPASFPVALDISGNMIGVADIMQSLSLVEFTPPKDGEPGKLEEKARHFQSAWATSVCHLG 1008
Query: 869 DDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
+ +L + N+ + +N + T+ +R RLE+ E +LGE +N+ + ++ P +V
Sbjct: 1009 GERWLETDAQGNIIVLARNPDAPTEHDRSRLEITSEMNLGEQINKIQRLNVA---PADNV 1065
Query: 929 GQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT 988
P ++ G + + + + L LQT + K +K GG++ + WR+F N+ +
Sbjct: 1066 VVSPKAFLASIEGTLYLYGDIAPKYQDLLITLQTTIEKYVKTTGGISFDAWRAFRNQARE 1125
Query: 989 VDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
D F+DG+++E FLDL + + + + ++VE++ +EEL R+H
Sbjct: 1126 ADGPFRFVDGEMVERFLDLRKQTQAALCQDLGLNVEDVRNMIEELRRMH 1174
>gi|302655264|ref|XP_003019424.1| hypothetical protein TRV_06556 [Trichophyton verrucosum HKI 0517]
gi|291183145|gb|EFE38779.1| hypothetical protein TRV_06556 [Trichophyton verrucosum HKI 0517]
Length = 1027
Score = 377 bits (968), Expect = e-101, Method: Compositional matrix adjust.
Identities = 291/993 (29%), Positives = 482/993 (48%), Gaps = 103/993 (10%)
Query: 6 IYGRIATLELFRPHGEA-----QDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRI 59
IYGR+ L+ P A D LF+ T++Y + + WD +L T R D++D
Sbjct: 55 IYGRVTLLKKL-PRSTAGGLALTDALFVGTDQYAYFSITWDPVHEQLRTERKYIDLADGS 113
Query: 60 GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG---------------------QL 98
R + IDP + L +Y+G+ + P QL
Sbjct: 114 LREAHSDDRCQIDPSGSFLTLEVYEGVVSIFPLVTADSHKRAKSAVSASASSASTSVEQL 173
Query: 99 KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWS 156
E +R+EEL V FL A P +LY+D + +K ++ P S
Sbjct: 174 GEPLQVRIEELMVRSSAFLDQEASNTPRFALLYEDTQGKVKLKLRDLKYTHAIITGDPGS 233
Query: 157 QNNL----------DNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SI 202
L D GA +LIPVP PL G+LIIGE +I Y + + I RP ++
Sbjct: 234 AAELKDVTTLSDEVDLGASILIPVPRPLGGLLIIGESSIKYVDVSRNETIS-RPLAESTV 292
Query: 203 TKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSIASTISYLDNAV 261
A+ +VD G R+LL D G L L++ + E V K++ LG TS AS + YLD +
Sbjct: 293 FVAWEQVD--GQRWLLADDYGRLFFLMLVLDAENAVDTWKVDFLGVTSRASVLVYLDGGI 350
Query: 262 VYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ---------GQ 312
V++GS GDSQ+I++ +G + ++++ N+ PI+DF V+D+ + GQ
Sbjct: 351 VFVGSHQGDSQVIQIK-----EGGF-DLVQTISNIAPILDFTVMDMGDRSGATREFSSGQ 404
Query: 313 GQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
++VT SGA+ DGSLR VR+G+GI E + ++ I +W+LRS+ +PF L+VSF++
Sbjct: 405 TRIVTGSGAFGDGSLRSVRSGVGIEELGVLASMEHITDLWTLRSACPEPFLDTLLVSFVN 464
Query: 372 ETRILAMNLEDELEETE--IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
ETR+ + + ++EE E G TL + N+++QVT + R + S +
Sbjct: 465 ETRVFHFSADGDVEEKEDGFLGLVFSQSTLLATNIPGNRIIQVTESTSRAIDVDSGMII- 523
Query: 430 EWKSP-PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPI 488
W+S +++ A+AN ++L GG LV + + L + + + ++S + I
Sbjct: 524 -WRSSYEEFTITSASANDDYLVLVLGGIRLVCMSLSTFELVGSRDFEADNQVSGMTIPA- 581
Query: 489 GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYL 544
S +Q V + + I LP+L + K+ LG GE IPRSV++ L
Sbjct: 582 ----SPTQACIVCLPQSAEIIILDLPELEVKNKQALGEPGEAIPRSVIVAEILPNKPPTL 637
Query: 545 LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
++ DG + +F ++ L + K++LG++ + + N + VFA D P++I++
Sbjct: 638 FVSMADGTVFSFSFDVHAFSLFNSSKITLGSEQPSFKELPRGNGQYNVFATCDHPSLIHA 697
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
S +++YS V+ S +C N+ A+P S+A++ + EL I +D+ + I ++P+
Sbjct: 698 SEGRIVYSAVDSASASRICSLNTQAYPGSIALSSQHELKIAIVDEERTTQIHTLPMHASV 757
Query: 664 RRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPL----------- 710
RR+ + + F + ++K + + EE FV L D+ F +STY L
Sbjct: 758 RRLAYSPMEKAFGLGTVKRKISNGVEEVSSSFV-LADEMLFRPLSTYDLRPDELVECVIR 816
Query: 711 DTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKET 769
YG + S S D + VGTA++ ++ +GRIL+F V + L LI EK
Sbjct: 817 SQLNYGKDEVGNSISKD---LFFVGTAFLDDVGDDHIRGRILIFEVNRSRELSLIVEKSL 873
Query: 770 KGAVYSLNAFNGKLL-AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
GA +L + LL A + + + ++K G L+ + + + GD +
Sbjct: 874 MGACRTLAVMDHTLLVAGLVKSVSVFKLARDRFGNILLEKHTAYRTSTAPIDISVVGDTV 933
Query: 829 VVGDLMKSISLLIYKH-----EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
V D+MKS+SL+ Y +E EE AR Y W +AV +++++YL A+ NL
Sbjct: 934 AVADVMKSMSLVQYTQPEEGEQEPKFEEVARHYQTLWSTAVAPIEENVYLLADAEGNLVV 993
Query: 884 VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
+++N G T+ +R RL+ E LGE + H
Sbjct: 994 LQQNITGVTESDRKRLQPTSEIRLGEMLFLHEH 1026
>gi|440487047|gb|ELQ66855.1| DNA damage-binding protein 1a [Magnaporthe oryzae P131]
Length = 1213
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 321/1142 (28%), Positives = 528/1142 (46%), Gaps = 125/1142 (10%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITR-AMGDVSDRIGRPTD 64
++G+I L+ RP D LF+ T+R+K+ ++D ++ EL+TR A+ D+ ++ R
Sbjct: 86 VFGKIVMLQAVRPKDSETDMLFVGTDRFKYFTAEYDPDTRELVTRQAISDLGEQFVREVS 145
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF-DNKGQLKEAFNIRLE--------ELQVLDIK 115
+ I+DP R + L L+ G+ V KGQ K RLE EL + D
Sbjct: 146 SRNRCIVDPSGRYMVLLLWSGIMHVWRLHKRKGQQKGQLQTRLELMDQARISELYIKDAV 205
Query: 116 FLYG-CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQ-------NNLDNGAD 165
FL+ A P I LYQ + K +Y + D+D + Q N D G+
Sbjct: 206 FLHSETAHPRIAFLYQPRPNEPDCKFASYRLCTDDRDRELSRFEQKDREFTFNVPDPGST 265
Query: 166 LLIPVP------------PP-----LCGVLIIGEETIVYCSANAFKAIPI---RPSITKA 205
++IPV P L GV+++GE ++Y ++ + A
Sbjct: 266 MMIPVERVETERRHNFRNPARDECHLGGVIVVGESRMLYIDDQSWTWTETALKNAMVFVA 325
Query: 206 YGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG---LKIELLGETSIASTISYLD-NAV 261
+ + D + YLL D G LHLL I ++ T + +G TS A+ + Y + N
Sbjct: 326 WAKFD--NTHYLLADDYGGLHLLTIQVKQNSDTAVDHMSTVQIGTTSRATKLVYSETNRT 383
Query: 262 VYIGSSYGDSQLIKLNLQPDA-KG-SYVEVLERYVNLGPIVDFCVVDLERQ--------- 310
+++ S YGDSQ +NL DA KG S++E+ + N+ PI+DF V+D+ +
Sbjct: 384 LFVASHYGDSQFYDVNLFADAAKGESFLELRQTIENIAPILDFAVMDMGNREGDSQLGNE 443
Query: 311 ---GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV--ELQGIKGMWSLRSSTDDPFDTFL 365
GQ ++VT SGA KDGSLR VR+G+G+ + + E+ G+ G++SL+S D DT L
Sbjct: 444 YSSGQARIVTASGAQKDGSLRSVRSGVGLEDIGVITDEISGVTGLFSLKSYGSDVEDT-L 502
Query: 366 VVSFISETRILAMNLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
VVSF++ETR+ + + E+EE ++++G TL ++ VT L +
Sbjct: 503 VVSFLTETRVFRFDKQGEVEELSQLQGLDISQPTLLVLGLDNGHVLYVTEEKATLFDAEG 562
Query: 425 RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCL 483
+ W G + A++N VLL+ G LV L IG D ++ + E +ISC
Sbjct: 563 GVTISSWSPTSGKPITHASSNGRWVLLSVDGRKLVSLNIGLDLKVSAESEERDEDQISC- 621
Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLC--- 536
+ +P + AVG W+ ++ I L L E L ++ R V+L
Sbjct: 622 ----VNASPHLLDVGAVGFWSSGTISIIDLKTLEATQTEKLRRNEDDAVVAREVVLARVL 677
Query: 537 -AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAA 594
A L + DG ++ F+ N G L+ RK V LGT+ R N +F
Sbjct: 678 PAEVANPTLFVSKDDGEVMTFVYN-DNGTLSSRKSVVLGTREARFRVLPQPNGLCSIFVT 736
Query: 595 SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG------ELTIGTIDD 648
+ ++I+ + ++++YS V +++CPF++AAF D LA+A E EL I ID
Sbjct: 737 CEHSSLIHGAERRIVYSAVTAHSAAYVCPFDTAAFRDCLAVATESAIDRRMELKISRIDR 796
Query: 649 IQKLHIRSIPLGEHPRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFI-S 706
++ + + P+GE+ R I + + F + C + S E +L D+ FE +
Sbjct: 797 QRQCQMMTRPMGENVRSIAYSSADKVFGLGCIRRVLSRGIEKVYGTFKLFDEVIFEPKGN 856
Query: 707 TYPLDTFEYGCSILSCSFSD---DSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQ- 762
+ L+ E + D + + VGT Y L GR+LVF V++ +
Sbjct: 857 VFALEDGEVPECVTRAPLLDSYGEQAERFIVGTRY-LSGTGSGHGGRVLVFGVDESRSPY 915
Query: 763 LIAEKETKGAVYSLNAFNGKLLA-AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV 821
LI TK + + LL A+ + + L ++ + + A+ V
Sbjct: 916 LIHAHSTKSGCRRIATMDDDLLVIALTKTVVLVRYSETSTTSAKFLKVAAFQTSSYAVDV 975
Query: 822 QTRGDFIVVGDLMKSISLLIY--------------------KHEEGAIE----ERARDYN 857
G I V D+MKSI+LL Y K EG+ + E RDY
Sbjct: 976 TVHGKLIAVADIMKSITLLEYIPGVGKSAKTGGKDKATRSDKEVEGSKQAKLVEVCRDYQ 1035
Query: 858 ANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHG 917
A W +AV L+ D ++ A+ + NL + +N+ G T E++ R+++ E+ LGE VN+ +
Sbjct: 1036 AMWSTAVSHLEGDSWIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGECVNKIQK- 1094
Query: 918 SLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRK-VIKGVGGLN 975
VM ++ + T G I + ++ P Q L ++ Q N+ V +G L
Sbjct: 1095 --VMVETSANAPIVAKAFLSTTEGSIYLFGTVAPKFQSLLMD-FQANMEAHVSSPLGELQ 1151
Query: 976 HEQWRSFNN-EKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
QWRSF N E++ + FLDG+ +E FLD+ +I + ++ + E++ + E+
Sbjct: 1152 FNQWRSFRNPEREGAGPERFLDGEFLEMFLDMEENTQIDICQGLSYTAEDMRNLIGEMKN 1211
Query: 1035 LH 1036
+H
Sbjct: 1212 MH 1213
>gi|389629928|ref|XP_003712617.1| hypothetical protein MGG_16867 [Magnaporthe oryzae 70-15]
gi|351644949|gb|EHA52810.1| hypothetical protein MGG_16867 [Magnaporthe oryzae 70-15]
gi|440464739|gb|ELQ34110.1| DNA damage-binding protein 1a [Magnaporthe oryzae Y34]
Length = 1183
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 319/1141 (27%), Positives = 525/1141 (46%), Gaps = 123/1141 (10%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITR-AMGDVSDRIGRPTD 64
++G+I L+ RP D LF+ T+R+K+ ++D ++ EL+TR A+ D+ ++ R
Sbjct: 56 VFGKIVMLQAVRPKDSETDMLFVGTDRFKYFTAEYDPDTRELVTRQAISDLGEQFVREVS 115
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPF-DNKGQLKEAFNIRLE--------ELQVLDIK 115
+ I+DP R + L L+ G+ V KGQ K RLE EL + D
Sbjct: 116 SRNRCIVDPSGRYMVLLLWSGIMHVWRLHKRKGQQKGQLQTRLELMDQARISELYIKDAV 175
Query: 116 FLYG-CAKPTIVVLYQDNKDARHVK--TYEVALKDKDFVEGPWSQ-------NNLDNGAD 165
FL+ A P I LYQ + K +Y + D+D + Q N D G+
Sbjct: 176 FLHSETAHPRIAFLYQPRPNEPDCKFASYRLCTDDRDRELSRFEQKDREFTFNVPDPGST 235
Query: 166 LLIPVP------------PP-----LCGVLIIGEETIVYCSANAFKAIPI---RPSITKA 205
++IPV P L GV+++GE ++Y ++ + A
Sbjct: 236 MMIPVERVETERRHNFRNPARDECHLGGVIVVGESRMLYIDDQSWTWTETALKNAMVFVA 295
Query: 206 YGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTG---LKIELLGETSIASTISYLD-NAV 261
+ + D + YLL D G LHLL I ++ T + +G TS A+ + Y + N
Sbjct: 296 WAKFD--NTHYLLADDYGGLHLLTIQVKQNSDTAVDHMSTVQIGTTSRATKLVYSETNRT 353
Query: 262 VYIGSSYGDSQLIKLNLQPDA-KG-SYVEVLERYVNLGPIVDFCVVDLERQ--------- 310
+++ S YGDSQ +NL DA KG S++E+ + N+ PI+DF V+D+ +
Sbjct: 354 LFVASHYGDSQFYDVNLFADAAKGESFLELRQTIENIAPILDFAVMDMGNREGDSQLGNE 413
Query: 311 ---GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV--ELQGIKGMWSLRSSTDDPFDTFL 365
GQ ++VT SGA KDGSLR VR+G+G+ + + E+ G+ G++SL+S D DT L
Sbjct: 414 YSSGQARIVTASGAQKDGSLRSVRSGVGLEDIGVITDEISGVTGLFSLKSYGSDVEDT-L 472
Query: 366 VVSFISETRILAMNLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
VVSF++ETR+ + + E+EE ++++G TL ++ VT L +
Sbjct: 473 VVSFLTETRVFRFDKQGEVEELSQLQGLDISQPTLLVLGLDNGHVLYVTEEKATLFDAEG 532
Query: 425 RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCL 483
+ W G + A++N VLL+ G LV L IG D ++ + E +ISC
Sbjct: 533 GVTISSWSPTSGKPITHASSNGRWVLLSVDGRKLVSLNIGLDLKVSAESEERDEDQISC- 591
Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLC--- 536
+ +P + AVG W+ ++ I L L E L ++ R V+L
Sbjct: 592 ----VNASPHLLDVGAVGFWSSGTISIIDLKTLEATQTEKLRRNEDDAVVAREVVLARVL 647
Query: 537 -AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAA 594
A L + DG ++ F+ N G L+ RK V LGT+ R N +F
Sbjct: 648 PAEVANPTLFVSKDDGEVMTFVYN-DNGTLSSRKSVVLGTREARFRVLPQPNGLCSIFVT 706
Query: 595 SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG------ELTIGTIDD 648
+ ++I+ + ++++YS V +++CPF++AAF D LA+A E EL I ID
Sbjct: 707 CEHSSLIHGAERRIVYSAVTAHSAAYVCPFDTAAFRDCLAVATESAIDRRMELKISRIDR 766
Query: 649 IQKLHIRSIPLGEHPRRICHQEQSRTFAI-CSLKNQSCAEESEMHFVRLLDDQTFEFI-S 706
++ + + P+GE+ R I + + F + C + S E +L D+ FE +
Sbjct: 767 QRQCQMMTRPMGENVRSIAYSSADKVFGLGCIRRVLSRGIEKVYGTFKLFDEVIFEPKGN 826
Query: 707 TYPLDTFEYGCSILSCSFSD---DSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQ- 762
+ L+ E + D + + VGT Y L GR+LVF V++ +
Sbjct: 827 VFALEDGEVPECVTRAPLLDSYGEQAERFIVGTRY-LSGTGSGHGGRVLVFGVDESRSPY 885
Query: 763 LIAEKETKGAVYSLNAFNGKLLA-AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV 821
LI TK + + LL A+ + + L ++ + + A+ V
Sbjct: 886 LIHAHSTKSGCRRIATMDDDLLVIALTKTVVLVRYSETSTTSAKFLKVAAFQTSSYAVDV 945
Query: 822 QTRGDFIVVGDLMKSISLLIY--------------------KHEEGAIE----ERARDYN 857
G I V D+MKSI+LL Y K EG+ + E RDY
Sbjct: 946 TVHGKLIAVADIMKSITLLEYIPGVGKSAKTGGKDKATRSDKEVEGSKQAKLVEVCRDYQ 1005
Query: 858 ANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHG 917
A W +AV L+ D ++ A+ + NL + +N+ G T E++ R+++ E+ LGE VN+ +
Sbjct: 1006 AMWSTAVSHLEGDSWIVADGDGNLVVLLRNTAGVTLEDKRRMQMTSEFGLGECVNKIQK- 1064
Query: 918 SLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK-VIKGVGGLNH 976
VM ++ + T G I + ++ + L Q N+ V +G L
Sbjct: 1065 --VMVETSANAPIVAKAFLSTTEGSIYLFGTVAPKFQSLLMDFQANMEAHVSSPLGELQF 1122
Query: 977 EQWRSFNN-EKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
QWRSF N E++ + FLDG+ +E FLD+ +I + ++ + E++ + E+ +
Sbjct: 1123 NQWRSFRNPEREGAGPERFLDGEFLEMFLDMEENTQIDICQGLSYTAEDMRNLIGEMKNM 1182
Query: 1036 H 1036
H
Sbjct: 1183 H 1183
>gi|194377326|dbj|BAG57611.1| unnamed protein product [Homo sapiens]
Length = 451
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 191/364 (52%), Positives = 247/364 (67%), Gaps = 15/364 (4%)
Query: 684 SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEE 743
S EE E+H + ++D TFE + + EY S++SC D N Y+ VGTA V PEE
Sbjct: 92 SFGEEVEVHNLLIIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEE 151
Query: 744 NEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT 803
EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++LY+W
Sbjct: 152 AEPKQGRIVVFQYSDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWTTE---- 207
Query: 804 RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
+EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG EE ARD+N NWMSA
Sbjct: 208 KELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSA 267
Query: 864 VEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR- 922
VEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN F HGSLVM+
Sbjct: 268 VEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQN 327
Query: 923 LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
L ++ +V+FGTVNG+IG++ SL Y L +Q L KVIK VG + H WRSF
Sbjct: 328 LGETSTPTQGSVLFGTVNGMIGLVTSLSESWYNLLLDMQNRLNKVIKSVGKIEHSFWRSF 387
Query: 983 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN----------VSVEELCKRVEEL 1032
+ E+KT A F+DGDLIESFLD+SR +M E+ + + ++L K VEEL
Sbjct: 388 HTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQYDDGSGMKREATADDLIKVVEEL 447
Query: 1033 TRLH 1036
TR+H
Sbjct: 448 TRIH 451
>gi|322706594|gb|EFY98174.1| DNA damage-binding protein 1 [Metarhizium anisopliae ARSEF 23]
Length = 1121
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 307/1137 (27%), Positives = 552/1137 (48%), Gaps = 130/1137 (11%)
Query: 13 LELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPTDNGQIGI 70
L+ +P D LFI T+R+++ + W+ E+++L + + + DV+++ R + + +
Sbjct: 2 LQRLKPKDSETDLLFIGTDRFQYFNVAWNPETNQLDTVEQTIEDVAEQYMRHSQSQNKCL 61
Query: 71 IDPDCRLIGLHLYDGLFKV--IPFDNKG---QLKEAFNIRLEELQVLDIKFLYG-CAKPT 124
+DP + + +HL++G+ V +P KG +L+ +RL EL + FL+ P
Sbjct: 62 VDPTGKFMAMHLWEGVLNVFRLPM-RKGMTTKLEALDQVRLTELWMKASTFLHSQTGHPK 120
Query: 125 IVVLYQDNKDARHVKTYEVALKDKDFVEG------PWSQNNLDN-----GADLLIPVP-- 171
I LY+ D + L KD G P + LD A +LIPVP
Sbjct: 121 IAFLYKTQTDQEEARIAVYRLT-KDDSRGNVARFDPHRERELDQIISDPYASMLIPVPFR 179
Query: 172 --------------PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAY-GRVDADGSRY 216
L G+L++GE I Y + + + K Y + D Y
Sbjct: 180 EEKRYHVRHNEGAKAHLGGLLVVGETLITYFDSLTYSRVSSTLQDPKIYVAWTEYDDVHY 239
Query: 217 LLGDHAGLLHLLVITHEKEK----VTGLKIELL------GETSIASTISYLDNAVVYIGS 266
LL D G L +L I E VTG+ I + G TS AS++ Y+ N ++++ S
Sbjct: 240 LLADDYGRLDILTIETATESTGIVVTGMAISPMRFPDSSGCTSRASSLVYMGNDMLFLAS 299
Query: 267 SYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ------------GQGQ 314
+GDSQL+++++ DA+ + V + N PI+DF ++D+ + GQ +
Sbjct: 300 HHGDSQLLRIDI--DAQ--VMVVAKALSNNAPILDFAIMDMGNREGDSQFGNAFSSGQAR 355
Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISET 373
+V GAY DGSLR +R+G+G+ +Q + E+Q KG+++LRS DT LVVS +++T
Sbjct: 356 IVAGCGAYHDGSLRSIRSGVGLEDQGILDEIQDTKGLFTLRSHQSSHVDT-LVVSSVADT 414
Query: 374 RILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWK 432
R+L + +EE +G +TL + QL+QVT S L+ S S W
Sbjct: 415 RVLRFDSAGGIEEVYAFQGLTLDMETLLAVNISDGQLLQVTPKSAVLLDSESGVNLCSWD 474
Query: 433 SPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY------EISCLDIN 486
+P G ++ A+AN LL+ G LV L + D + ++ + +ISCL +
Sbjct: 475 APSGKAITAASANKGWALLSIDGSLLVSLNLHDNLAAVLRDTSGDESSTQPDQISCL--H 532
Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEGIS 542
++P I VG W ++ I L L + E L +PR + L
Sbjct: 533 AARDSP---DIGVVGWWASGTISIVDLATLQPLHGEPLRQTEDSASVPRDIALVQLHPPQ 589
Query: 543 Y----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASDR 597
LL AL DG+++ F ++++ ++ RK V+LG+ P L ++ T +VFA ++
Sbjct: 590 VSGPTLLIALEDGNVVTFDMSIQGYTISGRKSVTLGSSPARLHVLPQEDGTCNVFATTEH 649
Query: 598 PTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSI 657
++IYS+ +++YS + +++ PF+S AFP+S+ ++ + + + ID + H++++
Sbjct: 650 ASLIYSAEGRIIYSATTADDATYVAPFDSEAFPNSIVLSTDSHIRLSHIDKERLTHVKTL 709
Query: 658 PLGEHPRRICHQEQSRTFAICSLKNQSCA-EESEMHFVRLLDDQTFE-------FISTYP 709
+ E RR+ + + F + +K + EE R++D+ F+ F ++
Sbjct: 710 SVKETVRRVAYSPTLKVFGLGCIKKELIQNEEVITSSFRIVDEIIFQELGKPFIFNTSTS 769
Query: 710 LDTFEYGCSILSCSFSDD-SNV--YYCVGTAYVLP----EENEPTKGRILVFIV-EDGKL 761
L+ E +++ D N+ + +GT+++ EEN+ T+GRILV V E+ ++
Sbjct: 770 LEMVE---TVIRAELPDSMGNLAERFIIGTSFITDDDAIEEND-TRGRILVLGVDENRQV 825
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV 821
I KGA L ++A +++ + +Y ++ LQ + L +
Sbjct: 826 YQIVSHNLKGACRCLGTLGEHIVAGLSKTVVVYHYVEETTVFGSLQKLAAYRPASFPLSL 885
Query: 822 QTRGDFIVVGDLMKSISLLIY-KHEEGA---IEERARDYNANWMSAVEILDDDIYLGAEN 877
G+ I V DLM+S++L+ + E+G+ +EE AR Y W ++V LD + +L A+
Sbjct: 886 DISGNIIGVVDLMQSLTLVEFIPSEDGSRAKLEETARHYQPGWATSVAHLDGERWLEADA 945
Query: 878 NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL-----VMRLPDSDVGQI- 931
N+ +++N E T+++R +LEV E ++GE +N+ R + + P + +G +
Sbjct: 946 QGNIIVLQRNPEAPTEQDRSKLEVTSEMNIGEQINQIRKLHVASNENAVVSPKAFLGSVG 1005
Query: 932 --PTVI--------FGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
T+I + G + + + P+ Q L L Q+ L+ I G ++ WR
Sbjct: 1006 LSETIITCWNQLLMLVQIEGTLYLFGEIAPNYQDLLL-TFQSRLQDYIYAPGNVSFNLWR 1064
Query: 981 SFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
+F N+ + D F+DG+++E FLDL + + + + + SVE++ +EEL R+H
Sbjct: 1065 AFRNKAREGDGPFRFVDGEMVERFLDLDEAKQELVCEGLGPSVEDMRNMIEELRRMH 1121
>gi|346321204|gb|EGX90804.1| DNA damage-binding protein 1 [Cordyceps militaris CM01]
Length = 1160
Score = 370 bits (951), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 294/1125 (26%), Positives = 533/1125 (47%), Gaps = 113/1125 (10%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSE--LITRAMGDVSDRIGRPT 63
++G I L+ RP D LF+ TER ++ + W+++ +E L + D ++ R +
Sbjct: 55 VHGTICMLQRLRPKDANTDLLFVGTERLQYFNVSWNSQRNEMELTHETIHDTAEPYMRES 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK----GQLKEAFNIRLEELQVLDIKFLYG 119
+ ++DP + + +HL++G+ V + +L+E IRL EL + D F++
Sbjct: 115 QSQNKCLVDPTAKYMVMHLWEGVLNVFRLPARRGVSDKLEEMDQIRLTELWMKDSAFIHS 174
Query: 120 -CAKPTIVVLYQ---DNKDARHVKTYEVALKDK--DFVE-GPWSQNNLDN-----GADLL 167
P I LY+ D ++AR V Y + DK D + P LD A +L
Sbjct: 175 RTGHPRIAFLYKTQLDEEEAR-VAVYRLTKDDKGSDVSKFDPHKDRELDQVIADPCASML 233
Query: 168 IPVP----------------PPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYGR 208
IPVP L G+L++GE + Y + + ++ P I A+
Sbjct: 234 IPVPVAEEKRYHVRNNEGTRAHLGGLLVVGETLLTYFDSLTYSSVSSVLDEPKIYVAWAM 293
Query: 209 VDADGSRYLLGDHAGLLHLLVI-THEKEK---VTGLKIELL------GETSIASTISYLD 258
D + Y L D G L LL + TH ++ V + +E L TS AS++ Y+
Sbjct: 294 YDE--THYFLADDYGRLDLLTLQTHSEQTGVVVNRMTVEPLKFPDSKNLTSRASSLVYMG 351
Query: 259 NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQ-------- 310
++ + S +GDSQL LQ D + ++ N GPI+DF ++D+ +
Sbjct: 352 GGMLVVASHHGDSQL----LQIDIDSRTMHPIKLLSNNGPILDFAIMDMGNRDGDNLQGN 407
Query: 311 ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFL 365
GQ ++V GAY+DGSLR +R+G+G+ + +E ++ + +++++S DT L
Sbjct: 408 LFSSGQARIVAGCGAYRDGSLRSIRSGVGLEDLGILEEIKNTRRLFTIKSRRSKKVDT-L 466
Query: 366 VVSFISETRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
V +I +TR+ + +++E G + ++L + +L+QVT V +
Sbjct: 467 VACYILDTRVFLFGADGDIQEVNNFPGLNLRVESLLVVNLPNGRLLQVTRMDVTIKDPNE 526
Query: 425 RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ------LEY 478
+ W +P G ++ A+ N +LL G +V L I + V+ Q
Sbjct: 527 QAKALRWVAP-GANITSASCNGKWILLCVDGSKIVSLNIDKNLEACVQQYQGADDSNQSN 585
Query: 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVL 534
++SCL + ++ +G W + I L L+ E L +PR +
Sbjct: 586 QVSCLH-----ASTTFQDYGVIGWWYPGRITIIDLATLSAKHVETLQQTTDAASVPRDIS 640
Query: 535 LCAFE----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TT 589
L LL AL DG ++ F ++ + L+ +K+V+LG+ P L + T+
Sbjct: 641 LVQLHPRHASGPTLLVALEDGTVITFNVS-EDLTLSGKKRVTLGSSPACLHVLPEADGTS 699
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
+FA ++ ++IYSS +++YS + + + PF+S AFP+S+ ++ + + + ID
Sbjct: 700 KIFATTENASLIYSSEGRIIYSATTADDATCVVPFDSEAFPNSILLSTDKHVRLCHIDKE 759
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA-EESEMHFVRLLDDQTFEFISTY 708
+ H++S+P+ E RR+ + + FA+ +K + EE V+L+D+ F+ +
Sbjct: 760 RLTHVKSLPVYETVRRVAYAPGVKAFALGCIKKELIRNEEVITSSVKLVDEIIFQELGA- 818
Query: 709 PL-----DTFEYGCSILSCSFSDDSNVY---YCVGTAYVLPE---ENEPTKGRILVFIV- 756
PL T E S++ D + + VGT+++ E T+GRILV V
Sbjct: 819 PLQLNASSTLEMVESVIRAELPDHTGALVERFIVGTSFITDADVGEASETRGRILVLGVD 878
Query: 757 EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI 816
E+ +L I KGA L+ + ++A +++ + +Y++ LQ +
Sbjct: 879 EERQLYTIVTHNLKGACRCLSVLDEYIVAGLSKTVVVYRYTEETSTEGSLQKLAAYRPAS 938
Query: 817 LALYVQTRGDFIVVGDLMKSISLLIYKH----EEGAIEERARDYNANWMSAVEILDDDIY 872
+ + G+ I VGDLM+S+SL+ + E ++E+AR + + W ++V LD + +
Sbjct: 939 FPVALDVSGNMIGVGDLMQSLSLVEFTPPKDGEPAKLQEKARHFQSAWATSVCHLDGERW 998
Query: 873 LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
L + N+ + +N E T+++RGRLE+ E +LGE +N+ R ++ P + P
Sbjct: 999 LETDAQGNIMVLARNPEAPTEQDRGRLEITSEMNLGEQINKIRKLNVA---PADNAVVSP 1055
Query: 933 TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK 992
++ G + + + + L LQ+N+ + +K G ++ WRSF N+ + D
Sbjct: 1056 KAFLASIEGTLYLYGDIAPKYQDLLITLQSNIEQYVKTTGDISFNAWRSFRNQTREADGP 1115
Query: 993 -NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
F+DG+++E FLDL E+ K + SVE++ +EEL R+H
Sbjct: 1116 FRFVDGEMVERFLDLDELTQVELCKDLGPSVEDVRNMIEELRRMH 1160
>gi|90108802|pdb|2B5N|A Chain A, Crystal Structure Of The Ddb1 Bpb Domain
gi|90108803|pdb|2B5N|B Chain B, Crystal Structure Of The Ddb1 Bpb Domain
gi|90108804|pdb|2B5N|C Chain C, Crystal Structure Of The Ddb1 Bpb Domain
gi|90108805|pdb|2B5N|D Chain D, Crystal Structure Of The Ddb1 Bpb Domain
Length = 323
Score = 369 bits (946), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 183/323 (56%), Positives = 241/323 (74%), Gaps = 4/323 (1%)
Query: 330 VRNGIGINEQASVELQGIKGMWSLRSSTD-DPFDTFLVVSFISETRILAMNLEDELEETE 388
+RNGIGI+E AS++L GIKG+W LRS + + +DT LV+SF+ +TR+L +N E E+EETE
Sbjct: 4 MRNGIGIHEHASIDLPGIKGLWPLRSDPNRETYDT-LVLSFVGQTRVLMLNGE-EVEETE 61
Query: 389 IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQ 448
+ GF QT FC + + QL+Q+TS SVRLVS + L +EWK P +++VA+ N+SQ
Sbjct: 62 LMGFVDDQQTFFCGNVAHQQLIQITSASVRLVSQEPKALVSEWKEPQAKNISVASCNSSQ 121
Query: 449 VLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISV 508
V++A G L YL+I L ++ H ++E+E++CLDI P+G++ S + A+G+WTDIS
Sbjct: 122 VVVAVGRA-LYYLQIHPQELRQISHTEMEHEVACLDITPLGDSNGLSPLCAIGLWTDISA 180
Query: 509 RIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDR 568
RI LP L+ KE LGGEIIPRS+L+ FE YLLCALGDG L F LN++TG L+DR
Sbjct: 181 RILKLPSFELLHKEMLGGEIIPRSILMTTFESSHYLLCALGDGALFYFGLNIETGLLSDR 240
Query: 569 KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
KKV+LGTQP LRTF S +TT+VFA SDRPTVIYSSN KL++SNVNLKEV++MCP NS
Sbjct: 241 KKVTLGTQPTVLRTFRSLSTTNVFACSDRPTVIYSSNHKLVFSNVNLKEVNYMCPLNSDG 300
Query: 629 FPDSLAIAKEGELTIGTIDDIQK 651
+PDSLA+A LTIGTID+IQK
Sbjct: 301 YPDSLALANNSTLTIGTIDEIQK 323
>gi|322700871|gb|EFY92623.1| DNA damage-binding protein 1 [Metarhizium acridum CQMa 102]
Length = 1121
Score = 367 bits (942), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 301/1137 (26%), Positives = 548/1137 (48%), Gaps = 130/1137 (11%)
Query: 13 LELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL--ITRAMGDVSDRIGRPTDNGQIGI 70
L+ +P D LFI T+R + + W++E+++L + + + DV+++ R + + +
Sbjct: 2 LQRLQPKDSETDLLFIGTDRLNYFNVAWNSETNQLDTVEQTIEDVAEQYMRHSQSQNKCL 61
Query: 71 IDPDCRLIGLHLYDGLFKV--IPFDNKG---QLKEAFNIRLEELQVLDIKFLYG-CAKPT 124
+DP + + +HL++G+ V +P KG +L +RL EL + FL+ P
Sbjct: 62 VDPTGKFMAMHLWEGVLNVFRLPM-RKGITTKLVALDQVRLTELWMKASTFLHSQTGHPK 120
Query: 125 IVVLYQDNKDARHVKTYEVALKDKDFVEG------PWSQNNLDNG-----ADLLIPVP-- 171
I LY+ D + L KD G P + LD A +LIPVP
Sbjct: 121 IAFLYKTQTDQEEARIAVYRLT-KDDNRGNVASFDPHRERELDQVISDPYASMLIPVPFR 179
Query: 172 --------------PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAY-GRVDADGSRY 216
L G+L++GE I Y + + + K Y + D Y
Sbjct: 180 EEKRYHVRNNEGAKAHLGGLLVVGETLITYFDSLTYSRVSSTLQDPKIYVAWTEYDDVHY 239
Query: 217 LLGDHAGLLHLLVITHEKEK----VTGLKIELL------GETSIASTISYLDNAVVYIGS 266
LL D G L +L + E VTG+ + + TS AS++ Y+ + ++++ S
Sbjct: 240 LLADDYGRLDILTVETTTESTGIVVTGMAVSPMRFPDSSAYTSRASSLVYMGDDMLFLAS 299
Query: 267 SYGDSQLIKLNLQPDAKGSYVEVLERYV-NLGPIVDFCVVDLERQ------------GQG 313
+GDSQL+++++ DA+ V VL + + N PI+DF ++D+ + GQ
Sbjct: 300 HHGDSQLLRIDI--DAQ---VMVLVKTLSNNAPILDFAIMDMGNREGDSQFGNAFSSGQA 354
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
++V GAY DGSLR +R+G+G+ +Q + E+Q KG+++LRS DT LVVS +++
Sbjct: 355 RIVAGCGAYHDGSLRSIRSGVGLEDQGILDEIQDTKGLFTLRSHESSHVDT-LVVSSVAD 413
Query: 373 TRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
TR+L + ++EE +G +TL + +L+QVT S L+ S S W
Sbjct: 414 TRVLRFDSAGDIEEVYAFQGLTLDMETLLAVNISDGRLLQVTPKSAVLLDSESGVNLCSW 473
Query: 432 KSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY------EISCLDI 485
+P G ++ A+AN LL+ G LV L + D + ++ + +ISCL
Sbjct: 474 DAPSGKAITAASANKGWALLSIDGSSLVSLNLHDNLAAVLRDTSGDESSGQPDQISCL-- 531
Query: 486 NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEGI 541
+ ++P I VG WT ++ I L L + E L +PR + L
Sbjct: 532 HAARDSP---DIGVVGWWTSGTISIVDLATLQPLHGEPLRQTEDSASVPRDIALVQLHPP 588
Query: 542 SY----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-TTHVFAASD 596
LL AL DG+++ F ++M+ ++ RK V+LG+ P L ++ T +VFA ++
Sbjct: 589 QVSGPTLLIALEDGNVVTFDMSMQGYTISGRKSVTLGSSPARLHVLPQEDGTCNVFATTE 648
Query: 597 RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
++IYS+ +++YS + +++ PF+S AFP+S+ ++ + + + ID + H+++
Sbjct: 649 HASLIYSAEGRIIYSATTADDATYVAPFDSEAFPNSIVLSTDSHIRLSHIDKERLTHVKT 708
Query: 657 IPLGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTFE-------FISTY 708
+ + E RR+ + + F + +K + EE R++D+ F+ F ++
Sbjct: 709 LSVKETVRRVAYSPTLKVFGLGCIKKELIHNEEVITSSFRIVDEIIFQELGKPFIFNTST 768
Query: 709 PLDTFEYGCSILSCSFSDDSNVY---YCVGTAYVLP----EENEPTKGRILVFIV-EDGK 760
L+ E +++ D + + +GT++V EEN+ T+GRILV V E+ +
Sbjct: 769 SLEMVE---TVIRAELPDSTGNLAERFIIGTSFVTDDDAIEEND-TRGRILVLGVDENRQ 824
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY 820
+ I KGA L+ ++A +++ + +Y ++ LQ + L
Sbjct: 825 VYQIVSHNLKGACRCLSTLGEHIVAGLSKTVVVYNYVEETTVFGSLQKLAAYRPASFPLG 884
Query: 821 VQTRGDFIVVGDLMKSISLLIY-KHEEGA---IEERARDYNANWMSAVEILDDDIYLGAE 876
+ G+ I V DLM+S++L+ + ++G+ +EE AR Y W ++V LD + +L A+
Sbjct: 885 LDISGNIIGVVDLMQSLTLVEFIPSKDGSRAKLEEVARHYQPGWATSVTNLDGERWLEAD 944
Query: 877 NNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL-----VMRLPDSDVGQI 931
N+ +++N E T+++R +LEV E ++GE +N+ R + + P + +G +
Sbjct: 945 AQGNIIVLQRNPEAPTEQDRSKLEVTSEINIGEQINQIRRLHVASNENAVVSPKAFLGSV 1004
Query: 932 -----------PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
+I + G + + + + L Q L+ I G ++ WR
Sbjct: 1005 GLSETTINCWTQLLILVQIEGTLYLFGEIAPKYQDLLLTFQARLQDYIYAPGNVSFNLWR 1064
Query: 981 SFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
+F N+ + D F+DG+++E FLDL + + + + + SVE++ +EEL R+H
Sbjct: 1065 AFRNKAREGDGPFRFVDGEMVERFLDLDEAKQELVCEGLGPSVEDMRNMIEELRRMH 1121
>gi|384080885|dbj|BAM11105.1| damage-specific DNA binding protein 1, 127kDa, partial
[Siebenrockiella crassicollis]
Length = 364
Score = 359 bits (922), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 186/367 (50%), Positives = 237/367 (64%), Gaps = 44/367 (11%)
Query: 615 LKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRT 674
LKEV++MCP NS +PDSLA+A LTIGTID+IQKLHIR++PL E PR+IC+QE S+
Sbjct: 2 LKEVNYMCPLNSDGYPDSLALANNSTLTIGTIDEIQKLHIRTVPLYESPRKICYQEVSQC 61
Query: 675 FAICSLK---------------------------------------NQSCAEESEMHFVR 695
F + S + S EE E+H +
Sbjct: 62 FGVLSSRIEVQDASGGTTALRPSASTQALSSSVSSSKLFSSSTAPHETSFGEEVEVHNLL 121
Query: 696 LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI 755
++D TFE + + EY S++SC D N Y+ VGTA V PEE EP +GRI+VF
Sbjct: 122 IIDQHTFEVLHAHQFLQNEYALSLVSCKLGKDPNTYFIVGTAMVYPEEAEPKQGRIVVFH 181
Query: 756 VEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 815
DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++LY+W +EL++EC H+ +
Sbjct: 182 YSDGKLQSLAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWT----AEKELRTECNHYNN 237
Query: 816 ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
I+ALYV+T+GDFI+VGDLM+S+ LL YK EG EE ARD+N NWMSAVEILDDD +LGA
Sbjct: 238 IMALYVKTKGDFILVGDLMRSVLLLAYKPMEGNFEEIARDFNPNWMSAVEILDDDNFLGA 297
Query: 876 ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR-LPDSDVGQIPTV 934
EN FNLF +K+S TDEER L+ VG +HLGEFVN F HGSLVM+ L ++ +V
Sbjct: 298 ENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFCHGSLVMQNLGETSTPTQDSV 357
Query: 935 IFGTVNG 941
+FGTVNG
Sbjct: 358 LFGTVNG 364
>gi|154303693|ref|XP_001552253.1| hypothetical protein BC1G_08731 [Botryotinia fuckeliana B05.10]
Length = 1087
Score = 357 bits (915), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 269/882 (30%), Positives = 454/882 (51%), Gaps = 76/882 (8%)
Query: 212 DGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
D YLLGD G L++L I + VTGL++ LG+ S +++ L N V YI S D+
Sbjct: 225 DERTYLLGDDYGFLYILTILVDGATVTGLQVVKLGQVSKPTSLENLGNGVFYIASHEADN 284
Query: 272 QLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-----ERQ------GQGQVVTCSG 320
Q+++++L+ G V +L+ N+ PI+DF V+D+ E Q GQ ++VT SG
Sbjct: 285 QVVQIDLESPEHG--VTLLQTLPNIAPILDFTVMDMGGREGETQLNEYSSGQARLVTGSG 342
Query: 321 AYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
++ GSLR VR+G+G+++ A + E++GI+ +++L S P DT LVVSF +ETR +
Sbjct: 343 GFEGGSLRSVRSGVGLDDTAILAEMEGIRKVFALHSGPTLPNDT-LVVSFSTETRFFKFD 401
Query: 380 LEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYS 438
+ ++EE E ++ S ++TL ++ ++QVT V + + + W+ P G
Sbjct: 402 TQGDIEEVETVKNLSSTSETLLTYNLDEGCILQVTQHEVSIYGKSPG---HRWQPPNGQI 458
Query: 439 VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL-EYEISCLDINPIGENPSYSQI 497
+ A+ N + +LL++ G L L I L EV +L + +++C+ + + I
Sbjct: 459 ITAASGNQNYILLSSNGRTLFTLSIQQN-LAEVAFQELGDDQVACIHVPQV-----MGDI 512
Query: 498 AAVGMWTDISVRIFSLPDLNLITKEHL---GGEIIPRSVLLCA-----FEGISYLLCALG 549
VG+W SV + L LN I E L G IPR + L G + L ++
Sbjct: 513 GVVGLWKSGSVSLLDLATLNTIVSEDLRRADGASIPRDIALTQILPPELSGPT-LFVSME 571
Query: 550 DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT-HVFAASDRPTVIYSSNKKL 608
DG +L+F ++ L+ RK + LGTQ L+ NTT +VFA + P++IY S +
Sbjct: 572 DGIVLSFNVDKSDCSLSGRKSIVLGTQQAQLQILPRDNTTFNVFATCEHPSLIYGSEGRT 631
Query: 609 LYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICH 668
+YS V ++ +C NS A+P S+ +A EL + ID+ ++ H+R++P+GE RR+ +
Sbjct: 632 VYSAVTAEDAIAVCSLNSVAYPGSVVVATINELKLAVIDNERRTHVRTLPIGETVRRVAY 691
Query: 669 QEQSRTFAICSLKNQ-SCAEESEMHFVRLLDDQTF-EFISTYPL----DTFEYGCSILSC 722
+ ++FAI ++K + + +E RL+D+ F E Y L + E
Sbjct: 692 SAKEKSFAIGAIKRELTKGQEVVTTSFRLVDEVVFGELGEPYYLPPNNEIIETVVRAELP 751
Query: 723 SFSDDSNVY--YCVGTAYVLPEENEPTKGRILVFIVE-DGKLQLIAEKETKGAVYSLNAF 779
+ D + + VGT++ L EE +GR+L+F V D +IA KG+ +
Sbjct: 752 TRHGDGELVERFLVGTSF-LHEEEANVRGRLLIFGVNADRAPYMIASHNLKGSCRCIGVL 810
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
+GK++AA+N+ + +Y + + L+ + + + + I V D+MKSI+L
Sbjct: 811 DGKIVAALNKTVVMYDYEETSSTSATLKKLATYRCSTCPIDIDITDNIIAVADIMKSIAL 870
Query: 840 LIYKHEEGA------IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
+ Y GA +EE AR + ++V +D D YL +++ NL +++N EG T
Sbjct: 871 VEYT--PGADGLPDKLEEVARHAQQVFSTSVAEVDTDTYLETDHDGNLILLKRNREGVTR 928
Query: 894 EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQ 953
E++ R+EV E +LGE VNR + ++ D IP GT G I + + +P +
Sbjct: 929 EDKTRMEVTCEMNLGEMVNRVKRINVET---SKDALLIPRAFLGTTEGSIYLFSLIPPQN 985
Query: 954 YLFLEKLQTNLRKV-----IKGV-------------GGLNHEQWRSF-NNEKKTVDAKNF 994
L +LQ+ L + I+G G L+ ++RS+ + ++T + F
Sbjct: 986 QDLLMRLQSRLASLPSASSIRGSSDSTSPHQIELSPGNLDFNKYRSYISATRETSEPFRF 1045
Query: 995 LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
+DG+LIE FLDL + +++ + V E+L VE L RLH
Sbjct: 1046 VDGELIERFLDLEVEVQEHVAEGLGVKAEDLRGMVEGLRRLH 1087
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 49/172 (28%), Positives = 90/172 (52%), Gaps = 10/172 (5%)
Query: 629 FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-SCAE 687
+P S+ +A EL + ID+ ++ H+R++P+GE RR+ + + ++FAI ++K + + +
Sbjct: 5 YPGSVVVATINELKLAVIDNERRTHVRTLPIGETVRRVAYSAKEKSFAIGAIKRELTKGQ 64
Query: 688 ESEMHFVRLLDDQTF-EFISTYPL----DTFEYGCSILSCSFSDDSNVY--YCVGTAYVL 740
E RL+D+ F E Y L + E + D + + VGT++ L
Sbjct: 65 EVVTTSFRLVDEVVFGELGEPYYLPPNNEIIETVVRAELPTRHGDGELVERFLVGTSF-L 123
Query: 741 PEENEPTKGRILVFIVE-DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
EE +GR+L+F V D +IA KG+ + +GK++AA+N+ +
Sbjct: 124 HEEEANVRGRLLIFGVNADRAPYMIASHNLKGSCRCIGVLDGKIVAALNKTV 175
>gi|393243160|gb|EJD50676.1| hypothetical protein AURDEDRAFT_112250 [Auricularia delicata
TFB-10046 SS5]
Length = 1140
Score = 354 bits (909), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 294/1007 (29%), Positives = 488/1007 (48%), Gaps = 113/1007 (11%)
Query: 83 YDGLFKVIPFDNK-GQL-KEAFNIRLEELQVLDIKFL-YGCAKPTIVVLYQDNKDARHVK 139
Y G + D++ G L +E F+ L EL VL + P + +L+ D+ D R V
Sbjct: 126 YKGKLHALRLDDRRGVLGRECFDCNLRELTVLSACVVPVESDLPVLTILHTDSDDKRRVT 185
Query: 140 TYEVALKDKDF---VEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANA---- 192
T V L + D E +++ DN L+P+ C + + G VY A+A
Sbjct: 186 TRPVLLDESDVGERTEHLFTKVISDN-THTLVPLRNHGCILTMGGTSLRVYKVAHADVRL 244
Query: 193 ----------------FKAIPIRPSITKA-------YGRVDA----DGSRYLLGDHAGLL 225
KA P+IT+A +G + A D R L+GD G L
Sbjct: 245 SPAHKKSRKRQSTTGKGKAAADEPAITEAAVSEPWPWGEIVAWTMVDDERILIGDAYGRL 304
Query: 226 HLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
L+ ++ + + LLG+TS A+T +YLDN ++Y+GS +GDSQL++L+ DA GS
Sbjct: 305 VLVTVSLNENAAFTISPVLLGQTSPATTFTYLDNGILYVGSHFGDSQLVRLSTAADASGS 364
Query: 286 YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-L 344
Y+ V++ Y NL PIVD +VD+ Q QVVTCSG Y GSLR+V+ + + A ++
Sbjct: 365 YISVVKAYSNLAPIVDAVIVDINDSNQPQVVTCSGGYNTGSLRVVQKSAELEQLAIIDAF 424
Query: 345 QGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA 404
+ ++ LR+S+D DT ++VS +T+ + NLE E E F ++ TL +
Sbjct: 425 PHTENIFPLRASSDAKEDTRMLVSSHMDTKFV--NLEGEDLSVEESPFVTREPTLAASNF 482
Query: 405 IYNQLV------QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
++ V Q+T+ V LV++ + + P G + +A+ + +Q LLA G L
Sbjct: 483 VHKDQVETPYVLQITTKRVVLVNTVVDMEIHSHEYPEG--IMLASCSRTQALLALSDGAL 540
Query: 459 VYLEIGDGILTEVKHAQLEYEI----SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP 514
VY+ + + + K +LE+ + S L I P + + W + L
Sbjct: 541 VYIRLENDNINATKGPKLEWTLRGQWSALCIEPADARGGPTDAVVIAKWDN----TVQLR 596
Query: 515 DLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDG----HLLNFLLN--------MKT 562
+ + ++ + + + L C S LL G G H+L N K
Sbjct: 597 RMGALFRDDIAHATLQLASLAC-----SVLLVDFGQGRENRHVLAGCANGTVVAARVTKD 651
Query: 563 GELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
L + + LG+ P+ LR ++ V A TV+Y + +L +S + +K++++
Sbjct: 652 WNLEAPRTIGLGSLPVRLR----RDGDRVVACGSLVTVLYWDSGRLQHSALAVKDITNAV 707
Query: 623 PFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLK 681
P ++ AF ++ G L I + +Q+LHI++IP+G E+PRRI + ++ F + ++
Sbjct: 708 PLHTPAFENTTVFISPGSLLIARVKQVQRLHIKTIPMGSENPRRIAYNSHAKAFGVGTMH 767
Query: 682 NQSCA---EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV-YYCVGTA 737
+Q ES ++L+ TF+ +S+ LD E S+ S +S + VGTA
Sbjct: 768 SQPVGVGEPESVTSSFKILNQDTFDTLSSMQLDDNEEITSVASLPIMPESRTEMFVVGTA 827
Query: 738 YVLPEENEPTKGRILVF-IVED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
Y+ E EP++GRILVF +ED G L A + GAV SL + +G ++A +N + L
Sbjct: 828 YIKDSEMEPSRGRILVFGSLEDSGTGGSWLTAFLQVTGAVLSLTSVDGLIVAGVNTAVIL 887
Query: 794 YKWMLRDDGTREL---------QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
Y+ LR + E Q + +H +++ + RGD I +GD + SI++L +KH
Sbjct: 888 YE--LRRNTLSEAERASHLTLRQKKEWNHNYVVT-SLAARGDTIYIGDSVASIAILRWKH 944
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
E + AR + + A++++ + A ++NL T + E TD +LE+ G
Sbjct: 945 E--TLHTIARHFGPIFPLALDVMSSGSVITANIDYNLHTFHQ--ESPTDR---KLEIDGS 997
Query: 905 YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
YHLG+ VN+F G L P + +F T G IG++A + L L+ N+
Sbjct: 998 YHLGDQVNKFIPGRL--SAPTVGASIVLEQVFVTSLGRIGIVAEADKDASWALSALERNI 1055
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAK----NFLDGDLIESFLDLS 1007
KV+ G H+ WR+ ++E DA+ NF+DGD++E F D++
Sbjct: 1056 EKVLDQ-GAPKHDLWRAPHSEHGVSDAQRSAANFIDGDMLERFYDVA 1101
>gi|315039795|ref|XP_003169275.1| hypothetical protein MGYG_08821 [Arthroderma gypseum CBS 118893]
gi|311337696|gb|EFQ96898.1| hypothetical protein MGYG_08821 [Arthroderma gypseum CBS 118893]
Length = 1076
Score = 351 bits (900), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 309/1094 (28%), Positives = 510/1094 (46%), Gaps = 154/1094 (14%)
Query: 6 IYGRIATLE-LFRPHGEA---QDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIG 60
IYGRI L+ L RP D LF+ T++Y + L WDA +L T R D++D
Sbjct: 74 IYGRITLLQKLSRPTTAGPVLTDVLFVGTDQYAYFSLTWDAAHDQLRTERKYIDLADGSL 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF----------------------DNKGQL 98
R + IDP + L +Y+G+ VIP N+
Sbjct: 134 REAHSDDRCQIDPSGSFLTLEVYEGVVSVIPLATADSHKRAKSAASTSASTAPTSNRTAR 193
Query: 99 KEAFNIRLE-ELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQ 157
+ A N + + L++ D+K + V+ D A +K L D+
Sbjct: 194 RAATNSQGKVRLKLRDLKHTHA-------VVTGDGGSAAELKDV-TTLSDE--------- 236
Query: 158 NNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITKAYGRVDADG 213
LD GA +LIPVP PL G+LIIGE +I Y + + I RP ++ A+ +VD G
Sbjct: 237 --LDLGASILIPVPRPLGGLLIIGESSIKYVDVSRNETIS-RPLAESTVFVAWEQVD--G 291
Query: 214 SRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL 273
R+LL D G L L++ + DNAV +
Sbjct: 292 QRWLLADDYGRLFFLMLVLDA-----------------------DNAV----------ES 318
Query: 274 IKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNG 333
K++L LG +F GQ ++VT SGA+ DGSLR VR+G
Sbjct: 319 WKVDL-----------------LGTTREFS------SGQTRIVTGSGAFGDGSLRSVRSG 355
Query: 334 IGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE-- 390
+GI + + ++ I +W LRS+ +PF L+VSF++ETR+ + + E+EE E +
Sbjct: 356 VGIEDLGVLASMEHITDLWGLRSACPEPFYDTLLVSFVNETRVFHFSPDGEVEEKEDDFL 415
Query: 391 GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVL 450
G TL + N+++QVT + +++ S + W+S S+ A+AN ++
Sbjct: 416 GLVFSRSTLLATNIPENRILQVTESTAKVIDLDSGMVI--WQSSHNESITSASANDDYLV 473
Query: 451 LATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI--NPIGENPSYSQIAAVGMWTDISV 508
L GG LV + + + +++ ++S + I +PI Q V + +
Sbjct: 474 LVLGGIRLVCISLSTFEQVGSRDFEVDNQVSGMTIPASPI-------QACIVCLPQSAEI 526
Query: 509 RIFSLPDLNLITKEHLG--GEIIPRSVLLCAFEG--ISYLLCALGDGHLLNFLLNMKTGE 564
I LP L + K+ LG GE IPRSV++ L ++ DG + +F ++ T
Sbjct: 527 VILDLPGLEVKNKQALGEPGEAIPRSVIVAEILAGKPPTLFVSMADGTVFSFSFDINTFT 586
Query: 565 LTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCP 623
++ K++LG++ + + N + VFA D P++I++S +++YS V+ S +C
Sbjct: 587 ISGSSKITLGSEQPSFKKLPRGNGQYNVFATCDHPSLIHASEGRIVYSAVDSASSSRICS 646
Query: 624 FNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN- 682
N+ A+P S+A++ + EL I +D+ + I ++P+ RR+ + + F + ++K
Sbjct: 647 LNTQAYPGSIALSNQNELKIALVDEERTTQIHTLPMHASVRRLAYSPLEKAFGLGTVKRT 706
Query: 683 -QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEY-GCSILS---CSFSDDSNVY----YC 733
+ EE FV L D+ F +STY L E C I S C + N+ +
Sbjct: 707 ISNGVEEVSSSFV-LADEIHFRPLSTYDLRPDELVECVIRSQVNCGKDEVGNLMSKDLFF 765
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLL-AAINQKI 791
VGTA++ ++ +GRIL+F V + L LI +K GA +L + LL A + + +
Sbjct: 766 VGTAFLDDVGDDHIRGRILIFEVNKSRELSLIVDKSLMGACRTLAVMDPSLLVAGLVKSV 825
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK-----HEE 846
++K G L+ + + + GD + V D+MKS+SL+ Y +E
Sbjct: 826 SVFKLARDRFGNIFLEKHTAYRTSTAPIDISVTGDTVAVADVMKSMSLVQYTPAEKDEQE 885
Query: 847 GAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
EE AR Y W +AV ++++ +YL A+ NL +++N+ G T+ +R RL+ E
Sbjct: 886 PKFEEIARHYQTLWSTAVTLIEEHVYLLADAEGNLVVLQQNTTGVTESDRKRLQPTSEIR 945
Query: 907 LGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
LGE VNR H V ++ V + TV+G I + + L +LQT +
Sbjct: 946 LGEMVNRI-HPITVQTHTETAVSA--RALLATVDGSIYLFGLINPAYIDLLLRLQTAMAS 1002
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTM---NVSV 1022
+ G + ++R+F + D F+DG+LIE FL + T +EI+ + NV+V
Sbjct: 1003 ITVSPGEIPFSKYRAFRTTVRQSDEPFRFVDGELIERFLSCTPTMQEEIANRLDDSNVTV 1062
Query: 1023 EELCKRVEELTRLH 1036
L + +EEL R+H
Sbjct: 1063 SSLKEMIEELRRMH 1076
>gi|402223178|gb|EJU03243.1| hypothetical protein DACRYDRAFT_115454 [Dacryopinax sp. DJM-731 SS1]
Length = 1175
Score = 346 bits (888), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 315/1087 (28%), Positives = 512/1087 (47%), Gaps = 128/1087 (11%)
Query: 58 RIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GR + Q I+ P R+ Y G KV F G+ K+ ++R+ E +L FL
Sbjct: 109 RAGRIAEYCQTSIVHPSGRVAVTSAYTGSLKVTVFGEDGKGKD-IDVRIREQNLLSFAFL 167
Query: 118 -YGCAKP-TIVVLYQDNKDARHVKTYEV----------ALKDKDFVEGPWSQN---NLDN 162
+ P T+ +L+ D RH+ T + +L D E P + NLD+
Sbjct: 168 PLSTSSPFTLALLHIDQNGKRHLVTRSLKHVPEPGSTKSLASLDISE-PTAHIPDINLDD 226
Query: 163 G--ADLLIPVPPPL----CGVLIIGEETIVYCSAN-----------------------AF 193
A ++ VPPP GV++ GE + +
Sbjct: 227 EWEATSVVGVPPPRPGARAGVMVFGEGQARWYAVKEEGAPDPKGKRRSSTAGSPTQEKKG 286
Query: 194 KAIPIRPSITKA---YGRVDA----DGSRYLLGDHAGLLHLLVITHEKEK--VTGLKIEL 244
K+ P R K +GR+ A D + L+ D G L+LL + + + V ++ E
Sbjct: 287 KSKPPRVEEAKVGLPWGRIAATCAIDEEKLLVSDEYGKLYLLALHRRQSEGPVVSIQREA 346
Query: 245 LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCV 304
LGE S S+I++L ++ ++ S GDSQLI L + VL+ ++NL PI DF V
Sbjct: 347 LGEISSPSSITHLTSSYFFVTSKCGDSQLISLTPSAICGTGNIAVLDTHLNLAPISDFVV 406
Query: 305 VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDT 363
D E+ GQ Q+VT SG GSLR+VR+G+ E +E + + ++ L+ + D +
Sbjct: 407 TDPEKIGQQQIVTASGRMNTGSLRVVRSGVAFEELGLLEQVMNTRSIFPLKINFDAEYHL 466
Query: 364 FLVVSFISETRILAMNLED------ELEETEIEGFCSQTQTLFCHDAIYNQL-VQVTSGS 416
+LV S + T + +M +D E++E+E G Q TL + + + VQVT+GS
Sbjct: 467 YLVTSSLHRTTLYSMRGDDSSPQFEEVDESEARGIVRQVPTLAAGNIREDDIFVQVTAGS 526
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL--EIGDGILTEVKHA 474
V + + E R+ WK P + A N +V+ GG+LV+L + I+ +
Sbjct: 527 VVAIDLITWEQRSIWK--PHVEIVCAAVNGQRVVAGLKGGYLVHLIPQRDQFIVANDWKS 584
Query: 475 QLEY---EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII-- 529
+ EISC+ ++P + AA+G W V+IF L D +E E I
Sbjct: 585 PGSWGLTEISCIALDPT-LSAELGNYAAIGFWGTNQVKIFYLGDETHRFEELKLAEPISP 643
Query: 530 ----PRSVLLCAFEGIS---YLLCALGDGHLLNFLLNMKT--GE----LTDRKKVSLGTQ 576
P S +L F G S +LL LG+G + ++ L + GE TD+K SLGT+
Sbjct: 644 EDHLPVSTILTTF-GTSNRPHLLVGLGNGTISSYALKTEIVLGEPSVRATDKKIFSLGTK 702
Query: 577 PITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
PI L + +++FA DRP +++ N +L S + L+++ + FP S
Sbjct: 703 PIMLNACTDLGRESNIFACGDRPALLFLKNDRLTASPIKLRDIHAGSVLHIPQFPSSFIF 762
Query: 636 AKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFA-ICSLKNQSCAEESEMH- 692
A L IG I + QK+ +R+I LG + P R+ + R + +C K + ++ E++
Sbjct: 763 ASASTLLIGQIRESQKIDVRTISLGLDTPIRLTYHRGLRAYGVVCQRKELNREDDREIYS 822
Query: 693 -FVRLLDDQTFEFISTY---PLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTK 748
+L DD TFE+++ + P + +I + +DS+ + VGT E + +K
Sbjct: 823 SSFKLFDDITFEYLNNFTARPDEQMMCVTTIPDSTGEEDSD-FLVVGTYEATGAEEDVSK 881
Query: 749 GRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
GRIL+F V + KL+L+ + G VY++ L AAIN +Q++ D R ++
Sbjct: 882 GRILIFEEVPNRKLKLVVSHDVGGCVYAVTNVGANLAAAINGTLQVFSLHRSHDDIR-IE 940
Query: 808 SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
S +A + RG+ ++VGD M+++ +L + + +E DY + W+ +E +
Sbjct: 941 SVAKWSSAYVASSLICRGNTLLVGDAMRAVCILRWTGAK--LETLYHDYASLWIQTLESI 998
Query: 868 DDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP--D 925
D+ +GAE N N+ T RK+ G+LE G ++ GE +NRFR SL P
Sbjct: 999 DEGGVIGAELNNNIVTWRKD---------GKLERDGMWYFGEGINRFRRASLNAAAPGAG 1049
Query: 926 SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
+ ++F T G IG++ASL + + L LQ N+ VI+G G + H + R+ +
Sbjct: 1050 GNNAGRGNLVFCTNTGRIGIVASLDEDLSMQLSNLQRNIGSVIQGPGKVEHSRRRAPQST 1109
Query: 986 K-------KTVDAKNFLDGDLIESFLDLSRTRMDEISKTM--NVSVEELCKR-------V 1029
+ A F+DGD +E FL L + DE+ + + E+L R +
Sbjct: 1110 PGLPTILTQNTHAVGFIDGDFVEKFLYLDPSS-DEVMRILEGKNKYEQLGNRYHDIVQML 1168
Query: 1030 EELTRLH 1036
EEL RLH
Sbjct: 1169 EELQRLH 1175
>gi|328858656|gb|EGG07768.1| hypothetical protein MELLADRAFT_105631 [Melampsora larici-populina
98AG31]
Length = 1216
Score = 344 bits (883), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 323/1194 (27%), Positives = 535/1194 (44%), Gaps = 191/1194 (15%)
Query: 4 VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT 63
+P++G I ++ + L + T L++ +S +IT A ++ RP
Sbjct: 53 LPVFGLIDHIQSYIKQPNQTATLLVLTSELNLFTLRYCPQSKSVITTATISLNQPGARPA 112
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF------------------------------- 92
D Q I+DP R + LH +G+ +IP
Sbjct: 113 DYLQTSIVDPHSRCLVLHALNGILHLIPLEPSSKPKPKHLEPSLGGRRKRSTISLKSNHP 172
Query: 93 ---------DNKGQLKEAFNIRLEELQV--LDIKFLYGCAKPTIVVLYQDNKDARHVKTY 141
D +L + +RL E+ V LD L PT+++LY ++ R +++
Sbjct: 173 PNPDLDTNIDPDSELYQIVQLRLNEVNVHALDFAALPAHLPPTLLILYSNHMGQRVLRSR 232
Query: 142 EVALK----DKDFVEGPWSQNNL--DNGADLLIPVPPPLCGVLIIGEETI--VYCS---- 189
+ L D+D N+L D L+IP+P VL++GE+++ VY +
Sbjct: 233 SIDLITANCDQDICP-----NHLCPDLDTSLIIPIPDDQGSVLLVGEDSVELVYLTDRLP 287
Query: 190 ------ANAFKAIPIRPSITKA----------------------YGRVDADGSRYLLGDH 221
+ +++ R T+A + +V+ +L+GD
Sbjct: 288 NPTGKGKSRDRSVDSRTGATRAPESDKLLKHVTSINIPLGSYTCFCKVEDQPDVWLIGDL 347
Query: 222 AGLLHLLVITHEKEKVTG--LKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQ 279
G H+LVI E+ + L+ + G+ S + Y+ + +Y+ S YGDSQL++++
Sbjct: 348 YG--HVLVIRLERPEGGKPLLRYQQAGQVSSPEALVYISDRFIYLASHYGDSQLLRISAL 405
Query: 280 PDA--KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
A + EV+ Y NL PI D CVVD Q+VTCSGAY+DGSLRI+ +GI +
Sbjct: 406 DSALAQDCQPEVVANYPNLAPISDVCVVDQSEGFDHQLVTCSGAYQDGSLRIITHGITLT 465
Query: 338 EQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM---NLEDELEETE-IEGFC 393
+ + + G + +WS+ T L+V F +ETR L + ++EE + GF
Sbjct: 466 DLGMLPIAGAEHIWSI--DTPHSPQVTLIVGFRNETRFLIIENDQFSQDVEELDSFSGFK 523
Query: 394 SQTQTLFCHDAIYNQ------LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANAS 447
S +T+ I +QVT V +V R W+ + VA AS
Sbjct: 524 SDRRTILAGQIISQGHHVQAFPIQVTQEEV-IVGEMFR-----WEPTSNDLITVAAIGAS 577
Query: 448 QVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDIS 507
++A ++ L + D +L + + + E+SCL I+P ++ AVG W S
Sbjct: 578 LTVVAL-QREVLLLHVKDNLLVQSESIRFPNEVSCLAIDP------SQKVLAVGQWISNS 630
Query: 508 VRIFSLPDLNLITKEHLGGEIIPRSVLLCAF-EGIS-YLLCALGDGHLLNFLLN-MKTGE 564
+ + SL +N+ T G + SVL F EGI +LL + DG L++ L ++ G
Sbjct: 631 IALVSLDQMNVSTTIETGSDFGVHSVLFTNFGEGIDPHLLAGMDDGELISIKLQGLENGS 690
Query: 565 LT---DRKKVSLGTQPITLR-------TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVN 614
L R+ V LG +PIT+ T V+ S+ PT++ + +L Y V
Sbjct: 691 LVQEQSRRTVVLGHRPITMSRIVLDIPTDQQLGKPAVWIHSEHPTLMIEVDGRLKYRPVA 750
Query: 615 LKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL-GEHPRRICHQEQSR 673
L + + + + EG + +G ID ++K+HI IPL GE P+RI H + R
Sbjct: 751 LGDATDAIKIGTRGVG---VVTSEG-IRLGRIDSLEKIHISKIPLGGEQPKRIAHSKSMR 806
Query: 674 TFAICSLK---NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV 730
+ + + NQ E + V+++DD TFE + + L + E G SI + D
Sbjct: 807 AYGVVCVSQKVNQQTGELDKGSSVKIIDDVTFELLFDFQLLSIEQGTSIAAIELGKDLIE 866
Query: 731 YYCVGTAYVLPEENEPTKGRILVFIV-----EDGKLQLIAEK---ETKGAVYSLNAF-NG 781
+ +GT +V P E++ GRIL + ++G L+ K + KG V+ L G
Sbjct: 867 DFIIGTGFVNPNESQSNTGRILTIGLSSKHDQEGNLREFKLKRMTKVKGTVHGLGGLPGG 926
Query: 782 KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
K +A+ N + + ++ + G + + V T ++I+VGDL KSI +L
Sbjct: 927 KFVASANAFVHAFGINEEEEDEGFEVLDTWGGGFV-SQTVLTEKNWIIVGDLYKSIVVLE 985
Query: 842 YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV--RKNSEGATDEER--- 896
+ ++ +++ RDY+A + + ++ D +++ A+ FNLFTV R+ +G +E+
Sbjct: 986 FDLKKFSLKVLGRDYSAMSVRPIGMISDRVFVAADTEFNLFTVEMRERQKGLKEEDEDEE 1045
Query: 897 -----------------------------GRLEVVGEYHLGEFVNRFRHGSLVMRLPD-- 925
L+ VG +HLGE VN F+ GSLV L
Sbjct: 1046 GLSVEEEKGDDDEWEEEERRMRVEKVFNDDHLDTVGGFHLGENVNHFKAGSLVKSLKHFY 1105
Query: 926 -SDVGQIPTVIFGTVNGVIGVIASLPHEQ-YLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
D+ +IF + G IGVI L + Y L+ L+ L+K I +GGL+ ++R F
Sbjct: 1106 GQDLKYGGKLIFVSSTGGIGVIIKLEDLKIYKHLKALEDRLKKEILSIGGLDSTEFRKFK 1165
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN-VSVEELCKRVEELTRLH 1036
N+ K VD FLDGDLI+ + S R I + + V ++E+ VEEL+ H
Sbjct: 1166 NKWKKVDGCEFLDGDLIQRLYNDSFKRKSLIKEDQDEVGIDEV---VEELSWFH 1216
>gi|403415203|emb|CCM01903.1| predicted protein [Fibroporia radiculosa]
Length = 1267
Score = 341 bits (874), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 278/967 (28%), Positives = 450/967 (46%), Gaps = 157/967 (16%)
Query: 205 AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
A+ D G R+++GD G L +L + GL + LGETS A +++YL + V+Y+
Sbjct: 323 AWCPADDQGRRFIIGDTFGRLAMLAF----DDANGLLLIPLGETSSAVSLTYLSSQVLYL 378
Query: 265 GSSYGDSQLIKLNLQP----DA-------------------------------------- 282
GS +G+SQL++++ P DA
Sbjct: 379 GSHFGESQLLRIHQSPFTERDAETLPIPDHVKSISAVSLPGGKGKERLDDSLEPSKKERG 438
Query: 283 ------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGI 336
KG +VEVL+ Y N+ PI+D +VDL+ GQ Q++ CSG G+L+I+R G
Sbjct: 439 GQVVLGKGKFVEVLDSYENIAPIMDAVMVDLDGSGQPQIIACSGGRNSGALKIIRTGADF 498
Query: 337 NEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI--EGFC 393
EQA + ++ + +W +RS +D D+ L+ + + ET + +++ + + + GF
Sbjct: 499 QEQAVIRGIENVTDIWPIRSHYEDIIDSHLIATTLHETLVFSLDGRNAVTHMDPSDHGFI 558
Query: 394 SQTQTL----------------FCHDAIYNQLV-QVTSGSVRLVSSTSR-----ELRNEW 431
+++ TL + + LV QV+ VR++ + + + W
Sbjct: 559 TRSPTLAVGNIPRRATTQNGGRVVSSYVDSSLVLQVSQEKVRVIEHDAALGLFVPVGDGW 618
Query: 432 -KSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQL------EYEISCL 483
+ G ++ A N+SQ +L GG L+ L +G+ + E+K+ EIS +
Sbjct: 619 DATKEGRTIVAAAINSSQFVLGLSGGRLILLNLGENEQIQELKYRDFGDRTYGMLEISAV 678
Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSLPD----LNLITK-------------EHLGG 526
+P Y+ V W + I SL L I + + G
Sbjct: 679 SCSPFDTTKKYATCIGVSFWGTNRIAILSLESATSYLTTICESDPLPSLPRSLLLHNFGQ 738
Query: 527 EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ P+ + ++L L DG L++F K +L D+K +LG P+ L +
Sbjct: 739 DHNPKGA-----DYHPHVLAGLVDGTLVSFSFREK--DLKDKKTFALGDVPVALAKSTVD 791
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
+FA+ R +V+Y ++L S V +K+ +S+AFP L +A L IG +
Sbjct: 792 GKPAIFASGSRASVLYWDRQRLHQSPVMIKDAVRGASISSSAFPSCLILATRSTLMIGNV 851
Query: 647 DDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLK---NQSCAEESEMHFVRLLDDQTF 702
+ K+ I S+ +G ++PRRI H R FA+ ++ N+ + ++LLDD TF
Sbjct: 852 RGVDKMQINSVRMGLDNPRRIAHHPTLRVFAVACIQTKPNRIGEPQETTSSLKLLDDTTF 911
Query: 703 EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG--- 759
I+++ ++ E S+L+ S SD S+ +CVGT P E EP+ GRIL+F + G
Sbjct: 912 NRIASFACESDEEVTSVLTLSSSDVSSARFCVGTVQFKPGETEPSSGRILLFSLNTGPES 971
Query: 760 KLQLIAEKETKGAVYSLNAFNGKLLAAIN------------------QKIQLYKWMLRDD 801
QL++ G VY L + G + AA+N K+ L+K ++
Sbjct: 972 SFQLVSSTPVSGCVYQLVSIQGMIAAAVNTSVRTAYVFIAFDPNMTIHKVILFKPEKLNN 1031
Query: 802 GTRELQ--SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
T L SE H+ + L V G ++VGD + SIS + K ++ +E ARDY+
Sbjct: 1032 STVVLTKVSEWNHNYSVTGLVVH--GCMLIVGDAISSISFV--KVDDTTLESIARDYSPL 1087
Query: 860 WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
W +VE +D D +GA ++ NLFT G R LE G Y+LG+ VN+F GSL
Sbjct: 1088 WPVSVEAMDGDGVIGANSDCNLFTFALQRSG----HRSTLERNGSYYLGDMVNKFLRGSL 1143
Query: 920 VMRLPDSDVGQI-PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
+ S+ I P +F T G IGVI + + L + LQ N+ K I G GG++H
Sbjct: 1144 T-NIDISERKSIEPKHLFFTSTGRIGVILEMNDKISLHMTGLQRNMGKRIIGPGGVHHAT 1202
Query: 979 WRSFNNEKKTVDAK---NFLDGDLIESFL------DLSRTRMDEISKTMNVSVEELCKRV 1029
+R+ N K DA+ FLDGD +E +L DL + MD TM+ S ++ +
Sbjct: 1203 FRAPANSKGHSDAEAAFGFLDGDFLEQYLGYADYTDLLQGDMDAEQVTMSHS--QIQDVL 1260
Query: 1030 EELTRLH 1036
E+L LH
Sbjct: 1261 EQLQSLH 1267
Score = 47.0 bits (110), Expect = 0.051, Method: Compositional matrix adjust.
Identities = 41/179 (22%), Positives = 76/179 (42%), Gaps = 12/179 (6%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATER-----YKFCVLQWDAESSELITRAMGDVSDRIG 60
I+GRI +L+ Q L + T+ ++ D ++ L+++ D+ DR
Sbjct: 54 IWGRIVSLKAIPSKDSKQSNLLVLTDHPDPKLIPLTLVLDDTGNASLLSKETIDLHDRYA 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP + +DP + + Y G KVI F N+ + +I + EL +L + LY
Sbjct: 114 RPAEFVTDVYVDPTGEVAVVSCYTGRLKVIEFKNRERTGIPLDISIPELYILAMTLLYTT 173
Query: 121 AKP-TIVVLYQDNKDARHVKTYEVALKDKD------FVEGPWSQNNLDNGADLLIPVPP 172
T+ +L+ D++ + + ++ L++ S N + LIPV P
Sbjct: 174 KDLYTLSILHYDHQGRLQLLSRDLDLRELSASPSTLLFSTILSSNVPSDSTPFLIPVLP 232
>gi|392591958|gb|EIW81285.1| hypothetical protein CONPUDRAFT_56293 [Coniophora puteana RWD-64-598
SS2]
Length = 1245
Score = 339 bits (869), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 327/1158 (28%), Positives = 532/1158 (45%), Gaps = 185/1158 (15%)
Query: 48 ITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLE 107
IT+ + +++R GR + ++DP +L + Y G KV+ D+ G+ F++
Sbjct: 104 ITKKLS-LAERTGRIAEFFVDLLVDPTGQLAIVSTYAGKLKVVQLDD-GEYDSDFDVSTT 161
Query: 108 ELQVLDIKFLYGCAKP-TIVVLYQDNKDARHVKTYEVALKDKDFVEGPWS---------- 156
EL +L + + T+ +L+ D + + T +++LK+ P +
Sbjct: 162 ELNLLSLATILPSPDDLTVALLHVDYQQRLQLLTRDLSLKELQLSPLPSAVLPSVPLAAK 221
Query: 157 QNNLDNGADLLIPVPPP-------LCGVLIIGEETI-VYCSANAFK-------------- 194
++D LI VP GVL+IG I +Y A A K
Sbjct: 222 HFSVDEEVPKLIHVPISEKDEESFAGGVLVIGGSKIFLYDLAPAEKQRKQKEKQRRTQKK 281
Query: 195 -----------------AIPIRPSI------TKAYGRVDADGSRYLLGDHAGLLHLLVIT 231
R S+ A+ +VD G R+LLGD G L +L +
Sbjct: 282 KASADSDEVMKDEGKEKKKKARASVEWPWAEVSAWCQVDDVGRRFLLGDKFGRLVMLCV- 340
Query: 232 HEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP----------- 280
+ + L + LGETS A+T++YL N VVY+GS YGDSQL++++
Sbjct: 341 -DISSNSSLTLIALGETSPATTLTYLSNQVVYLGSHYGDSQLLQISPSRLSDSDTPTLPV 399
Query: 281 ----------------------------DAKGSYVEVLERYVNLGPIVDFCVVDLERQGQ 312
+ +GS++ ++R+ N+ PI D +VD + GQ
Sbjct: 400 PGDIHTVTHIPHSSPGSDYDRRHRGFIVNGRGSHLTEVDRFQNIAPIYDAALVDPDNSGQ 459
Query: 313 GQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
+VVTCSG GS+R+VR+G E A++ + I +W +RS DP T +V S
Sbjct: 460 YEVVTCSGGQNTGSIRLVRSGADFQEAAAIGGIPNITNIWPVRSLYSDPIHTHIVASTPQ 519
Query: 372 ETRILAMNLEDELEETEIEGFCS-QTQTLFCHDA--------------IYNQLV-QVTS- 414
ET + ++ + + GF + + +TL + + + LV QVTS
Sbjct: 520 ETYVFRIDNKHTVSHVADSGFVTGKKRTLVVQNLQKKVPKEAGQGLTYMDSSLVLQVTSD 579
Query: 415 -----------GSVRLVSSTSRELRNEWKSPPGY---SVNVATANASQVLLATGGGHLVY 460
G V ++ PG+ + A+ANASQV+L G L
Sbjct: 580 ELVLLNLDKGLGEFTRVGDGVCKMGQLAGGKPGWIDREIVAASANASQVVLGLSFGRLAV 639
Query: 461 LEIGDGILTEVKHAQLE------YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL- 513
+ D V + EIS + P ++ S+S +AAV W V I SL
Sbjct: 640 VNFADNKFKLVGYRDFTNPSGGIAEISAVSCTPADQSKSFSPMAAVAFWQTHRVEIISLG 699
Query: 514 -PDLNLITKEHLGGEIIPRSVLLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTD 567
P L L +PRS+L+ F G S +LL DG + F+L K L +
Sbjct: 700 SPFPTLCASVSLPS--LPRSLLMHRFAGESANAPPHLLVGRADGVVATFVLKDKA--LVE 755
Query: 568 RKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
+K++ +G P+T T +K T VFA R +V++ +L +S + LKEV+ ++
Sbjct: 756 QKQIPVGNLPVTFHTCKAKGRTAVFACGSRTSVLFWEKDRLRHSPLILKEVAAAASLHTH 815
Query: 628 AFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSC- 685
+ SL +A L IG + +++KLHIRSI G ++PRRI H + A+ +++
Sbjct: 816 DYRSSLVLATSEGLVIGDVQNLEKLHIRSIHTGLDNPRRISHSPVHKALAVGCVRHTPVR 875
Query: 686 --AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI--LSCSFSDDSNVYYCVGTAYVLP 741
E V+L +D T + + LD E +I LS ++++ + VGT +
Sbjct: 876 VGEPEISRGSVQLYNDTTLDKLGQVVLDHDEEPMAIKALSVRVAEEAKDCFVVGTVIIDS 935
Query: 742 EENEPTKGRILVFIVEDGK----LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
ENE + GR+L+ + + + + A ++ KG VY++ A +G ++AA+N + +Y +
Sbjct: 936 LENESSSGRLLLVEPDYSRGESFVAVSASEKVKGCVYAVAAVDGLVVAAVNSAVVIYS-I 994
Query: 798 LRDDGTREL---QSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
DD TR L + +H +++A V +RG+ ++VGD + S++LL Y E GA++ AR
Sbjct: 995 EADDHTRALSFVKKVEWNHNYVVANLV-SRGNLLLVGDAISSVTLLQY--ERGALQNVAR 1051
Query: 855 DYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
DY+ W ++VE+LD+ +GA+N+ NLF +GA ER LE G Y+ G+ VN+F
Sbjct: 1052 DYSPLWPTSVEMLDERNVIGADNDCNLFMFTLQ-DGA---ERKVLERNGHYYFGDMVNKF 1107
Query: 915 RHGSLVMRLPD---SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG- 970
G + L SD+ P +F T G IGV+ + E L + LQ NL
Sbjct: 1108 IPGEIYRALSSFEASDIEVEPKQLFFTTTGSIGVVIDMSDELSLHMSSLQRNLSTYFAAQ 1167
Query: 971 VGGLNHEQWRSFNNEKKTVDAKN----FLDGDLIESFL------DLSRTRMDEISKTMNV 1020
GG +H ++R+ N + DA N FLDGDL+E FL + R ++ ++ +
Sbjct: 1168 PGGASHTKYRAPKNARGRSDADNSSFGFLDGDLLERFLLFGDDEEAVRKVLEGSTEAEQL 1227
Query: 1021 SV--EELCKRVEELTRLH 1036
S+ E + K +E L +H
Sbjct: 1228 SIAPERIIKVLERLQSMH 1245
>gi|170090007|ref|XP_001876226.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164649486|gb|EDR13728.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1275
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 335/1235 (27%), Positives = 532/1235 (43%), Gaps = 214/1235 (17%)
Query: 4 VPIYGRIATLELFRPHGEAQD--FLFIA---TERYKFCVLQWDAESSELITRAMGDVSDR 58
V I+G++ T++ G A+ L IA E F + ++ L + +R
Sbjct: 53 VEIWGKVLTVKAIPQIGSARSTIVLMIAHPDPELVFFTYIDRQGDNPALEVSKKLSLFER 112
Query: 59 IGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
I RP + ++ P +L + Y G K+I G + ++++L E+ V + FL
Sbjct: 113 IARPAEFFNDVLVHPSGKLALVSCYSGKLKIIRLKC-GTYDKDYDVQLPEVNVFSVTFLS 171
Query: 119 GC-AKPTIVVLYQDNKDARHVKTYEVALKDKDFV----------EGPWSQNNLD---NGA 164
+ + +LY D ++ + ++ + D P S L
Sbjct: 172 SFNGEYAVAILYLDYQERVQLVARDILVNGDDITVSEEPSTLLHPTPISNKTLPFPTESV 231
Query: 165 DLLIPVPPP------------LCGVLIIGEETIVYCSANAFKA----------IPIRPS- 201
L+ VPP L GVL++G I+ ++ K+ + R S
Sbjct: 232 PQLVSVPPEELDEDTDVDDAFLGGVLVVGGRQILLFELSSEKSQEKQRGKRKRLESRKSS 291
Query: 202 -----ITKAY-------GR--------------------VDADGSRYLLGDHAGLLHLLV 229
I KA GR +D RYL+GD G L +L
Sbjct: 292 KDVTEIAKAREKEKEREGRRRKPKSSIEWPFSELAAWCAIDKAPYRYLIGDSFGRLSILS 351
Query: 230 ITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK-------------L 276
+ +K + GL + GE S ++++YL N +VY+GS GDSQLI+ L
Sbjct: 352 L--DKVQRIGLVLIPFGEASSPTSLTYLTNQLVYVGSHSGDSQLIRLSPLPISSSESPTL 409
Query: 277 NLQPDAK--------------------------------------------GSYVEVLER 292
+ P+ K GS++EVL
Sbjct: 410 PIPPEIKTVSPNLLEALGHRKGKSKSVAEDHMDEDHDDEDDASQGYIVETHGSFIEVLSS 469
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMW 351
Y NL PI+D +VD + GQ +VTCSGA GS+ IVRNG E V L G+ G+W
Sbjct: 470 YKNLAPILDAILVDTDGSGQQHIVTCSGARSTGSINIVRNGADFQEIGHVPGLTGVVGVW 529
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDE------LEETEIEGFCSQTQTLFCHD-- 403
S+R+ +D D +++VS T + ++ ++ T I + TL +
Sbjct: 530 SVRTMLEDTTDRYILVSTNRSTHLFEIDDSGSTSTITPVDSTTIRSLVTTEATLAFSNLA 589
Query: 404 --------AIYNQ---LVQVTSGSVRLVSSTSR----ELRNEWKS----PPG---YSVNV 441
++Y VQV + RL+ S + EL ++ P G V
Sbjct: 590 RRSSVGGSSVYKNSPLAVQVVASGARLLKSNTAFGGYELVAQYSMLSSVPYGQRPLEVVA 649
Query: 442 ATANASQVLLATGGGHLVYLEIGDGI--LTEVKHAQLEY--EISCLDINPIGENPSYSQI 497
A+ANASQ+++A GG L +G+ + L + Q + EIS + P+ S
Sbjct: 650 ASANASQLVMAASGGKLTLWRLGENVDALEMIAGCQRKEGPEISAVSCAPLNTTKRTSPT 709
Query: 498 AAVGMWTDISVRIF--SLPDLNLITKEHLGGEIIPRSVLLCAFEGIS---------YLLC 546
V W ++ I SL L + K ++ RSVLL F + YLL
Sbjct: 710 IIVSYWQSNTIEILQVSLKGLESVYKSPTLPALV-RSVLLYNFGSDTNPKGTDHHPYLLA 768
Query: 547 ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK 606
L +G + +F K +LTD+K + LG P+ L + VFAA +R TV+ NK
Sbjct: 769 GLANGTVASF--RWKDKQLTDKKIIPLGHAPVNLMPCQVEGRHAVFAAGNRATVLSFENK 826
Query: 607 KLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRR 665
+L++S + LK++S N+ F SL ++ L IG + + KLHIRSIP G ++PR+
Sbjct: 827 RLVHSPIMLKDISSAARLNTPTFASSLILSTPTGLFIGRVQGLGKLHIRSIPFGFDNPRK 886
Query: 666 ICHQEQSRTFAICSL---KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
I H+ + F + N+ E +LLDD +F + + D E +++S
Sbjct: 887 IAHEPSIKAFGVAFTTMEPNRVNDPEISRSSFKLLDDTSFANLCQFNCDPDEETTAVVSF 946
Query: 723 S--FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG-----KLQLIAEKETKGAVYS 775
S + ++CVGT E EP+ GR+++F L L+A + G VY+
Sbjct: 947 SQKIAGKPMPFFCVGTYVYKAGEVEPSAGRLMIFTATTSTSSNLALSLMASTKVPGCVYA 1006
Query: 776 LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH-HGHILALYVQTRGDFIVVGDLM 834
L +++AA+N + L++ D + H + L + + D +VVGD
Sbjct: 1007 LTVVQNQIVAAVNSSVMLFRLESSSDSLSPSLIKVSEWHHNYLVTSLGSYADRVVVGDQP 1066
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
SISLL + + + +ARDY W VE LD+ +GA ++ NLFT + E A
Sbjct: 1067 SSISLL--QVTQSKLISQARDYGPLWPVCVEALDERHIIGANDSLNLFTF--SLEKAMG- 1121
Query: 895 ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
R RLE G YH+ + V +F GSL + +F T +G IGV+ + E+
Sbjct: 1122 -RSRLERDGCYHVADLVTKFLRGSLSSSDASTTSPLTSEAMFFTSSGRIGVVVDVKDEEL 1180
Query: 955 -LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN----FLDGDLIESFLD--LS 1007
L L +Q NL VI+GVGG +H ++R+ + T DA + FLDGD +E FL LS
Sbjct: 1181 SLQLTNMQRNLANVIQGVGGSSHSKYRAPKTTRGTSDADSGAIGFLDGDFLEQFLTHVLS 1240
Query: 1008 RTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
+ +++ + + ++ E L +E+L LH
Sbjct: 1241 PQQSEKVIQGQSPPERLTITREALQMVIEDLQSLH 1275
>gi|166158025|ref|NP_001107422.1| damage-specific DNA binding protein 1, 127kDa [Xenopus (Silurana)
tropicalis]
gi|157422734|gb|AAI53474.1| Zgc:63840 protein [Danio rerio]
gi|163916541|gb|AAI57552.1| LOC100135265 protein [Xenopus (Silurana) tropicalis]
Length = 306
Score = 337 bits (864), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 172/309 (55%), Positives = 220/309 (71%), Gaps = 15/309 (4%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V PEE EP +GRI+VF DGKLQ +AEKE KGAVYS+ FNGKLLA+IN ++LY+W
Sbjct: 2 VCPEEAEPKQGRIIVFHYTDGKLQTVAEKEVKGAVYSMVEFNGKLLASINSTVRLYEWT- 60
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
+EL++EC H+ +I+ALY++T+GDFI+VGDLM+S+ LL YK EG+ EE ARD+N
Sbjct: 61 ---AEKELRTECNHYNNIMALYLKTKGDFILVGDLMRSVLLLAYKPMEGSFEEIARDFNP 117
Query: 859 NWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGS 918
NWMSAVEILDDD +LGAEN FNLF +K+S TDEER L+ VG +HLGEFVN F HGS
Sbjct: 118 NWMSAVEILDDDNFLGAENAFNLFVCQKDSAATTDEERQHLQEVGLFHLGEFVNVFSHGS 177
Query: 919 LVMR-LPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
LV++ L +S +V+FGTVNG+IG++ SL Y L LQ L KVIK VG + H
Sbjct: 178 LVLQNLGESSTPTQGSVLFGTVNGMIGLVTSLSEGWYSLLLDLQNRLNKVIKSVGKIEHS 237
Query: 978 QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV----------SVEELCK 1027
WRSF+ E+KT A F+DGDLIESFLDL + +M E+ T+ + +V+E+ K
Sbjct: 238 FWRSFHTERKTEQATGFIDGDLIESFLDLGQAKMQEVVSTLQIDDGSGMKREATVDEVIK 297
Query: 1028 RVEELTRLH 1036
VEELTR+H
Sbjct: 298 IVEELTRIH 306
>gi|392566425|gb|EIW59601.1| hypothetical protein TRAVEDRAFT_167065 [Trametes versicolor FP-101664
SS1]
Length = 1263
Score = 337 bits (864), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 272/916 (29%), Positives = 418/916 (45%), Gaps = 133/916 (14%)
Query: 205 AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
A+ VDA R LLGD G L +L ++ T L LGETS A++++YL + V+Y+
Sbjct: 329 AWSAVDARCRRVLLGDAFGRLAMLAFNNDAVAFTLLP---LGETSPATSLTYLSSQVLYV 385
Query: 265 GSSYGDSQLIKLNLQPDA------------------------------------------ 282
GS +GDSQ ++++ P A
Sbjct: 386 GSHFGDSQQLRIHPAPIANASVDTLPIPKGVSTVSPSALASSPSKGKGRSMDLDDGLVRE 445
Query: 283 ------KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGI 336
KG+Y+EVL+ + N+ PI+D + D++ GQ QV+T SGA GSLR++R
Sbjct: 446 GRVIQTKGTYLEVLQTHDNVAPIMDAVLADIDGSGQPQVITASGARNTGSLRVIRTEADF 505
Query: 337 NEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE--IEGFC 393
EQA ++ L GI +W +R +D T LVVS ET +L+ +D + E + GF
Sbjct: 506 QEQAKLDGLPGITDIWPVRPRFNDATHTHLVVSTSRETLVLSFAGQDTITHVEPSVAGFA 565
Query: 394 SQTQTLFCHDAIYNQ-----------------LVQVTSGSVRLVS-----STSRELRNEW 431
+ T + Q + Q+TS VRLV + W
Sbjct: 566 THVPTYAIGNVPRRQTTTSGGRTTSSYVDSPLVAQITSEGVRLVEYDPTLNGFTPFGAGW 625
Query: 432 -------KSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCL 483
S G + A + SQ ++ GG L L +G + L VK EI +
Sbjct: 626 YPKKSGDASMAGRDIVAAAMSPSQFVVGLSGGRLALLNLGANDALQVVKTRDFPDEICAI 685
Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSL--PDLNLITKEHLGGEIIPRSVLLCAFEGI 541
P +Y+ AV W+ + + SL P L + +PRS+LL F
Sbjct: 686 SCTPFDPTKNYATHIAVSFWSSNKIAVLSLDSPASYLGPICEVSLPTLPRSILLHNFGAG 745
Query: 542 S---------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+ +LL L DG+++ F L + GEL D K SLG P +L + T V
Sbjct: 746 ARPKDADFRPHLLAGLADGNVMTFAL--RDGELHDSKASSLGNAPASLSLCAVDGRTVVL 803
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A R V+Y +++ S V +K ++ N+AAFP LAIA L IG + + K+
Sbjct: 804 AGGARSNVLYWDRQRIRPSPVGVKYMARGATLNTAAFPSCLAIATSSALLIGNVRGVDKM 863
Query: 653 HIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSCA---EESEMHFVRLLDDQTFEFISTY 708
IR+IPLG ++PR++ + + F + + E +LLD +F+ + +
Sbjct: 864 QIRTIPLGMDNPRKMAYHAGQQVFGVACARTAPGRVGDNEDITGSFKLLDAHSFQQLHHF 923
Query: 709 PLDTFEYGCSILSCSFSDD-SNVYYCVGTAYVLPEENEPTKGRILVFIV--EDG--KLQL 763
D E S+L+ S+ +C+GTA + PEE EP+ GRIL+F + E+G L
Sbjct: 924 QCDPDEEPASVLALPGEGSASSPAFCLGTAVIRPEEREPSNGRILLFSLSSENGVRSLTT 983
Query: 764 IAEKETKGAVYSLNAFN-GKLLAAINQKIQLYKWMLRD-------DGTRELQSECGHHGH 815
+A + +G VY+L + G + AAIN + LYK +R+ D + +E H+
Sbjct: 984 VASHKVRGCVYALQHVSEGVIAAAINTSVLLYK--IREGNLGEGFDRVLDKAAEWNHNHF 1041
Query: 816 ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
+ +L G F++VGD + S+S+L + +E ARDY W A+E + +GA
Sbjct: 1042 VTSLVWD--GQFLLVGDAISSVSVLRVADDATKLESVARDYAPLWPVAIESTGNGGVIGA 1099
Query: 876 ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV- 934
++ NLF+ +R LE G YH+ + VN+ G+ L +DV Q V
Sbjct: 1100 NSDCNLFSFALQR----GPQRNGLEKNGVYHIDDVVNKLIKGA----LSSADVSQDQAVK 1151
Query: 935 ---IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 991
+F T G IG I + L + LQ N+ K + G GG+NH + R+ + DA
Sbjct: 1152 AGHVFFTSTGRIGAILDMNDTMSLHMTALQRNMAKSLIGPGGVNHTKRRAPATPRGHTDA 1211
Query: 992 K---NFLDGDLIESFL 1004
+ FLDGD +E+FL
Sbjct: 1212 EASYGFLDGDFLETFL 1227
Score = 44.3 bits (103), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 35/148 (23%), Positives = 72/148 (48%), Gaps = 7/148 (4%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATE--RYKFCVLQWDAES--SELITRAMGDVSDRIGR 61
++GRI L+ + Q L + T+ K ++ + E + L + + D+ R
Sbjct: 56 MWGRIVGLQAVPARTQGQSNLLVMTDLPDAKLVLMTYSTEGGVAALTSGTSHSLMDQAAR 115
Query: 62 PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCA 121
P + +DP +++ Y G KV+ ++ G+L +AF++ L E+ +L FLY
Sbjct: 116 PAEFVTDFQVDPTGQVVIASCYAGKLKVVRIED-GELGQAFDVSLPEINLLAFTFLYTDD 174
Query: 122 KP--TIVVLYQDNKDARHVKTYEVALKD 147
+ T+ +L+ ++K + + +V L+D
Sbjct: 175 EDRFTLALLHLNHKRQIQLLSRDVLLED 202
>gi|336369683|gb|EGN98024.1| hypothetical protein SERLA73DRAFT_109335 [Serpula lacrymans var.
lacrymans S7.3]
gi|336382464|gb|EGO23614.1| hypothetical protein SERLADRAFT_449959 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1257
Score = 336 bits (862), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 303/1150 (26%), Positives = 511/1150 (44%), Gaps = 200/1150 (17%)
Query: 70 IIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKP---TIV 126
++DP RL + Y G K++ +G F++ + EL +L ++F+ ++P T+
Sbjct: 125 LVDPTGRLAVVSPYTGKLKIVLITPEGTYDTDFDVSIAELNLLALEFI--SSEPDLYTLA 182
Query: 127 VLYQDNKDARHVKTYEVALKDKDFV--------EGPWSQNNLDNG--ADLLIPVPPPL-- 174
+++ D+ + + +++L + P S + G L+ +PP +
Sbjct: 183 IMHIDHLKRLQLLSRDISLDEYQLSPTPSPALPSTPLSAKSFPIGDKPPSLVYIPPFIDD 242
Query: 175 -----CGVLIIGEETIVYCSANA------------------------------------F 193
G+L+IG I++ ++ +
Sbjct: 243 DEDFIGGMLVIGGTKILFYDTSSRESQGNRKQKQRRMDKGKDSTVKELAKAKQKEDEREW 302
Query: 194 KAIPIRPSITKAYGRV-------DADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLG 246
+ R S+ + + D G +L+GD G L +L I + T L + LG
Sbjct: 303 RKKKARASVDWPWSEITAWCAVGDELGRSFLIGDKYGRLAMLSIGINEG--TTLTLVALG 360
Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP-------------------------- 280
ETS A+T++YL + V+Y+GS +GDSQL+++N P
Sbjct: 361 ETSSATTLTYLTSQVLYVGSHFGDSQLLRINTTPVQSLESPTLPVPSDINTTTSSSLAAK 420
Query: 281 ---------------DAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDG 325
+ +GS++ V+E Y N+ PI+D +VD++ GQ VVTCSG G
Sbjct: 421 GKMEESSDRSGGCIINGRGSFLSVIESYKNIAPIIDAALVDVDNSGQHAVVTCSGGQNTG 480
Query: 326 SLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAM---NLE 381
S+ IVRNG A++E + + +W LR++ D DT + VS T IL+ N+
Sbjct: 481 SISIVRNGADFKAMANMEGIVDVTNIWPLRANYYDTVDTHVAVSTFKATHILSFDGRNVA 540
Query: 382 DELEETEIEGFCSQTQTLFCHDAI---------------YNQLVQVTSGSVRLVSSTSR- 425
+ +GF + + TL + + + +VQ+TS + L+
Sbjct: 541 SHVNPVS-KGFITTSPTLVIANVLGRPKAPGQATNSYVDSSLVVQITSRGIALLEYAVEL 599
Query: 426 ----ELRNEWKSP-------PGYS---VNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
+ + W PG+ + A+AN SQ +LA G +V L + D +
Sbjct: 600 GEYARVGDIWTPAKLSSTNGPGWENREIVAASANGSQFVLALNSGRIVLLNLDDRNCFDC 659
Query: 472 KH-------AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL---PDLNLITK 521
+ + EIS + P+ + +S V W+ + I S I K
Sbjct: 660 LNWRDLDGSSNYPTEISAISCVPLNPSKKFSMYIVVSFWSSNHIEILSANAASHFASICK 719
Query: 522 EHLGGEI--------IPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573
+ + RS + E YL+ LGDG ++++ K EL +K S+
Sbjct: 720 TPVLPSLPRSLLLFNFTRSDIPKQTEYNQYLIVGLGDGSVVSY--EFKKEELHGQKIFSV 777
Query: 574 GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
G P+TL + VFA R ++++ ++L +S + L EVS + N+ F S+
Sbjct: 778 GNVPVTLHPCDVEGRRTVFACGSRTSILFWDKERLHHSPLMLNEVSAVSRLNTTVFDSSV 837
Query: 634 AIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAI-CSLKNQSCAEESEM 691
+A L IG++ ++ KL+IRSIPLG ++PRRI H R +A+ C + +++E
Sbjct: 838 ILATSTGLVIGSVKNLDKLYIRSIPLGYDNPRRILHVPSLRAYAVGCISITPTRIDKAEY 897
Query: 692 --HFVRLLDDQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPT 747
++L D TF + + D E ++S + Y C GT Y +E EP+
Sbjct: 898 TSSSMQLFDHTTFARLGQFRCDPNEEITALGVVSVTLERSIGTYICAGT-YKYVDEVEPS 956
Query: 748 KGRILVFIVEDG-----KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDG 802
+GR+LVF EDG K+ + E +G VY++ + NG ++AAIN + +Y+ + D
Sbjct: 957 QGRLLVFDAEDGSLLREKITMAVSLEVRGCVYAVGSVNGMIIAAINSSVVVYRPEI-DAS 1015
Query: 803 TRELQ----SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
T+ L +E H + L + RGD I+VGD + SIS L + E I+ ARDY +
Sbjct: 1016 TQLLALHKITEWNH--NYLVTNLVCRGDKILVGDAINSISFL--RMVESQIQCLARDYGS 1071
Query: 859 NWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGS 918
W VE+LD +GA +++NLFT E R LE G Y++G+ VN+F G+
Sbjct: 1072 LWPVCVEMLDQSSIIGANSDYNLFTFALQET----ELRKSLERDGSYYIGDMVNKFIPGA 1127
Query: 919 LVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
L D+ P +F T G IGVI + E L + LQ N+ + G+ H +
Sbjct: 1128 LTAHDVSVDMPLEPKQLFFTSTGCIGVIVDMGDELSLHMTALQRNMSTYLSQTKGVTHTK 1187
Query: 979 WRSFNNEKKTVDAK----NFLDGDLIESFLDL------SRTRMDEI--SKTMNVSVEELC 1026
+R+ N DA+ FLDGD +E F+ ++ MD ++ + +SV+ +
Sbjct: 1188 FRAPKNAYGRSDAEATSFGFLDGDFLEKFMQFLNDPGPLKSIMDGQGEAERLTISVDRIH 1247
Query: 1027 KRVEELTRLH 1036
+ +E L +H
Sbjct: 1248 RVLERLQSMH 1257
>gi|302423344|ref|XP_003009502.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
gi|261352648|gb|EEY15076.1| DNA damage-binding protein 1b [Verticillium albo-atrum VaMs.102]
Length = 1119
Score = 328 bits (842), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 284/1119 (25%), Positives = 504/1119 (45%), Gaps = 141/1119 (12%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
+ G I L+ RP D LF+ T+ + + +WD +L+T + + D+++ R
Sbjct: 54 VCGTITLLQTLRPKASETDILFVGTDLFHYFTARWDPTQGKLVTEQVIQDIAEPHMREAQ 113
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQL-----KEAFNIRLEELQVLDIKFL-Y 118
+ ++DP R + +HL++G+ ++ + K+ +RL EL + F+
Sbjct: 114 SQDKCLVDPAGRFVAMHLWEGVMNIMQLGTRKGANLRLDKDWAQVRLSELFMKASTFVPT 173
Query: 119 GCAKPTIVVLYQ---DNKDARHVKTYEV------------ALKDKDF-VEGPWSQNNLDN 162
PTI LYQ D +DA H+ Y + LK+++ +E P D
Sbjct: 174 ETGHPTIAFLYQNSIDREDA-HLAVYRLMEDGNTNVSKFDPLKNRELELEIP------DP 226
Query: 163 GADLLIPVP------------------PPLCGVLIIGEETIVYCSANAFKAIPI-----R 199
A LIPV L G+++ GE ++Y + + I
Sbjct: 227 FARTLIPVSIVESDVKRYHRRDTTNASAQLGGLIVAGETMLIYV--DTLTKVKISKALDE 284
Query: 200 PSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
P I ++ + D +RYLL D G LHLL + + VTGL ++ +G+TS AS + Y+ N
Sbjct: 285 PRIFVSWAKYDV--TRYLLADDYGNLHLLTLEVDGVIVTGLSLKTIGKTSRASCLVYMGN 342
Query: 260 AVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER---------- 309
++++GS +GDSQL L+L + +++ N+ PI+DF ++DL
Sbjct: 343 EILFLGSHHGDSQLFTLDLC----AQTIRLIQTIPNIAPILDFSIMDLGNAGDSGVGNAF 398
Query: 310 -QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVV 367
GQ ++V G + +GSLR +R+ +G+ + ++ +Q ++G++SL+S + DT LVV
Sbjct: 399 SSGQARIVAGCGVHHNGSLRSIRSSVGLEDIGILDGIQDVRGLFSLKSYGSEKVDT-LVV 457
Query: 368 SFISETRILAMNLEDELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
SFI+ETR+ + +EE + +G TLF +Q+T+ S L+ S S
Sbjct: 458 SFITETRVFKFDAYGSVEELADFQGLTLDQPTLFAGSLANGHTLQITASSALLLDSESSV 517
Query: 427 LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDIN 486
N W S G S+ A+ N +TG + + +
Sbjct: 518 TINSWTSSNGGSIVNASVNEKA---STGRRY--------------------WGATAARSR 554
Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGIS-YLL 545
++ I A+G W+ SV + T G+ R+ F GIS +
Sbjct: 555 AFTHRETFPDIGAIGFWSTGSVCGHQPRN----TASTAWGKASNRTTTTSLFLGISLWSS 610
Query: 546 CALGDGHLLNFLLNMKTGEL---TDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIY 602
C + ++T + T +K L + R + + DR +
Sbjct: 611 CTRLSSQAPRYSFLLRTAKWFQSTSQKMTILSRRGKVSRWGAGRQA--CMCCRDRVPL-- 666
Query: 603 SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEH 662
+++YS ++V+H+ PF+S FPD++ +A + + I +D ++ H+ + +GE
Sbjct: 667 ---GRIIYSAATAEDVTHIAPFDSEGFPDAIFLATDKNVRIANVDTERRTHVNPLHIGET 723
Query: 663 PRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPL-----DTFEYGC 717
RR+ + + F I +++ + +E + L D+ P + E
Sbjct: 724 VRRVAYSPALKAFGIGTIRKELLHDEEIVSSAFQLVDEIVLGKVGRPFALGGEASVELVE 783
Query: 718 SILSCSFSDDSNV---YYCVGTAYVL-PEENEP--TKGRILVFIVE-DGKLQLIAEKETK 770
+++ D + + +GT+Y+ P+ N+ KGRILV V+ D LI E K
Sbjct: 784 AVIRAELDDSTGQPAERFIIGTSYLADPDVNDSGDVKGRILVLGVDSDRNPYLIVSHELK 843
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
GA L KL+A +++ + +Y ++ + +L+ + + G+ I V
Sbjct: 844 GACRCLGVMGDKLVAGLSKTVVVYDYIEDSTTSGKLEKLTTFRPSTFPVDLDISGNMIGV 903
Query: 831 GDLMKSISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
DLM+S++L+ + + + ERAR + W +AV L ++ +L A+ NL + +
Sbjct: 904 ADLMQSMTLVEFVPAKDGRKAKLIERARHFEYIWATAVCSLGEESWLEADAQGNLMILER 963
Query: 887 NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
+ T+ ++ ++ E HLGE VN+ R L + +SD+ +P TV+G + V+
Sbjct: 964 QPDAPTEHDQKQMRTTSEMHLGEQVNKIR--PLQITATESDI-IVPKAFLATVDGSLYVL 1020
Query: 947 ASLPHEQYLFLEKLQTNLRKVIKGVG--------GLNHEQWRSFNNEKKTVDAK-NFLDG 997
A++ E L Q L +++ +G G + QWR F N K+ A F+DG
Sbjct: 1021 ANISAEYQSILLPFQERLAGIVRYLGQAAPEDNEGPSFSQWRGFRNAKRMAGAPFRFVDG 1080
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
+LIE FLDL R + + + + SVE + VEEL R+H
Sbjct: 1081 ELIERFLDLDELRQEAVVEGLGPSVEAMRNMVEELRRMH 1119
>gi|213407660|ref|XP_002174601.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces japonicus
yFS275]
gi|212002648|gb|EEB08308.1| damaged DNA binding protein Ddb1 [Schizosaccharomyces japonicus
yFS275]
Length = 1078
Score = 328 bits (841), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 274/1073 (25%), Positives = 501/1073 (46%), Gaps = 90/1073 (8%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITR--AMGDVSDRIGRPT 63
++ ++ +++ FRP +D L + + + + WD + + I A S
Sbjct: 54 VFSKVRSIKSFRPPLSTKDHLIVTVSPFSYFTVTWDPDKKQFINEVTAQPPASTFFLNKQ 113
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK-----------GQLKEAFNIRLEELQVL 112
N ++ +IDP R+I H + GL IP + +++ F +R+ EL VL
Sbjct: 114 RNPKL-LIDPSNRVICSHAFQGLLTFIPITQRITKKRQKDVPSQSVQDNFTVRIMELDVL 172
Query: 113 DIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG-PWSQNNLDNGADLLIPVP 171
DI FLY PT+ VLY+D++ H+K Y V L +K+ E ++ ++++ G +
Sbjct: 173 DIVFLYTSKTPTLAVLYKDSRSVIHLKAYTVNLAEKELNEDESFACHDIEEGR----LIA 228
Query: 172 PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYG----------RVDADGSRYLLGDH 221
GV + GE + Y S + + AY V + +Y++ D
Sbjct: 229 HYDGGVFVFGEVYVYYVSREKKSSKMLVSYPVTAYSAVFQSFQNGTEVCINERKYIVSDE 288
Query: 222 AGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPD 281
+G +++ I + + +E +GE IAS I L N +++GS +GDS+L + +P
Sbjct: 289 SGAVYMFNIGNN----FNVTLEKIGEAPIASCIVALPNDQIFLGSHFGDSKLYRFQNKPG 344
Query: 282 AKGSYVEVLE--RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQ 339
GS VE+LE + NL PI DFC+ G +V C AY +GSLR +++GI I++
Sbjct: 345 VAGS-VELLEVQSFDNLAPISDFCID--HGNGGSFMVACCNAYNEGSLREMKSGIDISDY 401
Query: 340 ASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
+E+ + ++S++ + +L V ISET + ++ E++ ++ C + T+
Sbjct: 402 GVIEMPNVCSLYSVQLQSS--VTKYLFVGSISETSVFEISQSGEMDL--VDSLCLEEPTI 457
Query: 400 FC-HDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
F + + L V SV L ++L + W + G ++ A + V L+ +
Sbjct: 458 FVGATSDSSCLYHVARSSVCLFDG--KQL-SYWFADGG-AITCAAVYDNAVCLSMTNNQI 513
Query: 459 VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
V+L L V Q++ +++ L + ++ VG+W + + +
Sbjct: 514 VFLHK----LQVVSKLQVDSDVTALSF------LNDARTICVGLWNQ-KLLLLDSDNTTC 562
Query: 519 ITKEHLGGEIIPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
+E L +S++ GI+ L + G L+ F ++ T + R L
Sbjct: 563 TVRETLEVNEKAQSLVCVKLAGIASVALYMSTESGKLITFDIDPVTKGIKQRLTYPLTAV 622
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ + TF + N + + P +Y N+ L ++ + ++ + P + A
Sbjct: 623 PLNMNTFQTPNGPVLITLGEHPYAVYGENQHLSFAYIGNSDIVCLSHLQHPGIPANTVYA 682
Query: 637 KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE-MHFVR 695
L + ++ +QKLHIR IP+ PRRI ++ L+++ ++ S M +
Sbjct: 683 TRNALKLSNVNMLQKLHIRRIPVAGIPRRIAATKEHYFVLSVDLQDKLASQGSSIMSSLH 742
Query: 696 LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF- 754
+ T+E + + + +E I+ C+ + N VGT + P+ +EP GR++VF
Sbjct: 743 VFKKLTYETVLQHEFEDYE----IVECALTLPDNERVVVGTGFNYPDRDEPDGGRLIVFR 798
Query: 755 IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW---MLRDDGTRELQSECG 811
+ E KL A +T+GA++S+ GKLL +N + +++ LR G+ + C
Sbjct: 799 LDEQEKLVTEAVYKTQGAIFSVEYQEGKLLVGMNAVLCTFRYENKTLRVVGSTRTPTYC- 857
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
L + ++ D +VVGD+MKS++L Y E+ EE ARD+ A W+++V+ L + +
Sbjct: 858 -----LNIAASSK-DIVVVGDMMKSLTL--YNTEKDTAEEVARDFGALWVTSVQPLSETL 909
Query: 872 YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
+ + T+ +++ ER +L Y LG+ VNR R G V+ P V
Sbjct: 910 FFCTTADGEAVTMLWDTKAPQSVERKKLRWKSCYRLGDMVNRTRRGCFVLSSPSRLVK-- 967
Query: 932 PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQW--RSFNNEKKTV 989
P ++ TV G I +I L L+ +Q N + + +GGL+ +W R F + +
Sbjct: 968 PELMCVTVEGGILLIGDASQHADLLLQ-IQHNFLEAVPPLGGLDFYKWHERLF-PARASA 1025
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
K+F+DGDL+ES DL + + +I +++N+SV +L + +L RLH
Sbjct: 1026 ANKDFIDGDLLESIEDLPESTLQKIVQGTNGGQSLNISVPDLLGIISDLKRLH 1078
>gi|389744702|gb|EIM85884.1| hypothetical protein STEHIDRAFT_121882 [Stereum hirsutum FP-91666
SS1]
Length = 1255
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 313/1201 (26%), Positives = 522/1201 (43%), Gaps = 208/1201 (17%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERY-KFCVL--QWDAESSELITRAMGDVSDRIGRP 62
+YGRI+ + + + L + K VL +W + L+T + ++I RP
Sbjct: 54 LYGRISVIRTITRKDDPDNILLMTDHPAPKLVVLSPKWISNRLSLVTVQTESLYEQIARP 113
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
++ +DP + H Y G K N G L++ ++ + EL +L + + +
Sbjct: 114 SEFVHNVFLDPYGYVGLAHCYTGKAKAFTI-NDGLLEDFCDVLVPELNILSLTLVSSPEQ 172
Query: 123 P-------TIVVLYQDNKDARHVKTYEVALKDKDFVEGP---WSQNNLDNGADLLIPVP- 171
+ +L+ D++ + ++ + + + P ++ + + + PVP
Sbjct: 173 AIDTPDPSALAILHLDHRQNIQLLLRDLNISEYELSPSPSLMFTPATISSKVFEMAPVPE 232
Query: 172 --------PPL---------------CGVLIIGEETIVY--------------------- 187
PPL G+L+IG + ++
Sbjct: 233 NPPIIVNIPPLQLPNPEAMEGTSDFNGGLLVIGGKKLLLFELTSKEFQRKHHKRLRKQES 292
Query: 188 ----CSANAFKAIP---------IRP--SITKAYGRVDA------DGSRYLLGDHAGLLH 226
+ KA P +P S+ + V A +G+RYL+GD G L
Sbjct: 293 RRSTSGKDKAKAEPKEEKEGKSRSKPIASVAWPWSEVTACCQANDEGTRYLVGDAYGRLA 352
Query: 227 LLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP------ 280
+L + ++ GL + LGETS A+T++YLD+ +Y+GS +GDSQLIK++ QP
Sbjct: 353 MLSLDSLYDQ--GLILIPLGETSPATTLTYLDSQFLYVGSHFGDSQLIKIHRQPYGYTNT 410
Query: 281 --------------------------------------DAKGSYVEVLERYVNLGPIVDF 302
KG Y+EV+ + N+ PI+D
Sbjct: 411 DTLPLLGILPLEPTNLASSSGKGKAKAAAGRKSDGRILGMKGKYLEVVHSWRNIAPILDA 470
Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPF 361
D + G +VTCSG GSLR++RNG E+A V+ L + G+W+LRS D
Sbjct: 471 VKADTDGSGLPHIVTCSGGKSTGSLRVIRNGADFQEKAVVDGLDNMTGLWALRSHNSDKD 530
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEG---FCSQTQTLFCH---DAIYNQLVQVTSG 415
++LV S ET + + L E F TL D + ++QVTS
Sbjct: 531 HSYLVASDKDETHLFRFEGPESLRRVESSSESYFRMNKPTLAVANVGDLDSSLILQVTSD 590
Query: 416 SVRLVSSTSR----ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG----- 466
V+L+ + L + W G + +A+ + +QVLLA GG +V L +
Sbjct: 591 KVQLLEFDAVLEIFHLSSHWSPESGKQITLASISPTQVLLALDGGTVVLLGLDKSRNMTV 650
Query: 467 ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL----PDLNLITKE 522
+ T + +A++ +SC N +G AV W + + SL P L+ +T
Sbjct: 651 LSTRLYNAEIS-ALSCALPNALG---------AVSFWDSDRIELVSLGGSTPLLSTVTSR 700
Query: 523 HLGGEIIPRSVLLCAFEGIS---YLLCALGDGHLLNFLL---NMKTGELTDRKKVSLGTQ 576
+G RS+LL F+ S LL L +G ++ F++ KT EL DRK ++G
Sbjct: 701 EIGSSARARSILLHKFDSTSSEMQLLVGLTNGIMVAFVVASSETKTPELQDRKTFAMGNM 760
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L T V A+ R V++ +L S V L + P ++ P S+ +A
Sbjct: 761 PVLLVKVMVDGHTSVLASGSRSVVVFWEKGRLKTSPVMLNNIFAGAPVDTPWSPSSVVLA 820
Query: 637 KEGELTIGTIDDIQKLHIRSI---PLG-EHPRRICHQEQSRTFAICSLKNQSC---AEES 689
+L IG++ + K+HI + P G ++PRRI + + FA+ + + EE
Sbjct: 821 TGSKLIIGSVHGLDKMHIVPVTATPFGLDNPRRIVYDSHASVFAVACTRIEPAPVFGEEV 880
Query: 690 EMHFVRLLDDQTFEFISTYPL----DTFEYGCSILSCSFSDDSNV--YYCVGTAYVLPEE 743
+LLDD TFE I P D + +I + + + +YCVG +E
Sbjct: 881 SRGSFKLLDDITFELIGEVPFAENTDQMQEITAIHAFQHVEGNIAISHYCVGLVTYQLDE 940
Query: 744 NEPTKGRILVF----IVEDGKLQL----IAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
EP+ G + +F DG ++ + ++E G VY+L +F G ++AAIN + +K
Sbjct: 941 KEPSHGELHIFSKGGTEPDGTVKADLLEVVKQEVSGCVYALASFEGYIVAAINSTVSFFK 1000
Query: 796 WMLRDDGTREL---QSECGHHGHILALYVQTRGDFIVVGDLMKSISLL--IYKHEEGAIE 850
L GT + E H+ + +L V G ++++GD + S+S+L I E G I
Sbjct: 1001 --LDTSGTEATLVKKHEWNHNYLVTSLVVS--GSYLLIGDAISSVSVLQVIQVDENGEIT 1056
Query: 851 ER----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
E+ ARDY W +++ + +GA ++ NLF+ ++ L+ G +H
Sbjct: 1057 EKLKTVARDYGPLWPVSLQGWGKEGVIGANSDCNLFSFTLQ---PVTPQKTVLDRDGHFH 1113
Query: 907 LGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
L + VN+F HG++ D+ +F T +G IG++ + E L + LQ NL
Sbjct: 1114 LDDHVNKFLHGTVHSSEKAEDLDIEARSLFFTASGRIGLVLDMGKELSLHMTALQRNLNG 1173
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAK----NFLDGDLIESFLDLSRTRMDEISKTMNVSV 1022
V+K V G H++WR+ + + DA FLDGD IE FL+ DE+S M +
Sbjct: 1174 VVKDVSGTTHKRWRAPVSGRGPSDADAEAYGFLDGDFIEQFLN-----YDELSPEMKRIM 1228
Query: 1023 E 1023
E
Sbjct: 1229 E 1229
>gi|384490729|gb|EIE81951.1| hypothetical protein RO3G_06656 [Rhizopus delemar RA 99-880]
Length = 967
Score = 327 bits (837), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 276/1034 (26%), Positives = 468/1034 (45%), Gaps = 152/1034 (14%)
Query: 2 LDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGR 61
L V IYG I L++ + ++ LFI T + ++ + ++ ++T G ++ R
Sbjct: 32 LIVCIYGTIRNLQVIQLPDKSTCSLFILTVHQCYTIITYSLKTQSIVTEFSGQLNITNAR 91
Query: 62 PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF--------------DNKGQLKEAFNIRLE 107
TD + +D +I + + G IPF D K IR
Sbjct: 92 ETDQQVVVTVDKTSEMIFVSAFTGYVIAIPFGKPAPGAKLTSKNRDTKASRFMQIPIRTN 151
Query: 108 ELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
E + + L + VL + +D + +KT++ + KD +E S ++ L
Sbjct: 152 EFDFISVAALQQGGY--LSVLVGEMEDLKTIKTFKYRDEYKDLLERNKSTIKVEASTHAL 209
Query: 168 IPVPPPLCGVLIIGEETIVYCS--ANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLL 225
+PVP PL G+L+IGE I Y N + + I P+ A+ + + +RYLLGD G L
Sbjct: 210 VPVPEPLGGLLVIGEYIITYFDPLTNTNRELSIDPARVTAWEFMKDESNRYLLGDEEGYL 269
Query: 226 HLLVITHEKEKVTGLKIELLGET----------------SIASTISYLDNAVVYIGSSYG 269
++ I KV L +G+ S S I L N + YIGS++G
Sbjct: 270 YVFSIETSHNKVVNLSSTFIGQVPSFNQNIESKANHPQVSRPSCIVDLGNLMFYIGSTHG 329
Query: 270 DSQLIKLNLQPDAKGSY-VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
DS LI+L ++ K Y V+VL Y LGPIVDFC+ D +QG+ + C+G KD S+R
Sbjct: 330 DSCLIQL-IKGQEKSKYTVKVLSTYSCLGPIVDFCLYDYNKQGKQTMACCAGVEKDASIR 388
Query: 329 IVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE 388
IV NGIG +++ +++ + MW+L S D D+ L+ + + +T +L + ++ELE T+
Sbjct: 389 IVENGIGFSKKYALDFPLVYAMWTL--SLDGDRDSLLISTAL-DTVLLKPSDQEELEVTQ 445
Query: 389 IEGFCS--QTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR-ELRNEWKSPPGYSVNVATAN 445
+ + +Q D + +VQ TS VR++++ +L EWK P G S+ +A
Sbjct: 446 HTSYSALDTSQMTLAADMFNSFIVQATSSFVRMMTNDEYGQLIGEWKPPTGTSIAIAKIK 505
Query: 446 ASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPS-YSQIAAVGMWT 504
S ++ G ++YLE+ + E QL+ SC+ I+ EN + Y + A +
Sbjct: 506 DSHCVVCCEGDMIIYLEMTNKGFIEKSKRQLK-NASCISISTRKENETLYDYVVAGTCGS 564
Query: 505 DISVRIFSLPDLNLITK-EHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG 563
+ SV LPDL ++ + + + P +L+ E + YL+ LGDG + ++ + + +
Sbjct: 565 NPSVVFLQLPDLEVVLEHKDMPSTTGPNDLLVVTMEKVLYLMVLLGDGQMFSYHMEVGSE 624
Query: 564 E--LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHM 621
+ L ++ +GT + + VF A RPTVI S ++ L +VNL
Sbjct: 625 DVILESETEIMVGTYCTAMYPYQHGQEKRVFVAGQRPTVISSFHQTLFVYSVNL------ 678
Query: 622 CPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK 681
+ + E L I I DI+ L + S P + + +
Sbjct: 679 ----------TYELPGEMPLRIEYISDIKALAVASCTNVHDPNKNIIERTGK-------- 720
Query: 682 NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLP 741
VRLLD QTF+
Sbjct: 721 ------------VRLLDAQTFQ-------------------------------------- 730
Query: 742 EENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ-LYKWMLRD 800
F +E+ + +LI + G VY + + ++AA++ KI LY +
Sbjct: 731 -----------AFTIENSRCELIDAVDMPGVVYRMESIKNTIIAAVDGKIYGLYNFKPDL 779
Query: 801 DGTRELQSECGHHGHILALYVQT-RGDFIVVGDLMKSISLLIYKHEEGAIE--ERARDYN 857
++ + H +++AL + T D ++VGDLM+S+SLL + +E +++ A D
Sbjct: 780 LKGERIEFKFLLHNNVVALDMDTDNNDTLLVGDLMESMSLLKVEKDEESLKLSLEAVDNK 839
Query: 858 ANWMSAVEILDDDIYLGAENNFNLFTVRK---NSEGATDEERGRLEVVGEYHLGEFVNRF 914
WM+AV+ +++++ +GA++ NLFT+ K EG T +LE+ G YHLG VNRF
Sbjct: 840 QVWMTAVKFVNENVLIGADDRHNLFTMIKPEIRQEGKT----CKLELEGGYHLGTLVNRF 895
Query: 915 RHGSLVMRLPDSDVGQIPTV-------IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
R ++R ++ I ++ F TVNG IG + ++ E + F + +Q + +
Sbjct: 896 RKD--ILRDVENASDNIDSISKYESEFTFATVNGSIGTVKTISRESFEFFKGIQEGILNI 953
Query: 968 IKGVGGLNHEQWRS 981
+ G L+H W S
Sbjct: 954 LPNNGNLDHGLWLS 967
>gi|449704103|gb|EMD44407.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
Length = 1088
Score = 324 bits (831), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 280/1050 (26%), Positives = 509/1050 (48%), Gaps = 75/1050 (7%)
Query: 22 AQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLH 81
++D+L T + + Q + ++ EL ++ DR+GR G G++ + L+
Sbjct: 76 SKDYLIFVTNNQQLIIAQLN-DNKELSLVYSNNIEDRLGRTAFYGISGVVIGNSYLLLHL 134
Query: 82 LYDGLFKVIPFDNKGQ-LKEA-FNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVK 139
+ K++P + + L EA FN+R++E +V+ + L K I + ++ D + +
Sbjct: 135 YNHLI-KIVPLPREEENLPEAPFNLRIDEQRVVGLYCLQLDQKQVIAIHHEIKNDVKVIT 193
Query: 140 TYEVALKDKDFVEGPWSQNNLDNGA-DLLIPVPPPLCGVLIIGEETIVYC-SANAFKAIP 197
YE + K +++ D DL IP G I + + + +++ ++
Sbjct: 194 FYEFNIDSKTL--SLYNKQIEDTSTIDLFIPYLGNE-GYFKINSQIVSFVPNSSQISSVS 250
Query: 198 IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI----THEKE-KVTGLKIELLGETSIAS 252
+RP + Y +D R LL D+ G +HL+ + HE E L+ + L +I S
Sbjct: 251 MRPIHLETYCFLDE--HRLLLSDNQGKMHLITLKENNNHEIELYYYPLEYQAL---TIPS 305
Query: 253 TISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQ 312
T+ YLDN+V++ GS GDS LIK+N KG E+LE + N GPI+D + E +
Sbjct: 306 TVLYLDNSVLFWGSKGGDSHLIKIN----EKGC--EILETFENRGPILDMTAIHDEITNK 359
Query: 313 GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
++ C Y+ G+L+++ +G+GI+ E++GI ++ + +L++S+
Sbjct: 360 DNLLMCCNTYQQGTLKLISSGVGIDIICQNEMKGITHLYQAEMGHKE----YLIISYGDG 415
Query: 373 TRILAMNLED---ELEETEIEGFCSQTQTLFCHDAIY------NQLVQVTSGSVRLVSST 423
+++ E E EI+GF + +T+ C I + VQVT + + S
Sbjct: 416 SKVFESQQEVNQLRFNEIEIKGFNRKEETI-CSGIIEIGEMKESCFVQVTREEINIFDSE 474
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY----E 479
E + + S + + L + + +L E K + Y +
Sbjct: 475 QLEFIQQIRFNKFIS---HSCIYDEKLYISQSNEI-------NVLNENKEIKQVYSGEND 524
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
IS I G + + W ++ I ++ + N + G I +S+ + +
Sbjct: 525 ISAFTIGKEGT----EDVIGICYWDSNTMEIINMENGNKNHIREINGNIYGKSIEIINNQ 580
Query: 540 GISYLLCALGDGHLLNF----LLNMKTGELTDRKKV-SLGTQPITLRTFSSKNTTHVFAA 594
Y++ +GDG +L + L+ ++ GE+ KV +G I L+T ++
Sbjct: 581 EGMYVVIGMGDGRVLVYQMEELMEIE-GEIPKNYKVLDIGLSGIELKTLEINGMKYIMCL 639
Query: 595 SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHI 654
DRPT++ K+ +VN E+ + PF +SL +A + + IG ID+IQ +H
Sbjct: 640 CDRPTLVSICKGKIKTLSVNCSEIVSIVPFKMKECMNSLVVATKENIIIGNIDEIQSIHT 699
Query: 655 RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES-EMHFVRLLDDQTFEFISTYPLDTF 713
+S+ +G RI +Q + S + + E+ E H V+L+D + E ++ L
Sbjct: 700 KSLSIGVFVSRIVVSKQENKALLLSSEIKELKEKRVETHSVKLIDFRKMEIEDSHELKEN 759
Query: 714 EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
E+ SI D N + +GTA+ P E EP+ GRIL+ ++DG+L++I EK+ GAV
Sbjct: 760 EHALSIEQIVI--DENEMFVIGTAFAKPNEVEPSSGRILIVQIKDGRLEIIFEKDVNGAV 817
Query: 774 YSLNAFNGKLLA-AINQKIQLYKWM-LRDDGTRE--LQSECGHHGHILALYVQTRGDFIV 829
YS+ K LA +I +K+ ++++ + +G E LQ + + ++ LYV+T G+ I+
Sbjct: 818 YSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVKLQEKGSCNVKLIGLYVKTLGNKIL 877
Query: 830 VGDLMKSISLLIYK---HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
VGDLMKSIS+ + + + + E +RD+ A++ +A+E +D+D YL +++N N+
Sbjct: 878 VGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFVDEDCYLSSDSNSNILIFNT 937
Query: 887 NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
NS G + ER RL H+GE +N GS+ + Q ++FG V G IG I
Sbjct: 938 NSTG-NESERFRLNNCAHIHVGECINVMCKGSIAPTHSTYETVQKKCILFGGVTGYIGGI 996
Query: 947 ASLPHEQYLFLEKLQTNLRKVIKG-VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
+P+E Y L K+Q + +KG V + W+ ++ K + + N +DG ++ES+L+
Sbjct: 997 CEIPNEIYDVLIKVQNQILLQMKGIVECTTPDNWKKVIDDWKRMPSSNIIDGSIVESYLE 1056
Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+S+ + EI+ V+ E++ +E + L
Sbjct: 1057 MSKEKQCEIAHLSGVNEEQISDIIENMISL 1086
>gi|183232997|ref|XP_653855.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
gi|169801778|gb|EAL48469.2| damaged DNA binding protein, putative [Entamoeba histolytica
HM-1:IMSS]
Length = 1088
Score = 323 bits (829), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 280/1050 (26%), Positives = 509/1050 (48%), Gaps = 75/1050 (7%)
Query: 22 AQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLH 81
++D+L T + + Q + ++ EL ++ DR+GR G G++ + L+
Sbjct: 76 SKDYLIFVTNNQQLIIAQLN-DNKELSLVYSNNIEDRLGRTAFYGISGVVIGNSYLLLHL 134
Query: 82 LYDGLFKVIPFDNKGQ-LKEA-FNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVK 139
+ K++P + + L EA FN+R++E +V+ + L K I + ++ D + +
Sbjct: 135 YNHLI-KIVPLPREEENLPEAPFNLRIDEQRVVGLYCLQLDQKQVIAIHHEIKNDVKVIT 193
Query: 140 TYEVALKDKDFVEGPWSQNNLDNGA-DLLIPVPPPLCGVLIIGEETIVYC-SANAFKAIP 197
YE + K +++ D DL IP G I + + + +++ ++
Sbjct: 194 FYEFNIDSKTL--SLYNKQIEDTSTIDLFIPYLGNE-GYFKINSQIVSFVPNSSQISSVS 250
Query: 198 IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVI----THEKE-KVTGLKIELLGETSIAS 252
+RP + Y +D R LL D+ G +HL+ + HE E L+ + L +I S
Sbjct: 251 MRPIHLETYCFLDE--HRLLLSDNQGKMHLITLKENNNHEIELYYYPLEYQAL---TIPS 305
Query: 253 TISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQ 312
T+ YLDN+V++ GS GDS LIK+N KG E+LE + N GPI+D + E +
Sbjct: 306 TVLYLDNSVLFWGSKGGDSHLIKIN----EKGC--EILETFENRGPILDMTAIHDEITNK 359
Query: 313 GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
++ C Y+ G+L+++ +G+GI+ E++GI ++ + +L++S+
Sbjct: 360 DNLLMCCNTYQQGTLKLISSGVGIDIICQNEMKGITHLYQAEMGHKE----YLIISYGDG 415
Query: 373 TRILAMNLED---ELEETEIEGFCSQTQTLFCHDAIY------NQLVQVTSGSVRLVSST 423
+++ E E EI+GF + +T+ C I + VQVT + + S
Sbjct: 416 SKVFESQQEVNQLRFNEIEIKGFNRKEETI-CSGIIEIGEMKESCFVQVTREEINIFDSE 474
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY----E 479
E + + S + + L + + +L E K + Y +
Sbjct: 475 QLEFIQQIRFNKFIS---HSCIYDEKLYISQSNEI-------NVLNENKEIKQVYSGEND 524
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE 539
IS I G + + W ++ I ++ + N + G I +S+ + +
Sbjct: 525 ISAFTIGKEGT----EDVIGICYWDSNTMEIINMENGNKNHIREINGNIYGKSIEIINNQ 580
Query: 540 GISYLLCALGDGHLLNF----LLNMKTGELTDRKKV-SLGTQPITLRTFSSKNTTHVFAA 594
Y++ +GDG +L + L+ ++ GE+ KV +G I L+T ++
Sbjct: 581 EGMYVVIGMGDGRVLVYQMEELMEIE-GEIPKNYKVLDIGLSGIELKTLEINGMKYIMCL 639
Query: 595 SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHI 654
DRPT++ K+ +VN E+ + PF +SL +A + + IG ID+IQ +H
Sbjct: 640 CDRPTLVSICKGKIKTLSVNCSEIVSIVPFKMKECMNSLVVATKENIIIGNIDEIQSIHT 699
Query: 655 RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES-EMHFVRLLDDQTFEFISTYPLDTF 713
+S+ +G RI +Q + S + + E+ E H V+L+D + E ++ L
Sbjct: 700 KSLSIGVFVSRIVVSKQENKALLLSSEIKELKEKRVETHSVKLIDFRKMEIEDSHELKEN 759
Query: 714 EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
E+ SI D N + +GTA+ P E EP+ GRIL+ ++DG+L++I EK+ GAV
Sbjct: 760 EHALSIEQIVI--DENEMFVIGTAFAKPNEVEPSSGRILIVQIKDGRLEIIFEKDVNGAV 817
Query: 774 YSLNAFNGKLLA-AINQKIQLYKWM-LRDDGTRE--LQSECGHHGHILALYVQTRGDFIV 829
YS+ K LA +I +K+ ++++ + +G E LQ + + ++ LYV+T G+ I+
Sbjct: 818 YSMKTLLKKYLAMSIEKKLVVFEYQRVITNGEFEVKLQEKGSCNVKLIGLYVKTLGNKIL 877
Query: 830 VGDLMKSISLLIYK---HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRK 886
VGDLMKSIS+ + + + + E +RD+ A++ +A+E +D+D YL +++N N+
Sbjct: 878 VGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFVDEDCYLSSDSNSNILIFNT 937
Query: 887 NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
NS G + ER RL H+GE +N GS+ + Q ++FG V G IG I
Sbjct: 938 NSTG-NESERFRLNNCAHIHVGECINVMCKGSIAPTHSTYETVQKKCILFGGVTGYIGGI 996
Query: 947 ASLPHEQYLFLEKLQTNLRKVIKG-VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
+P+E Y L K+Q + +KG V + W+ ++ K + + N +DG ++ES+L+
Sbjct: 997 CEIPNEIYDVLIKVQNQILLQMKGIVECTTPDDWKKVIDDWKRMPSSNIIDGSIVESYLE 1056
Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+S+ + EI+ V+ E++ +E + L
Sbjct: 1057 MSKEKQCEIAHLSGVNEEQISGIIENMISL 1086
>gi|407044103|gb|EKE42371.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
Length = 1088
Score = 323 bits (827), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 277/1051 (26%), Positives = 507/1051 (48%), Gaps = 77/1051 (7%)
Query: 22 AQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLH 81
++D+L T + + Q + ++ EL ++ DR+GR G G++ + L+
Sbjct: 76 SKDYLIFVTNNQQLIIAQLN-DNKELSLVYSNNIEDRLGRTAFYGISGVVIGNSYLLLHL 134
Query: 82 LYDGLFKVIPFDNKGQ-LKEA-FNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVK 139
+ K++P + + L EA +N+R++E +V+ + L K I + ++ D + +
Sbjct: 135 YNHLI-KIVPLPREEENLPEAPYNLRIDEQRVVGLYCLQLDQKQVIAIHHEIKNDVKVIT 193
Query: 140 TYEVALKDKDFVEGPWSQNNLDNGA-DLLIPVPPPLCGVLIIGEETIVYC-SANAFKAIP 197
YE + K +++ D DL IP G I + + + +++ ++
Sbjct: 194 FYEFNIDSKTL--SLYNKQIEDTSTIDLFIPYLGNE-GYFKINSQIVSFVPNSSQISSVS 250
Query: 198 IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELL------GETSIA 251
+RP + Y +D R LL D+ G +HL+ + + +IEL +I
Sbjct: 251 MRPIHLETYCFLDE--HRLLLSDNQGKMHLITLKENNKH----EIELYYYPLEYQALTIP 304
Query: 252 STISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG 311
ST+ YLDN+V++ GS GDS LIK+N KG E+LE + N GPI+D + E
Sbjct: 305 STVLYLDNSVLFWGSKGGDSHLIKIN----EKGC--EILETFENRGPILDMTAIHDEITN 358
Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
+ ++ C Y+ G+L+++ +G+GI+ E++GI ++ + +L++S+
Sbjct: 359 KDNLLMCCNTYQQGTLKLISSGVGIDIICQNEMKGITHLYQAEMGHKE----YLIISYGD 414
Query: 372 ETRILAMNLED---ELEETEIEGFCSQTQTLFCHDAIY------NQLVQVTSGSVRLVSS 422
+++ E E EI+GF + +T+ C I + VQVT + + S
Sbjct: 415 GSKVFESQQEVNQLRFNEIEIKGFNRKEETI-CSGIIEIGEMKESCFVQVTREEINIFDS 473
Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY---- 478
EL + + S + + L + + +L E K + Y
Sbjct: 474 EQLELIQQIRLNKFIS---HSCIYDEKLYISQSNEI-------NVLNENKEIKQVYSGEN 523
Query: 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAF 538
+IS I G + + W ++ I + + N + G I +S+ +
Sbjct: 524 DISAFTIGKEGT----EDVIGICYWDSNTMEIIKMENGNKNHIREINGNIYGKSIEIINN 579
Query: 539 EGISYLLCALGDGHLLNF----LLNMKTGELTDRKKV-SLGTQPITLRTFSSKNTTHVFA 593
+ Y++ +GDG +L + L+ ++ GE+ KV +G I L+T ++
Sbjct: 580 QEGMYVVIGMGDGRVLVYQMEELMEIE-GEIPKNYKVLDIGLSGIELKTLEINGMKYIMC 638
Query: 594 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
DRPT++ K+ +VN E+ + PF +SL +A + + IG ID+IQ +H
Sbjct: 639 LCDRPTLVSICKGKIKTLSVNCSEIVSIVPFKMKECMNSLVVATKENIIIGNIDEIQSIH 698
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES-EMHFVRLLDDQTFEFISTYPLDT 712
+S+ +G RI +Q + S + + E+ E H V+L+D + E ++ L
Sbjct: 699 TKSLSIGVFVSRIVVSKQENKALLLSSEIKELKEKRVETHSVKLIDFRKMEIEDSHELKE 758
Query: 713 FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA 772
E+ SI D N + +GTA+ P E EP+ GRIL+ ++DG+L+++ EK+ GA
Sbjct: 759 NEHALSIEQIVI--DENEMFVIGTAFAKPNEVEPSSGRILIVQIKDGRLEIVFEKDVNGA 816
Query: 773 VYSLNAFNGKLLA-AINQKIQLYKWM-LRDDGTRE--LQSECGHHGHILALYVQTRGDFI 828
VYS+ K LA +I +K+ ++++ + +G E LQ + + ++ LYV+T G+ I
Sbjct: 817 VYSMKTLLKKYLAISIEKKLVVFEYQRVITNGEFEVKLQEKGSCNVKLIGLYVKTLGNKI 876
Query: 829 VVGDLMKSISLLIYK---HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR 885
+VGDLMKSIS+ + + + + E +RD+ A++ +A+E +D+D YL +++N N+
Sbjct: 877 LVGDLMKSISVYSFDNNGNNKNCLTEVSRDFYASYTTAIEFVDEDCYLSSDSNSNILIFN 936
Query: 886 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
NS G + ER RL H+GE +N GS+ + Q ++FG V G IG
Sbjct: 937 TNSTG-NESERFRLNNCAHIHVGECINVMCKGSIAPTHSTYETVQKKCILFGGVTGYIGG 995
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKG-VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
I +P+E Y L K+Q + +KG V + W+ ++ K + + N +DG ++ES+L
Sbjct: 996 ICEIPNEIYDILIKVQNQILLQMKGIVECTTPDDWKKVIDDWKRMPSSNIIDGSIVESYL 1055
Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
++S+ + EI+ V+ E++ +E + L
Sbjct: 1056 EMSKEKQCEIAHLSGVNEEKISDIIENMISL 1086
>gi|390603312|gb|EIN12704.1| hypothetical protein PUNSTDRAFT_97523 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1268
Score = 319 bits (817), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 273/925 (29%), Positives = 417/925 (45%), Gaps = 141/925 (15%)
Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
L+GD G + +L I + + GL + LGETS ++I+YL + V Y+GS GDSQL+++
Sbjct: 344 LIGDAYGRMSMLSI--DPIRGNGLTLLPLGETSPPTSIAYLTSQVFYLGSHMGDSQLLRI 401
Query: 277 --------------------------------------------------NLQPDAK--- 283
++ PDA+
Sbjct: 402 SPTPHSLIDKPTLPIDPRISTVLPSQLSSYNAGESVKGKGKRRADLDSDIDVDPDAQSGR 461
Query: 284 --------GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIG 335
GSY+EVL Y NL P+ D +++ + GQ Q+VTCSG GSL++VR G
Sbjct: 462 KGKVVRSDGSYLEVLATYQNLAPVTDALLMNADGSGQPQIVTCSGGANAGSLKVVRKGAD 521
Query: 336 INEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE--TEIEGF 392
A VE L G +W +R D D+++V S + T++L + D + F
Sbjct: 522 FKTAAVVESLPGTVSVWPVRKRYYDNTDSYIVASTLRCTQVLLLEEHDTVNPLVAASTDF 581
Query: 393 CSQTQTLFCHDAIYNQLV-------------QVTSGSVRL-----VSSTSRELRNEWK-S 433
+ T+ + + +LV QVT +RL + L + WK S
Sbjct: 582 ATSGPTISVANILRRRLVNGKSEYEDSSLVVQVTPSKMRLLEHDMIGPVEFRLVDTWKPS 641
Query: 434 PPG-YSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ-LEYEISCLDINPIGEN 491
PG + A+ N +Q+ L GG L + +++ Q +IS + P+
Sbjct: 642 GPGPQEIVTASINPTQIALGLRGGRLCVFRLAPNDHFDLRFTQHFSNDISAVSCLPLNPG 701
Query: 492 PSYSQIAAVGMWTDISVRIF----SLPDLNLITKEHLGGEIIPRSVLLCAFEGISY---- 543
S AVG W +V I ++ +L L T +PRS+LL F G S+
Sbjct: 702 NLISAFIAVGFWGSNNVMILCQKGNILELELQTDPL---PALPRSLLLYNF-GTSFGKKD 757
Query: 544 ------LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDR 597
LL L DG L+++ K EL D+K V LG P++L + +F R
Sbjct: 758 PNYHAHLLIGLADGSLVSYAYARK--ELKDKKVVPLGASPVSLVPCEANGKKAIFCCGTR 815
Query: 598 PTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSI 657
V+Y +L + LK V CP NS +F S + TIG ++ LH+RS+
Sbjct: 816 AAVVYWDRDRLQNAPALLKNVVTACPLNSTSFEQSTILVTGSGFTIGHFGNVAGLHVRSV 875
Query: 658 PLG-EHPRRICHQEQSRTFAICSLKNQSCA----EESEMHFVRLLDDQTFEFISTYPLDT 712
PLG + P+RI + +S + ++ + E + RLLDD +F + + L+
Sbjct: 876 PLGVDVPKRITYNNESHLLGVACIRKEPHRIGDDEGTIRSSFRLLDDTSFGELDRFDLEA 935
Query: 713 FE--YGCSILSCSFSDDSNVYYCVGTA-YVLPEENEPTKGRILVFIVEDGKLQLIAEKET 769
E +LS ++ ++C+GTA + ++ E +KGR++VF L +A +
Sbjct: 936 DEDITSAVVLSLGTAEAYTSHFCIGTADFTSDDQLEVSKGRLVVFDPSTKVLSPVATLDV 995
Query: 770 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT-----RELQSECGHHGHILALYVQTR 824
G VY+L + G + AA+N + +Y+ L DG R +Q +H + + V TR
Sbjct: 996 NGCVYALASIQGLVAAAVNSAVIVYR--LETDGPTFSSKRLVQLANWNHNYFVTNLV-TR 1052
Query: 825 GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
G I VGD + S+S+L + A++ ARDY W A+E D +GA+ FNLFT
Sbjct: 1053 GSRIFVGDAISSVSILELTGQ--ALQTVARDYGPLWPVAIESTGPDSVIGADGEFNLFTF 1110
Query: 885 RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--PTVIFGTVNGV 942
+ + G+LE G YHLGE VN+F G L+ P P T +G
Sbjct: 1111 KLS--------EGKLERDGSYHLGEQVNKFVPGGLLAADPAHTTQTTCKPIQTLFTSSGR 1162
Query: 943 IG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA---KNFLDGD 998
I VI L E L L L N+ KVI G+ G+ +E WRS + DA + FLDGD
Sbjct: 1163 IAIVIDVLDPEVSLHLTNLHRNMSKVITGLTGIQNETWRSPATSRGRTDAETSRGFLDGD 1222
Query: 999 LIESFLDLSRT--RMDEISKTMNVS 1021
+E+FLD+ RT + EI + NV+
Sbjct: 1223 FLEAFLDVPRTSPSLAEILEGENVA 1247
>gi|395330962|gb|EJF63344.1| hypothetical protein DICSQDRAFT_153890 [Dichomitus squalens LYAD-421
SS1]
Length = 1263
Score = 318 bits (816), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 309/1199 (25%), Positives = 506/1199 (42%), Gaps = 227/1199 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERY--KFCVLQWDAES--SELITRAMGDVSDRIGR 61
++GR+ L+ E + + T+ + +L +D + + L ++DR R
Sbjct: 56 MWGRVVGLQAVPAEEEGRSHILAMTDHPDPELILLSYDVKDGVASLTPAGHHSLADRAAR 115
Query: 62 PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCA 121
+ +DP + Y G KV+ + G+L AF++ L EL +L FL+
Sbjct: 116 QAEFVTDFQVDPSGTVAVASCYTGKLKVVTVSD-GKLDTAFDVSLPELNLLAFTFLHTGQ 174
Query: 122 --KPTIVVLYQDNKD-----ARHVKTYE-------------VALKDKDF---------VE 152
+ ++ +L+ +++ R V E L DK F V
Sbjct: 175 EDRYSLALLHLNHRRQIQLLCRDVDLAEFELSPVHSNILLTATLSDKTFPSLESPLMLVP 234
Query: 153 GPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY------------------------- 187
P + + D G D P GVLI+G I++
Sbjct: 235 IPQHETDADEGDD---DAPAHRGGVLILGGRKILFYEHSTEEQQETRKEKHRRLSKRLAS 291
Query: 188 ------------CSANAFKAIPIRPSI------TKAYGRVDADGSRYLLGDHAGLLHLLV 229
+ I R S+ A+ VDA+ R LLGD G L ++
Sbjct: 292 EDEAKAQEAKKKEKERESRRIKPRASVKWPWAAVTAWTAVDAERRRVLLGDAYGRLAMIA 351
Query: 230 ITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP--------- 280
K L + LGE S + +SYL + V+Y+GS YGDSQL++++ P
Sbjct: 352 FDDGK---MCLNLIPLGEASSPTCLSYLSSQVLYLGSHYGDSQLLRIHPTPFANATVDTL 408
Query: 281 ------------------------------------------DAKGSYVEVLERYVNLGP 298
+ +G+++EVL+ + N+ P
Sbjct: 409 PIPRGVSSVPPSSLAGSGKGKAKATSGVGDAESGSAREGRVVNTQGTFLEVLQNFDNIAP 468
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSST 357
I+D + D++ GQ QV+T SG GSLR++R EQA ++ L G+ +W +++ +
Sbjct: 469 IMDAALADIDGSGQPQVITSSGGRNTGSLRVIRTEADFQEQARLDGLIGVTDIWPVKTHS 528
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELE--ETEIEGFCSQTQTL---------------- 399
+P T LVVS + ET + A +D + + + GF + T
Sbjct: 529 AEPIHTHLVVSTLRETHVFAFEGKDAIAHLDPSVAGFTTHAPTFVLGNIPRRVVSASGTS 588
Query: 400 -FCHDAIYNQLVQVTSGSVRLVSSTS--------------RELRNEWKSPPGYSVNVATA 444
+ H ++ +VQ+TS ++LV +++ E+ G + A
Sbjct: 589 SYEHSSL---VVQITSEGIQLVEYEPTLFAFGKVGPGWYPKQVGGEYA---GREIVAAAM 642
Query: 445 NASQVLLATGGGHLVYLEIGDGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMW 503
+ SQ ++ GG L +G ++ + EI + P + +Y+ AV W
Sbjct: 643 SPSQFVVGLSGGRLALFNLGQKDTVQLLTTRNFPDEICAISCQPYDSSKNYASAIAVSFW 702
Query: 504 TDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAF---------EGISYLLCALGDGHL 553
V + +L P L T +PR VLL F + +LL L DG L
Sbjct: 703 GSNKVAVLALDPSSPLATVCDTALPTLPRVVLLHNFGTGGRPKDPDFHPHLLVGLADGTL 762
Query: 554 LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNV 613
+ + L + G+L DRK+ LG P +L T VFA+ R +V++ +++ S V
Sbjct: 763 VTYAL--RDGKLHDRKQSGLGNAPASLSVCDVDGRTVVFASGARSSVLFWDRQRVRPSPV 820
Query: 614 NLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQS 672
++K++ N+AAFP LAIA L IGT+ + K+ IR+IPLG + PRR+ +
Sbjct: 821 SVKDMIKGVTLNTAAFPSCLAIATSSALLIGTVRGLDKMQIRTIPLGLDSPRRLGYHAGQ 880
Query: 673 RTFAICSLKNQSCA----EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC-SFSDD 727
F + + EE F +LLD +F+ + + E S+L+ +
Sbjct: 881 GVFGVACTRTTPDRIGDYEEVTGSF-KLLDAHSFQQLHNFLCQADEEPSSVLTLPAHGSG 939
Query: 728 SNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK---------LQLIAEKETKGAVYSLNA 778
++ + +GT Y+ PEE EP+KGRIL+F V + L +A G VY+L
Sbjct: 940 ASPAFALGTVYIRPEEREPSKGRILLFSVSSTEGARGANVRSLHTLASVNVGGCVYALAN 999
Query: 779 FNGKLL-AAINQKIQLYKWMLRDDG-----TRELQSECGHHGHILALYVQTRGDFIVVGD 832
+ L+ AAIN + L+K + G + E +E H+ + + V G+ I+VGD
Sbjct: 1000 LSENLIVAAINTSVVLFKSTENEAGESTPLSLEKVTEWNHNHFVTNVVVD--GERILVGD 1057
Query: 833 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
+ S+S+L + +E ARDY W A+E + + +GA + NLF+ S
Sbjct: 1058 AISSVSVLKWNERLERLESIARDYGPLWPIAIEGTGNGL-IGANADCNLFSFSLQSV--- 1113
Query: 893 DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV----IFGTVNGVIGVIAS 948
R LE G YHL + N+F G+L +DV + V +F T G IG I
Sbjct: 1114 -PHRTYLEKDGVYHLNDVTNKFVRGALT----STDVAEDQVVKASHVFFTSTGCIGAILD 1168
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK---NFLDGDLIESFL 1004
+ L + LQ N+ K + G GG NH + R+ + + DA+ FLDGD +E +L
Sbjct: 1169 MNDVTSLHMTALQRNMAKTLTGPGGDNHTKLRAPSTPRGHTDAEASYGFLDGDFLEQYL 1227
>gi|167390599|ref|XP_001739420.1| DNA damage-binding protein [Entamoeba dispar SAW760]
gi|165896898|gb|EDR24200.1| DNA damage-binding protein, putative [Entamoeba dispar SAW760]
Length = 1088
Score = 315 bits (807), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 282/1056 (26%), Positives = 513/1056 (48%), Gaps = 87/1056 (8%)
Query: 22 AQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLH 81
++D+L T + + Q + ++ EL ++ DR+GR G G++ + L+
Sbjct: 76 SKDYLIFVTNNQQLVIAQLN-DNKELSLVYSNNIEDRLGRTAFYGISGVVISNSHLLLHL 134
Query: 82 LYDGLFKVIPFDNKGQ-LKEA-FNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVK 139
+ K+IP + + L EA FN+R++E +V+ + L K I + ++ D + +
Sbjct: 135 YNHLI-KIIPLPREEENLPEAPFNLRIDEQRVVGLYCLQLDQKQVIGIHHEIKNDIKVIT 193
Query: 140 TYEVALKDKDFVEGPWSQNNLD-NGADLLIPVPPPLCGVLIIGEETIVYC-SANAFKAIP 197
YE ++ K +++ D + DL IP G I +T+ + ++N ++
Sbjct: 194 FYEFNIESKTL--NLYNKQIEDTSNIDLFIPYLGNE-GYFKINSQTVSFVPNSNQISSVS 250
Query: 198 IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELL------GETSIA 251
+RP + Y +D R LL D+ G +HL+ + + +IEL +I
Sbjct: 251 MRPIHLETYCFLDE--KRLLLSDNKGKMHLITLKENNKH----EIELYYYPLEYQALTIP 304
Query: 252 STISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG 311
STI YLDN+V++ GS GDS LIK+N + E+LE + N GPI+D + E
Sbjct: 305 STILYLDNSVLFWGSKGGDSHLIKINEKR------CEILETFENRGPILDMTAIHDEITN 358
Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
+ ++ C Y+ G+L+++ +G+GI+ E++GI ++ + + +L++S+
Sbjct: 359 KDDLLMCCNTYQQGTLKLISSGVGIDIICQNEMKGITHLYQVEMGNKE----YLIISYSD 414
Query: 372 ETRILAMNLED---ELEETEIEGFCSQTQTLFCHDAIYNQ------LVQVTSGSVRLVSS 422
+++ E+ + E EI+GF + +T+ C I VQVT V + S
Sbjct: 415 NSKVFESQQENNQLQFNEIEIKGFNRKEETI-CSGIIEIGEMKEICFVQVTREEVNIFDS 473
Query: 423 TSRELRNEWKSPPGYS--------VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHA 474
+L + + S + ++ +N VL E+KH
Sbjct: 474 EQLKLIQQIRVNKFISHSCIYDEKLYISQSNEINVLNEK---------------KEIKHI 518
Query: 475 QL-EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV 533
E +IS I G + + W + I L + N + G I +S+
Sbjct: 519 YSGENDISAFTIGKEGT----EDVIGICYWDSNKMEIIKLENGNKNHIREINGNIYGKSI 574
Query: 534 LLCAFEGISYLLCALGDGHLLNF----LLNMKTGELTDRKKV-SLGTQPITLRTFSSKNT 588
+ + Y++ +GDG +L + L+ ++ GE+T KV +G I L+T
Sbjct: 575 EIINNKEGMYIVIGMGDGRVLVYQMEELMEIE-GEITKNYKVLDIGLSGIELKTLEINGM 633
Query: 589 THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
++ DRPT++ K+ +VN E+ + PF +SL +A + + IG I++
Sbjct: 634 KYIMCLCDRPTLVSIYKGKIKTLSVNCSEIVSIVPFKMKECMNSLIVATKENIIIGNIEE 693
Query: 649 IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES-EMHFVRLLDDQTFEFIST 707
IQ +H +S+ +G RI +Q + S + + E+ E H ++L+D + E +
Sbjct: 694 IQSIHTKSLSIGVFISRIVISKQENKALLLSSEIKELKEKRIESHSIKLIDFRKMEIEDS 753
Query: 708 YPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEK 767
+ L E+ SI D + +GTA+ P E EP+ GRIL+ ++DGKL+++ EK
Sbjct: 754 HELKENEHALSIEEIVV--DEKEMFVIGTAFAKPNEVEPSSGRILIVQIKDGKLEIVFEK 811
Query: 768 ETKGAVYSLNAFNGKLLA-AINQKIQLYKWM-LRDDGTRE--LQSECGHHGHILALYVQT 823
+ GAVYS+ K LA +I +K+ ++++ + +G E LQ + + ++ LYV+T
Sbjct: 812 DVNGAVYSIKTLLKKYLAMSIEKKLVIFEYQRIITNGEFEVKLQEKGSCNVKLIGLYVKT 871
Query: 824 RGDFIVVGDLMKSISLLIY---KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
G+ I+VGDLMKSIS+ + + + + E +RD+ A++ +A+E +D++ YL +++N N
Sbjct: 872 MGNKILVGDLMKSISVYSFDNNGNNKNCLNEVSRDFYASYTTAIEFVDENCYLSSDSNSN 931
Query: 881 LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
L NS G + ER RL H+GE +N GS+ + Q ++FG V
Sbjct: 932 LLVFNTNSTG-NESERFRLNNCAHIHVGECINVMCKGSIAPTHSTYETIQKKCILFGGVT 990
Query: 941 GVIGVIASLPHEQYLFLEKLQTNLRKVIKG-VGGLNHEQWRSFNNEKKTVDAKNFLDGDL 999
G IG I +P+E Y L K+Q + +KG V ++W+ ++ K + + N +DG++
Sbjct: 991 GYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTPDEWKKVIDDWKRMPSSNIIDGNI 1050
Query: 1000 IESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+ES+L++S+ + EI+ V+ E++ +E + L
Sbjct: 1051 VESYLEMSKEKQCEIAHLSGVNEEQINDIIENMISL 1086
>gi|119191318|ref|XP_001246265.1| hypothetical protein CIMG_00036 [Coccidioides immitis RS]
Length = 1072
Score = 312 bits (799), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 228/773 (29%), Positives = 388/773 (50%), Gaps = 86/773 (11%)
Query: 6 IYGRIATLE-LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPT 63
IYG+I+ L+ + R H A D LF+ T+RY + L WD + +L T + D++D R
Sbjct: 55 IYGKISVLQKVPRSHSSATDLLFVGTDRYAYFTLSWDPSTCQLHTEQKYLDIADPSLRDN 114
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAFNI 104
+G +DP + + + +Y+G+ VIP + + L E
Sbjct: 115 QSGDRSWVDPSGKFLTMEIYEGIISVIPIAQEPLKRPSPSAGRSSGTSEQREYLGEPLQT 174
Query: 105 RLEELQVLDIKFLY--GCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG-PWSQ---- 157
R+EEL V FL+ P I +LY++ + +K L+D + G P +
Sbjct: 175 RIEELIVRSTAFLHHDPTKPPRIAILYENTQGKVKLK-----LRDLIYSRGIPGGEASAA 229
Query: 158 ---------NNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP----SITK 204
++L+ GAD+L+PVP PL GVLI+GE+ I Y + I RP +I
Sbjct: 230 EFRDVDDLYDDLELGADILVPVPLPLGGVLILGEKFIKYIDTVKNETI-TRPLEHNTIFV 288
Query: 205 AYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
A+ ++D R+LL D G L ++I + V K+ LLGETS AS + +L VV+
Sbjct: 289 AWEQLD--NQRWLLADDYGRLFFFMLILNSANAVQSWKVGLLGETSRASALVHLGGGVVF 346
Query: 264 IGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG----------QG 313
+GS GDS +I++ +GS+ E+++ N+GPI+DF V+DL +G Q
Sbjct: 347 LGSHQGDSHVIRIT-----EGSF-EIIQTLSNIGPILDFTVMDLGNRGETPTHEFSSGQA 400
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
++VT SGA++DGSLR VR+G+G+ + + ++ I +W L + + F L++SF+ E
Sbjct: 401 RIVTGSGAFRDGSLRSVRSGVGMEDLGVLGAMEHITDLWGLSAFCPEGFCDTLLLSFVDE 460
Query: 373 TRILAMNLEDELEETE-IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
+R+ + + E+EE + G T+ + +++QVT R+ SR EW
Sbjct: 461 SRVFHFSPDGEVEEKDDFLGLLLGEPTIHAANLPSRRILQVTEHGARVTDVESRMTLWEW 520
Query: 432 KSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEIS--CLDINPI 488
+ + A++N ++L GG L+ +IGD I ++ K + + ++S L +PI
Sbjct: 521 SAVESRKITAASSNDRHLVLMVGGQKLMVFDIGDDIKMSSTKTFEADKQVSGVALTSSPI 580
Query: 489 GENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSVLL-CAF-EGISYL 544
Q + V I L LN+ E LG G+ +PRSVL+ C F + L
Sbjct: 581 -------QACILCFPQSAEVTIIDLTGLNIRHTETLGEPGDAVPRSVLVACMFSDRAPTL 633
Query: 545 LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYS 603
A+ DG + +F LN+ L+D K+ LG++ + + + +FA D P++IY+
Sbjct: 634 FVAMADGSVFSFSLNVANYSLSDANKLVLGSEAPVFKLLPRADGLYNIFATCDHPSLIYA 693
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHP 663
S +++YS VN + + +C FN+ A+P ++A+A ++ I +D + I+++ + E
Sbjct: 694 SEDRIVYSAVNSDKATRICHFNAEAYPGAIAVATPDDIKIALVDAERTTQIQTLMINETV 753
Query: 664 RRICHQEQSRTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTYPLDTFE 714
RR + R F + +++ EE + HF+ L D+ F +S + L+ E
Sbjct: 754 RRTSYSSTERAFGLGTIQRTLVQNVEEVKSHFI-LADEIMFRQLSVFDLNPNE 805
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 71/241 (29%), Positives = 120/241 (49%), Gaps = 12/241 (4%)
Query: 805 ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA----IEERARDYNANW 860
E++ E + + + G+ I V DLMKSISL+ Y EG ++E AR Y W
Sbjct: 835 EIEKEASYRTSTAPVDISVTGNIIAVADLMKSISLVEYHAGEGGQPDTLKEVARHYQTLW 894
Query: 861 MSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 920
+A + ++ +L A+ NL + +++ G T+++R R++V E LGE VNR H +
Sbjct: 895 TTAAAPVAENEFLVADAEGNLVVLNRDTTGVTEDDRRRMQVTSELRLGEMVNRI-HPMDL 953
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
P+S V IP TV+G I + + L +LQ+ L + G + ++R
Sbjct: 954 QTSPESPV--IPKAFLATVDGSIYLFGLISPSAQDTLMRLQSALADFVASPGEIPFNKYR 1011
Query: 981 SFNNE-KKTVDAKNFLDGDLIESFL----DLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+F + ++ + F+DG+LIE FL D+ + + V+V E+ +E L R+
Sbjct: 1012 AFKSSVRQAEEPFRFVDGELIEQFLTFPPDIQEAALARMDGGGRVNVIEIKGMIEGLKRM 1071
Query: 1036 H 1036
H
Sbjct: 1072 H 1072
>gi|380481704|emb|CCF41690.1| CPSF A subunit region, partial [Colletotrichum higginsianum]
Length = 932
Score = 308 bits (788), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 256/968 (26%), Positives = 449/968 (46%), Gaps = 115/968 (11%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTD 64
+YG I L+ RP D LF+ T+R+++ +WD + L T + + D ++ R
Sbjct: 7 VYGTILFLQRLRPKDTKADLLFVGTDRFQYFTARWDPATQRLQTEQVIEDAAEPHMRDAQ 66
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG----QLKEAF-NIRLEELQVLDIKFL-Y 118
+ ++DP + + +HL++G+ V+ + +L +++ +RL EL V F+
Sbjct: 67 SQDRCLVDPTGKFMAMHLWEGVLNVMRLGTRKGTFTRLDDSWERVRLSELFVKASTFVPT 126
Query: 119 GCAKPTIVVLYQDN--KDARHVKTYEVALKDKDFVEGPWSQNN--------LDNGADLLI 168
PT+ LYQ K+ H+ Y + DK+ + + D A ++I
Sbjct: 127 ETGHPTVAYLYQSQIEKEDAHLAIYRLMSDDKNTTVSRFDPSRDREFELEIKDPYARIVI 186
Query: 169 PVP------------------PPLCGVLIIGEETIVYCSANAFKAIPI---RPSITKAYG 207
PVP L G++++GE +VY + P+I A+
Sbjct: 187 PVPIVEDEVKRYHKRDTTGAKAQLGGLIVVGETLLVYVDTLTRTVVESGLNSPAIFVAWA 246
Query: 208 RVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
D + Y L D G LHLL I E VT L + LLG TS AS + ++ N ++++GS
Sbjct: 247 AYD--DTNYFLSDDYGNLHLLTIETEGVVVTNLSLRLLGVTSRASCLVHMGNGLLFLGSH 304
Query: 268 YGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER-----------QGQGQVV 316
YGDSQL+++N++ K V+ + ++ PI+DF ++DL GQ ++V
Sbjct: 305 YGDSQLLQINME-SLKTRLVQTIP---SIAPILDFSIMDLGNAGDSQVGNAFSSGQARIV 360
Query: 317 TCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
G +++GSLR +R+ +G+ + +E LQ ++G++SLRS DT LVVSFI+ETRI
Sbjct: 361 AGCGVHQNGSLRSIRSSVGLEDIGVLEDLQDVRGLFSLRSHGSPKVDT-LVVSFITETRI 419
Query: 376 LAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
+ + E +EE E +G TL +L+QVTS +V L+ S N W P
Sbjct: 420 FSFDPEGGIEEVFEFQGLALDRPTLVATTLPDGRLLQVTSTTVTLLESERGITLNTWAVP 479
Query: 435 PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKH-----AQLEYEISCLDINPIG 489
G ++ A+AN VLL+ G LV L + + + + + E +ISC I
Sbjct: 480 DGKAIVNASANNKWVLLSINGTTLVSLNLLNNLSAQEQVLGRDIGGHEDQISC-----IH 534
Query: 490 ENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE----IIPRSVLLCAFEGISYL- 544
+ VG W SV I L L+ + E + +PR + L L
Sbjct: 535 AASDLDDVGVVGFWATGSVSIIDLRTLDALHGETIKQTDDSVSVPRDLALVQLHPPHLLG 594
Query: 545 ---LCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF---SSKNTTHVFAASDRP 598
A+ DG +++F ++ + L+ RK V+LG+Q L + +++FA ++
Sbjct: 595 PTLFVAMEDGQVVSFNVSKQDFSLSSRKSVTLGSQQAGLHVLPRPGGEGISNIFATTEHS 654
Query: 599 TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIP 658
++IYSS +++YS ++V+++ PF+S AFPD++ +A + + I ID ++ H+ +P
Sbjct: 655 SLIYSSEGRIIYSAATAEDVTYIAPFDSEAFPDAIFLATDQNIRIAHIDAERRTHVNPLP 714
Query: 659 LGEHPRRICHQEQSRTFAICSLKNQSC-AEESEMHFVRLLDDQTFEFIST-YPLDTFEYG 716
L E RR+ + R F I ++ + EE RL+D+ + + LD
Sbjct: 715 LRETVRRVAYSPALRAFGIGTISRELVNNEEVVTSSFRLVDEIVLGVVGKPFHLDG---- 770
Query: 717 CSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSL 776
S + + V A + +P AE+ GA SL
Sbjct: 771 --------STSTEMVESVIRAELPDSMGQP------------------AERFIIGACRSL 804
Query: 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
KL+A +++ + +Y++ + L+ + + G+ I + DLM+S
Sbjct: 805 GIMGEKLVAGLSKTVVVYEYAEESSTSGALRKLATFRPSTFPVDIDVNGNMIGIADLMQS 864
Query: 837 ISLLIY----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
++L+ + + + ERAR + W ++V L+ +L A+ NL +R+N + T
Sbjct: 865 MTLVEFIPAQDGNKAKLVERARHFQYIWATSVCHLEGHSWLEADAQGNLMVLRRNPDAPT 924
Query: 893 DEERGRLE 900
+ ++ ++E
Sbjct: 925 EHDQKQME 932
>gi|449549048|gb|EMD40014.1| hypothetical protein CERSUDRAFT_63520 [Ceriporiopsis subvermispora B]
Length = 1265
Score = 303 bits (776), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 239/797 (29%), Positives = 383/797 (48%), Gaps = 84/797 (10%)
Query: 283 KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
KG ++EVL+ + N+ PIVD + DL+ GQ Q++TCSG G+L++VR G E A V
Sbjct: 450 KGQFIEVLDTFQNIAPIVDAVLADLDDSGQSQIITCSGGRNSGALKVVRTGADFQELARV 509
Query: 343 E-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE--TEIEGFCSQTQTL 399
+ GI +W +RS + D+ LV S +ET + + D + + + +GF + TL
Sbjct: 510 NGITGITSLWPVRSRFEHSTDSHLVASTETETYVFKFDSSDVITQLDSSADGFITTAPTL 569
Query: 400 FCHD--------------AIY---NQLVQVTSGSVRLVSSTSR----ELRNEWKSPPGYS 438
+ + Y + ++QVT + L+ + L E P
Sbjct: 570 AAANIPRRVSTNAGGRVSSSYVDSSLVIQVTPEMITLLEYDAALGLFSLVGEGWDPKSQG 629
Query: 439 -------VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQL-------EYEISCLD 484
+ A NASQ+++ GG + L + D +V+ ++ ++S +
Sbjct: 630 AIGGRRRIVAADVNASQIVVGLDGGKVCLLNLADNSRFQVQRSRGFADPVYGPLDVSAVS 689
Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSLP--DLNLITKEHLGG-EIIPRSVLLCAF--- 538
P +++ AV W V+I S+ D L T + G +PRSVLL F
Sbjct: 690 CVPFDRTKNFATNIAVAFWGTNKVQILSISSQDATLATVCEVSGLPSLPRSVLLHNFGTG 749
Query: 539 ------EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+ ++L L DG +++F ++ EL ++K SLG P++L VF
Sbjct: 750 RTKKEPDFHPHVLVGLVDGSVISF--SVVENELKEKKVFSLGIAPVSLSRCEVDGKITVF 807
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A R +V+Y ++L S V +K+++ N++ F SL +A L IGTI + K+
Sbjct: 808 AVGSRTSVLYWDKQRLTNSPVMIKDMAVGTSLNTSYFRSSLVLAASSGLIIGTIRGVDKM 867
Query: 653 HIRSIPLG-EHPRRICHQEQSRTFAI-C--SLKNQSCAEESEMHFVRLLDDQTFEFISTY 708
IRSIP G PRRI + + + F + C ++ + + +++D TF + +
Sbjct: 868 QIRSIPFGLSDPRRIAYHSRLKLFGVGCNRTMPLRPGEFQGTTSSFKIIDATTFNGLWDF 927
Query: 709 PLDTFEYGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV--------EDG 759
L E S+++ +D + +CVGT + EE EPT GR+L+F + DG
Sbjct: 928 ELQANEEVSSVMALPDGTDGRSPCFCVGTVFFEVEETEPTSGRLLLFAIGSDGATSSADG 987
Query: 760 KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR-ELQSECGHHGHILA 818
+L+L+ ++ KG V+ + + N + AAIN + L+ LRD + LQ + +
Sbjct: 988 ELRLVTTQDVKGCVFQITSVNNFIAAAINSNVVLFA--LRDTNKQYALQQVADWNHNYFV 1045
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+ + GD ++VGD + S+SLL + IE +RDY+ W AVE ++ +GA ++
Sbjct: 1046 TNLASHGDRLIVGDAISSVSLL--RVSVARIECLSRDYSPLWPVAVEATAENQIIGANSD 1103
Query: 879 FNLFTVR-KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--PTVI 935
NLF+ ++ +G R LE G YHL + VN+F G LV DS G P +
Sbjct: 1104 CNLFSFALQHIDG-----RKVLERDGSYHLDDIVNKFAPGGLVA--ADSSTGYTLRPRQL 1156
Query: 936 FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK--- 992
F + +G IGVI + E L L LQ N+ K IKG G NH +WR+ N + DA+
Sbjct: 1157 FFSSSGRIGVIIDVDDELSLPLTSLQMNMAKRIKGPGDTNHTEWRAPTNARGRTDAEASA 1216
Query: 993 -NFLDGDLIESFLDLSR 1008
FLDGD IE FL R
Sbjct: 1217 FGFLDGDFIEQFLTHPR 1233
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 50/78 (64%), Gaps = 5/78 (6%)
Query: 205 AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
A+ +D + R+L+GD G L LLV+ TGL + LGETS +T+SYLD+ VVY+
Sbjct: 318 AWCPLDEESRRFLVGDAYGRLALLVL-----DTTGLIVSPLGETSPPTTLSYLDSQVVYV 372
Query: 265 GSSYGDSQLIKLNLQPDA 282
GS G+ QL++++ P A
Sbjct: 373 GSHMGNPQLLRIHTSPIA 390
>gi|170589359|ref|XP_001899441.1| Xeroderma Pigmentosum Group E Complementing protein [Brugia malayi]
gi|158593654|gb|EDP32249.1| Xeroderma Pigmentosum Group E Complementing protein, putative
[Brugia malayi]
Length = 521
Score = 303 bits (775), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 186/446 (41%), Positives = 277/446 (62%), Gaps = 29/446 (6%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++GRIAT++LFR GE D L I T +Y +++W +SEL TRA G + DR+GRP++
Sbjct: 57 VFGRIATIKLFRAPGENVDSLLILTAKYHLAIIRW-TPTSELRTRASGHIVDRVGRPSET 115
Query: 66 GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTI 125
G I + ++ LYDGL KV+ + N+G+ FN+R ++L ++DI F+ +PT+
Sbjct: 116 GMIATVHSSGLMV-FRLYDGLLKVVQW-NEGKDLRGFNVRCDDLYIVDITFMSDPDRPTL 173
Query: 126 VVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETI 185
+YQD+ + RH+K + + DK+ W +NL+ A+++I VP P+ G LI G + I
Sbjct: 174 AYIYQDD-NGRHIKVVALNIDDKELSSPLWKHDNLEGEANIVIGVPEPVGGCLIAGPDAI 232
Query: 186 VYCS----ANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK-- 236
Y A + +P + + Y VD DG RYLL D AG L++L++ K++
Sbjct: 233 SYHKGGDDALRYAGVPGSRLHNTHPNCYAPVDRDGQRYLLADLAGNLYMLLLELGKDQEQ 292
Query: 237 -------VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS-YVE 288
V +K+E LGET IA + YLBN V +IGS +GDSQLI+L+ +P A G+ Y+
Sbjct: 293 DENSAMYVRDMKVESLGETCIAECMCYLBNGVCFIGSRFGDSQLIRLSTEPRADGTGYIS 352
Query: 289 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 348
+L+ Y NL PI D V + GQ Q++TCSGAYKDG++RI+RNGIGI E ASVEL+GIK
Sbjct: 353 LLDSYTNLAPIRDMTV--MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELASVELKGIK 410
Query: 349 GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
M++LR+ DD FD +L++SF SET +L +N E ELE+TEI GF TL+ +++
Sbjct: 411 NMFTLRTR-DDEFDDYLILSFDSETHVLLINGE-ELEDTEITGFTVDGATLWAGCLFHSK 468
Query: 409 -LVQVTSGSVRLVSSTSRELRNEWKS 433
++QVT G V L+ + ++ WKS
Sbjct: 469 TILQVTHGEVILIDGDNIQV---WKS 491
>gi|312076588|ref|XP_003140928.1| xeroderma Pigmentosum Group E Complementing protein [Loa loa]
Length = 516
Score = 295 bits (754), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 183/449 (40%), Positives = 273/449 (60%), Gaps = 29/449 (6%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
+ P++GRIAT++LFR GE D L I T +Y +++W +SEL TRA G + DR+GRP
Sbjct: 54 ECPVFGRIATIKLFRAPGEDVDSLLILTAKYHLAIIRW-TPTSELRTRASGHIVDRVGRP 112
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
++ G I + ++ LYDGL KV+ + N+G+ FN+R ++L ++DI F+ +
Sbjct: 113 SETGMIATVHSSGLMV-FRLYDGLLKVVQW-NEGKDLRGFNVRCDDLYIIDITFMSDPDR 170
Query: 123 PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGE 182
PT+ +YQD+ + RH+K + ++DK+ W +NL+ A ++I VP P G LI G
Sbjct: 171 PTLAYIYQDD-NGRHIKVVTLNIEDKELSSPLWKHDNLEGEASMVIGVPEPAGGCLIAGP 229
Query: 183 ETIVYCS----ANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE 235
+ I Y A + +P + + Y VD DG RYLL D AG L++L++ K
Sbjct: 230 DAISYHKGGDDALRYAGVPGSRLHNTHPNCYAPVDRDGQRYLLADLAGNLYMLLLEFGKG 289
Query: 236 K---------VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS- 285
+ V +K+E LG T IA + YLDN V +IGS +GDSQLI+L+ +P A G+
Sbjct: 290 QEQDESSTVSVKDMKVESLGNTCIAECMCYLDNGVCFIGSRFGDSQLIRLSTEPRADGTG 349
Query: 286 YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
Y+ +L+ Y NL PI D V + GQ Q++TCSGAYKDG++RI+RNGIGI E ASVEL+
Sbjct: 350 YISLLDSYTNLAPIRDMTV--MRCNGQQQILTCSGAYKDGTIRIIRNGIGIEELASVELK 407
Query: 346 GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
GIK M++LR+ D FD +L++SF S+T +L +N E ELE+T+I GF TL+
Sbjct: 408 GIKNMFTLRTR-DHEFDDYLILSFDSDTHVLLINGE-ELEDTQITGFVVDGATLWAGCLF 465
Query: 406 YN-QLVQVTSGSVRLVSSTSRELRNEWKS 433
+ ++QVT G V L+ + ++ WK+
Sbjct: 466 QSTTILQVTHGEVILIDGDNIQI---WKA 491
>gi|449540702|gb|EMD31691.1| hypothetical protein CERSUDRAFT_109269 [Ceriporiopsis subvermispora
B]
Length = 1265
Score = 288 bits (737), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 234/801 (29%), Positives = 383/801 (47%), Gaps = 92/801 (11%)
Query: 283 KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
KG ++EVL+ + N+ PI+D + DL+ GQ Q++TCSG G+L++VR G E A V
Sbjct: 450 KGQFIEVLDTFQNMAPIMDAVLADLDDSGQSQIITCSGGRNSGALKVVRAGADFQELARV 509
Query: 343 E-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE--TEIEGFCSQTQTL 399
+ GI +W +RS ++ D+ LV S +ET + + D + + + + F + TL
Sbjct: 510 NGITGITSLWPVRSRSEHSTDSHLVASTETETYVFKFDSSDVITQLDSSADEFITTAPTL 569
Query: 400 FCHDAIYNQLVQVTSGSVRLVSSTSRE-------------------------LRNEWKSP 434
A+ N +V++ + VSS+ + L E P
Sbjct: 570 ----AVANIPRRVSTNASGRVSSSYVDSSLVIQVTPEKITLLEYGAALGLFSLVGEGWDP 625
Query: 435 PGYS-------VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ-------LEYEI 480
+ A NASQ+++ GG + L + D +V+ ++ ++
Sbjct: 626 KSQGAIGGRRRIVAADVNASQIVVGLDGGKVCLLNLADNSRFQVQRSRDFADPVYGPLDV 685
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP--DLNLITKEHLGG-EIIPRSVLLCA 537
S + P +++ AV W V+I S+ D L T + G +PRSVLL
Sbjct: 686 SAVSCVPFDRTKNFATNIAVAFWGTNKVQILSMSSQDATLATVCEVSGLPSLPRSVLLHN 745
Query: 538 F---------EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
F + ++L L DG +++F + EL ++K SLG P++L +
Sbjct: 746 FGTGRTKKEPDYHPHVLVGLVDGSVISF--KVVENELKEKKVFSLGIAPVSLSRCEVDDK 803
Query: 589 THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
VFA R +V+Y ++L S V +K+++ N++ F SL +A L IGTI
Sbjct: 804 ITVFAVGSRTSVLYWDKQRLTNSPVMIKDMTVGASLNTSYFRSSLVLAASSGLIIGTIRG 863
Query: 649 IQKLHIRSIPLG-EHPRRICHQEQSRTFAI-C--SLKNQSCAEESEMHFVRLLDDQTFEF 704
+ K+ IRSIP G +PR I + + + F + C ++ + + +++D TF+
Sbjct: 864 VDKMQIRSIPFGLTNPRWITYHSRLKLFGVGCNHTMPLRLGEFQGTTSSFKIVDATTFDG 923
Query: 705 ISTYPLDTFEYGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV------- 756
+ + L E S+++ +D + +CVGTA++ EE EP GR+L+F +
Sbjct: 924 LWDFELQANEEVTSVMALPDGTDGRSPSFCVGTAFLEVEETEPRSGRLLLFAIGSDGATS 983
Query: 757 -EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR-ELQSECGHHG 814
DG+L+L+A ++ KG V+ + + N + AAI+ + L+ LR+ + LQ +
Sbjct: 984 SADGELRLVATQDVKGCVFQITSVNSFIAAAISSNVVLFA--LRNTNKQYALQQVADWNH 1041
Query: 815 HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
+ + + GD ++VGD + S+SLL + + IE +RDY AVE ++ +G
Sbjct: 1042 NYFVTNLASHGDLLIVGDAISSVSLL--RVSDSRIECLSRDYGPLRPVAVEATAENQIIG 1099
Query: 875 AENNFNLFTVR-KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-- 931
A + NLF+ ++ +G R LE G YHL + V +F G LV DS G
Sbjct: 1100 ANSYCNLFSFALQHIDG-----RKVLERDGSYHLDDIVKKFVPGGLVA--ADSSTGYTLR 1152
Query: 932 PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 991
P +F + +G IGVI + E L L LQ N+ K IKG G NH +WR+ N + DA
Sbjct: 1153 PRQLFFSSSGRIGVIIDVDDELSLPLTSLQMNMAKRIKGPGDTNHTEWRAPTNARGRTDA 1212
Query: 992 K----NFLDGDLIESFLDLSR 1008
+ FLDGD IE FL R
Sbjct: 1213 EASAFGFLDGDFIEQFLTHPR 1233
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 70/296 (23%), Positives = 127/296 (42%), Gaps = 65/296 (21%)
Query: 47 LITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEA-FNIR 105
L+T+ + +R GR + + P + + Y G+ KV+ F A F++
Sbjct: 100 LVTKGHVSLQERGGRVAEFVTDVFVHPGGEVAVVSCYAGMLKVVQFKEGNIDTHAHFDVS 159
Query: 106 LEELQVLDIKFLYGCAKPTIVVLYQ-DNKDARHVKTYEVALKDKDFVEGP--------WS 156
+ E+ +L + FL+ I+ + D++ + + ++ L++ + P S
Sbjct: 160 IPEMNILALNFLHSDDNQYILAIVNYDHQRRVQLLSRKLDLENYELDPSPSVVLRSTQLS 219
Query: 157 QNNLDN--GADLLIPVPPP-----LCGVLIIGEETIVY---CSANAFK------------ 194
++ + LL+P+ P L GVL++G I++ S N +
Sbjct: 220 SSHFPSIETPPLLVPILPEGDGGHLGGVLVLGGRKILFFEHTSKNRQQIKRWKQIRLESR 279
Query: 195 --------------------AIPIRPSIT--------KAYGRVDADGSRYLLGDHAGLLH 226
A ++P + A+ +D + R+L+GD G L
Sbjct: 280 LSSKDPMEVAKAREKEKERDARKVKPKFSVKWPWSDVTAWCPLDEECRRFLVGDAYGRLA 339
Query: 227 LLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDA 282
LLV+ TGL + LGETS +T+SYLD+ VVY+GS G+ QL++++ P A
Sbjct: 340 LLVL-----DTTGLIVSPLGETSPPTTLSYLDSQVVYVGSHMGNPQLLRIHTSPIA 390
>gi|387219217|gb|AFJ69317.1| dna damage-binding protein 1a, partial [Nannochloropsis gaditana
CCMP526]
Length = 383
Score = 288 bits (736), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 169/380 (44%), Positives = 224/380 (58%), Gaps = 31/380 (8%)
Query: 405 IYNQLVQVTSGSVRLV--SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
+ LVQVT SVRL+ +S S L +EW S G + VA NA QV+LA GG +VYLE
Sbjct: 4 VGEMLVQVTDASVRLLDLASASSPLLSEWISAHGTRITVAAGNAMQVVLALSGGEVVYLE 63
Query: 463 I--GDGILTEVKHAQLEYEISCLDINPI------GEN---------------PSYSQIAA 499
+ L E LE+E+SCL ++P+ GE+ P + + A
Sbjct: 64 LKANQRALEEKARIHLEHEVSCLSVHPLIPGPVPGEDGAEEAMQVEDGREEAPPSAFLVA 123
Query: 500 VGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEG-ISYLLCALGDGHLLNFLL 558
VG WTD+SVR+ +LP L + + LG + RSVLL + I YLL LGDG L++F +
Sbjct: 124 VGTWTDLSVRLLALPSLQSLHRCELGSDTQARSVLLITLQADIHYLLVGLGDGFLVSFAV 183
Query: 559 NM--KTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNK-KLLYSNVN 614
+ KT L RKKVSLGTQP++L FSS VF S+RPTVI+ S KLLYSNVN
Sbjct: 184 ALEGKTPALGPRKKVSLGTQPLSLTPFSSTAAEPCVFVCSERPTVIHVSKADKLLYSNVN 243
Query: 615 LKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRT 674
EV+ M PF+SA FPD LA+A E L IGT+DDIQKL I+++PLGE PRRI H Q+
Sbjct: 244 TSEVTLMAPFHSALFPDCLALASETGLRIGTVDDIQKLRIQTVPLGESPRRIAHIPQAGV 303
Query: 675 FAICSLKNQSCAE-ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
A+ + K E E ++VR LDD +FE + + L E CS+ +C+F+ D Y
Sbjct: 304 LAVLTAKYAVGENGEEETNYVRFLDDASFEAVGAFELRPMELACSVAACTFAKDPREYLV 363
Query: 734 VGTAYVLPEENEPTKGRILV 753
VGT L +E+EP +GR++V
Sbjct: 364 VGTCMALEDEDEPREGRLIV 383
>gi|225680146|gb|EEH18430.1| DNA damage-binding protein [Paracoccidioides brasiliensis Pb03]
Length = 1138
Score = 282 bits (721), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 235/819 (28%), Positives = 383/819 (46%), Gaps = 116/819 (14%)
Query: 15 LFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDP 73
LF P D LFI T+R + + W+ +S+L T R D++D R + G +IDP
Sbjct: 113 LFLPQ---TDVLFIGTDRATYFTVSWNPVTSQLRTERKYVDLADPSSRESQLGDRCLIDP 169
Query: 74 DCRLIGLHLYDGLFKVIPFDNKG-----------------------------------QL 98
+ I L LY+G+ VIP +L
Sbjct: 170 SGKFITLELYEGIITVIPIGQPQRTARQSGRKYGKRAVTAQNQHDSSHIGNTNAAGEVEL 229
Query: 99 KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV------------- 143
E R++EL V FL+ A P + LY+D +K E+
Sbjct: 230 GEPCQARVDELLVRSSAFLHTQADMLPRMAFLYEDTMGQVRLKVRELEFTYGGMGIGTGG 289
Query: 144 ----------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--SAN 191
K D + + L+ GA LIPVP PL G+L++GE +I Y + N
Sbjct: 290 GAGQDTGCIAVFKALDLL-----KEELEMGASFLIPVPAPLGGLLVLGETSIRYLDDATN 344
Query: 192 AFKAIPI-RPSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETS 249
++P+ +I A+ +VD G R+LL D G L L++I E V K++LLG
Sbjct: 345 ECISLPLDEATIFVAWEQVD--GQRWLLADDYGRLFFLMLILDEDNAVQSWKLDLLGNIP 402
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL-- 307
AS + YL V +IGS GDSQLI++ +GS+ E+++ + N+ PI+DF ++DL
Sbjct: 403 RASVLVYLGGGVTFIGSHQGDSQLIRIT-----EGSF-EIIQTFSNIAPILDFTIMDLGG 456
Query: 308 ---ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSST 357
E Q GQ ++VT SGA+ DGSLR VR+G+G+ E + ++ I +W+LR +
Sbjct: 457 RAGENQTHDFSSGQARIVTGSGAFDDGSLRSVRSGVGMEEVGVLGAMEHITDLWALRVAC 516
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEET-EIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+ F L+VSF+ ETR+ + E+EE E G TL + +++QVT +
Sbjct: 517 QEGFSDTLLVSFVDETRVFHFTQDGEVEEKDEFMGLGLAESTLLAANLPNGRILQVTERN 576
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQ 475
VR+ L W P ++ A++N ++L GG L+ +I G+ L K
Sbjct: 577 VRVADLDDGMLLWNWSPPSQKAITAASSNDDHLVLVVGGQVLMCFDIQGEVKLAGKKDFG 636
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG--GEIIPRSV 533
+ ++S + + +P+ I + ++ V +L D+ + +G GE PRSV
Sbjct: 637 DDTQVSGVTVT---SSPATYCILCLPQTAEVVV--MNLEDMTIRHSTSVGEPGEAFPRSV 691
Query: 534 LLCAF--EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTFSSKNTTH 590
L+ + L ++ DG + +F N L K+ LG+ QP + +
Sbjct: 692 LVAEVLPNQPATLFVSMADGRVFSFSFNADEFTLKKMSKLVLGSEQPSFKKLPRGDGLYN 751
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
+FA ++P++IY++ +++YS V+ + S +C FNS A+P S+A+A EL I +D +
Sbjct: 752 IFATCEQPSLIYATEGRIIYSAVHSDQASRICHFNSEAYPGSIALATPTELKIAHVDSER 811
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLKN--QSCAEESEMHFVRLLDDQTFEFISTY 708
I ++ + E RR+ + + F I ++K ++ AE F+ L D+ F + Y
Sbjct: 812 TTQIHTLEIDETVRRVAYSAAEKAFGIGTIKRTLENGAEVITSRFM-LADEIMFRELDDY 870
Query: 709 PLDTFEYGCSILSCSFSD--DSNV------YYCVGTAYV 739
L E S++ F + DSN + VGT+Y+
Sbjct: 871 SLRPDELVESVIQAQFPEGKDSNGNESFKDIFVVGTSYL 909
Score = 109 bits (273), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 71/231 (30%), Positives = 119/231 (51%), Gaps = 14/231 (6%)
Query: 820 YVQTRGDFIVVGDLMKSISLLIYKHEEG----AIEERARDYNANWMSAVEILDDDIYLGA 875
Y+ G+ I V DLMKS+S++ +K E ++ E AR + W +AV + ++++L +
Sbjct: 908 YLDDVGNLIAVADLMKSVSIIEFKQGENDQPDSLTEVARHFQTLWSTAVAPIAENMFLES 967
Query: 876 ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI---P 932
+ NL + +N G TD+++ RLEV E LGE VNR R S+ LP + + P
Sbjct: 968 DAEGNLVVLNRNVNGVTDDDKRRLEVTSEILLGEMVNRIRPVSIQGSLPATGPREAVISP 1027
Query: 933 TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE-KKTVDA 991
GTV G I + + L +LQ+ + ++ G + ++R+F N ++ +
Sbjct: 1028 KAFLGTVEGSIYLFGLINPAYQDLLMRLQSAMAGLVVTPGAMPFNKFRAFKNAVRQAEEP 1087
Query: 992 KNFLDGDLIESFLDLSRTRMDEI------SKTMNVSVEELCKRVEELTRLH 1036
F+DG+LIE F +EI +V+VE++ + VEEL R+H
Sbjct: 1088 YRFVDGELIERFWTCETALQEEIVGLVVAGGVADVTVEKVKRIVEELRRMH 1138
>gi|409049568|gb|EKM59045.1| hypothetical protein PHACADRAFT_181065 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1268
Score = 277 bits (709), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 267/947 (28%), Positives = 412/947 (43%), Gaps = 155/947 (16%)
Query: 189 SANAFKAIPIRPSITKAYGRVDADGSR----------YLLGDHAGLLHLLVITHEKEKVT 238
+A K + R ++ + V A SR + +GD G L LL I + E
Sbjct: 308 AARELKKVKPRATVKWPWSEVTASASRWCPADDEMRRFFVGDVFGRLSLLTINDDPE--- 364
Query: 239 GLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQP------------------ 280
L I LGE S +T+SYL + V+Y+GS G+SQL++++ P
Sbjct: 365 -LIIIPLGEISSPTTLSYLSSQVLYVGSVLGNSQLLRISPSPVGDIDSDTLPIPGGIHTI 423
Query: 281 ----------------------------------DAKGSYVEVLERYVNLGPIVDFCVVD 306
+ KG ++E L +Y N+ PIVD + D
Sbjct: 424 KPAELSTLRAESPDEDYDMRDAFDTPEGRGGKIVNCKGRFIEELTQYANIAPIVDAVMAD 483
Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWSLRSSTDDPFDTFL 365
+ GQ Q++TCSG GSL +VR G E A + E+ + +W +R+S D P DT++
Sbjct: 484 PDESGQPQIITCSGGANTGSLNVVRTGADFQELAVLNEIPNVTNIWPIRTSFDGPADTYV 543
Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN-----------------Q 408
+ + + E+ +L + D + T I+ ++ T A+ N
Sbjct: 544 LATTLYESFVLRFDDGDSV--TRIDPSATELVTNRPTIAVANIPRRTTQQNASTYINSSL 601
Query: 409 LVQVTSGSVRLVS---------------STSRELRNEWKSPPGYSVNVATANASQVLLAT 453
+VQVT + LV S ++ W++ + A+ N SQ +A
Sbjct: 602 VVQVTPQGLNLVEYDVALGAFNKVGDGWSLQKQENPLWRA---KEIVAASINPSQFAVAL 658
Query: 454 GGGHLVYLEIG-DGILTEVKHAQLE-YEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 511
GG + + DG L ++ + EI+ L P+ + ++S AV W +V +
Sbjct: 659 NGGTFLLFNLSPDGQLNLLQTREFHGKEIAALSCAPLDPSKNFSPFVAVSFWGSNTVTLL 718
Query: 512 SLPDLNLITKEHLGGEIIP---RSVLLCAF-EGIS--------YLLCALGDGHLLNFLLN 559
S D L T H +P S+LL F G S Y++ L DG + ++
Sbjct: 719 STKDPALGT--HTESAPLPALAHSLLLHNFGAGRSTRDADFQPYVVAGLVDGTVA--CVS 774
Query: 560 MKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS 619
+ EL D+K +LG P++L + V A R V Y ++L S V LK V+
Sbjct: 775 FRNNELRDQKLFALGAAPVSLAVSTVDGARMVLATGSRAAVFYWDRQRLRQSPVMLKNVA 834
Query: 620 HMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAIC 678
N+AAFP +A L +G I + K+ IRS LG E PRRI + F +
Sbjct: 835 VGAGLNTAAFPACQILATPSSLVVGQIRGVDKMQIRSFGLGHESPRRIAYHSDLNLFGVS 894
Query: 679 SLKNQSC----AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
K+ AE E F + D TF IS + + E ++L+ + + V
Sbjct: 895 IAKSAPARVGEAELQESTF-EIKDSVTFSTISLFTAEPDEEITAVLA--LPAGAPGCFVV 951
Query: 735 GTAYVLPEENEPTKGRILVFIVE---DG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
GT E EP+ GR+++F V DG +L+ +AE E G VY+L A + AA+N
Sbjct: 952 GTVKHQHGEFEPSAGRLILFGVVPSVDGGRELKKLAEAEANGCVYALAAVENGVAAAVNT 1011
Query: 790 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL-IYKHEEGA 848
+ LY M+ +G L+ + + + RG ++ GD + S+S+L + +
Sbjct: 1012 SVDLYG-MVEHEGAHALEKVAAWNHNYFVTSLVARGGRLIAGDAISSVSVLEVLRGSH-- 1068
Query: 849 IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
+ ARDY W AVE D +GA + NLFT + R LE G YHLG
Sbjct: 1069 LRTIARDYGPVWPVAVEATKDGGVIGANTDGNLFTF-----ALPEGPRAVLERNGHYHLG 1123
Query: 909 EFVNRFRHGSLVMRLPDSDVGQIPTV-----IFGTVNGVIGVIASLPHEQY-LFLEKLQT 962
E VN+F G+LV G ++ +F T +G +G++ + + L L LQ
Sbjct: 1124 ELVNKFIPGALVGNNSGGGSGGEGSLFETEQLFVTSSGRVGLVHHVCDDGVALALTALQR 1183
Query: 963 NLRKVIKGVGGLN-HEQWRSFNNEKKTVDAKN----FLDGDLIESFL 1004
NL V+ G H QWR+ N + DA++ FLDGDL+ESFL
Sbjct: 1184 NLGGVLGDGPGGTAHGQWRAPANARGRSDAEDGATGFLDGDLLESFL 1230
Score = 47.4 bits (111), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 34/133 (25%), Positives = 61/133 (45%), Gaps = 6/133 (4%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERY--KFCVLQWDAESSELITRAMGDVSDRIGRPT 63
I+GR+ ++ E L + T+ K + ++ E+ EL + D+ DR R
Sbjct: 56 IWGRVLSVCAVPAEAEGVSNLLVLTDHPYPKLILFAFNGETRELDEKWFADLHDRNARHA 115
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYG---C 120
+ ++ P R + Y G KV+ F KG + + F + L EL +L + FLY
Sbjct: 116 EYLNDIVVHPAGRAAAVSCYAGKLKVVTF-KKGGVDKHFEVILPELNLLALSFLYSEPNA 174
Query: 121 AKPTIVVLYQDNK 133
T+ +++ D+K
Sbjct: 175 HTHTLAIMHIDHK 187
>gi|299743623|ref|XP_001835883.2| hypothetical protein CC1G_02971 [Coprinopsis cinerea okayama7#130]
gi|298405741|gb|EAU85948.2| hypothetical protein CC1G_02971 [Coprinopsis cinerea okayama7#130]
Length = 1282
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 273/980 (27%), Positives = 441/980 (45%), Gaps = 178/980 (18%)
Query: 212 DGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
D SR+L+ D G L LL + + K G+ + LG TS ++++YL N V+++GS GDS
Sbjct: 326 DDSRFLISDVCGGLSLLGV--DNVKTNGMTLLPLGMTSPPTSLTYLTNQVIFVGSHLGDS 383
Query: 272 QLIK------------LNLQPDAK------------------------------------ 283
QL++ L++ P+ K
Sbjct: 384 QLVQVSSTPNNQDGPMLDILPEIKTVARNMPAPSRDKGKGRASDEAMDVDDEDDDDISRG 443
Query: 284 ------GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
GS++ VL + N+ PI D VVD+E GQ ++VTCSG Y GSL IVR+G +
Sbjct: 444 RIIKPEGSHLHVLHSFKNIAPINDAVVVDVEGNGQNEIVTCSGGYTSGSLNIVRSGAEYH 503
Query: 338 EQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDE--------LEETE 388
E A++ + + +W+++S+ +D + +V S + R L ++D+ L+ T
Sbjct: 504 EAATLPGVCNVNSLWTIKSNFEDTIHSHIVAS--THDRTLLFRIKDDGRNTTFTLLDSTA 561
Query: 389 IEGFCSQTQTLFCHD----------AIY---NQLVQVTSGSVRLVSST----SRELRNEW 431
F + T+ + ++Y N +VQVT V L+ R W
Sbjct: 562 ARDFITDQPTVALANVRKRVSVERKSVYRDCNWVVQVTDNVVNLLEHDVVLGGFNKRASW 621
Query: 432 KSPPGYS-----VNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYEISCLD 484
P + + A N +QV+LA GG LV L + G E V EIS +
Sbjct: 622 SPPSSVAPRPVEIVAADINPTQVVLALSGGRLVVLRHNEEGTAFELVAEKNTLREISAVS 681
Query: 485 INPIGENPSYSQIAAVGMWTDIS---------VRIFSL----------PDLNL---ITKE 522
Y+++ VG W ++ V I L P L ++K+
Sbjct: 682 CQAADTKTPYTKVFLVGYWEQVAEADTDRDTVVEILELERRSNPGSGTPSLTCLVKVSKK 741
Query: 523 HLGGEIIPRSVLLCAFEGI--------------SYLLCALGDGHLLNFLLNMKTG-ELTD 567
++ +PRS+LL +F G+ ++L C L DG + +F++ G +TD
Sbjct: 742 YVPA--LPRSLLLYSF-GVPDSEPTNLKPNSQPTHLFCGLADGSVAHFVVWKDGGLNVTD 798
Query: 568 RKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
K V LGT P+ V A +R ++ K++ +S V LK+++ P N+
Sbjct: 799 SKIVPLGTTPVKFSACVVDGKRCVLAVGNRASIFSYERKRMAHSPVMLKDLNAAYPLNTH 858
Query: 628 AFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAI-CSLKNQSC 685
FP S +A LTIG++ +I K+ IR+IPLG ++P+RI H R +A+ C+
Sbjct: 859 TFPTSFILANHQGLTIGSVKEIGKISIRTIPLGYDNPQRIVHIPLLRAYAVACATYTPVR 918
Query: 686 AEESEMH--FVRLLDDQTFEFISTYPLDTFEYGCSILSCS--FSDDSNVYYCVGTAYVLP 741
++E ++LLDD TF+ +S Y D+ E ++ + + S VGT+
Sbjct: 919 VGDAEAFKGSLKLLDDLTFKQLSQYNCDSDEVISALTTFTEEISGKETPLLVVGTS---- 974
Query: 742 EENEPTKGRILVFIVEDG----KLQLIAEKETKGA-VYSLNAFNGKLLAAINQKIQLYKW 796
++ R+LVF V +L LI E G V SL +LAA++ + YK+
Sbjct: 975 ---SSSQARLLVFSVASSEACQELTLITSLEVNGQHVNSLCVMGNYVLAAVDCAVFSYKF 1031
Query: 797 ----MLRDDGTRELQSECGHHGHILALYVQTRGDF---IVVGDLMKSISLLIYKHEEGAI 849
D + EL+ E G H V++ G F +V+GD+ S+SL+ +G
Sbjct: 1032 KGSSDDTDSQSSELK-EVGEWNH--NYIVRSLGSFNNSLVIGDIASSVSLV--NVNKGQF 1086
Query: 850 EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGE 909
ARDY + A+E L ++ +G + NLFT S G R LE G + LG+
Sbjct: 1087 TPIARDYAPLFPYALEALSENALIGGNDASNLFTF---SLGQGGMGRKVLERDGSFFLGD 1143
Query: 910 FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY-LFLEKLQTNLRKVI 968
+F GS+ + + P IF T +G IG + + +Q L L LQ NL ++
Sbjct: 1144 LATKFIRGSITTDYTAIEALE-PIAIFFTGSGRIGAVIDIKDQQLALHLSGLQRNLSALV 1202
Query: 969 KGVGGLNHEQWRSFNNEKKTVDAK----NFLDGDLIESFLDL--SRTRMDEI------SK 1016
+GVG H ++R+ N + DA+ F+DGD +E+FL + + +D++ +
Sbjct: 1203 QGVGASTHTKFRAPRNNRGRTDAEAAGFGFIDGDFVETFLGMLGDKGMVDKVMTGQSAPE 1262
Query: 1017 TMNVSVEELCKRVEELTRLH 1036
+ SV+E K +E L LH
Sbjct: 1263 KLEFSVDEYQKTLETLQGLH 1282
>gi|343429611|emb|CBQ73184.1| related to UV-damaged DNA-binding protein [Sporisorium reilianum
SRZ2]
Length = 1505
Score = 275 bits (703), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 246/978 (25%), Positives = 426/978 (43%), Gaps = 174/978 (17%)
Query: 217 LLGDHAGLLHLLVITHEKE---------KVTGLKIELLGETSIAS---TISYLDNAVVYI 264
L +G L+LL IT + +++E LG TS + +SYL + ++ +
Sbjct: 515 LFACSSGALNLLRITMPSGGQITPSSPLQPKSMRVETLGTTSQPAGPQALSYLGDGLICV 574
Query: 265 GSSYGDSQLIK---------LNLQPDAK-------------------------------G 284
GS+ GDS L K + L P +
Sbjct: 575 GSATGDSCLYKILQQGAEQDMPLSPKEQLLTPPSSPTQSRRRRSSQVVGASLDNTELPTA 634
Query: 285 SYVEVLERYVNLGPIVDFCVVD-------LERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
+ V+E + NLGP+V+F V D Q ++VTCSGA GS+R R+G +
Sbjct: 635 GSLAVVETWQNLGPVVEFVVDDGAGGDPTCSSGAQARIVTCSGAGPSGSIREARSGASVQ 694
Query: 338 EQASVELQGIKGMWSLRSSTD-DPFDTFLVVSFISETRILAMNLEDELEE---------T 387
+ +S+ + + +W + + D + L++ F + T L + + +L + T
Sbjct: 695 DVSSLSIPNAQQIWPVHAGNDASKYSVGLLLGFATSTAYLHFDAKGDLADATDRLAATGT 754
Query: 388 EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS--STSRELRNEWKSPPGYSVNVATAN 445
++ T++ D QL+++T LVS S L ++W P G V A+AN
Sbjct: 755 DLTLPTLAASTVWSSDH-QPQLLRITRSEACLVSLNDESASLLHQWVPPKGLEVTAASAN 813
Query: 446 A-SQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
QV+LA L++L + G L E QLE+E+SC+D++P+ +Q+AA G W
Sbjct: 814 VHGQVVLALSDKTLLHLSVESGALVEKGKVQLEHEVSCVDVSPLVAG-KAAQLAACGFWQ 872
Query: 505 DISVRIFSLPDLNLITKEHLGGE---IIPRSVLLCAFEG--------------------- 540
+++I+SLP+L + + + + +PRS+LL F
Sbjct: 873 TRTIQIYSLPELAAVGQSSVVQQRFPAVPRSILLHRFASKQTDGGSDTQSKGSLSNRDAL 932
Query: 541 ISYLLCALGDGHLLNFLLNM-------KTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
+LL LGDG L+++ L++ KT L+D K VSLGTQ + L + V A
Sbjct: 933 TPHLLIGLGDGTLVSYSLSLPTHDSYSKTVGLSDAKTVSLGTQALKLDALETAAGARVVA 992
Query: 594 AS-DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
S RPT++Y+ +K+ Y+ + K+ + ++ A A + + +I +++
Sbjct: 993 VSGSRPTLVYADSKRFSYNALKYKDQRTVATLHAGQERVFAAFALSDSVELASIGALRQR 1052
Query: 653 HIRSIPLG-EHPRRICHQEQSRTFAICSL----KNQSCAEESEMHFVRLLDDQTFEFIST 707
IR+ PLG P I FA+C+ + + + VR+LD TFE +
Sbjct: 1053 DIRTFPLGLNQPLAITQWANRGVFAVCTWAFLPRGTASKSDGPRGAVRILDQTTFETLDE 1112
Query: 708 YPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---------ED 758
L+ E I + VGT +V + +E T+GR++ F V E
Sbjct: 1113 IRLEPDERPNCI--TVLQAQGHEILVVGTGFVSEQASETTQGRLVGFDVSNGSSRTKEER 1170
Query: 759 GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM-LRDDGTRELQSECGHHGHIL 817
G+L+ + E G VYS+ + N +L AA+N ++++Y + R + G
Sbjct: 1171 GRLRKLFEHSETGNVYSVQSINNRLAAAVNSEVKIYSVVDPRPSEVPAPRIRVRQRGSWA 1230
Query: 818 ALYVQT-----RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD-- 870
++ D IVVGD ++S+++L + E ARD + W SA ++LDD+
Sbjct: 1231 CSFIACNLSVIEPDRIVVGDALRSMNVLHVHPYTARLTELARDCDPFWTSATDLLDDESQ 1290
Query: 871 IYLGAENNFNLFTVR------------------------------KNSEGA--TDEERGR 898
Y+GA+ +FNL+T + + + G D
Sbjct: 1291 TYIGADISFNLYTTQRVPLSEEVKARIRRARERESERTVVQTIDPRTTRGPDMVDRYAHV 1350
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLE 958
++ +H G+ +N+F SLV P S+ P ++F T G IG++A + ++ L
Sbjct: 1351 MQRNAVWHYGDMINKFCRRSLVPD-PGSNAAVRPRLLFCTAAGAIGIVAHVRDDEAHLLA 1409
Query: 959 KLQTNLRKVIKG---------VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD--LS 1007
K++ N+ +++ VG + H WR+ + + FLD ++++ F+D L
Sbjct: 1410 KVERNILSLLESSSTAAAAGVVGNIAHSDWRTLRTDHRVQAPAGFLDANVLQMFVDGRLD 1469
Query: 1008 RTRMDEISKTMNVSVEEL 1025
R + D++ N E L
Sbjct: 1470 RQQRDKVLAGPNSETEAL 1487
>gi|409078983|gb|EKM79345.1| hypothetical protein AGABI1DRAFT_39860 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1236
Score = 275 bits (702), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 254/925 (27%), Positives = 404/925 (43%), Gaps = 160/925 (17%)
Query: 199 RPSITKAY--GRVDA----DG-SRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIA 251
RP T A+ + A DG SR++LGD G +++++ + GL + LGE S
Sbjct: 307 RPKATVAWPWSEISAWCCIDGLSRFILGDAYG--KMVLLSLDNLSDLGLLLIPLGEVSPP 364
Query: 252 STISYLDNAVVYIGSSYGDSQLIKL----------------------------------- 276
+T++YL N V+Y+GS GDSQL++L
Sbjct: 365 TTLTYLTNQVIYVGSHLGDSQLVQLTSTPTTSTDKPTLPIPCDVHVIPSSSFDISAFKKG 424
Query: 277 ------------------NLQPDAKGSYVEVLERYV-------NLGPIVDFCVVDLERQG 311
+ P G+ VE Y+ N+ PI+D C+VD + G
Sbjct: 425 KARATSPEFDAMDLDGCDTVSPQESGNIVETRGSYLNVLERFKNIAPILDACLVDPD-SG 483
Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFI 370
Q Q+VTCSG GS+ VRNG E ++ + + +W+L+S ++ D+FL++SF
Sbjct: 484 QRQIVTCSGGKNTGSINAVRNGADFEEIVNIPGVPHVLKIWALKSRLEEAEDSFLLLSFC 543
Query: 371 SETRILAMN--------------LEDELEETEIE-GFCSQTQTLFCHDA----IYNQLVQ 411
+ + ++ +N + L E F + Q + D + + LV
Sbjct: 544 NSSSLIKINDSGDNISFAPFDNAIASGLVTVEPTIAFANVAQRVKGQDGKARYMNSSLVV 603
Query: 412 VTSGSVRLVSSTSRELR-----NEWKSPPGYS------VNVATANASQVLLATGGGHLVY 460
+GS + + L+ +EW + + + A+ N+SQV LA GG L
Sbjct: 604 QVTGSGAFLLELDQGLQTYIRIDEWNAKKNVADGITPEIVAASINSSQVALAISGGKLAL 663
Query: 461 LEIGDG-----ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515
L + + +++ + + EIS + +P+ ++SQ V W + +F+ D
Sbjct: 664 LSVAEDKKLRVVVSTLNDCNRQPEISAISCSPLNPARAFSQHIVVSYWESNIIEVFTPSD 723
Query: 516 LNLITKEHLGGEIIPRSVLLCAFEGI------------SYLLCALGDGHLLNFLLNMKTG 563
K +P V F YLL L DG + F +
Sbjct: 724 SGF--KSVCKSSPLPSLVSSLRFYNFGSDQSSKGADYHPYLLAGLSDGSVATF--RWQDQ 779
Query: 564 ELTDRKKVSLGTQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
EL +RK VSLG P++L S + V AA DR V +L+YS + L+ ++
Sbjct: 780 ELKERKIVSLGHAPVSLAVCSILGDAKTVLAAGDRAVVFAYERGRLVYSPILLRNIAAAS 839
Query: 623 PFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLK 681
P N+++ P SL A L IG I D+ KLHIRSIP G ++PR+I HQ F + +
Sbjct: 840 PLNTSSLPMSLVAATAEGLHIGKIKDLNKLHIRSIPFGLDNPRKITHQPLLSAFGVALVY 899
Query: 682 NQSC---AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS--NVYYCVGT 736
+ A E F LLDD +FE ++ + D E SI+ S D ++ +GT
Sbjct: 900 TEPAEIGAHEVRKSFFSLLDDASFEVLAKFDCDADEEIVSIIPFSAVVDGILTPFFVLGT 959
Query: 737 AYVLPEENEPTKGRILVFIV----------EDGKLQLIAEKETKGAVYSLNAF-----NG 781
PEE EP KGRIL+ + + +L L A E +G VY+L
Sbjct: 960 FIFRPEEFEPDKGRILLISISTTNNPRNPRQGYQLSLAASIEVRGCVYALTPIVDDKPVA 1019
Query: 782 KLLAAINQKIQLYKWMLRDDGTR------ELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
+++AA+N + L+ D T+ L+ + + L + G+ + VGD +
Sbjct: 1020 RIVAAVNSSVNLFSL---DIDTKIYPAGLHLRKMAEWNHNYLVTGLGAVGNHVFVGDQIS 1076
Query: 836 SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE 895
S+SLL K E + ARDY W +VE +D+ + A + N+FT + +
Sbjct: 1077 SVSLL--KCTEEKFQTVARDYGPRWPVSVEAIDEKNVIAANDALNIFTFTL----SRNLG 1130
Query: 896 RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY- 954
R L+ G YH+ + V +F GSLV P +F T +G IGVI ++ +E
Sbjct: 1131 RSVLDCTGNYHIADLVTKFIRGSLVTIEQSEKADLEPEELFFTSSGCIGVIVNVKNEDVS 1190
Query: 955 LFLEKLQTNLRKVIKGVGGLNHEQW 979
+ L LQ N+ +I VGG +H ++
Sbjct: 1191 IHLSGLQRNMSAIIPNVGGTSHARY 1215
>gi|426195893|gb|EKV45822.1| hypothetical protein AGABI2DRAFT_72896 [Agaricus bisporus var.
bisporus H97]
Length = 1229
Score = 273 bits (699), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 246/916 (26%), Positives = 396/916 (43%), Gaps = 174/916 (18%)
Query: 212 DG-SRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
DG SR++LGD G +++++ + GL + LGE S +T++YL N V+Y+GS GD
Sbjct: 319 DGFSRFILGDAYG--KMVLLSLDNLSDLGLLLIPLGEVSPPTTLTYLTNQVIYVGSHLGD 376
Query: 271 SQLIKL-----------------------------------------------------N 277
SQL++L
Sbjct: 377 SQLVQLTSTPTTSTDKPTLPIPCDVHVIPSSSFDISAFKKGKARATSPEFDAMDLDGSDT 436
Query: 278 LQPDAKGSYVEVLERYV-------NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIV 330
+ P G+ VE Y+ N+ PI+D C+VD + GQ Q+VTCSG GS+ V
Sbjct: 437 VSPQESGNIVETRGSYLNVLERFKNIAPILDACLVDPD-SGQRQIVTCSGGKNTGSINAV 495
Query: 331 RNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN---------- 379
RNG E ++ + + +W+++S ++ D+FL++SF + + ++ +N
Sbjct: 496 RNGADFEEIVNIPGVPHVLKIWAVKSRLEEAEDSFLLLSFCNSSSLIKINDSGDNISFAP 555
Query: 380 ----LEDELEETEIE-GFCSQTQTLFCHDA----IYNQLVQVTSGSVRLVSSTSRELR-- 428
+ L E F + Q + D + + LV +GS + + L+
Sbjct: 556 FDNAIASGLVTVEPTIAFANVAQRVKGQDGKARYMNSSLVVQVTGSGAFLLELDQGLQTY 615
Query: 429 ---NEWKSPPGYS------VNVATANASQVLLATGGGHLVYLEIGDG-----ILTEVKHA 474
+EW + + + A+ N+SQV LA GG L L + + +++ +
Sbjct: 616 IRIDEWNAKKNVADGITPEIVAASINSSQVALAISGGKLALLSVAEDKKLRVVVSTLNDC 675
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT-------------- 520
+ EIS + +P+ ++SQ V W + +F+ D +
Sbjct: 676 NRQPEISAISCSPLNPARAFSQHIVVSYWESNIIEVFTPSDSGFRSVCKSSPLPSLVSSL 735
Query: 521 --------KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 572
+ G + P YLL L DG + F + EL +RK VS
Sbjct: 736 RFYNFGSDQSSKGADYHP------------YLLAGLSDGSVATF--RWQDQELKERKIVS 781
Query: 573 LGTQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
LG P++L S + V AA DR V +L+YS + L+ ++ P N+++ P
Sbjct: 782 LGHAPVSLAVCSILGDAKTVLAAGDRAVVFAYERGRLVYSPILLRNIAAASPLNTSSLPM 841
Query: 632 SLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSC---AE 687
SL A L IG I D+ KLHIRSIP G ++PR+I HQ F + + + A
Sbjct: 842 SLVAATAEGLHIGKIKDLNKLHIRSIPFGLDNPRKITHQPLLNAFGVAFVYTEPAEIGAH 901
Query: 688 ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS--NVYYCVGTAYVLPEENE 745
E F LLDD +FE ++ + D E SI+ S D ++ +GT PEE E
Sbjct: 902 EVRKSFFSLLDDASFEVLAKFDCDADEEIVSIIPFSAVVDGILTPFFVLGTFIFRPEEFE 961
Query: 746 PTKGRILVFIV----------EDGKLQLIAEKETKGAVYSLNAF-----NGKLLAAINQK 790
P KGRIL+ + + +L L A E +G VY+L +++AA+N
Sbjct: 962 PDKGRILLISISTTNNPRNPRQGYQLSLAASIEVRGCVYALTPIVDDKPVARIVAAVNSS 1021
Query: 791 IQLYKWMLRDDGTR------ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
+ L+ D T+ L+ + + L + G + VGD + S+SLL K
Sbjct: 1022 VNLFSL---DIDTKIYPAGLHLRKMAEWNHNYLVTGLGAVGSHVFVGDQISSVSLL--KC 1076
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
E + ARDY W +VE +D+ + A + N+FT + + R L+ G
Sbjct: 1077 TEEKFQTVARDYGPRWPVSVEAIDEKNVIAANDALNIFTFTL----SRNLGRSVLDCTGN 1132
Query: 905 YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY-LFLEKLQTN 963
YH+ + V +F GSLV P +F T +G IGVI ++ +E + L LQ N
Sbjct: 1133 YHIADLVTKFIRGSLVTIEQSEKADLEPEELFFTSSGCIGVIVNVKNEDVSIHLSGLQRN 1192
Query: 964 LRKVIKGVGGLNHEQW 979
+ +I VGG +H ++
Sbjct: 1193 MSAIIPNVGGTSHARY 1208
>gi|224104491|ref|XP_002333932.1| predicted protein [Populus trichocarpa]
gi|222839243|gb|EEE77594.1| predicted protein [Populus trichocarpa]
Length = 221
Score = 270 bits (689), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 147/223 (65%), Positives = 163/223 (73%), Gaps = 28/223 (12%)
Query: 52 MGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQV 111
M DVS RIGRPT+NGQIGIIDPDCRLIGLHLYDGLFKVIP DNKGQLK+AF++R E L
Sbjct: 1 MEDVSGRIGRPTNNGQIGIIDPDCRLIGLHLYDGLFKVIPSDNKGQLKQAFSMRHENLDG 60
Query: 112 LDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVP 171
I FL QDNKDAR+VKTYEVALKDKDF+EG WSQNNLDNGA+LLIPVP
Sbjct: 61 FFILFLS----------LQDNKDARYVKTYEVALKDKDFIEGRWSQNNLDNGANLLIPVP 110
Query: 172 PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVIT 231
P CGVLIIGEET+VYC+AN F+AIPI+P L H L L+V+
Sbjct: 111 LPFCGVLIIGEETVVYCNANVFRAIPIKPM---------GHWWNCLSPKHMEELMLMVL- 160
Query: 232 HEKEKVTGLKIELLGETSIAST-ISYLDNAVVYIGSSYGDSQL 273
GLKI+LLGETSIAST ISYLDNA V+IGSSY DSQ+
Sbjct: 161 -------GLKIKLLGETSIASTIISYLDNASVFIGSSYKDSQV 196
>gi|443894313|dbj|GAC71661.1| hypothetical protein PANT_5d00006 [Pseudozyma antarctica T-34]
Length = 1625
Score = 265 bits (677), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 257/979 (26%), Positives = 425/979 (43%), Gaps = 196/979 (20%)
Query: 217 LLGDHAGLLHLLVITHEKEKVT----------GLKIELLGETSIAS---TISYLDNAVVY 263
L +G L++L +T E V+ +K+ G +S + ISYL VV
Sbjct: 633 LFACSSGALNVLTLTLPAEAVSESSCIPIQPRSMKVATFGYSSQPAGPQAISYLGEGVVC 692
Query: 264 IGSSYGDS-------------------QLIKLNLQPDAK----GSY-------------- 286
IGSS GD+ QLI P GS+
Sbjct: 693 IGSSTGDTLLHQVVPRDVDEAADPTGAQLISPPSSPTQSRRRSGSFQLASSSQGRTELPS 752
Query: 287 ---VEVLERYVNLGPIVDFCVVD-------LERQGQGQVVTCSGAYKDGSLRIVRNGIGI 336
+ +E + NLGPIVDF V D Q ++VTCSGA GS+R VR G +
Sbjct: 753 AGTLHTIESWQNLGPIVDFVVDDGAGGDPSASSGAQARIVTCSGAGPSGSIREVRTGASV 812
Query: 337 NEQASVELQGIKGMWSLRS-STDDPFDTFLVVSFISETRILAMNLEDELEET--EIEGFC 393
E ++ + + +W + + T + ++V F + T L +++ L + + C
Sbjct: 813 QEVCTLPIPNAQHIWPVHAGPTGSKQTSGILVGFATSTAYLYFDVDGNLVDATDRLAAVC 872
Query: 394 SQTQ--TLFCHDAIYNQ----LVQV--TSGSVRLVSSTSRELRNEWKSPPGYSVNVATAN 445
T TL + L++V T S+ S L ++W P G + A+AN
Sbjct: 873 RSTANPTLAASSLVDESGSALLLRVAHTDFSLFRCQGGSIALLHQWSPPSGCEITTASAN 932
Query: 446 -ASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
Q ++A LVYL LTE LE+E+SCLDI+ + +QIAA+G W
Sbjct: 933 DLGQCVIALSDKSLVYLTAEAESLTERGKTSLEHEVSCLDISCLSPG-EPAQIAAIGFWQ 991
Query: 505 DISVRIFSLPDLNLITKEHLGGE---IIPRSVLLCAF------EGIS------------- 542
+++I SLPDL+ + + + + +PRS+LL F +G S
Sbjct: 992 ARTIQILSLPDLSPVGQSSVVQQQLPAVPRSLLLHRFASKQTRQGASRQSRTSLRNRDAL 1051
Query: 543 --YLLCALGDGHLLNFLLNMKTGE-------LTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
+LL LGDG L+ + +++ T E L D K +SLGTQ + L + + V A
Sbjct: 1052 SPHLLIGLGDGTLITYSVSLPTDESFSRTVGLFDCKTLSLGTQALKLDAIETSSGARVVA 1111
Query: 594 AS-DRPTVIYSSNKKLLYSNVNLKEVSHMCP----------FNSAAFPDSLAIAKEGELT 642
S RPT++++ +K+ Y+ L+ C N+ + DS+ +A G L
Sbjct: 1112 VSGSRPTLVFADSKRFSYNA--LQHGDQRCVATLYVGAEQVLNAFSLGDSVKLASVGTL- 1168
Query: 643 IGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSL----KNQSCAEESEMHFVRLL 697
Q+ +R+ PLG + P I + FA+C+ + + + VR+L
Sbjct: 1169 -------QQRDVRTFPLGLDQPLAIAQWPARQAFAVCTWAFLPRGSASGGDKPRGAVRIL 1221
Query: 698 DDQTFEFISTYPLDTFEY-GCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV 756
D TFE + L+ E C + ++ + +GT Y+ + E GR++ F V
Sbjct: 1222 DQTTFEVLDEIRLEPDERPNCIAVLHAYGTE---MLVIGTGYIDSQSQETVSGRLVGFDV 1278
Query: 757 ---------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM--LRDDGTR- 804
E G+L+ + E + G VYS+ + +L AA+N ++++Y + R D +
Sbjct: 1279 SPGSSRTKEERGRLRRLFEHDENGNVYSVQSIGNRLAAAVNSEVKIYSVIDPRRGDASSP 1338
Query: 805 --ELQSECGHHGHILALYVQT-RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWM 861
+++ +A + D IVVGD ++S+++L + + E ARD + W
Sbjct: 1339 KIKIKQRGSWASSFIACSLSVVEPDRIVVGDALRSMNVLHVHPQTARVSEIARDCDPFWT 1398
Query: 862 SAVEILDD--DIYLGAENNFNLFTVRK--------------------NSEGATDEERGR- 898
SA E+LDD Y+GA+ +FNL+T ++ + A+D + G
Sbjct: 1399 SATELLDDASQTYIGADISFNLYTTQRVLLSDEVKTRMRRAREREQERTVSASDSQGGGR 1458
Query: 899 ---------LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL 949
++ +H G+ +++F SLV ++ P +IF T G IGVIA +
Sbjct: 1459 DDGDQYAHVMQRSAVWHYGDMISKFCRKSLVPSA-GANAAIQPKLIFCTAAGAIGVIAHV 1517
Query: 950 PHEQYLFLEKLQTNLRKVIKG------------VGGLNHEQWRSFNNEKKTVDAKNFLDG 997
E+ L K++ N +I+ VGG+ HE+WR+ + + FLD
Sbjct: 1518 RDEEAQLLAKVERNTLSLIESGGGESNAASAGVVGGIRHEEWRTLRTDHRVQSPAGFLDA 1577
Query: 998 DLIESFLD--LSRTRMDEI 1014
+++ F+D L R + D I
Sbjct: 1578 GVLKMFIDGRLDRAQRDRI 1596
>gi|402582698|gb|EJW76643.1| CPSF A subunit region family protein [Wuchereria bancrofti]
Length = 418
Score = 264 bits (674), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 147/352 (41%), Positives = 206/352 (58%), Gaps = 46/352 (13%)
Query: 470 EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII 529
E+K A Y I +D G+ S+I AVG WTD+SV + +LP L + +E + G+++
Sbjct: 56 EIKGAA--YSIQSMD----GKGTLRSEICAVGYWTDLSVALRALPQLVEVVREKIAGDML 109
Query: 530 PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
RS++L EG YLL ALGDG + F ++MKTG L D KK +LGTQPI LR F S+ +
Sbjct: 110 SRSIMLSPMEGHVYLLVALGDGTVHYFQIDMKTGALLDPKKATLGTQPIHLRKFRSRCSP 169
Query: 590 --HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
++F SDRP VIYSSN+KLL+SNVNL+ VS M P + A+PD+L + L IG ID
Sbjct: 170 VHNIFVCSDRPAVIYSSNQKLLFSNVNLRMVSTMTPLYAEAYPDALVLTDGHSLVIGRID 229
Query: 648 DIQKLHIRSIPLGEHPRRICHQEQSRTFAI------------------CSLKN------- 682
DIQKLHIR++PLGE P RI +Q ++ T A+ C+ KN
Sbjct: 230 DIQKLHIRTVPLGESPSRIAYQPETNTIAVIVERLEFVDAMGKHHFGQCASKNAMETSSS 289
Query: 683 -----------QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
+ AEE E+ V LLD TFE + ++ L+ E S+ SC +DS Y
Sbjct: 290 RLSSMRREPTPECLAEEMEVSSVLLLDSNTFEILHSHELEGSEMAMSLTSCQLGNDSQPY 349
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFNG 781
+ VGTA ++ +E E GRI++F +G +++L+ EKE KGA YS+ + +G
Sbjct: 350 FVVGTAVIMSDETESKMGRIMMFQASEGPERMRLVYEKEIKGAAYSIQSMDG 401
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 44/67 (65%), Gaps = 2/67 (2%)
Query: 718 SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYS 775
S+ SC +DS Y+ VGTA ++ +E E GRI++F +G +++L+ EKE KGA YS
Sbjct: 4 SLTSCQLGNDSQPYFVVGTAVIMSDETESKMGRIMMFQASEGPERMRLVYEKEIKGAAYS 63
Query: 776 LNAFNGK 782
+ + +GK
Sbjct: 64 IQSMDGK 70
>gi|353235645|emb|CCA67655.1| hypothetical protein PIIN_01483 [Piriformospora indica DSM 11827]
Length = 1145
Score = 262 bits (670), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 273/1051 (25%), Positives = 465/1051 (44%), Gaps = 107/1051 (10%)
Query: 37 VLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNK- 95
VL++ + L+ + + GRP I PD R + L+ G ++ F +
Sbjct: 84 VLEYSPKQRNLVVSSSQKLVLPYGRPLSGCVNCAISPDGRHAAIALFQGTMTILSFTREK 143
Query: 96 --GQLKEAFNIRLEELQVLDIKFLYGCAKPTIVV--LYQDNKDARHVKTYEVALKDKDF- 150
++ + + EL +L FL A+ I + L++ +RH+ T V L D
Sbjct: 144 GVARIARKVDSAIRELLLLSFAFLPLPAQDGIAISLLHKSYTGSRHISTRGVDLDTLDIS 203
Query: 151 VEGPWSQNNLDNGADLLIPVPPPLC---GVLIIGEETIVYCSANAFKAI--PIR------ 199
EG + ++ +I VP G+L+ G + ++ A+ + P +
Sbjct: 204 AEGSSIELREEDEPTRMICVPRAASKPGGLLLFGPDCTIFYQADKLHKMQSPAKSRRTST 263
Query: 200 --PSITKAYGRVDADG------SRYLLGDHAGLLHLLVITHEKEKVTG--LKIELLGETS 249
P + + + D G SR LL D G L LL + ++ +K+ + I LLGE S
Sbjct: 264 RQPDLYEEWKYSDVSGYGFIDESRLLLSDKYGKLVLLALDNDPKKIPAGAINIHLLGEAS 323
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLER 309
S I+YL+ V++IGS GDSQL+K+ +EV++ + N PI D + DL+
Sbjct: 324 AGSCIAYLNAGVIFIGSETGDSQLMKIT-----SSGKLEVIDTFSNTAPIADAVLADLDN 378
Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLRSSTDDPFDTFLVVS 368
G VVTCSG + GSLR +R+G I E AS+E IK ++ L+ T ++++S
Sbjct: 379 TGDHVVVTCSGNGRTGSLRTIRSGANIEELASMETSIPIKNIFPLQE-TSGTSHLYMLIS 437
Query: 369 FISETRIL----AMNLEDELEETEIEGFCSQTQTLFCHD---AIYNQL---VQVTSGSVR 418
+ ET+++ A L EL + G + TL + I++ VQVT+ +
Sbjct: 438 YDQETKLVDAREAPRLS-ELSAHQFPGVAREFPTLAAGNVRRTIFDTTTLAVQVTTRAAI 496
Query: 419 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY 478
L S W S+ A+ + V L GG ++ L++ V A+ +
Sbjct: 497 LFDVQSGAEYCRWSG----SITTASVSGDAVCLGLRGGKVIALKVDIEAAKLVTQAERNF 552
Query: 479 --EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLC 536
EIS L I PI S + +G W D V+I L +L + E + P SVL
Sbjct: 553 DKEISTLSIEPIQSGEVTSNVVVIGFWEDFLVKICQLHNLAQV-GEDIETPHTPHSVLAW 611
Query: 537 AF----EGISYLLCALGDGHLLNFLLNMKTGEL--TDRKKVSLGTQPITLRTFSSKNTTH 590
F EG+ YLL G+GH+L+ L + T R+ V LG +P+ L S
Sbjct: 612 NFGDRKEGL-YLLVGTGNGHILSVKLKETKNRVLATSRRTVVLGDRPVLLHRCSIAGAEV 670
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
+ A R ++ SN ++ +VN+K + + PF S+AF D+L L+I I ++
Sbjct: 671 IMATGSRAMLLSWSNGRIAQHHVNIKNIESVAPFTSSAFGDALIFKLTKGLSIARIGKLE 730
Query: 651 KLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSC---AEESEMHFVRLLDDQTFEFIS 706
KL I S+ LG + P + + + FA+ L+ + A ++ +L+D TFEF++
Sbjct: 731 KLKIDSVSLGYDVPNTLAYHPDIKAFAVGCLRTEPSLNGASDNIGSSFKLIDALTFEFLN 790
Query: 707 TYPLDTFEYGCSIL--SCSFSDDSNVYYCVGTAYVLPEE-NEPTKGRILVFIVEDGK--- 760
++ E ++ D Y VGTA EE +EPTKGRIL+F K
Sbjct: 791 SHSFPPNEEVTKVIVGDLVMGDKQERYIIVGTAIWEDEEGSEPTKGRILLFRASLSKGMQ 850
Query: 761 ------------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
L L+ E++ G+ L + +L +N + +Y+ + R + L+
Sbjct: 851 VGSGAANAPPPVLTLVLEQDIPGSAVGLAVVDHRLAIIVNTIVVVYE-LRRTQTAQGLEL 909
Query: 809 ECGH---HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
+ H + + V +V+GD ++S + L + + +E A+D+ ++++
Sbjct: 910 KAVDQWIHNYAIWSIVPAGDSRVVIGDALQSATTLRWNGTK--LEVVAKDWTT--VNSLN 965
Query: 866 ILDDDIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP 924
+ D+ Y + ++ + NL + + E L+ G YH GE ++ GS+ R
Sbjct: 966 VTADETYVIQSDIDGNLMSYKP--------EPPILQQTGHYHFGETISCLVPGSIRSRSN 1017
Query: 925 DSDVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKG------VGGLN-- 975
+ + T G I +I L E+ + L ++ N+ +K VG
Sbjct: 1018 QNGSIVAAKHVLLTPGGRISLIQEILDEEKEMTLLAIERNMSAALKAERDHYDVGSWRAP 1077
Query: 976 HEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
H R ++ + + FLDGD++ L+L
Sbjct: 1078 HVNTRRVITSEEPLQSYGFLDGDILSHALEL 1108
>gi|388853409|emb|CCF53029.1| related to UV-damaged DNA-binding protein [Ustilago hordei]
Length = 1508
Score = 262 bits (670), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 260/985 (26%), Positives = 423/985 (42%), Gaps = 182/985 (18%)
Query: 217 LLGDHAGLLHLLVITHEKEKVT--------------GLKIELLGETSIAS---TISYLDN 259
L G ++G L++L +T ++ + +++E LG TS + +SYL +
Sbjct: 512 LFGCNSGALNVLQLTLPQDAASEVVPSHSAAAPQPKSMRVETLGTTSQPAGPQALSYLGD 571
Query: 260 AVVYIGSSYGDSQLIKLNLQ---------PDAK--------------------------- 283
+V +GS+ GDS L K+ Q P A+
Sbjct: 572 GLVCVGSATGDSCLYKILQQDTSEEMDTGPGAQLLSPPSSPTQSRRRRSSLAANTTPKST 631
Query: 284 ----GSYVEVLERYVNLGPIVDFCVVD-------LERQGQGQVVTCSGAYKDGSLRIVRN 332
G + +E + NLGP+VDF V D Q ++VTCSGA GS+R VR+
Sbjct: 632 ELPCGGSLVTIETWQNLGPVVDFTVDDGAGGDPTTSASAQARIVTCSGAGPSGSIREVRS 691
Query: 333 GIGINEQASVELQGIKGMWSLRSSTDDPFDTF-LVVSFISETRILAM----NLEDELEET 387
G + + +S+ + + +WS+ + T L++ F + T L NL D +
Sbjct: 692 GASVQDVSSLPIPNAQQIWSVEAGDATSKQTAGLLIGFATSTAYLHFDANGNLADATDRL 751
Query: 388 EIEGFCSQTQTLFCH---DAIYNQ-LVQVTSGSVRLVS--STSRELRNEWKSPPGYSVNV 441
G + TL DA LV+V + LV+ + L +WK G +
Sbjct: 752 SAVGVDTTLPTLTASTVFDASQGPLLVRVAKDAASLVNLQDEAATLVQQWKPSAGLEITT 811
Query: 442 ATANA-SQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAV 500
A+ N Q+++A+ L YL + +G L E QL++E+SCLD + + +Q AA
Sbjct: 812 ASVNPYGQLVIASSDKSLSYLVVEEGALIERNKIQLDHEVSCLDTSSVTAG-QAAQFAAC 870
Query: 501 GMWTDISVRIFSLPDLNLITKE---HLGGEIIPRSVLLCAFEG----------------- 540
G W S++IF+LP+L + + G +PRS+LL F
Sbjct: 871 GFWQTRSIQIFALPELAPVGESLVVQQGFAAVPRSILLHRFASKQSEASSQTKSSVGNRD 930
Query: 541 --ISYLLCALGDGHLLNFLLNMKTGE-------LTDRKKVSLGTQPITLRTFSSKNTTHV 591
+ +LL LGDG L++F L++ + L+D K VSLG Q + L S V
Sbjct: 931 ALMPHLLIGLGDGTLVSFSLSLPRDDSYSKIVGLSDCKTVSLGKQALKLDAIESWAGARV 990
Query: 592 FAAS-DRPTVIYSSNKKLLYSNVNLKEVSH--MCPFNSAAFPDSLAIAKEGELTIGTIDD 648
A S RPT++Y+ +K+ Y+ + K M + A+A+ EL +I
Sbjct: 991 VAVSGSRPTLVYADSKRFSYNALKHKSQRSVTMLHVGPGRVLGAFALAESVEL--ASIGA 1048
Query: 649 IQKLHIRSIPLG-EHPRRICHQEQSRTFAICS---LKNQSCAEESEMH-FVRLLDDQTFE 703
+++ IR+ PLG + P I + FA+C+ L S + S+ +R+LD TFE
Sbjct: 1049 LRQRDIRTFPLGLDQPLAIAQWPNRKVFAVCTWAFLPRGSATKASKSRGAIRILDQSTFE 1108
Query: 704 FISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV------- 756
+ + L+ E I + VGT Y+ E+E GR+L F V
Sbjct: 1109 TLDEFRLEVDERPNCI--TVLRAQGHEMLVVGTGYISDGEHEVISGRLLGFDVSAGSIRG 1166
Query: 757 --EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM--LRDDGTRELQSECGH 812
E G+L+ + KE G VYS+ + N +L A+N ++++Y + D +
Sbjct: 1167 KEERGRLRKLFVKEQAGNVYSVQSINNRLATAVNSEVKIYSVVDPRASDEVSAPRINVVQ 1226
Query: 813 HGHILALYVQ-----TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
G ++ D IVVGD ++SI++L + E ARD + W SA E+L
Sbjct: 1227 RGSWACSFIACNLSVVEPDQIVVGDALRSINVLHVHPYTARLTEIARDCDPFWTSATELL 1286
Query: 868 DD--DIYLGAENNFNLFTVRK-------NSEGATDEERGRLEVVGE-------------- 904
D+ Y+GA+ +FNL+T ++ + ER VGE
Sbjct: 1287 DEASQTYIGADISFNLYTTQRVPLSEQVKARIRRTREREIERSVGEVNRLTTRDPNQVDR 1346
Query: 905 ----------YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
+H G+ +N+F SLV P + P ++F T G IGVIA + EQ
Sbjct: 1347 YAHVMQRNAVWHYGDMINKFYRKSLVAD-PGTAAEVHPRLLFCTAAGAIGVIAHVQEEQA 1405
Query: 955 LFLEKLQTN------------LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
L K++ N V ++H+ WR+ + + FLD ++++
Sbjct: 1406 RLLAKVERNILSLIELSSSISTSNSSSVVTSISHQDWRTLRTDHRVQAPAGFLDANILQM 1465
Query: 1003 FLD--LSRTRMDEISKTMNVSVEEL 1025
F+D L + D++ + N +E L
Sbjct: 1466 FVDGRLDSKQRDKVVQGPNSEMEAL 1490
>gi|403177920|ref|XP_003888718.1| hypothetical protein PGTG_22575 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|375173263|gb|EHS64847.1| hypothetical protein PGTG_22575 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1326
Score = 262 bits (669), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 268/1016 (26%), Positives = 444/1016 (43%), Gaps = 175/1016 (17%)
Query: 26 LFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDG 85
L + T L++ +S+ +IT A + RP D Q I+DP R + LH +G
Sbjct: 75 LLVLTADLNLFTLRFCPKSATIITTASISLHQIGARPADYVQTSIVDPHGRCVVLHALNG 134
Query: 86 LFKVIPF---------------------------------------DNKGQLKEAFNIRL 106
+ VIP D ++ +F +RL
Sbjct: 135 ILHVIPLVPGCLSNLRNLDPVLSKRKKANVSTGLASHSRNLHPSHNDPDCEVYRSFQLRL 194
Query: 107 EEL--QVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALK----DKDFVEGPWSQNNL 160
E+ Q L+ L PT++V+Y ++ R +++ ++ L+ +++F G +
Sbjct: 195 NEVNVQALNFATLSPNYPPTLLVVYSNHLGDRVLRSRKIDLQAAHCEQEFFRG---YHCC 251
Query: 161 DNGADLLIPVPPP----------LCGVLIIGEET--IVYCSANAFKAIPIRPSITKA--- 205
D L+IP G ++IGEET +V + A R +I +A
Sbjct: 252 DPATALIIPFSLTGEAGEDSDGVNNGAILIGEETAQLVRFGLVQYSASDDRGNIREAEAT 311
Query: 206 ---------------------------YGRVDADGSRYLLGDHAGLLHLLVITHEKEKVT 238
+ VD + +LLGD G L L + K+ T
Sbjct: 312 AGQSNQLTEQGVIGGHVLRLPLGHYTCFCPVDGVPNCWLLGDLYGNLIFLFL---KQSTT 368
Query: 239 GLKIEL----LGETSIASTISYLDNAVVYIGSSYGDSQLIKL-----NLQPDAKGS-YVE 288
G L G + Y+ + V++ S YGDSQL+KL +L PD S E
Sbjct: 369 GTPPSLHYFHAGHVPSPEALVYITSGFVFLASHYGDSQLLKLPSPTSSLNPDNTASAQPE 428
Query: 289 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 348
V+ + NL PI DFCV + + Q+VTCSG+++DGSLR++++GIGI E S+E+ G++
Sbjct: 429 VITTFPNLAPISDFCVTEDRKSLVNQIVTCSGSHRDGSLRVIKHGIGIRESGSLEVGGVQ 488
Query: 349 GMWSLRSSTD----DPFDTFLVVSFISETRILAMNLEDELEETEI-EGFCSQTQTLFC-- 401
+W+LRSST + FD LV+S TR LA+N + +EE + GF S T+
Sbjct: 489 RLWALRSSTHVNSVEDFDDRLVLSCADCTRFLALNEDGTIEEIGLFNGFESDVPTILAGN 548
Query: 402 ----HDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
D+ +QVT S ++++ + W+ S+ A + + ++
Sbjct: 549 LLDGSDSTTRYSIQVT--SRKIIAGDAL----VWEPDDAKSITRAALSVTTCAVSL-KEQ 601
Query: 458 LVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLN 517
+V L I DG E +L +IS L I+ EN A W + I S+ +
Sbjct: 602 VVVLCIKDGKFVEKGTYKLLNDISSLAIDQ-SEN-----FVAAAQWVTNCIEIISVSSSS 655
Query: 518 LITKEHLGGEIIPRSVLLCAFEGIS----YLLCALGDGHLLNFLLNMKTG---ELTDRKK 570
I + + + + S+ + FEG LL LGDG ++N L TG E +
Sbjct: 656 TICRVNTDSDFMVNSLKMTNFEGTESDGCRLLIGLGDGKIMNVALG-PTGMHVEGDSPRF 714
Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVI--YSSNKKLLYSNVNLK--EVSHMCPFNS 626
+LG +PI + + ++A SD+PT+I +N + Y+ V ++ VS ++
Sbjct: 715 TTLGIRPIEFVSMRNATGEFLWANSDQPTIIDRIQNNGRFAYTPVTVQGGSVSSATGLHA 774
Query: 627 AAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSC 685
F DSL +A E+ IG ++ +K++I I LG E PRRI H E + + + + +
Sbjct: 775 RFFQDSLVLASNDEIRIGKLNTTEKMNILKISLGNEQPRRIAHSEDMKAYGVVCARLELD 834
Query: 686 AEESEMHFV---RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPE 742
+ +H + ++ DD+TF+ + TY E G SI + ++ ++ +GT +
Sbjct: 835 QDTGTIHRIGTFKVFDDETFQLLYTYNFGPMEQGSSIAAVKLGEEMIEHFVIGTGVIKST 894
Query: 743 ENEPTKGRILVFIVEDGKLQLIAEK---------ETKGAVYSLNAF-NGKLLAAINQKIQ 792
E E T GRIL + K L A+K + GAV + NG +A+ N +
Sbjct: 895 EAEATIGRILAIRELNSKQDLTAKKRHFELTNVGKLSGAVGGVGGLPNGMFVASANAFVH 954
Query: 793 LYKWMLRD--------------------DGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
+ D DG L G G ++ V T G ++VGD
Sbjct: 955 AFGLKKGDSGRAFPSGTDTVLAGSVPEMDGGFRLLDTWG--GGFVSQTVVTDGTKVLVGD 1012
Query: 833 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
L KS+ LL + E + +ARD++A + + + + ++ A++ FN+FTV+ +S
Sbjct: 1013 LYKSVVLLEFDLEHLELAVKARDFSAMSVRPIGAISEREFVAADSEFNMFTVQYDS 1068
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 39/117 (33%), Positives = 61/117 (52%), Gaps = 2/117 (1%)
Query: 917 GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLN 975
GSLV R +S + +IF T G IG+IA + ++ L + Q++L K+ VG L
Sbjct: 1151 GSLVPRFIESSLIGETKLIFVTSTGGIGLIAKIHSKKKTKQLARFQSDLSKISTSVGNLA 1210
Query: 976 HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE-ISKTMNVSVEELCKRVEE 1031
H +R F E + + + FLDGD +E LDL+ ++ + K M + EE + E
Sbjct: 1211 HSAYRMFKTESRKIPSMGFLDGDFLEGCLDLTPDEVENLVKKMMALKAEEAQAKATE 1267
>gi|384253371|gb|EIE26846.1| hypothetical protein COCSUDRAFT_52476 [Coccomyxa subellipsoidea
C-169]
Length = 1205
Score = 260 bits (664), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 291/1171 (24%), Positives = 482/1171 (41%), Gaps = 171/1171 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G ++D++ + ++ + +L++ E + R
Sbjct: 57 VFGCIRSLAPFRLTGASRDYVIVGSDSGRIVILEYIKEQNYFRKIHQETYGKSGCRRIVP 116
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-----Y 118
GQ +DP R LIG L V+ D L + LE + +I F
Sbjct: 117 GQYLAVDPKGRACLIGAMEKQKLVYVLNRDQAANL--TISSPLEAHKSHNIVFSVIGLDM 174
Query: 119 GCAKPTIVVLYQDNKDARHVKTYEVA--------LKDKDF----VEGPWSQNNLDNGADL 166
G P + D DA T E A L + D V WSQ +DNGA+L
Sbjct: 175 GFDNPVFAAIELDYADADQDPTGEAAGNAQKQLTLYELDLGLNHVTRKWSQE-VDNGANL 233
Query: 167 LIPVPPPL---CGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR----- 215
L+PVP GVL+ E I+Y + A+ IP R S+ G + +
Sbjct: 234 LVPVPGGADGPGGVLVCAENFIIYSNQDHADVRAVIPRRTSLLIDRGVLIVSYAMHKQKS 293
Query: 216 ---YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS- 271
+L+ G L+ + + + + VT LKI+ +I L ++ S +G+
Sbjct: 294 LFFFLVQSEYGDLYKVTLAYSGDTVTELKIKYFDTIPPCISICVLKTGFLFAASEFGNHA 353
Query: 272 --QLIKLNLQPDA-KGSYVEVLE------------------RYVN----LGPIVDFCVVD 306
Q + + DA + S E+ E R V+ L PI+DF V +
Sbjct: 354 LYQFLSIGDDDDAVEASSAELTETQEGFQPVFFDPRPLRNLRLVDEVESLSPIMDFKVAN 413
Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFL 365
L ++ Q+ G +LR++R G+ E A L G +W+L+ S +DPFD ++
Sbjct: 414 LLKEEIPQLYAMCGRGARSTLRVLRPGLAATEIAVSPLPGNPTAVWTLKRSVNDPFDAYI 473
Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
+VSF + T +L++ + + E GF TL + ++QV +R + + R
Sbjct: 474 IVSFTNATLVLSIG--ETVVEVNDSGFLGTVPTLRTQLLTDDSMLQVHPSGLRHIRADRR 531
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEISCLD 484
NEW++P ++ A N QV++A GG L+Y E+ G L EV+ ++ ++ CLD
Sbjct: 532 V--NEWRAPGRRTIVKAATNEQQVVIALSGGELIYFELSPTGQLMEVEKKEMAGDVVCLD 589
Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIPRSVLL-------- 535
I P+ E S+ AV + D SVRI SL ++++ + +P S+L
Sbjct: 590 IAPVPEGRQRSRFLAVASY-DSSVRILSLDPEDMMSALAVQMVSAVPESLLFIDSPAADI 648
Query: 536 ------CAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
+ G +L L +G LL ++ TG+L+D + LGT+P L S +
Sbjct: 649 AGKGEDASGAGGLFLNIGLLNGVLLRTEVDKVTGQLSDTRTRFLGTRPPKLFAVSVRGKR 708
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
+ A S RP + YS + + ++ + + + F S P+ + L I T++ +
Sbjct: 709 SMLALSSRPWLGYSDMGRYTLAPLSYEALDYASGFASDQCPEGFCAVSKSMLRILTLERL 768
Query: 650 -QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC----------------------- 685
+ + + L PR+ +S + + +
Sbjct: 769 GEAFNQQVTRLRYTPRKFVVHPESNMLIVAEADHAAVPLAERRAVEDGMEMDAALTEGIE 828
Query: 686 -----AEESEMH------------FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-- 726
A E E H +R+LD + + S LD E S+ FS+
Sbjct: 829 FDEERAAEEEQHGAPKNSTGRWASCIRVLDPTSLQTSSVLELDGNEAAVSLCLLRFSNWP 888
Query: 727 DSNVYYCVGTAYVL---PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
+ + VGT L P + R+ F +L+LI + T G +L AF G+L
Sbjct: 889 EEGMVLAVGTVQGLAFYPRTADEGYIRLYRFRDSGRQLELIHKTPTGGIPGALAAFKGRL 948
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSECGHHG---HILALYVQTRGDFIVVGDLMKSISLL 840
LA + +++Y + G ++L +C H HI L T GD I VGDL +S+
Sbjct: 949 LAGVGPTLRIY-----EAGKKKLLRKCEHRKLPTHIATL--ATSGDRIFVGDLQESMHYF 1001
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG--- 897
YK E A+ E A D ++A LD D GA+ N+F R + +T E
Sbjct: 1002 RYKANENALYEYADDIAPRHLTAALPLDYDTVAGADKFCNIFVTRLPRDVSTQVEEDPTG 1061
Query: 898 ---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGV 942
+LE V +H+G+ V + L G +++ TV G
Sbjct: 1062 GKFAGAAGLLNGAPHKLEDVVNFHVGDLVTSLQRAVL-------QPGGREVLLYATVMGA 1114
Query: 943 IGVIASLP-HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
IG + P E F L+ +LR+ +GG +H +R K+ +DGDL E
Sbjct: 1115 IGAMLPFPSREDVDFFSHLEMHLRQEHPPMGGRDHMSYRG-----SYFPVKDVIDGDLCE 1169
Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
F L + I+ + + E+ K++E++
Sbjct: 1170 HFSQLPAAKQKSIADELERTPGEILKKLEDI 1200
>gi|440300137|gb|ELP92626.1| DNA damage-binding protein, putative [Entamoeba invadens IP1]
Length = 1086
Score = 259 bits (662), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 246/1037 (23%), Positives = 492/1037 (47%), Gaps = 53/1037 (5%)
Query: 23 QDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHL 82
+DF+ I T ++ + + + + +++ V DR+GR G +G++ + LI L+L
Sbjct: 77 RDFVIILTSNHQLTITRLEGNTLKVVFGI--SVEDRLGRKAFYGALGVVMQNKYLI-LYL 133
Query: 83 YDGLFKVI--PFDNKGQLKEAF-NIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVK 139
Y+ K+I P ++ L ++ ++R+++ +++ + + + L+ D + V
Sbjct: 134 YNHQIKIISLPQGDEELLSQSVKSVRIDQERIVGMHII----GDNQIALHSDQNTKKTVV 189
Query: 140 TYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR 199
YEV K DF ++ D + G ++ ++ Y K+I +
Sbjct: 190 FYEV--KTDDFSVSNSTEYQPIENVDFFVAHNN---GFFVVQNGSLQYHYNTNTKSINVA 244
Query: 200 -PSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
P IT A D +R LL D+ G HL+ + + E+V + + SI S + YLD
Sbjct: 245 LPQITIA-TTCFLDNNRLLLCDNTGKSHLITLRLDVEQVHYYPLPF-AKLSIPSKVVYLD 302
Query: 259 NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTC 318
N V++ GSS G+S LIK+ + +EVLE + N GPI+D + E + ++ C
Sbjct: 303 NGVLFWGSSGGNSYLIKIGEER------MEVLETFENRGPILDMITLHDEISKKDDLLIC 356
Query: 319 SGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSS----TDDPFDTFLVVSFISETR 374
S Y G+L+++R+G+G+N VE +GI+ MW + +DD + +++
Sbjct: 357 SNTYHQGTLKLLRSGVGVNILGEVEYRGIEKMWKCFENAGDMSDDYKEMYVIEGAFGSNF 416
Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSP 434
+ + +++ EI + + +LV VT+G + + + ++ S
Sbjct: 417 VHIEGRKGDVQIVEIANSVTVKGRIVGVGDFDGKLVVVTNGGISIARVHDTIVEEQFVST 476
Query: 435 PGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSY 494
+ V+ ++ +G + + +++++K + E EIS + + +
Sbjct: 477 EEITHCVSEGKD---VVYSGRNTVKLFDAEKCVMSDIK--RYEEEISSIGCYTVSSENTR 531
Query: 495 SQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLL 554
+ VG W + I + D + G+++ +S+ ++ LGDG ++
Sbjct: 532 ERYIGVGRWEHKEIEI--IDDQGDVLDRVYIGDVVSKSIKFVGTPSTLKVIIGLGDGRVV 589
Query: 555 NFLLNMKTGELTDR-----KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLL 609
+ K + K + +G +++ + + +RPT++ + K+
Sbjct: 590 VSNIRKKNSTQMEEEHKVGKVIEVGMGGVSIDEMAIDGKMYEICVCERPTLMSLDDDKIK 649
Query: 610 YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQ 669
+VN+ E + P+S+ IA + L IG+I++++ ++ +S+ LG R+
Sbjct: 650 MMSVNIGESVGFLGVHITGIPNSVLIASKESLMIGSIEEVKAINTKSLELGVFVSRVVVS 709
Query: 670 EQSRTFAICSLK-NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS 728
R+ + S + ++ + ++E+HF+ L+D + E I LD E+G +I +
Sbjct: 710 SDGRSGVLLSSEIEETRSGKNEVHFINLIDLRKMEIIDKVRLDKDEHGMAIDVKEIEEKE 769
Query: 729 NVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN---GKLLA 785
Y VGTAY E EP++GR ++F + + K+ ++ + GAVYS+ F G +A
Sbjct: 770 --LYIVGTAYAKLGEVEPSRGRFIIFEIHEEKIIEVSNRYVDGAVYSVKRFENDVGNYIA 827
Query: 786 AINQK----IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
A QK Q+ + ++ ++ + G + ++ L+V+T G I+VGDLMKSIS+
Sbjct: 828 ATIQKKVVVYQIERKIVDGKFAVTIEEKGGANVKLIGLFVKTLGHEILVGDLMKSISVFK 887
Query: 842 Y--KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRL 899
+ K A+ E RD+ A++ +AVE +D+ ++ +++ NL +N+ + E+ +L
Sbjct: 888 FDEKATRNAVVETCRDFYASYTTAVEFMDEHCFMSSDSQGNLLVFTENTTTTNENEKFKL 947
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
+ H+GE +N GS+ + Q ++FG + G IG I + E Y L
Sbjct: 948 QNEAHIHVGECINVMCKGSIAVMNNAMWETQKKCMLFGGICGSIGGITEINLETYKKLFA 1007
Query: 960 LQTNLRKVIKGVGGL-NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
L++ + + +KGV + QW+ ++ K ++A+N +DG+++E FLDL + I++ +
Sbjct: 1008 LESEMLREMKGVIECESFGQWKMVFDDWKRMEAQNVIDGNVVELFLDLPKESQKHIAEKI 1067
Query: 1019 NVSVEELCKRVEELTRL 1035
+ EEL +E + +
Sbjct: 1068 GYAGEELVTVLESMNTI 1084
>gi|71016864|ref|XP_758932.1| hypothetical protein UM02785.1 [Ustilago maydis 521]
gi|46098463|gb|EAK83696.1| hypothetical protein UM02785.1 [Ustilago maydis 521]
Length = 1549
Score = 253 bits (646), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 254/1002 (25%), Positives = 407/1002 (40%), Gaps = 217/1002 (21%)
Query: 240 LKIELLGETSIAS---TISYLDNAVVYIGSSYGDSQLIKL----------------NLQP 280
L++E LG TS + +SYL +V +GS+ GD+ L ++ L P
Sbjct: 551 LRVEALGTTSQPAGPQALSYLGQGLVCVGSATGDNCLYRIIGQNASQKMPASSSEQVLTP 610
Query: 281 DAK------------------------GSYVEVLERYVNLGPIVDFCVVD-------LER 309
+ GS V LE + NLGP+VDF V D
Sbjct: 611 PSSPTQSRSMLSHAASSSRDHSELSTGGSLVN-LETWQNLGPVVDFVVDDGAGGDPTYAS 669
Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP--FDTFLVV 367
Q ++VTCSG GS+R R+G + + AS+ + + +WS+ + DD L+V
Sbjct: 670 SAQARIVTCSGLGPTGSVREARSGASVRDIASLPIPNAEQIWSVDAGVDDASKLTIGLLV 729
Query: 368 SFISETRILAMNLEDELEETEIEGFCS---------QTQTLFCHDAIYNQLVQV--TSGS 416
F + T L N + +L + + + T+ HD QL++V T S
Sbjct: 730 GFATSTAYLHFNADGDLTDATDQLVAAGANVSLPTIAATTVLFHDR-SPQLLRVDRTGAS 788
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANA-SQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
+ V S + ++W+ P V A+ N+ Q +LA L+YL DG L E
Sbjct: 789 LFSVQDASISILDQWRPPNELEVTSASVNSVGQAILALSDKSLLYLTDEDGALIERNKKM 848
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI--IPRSV 533
LE E+SC+DI+P+ + +Q+ A G W ++LPDL ++ + E +PRS+
Sbjct: 849 LEDEVSCVDISPLIAGKA-AQLVACGFWGRDVFDFYNLPDLEVVPR-GFAAEFSSVPRSI 906
Query: 534 LLCAFEGI---------------------SYLLCALGDGHLLNFLLNMKTGE-------- 564
LL FE +YLL LGDG L++F L +
Sbjct: 907 LLHRFESSQPEKASDAEFEFWGFNPNPLDAYLLIGLGDGTLVSFRLGGIVADGNAYVPVS 966
Query: 565 LTDRKKVSLGTQPITLRTFSSKNTTHVFAAS-DRPTVIYSSNKKLLYSNVNLKE---VSH 620
L D K ++LGTQ + L + V A S RPT++Y +K+ YS + K+ V+
Sbjct: 967 LYDAKTITLGTQALKLDAIKTSTGARVVAISGSRPTLVYHDSKRFSYSALKYKDPRSVAT 1026
Query: 621 MCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICS 679
+C F + L + +I + + IR+ PLG P I + FA+C+
Sbjct: 1027 VCAGPGRVF---AVVVLTDSLELTSISALGQRDIRTFPLGLNQPLAIAQWADRKVFAVCT 1083
Query: 680 L----KNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVG 735
+ + +E VR+LD TFE + L+ E I + VG
Sbjct: 1084 WTFLPRGSASKQEGSRGAVRILDHTTFELLDEIRLEPDERPNCITLLDLP--GHKMLVVG 1141
Query: 736 TAYVLPEENEPTKGRILVFIV---------EDGKLQLIAEKETKGAVYSLNAFNGKLLAA 786
T YV + +E +GR++ F V E G+L+ + E G VY + + L AA
Sbjct: 1142 TGYVSKQSSETVRGRLVAFDVSTGSSRTKEERGRLRQLFECSENGNVYQVQSTRYHLCAA 1201
Query: 787 INQKIQLYKWM-------LRDDGTRELQSECGHHGHILALYVQT-----RGDFIVVGDLM 834
+N +I+ Y R G E G ++ D +VVGD +
Sbjct: 1202 VNSEIKTYSITNCKDMSDKRPSGGPRRPYEVRQEGSWACSFIACNLSAIEPDRLVVGDAL 1261
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD--IYLGAENNFNLFTVRKNSEGAT 892
+S+++L G + E ARD + +W SA E+LD + Y+GA+ +FNL+T ++ +
Sbjct: 1262 RSMNVLNVDRYTGRLTEIARDCDPSWTSATELLDSETQTYIGADISFNLYTTQRMP--MS 1319
Query: 893 DEERGR-------------------------------LEVVGEYHLGEFVNRFRHGSL-V 920
+E R ++ +H G+ +N+FR V
Sbjct: 1320 EEVRTSIRRARERETERSVSLAYPRTLRDTDDCYAHVMQRKAVWHYGDMINKFRQSEFWV 1379
Query: 921 MRL------------------------------------PDSDVGQIPTVIFGTVNGVIG 944
+RL P D P ++F T G IG
Sbjct: 1380 VRLVPRHPIPSHPILSNLTLTLIFSFARRLISTASLVSDPGPDAAVRPKLVFCTAAGAIG 1439
Query: 945 VIASLPHEQYLFLEKLQTNLRKVIKG---------VGGLNHEQWRSFNNEKKTVDAKNFL 995
VIA + ++ L K++ N+ +I+ +G + H WR+ + + FL
Sbjct: 1440 VIAHVRDDEAQILAKVERNILSLIESPTEAASAGVIGNIAHSDWRTLRTDHRVQAPAGFL 1499
Query: 996 DGDLIESFLD--LSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
D D+++ FLD L + + N E L R E + +L
Sbjct: 1500 DADVLQMFLDGRLDHNQRYRVVHGPNSEAEALGVRSEVVEQL 1541
>gi|328869269|gb|EGG17647.1| CPSF domain-containing protein [Dictyostelium fasciculatum]
Length = 1194
Score = 249 bits (637), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 275/1165 (23%), Positives = 503/1165 (43%), Gaps = 173/1165 (14%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I ++ FR G ++D+L + ++ + +L+++ S + + + R G R
Sbjct: 58 FGVIRSIAPFRLTGGSKDYLIVGSDSGRVVILEYNP-SKNVFEKVHQETFGRSGCRRIVP 116
Query: 66 GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY-----GC 120
GQ DP R + + V + Q K + + LE + I F G
Sbjct: 117 GQYISTDPKGRAFMIGAIEKQKLVYILNRDSQAKLSISSPLEAHKAHTIVFSMCGVDVGF 176
Query: 121 AKP---TIVVLYQDNKDARHVKT---------YEVALKDKDFVEGPWSQNNLDNGADLLI 168
P TI V Y + + V+ YE+ L + V WS+ +D A+L++
Sbjct: 177 ENPIFATISVDYSEETNIEDVEETHNTKVLTFYELDLGLNNVVR-KWSEE-VDRSANLVV 234
Query: 169 PVP-----PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRV--------DAD 212
VP P GVL+ + + Y + A+ +IP R +T+ + D
Sbjct: 235 SVPGGSDGP--GGVLVCAQGRVYYRNIGHADISVSIPRRNGMTEEKSLMIVSHASHKQRD 292
Query: 213 GSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQ 272
+L+ G L+ + + + E V+G++I A+ I+ L N +++ S +GD
Sbjct: 293 MFFFLVQSEYGDLYKITLDYSGEMVSGMQIAYFDTFPTANCITMLKNGFLFVASEFGDHG 352
Query: 273 L-----IKLNLQPDAKGS-------YVEVLE--------RYVNLGPIVDFCVVDLERQGQ 312
L + L+ P A + + V E +L PIVDF V DL ++G
Sbjct: 353 LYLFKSLGLDDAPTASSAGNTEMVFFEPVFEPRNLVLTATISSLSPIVDFKVADLAQEGT 412
Query: 313 GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSS--------------T 357
Q+ SG + +LR++R+G+ I + +L G G+W++ S
Sbjct: 413 PQMYALSGVSERANLRVLRHGLPITQMVDSQLPGTPAGIWTIPQSLTTMRNPQYQGIGTV 472
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD--AIYNQLVQVTSG 415
+ P D ++VVSF+ T +L + + +EE + G S T T+ A + +VQ+ +
Sbjct: 473 ESPADRYIVVSFVGSTLVLGVG--ETVEEVQDSGILSTTTTILIRSMGANLDSIVQIFAQ 530
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHA 474
+R +++ R +EW++P ++++A AN QV++ GG ++Y E+ G LTEV
Sbjct: 531 GIRHINAERRV--SEWRAPGRKTISIAAANQQQVIICIGGSEIIYFELDPAGNLTEVFKK 588
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL 534
+ +I+C++ PI + S+ A+ + D VRI SL N++ + + +
Sbjct: 589 DMRKDINCIEFAPIPRGRTMSRFVAISDY-DGPVRILSLERDNMLNQVSMVDTDRQQVEQ 647
Query: 535 LCAFE------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G +L L +G L +L+ TGE++D + LG +
Sbjct: 648 LCVAELMVHEPGVDQQIVNTQRQGTLFLHIGLKNGVLKRAVLDGLTGEISDMRTRILGRR 707
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
PI K + V A S R + YS+ + + ++++ + H +S P+ +
Sbjct: 708 PIKFFRVKIKGSPAVLALSTRVWMCYSNLGRYEITPLSVEPLDHAASLSSDQCPEGIVAT 767
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAEESEM--- 691
E L I +I+ + L + IPL P+R + H + + +I + N S E E+
Sbjct: 768 SENNLKIFSIEKLGDLFNQVQIPLSCTPKRFVVHPQTNYIVSIETEHNFSTQFEPELAQD 827
Query: 692 ------------HFVRLLD---DQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGT 736
+R+LD +QT + +S L E SI + F + V VG
Sbjct: 828 LEMPRAGKGKWKSVIRILDPVSNQTLDLVS---LTNNEAAFSIGTVMF-QEGEVMLAVGC 883
Query: 737 A---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
A + P + F+ KL L+ ET +++ +F GKL+A + ++L
Sbjct: 884 AKDVQLQPRGFSSASIHLYQFVENGQKLALVHTTETDLIPHAVGSFQGKLIAGVGNVLRL 943
Query: 794 Y----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
Y K +LR R++ ++ +Q++GD IVV D+ +S+ Y+ E +
Sbjct: 944 YELGKKKLLRKCENRKVPNQITS--------IQSQGDRIVVSDVQESVHFFTYRRSENVL 995
Query: 850 EERARDYNANWMSAVEILD---------------------DDIYLGAENNFNLFTVRKNS 888
A D ++++ +LD D + EN ++ T+ S
Sbjct: 996 VLFADDTAPRFVTSCVMLDFDTVAIADKFGTIAVVRMPPSSDTDIDIENELDIGTLTTKS 1055
Query: 889 EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS 948
+ A + +LE+V +++GE SL+ VG +I+ T+ G IG +
Sbjct: 1056 KLALNGAANKLELVASFYVGETCCNLTRTSLM-------VGGSEVIIYTTLGGAIGALIP 1108
Query: 949 L-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
E+ F +L+ +R+ + + G +H +RS+ KN +DGDL E + L
Sbjct: 1109 FSSREEVDFFAQLEMQMRQEKESLCGRDHLSYRSY-----YFPVKNVIDGDLCEQYSTLD 1163
Query: 1008 RTRMDEISKTMNVSVEELCKRVEEL 1032
T+ +++ + + E+ K++E++
Sbjct: 1164 ATKQRSVAEELIRNPSEVLKKLEDI 1188
>gi|331247470|ref|XP_003336363.1| DNA damage-binding protein 1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 962
Score = 246 bits (627), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 249/928 (26%), Positives = 411/928 (44%), Gaps = 136/928 (14%)
Query: 209 VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIEL----LGETSIASTISYLDNAVVYI 264
VD + +LLGD G L L + K+ TG L G + Y+ + V++
Sbjct: 7 VDGVPNCWLLGDLYGNLIFLFL---KQSTTGTPPSLHYFHAGHVPSPEALVYITSGFVFL 63
Query: 265 GSSYGDSQLIKL-----NLQPDAKGS-YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTC 318
S YGDSQL+KL +L PD S EV+ + NL PI DFCV + + Q+VTC
Sbjct: 64 ASHYGDSQLLKLPSPTSSLNPDNTASAQPEVITTFPNLAPISDFCVTEDRKSLVNQIVTC 123
Query: 319 SGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTD----DPFDTFLVVSFISETR 374
SG+++DGSLR++++GIGI E S+E+ G++ +W+LRSST + FD LV+S TR
Sbjct: 124 SGSHRDGSLRVIKHGIGIRESGSLEVGGVQRLWALRSSTHVNSVEDFDDRLVLSCADCTR 183
Query: 375 ILAMNLEDELEETEI-EGFCSQTQTLFC------HDAIYNQLVQVTSGSVRLVSSTSREL 427
LA+N + +EE + GF S T+ D+ +QVT S ++++ +
Sbjct: 184 FLALNEDGTIEEIGLFNGFESDVPTILAGNLLDGSDSTTRYSIQVT--SRKIIAGDAL-- 239
Query: 428 RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINP 487
W+ S+ A + + ++ +V L I DG E +L +IS L I+
Sbjct: 240 --VWEPDDAKSITRAALSVTTCAVSL-KEQVVVLCIKDGKFVEKGTYKLLNDISSLAID- 295
Query: 488 IGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGIS----Y 543
A W + I S+ + I + + + + S+ + FEG
Sbjct: 296 -----QSENFVAAAQWVTNCIEIISVSSSSTICRVNTDSDFMVNSLKMTNFEGTESDGCR 350
Query: 544 LLCALGDGHLLNFLLNMKTG---ELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
LL LGDG ++N L TG E + +LG +PI + + ++A SD+PT+
Sbjct: 351 LLIGLGDGKIMNVALG-PTGMHVEGDSPRFTTLGIRPIEFVSMRNATGEFLWANSDQPTI 409
Query: 601 I--YSSNKKLLYSNVNLK--EVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
I +N + Y+ V ++ VS ++ F DSL +A E+ IG ++ +K++I
Sbjct: 410 IDRIQNNGRFAYTPVTVQGGSVSSATGLHARFFQDSLVLASNDEIRIGKLNTTEKMNILK 469
Query: 657 IPLG-EHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV---RLLDDQTFEFISTYPLDT 712
I LG E PRRI H E + + + + + + +H + ++ DD+TF+
Sbjct: 470 ISLGNEQPRRIAHSEDMKAYGVVCARLELDQDTGTIHRIGTFKVFDDETFQC-------- 521
Query: 713 FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEK----- 767
SI + ++ ++ +GT + E E T GRIL + K L A+K
Sbjct: 522 ----SSIAAVKLGEEMIEHFVIGTGVIKSTEAEATIGRILAIRELNSKQDLTAKKRHFEL 577
Query: 768 ----ETKGAVYSLNAF-NGKLLAAINQKIQLYKWMLRD--------------------DG 802
+ GAV + NG +A+ N + + D DG
Sbjct: 578 TNVGKLSGAVGGVGGLPNGMFVASANAFVHAFGLKKGDSGRAFPSGTDTVLAGSVPEMDG 637
Query: 803 TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMS 862
L G G ++ V T G ++VGDL KS+ LL + E + +ARD++A +
Sbjct: 638 GFRLLDTWG--GGFVSQTVVTDGTKVLVGDLYKSVVLLEFDLEHLELAVKARDFSAMSVR 695
Query: 863 AVEILD--DDIYLGAENNFNL---------FTVRKNSEGATDEERGRLEVVG-------- 903
+ + D G++ + V + G +DE+ + +
Sbjct: 696 PIGAISERDPTSPGSDGKSDRPNQDAHREDHGVHLDDGGGSDEDEEQALELEQEEEEEED 755
Query: 904 -EYHLGEFVNRFRH-----------------GSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
+Y+ + + + GSLV R +S + +IF T G IG+
Sbjct: 756 QDYNQSSYHDHLQPPHSPTFDQDQNPAAYGGGSLVPRFIESSLIGETKLIFVTSTGGIGL 815
Query: 946 IASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
IA + ++ L + Q++L K+ VG L H +R F E + + + FLDGD +E L
Sbjct: 816 IAKIHSKKKTKQLARFQSDLSKISTSVGNLAHSAYRMFKTESRKIPSMGFLDGDFLEGCL 875
Query: 1005 DLSRTRMDE-ISKTMNVSVEELCKRVEE 1031
DL+ ++ + K M + EE + E
Sbjct: 876 DLTPDEVENLVKKMMALKAEEAQAKATE 903
>gi|18410222|ref|NP_567015.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
gi|18410226|ref|NP_567016.1| putative splicing factor [Arabidopsis thaliana]
gi|7019653|emb|CAB75754.1| spliceosomal-like protein [Arabidopsis thaliana]
gi|7019655|emb|CAB75756.1| spliceosomal-like protein [Arabidopsis thaliana]
gi|332645831|gb|AEE79352.1| splicing factor 3B subunit 3 [Arabidopsis thaliana]
gi|332645833|gb|AEE79354.1| putative splicing factor [Arabidopsis thaliana]
Length = 1214
Score = 244 bits (622), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 277/1187 (23%), Positives = 499/1187 (42%), Gaps = 192/1187 (16%)
Query: 4 VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RP 62
V ++G I +L FR G +D++ + ++ + +L+++ E + + + + + G R
Sbjct: 55 VEVFGAIRSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKEKN-VFDKVHQETFGKSGCRR 113
Query: 63 TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
GQ +DP R +IG L V+ D +L + + + +
Sbjct: 114 IVPGQYVAVDPKGRAVMIGACEKQKLVYVLNRDTTARLTISSPLEAHKSHTICYSLCGVD 173
Query: 118 YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGA 164
G P + D +A +H+ YE+ L + V WS N +DNGA
Sbjct: 174 CGFDNPIFAAIELDYSEADQDPTGQAASEAQKHLTFYELDLG-LNHVSRKWS-NPVDNGA 231
Query: 165 DLLIPVPPPL---CGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR--- 215
++L+ VP GVL+ E ++Y + + IP R + G + +
Sbjct: 232 NMLVTVPGGADGPSGVLVCAENFVIYMNQGHPDVRAVIPRRTDLPAERGVLVVSAAVHKQ 291
Query: 216 -----YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
+L+ G + + + H + V+ LK++ +AS+I L ++ S +G+
Sbjct: 292 KTMFFFLIQTEYGDVFKVTLDHNGDHVSELKVKYFDTIPVASSICVLKLGFLFSASEFGN 351
Query: 271 SQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDFCVV 305
L + +PD + S ++E + +L P++D V+
Sbjct: 352 HGLYQFQAIGEEPDVESSSSNLMETEEGFQPVFFQPRRLKNLVRIDQVESLMPLMDMKVL 411
Query: 306 DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTF 364
++ + Q+ + G SLRI+R G+ I E A +L G +W+++ + D FD +
Sbjct: 412 NIFEEETPQIFSLCGRGPRSSLRILRPGLAITEMAVSQLPGQPSAVWTVKKNVSDEFDAY 471
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
+VVSF + T +L++ +++EE GF T +L + L+QV +R +
Sbjct: 472 IVVSFTNATLVLSIG--EQVEEVNDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIREDG 529
Query: 425 RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCL 483
R NEW++P S+ N QV++A GG L+Y E G L EV+ ++ +++CL
Sbjct: 530 R--INEWRTPGKRSIVKVGYNRLQVVIALSGGELIYFEADMTGQLMEVEKHEMSGDVACL 587
Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAFEG 540
DI P+ E S+ AVG + D +VRI SL PD L +++ + + P S+L +
Sbjct: 588 DIAPVPEGRKRSRFLAVGSY-DNTVRILSLDPDDCLQILSVQSVSSA--PESLLFLEVQA 644
Query: 541 IS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
+L L +G L +++M TG+L+D + LG +P L + S +
Sbjct: 645 SIGGDDGADHPANLFLNSGLQNGVLFRTVVDMVTGQLSDSRSRFLGLKPPKLFSISVRGR 704
Query: 589 THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
+ + S RP + Y + ++ + + PF+S + + L I ID
Sbjct: 705 SAMLCLSSRPWLGYIHRGHFHLTPLSYETLEFAAPFSSDQCAEGVVSVAGDALRIFMIDR 764
Query: 649 I-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAEESE---------------- 690
+ + + +PL PR+ + H ++ I S + AEE E
Sbjct: 765 LGETFNETVVPLRYTPRKFVLHPKRKLLVIIESDQGAFTAEEREAARKECFEAGGVGENG 824
Query: 691 -----------------------------------MHFVRLLDDQTFEFISTYPLDTFEY 715
+ +R+LD +T L E
Sbjct: 825 NGNADQMENGADDEDKEDPLSDEQYGYPKAESEKWVSCIRVLDPKTATTTCLLELQDNEA 884
Query: 716 GCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRILVFI-----VEDGK-LQLIAEKE 768
S+ + +F D + VGT + + P K + FI VEDGK L+L+ + +
Sbjct: 885 AYSVCTVNFHDKEYGTLLAVGTVKGM--QFWPKKNLVAGFIHIYRFVEDGKSLELLHKTQ 942
Query: 769 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI-LALYVQTRGDF 827
+G +L F G+LLA I ++LY D G + L +C + + +QT D
Sbjct: 943 VEGVPLALCQFQGRLLAGIGPVLRLY-----DLGKKRLLRKCENKLFPNTIISIQTYRDR 997
Query: 828 IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-- 885
I VGD+ +S Y+ +E + A D W++A +D D GA+ N++ VR
Sbjct: 998 IYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLTASHHVDFDTMAGADKFGNVYFVRLP 1057
Query: 886 ----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 929
K +G + +++ + ++H+G+ V + S++ G
Sbjct: 1058 QDLSEEIEEDPTGGKIKWEQGKLNGAPNKVDEIVQFHVGDVVTCLQKASMI-------PG 1110
Query: 930 QIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT 988
++++GTV G IG + A + F L+ ++R+ + G +H +RS
Sbjct: 1111 GSESIMYGTVMGSIGALHAFTSRDDVDFFSHLEMHMRQEYPPLCGRDHMAYRS-----AY 1165
Query: 989 VDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEE 1031
K+ +DGDL E F +DL R DE+ +T E+ K++E+
Sbjct: 1166 FPVKDVIDGDLCEQFPTLPMDLQRKIADELDRT----PAEILKKLED 1208
>gi|291000406|ref|XP_002682770.1| predicted protein [Naegleria gruberi]
gi|284096398|gb|EFC50026.1| predicted protein [Naegleria gruberi]
Length = 1216
Score = 243 bits (621), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 257/1033 (24%), Positives = 453/1033 (43%), Gaps = 155/1033 (15%)
Query: 127 VLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIV 186
V YQ K + + YE+ L V+ S +D A+LLI +P PL GVL+ E I
Sbjct: 206 VAYQSLK--KMLTIYELDLGKNTIVKK--SSEEIDRSANLLIQIPEPLGGVLVCSENYIT 261
Query: 187 YCSA---NAFKAIPIRPSITKAYG--------RVDADGSRYLLG-DHAGLLHLLVITHEK 234
Y + IP R + + G +V G +LL + G L+ + +
Sbjct: 262 YKKEGQDDVRSPIPRRSGMPQERGLMIINSNLKVKKGGDIFLLAQNELGDLYRVSFNKDG 321
Query: 235 EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL------------------ 276
+V+ + ++ ++++ L N ++ S +GD + K+
Sbjct: 322 NRVSSVSLQYFDTIPVSNSFLALKNGNLFCASEFGDHKCYKITKIEPNEYPETYESIGSE 381
Query: 277 ------------------NLQPDAKG-SYVEVLERYVNLGPIVDFCVVDLERQGQGQVVT 317
N QP + +E++ +L P+ D V DL +G Q+
Sbjct: 382 TFPTFTPPMNRFEALMSDNDQPSTTNVTCLELVNEIKSLAPVTDMKVTDLMGEGTPQIYC 441
Query: 318 CSGAYKDGSLRIVRNGIGINEQASVEL-QGIKGMWSLRSSTDDPFDTFLVVSFISETRIL 376
+G SLR +R G+ +NE+ + L Q +++++ S +D FD ++++SF T +L
Sbjct: 442 LNGRGPTSSLRSLRYGLPVNEEVAAPLDQQATAIFTVKESMNDTFDKYIILSFSEFTMVL 501
Query: 377 AMNLEDELEETEIEGFCSQTQTLFCHD-AIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
++ E+ E TE GF + T+T++ + + VQV +R + + NEW S
Sbjct: 502 SVG-ENVAEVTE-SGFLTTTKTIYASNIGESGEFVQVHPKGIRHIHP---DRVNEWNSG- 555
Query: 436 GYSVNVATANASQVLLATGGGHLVYLE--IGDGILTEVKHAQLEYEISCLDINPIGENPS 493
+ A N Q++++ GG ++Y E G L E + L +++CL ++PI + +
Sbjct: 556 NKIIEKAAVNGYQIVVSLSGGEIIYFEYDTSSGNLIETERNDLSQDVACLALSPIQDGRT 615
Query: 494 YSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLCAF-------EGISY 543
+ AVG + D +VR+ SL + +++++++ L + P S+ L E Y
Sbjct: 616 RGRFLAVGFY-DKTVRLISLGEYDMMSILSRQALPAD--PESLSLIELQSGHSRDETSLY 672
Query: 544 LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYS 603
L L +G LL ++ TGEL+D + LGT+ + LR + + A S +P + S
Sbjct: 673 LNIGLSNGILLRSTVDSSTGELSDTRSRFLGTKGVKLRNVKVRGDNAILALSSKPWLGNS 732
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD-IQKLHIRSIPLGEH 662
N K+ + ++ ++ C F+S D L L I TI + K IPL
Sbjct: 733 INGKIEMTPLSYPALNSACNFSSEQIRDGLVSITAEHLRIITISSLLDKFTPEIIPLNYT 792
Query: 663 PRR-ICHQEQSRTFAICSLKNQSCAEESEM-HFVR---------LLDDQTFEFISTYP-- 709
PR+ + H E S + + N S ++ +++ L D + I T P
Sbjct: 793 PRKFVVHDETSHMIILQTDHNVSKENSTDRPDYIKVDESQKDPSLSKDVEYGAIKTKPKS 852
Query: 710 -----------------LDTFEY-----GCSILSCSFS------DDSNVYYCVGTAYVLP 741
LD FE SI +C FS + VGTA +
Sbjct: 853 NLWASYIRVYSPKKQANLDEFEIEDNEAAFSITTCKFSTSLSGAKSNESLIIVGTAKNM- 911
Query: 742 EENEPTK----GRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW 796
+ PT+ G I VF I EDGKLQLI + E + Y+L+AF G+LL + +++Y
Sbjct: 912 -KLYPTRTCDCGYINVFQISEDGKLQLIHKTEVEDVPYALHAFRGRLLVGVKNMLRIY-- 968
Query: 797 MLRDDGTRELQSECGHHGHI-LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
D G ++L +C + + G+ I VGD+ +S + + E ++ A +
Sbjct: 969 ---DLGKKKLLRKCENKSFPNFITSIAVDGNRIFVGDITESFHFVKFNSSENSLTIFADN 1025
Query: 856 YNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE-------------ERGRLEVV 902
W++A ++D + G + N F R S+ +DE ERG L
Sbjct: 1026 TTPRWLTASALVDHNTIAGGDKFGNFFISRLPSD-VSDELEDSSTGKEKWIWERGLLN-- 1082
Query: 903 GEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLPHEQYL-FLEK 959
G + +F GS++ + + + G +I+ T+ G +GV ++ + F +
Sbjct: 1083 GAPQKATEIVKFYVGSMITSIYKTSLIAGGPSILIYTTITGAVGVFFPFTSKKDIEFFTQ 1142
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
L+ +LR+ + G +H +RS+ K+V +DGDLIE F D+ +IS+ +
Sbjct: 1143 LEMHLREKNPPLCGRDHLAYRSYYFPVKSV-----VDGDLIEQFNDVDLQTKTKISEDLQ 1197
Query: 1020 VSVEELCKRVEEL 1032
++ E+ K++E++
Sbjct: 1198 RTINEIAKKIEDM 1210
>gi|374095609|gb|AEY85032.1| spliceosomal-like protein [Camellia sinensis]
Length = 1212
Score = 243 bits (621), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 275/1187 (23%), Positives = 502/1187 (42%), Gaps = 186/1187 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G I +L FR G +D++ + ++ + +L+++ E + + + + + G
Sbjct: 52 ILSVEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKEKN-VFDKVHQETFGKSG 110
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R +IG L V+ D +L + + + +
Sbjct: 111 CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIFYSIC 170
Query: 118 ---YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLD 161
G P + D +A +H+ YE+ L + V WS+ +D
Sbjct: 171 GVDCGFDNPIFASIELDYSEADQDSTGQAAAEAQKHLTFYELDLG-LNHVSRKWSEQ-VD 228
Query: 162 NGADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR 215
NGA++L+ VP GVL+ E ++Y + + IP R + G + +
Sbjct: 229 NGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRYDLPAERGVLIVSAAM 288
Query: 216 --------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
+LL G + + + H+ ++VT L+I+ + +++ L + ++ S
Sbjct: 289 HKQKSMFFFLLQTEYGDIFKVTLDHDNDRVTELRIKYFDTIPVTASLCVLKSGFLFAASE 348
Query: 268 YGDSQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDF 302
+G+ L + +PD + S ++E + +L PI+D
Sbjct: 349 FGNHALYQFQAIGDEPDVESSSATLMETEEGFQPVFFQPRRLKNLVRIDQVESLMPIMDM 408
Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPF 361
+++L + Q+ T G SLRI+R G+ I+E A +L G+ +W+++ + +D F
Sbjct: 409 KIINLFEEETPQIFTLCGRGPRSSLRILRPGLAISEMAVSQLPGVPSAVWTVKKNVNDEF 468
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D ++VVSF + T +L++ + +EE GF T +L + L+QV +R +
Sbjct: 469 DAYIVVSFANATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHIR 526
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEI 480
R NEW++P ++ +N QV++A GG ++Y E+ G L EV+ ++ ++
Sbjct: 527 EDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGEIIYFEVDMTGQLMEVEKQEMSGDV 584
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCA 537
+CLDI P+ E S+ AVG + D +RI SL PD + +++ + + P S+L
Sbjct: 585 ACLDIAPVPEGRQRSRFLAVGSY-DNCIRILSLDPDDCMQVLSLQSVSSP--PESLLFLE 641
Query: 538 FEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
+ +L L +G L +++M TG+L+D + LG + L +
Sbjct: 642 VQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDARSRFLGLRAPKLFSVII 701
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
+ + S RP + Y L + ++ + + F+S + + L + T
Sbjct: 702 RGRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEFAASFSSDQCAEGVVAVAGDALRVFT 761
Query: 646 IDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSL------------KNQSCAEESEM- 691
I+ + + + +IPL PR+ Q + + I + C E++ M
Sbjct: 762 IERLGETFNETAIPLRYTPRKFVLQPKRKLLVIIESDQGAYAAEQRENAKKECFEDAGMG 821
Query: 692 -------------------------------------HFVRLLDDQTFEFISTYPLDTFE 714
+R+LD +T L E
Sbjct: 822 ENGKVEQMENGGDDEDKEDPLSDEQYGYPKVESDRWVSCIRVLDPRTANTTCLLELQDNE 881
Query: 715 YGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGRILVF-IVEDGK-LQLIAEKET 769
SI +F D + VGTA L + + G I ++ VEDGK L+L+ + +
Sbjct: 882 AAFSICLVNFHDKEYGTLLAVGTAKGLQFWPKRSISSGYIHIYRFVEDGKSLELLHKTQV 941
Query: 770 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFI 828
+L F GKLLA + ++LY D G R+L +C + + + T D I
Sbjct: 942 DDVPLALCQFQGKLLAGVGSVLRLY-----DLGKRKLLRKCENKLFPNTITSIHTYRDRI 996
Query: 829 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--- 885
VGD+ +S Y+ +E + A D W++A +D D GA+ N++ VR
Sbjct: 997 YVGDIQESFHYCKYRRDENQLYIFADDCVPRWLTASYHIDFDTMAGADKFGNIYFVRLAQ 1056
Query: 886 ---------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
K +G + ++E + ++H+G+ V + SL+ P
Sbjct: 1057 DVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDVVTCLQKASLI---PSGG--- 1110
Query: 931 IPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
VI+GTV G +G ++A + F L+ ++R+ + G +H +RS
Sbjct: 1111 -ECVIYGTVMGSLGALLAFTSRDDVDFFSHLEMHMRQENPPLCGRDHMAYRS-----AYF 1164
Query: 990 DAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E F +D+ R DE+ +T E+ K++EE+
Sbjct: 1165 PVKDVIDGDLCEQFPTLPMDMQRKIADELDRTPG----EILKKLEEV 1207
>gi|356576847|ref|XP_003556541.1| PREDICTED: splicing factor 3B subunit 3-like [Glycine max]
Length = 1214
Score = 243 bits (619), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 282/1190 (23%), Positives = 498/1190 (41%), Gaps = 190/1190 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G I +L FR G +D++ + ++ + +L+++ E + + + + + G
Sbjct: 52 ILSVEIFGAIRSLAQFRLMGAQKDYIVVGSDSGRIVILEYNKEKN-VFDKIHQETFGKSG 110
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R +IG L V+ D +L + + + L
Sbjct: 111 CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTLVYSIC 170
Query: 118 ---YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLD 161
G P + D +A +H+ YE+ L + V WS+ +D
Sbjct: 171 GVDCGFENPIFAAIELDYSEADQDSTGLAASEAQKHLTFYELDL-GLNHVSRKWSEQ-VD 228
Query: 162 NGADLLIPVP---PPLCGVLIIGEETIVY---CSANAFKAIPIRPSITKAYGRVDADGSR 215
NGA+LL+ VP GVL+ E ++Y IP R + G + +
Sbjct: 229 NGANLLVTVPGGGDGPSGVLVCAENFVIYKNQAHPEVRAVIPRRTDLPAERGVLIVSAAM 288
Query: 216 --------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
+LL G + + + H ++V+ LKI+ + +++ L + ++ S
Sbjct: 289 HKLKNMFFFLLQTEYGDIFKVTLEHNNDRVSELKIKYFDTIPVTASMCVLKSGFLFAASE 348
Query: 268 YGDSQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDF 302
+G+ L + + D + S ++E + +L PI+D
Sbjct: 349 FGNHALYQFKSIGDEDDVEASSATLMETEQGFQPVFFQPRRLKNLVRIDQVESLMPIMDM 408
Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPF 361
V +L + Q+ T G SLRI+R G+ ++E A +L GI +W+++ + D F
Sbjct: 409 KVSNLFEEETPQIYTLCGRGPRSSLRILRTGLAVSEMAVSKLPGIPSAVWTVKKNAIDEF 468
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D ++VVSF + T +L++ + +EE GF T +L + L+QV +R +
Sbjct: 469 DAYIVVSFTNATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIR 526
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEI 480
R NEW++P +++ +N QV++A GG L+Y E+ G L EV+ ++ ++
Sbjct: 527 EDGR--INEWRTPGKRTISKVGSNRLQVVIALSGGELIYFEVDVTGQLMEVEKHEMSGDV 584
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFE 539
+CLDI P+ E S+ AVG + D ++RI SL PD + P S+L +
Sbjct: 585 ACLDIAPVPEGRQRSRFLAVGSY-DKTIRILSLDPDDCMQALSVQSVSSAPESLLFLEVQ 643
Query: 540 GIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
+L L +G + +++M TG+L+D + LG + L +
Sbjct: 644 ASVGGEDGADHPASLFLNAGLQNGVMFRTVVDMVTGQLSDSRSRFLGLRAPKLFPIIVRG 703
Query: 588 TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
+ S RP + Y L + ++ + + + F+S + + L I TI+
Sbjct: 704 KRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCVEGVVAVAGEALRIFTIE 763
Query: 648 DI-QKLHIRSIPLGEHPRRICHQEQSRTFA-ICSLKNQSCAEESEM-------------- 691
+ + + IPL PR+ Q + + I S + AEE E
Sbjct: 764 RLGETFNETVIPLRYTPRKFVLQPKRKLLVMIESDQGALTAEEREAARKECFESAQAGEN 823
Query: 692 ------------------------HF-------------VRLLDDQTFEFISTYPLDTFE 714
H+ +R+LD +T L E
Sbjct: 824 GTESADQMENGGEDEDKDDPLSDEHYGYPKAESDKWASCIRVLDPRTGNTTCLLELQENE 883
Query: 715 YGCSILSCSFSD-DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKE 768
SI + +F D + VGTA LP+ T G I ++ VEDG+ L+L+ + +
Sbjct: 884 AAFSICTINFHDKEYGTLLAVGTAKGLQFLPKRT-ITAGFIHIYRFVEDGRSLELLHKTQ 942
Query: 769 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRG 825
+G +L F G+LLA I ++LY D G R L +C + I++++
Sbjct: 943 VEGVPLALCQFQGRLLAGIGPVLRLY-----DLGKRRLLRKCENKLFPNTIVSIHAYR-- 995
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR 885
D I VGD+ +S Y+ +E + A D W++A +D D GA+ N++ VR
Sbjct: 996 DRIYVGDVQESFHYCKYRRDENQLYIFADDCVPRWLTASYHIDFDTMAGADKFGNIYFVR 1055
Query: 886 ------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
K +G + ++E + ++H+G+ V + SL+
Sbjct: 1056 LPQDVSDEIEEDPTGGRIKWEQGKLNGAPNKVEEIVQFHIGDVVTCLQKASLI------- 1108
Query: 928 VGQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
G ++FGTV G +G + A + F L+ ++R+ + G +H +RS
Sbjct: 1109 PGGGECIVFGTVMGSVGALHAFTSRDDVDFFSHLEMHMRQDHPPLCGRDHMAYRS----- 1163
Query: 987 KTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E + +DL R DE+ +T E+ K++EE+
Sbjct: 1164 AYFPVKDVIDGDLCEQYPTLPMDLQRKIADELDRTPG----EILKKLEEV 1209
>gi|255588145|ref|XP_002534515.1| spliceosomal protein sap, putative [Ricinus communis]
gi|223525135|gb|EEF27867.1| spliceosomal protein sap, putative [Ricinus communis]
Length = 1214
Score = 242 bits (618), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 278/1187 (23%), Positives = 499/1187 (42%), Gaps = 185/1187 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G I +L FR G +D++ + ++ + +L+++ E + + + + + G
Sbjct: 53 ILSVEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKERN-VFDKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R +IG L V+ D +L + + + +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSIC 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
G P + D +A T + A + + + V WS+ +DN
Sbjct: 172 GVDCGFDNPIFAAIELDYSEADLDSTGQAANEAQKVLTFYELDLGLNHVSRKWSE-QVDN 230
Query: 163 GADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VDADGS 214
GA++L+ VP GVL+ E ++Y + + IP R + G V A
Sbjct: 231 GANMLVTVPGGGDGPSGVLVCAENFVIYKNEGHPDVRAVIPRRADLPAERGVLIVSAATH 290
Query: 215 R------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
R +LL G + + + HE +KV LKI+ + S++ + ++ S +
Sbjct: 291 RQKSMFFFLLQTEYGDIFKVTLDHENDKVKELKIKYFDTIPVTSSMCVMKLGFLFAASEF 350
Query: 269 GDSQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDFC 303
G+ L + + D + S ++E + +L PI+D
Sbjct: 351 GNHALYQFQAIGEEADVEASSATLMETEEGFQPVFFQPRGLKNLVRIDQVESLMPIMDMK 410
Query: 304 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFD 362
+ +L + Q+ + G SLRI+R G+ I+E A +L G+ +W+++ + +D FD
Sbjct: 411 IANLFDEETPQIFSLCGRGPRSSLRILRPGLAISEMAVSQLPGVPSAVWTVKKNVNDEFD 470
Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
+++VSF + T +L++ + +EE GF T +L + L+QV +R +
Sbjct: 471 AYIIVSFNNATLVLSIG--ETVEEVNNSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIRE 528
Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEIS 481
R NEW++P ++ +N QV++A GG L+Y E+ G L EV+ ++ +++
Sbjct: 529 DGR--INEWRTPGKRTIVKVGSNRVQVVIALSGGELIYFEVDITGQLMEVEKHEMSGDVA 586
Query: 482 CLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAF 538
CLDI P+ E S+ AVG + D ++RI SL PD + +++ + + P S+L
Sbjct: 587 CLDIAPVPEGRQRSRFLAVGSF-DNTIRILSLDPDDCMQILSVQSVSSP--PESLLFLEV 643
Query: 539 EGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ +L L G L L++M TG+L+D + LG + L + +
Sbjct: 644 QASVGREDVADHPASLFLNAGLQSGVLFRTLVDMVTGQLSDSRSRFLGLRAPKLFSILVR 703
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
+ S RP + Y L + ++ + + F+S + + L I TI
Sbjct: 704 GRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEFAASFSSDQCAEGVVAVAGDALRIFTI 763
Query: 647 DDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE-------------- 690
+ + + + +IPL PR+ Q + + I S + AEE E
Sbjct: 764 ERLGETFNETAIPLRYTPRKFVLQPKKKLLVIVESDQGAYTAEEREAAKKECFEAAGMGE 823
Query: 691 ------------------------------------MHFVRLLDDQTFEFISTYPLDTFE 714
+ +R+LD +T L E
Sbjct: 824 NGSANAEQMENGDDEDKDDPLSDEQYGYPKAEAEKWVSCIRVLDPRTAATTCLLELQDNE 883
Query: 715 YGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGRILVF-IVEDGK-LQLIAEKET 769
S+ + +F D + VGTA L + + G I ++ V+DG+ L+L+ + +
Sbjct: 884 AAFSVCTVNFHDKEHGTLLAVGTAKGLQFWPKRSLSAGFIHIYKFVDDGRALELLHKTQV 943
Query: 770 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFI 828
+G +L+ F G+LLA I ++LY D G + L +C + ++ +QT D I
Sbjct: 944 EGVPLALSQFQGRLLAGIGPVLRLY-----DLGKKRLLRKCENKLFPNSIVSIQTYRDRI 998
Query: 829 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--- 885
VGD+ +S Y+ +E + A D W++A +D D GA+ N++ VR
Sbjct: 999 YVGDIQESFHFCKYRRDENQLYIFADDCVPRWLTASHHVDFDTMAGADKFGNIYFVRLPQ 1058
Query: 886 ---------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
K +G + ++E + ++H+G+ V SL+ G
Sbjct: 1059 DVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHIGDVVTSLSKASLI-------PGG 1111
Query: 931 IPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
+I+GTV G +G + + F L+ +LR+ + G +H +RS
Sbjct: 1112 GECIIYGTVMGSVGALLPFTSRDDVDFFSHLEMHLRQDHPPLCGRDHMAYRS-----AYF 1166
Query: 990 DAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E F LD R DE+ +T E+ K++EE+
Sbjct: 1167 PVKDVIDGDLCEQFPTLPLDAQRKIADELDRTPG----EILKKLEEV 1209
>gi|168064351|ref|XP_001784126.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664326|gb|EDQ51050.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1214
Score = 242 bits (617), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 278/1189 (23%), Positives = 509/1189 (42%), Gaps = 188/1189 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L+++ E + L + + + G
Sbjct: 52 LLSVEVFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKEKN-LFEKVHQETFGKSG 110
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R +IG L V+ DN +L + + + +
Sbjct: 111 CRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDNAARLTISSPLEAHKSHTIVYSIT 170
Query: 118 ---YGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLD 161
G P + D N+ +H+ YE+ L V W++ +D
Sbjct: 171 GVDCGFDNPIFAAIELDYSEADQDSTGQAANEAQKHLTFYELDLGLNHVVR-KWTEP-ID 228
Query: 162 NGADLLIPVP-----PPLCGVLIIGEETIVYCSAN---AFKAIPIRPSITKAYGRVDADG 213
NGA++L+ VP P GVL+ E ++Y + N IP R + G +
Sbjct: 229 NGANMLVTVPGGGDGP--SGVLVCAENFVIYKNQNQPDVRAVIPRRQDLPPNRGVLIVSA 286
Query: 214 SR--------YLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYI 264
+ +LL G + + + ++K+ +VT LKI+ + S + L ++
Sbjct: 287 ATHKQKALFFFLLQTEYGDIFKVTLDYDKDDQVTELKIKYFDTIPVTSAMCVLKAGFLFA 346
Query: 265 GSSYGDSQLIK---LNLQPDAKGS---YVEVLERYV-------------------NLGPI 299
S +G+ L + + P+ + S VE E Y +L PI
Sbjct: 347 ASEFGNHSLYQFQSIGEDPEVESSSSTLVETEEGYQPVFFQPRKLKNLVQIDDIESLMPI 406
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTD 358
+D V +L + Q+ + G SLRI+R G+ + E A L G+ +W+++ +
Sbjct: 407 MDMKVANLFEEETPQIFSLCGRGPRSSLRILRPGLAVTEMAVSPLPGVPSAVWTVKKHAN 466
Query: 359 DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
D FD ++VVSF++ T +L++ + +EE GF T +L + L+QV +R
Sbjct: 467 DEFDAYIVVSFVNATLVLSIG--ETVEEVSDSGFLDTTPSLAISLLGDDSLMQVHPSGIR 524
Query: 419 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLE 477
+ + R NEWK+P ++ N QV++A GG L+Y E+ G L E++ +
Sbjct: 525 HIRADGR--INEWKTPGKKTIVKVGYNRMQVVIALSGGELIYFEMDMSGQLMEIEKRDMT 582
Query: 478 YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVL 534
+++CLDI P+ E S+ AVG + D ++RI SL PD + +++ + + P S+L
Sbjct: 583 GDVACLDIAPVPEGRQRSRFLAVGSY-DSTIRILSLDPDDCMQILSVQAVSSP--PESLL 639
Query: 535 LCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
L + + +L L +G LL ++M TG+L+D + LG + L +
Sbjct: 640 LLEVQASTGGEDGADHPASVFLNAGLQNGVLLRTEVDMVTGQLSDTRTRFLGLRAPKLFS 699
Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+ + S RP + Y L + ++ + + + F+S + + L
Sbjct: 700 ALVRGRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCAEGVVAVAGDALR 759
Query: 643 IGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAEESEMH-------- 692
+ TI+ + + + +PL PR+ I H +Q+ + S + E+ E +
Sbjct: 760 VFTIERLGETFNQTVVPLRYTPRKFILHPKQNSLIILESDQGAFNTEDREANKKEAVEAA 819
Query: 693 -----------------------------------------FVRLLDDQTFEFISTYPLD 711
+R+LD +T L
Sbjct: 820 GYHENGKADGEDEDMADGEDEEPLPDEQYGYPKAESNKWASCIRVLDPKTSTTTCLLELQ 879
Query: 712 TFEYGCSILSCSFSDDSNV--YYCVGTAYVLP--EENEPTKGRILVF-IVEDGK-LQLIA 765
E S+ + +F D+ + VGTA L + E + G I ++ ++GK L+L+
Sbjct: 880 ENEAAFSLCAVNFHDNKELGTLIAVGTAKDLQFMPKKEASGGFIHIYRFADEGKVLELVH 939
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH---ILALYVQ 822
+ G +L F G+LL + Q +++Y D G R+L +C + I+A++
Sbjct: 940 KTPVDGVPTALCQFQGRLLVGVGQVLRIY-----DLGKRKLLRKCENKNFPNTIIAIH-- 992
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
T GD I VGD+ +S + Y+ +E + A D W++A +D D GA+ N++
Sbjct: 993 TYGDRIYVGDIQESFHYVKYRRDENQLYTFADDSCPRWLTASLHIDFDTMAGADKFGNIY 1052
Query: 883 TVRKNSE-----------GATDEERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
+R + G E+GRL E + ++H+GE V + SL+
Sbjct: 1053 VMRLPQDVSEEIEDDPTGGKIKWEQGRLNGAPNKVEEIIQFHVGEVVTSLQKASLI---- 1108
Query: 925 DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
G +V++GT+ G +G + E F L+ +LR+ + G +H +RS
Sbjct: 1109 ---PGGGESVLYGTIMGSMGALLPFSSREDVDFFSHLEMHLRQENPPLCGRDHMGFRS-- 1163
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E + L+ +I+ ++ + E+ K++E++
Sbjct: 1164 ---AYFPVKDVIDGDLCEQYPMLTSELQKKIADDLDRTPGEILKKLEDI 1209
>gi|449459948|ref|XP_004147708.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
gi|449513493|ref|XP_004164340.1| PREDICTED: splicing factor 3B subunit 3-like [Cucumis sativus]
Length = 1214
Score = 241 bits (614), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 277/1189 (23%), Positives = 503/1189 (42%), Gaps = 188/1189 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G I +L FR G +D++ + ++ + +L+++ + + + + + + G
Sbjct: 52 LLSVEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKDKN-VFDKIHQETFGKSG 110
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R +IG L V+ D +L + + + +
Sbjct: 111 CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTIVYSIC 170
Query: 118 ---YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLD 161
G P + D +A +H+ YE+ L + V WS+ +D
Sbjct: 171 GIDCGFDNPIFAAIELDYSEADQDSTGVAASEAQKHLTFYELDLG-LNHVSRKWSEP-VD 228
Query: 162 NGADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR 215
NGA++L+ VP GVL+ E ++Y + + IP R + G + +
Sbjct: 229 NGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAAM 288
Query: 216 --------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
+LL G + + + H + V LKI+ + +++ L + ++ S
Sbjct: 289 HKQKTMFFFLLQTEYGDIFKVTLEHNNDSVKELKIKYFDTIPVTASMCVLKSGFLFAASE 348
Query: 268 YGDSQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDF 302
+G+ L + D + S ++E + +L PI+D
Sbjct: 349 FGNHSLYQFQAIGEDADVESSSATLMETEEGFQPVFFQPRRLKNLMRIDQVESLMPIMDM 408
Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPF 361
+++L + Q+ T G SLRI+R G+ I+E A EL G+ +W+++ + +D F
Sbjct: 409 KIINLFEEETPQIFTLCGRGPRSSLRILRPGLAISEMAVSELPGVPSAVWTVKKNINDEF 468
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D ++VVSF + T +L++ + +EE GF T +L + L+QV +R +
Sbjct: 469 DAYIVVSFANATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHIR 526
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEI 480
R NEW++P ++ +N QV++A GG L+Y E+ G L EV+ ++ ++
Sbjct: 527 EDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLMEVEKHEMSGDV 584
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVL--- 534
+CLDI P+ E S+ AVG + D ++RI SL PD + +++ + + P S+L
Sbjct: 585 ACLDIAPVPEGRQRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQSVSAA--PESLLFLE 641
Query: 535 -LCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
L + G +L AL G L +++M TG+L+D + LG + L +
Sbjct: 642 VLASVGGEDGADHPASLFLNAALHSGVLFRTVVDMVTGQLSDSRSRFLGLRAPKLFSVVL 701
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
+ + S RP + Y L + ++ + + + F+S + + L + T
Sbjct: 702 RGRRAILCLSSRPWLGYIHQGHFLLTPLSYETLEYASSFSSDQCAEGVVAVAGNFLRVFT 761
Query: 646 IDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESEM------------ 691
I+ + + + IPL PR+ Q + + + S + AEE E
Sbjct: 762 IERLGETFNETVIPLRYTPRKFVLQPRRKLLVVIESDQGAFTAEEREAAKKECFEAAGAG 821
Query: 692 --------------------------HF-------------VRLLDDQTFEFISTYPLDT 712
H+ +R+LD ++ L
Sbjct: 822 ENGNGTMDQMENGGDDEDKDDPLSDEHYGYPKAESEKWVSCIRVLDPRSATTTCLLELQD 881
Query: 713 FEYGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGRILVF-IVEDGK-LQLIAEK 767
E S+ + +F D + VGTA L + G I ++ +EDGK L+L+ +
Sbjct: 882 NEAAFSVCTVNFHDKEYGTLLAVGTAKGLQFFPKRSLVAGYIHIYRFLEDGKSLELLHKT 941
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
+ +G +L F G+LLA + ++LY D G R L +C + + +QT D
Sbjct: 942 QVEGVPLALAQFQGRLLAGLGSVLRLY-----DLGKRRLLRKCENKLFPNTIVSIQTYRD 996
Query: 827 FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
I VGD+ +S Y+ +E + A D W++A +D D GA+ N++ VR
Sbjct: 997 RIYVGDIQESFHYCKYRRDENQLYIFADDSVPRWLTASYHVDFDTMAGADKFGNIYFVRL 1056
Query: 886 -----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
K +G + ++E + ++H+G+ V + SL+
Sbjct: 1057 PQDVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIIQFHIGDVVTSLQKASLI-------P 1109
Query: 929 GQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
G +++GTV G +G + A + F L+ ++R+ + G +H +RS
Sbjct: 1110 GGGECILYGTVMGSLGALHAFTSRDDVDFFSHLEMHMRQEHPPLCGRDHMGYRS-----A 1164
Query: 988 TVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E F LD+ R DE+ +T E+ K++EE+
Sbjct: 1165 YFPVKDVIDGDLCEQFPSLPLDMQRKIADELDRTPG----EILKKLEEV 1209
>gi|224100909|ref|XP_002312063.1| predicted protein [Populus trichocarpa]
gi|222851883|gb|EEE89430.1| predicted protein [Populus trichocarpa]
Length = 1213
Score = 241 bits (614), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 273/1189 (22%), Positives = 500/1189 (42%), Gaps = 189/1189 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G I +L FR G +D++ + ++ + +L+++ E + + + + + G
Sbjct: 52 VLSVEIFGAIRSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKERN-VFDKIHQETFGKSG 110
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R +IG L V+ D +L + + + +
Sbjct: 111 CRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDTVARLTISSPLEAHKSHTICYSVC 170
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
G P + D +A T + A + + + V WS+ +DN
Sbjct: 171 GVDCGFDNPIFAAIELDYSEADQDSTGQAAGEAQKNLTFYELDLGLNHVSRKWSE-QVDN 229
Query: 163 GADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR- 215
GA++L+ VP G+L+ E ++Y + + IP R + G + +
Sbjct: 230 GANMLVTVPGGGDGPSGILVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAATH 289
Query: 216 -------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
+LL G + + + HE +KV LKI+ + S+I L + ++ S +
Sbjct: 290 KQKSMFFFLLQTEYGDIFKVTLDHENDKVKELKIKYFDTIPVTSSICVLKSGFLFAASEF 349
Query: 269 GDSQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDFC 303
G+ L + + D + S ++E + +L P++D
Sbjct: 350 GNHALYQFQAIGEEEDVEASSATLMETEEGFQPVFFQPRGLKNLVRIDQVESLMPVMDMK 409
Query: 304 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFD 362
V ++ + Q+ + G SLRI+R G+ I+E A +L G+ +W+++ + +D FD
Sbjct: 410 VANIFDEETPQIFSLCGRGPRSSLRILRPGLAISEMAVSQLPGVPSAVWTVKRNANDEFD 469
Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
++VVSF + T +L++ + +EE GF T +L + L+Q+ +R +
Sbjct: 470 AYIVVSFNNATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQIHPNGIRHIRE 527
Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEIS 481
R NEW++P ++ +N QV++A GG L+Y E+ G L EV+ ++ +++
Sbjct: 528 DGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLMEVEKHEMSGDVA 585
Query: 482 CLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAF 538
CLDI P+ E S+ AVG + D ++R+ SL PD + +++ + + P S+L
Sbjct: 586 CLDIAPVPEGRQRSRFLAVGSY-DNTIRVLSLDPDDCMQILSVQSVSAP--PESLLFLEV 642
Query: 539 EGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ +L L G L +++M TG+L+D + LG + L + +
Sbjct: 643 QASIGGEDGADHPASLFLNAGLQTGVLFRTVVDMVTGQLSDSRSRFLGLRAPKLFAINVR 702
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
+ S RP + Y L + ++ + + + F+S + + L I TI
Sbjct: 703 GRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCAEGVVAVAGDALRIFTI 762
Query: 647 DDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE-------------- 690
+ + + + +IPL PR+ Q + + I S + AEE E
Sbjct: 763 ERLGETFNETAIPLRYTPRKFVLQPKRKLLVIIESDQGAYTAEEREAAKKECFEAAGMGE 822
Query: 691 ------------------------------------MHFVRLLDDQTFEFISTYPLDTFE 714
+ +R+LD ++ L E
Sbjct: 823 NGSANAEKMENGDDDDKDDPLSDEQYGYPKAEADRWVSCIRVLDPRSATTTCLLELQDNE 882
Query: 715 YGCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRILVFI-----VEDGK-LQLIAEK 767
S+ + +F D + VGTA L + P + I FI V+DGK L+L+ +
Sbjct: 883 AAFSVCTVNFHDKEHGTLLAVGTAKGL--QFWPKRSLIAGFIHIYKFVDDGKSLELLHKT 940
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
+ +G +L F G+LLA I ++LY D G + L +C + ++ + T D
Sbjct: 941 QVEGVPLALCQFQGRLLAGIGSVLRLY-----DLGKKRLLRKCENKLFPNSIVSIHTYRD 995
Query: 827 FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
I VGD+ +S Y+ +E + A D W++A +D D GA+ N++ VR
Sbjct: 996 RIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTASYHVDFDTMAGADKFGNIYFVRL 1055
Query: 886 -----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
K +G + ++E + ++H+G+ VN + SL+
Sbjct: 1056 PQDVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHIGDVVNSLQKASLI-------P 1108
Query: 929 GQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
G +++GTV G +G + + F L+ +LR+ + G +H +RS
Sbjct: 1109 GGGECIMYGTVMGSVGALLPFTSRDDVDFFSHLEMHLRQDHPPLCGRDHMAYRS-----A 1163
Query: 988 TVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E F LD R DE+ +T E+ K++EE+
Sbjct: 1164 YFPVKDVIDGDLCEQFPTLPLDAQRKIADELDRTPG----EILKKLEEV 1208
>gi|168046759|ref|XP_001775840.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162672847|gb|EDQ59379.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1214
Score = 241 bits (614), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 275/1191 (23%), Positives = 508/1191 (42%), Gaps = 192/1191 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G I +L FR G +D++ + ++ + +L+++ E + + + + + G
Sbjct: 52 LLSVEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKEKN-IFEKVHQETFGKSG 110
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R +IG L V+ DN +L + + + +
Sbjct: 111 CRRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDNAARLTISSPLEAHKSHTIVYSIT 170
Query: 118 ---YGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLD 161
G P + D N+ +H+ YE+ L V W++ +D
Sbjct: 171 GVDCGFDNPIFAAIELDYSEADQDSTGQAANEAQKHLTFYELDLGLNHVVR-KWTEP-ID 228
Query: 162 NGADLLIPVP-----PPLCGVLIIGEETIVYCSANA---FKAIPIRPSITKAYGRVDADG 213
NGA++L+ VP P GVL+ E ++Y + N IP R + G +
Sbjct: 229 NGANMLVTVPGGGDGP--SGVLVCAENFVIYKNQNQPDIRAVIPRRQDLPPNRGVLIVSA 286
Query: 214 SR--------YLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETSIASTISYLDNAVVYI 264
+ +LL G + + + ++K+ +VT LKI+ + S + L + ++
Sbjct: 287 ATHKQKALFFFLLQTEYGDIFKVTLDYDKDDQVTELKIKYFDTIPVTSAMCVLKSGFLFA 346
Query: 265 GSSYGDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGP 298
S +G+ L + QP + V++ + +L P
Sbjct: 347 ASEFGNHSLYQFQSIGEEPEVESSSSTLVETEEGYQPIFFQPRKLKNLVQI-DDIESLMP 405
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSST 357
I+D V ++ + Q+ + G SLRI+R G+ + E A L G+ +W+++
Sbjct: 406 IMDMKVANIFEEETPQIFSLCGRGPRSSLRILRPGLAVTEMAVSPLPGVPSAVWTVKKHA 465
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
+D FD ++VVSF++ T +L++ + +EE GF T +L + L+QV +
Sbjct: 466 NDEFDAYIVVSFVNATLVLSIG--ETVEEVSDSGFLDTTPSLAISLLGDDSLMQVHPSGI 523
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQL 476
R + + R NEWK+P ++ N QV++A GG L+Y E+ G L E++ +
Sbjct: 524 RHIRADGR--INEWKTPGKKTIVKVGYNRMQVVIALSGGELIYFEMDMSGQLMEIEKRDM 581
Query: 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSV 533
+++CL I P+ E S+ AVG + D ++RI SL PD + +++ + + P S+
Sbjct: 582 TGDVACLHIAPVPEGRQRSRFLAVGSY-DSTIRILSLDPDDCMQILSVQAVSSP--PESL 638
Query: 534 LLCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
LL + + +L L +G LL ++M TG+L+D + LG + L
Sbjct: 639 LLLEVQASTGGEDGADHPASVFLNAGLQNGVLLRTEVDMVTGQLSDTRTRFLGLRAPKLF 698
Query: 582 TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
+ + + S RP + Y L + ++ + + + F+S + + L
Sbjct: 699 SARVRGRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCAEGVVAVAGDAL 758
Query: 642 TIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAEESEMH------- 692
+ TI+ + + + +PL PR+ I H +Q + S EE E +
Sbjct: 759 RVFTIERLGETFNQTVVPLRYTPRKFILHPKQKSLIILESDLGAFSTEEREANKKEAVKA 818
Query: 693 ------------------------------------------FVRLLDDQTFEFISTYPL 710
+R+LD +T L
Sbjct: 819 TGGRDDEKANGEDEEMADGENEDPLPDEQYGYPKAESNKWASCIRVLDPKTSTTTCLLEL 878
Query: 711 DTFEYGCSILSCSFSDDSNV--YYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQL 763
E S+ + +F D+ + VGTA +P++ E + G I ++ VE+G+ L+L
Sbjct: 879 QENEAAFSLCAVNFHDNKELGTLIAVGTAKNMQFMPKK-ESSGGFIHIYRFVEEGRILEL 937
Query: 764 IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH---ILALY 820
+ + G +L F G+LL + Q +++Y D G R+L +C + I+A++
Sbjct: 938 VHKTPVDGVPTALCQFQGRLLVGVGQVLRIY-----DLGKRKLLRKCENKNFPNTIIAIH 992
Query: 821 VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
T GD I VGD+ +S + Y+ +E + A D W++A +D D GA+ N
Sbjct: 993 --TYGDRIYVGDIQESFHYVKYRRDENQLYTFADDSCPRWLTASLHIDFDTMAGADKFGN 1050
Query: 881 LFTVRKNSE-----------GATDEERGRL-------EVVGEYHLGEFVNRFRHGSLVMR 922
++ +R + G E+GRL E + ++H+GE V + SL+
Sbjct: 1051 VYVMRLPQDVSEEIEDDPTGGKIKWEQGRLNGAPNKVEEIIQFHVGEVVTSLQKASLI-- 1108
Query: 923 LPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS 981
G +V++GT+ G +G + E F L+ +LR+ + G +H +RS
Sbjct: 1109 -----PGGGESVLYGTIMGSVGALLPFSSREDVDFFSHLEMHLRQENPPLCGRDHMAFRS 1163
Query: 982 FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E + L+ +I+ ++ + E+ K++E++
Sbjct: 1164 -----AYFPVKDVIDGDLCEQYPMLTSELQRKIADDLDRTPGEVLKKLEDI 1209
>gi|380490733|emb|CCF35810.1| pre-mRNA-splicing factor rse-1 [Colletotrichum higginsianum]
Length = 1212
Score = 240 bits (612), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 280/1168 (23%), Positives = 493/1168 (42%), Gaps = 166/1168 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G I ++ FR G +D+L +AT+ + ++++ + + R
Sbjct: 64 IFGIIRSMAAFRLAGSNKDYLILATDSGRITIIEYIPAQNRFQRLHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
G+ DP R LI + L V+ +++ +L + + + VL + + G
Sbjct: 124 GEYLACDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSMVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
A P L + +A T E A + + + V WS++ +D A +L
Sbjct: 184 ANPVFAALEIEYTEADQDPTGEAAREAETQLVYYELDLGLNHVVRKWSES-VDPTASMLF 242
Query: 169 PVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYLL 218
VP P GVL+ GEE I Y +N AF+ IP R T+ + R G + L
Sbjct: 243 QVPGGQDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRKRHAVSGVMHKL 300
Query: 219 GDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNA 260
AG L+ T + + +V LKI+ ++S++ L +
Sbjct: 301 KGSAGAFFFLLQTEDGDLFKATLDMVEDTDGNPTGEVKRLKIKYFDTIPVSSSLCILKSG 360
Query: 261 VVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLERY 293
+Y S +G+ Q + + D K Y + ++E
Sbjct: 361 FLYAASQFGNHQFYQFEKLGDDDDELEFSSDDFPTDPKAGYDAVYFHPRPLENLALVESI 420
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 352
++ P++D V +L + Q+ T G + R++++G+ +NE + EL GI +W+
Sbjct: 421 DSMNPLLDCKVANLTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIPSAVWT 480
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
L+ + D +D ++V+SF + T +L++ + +EE GF + TL + L+QV
Sbjct: 481 LKLNRGDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGEDGLIQV 538
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV 471
+R + NEW +P S+ AT N QV +A G +VY E+ GDG L E
Sbjct: 539 HPKGIRHIRHGHV---NEWAAPQHRSIVAATTNEHQVAVALSSGEIVYFEMDGDGSLAEY 595
Query: 472 KHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
+ + ++CL + + E S AVG D +VR+ SL P+ L +K
Sbjct: 596 DEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDSTVRVLSLDPESTLESKSVQALTAP 654
Query: 530 PRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
P S+ + A + S YL L G L +L+ TGELTD ++ LG +P+ L
Sbjct: 655 PSSLAIIAMDDSSSGGSTLYLHIGLHSGVYLRTVLDEITGELTDTRQKFLGPKPVRLFQV 714
Query: 584 SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
+ + T V S RP + YS + L + +N ++ + F+S + + + L
Sbjct: 715 TVQGRTCVIGLSSRPWLGYSDPITRGFLVTPLNYVDLEWVWNFSSEQCEEGVVGIQGQSL 774
Query: 642 TIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV------ 694
I I+++ + +SIPL PRR+ + F N + + +
Sbjct: 775 RIFAIENLGDTITQKSIPLSYTPRRLLKHPEHPMFYTIEADNNTLPPDLRAKLIAEPGVV 834
Query: 695 ----RLLDDQTFEF-----------------------ISTYPLDTFEYGCSILSCSF-SD 726
R+L F + + T LD E S SF S
Sbjct: 835 NGDARILPPDEFGYPKGKGRWASCISVIDPLSEEQRVLQTVDLDNNEAAVSAAIVSFASQ 894
Query: 727 DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKL 783
D+ + VGT ++ + ++G I V+ EDG +L+ I + + + +L F G+L
Sbjct: 895 DNESFLIVGTGKDMIVNPRQFSEGYIHVYRFSEDGHELEFIHKTKVEEPPSALLGFQGRL 954
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
LA I Q +++Y LR R+ Q++ L + + T+G I+VGD+ I+ ++YK
Sbjct: 955 LAGIGQTLRIYDLGLR-QMLRKAQADVAPQ---LIVSLSTQGSRIIVGDVQHGITYVVYK 1010
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG--- 897
+ D + W++ ++D + +G + N+F VR K S+ A +E G
Sbjct: 1011 PTTNKLIPFVDDTISRWVTCTTMVDYESVVGGDKFGNIFLVRCPEKASQEADEESGGLHL 1070
Query: 898 ------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
RL ++G + + SLV VG +++ +NG IGV
Sbjct: 1071 LNTRDYLHGTPHRLSLLGHSYTQDVPTSITKTSLV-------VGGQDVLLWSGINGTIGV 1123
Query: 946 -IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
I + E F + L+ ++R + G +H +RS+ V K +DGDL E +
Sbjct: 1124 FIPFVTREDVDFFQNLEQHMRTEDAPLAGRDHLMYRSY-----YVPVKGVIDGDLCERYT 1178
Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L + I+ ++ SV E+ +++ ++
Sbjct: 1179 LLPSEKKQMIAGELDRSVREIERKISDI 1206
>gi|302773427|ref|XP_002970131.1| hypothetical protein SELMODRAFT_171237 [Selaginella moellendorffii]
gi|300162642|gb|EFJ29255.1| hypothetical protein SELMODRAFT_171237 [Selaginella moellendorffii]
Length = 1207
Score = 239 bits (611), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 280/1179 (23%), Positives = 500/1179 (42%), Gaps = 175/1179 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G + +L FR G +D+L + ++ K +L+++ + +
Sbjct: 52 LLAVEIFGVVRSLAQFRLTGSHKDYLVVGSDSGKIVILEYNRDRNAFDPVHKETFGKSGC 111
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ +DP R +I L V+ DN +L + + + + +
Sbjct: 112 RRIVPGQFLSVDPKGRAVMIAATEKQKLVYVLNRDNAARLTISSPLEAHKSHTVVYSLVG 171
Query: 118 --YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDN 162
G P + D DA +H+ YE+ L V WS+ +DN
Sbjct: 172 VDCGFENPMFAAIELDYSDADQDATGQAALEAQKHLTFYELDLGLNHVVR-KWSEP-IDN 229
Query: 163 GADLLIPVPPPL---CGVLIIGEETIVYCS-ANAFKA-IPIRPSITKAYGRVDADGSR-- 215
GA++L+ VP GVL+ + ++Y + + +A IP R + G + S
Sbjct: 230 GANMLVTVPGGADGPSGVLVCADNFVIYKNQSQELRALIPRRADLPADRGVLVVSASTHR 289
Query: 216 ------YLLGDHAGLLHLLVITHEKEK--VTGLKIELLGETSIASTISYLDNAVVYIGSS 267
+LL G + + ++ VT LKI+ + S + L + ++ S
Sbjct: 290 QKSVFFFLLQTEYGDVFKVTLSFNDTDMVVTELKIKYFDTFPVTSAMCVLKSGFLFAASE 349
Query: 268 YGDSQLIK---LNLQPDAKGSYVEV----------------------LERYVNLGPIVDF 302
+G+ L + + PD + S + ++ +L PI+D
Sbjct: 350 FGNHALYQFQGIGDDPDVESSSATLEETEDGFKPIFFQPRKLKNLIQIDEGESLMPIMDM 409
Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPF 361
+ +L + Q+ G +LRI+R G+ ++E A +L G+ +W+++ S D F
Sbjct: 410 KIANLFEEETPQIFVACGRGPRSTLRILRPGLAVSEMAVSQLPGVPSAVWTVKKSVTDEF 469
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D ++VVSF++ T +L++ + +EE GF T +L + L+QV +R +
Sbjct: 470 DAYIVVSFLNATLVLSIG--ETVEEVSDSGFLDTTPSLQVSLLGDDSLMQVYPNGIRHIR 527
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEI 480
S R NEWK+P ++ +N QV++ GG L+Y E+ G L E++ ++ ++
Sbjct: 528 SLVR--INEWKTPGKKTIVKVGSNRMQVVIGLSGGELIYFEMDPTGQLMEIEKREMSGDV 585
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLL-- 535
+CL I P+ E S+ AVG + D ++RI SL P+ +++++ + + P S+LL
Sbjct: 586 ACLAIGPVPEGRQRSRFLAVGSY-DNTIRILSLDPNDCMHILSVQMVASP--PESLLLLE 642
Query: 536 ----------CAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
+ +L L +G LL ++M TG+L+ + LG + L S
Sbjct: 643 VQASTGGEDGAVYPASVFLNAGLQNGVLLRTEVDMVTGQLSGTRTRFLGLRAPKLFATSV 702
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
+ + S RP + Y L + ++ + + + F+S + + L + T
Sbjct: 703 RGRQALLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCAEGVVAVAGEALRVFT 762
Query: 646 IDDI-QKLHIRSIPLGEHPRR-ICH---------------------QEQSRTFA-----I 677
I+ + + + SI L PR+ I H + SR+ A +
Sbjct: 763 IERLGETFNQTSIQLRYTPRKFILHPKRKFLVLIETDQGTFTTEEREHMSRSAAAQDNGM 822
Query: 678 CSLKNQSCAEESEM----------------HFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
+ NQ +EE M +R+LD + L E SI
Sbjct: 823 MEIDNQEKSEEEPMPDEQYGYPKTDSRRWVSCIRVLDPKAAATTCLLELQENEAAFSICC 882
Query: 722 CSFSDDSNV--YYCVGTAYVLPEENEPTKGRILVFI-----VEDGK-LQLIAEKETKGAV 773
+F D+ N+ VGTA L E P + + FI VEDG+ L+L+ + G
Sbjct: 883 VNFHDNKNLGTVLAVGTAKDL--EWWPKRRSMGGFIHIYRFVEDGRSLELVHKTPIDGVP 940
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
+L F G+LLA I Q +++Y D G R+L +C + + + + GD I VGD
Sbjct: 941 TALCQFQGRLLAGIGQILRIY-----DLGKRKLLRKCENKNFPNTITSIHSYGDRIYVGD 995
Query: 833 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE--- 889
+ +S + Y+ +E + A D + W++A +D D + NLF VR +
Sbjct: 996 IQESFHYVKYRRDENQLYAFADDSSPRWLTASLHIDFDTMAAGDKFGNLFFVRLPQDLSE 1055
Query: 890 --------GATDEERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
G E+GRL E + ++H+GE V + SL+ G +V
Sbjct: 1056 EIEDDPTGGKIKWEQGRLNGAPNKVEEIIQFHVGEVVTCMQKASLI-------PGGGESV 1108
Query: 935 IFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
I+GTV G +G + E F L+ ++R+ + G +H +RS K+
Sbjct: 1109 IYGTVMGSVGALLPFSSREDVDFFSHLEMHMRQEHPPLCGRDHMAFRS-----AYFPVKD 1163
Query: 994 FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+DGDL E + L +I++ ++ + E+ K++E++
Sbjct: 1164 VIDGDLCEQYPTLPPDLQRKIAEELDRTPGEVMKKLEDI 1202
>gi|356536504|ref|XP_003536777.1| PREDICTED: splicing factor 3B subunit 3-like [Glycine max]
Length = 1214
Score = 239 bits (611), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 281/1191 (23%), Positives = 497/1191 (41%), Gaps = 192/1191 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G I +L FR G +D++ + ++ + +L+++ E + + + + + G
Sbjct: 52 ILSVEIFGAIRSLAQFRLMGAQKDYIVVGSDSGRIIILEYNKEKN-VFDKIHQETFGKSG 110
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R +IG L V+ D +L + + + L
Sbjct: 111 CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTAARLTISSPLEAHKSHTLVYSIC 170
Query: 118 ---YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLD 161
G P + D +A +H+ YE+ L + V WS+ +D
Sbjct: 171 GVDCGFENPIFAAIELDYSEADQDSTGQAASEAQKHLTFYELDL-GLNHVSRKWSEQ-VD 228
Query: 162 NGADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR 215
NGA+LL+ VP GVL+ E ++Y + IP R + G + +
Sbjct: 229 NGANLLVTVPGGGDGPSGVLVCAENFVIYKNQGHPEVRAVIPRRADLPAERGVLIVSAAM 288
Query: 216 --------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
+LL G + + + H ++V+ LKI+ + +++ L + ++ S
Sbjct: 289 HKLKNMFFFLLQTEYGDIFKVTLEHNNDRVSELKIKYFDTIPVTASMCVLKSGFLFAASE 348
Query: 268 YGDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVD 301
+G+ L + QP + V + ++ +L PI+D
Sbjct: 349 FGNHALYQFKSIGDDDDVEASSATLMETEEGFQPVFFQPRRLKNLVRI-DQVESLMPIMD 407
Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDP 360
V +L + Q+ T G SLRI+R G+ ++E A +L GI +W+++ + D
Sbjct: 408 MKVSNLFEEETPQIYTLCGRGPRSSLRILRTGLAVSEMAVSKLPGIPSAVWTVKKNVIDE 467
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FD ++VVSF + T +L++ + +EE GF T +L + L+QV +R +
Sbjct: 468 FDAYIVVSFTNATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPNGIRHI 525
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYE 479
R NEW++P S++ +N QV++A GG L+Y E+ G L EV+ ++ +
Sbjct: 526 REDGR--INEWRTPGKRSISKVGSNTLQVVIALSGGELIYFEMDVTGQLMEVEKHEMSGD 583
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAF 538
++CLDI P+ E S+ AVG + D ++RI SL PD + P S+L
Sbjct: 584 VACLDIAPVPEGRQRSRFLAVGSY-DKTIRILSLDPDDCMQALSVQSVSSAPESLLFLEV 642
Query: 539 EGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ +L L +G + +++M TG+L+D + LG + L +
Sbjct: 643 QASVGGEDGADHPASLFLNAGLQNGVMFRTVVDMVTGQLSDSRSRFLGLRAPKLFPIIVR 702
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
+ S RP + Y L + ++ + + + F+S + + L I TI
Sbjct: 703 GKRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCVEGVVAVAGEALRIFTI 762
Query: 647 DDI-QKLHIRSIPLGEHPRRICHQEQSRTFA-ICSLKNQSCAEESEM------------- 691
+ + + + IPL PR+ Q + + I S + AEE E
Sbjct: 763 ERLGETFNETVIPLRYTPRKFVLQPKRKLLVMIESDQGALTAEEREAARKECFEAAQAGE 822
Query: 692 -------------------------HF-------------VRLLDDQTFEFISTYPLDTF 713
H+ +R+LD +T L
Sbjct: 823 NGTGSADQMENGGDDEDKDDPLSDEHYGYPKAESDKWASCIRVLDPRTSNTTCLLELQEN 882
Query: 714 EYGCSILSCSFSD-DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEK 767
E SI + +F D + VGTA LP+ T G I ++ VEDG+ L+L+ +
Sbjct: 883 EAAFSICTVNFHDKEYGTLLAVGTAKGLQFLPKRTV-TAGFIHIYRFVEDGRSLELLHKT 941
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTR 824
+ +G +L F G+LLA I ++LY D G + L +C + I++++
Sbjct: 942 QVEGVPLALCQFQGRLLAGIGPVLRLY-----DLGKKRLLRKCENKLFPNTIISIHAYR- 995
Query: 825 GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
D I VGD+ +S Y+ +E + A D W++A +D D G + N++ V
Sbjct: 996 -DRIYVGDVQESFHYCKYRRDENQLYIFADDCVPRWLTASYHIDFDTMAGTDKFGNIYFV 1054
Query: 885 R------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
R K +G + ++E + ++H+G+ V + SL+
Sbjct: 1055 RLPQDVSDEIEEDPTGGRIKWEQGKLNGAPNKVEEIVQFHVGDVVTCLQKASLI------ 1108
Query: 927 DVGQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
G ++FGTV G +G + A + F L+ ++R+ + G +H +RS
Sbjct: 1109 -PGGGECIVFGTVMGSVGALHAFTSRDDVDFFSHLEMHMRQDHPPLCGRDHMAYRS---- 1163
Query: 986 KKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E + +DL R DE+ +T E+ K++EE+
Sbjct: 1164 -AYFPVKDVIDGDLCEQYPTLPMDLQRKIADELDRTPG----EILKKLEEV 1209
>gi|302807210|ref|XP_002985318.1| hypothetical protein SELMODRAFT_181612 [Selaginella moellendorffii]
gi|300147146|gb|EFJ13812.1| hypothetical protein SELMODRAFT_181612 [Selaginella moellendorffii]
Length = 1207
Score = 239 bits (610), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 282/1179 (23%), Positives = 501/1179 (42%), Gaps = 175/1179 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G + +L FR G +D+L + ++ K +L+++ + +
Sbjct: 52 LLAVEIFGVVRSLAQFRLTGSHKDYLVVGSDSGKIVILEYNRDRNAFDPVHKETFGKSGC 111
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ +DP R +I L V+ DN +L + + + + +
Sbjct: 112 RRIVPGQFLSVDPKGRAVMIAATEKQKLVYVLNRDNAARLTISSPLEAHKSHTVVYSLVG 171
Query: 118 --YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDN 162
G P + D DA +H+ YE+ L V WS+ +DN
Sbjct: 172 VDCGFENPMFAAIELDYSDADQDATGQAALEAQKHLTFYELDLGLNHVVR-KWSEP-IDN 229
Query: 163 GADLLIPVPPPL---CGVLIIGEETIVYCS-ANAFKA-IPIRPSITKAYGR--VDADGSR 215
GA++L+ VP GVL+ + ++Y + + +A IP R + G V A R
Sbjct: 230 GANMLVTVPGGADGPSGVLVCADNFVIYKNQSQELRALIPRRADLPADRGVLVVSAATHR 289
Query: 216 ------YLLGDHAGLLHLLVITHEKEK--VTGLKIELLGETSIASTISYLDNAVVYIGSS 267
+LL G + + ++ VT LKI+ + S + L + ++ S
Sbjct: 290 QKSVFFFLLQTEYGDVFKVTLSFNDTDMVVTELKIKYFDTFPVTSAMCVLKSGFLFAASE 349
Query: 268 YGDSQLIK---LNLQPDAKGSYVEV----------------------LERYVNLGPIVDF 302
+G+ L + + PD + S + ++ +L PI+D
Sbjct: 350 FGNHALYQFQGIGDDPDVESSSATLEETEDGFKPIFFQPRKLKNLIQIDEGESLMPIMDM 409
Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPF 361
+ +L + Q+ G +LRI+R G+ ++E A +L G+ +W+++ S D F
Sbjct: 410 KIANLFEEETPQIFVACGRGPRSTLRILRPGLAVSEMAVSQLPGVPSAVWTVKKSVTDEF 469
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D ++VVSF++ T +L++ + +EE GF T +L + L+QV +R +
Sbjct: 470 DAYIVVSFLNATLVLSIG--ETVEEVSDSGFLDTTPSLQVSLLGDDSLMQVYPNGIRHIR 527
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEI 480
S R NEWK+P ++ +N QV++ GG L+Y E+ G L E++ ++ ++
Sbjct: 528 SLVR--INEWKTPGKKTIVKVGSNRMQVVIGLSGGELIYFEMDPTGQLMEIEKREMSGDV 585
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLL-- 535
+CL I P+ E S+ AVG + D ++RI SL P+ +++++ + + P S+LL
Sbjct: 586 ACLAIGPVPEGRQRSRFLAVGSY-DNTIRILSLDPNDCMHILSVQMVASP--PESLLLLE 642
Query: 536 ----------CAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
+ +L L +G LL ++M TG+L+ + LG + L S
Sbjct: 643 VQASTGGEDGAVYPASVFLNAGLQNGVLLRTEVDMVTGQLSGTRTRFLGLRAPKLFATSV 702
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
+ + S RP + Y L + ++ + + + F+S + + L + T
Sbjct: 703 RGRQALLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCAEGVVAVAGEALRVFT 762
Query: 646 IDDI-QKLHIRSIPLGEHPRR-ICH---------------------QEQSRTFA-----I 677
I+ + + + SI L PR+ I H + SR+ A +
Sbjct: 763 IERLGETFNQTSIQLRYTPRKFILHPKRKFLVLIETDQGTFTTEEREHMSRSAAAQDNGM 822
Query: 678 CSLKNQSCAEESEM----------------HFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
+ NQ +EE M +R+LD + L E SI
Sbjct: 823 MEIDNQEKSEEEPMPDEQYGYPKTDSRRWVSCIRVLDPKAAATTCLLELQENEAAFSICC 882
Query: 722 CSFSDDSNV--YYCVGTAYVLPEENEPTKGRILVFI-----VEDGK-LQLIAEKETKGAV 773
+F D+ N+ VGTA L E P + + FI VEDG+ L+L+ + G
Sbjct: 883 VNFHDNKNLGTVLAVGTAKDL--EWWPKRRSMGGFIHIYRFVEDGRSLELVHKTPIDGVP 940
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
+L F G+LLA I Q +++Y D G R+L +C + + + + GD I VGD
Sbjct: 941 TALCQFQGRLLAGIGQILRIY-----DLGKRKLLRKCENKNFPNTITSIHSYGDRIYVGD 995
Query: 833 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE--- 889
+ +S + Y+ +E + A D + W++A +D D + NLF VR +
Sbjct: 996 IQESFHYVKYRRDENQLYAFADDSSPRWLTASLHIDFDTMAAGDKFGNLFFVRLPQDLSE 1055
Query: 890 --------GATDEERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
G E+GRL E + ++H+GE V + SL+ G +V
Sbjct: 1056 EIEDDPTGGKIKWEQGRLNGAPNKVEEIIQFHVGEVVTCMQKASLI-------PGGGESV 1108
Query: 935 IFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
I+GTV G +G + E F L+ ++R+ + G +H +RS K+
Sbjct: 1109 IYGTVMGSVGALLPFSSREDVDFFSHLEMHMRQEHPPLCGRDHMAFRS-----AYFPVKD 1163
Query: 994 FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+DGDL E + L +I++ ++ + E+ K++E++
Sbjct: 1164 VIDGDLCEQYPTLPPDLQRKIAEELDRTPGEVMKKLEDI 1202
>gi|74212634|dbj|BAE31055.1| unnamed protein product [Mus musculus]
Length = 261
Score = 238 bits (608), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 114/204 (55%), Positives = 150/204 (73%), Gaps = 5/204 (2%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
+V +YG+IA +ELFRP GE++D LFI T +Y C+L++ ES ++ITRA G+V DRIG
Sbjct: 54 EVGMYGKIAVMELFRPKGESKDLLFILTAKYNACILEYKQSGESIDIITRAHGNVQDRIG 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGC 120
RP++ G IGIIDP+CR+IGL LYDGLFKVIP D + +AFNIRLEEL V+D+KFLYGC
Sbjct: 114 RPSETGIIGIIDPECRMIGLRLYDGLFKVIPLDRDNKELKAFNIRLEELHVIDVKFLYGC 173
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLII 180
PTI +YQD + RHVKTYEV+L++K+F +GPW Q N++ A ++I VP P G +II
Sbjct: 174 QAPTICFVYQDPQ-GRHVKTYEVSLREKEFNKGPWKQENVEAEASMVIAVPEPFGGAIII 232
Query: 181 GEETIVYCSANAFKAIPIRPSITK 204
G+E+I Y N K + I P I K
Sbjct: 233 GQESITY--HNGDKYLAIAPPIIK 254
>gi|145348011|ref|XP_001418451.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578680|gb|ABO96744.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1196
Score = 238 bits (607), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 282/1169 (24%), Positives = 482/1169 (41%), Gaps = 173/1169 (14%)
Query: 4 VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT 63
V +G ++ R G +D+L + ++ + VLQ+D E + + R
Sbjct: 56 VEAFGIARSIATLRLAGANRDYLVLGSDSGRVVVLQFDKEKNAFVKVHQETYGKTGCRRI 115
Query: 64 DNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCA 121
GQ DP R I L + L V+ DN+ L + LE + I F + CA
Sbjct: 116 VPGQFLCADPKGRAICLGAVEKNKLVYVMNRDNEANL--TISSPLEANKSHTIVF-HMCA 172
Query: 122 ------KPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDN 162
P + D DA +H+ YE+ L V WS+ +DN
Sbjct: 173 LDCGLDNPIFAAIELDYGDADADPSGEAVAEAQKHLTYYELDLGLNHVVR-KWSEP-IDN 230
Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSAN---AFKAIPIRPSITKAYGRVDADGS 214
GA+ LIPVP P GVL+ E ++Y N IP R S+ G + +
Sbjct: 231 GANHLIPVPGGSDGP--GGVLVCCENFVIYRHQNHDEVRAVIPRRNSLAADRGVLIVSSA 288
Query: 215 R--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
+L G ++ L I + E V+ +K++ ++ L ++ S
Sbjct: 289 SHRSKNSFFFLAQSEYGDIYKLTIDYSGETVSEVKVKYFDTIPPCVSLCVLKTGFLFAAS 348
Query: 267 SYGDSQLIKL------NLQPDAKGSYVEVLERYV-------------------NLGPIVD 301
+G+ L + + + S +E E Y +L P++D
Sbjct: 349 EFGNHALYQFAGIGDDDAVESSSASLMETEEGYEPVFFDPRPLTNLYTIDTIDSLCPVLD 408
Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDP 360
+L + Q+ T G SLRI+R GI +NE A L G +++++ S+ +
Sbjct: 409 MQAHNLTEEEMPQLYTLCGTGARSSLRILRQGIAMNELAMSPLPGQPNAIFTVKKSSSEE 468
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+D++++VSF++ T +L++ D + E G T TL L+Q+ G +R +
Sbjct: 469 YDSYIIVSFLNATLVLSIG--DTVSEVNDSGILGTTTTLQACLMSDESLLQIHPGGLRHI 526
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYE 479
+ R NEW++P ++ T N+ QV++A GG ++Y E+ G L EV+ + +
Sbjct: 527 KADKR--INEWRTPGRKQISKCTCNSKQVVIALTGGEVIYFELDSAGQLIEVEKLETSGD 584
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLC 536
I+CLDI P+ E ++ A+G + D +VR+ SL PD L + + L G P S+L+
Sbjct: 585 IACLDIGPVPEGALRNRFLAMGSY-DGTVRVMSLNPDDCLQTLAVQALKGSA-PSSLLVL 642
Query: 537 AFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
G L + +G L+ ++ +G+L+D + LG + L S +
Sbjct: 643 QTAGTESTQGGLLLNVGMANGVLIRATIDQVSGQLSDMRMRFLGARAPKLVRTSVRGAPA 702
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI- 649
+ A S RP + YS + + ++ + +C FNS P+ + L I +I+ +
Sbjct: 703 LVALSSRPWLGYSEKGTFVLAPISYVPLEEVCSFNSEQCPEGVVAISNQTLRIASIERLG 762
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAI-----CSLK------------------NQSCA 686
+ + ++ L PR + ++ A+ C++ N
Sbjct: 763 ENFNQTTVKLRYTPRAMSANPDTKMVALIESDQCTVPVGEREGPEATPADEAPETNDEDE 822
Query: 687 EESEM-----------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
EE++M VR++D + + L E S+ + +
Sbjct: 823 EEAKMLPVEQFGAPKSSPGTWAACVRIVDPKEAKSTFVLELHKSEAALSLCHVFLTGPNE 882
Query: 730 VYYCVGTAYVLPEENEPTKGRI--LVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAA 786
+ VGTA L G L DG+ L L+ T G V +L + G LLA
Sbjct: 883 LLLAVGTAVNLTFAPRNCDGGFIHLYRYGNDGRTLNLVHSTPTDGPVGALCGYKGHLLAG 942
Query: 787 INQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
+N +++Y K +LR R + I L+ GD I VGD+ +SI + Y
Sbjct: 943 VNNSLRIYDYGKKKLLRKVENRNFPN------FITTLHAA--GDRIYVGDVQESIHYVKY 994
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR----------------K 886
K +EG+I A D +++A LD D GA+ N+F R K
Sbjct: 995 KADEGSIYIFADDTKPRYITATLPLDYDTLAGADKFGNIFVNRLPKDVSEDMDDDPTGGK 1054
Query: 887 N--SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
N S+G + + E + ++GE V G+L G I +++GT G IG
Sbjct: 1055 NIYSQGVLNGAPNKSETSAQTYIGETVCALTKGAL-------QPGGIEIIMYGTFMGGIG 1107
Query: 945 VIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
+ + F L+ ++R+ + G +H +RS+ KN +DGDL E F
Sbjct: 1108 CLLPFSSRSEIEFFTHLEMHMRQEAPSIVGRDHMAFRSY-----YAPVKNVIDGDLCEQF 1162
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L I++ M+ + E+ K++E++
Sbjct: 1163 GALPADVQRRIAEEMDRTPGEILKKLEQV 1191
>gi|224109600|ref|XP_002315251.1| predicted protein [Populus trichocarpa]
gi|222864291|gb|EEF01422.1| predicted protein [Populus trichocarpa]
Length = 1213
Score = 238 bits (606), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 273/1190 (22%), Positives = 496/1190 (41%), Gaps = 191/1190 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G I +L FR G +D++ + ++ + +L+++ E + L
Sbjct: 52 VLSVEIFGAIRSLAQFRLTGAQKDYIVVGSDSGRIVILEYNKERNVLDKIHQETFGKSGC 111
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ +DP R +IG L V+ D +L + + + +
Sbjct: 112 RRIVPGQYLAVDPKGRAVMIGACEKQKLVYVLNRDTVARLTISSPLEAHKSHTICYSVCG 171
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNG 163
G P + D +A T + A + + + V WS+ +DNG
Sbjct: 172 VDCGFDNPIFAAIELDYSEADQDSTGQSASEAQKNLTFYELDLGLNHVSRKWSEQ-VDNG 230
Query: 164 ADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR-- 215
A++L+ VP GVL+ E ++Y + + IP R + G + +
Sbjct: 231 ANMLVTVPGGGDGPSGVLVCVENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAATHK 290
Query: 216 ------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
+LL G + +++ HE +KV LKI+ + S++ L + ++ S +G
Sbjct: 291 QKSMFFFLLQTEYGDIFKVMLDHENDKVKELKIKYFDTIPVTSSMCVLKSGFLFAASEFG 350
Query: 270 DSQLIKLNL---QPDAKGSYVEVLE----------------------RYVNLGPIVDFCV 304
+ L + + D + S ++E + +L PI+D V
Sbjct: 351 NHALYQFQAIGEEEDVEASSATLMETEEGFQPVFFQPRGLKNLVRIDQVESLMPIMDMKV 410
Query: 305 VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFDT 363
+L + Q+ + G SLRI+R G+ I+E A +L G+ +W+++ + D FD
Sbjct: 411 ANLFDEETPQIFSLCGRGPRSSLRILRPGLAISEMAVSQLPGVPSAVWTVKKNIYDEFDA 470
Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
++VVSF + T +L++ + +EE GF T +L + L+Q+ +R +
Sbjct: 471 YIVVSFNNATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQIHPNGIRHIRED 528
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISC 482
R NEW++P ++ +N QV++A GG L+Y E+ G L EV+ ++ +++C
Sbjct: 529 GR--INEWRTPAKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLMEVEKHEMSGDVAC 586
Query: 483 LDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAFE 539
LDI P+ E S+ AVG + D ++R+ SL PD + +++ + + P S+L +
Sbjct: 587 LDIAPVPEGRQRSRFLAVGSY-DNTIRVLSLDPDDCMQILSVQSVSAP--PESLLFLEVQ 643
Query: 540 GIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
+L L G L +++M TG+L+D + LG + L + + +
Sbjct: 644 ASIGGEDGADHPASLFLNAGLQTGVLFRTVVDMVTGQLSDSRSRFLGLRAPKLFSINVRG 703
Query: 588 TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
+ S RP + Y L + ++ + + + F+S + + L I TI+
Sbjct: 704 RRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCAEGVVSVAGDALRIFTIE 763
Query: 648 DI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC------------SLKNQSCAEESEM--- 691
+ + + +IPL PR+ Q + + I + C E S M
Sbjct: 764 RLGETFNETAIPLRYTPRKFVLQPKRKLLVIIESDQGAYTAEEREAAKKECFEASGMGEN 823
Query: 692 ------------------------------------HFVRLLDDQTFEFISTYPLDTFEY 715
+R+LD ++ L E
Sbjct: 824 GSASAEQMENGDDDDKDDPLSDEQYGYPKAESDKWVSCIRVLDPRSAATTCLLELQDNEA 883
Query: 716 GCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRILVFI-----VEDGK-LQLIAEKE 768
S+ + +F D + VGTA L + P + + FI V+DGK L+L+ + +
Sbjct: 884 AFSLCTVNFHDKEHGTLLAVGTAKGL--QFWPKRSLVTGFIHIYKFVDDGKSLELLHKTQ 941
Query: 769 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRG 825
+G +L F G+LLA I ++LY D G + L +C + I++++ T
Sbjct: 942 VEGVPLALCQFQGRLLAGIGSVLRLY-----DLGKKRLLRKCENKLFPNTIVSIH--TYR 994
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR 885
D I VGD+ +S Y+ +E + A D W+++ +D D GA+ N++ R
Sbjct: 995 DRIYVGDIQESFHFCKYRRDENQLYIFADDSVPRWLTSSYHVDFDSMAGADKFGNIYFAR 1054
Query: 886 ------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
K +G + ++E + ++H+G+ VN + SL+
Sbjct: 1055 LPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHIGDVVNSLQKASLI------- 1107
Query: 928 VGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
G +I+GTV G +G + + F L+ +LR+ + G +H +RS
Sbjct: 1108 PGGGECIIYGTVMGSVGALLPFTSRDDVDFFSHLEMHLRQDHPPLCGRDHMSYRS----- 1162
Query: 987 KTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E F LD R DE+ +T E+ K++EE+
Sbjct: 1163 AYFPVKDVIDGDLCEQFPTLPLDAQRKIADELDRTPG----EILKKLEEV 1208
>gi|348667612|gb|EGZ07437.1| hypothetical protein PHYSODRAFT_565381 [Phytophthora sojae]
Length = 1197
Score = 235 bits (600), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 283/1177 (24%), Positives = 481/1177 (40%), Gaps = 181/1177 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++ ++G + L+ FR G +D+L + ++ K VL+ + S + R
Sbjct: 52 VISTQVFGVVRALQPFRLTGGERDYLVVGSDSGKIVVLEVNPSSGQFEARQSETYGKTGC 111
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIRLEELQV-- 111
R GQ DP R L+G L V+ D +L EA L V
Sbjct: 112 RRITPGQYLAADPKGRALLVGAVEKQQLVYVMNRDASSRLTISSPLEAHRSNAIHLGVVG 171
Query: 112 LDIKF---LYGC---------AKPTIVVLYQDNKDA-RHVKTYEVALKDKDFVEGPWSQN 158
LD+ F ++ C A PT Q +DA + + YE+ L + V WS+
Sbjct: 172 LDVGFENPIFACLELDYAEADADPT----GQAARDAVKSLVYYELDL-GLNHVTRRWSEQ 226
Query: 159 NLDNGADLLIPVPPPLCG---VLIIGEETIVY---------CSANAFKAIPIRPSITKAY 206
+ A++L+ VP G VL++GE T+ Y C+ + P I A
Sbjct: 227 VV-RSANMLVAVPGGGDGPGGVLVLGENTVQYKNEGHPELTCAIPRRQGEPRDIVIVSAA 285
Query: 207 GRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
D LL G L+ + + + V +KI+ +AS++ +++ S
Sbjct: 286 THKQRDLFFVLLQSELGDLYKISLDYSGNAVEEIKIQFFDTVPVASSMCITKTGLLFCAS 345
Query: 267 SYGDSQLIK----------------------LNLQPDAKGSYVEVLERYVNLGPIVDFCV 304
+ + L + L+ P K + +++ +L P+ V
Sbjct: 346 EFSNHYLFQFLSIGEGDDTAKCSSLAMDPTELSTFPLRKLTNLQLASSMPSLSPVTQLLV 405
Query: 305 VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFDT 363
DL + Q+ G SLR++R+G+ I E A+ L G+ K +W L+ S DP+D
Sbjct: 406 DDLANEQTPQMYALCGNSNRSSLRVLRHGLPITEMAASALPGVAKAVWCLKESYADPYDK 465
Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
++VVSF E L + + + +EE GF +L + +Q+ + +R V
Sbjct: 466 YIVVSF--EDATLVLEVGETVEEVTQSGFLRDHGSLLVALLEDDSKLQIHANGIRHVPKF 523
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEISC 482
+ EWK+P + AN+ QV+++ GG ++Y E+G G L E L +E+
Sbjct: 524 --QPVTEWKAPGKKVIEHCAANSRQVVISLAGGEIIYFELGQSGELAEKGKLDLGFEVCS 581
Query: 483 LDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGI 541
LD+ + E Q AVG W D +VRI SL P+ K L P ++ L +
Sbjct: 582 LDLGEVPEGRQRFQFMAVGSW-DNTVRILSLDPNDLFRQKSTLALTSHPHTLCLAQLQSE 640
Query: 542 S------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
+L L +G L L++ T LTD + LGT P+ L + +
Sbjct: 641 PSTPDSEHSSQSLFLSIGLDNGVLQQSLIDPITANLTDSRSRFLGTNPVKLFRVAVEGKR 700
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL-AIAKEGELTIGTIDD 648
+ A S R + Y + + ++ + +S+ FNS P + A+ EG + I T+D
Sbjct: 701 SILALSSRSWISYFHQTRRHLTPLSCELLSYASSFNSEQCPGGIVALTNEG-MKILTVDQ 759
Query: 649 I------QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-----------------SC 685
+ QK ++R P + + H R I S N+ S
Sbjct: 760 LGDTFNQQKCNLRYTPR----KAVVHAASRRLVVIESDHNEYGAAYKRQHGLRIPDIRSA 815
Query: 686 AEESEMH------------------------FVRLLDDQTFEFISTYPLDTFEYGCSILS 721
AE+ + + VR++D + + + LD E SI S
Sbjct: 816 AEQDDEYEDEINEALLFPRGPLPAEKDKWASCVRIVDPVSCQTVVCEELDVDERARSIAS 875
Query: 722 CSFSD-DSNVYYCVGTA--YVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLN 777
C F D + VGT L + P G + V+ +VE +L L+ E +++
Sbjct: 876 CVFHDRGGEAFIIVGTVKKMQLHPQKAPAGGYLRVYRVVEGMQLVLVHTTEIDDIPHAMC 935
Query: 778 AFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDL 833
F G+LL ++ + +++Y K MLR R S + + ++ GD I D+
Sbjct: 936 EFQGRLLVSVGRALRIYDLGKKKMLRKCENRNFPS--------ILVELKAAGDRIYASDM 987
Query: 834 MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE---- 889
+S + YK +E + A D ++++ +LD D GA+ N+F R SE
Sbjct: 988 HESFHFVKYKKDENQLVIFADDCVPRFITSSVLLDYDTLCGADKFGNVFVSRLPSEVSDE 1047
Query: 890 -------------GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
G + +LE V ++H+G+ V SLV G I +I+
Sbjct: 1048 IDNPTANRILWDSGLLNGAPNKLEQVAQFHVGDVVTSMVRTSLVP-------GGIEAIIY 1100
Query: 937 GTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
T+ G IG + Q + F L+ +R+ + G +H +RS+ + KN
Sbjct: 1101 ATIMGRIGALIPFTSRQDVDFYTHLEMYMRQEQPPLCGRDHLSYRSY-----YIPVKNVT 1155
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DGDL E F LS + +++ ++ + E+ K++E++
Sbjct: 1156 DGDLCEQFSSLSVEKQASVAEDLDRTPAEVLKKLEDI 1192
>gi|310793065|gb|EFQ28526.1| CPSF A subunit region [Glomerella graminicola M1.001]
Length = 1212
Score = 234 bits (598), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 280/1173 (23%), Positives = 490/1173 (41%), Gaps = 166/1173 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L I+G I ++ FR G +D+L +AT+ + ++++ + +
Sbjct: 59 LLSHDIFGIIRSMAAFRLAGSNKDYLILATDSGRITIIEYIPAQNRFQRLHLETFGKSGV 118
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R G+ DP R LI + L V+ +++ +L + + + VL + +
Sbjct: 119 RRVIPGEYLACDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSMVA 178
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNG 163
G A P L + +A T E A + + + V WS+ +D
Sbjct: 179 LDVGYANPVFAALEIEYTEADQDPTGEAAREAETQLVYYELDLGLNHVVRKWSEP-VDPT 237
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVD--ADG 213
A LL VP P GVL+ GEE I Y +N AF+ IP R T+ R G
Sbjct: 238 ASLLFQVPGGQDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRKRHVVSG 295
Query: 214 SRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTIS 255
+ L AG L+ T + + +V LKI+ ++S++
Sbjct: 296 VMHKLKGSAGAFFFLIQTEDGDLFKATIDMVEDADGNPTGEVKRLKIKYFDTIPVSSSLC 355
Query: 256 YLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VE 288
L + +Y S +G+ Q + + D K Y +
Sbjct: 356 ILKSGFLYAASQFGNHQFYQFEKLGDDDEELEFSSDDFPTDPKAGYDAVYFHPRPLENLA 415
Query: 289 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI- 347
++E ++ P++D V +L + Q+ T G + R++++G+ +NE + EL GI
Sbjct: 416 LVESIDSMNPLLDCKVANLTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIP 475
Query: 348 KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
+W+L+ + D +D ++V+SF + T +L++ + +EE GF + TL +
Sbjct: 476 SAVWTLKLNRGDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGED 533
Query: 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDG 466
L+QV +R + NEW +P S+ AT N QV +A G +VY E+ GDG
Sbjct: 534 GLIQVHPKGIRHIRQGQV---NEWAAPQHRSIVAATTNEHQVAVALSSGEIVYFEMDGDG 590
Query: 467 ILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
L E + + ++CL + + E S AVG D +VR+ SL P+ L +K
Sbjct: 591 SLAEYDEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDSTVRVLSLDPESTLESKSVQ 649
Query: 525 GGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
P S+ + A + S YL L G L +L+ TGELTD ++ LG +P+
Sbjct: 650 ALTAPPSSLAIIAMDDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPV 709
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
L + + T V S RP + YS + L + +N ++ + F+S + +
Sbjct: 710 RLFQVTVQGRTCVIGLSSRPWLGYSDPITRGFLVTPLNYVDLEWVWNFSSEQCEEGVVGI 769
Query: 637 KEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV- 694
+ L I I+++ + +SIPL PRR+ + F N + + +
Sbjct: 770 QGQSLRIFAIENLGDTITQKSIPLSYTPRRLLKHPEYPMFYTIEADNNTLPPDLRAKLIA 829
Query: 695 ---------RLLDDQTFEF-----------------------ISTYPLDTFEYGCSILSC 722
R+L F + + T LD E S
Sbjct: 830 EPGVVNGDARILPPDEFGYPKGKGRWASCISVIDPLAEDQRVLQTIDLDNNEAAVSAAIV 889
Query: 723 SF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKETKGAVYSLNA 778
SF S DS + VGT ++ + ++G I V+ EDG +L+ I + + + +L
Sbjct: 890 SFASQDSESFLIVGTGKDMVVNPRQFSEGYIHVYRFGEDGHELEFIHKTKVEEPPSALLG 949
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
F G+LLA I + +++Y LR R+ Q++ L + + T+G I+VGD+ I+
Sbjct: 950 FQGRLLAGIGKTLRIYDLGLR-QMLRKAQADVTPQ---LIVSLSTQGSRIIVGDVQHGIT 1005
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEE 895
++YK + D + W++ ++D + G + N+F VR K ++ A DE
Sbjct: 1006 YVVYKPTTNKLIPFVDDTVSRWVTCTTMVDYESVAGGDKFGNMFLVRCSEKATQEADDES 1065
Query: 896 RG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
G RL ++ + + SLV VG +++ +N
Sbjct: 1066 GGLHLINTRDYLHGTPHRLSLLAHSYTQDVPTSITKTSLV-------VGGQDVLLWSGIN 1118
Query: 941 GVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDL 999
G IGV I + E F + L+ ++R + G +H +R + V K +DGDL
Sbjct: 1119 GTIGVFIPFVTREDVDFFQNLEQHMRTEDAPLAGRDHLMYRGY-----YVPVKGVIDGDL 1173
Query: 1000 IESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
E + L + I+ ++ SV E+ +++ ++
Sbjct: 1174 CERYTLLPSEKKQMIAGELDRSVREIERKISDI 1206
>gi|308805058|ref|XP_003079841.1| spliceosomal-like protein (ISS) [Ostreococcus tauri]
gi|116058298|emb|CAL53487.1| spliceosomal-like protein (ISS) [Ostreococcus tauri]
Length = 1166
Score = 234 bits (598), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 277/1161 (23%), Positives = 473/1161 (40%), Gaps = 193/1161 (16%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
+G + ++ R G +D+L + ++ + +LQ+ E + + R G
Sbjct: 59 FGVVRSICALRLAGANRDYLCVGSDSGRVVILQFKKERNAFVKVHQETFGKSGCRRVVPG 118
Query: 67 QIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLK-----------------EAFNIRLE 107
Q DP R I L + L V+ DN+ L A + L+
Sbjct: 119 QFVCADPKGRAICLGAMERSKLVYVMNRDNEANLTISSPLEANKSHAITYHMCALDCGLD 178
Query: 108 ELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGAD 165
I+ YG PT V+ + K H+ YE+ L V WS+ +DNGA+
Sbjct: 179 NPVFAAIELDYGEVDDDPTGEVVAETQK---HLTYYELDLGLNHVVR-KWSEP-IDNGAN 233
Query: 166 LLIPVP-----PPLCGVLIIGEETIVYCSAN---AFKAIPIRPS--------ITKAYGRV 209
LIPVP P GVL+ E I+Y + IP R S I + G
Sbjct: 234 HLIPVPGGSDGPG--GVLVCCENFIIYRHQDHEEVRAVIPRRTSLPGDRGVLIVSSAGHK 291
Query: 210 DADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
+L G ++ L I + +KV+ +K++ ++ L ++ S +G
Sbjct: 292 SKKSFFFLAQSEYGDIYKLSIEYSGDKVSEVKVKYFDTIPPCVSMCVLKTGFLFAASEFG 351
Query: 270 DSQLIKLNLQPD------AKGSYVEVLERYV-------------------NLGPIVDFCV 304
+ L + D + S +E + Y +L PI+D
Sbjct: 352 NHALYQFAGIGDDDAVESSSASLMETEQGYEPVFFDPRRLTNLYPIDKIDSLCPILDMQA 411
Query: 305 VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDT 363
+L + Q+ T G SLRI+R GIG+NE A L G +++++ + D +D
Sbjct: 412 HNLTEEDTPQLYTLCGTGARSSLRILRQGIGMNELAMSSLPGQPNAIFTVKKKSSDQYDG 471
Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
++VVSF++ T +LA+ D + E G T TL + L+QV +G +R + +
Sbjct: 472 YIVVSFLNATLVLAIG--DTVTEVSDSGILGTTMTLQVCLMNDDSLLQVHAGGLRHIKAD 529
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEISC 482
R NEW++P ++ T N+ QV++A GG ++Y E+ G L EV+ + +I+C
Sbjct: 530 KR--INEWRTPGRKQISKCTCNSKQVVIALTGGEVIYFELDSAGQLIEVEKLETNGDIAC 587
Query: 483 LDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLC--- 536
+DI P+ E S+ A+G + D +VR+ SL L + + L G P S+L+
Sbjct: 588 VDIAPVPEGALRSRFLAMGSY-DGTVRVMSLNSDDCLQTLAVQALKGST-PSSLLILQTA 645
Query: 537 ---AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
+ +G L + +G L+ L+ +G+L+D + LGT+ L S + + + A
Sbjct: 646 GTESLQGSLLLNVGMANGVLMRATLDQVSGQLSDMRVRFLGTRAPKLVRTSVRGESALVA 705
Query: 594 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKL 652
S RP + YS + S ++ + +C F+S A P+ + L I +++ + +
Sbjct: 706 LSSRPWLGYSEKGTFVLSPISYVALEEVCSFSSEACPEGVVAISNQTLRIASVERLGENF 765
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAI-----CSL---------------KNQSCAEESEMH 692
H +I L PR + ++ A+ CS+ +++ +E E++
Sbjct: 766 HQTTIKLRYTPRAMSANPDTKMVALIESDQCSVPVGERESPEATSADEASEANGDEDELN 825
Query: 693 F-------------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC 733
VR++D + + L E SI + + +
Sbjct: 826 MLPVEQYGAPKSSPGTWAACVRIIDPKDAKSQYVLELGKNESAISICHVYLTGPNELLLA 885
Query: 734 VGTAYVL---PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
VGTA L P + + + + L L+ T+G V +L
Sbjct: 886 VGTAQNLTFAPRNCDGGFIHLYRYGTDSKTLSLVHSTPTEGPVGAL-------------- 931
Query: 791 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
CG+ GH+LA D I VGD+ +SI + YK +EG++
Sbjct: 932 -------------------CGYRGHLLAXXXXXXXDRIYVGDVQESIHFVKYKADEGSMY 972
Query: 851 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR----------------KN--SEGAT 892
A D +M+A LD D GA+ N+F R KN S+G
Sbjct: 973 IFADDTKPAYMTAALPLDFDTLAGADKFGNIFVNRLPKDISEDMDEDPTGGKNIYSQGVL 1032
Query: 893 DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
+ + E +GE V G+L G I +++GT G IG +
Sbjct: 1033 NGAPNKSETCARTFIGETVCALTKGAL-------QPGGIDIIMYGTFLGGIGALVPFQTR 1085
Query: 953 QYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
+ F L+ ++R+ + G +H +RS+ KN +DGDL E F L+
Sbjct: 1086 SEIDFFITLEMHMRQEASSIVGRDHMAFRSY-----YAPVKNVIDGDLCEQFGALAPEVQ 1140
Query: 1012 DEISKTMNVSVEELCKRVEEL 1032
I++ M+ + E+ KR+E++
Sbjct: 1141 RRIAEDMDRTPGEILKRLEQI 1161
>gi|409045147|gb|EKM54628.1| hypothetical protein PHACADRAFT_210427 [Phanerochaete carnosa
HHB-10118-sp]
Length = 1213
Score = 234 bits (597), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 281/1182 (23%), Positives = 486/1182 (41%), Gaps = 188/1182 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+ + ++ + +L +D ++S + R
Sbjct: 58 VFGSIRSLTAFRLTGGTKDYAIVGSDSGRIVILDYDPKTSSFVKVHQETYGKSGARRIVP 117
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
GQ DP R +I L ++ D L EA N + + LD+ F
Sbjct: 118 GQYLATDPKGRSVMIAATEKAKLVYILNRDAAANLTISSPLEAHKNNAIIHHIVGLDVGF 177
Query: 117 LYGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNG 163
PT L D N + + YE+ L V WS+ D
Sbjct: 178 ----ENPTFAALEVDYSESDQDPTGEAFNNAEKLLTYYELDLGLNHVVRK-WSEPT-DPR 231
Query: 164 ADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------- 201
A+LL+ VP P GVL+ E+ I+Y +A + IP R
Sbjct: 232 ANLLVQVPGGQLASSDRYDGP--SGVLVCCEDHIIYRHMDAPQHRVPIPRRRHPLEDKER 289
Query: 202 ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
IT A +LL G L+ + I HE++ V LKI+ +AS++ L
Sbjct: 290 GLIITAAVMHKMKGAFFFLLQSEEGDLYKVTIEHEEQDVKALKIKYFDTVPVASSLCILK 349
Query: 259 NAVVYIGSSYGDSQLIK---------------------------LNLQP---DAKG-SYV 287
+ +++ + +G+ L + ++L P +G +
Sbjct: 350 SGFLFVAAEFGNHHLYQFQKLGDDDNEPEFSSTMYPNFGMANPAISLPPAYFSPRGLDNL 409
Query: 288 EVLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+++ +L P+VD V++L Q+ G SLR++R+G+ + E S +L G
Sbjct: 410 TLVDELESLDPVVDAKVLNLLPNSDSPQIFAACGRGGRSSLRMLRHGLEVEEVVSSDLPG 469
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
I +W+ + DDPFD+++++SF++ T +L++ + +EE + GF S + TL
Sbjct: 470 IPNAVWTTKLREDDPFDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSSPTLAVQQLG 527
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
+ L+QV +R V R NEW+ P G ++ AT N QV++A LVY E+
Sbjct: 528 ADALLQVHPHGIRHVLVDRRV--NEWRVPQGKTIVCATTNRRQVVVALSSAELVYFELDL 585
Query: 465 DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
+G L E + + + + L + + E + AVG D +VRI SL + +
Sbjct: 586 EGQLNEYQERKAMGSTVLALSVGEVPEGRQRTPFLAVGC-EDQTVRIVSLDPESTLETIS 644
Query: 524 LGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
L P S + A + +++ L +G LL +L+ G+LTD + LG
Sbjct: 645 LQALTAPPSAICIADMLDAGINKVQPTTFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLG 704
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
T+PI L + + A S R + Y+ + ++ + + + F++ P+ L
Sbjct: 705 TRPIKLVRVQIQRNPGILALSSRSWLNYTHQNLMHFTPLIFDNLDYAWSFSAELCPEGLI 764
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQ----------------EQSRTFA 676
L I I + KL SIPL PR+ I H E+++
Sbjct: 765 GISGSVLRIFQIPKLGTKLKQDSIPLSYTPRKMIVHPTNGFFYMIEGDHRVMGEEAKQKK 824
Query: 677 ICSLKNQSCAEESEM----------------------HFVRLLDDQTFEFISTYPLDTFE 714
I L+ Q + EM V +D QT FI PLD E
Sbjct: 825 IEELRQQGKQVDEEMLNLPPEVFGRPKAPAGTWASGIRIVSPVDAQTVNFI---PLDNNE 881
Query: 715 YGCSILSCSFS-DDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETK 770
SI F+ + VGTA ++ P R F+ + ++L+ + ET
Sbjct: 882 AAFSIAVVPFTARGGELTLVVGTAQDTFLAPRSCTSGFLRTYRFLDDGRDIELLHKTETN 941
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIV 829
++ AF G+L A I + +++Y D G ++L + A+ + T+G I+
Sbjct: 942 DVPLAIMAFQGRLAAGIGKALRIY-----DIGKKKLLRKVESKNFSNAIVTLNTQGSRII 996
Query: 830 VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---K 886
VGD+ +SI +YK E + A D W++AV ++D + + N+F R K
Sbjct: 997 VGDMQESIFYAVYKPPENRLLIFADDAQPRWITAVTMIDYNTVAAGDRFGNVFVNRLDPK 1056
Query: 887 NSEGATDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
S+ D+ G + ++ +H+G+ V SLV G
Sbjct: 1057 ISDQVDDDPTGAGILHEKGILMGAPHKTAMIAHFHVGDIVTSIHKVSLV-------AGGR 1109
Query: 932 PTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
+++ ++G +G++ + E F+ L+ ++R + G +H WR + V
Sbjct: 1110 ELLLYTGLHGTVGILVPFVSKEDVDFISTLEQHMRTEQLSLVGRDHLAWRGY-----YVP 1164
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K +DGDL E F L ++ I+ ++ +V E+ K++E+L
Sbjct: 1165 VKAVVDGDLCEMFARLPASKQSSIATELDRTVGEVLKKLEQL 1206
>gi|389740093|gb|EIM81285.1| hypothetical protein STEHIDRAFT_86633 [Stereum hirsutum FP-91666 SS1]
Length = 1213
Score = 234 bits (596), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 270/1182 (22%), Positives = 485/1182 (41%), Gaps = 188/1182 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+ + ++ + +L++D ++S + R
Sbjct: 58 VFGSIRSLAAFRLTGGTKDYAIVGSDSGRIVILEYDPKTSSFVKVHQETYGKSGARRIVP 117
Query: 66 GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
GQ DP R + + + L ++ D L + + + + LD+ F
Sbjct: 118 GQYLATDPKGRSVMVAAVEKAKLVYILNRDAAANLTISSPLEAHKNAAIIHHIVGLDVGF 177
Query: 117 LYGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNG 163
P L D N + + YE+ L V WS+ D
Sbjct: 178 ----ENPMFAALEVDYTESDQDPTGEAFNNADKVLTYYELDLGLNHVVRK-WSEPT-DPR 231
Query: 164 ADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIR------PS- 201
A+LL+ VP P GVL+ E+ I+Y + + IP R P
Sbjct: 232 ANLLVQVPGGQLASSDRYDGP--SGVLVCCEDHIIYRHTDVPQHRVPIPRRRHPLEDPDR 289
Query: 202 ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
IT A +LL G L+ + I H++E+V +KI+ +AS++ L
Sbjct: 290 GLIITSAVMHKMKGAFFFLLQSEEGDLYKVTIEHQEEEVKAVKIKYFDTVPVASSLCILK 349
Query: 259 NAVVYIGSSYGDSQLIKL------NLQPDAKGS-------------------------YV 287
+ +++ S +G+ QL + + +P+ + +
Sbjct: 350 SGFLFVASEFGNHQLYQFQKLGDDDNEPEFSSTDYPSFGMADPSAALPRAHFRPHPLDNL 409
Query: 288 EVLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+ + +L P++D V++L Q+ T G + R +R+G+ + E S +L G
Sbjct: 410 ALADELESLAPVIDSKVMNLLPNSDTPQIFTACGRGARSTFRTLRHGLEVEETVSSDLPG 469
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
I +W+ ++ D P+D+++++SF++ T +L++ + +EE + GF S TL
Sbjct: 470 IPNAVWTTKTREDAPYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIG 527
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
+ L+QV +R V + R NEW+ P G ++ AT N QV++A LVY E+
Sbjct: 528 ADALLQVHPHGIRHVLADRRV--NEWRVPQGKTIVSATTNKRQVVVALSSAELVYFELDL 585
Query: 465 DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
DG L E + + + + L I + E + AVG D +VRI SL + +
Sbjct: 586 DGQLNEYQDRKAMGSTVLALSIGEVPEGRQRTPYLAVGC-EDQTVRIISLDPESTLETIS 644
Query: 524 LGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
L P S L A + ++ L +G LL +L+ G+LTD + LG
Sbjct: 645 LQALTAPPSALCIADMLDAGINKSQPTMFVNIGLQNGVLLRTVLDPVNGQLTDTRTRFLG 704
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
T+P+ L ++ + A S R + Y+ + ++ + + F++ P+ L
Sbjct: 705 TRPVRLIRVQIQHQPAILALSSRTWLNYTHQNLMHFTPLIFDNLDFAWSFSAELCPEGLI 764
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAI---------------- 677
L I + + KL SIPL PR+ + F I
Sbjct: 765 GITGSVLRIFQMPKLGTKLKQDSIPLSYTPRKFASHPSNSYFYIIEADHRVTGPDATAKK 824
Query: 678 -CSLKNQSCAEESEM-------------------HFVRLLDDQTFEFISTYPLDTFEYGC 717
L++Q + EM +R++D + ++ P+D E
Sbjct: 825 VAELRSQGKMVDDEMLELPAEVFGRSKAPAGTWASCIRIVDPAEGKSVAEIPIDNNEAAF 884
Query: 718 SILSCSFSDDSNVYY-CVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
S+ FS +N Y+ VGTA ++ P R F+ + L+L+ + ET
Sbjct: 885 SLAIVPFSVRNNEYHLVVGTAQDTFLAPRSCTSGFLRTYKFVDDGAGLELLHKTETDDIP 944
Query: 774 YSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
SL AF G+L+A I + +++Y K +LR ++ S + + T+G I+
Sbjct: 945 MSLLAFQGRLVAGIGKALRIYDIGKKKLLRKAESKTFASAI--------ISLNTQGSRII 996
Query: 830 VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---K 886
VGD+ +SI+ +YK E + A D A W++ ++D + N+F R K
Sbjct: 997 VGDMQESIAYAVYKAPENKLLVFADDTQARWVTCSTMVDYTTVAAGDRFGNIFINRLDSK 1056
Query: 887 NSEGATDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
S+ D+ G + ++ +H+G+ V SLV G
Sbjct: 1057 VSDQVDDDPTGAGILHEKGILMGAPHKTAMLAHFHVGDLVTSIHKVSLV-------AGGR 1109
Query: 932 PTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
+++ ++G IG++ L E F+ L+ ++R + G +H WR + V
Sbjct: 1110 EVLLYTGLHGTIGMLVPLVSKEDVDFISTLEQHIRTEQTSLVGRDHLAWRGY-----YVP 1164
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K +DGDL E+F L + I+ ++ +V E+ K++++L
Sbjct: 1165 VKAVVDGDLCETFARLPAAKQSMIAGELDRTVSEVLKKLDQL 1206
>gi|169848339|ref|XP_001830877.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
gi|116508046|gb|EAU90941.1| pre-mRNA-splicing factor RSE1 [Coprinopsis cinerea okayama7#130]
Length = 1213
Score = 233 bits (595), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 265/1174 (22%), Positives = 486/1174 (41%), Gaps = 174/1174 (14%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
+G + +L FR G +D+ + ++ + +L +D +++ + R G
Sbjct: 59 FGSVRSLAAFRLTGGTKDYAILGSDSGRIVILDYDPKTTSFVKLHQETFGKSGARRIVPG 118
Query: 67 QIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGCA 121
Q DP R +IG L ++ D L + + + + + G
Sbjct: 119 QYLATDPKGRAVMIGAMEKAKLVYILNRDAAANLTISSPLEAHKNSAIIHHIVGVDVGFE 178
Query: 122 KPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
P L D N + + YE+ L V WS+ D A+LL+
Sbjct: 179 NPLFAALEVDYSESDQDPTGEAFNNAEKMLTYYELDLGLNHVVRK-WSEPT-DPRANLLV 236
Query: 169 PVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIR------PS----IT 203
VP P GVL+ E+ I+Y +A + IP R P IT
Sbjct: 237 QVPGGQLASSDRYDGP--SGVLVCCEDHIIYRHMDAPQHRVPIPRRRHPLEDPERGVIIT 294
Query: 204 KAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
A +LL + G L + I HE+E V LKI+ +AS++ L + ++
Sbjct: 295 AAVMHKMKGAFFFLLQNEDGDLFKVTIEHEEEDVKALKIKYFDTVPVASSLCILKSGFLF 354
Query: 264 IGSSYGDSQLIKLNL------QPDAKG-SY------------------------VEVLER 292
+ S +G+ L + +P+ SY + + +
Sbjct: 355 VASEFGNHHLYQFQKLGDDDDEPEVTSTSYPSFGMADPTAALPRAYFKPRPLDNLALADE 414
Query: 293 YVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
+L PI+D V++L Q+ G +LR +R+G+ + E S EL GI +
Sbjct: 415 LESLDPILDSKVLNLLPNSDAPQIFAACGRGARSTLRTLRHGLEVEESVSSELPGIPNAV 474
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+ + + +DP+D+++++SF++ T +L++ + +EE + GF S + T+ + L+
Sbjct: 475 WTTKKTEEDPYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSSPTIAVQQIGADALL 532
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
QV +R V + R NEW+ P G ++ AT N QV++A LVY E+ DG L
Sbjct: 533 QVHPHGIRHVLADRRV--NEWRVPQGKTIVTATTNKRQVVVALSSAELVYFELDLDGQLN 590
Query: 470 EVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
E + + + + L I + E + AVG D +VRI SL P+ L T
Sbjct: 591 EYQDRKAMGSTVLALSIGEVPEGRQRTPYLAVGC-EDQTVRIISLDPETTLETLSLQALT 649
Query: 528 IIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
P S+ + S ++ L +G LL +L+ G+LTD + LGT+PI
Sbjct: 650 APPSSICIADMLDASINKSQPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGTRPIK 709
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L + A S R + Y+ + ++ + + + F++ P+ L
Sbjct: 710 LIRVPVHKNPAILALSSRSWINYTHQNMMHFTPLIYDNLDYAWSFSAELSPEGLIGIAGN 769
Query: 640 ELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE-------- 690
L I TI + KL ++PL PR++ ++ F + ++ +EE+
Sbjct: 770 VLRIFTIPKLGSKLKQDTLPLSYTPRKLITHPENNYFYLIESDHRVYSEEATKAKLDELQ 829
Query: 691 ----------------------------MHFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
+R++D + ++ +PLD E SI
Sbjct: 830 KKGKKIDEEIISLPPSEFGRPKAPAGTWASNIRIIDPVENKTVAVFPLDNNEAAFSIAIV 889
Query: 723 SFS-DDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
FS + ++ VGTA V P E R F L+L+ + ET +L A
Sbjct: 890 PFSARNGELHLVVGTAKDTTVSPRTCESGFLRTYKFTENGTGLELLHKTETDDVPMALLA 949
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSI 837
F G+L A + + +++Y D G ++L + + A+ + T+G I+VGD+ +S+
Sbjct: 950 FQGRLAAGVGKALRIY-----DIGKKKLLRKVENKSFTTAIVTLTTQGSRILVGDMQESV 1004
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
++YK E + A D W++A+ ++D + + + N+F R K S+ ++
Sbjct: 1005 QYVVYKQPENRLLTFADDTQPRWVTAITMVDYNTIVAGDRFGNIFVNRLDSKVSDQVDED 1064
Query: 895 ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
G + +++ +H+G+ + SLV G +++ +
Sbjct: 1065 PTGAGILHEKPILMGAPHKTKMIAHFHVGDIITSLHKVSLV-------AGGREVIVYTGL 1117
Query: 940 NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
+G IG++ + E F+ L+ ++R + G + +R + V K +DGD
Sbjct: 1118 HGTIGILMPFISKEDVDFISTLEQHMRTEQPSLVGRDQLAYRGY-----YVPVKAVVDGD 1172
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L E++ L ++ I+ ++ +V E+ K++E++
Sbjct: 1173 LCETYAHLPASKQSSIANELDRTVGEVLKKLEQM 1206
>gi|395333071|gb|EJF65449.1| hypothetical protein DICSQDRAFT_178021 [Dichomitus squalens LYAD-421
SS1]
Length = 1213
Score = 233 bits (594), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 283/1185 (23%), Positives = 487/1185 (41%), Gaps = 194/1185 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D++ + ++ + +L +D ++S + R
Sbjct: 58 VFGSIRSLAAFRLTGGTKDYVIVGSDSGRIVILDYDPKTSSFVKVHQETFGKSGARRIVP 117
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
GQ DP R +I L ++ D L EA N + + LD+ F
Sbjct: 118 GQYLATDPKGRSVMIAAMEKAKLVYILNRDAAANLTISSPLEAHKNNAIIHHIVGLDVGF 177
Query: 117 LYGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNG 163
PT L D N + + YE+ L V WS+ D
Sbjct: 178 ----ENPTFAALEVDYSESDQDPSGEAFNNAEKMLTYYELDLGLNHVVRK-WSEPT-DPR 231
Query: 164 ADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIR------PS- 201
A+LL+ VP P GVL+ E+ I+Y +A + IP R P
Sbjct: 232 ANLLVQVPGGQLATSDRYDGP--SGVLVCCEDHIIYRHMDAPQHRVPIPRRRHPLEDPER 289
Query: 202 ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
IT A +LL G L+ + I HE++ V LKI+ +AS + L
Sbjct: 290 GLIITAAVMHKMKGAFFFLLQSEEGDLYKVTIEHEEQDVKALKIKYFDTVPVASGLCILK 349
Query: 259 NAVVYIGSSYGDS---QLIKL---NLQPD-AKGSY------------------------V 287
+ +++ S +G+ Q KL + +P+ + G Y +
Sbjct: 350 SGFLFVASEFGNHYFYQFQKLGDDDNEPEFSSGDYPSYGMADPTAALPRAYFRPRALDNL 409
Query: 288 EVLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+++ +L PI+D V++L Q+ G + R +R+G+ + E S +L G
Sbjct: 410 TLVDELESLCPILDSKVMNLLPNSDTPQIFAACGRGARSTFRTLRHGLEVEEVVSSDLPG 469
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
I +W+ + DDP+D+++++SF++ T +L + + +EE + GF S TL
Sbjct: 470 IPNAVWTTKLKEDDPYDSYIILSFVNGTLVLCIG--ETIEEVQDTGFLSSAPTLAVQQIG 527
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
+ L+QV +R V S R NEW+ P G ++ AT N QV++A LVY E+
Sbjct: 528 ADALLQVHPHGIRHVLSDRRV--NEWRVPSGKTIVCATTNKRQVVVALSSAELVYFELDL 585
Query: 465 DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
DG L E + + + + L I + E + AVG D +VRI SL + +
Sbjct: 586 DGQLNEYQDRKAMGSTVLALSIAEVPEGRQRTPYLAVGC-EDQTVRIVSLDPESTLETIS 644
Query: 524 LGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
L P S + A + +++ L +G LL +L+ G+LTD + LG
Sbjct: 645 LQALTAPPSAICIADMLDAGINKTQPTTFVNIGLQNGVLLRTVLDPVNGQLTDTRTRFLG 704
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
T+PI L + + + A S RP + Y+ + ++ + + + + F++ + L
Sbjct: 705 TRPIRLLRVNIQRNPAILALSSRPWLNYTHQNFMHFTPLIFENLDYAWSFSAELCQEGLI 764
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRI-----------------CHQEQSRTFA 676
L I I + KL SIPL PR++ E++
Sbjct: 765 GISGSLLRIFQIPKLGTKLKQDSIPLSYTPRKLIPHPHNGLLYLIEGDHRVMSEEAAAKQ 824
Query: 677 ICSLKNQSCAEESEM----------------------HFVRLLDDQTFEFISTYPLDTFE 714
+ L+ A + EM + LD QT I LD E
Sbjct: 825 LQQLRESGRAVDEEMVNLPPEQFGRPKAPAGTWASCIRIISPLDAQTVNVIH---LDNNE 881
Query: 715 YGCSILSCSFSDDSN-VYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETK 770
S+ F+ +N ++ VGTA ++ P R F + L+L+ + ET
Sbjct: 882 AAFSLAIVPFAAKNNELHLVVGTAQDTFLAPRSCTSGFLRTYRFTDDGRNLELLHKTETN 941
Query: 771 GAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGD 826
++ AF GKL+A + + ++LY K +LR + L S + + T+G
Sbjct: 942 DVPLAIMAFQGKLVAGVGKALRLYDIGKKKLLRKVENKTLGSTI--------VTLNTQGS 993
Query: 827 FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
I++GD+ +S+ +YK E + A D W++A +LD + + ++ N+F R
Sbjct: 994 RIIIGDMQESVFYAVYKPPENRLLVFADDVQPRWVTATTMLDYNTVVASDRFGNVFVNRL 1053
Query: 886 --KNSEGATDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
K S+ D+ G + ++ +H+G+ V SLV
Sbjct: 1054 DAKISDQIDDDPTGAGILHEKGVLFGAPHKTAMLAHFHIGDIVTSLNKISLV-------A 1106
Query: 929 GQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
G +++ ++G IG++ + E L L+ ++R + G +H WR +
Sbjct: 1107 GGREVILYTCLHGTIGILVPFVSKEDVDLLTTLEQHMRTEQLSLVGRDHLAWRGY----- 1161
Query: 988 TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
V K +DGDL ESF L + I+ ++ +V E+ K++E+L
Sbjct: 1162 YVPVKAVVDGDLCESFAKLPANKQSSIAGELDRTVGEVLKKLEQL 1206
>gi|346971485|gb|EGY14937.1| pre-mRNA-splicing factor RSE1 [Verticillium dahliae VdLs.17]
Length = 1230
Score = 232 bits (592), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 269/1054 (25%), Positives = 453/1054 (42%), Gaps = 169/1054 (16%)
Query: 119 GCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADL 166
G A P L D +A T + AL + + V WS+ +DN A L
Sbjct: 200 GYANPVFAALETDYTEADQDPTGQAALDVETQLVYYELDLGLNHVVRKWSEP-VDNTASL 258
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRY 216
L VP P GVL+ GEE I Y +N AF+ +P R T+ + R G +
Sbjct: 259 LFQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPVPRRRGATEDPSRKRCIVAGVMH 316
Query: 217 LLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLD 258
L AG L+ T + + +V LKI+ +AS++ L
Sbjct: 317 KLKGSAGAFFFLLQTEDGDLFKITIDMIEDRDGNPTGEVKRLKIKYFDTIPVASSLCILK 376
Query: 259 NAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLE 291
+ +Y+ S +G+ Q + + D K SY + ++E
Sbjct: 377 SGFLYVASQFGNYQFYQFEKLGDDDEELEFSSDDFPTDPKQSYEAVFFHPRELENLALVE 436
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
++ P++D V +L + Q+ T G + RI+++G+ +NE + EL GI +
Sbjct: 437 SIDSMNPLIDCKVANLTGEDAPQIYTACGNGARSTFRILKHGLEVNEIVASELPGIPSAV 496
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+L+ S D +D ++V+SF + T +L++ + +EE GF + TL L+
Sbjct: 497 WTLKLSRGDQYDAYIVLSFTNATLVLSIG--ETVEEVNDSGFLTSVPTLAAQLLGGEGLI 554
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILT 469
QV +R + + NEW +P S+ AT N QV +A G +VY E+ DG L
Sbjct: 555 QVHPKGIRHIRNGQV---NEWAAPQHRSIVAATTNEHQVAVALSSGEIVYFEMDSDGSLA 611
Query: 470 EVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
E + + ++CL + + E S AVG D +VR+ SL P+ L +K
Sbjct: 612 EYDEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDSTVRVLSLDPETTLESKSVQALT 670
Query: 528 IIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
P S+ + A E S YL L G L +L+ TGELTD ++ LG +P+ L
Sbjct: 671 APPTSLAVIAMEDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPVKLF 730
Query: 582 TFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
+ + T V S RP + Y+ K + + +N ++ F+S + + +
Sbjct: 731 QVTVQGRTCVLGLSSRPWLGYADPITKGFVVTPLNYVDLEWGWNFSSEQCEEGVVGIQGQ 790
Query: 640 ELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICS--------LKNQSCAEES 689
L I I+ + L +SIPL PRR+ H E + I S L+ Q A+ S
Sbjct: 791 SLRIFAIEKLGDTLTQKSIPLTYTPRRMVKHPEHPMFYTIESDNNTLPPELRAQLLADPS 850
Query: 690 ------------EMHFVR----------LLD--DQTFEFISTYPLDTFEYGCSILSCSF- 724
E + R ++D + + + T LD E S F
Sbjct: 851 VVNGDARTLPPAEFGYPRAKGRWASCISVIDPLSEELQTLQTVDLDNNEAAVSAAIVPFT 910
Query: 725 SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNG 781
S D+ + VGT ++ + T+G I ++ EDG+ L+ I + + + +L AF G
Sbjct: 911 SQDNESFLVVGTGKDMIVNPRQFTEGYIHIYRFSEDGRELEFIHKTKVEEPPTALLAFQG 970
Query: 782 KLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
+L+A + + +++Y K ML R+ Q++ L + + T+G IVVGD+ + +
Sbjct: 971 RLVAGVGKTLRIYDLGQKQML-----RKAQADVAPQ---LIVSLSTQGSRIVVGDVQQGV 1022
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
+ ++YK + D A WM+ ++D + G + N+F VR K S+ A +E
Sbjct: 1023 TYVVYKPLSNKLIPFVDDTVARWMTCTTMVDYESVAGGDKFGNIFIVRAPEKASQEADEE 1082
Query: 895 ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
G RL +V ++ + + SLV VG +++ +
Sbjct: 1083 GAGLHLTNTRDYLHGTPHRLSLVSHFYSQDVLTSITKTSLV-------VGGQDVLLWSGI 1135
Query: 940 NGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
+G IGV I + E F + L+ +LR + G +H +R + K V +DGD
Sbjct: 1136 SGTIGVFIPFVTREDADFFQTLEQHLRTEDAPLAGRDHLMYRGYYAPVKGV-----IDGD 1190
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L E + L + I+ ++ SV E+ +++ ++
Sbjct: 1191 LCERYTLLPNDKKQMIAGELDRSVREIERKISDI 1224
>gi|322693432|gb|EFY85292.1| Pre-mRNA-splicing factor RSE1 [Metarhizium acridum CQMa 102]
Length = 1221
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 289/1166 (24%), Positives = 496/1166 (42%), Gaps = 168/1166 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+L IA++ + ++++ + + R
Sbjct: 64 VFGIIRSLAAFRLAGSNKDYLIIASDSGRITIVEYLPAQNRFSRLHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
G+ DP R LI + L V+ + +L + + + VL I + G
Sbjct: 124 GEYLACDPKGRACLIASTEKNKLVYVLNRSAQAELTISSPLEAHKPGVLVISMVALDVGY 183
Query: 121 AKPTIVVLYQDNKDA---------RHVKT----YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L D D V+T YE+ L V WS+ +D A LL
Sbjct: 184 ANPVFAALEMDYSDVDQDSSGQALEEVETQLVYYELDLGLNHVVRK-WSEP-VDPTASLL 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYL 217
VP P GVL+ GEE+I Y +N AF+ IP R T+ + R+ G +
Sbjct: 242 FQVPGGNDGP--SGVLVCGEESITYRHSNQDAFRVPIPRRKGATEDPSRKRIIVSGVMHK 299
Query: 218 LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
L AG L+ T + + +V LKI+ +A+++ L +
Sbjct: 300 LKGSAGAFFFLLQTEDGDLFKVTIDMVEDEEGSPTGEVRRLKIKYFDTVPVATSLCILKS 359
Query: 260 AVVYIGSSYGD---SQLIKL------------NLQPDAKGSY------------VEVLER 292
+YI S +G+ Q KL + D + SY + ++E
Sbjct: 360 GFLYIASQFGNFSFYQFEKLGDDDEELEFSSDDFPVDPQASYDPVYFHPRPAENLALVES 419
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
++ P++D V +L + Q+ T G + R++R+G+ +NE + EL GI +W
Sbjct: 420 IPSMNPLLDCQVANLTGEDAPQIYTVCGNGARSTFRMLRHGLEVNEIVASELPGIPSAVW 479
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+L+ + + +D ++V+SF + T +L++ + +EE GF + TL + L+Q
Sbjct: 480 TLKLNRGEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFSTSVPTLAAQLLGDDGLIQ 537
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
V +R V + NEW +P S+ A+ NA QV +A G +VY E+ DG L E
Sbjct: 538 VHPKGIRHVRNGKV---NEWDAPQHRSIVAASTNAHQVAIALSSGEIVYFEMDSDGSLAE 594
Query: 471 VKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
+ + ++CL + + E + S AVG D +VRI SL P+ L +K
Sbjct: 595 YDEKKEMFGTVTCLSLGEVPEGRARSSFLAVGC-DDCTVRILSLDPESTLESKSVQALTA 653
Query: 529 IPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
P S+ + A + S YL L G L +L+ TGELTD ++ LG + + L
Sbjct: 654 APSSLAIIAMDDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKQVRLFQ 713
Query: 583 FSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
+ + TT V S RP + YS K + + +N ++ F+S + + +
Sbjct: 714 VTVQGTTCVLGLSSRPWLGYSDPITKGFVVTPLNYVDLEWGWNFSSEQCEEGIVGIQGQS 773
Query: 641 LTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICS--------LKNQSCAEESE 690
L I +ID + L ++I L P++ I H EQ + I S L+ Q A+ +
Sbjct: 774 LRIFSIDRLGDTLTQKAISLTYTPKKLIKHPEQPLFYTIESDNNTLPPDLRAQLIADPAV 833
Query: 691 MHF-VRLLDDQTF-----------------------EFISTYPLDTFEYGCSILSCSF-S 725
++ ++L + F + + T L+ E S F S
Sbjct: 834 VNGDTKVLPPEEFGYPKGNRRWASCINVIDPVSEEPQVVQTVDLENNEAAVSAAIVPFAS 893
Query: 726 DDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGK 782
D+ + VGT V P + F E +L+ I + + + +L F GK
Sbjct: 894 QDNESFLIVGTGKDIVVNPRNFSEAYIYVYRFQEEGRELEFIHKTKIEEPALALIPFQGK 953
Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
LLA + + +++Y +R R+ Q+E I++L T+G I+VGD+ + ++ + Y
Sbjct: 954 LLAGVGKTLRVYDLGMR-QMLRKAQAEVAPQ-QIVSL--NTQGSRIIVGDVQQGVTYVTY 1009
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
K + A D A W++ ++D + G + N+F VR K SE A +E+ G
Sbjct: 1010 KPTTNKLIPFADDIIARWITCTTMVDYESVAGGDKFGNMFIVRCPPKASEEADEEQSGLH 1069
Query: 898 -------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
RL+++ ++ + SLV VG +++ + G IG
Sbjct: 1070 LMNARDYLHGTSQRLDLMCHFYTQDIPTSMAKTSLV-------VGGQDVLLWSGLMGTIG 1122
Query: 945 V-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
V I + E F + L+++LR + G +H +RS+ K +DGDL E +
Sbjct: 1123 VFIPLISREDADFFQSLESHLRTEDPPLAGRDHLMYRSY-----YAPVKGIIDGDLCERY 1177
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRV 1029
L + I+ ++ SV E+ +++
Sbjct: 1178 TLLPNDKKQMIAGELDRSVREIERKI 1203
>gi|66811906|ref|XP_640132.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
gi|74854972|sp|Q54SA7.1|SF3B3_DICDI RecName: Full=Probable splicing factor 3B subunit 3
gi|60468134|gb|EAL66144.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
Length = 1256
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 245/1065 (23%), Positives = 456/1065 (42%), Gaps = 192/1065 (18%)
Query: 133 KDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL---CGVLIIGEETIVYCS 189
K + + YE+ L + V WS + +D+ A++++ VP GVL+ E+ IVY +
Sbjct: 213 KKKKLLTYYELDLGLNNVVR-KWS-DQVDDSANIVMTVPGGTEGPGGVLVASEDYIVYRN 270
Query: 190 ANAFKAIPIRPSITKAYGRVDADGSR--------------YLLGDHAGLLHLLVITHEKE 235
+ + +R I + YG G +L+ G L+ + + ++ +
Sbjct: 271 QDHAE---VRSRIPRRYGSDPNKGVLIISHSSHKQKGMFFFLVQSEHGDLYKITLDYQGD 327
Query: 236 KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI--------------------- 274
+V+ + + +A+ ++ L N ++ S +GD L
Sbjct: 328 QVSEVNVNYFDTIVLANCLTVLKNGFLFAASEFGDHTLYFFKSIGDEEEEGQAKRLEDKD 387
Query: 275 -KLNLQP-DAKGSYVEVLE------RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGS 326
L P ++ G+ +E L+ +L PI+DF V+DL R+ Q+ + G + S
Sbjct: 388 GHLWFTPRNSCGTKMEELKNLEPTSHLSSLSPIIDFKVLDLVREENPQLYSLCGTGLNSS 447
Query: 327 LRIVRNGIGINEQASVELQGI-KGMWSLRSSTD----DPFDTFLVVSFISETRILAMNLE 381
L+++R+G+ + + L G+ G+W++ ST D D ++VVSF+ T +L++
Sbjct: 448 LKVLRHGLSVTTITTANLPGVPSGIWTVPKSTSPNAIDQTDKYIVVSFVGTTSVLSVG-- 505
Query: 382 DELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNV 441
D ++E G T TL + ++QV R + S R NEW++P ++
Sbjct: 506 DTIQENHESGILETTTTLLVKSMGDDAIIQVFPTGFRHIKSDLR--INEWRAPGRKTIVR 563
Query: 442 ATANASQVLLATGGGHLVYLEIGDGI-LTEVKHAQLEYEISCLDINPIGENPSYSQIAAV 500
A+AN SQ+ +A GG ++Y E+ L E+ L +I+C++I+PI + + ++ AV
Sbjct: 564 ASANQSQLAIALSGGEIIYFELDQASNLIEIIKKDLRRDIACIEISPIPKGRNMARFIAV 623
Query: 501 GMWTDISVRIFSLPDLNLITKEHL--GGEIIPRSVLLCAFE----GIS------------ 542
W + +R+ SL N + + + ++ S+ + + GI
Sbjct: 624 SDW-EGPIRVLSLDRDNCLGQVSMLDTDKVYIESLSIIEMQLNEMGIETKKSQSQTGQTT 682
Query: 543 -------------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
+L L +G + L+ TGEL+D + LG +P+ L
Sbjct: 683 TTTTSTSSASSSVTSGGSLFLFVGLKNGVVKRATLDSVTGELSDIRTRLLGRKPVKLFKV 742
Query: 584 SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
+ + + A S R + Y + KL ++++ + + +S +S+ E ++ I
Sbjct: 743 KVRGSNAMLALSSRVWLNYINQGKLDIVPLSIEPLENASNLSSEQSAESIVATSENKIII 802
Query: 644 GTIDDIQKL-HIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSC--------AEESEM-- 691
+ID + L + +I L P+R I H + S + + N + E+SE
Sbjct: 803 FSIDKLGDLFNQETIKLNATPKRFIIHPQTSYIIILETETNYNTDNIDIDKINEQSEKLL 862
Query: 692 -----------------------------------------HFVRLLDDQTFEFISTYPL 710
+++++D T E + + L
Sbjct: 863 LEKQKELQQEMDIDDDDQNNNNEIEPFKKLFKPKAGKGKWKSYIKIMDPITHESLESLML 922
Query: 711 DTFEYGCSILSCSFSDDSNVYYCVG--TAYVL-PEENEPTKGRILVFIVEDGKLQLIAEK 767
+ E G S+ +CSF + ++ VG T VL P+ ++ + FI KL+L+ +
Sbjct: 923 EDGEAGFSVCTCSFGESGEIFLVVGCVTDMVLNPKSHKSAHLNLYRFIDGGKKLELLYKT 982
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
E + VY++ F GKL+ + + I++Y D G ++L +C + + + GD
Sbjct: 983 EVEEPVYAMAQFQGKLVCGVGKSIRIY-----DMGKKKLLRKCETKNLPNTIVNIHSLGD 1037
Query: 827 FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
+VVGD+ +SI + YK E + A D WM++ +LD D GA+ N+F +R
Sbjct: 1038 RLVVGDIQESIHFIKYKRSENMLYVFADDLAPRWMTSSVMLDYDTVAGADKFGNIFVLRL 1097
Query: 886 -----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
K G + +L+ + + +G+ V SLV V
Sbjct: 1098 PLLISDEVEEDPTGTKLKFESGTLNGAPHKLDHIANFFVGDTVTTLNKTSLV-------V 1150
Query: 929 GQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
G +++ T++G IG +I E F L+ N+R + G +H +RS+
Sbjct: 1151 GGPEVILYTTISGAIGALIPFTSREDVDFFSTLEMNMRSDCLPLCGRDHLAYRSY----- 1205
Query: 988 TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F L+ + IS+ ++ S E+ K++EE+
Sbjct: 1206 YFPVKNIIDGDLCEQFSTLNYQKQLSISEELSRSPSEVIKKLEEI 1250
>gi|429859776|gb|ELA34542.1| pre-mRNA-splicing factor rse1 [Colletotrichum gloeosporioides Nara
gc5]
Length = 1212
Score = 230 bits (587), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 281/1173 (23%), Positives = 489/1173 (41%), Gaps = 166/1173 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L I+G I ++ FR G +D+L +AT+ + ++++ + +
Sbjct: 59 LLSHDIFGIIRSMAAFRLAGSNKDYLILATDSGRITIVEYIPAQNRFQRLHLETFGKSGV 118
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R G+ DP R LI + L V+ + + +L + + + VL + +
Sbjct: 119 RRVIPGEYLACDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHKPGVLVLSMVA 178
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNG 163
G A P L + +A T E A + + + V WS+ +D
Sbjct: 179 LDVGYANPVFAALEIEYTEADQDPTGEAAREAETQLVYYELDLGLNHVVRKWSEP-VDPT 237
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVD--ADG 213
A LL VP P GVL+ GEE I Y +N AF+ IP R T+ R G
Sbjct: 238 ASLLFQVPGGQDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRKRHIVSG 295
Query: 214 SRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTIS 255
+ L AG L+ T + + +V LKI+ ++S++
Sbjct: 296 VMHKLKGSAGAFFFLLQTEDGDLFKAVIDMVEDADGNPTGEVKRLKIKYFDTVPVSSSLC 355
Query: 256 YLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VE 288
L + +Y S +G+ Q + + D K Y +
Sbjct: 356 ILKSGFLYAASQFGNHQFYQFEKLGDDDEEKEFSSDDFPADPKAGYDAVYFYPRPLENLA 415
Query: 289 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI- 347
++E ++ P++D V +L + Q+ T G + R++++G+ +NE + EL GI
Sbjct: 416 LVESIDSMNPLLDCKVANLTGEDAPQIYTACGNGARSTFRMLKHGLEVNEIVASELPGIP 475
Query: 348 KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
+W+L+ S D +D ++V+SF + T +L++ + +EE GF + TL +
Sbjct: 476 SAVWTLKLSRGDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGED 533
Query: 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDG 466
L+QV +R + NEW +P S+ AT N QV +A G +VY E+ GDG
Sbjct: 534 GLIQVHPKGIRHIRHGQI---NEWAAPQHRSIVAATTNEHQVAVALSSGEIVYFEMDGDG 590
Query: 467 ILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
L E + + ++CL + + E S AVG D +VR+ SL PD L +K
Sbjct: 591 SLAEYDEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDSTVRVLSLDPDSTLESKSVQ 649
Query: 525 GGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
P S+ + A + S YL L G L +L+ TGELTD ++ LG +P+
Sbjct: 650 ALTAPPSSLAIIAMDDSSSGGSALYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPV 709
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
L + + T V S RP + YS + + + +N ++ + F+S + +
Sbjct: 710 RLFQVTVQGRTCVIGLSSRPWLGYSDPITRGFVVTPLNYVDLEWVWNFSSEQCEEGVVGI 769
Query: 637 KEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV- 694
+ L I I+++ + +SIPL PRR+ F N + + +
Sbjct: 770 QGQSLRIFAIENLGDTITQKSIPLTYTPRRLLKHPDHPMFYTVESDNNTLPPDLRAKLIA 829
Query: 695 ---------RLLDDQTFEF-----------------------ISTYPLDTFEYGCSILSC 722
+L F + + T LD E S
Sbjct: 830 EPGVVNGDATVLPPDEFGYPKGKGRWASCISVIDPLSEEQRVLQTVDLDDNEAAVSAAIV 889
Query: 723 SF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKETKGAVYSLNA 778
SF S D+ + VGT ++ + T+G I V+ EDG +L+ I + + + +L A
Sbjct: 890 SFASQDNENFLIVGTGKDMIVNPRQFTEGYIHVYRFGEDGHELEFIHKTKVEEPPTALLA 949
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
F G+LLA + + +++Y LR R+ Q++ L + + T+G IVVGD+ ++
Sbjct: 950 FQGRLLAGVGKTLRIYDLGLR-QMLRKSQADVAPQ---LIVSLSTQGSRIVVGDVQHGVT 1005
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEE 895
++YK + D A W++ ++D + G + N+F VR K S+ A +E+
Sbjct: 1006 YVVYKPTTNKLIPFVDDTIARWVTCTTMVDYESVAGGDKFGNMFLVRCPEKASQEADEEQ 1065
Query: 896 RG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
G RL ++ + + SLV VG +++ +N
Sbjct: 1066 AGLHLLNTRDYLHGAPHRLNLLSHSYTQDVPTSITKTSLV-------VGGQDVLLWSGIN 1118
Query: 941 GVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDL 999
G IGV I + E F + L+ ++R + G +H +R + V K +DGDL
Sbjct: 1119 GTIGVFIPFVTREDVDFFQNLEQHMRTEDAPLAGRDHLMYRGY-----YVPVKGVIDGDL 1173
Query: 1000 IESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
E + L + I+ ++ SV E+ +++ ++
Sbjct: 1174 CERYTLLPNDKKLMIAGELDRSVREIERKISDI 1206
>gi|336371417|gb|EGN99756.1| hypothetical protein SERLA73DRAFT_88390 [Serpula lacrymans var.
lacrymans S7.3]
gi|336384183|gb|EGO25331.1| hypothetical protein SERLADRAFT_355643 [Serpula lacrymans var.
lacrymans S7.9]
Length = 1216
Score = 230 bits (586), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 270/1174 (22%), Positives = 484/1174 (41%), Gaps = 169/1174 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+ + ++ + +L +D +S + R
Sbjct: 58 VFGSIRSLAAFRLTGGTKDYAIVGSDSGRIVILDYDPRTSSFVKLHQETFGKSGSRRIVP 117
Query: 66 GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
GQ DP R + + + L ++ D L + + + + LD+ F
Sbjct: 118 GQYLATDPKGRSVMISAMEKAKLVYILNRDAAANLTISSPLEAHKNAAIIHHIVGLDVGF 177
Query: 117 ---LYGCAKPTIVVLYQD------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
L+ + QD N + + YE+ L V WS+ D A+LL
Sbjct: 178 ENPLFAALEVDYTESDQDPTGEAFNNAEKMLTYYELDLGLNHVVRK-WSEPT-DPRANLL 235
Query: 168 IPVP------------PPLCGVLIIGEETIVYCSANAFK-AIPI-----------RPSIT 203
+ VP P GVL+ E+ I+Y + IPI R I
Sbjct: 236 VQVPGGQLASSERFDGP--SGVLVCCEDHIIYRHMGVPQHRIPIPRRRHPLEDPERGIII 293
Query: 204 KAYGRVDADGSRY-LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
A G+ + LL G L+ + I HE ++V LKI+ +AS++ L + +
Sbjct: 294 VAAVMHKMKGAFFFLLQSEDGDLYKVTIEHEDDEVKALKIKYFDTVPVASSLCILKSGFL 353
Query: 263 YIGSSYGDSQLIKLNLQPD-------AKGSY------------------------VEVLE 291
++ S +G+ L + D + SY + + +
Sbjct: 354 FVASEFGNHYLYQFQKLGDDDNEPEFSSTSYPSFGMAEPSMPLPHAHFHPRALDNLALAD 413
Query: 292 RYVNLGPIVDFCVVD-LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KG 349
+L PI+D V++ L Q+ G SLR++R+G+ + E S EL GI
Sbjct: 414 EMESLDPILDSKVMNILPNSDTPQIFAACGRGARSSLRMLRHGLDVEESVSSELPGIPNA 473
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
+W+ + DDPFD+++++SF++ T +L++ + +EE + GF S TL + L
Sbjct: 474 VWTTKRKEDDPFDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGADAL 531
Query: 410 VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGIL 468
+QV +R V + R NEW+ P G ++ A N QV++A LVY E+ DG L
Sbjct: 532 LQVHPQGIRHVLADRRV--NEWRVPQGKTIVSAATNKRQVVVALSSAELVYFELDLDGQL 589
Query: 469 TEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
E + + + + L + + E + AVG D +VRI SL + + L
Sbjct: 590 NEYQDRKAMGSTVLALSVGEVPEGRQRTPYLAVGC-EDQTVRIISLDPESTLETISLQAL 648
Query: 528 IIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
P S + A + ++ L +G LL +L+ G+LTD + LGT+PI
Sbjct: 649 TAPPSAICIADMLDASINKTQPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGTRPI 708
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
L + A S R + Y+ + ++ + + + + F++ P+ L
Sbjct: 709 RLVRVQIHGNPAILALSSRSWLNYTHQNLMHFTPLIFENLDYAWSFSAELSPEGLIGIAG 768
Query: 639 GELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES------EM 691
L I I + KL +IPL PR+ +R F + ++ E++ EM
Sbjct: 769 SVLRIFQIPKLGTKLKQDTIPLSYTPRKFISHPTNRYFYLIEGDHRVLGEDAAAKRLDEM 828
Query: 692 H------------------------------FVRLLDDQTFEFISTYPLDTFEYGCSILS 721
+R++D + +S LD+ E S+
Sbjct: 829 RQRGKMVDDNMLDLPPEVFGRPKAPAGTWASAIRIIDPVEAKTLSMITLDSNECAFSLAI 888
Query: 722 CSFSDDSN-VYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
FS N ++ VGTA ++ P R F + L+L+ + ET +L
Sbjct: 889 VPFSARGNELHLVVGTAADTFLAPRSCSSGFLRTYKFTEDGTGLELLHKTETDDVPLALM 948
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
AF G+L+A + + +++Y + + R+++++ + + T+G I+VGD+ +SI
Sbjct: 949 AFQGRLVAGVGKALRIYD-IGKKKLLRKVENKARATFSTAIVTLNTQGSRIIVGDMQESI 1007
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
S + YK E + A D W++A ++D + N+F R K SE D+
Sbjct: 1008 SFVAYKAPENRLLVFADDNQPRWITATTMVDYTTIAAGDRFGNIFVNRLDPKVSEQVDDD 1067
Query: 895 ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
G + +++ +H+G+ V SLV G +++ +
Sbjct: 1068 PTGAGILHEKGLLMGAPHKTKMLAHFHIGDLVTSINKVSLV-------AGGREVLLYTGL 1120
Query: 940 NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
+G IG++ + E F+ L+ ++R + G + WR + K+V +DGD
Sbjct: 1121 HGTIGILVPFVSKEDVDFISTLEQHMRTEQGSLVGRDQLSWRGYYTPVKSV-----VDGD 1175
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L E++ L T+ I+ ++ +V E+ K++E+L
Sbjct: 1176 LCETYARLPGTKQSAIAGELDRTVGEVLKKLEQL 1209
>gi|403411971|emb|CCL98671.1| predicted protein [Fibroporia radiculosa]
Length = 1212
Score = 230 bits (586), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 282/1179 (23%), Positives = 487/1179 (41%), Gaps = 183/1179 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+ + ++ + +L +D ++S R
Sbjct: 58 VFGSIRSLAAFRLTGGTKDYAVVGSDSGRIVILDYDPKTSSFTKVHQETYGKSGARRIVP 117
Query: 66 GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
GQ DP R + + + L ++ D L EA N + + LD+ F
Sbjct: 118 GQYLATDPKGRSVMISAVEKAKLVYILNRDAAANLTISSPLEAHKNNSIIHHIVGLDVGF 177
Query: 117 LYGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNG 163
P L D D+ T YE+ L V WS+ D
Sbjct: 178 ----DNPMFAALEVDYSDSDQDPTGEAFNNAEKMLTYYELDLGLNHVVRK-WSEPT-DPR 231
Query: 164 ADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------- 201
A+LL+ VP P GVL+ E+ I+Y +A + IP R
Sbjct: 232 ANLLVQVPGGQLASSDRYDGP--SGVLVCCEDHIIYRHMDAPQHRIPIPRRQHPLADTHR 289
Query: 202 ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIAST----- 253
IT A +LL G L+ + I HE + V LKI+ IAS
Sbjct: 290 GLIITAAVMHKMKGAFFFLLQSEEGDLYKVTIEHEDQDVRALKIKYFDTVPIASIEYNFN 349
Query: 254 ----ISYL---------------DNAVVYIGSSYGD-------SQLIKLNLQPDAKGSYV 287
+SY+ DN + +S+ S L +P + +
Sbjct: 350 CRICVSYIILESYLYQFQKLGDDDNEPEFSSTSFPSFGMADPSSPLPHAYFKPHILDN-L 408
Query: 288 EVLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+++ +L P++D V++L QV G + R++R+G+ + E S +L G
Sbjct: 409 TLVDEMESLSPVLDAKVMNLLPNSDTPQVFAACGRGGRSTFRMLRHGLEVEEVVSSDLPG 468
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
I +W+ + DDP+D+++++SF++ T +L + + +EE + GF S TL
Sbjct: 469 IPNAVWTTKLKEDDPYDSYIILSFVNGTLVLCIG--ETIEEVQDTGFLSSAPTLAVQQIG 526
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
Y+ L+QV +R V S R NEW+ P G ++ AT N QV++A LVY E+
Sbjct: 527 YDALLQVHPHGIRHVLSDKRV--NEWRVPQGKTIVAATTNKRQVVVALSSAELVYFELDL 584
Query: 465 DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
DG L E + + + + L I + E + AVG D +VRI SL + +
Sbjct: 585 DGQLNEYQDRKAMGSTVLALSIAEVPEGRQRTPYLAVGC-EDQTVRIVSLDPESTLETIS 643
Query: 524 LGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
L P S + A + S++ L +G LL +L+ G+LTD + LG
Sbjct: 644 LQALTAPPSAICIADMLDAGINKTQPTSFVNIGLQNGVLLRTVLDPVNGQLTDTRTRFLG 703
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
T+PI L + + A S R + Y+ + ++ + + + + F++ P+ L
Sbjct: 704 TRPIKLIRVLVQRNPGILALSSRSWLNYTYQNLMHFTPLIFENLDYAWSFSAELCPEGLI 763
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICH----------------QEQSRTFA 676
L I I + KL +IPL PR+ I H +++ A
Sbjct: 764 GISGSVLRIFQIPKLGTKLKQDAIPLSYTPRKFISHPTNGLFYLIEGDHRVRSDEASAKA 823
Query: 677 ICSLKNQSCAEESEM-------------------HFVRLLDDQTFEFISTYPLDTFEYGC 717
+ L+ Q + E+ +R+++ + ++ PLD E
Sbjct: 824 LGELRQQGKMVDDELVNLPPETFGRQKAPAGTWASCIRIINPVDAKTVNIIPLDNNEAAF 883
Query: 718 SILSCSFSDDSN-VYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
S+ SFS S ++ VGTA ++ P R F + L+L+ + ET
Sbjct: 884 SLAVVSFSARSGELHLVVGTAQDTFLAPRSCTSGFLRTYRFTDDGTNLELLHKTETNDVP 943
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
++ F G+L+A + + ++LY D G ++L + + A+ + T+G I+VGD
Sbjct: 944 LAVLGFQGRLVAGVGKALRLY-----DMGKKKLLRKVENKTFASAIVSLATQGSRILVGD 998
Query: 833 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSE 889
+ +S+S +YK E + A D W SA+ ++D + A+ N++ R K SE
Sbjct: 999 MQESVSFAVYKPPENKLLVFADDTQPRWTSAMTMVDYNTVASADRFGNIYVNRLDPKVSE 1058
Query: 890 GATDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
D+ G + E++ +H+G+ V SLV G +
Sbjct: 1059 QVDDDPTGAGILHEKGLLAGAPHKTELLSHFHVGDIVTSINKVSLV-------AGGREVL 1111
Query: 935 IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
++ ++G IG++ + E F+ L+ ++R + G +H WR + V K
Sbjct: 1112 LYTGLHGTIGILVPFVSKEDVDFISTLEQHMRTEQLSLVGRDHLTWRGY-----YVPVKA 1166
Query: 994 FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+DGDL E+F L ++ I+ ++ +V E+ K++E+L
Sbjct: 1167 VVDGDLCETFARLPASKQSAIAGELDRTVGEVLKKLEQL 1205
>gi|440636768|gb|ELR06687.1| pre-mRNA-splicing factor rse1 [Geomyces destructans 20631-21]
Length = 1212
Score = 230 bits (586), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 285/1168 (24%), Positives = 489/1168 (41%), Gaps = 166/1168 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I TL FR G +D++ I ++ + ++++ + + R
Sbjct: 64 VFGIIRTLAAFRLAGSNKDYIIIGSDSGRITIVEFVPAQNRFNRLHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R LI + L V+ +++ +L + + + Q L + G
Sbjct: 124 GQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKAQTLVFALVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L D D+ T YE+ L V WS + +D A +L
Sbjct: 184 ANPVFAALEVDYGDSDQDPTGQAFEEIEKTLVYYELDLGLNHVVR-KWS-DPVDRTASIL 241
Query: 168 IPVPPPL---CGVLIIGEETIVYCSAN--AFK-AIPIRPSITKAYGRV------------ 209
VP GVL+ GE+ I Y +N AF+ AIP R T+ R
Sbjct: 242 FQVPGGTDGPSGVLVCGEDNITYRHSNQEAFRVAIPRRKGATEDPQRKRSIVAGVMHKMR 301
Query: 210 DADGSRYLL--GDHAGLLHLLV--ITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
A G+ + L D L + + I + + TG LKI+ IA+++ L +
Sbjct: 302 GAAGAFFFLLQSDDGDLFKITIEMIEDDNGQPTGEVRRLKIKYFDTVPIATSLCILKSGF 361
Query: 262 VYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYV 294
+++ S +G+ Q + N D Y V +E
Sbjct: 362 LFVASEFGNHQFYQFEKLGDDDDETEYISDNFPTDPAEPYTPVYFHPRPAENLNLVESID 421
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
++ P++D V +L + Q+ + G + R +++G+ +NE EL G+ +W+
Sbjct: 422 SMNPLMDCKVANLTEEDAPQIYSICGTGARSTFRTLKHGLEVNEIVESELPGVPSAVWTT 481
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ + D +D +++++F + T +L++ + +EE GF S TL + L+QV
Sbjct: 482 KLTRGDEYDAYIILAFSNGTLVLSIG--ETVEEVTDTGFLSSATTLAVQQLGEDGLIQVH 539
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
+R + + R NEW +P S+ AT NA QV +A G +VY E+ DG L E
Sbjct: 540 PKGIRHIRADRR--VNEWAAPQHRSIVAATTNAQQVAVALSSGEIVYFEMDSDGSLAEYD 597
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
+ ++ ++CL + + E S AVG D +VRI SL PD L K P
Sbjct: 598 EKREMSGTVTCLSLGEVPEGRVRSNFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 656
Query: 531 RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
S+ + A S YL L G L +L+ TGEL+D + LG +P L S
Sbjct: 657 SSLSIMAMTDSSSGGSTLYLHIGLYSGVYLRTVLDEITGELSDTRTRFLGPKPAKLFRVS 716
Query: 585 SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+ T V A S RP + YS K + + ++ + F+S P+ + + L
Sbjct: 717 VQGQTAVLALSSRPWLGYSDPVTKGFMLTPLDYAGLEWGWNFSSEQCPEGMVGIQGQNLR 776
Query: 643 IGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT 701
I +I+ + + L SIPL PRR + F + N ++ ++ +LL+D +
Sbjct: 777 IFSIEKLTENLLQHSIPLSYTPRRFVKHPEHPYFYVIQSDNNILSKSTKQ---KLLEDPS 833
Query: 702 FEFISTYPLDTFEYG-----------------------------------CSILSCSF-S 725
+ + L E+G S+ + SF S
Sbjct: 834 LQNGDSAVLPAEEFGYPRGRGHWASCIQIVDPIREKKVLQQIDLEDNEAAVSMATVSFAS 893
Query: 726 DDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGK 782
+ V+ VGT ++ + G I V+ EDGK ++ I + + + +L F G+
Sbjct: 894 QEDEVFLVVGTGKDMVASPRSSSGGFIHVYRFHEDGKEIEFIHKTKVEEPPLALLGFQGR 953
Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
LL I +++++Y +R R+ Q+E L + +QT+G I+V D+ +SI+ ++Y
Sbjct: 954 LLVGIGRELRIYDLGMR-QLLRKAQTEIAAS---LIVGLQTQGSRIIVSDVQESITFVVY 1009
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRL 899
K +E + A D A W + ++D + G + NL+ +R K SE A +E G
Sbjct: 1010 KFQENKLIPFADDTIARWTTCTTMVDYETVAGGDKFGNLWLLRCPTKASEEADEEGSGAH 1069
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT-----------VNGVIGVIAS 948
V H +++ H V + + IPT I T +G+ G IA
Sbjct: 1070 LV----HERQYLQGAPH--RVALMAHNFANDIPTSIQKTNLVAGGRDCLLWSGLQGTIAI 1123
Query: 949 L----PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
+ E F + L+ +LR + G +H +RS+ V K +DGDL E +
Sbjct: 1124 MIPFVSREDVDFFQTLEQHLRTEDAPLAGRDHLIYRSY-----YVPVKGVIDGDLCERYT 1178
Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L + I+ + SV E+ +++ ++
Sbjct: 1179 LLPTDKKMMIAGEFDRSVREIERKISDM 1206
>gi|449546293|gb|EMD37262.1| hypothetical protein CERSUDRAFT_83017 [Ceriporiopsis subvermispora B]
Length = 1213
Score = 229 bits (585), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 270/1178 (22%), Positives = 490/1178 (41%), Gaps = 180/1178 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+ + ++ + +L+++ ++S R
Sbjct: 58 VFGSIRSLAAFRLTGGTKDYAIVGSDSGRIVILEYEPKTSSFTKVHQETFGKSGARRIVP 117
Query: 66 GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
GQ DP R + + + L ++ D L EA N + + LD+ F
Sbjct: 118 GQYLATDPKGRSVMIAAVEKAKLVYILNRDAAANLTISSPLEAHKNNAIIHHIVGLDVGF 177
Query: 117 LYGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNG 163
P L DN ++ T YE+ L V WS+ D
Sbjct: 178 ----DNPMFAALEVDNSESDQDPTGEAFNNAEKMLTYYELDLGLNHVVRK-WSEPT-DPR 231
Query: 164 ADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------- 201
A+LL+ VP P GVL+ E+ I+Y +A + IP R
Sbjct: 232 ANLLVQVPGGQLAASDRYDGP--SGVLVCCEDHIIYRHMDAPQHRVPIPRRRHPLQDADR 289
Query: 202 ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
IT A +LL G L+ + I HE++ V LKI+ +AS++ L
Sbjct: 290 GLIITAAVMHKMKGAFFFLLQSEEGDLYKVTIDHEEQDVKALKIKYFDTVPVASSLCILK 349
Query: 259 NAVVYIGSSYGDSQLIKL------NLQPD-AKGSY------------------------V 287
+ +++ S +G+ L + + +P+ + SY +
Sbjct: 350 SGFLFVASEFGNHYLYQFQKLGDDDFEPEFSSASYPSFGMADPSIPLPPVYFRPRALDNL 409
Query: 288 EVLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+++ +L P+VD V++L Q+ T G S R++R+G+ + E S +L G
Sbjct: 410 TLVDEMESLSPVVDSKVMNLLPNSDTPQIFTACGRGPRSSFRMLRHGLDVEEVVSSDLPG 469
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
I +W+ + DDP+D+++++SF++ T +L + + +EE + GF S TL
Sbjct: 470 IPNAVWTTKLKEDDPYDSYIILSFVNGTLVLCIG--ETIEEVQDTGFLSSAPTLAVQQIG 527
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
+ L+QV +R V + R NEW+ P G ++ AT N QV++A LVY E+
Sbjct: 528 ADALLQVHPHGIRHVLADKRV--NEWRVPQGKTIVCATTNKRQVVVALSSAELVYFELDL 585
Query: 465 DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
DG L E + + + + L I + E + AVG D +VRI SL + +
Sbjct: 586 DGQLNEYQDRKAMGSTVLALSIAEVPEGRQRTPYLAVGC-EDQTVRIISLDPESTLDTIS 644
Query: 524 LGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
L P S + A + +++ L +G LL +L+ G+LTD + LG
Sbjct: 645 LQALTAPPSAICIADMLDAGINKTQQTTFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLG 704
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
T+PI L + + A S RP + Y+ + ++ + + + + F++ P+ L
Sbjct: 705 TRPIKLIRVLVQRNPGILALSSRPWLNYTHQNLMHFTPLIFENLDYAWSFSAELCPEGLI 764
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 693
L I I + KL +IPL PR+ + + ++ E++
Sbjct: 765 GISGSVLRIFHIPKLGTKLKQDAIPLLYTPRKFIPHPTNSLLYMIEGDHRVMGEDAAARK 824
Query: 694 V-------RLLDDQTFEF-----------------------------ISTYPLDTFEYGC 717
+ R +DD+ + ++ LD E
Sbjct: 825 LDELRQKGREIDDEVVQLPPELFGRPKAPAGTWASCIRIINPVDAKTVNVIHLDNNEMAF 884
Query: 718 SILSCSF-SDDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
S+ F + ++ VGTA ++ P R F + L+L+ + ET
Sbjct: 885 SLAVVPFAARGGELHLVVGTAQDTFLTPRSCTSGFLRTYRFSDDGQNLELLHKTETNDVP 944
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDL 833
++ AF GKL+A + + +++Y M + R+++++ I+AL+ T+G I VGD+
Sbjct: 945 LAVMAFQGKLVAGVGKSLRIYD-MGKKKLLRKVENKT-FSAAIVALH--TQGSHIYVGDM 1000
Query: 834 MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEG 890
+S+ +YK E + A D W++A+ ++D + A+ N+F R K SE
Sbjct: 1001 QESVFYAVYKAPENRLLVFADDTQPRWITAMTMIDYNTVAAADRFGNVFVNRLDPKVSEQ 1060
Query: 891 ATDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVI 935
D+ G + ++ +H+G+ + +LV G ++
Sbjct: 1061 VDDDPTGAGILHEKSILFGAPHKTSMLAHFHVGDLITSINKVALV-------AGGREVLL 1113
Query: 936 FGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNF 994
+ ++G IG++ + E F+ L+ ++R + G +H WR + V K
Sbjct: 1114 YTGLHGTIGILMPFVSKEDVDFISTLEQHMRTEQLSLVGRDHLTWRGY-----YVPVKAV 1168
Query: 995 LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+DGDL E+F L ++ I+ ++ +V E+ K++E+L
Sbjct: 1169 VDGDLCETFARLPASKQSAIAGELDRTVGEVLKKLEQL 1206
>gi|168045572|ref|XP_001775251.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673464|gb|EDQ59987.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1201
Score = 229 bits (585), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 248/1039 (23%), Positives = 449/1039 (43%), Gaps = 168/1039 (16%)
Query: 132 NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVP-----PPLCGVLIIGEETIV 186
N+ +H+ YE+ L V W++ +DNGA++L+ VP P GVL+ E ++
Sbjct: 188 NEAQKHLTFYELDLGLNHVVR-KWTEP-IDNGANMLVTVPGGGDGP--SGVLVCAENFVI 243
Query: 187 YCSAN---AFKAIPIRPSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKE 235
Y + N IP R + G + + +LL G + + + ++K+
Sbjct: 244 YKNQNHPDIRAVIPRRQDLPPNRGVLIVSAATHKQKALFFFLLQTEYGDIFKVTLDYDKD 303
Query: 236 -KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPD------AKGSYVE 288
+VT LK++ + S + L + ++ S +G+ L + D + + VE
Sbjct: 304 DQVTELKVKYFDTIPVTSAMCVLKSGFLFAASEFGNHSLYQFQSIGDDPEVESSSSTLVE 363
Query: 289 VLERYV-------------------NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRI 329
E Y +L PI+D V ++ + Q+ + G SLRI
Sbjct: 364 TEEGYQPVFFQPRKLKNLVQIDDIESLMPIMDMKVANIFEEETPQIFSLCGRGPRSSLRI 423
Query: 330 VRNGIGINEQASVELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE 388
+R G+ + E A L G+ +W+++ +D FD ++VVSF++ T +L++ + +EE
Sbjct: 424 LRPGLAVTEMAVSPLPGVPSAVWTVKKHANDEFDAYIVVSFVNATLVLSIG--ETVEEVS 481
Query: 389 IEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQ 448
GF T +L + L+QV +R + S R NEWK+P ++ N Q
Sbjct: 482 DSGFLDTTPSLAISLLGDDSLMQVHPSGIRHIRSDGR--INEWKTPGKKTIVKVGYNRMQ 539
Query: 449 VLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDIS 507
V++A GG L+Y E+ G L E++ + +++CLDI P+ E S+ AVG + D +
Sbjct: 540 VVIALSGGELIYFEMDMSGQLMEIEKRDMTGDVACLDIAPVPEGRQRSRFLAVGSY-DST 598
Query: 508 VRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAFEGIS------------YLLCALGDGH 552
+RI SL PD + +++ + + P S+LL + + +L L +G
Sbjct: 599 IRILSLDPDDCMQILSVQAVSSP--PESLLLLEVQASTGGEDGADHPASVFLNAGLQNGV 656
Query: 553 LLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSN 612
LL ++M TG+L+D + LG + L + + + S RP + Y L +
Sbjct: 657 LLRTEVDMVTGQLSDTRTRFLGLRAPKLFSALVRGRRAMLCLSSRPWLGYIHQGHFLLTP 716
Query: 613 VNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQE 670
++ + + + F+S + + L + TI+ + + + +PL PR+ I H +
Sbjct: 717 LSYETLEYAASFSSDQCAEGVVAVAGDALRVFTIERLGETFNQTVVPLRYTPRKFILHPK 776
Query: 671 QSRTFAICSLKNQSCAEESEMH-------------------------------------- 692
Q + S + EE E +
Sbjct: 777 QKTLIILESDEGAFSTEEREANKKEAVEATGGHENGKADGEDEEMADEESEDPLPDEQYG 836
Query: 693 -----------FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV--YYCVGTAYV 739
+R+LD +T L E SI + +F D+ + VGTA
Sbjct: 837 YPKAEANKWVSCIRVLDPKTSTTTCLLELQENEAAFSICTVNFHDNKELGTLIAVGTAKD 896
Query: 740 LP--EENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
L E + G I ++ E+G+ L+L+ + G +L F G+LL + Q +++Y
Sbjct: 897 LQFMPRKEASGGFIHIYRFAEEGRVLELVHKTPVDGVPTALCQFQGRLLVGVGQVLRIY- 955
Query: 796 WMLRDDGTRELQSECGHHGH---ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
D G R+L +C + I+A++ T GD I VGD+ +S + Y+ +E +
Sbjct: 956 ----DLGKRKLLRKCENKNFPNTIIAIH--TYGDRIYVGDIQESFHYVKYRRDENQLYTF 1009
Query: 853 ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE-----------GATDEERGRL-- 899
A D W++A +D D GA+ N++ +R + G E+GRL
Sbjct: 1010 ADDSCPRWLTASLHIDFDTMAGADKFGNVYVMRLPQDVSEEIEDDPTGGKIKWEQGRLNG 1069
Query: 900 -----EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQ 953
+ + ++H+GE V + SL+ G ++++GTV G +G + E
Sbjct: 1070 APNKVDEIIQFHVGEVVTSLQKASLI-------PGGGESMLYGTVMGSMGALLPFSSRED 1122
Query: 954 YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDE 1013
F L+ +LR+ + G +H +RS K+ +DGDL E + L+ +
Sbjct: 1123 VDFFSHLEMHLRQENPPLCGRDHMAFRS-----AYFPVKDVIDGDLCEQYSMLTSELQKK 1177
Query: 1014 ISKTMNVSVEELCKRVEEL 1032
I+ ++ + E+ K++E++
Sbjct: 1178 IADDLDRTPGEIVKKLEDI 1196
>gi|170083859|ref|XP_001873153.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164650705|gb|EDR14945.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1213
Score = 228 bits (582), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 268/1174 (22%), Positives = 476/1174 (40%), Gaps = 172/1174 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+ + ++ + +L +D ++S + R
Sbjct: 58 VFGSIRSLAAFRLTGGTKDYAIVGSDSGRIIILDYDPKTSSFVKLHQETYGKSGARRIVP 117
Query: 66 GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
GQ DP R + + + L ++ D L + + + + LD+ F
Sbjct: 118 GQYLATDPKGRSVMISAMEKAKLVYILNRDAAANLTISSPLEAHKNSAIIHHIVGLDVGF 177
Query: 117 ---LYGCAKPTIVVLYQDN-----KDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
++ + QD K+A + TY + V WS+ D A+LL+
Sbjct: 178 ENPMFAALEVDYAESDQDPTGDAFKNAEKMLTYYELDLGLNHVVRKWSEPT-DPRANLLV 236
Query: 169 PVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS----------IT 203
VP P GVL+ E I+Y A+ + IP R IT
Sbjct: 237 QVPGGQLASSERFDGP--SGVLVCCENHIIYRHADVPQHRVPIPRRRHPLEDLNRGLIIT 294
Query: 204 KAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
A +LL G L+ + I HE E+V LKI+ +AS + L + ++
Sbjct: 295 AAVMHKMKGAFFFLLQSEDGDLYKVTIEHEDEEVKSLKIKYFDTVPVASNLCILKSGFLF 354
Query: 264 IGSSYGDSQLIKLNLQPD-------AKGSY---------VEVLERYVNLGPIVDFCVVD- 306
+ S +G+ L + D + SY V + + L P+ + V D
Sbjct: 355 VASEFGNHYLYQFQKLGDDDDEPEFSSTSYPSFGMADPSVPLPHAFFRLHPLDNLVVADE 414
Query: 307 ---------------LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
L Q+ G + R +R+G+ + E S +L GI +
Sbjct: 415 LESLDPILDSKVMNLLPNSDTPQIFAACGRGARSTFRTLRHGLEVEESVSSDLPGIPNAV 474
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+ + + DDPFD+++++SF++ T +L++ + +EE + GF S TL + L+
Sbjct: 475 WTTKRTEDDPFDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGADALL 532
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
QV +R V + R NEW+ P G ++ AT N QV++A LVY E+ DG L
Sbjct: 533 QVHPHGIRHVLADRRV--NEWRVPQGKTIVSATTNKRQVVVALSSAELVYFELDLDGQLN 590
Query: 470 EVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
E + + + + L I + E + AVG D +VRI SL + L
Sbjct: 591 EYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGC-EDQTVRIISLDPETTLETISLQALT 649
Query: 529 IPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
P S + A + ++ L +G LL +L+ G+LTD + LGT+PI
Sbjct: 650 APPSAICIADMLDASINKSQPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGTRPIK 709
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L + + A S R + Y+ + ++ + + + F++ P+ L
Sbjct: 710 LVRVQIQRNPGILALSSRSWLNYTHQNLMHFTPLIYDNLDYAWSFSAELSPEGLIGIAGS 769
Query: 640 ELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES--------- 689
L I + + KL SIPL PR+ + F + +++ EE+
Sbjct: 770 VLRIFQVPKLGTKLKQDSIPLSYTPRKFITHPGNNYFYLIEGDHRTLGEEAASTKLDEFR 829
Query: 690 ---------EMHF------------------VRLLDDQTFEFISTYPLDTFEYGCSILSC 722
E++ +R++D + ++ P+D+ E S+
Sbjct: 830 RQGKSFDDGEINIPPEVFGRPKAPAGTWASCIRIIDPVEAKTVAVIPMDSNEAAFSLAIV 889
Query: 723 SFS-DDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
FS + ++ VGTA V P R F E L+ + E +L A
Sbjct: 890 PFSARNGELHLVVGTAADTIVSPRSCTSGYLRTYKFTNEGAGLEFQHKTEVDDVPLALLA 949
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSI 837
F G+L+A + + +++Y D G ++L + + A+ + T+G I+VGD+ +SI
Sbjct: 950 FQGRLVAGVGKALRIY-----DIGKKKLLRKVENKTFSSAIVTLSTQGSRIIVGDMQESI 1004
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
+YK E + A D W++A ++D + + + N+F R K SE D+
Sbjct: 1005 QFAVYKPPENRLLVFADDSQPRWITASSMVDYNTIVAGDRFGNIFVNRLDPKVSEQVDDD 1064
Query: 895 ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
G + +++ +HLG+ V SLV G +++ +
Sbjct: 1065 PTGAGILHEKGILMGAPHKTKMLAHFHLGDLVTSIHKVSLV-------AGGREVLMYTGL 1117
Query: 940 NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
+G IG++ + E F+ L+ ++R + G + WR + K V +DGD
Sbjct: 1118 HGTIGILVPFVSKEDVDFISTLEQHMRTEQSSLVGRDQLSWRGYYAPVKAV-----VDGD 1172
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L E++ L + I+ ++ +V E+ K++E+L
Sbjct: 1173 LCETYARLPAAKQSSIAGELDRTVGEVLKKLEQL 1206
>gi|347829304|emb|CCD45001.1| similar to pre-mRNA-splicing factor rse1 [Botryotinia fuckeliana]
Length = 1212
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 275/1162 (23%), Positives = 491/1162 (42%), Gaps = 154/1162 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I + FR G +D++ I ++ + ++++ ++ + R
Sbjct: 64 VFGIIRAIAAFRLAGSNKDYIIITSDSGRITIVEFVPAQNKFNRLHLETFGKSGVRRVVP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R L + L V+ +++ +L + + + Q L + G
Sbjct: 124 GQYLAVDPKGRACLTASVEKNKLVYVLNRNSQAELTISSPLEAHKAQTLVFALVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L D ++ T YE+ L V WS+ +D A++L
Sbjct: 184 ANPVFAALEIDYGESDQDPTGQAYDEIEKQLVYYELDLGLNHVVRK-WSEP-VDRTANIL 241
Query: 168 IPVPPPL---CGVLIIGEETIVYCSAN--AFK-AIPIRPSITKAYGRVD--ADGSRYLLG 219
VP GVL+ GE+ I Y +N AF+ AIP R T+ R G + L
Sbjct: 242 FQVPGGTDGPSGVLVCGEDNITYRHSNQEAFRVAIPRRRGATEDPQRKRNIVAGVMHKLK 301
Query: 220 DHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNAV 261
AG L+ T + + +V LKI+ +A+++ L +
Sbjct: 302 GAAGAFFFLLQTDDGDLFKITIEMVEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGF 361
Query: 262 VYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYV 294
+++ S +G+ Q + + A SY + +E
Sbjct: 362 LFVASEFGNHQFYQFEKLGDDDEETEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESID 421
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
++ P++D V +L + Q+ + G + R +++G+ ++E EL G+ +W+
Sbjct: 422 SMNPLMDCKVANLTDEDAPQIYSICGTGARSTFRTLKHGLEVSEIVESELPGVPSAVWTT 481
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ + D +D ++++SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 482 KLTRGDTYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVH 539
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
+R + + R NEW +P S+ AT N QV +A G +VY E+ DG L E
Sbjct: 540 PKGIRHIRADHR--VNEWAAPQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSLAEYD 597
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
+ ++ ++CL + + E SQ AVG D +VRI SL PD L K P
Sbjct: 598 EKKEMSGTVTCLSLGEVPEGRQRSQFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSPP 656
Query: 531 RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
++ + A S YL L G L +L+ TGEL+D + LG +P+ L +
Sbjct: 657 NALSIMAMSDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGPKPVKLFRVA 716
Query: 585 SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+ T V A S RP + YS K + + ++ + F+S + + + L
Sbjct: 717 VQGQTAVLALSSRPWLGYSDPVTKGFMLTPLDYPALEWGWNFSSEQCTEGMVGIQGQNLR 776
Query: 643 IGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT 701
I +I+ + L SIPL PRR + F + N + ++ +LL+D +
Sbjct: 777 IFSIEKLTDNLLQESIPLTYTPRRFVRHPEHACFYVIEADNNILSPATKQ---KLLEDPS 833
Query: 702 F-----------EF------------------------ISTYPLDTFEYGCSILSCSF-S 725
EF +ST L+ E SI +F S
Sbjct: 834 VTNGDATVLPPEEFGYPRGTNHWASCISVVDPVTEKKVLSTIHLEDNECAVSIAVVAFAS 893
Query: 726 DDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGK 782
+ + CVGT ++ + G I V+ EDGK L+ I + + + +L AF G+
Sbjct: 894 QEDETFLCVGTGKDMVVSPRSFSAGFIHVYRFHEDGKELEFIHKTKVEEPPMALLAFQGR 953
Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
LLA + + +++Y +R R+ QSE + + + +QT+G I+V D+ +SI++++Y
Sbjct: 954 LLAGVGKDLRIYDLGMR-QLLRKAQSEVAPN---MIVGLQTQGSRIIVSDVQESITMVVY 1009
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
K +E + D A W S ++D + G + NL+ +R K SE A +E G
Sbjct: 1010 KFQENRLIPFVDDTIARWTSCTTMVDYETVAGGDKFGNLWLLRCPAKASEEADEEGSGAH 1069
Query: 898 ----RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGV-IASLP 950
R + G H + + M + +++ G +++ + G +G+ I +
Sbjct: 1070 LLHERQYLAGAPHRLTLMAHNFSQDIPMSIQKTNLVAGGRDCLLWSGLQGTLGILIPFVS 1129
Query: 951 HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTR 1010
E F + L+ +LR + G +H +RS+ V K +DGDL E + L +
Sbjct: 1130 REDVDFFQTLEQHLRSEDPPLAGRDHLIYRSY-----YVPVKGVIDGDLCERYTLLPTDK 1184
Query: 1011 MDEISKTMNVSVEELCKRVEEL 1032
I+ ++ SV E+ +++ ++
Sbjct: 1185 KQMIAGELDRSVREIERKISDI 1206
>gi|302680006|ref|XP_003029685.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
gi|300103375|gb|EFI94782.1| hypothetical protein SCHCODRAFT_58785 [Schizophyllum commune H4-8]
Length = 1213
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 273/1175 (23%), Positives = 471/1175 (40%), Gaps = 174/1175 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I ++ FR G +D L + ++ + +L +D +S+ + R
Sbjct: 58 VFGSIRSIAAFRLTGGTKDHLILGSDSGRIVILDYDPKSTSFVKLHQETYGKSGARRIVP 117
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R +IG L ++ D L + + + + + G
Sbjct: 118 GQYLATDPKGRAVMIGAMEKSKLVYILNRDAAANLTISSPLEAHKNSAIIHHIVGVDVGF 177
Query: 121 AKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
P L D N A+ + YE+ L V WS+ D A+LL
Sbjct: 178 ENPIFAALEVDYSESDQDPTGEAFNNAAKMLTYYELDLGLNHVVRK-WSEPT-DPRANLL 235
Query: 168 IPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS----------I 202
+ VP P GVLI E+ I+Y +A + IP R I
Sbjct: 236 VQVPGGQLASSDRFDGP--SGVLICCEDHIIYRHMDAPQHRVPIPRRKHPLEDPERGIII 293
Query: 203 TKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
A +LL G L + I HE E V +KI+ +AS + L + +
Sbjct: 294 VAAVMHKMKGAFFFLLQSEDGDLFKVTIEHEDEDVKEVKIKYFDTVPVASALCILKSGFL 353
Query: 263 YIGSSYGDSQLIKLNLQPD-------AKGSYVE------------------------VLE 291
++ S +G+ L + D + SY + + +
Sbjct: 354 FVASEFGNHYLYQFQKLGDDDDEPEFSSSSYPQFGMADSSMPLPHVHFKPHPLDNLALAD 413
Query: 292 RYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KG 349
+L PI+D V++L Q+ G SLR +R+G+ + E S +L GI
Sbjct: 414 EVESLDPIIDSKVLNLMPNSDTPQIFAACGRGPRSSLRTLRHGLEVEESVSSDLPGIPNA 473
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
+W+ + DD FD+++++SF++ T +L++ + +EE + GF S TL + L
Sbjct: 474 VWTTKKKEDDAFDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGADAL 531
Query: 410 VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGIL 468
+QV +R V S R NEW+ P G S+ AT N QV++A LVY E+ DG L
Sbjct: 532 LQVHPQGIRHVLSDRRV--NEWRVPQGKSIVQATTNKRQVVVALSSAELVYFELDLDGQL 589
Query: 469 TEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGG 526
E + + + + L I + E + AVG D +VRI SL P+ L T
Sbjct: 590 NEYQDRKAMGSTVLALSIGEVPEGRQRTPFLAVGC-EDQTVRIISLDPESTLDTISLQAL 648
Query: 527 EIIPRSVLLCAF--------EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
P S+ + ++ L +G LL +L+ TG+LTD + LGT+P+
Sbjct: 649 TAPPSSICIAEMLDAAINKTHPTMFVNIGLQNGVLLRTVLDPMTGQLTDTRTRFLGTRPV 708
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
L + + + A S R + Y+ + ++ + + + F++ P+ L
Sbjct: 709 KLVRVAIQRNPAIMALSSRSWLNYTHQNMVHFTPLIFENLDFAWSFSAELSPEGLIGITG 768
Query: 639 GELTIGTIDDIQ-KLHIRSIPLGEHPR------------------RICHQEQSRTFAICS 679
L I I + KL S+PL PR R+ E +R +
Sbjct: 769 SVLRIFQIPKLGVKLKQDSLPLSYTPRKFITHPNNHYFYMIESDHRVYGDETAREKILDQ 828
Query: 680 LKNQSCAEESEMHF------------------VRLLDDQTFEFISTYPLDTFEYGCSILS 721
+ +E ++ +R++D ++ PLD E SI
Sbjct: 829 MSRGKAVDEDVVNLPVTEFGRVKAPAGTWGSCIRIIDPTQNSTVAVIPLDNNEAAFSIAV 888
Query: 722 CSFS-DDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
FS + ++ VGTA V P R F + L+L + ET +L
Sbjct: 889 VPFSARNGELFLVVGTAANTRVSPRTCSSGYLRTYQFTNDGAGLELHHKTETDDVPLALL 948
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH-ILALYVQTRGDFIVVGDLMKS 836
AF G+L A + + +++Y D G ++L + + G + + T+G I+ GD+ +S
Sbjct: 949 AFQGRLAAGVGKALRIY-----DIGKKKLLRKAENKGFGTTIVTLNTQGSRIIAGDMQES 1003
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATD 893
+ +YK E + A D W+SA ++D + N+F R K SE D
Sbjct: 1004 LFYAVYKAPENRLLVFADDSQPRWISAATMVDYYTVAAGDRFGNVFVNRLDYKVSEQVDD 1063
Query: 894 EERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
+ G + +++ +H+G+ + +LV G +++
Sbjct: 1064 DPTGAGILHEKGILMGAPHKTKLLCHFHVGDLITSIHKVALV-------AGGREVLLYTG 1116
Query: 939 VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
++G IG++ + E F+ L+ ++R + G +H WR + V K +DG
Sbjct: 1117 LHGTIGMLVPFVSKEDVDFISTLEQHMRSEQSSLVGRDHLSWRGY-----YVPVKAVVDG 1171
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DL E+F L ++ I+ ++ +V E+ K+++ L
Sbjct: 1172 DLCETFAKLPASKQSAIANELDRTVGEVLKKLDSL 1206
>gi|302916981|ref|XP_003052301.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256733240|gb|EEU46588.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 1212
Score = 226 bits (577), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 282/1174 (24%), Positives = 489/1174 (41%), Gaps = 168/1174 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L I+G I +L FR G +D+L +AT+ + ++++ + +
Sbjct: 59 LLSHDIFGIIRSLAAFRLAGSNKDYLILATDSGRITIIEYLPAQNRFQRLHLETFGKSGV 118
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R G+ DP R LI + L V+ +++ +L + + + VL I +
Sbjct: 119 RRVIPGEYLACDPKGRACLIASTEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVISMVA 178
Query: 118 --YGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDN 162
G + P L D + + YE+ L V WS + +D
Sbjct: 179 LDVGYSNPVFAALEIDYSEVDQDSSGQAMEDLETQLVYYELDLGLNHVVRR-WS-DPVDP 236
Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDAD 212
A LL VP P GVL+ GEE I Y +N AF+ AIP R T+ R
Sbjct: 237 TASLLFQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVAIPRRRGATEDPNRKRTIVS 294
Query: 213 GSRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTI 254
G + L AG L+ T + + +V LKI+ +AS++
Sbjct: 295 GIMHKLKGSAGAFFFLLQTEDGDLFKLSIDMVEDEDGNPTGEVKRLKIKYFDTVPVASSL 354
Query: 255 SYLDNAVVYIGSSYGDS---------------QLIKLNLQPDAKGSY------------V 287
L + +Y+ S +G+ + I + D + SY +
Sbjct: 355 CILKSGFLYVASQFGNYSFYQFEKLGDDDEELEFISDDFPADPRASYDPVYFHPRPTENL 414
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
++E + P++D V +L + Q+ T G + R++++G+ +NE + EL GI
Sbjct: 415 ALVESIPAMNPLLDCKVANLTGEDAPQIYTICGNGARSTFRMLKHGLEVNEIVASELPGI 474
Query: 348 -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+L+ + + +D ++V+SF + T +L++ + +EE GF + TL
Sbjct: 475 PSAVWTLKLNRSEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGE 532
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
+ L+QV +R V + NEW +P S+ A+ANA QV +A G +VY E+ D
Sbjct: 533 DGLIQVHPKGIRHVRNGHI---NEWAAPQHRSIVAASANAHQVAIALSSGEIVYFEMDAD 589
Query: 466 GILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
G L E + + ++CL + + E S AVG D +VRI SL PD L K
Sbjct: 590 GSLAEYDEKKEMFGTVTCLSLGDVPEGRLRSSFLAVGC-DDCTVRILSLDPDSTLENKSV 648
Query: 524 LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
P S+ + A E S YL L G L +L+ TGELTD ++ LG +
Sbjct: 649 QALTAAPTSLAIIAMEDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKE 708
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
+ L + + T V S RP + Y K + + +N ++ F+S + +
Sbjct: 709 VRLFQVTVQGKTCVLGLSSRPWLGYPDPITKGFVVTPLNYVDLEWGWNFSSEQCEEGIVG 768
Query: 636 AKEGELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
+ L I ID + I +SIPL P+++ + F N + E +
Sbjct: 769 IQGQSLRIFNIDRLGDTLIQKSIPLTYTPKKLVKHPEQPLFYTIEADNNTLPPELRAQLL 828
Query: 695 ----------RLLDDQTF-----------------------EFISTYPLDTFEYGCSILS 721
++L + F + + T + E S
Sbjct: 829 ADPNVVNGDSQVLPPEDFGYPRANRRWASCINVVDPLSEEGQVLQTVHFENNEAAVSATV 888
Query: 722 CSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
SF S D+ + VGT ++ ++G + ++ VEDG+ L+ I + + + +L
Sbjct: 889 VSFASQDNENFLVVGTGKDMIVNPQSYSEGYLYIYRFVEDGRELEFIHKTKIEEPPLALL 948
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
F GK+ A+ ++++Y +R R+ Q+E I++L T+G IVVGD+ + +
Sbjct: 949 PFQGKVAVAVGTQLRIYDLGMR-QMLRKAQAEVSAQ-RIVSL--NTQGSRIVVGDVQQGV 1004
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
+L++YK + A D A W + ++D + G + N+F VR K SE A +E
Sbjct: 1005 TLVVYKSATNKLIPFADDTVARWTTCTTMVDYESIAGGDKFGNMFIVRCPEKASEEADEE 1064
Query: 895 ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
+ G RL ++ ++ + SLV VG +++ +
Sbjct: 1065 QSGLHLINARDYLHGTPHRLGLMCHFYTQDVPTSITKTSLV-------VGGQEILLWSGI 1117
Query: 940 NGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
G IGV I + E F + L+ +LR + G +H +R + K +DGD
Sbjct: 1118 MGTIGVFIPFVSREDADFFQNLEQHLRTEDPPLAGRDHLMYRGY-----YAPVKGVIDGD 1172
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L E + L + I+ ++ SV E+ +++ ++
Sbjct: 1173 LCERYNLLPNDKKQMIAGELDRSVREIERKISDI 1206
>gi|240275059|gb|EER38574.1| DNA damage-binding protein 1a [Ajellomyces capsulatus H143]
Length = 1134
Score = 226 bits (576), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 163/556 (29%), Positives = 275/556 (49%), Gaps = 29/556 (5%)
Query: 508 VRIFSLPDLNLITKEHLG--GEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMKTG 563
V + SLPDL + LG G+ +PRSVL+ + L ++ DG + +F N +
Sbjct: 581 VVVMSLPDLAIRRSTSLGEPGDAVPRSVLVAEVLPNNPATLFVSMADGSVFSFSFNSEDF 640
Query: 564 ELTDRKKVSLGT-QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
LT K++LG+ QP + +VFA ++P++IY+ +++YS VN + S +C
Sbjct: 641 SLTSMSKLTLGSEQPSFKKLPRGDGLYNVFATCEQPSLIYAVEGRIVYSAVNSDQASRIC 700
Query: 623 PFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN 682
FNS A+P S+A+A EL I +D + I+++ +GE RR+ + R F I ++K
Sbjct: 701 HFNSEAYPGSIALATPSELKITLVDAERTTQIQTLEVGETVRRVAYSAPERAFGIGTIKR 760
Query: 683 --QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVY---Y 732
+ AE F+ L D+ F + Y L+ E S++ F S+ ++++ +
Sbjct: 761 TLEDGAEVIASRFM-LADEIMFRELDIYDLNKDELVESVIRAQFPDGIGSEGNDLFKDLF 819
Query: 733 CVGTAYVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
VGT+Y+ +GRIL F + + +L +AE KGA +L K++AA+ + +
Sbjct: 820 VVGTSYLDDFGEGSIRGRILAFEVTANRQLAKVAEMPVKGACRALAIVQDKIVAALMKTV 879
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG---- 847
+Y L + + + G+ I V DLMKS+S++ Y+
Sbjct: 880 VVYTLSKGQFADYTLSKTASYRTSTAPVDIAVTGNLIAVADLMKSVSIVEYQQGANGLPD 939
Query: 848 AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
++ E AR + W +AV + +D +L ++ NL + +N G TD++R RLEV E L
Sbjct: 940 SLTEVARHFQTLWSTAVAPVAEDTWLESDAEGNLVMLHRNVNGVTDDDRRRLEVTSEILL 999
Query: 908 GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
GE VNR R + + ++ P GTV G I + + L +LQ+ + +
Sbjct: 1000 GEMVNRIRPVN-IQGSQGAEAAISPRAFLGTVEGSIYLFGIINPTYQDLLMRLQSAMAGM 1058
Query: 968 IKGVGGLNHEQWRSFNNE-KKTVDAKNFLDGDLIESFLDLSRTRMDEI------SKTMNV 1020
+ GG+ ++R+F N ++T + F+DG+LIE FL+ +EI V
Sbjct: 1059 VVTPGGMPFNKFRAFRNTIRQTEEPYRFVDGELIERFLNCGVELQEEIVGKVIADGVAGV 1118
Query: 1021 SVEELCKRVEELTRLH 1036
+VE + VEEL R+H
Sbjct: 1119 TVESVKGLVEELRRMH 1134
Score = 182 bits (462), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 140/456 (30%), Positives = 218/456 (47%), Gaps = 99/456 (21%)
Query: 16 FRPHGEAQDFLFIATERYKFCVLQWDAESSELIT-RAMGDVSDRIGRPTDNGQIGIIDPD 74
FRP D LFI T+R + + WD ++ L T R D++D R + G +IDP
Sbjct: 92 FRPQ---TDILFIGTDRASYFTVSWDQKTKNLRTERKYVDLADPSARASQLGDRCLIDPS 148
Query: 75 CRLIGLHLYDGLFKVIP----FDNKG--------------------------------QL 98
R + L LY+G+ VIP ++G +L
Sbjct: 149 GRFLTLELYEGIVTVIPIGQPLRSRGSGRRLGKRAANASASAAYANQDAAAGAAAADLEL 208
Query: 99 KEAFNIRLEELQVLDIKFLYGCAK--PTIVVLYQDNKDARHVKTYEV------------- 143
E R+EEL V FL+ A P + +LY+D +K E+
Sbjct: 209 GEPCQARVEELLVRSSTFLHTQADTLPRMALLYEDTMGKVRLKVRELEVMHGGMGMGGGT 268
Query: 144 --------------ALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC- 188
LK D + + L+ GA L+PVP PL G+L++GE +I Y
Sbjct: 269 GSGGGIGADGGIVAVLKGLDLL-----KEELEMGASFLVPVPAPLGGLLVLGETSIRYLD 323
Query: 189 -SANAFKAIPIR-PSITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELL 245
++N + P++ +I A+ +VD G R+LL D G L L+++ V K++LL
Sbjct: 324 DASNECISQPLKEATIFVAWEQVD--GQRWLLADDYGRLFFLMLVLDTDNAVQSWKLDLL 381
Query: 246 GETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVV 305
G+ AS + Y+ + +IGS GDS+LI++ +GS+ EV++ + N+ PI+DF ++
Sbjct: 382 GDIPRASVLVYMGGGITFIGSHQGDSELIRIT-----EGSF-EVIQTFANIAPILDFTIM 435
Query: 306 DL------ERQ------GQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWS 352
DL E Q GQ ++VT SGA+ DGSLR VR+G+G+ E + ++ I +W+
Sbjct: 436 DLGGRAIGESQTHEFSSGQARIVTGSGAFNDGSLRSVRSGVGMEEVGVLGAMKHITDLWA 495
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE 388
LR + F L+VSF+ ETR+ + E+EE E
Sbjct: 496 LRVACPQEFSDTLLVSFVDETRVFYFTPDGEVEEKE 531
>gi|392570042|gb|EIW63215.1| hypothetical protein TRAVEDRAFT_161375 [Trametes versicolor FP-101664
SS1]
Length = 1213
Score = 225 bits (574), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 277/1179 (23%), Positives = 482/1179 (40%), Gaps = 182/1179 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D++ + ++ + +L +D +S + R
Sbjct: 58 VFGSIRSLAAFRLTGGTKDYVIVGSDSGRIVILDYDPRTSTFVKIHQETYGKSGARRIVP 117
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
GQ DP R +IG L ++ D L EA N + + LD+ F
Sbjct: 118 GQYLATDPKGRSVMIGSVEKAKLVYILNRDAAANLTISSPLEAHKNNAIIHHIVGLDVGF 177
Query: 117 LYGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNG 163
PT L D N + + YE+ L V WS+ D
Sbjct: 178 ----ENPTFAALEVDYSESDQDPTGEAFNNAEKMLTYYELDLGLNHVVRK-WSEPT-DPR 231
Query: 164 ADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------- 201
A+LL+ VP P GVL+ E+ I+Y ++ + IP R
Sbjct: 232 ANLLVQVPGGQLAASERYDGP--SGVLVCCEDHIIYRHMDSPQHRVPIPRRKHPLEDTER 289
Query: 202 ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
IT A +LL G L+ + I HE++ V +KI+ +AS++ L
Sbjct: 290 GLIITAAVMHKMKGAFFFLLQSEEGDLYKVTIEHEEQDVKAVKIKYFDTVPVASSLCILK 349
Query: 259 NAVVYIGSSYGDS---QLIKLN---------------------LQPDAKGSY-------V 287
+ +++ S +G+ Q KL P +G + +
Sbjct: 350 SGFLFVASEFGNHYFYQFQKLGDDDNEPEWSSADYPSYGMADPSTPLPRGYFRPRALDNL 409
Query: 288 EVLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+++ +L PI+D V++L Q+ T G + R +R+G+ + E S +L G
Sbjct: 410 TLVDELESLCPIIDSKVMNLLPNSDTPQIFTACGRGARSTFRTLRHGLEVEEVVSSDLPG 469
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
I +W+ + DD +D+++++SF++ T +L++ + +EE + GF S TL
Sbjct: 470 IPNAVWTTKLKEDDLYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIG 527
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
+ L+QV +R V + R NEWK P G ++ AT N QV++A LVY E+
Sbjct: 528 ADALLQVYPHGIRHVLADRRV--NEWKVPSGKTIVCATTNKRQVVVALSSAELVYFELDL 585
Query: 465 DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
DG L E + + + + L + + E + AVG D +VRI SL + +
Sbjct: 586 DGQLNEYQDRKAMGSTVLALSVAEVPEGRQRTPYLAVGC-EDQTVRIVSLDPESTLETIS 644
Query: 524 LGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
L P S + A + +++ L +G LL +L+ G+LTD + LG
Sbjct: 645 LQALTAPPSAICIADMLDASINKSQPTTFVNIGLQNGVLLRTVLDPVNGQLTDTRTRFLG 704
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
T+PI L + + + A S RP + Y+ + ++ + + + + F++ + L
Sbjct: 705 TRPIRLLRVNIQQNPAILALSSRPWLNYTYQNFMHFTPLIFENLDYAWSFSAELCTEGLI 764
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC--------------HQEQSRTFAICS 679
L I I + KL S+PL PR+ H+ S A
Sbjct: 765 GISGSLLRIFQIPKLGTKLKQDSLPLSYTPRKFMPHPTNGLLYLIEGDHRVMSEEAASKK 824
Query: 680 LKNQSCAEESEMHFVRLLDDQTF----------------------EFISTYPLDTFEYGC 717
L+ E V LL + F + LD E
Sbjct: 825 LQEMRARGERIDEEVLLLPPEQFGRPKAPAGTWASCIRIINPLESSTVKVIHLDNNEAAF 884
Query: 718 SILSCSFSDDSN-VYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV 773
S+ F+ N ++ VGTA ++ P R FI + L+ + + ET
Sbjct: 885 SMAIVPFAARGNELHLVVGTAQDTFLSPRSCTSGFLRTYRFIDDGRDLEFLHKTETSDVP 944
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
++ AF GKL+A + + ++LY D G ++L + + G A+ + T+G I+VGD
Sbjct: 945 LAVMAFQGKLIAGVGKSLRLY-----DVGKKKLLRKVENKGFPAAIVTLNTQGSRIIVGD 999
Query: 833 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE--- 889
+ +S+ +YK E + A D W++A +LD + + + N+F R +S+
Sbjct: 1000 MQESVFYAVYKAPENRLLVFADDAQPRWVTATTMLDYNTVVAGDRFGNVFVNRLDSKISD 1059
Query: 890 --------GATDEERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
E+G L ++ +H+G+ V +LV G +
Sbjct: 1060 QIDDDPTGAGILHEKGVLFGAPHKSVMLSHFHVGDIVTSLHKVALV-------AGGREVL 1112
Query: 935 IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
++ ++G IG++ + E L L+ ++R + G +H WR + V K+
Sbjct: 1113 LYTCLHGTIGILVPFVSKEDVDLLTTLEQHMRTEQLSLVGRDHLTWRGY-----YVPVKS 1167
Query: 994 FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+DGDL ESF L + I+ ++ +V E+ K++E+L
Sbjct: 1168 VVDGDLCESFAKLPANKQSTIAGELDRTVGEVLKKLEQL 1206
>gi|409075182|gb|EKM75565.1| hypothetical protein AGABI1DRAFT_64324 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 1213
Score = 225 bits (574), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 275/1177 (23%), Positives = 484/1177 (41%), Gaps = 178/1177 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+ + ++ + +L++DA++S R
Sbjct: 58 VFGSIRSLASFRLTGATKDYAILGSDSGRIIILEYDAKTSSFAKLHQETFGKSGARRIVP 117
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
GQ DP R +I L ++ D L + + + + LD+ F
Sbjct: 118 GQYLATDPKGRSVMIAAMEKAKLVYILNRDAAANLTISSPLEAHKNSSIIHHIVGLDVGF 177
Query: 117 ---LYGCAKPTIVVLYQDN-----KDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
+Y + QD K A + TY + V WS+ D A+LL+
Sbjct: 178 ENPVYAALEVDYSESDQDPTGEAFKRAEKMLTYYELDLGLNHVVRKWSEPT-DPRANLLV 236
Query: 169 PVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS----------IT 203
VP P GVL+ E+ I+Y ++ + IP R IT
Sbjct: 237 QVPGGQLASSDRFDGP--SGVLVCCEDHIIYRHMDSPQHRVPIPRRQHPLEDKERGVIIT 294
Query: 204 KAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
A +LL G L + I HE E+V LKI+ +AS++ L + ++
Sbjct: 295 SAVMHKMKGAFFFLLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLF 354
Query: 264 IGSSYGDSQLIKLN---------------------LQPDAKGSYVEVLERYVN------- 295
+ S +G+ L + +P A V R ++
Sbjct: 355 VASEFGNHYLYQFQKLGDDDEEPEFSSTSFPSSGMAEPQAALPRVYFKPRPLDNLALADE 414
Query: 296 ---LGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
L PI+D V++L Q+ G SLR +++G+ + E S +L GI +
Sbjct: 415 LESLDPIIDSKVLNLLPNSDTPQIFAACGRGARSSLRTLQHGLEVEESVSSDLPGIPNAV 474
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+ + + DDP+D+++++SF++ T +L++ + +EE + GF S TL + L+
Sbjct: 475 WTTKRNEDDPYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGSDALL 532
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
QV +R V + R NEW+ P ++ AT N QV++A LVY E+ DG L
Sbjct: 533 QVHPHGIRHVLADRRV--NEWRVPSNKTIVAATTNKRQVVVALSSAELVYFELDLDGQLN 590
Query: 470 EVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
E + + + + L I + E + AVG D +VRI SL + + L
Sbjct: 591 EYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGC-EDQTVRIISLDPESTLETISLQALT 649
Query: 529 IPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
P S + A + ++ L +G LL +L+ G+LTD + LGT+PI
Sbjct: 650 APPSAICIADMLDASINKAQPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGTRPIK 709
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L + A S R + Y+ + ++ + + + + F++ P+ L
Sbjct: 710 LVRVLIHKHPAILALSSRSWLNYTYQNFMHFTPLIYENLDYAWSFSAELSPEGLIGISGS 769
Query: 640 ELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAIC-----------------SLK 681
L I + + KL SIPL PR+ + F + L+
Sbjct: 770 VLRIFHVPRLGIKLKQDSIPLSYTPRKFITYPLNNLFYLIEGDHRVMGQDAVDKKLNELR 829
Query: 682 NQSCAEESEM-------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
Q+ A + E+ +R++D + IS PLD E S+
Sbjct: 830 QQNRAIDQEVLNLSPEVFGRPKAANGTWASNIRIIDPVEGKTISVVPLDGNESAFSLAVV 889
Query: 723 SFSDDSN-VYYCVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
FS N ++ VGTA + P R+ F+ + +L+L+ + E +L A
Sbjct: 890 PFSAKGNELHLVVGTAADTKLSPRTCSTGFLRVYKFLEDGRQLELVHKTEIDDVPLALMA 949
Query: 779 FNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
F G+L+A + + +++Y K MLR ++ S + + T+G I+VGD+
Sbjct: 950 FQGRLVAGVGKALRIYDIGKKKMLRKVENKQFGSAI--------VTLSTQGSRILVGDMQ 1001
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA--T 892
+SI +YK E + A D W+SA ++D + + A+ N+F R + +
Sbjct: 1002 ESIFFAVYKAPENRLLIFADDSQPRWISAATMVDYNTVVAADRFGNIFVNRLDPRVSDQV 1061
Query: 893 DE---------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
DE E+G + +++ +H+G+ + SLV G +++
Sbjct: 1062 DEDPTGAGILHEKGLYMGAPHKTKMICHFHVGDLITSIHKVSLV-------AGGREVLLY 1114
Query: 937 GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
++G IG++ + E F+ L+ ++R + G +H WR + V K +
Sbjct: 1115 TGLHGTIGILVPFVTKEDVDFISTLEQHMRTEQVSLVGRDHLGWRGY-----YVPVKAVV 1169
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DGDL E + L ++ I+ ++ S+ E+ K++E+L
Sbjct: 1170 DGDLCEMYAKLPGSKQSAIAGELDRSIGEVLKKLEQL 1206
>gi|322707263|gb|EFY98842.1| Pre-mRNA-splicing factor rse-1 [Metarhizium anisopliae ARSEF 23]
Length = 1212
Score = 224 bits (571), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 288/1169 (24%), Positives = 497/1169 (42%), Gaps = 168/1169 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+L IA++ + V+++ + + R
Sbjct: 64 VFGIIRSLAAFRLAGSNKDYLIIASDSGRITVVEYLPAQNRFSRLHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
G+ DP R LI + L V+ + +L + + + VL I + G
Sbjct: 124 GEYLACDPKGRACLIASTEKNKLVYVLNRSAQAELTISSPLEAHKPGVLVISMVALDVGY 183
Query: 121 AKPTIVVLYQD----NKDA-----RHVKT----YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L D ++D+ V+T YE+ L V WS+ +D A LL
Sbjct: 184 ANPVFAALEMDYSEVDQDSSGQALEEVETQLVYYELDLGLNHVVRK-WSEP-VDPTASLL 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYL 217
VP P GVL+ GEE+I Y +N AF+ IP R T+ + R G +
Sbjct: 242 FQVPGGNDGP--SGVLVCGEESITYRHSNQDAFRVPIPRRKGATEDPSRKRTIVSGVMHK 299
Query: 218 LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
L AG L+ T + + +V LKI+ +A+++ L +
Sbjct: 300 LKGSAGAFFFLLQTEDGDLFKVTIDMVEDEEGSPTGEVKRLKIKYFDTVPVATSLCILKS 359
Query: 260 AVVYIGSSYGD---SQLIKL------------NLQPDAKGSY------------VEVLER 292
+YI S +G+ Q KL + D + SY + ++E
Sbjct: 360 GFLYIASQFGNFSFYQFEKLGDDDAELEFSSDDFPVDPQASYDPVYFHPRPAENLALVES 419
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
++ P++D V +L + Q+ T G + R++R+G+ +NE + EL GI +W
Sbjct: 420 IPSMNPLLDCQVANLTGEDAPQIYTVCGNGARSTFRMLRHGLEVNEIVASELPGIPSAVW 479
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+L+ + + +D ++V+SF + T +L++ + +EE GF + TL + L+Q
Sbjct: 480 TLKLNRGEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFSTSVPTLAAQLLGDDGLIQ 537
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
V +R + + NEW +P S+ A+ NA QV +A G +VY E+ DG L E
Sbjct: 538 VHPKGIRHIRNGKV---NEWDAPQHRSIVAASTNAHQVAIALSSGEIVYFEMDSDGSLAE 594
Query: 471 VKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
+ + ++CL + + E S AVG D +VRI SL P+ L +K
Sbjct: 595 YDEKKEMFGTVTCLSLGEVPEGRVRSSFLAVGC-DDCTVRILSLDPESTLESKSVQALTA 653
Query: 529 IPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
P S+ + A + S YL L G L +L+ TGELTD ++ LG + + L
Sbjct: 654 APSSLAIIAMDDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKQVRLFQ 713
Query: 583 FSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
+ + TT V S RP + YS K + + +N ++ F+S + + +
Sbjct: 714 VTVQGTTCVLGLSSRPWLGYSDPITKGFVVTPLNYVDLEWGWNFSSEQCEEGVVGIQGQS 773
Query: 641 LTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICS--------LKNQSCAEESE 690
L I +ID + L +++ L P++ I H EQ + I S L+ Q A+ +
Sbjct: 774 LRIFSIDRLGDTLTQKAVSLTYTPKKLIKHPEQPLFYTIESDNNTLPPDLRAQLMADPAV 833
Query: 691 MHF-VRLLDDQTF-----------------------EFISTYPLDTFEYGCSILSCSF-S 725
++ +L + F + + T L+ E S F S
Sbjct: 834 VNGDATVLPPEDFGYPKGNRRWASCINVIDPVSEEPQVVQTVDLENNEAAVSAAIVPFAS 893
Query: 726 DDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGK 782
D+ + VGT V P + F E +L+ I + + + +L F GK
Sbjct: 894 QDNESFLIVGTGKDVVVNPRNFSEAYIYVYRFQEEGRELEFIHKTKIEEPALALIPFQGK 953
Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
LLA + + +++Y +R R+ Q+E I++L T+G I+VGD+ + ++ + Y
Sbjct: 954 LLAGVGKTLRVYDLGMR-QMLRKAQAEVAPQ-QIVSL--NTQGSRIIVGDIQQGVTYVTY 1009
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
K + A D A W + ++D + G + N+F VR K SE A +E+ G
Sbjct: 1010 KPTTNKLIPFADDTIARWTTCTTMVDYESVAGGDKFGNMFIVRCPPKASEEADEEQSGLH 1069
Query: 898 -------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
RL+++ ++ + SLV VG +++ + G IG
Sbjct: 1070 LMNARDYLHGTSQRLDLMCHFYTQDIPTSMAKTSLV-------VGGQDVLLWSGLMGTIG 1122
Query: 945 V-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
V I + E F + L+++LR + G +H +RS+ K +DGDL E +
Sbjct: 1123 VFIPLISREDADFFQSLESHLRTEDPPLAGRDHLMYRSY-----YAPVKGIIDGDLCERY 1177
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L + I+ ++ SV E+ +++ ++
Sbjct: 1178 TLLPNDKKQMIAGELDRSVREIERKISDI 1206
>gi|426192113|gb|EKV42051.1| hypothetical protein AGABI2DRAFT_229642 [Agaricus bisporus var.
bisporus H97]
Length = 1213
Score = 224 bits (570), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 275/1177 (23%), Positives = 483/1177 (41%), Gaps = 178/1177 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+ + ++ + +L++DA++S R
Sbjct: 58 VFGSIRSLASFRLTGATKDYAILGSDSGRIIILEYDAKTSSFAKLHQETFGKSGARRIVP 117
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
GQ DP R +I L ++ D L + + + + LD+ F
Sbjct: 118 GQYLATDPKGRSVMIAAMEKAKLVYILNRDAAANLTISSPLEAHKNSSIIHHIVGLDVGF 177
Query: 117 ---LYGCAKPTIVVLYQDN-----KDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
+Y + QD K A + TY + V WS+ D A+LL+
Sbjct: 178 ENPVYAALEVDYSESDQDPTGEAFKRAEKMLTYYELDLGLNHVVRKWSEPT-DPRANLLV 236
Query: 169 PVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS----------IT 203
VP P GVL+ E+ I+Y ++ + IP R IT
Sbjct: 237 QVPGGQLASSDRFDGP--SGVLVCCEDHIIYRHMDSPQHRVPIPRRQHPLEDKERGVIIT 294
Query: 204 KAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
A +LL G L + I HE E+V LKI+ +AS++ L + ++
Sbjct: 295 SAVMHKMKGAFFFLLQSEDGDLFKVTIEHEDEEVKALKIKYFDTVPVASSLCILKSGFLF 354
Query: 264 IGSSYGDSQLIKLN---------------------LQPDAKGSYVEVLERYVN------- 295
+ S +G+ L + +P A V R ++
Sbjct: 355 VASEFGNHYLYQFQKLGDDDEEPEFSSTSFPSSGMAEPQAALPRVYFKPRPLDNLALADE 414
Query: 296 ---LGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
L PI+D V++L Q+ G SLR +++G+ + E S +L GI +
Sbjct: 415 LESLDPIIDSKVLNLLPNSDTPQIFAACGRGARSSLRTLQHGLEVEESVSSDLPGIPNAV 474
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+ + + DDP+D+++++SF++ T +L++ + +EE + GF S TL + L+
Sbjct: 475 WTTKRNEDDPYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGSDALL 532
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
QV +R V + R NEW+ P + AT N QV++A LVY E+ DG L
Sbjct: 533 QVHPHGIRHVLADRRV--NEWRVPSNKIIVAATTNKRQVVVALSSAELVYFELDLDGQLN 590
Query: 470 EVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
E + + + + L I + E + AVG D +VRI SL + + L
Sbjct: 591 EYQDRKAMGSTVLALSIGDVPEGRQRTPYLAVGC-EDQTVRIISLDPESTLETISLQALT 649
Query: 529 IPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
P S + A + ++ L +G LL +L+ G+LTD + LGT+PI
Sbjct: 650 APPSAICIADMLDASINKAQPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGTRPIK 709
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L + A S R + Y+ + ++ + + + + F++ P+ L
Sbjct: 710 LVRVLIHKHPAILALSSRSWLNYTYQNFMHFTPLIYENLDYAWSFSAELSPEGLIGISGS 769
Query: 640 ELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAIC-----------------SLK 681
L I + + KL SIPL PR+ + F + L+
Sbjct: 770 VLRIFHVPRLGIKLKQDSIPLSYTPRKFITYPLNNLFYLIEGDHRVMGQDAVDKKLNELR 829
Query: 682 NQSCAEESEM-------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
Q+ A + E+ +R++D + IS PLD E S+
Sbjct: 830 QQNKAIDQEVLNLSPEVFGRPKAANGTWASNIRIIDPVEGKTISVVPLDGNESAFSLAVV 889
Query: 723 SFSDDSN-VYYCVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
FS N ++ VGTA + P R+ F+ + +L+L+ + E +L A
Sbjct: 890 PFSAKGNELHLVVGTAADTKLSPRTCSTGFLRVYKFLEDGRQLELVHKTEIDDVPLALMA 949
Query: 779 FNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
F G+L+A + + +++Y K MLR ++ S + + T+G I+VGD+
Sbjct: 950 FQGRLVAGVGKALRIYDIGKKKMLRKVENKQFGSAI--------VTLSTQGSRILVGDMQ 1001
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA--T 892
+SI +YK E + A D W+SA ++D + + A+ N+F R + +
Sbjct: 1002 ESIFFAVYKAPENRLLIFADDSQPRWISAATMVDYNTVVAADRFGNIFVNRLDPRVSDQV 1061
Query: 893 DE---------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
DE E+G + +++ +H+G+ + SLV G +++
Sbjct: 1062 DEDPTGAGILHEKGLYMGAPHKTKMICHFHVGDLITSIHKVSLV-------AGGREVLLY 1114
Query: 937 GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
++G IG++ + E F+ L+ ++R + G +H WR + V K +
Sbjct: 1115 TGLHGTIGILVPFVTKEDVDFISTLEQHMRTEQVSLVGRDHLGWRGY-----YVPVKAVV 1169
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DGDL E + L ++ I+ ++ S+ E+ K++E+L
Sbjct: 1170 DGDLCEMYAKLPGSKQSAIAGELDRSIGEVLKKLEQL 1206
>gi|221044336|dbj|BAH13845.1| unnamed protein product [Homo sapiens]
Length = 222
Score = 223 bits (569), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 113/208 (54%), Positives = 151/208 (72%), Gaps = 7/208 (3%)
Query: 223 GLLHLLVITHEKE-----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
G L +L++ E++ + L++ELLGETSIA ++YLDN VV++GS GDSQL+KLN
Sbjct: 3 GRLFMLLLEKEEQMDGTVTLKDLRVELLGETSIAECLTYLDNGVVFVGSRLGDSQLVKLN 62
Query: 278 LQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
+ + +GSYV +E + NLGPIVD CVVDLERQGQGQ+VTCSGA+K+GSLRI+RNGIGI+
Sbjct: 63 VDSNEQGSYVVAMETFTNLGPIVDMCVVDLERQGQGQLVTCSGAFKEGSLRIIRNGIGIH 122
Query: 338 EQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQ 397
E AS++L GIKG+W LRS + D LV+SF+ +TR+L +N E E+EETE+ GF Q
Sbjct: 123 EHASIDLPGIKGLWPLRSDPNRETDDTLVLSFVGQTRVLMLNGE-EVEETELMGFVDDQQ 181
Query: 398 TLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
T FC + + QL+Q GS RLV +
Sbjct: 182 TFFCGNVAHQQLIQ-PPGSGRLVKPEGK 208
>gi|406868052|gb|EKD21089.1| pre-mRNA-splicing factor rse1 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 1236
Score = 223 bits (567), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 279/1163 (23%), Positives = 482/1163 (41%), Gaps = 162/1163 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I L FR G +D++ I ++ + ++++ ++ + R
Sbjct: 64 VFGIIRALASFRLAGSNKDYIIITSDSGRITIVEFLPAQNKFNRLHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R LI + L V+ +++ +L + + + Q L G
Sbjct: 124 GQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMCALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L D D+ T YE+ L V WS + +D A +L
Sbjct: 184 ANPVFAALEVDYGDSDQDPTGQAYDEIEKSLVYYELDLGLNHVVRK-WS-DPVDRTASVL 241
Query: 168 IPVPPPL---CGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYLLG 219
VP GVL+ G++ I Y +N AF+ AIP R T+ RV G + L
Sbjct: 242 FQVPGGTDGPSGVLVCGDDNITYRHSNQEAFRVAIPRRRGATEDPNRKRVIVGGVMHKLK 301
Query: 220 DHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNAV 261
AG L+ T + + +V LKI+ IA+++ L +
Sbjct: 302 GAAGAFFFLLQTDDGDLFKVTIEMVENDEGAPTGEVKRLKIKYFDTVPIATSLCILKSGF 361
Query: 262 VYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYV 294
+++ S +G+ Q + N D Y V +E
Sbjct: 362 LFVASEFGNHQFYQFEKLGDDDEETEFISDNFPTDPLEPYTPVYFHPRPAENLSLVESID 421
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
++ P++D V +L + Q+ + G + R +R+G+ ++E EL G+ +W+
Sbjct: 422 SMNPLMDCKVANLTEEDAPQIYSICGTGARSTFRTLRHGLEVSEIVESELPGVPSAVWTT 481
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ + +D +D ++++SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 482 KLTRNDTYDAYIILSFTNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGDDGLIQVH 539
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
+R + + R NEW +P S+ A NA QV +A G +VY E+ DG L E
Sbjct: 540 PKGIRHIRADRRV--NEWAAPQHRSIVAAATNARQVAVALSSGEIVYFEMDSDGSLAEYD 597
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
+ ++ ++CL + + E SQ AVG D +VRI SL PD L K P
Sbjct: 598 EKKEMSGTVTCLSLGEVPEGRVRSQFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 656
Query: 531 RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
++ + A S YL L G L +L+ TGEL+D + LG +P L S
Sbjct: 657 SALSIMAMSDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGPKPARLFRVS 716
Query: 585 SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+ V A S RP + YS K + + +N + F+S + + + L
Sbjct: 717 VQGQAAVLALSSRPWLGYSDPVTKGFMLTPLNYPGLEWGWNFSSEQCTEGMVGIQGQNLR 776
Query: 643 IGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT 701
I +I+ + L +I L PRR + F N + ++ +LL+D +
Sbjct: 777 IFSIEKLTDNLLQDTISLTYTPRRFLRHPEHPLFYTIESDNNILSPATK---AKLLEDPS 833
Query: 702 F-----------EFISTYP--------------------------LDTFEYGCSILSCSF 724
EF YP LD E S+ + SF
Sbjct: 834 IVNGDAAVLPAEEF--GYPRGKNHWASCISVVDPVTEKRVLQKIDLDDNEAAVSMAAVSF 891
Query: 725 SD-DSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFN 780
S D V+ +GT ++ T G I V+ D +++ I + + + +L F
Sbjct: 892 SSQDDEVFLVIGTGKDMIVSPRSSTAGFIHVYRFHDNGKEIEFIHKTKVEEPPMALLGFQ 951
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
G+LL I + +++Y +R R+ Q+E + L + +QT+G IVV D+ +SI ++
Sbjct: 952 GRLLVGIGKDLRIYDLGMR-QLLRKAQAEVAPN---LIVGLQTQGSRIVVSDVQESIIMI 1007
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE--- 894
+YK +E + D + W S ++D + G + NL+ +R K SE A +E
Sbjct: 1008 VYKFQENKLIPFVDDTISRWTSCTTMVDYETVAGGDKFGNLWLLRCPTKASEEADEEGSA 1067
Query: 895 -----ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGV-I 946
ER L+ G H + F + M + +++ G +++ + G +G+ I
Sbjct: 1068 SHLVHERSYLQ--GSPHRLTLMAHFFTQDIPMSIQKTNLVAGGRDCILWSGIQGTLGILI 1125
Query: 947 ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
+ E F + L+ +LR + G +H +RS+ V K +DGDL E + L
Sbjct: 1126 PFVSREDVDFFQTLEQHLRSEDAPLAGRDHLIYRSY-----YVPVKGVIDGDLCERYTLL 1180
Query: 1007 SRTRMDEISKTMNVSVEELCKRV 1029
+ I+ ++ SV E+ +++
Sbjct: 1181 PTDKKQMIAGELDRSVREIERKI 1203
>gi|393217872|gb|EJD03361.1| hypothetical protein FOMMEDRAFT_108572 [Fomitiporia mediterranea
MF3/22]
Length = 1213
Score = 222 bits (566), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 270/1177 (22%), Positives = 472/1177 (40%), Gaps = 180/1177 (15%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
+G I +L FR G +D+ + ++ + +L++D ++S I R G
Sbjct: 59 FGSIRSLAAFRLTGGTKDYAILGSDSGRIVILEYDPKTSSFIKLHQETYGKSGARRIVPG 118
Query: 67 QIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF- 116
Q DP R + + + L ++ D L + + + + +D+ F
Sbjct: 119 QYLATDPKGRSVMISAMEKAKLVYILNRDAAANLTISSPLEAHKNAAIIHHIVGVDVGFE 178
Query: 117 --LYGCAKPTIVVLYQD------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
LY + QD N + + YE+ L V WS+ D A+LL+
Sbjct: 179 NPLYAALEVDYTESDQDPTGEAFNNAEKLLTFYELDLGLNHVVRK-WSEPT-DPRANLLV 236
Query: 169 PVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIR--PSITKAYGRVDA 211
VP P GVL+ E+ I+Y + ++ + IP R P K G +
Sbjct: 237 QVPGGQLATSDRFDGP--SGVLVCCEDHIIYRNMDSPQHRVPIPRRRHPLEDKERGLIIV 294
Query: 212 DGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
+LL G L+ + I HE E V LKI+ +AS++ L + ++
Sbjct: 295 SAVMHKMKGAFFFLLQSEEGDLYKVTIDHEDEDVRALKIKYFDTVPVASSLCILKSGFLF 354
Query: 264 IGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLG-------------------------- 297
+ S +G+ L + D Y N G
Sbjct: 355 VASEFGNHHLYQFQKLGDDDNEPEFSSTDYPNFGMADPTVPLPAAYFKPRPLDNLVLADE 414
Query: 298 -----PIVDFCVVD-LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
PI+D V++ L Q+ G S R +R+G+ + E S +L GI +
Sbjct: 415 IESLDPIIDSKVMNILPNSDTPQIFAACGRGARSSFRTLRHGLEVEEVVSSDLPGIPNAV 474
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+ R DDP+D+++++SF++ T +L++ + +EE + GF S TL + L+
Sbjct: 475 WTTRIKEDDPYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGSDALL 532
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
QV +R V + R NEW+ P G ++ AT N QV++A LVY E+ +G L
Sbjct: 533 QVHPQGIRHVLADRRV--NEWRVPQGRTIVAATTNKRQVVVALSSAELVYFELDLEGQLN 590
Query: 470 EVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
E + + + + L + + + AVG D +VRI SL + + L
Sbjct: 591 EYQDRKAMGSTVLALSVGEVPPGRQRTPYLAVGC-EDQTVRIVSLDPESTLETISLQALT 649
Query: 529 IPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
P S + A + ++ L +G LL +L+ G+LTD + LGT+P+
Sbjct: 650 APPSAICIADMLDSSIDKNQPTMFVNIGLQNGVLLRTVLDPVNGQLTDTRTRFLGTRPVK 709
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L + + + A S R + Y+ L ++ + + + H F++ P+ L
Sbjct: 710 LMRVQVQRNSSILALSSRSWLNYTHQGLLHFTPLIFETLDHAWEFSAELCPEGLIGISGS 769
Query: 640 ELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTF-----------------AICSLK 681
L I I + KL S+PL PR+ ++ F + L+
Sbjct: 770 VLRIFQIPRLGTKLKQDSMPLTYTPRKFIPHPMNQYFYMIEADHRVMGDDAAKEKLAELR 829
Query: 682 NQSCAEESEM-------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
+ + E+ +R+LD + LD E SI
Sbjct: 830 QRGVKYDQEVVDLPPEVFGRPKAPAGTWGSCIRILDPINKATVKVVHLDNNEAAFSIAIV 889
Query: 723 SF-SDDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
F + +S ++ CVGTA ++ P R F L+L+ + E +L A
Sbjct: 890 PFAARNSELFLCVGTASSTFLAPRSCSSGFIRTYAFTNGGADLELVHKTEADDVPMALMA 949
Query: 779 FNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
F G+L A + + +++Y K +LR T+ S + + T+G I+VGD+
Sbjct: 950 FQGRLCAGVGKSLRIYEIGKKKLLRKVETKTYGSAI--------VTLNTQGSRIIVGDMQ 1001
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE--GAT 892
+SI ++K E + A D W ++ ++D + N+F R +S+
Sbjct: 1002 ESIVYAVFKPPENRLLIFADDSQPRWTTSAVMVDYTTIAAGDKFGNVFINRLDSKISDQV 1061
Query: 893 DE---------ERGRLE-------VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
DE E+G L ++ +H+G+ V SLV G +++
Sbjct: 1062 DEDPTGAGILHEKGLLMGAPHKTGMIAHFHVGDIVTSIHKISLV-------AGGREVLLY 1114
Query: 937 GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
++G IG++ + E F+ L+ ++R + G +H WR + V K +
Sbjct: 1115 TCLHGTIGILVPFVSKEDVDFISTLEQHMRSEKLSLVGRDHLAWRGY-----YVPVKAVV 1169
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DGDL E F L + I+ ++ +V E+ K++E+L
Sbjct: 1170 DGDLCEQFARLPANKQSAIAVELDRTVGEVLKKLEQL 1206
>gi|301110252|ref|XP_002904206.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
gi|262096332|gb|EEY54384.1| pre-mRNA-splicing factor RSE1 [Phytophthora infestans T30-4]
Length = 1197
Score = 222 bits (565), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 284/1175 (24%), Positives = 468/1175 (39%), Gaps = 177/1175 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++ ++G + L+ FR G +D+L + ++ K VL+ + SS R
Sbjct: 52 VISTQVFGVVRALQPFRLTGGDRDYLVVGSDSGKIVVLEVNPTSSRFEARQSETYGKTGC 111
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIRLEELQVLD 113
R GQ DP R LIG L V+ D +L EA L V+
Sbjct: 112 RRITPGQYLAADPKGRAVLIGAVEKQQLVYVMNRDASSRLTISSPLEAHRSNAIHLGVVG 171
Query: 114 IKFLYGCAKPTIVVLYQDNKDARH----------VKT---YEVALKDKDFVEGPWSQNNL 160
+ G P L D +A VKT YE+ L + V WS+ +
Sbjct: 172 LDV--GFENPIFASLELDYAEADADPSGQAAREVVKTLVYYELDL-GLNHVTRRWSEQVV 228
Query: 161 DNGADLLIPVPPPLCG---VLIIGEETIVYCSANAFK---AIPIRPS------ITKAYGR 208
A+ L+ VP G VL++GE T+ Y + + AIP R I A
Sbjct: 229 -RSANRLVAVPGGGDGPGGVLVLGENTVQYKNEGHPELTCAIPRREGEHRDIIIVSAATH 287
Query: 209 VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
D LL G L+ + + + V +KI+ +AS++ +++ S +
Sbjct: 288 KQRDLFFVLLQSELGDLYKISLDYSGNVVEEIKIQFFDTIPVASSMCITKTGLLFCASEF 347
Query: 269 GDSQLIKL-------------NLQPDAKGSYVEVLERYVNLG---------PIVDFCVVD 306
+ L + +L D L + NL P+ V D
Sbjct: 348 SNHYLFQFLSIGEGDDAAKCSSLAMDPTEFSTFPLRKLTNLALASSSASLSPVTQLLVDD 407
Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFDTFL 365
L + Q+ G SLR++R+G+ I E A+ L G+ K +W L+ S DP+D ++
Sbjct: 408 LANEQTPQMYALCGNNNRSSLRVLRHGLPITEMAASALPGVAKAVWCLKESYADPYDKYI 467
Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
VVSF E L + + + +EE GF +L + +Q+ + +R V
Sbjct: 468 VVSF--EDATLVLEVGETVEEVAQSGFLRDHGSLLVALLEDDSKLQIHTNGIRHVPKF-- 523
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEISCLD 484
+ EWK+P + AN+ QV+++ GG ++Y E+G G L E L +E+ LD
Sbjct: 524 QPVTEWKAPGKKVIEHCAANSRQVVISLAGGEIIYFELGQSGELAEKGKLDLGFEVCSLD 583
Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEG--- 540
+ + E Q AVG W D +VRI SL P+ K L P ++ L +
Sbjct: 584 LGEVPEGRQRFQFMAVGSW-DNTVRILSLDPNELFRQKSTLALTSHPHTLCLAQLQNEPS 642
Query: 541 ---------ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
+L L +G L L++ T L D + LGT P+ L + + +
Sbjct: 643 TPDSEHSSQALFLSIGLDNGVLQQSLIDPITATLADSRSRFLGTNPVKLFRVAVEGKRSI 702
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL-AIAKEGELTIGTIDDI- 649
A S R + Y + + ++ + +S+ FNS P + A+ EG + I T+D +
Sbjct: 703 LALSSRAWISYFHQTRRHLTPLSCELLSYASSFNSEQCPGGIVALTNEG-MKILTVDQLG 761
Query: 650 -----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH------------ 692
QK ++R P + + H R I S N+ A H
Sbjct: 762 DTFNQQKCNLRYTPR----KAVVHSPSRRLVVIESDHNEYGAAYKRQHGLQIPDIRSAND 817
Query: 693 -----------------------------FVRLLDDQTFEFISTYPLDTFEYGCSILSCS 723
VR++D + + + LD E SI +C
Sbjct: 818 LEDEAEDEINDALLFPRGPLPAEKDKWASCVRIIDPASCQTVVCEELDVDERARSIAACV 877
Query: 724 FSD-DSNVYYCVGTA--YVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAF 779
F D + VGT L + P G + V+ +VE +L L+ E +++ F
Sbjct: 878 FHDRGGEAFIIVGTVKKMQLHPQKAPAGGYLRVYRVVEGMQLVLVHTTEIDDIPHAMCEF 937
Query: 780 NGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
G+LL ++ + +++Y K MLR R S + + ++ GD I D+ +
Sbjct: 938 QGRLLVSVGRALRIYDLGKKKMLRKCENRNFPS--------ILVELKAAGDRIYASDMHE 989
Query: 836 SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE------ 889
S + YK +E + A D ++++ +LD D GA+ N+F R SE
Sbjct: 990 SFHFVKYKKDENQLVIFADDCVPRFITSSVLLDYDTLCGADKFGNVFVSRLPSEVSDEID 1049
Query: 890 -----------GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
G + +LE V ++H+G+ V SLV G VI+ T
Sbjct: 1050 NPTGNRILWDSGLLNGAPNKLEQVAQFHVGDVVTSMVRSSLVP-------GGTEAVIYAT 1102
Query: 939 VNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
+ G IG +I E F L+ +R+ + G +H +RS+ + KN DG
Sbjct: 1103 IMGRIGALIPFTSREDVDFYTHLEMYMRQEQPPLCGRDHLSYRSY-----YIPVKNITDG 1157
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DL E F LS + +++ ++ + E+ K++E++
Sbjct: 1158 DLCEQFSSLSVEKQASVAEDLDRTPAEVLKKLEDI 1192
>gi|392593521|gb|EIW82846.1| hypothetical protein CONPUDRAFT_81012 [Coniophora puteana RWD-64-598
SS2]
Length = 1213
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 266/1177 (22%), Positives = 481/1177 (40%), Gaps = 178/1177 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+ + ++ + +L +D ++S + R
Sbjct: 58 VFGSIRSLAAFRLTGSTKDYAIVGSDSGRIVILDYDPKTSSFVKLHQETFGKSGARRIVP 117
Query: 66 GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYGC 120
GQ DP R + + + L ++ D L + + + + + G
Sbjct: 118 GQYLATDPKGRSVMISAMEKAKLVYILNRDAAANLTISSPLEAHKSSAIIHHIVGLDVGF 177
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
P L D +A T E A + + V WS+ D A+LL+
Sbjct: 178 ENPLFAALEVDYAEADQDPTGEAAQNAEKMLTYYELDLGLNHVVRKWSEPT-DPRANLLV 236
Query: 169 PVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS----------IT 203
VP P GVL+ E+ I+Y + + IP R I
Sbjct: 237 QVPGGQVASSERFDGP--SGVLVCCEDHIIYRHMDKPQHRVPIPRRSHPLEDPKRGVIIV 294
Query: 204 KAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
A +LL G L + I H++++V LKI+ +AS++ L + ++
Sbjct: 295 AAVMHKMKGAFFFLLQSEDGDLFKVTIDHDEDEVKSLKIKYFDTVPVASSLCILKSGFLF 354
Query: 264 IGSSYGDSQLIKLNL------QPDAKGSYVE---VLERYV-------------------- 294
+ S +G+ L + +P+ + + E ++
Sbjct: 355 VASEFGNHYLYQFQKLGDDDDEPEFSSTSFPSFGMAESFIPLPHAHFRPRGLDNLALADE 414
Query: 295 --NLGPIVDFCVVD-LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
+L PI+D V++ L Q+ T G + R++R+G+ + E S EL GI +
Sbjct: 415 IESLDPILDAKVMNILPNSDTPQIFTACGRGSRSTFRMLRHGLEVEESVSSELPGIPNAV 474
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+ + + DDP+D+++++SF++ T +L++ + +EE + GF S TL + L+
Sbjct: 475 WTTKRTEDDPYDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGSDALL 532
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
QV +R V S R NEW+ P G ++ AT N QV++A LVY E+ DG L
Sbjct: 533 QVHPQGIRHVLSDRRV--NEWRVPQGKTIVCATTNKRQVVVALSSAELVYFELDLDGQLN 590
Query: 470 EVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
E + + + + L + + E + AVG D +VRI SL + + L
Sbjct: 591 EYQDWKAMGSTVLALSVGEVPEGRQRTPYLAVGC-EDQTVRIISLDPESTLETISLQALT 649
Query: 529 IPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
P S + A + ++ L +G LL +L+ G+LTD + LGT+PI
Sbjct: 650 APPSAICIADMLDASINKSQPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGTRPIR 709
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L + + + A S R + Y+ + ++ + + + + F++ P+ L
Sbjct: 710 LVRVTVQKNPGILALSSRSWLNYTHQSLMHFTPLIFENLDYAWSFSAELSPEGLIGITGS 769
Query: 640 ELTIGTIDDI-QKLHIRSIPLGEHPRRIC--------------HQEQSRTFAICSL---- 680
L I I + KL ++PL PR+ H+ S + A L
Sbjct: 770 VLRIFQIPKLGMKLKQDAVPLSYTPRKFVSHPANQYLYLIQGDHRVMSESAAEKKLQEMR 829
Query: 681 -KNQSCAEE-----------------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
K Q EE + + ++D I T LD E S+
Sbjct: 830 TKGQKVDEEILQLPVEVFGRPKAPAGTWASAICIIDPIEARTIHTVELDNNESAFSVAVV 889
Query: 723 SF-SDDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
F + D+ ++ VGTA + P R F E L+L+ + ET ++ A
Sbjct: 890 PFAARDNELHLVVGTAADTLLTPRSCRSGYLRTYRFTDEGRSLELLHKTETDDVPLAVMA 949
Query: 779 FNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
F G+L+A + + ++LY K +LR + S + + T+G I+VGD+
Sbjct: 950 FQGRLIAGVGKSLRLYEIGKKKLLRKAENKSFASAI--------VTLNTQGSRIIVGDMQ 1001
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
+S+ YK E + A D W++A+ ++D + N+F R + S+
Sbjct: 1002 ESVHFAAYKAPENRLLIFADDMQPRWVTALTMVDYTTIAVGDRFGNVFINRLDMRVSDQV 1061
Query: 892 TDE--------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
D+ E+G+L +++ +H+G+ + SLV G +++
Sbjct: 1062 DDDPTGAGILHEKGQLSGAPHKTKLLCHFHVGDLITSIHKVSLV-------AGGREVLLY 1114
Query: 937 GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
++G IG++ + E F+ L+ ++R + G + WR + K V +
Sbjct: 1115 TGIHGTIGILVPFVSKEDVDFISTLEQHMRSEQSSLVGRDQLSWRGYYTPVKAV-----V 1169
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DGDL E+F L+ ++ I+ ++ +V E+ K++E+L
Sbjct: 1170 DGDLCEAFARLTGSKQSAIAGELDRTVGEVLKKLEQL 1206
>gi|345563727|gb|EGX46712.1| hypothetical protein AOL_s00097g460 [Arthrobotrys oligospora ATCC
24927]
Length = 1308
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 265/1145 (23%), Positives = 481/1145 (42%), Gaps = 138/1145 (12%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I L FR G +D++ I ++ + +L++ ++ + R
Sbjct: 178 VFGIIRCLAAFRLAGSGKDYIIIGSDSGRITILEYIPTQNKFKRLHLETFGKSGVRRVVP 237
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R LI + L V+ + + +L + + + L G
Sbjct: 238 GQYLACDPRGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHKPHCLVFALCALDVGY 297
Query: 121 AKPTIVVLYQDNKDARHVKT---YEVALKDKDFVE---------GPWSQNNLDNGADLLI 168
P VL D + T Y A K + E WS+ ++ ++ L+
Sbjct: 298 ENPIFAVLEMDYSETDQDPTGEAYREAEKSLTYYELDLGLNHVVRKWSEP-VERSSNFLL 356
Query: 169 PVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRV-DADGSRY------ 216
VP P GVL+ E+ + Y N + +R I + G D + R+
Sbjct: 357 HVPGGSDGP--SGVLVCSEDAVTYRHMNQPE---LRVPIPRRRGACEDPNRKRFITCGVM 411
Query: 217 ---------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
LL G L + + ++ V L+I+ IA+++ L + +++ S
Sbjct: 412 HKMRGAFFFLLQSEDGDLFKVGLDFDENSVVRLRIKYFDTVPIATSLCILKSGFLFVASE 471
Query: 268 YGDSQLIKLNLQPD------------AKGSY--------VEVLERYVNLGPIVDFCVVDL 307
G+ + D +Y + ++E ++ ++D + +L
Sbjct: 472 NGNHHFYQFEKLGDDDNETEFTSDDEINAAYFSPRLAENLALMESVDSMNSLLDCQIANL 531
Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLV 366
Q+ T GA S R +++G+ ++E S EL G +W+ + ++ D +D ++V
Sbjct: 532 TNDDAPQIYTICGANARSSFRTLKHGLEVSEIVSSELPGGPVAVWTTKLASQDEYDAYIV 591
Query: 367 VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
+SF + T +L++ + +EE GF S TL + L+QV +R + + R
Sbjct: 592 LSFTNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDGLLQVHPKGIRHIRADRRV 649
Query: 427 LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVK-HAQLEYEISCLD 484
NEW +P S+ A+ N+ QV +A G +VY E+ DG L E + ++ ++CL
Sbjct: 650 --NEWPAPQHRSIVAASTNSRQVAIALSSGEIVYFELDSDGQLAEYEDKKEMSGTVTCLS 707
Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGIS- 542
+ + E S AVG D +VRI SL PD L +K P ++ + + +
Sbjct: 708 LGDVPEGKMRSSFLAVGC-DDSTVRILSLDPDSTLESKSVQALTSAPTALRIMSMPDSAG 766
Query: 543 --------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAA 594
+L L G L +L+ TGELTD + LG QP+ L + S+ TT + A
Sbjct: 767 GDSQRSTLFLHIGLYSGVYLRTVLDTITGELTDTRTRFLGPQPVKLFSVSAAGTTAILAL 826
Query: 595 SDRPTVIYSSNK-KLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKL 652
S +P + Y++ + L + ++ + +++ C F+S P+ + + L I T++ I + L
Sbjct: 827 SSKPWLGYTNGSGQYLLTPLSYQSLTYGCTFSSEQCPEGMVGIQGNNLRIFTVEKISENL 886
Query: 653 HIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES----------------------- 689
SIPL PR++ + F + S + + ES
Sbjct: 887 QQESIPLSYTPRKMAKHPTAPYFYVVSSDADTLSPESRNNLIAKAQNGDSTELPPSEFGY 946
Query: 690 ---EMHF---VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAY-VLP 741
E H+ V ++D E L E SI FS D + VGTA ++
Sbjct: 947 PRCEGHWASCVEVVDPINKEVTHKIDLGENEAALSIAVVPFSSQDDEHFLVVGTAKDLVL 1006
Query: 742 EENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
+ G V+ + DG +Q I + + + A + AF+G+LLA I Q +++Y ++
Sbjct: 1007 NPRSSSCGYAHVYRLSDGGKTIQFIHKTKLEYAPLAFAAFHGRLLAGIGQDLRIYDLGMK 1066
Query: 800 DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
R+ Q+ + L + T+ + I+V D+ +S++ ++YK +E + A D
Sbjct: 1067 Q-LLRKTQATVAPN---LITGLTTQNNRIIVSDIQESVTYVVYKQQENKLIPFADDMVPR 1122
Query: 860 WMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRH 916
W + +LD + G + NL+ VR K SE A ++ + + +L NR
Sbjct: 1123 WTTCTALLDYESVAGGDKFGNLWVVRAPPKASEEADEDPSSGYLLHEKSYLQGAPNRVNL 1182
Query: 917 GS--LVMRLPDS------DVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKV 967
S +P S G V++ + G IG++ + E F + L+ ++R
Sbjct: 1183 MSHFFTQDVPTSIQKVNLVAGGRDCVVWSGLQGTIGIMVPFISREDVDFFQTLEQHMRTE 1242
Query: 968 IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
+ G +H +RS+ V K +DGDL E F L + I+ ++ S+ E+ +
Sbjct: 1243 DAPLAGRDHLIYRSY-----YVPVKGVIDGDLCERFALLPYDKKQMIAAELDRSIREVER 1297
Query: 1028 RVEEL 1032
+V ++
Sbjct: 1298 KVADM 1302
>gi|255081708|ref|XP_002508076.1| predicted protein [Micromonas sp. RCC299]
gi|226523352|gb|ACO69334.1| predicted protein [Micromonas sp. RCC299]
Length = 1199
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 267/1165 (22%), Positives = 471/1165 (40%), Gaps = 165/1165 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+L ++ + VLQ+D E + R
Sbjct: 57 VFGVIRSLAAFRLTGANRDYLVCGSDSGRIVVLQYDKERNRFDKVHQETFGKSGCRRIVP 116
Query: 66 GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY---GC 120
GQ +DP R I + + L V+ D + L + + + V+ G
Sbjct: 117 GQFLCVDPKGRAIMIAAIEKAKLVYVLNRDTEANLTISSPLEANKSHVVAFHVAALDCGL 176
Query: 121 AKPTIVVLYQDNKDARHVKTYEVAL---KDKDFVE--------GPWSQNNLDNGADLLIP 169
P + D DA T E A K F E S +DNGA+ LI
Sbjct: 177 DNPVFAAIELDYADADQDSTGEAAAEAQKHLTFYELDLGLNHVARKSSEPIDNGANHLIA 236
Query: 170 VPPPLCG---VLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR-------- 215
VP G VL+ E I+Y + + IP R +++ G + +
Sbjct: 237 VPGGGDGPGGVLVCAENFIIYKNQGHPDVRAVIPRRNALSGDRGVLIVSSATHRTKQQFF 296
Query: 216 YLLGDHAGLLHLLVITHEK--EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL 273
+L G ++ + + ++K E V+ +KI+ +I L ++ S +G+ L
Sbjct: 297 FLAQSEYGDIYKVTVEYQKGTEFVSEVKIKYFDTIPPCVSICVLKTGFLFAASEFGNHAL 356
Query: 274 IKLNLQPD-------AKGSYVEVLERYV-------------------NLGPIVDFCVVDL 307
+ D + + VE E Y +L P++D +L
Sbjct: 357 YQFQGIGDDEDDVESSSATLVETDEGYQPVFFEPRALRNLHPIDDVESLCPVLDMQCHNL 416
Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLV 366
+ Q+ G +LR++R G+ ++E A L G ++++R S D FD ++V
Sbjct: 417 VAEETPQLYALCGTGPRSTLRVLRQGVALSEMAVSPLPGNPNAVFTVRKSASDEFDAYIV 476
Query: 367 VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
VSF + T +L++ + +EE GF TL + L+QV G +R + + R
Sbjct: 477 VSFTNATLVLSIG--ETVEEVSDSGFLGTVPTLSASLLGDDSLLQVHPGGLRHIRADKR- 533
Query: 427 LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEISCLDI 485
NEW++P +V T NA QV++A GG L+Y E+ G L EV+ + +++CLDI
Sbjct: 534 -INEWRTPGRKTVTRVTTNARQVIIALSGGELIYFELDQTGQLMEVEKLETSGDVACLDI 592
Query: 486 NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII---PRSVLLCAFEGIS 542
P+ E ++ AVG + D +VRI SL + + + +G + + P S+L+ +G
Sbjct: 593 GPVPEGHLRNRFLAVGSF-DSTVRILSLGAEDCL--QTMGVQALAAAPNSLLMLRDDGSG 649
Query: 543 --YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTV 600
YL L +G LL ++ TG+L+D + LG +P L S + + A S RP +
Sbjct: 650 SIYLNVGLTNGVLLRADVDSVTGQLSDTRARFLGARPPKLSAVSVQGKAAMMALSSRPWL 709
Query: 601 -IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIP 658
+ + S ++ + + H PF+S A P+ + L I ++ + ++ + R+
Sbjct: 710 GHFDLANRFALSPLSYEPLEHAAPFSSDACPEGVVAVAGNTLRIVAVERLGEQFNQRTCK 769
Query: 659 LGEHPRRICHQEQSRTFAICSLKNQSC--------------------------------- 685
L PR++ T A+ S
Sbjct: 770 LRYTPRQMSVNVDRNTLAVVECDQSSVPYDERTGLEGAAGEKDTKMDVEGEEDDEEEDEV 829
Query: 686 ----AEE---------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732
AE+ S VR++D + + E F ++
Sbjct: 830 TMTPAEQFGAPKAPPGSWASCVRIVDPASASTKQIVEMTGNEAALCCCHVYFPQADELFL 889
Query: 733 CVGTAYVLPEENEPTKGRI--LVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
VG+A L ++G L +DG ++L + G ++ F G+LL +
Sbjct: 890 AVGSAVSLTFSPRDSEGGFIHLYRYTQDGGIELFHKTPLDGVPGAMCGFKGRLLVGVGNT 949
Query: 791 IQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
++LY K +LR R + + +G+ I VGD+ +S + YK E+
Sbjct: 950 LRLYDFGKKKLLRKVENRNFPN--------FIKTIHAQGERIYVGDVQESFHYVRYKRED 1001
Query: 847 GAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR------------------KNS 888
G++ A D ++A LD D G + N+F R
Sbjct: 1002 GSMYIVADDVQPRHVTAACPLDYDTIAGGDRFGNVFVSRLAQDVSDEIEEDPTGGKTAYG 1061
Query: 889 EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS 948
+GA + ++ V ++H+GE V G+L G + ++I+ T+ G +G +
Sbjct: 1062 QGALNGASHKINQVTQFHVGETVCALTKGTL-------QAGGLESMIYATLMGTLGALMP 1114
Query: 949 LPH-EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
+ E F L+ ++R+ + + G +H +RS K+ +DGDL E + L
Sbjct: 1115 FGNREDVDFCTHLEMHMRQELPPLLGRDHLAFRS-----SYFPVKDVIDGDLCEMYTVLP 1169
Query: 1008 RTRMDEISKTMNVSVEELCKRVEEL 1032
+++ M+ +V E+ K++E+L
Sbjct: 1170 HEAQRRVAEDMDRTVSEVLKKLEDL 1194
>gi|348503892|ref|XP_003439496.1| PREDICTED: splicing factor 3B subunit 3-like [Oreochromis niloticus]
Length = 1217
Score = 220 bits (560), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 267/1190 (22%), Positives = 492/1190 (41%), Gaps = 189/1190 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYHP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R +IG L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMIGAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + ++E VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRLKYFDTIPVATAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V V E+ NL
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-NLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ + DL + Q+ G +LR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIMSCQIADLANEDTPQLYVACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKTPGKKTIVRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
++ ++ C+ + + S+ AVG+ D +VRI SL L ++ + L + P
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-ADNTVRIISLDPSDCLQPLSMQALPAQ--PE 638
Query: 532 SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
S+ + G+ YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 639 SLCIVEMGGVEKQDELGDKGTIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + + F S P+ +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLILIETDHNAYTEATKAQRKQQ 818
Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
AEE +EM VRL++ +
Sbjct: 819 MAEEMVEAAGEDERELAAEMAAAFLNENLPEAIFGSPKAGAGQWASLVRLVNPIQGSTLD 878
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
L+ E S+ +C F++ + +Y VG A ++ G I + + G KL+
Sbjct: 879 QVQLEQNEAAFSVAACRFTNTGDDWYVLVGVARDMILNPRSVGGGFIYTYRLVSGGEKLE 938
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYV 821
+ + + ++ F G+ L + + +++Y D G ++L +C H L +
Sbjct: 939 FVHKTPVEDVPLAIAPFQGRALVGVGKLLRIY-----DLGKKKLLRKCENKHVPNLVTGI 993
Query: 822 QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
T G ++V D+ +S+ + Y+ E + A D W++ +LD D A+ N+
Sbjct: 994 HTIGQRVIVSDVQESLFWVRYRRNENQLIIFADDTYPRWVTTACLLDYDTMAAADKFGNI 1053
Query: 882 FTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
VR N+ DE +RG L E++ YH+GE V + +L+
Sbjct: 1054 SIVRLPPNTSDDVDEDPTGNKALWDRGLLNGASQKAEIIMNYHVGETVLSLQKTTLI--- 1110
Query: 924 PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
G ++++ T++G IG++ HE + F + L+ ++R + G +H +RS+
Sbjct: 1111 ----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHLEMHMRSEFPPLCGRDHLSFRSY 1166
Query: 983 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + +++ ++ + E+ K++E++
Sbjct: 1167 -----YFPVKNVIDGDLCEQFNSMDPHKQKSVAEELDRTPPEVSKKLEDI 1211
>gi|46125735|ref|XP_387421.1| hypothetical protein FG07245.1 [Gibberella zeae PH-1]
Length = 1208
Score = 220 bits (560), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 274/1169 (23%), Positives = 479/1169 (40%), Gaps = 168/1169 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+L IA++ + ++++ + + R
Sbjct: 64 VFGIIRSLAAFRLAGSNKDYLIIASDSGRITIIEYLPAQNRFQRLHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
G+ DP R LI + L V+ +++ +L + + + VL I + G
Sbjct: 124 GEYLACDPKGRACLIASTEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVISMVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
+ P L D + T YE+ L V WS + +D A +L
Sbjct: 184 SNPVFAALEIDYSEVDQDSTGQAMEELDTQLVYYELDLGLNHVVRK-WS-DPVDPTASIL 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYL 217
VP P GVL+ GEE I Y +N AF+ AIP R T+ R G +
Sbjct: 242 FQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVAIPRRRGATEDPNRKRTIVSGIMHK 299
Query: 218 LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
L + G L+ T + + +V LK++ +AS++ L +
Sbjct: 300 LKGNTGAFFFLLQTDDGDLFKLSIDMIEDEEGNPTGEVKRLKVKYFDTVPVASSLCILKS 359
Query: 260 AVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLER 292
+Y+ + +G+ + + D K SY + ++E
Sbjct: 360 GFLYVATQFGNYSFYQFEKLGDDDEELEFYSDDFPADPKASYEPVYFHPRPTENLALVES 419
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
+ P++D V +L + Q+ T G S R++++G+ +NE + EL GI +W
Sbjct: 420 IPAMNPLLDCKVANLTGEDAPQIYTICGNGPRSSFRMLKHGLEVNEIVASELPGIPSAVW 479
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+L+ + + +D ++V+SF + T +L++ + +EE GF + TL + L+Q
Sbjct: 480 TLKLNRSEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGDDGLIQ 537
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
V +R + + + NEW +P S+ ATANA QV +A G +VY E+ DG L E
Sbjct: 538 VHPKGIRHIRNGNV---NEWAAPQHRSIVAATANAHQVAVALSSGEIVYFEMDADGSLAE 594
Query: 471 VKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
+ + ++CL + + E S AVG D +VRI SL P+ L K
Sbjct: 595 YDEKKEMFGTVTCLSLGDVPEGRLRSSFLAVGC-DDCTVRILSLDPESTLENKSVQALTA 653
Query: 529 IPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
P S+ + A E S YL L G L +L+ TGELTD ++ LG + + L
Sbjct: 654 APTSLAIIAMEDSSSGGSTLYLHIGLHSGVYLRTVLDEITGELTDTRQKFLGPKEVRLFQ 713
Query: 583 FSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
+ + T V S RP + Y+ K + + +N ++ F+S + + +
Sbjct: 714 VTVQGKTCVLGLSSRPWLGYADPITKGFVVTPLNYVDLEWGWNFSSEQCEEGIVGIQGQS 773
Query: 641 LTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV----- 694
L I ID + + I +SIPL P+++ F N + E +
Sbjct: 774 LRIFNIDRLGETLIQKSIPLTYTPKKLVKHPDQPLFYTIEADNNTLPPELRAQLLADPGV 833
Query: 695 -----RLLDDQTF-----------------------EFISTYPLDTFEYGCSILSCSFSD 726
R+L + F + + T L+ E S FS
Sbjct: 834 VNGDSRVLPPEDFGYPKGTRRWASCINVIDPLSEEGQVLQTIDLENNEAAVSAAIVPFSS 893
Query: 727 -DSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGK 782
D+ + +GT V P I F+ +L+ I + + + +L AF G+
Sbjct: 894 QDNESFLVIGTGKDMVVNPRSFSEGYLHIYRFLEGGRELEFIHKTKVEEPPLALLAFQGR 953
Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
+L A+ +++Y +R R+ Q+E I++L T+G I+VGD+ + ++ ++Y
Sbjct: 954 VLVAVGTSLRIYDLGMR-QMLRKSQAEVATQ-QIVSL--NTQGSRIIVGDVQQGVTYVVY 1009
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
K + D A W + ++D + G + N+F VR K SE A +E+ G
Sbjct: 1010 KPASNKLIPFVDDTIARWTTCTTMVDYESVAGGDKFGNMFIVRCPEKASEEADEEQSGLH 1069
Query: 898 -------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
R+ ++ ++ + SLV VG +++ + G IG
Sbjct: 1070 LINARDYLHGTPHRVSLMCHFYTQDIPTSITKASLV-------VGGQEVLLWSGIMGTIG 1122
Query: 945 V-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
V I + E F + L+ +LR + G +H +R + K +DGDL E +
Sbjct: 1123 VFIPFVSREDADFFQNLEQHLRTEDPPLAGRDHLMYRGY-----YAPVKGVIDGDLCERY 1177
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L + I+ ++ SV E+ +++ L
Sbjct: 1178 NLLPNDKKQMIAGELDRSVREIERKISYL 1206
>gi|390601867|gb|EIN11260.1| hypothetical protein PUNSTDRAFT_118747 [Punctularia strigosozonata
HHB-11173 SS5]
Length = 1214
Score = 219 bits (559), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 265/1175 (22%), Positives = 483/1175 (41%), Gaps = 173/1175 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G +D+ + ++ + +L +D ++S + R
Sbjct: 58 VFGSIRSLAAFRLTGGTKDYAIVGSDSGRIVILDYDPKTSSFVKLHQETYGKSGARRIVP 117
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
GQ DP R +IG L ++ D L + + + + LD+
Sbjct: 118 GQYLATDPKGRAVMIGAMEKAKLVYILNRDAAANLTISSPLEAHKNAAIIHHIVGLDV-- 175
Query: 117 LYGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNG 163
G P L D N + + YE+ L V S +D
Sbjct: 176 --GYENPMFAALEVDYTESDQDPSGEAFNNAEKMLTYYELDLGLNHVVRK--SSEPVDPR 231
Query: 164 ADLLIPVP----------PPLCGVLIIGEETIVYCSANAFK---AIPIR------PS--- 201
A+LL+ VP P GVLI E+ I+Y + + IP R P
Sbjct: 232 ANLLVQVPGGQTGDRFDGP--SGVLICCEDHIIYRHPSGSQHRVPIPRRRHPLDDPERGV 289
Query: 202 -ITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
IT A ++ D L + L + + + E V LKI+ +A+ + L +
Sbjct: 290 IITFALMHKIKGDFFFLLQSEEGDLFKVTIDYNGDEDVNALKIKYFDTVPVATGLCILKS 349
Query: 260 AVVYIGSSYGDSQLIKLNLQPD-------AKGSYVE------------------------ 288
+++ S +GD L + D + Y +
Sbjct: 350 GFLFVASEFGDHHLYQFQKLGDDDDEPEFSSADYPQRGMAYPSQALPRAWFRPRPLDNLA 409
Query: 289 VLERYVNLGPIVDFCVVDL-ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
+L+ +L PI+D V++L Q+ G + R +R+G+ + E S +L GI
Sbjct: 410 LLDTVESLSPIIDARVMNLLPNSDTPQIFAACGRGARSTFRTLRHGLEVEETVSSDLPGI 469
Query: 348 -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+ + DD D+++++SF++ T +L++ + +EE + GF S T+
Sbjct: 470 PNAVWTTKLKEDDAHDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTIAVQQIGA 527
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-D 465
+ L+QV +R V + R NEW+ P G ++ AT N QV++A LVY E+ +
Sbjct: 528 DALLQVHPQGIRHVLADRRV--NEWRVPAGKTIVTATTNKRQVVVALSSAELVYFELDLE 585
Query: 466 GILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
G L E + + + + L I + E + AVG D +VRI SL + + L
Sbjct: 586 GQLNEYQDRKAMGSTVLALSIGEVPEGRQRTPFLAVGC-EDQTVRIISLDPESTLETISL 644
Query: 525 GGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT 575
P + + A ++ L +G LL +L+ G+LTD + LGT
Sbjct: 645 QALTAPPTAICIADMLDASINKVHPTMFVNIGLQNGVLLRTVLDPINGQLTDTRTRFLGT 704
Query: 576 QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
+P+ L + + + A S R + Y+ + + ++ + + + F++ PD L
Sbjct: 705 RPVRLIRVNVQRNPAILALSSRSWLNYTHHNLMHFTPLIFENLDFAWSFSAELCPDGLIG 764
Query: 636 AKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES----- 689
L I I + KL ++PL PR+ +R F + +++ EE+
Sbjct: 765 IAGSVLRIFQIPRLGTKLKQDAMPLTYTPRKFIPHPANRYFYLIEGDHRTWGEEAVQKKV 824
Query: 690 ------------EM-------------------HFVRLLDDQTFEFISTYPLDTFEYGCS 718
EM +R++D + ++T P+D E S
Sbjct: 825 AELRQAGKQVDEEMLSLPADTFGRPKAPAGTWASLIRIIDPVDAKTVATIPIDNNECAFS 884
Query: 719 ILSCSF-SDDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVY 774
I + + ++ VGTA ++ P R F E L+L+ + ET
Sbjct: 885 IAVVPWAARGGELHLVVGTAQDTFLAPRSCTSGFLRTYAFTNEGAGLELLHKTETDDVPM 944
Query: 775 SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDL 833
++ AF G+L+A + + +++Y D G ++L + + A+ + T+G I+VGD+
Sbjct: 945 AVMAFQGRLVAGVGKSLRIY-----DIGKKKLLRKVENKSFATAIVTLATQGSRILVGDM 999
Query: 834 MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEG 890
+S++ +YK E + A D W+S+ ++D + + + N+F R K SE
Sbjct: 1000 QESMAYAVYKPPENRLLVFADDVQPRWISSSTMVDYNTVIAGDRFGNVFVNRLDAKVSEQ 1059
Query: 891 ATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP----TVIFGTVN 940
D+ G + ++G H + + + G +V L S V +P +++ ++
Sbjct: 1060 VDDDPTGAGILHEKGLLMGAPHKTKMLCHYHVGDIVTSL--SKVALVPGGREVILYTGLH 1117
Query: 941 GVIGVIAS-LPHEQYLFLEKLQTNLRKVIK--GVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
G IGV+ + E F+ L+ ++R + G+ G +H WR + V K +DG
Sbjct: 1118 GTIGVLVPFVSKEDVDFISTLEQHMRTELTQFGLVGRDHLSWRGY-----YVPVKAVVDG 1172
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DL E+F L + I+ ++ +V E+ K++++L
Sbjct: 1173 DLCEAFATLPAPKQSSIAGELDRTVGEVLKKLDQL 1207
>gi|408400551|gb|EKJ79630.1| hypothetical protein FPSE_00190 [Fusarium pseudograminearum CS3096]
Length = 1212
Score = 219 bits (557), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 274/1174 (23%), Positives = 483/1174 (41%), Gaps = 168/1174 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L ++G I +L FR G +D+L IA++ + ++++ + +
Sbjct: 59 LLSHDVFGIIRSLAAFRLAGSNKDYLIIASDSGRITIIEYLPAQNRFQRLHLETFGKSGV 118
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R G+ DP R LI + L V+ +++ +L + + + VL I +
Sbjct: 119 RRVIPGEYLACDPKGRACLIASTEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVISMVA 178
Query: 118 --YGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDN 162
G + P L D + T YE+ L V WS + +D
Sbjct: 179 LDVGYSNPVFAALEIDYSEVDQDSTGQAMEELDTQLVYYELDLGLNHVVRK-WS-DPVDP 236
Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDAD 212
A +L VP P GVL+ GEE I Y +N F+ AIP R T+ R
Sbjct: 237 TASILFQVPGGNDGP--SGVLVCGEENITYRHSNQETFRVAIPRRRGATEDPNRKRTIVS 294
Query: 213 GSRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTI 254
G + L + G L+ T + + +V LK++ +AS++
Sbjct: 295 GIMHKLKGNTGAFFFLLQTDDGDLFKLSIDMIEDEEGNPTGEVKRLKVKYFDTVPVASSL 354
Query: 255 SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------V 287
L + +Y+ + +G+ + + D K SY +
Sbjct: 355 CILKSGFLYVATQFGNYSFYQFEKLGDDDEELEFYSDDFPADPKASYEPVYFHPRPTENL 414
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
++E + P++D V +L + Q+ T G S R++++G+ +NE + EL GI
Sbjct: 415 ALVESIPAMNPLLDCKVANLTGEDAPQIYTICGNGPRSSFRMLKHGLEVNEIVASELPGI 474
Query: 348 -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+L+ + + +D ++V+SF + T +L++ + +EE GF + TL
Sbjct: 475 PSAVWTLKLNRSEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGD 532
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
+ L+QV +R + + + NEW +P S+ ATANA QV +A G +VY E+ D
Sbjct: 533 DGLIQVHPKGIRHIRNGNV---NEWAAPQHRSIVAATANAHQVAVALSSGEIVYFEMDAD 589
Query: 466 GILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
G L E + + ++CL + + E S AVG D +VRI SL P+ L K
Sbjct: 590 GSLAEYDEKKEMFGTVTCLSLGDVPEGRLRSSFLAVGC-DDCTVRILSLDPESTLENKSV 648
Query: 524 LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
P S+ + A E S YL L G L +L+ TGELTD ++ LG +
Sbjct: 649 QALTAAPTSLAIIAMEDSSSGGSTLYLHIGLHSGVYLRTVLDEITGELTDTRQKFLGPKE 708
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
+ L + + T V S RP + Y+ K + + +N ++ F+S + +
Sbjct: 709 VRLFQVTVQGKTCVLGLSSRPWLGYADPITKGFVVTPLNYVDLEWGWNFSSEQCEEGIVG 768
Query: 636 AKEGELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
+ L I ID + + I +SIPL P+++ F N + E +
Sbjct: 769 IQGQSLRIFNIDRLGETLIQKSIPLTYTPKKLVKHPDQPLFYTIEADNNTLPPELRAQLL 828
Query: 695 ----------RLLDDQTF-----------------------EFISTYPLDTFEYGCSILS 721
++L + F + + T L+ E S
Sbjct: 829 ADPGVVNGDSKVLPPEDFGYPKGTRRWASCINVIDPLSEEGQVLQTIDLENNEAAVSAAI 888
Query: 722 CSFSD-DSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
SFS D+ + +GT V P I F+ +L+ I + + + +L
Sbjct: 889 VSFSSQDNESFLVIGTGKDMVVNPRSFSEGYLHIYRFLEGGRELEFIHKTKVEEPPLALL 948
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
AF G++L A+ +++Y +R R+ Q+E I++L T+G I+VGD+ + +
Sbjct: 949 AFQGRVLVAVGTSLRIYDLGMR-QMLRKSQAEVATQ-QIVSL--NTQGSRIIVGDVQQGV 1004
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
+ ++YK + D A W + ++D + G + N+F VR K SE A +E
Sbjct: 1005 TYVVYKPASNKLIPFVDDTIARWTTCTTMVDYESVAGGDKFGNMFIVRCPEKASEEADEE 1064
Query: 895 ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
+ G R+ ++ ++ + SLV VG +++ +
Sbjct: 1065 QSGLHLINARDYLHGTPHRVSLMCHFYTQDIPTSITKASLV-------VGGQEVLLWSGI 1117
Query: 940 NGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
G IGV I + E F + L+ +LR + G +H +R + K V +DGD
Sbjct: 1118 MGTIGVFIPFVSREDADFFQNLEQHLRTEDPPLAGRDHLMYRGYYAPVKGV-----IDGD 1172
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L E + L + I+ ++ SV E+ +++ ++
Sbjct: 1173 LCERYNLLPNDKKQMIAGELDRSVREIERKISDI 1206
>gi|367027320|ref|XP_003662944.1| hypothetical protein MYCTH_2304190 [Myceliophthora thermophila ATCC
42464]
gi|347010213|gb|AEO57699.1| hypothetical protein MYCTH_2304190 [Myceliophthora thermophila ATCC
42464]
Length = 1211
Score = 218 bits (556), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 281/1168 (24%), Positives = 490/1168 (41%), Gaps = 167/1168 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I + FR G +D++ +AT+ + ++++ +++ + R
Sbjct: 64 VFGIIRAIASFRLAGSHKDYIILATDSGRIAIIEYQPKTNRFSRIHLETFGKSGVRRVVP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R LI + L V+ + + +L + + + VL + + G
Sbjct: 124 GQYLAADPKGRACLIASIEKNKLVYVLNRNAQAELTISSPLEAHKPGVLVLSLVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
A P L D +A T E A + + + V WS + +D + LL
Sbjct: 184 ANPVFAALEIDYTEADQDPTGEAAKETEAQLVYYELDLGLNHVVRKWS-DVVDPTSSLLF 242
Query: 169 PVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYLL 218
VP P GVL+ GEE I Y +N AF+ IP R T+ R G + L
Sbjct: 243 QVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPQRKRTIVSGVMHKL 300
Query: 219 GDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNA 260
AG LV T + + +V LKI+ IA ++ L +
Sbjct: 301 KGSAGAFFFLVQTEDGDLMKVTLDMVEDNDGNPTGEVRRLKIKYFDTIPIAKSLCILKSG 360
Query: 261 VVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV----- 294
++ S +G+ + + D + SY V LE V
Sbjct: 361 FLFAASEFGNHHFYQFEKLGDDDDEQEFTSDDFPTDPRASYNPVYFRPRPLENLVLVESI 420
Query: 295 -NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG-MWS 352
++ P+VD V +L + Q+ G + R++++G+ ++E + EL G +W+
Sbjct: 421 DSMNPLVDCKVANLTGEDAPQIYAVCGNRARSTFRMLKHGLEVSEIVASELPGTPAAVWT 480
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ + D +D ++V+SF + T +L++ + +EE GF + TL L+QV
Sbjct: 481 TKLTKYDEYDGYIVLSFTNATLVLSIG--ETVEEVTESGFLTSVPTLAVQQMGEEGLIQV 538
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV 471
+R + NEW +P S+ A N +QV++A G +VY E+ DG L E
Sbjct: 539 HPKGIRHIVQGRV---NEWPAPQHRSIVAAATNENQVVIALSSGEIVYFEMDADGSLAEY 595
Query: 472 -KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
+ ++ ++ L I + E S AVG D +VRI SL P+ L K
Sbjct: 596 DEKKEMSGTVTSLSIGKVPEGLRRSSFLAVGC-DDCTVRILSLDPESTLEMKSIQALTSA 654
Query: 530 PRSVLLCAFE----GIS-YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
P S+ + + E G++ YL L G L +L+ TGELTD ++ LG +P L S
Sbjct: 655 PSSLSIMSMEDSTGGMTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTKLFQVS 714
Query: 585 SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+N V A S RP + Y+ K + + +N E+ + FNS + + L
Sbjct: 715 VQNQACVLALSSRPWLGYTDPITKNFVMTPLNYSELEYGWNFNSEQCLEGMVGIHANFLR 774
Query: 643 IGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICS--------LKNQSCAEE---- 688
I TID + + I +SIPL P+R+ H EQ + I S L+ Q A+
Sbjct: 775 IFTIDKLGETMIQKSIPLTYTPKRLVKHPEQPYFYTIESDNNTIPPELRAQLLAQSGAVN 834
Query: 689 ---------------------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF-SD 726
S + V L D+ + + L+ E S F S
Sbjct: 835 GDATVLPPEDFGYPRATGRWASCISIVDPLGDEP-KVLQRIDLEGNEAAVSAAVVPFASQ 893
Query: 727 DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
+ + VGT ++ + T+G I V+ EDG+ L+ I + + + +L F G+L
Sbjct: 894 EGESFLVVGTGKDMVLNPRKFTEGYIHVYRFHEDGRELEFIHKTKVEEPPLALIPFQGRL 953
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
LA I + +++Y LR + L+ G L + +QT+G I+VGD+ + ++ ++YK
Sbjct: 954 LAGIGKMLRVYDLGLR----QLLRKAQGEVAPQLIVTLQTQGSRIIVGDVQQGVTYVVYK 1009
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEERGRLEV 901
E + A D W + ++D + G + N++ +R + + +DE +++
Sbjct: 1010 PESNKLLVFADDTINRWTTCTTMVDYESVAGGDKFGNVWILRCPERASQESDEPGSEIQL 1069
Query: 902 VGEYHLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV 945
+ ++ HG+ LP S VG +++ + G +GV
Sbjct: 1070 L-------HARKYLHGAPNRLDLMVHFYTQDLPTSIVKTNLVVGGQDVLVWSGIQGTVGV 1122
Query: 946 -IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
I + E F + L++++R + G +H +R + V K +DGDL E F
Sbjct: 1123 LIPFVSREDVDFFQSLESHMRAEDPPLAGRDHLIYRGY-----YVPVKGVIDGDLCERFS 1177
Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L + I+ ++ SV E+ +++ ++
Sbjct: 1178 LLPNDKKQMIAGELDRSVREIERKISDI 1205
>gi|402592185|gb|EJW86114.1| CPSF A subunit region family protein [Wuchereria bancrofti]
Length = 278
Score = 218 bits (556), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 110/265 (41%), Positives = 170/265 (64%), Gaps = 6/265 (2%)
Query: 752 LVFIVEDG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
++F +G +++L+ EKE KGA YS+ + +GKL+ A+N ++L++W +EL+ E
Sbjct: 1 MMFQASEGPERMRLVYEKEIKGAAYSIQSMDGKLVVAVNSCVRLFEWT----ADKELRLE 56
Query: 810 CGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
C ++ ALY++T+ D I+VGDLM+S+SLL YK E E+ ARD+ NWMSA EI+D
Sbjct: 57 CSDFDNVTALYLKTKNDLILVGDLMRSLSLLSYKSMESTFEKVARDFMTNWMSACEIIDS 116
Query: 870 DIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 929
D +LGAEN++NLFTV K+S EE RL+ +G ++LGE VN F HGSL D
Sbjct: 117 DNFLGAENSYNLFTVMKDSFTVFKEEGTRLQELGLFYLGEMVNVFCHGSLTATQVDVAPL 176
Query: 930 QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
++++GT +G IGVI +P Y FL+ +Q L + + ++H Q+R+F EK++
Sbjct: 177 YHSSILYGTSDGGIGVIVQMPPVLYTFLQDVQKRLAEYAENCMRISHTQYRTFETEKRSE 236
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEI 1014
F+DGDLIES LD+ + ++++
Sbjct: 237 APNGFIDGDLIESLLDMGKDSVEQV 261
>gi|428180158|gb|EKX49026.1| hypothetical protein GUITHDRAFT_68305 [Guillardia theta CCMP2712]
Length = 1202
Score = 218 bits (555), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 274/1169 (23%), Positives = 477/1169 (40%), Gaps = 172/1169 (14%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
+G I ++ FR G +D+L + + + VL++ E ++ + R G
Sbjct: 58 FGLIRSMASFRLPGSNKDYLVLGADSGRISVLEFSKERNQFERVHLETYGKSGCRRIVPG 117
Query: 67 QIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-----YGCA 121
Q DP R + + + V F+ K + LE + I F G
Sbjct: 118 QFLASDPKGRAVMISAIEKQKLVYVFNRDASSKLTISSPLEAHKASTIHFSIVGVDVGFD 177
Query: 122 KPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLLI 168
P L D DA +T YE+ L V + +D +++LI
Sbjct: 178 NPIFAALEMDYSDADADETGQSAEEFNKVLTFYELDLGLNHVVRK--ASEPIDAASNMLI 235
Query: 169 PVPPPL---CGVLIIGEETIVYCSAN---AFKAIPIR-------PSITKAYGRV-DADGS 214
PVP GVL+ E I Y + IP R P + Y + DG
Sbjct: 236 PVPGDTDGPSGVLVCAENKIAYKKPDHEDVVALIPRRQGMPLDQPLLITGYSHLKQKDGF 295
Query: 215 RYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
+LL G L+ L +T+ E+V+ + I +A +I+ L +++ S +G+ L
Sbjct: 296 FFLLQSEIGDLYRLTLTYNDEEVSEINITYFDTVPVAQSITILKTGFLFVASEFGNHALY 355
Query: 275 KL---------NLQP---DAKGSYVEV-------------LERYVNLGPIVDFCVVDLER 309
+ ++ P + +G +E+ ++ +L PI+D V+DL
Sbjct: 356 QFLSIKGSDESDMMPVEVEIEGETIEIPHFAPRPLKNLLLVDEMESLSPILDMRVLDLAG 415
Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVS 368
+ Q+ G +LR +R+G+ + E A EL +W+++ S+ D D ++VV+
Sbjct: 416 EETPQIYALCGKGPRSTLRTLRHGLAVAEMAVSELPSNPLAVWTVKGSSKDAADKYIVVT 475
Query: 369 FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 428
F + T +L++ D +EE GF + +TL + L+QV +R V S R
Sbjct: 476 FANATIVLSIG--DTVEEVTDSGFLATNKTLSVSLLGDDSLLQVHPNGLRTVRSDKR--I 531
Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY---EISCLDI 485
+++ P +N+ N QV++A ++Y E+ D + + A+ E +I+ LDI
Sbjct: 532 SQFTPPNKGVINLVAVNQQQVVVALADHTMIYFEL-DAVGQLQEKAKPEIGGGQIAALDI 590
Query: 486 NPIGENPSYSQIAAVGMWTDIS--VRIFSLPD---LNLITKEHLGGEIIPRSVLLCAFEG 540
+P+G S + AVG D S VRI SL +++++++ L + P S LC E
Sbjct: 591 SPLGAGRSRGRFLAVGASVDGSWFVRILSLDPGSFMHIVSRQALPAK--PES--LCLIE- 645
Query: 541 IS-------------YLLCALGDGHLLNFLLNMKTGELTDRKKVS-LGTQPITLRTFSSK 586
IS +L L +G L+ ++ TG+L + LGT+P+ L +
Sbjct: 646 ISIGAAQSDGGTPTLFLFAGLENGVLMRITVDPITGQLAPEFRTRFLGTKPVKLFKVLVQ 705
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
V A S R + Y+ + + ++ + + + F S P+ L I T+
Sbjct: 706 EQPAVLALSSRSWLAYNFQGRYQITPLSYETLEYASGFASDQCPEGFVCVAANTLRILTV 765
Query: 647 DDI-QKLHIRSIPLGEHPRRICHQEQSRTFAI----------------CSLK-------- 681
+ + + + S+ L PR+ + S F + C +
Sbjct: 766 ERLGEVFNQHSMKLSFTPRKSALLKDSGCFVVLETDHNSDMEKAKEARCDVAEDEEEENE 825
Query: 682 ---------NQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVY 731
Q E VR++D E LD E S+ +F D + +
Sbjct: 826 NKLPHSIYGEQRAGEAKWASRVRVIDPNERETKQIIELDPNEAALSVCVATFYDRKGHTF 885
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
C GTA + G + + V +L + + G +L +F G+LL + +
Sbjct: 886 LCFGTAVGHKVGSRTGSGFLHTYSVVGSQLTFVHKTPIDGVPRALCSFQGRLLVGVGSAL 945
Query: 792 QLYKWMLRDDGTRELQSECGHHG-HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
+LY+ G R+L +C + L + + T GD I VGD+ +SIS L Y +
Sbjct: 946 RLYEM-----GKRKLLRKCENRNIPNLVVTISTMGDRIYVGDVAESISFLKYNRILNELV 1000
Query: 851 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEERG----------- 897
A D + WM+A +D D GA+ N+F R N EE G
Sbjct: 1001 IFADDTHPRWMTAACPVDYDTVAGADKFGNIFLTRLPDNVSDEISEEPGAVGMFEGNDLQ 1060
Query: 898 ----RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHE 952
+ E + +YH+GE V + +L G +I+GT+ G IG + + E
Sbjct: 1061 GAHYKAEEIVQYHVGETVCSLQKATL-------SPGGSDAIIYGTMYGGIGALQPFVSRE 1113
Query: 953 QYLFLEKLQTNLRKVI---------KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
F L+ +LR +G+ G + +RS+ K+ +DGDL E+F
Sbjct: 1114 DVDFFLHLEMHLRGAAGAREHKPAGEGICGRDQLSFRSY-----YFPVKDVVDGDLCETF 1168
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
LS +R +I++ ++ + E+ K++E++
Sbjct: 1169 NYLSPSRQKQIAEDLDRTPGEVAKKLEDM 1197
>gi|342885857|gb|EGU85809.1| hypothetical protein FOXB_03657 [Fusarium oxysporum Fo5176]
Length = 1189
Score = 218 bits (555), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 277/1163 (23%), Positives = 481/1163 (41%), Gaps = 169/1163 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L ++G I +L FR G +D+L IA++ + ++++ + +
Sbjct: 59 LLSHDVFGIIRSLAAFRLAGSNKDYLIIASDSGRITIIEYLPAQNRFHRLHLETFGKSGV 118
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R G+ DP R LI + L V+ +++ +L + + + VL I +
Sbjct: 119 RRVIPGEYLACDPKGRACLIASTEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVISMVA 178
Query: 118 --YGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDN 162
G + P L D + T YE+ L V WS + +D
Sbjct: 179 LDVGYSNPVFAALEIDYSEIDQDSTGQAMEELDTQLVYYELDLGLNHVVRK-WS-DPVDP 236
Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDAD 212
A +L VP P GVL+ GEE I Y +N AF+ AIP R T+ R
Sbjct: 237 TASILFQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVAIPRRRGATEDPNRKRTIVS 294
Query: 213 GSRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTI 254
G + L AG L+ T + + +V LKI+ +AS++
Sbjct: 295 GIMHKLKGSAGAFFFLLQTDDGDLFKLSIDMIEDEEGNPTGEVKRLKIKYFDTVPVASSL 354
Query: 255 SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------V 287
L + +Y+ S +G+ + + D + SY +
Sbjct: 355 CILKSGFLYVASQFGNYSFYQFEKLGDDDEELEFYSDDFPADPRASYEPVYFHPRPTENL 414
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
++E + P++D V +L + Q+ T G S R++++G+ +NE + EL GI
Sbjct: 415 ALVESIPAMNPLLDCKVANLTGEDAPQIYTICGNGPRSSFRMLKHGLEVNEIVASELPGI 474
Query: 348 -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+L+ + + +D ++V+SF + T +L++ + +EE GF + TL
Sbjct: 475 PSAVWTLKLNRSEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGD 532
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
+ L+QV +R V + NEW +P S+ ATANA QV +A G +VY E+ D
Sbjct: 533 DGLIQVHPKGIRHVRNGHV---NEWAAPQHRSIVAATANAHQVAVALSSGEIVYFEMDAD 589
Query: 466 GILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
G L E + + ++CL + + E S AVG D +VRI SL P+ L K
Sbjct: 590 GSLAEYDEKKEMFGTVTCLSLGDVPEGRLRSSFLAVGC-DDCTVRILSLDPESTLENKSV 648
Query: 524 LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
P S+ + A + S YL L G L +L+ TGELTD ++ LG +
Sbjct: 649 QALTAAPTSLAIIAMDDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKE 708
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
+ L + + T V S RP + Y+ K + + +N ++ F+S + +
Sbjct: 709 VRLFQVTVQGKTCVLGLSSRPWLGYADPITKGFVVTPLNYVDLEWGWNFSSEQCEEGIVG 768
Query: 636 AKEGELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
+ L I ID + I +SIPL P+++ F N + E +
Sbjct: 769 IQGQSLRIFNIDRLGDTLIQKSIPLTYTPKKLVKHPDQPLFYTIEADNNTLPPELRAQLL 828
Query: 695 ----------RLLDDQTF-----------------------EFISTYPLDTFEYGCSILS 721
R+L + F + + T L+ E S
Sbjct: 829 ADPKIVNGDSRVLPPEDFGYPKGTRRWASCINVIDPLSEEGQVVQTIDLENNEAAVSAAI 888
Query: 722 CSFSD-DSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
SFS D+ + VGT ++ ++G + ++ +DG G +L AF
Sbjct: 889 VSFSSQDNESFLVVGTGKDMVVNPRSYSEGYLHIYRFQDG-----------GENLTLLAF 937
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
G++ A+ ++++Y +R R+ Q+E I++L T+G I+VGD+ + ++
Sbjct: 938 QGRVAVAVGTQLRIYDLGMR-QMLRKSQAEVAAQ-QIVSL--NTQGSRIIVGDVQQGVTY 993
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEER 896
++YK + D A W + ++D + G + N+F VR K SE A DEE+
Sbjct: 994 VVYKPASNKLIPFVDDTIARWTTCTTMVDYESVAGGDKFGNMFIVRCPEKASEEA-DEEQ 1052
Query: 897 GRLEVVGEYHLGEFVNRFRHGSLVMRLPDS------DVGQIPTVIFGTVNGVIGV-IASL 949
L ++ + HG+ +P S VG +++ + G IGV I +
Sbjct: 1053 TGLHLIN-------AREYLHGTPHRDIPTSITKTSLVVGGQEILLWSGIMGTIGVFIPFI 1105
Query: 950 PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRT 1009
E F + L+ +LR + G +H +R + K +DGDL E + L
Sbjct: 1106 SREDADFFQNLEQHLRTEDPPLAGRDHLMYRGY-----YAPVKGVIDGDLCERYNLLPND 1160
Query: 1010 RMDEISKTMNVSVEELCKRVEEL 1032
+ I+ ++ SV E+ +++ ++
Sbjct: 1161 KKLMIAGELDRSVREIERKISDI 1183
>gi|400597418|gb|EJP65151.1| CPSF A subunit region [Beauveria bassiana ARSEF 2860]
Length = 1212
Score = 218 bits (554), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 279/1170 (23%), Positives = 496/1170 (42%), Gaps = 170/1170 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
I+G + ++ +FR G ++D++ +AT+ + VL++ S +R + + G R
Sbjct: 64 IFGVLRSIAVFRLAGSSKDYIILATDSGRITVLEY-LPSPNRFSRLHMETFGKTGIRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
G+ DP R LI + L V+ +++ +L + + + VL I G
Sbjct: 123 PGEYLACDPKGRACLISAVEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVIALTALDVG 182
Query: 120 CAKPTIVVLY-------QDNKDA------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
A P L QDN H+ YE+ L V WS + +D A L
Sbjct: 183 YANPVFAALEIDYTEVDQDNTGEALSEVETHLVYYELDLGLNHVVRK-WS-DPVDPTASL 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRY 216
L VP P GVL+ GEE + Y +N A + IP R T+ + R G +
Sbjct: 241 LFQVPGGNDGP--SGVLVCGEENVTYRHSNQDALRVPIPRRRGATEDPSRKRNIVAGVMH 298
Query: 217 LLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLD 258
L AG L+ T + + +V +KI+ +A+++ L
Sbjct: 299 KLKGSAGAFFFLLQTDDGDLFKITIDMVEDEEGAPTGEVQRMKIKYFDTVPVATSLCILK 358
Query: 259 NAVVYIGSSYGDS---QLIKL------------NLQPDAKGSY------------VEVLE 291
+ +Y+ S +G+ Q KL + D +Y + +++
Sbjct: 359 SGFLYVASQFGNYAFYQFEKLGDDDDELEFSSDDFPVDPLAAYEPVYFYPRPAENLALVD 418
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
+ P++D V +L + Q+ + G + R +++G+ +NE + EL G+ +
Sbjct: 419 SIPAMNPLLDCKVANLTGEDAPQIYSICGNGARSTFRTIKHGLEVNEIVASELPGVPSAV 478
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+L+ ++D+ +DT++V+SF + T +L++ + +EE GF + T+ + L+
Sbjct: 479 WTLKLNSDEQYDTYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTIAAQLLGTDGLI 536
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILT 469
QV +R + + + NEW +P S+ A+ NA QV +A G +VY E+ DG L
Sbjct: 537 QVHPRGIRHIRNGNV---NEWSAPQHRSIVAASTNAHQVAIALSSGEIVYFEMDSDGSLA 593
Query: 470 EVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
E + + ++ L + + E S AVG D +VRI SL P+ L K
Sbjct: 594 EYDEKKEMFGTVTALSLGEVPEGRVRSSFLAVGC-DDATVRILSLDPESTLENKSVQALT 652
Query: 528 IIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
P S+ + + S YL L G L +L+ TG+L+D ++ LG +P+ L
Sbjct: 653 AAPTSLAIIPMDDSSSGGSTLYLHIGLHSGVYLRTVLDEITGDLSDTRQKFLGPKPVRLF 712
Query: 582 TFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
S T V S RP + Y+ + +N ++ F+S D + +
Sbjct: 713 QVSVGGRTCVLGLSSRPWLGYADPITNSFEITPLNYVDLEWGWNFSSEQCEDGVVGIQGQ 772
Query: 640 ELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV---- 694
L I +ID + + I+ SIPL P++ F N + + + +
Sbjct: 773 SLRIFSIDRLGETLIQSSIPLTYTPKKFVKHPSEPLFYTIEADNHTLPPDLQAKLLADPA 832
Query: 695 ------RLLDDQTF----------------EFISTYP-------LDTFEYGCSILSCSF- 724
++L + F + +S P + E S SF
Sbjct: 833 AVNGDAKVLPPEEFGHPRGNRRWASCISVVDPVSEEPSVLQKVDFENNEAAVSAAVVSFA 892
Query: 725 SDDSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFNG 781
S D+ + VGT ++ ++ I ++ ++G +L+ I + + + +L AF G
Sbjct: 893 SQDNESFLVVGTGKDMILNPRSSSEAYIYIYRFQEGGRELEFIHKTKIEEPAMALLAFQG 952
Query: 782 KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
KLLA I + +++Y +R R+ Q+E I++L T+G IVVGD+ + ++L++
Sbjct: 953 KLLAGIGKTLRMYDLGMR-QLLRKAQAEVVPQ-QIVSL--NTQGSRIVVGDVQQGVTLVV 1008
Query: 842 YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG- 897
YK + A D A W + ++D + G + N+F VR K SE A +E+ G
Sbjct: 1009 YKPASNKLIPFADDTIARWTTCTTMVDYESVAGGDKFGNMFIVRSPAKASEEADEEQAGL 1068
Query: 898 --------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVI 943
RLE++ + + SLV VG +++ + G I
Sbjct: 1069 HLVNARDYLHGAQHRLELMCHFFTQDVPTSINKTSLV-------VGGQDVLLWSGIMGTI 1121
Query: 944 GV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
GV I + E F + L+ +LR + G +H +RS+ K V +DGDL E
Sbjct: 1122 GVFIPFVSREDADFFQSLEQHLRTEDAPLAGRDHLMYRSYYAPVKGV-----IDGDLCER 1176
Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
F L + ++ ++ SV E+ +++ ++
Sbjct: 1177 FAALPNDKKQMMAGELDRSVREIERKISDI 1206
>gi|223647430|gb|ACN10473.1| Splicing factor 3B subunit 3 [Salmo salar]
Length = 1217
Score = 217 bits (553), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 268/1190 (22%), Positives = 492/1190 (41%), Gaps = 189/1190 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L + ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTMEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYHP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R +IG L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMIGAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + ++E VT ++++ +A+++ L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRMKYFDTIPVATSMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V V E+ NL
Sbjct: 347 LFVSSEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-NLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ + DL + Q+ G +LR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIMSCQIADLANEDTPQLYVACGRGPKSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 IEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKTPGKKTIVRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
++ ++ C+ + + S+ AVG+ D +VRI SL L ++ + L + P
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ--PE 638
Query: 532 SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
S+ + G+ YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 639 SLCIVEMGGVEKEDELGVKGTVGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + + F S P+ +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLILIETDHNAYTEATKAQRKQQ 818
Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
AEE +EM VRL++ +
Sbjct: 819 MAEEMVEAAGEDERELAAEMAAAFLNENLPEAIFGAPKAGSGQWASLVRLVNPIQGTTLD 878
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
L+ E S+ C F+ +Y VG A ++ G I + + G KL+
Sbjct: 879 LVQLEQNEAAFSVTVCRFASGGEDWYVLVGVARDMILNPRSVGGGFIYTYRLGGGGDKLE 938
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYV 821
+ + + ++ F G++L + + +++Y D G ++L +C H L +
Sbjct: 939 FMHKTPVEDVPLAIAPFQGRVLVGVGKLLRIY-----DLGKKKLLRKCENKHVPNLVTGI 993
Query: 822 QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
T G ++V D+ +S+ + YK E + A D + W++ +LD D A+ N+
Sbjct: 994 HTIGQRVIVTDVQESLFWVRYKRNENQLIIFADDTHPRWVTTACLLDYDTMASADKFGNI 1053
Query: 882 FTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1054 SIVRLPPNTSDDVDEDPTGNKALWDRGLLNGASQKAEVIMNYHIGETVLSLQKTTLI--- 1110
Query: 924 PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
G ++++ T++G IG++ HE + F + L+ ++R + G +H +RS+
Sbjct: 1111 ----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHLEMHMRSEFPPLCGRDHLSFRSY 1166
Query: 983 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + +++ ++ + E+ K++E++
Sbjct: 1167 -----YFPVKNVIDGDLCEQFNSMDPHKQKSVAEELDRTPPEVSKKLEDI 1211
>gi|443694993|gb|ELT96001.1| hypothetical protein CAPTEDRAFT_155561 [Capitella teleta]
Length = 1215
Score = 217 bits (553), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 267/1187 (22%), Positives = 502/1187 (42%), Gaps = 187/1187 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L ++G I +L FR G ++D++ + ++ + +L++ A S + + + + G
Sbjct: 53 LLTSEVFGVIRSLMPFRLTGGSKDYIILGSDSGRIVILEYVA-SKNIFEKIQQETYGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R ++G L ++ D + +L + + + L +
Sbjct: 112 CRRIVPGQQLAIDPRGRAVMVGAVEKQKLVYILNRDAQARLTISSPLEAHKANTLGYYMV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G PT L D ++A T E A K + + E N+ L+
Sbjct: 172 GVDVGFENPTFACLEVDYEEADSDPTGESASKLQQMLTYYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR----- 215
A++LI VP GVLI E I Y N IR I + +D D R
Sbjct: 232 ANMLISVPGGSEGPSGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMIFV 288
Query: 216 ------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
+L G + + + +++ VT ++++ +AS++ L ++
Sbjct: 289 CNATHKTKSMFFFLTQTEQGDVFKITLETDEDMVTEVRLKYFDTVPVASSMCVLKTGFLF 348
Query: 264 IGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPIV 300
I S +G+ L ++ + P +G R + +L PI+
Sbjct: 349 IASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLKNLVMVDEMDSLSPIM 408
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDD 359
+ DL + Q+ G +LR++R+G+ ++E A EL G +W+++ + +D
Sbjct: 409 HCQIADLANEDTPQLFAMCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRNIED 468
Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
FD +++VSF++ T +L++ + +EE GF T TL C + LVQ+ +R
Sbjct: 469 EFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSQLGDDALVQIYPDGIRH 526
Query: 420 VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLE 477
+ S R NEWK+P ++ N QV++A GG LVY E+ G L E + +L
Sbjct: 527 IRSDKR--VNEWKTPGKKTIVRCAVNQRQVVIALSGGELVYFEMDPTGQLNEYTERKELS 584
Query: 478 YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LLC 536
++ C+ + + + AVG+ D +VRI SL + ++ L + +P S LC
Sbjct: 585 SDVVCMGLGRVPSGEQRCRFLAVGL-NDNTVRIISLDPTDCLSP--LSMQALPASPESLC 641
Query: 537 ------AFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
A EG +L L +G LL +L+ TG+L+D + LG++P+ L
Sbjct: 642 IVEMGGAVEGQEEAHINHGLFLNIGLQNGVLLRAVLDNVTGDLSDTRTRYLGSRPVKLFR 701
Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
S + + V A S R + Y+ + + ++ + + F S P+ + L
Sbjct: 702 ISMQGSDSVLAMSSRSWLSYTYQNRFHLTPLSYDTLEYASGFASEQCPEGIVAISTNTLR 761
Query: 643 IGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKN-----------QSCAEE- 688
I ++ + + + SIPL PR+ + H E I S N Q AEE
Sbjct: 762 ILALEKLGAIFNQVSIPLQYTPRKFVIHAETGNVIVIESDHNAYTEDTKQQRKQQMAEEM 821
Query: 689 ------------SEMHF-------------------------VRLLDDQTFEFISTYPLD 711
+EM +++++ + + PL+
Sbjct: 822 VEAAGEEEQQLAAEMAAAFLQENLPEDVFGAPKAGNGMWASQIQIMNPINGKTLCKLPLE 881
Query: 712 TFEYGCSILSCSFSDDSN-VYYCVGTAY-VLPEENEPTKGRILVFIVEDG--KLQLIAEK 767
E S+ F++ + + VGT ++ + G IL F + L+L+
Sbjct: 882 QNEAAFSVSIVRFANHGDEAFVLVGTVKDLVLSPRSCSCGYILCFRFTNNYEGLELLHRT 941
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTR 824
++ AF G++L + + +++Y D G +++ +C + + A++V
Sbjct: 942 PVDDVPAAIAAFQGRVLIGVGRHLRIY-----DLGKKKMLRKCENKFLPNFVTAIHVN-- 994
Query: 825 GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
G+ I+V D+ S + YK E + A D W++ +LD + GA+ N+ V
Sbjct: 995 GNRIMVSDIQDSFHFVRYKRHENQLIVFADDTCPRWLTCSCMLDYNTMAGADKFGNICVV 1054
Query: 885 RKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPD 925
R S ATD+ +RG L + + +H+GE + + +L+
Sbjct: 1055 RLPS-NATDDVDEDPTGNKALWDRGSLNGASQKADSIANFHVGEMITSLQKATLI----- 1108
Query: 926 SDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN 984
G ++++ T++G +G++ HE + F + L+ ++R ++ + G +H +RS+
Sbjct: 1109 --PGGSESLVYTTLSGSLGMLVPFSSHEDHDFFQHLEMHMRSEVQSLCGRDHLAYRSY-- 1164
Query: 985 EKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
+N +DGDL E F + +R +IS+ ++ + E+ K++E+
Sbjct: 1165 ---YFPVRNVIDGDLCEMFTSMDVSRQKQISEELDRTPAEVSKKLED 1208
>gi|402222132|gb|EJU02199.1| hypothetical protein DACRYDRAFT_21931 [Dacryopinax sp. DJM-731 SS1]
Length = 1209
Score = 217 bits (552), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 271/1172 (23%), Positives = 482/1172 (41%), Gaps = 172/1172 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G A+D+ + ++ + +L++D +++ T+ + + G R
Sbjct: 58 VFGTIRSLVAFRLTGGAKDYAIVGSDSGRLVILEYDPQANAF-TKLHQETYGKSGVRRIV 116
Query: 65 NGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIR-------LEELQVLDIK 115
GQ +DP R + + + L ++ D L + + + + +D+
Sbjct: 117 PGQYLAVDPKGRSVMIAALEKAKLVYILNRDAAANLTISSPLEAHKNHGIIHSIVGVDVG 176
Query: 116 F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
F L+ + QD+ + KT YE+ L V WS+ D A+L
Sbjct: 177 FENPLFAALEVDYTESDQDSTGEAYEKTEKMLTYYELDLGLNHVVRK-WSEPT-DKRANL 234
Query: 167 LIPVP------------PPLCGVLIIGEETIVYCSANAFK-AIPI-----------RPSI 202
L+ VP P GVLI E I++ + + +PI R SI
Sbjct: 235 LLQVPGGQTSSEARHDGP--SGVLICCEGYIIWRHMDTVQHRVPIPKRRHPLDERTRDSI 292
Query: 203 TKAYGRVDADGSRY-LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
A G+ + LL G L + I HE E+V +KI+ +AS++ L +
Sbjct: 293 IVAAVMHKMKGAFFFLLQSEDGDLFKVTIDHEDEEVKTMKIKYFDTVPVASSLCILKSGF 352
Query: 262 VYIGSSYGDSQLIKLNLQPD-------AKGSYVE---------------VLERYV----- 294
+++ S +G+ L + D + SY + LE V
Sbjct: 353 LFVASEFGNHYLYQFQKLGDDDDEIEYSSVSYPDNGMADPIPQAYFRPRPLENLVLADEL 412
Query: 295 -NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 352
+ PIVD V +L Q+ G S R++R+G+ + E S EL GI +W+
Sbjct: 413 NSFDPIVDAKVTNLLNTDTPQIFAACGRGARSSFRMLRHGLDVEETVSSELPGIPNAVWT 472
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
++ DD +D ++++SF++ T +L++ + +EE GF S + T+ + L+QV
Sbjct: 473 VKLKADDQYDAYIILSFVNGTLVLSIG--ETIEEVSDTGFLSSSPTIAVQQIGEDSLLQV 530
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV 471
+R V S R NEW+ P ++ AT N+ QV +A LVY E+ +G L E
Sbjct: 531 YPHGIRHVLSDRRV--NEWRCPQHTTIVAATTNSRQVAIALSSAQLVYFELDLEGQLNEY 588
Query: 472 K-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
+ L + + I + E + AVG D +VRI SL PD L
Sbjct: 589 QDRKSLGSGVLAMSIAEVPEGRQRTPYLAVGC-EDQTVRIISLDPDTTLENISLQALTAP 647
Query: 530 PRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
P S+ + S ++ L +G LL +L+ G+LTD + LG++P+ L
Sbjct: 648 PSSICVAEIMDASIDKNQPTMFVNIGLQNGVLLRTVLDPVNGQLTDTRTRFLGSRPVRLI 707
Query: 582 TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
+ + A S R + Y+ L ++ + + + F++ PD L L
Sbjct: 708 RVNVHGLPSILALSSRSWLNYTYQNLLHFTPLIFDPLEYAWSFSAELCPDGLIGISGNVL 767
Query: 642 TIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE---------- 690
I + ++ QKL IPL PR++ R F + ++ + E+
Sbjct: 768 RIFQVPNLGQKLKQDVIPLSYTPRKMLQHPTERLFYVIESDHRVLSPEAADKKLQKLKST 827
Query: 691 --------------------------MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF 724
++++D + PLD E S+ +F
Sbjct: 828 GQRLDQEVIDLPADIFGRPRADAGTWASCIQIIDPANVRSVLEVPLDNNEAAFSLAITTF 887
Query: 725 -SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFN 780
+ ++ VGTA V+ G + + I EDG+ L+ + + E +L +F
Sbjct: 888 IARPGELFLVVGTAQDVIVSPKSCKSGFLRTYKISEDGRSLEFLHKTEVDDVPLALLSFQ 947
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISL 839
G+L+A I + ++++ D G + L +C + A+ + T+G I+VGD+ +SI
Sbjct: 948 GRLVAGIGKALRIF-----DMGKKRLLRKCENKSFATAIVTLSTQGSRIIVGDMAESIYF 1002
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEER 896
YK E + A D W++A ++D D + N+F R K E ++
Sbjct: 1003 ATYKPPENRLLIFADDSQPRWITASAMVDYDTVCAGDKFGNVFVNRLPPKVGEQVDEDPT 1062
Query: 897 G---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
G + ++ Y++G+ + +LV G V++ ++G
Sbjct: 1063 GAGVLHEKGLFMGAPHKTNMLAHYYVGDIITSMHKVALV-------TGGRDIVLYTGLHG 1115
Query: 942 VIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
IGV I + E F+ L+ ++R + G +H +R + V K +DGDL
Sbjct: 1116 TIGVLIPFISKEDVDFIRTLEQHMRTEAPSLVGRDHLTYRGY-----YVPVKGVVDGDLC 1170
Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
E F L + I+ ++ + E+ K++E+L
Sbjct: 1171 ELFSLLPTQKQQSIAGELDRTYSEVLKKLEQL 1202
>gi|358378986|gb|EHK16667.1| hypothetical protein TRIVIDRAFT_40938 [Trichoderma virens Gv29-8]
Length = 1212
Score = 215 bits (548), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 280/1174 (23%), Positives = 484/1174 (41%), Gaps = 168/1174 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L I+G I +L FR G +D+L +AT+ + ++++ + + +
Sbjct: 59 LLSHDIFGIIRSLAAFRLAGSNKDYLILATDSGRITIIEYLPKENRFTKLHLETFGKSGV 118
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R G+ DP R LI + L V+ +++ +L + + + VL I +
Sbjct: 119 RRVIPGEYLACDPKGRACLIASIEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVISMVA 178
Query: 118 --YGCAKPTIVVLYQDNKDA---------RHVKT----YEVALKDKDFVEGPWSQNNLDN 162
G A P L D +A R ++T YE+ L V WS+ +D+
Sbjct: 179 LDVGYANPVFAALEMDYTEADQDATGEAMRELETQLVYYELDLGLNHVVRK-WSEP-VDS 236
Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDAD 212
A LL VP P GVL+ GEE I Y +N AF+ IP R T+ + R
Sbjct: 237 TASLLFQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRKRTIIS 294
Query: 213 GSRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTI 254
G + L AG L+ T + + +V LKI+ A+++
Sbjct: 295 GVMHKLKGSAGAFFFLLQTEDGDMFKVTIDMVEDDEGNTTGEVRRLKIKYFDTVPAAASL 354
Query: 255 SYLDNAVVYIGSSYGD---SQLIKL------------NLQPDAKGSYVEV---------- 289
L + +Y+ S +G+ Q KL + DA+ SY V
Sbjct: 355 CILKSGFLYVASQFGNFSFYQFEKLGDDDEELEFTSDDFPVDAQASYNPVYFYPRPLENL 414
Query: 290 --LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
+E ++ P++D V +L + Q+ T G + R +++G+ INE S EL GI
Sbjct: 415 VLVESIPSMNPLLDCKVANLTNEDAPQIYTICGNGARSTFRTLKHGLEINEIVSSELPGI 474
Query: 348 -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+L+ + + +D ++V+SF + T +L++ + +EE GF + TL
Sbjct: 475 PSAVWTLKLNRTEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGD 532
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
+ L+QV +R + + N+W +P S+ A+ NA QV +A G +VY E+ D
Sbjct: 533 DGLIQVHPKGIRHIRNGQV---NQWDAPQHRSIVAASTNAHQVAIALSSGEIVYFEMDDD 589
Query: 466 GILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
G L E + + ++CL + + E S AVG D +VRI SL P+ L K
Sbjct: 590 GSLAEYDEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDCTVRILSLDPESTLENKSV 648
Query: 524 LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
P S+ + + E S YL L G L +L+ TGELTD ++ LG +
Sbjct: 649 QALTAAPTSLAIISMEDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELTDTRQKFLGPKQ 708
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
+ L + + T V S RP + Y+ K + + +N ++ F S + +
Sbjct: 709 VRLFQVTVQGKTCVLGLSSRPWLGYADPITKTFVVTPLNYVDLEWGWNFTSEQCEEGIVG 768
Query: 636 AKEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
+ L I ++D + I+ SIPL P+++ F + N + A E +
Sbjct: 769 IQGQTLRIFSVDRLGDTLIQSSIPLTYTPKKMVKHPDHPLFYVIEADNHTLAPELCAKLL 828
Query: 695 ----------RLLDDQTF-----------------------EFISTYPLDTFEYGCSILS 721
++L + F + + L+ E S+
Sbjct: 829 ADPARVNGDTKILPAEEFGHPRGNRRWASCISVVDPLAEDGQVLQRIDLEENEAAVSVAI 888
Query: 722 CSFSDDSN-VYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
+F+ N + VGT V P I F + L I + + + ++
Sbjct: 889 VTFASQENETFLVVGTGKDMVVNPRSFSDAFVHIYRFERDGRGLVFIHKTKVEEPPMAMI 948
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
F G++L I + +++Y +R R+ Q+E I++L T+G IVVGD+ + I
Sbjct: 949 PFQGRVLVGIGKTLRIYDLGMR-QLLRKAQAEVAPQ-QIISL--STQGSRIVVGDVQQGI 1004
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
+ +YK + D A W + ++D + G + N+F VR K SE A +E
Sbjct: 1005 TYAVYKQSTNKLIPFVDDTVARWTTCTTMVDYETVAGGDKFGNIFIVRSPQKASEEADEE 1064
Query: 895 ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
+ G RL+++ + SLV VG +++ +
Sbjct: 1065 QAGLHLLNARDYLHGTSHRLDLMCHLFTQDIPTSIAKTSLV-------VGGQDVLLWSGL 1117
Query: 940 NGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
G IGV I + E F + L+ ++R + G +H +RS+ K +DGD
Sbjct: 1118 MGTIGVLIPFITREDTDFFQSLEQHMRAEDPPLAGRDHLMYRSY-----YAPMKGIIDGD 1172
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L E + L + I+ ++ SV E+ +++ ++
Sbjct: 1173 LCERYALLPNDKKQMIAGELDRSVREIERKISDI 1206
>gi|348572800|ref|XP_003472180.1| PREDICTED: LOW QUALITY PROTEIN: splicing factor 3B subunit 3-like
[Cavia porcellus]
Length = 1215
Score = 215 bits (548), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 272/1194 (22%), Positives = 493/1194 (41%), Gaps = 199/1194 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD--LNLITKEHLGGEIIPR 531
++ ++ C+ + + S+ G+WT RI S P L ++ + L + P
Sbjct: 581 KEMSADVVCMSLANVSPGEQRSRFPGCGLWT--YCRIISDPSDCLQPLSMQALPAQ--PE 636
Query: 532 SVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT 575
S LC E G YL L +G LL +L+ TG+L+D + LG+
Sbjct: 637 S--LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGS 694
Query: 576 QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
+P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 695 RPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVA 754
Query: 636 AKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKN 682
L I ++ + + + + PL PR+ + H E + T A + +
Sbjct: 755 ISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRK 814
Query: 683 QSCAEE-------------SEM-------------------------HFVRLLDDQTFEF 704
Q AEE +EM +R+++
Sbjct: 815 QQMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNT 874
Query: 705 ISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-K 760
+ L+ E S+ C FS+ +Y VG A L G + +V +G K
Sbjct: 875 LDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEK 934
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY 820
L+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 935 LEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANY 987
Query: 821 ---VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 988 ISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADK 1047
Query: 878 NFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSL 919
N+ VR N+ DE +RG L EV+ YH+GE V + +L
Sbjct: 1048 FGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTL 1107
Query: 920 VMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
+ G ++++ T++G IG++ HE + F + ++ +LR + G +H
Sbjct: 1108 I-------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLS 1160
Query: 979 WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1161 FRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1209
>gi|149640712|ref|XP_001506454.1| PREDICTED: splicing factor 3B subunit 3 [Ornithorhynchus anatinus]
gi|395508661|ref|XP_003758628.1| PREDICTED: splicing factor 3B subunit 3 [Sarcophilus harrisii]
Length = 1217
Score = 215 bits (547), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 273/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGAEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM VR+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVVRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ + +Y VG A L G + +V G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGDDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNSGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>gi|357478269|ref|XP_003609420.1| Splicing factor 3B subunit [Medicago truncatula]
gi|355510475|gb|AES91617.1| Splicing factor 3B subunit [Medicago truncatula]
Length = 1225
Score = 215 bits (547), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 268/1199 (22%), Positives = 494/1199 (41%), Gaps = 200/1199 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSD 57
+L V ++G I +L FR G +DF+ + ++ + +L ++ + + ++ G
Sbjct: 55 ILSVQVFGTIRSLSQFRLTGAQKDFIVVGSDSGRIVILDYNKQKNVFDKIHQETFGKSGC 114
Query: 58 RIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R P GQ IDP R + + + V + + + LE + I F
Sbjct: 115 RRIVP---GQYLAIDPKGRAVMIAACEKKKLVYVLNRDSLARLTISSPLEANKSHTIVFS 171
Query: 118 Y-----GCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNN 159
G P + D DA +H+ YE+ L + V WS +
Sbjct: 172 ICAVDCGFENPIFAAIELDCSDADQDATGVAASQAQKHLIFYELDLG-LNHVSRKWS-DQ 229
Query: 160 LDNGADLLIPVPPPL---CGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADG 213
+DNGA++L+ VP GVL+ E ++Y + + IP R + G +
Sbjct: 230 VDNGANMLVTVPGGADGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSA 289
Query: 214 SRY---------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
+ + L ++ + + + ++V+ L I+ ++A +I L
Sbjct: 290 AMHKTKNLKPEEFKLFFLLQTEYGDIFKVTLTDGGGDRVSELNIKYFDTIAVAVSICVLK 349
Query: 259 NAVVYIGSSYGDSQLIKLN---------------------------LQPDAKGSYVEVLE 291
+ ++ S +G+ L + QP + V + +
Sbjct: 350 SGFLFAASEFGNHALYQFKGIGDDDNDVGASSASLMETEEGFQPVFFQPRRLKNLVRI-D 408
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
+ +L P++D V +L + Q+ T G SLRI+R G+ ++E A +L GI +
Sbjct: 409 QVESLMPVMDMKVSNLFEEETPQIFTLCGRGPRSSLRIMRTGLAVSEMAVSKLPGIPSAV 468
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+++ + D FD ++VVSF + T +L++ + +E GF +L + L+
Sbjct: 469 WTVKKNVMDEFDAYIVVSFTNATLVLSIG--ETADEVSDSGFLDTAPSLAVSLIGDDSLM 526
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
QV +R + R NEW++ ++ +N QV++A GG L+Y E+ G L
Sbjct: 527 QVHPNGIRHIREDGRT--NEWQTSGKRTIAKVGSNRLQVVIALNGGELIYFEVDVTGQLM 584
Query: 470 EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
EV+ ++ +++CLDI P+ + S+ AVG + D ++RI SL PD + T
Sbjct: 585 EVERHEMSGDVACLDIAPVPKGRLRSRFLAVGSY-DKTIRILSLDPDDCMQTLGIQSLSS 643
Query: 529 IPRSVLLCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
P S+L + +L L +G L +++M TG L+D + LG +
Sbjct: 644 APESLLFLEVQASVGGEDGADHPASLFLNAGLQNGVLSRTVVDMVTGLLSDTRSRFLGLK 703
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS-AAFPDSLAI 635
L + + S RP + Y L + ++ + + + F+S F +++
Sbjct: 704 APKLFPIIVRGKRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCFEGVVSV 763
Query: 636 AKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC---------------- 678
A E L I T++ + + + IPL PR+ Q + + +
Sbjct: 764 ASEA-LRIFTVERLGETFNQNVIPLRYTPRKFVLQPKRKLLVVIESDQGALTAEEREAAR 822
Query: 679 --------SLKNQSCAEE---------------SEMHF-------------VRLLDDQTF 702
+ +N++ +E+ S+ H+ +R+LD +T
Sbjct: 823 KECFEAAHAGENKTGSEDQMENGGEDEDNDDSLSDEHYGYPKSESDKWVSCIRVLDPRTG 882
Query: 703 EFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL--PEENEPTKGRILVF-IVED 758
L E SI + +F D + VGTA L + T G I ++ ++D
Sbjct: 883 NTTCLLELQENEAAFSICTVNFHDKEYGTLLAVGTAKGLQFTPKRSLTAGFIHIYRFLDD 942
Query: 759 GK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
G+ L+L+ + + +G +L F G+LLA I ++LY D G R L +C +
Sbjct: 943 GRSLELLHKTQVEGVPLALCQFQGRLLAGIGPVLRLY-----DLGKRRLLRKCENKSFPS 997
Query: 818 ALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
++ + D I VG + +S Y+ +E + A D W+++ +D D GA+
Sbjct: 998 SIVSIHAYRDRIYVGGIQESFHYCKYRRDENQLYIFADDSVPRWLTSSYHIDFDTMAGAD 1057
Query: 877 NNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGS 918
N+F R K +G + ++E + ++H+G+ + + S
Sbjct: 1058 KFGNIFFARLPQDVSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDVITSLQKAS 1117
Query: 919 LVMRLPDSDVGQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
LV G +++GTV G +G + A + F L+ ++R+ + G +H
Sbjct: 1118 LV-------PGGGECIVYGTVMGSVGALHAFTSRDDVDFFSHLEMHMRQDNPPLCGRDHM 1170
Query: 978 QWRSFNNEKKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+RS K+ +DGDL E F +DL R DE+ +T E+ K++EE+
Sbjct: 1171 AYRS-----AYFPVKDVIDGDLCEQFPTLPMDLQRKIADELDRTPG----EILKKLEEV 1220
>gi|357478323|ref|XP_003609447.1| Splicing factor 3B subunit [Medicago truncatula]
gi|355510502|gb|AES91644.1| Splicing factor 3B subunit [Medicago truncatula]
Length = 1225
Score = 215 bits (547), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 272/1199 (22%), Positives = 492/1199 (41%), Gaps = 200/1199 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSD 57
+L V ++G I +L FR G +DF+ + ++ + +L ++ + + ++ G
Sbjct: 55 ILSVQVFGTIRSLSQFRLTGAQKDFIVVGSDSGRIVILDYNKQKNVFDKIHQETFGKSGC 114
Query: 58 RIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R P GQ IDP R + + + V + + + LE + I F
Sbjct: 115 RRIVP---GQYLAIDPKGRAVMIAACEKKKLVYVLNRDSLARLTISSPLEANKSHTIVFS 171
Query: 118 Y-----GCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNN 159
G P + D DA +H+ YE+ L + V WS +
Sbjct: 172 ICAVDCGFENPIFAAIELDCSDADQDATGVAASQAQKHLIFYELDLG-LNHVSRKWS-DQ 229
Query: 160 LDNGADLLIPVPPPL---CGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADG 213
+DNGA++L+ VP GVL+ E ++Y + + IP R + G +
Sbjct: 230 VDNGANMLVTVPGGADGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSA 289
Query: 214 SRY---------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
+ + L ++ + + + ++V+ L I+ ++A +I L
Sbjct: 290 AMHKTKNLKPEEFKLFFLLQTEYGDIFKVTLTDGGGDRVSELNIKYFDTIAVAVSICVLK 349
Query: 259 NAVVYIGSSYGDSQLIKLNLQPD-------AKGSYVEV-------------LERYVNLGP 298
+ ++ S +G+ L + D + S +E L+ V + P
Sbjct: 350 SGFLFAASEFGNHALYQFKGIGDDDNDVVASSASLMETEEGFQPVFFLPRRLKNLVRIDP 409
Query: 299 ------IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
I+D V +L + Q+ T G SLRI+R G+ ++E A +L GI +W
Sbjct: 410 VESLMSIMDMKVSNLFEEETPQIFTLCGRGPRSSLRIMRTGLAVSEMAVSKLPGIPSAVW 469
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+++ + D FD+++VVSF + T +L++ + +EE GF +L + ++Q
Sbjct: 470 TVKKNVMDEFDSYIVVSFTNATLVLSIG--ETVEEVSNSGFLDTAPSLAVSLIGDDSIMQ 527
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE 470
V +R + R NEW++ ++ +N QV++A GG LVY E+ G L E
Sbjct: 528 VHPNGIRHIREDGRT--NEWQTSGKRTIAKVGSNRLQVVIALNGGELVYFEVDVTGQLME 585
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
V+ ++ +++CLDI P+ + S+ AVG + D ++RI SL PD + T
Sbjct: 586 VEKHEMSGDVACLDIAPVPKGRLRSRFLAVGSY-DNTIRILSLDPDDCMQTLSIQSLSSA 644
Query: 530 PRSVLLCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
P S+L + +L L G L +++ TG L+D + LG +
Sbjct: 645 PESLLFLEVQASVGGEDGADHPASLFLSAGLQSGVLSRTVVDTVTGLLSDARSRFLGLKA 704
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
L + S RP + Y L + ++ + + + F+S + +
Sbjct: 705 PKLFPIILGRKCAMLCLSSRPWLGYIHQGHFLLTPLSYETLEYAASFSSDQCLEGVVSVS 764
Query: 638 EGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTF--------AICSLKNQSCAEE 688
L I T++ + + + IPL PR++ Q + + A+ + + ++ +E
Sbjct: 765 GEALRIFTVERLGETFNQTVIPLRYTPRKLVLQPKRKLLVVIESDQGALTAKEREAARKE 824
Query: 689 -------------------------------SEMHF-------------VRLLDDQTFEF 704
S+ H+ +R+LD +T
Sbjct: 825 CFEAAQTGENATGSEDQMENDGEDEDQGDSLSDEHYGYPKSESDKWVSCIRVLDPRTGNT 884
Query: 705 ISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRILVFI-----VED 758
L E SI + +F D + VGTA L + P + + FI ++D
Sbjct: 885 TCLLELQENEAAFSICTVNFHDKEYGTLLAVGTAKGL--QFTPKRSLTVGFIHIYRFLDD 942
Query: 759 GK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
G+ L+L+ + + +G +L F G+LLA I ++LY D G R L +C + +
Sbjct: 943 GRSLELLHKTQVEGVPLALCQFQGRLLAGIGPVLRLY-----DLGKRRLLRKCENKSFPI 997
Query: 818 ALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
++ + D I VGD+ +S Y+ +E + A D W++A +D D GA+
Sbjct: 998 SIVSIHAYRDRIYVGDIQESFHYCKYRRDENQLYIFADDSVPRWLTASYHIDFDTMAGAD 1057
Query: 877 NNFNLFTVR-----------KNSEGATDEERGRL-------EVVGEYHLGEFVNRFRHGS 918
N+F R + G E+G+L E + ++H+G+ + + +
Sbjct: 1058 KFGNIFFARLPQDVSDEVEEDPTSGKIKWEQGKLNGALNKVEEIVQFHVGDVITSLQKAA 1117
Query: 919 LVMRLPDSDVGQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
LV G +++GTV G +G + A + F L+ ++R+ + G +H
Sbjct: 1118 LV-------PGGGECIVYGTVMGCVGALHAFTSRDDVDFFSHLEMHMRQDNPPLCGRDHM 1170
Query: 978 QWRSFNNEKKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+RS K+ LDGDL E F +DL R DE+ +T E+ K++EEL
Sbjct: 1171 AYRS-----AYFPVKDVLDGDLCEQFPTLPMDLQRKIADELDRTPG----EILKKLEEL 1220
>gi|73957045|ref|XP_536791.2| PREDICTED: splicing factor 3B subunit 3 isoform 1 [Canis lupus
familiaris]
gi|149699332|ref|XP_001500880.1| PREDICTED: splicing factor 3B subunit 3 [Equus caballus]
gi|301771131|ref|XP_002920989.1| PREDICTED: splicing factor 3B subunit 3-like [Ailuropoda melanoleuca]
gi|344290794|ref|XP_003417122.1| PREDICTED: splicing factor 3B subunit 3-like [Loxodonta africana]
gi|410983914|ref|XP_003998280.1| PREDICTED: splicing factor 3B subunit 3 [Felis catus]
gi|426242169|ref|XP_004014947.1| PREDICTED: splicing factor 3B subunit 3 [Ovis aries]
gi|417406150|gb|JAA49749.1| Putative damage-specific dna binding complex subunit ddb1 [Desmodus
rotundus]
gi|431912451|gb|ELK14585.1| Splicing factor 3B subunit 3 [Pteropus alecto]
Length = 1217
Score = 214 bits (546), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 497/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ + +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGDDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>gi|40788938|dbj|BAA32662.2| KIAA0017 protein [Homo sapiens]
Length = 1253
Score = 214 bits (546), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 89 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 147
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 148 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 207
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 208 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 267
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 268 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 322
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 323 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 382
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 383 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 441
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 442 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 500
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 501 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 558
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 559 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 616
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 617 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 673
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 674 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 733
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 734 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 793
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 794 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 853
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 854 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 913
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 914 DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 973
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 974 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 1026
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 1027 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1086
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1087 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1146
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1147 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1199
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1200 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1247
>gi|395836972|ref|XP_003791420.1| PREDICTED: splicing factor 3B subunit 3 [Otolemur garnettii]
Length = 1217
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGSGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGEEWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>gi|54112121|ref|NP_036558.3| splicing factor 3B subunit 3 [Homo sapiens]
gi|118150814|ref|NP_001071319.1| splicing factor 3B subunit 3 [Bos taurus]
gi|296231541|ref|XP_002761182.1| PREDICTED: splicing factor 3B subunit 3 isoform 2 [Callithrix
jacchus]
gi|332846357|ref|XP_511081.3| PREDICTED: splicing factor 3B subunit 3 [Pan troglodytes]
gi|397518709|ref|XP_003829523.1| PREDICTED: splicing factor 3B subunit 3 [Pan paniscus]
gi|402908991|ref|XP_003917214.1| PREDICTED: splicing factor 3B subunit 3 [Papio anubis]
gi|403298333|ref|XP_003939977.1| PREDICTED: splicing factor 3B subunit 3 [Saimiri boliviensis
boliviensis]
gi|426382761|ref|XP_004057969.1| PREDICTED: splicing factor 3B subunit 3 [Gorilla gorilla gorilla]
gi|116242787|sp|Q15393.4|SF3B3_HUMAN RecName: Full=Splicing factor 3B subunit 3; AltName:
Full=Pre-mRNA-splicing factor SF3b 130 kDa subunit;
Short=SF3b130; AltName: Full=STAF130; AltName:
Full=Spliceosome-associated protein 130; Short=SAP 130
gi|125987788|sp|A0JN52.1|SF3B3_BOVIN RecName: Full=Splicing factor 3B subunit 3; AltName:
Full=Pre-mRNA-splicing factor SF3b 130 kDa subunit;
Short=SF3b130; AltName: Full=Spliceosome-associated
protein 130; Short=SAP 130
gi|117306205|gb|AAI26519.1| Splicing factor 3b, subunit 3, 130kDa [Bos taurus]
gi|119572190|gb|EAW51805.1| splicing factor 3b, subunit 3, 130kDa, isoform CRA_c [Homo sapiens]
gi|168274284|dbj|BAG09562.1| splicing factor 3B subunit 3 [synthetic construct]
gi|296478233|tpg|DAA20348.1| TPA: splicing factor 3B subunit 3 [Bos taurus]
gi|355710371|gb|EHH31835.1| Spliceosome-associated protein 130 [Macaca mulatta]
gi|355756944|gb|EHH60552.1| Spliceosome-associated protein 130 [Macaca fascicularis]
gi|380811142|gb|AFE77446.1| splicing factor 3B subunit 3 [Macaca mulatta]
gi|383417057|gb|AFH31742.1| splicing factor 3B subunit 3 [Macaca mulatta]
gi|384946118|gb|AFI36664.1| splicing factor 3B subunit 3 [Macaca mulatta]
gi|410209986|gb|JAA02212.1| splicing factor 3b, subunit 3, 130kDa [Pan troglodytes]
gi|410260956|gb|JAA18444.1| splicing factor 3b, subunit 3, 130kDa [Pan troglodytes]
gi|410355121|gb|JAA44164.1| splicing factor 3b, subunit 3, 130kDa [Pan troglodytes]
Length = 1217
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>gi|109129162|ref|XP_001107025.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Macaca
mulatta]
gi|297284421|ref|XP_002802591.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Macaca
mulatta]
Length = 1199
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 35 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 93
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 94 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 153
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 154 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 213
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 214 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 268
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 269 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 328
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 329 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 387
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 388 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 446
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 447 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 504
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 505 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 562
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 563 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 619
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 620 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 679
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 680 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 739
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 740 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 799
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 800 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 859
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 860 DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 919
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 920 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 972
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 973 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1032
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1033 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1092
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1093 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1145
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1146 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1193
>gi|197101659|ref|NP_001125500.1| splicing factor 3B subunit 3 [Pongo abelii]
gi|75042033|sp|Q5RBI5.1|SF3B3_PONAB RecName: Full=Splicing factor 3B subunit 3; AltName:
Full=Pre-mRNA-splicing factor SF3b 130 kDa subunit;
Short=SF3b130; AltName: Full=Spliceosome-associated
protein 130; Short=SAP 130
gi|55728259|emb|CAH90875.1| hypothetical protein [Pongo abelii]
Length = 1217
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSETAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGSGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALRDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVTKKLEDI 1211
>gi|425768510|gb|EKV07031.1| Pre-mRNA-splicing factor rse1 [Penicillium digitatum PHI26]
gi|425775700|gb|EKV13954.1| Pre-mRNA-splicing factor rse1 [Penicillium digitatum Pd1]
Length = 1209
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 289/1169 (24%), Positives = 490/1169 (41%), Gaps = 171/1169 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G +D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRVAGSNKDYIIIGSDSGRITIVEY-VPSQNRFNRIHMETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ +++ +L + + + Q L + G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMICLDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D ++ T YE+ L V WS + +D A +
Sbjct: 183 YDNPVFAALEVDYSESDQDPTGQAYEEIEKVLVYYELDLGLNHVVRK-WS-DTVDRTASM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSR--- 215
L VP P GVL+ GE+ I Y +N AF+ IP R T+ R + S
Sbjct: 241 LFQVPGGDDGP--SGVLVCGEDNITYRHSNQGAFRVPIPRRSGATENPERKRSIVSGVMH 298
Query: 216 -------YLLGDHAG---LLHLLVITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
+LL G L+L ++ ++ + TG L I+ +++ + L +
Sbjct: 299 KMRGAFFFLLQTEDGDLFKLNLEMVEDKQGQPTGEVKRLTIKYFDTIPVSTNLLILKSGF 358
Query: 262 VYIGSSYGDS---QLIKLNLQPD-----------------------AKGS-YVEVLERYV 294
+Y S G+ Q KL PD +G+ V ++E
Sbjct: 359 LYAASESGNHNFYQFEKLGDDPDELFFSSDSFSADHSVPSAPVFFHPRGAENVNLMESMN 418
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
+L P++D +++L Q+ T G+ S R +++G+ ++E +LQ + +W+
Sbjct: 419 SLSPLIDSKILNLSEDDAPQIYTICGSGARSSFRTLKHGLEVSEIVESDLQQVPSAVWTT 478
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ + D F T++++SF + T +L++ + +EE GF S TL + LVQV
Sbjct: 479 KLTRADEFHTYIILSFANGTLVLSIG--EIVEEVSDTGFLSSAPTLAVQQLGEDSLVQVH 536
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
+R + + R NEW +P S+ A N QV +A G +VY E+ DG L E
Sbjct: 537 PRGIRHILADQRV--NEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGTLAEYD 594
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
+ Q+ ++ L + + E S AVG D +VRI SL PD L K P
Sbjct: 595 ERRQMSGTVTSLSMGEVPEGRMRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 653
Query: 531 RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
++ + A S YL L G L +L+ TGEL+D + +G +P+ L S
Sbjct: 654 SALQIMAMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFIGAKPVKLSQVS 713
Query: 585 SKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
K T V A S RP + YS K + + ++ + F+S + + + L
Sbjct: 714 VKGQTAVLALSTRPWLGYSDIQTKSFMLTPLDYVGLEWGWNFSSEQCVEGMIGIQGRSLR 773
Query: 643 IGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQ- 700
I T++ + ++ SIPL PRR F + N + + RL+DD
Sbjct: 774 IFTVEKLDNNMLQESIPLSYTPRRFVKHPDQHLFYVIESDNNVLSPATRQ---RLIDDSQ 830
Query: 701 ----------------------------------TFEFISTYPLDTFEYGCSILSCSFSD 726
T IST L+ E S+ + SFS
Sbjct: 831 AQNGEVADLPPADFGYPRATGHWASCVQIVDPITTKSVISTLDLEDNEAAVSLAAVSFSS 890
Query: 727 -DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFN 780
D + VGTA V P + + G I ++ EDG+ L+ I + + +L F
Sbjct: 891 QDDETFLVVGTAKDMTVSPPSS--SCGFIHIYRFQEDGRELEFIHKTQVDEPPLALLGFQ 948
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFIVVGDLMKSI 837
G+LLA I +++Y D G ++L +C I+ L QT+G I+V D+ +S+
Sbjct: 949 GRLLAGIGPVLRVY-----DLGMKQLLRKCQAPVVPKTIVGL--QTQGSRIIVSDIRESV 1001
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
+ ++YK+++ + A D A W S+ ++D + G + NL+ VR K SE A ++
Sbjct: 1002 TYVVYKYQDNVLIPFADDSIARWTSSTTMVDYETTAGGDKFGNLWLVRCPSKISEQADED 1061
Query: 895 --------ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIG 944
E+G L G H E + F + L + + G V++ + G IG
Sbjct: 1062 GSGAHLIHEKGYLH--GTPHRLELMVHFFAQDIPTSLHKTQLVAGGRDIVVWTGLQGTIG 1119
Query: 945 V-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
+ + + E F + L+T L + G +H +R + K V +DGDL E +
Sbjct: 1120 MFVPFVSREDVDFFQLLETQLASQQPPLAGRDHLMYRGYYAPVKGV-----IDGDLCEMY 1174
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L L I+ ++ SV E+ +++ ++
Sbjct: 1175 LLLPNDTKLMIAGELDRSVREIERKISDM 1203
>gi|326676028|ref|XP_002667683.2| PREDICTED: splicing factor 3B subunit 3-like [Danio rerio]
gi|123888555|sp|Q1LVE8.1|SF3B3_DANRE RecName: Full=Splicing factor 3B subunit 3
Length = 1217
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 265/1190 (22%), Positives = 491/1190 (41%), Gaps = 189/1190 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L + ++G + +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTMEVFGVVRSLMAFRLTGGTKDYVVVGSDSGRIVILEYHP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R +IG L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMIGATEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + ++E VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRMKYFDTIPVATAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V V E+ +L
Sbjct: 347 LFVSSEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-SLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ + DL + Q+ G +LR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIMSCQIADLANEDTPQLYVACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKTPGKKTIIRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
++ ++ C+ + + S+ AVG+ D +VRI SL L ++ + L + P
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ--PE 638
Query: 532 SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
S+ + G+ YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 639 SLCIVEMGGVEKQDELGEKGTIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + + F S P+ +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLILIETDHNAYTEATKAQRKQQ 818
Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
AEE +EM VRL++ +
Sbjct: 819 MAEEMVEAAGEDERELAAEMAAAFLNENLPEAIFGAPKAGSGQWASLVRLINPIQGNTLD 878
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
L+ E S+ C F + + +Y VG A ++ G I + + G KL+
Sbjct: 879 LVQLEQNEAAFSVAICRFLNGGDDWYVLVGVARDMILNPRSVGGGYIYTYRIVGGGDKLE 938
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYV 821
+ + + ++ F G++L + + +++Y D G ++L +C H L +
Sbjct: 939 FLHKTPVEDVPLAIAPFQGRVLVGVGKLLRIY-----DLGKKKLLRKCENKHVPNLVTGI 993
Query: 822 QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
T G ++V D+ +S+ + Y+ E + A D W++ +LD D A+ N+
Sbjct: 994 HTIGQRVIVSDVQESLFWVRYRRNENQLIIFADDTYPRWITTACLLDYDTMASADKFGNI 1053
Query: 882 FTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
VR N+ DE +RG L E++ YH+GE V + +L+
Sbjct: 1054 CVVRLPPNTSDDVDEDPTGNKALWDRGLLNGASQKAEIIINYHIGETVLSLQKTTLI--- 1110
Query: 924 PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
G ++++ T++G IG++ HE + F + L+ ++R + G +H +RS+
Sbjct: 1111 ----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHLEMHMRSEFPPLCGRDHLSFRSY 1166
Query: 983 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1167 -----YFPVKNVIDGDLCEQFNSMDPHKQKSVSEELDRTPPEVSKKLEDI 1211
>gi|19527174|ref|NP_598714.1| splicing factor 3B subunit 3 [Mus musculus]
gi|297207121|ref|NP_001099657.2| splicing factor 3B subunit 3 [Rattus norvegicus]
gi|354477789|ref|XP_003501101.1| PREDICTED: splicing factor 3B subunit 3 [Cricetulus griseus]
gi|81879817|sp|Q921M3.1|SF3B3_MOUSE RecName: Full=Splicing factor 3B subunit 3; AltName:
Full=Pre-mRNA-splicing factor SF3b 130 kDa subunit;
Short=SF3b130; AltName: Full=Spliceosome-associated
protein 130; Short=SAP 130
gi|15030278|gb|AAH11412.1| Splicing factor 3b, subunit 3 [Mus musculus]
gi|26353236|dbj|BAC40248.1| unnamed protein product [Mus musculus]
gi|27503728|gb|AAH42580.1| Splicing factor 3b, subunit 3 [Mus musculus]
gi|148679525|gb|EDL11472.1| splicing factor 3b, subunit 3 [Mus musculus]
gi|187951307|gb|AAI39016.1| Splicing factor 3b, subunit 3 [Mus musculus]
gi|187954163|gb|AAI39017.1| Splicing factor 3b, subunit 3 [Mus musculus]
gi|344248014|gb|EGW04118.1| Splicing factor 3B subunit 3 [Cricetulus griseus]
Length = 1217
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILSPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>gi|158256968|dbj|BAF84457.1| unnamed protein product [Homo sapiens]
Length = 1217
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEEQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>gi|321478392|gb|EFX89349.1| hypothetical protein DAPPUDRAFT_303178 [Daphnia pulex]
Length = 1215
Score = 214 bits (544), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 265/1187 (22%), Positives = 489/1187 (41%), Gaps = 185/1187 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L ++G I ++ FR G +D+L + ++ + +L+++A + + + + + G
Sbjct: 53 LLSQEVFGIIRSIISFRLTGGNKDYLVVGSDSGRIVILEYNA-AKNVFDKVHQETYGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R + + + L ++ D + L + + + L +
Sbjct: 112 CRRIVPGQYMAIDPKGRAVMISAIEKQKLVYILNRDAQAHLTISSPLEAHKSNTLVYNIV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A T E A K + F E N+ LD
Sbjct: 172 GVDVGFENPVFACLEMDYEEADGDPTGEAAQKTQQTLTFYELDLGLNHVVRKYSEPLDEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
A+ LI VP P GVL+ E I Y N + IR I + +D D R
Sbjct: 232 ANFLITVPGGNDGP--SGVLVCSENYITY--KNLGEQHDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L+ G + + + ++E V LKI+ ++S++ L
Sbjct: 287 FVCSASHRTKAMFFFLVQTEQGDIFKITMDMDEEVVRELKIKYFDTVPVSSSMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKL------NLQPDAKGSY-----------------VEVLERYVNLGP 298
+++ S +G+ L ++ + +P+ + + +++ +L P
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDDEPEFSSTIPLEEGDTFFFAPRPLKNLVLVDEMESLSP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
++ + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 407 VLACHIADLANEDTPQLYALCGRSSRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRKRA 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
DD FD ++VVSF++ T +L++ + +EE GF T TL C N L+QV +
Sbjct: 467 DDEFDAYIVVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLGCSSLGDNALLQVYPEGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
R + + R NEWKSP +++ N QV+++ GG LVY E+ G L E + +
Sbjct: 525 RHIRADRR--INEWKSPGKRAISRCAVNQRQVVISLAGGELVYFEMDPTGQLNEYTERKE 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-L 534
+ E+ C+ + + S+ AVG+ D +VR+ SL + +T L + +P +
Sbjct: 583 MTAEVVCMALANVPSGEQRSRFLAVGL-ADNTVRVISLDPNDCLTP--LSMQALPATPES 639
Query: 535 LCAFE--------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
LC E G +L L +G LL +L+ TG+L D + LG++P+ L
Sbjct: 640 LCIVEMGGHDKDSETTGTSGQLFLNIGLQNGVLLRTVLDPVTGDLADTRTRYLGSRPVKL 699
Query: 581 RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
++N V A S R + Y + + ++ + F S P+ +
Sbjct: 700 FRVMTQNHEAVLAISSRTWLSYYFQNRFHLAPLSYDSLEFASGFASEQCPEGVVAIASNT 759
Query: 641 LTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICS-------------------- 679
L I ++ + + + S PL PR+ + R I +
Sbjct: 760 LRILALEKLGAVFNQVSYPLEYTPRKFVVHDSGRMIIIETDHNAYTLETKLERRQQIAQE 819
Query: 680 LKNQSCAEESEM-----------------------------HFVRLLDDQTFEFISTYPL 710
+++ + EE E+ VR++D + + T L
Sbjct: 820 MRDAATEEEQELAHQMADAFLNEDLPESDFGAPKAGAGMWASIVRIMDPTSGNSLHTIRL 879
Query: 711 DTFEYGCSILSCSF--SDDSNVYYCVGTAYVLPEENEPTKGRILV---FIVEDGKLQLIA 765
E SI F D + + VG A L + G + I E L+ +
Sbjct: 880 PQNEAALSIGLARFMNQDPEDYFVLVGVAKDLKLNPKQCDGGFIYTYKLINEWSALEFVH 939
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTR 824
+ + Y++ A+ G++L + + ++LY D G ++L +C H + +Q
Sbjct: 940 KTPVEDTPYAICAYQGRVLIGVGRLLRLY-----DMGKKKLLRKCENKHLPSCVVNIQAL 994
Query: 825 GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
G + V D+ +SI + YK E + A D + +++ + +LD D A+ N+ +
Sbjct: 995 GQRVYVADVQESIHFVRYKRMENQLIIFADDTHPRYVTTMCLLDYDTVAVADKFGNISVL 1054
Query: 885 RKNSEGATDE-----------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDS 926
R S + D +RG L EV+ H+GE + +L+
Sbjct: 1055 RLPSRTSDDVDEDPTGNKSFWDRGVLNGASNKAEVLCNIHIGETALSLQRATLI------ 1108
Query: 927 DVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
G ++++ T++G IGV+ E + F + L+ +LR + G +H +RSF
Sbjct: 1109 -PGGSESLVYTTLSGSIGVLVPFTSREDHDFFQALEMHLRSEHPPLCGRDHLAFRSF--- 1164
Query: 986 KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + ++ ++ ++ + E+ K++E++
Sbjct: 1165 --YFPVKNVIDGDLCEQFNSIDSSKQKAVADDLDRAPNEVSKKLEDI 1209
>gi|407919154|gb|EKG12409.1| Cleavage/polyadenylation specificity factor A subunit [Macrophomina
phaseolina MS6]
Length = 1210
Score = 214 bits (544), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 269/1178 (22%), Positives = 478/1178 (40%), Gaps = 181/1178 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I L FR G ++D++ +A++ + +L+++ + + + R
Sbjct: 64 VFGVIRALAAFRLAGSSKDYIIVASDSGRIAILEYNPQKNIFTQLKLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R +I + L V+ D+ L + + Q + G
Sbjct: 124 GQYLATDPKGRACMIASIEKNKLVYVLNRDSNANLTISSPLEAHRPQTFVFSLIGLDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKTYEVAL-KDKDFVE-----------GPWSQNNLDNGADLLI 168
P L D D T EVA K K+ V WS + +D A++L
Sbjct: 184 ENPIFAALEVDYDDVDQDHTGEVAASKQKELVYYELDLGLNHVVRKWS-DAVDRTANMLF 242
Query: 169 PVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD-ADGSRYLLGDHA 222
VP P GVL+ GEE+I Y N P R I + G + RY+ A
Sbjct: 243 QVPGGQDGP--SGVLVCGEESITYRHINQ---DPFRVPIPRRKGATENPQRKRYIT---A 294
Query: 223 GLLHLL---------------------VITHEKEKVTG----LKIELLGETSIASTISYL 257
G++H + ++ E+ + TG LKI+ +AS++ L
Sbjct: 295 GVMHKMRGAFFFLLQTEDGDLFKVTIDMVEDEQGQPTGDVSRLKIKYFDTVPVASSLCIL 354
Query: 258 DNAVVYIGSSYGDS---------------QLIKLNLQPDAKGSY------------VEVL 290
+ +++ + G+ + I N P+ Y + +
Sbjct: 355 KSGYLFVTAESGNHHFYQFEKLGDDDEELEFISDNFSPEPDAPYEPVFFHPRGAENISLA 414
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KG 349
+ ++ P + V +L Q+ T G + + + +G+ ++E EL + +
Sbjct: 415 QSVPSMNPTLGCKVANLTDDDAPQIYTVCGTGARSTFKTLTHGLEVSEIVESELPSVPEA 474
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
+W+ + T D +D ++++SF + T +L++ + +EE GF S T+TL + L
Sbjct: 475 VWTTKIRTGDEYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSTRTLAVQLLGEDAL 532
Query: 410 VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGIL 468
+QV +R + + R NEW +P ++ AT N QV +A G +VY E+ DG L
Sbjct: 533 IQVHPKGIRHIQAGGR--INEWPAPQHRTIVAATTNERQVAVALSSGEIVYFEMDVDGSL 590
Query: 469 TEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGG 526
E + ++ ++CL + + E S AVG D +VRI SL PD L K
Sbjct: 591 AEYDEKKEMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQAL 649
Query: 527 EIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
P ++ + A S YL L G L +++ TGELTD + LG + + L
Sbjct: 650 TSAPSALSIMAMADSSSGGSTLYLHIGLYSGVYLRTVIDEITGELTDTRTRFLGAKGVKL 709
Query: 581 RTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
+ ++ + + A S R + YS K + + + FNS + +
Sbjct: 710 FRVTVQDQSAILATSSRSWLGYSDQQTKAFTLTPLAYAPLEWGWSFNSEQCTQGMVGIQG 769
Query: 639 GELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV--- 694
L I TI+++ L +IPL PR + F + N A + +
Sbjct: 770 QHLRIFTIENLSNNLLFENIPLMYTPRNLVRHPTEPLFYVIESDNGVLASSTRQQLLNDP 829
Query: 695 -------RLLDDQTF----------------------EFISTYPLDTFEYGCSILSCSF- 724
++L + F E + T L+ E S+ F
Sbjct: 830 AAVNGDAKILPPEEFGYPRAEGHWASCIQVVDPISSKEVVHTLELEENESAVSVCLAPFT 889
Query: 725 SDDSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAF 779
S + + VGTA V P G + ++ + E+G+ L+ I + + + +L F
Sbjct: 890 SQNDETFLVVGTAKDLVVAPRSY--NCGYVHIYRLQENGRELEFIHKTKMEAPPMALLPF 947
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKSI 837
GKLL + ++LY D G R+L + + + + +QT+G IV D+ +S+
Sbjct: 948 QGKLLVGVEADLRLY-----DLGLRQLLRKAQALNVVPNILIGLQTQGSRIVCSDVQESV 1002
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
+ ++YKH E + + D W S ++D + G + N++ VR K SE A +E
Sbjct: 1003 TYVVYKHLENRLIQFCDDSIHRWTSCTAMVDYETTAGGDKFGNIWLVRCPPKASEEADEE 1062
Query: 895 ERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
G RL+++ ++ + + +LV G +++ +
Sbjct: 1063 GSGLHLINERPYLQGTPNRLDLLAHFYTQDIPTSIQKTALV-------AGGRELLLWSGL 1115
Query: 940 NGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
G +G+ I + E F + L+ LR + G +H +RS+ V K +DGD
Sbjct: 1116 QGTLGIFIPFVSREDVDFFQSLEQQLRTEDPPIAGRDHLAYRSY-----YVPVKGVIDGD 1170
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRV-EELTRL 1035
L E FL L R + + I+ ++ SV E+ +++ ++ TR
Sbjct: 1171 LCERFLRLPRDKKETIAAELDRSVREVERKIGDQRTRF 1208
>gi|323447810|gb|EGB03719.1| hypothetical protein AURANDRAFT_72671 [Aureococcus anophagefferens]
Length = 760
Score = 214 bits (544), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 204/761 (26%), Positives = 349/761 (45%), Gaps = 79/761 (10%)
Query: 302 FCVVD-LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
CVV+ + +GQG++V C GA G+L IV +G+ + AS I + + SS +
Sbjct: 1 MCVVEPADAEGQGRIVACCGAGDAGTLHIVHDGLDVTGTASRSPTRITNLSLILSSRGE- 59
Query: 361 FDTFLVVSFISETR--ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV- 417
++VS + T ++A + + +GF T++C + + + VQVT ++
Sbjct: 60 ---MVIVSTEAATHAFLVANDGSTTAFPSNPQGFDETISTIYCCN-LGDAAVQVTPRAIS 115
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLE 477
+ + +R W+ G N + +L++ G LV L D + + A L+
Sbjct: 116 KFYTGENRRKLKTWRPKDGRITAATGNNGAVILISLSNGKLVLL---DCDVEPIASAALD 172
Query: 478 YEISCLDINPIG----ENPSYSQIAAVGMWTDISVRIFSLPD--LNLITKEHLGGE---- 527
E SC+ ++P G ++ + AV +W+D V S+ D LN I + +
Sbjct: 173 DEASCVALSPSGSASFKDLGMAHATAVSVWSD-PVSCVSVYDSILNKIHDPTITWQDKSS 231
Query: 528 -----IIPRSVLLCAFEGISYLLCALGDG--HLLNFLLNMKTGELTDRKKVSLGTQPITL 580
R+++L YL CA GDG + F T + VSLG P +
Sbjct: 232 NEQTLQFVRALVLSEINEKPYLFCASGDGLVTVYQFACGNDVVTTTRLRAVSLGVLPARV 291
Query: 581 RTFSS--KNTTHVFAASDRPTVI----YSSNKKLLYSNVNLKEVSH-----MCPFNSAAF 629
+ + V SDR V+ S + ++ S E + + P + A
Sbjct: 292 LILGAFDRMARSVLCFSDRACVVRRGCASHLQWVVASAKAFGETTEWRYYIVVPLHKKAT 351
Query: 630 PDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES 689
LA + I ID Q + + S+PL E P ICH QS FA+C++ ++ +
Sbjct: 352 TAILACGSQ--FAICKIDANQGIKVVSVPLAEQPLCICHDLQSHLFAVCTIDHREGDNQG 409
Query: 690 EMHFVRLLDDQTFEFISTYPLDTFE--YGCSILS---CSFSDDSNVYYCVGTAYVLPEEN 744
+ F+R D+ + + L+ E CSI+S S D ++ VGTA+ E +
Sbjct: 410 VIRFLR--DEAPYNDVHREALEPLEIPLCCSIISLDSISTYKDQRAHFVVGTAFAAQEND 467
Query: 745 -EPTKGRILVFIVEDGKL--QLIAEKETKGAVYSLNAFNGKLL-AAINQKIQLYKWMLRD 800
EP GR+++F + ++ E GAVY + A LL A+N I +Y ++RD
Sbjct: 468 FEPCSGRMIIFRSGQANVAPSVLFFVEANGAVYDVAAMRASLLVCAVNHAIHIYDPVVRD 527
Query: 801 DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
+ L+ + G ++AL VQ G+ IVVGD+M+S++LL ++ I E A DYN NW
Sbjct: 528 NRRGHLKPRASYDGLVVALKVQCYGNLIVVGDMMRSVTLLNLIRQKMIIVEVACDYNTNW 587
Query: 861 MSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 920
+ A+E++ D ++ A+ + +L + ++ G +D+ LE + HLG+ V F GS++
Sbjct: 588 VCALEVIGDGSFIIADASGSLVAL-ESLYGNSDKGY-FLESRAKMHLGDVVTCFARGSIM 645
Query: 921 MR--LPDSDVGQIPT-VIFGT-----------------VNGVIGVIASLPHEQYLFLEKL 960
+ + V ++ T +IFG V+G IG I S+ + LE+L
Sbjct: 646 TQDDWRNPKVSKVATPLIFGCVTSSRVLVLTPLIARYQVSGAIGCIVSIDDVTHSLLERL 705
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK---NFLDGD 998
L + GVG +HE +++ +N T +A +F+D +
Sbjct: 706 SKVLLEFHSGVGDFDHETFQALHNNVATCNAAPMDDFIDAN 746
>gi|350630003|gb|EHA18376.1| hypothetical protein ASPNIDRAFT_38018 [Aspergillus niger ATCC 1015]
Length = 1219
Score = 213 bits (543), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 281/1172 (23%), Positives = 486/1172 (41%), Gaps = 169/1172 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R +I + L V+ +++ +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACIIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMTALDVG 182
Query: 120 CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
P L D ++ +H+ YE+ L V WS + +D A L
Sbjct: 183 YENPVFAALEVDYSESDQDPTGQAFDELEKHLVYYELDLGLNHVVRK-WS-DPVDRTASL 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRP----------SITKAYGRVD 210
L VP GVL+ E+++ Y +N AF+ IP R SIT
Sbjct: 241 LFQVPGGADGPSGVLVCAEDSVTYRHSNQDAFRVPIPRRSGATENPERKRSITAGVMHKM 300
Query: 211 ADGSRYLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVVY 263
+LL G L L I ++ ++TG LKI+ +AS++ L + +Y
Sbjct: 301 RGAFFFLLQTEDGDLFKLTIDMVEDDNGQLTGEVKRLKIKYFDTVPLASSLLILKSGFLY 360
Query: 264 IGSSYGDSQLIKLNLQPD--------------------------AKGS-YVEVLERYVNL 296
+ S G+ + D +G+ + ++E +L
Sbjct: 361 VASEAGNHHFYQFEKLGDDDEEIEFTSESFSADPSVPCDPIYFRPRGAENLNLVETINSL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P++D VV++ Q+ T SGA S R +++G+ ++E EL + +W+ +
Sbjct: 421 NPLIDSKVVNITEDDAPQIYTISGAGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ D FD ++++SF + T +L++ + +EE GF S TL + L+Q+
Sbjct: 481 TRSDEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQIHPR 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
+R + + R NEW +P S+ A N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDSDGSLAEYDER 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
Q+ ++CL + + E S AVG D +VRI SL PD L K P +
Sbjct: 597 RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTAAPSA 655
Query: 533 VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ + + S YL L G L +L+ TGEL+D + LG++P+ L S K
Sbjct: 656 LNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKPVKLFQVSVK 715
Query: 587 NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
T V S RP + YS K + + ++ + F+S + + + L I
Sbjct: 716 GQTAVLGLSSRPWLGYSDVQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775
Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR-------- 695
+I+ + + +SIPL PRR + F + N A + +
Sbjct: 776 SIEKLDNNMLQQSIPLSYTPRRFLKHPEQPLFYVIESDNNVLAPSTRAKLLEDSKSRGGD 835
Query: 696 --LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSF-SDDSNV 730
+L + F + + T L+ E SI + F S D
Sbjct: 836 ETVLPPEDFGYPRATGHWASCIQVVDPLDAKAVVHTIELEENEAAISIAAVPFTSQDDET 895
Query: 731 YYCVGTAYVLPEENEP--TKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAA 786
+ VGTA + N P G I ++ EDG+ L+ I + + + +L F G+L+A
Sbjct: 896 FLVVGTAKDM-TVNPPGSAGGYIHIYRFQEDGRELEFIHKTKVEEPPLALLGFQGRLVAG 954
Query: 787 INQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
I +++Y D G ++L +C I+ L QT+G IVV D+ +S++ ++YK
Sbjct: 955 IGSLLRIY-----DLGMKQLLRKCQAPVVPKTIVGL--QTQGSRIVVSDVRESVTYVVYK 1007
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE------ 894
++E + D + W +A ++D + G + NL+ +R K SE A ++
Sbjct: 1008 YQENVLIPFVDDSVSRWTTATTMVDYETTAGGDKFGNLWLLRCPKKTSEEADEDGSGAHL 1067
Query: 895 --ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
ERG RLE++ + + LV G +++ G IG+
Sbjct: 1068 IHERGYLQGTPNRLELMIHVYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIGM 1120
Query: 946 IAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
+ + E F + L+ L + G +H +RS+ K V +DGDL E +
Sbjct: 1121 LVPFIGREDVDFFQNLEMQLAAQHPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEMYF 1175
Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
L I+ ++ SV E+ +++ + RLH
Sbjct: 1176 LLPNDTKMMIAAELDRSVREIERKISCIYRLH 1207
>gi|358391805|gb|EHK41209.1| hypothetical protein TRIATDRAFT_135379 [Trichoderma atroviride IMI
206040]
Length = 1212
Score = 213 bits (543), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 285/1177 (24%), Positives = 491/1177 (41%), Gaps = 174/1177 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW---DAESSELITRAMGDVSD 57
+L I+G I +L FR G +D+L +AT+ + ++++ + S L G
Sbjct: 59 LLSHDIFGIIRSLAAFRLAGSNKDYLILATDSGRITIIEYLPRENRFSRLHLETFGKSGV 118
Query: 58 RIGRPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIK 115
R P G+ DP R L+ + L V+ +++ +L + + + VL I
Sbjct: 119 RRVIP---GEYLACDPKGRACLVASIEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVIS 175
Query: 116 FL---YGCAKPTIVVLYQD----NKDA-----RHVKT----YEVALKDKDFVEGPWSQNN 159
+ G A P L D ++D+ R ++T YE+ L V WS+
Sbjct: 176 MVALDVGYANPVFAALEIDYTEVDQDSTGEALRELETQLVYYELDLGLNHVVRK-WSEP- 233
Query: 160 LDNGADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRV 209
+D+ A LL VP P GVL+ GEE I Y +N AF+ IP R T+ + R
Sbjct: 234 VDSTASLLFQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRKRT 291
Query: 210 DADGSRYLLGDHAGLLHLLVITH--------------EKEKVTG----LKIELLGETSIA 251
G + L AG L+ T E+ TG LKI+ A
Sbjct: 292 IVSGVMHKLKGSAGAFFFLLQTEDGDMFKVTIDMVEDEEGNTTGEVRRLKIKYFDTVPAA 351
Query: 252 STISYLDNAVVYIGSSYGD---SQLIKL------------NLQPDAKGSYVEV------- 289
S + L + +Y+ S +G+ Q KL + DA+ SY V
Sbjct: 352 SNLCILKSGFLYVASQFGNFSFYQFEKLGDDDEELEFTSDDFPVDAQASYTPVYFYPRPL 411
Query: 290 -----LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL 344
+E ++ P++D + +L + Q+ T G + R +++G+ INE S EL
Sbjct: 412 ENLVLVESIPSMNPLLDCKIANLTGEDAPQIYTVCGNGARSTFRTLKHGLEINEIVSSEL 471
Query: 345 QGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
GI +W+L+ + + +D ++V+SF + T +L++ + +EE GF + TL
Sbjct: 472 PGIPSAVWTLKLNRSEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQL 529
Query: 404 AIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
+ L+QV +R + + N+W +P S+ A+ NA QV +A G +VY E+
Sbjct: 530 LGDDGLIQVHPKGIRHIRNGQV---NQWDAPQHRSIVAASTNAHQVAIALSSGEIVYFEM 586
Query: 464 -GDGILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLIT 520
DG L E + + ++CL + + E S AVG D +VRI SL P+ L
Sbjct: 587 DDDGSLAEYDDKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDCTVRILSLDPESTLEN 645
Query: 521 KEHLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
K P S+ + + E S YL L G L +L+ TGELTD ++ LG
Sbjct: 646 KSVQALTAAPTSLAIISMEDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELTDTRQKFLG 705
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
+ + L + + T V S RP + Y+ K + + ++ ++ F S +
Sbjct: 706 PKQVRLFQVTVQGRTCVLGLSSRPWLGYADPITKSFVVTPLSYVDLEWGWNFTSEQCEEG 765
Query: 633 LAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM 691
+ + L I ++D + I+ SIPL P+++ + F + N + + +
Sbjct: 766 IVGIQGQTLRIFSVDRLGDTLIQNSIPLTYTPKKMVKHPEHPLFYVIEADNHTLSPDLCA 825
Query: 692 HFV----------RLLDDQTF-----------------------EFISTYPLDTFEYGCS 718
+ ++L + F + + LD E S
Sbjct: 826 KLLADPARVNGDAKVLSPEEFGHPRGNRRWASCISVVDPLAEDGQALQKIDLDENEAAVS 885
Query: 719 ILSCSF-SDDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVY 774
+ +F S D+ + VGT V P I F E L I + + +
Sbjct: 886 LAIVTFASQDNETFLVVGTGKDMVVNPRSFSDAYVHIYRFEQEGRGLVFIHKTKVEEPPM 945
Query: 775 SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
++ F G++L I + +++Y +R R+ Q+E L + T+G+ I+VGD+
Sbjct: 946 AIIPFQGRVLVGIGKILRIYDLGMR-QLLRKTQAEVAPQ---LINSLSTQGNRIIVGDVQ 1001
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
+ I+ ++YK + D A W + ++D + G + N+F VR K SE A
Sbjct: 1002 QGITYVVYKQTTNKLIPFVDDTVARWTTCSTMVDYETVAGGDKFGNIFVVRSPQKASEEA 1061
Query: 892 TDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
+E+ G RL+++ + + SLV VG +++
Sbjct: 1062 DEEQAGLHLLNARDYLHGRSHRLDLMCHLYTQDIPTSITKTSLV-------VGGQDVLLW 1114
Query: 937 GTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
+ G IGV I + E F + L+ +LR + G +H +RS+ K V +
Sbjct: 1115 SGLMGTIGVLIPFVTREDTDFFQSLELHLRAEDPPLAGRDHLMYRSYYAPVKGV-----I 1169
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DGDL E + L + I+ ++ SV E+ +++ ++
Sbjct: 1170 DGDLCERYTLLPNDKKQMIAAELDRSVREIERKISDI 1206
>gi|47087273|ref|NP_998668.1| splicing factor 3B subunit 3 [Danio rerio]
gi|28502779|gb|AAH47171.1| Splicing factor 3b, subunit 3 [Danio rerio]
Length = 1217
Score = 213 bits (543), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 264/1190 (22%), Positives = 491/1190 (41%), Gaps = 189/1190 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L + ++G + +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTMEVFGVVRSLMAFRLTGGTKDYVVVGSDSGRIVILEYHP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R ++G L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMMGATEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + ++E VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRMKYFDTIPVATAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V V E+ +L
Sbjct: 347 LFVSSEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-SLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ + DL + Q+ G +LR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIMSCQIADLANEDTPQLYVACGRGPGSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKTPGKKTIIRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
++ ++ C+ + + S+ AVG+ D +VRI SL L ++ + L + P
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ--PE 638
Query: 532 SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
S+ + G+ YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 639 SLCIVEMGGVEKQDELGEKGTICFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + + F S P+ +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLILIETDHNAYTEATKAQRKQQ 818
Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
AEE +EM VRL++ +
Sbjct: 819 MAEEMVEAAGEDERELAAEMAAAFLNENLPEAIFGAPKAGSGQWASLVRLINPIQGNTLD 878
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
L+ E S+ C F + + +Y VG A ++ G I + + G KL+
Sbjct: 879 LVQLEQNEAAFSVAICRFLNGGDDWYVLVGVARDMILNPRSVGGGYIYTYRIVGGGDKLE 938
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYV 821
+ + + ++ F G++L + + +++Y D G ++L +C H L +
Sbjct: 939 FLHKTPVEDVPLAIAPFQGRVLVGVGKLLRIY-----DLGKKKLLRKCENKHVPNLVTGI 993
Query: 822 QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
T G ++V D+ +S+ + Y+ E + A D W++ +LD D A+ N+
Sbjct: 994 HTIGQRVIVSDVQESLFWVRYRRNENQLIIFADDTYPRWITTACLLDYDTMASADKFGNI 1053
Query: 882 FTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
VR N+ DE +RG L E++ YH+GE V + +L+
Sbjct: 1054 CVVRLPPNTSDDVDEDPTGNKALWDRGLLNGASQKAEIIINYHIGETVLSLQKTTLI--- 1110
Query: 924 PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
G ++++ T++G IG++ HE + F + L+ ++R + G +H +RS+
Sbjct: 1111 ----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHLEMHMRSEFPPLCGRDHLSFRSY 1166
Query: 983 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1167 -----YFPVKNVIDGDLCEQFNSMDPHKQKSVSEELDRTPPEVSKKLEDI 1211
>gi|342320507|gb|EGU12447.1| Pre-mRNA-splicing factor RSE1 [Rhodotorula glutinis ATCC 204091]
Length = 1212
Score = 213 bits (543), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 282/1186 (23%), Positives = 480/1186 (40%), Gaps = 187/1186 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+++ ++G+I +L F+ G ++D++ I ++ + VL++D ++ + + + R G
Sbjct: 55 IVEADVFGQIRSLTAFKLTGGSKDYIIIGSDSGRIVVLEYDPNNN-VFNKLHQETYGRSG 113
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIR--LEELQ 110
R GQ DP R +IG L ++ D L EA R + +
Sbjct: 114 SRRIVPGQYLATDPKGRAVMIGAMEKSKLVYILNRDAAANLTISSPLEAHKPRAIIHSIV 173
Query: 111 VLDIKFLYGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQ 157
+D+ F P L D +A T YE+ L V WS+
Sbjct: 174 GVDVGF----ENPMFAALEVDYTEADQDPTGAAFDAAEKMLTYYELDLGLNHVVRK-WSE 228
Query: 158 NNLDNGADLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS- 201
D A+LL+ VP P GVL+ E+ I+Y A + IP R
Sbjct: 229 AT-DPRANLLVQVPGGQNATMDRFDGP--SGVLVCCEDYIIYKHQGAKEHRVPIPKRAHP 285
Query: 202 ---------ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIAS 252
IT A +LL G L + I HE+E+VT LKI+ AS
Sbjct: 286 LADPERGVIITAAVMHKMKGAFFFLLQSEEGDLFKVTIEHEEEEVTALKIKYFDTVPSAS 345
Query: 253 TISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLG--------------- 297
++ L + +++ + +G+ +L + D Y NLG
Sbjct: 346 SLCILKSGFLFVATEFGNPRLYQFEKLGDDDDETEFSSTDYDNLGAGTDPLPPALFRPRE 405
Query: 298 --------------PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE 343
PI+D V + + Q+ G SLRI+R G+ + E S E
Sbjct: 406 LQNLAIADEVESLAPILDAKVANYLGEDTPQIYAACGRGARSSLRILRQGLEVMEAVSSE 465
Query: 344 LQGIK-GMWS--LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
L G +W+ LR+ +D ++++SF++ T +LA+ D +EE GF S TL
Sbjct: 466 LPGAPIAVWTTKLRAEGASSYDAYIILSFVNGTLVLAIG--DTIEEVSDTGFISSAPTLG 523
Query: 401 CHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVY 460
+ L+Q+ +R + + R NEWK ++ AT N+ QV++ G LVY
Sbjct: 524 VQQLGDDALLQIYPRGIRHILADKRV--NEWKVGARETIVCATTNSRQVVIGLSTGELVY 581
Query: 461 LEIG-DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
E+ DG L E + + + EI L I + E + AVG D +VRI SL N
Sbjct: 582 FELDMDGQLNEFQERKPMGAEILALSIAEVPEGRQRTPYLAVGC-ADSTVRIISLDPENT 640
Query: 519 ITKEHLGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRK 569
+ L P S ++ A + ++ L +G LL +L+ TG L D +
Sbjct: 641 LDSLSLQALTAPPSSIVMAEITDASIDKYHATMFVNIGLNNGVLLRTVLDPLTGSLGDTR 700
Query: 570 KVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAF 629
LG++P+ L + + + A S RP + Y+ L ++ + + + F++
Sbjct: 701 TRFLGSRPVKLARVPVQGSPAILALSSRPWLNYAYRGILQFTPLIFDALDYAWSFSAELC 760
Query: 630 PDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQS----------RTFAIC 678
P+ L L I T + QK+ I L PR++ S RTF+
Sbjct: 761 PEGLIGIVGNSLRIFTFPRLGQKVQQTVIDLSYTPRQLLTSPHSRLLYTVEADHRTFSPS 820
Query: 679 SL-KNQSCAEESEMHF-------------------------VRLLDDQTFEFISTYPLDT 712
++ K S +EM VR++D T E + L+
Sbjct: 821 AIQKTISDMRMAEMEVDEEVLNLDPKEFGLPRGPAGQWASCVRVIDPVTAETVFKVDLEQ 880
Query: 713 FEYGCSILSCSFSDDSN-VYYCVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKE 768
E S +F N V+ VGT + P + + E +L+L+ + E
Sbjct: 881 NEAAFSAAIVTFHSHPNEVFLVVGTGQDTSLAPRACKQAYLHTYKLLEEGRQLELLHKTE 940
Query: 769 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH-ILALYVQTRGDF 827
+L AF G+L+A + + ++LY D G ++L + + G + + + T+G
Sbjct: 941 VDDIPKALIAFQGRLVAGVGKALRLY-----DLGKKKLLRKAENKGFATMIMTLNTQGTR 995
Query: 828 IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
I+VGD +S+ +YK E + A D + W +A I+ D + A + F F V +
Sbjct: 996 IIVGDAQESVYYALYKAPENRLLIFADDISPRWTTA-SIMVDYETVAAGDKFGNFFVNRL 1054
Query: 888 SEGAT----DEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
+G + D+ G R ++ YH+G+ + +LV
Sbjct: 1055 PKGVSSDVDDDPTGAGIMHEKPYLMGAPHRTHLLAHYHIGDIITSLHKVALV-------A 1107
Query: 929 GQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS-FNNEK 986
G +++ + G +GV+ + +E F L+ +LR + G H +RS + K
Sbjct: 1108 GGRDLLVYTGLMGTVGVLVPFVSNEDVDFFTTLEMHLRSEAPSLCGREHLAYRSAYTPVK 1167
Query: 987 KTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
TV DGDL E + L + +I+ + +V E+ K+++ +
Sbjct: 1168 ATV------DGDLCEVYRSLPMAKQGQIAGELERTVSEVIKKLDNV 1207
>gi|388582014|gb|EIM22320.1| hypothetical protein WALSEDRAFT_60013 [Wallemia sebi CBS 633.66]
Length = 1208
Score = 213 bits (543), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 201/847 (23%), Positives = 378/847 (44%), Gaps = 91/847 (10%)
Query: 258 DNAVVYIGSSYGDSQLIKLN---------LQPDAKGSYVEVLERYVNLGPIVDFCVVDLE 308
DN + Y SSY ++ + L P + V V + +L PI+D V +
Sbjct: 375 DNELEYSSSSYENNGMPSLENPLPIESAYFTPRGLDNLVPV-DEIQSLAPILDAKVQSIY 433
Query: 309 RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFDTFLVV 367
Q+ T SG SLR++R+G+ + E S EL G+W+L+ + D +D+ +V+
Sbjct: 434 AGDTPQIYTASGVGSRSSLRVMRHGLDVIEAVSSELPAPPNGIWTLKQNAQDMYDSLIVL 493
Query: 368 SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
SF++ T +L + + +EE G + TL + ++QV +R + + R
Sbjct: 494 SFVNGTLVLGIG--ESIEEVSDTGLATSVSTLSVDQLGEDSMIQVFPQGIRHILNDKR-- 549
Query: 428 RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQ-LEYEISCLDI 485
NEWKSP + +T N+ QV +A G LVY E+ +G L E + + +E ++ + I
Sbjct: 550 VNEWKSPSDTYITASTTNSRQVCVALSNGELVYFEMDNEGQLNEFQERKSMESTVTTMSI 609
Query: 486 NPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGG--------EIIPRSVL 534
+ + A+G D ++RI SL P+ L +++ + + EI+ +S
Sbjct: 610 GEVPQGRQRCPYLALGC-DDQTIRIVSLDPENTLEVVSVQAVTAQPSSICVAEILDKS-- 666
Query: 535 LCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAA 594
L + ++ L +G LL +L+ G L D + LG +P+ LR + V A
Sbjct: 667 LDKYNPTLFVNIGLANGVLLRTVLDTVNGSLADTRTRFLGAKPVVLRRVTVDKQQAVMAL 726
Query: 595 SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLH 653
S R + Y+ ++ ++ + + + + FN+ PD L + L I T+ ++ Q++
Sbjct: 727 STRTFLNYAHGDQMYFTPLLYEPLDQVSSFNAELCPDGLIGISDTVLRIFTLPNVGQRMK 786
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF-------------------- 693
S+ L PR+I + F +++ + E +
Sbjct: 787 QDSVALSYTPRKILLHPTAPLFLTIESDHRTISRERQAELLTSKGYNPEEHNFDAVQFGN 846
Query: 694 -----------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-DDSNVYYCVGTA---Y 738
VR++D T ++ LD E S ++ + + VGTA
Sbjct: 847 VRMEAGNWASCVRMIDPVTLTTVNKVELDNNEAAFSAAFVQWAGHEDETHLVVGTAKDRM 906
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
++P+ ++ R+ + +D +L+L+ + + Y+++AF G+LLA + + ++LY
Sbjct: 907 MMPQSHKEAYLRVYK-VTQDSQLELLHKTDIDDVPYAIHAFKGRLLAGVGKALRLY---- 961
Query: 799 RDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
D G + L +C + + + G I VGD+ +S+S +YK E + A D
Sbjct: 962 -DLGKKRLLRKCENKSFAAGIVNLNVVGSRIYVGDMQESVSFAVYKAPENRLLVFADDIM 1020
Query: 858 ANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEER---GRLEVVGEYH----LG 908
+ W + +D D G + N+F R K++ DE+ G L G YH
Sbjct: 1021 SRWTTTATPVDYDTVAGGDKFGNIFITRVDKSTSEWVDEDESGGGLLHARGLYHGAPNRS 1080
Query: 909 EFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLR 965
+ + F G +V + S + G +++ ++G +G+I + + F+ L+ ++R
Sbjct: 1081 KLLAHFYVGDIVTSITKSQLSAGGRDVLVYTCLHGTVGMIIPFASKDDIEFMSTLELHMR 1140
Query: 966 KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEEL 1025
+ + G +H +RS+ + K F+DGDL E + L T+ I+ ++ + E+
Sbjct: 1141 QESPSLVGRDHLGFRSY-----YIPCKAFVDGDLCELYASLPVTKQQAIANELDRTSGEV 1195
Query: 1026 CKRVEEL 1032
K++E L
Sbjct: 1196 LKKIESL 1202
>gi|405970223|gb|EKC35151.1| Splicing factor 3B subunit 3 [Crassostrea gigas]
Length = 1217
Score = 213 bits (542), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 274/1196 (22%), Positives = 504/1196 (42%), Gaps = 201/1196 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G ++D++ + ++ + +L++ A+ + + + + + G
Sbjct: 53 VLSVEVFGMIRSLMPFRLTGGSKDYVVVGSDSGRIVILEYIAQKN-IFEKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R +IG L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQYLAVDPKGRAVMIGAVEKQKLVYILNRDASARLTISSPLEAHKANTLVYHMV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
G PT L D ++A T E A + + V +S++ L+
Sbjct: 172 GVDVGFENPTFACLEIDYEEADMDHTGEGAKNTQQLLTYYELDLGLNHVVRKYSES-LEE 230
Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-- 215
A+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 231 HANFLISVPGGSDGP--SGVLICSENYITY--KNLGDQPDIRCPIPRRRNDLD-DPERGM 285
Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
+L G + + + +++ VT + ++ +A+ + L +
Sbjct: 286 IFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIILKYFDTVPVAANMCVLKSG 345
Query: 261 VVYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNL 296
++I S +G+ L ++ P + V+V E +L
Sbjct: 346 FLFIASEFGNHYLYQIAQLGDDDEEPRFSSASQLEEGEPHFFSPRPLKNLVQVDE-MDSL 404
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI + DL + Q+ T G +LR++R+G+ ++E A EL G +W+++
Sbjct: 405 SPITHCQIADLANEDTPQLYTLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKK 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ DD FD +++VSF++ T +L++ + +EE GF T TL C + LVQ+
Sbjct: 465 NIDDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSQLGDDALVQIYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKH 473
+R + S R NEWK+P S+ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRSDKR--VNEWKTPGKKSIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTER 580
Query: 474 AQLEYEISCLDIN--PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
++ E+ C+ + P+GE + AVG+ +D +VRI SL + ++ L + +P
Sbjct: 581 KEMSSEVVCMALGRVPVGEQ--RCRFLAVGL-SDNTVRIISLDPSDCLSP--LSMQALPE 635
Query: 532 -SVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
S LC E G +L L +G LL +L+ TG+L+D + LG
Sbjct: 636 PSESLCIVEMGGTEAKEETGEPGTVGGLFLNIGLQNGVLLRTVLDNVTGDLSDTRTRYLG 695
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
+P+ L S + + V A S R + Y+ + + ++ + + + F S P+ +
Sbjct: 696 VRPVKLFRISMQGSEAVLAMSSRSWLSYTYQNRFHLTPLSYETLEYASGFASEQCPEGIV 755
Query: 635 IAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKN---------- 682
L I ++ + + + S PL PR+ + H E + I + N
Sbjct: 756 AISTNTLRILALEKLGAVFNQVSHPLQHTPRKFVIHPETNNLIVIETEYNAYTEATKMER 815
Query: 683 ----------QSCAEESEM-----------------------------HFVRLLDDQTFE 703
AEE EM +R+++ +
Sbjct: 816 KQQMAEEMVEMVSAEEREMAAEMAANFLNEERSESIFGAPKAGPGMWASIIRIINPISGN 875
Query: 704 FISTYPLDTFEYGCSILSCSFS---DDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG- 759
+ L+ E SI F+ DD V V VL T G + ++ +EDG
Sbjct: 876 TLEKIQLEQNESVHSIALLKFASRGDDQFVLVGVARDLVL-NPRSLTGGFLYLYQLEDGG 934
Query: 760 -KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG---H 815
L+L+ + + ++ +F G++L + + +++Y D G +++ +C + +
Sbjct: 935 ESLKLLHKTPVEDVPGAIASFQGRVLIGVGRYLRIY-----DIGKKKMLRKCENKSIPNY 989
Query: 816 ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
+ A++ T G+ I+ D+ S + YK +E + A D N W++A LD + GA
Sbjct: 990 VSAIH--TVGNRIIASDVQDSFHFVRYKRQENQLIVFADDTNPRWITASCSLDYNTVAGA 1047
Query: 876 ENNFNLFTVRKNSEGATDE-----------ERGRL-------EVVGEYHLGEFVNRFRHG 917
+ N+ +R + + D +RG L +V+ +HLGE V +
Sbjct: 1048 DKFGNITIIRLPGDVSDDVDEDPTGNKALWDRGLLNGASQKADVMMNFHLGEMVTSLQKA 1107
Query: 918 SLVMRLPDSDVGQIPTVIFGTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
+L+ G ++++ T++G IG+ +A HE + F + L+ +R + G +H
Sbjct: 1108 TLI-------PGGSESLVYTTLSGGIGMLVAFTSHEDHDFFQHLEMYMRTEHPPLCGRDH 1160
Query: 977 EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1161 LSFRSY-----YYPVKNVIDGDLCEMFNSMDPVKQKSVSEELDRTPSEVSKKLEDI 1211
>gi|432114152|gb|ELK36185.1| Splicing factor 3B subunit 3 [Myotis davidii]
Length = 1217
Score = 213 bits (542), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 271/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C F++ +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFANTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>gi|367050506|ref|XP_003655632.1| hypothetical protein THITE_2119532 [Thielavia terrestris NRRL 8126]
gi|347002896|gb|AEO69296.1| hypothetical protein THITE_2119532 [Thielavia terrestris NRRL 8126]
Length = 1211
Score = 213 bits (541), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 270/1167 (23%), Positives = 483/1167 (41%), Gaps = 165/1167 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G I +L FR G +D++ +AT+ + ++++ + + + R
Sbjct: 64 IFGIIRSLASFRLAGSHKDYIILATDSGRIAIVEYLPKENRFSRIHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R LI + L V+ + + +L + + + VL + + G
Sbjct: 124 GQYLAADPKGRACLIASIEKNKLVYVLNRNAQAELTISSPLEAHKPGVLVLSLVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
A P L D +A T E + + + V WS + +D + LL
Sbjct: 184 ANPVFAALEIDYSEADQDPTGEAGREAEAQLVYYELDLGLNHVVRKWS-DTVDPTSSLLF 242
Query: 169 PVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYLL 218
VP P GVL+ GEE + Y +N AF+ IP R T+ R G + L
Sbjct: 243 QVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRRGATEDPQRKRTIVSGVMHKL 300
Query: 219 GDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNA 260
AG L+ T + + +V LKI+ IA ++ L +
Sbjct: 301 KGSAGAFFFLLQTDDGDLFKVTIDMVEDSEGNPTGEVRRLKIKYFDTIPIAHSLCILKSG 360
Query: 261 VVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLERY 293
++ S +G+ + + D + SY + ++E
Sbjct: 361 FLFAASEFGNHHFYQFEKLGDDDEEPEFTSDDFPTDPRASYNPVYFHPRPLENLALVESL 420
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 352
++ P+VD V +L + Q+ G + R++++G+ ++E + EL G +W+
Sbjct: 421 NSMNPLVDCKVANLTGEDAPQIYAVCGNRARSTFRMLKHGLEVSEIVASELPGTPSAVWT 480
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ + D +D ++V+SF + T +L++ + +EE GF + TL L+QV
Sbjct: 481 TKLTKYDEYDGYIVLSFTNATLVLSIG--ETVEEVTESGFLTSVPTLAVQQLGEEGLIQV 538
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV 471
+R + NEW +P S+ A N SQV++A G +VY E+ DG L E
Sbjct: 539 HPKGIRHIVQGRV---NEWPAPQHRSIVAAATNESQVVIALSSGEIVYFEMDADGSLAEY 595
Query: 472 -KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
+ ++ ++ L + + E + S AVG D +VRI L P+ L K
Sbjct: 596 DEKKEMSGTVTSLSLGKVPEGLTRSSFLAVGC-DDCTVRILGLDPESTLEMKSIQALTSA 654
Query: 530 PRSVLLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
P S+ + + E + YL L G L +L+ TGELTD + LG +P L S
Sbjct: 655 PSSLSIMSMEDSTGGNTLYLHIGLHSGVYLRTVLDEVTGELTDTRSKFLGPKPTKLFQVS 714
Query: 585 SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+N V A S RP + Y +K + + ++ E+ + F+S + + L
Sbjct: 715 VQNQPCVLALSSRPWLGYMDPLSKNFVMTPLSYAELEYGWNFSSEQCLEGMVGIHANFLR 774
Query: 643 IGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAI--------CSLKNQSCAEESEMH 692
I TI+ + + I +SIPL P+R+ H EQ + I L+ Q A+ ++
Sbjct: 775 IFTIEKLGETMIQKSIPLTYTPKRLVKHPEQPYFYTIEADNNTLPPELRAQLLAQSGAVN 834
Query: 693 F-VRLLDDQTFEF----------------ISTYP-------LDTFEYGCSILSCSF-SDD 727
+L + F + +S P L+ E S F S +
Sbjct: 835 GDATVLPPEDFGYPRAKGRWASCISVVDPVSEEPRVLQRVDLEGNEAAVSAAVVPFASQE 894
Query: 728 SNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLL 784
+ VGT ++ + T+G I V+ EDG+ L+ I + + +L F G+LL
Sbjct: 895 GESFLIVGTGKDMVLNPRQFTEGYIHVYRFHEDGRSLEFIHKTRVEEPPMALIPFQGRLL 954
Query: 785 AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
A I + +++Y LR + L+ G L + +QT+G+ IVVGD+ + ++ ++YK
Sbjct: 955 AGIGKTLRVYDLGLR----QLLRKAQGEVAQQLIVSLQTQGNRIVVGDVQQGVTYVVYKP 1010
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER-------- 896
E + D W + + ++D + G + NL+ VR + + +
Sbjct: 1011 ESNKLIPFVDDTINRWTTCITMVDYESVAGGDRFGNLWIVRAPERASQESDEPGSEVQLL 1070
Query: 897 ----------GRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV- 945
RL ++ ++ + +LV VG +++ + G +GV
Sbjct: 1071 HARSYLHGAPNRLNLMAHFYPQDLPTSITKTNLV-------VGGQDVLVWSGIQGTVGVL 1123
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
I + E F + L++++R + G +H +R + V K +DGDL E F
Sbjct: 1124 IPFVSREDVDFFQNLESHMRSEDPPLAGRDHLIYRGY-----YVPVKGVIDGDLCERFSL 1178
Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
L + I+ ++ SV E+ +++ ++
Sbjct: 1179 LPNDKKQMIAGELDRSVREVERKISDI 1205
>gi|46362557|gb|AAH68974.1| Splicing factor 3b, subunit 3, 130kDa [Homo sapiens]
Length = 1217
Score = 212 bits (540), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 495/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFSLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>gi|224064143|ref|XP_002188384.1| PREDICTED: splicing factor 3B subunit 3 [Taeniopygia guttata]
Length = 1217
Score = 212 bits (540), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 269/1192 (22%), Positives = 494/1192 (41%), Gaps = 193/1192 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNVFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQYLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PILCCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSV 533
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPE 638
Query: 534 LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
LC E G YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 639 SLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQ 818
Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
AEE +EM +R+++ +
Sbjct: 819 MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLD 878
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQ 762
L+ E S+ C FS+ + +Y VG A L G + +V G KL+
Sbjct: 879 LVQLEQNEAAFSVAVCRFSNTGDEWYVLVGVAKDLILNPRSVAGGFVYTYKLVNSGEKLE 938
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-- 820
+ + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 939 FLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYIC 991
Query: 821 -VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNF 879
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 992 GIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTATLLDYDTVAGADKFG 1051
Query: 880 NLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVM 921
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1052 NICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI- 1110
Query: 922 RLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
G ++++ T++G IG++ HE + F + ++ +LR + G +H +R
Sbjct: 1111 ------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFR 1164
Query: 981 SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
S+ KN +DGDL E F + + +++ ++ + E+ K++E++
Sbjct: 1165 SY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVAEELDRTPPEVSKKLEDI 1211
>gi|296417673|ref|XP_002838477.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295634416|emb|CAZ82668.1| unnamed protein product [Tuber melanosporum]
Length = 1202
Score = 212 bits (539), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 260/1162 (22%), Positives = 481/1162 (41%), Gaps = 154/1162 (13%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L + ++G I L FR G ++D++ + ++ + +L++ + ++ +
Sbjct: 59 VLTIDVFGIIRCLAAFRLAGSSKDYVIVGSDSGRITILEYQPQVNKFHRLHLETFGKSGV 118
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ DP R LI + L V+ + + +L + + + L +
Sbjct: 119 RRVIPGQFLACDPKGRACLISSVEKNKLVYVLNRNAQAELTISSPLEAHKPHCLVFALVA 178
Query: 118 --YGCAKPTIVVLYQD----NKDARHVKT---------YEVALKDKDFVEGPWSQNNLDN 162
G P L D ++DA V YE+ L V WS + +D
Sbjct: 179 LDVGYDNPVFAALEVDYGESDQDATGVAYENIEKMLVYYELDLGLNHVVRK-WS-DTVDR 236
Query: 163 GADLLIPVPPPL---CGVLIIGEETIVYCS-ANAFKAIPI------------RPSITKAY 206
A++L VP GVL+ E++I Y + +PI R +I
Sbjct: 237 SANILFQVPGGTDGPSGVLVCAEDSIYYRHMGRSTHRVPIPRRRGACEDPNRRRTIISGV 296
Query: 207 GRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSIASTISYLDNAVVYIG 265
+LL G L + + ++ E V+ +K++ IAS++ L + +++
Sbjct: 297 MHKMRGAFFFLLQTEDGDLLKVTLDYDDESGVSRIKVKYFDTVPIASSLCILKSGFLFVA 356
Query: 266 SSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYVNLGP 298
S G+ + N PD V + +E ++ P
Sbjct: 357 SEGGNHHFYQFEKLGDDDEEIEFSSDNFSPDPLDRTVPIYFHPRAPENLSLVEAIDSMNP 416
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSST 357
+VD + +L Q+ T G+ + R +++G+ E S EL G+ +W+ + +
Sbjct: 417 LVDCKIANLTDDDAPQIYTVCGSGPRSTFRTLKHGLETTEIVSSELPGVPSAVWTTKLTN 476
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
D FD ++V+SF + T +L++ + +EE GF S TL + L+QV +
Sbjct: 477 GDEFDAYIVLSFTNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDALLQVHPKGI 534
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHA-Q 475
R + + R NEW +P S+ A+ N+ QV +A G +VY E+ DG L E + +
Sbjct: 535 RHIRADRRV--NEWPAPQHRSIVAASTNSRQVAVALSSGEIVYFEMDSDGQLAEYEEKKE 592
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVL 534
+ ++CL + + E S AVG D +VRI SL PD L +K P S+
Sbjct: 593 MSGTVTCLSLGEVPEGRVRSSFLAVGC-DDATVRILSLDPDSTLESKSVQALTSAPSSLC 651
Query: 535 LCAFEGIS---------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
+ + + YL L G L +L+ TGELTD + LG +P+ + +
Sbjct: 652 IMSMPDSASETSRSLTLYLHIGLYSGVYLRTVLDTVTGELTDTRTRFLGPKPVKIFKVIA 711
Query: 586 KNTTHVFAASDRPTVIYSSNK-KLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
+ V A S RP + YS + + + + + + F+S P+ + + L I
Sbjct: 712 QGLPSVLALSSRPWLGYSDARGQFMLTPLTYPMLEWGWTFSSEQCPEGMVGIQGQNLRIF 771
Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-----------------SLKNQSCA 686
TI+ + L SIPL PR++ F + S +N
Sbjct: 772 TIEKLTDNLQQESIPLSYTPRKMLKHPDQPVFYVAEADANTLSVATKQKLMNSAQNGDTN 831
Query: 687 EESEMHF------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYC 733
E + F + ++D + E L+ E SI + SFS D +
Sbjct: 832 ELAPAEFGNPRGEGHWASCISVVDPISKEVTQRIELEENEAAFSIAAVSFSSQDDEWFLV 891
Query: 734 VGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
VGT + P ++ I F+ + +L+ I + + +L F G+LLA I +
Sbjct: 892 VGTGKDTCLSPRKSANGYIHIYRFLNDGKELEFIHKTKVDEPPLALLGFQGRLLAGIGKD 951
Query: 791 IQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
+++Y D G ++L + G + +QT+G I+V D+ +S++ ++YK+++ +
Sbjct: 952 LKIY-----DLGMKQLLRKAQGQVAPNVINGLQTQGSRIIVSDVQESVTYVVYKYQDNRL 1006
Query: 850 EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG--------- 897
A D + + ++D + G + N + VR K S+ + ++ G
Sbjct: 1007 IPFADDMIPRFTTCTTMVDYETVAGGDRFGNFWIVRCPQKASDESDEDPAGGHLIHERSY 1066
Query: 898 ------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LP 950
RL ++ ++ + + LV G T+++ + G IG++ +
Sbjct: 1067 LQGAPNRLNLMCHFYPQDIPTSVQKAQLV-------TGGRDTLVWTGLQGTIGLMVPFVS 1119
Query: 951 HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTR 1010
E F + L+ ++R + G +H +RS+ V K +DGDL E++ LSR +
Sbjct: 1120 REDVDFFQTLEQHMRTEDAPIAGRDHLIYRSY-----YVPVKGVIDGDLCENYSLLSRDK 1174
Query: 1011 MDEISKTMNVSVEELCKRVEEL 1032
I+ ++ SV E+ +++ ++
Sbjct: 1175 KAMIAAELDRSVREVERKIADM 1196
>gi|327286386|ref|XP_003227911.1| PREDICTED: splicing factor 3B subunit 3-like [Anolis carolinensis]
Length = 1217
Score = 212 bits (539), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 270/1193 (22%), Positives = 496/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNVFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQYLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PILCCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSV 533
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPE 638
Query: 534 LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
LC E G YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 639 SLCIVEMGGAEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQ 818
Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
AEE +EM VR+++ +
Sbjct: 819 MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVVRVMNPIQGNTLD 878
Query: 707 TYPLDTFEYGCSILSCSFS---DDSNVYYCVGTAYVLPEENEPTKGRILVF-IVEDG-KL 761
L+ E S+ C F+ DD +V V +L G + + +V +G KL
Sbjct: 879 LVQLEQNEAAFSVAVCRFANTGDDWHVLVGVAKDLIL-NPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFMHKTPVEEVPAAIAPFQGRILIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D + W++ +LD D GA+
Sbjct: 991 CGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTHPRWVTTACLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>gi|326927039|ref|XP_003209702.1| PREDICTED: splicing factor 3B subunit 3-like [Meleagris gallopavo]
gi|363738006|ref|XP_001232348.2| PREDICTED: splicing factor 3B subunit 3 [Gallus gallus]
Length = 1217
Score = 212 bits (539), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 267/1192 (22%), Positives = 495/1192 (41%), Gaps = 193/1192 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNVFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQYLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PILCCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSV 533
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPE 638
Query: 534 LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
LC E G YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 639 SLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQ 818
Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
AEE +EM +R+++ +
Sbjct: 819 MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLD 878
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
L+ E S+ C FS+ +Y VG A ++ G + + + +G KL+
Sbjct: 879 LVQLEQNEAAFSVAVCRFSNTGEEWYVLVGVAKDLILNPRSVAGGFVYTYKLVNGGEKLE 938
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-- 820
+ + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 939 FLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYIC 991
Query: 821 -VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNF 879
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 992 GIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTATLLDYDTVAGADKFG 1051
Query: 880 NLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVM 921
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1052 NICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI- 1110
Query: 922 RLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
G ++++ T++G IG++ HE + F + ++ +LR + G +H +R
Sbjct: 1111 ------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFR 1164
Query: 981 SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
S+ KN +DGDL E F + + +++ ++ + E+ K++E++
Sbjct: 1165 SY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVAEELDRTPPEVSKKLEDI 1211
>gi|301604148|ref|XP_002931705.1| PREDICTED: splicing factor 3B subunit 3 [Xenopus (Silurana)
tropicalis]
Length = 1217
Score = 211 bits (538), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 264/1190 (22%), Positives = 494/1190 (41%), Gaps = 189/1190 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIIILEYHP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFIAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V V E+ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQD-SLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIMSCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D +D +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 IEDEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P + N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKTPGKKIIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSV 533
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPE 638
Query: 534 LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
LC E G YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 639 ALCIVEMGGAERQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + + F S P+ +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQ 818
Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
AEE +EM +R+++ +
Sbjct: 819 MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLD 878
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVF-IVEDG-KLQ 762
L+ E S+ C F++ + +Y VG A ++ G + + +V +G KL+
Sbjct: 879 LVQLEQNEAAFSVAVCRFTNTGDDWYVLVGVAKDLILNPRSVAAGLVYTYKLVNNGEKLE 938
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYV 821
+ + + ++ F G++L + + +++Y D G ++L +C H + +
Sbjct: 939 FVHKTPVEEVPAAIAPFQGRVLIGVGKLLRIY-----DLGKKKLLRKCENKHISNFIVGI 993
Query: 822 QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
QT G +++ D+ +S + YK E + A D W++ +LD D GA+ N+
Sbjct: 994 QTIGQRVILSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTSCLLDYDTVAGADKFGNI 1053
Query: 882 FTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1054 CIVRLPSNTNDDVDEDPTGNKALWDRGLLNGASQKAEVIVNYHVGETVLSVQKTTLI--- 1110
Query: 924 PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
G ++++ T++G IG++ HE + F + ++ +LR + G +H +RS+
Sbjct: 1111 ----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPICGRDHLSFRSY 1166
Query: 983 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + +++ ++ + E+ K++E++
Sbjct: 1167 -----YFPVKNVIDGDLCEQFNSMEPVKQKSVAEELDRTPPEVSKKLEDI 1211
>gi|6006515|emb|CAB56791.1| spliceosomal protein SAP 130 [Homo sapiens]
Length = 1217
Score = 211 bits (538), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 271/1193 (22%), Positives = 495/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEAGNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+ G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFWAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>gi|410297890|gb|JAA27545.1| splicing factor 3b, subunit 3, 130kDa [Pan troglodytes]
Length = 1217
Score = 211 bits (538), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 272/1193 (22%), Positives = 495/1193 (41%), Gaps = 195/1193 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAGDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQ 683
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+++ +
Sbjct: 818 QMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTL 877
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KL 761
L+ E S+ C FS+ +Y VG A L G + +V +G KL
Sbjct: 878 DLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKL 937
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY- 820
+ + + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 938 EFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYI 990
Query: 821 --VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 991 SGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKF 1050
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1051 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1110
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1111 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1163
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1164 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1211
>gi|302831461|ref|XP_002947296.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f. nagariensis]
gi|300267703|gb|EFJ51886.1| hypothetical protein VOLCADRAFT_73165 [Volvox carteri f. nagariensis]
Length = 1221
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 248/1050 (23%), Positives = 432/1050 (41%), Gaps = 191/1050 (18%)
Query: 136 RHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL---CGVLIIGEETIVYCSANA 192
+H+ YE+ L + + W++ +DNGA+LL+ VP GVL+ E I+Y + +
Sbjct: 205 KHLTFYEMDLGLNNVLR-KWTEP-IDNGANLLVAVPGGADGPGGVLVCAENFIIYKNQDH 262
Query: 193 FKAIPIRPSITKAYGRVDADGSR-----------------YLLGDHAGLLHLLVITHEKE 235
+ + P R D G R +L+ G ++ + + +E E
Sbjct: 263 EEVRAVIPR------RSDLPGDRGVLIVSYATHKKKAYSFFLVQSEYGDIYKVTLAYEGE 316
Query: 236 KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL------------------- 276
VT LKI+ ++I+ L ++ S YG+ L +
Sbjct: 317 AVTELKIKYFDTIPPCTSIAVLKTGFLFAASEYGNHALYQFVGTGEDDEDVESSSAALVQ 376
Query: 277 ---NLQP----DAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRI 329
QP + +++ +L PI D V +L + Q+ G SL +
Sbjct: 377 TEEGFQPVFFEPRPLKNLLLIDEMASLMPITDMKVANLLNEEIPQIYALCGHGPRASLSV 436
Query: 330 VRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETE 388
+R G+ + E A L G +W++R + D FD F+VVSF + T L ++ +E++ET
Sbjct: 437 LRPGLAVTELAVSPLPGAPTAVWTVRRNATDEFDAFIVVSFANAT--LVFSIGEEVKETN 494
Query: 389 IEGFCSQTQTLFCHDAIYNQLVQVT----------SGSVRLVSSTSRELRNEWKSPPGYS 438
GF TL N ++Q T G +R + R NEWK P +
Sbjct: 495 ESGFLGTVPTLHTQLLADNSMLQATFFDLSEPQVYPGGLRHIKPDRR--INEWKVPGRRN 552
Query: 439 VNVATANASQVLLATGGGHLVY--LEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQ 496
+ A +N QV +A GG + LE+G G+L E + L ++SCL+I P+ E S
Sbjct: 553 IKAAASNEKQVAIALQGGEVTVFELEVGTGLLVEAEKRDLGEDVSCLEIPPVPEGLVRSS 612
Query: 497 IAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLC---------AFEGISYLLC 546
AVG + D +VR+ SL P L +P S+L+ A EG +L
Sbjct: 613 FLAVGTY-DQTVRVLSLDPGQGLKNLSMQALNSVPESILMLYNTGPGTERATEGGLFLHV 671
Query: 547 ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK 606
L +G L+ ++ +G+LTD + LGT+P L + + + A S RP + Y+
Sbjct: 672 GLQNGILIRSEVDRISGQLTDSRTRFLGTKPPKLFAAAVRGNRCMLALSSRPWLGYNDQG 731
Query: 607 KLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR 665
+ S ++ + + + F S P+ + L I ++++ + ++ L PRR
Sbjct: 732 RFNLSPLSYEALDYASGFASDQCPEGFVAVVKSTLRILAVENVGDAFNTQACRLRYTPRR 791
Query: 666 ICHQEQSRTFAICSLKNQSC---------------AEES----------------EMHF- 693
+ +++ I N + AEE E F
Sbjct: 792 LLIHPETKLLMIAESDNAAIPLAEREDLQAKLAALAEEEGAPVQGVEFNDELAALEEQFG 851
Query: 694 ------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-------------DS 728
+RL+D T + +D E +I+S + D
Sbjct: 852 APKGQPGQWAGCLRLVDPATLSTVFVTEVDNNE---AIVSMALVDLVLPAGPGFGGTEKL 908
Query: 729 NVYYCVGTAYVLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVY-SLNAFNGKLLA 785
V C +P + E I V+ + DG +L+L+ + G V +L F G+LLA
Sbjct: 909 LVVGCAKGLRYMPTDCE--AAYIRVYRLADGGKRLELVHKTIVDGGVPGALCGFKGRLLA 966
Query: 786 AINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRGDFIVVGDLMKSISLLIY 842
+ ++LY D G ++L +C ++ I+ + VQ G I VGD +S+ ++ Y
Sbjct: 967 GVGPTLRLY-----DMGKKKLLRKCEYNRLPHQIMNITVQ--GPRIYVGDAQESVHMMRY 1019
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV----RKNSEGATDEERG- 897
K + A A D +++ + LD D L A + F F V R+ S+ D+ G
Sbjct: 1020 KKADNAFYIFADDIAPRYLTTILPLDYDT-LAAGDKFGNFVVLRLPREASQQVEDDPTGG 1078
Query: 898 --------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVI 943
+LE V ++H+G+ + + + G +++ TV G I
Sbjct: 1079 KMAAASGRLNGAPHKLEEVVKFHVGDTITSLQRAEM-------QAGGQEVLLYSTVMGAI 1131
Query: 944 GVIASLPH-EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
GV+ + E F L+ +LR+ + G +H +RS ++ +DGDL
Sbjct: 1132 GVLYPFTNREDVDFFGHLEMHLRQEHPPLCGRDHLSFRS-----AYFPVRSCVDGDLCGQ 1186
Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+ + + I++ M+ + E+ K++E++
Sbjct: 1187 YASIPAKKQQMIAEAMDRTPGEMLKKLEDI 1216
>gi|261196131|ref|XP_002624469.1| splicing factor 3B subunit 3 [Ajellomyces dermatitidis SLH14081]
gi|239587602|gb|EEQ70245.1| splicing factor 3B subunit 3 [Ajellomyces dermatitidis SLH14081]
gi|239614558|gb|EEQ91545.1| splicing factor 3B subunit 3 [Ajellomyces dermatitidis ER-3]
gi|327356791|gb|EGE85648.1| pre-mRNA-splicing factor rse1 [Ajellomyces dermatitidis ATCC 18188]
Length = 1209
Score = 210 bits (535), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 264/1172 (22%), Positives = 486/1172 (41%), Gaps = 177/1172 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I TL FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRTLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ + + +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALAALDVG 182
Query: 120 CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
P L + +A +H+ YE+ L V WS + +D A +
Sbjct: 183 YDNPIFAALEVEYAEADQDPTGQAYEELEKHLVYYELDLGLNHVVRK-WS-DPVDRSATM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGD 220
L VP G L+ E+ I Y +N AF+ IP R T+ + + RY+
Sbjct: 241 LFQVPGGADGPSGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT-- 293
Query: 221 HAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIASTIS 255
AG++H + ++ + ++TG LK++ IAS++
Sbjct: 294 -AGVMHKMRGAFFFLLQTEDGDLFKVTIDMVEDDNGQITGEVRRLKLKYFDTVPIASSLC 352
Query: 256 YLDNAVVYIGSSYGDSQLIKLNL--------------------QPDAKGSY-------VE 288
L + +++ S G+ + +P A + V
Sbjct: 353 ILKSGFLFVASETGNHHFYQFEKLGDDDDEIEFTSDSYSADPSEPLAPAYFRPRPYENVN 412
Query: 289 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI- 347
++E +L P++D V +L Q+ T G S R +++G+ ++E EL +
Sbjct: 413 LVESINSLNPLMDCKVANLTDDDAPQIYTICGTGARSSFRTLKHGLEVSEIVESELPSVP 472
Query: 348 KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
+W+ + + +D FD ++++SF + T +L++ + +EE GF S TL +
Sbjct: 473 SAVWTTKLTREDEFDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGED 530
Query: 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DG 466
L+QV +R + + R NEW +P S+ AT N QV +A G +VY E+ DG
Sbjct: 531 SLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAVALSSGEIVYFEMDTDG 588
Query: 467 ILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
L E + ++ ++CL + + + + S AVG D +VRI SL PD L K
Sbjct: 589 SLAEYDEKREMSGTVTCLSLGEVPKGRARSSFLAVGC-DDSTVRILSLDPDSTLENKSVQ 647
Query: 525 GGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
P ++ + A + YL L G L +L+ TGEL+D + LG +P+
Sbjct: 648 ALTSAPSALSIMAMSDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLKPV 707
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
L S K + A S RP + YS K + + ++ + F+S + +
Sbjct: 708 KLFQVSVKEQKAILALSSRPWLGYSDLQTKAFMLTPLDYVGLEWGWNFSSEQCIEGMVGI 767
Query: 637 KEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR 695
+ L I +++ ++ L +IPL PR + F + N + + +
Sbjct: 768 QGQNLRIFSLEKLENNLLQETIPLQYTPRHFIKHPEHPLFYVIEADNNILSPGTRTKLLN 827
Query: 696 ----------LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCS 723
+L + F + IS L+ E S+ + +
Sbjct: 828 DSGAVNGDAMVLPPEEFGYPRGTGHWASCIQIVDPVNSQSVISQIELEENEAAVSVAAVT 887
Query: 724 FSD-DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAF 779
FS D + VGT ++ + G I V+ E+GK L+ I + + + +L F
Sbjct: 888 FSSQDDETFLVVGTGKDMVVNPRSSSAGFIYVYRFQEEGKELEFIHKTKVEQPPMALLGF 947
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
G+LLA I +++Y +R R+ Q+ H L + +QT+G I+V D+ +S++
Sbjct: 948 QGRLLAGIGTDVRIYDLGMR-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQESVTY 1003
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEER 896
++YK++E + D + W + ++D + G + NL+ +R K SE A ++
Sbjct: 1004 VVYKYQENRLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPEKASEEADEDGS 1063
Query: 897 G---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
G RL +V ++ + + LV G +++ + G
Sbjct: 1064 GAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLV-------TGGRDVLVWTGLQG 1116
Query: 942 VIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
+ + I + E+ F + L+ L + G +H +RS+ AK +DGDL
Sbjct: 1117 TVAMLIPFVSREEVDFFQSLEMQLSSQNPPLAGRDHLIYRSY-----YAPAKGTIDGDLC 1171
Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
E++L L + +I+ ++ SV E+ +++ ++
Sbjct: 1172 ETYLLLPNDKKQQIAGELDRSVREIERKIADM 1203
>gi|70992737|ref|XP_751217.1| nuclear mRNA splicing factor [Aspergillus fumigatus Af293]
gi|74670386|sp|Q4WLI5.1|RSE1_ASPFU RecName: Full=Pre-mRNA-splicing factor rse1
gi|66848850|gb|EAL89179.1| nuclear mRNA splicing factor, putative [Aspergillus fumigatus Af293]
Length = 1225
Score = 210 bits (535), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 273/1166 (23%), Positives = 477/1166 (40%), Gaps = 171/1166 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I TL FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRTLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ +++ +L + + + Q + G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTVVFAMTALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D +A T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVDYSEADQDPTGRAYEESEKLLVYYELDLGLNHVVRK-WA-DPVDRTASM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGD 220
L VP GVL+ E+ I Y +N AF+ IP R T+ R + + +
Sbjct: 241 LFQVPGGADGPSGVLVCAEDNITYRHSNQDAFRVPIPRRIGATENPERKRSIVAGVMHKM 300
Query: 221 HAGLLHLL-------------VITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
LL ++ + ++TG LKI+ IAS++ L + +Y
Sbjct: 301 RGAFFFLLQTEDGDLFKVTIDMVEDDNGQLTGEVKRLKIKYFDTVPIASSLLILKSGFLY 360
Query: 264 IGSSYGDSQLIKLN----------------------------LQPDAKGSYVEVLERYVN 295
+ S G+ + QP + ++E +
Sbjct: 361 VASEAGNHHFYQFEKLGDDDEETEFNSENFPADLSVPCEPVFFQPRG-AENLNLVETLNS 419
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLR 354
L P++D +V+L Q+ T SG+ S R +++G+ ++E EL + +W+ +
Sbjct: 420 LNPLIDSKIVNLNEDDAPQIYTVSGSGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTK 479
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
+ D FD ++++SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 480 LTRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVHP 537
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-K 472
+R + + R NEW +P S+ A N QV +A G +VY E+ DG L E +
Sbjct: 538 RGIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGTLAEYDE 595
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPR 531
Q+ ++CL + + E S AVG D +VRI SL PD L K P
Sbjct: 596 RRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPS 654
Query: 532 SVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
++ + + S YL L G L +L+ TGEL+D + LG +P+ L S
Sbjct: 655 ALNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKPVKLFRVSV 714
Query: 586 KNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
K T V A S RP + YS K + + ++ + F+S + + + L I
Sbjct: 715 KGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQAQNLRI 774
Query: 644 GTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR------- 695
+I+ + + SIPL PRR+ + F + N + + +
Sbjct: 775 FSIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDNNVLSPATRARLIEDSKARNG 834
Query: 696 ---LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSFSD-DSN 729
+L + F + IST L+ E S+ + FS D
Sbjct: 835 ETNVLPPEDFGYPRATGHWASCIQIVDPLDAKAVISTIELEENEAAVSMAAVPFSSQDDE 894
Query: 730 VYYCVGTAYVLPEENEPTKGRILVFIV---EDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
+ VGTA + N P+ + I EDGK L+ I + + + +L F G+LLA
Sbjct: 895 TFLVVGTAKDM-IVNPPSSAGGFIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRLLA 953
Query: 786 AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---QTRGDFIVVGDLMKSISLLIY 842
I +++Y D G ++L +C +++ + QT+G IVV D+ +S++ ++Y
Sbjct: 954 GIGSTLRIY-----DLGMKQLLRKC--QAQVVSKTIVGLQTQGSRIVVSDVRESVTYVVY 1006
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE----- 894
K+++ + D + W ++ ++D + G + NL+ VR K SE A ++
Sbjct: 1007 KYQDNILIPFVDDSVSRWTTSTTMVDYETVAGGDKFGNLWLVRCPKKASEEADEDGSGAH 1066
Query: 895 ---ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
ERG RL+++ + + LV G +++ G IG
Sbjct: 1067 LIHERGYLHGAPNRLDLMIHTYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIG 1119
Query: 945 VIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
++ + E F + L+ L + G +H +RS+ K V +DGDL E +
Sbjct: 1120 MLVPFVSREDVDFFQNLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEMY 1174
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRV 1029
L I+ ++ SV E+ +++
Sbjct: 1175 FLLPNDTKMMIAAELDRSVREIERKI 1200
>gi|312072035|ref|XP_003138882.1| hypothetical protein LOAG_03297 [Loa loa]
gi|307765956|gb|EFO25190.1| hypothetical protein LOAG_03297 [Loa loa]
Length = 1197
Score = 210 bits (535), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 256/1164 (21%), Positives = 495/1164 (42%), Gaps = 170/1164 (14%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I +L FR G +D++ + ++ + +L+++ + R + + G R
Sbjct: 60 FGIIRSLLAFRLTGGTKDYIVVGSDAGRIVILEYNTQKV-CFERVHQETFGKTGCRRIVP 118
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ IDP R LIG L ++ D L + + + + + G
Sbjct: 119 GQFLAIDPKGRAILIGAVERQKLVYIMNRDASANLTISSPLEAHKSHCICYSVVGVDVGF 178
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALK------------DKDFVEGPWSQNNLDNGADLLI 168
PT L D ++ H T +A K + V +++ +D G ++LI
Sbjct: 179 ENPTFACLEVDYEEVDHDSTGHLATKIPQTLTFYELDLGLNHVVRKYAEPLVDKG-NILI 237
Query: 169 PVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-------- 215
VP P GV++ E +VY N I+ + + +D D R
Sbjct: 238 SVPGGQDGPS--GVIVCCENYLVY--KNLGDQPDIKCPVPRRRNELD-DCDRTVIIVCAA 292
Query: 216 ---------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
+L+ G + + + E + VT LKI+ +++ + L ++ S
Sbjct: 293 THKTKLMYFFLVQTDQGDIFKVTLESEHDIVTELKIKYFDTIPVSNAMCILKTGFLFTAS 352
Query: 267 SYGDSQLIKL--------------NLQPDAKGSY---------VEVLERYVNLGPIVDFC 303
+G+ L ++ +Q + ++ + V+++ +L P++
Sbjct: 353 EFGNHHLYQIAHLGDEDDEPEFSSRMQLEEGETFFFAPRGLTNLAVVDQMDSLSPLISSY 412
Query: 304 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
+ DL + Q+ T G +++++RNG+ + E A EL G +W+++ + DD FD
Sbjct: 413 IDDLANEDSPQIYTLIGRGALSAVKVLRNGLEVTEMAVSELPGNPNAVWTVKRNIDDKFD 472
Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
+ +VVSF++ T +L++ + +EE GF T TL C + L+QV +R + +
Sbjct: 473 SHIVVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLGCALIGDDALLQVYPDGIRHIRA 530
Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEI 480
R NEWK+P ++ N QV +A GG LVY E+ G L E + +L ++
Sbjct: 531 DRRV--NEWKAPGKRTIMKCALNRRQVAIALAGGELVYFELDVTGQLNEYTERRELPADV 588
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLCA 537
C+ ++ I E S+ VG+ D +VRI SL L+ ++ + L E P S+++
Sbjct: 589 LCMSLSEIPEGELRSRFLTVGL-ADKTVRIISLDPQDCLSPLSMQALPSE--PESIIVLE 645
Query: 538 FEGIS-------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
G +L L +G LL ++ TGELTD + LGT+ + L ++
Sbjct: 646 MFGTETQSASTVHLNIGLQNGCLLRTTVDQVTGELTDNRTRYLGTKSVKLFHVRIQSKDA 705
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
+ AAS R ++Y + + ++ + F+S P+ + E L I +++ +
Sbjct: 706 IMAASSRAWLLYDYQSRFHLTPLSYAALEFAAGFSSEQCPEGIVAIAENTLRILSLEKLG 765
Query: 651 KLHIRSI-PLGEHPRR-ICHQ---------------------EQSRTFAICSLKNQSCAE 687
+ + PL PRR + H+ E+ + A ++ AE
Sbjct: 766 AVFNHVVHPLDYTPRRMVVHKASGNLIIIENDHAAFTVKGKMERRKQLADELMEVAKEAE 825
Query: 688 ESEMHFVRLLDDQTFEFIST--------------YPLDTFEYGCSILSCSFSDDSNVYYC 733
E++ V+ + D ++ ST +P E +I F + S+ +
Sbjct: 826 EADQQAVKEMADAIRKWASTVRVMRSNDGETLSHFPFAEDEAAFAIAMVQFQNQSDTQFV 885
Query: 734 -VGTAYVLP-EENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
VG L + + G I F++ LQL+ T V +++ F G LA + +
Sbjct: 886 LVGCGCELQLKPRKANGGCIYTFLLAANGTTLQLLHRTATDEVVNAIHDFRGMALAGVGK 945
Query: 790 KIQLYKWMLRDDGTRELQSECGHHG-HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
K++LY D G R+L ++C + + +++ G IVV D +S+ + YK ++G
Sbjct: 946 KVRLY-----DLGKRKLLAKCENRQIPTQVVDIRSMGQRIVVSDSQESVHFMRYKKQDGQ 1000
Query: 849 IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE------------ER 896
+ + + +++ V +LD D + + F V + +G T+E +R
Sbjct: 1001 LSIFCDETSPRYVTCVCLLDYDT-VAVGDRFGNIAVLRLPKGVTEEVQEDPTGVRALWDR 1059
Query: 897 G-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS- 948
G +LE + ++G+ + + SLV G + + T++G+IG++
Sbjct: 1060 GNLNGASQKLEAIAHLYIGDAITSMQKTSLV-------PGANDCLCYTTISGIIGILVPF 1112
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+ +++ F + L+ ++R + G +H +RS+ K+ +DGDL E + +
Sbjct: 1113 MSRDEFEFFQNLEMHMRVEYPPLCGRDHLAYRSY-----YFPVKSIIDGDLCEQYSLMPL 1167
Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
+ + + + E+ K++E++
Sbjct: 1168 DKQKSVGEELGRKSTEIHKKLEDI 1191
>gi|340959394|gb|EGS20575.1| putative pre-mRNA splicing protein [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 1213
Score = 210 bits (535), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 267/1169 (22%), Positives = 476/1169 (40%), Gaps = 168/1169 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G I + FR G +D++ +AT+ + ++++ + + + R
Sbjct: 65 IFGIIRAMAAFRLAGSHKDYIILATDSGRIAIIEYLPKENRFQRIHLETFGKSGVRRVIP 124
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R LI + L V+ + + +L + + + V+ + + G
Sbjct: 125 GQYLAADPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHKPGVIVLSLVALDVGY 184
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
+ P L + +A T YE+ L V WS + +D + LL
Sbjct: 185 SNPVFAALEYEYSEADQDPTGQAAKQLEMQLVYYELDLGLNHVVRK-WS-DTVDPTSSLL 242
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYL 217
VP P GVL+ GEE I Y +N AF+ IP R T+ R G +
Sbjct: 243 FQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPNRKRTIVAGVMHK 300
Query: 218 LGDHAGLLHLLVITH--------------EKEKVTG----LKIELLGETSIASTISYLDN 259
L AG L+ T EK TG +KI+ IA ++ L +
Sbjct: 301 LKGSAGAFFFLLQTEDGDLFKVTIDMVEDEKGNPTGEVKRVKIKYFDTVPIAHSLCILKS 360
Query: 260 AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
+++ S +G+ + + D Y V LE V
Sbjct: 361 GFLFVASEFGNHHFYQFEKLGDDDDEPEFTSDDFPADWNAPYNPVYFKPRPLENLVLVES 420
Query: 295 --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
++ P+V V +L + Q+ G S R++++G+ ++E + EL G +W
Sbjct: 421 IDSMNPLVGCKVANLTGEDAPQIYAICGNGARSSFRMLKHGLEVSEIVASELPGTPSAVW 480
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+ + + D +D ++V+SF + T +L++ + +EE GF + TL L+Q
Sbjct: 481 TTKLTKYDEYDAYIVLSFTNATLVLSIG--ETVEEVSDSGFLTTVPTLAVQQMGEEGLIQ 538
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
+ +R + NEW +P S+ AT N +QV++A G +VY E+ DG L E
Sbjct: 539 IHPKGIRHIVQGRV---NEWPAPQHRSIVAATTNENQVVIALSSGEIVYFEMDADGSLAE 595
Query: 471 V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
+ Q+ ++ L + + E S AVG D +VRI SL P+ L K
Sbjct: 596 YDEKKQMSGTVTSLSLGKVPEGLRRSSFLAVGC-DDCTVRILSLDPESTLEMKSIQALTA 654
Query: 529 IPRSVLLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
P S+L+ + E + YL L G L +L+ TGELTD ++ LG +P L
Sbjct: 655 APSSLLIMSMEDSTGGTTLYLHIGLHSGVYLRTVLDEITGELTDTRQKFLGPKPTKLFQV 714
Query: 584 SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
+ +N T V A S RP + Y++ + + + ++ E+ + FNS + + L
Sbjct: 715 TVQNQTCVLALSSRPWLGYTAPITRNFVMTPLSYTELGYTWSFNSEQCQEGMVGIHANYL 774
Query: 642 TIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEE------------ 688
I TI+ + + I +S PL P+R+ + F + N + E
Sbjct: 775 RIFTIEKLGQTMIQKSCPLTYTPKRLVKHPEQPYFYVIEADNNTLPPELRAQLLEQSGAV 834
Query: 689 ----------------------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF-S 725
S + V + ++ + L+ E S F S
Sbjct: 835 NGDATVLPPEDFGYPKARGRWASCIEIVDPVSEEQPRVLKRIELEGNEAAVSAAVVPFAS 894
Query: 726 DDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGK 782
D + VGT ++ T+G I V+ ++DG+ L+ I + + + F G+
Sbjct: 895 QDGESFLIVGTGKDMVLNPRASTEGAIHVYRFIDDGRDLEFIHKTIIEEPPLAFCPFQGR 954
Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
LLA I + +++Y L+ R+ Q+E L + + TR + IVVGD+ ++ ++Y
Sbjct: 955 LLAGIGKMLRIYDLGLK-QLLRKAQAEVSPQ---LIVSLDTRHNRIVVGDVQHGMTYVVY 1010
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER------ 896
K + + A D A W + ++D + G + NL+ VR + + +
Sbjct: 1011 KPDSNKLIPFADDTIARWTTCTTMVDYESVAGGDKFGNLWIVRCPERASLESDEPGSEVQ 1070
Query: 897 ------------GRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
RL+++ ++ + +LV VG +++ + G +G
Sbjct: 1071 LLHARPYLHGAPNRLDLMAHFYPQDLPTSICKTNLV-------VGGQDVLVWSGIQGTVG 1123
Query: 945 V-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
V I + E F + L++++R + G +H +R + V K +DGDL E F
Sbjct: 1124 VLIPFVTREDADFFQNLESHMRAEDPPLAGRDHLIYRGY-----YVPVKGVIDGDLCERF 1178
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L + I+ ++ SV E+ +++ ++
Sbjct: 1179 TLLPNDKKQMIAGELDRSVREIERKISDI 1207
>gi|159130328|gb|EDP55441.1| nuclear mRNA splicing factor, putative [Aspergillus fumigatus A1163]
Length = 1225
Score = 210 bits (534), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 274/1165 (23%), Positives = 480/1165 (41%), Gaps = 169/1165 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I TL FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRTLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ +++ +L + + + Q + G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTVVFAMTALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D +A T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVDYSEADQDPTGRAYEESEKLLVYYELDLGLNHVVRK-WA-DPVDRTASM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGD 220
L VP GVL+ E+ I Y +N AF+ IP R T+ R + + +
Sbjct: 241 LFQVPGGADGPSGVLVCAEDNITYRHSNQDAFRVPIPRRIGATENPERKRSIVAGVMHKM 300
Query: 221 HAGLLHLL-------------VITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
LL ++ + ++TG LKI+ IAS++ L + +Y
Sbjct: 301 RGAFFFLLQTEDGDLFKVTIDMVEDDNGQLTGEVKRLKIKYFDTVPIASSLLILKSGFLY 360
Query: 264 IGSSYGDSQLIKL---------------NLQPD-----------AKGS-YVEVLERYVNL 296
+ S G+ + N D +G+ + ++E +L
Sbjct: 361 VASEAGNHHFYQFEKLGDDDEETEFNSENFPADLSVPCEPVFFRPRGAENLNLVETLNSL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P++D +V+L Q+ T SG+ S R +++G+ ++E EL + +W+ +
Sbjct: 421 NPLIDSKIVNLNEDDAPQIYTVSGSGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ D FD ++++SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 481 TRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVHPR 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
+R + + R NEW +P S+ A N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGTLAEYDER 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
Q+ ++CL + + E S AVG D +VRI SL PD L K P +
Sbjct: 597 RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPSA 655
Query: 533 VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ + + S YL L G L +L+ TGEL+D + LG +P+ L S K
Sbjct: 656 LNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKPVKLFRVSVK 715
Query: 587 NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
T V A S RP + YS K + + ++ + F+S + + + L I
Sbjct: 716 GQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQAQNLRIF 775
Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR-------- 695
+I+ + + SIPL PRR+ + F + N + + +
Sbjct: 776 SIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDNNVLSPATRARLIEDSKARNGE 835
Query: 696 --LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSFSD-DSNV 730
+L + F + IST L+ E S+ + FS D
Sbjct: 836 TNVLPPEDFGYPRATGHWASCIQIVDPLDAKAVISTIELEENEAAVSMAAVPFSSQDDET 895
Query: 731 YYCVGTAYVLPEENEPTKGRILVFIV---EDGK-LQLIAEKETKGAVYSLNAFNGKLLAA 786
+ VGTA + N P+ + I EDGK L+ I + + + +L F G+LLA
Sbjct: 896 FLVVGTAKDM-IVNPPSSAGGFIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRLLAG 954
Query: 787 INQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYK 843
I +++Y D G ++L +C +++ + QT+G IVV D+ +S++ ++YK
Sbjct: 955 IGSTLRIY-----DLGMKQLLRKC--QAQVVSKTIVGLQTQGSRIVVSDVRESVTYVVYK 1007
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE------ 894
+++ + D + W ++ ++D + G + NL+ VR K SE A ++
Sbjct: 1008 YQDNILIPFVDDSVSRWTTSTTMVDYETVAGGDKFGNLWLVRCPKKASEEADEDGSGAHL 1067
Query: 895 --ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
ERG RL+++ + + LV G +++ G IG+
Sbjct: 1068 IHERGYLHGAPNRLDLMIHTYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIGM 1120
Query: 946 IAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
+ + E F + L+ L + G +H +RS+ K V +DGDL E +
Sbjct: 1121 LVPFVSREDVDFFQNLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEMYF 1175
Query: 1005 DLSRTRMDEISKTMNVSVEELCKRV 1029
L I+ ++ SV E+ +++
Sbjct: 1176 LLPNDTKMMIAAELDRSVREIERKI 1200
>gi|145240731|ref|XP_001393012.1| pre-mRNA-splicing factor rse1 [Aspergillus niger CBS 513.88]
gi|134077536|emb|CAK96680.1| unnamed protein product [Aspergillus niger]
Length = 1209
Score = 209 bits (532), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 278/1168 (23%), Positives = 484/1168 (41%), Gaps = 169/1168 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R +I + L V+ +++ +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACIIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMTALDVG 182
Query: 120 CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
P L D ++ +H+ YE+ L V WS + +D A L
Sbjct: 183 YENPVFAALEVDYSESDQDPTGQAFDELEKHLVYYELDLGLNHVVRK-WS-DPVDRTASL 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRP----------SITKAYGRVD 210
L VP GVL+ E+++ Y +N AF+ IP R SIT
Sbjct: 241 LFQVPGGADGPSGVLVCAEDSVTYRHSNQDAFRVPIPRRSGATENPERKRSITAGVMHKM 300
Query: 211 ADGSRYLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVVY 263
+LL G L L I ++ ++TG LKI+ +AS++ L + +Y
Sbjct: 301 RGAFFFLLQTEDGDLFKLTIDMVEDDNGQLTGEVKRLKIKYFDTVPLASSLLILKSGFLY 360
Query: 264 IGSSYGDSQLIKLNLQPD--------------------------AKGS-YVEVLERYVNL 296
+ S G+ + D +G+ + ++E +L
Sbjct: 361 VASEAGNHHFYQFEKLGDDDEEIEFTSESFSADPSVPCDPIYFRPRGAENLNLVETINSL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P++D VV++ Q+ T SGA S R +++G+ ++E EL + +W+ +
Sbjct: 421 NPLIDSKVVNITEDDAPQIYTISGAGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ D FD ++++SF + T +L++ + +EE GF S TL + L+Q+
Sbjct: 481 TRSDEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQIHPR 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
+R + + R NEW +P S+ A N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDSDGSLAEYDER 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
Q+ ++CL + + E S AVG D +VRI SL PD L K P +
Sbjct: 597 RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTAAPSA 655
Query: 533 VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ + + S YL L G L +L+ TGEL+D + LG++P+ L S K
Sbjct: 656 LNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKPVKLFQVSVK 715
Query: 587 NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
T V S RP + YS K + + ++ + F+S + + + L I
Sbjct: 716 GQTAVLGLSSRPWLGYSDVQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775
Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR-------- 695
+I+ + + +SIPL PRR + F + N A + +
Sbjct: 776 SIEKLDNNMLQQSIPLSYTPRRFLKHPEQPLFYVIESDNNVLAPSTRAKLLEDSKSRGGD 835
Query: 696 --LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSF-SDDSNV 730
+L + F + + T L+ E SI + F S D
Sbjct: 836 ETVLPPEDFGYPRATGHWASCIQVVDPLDAKAVVHTIELEENEAAISIAAVPFTSQDDET 895
Query: 731 YYCVGTAYVLPEENEP--TKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAA 786
+ VGTA + N P G I ++ EDG+ L+ I + + + +L F G+L+A
Sbjct: 896 FLVVGTAKDM-TVNPPGSAGGYIHIYRFQEDGRELEFIHKTKVEEPPLALLGFQGRLVAG 954
Query: 787 INQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
I +++Y D G ++L +C I+ L QT+G IVV D+ +S++ ++YK
Sbjct: 955 IGSLLRIY-----DLGMKQLLRKCQAPVVPKTIVGL--QTQGSRIVVSDVRESVTYVVYK 1007
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE------ 894
++E + D + W +A ++D + G + NL+ +R K SE A ++
Sbjct: 1008 YQENVLIPFVDDSVSRWTTATTMVDYETTAGGDKFGNLWLLRCPKKTSEEADEDGSGAHL 1067
Query: 895 --ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
ERG RLE++ + + LV G +++ G IG+
Sbjct: 1068 IHERGYLQGTPNRLELMIHVYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIGM 1120
Query: 946 IAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
+ + E F + L+ L + G +H +RS+ K V +DGDL E +
Sbjct: 1121 LVPFIGREDVDFFQNLEMQLAAQHPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEMYF 1175
Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L I+ ++ SV E+ +++ ++
Sbjct: 1176 LLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|307109500|gb|EFN57738.1| hypothetical protein CHLNCDRAFT_56079 [Chlorella variabilis]
Length = 1144
Score = 209 bits (532), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 269/1152 (23%), Positives = 480/1152 (41%), Gaps = 194/1152 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G QD++ ++ + +LQ++ E + TR + + G R
Sbjct: 57 VFGIIRSLAPFRFPGAQQDYVICGSDSGRIVILQYNKEKN-CFTRIHTETFGKSGCRRIV 115
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
G+ +DP R +I V+ DN +L + + + L LD+
Sbjct: 116 PGEYLAVDPKGRACMIAAVEKSKFVYVLNRDNDARLTISSPLEAHKGSNLCYALTGLDMG 175
Query: 116 FLYGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDN 162
F P + D +A +H+ YE+ L V +S+ +DN
Sbjct: 176 F----DNPVFAAIELDYAEADTDPTGEAASEAQKHLTMYELDLGLNHVVR-KYSEP-VDN 229
Query: 163 GADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHA 222
GA++LIPVP G G+ + + +P +G +
Sbjct: 230 GANMLIPVPG--AGDGPGGQRAVWGGPCARAEKLPACMHACSEFGDI------------- 274
Query: 223 GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN-LQPD 281
+ + + +E E+V L+I+ A++I + ++ S +GD L + + L D
Sbjct: 275 ---YRVTLDYEGEQVKELRIKYFDSIPPATSICLMRKGFLFAASEFGDHALYQFSSLGDD 331
Query: 282 AKG--------------------------SYVEVLERYVNLGPIVDFCVVDLERQGQGQV 315
+G S +E L+R +L PI+D V +L + Q+
Sbjct: 332 DEGGVESSSATLMETEEGYQPVFFDPRPLSNLEALDRMDSLAPILDMKVANLAGEEIPQI 391
Query: 316 VTCSGAYKDGSLRIVRNGIGI------NEQASVELQG-IKGMWSLRSSTDDPFDTFLVVS 368
G +LR++R G+ + +E A L G +W+++ S D FD ++VVS
Sbjct: 392 YAACGRGSRSTLRVLRPGLAVTGKRAAHEMAVSPLPGNPTAVWTIKRSVGDEFDAYIVVS 451
Query: 369 FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 428
F + T +L++ + +EE GF TL + ++QV +R + R
Sbjct: 452 FSNATLVLSIG--ETVEEVNDSGFNGNVPTLQTQLLADDSMLQVYPNGLRHIRPDRR--I 507
Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCLDINP 487
NEW++P ++ AT N QV +A GGG ++Y E+ G+L E + ++ +++CLD+ P
Sbjct: 508 NEWRAPGRKTIVKATTNERQVAIALGGGEVIYFELNPQGMLVESEKREMGGDVACLDVAP 567
Query: 488 IGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLC-------- 536
+ E + AVGM+ D + R+ SL PD L +++ + +G P SVLL
Sbjct: 568 VPEGRTRCAFLAVGMY-DGAARVLSLQPDSTLKVLSTQAVGA--TPESVLLLDSPLMGKD 624
Query: 537 -----AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
A G +L L +G LL ++ TG+L+D + LGT+P L F++
Sbjct: 625 GTEEGAGSGALFLQVGLVNGVLLRTEVDRVTGQLSDTRTRFLGTKPPKL--FAAA-VPEG 681
Query: 592 FAASDRPTVIYSSNKKL--LYSNVNLKEVSHMCPFNSAAFPD----SLAIAKEGELTIGT 645
F A R T+ + ++L ++ +L+ P PD ++A A + +
Sbjct: 682 FVAVARNTLRVITLERLGEFFNQQSLR--LRYTPRKFVIHPDLKVLAIAEADHAAIPLAQ 739
Query: 646 IDDIQKLHIRSIPLGEHPRRICHQEQSRTFAIC------SLKNQSCAEESE--------- 690
+D+Q+ RR + + + A + ++ A E +
Sbjct: 740 REDLQQ------------RRDGMEAEQQGGATAAAAAGPEMDEEAAAREDQWGAPKGEPG 787
Query: 691 --MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD--DSNVYYCVGTAYVL---PEE 743
VR++D + + ++ E S+ F + VGTA L P+E
Sbjct: 788 QWASCVRIVDPVSLQTTHCIEMEDNEAALSVCLVEFDSHPEHGTLLAVGTAQGLKFYPKE 847
Query: 744 NEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLR 799
+ + F+ + +++L+ + +G ++ AF G+LL ++ ++LY K MLR
Sbjct: 848 CQNGFVHLYRFLDDGKRIELLHKTAVEGVPGAMAAFKGRLLVGVDAVLRLYDMGKKRMLR 907
Query: 800 DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
R L + I L+V G I VGD +S + YK + A D
Sbjct: 908 KCEYRRLPT------RIATLHVS--GSRIYVGDGQESTFFMRYKKGDNQFYIFADDIVPR 959
Query: 860 WMSAVEILDDDIYLGAENNFNLFTVRKNSE-----------GATDEERG-------RLEV 901
++A LD D GA+ N+F R E G E G +L
Sbjct: 960 HVTAALHLDYDTLAGADRFGNVFVSRLPQEVSAQVEDDPTGGKYATETGLLGGAPNKLRT 1019
Query: 902 VGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKL 960
+ +H+GE V + L G +++GT+NG IGV+ E F + L
Sbjct: 1020 INSFHVGETVTALQRAVL-------QPGGRELIVYGTINGAIGVLYPFTSKEDCDFFQHL 1072
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
+ ++R+ + G +H +RSF K+ +DGDL E + L+ + +++ ++
Sbjct: 1073 EMHMRQEHPPLLGRDHLAYRSF-----YFPVKDVVDGDLCEQYPQLAADKARGVAEELDR 1127
Query: 1021 SVEELCKRVEEL 1032
S E+ K++E++
Sbjct: 1128 SPGEVLKKLEDI 1139
>gi|255946770|ref|XP_002564152.1| Pc22g01070 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591169|emb|CAP97395.1| Pc22g01070 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1209
Score = 209 bits (532), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 288/1169 (24%), Positives = 489/1169 (41%), Gaps = 171/1169 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRVAGSSKDYIIIGSDSGRITIVEY-VPSQNRFNRIHMETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ +++ +L + + + Q L + G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQNLVFAMICLDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D +A T YE+ L V WS + +D A +
Sbjct: 183 YDNPVFAALEVDYSEADQDPTGQAYEEIEKVLVYYELDLGLNHVVRK-WS-DPVDRTASM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSR--- 215
L VP P GVL+ E+ + Y +N AF+ IP R T+ R + S
Sbjct: 241 LFQVPGGDDGP--SGVLVCAEDNVTYRHSNQGAFRVPIPRRSGATENPERKRSIVSGVMH 298
Query: 216 -------YLLGDHAG---LLHLLVITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
+LL G L+L ++ ++ K TG L I+ +++ + L +
Sbjct: 299 KMRGAFFFLLQTEDGDLFKLNLEMVEDKQGKPTGEVKRLTIKYFDTIPVSTNLLILKSGF 358
Query: 262 VYIGSSYGDS---QLIKLNLQPD-----------------------AKGS-YVEVLERYV 294
+Y S G+ Q KL P +G+ V ++E
Sbjct: 359 LYAASESGNHNFYQFEKLGDDPSEAFFSSDSFSADPSVPLTPIYFHPRGAENVNLMESMN 418
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
+L P++D +++L Q+ T G+ + R +++G+ ++E +LQ + +W+
Sbjct: 419 SLSPLIDSKILNLSEDDAPQIYTICGSGARSTFRTLKHGLEVSEIVESDLQQVPSAVWTT 478
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ + D F T++++SF + T +L++ + +EE GF S TL + LVQV
Sbjct: 479 KLTRGDEFHTYIILSFANGTLVLSIG--EIVEEVSDTGFLSSAPTLAVQQLGEDSLVQVH 536
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
+R + + R NEW +P S+ A AN QV +A G +VY E+ DG L E
Sbjct: 537 PRGIRHILADQRV--NEWPAPQHRSIVAAAANERQVAVALSSGEIVYFEMDADGTLAEYD 594
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
+ Q+ ++ L + + E S AVG D +VRI SL PD L K P
Sbjct: 595 ERRQMSGTVTSLSMGEVPEGRMRSSFLAVGC-DDSTVRILSLNPDSTLENKSVQALTSAP 653
Query: 531 RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
++ + A S YL L G L +L+ TGEL+D + +G +P+ L S
Sbjct: 654 SALQIMAMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFIGAKPVKLSQVS 713
Query: 585 SKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
K T V A S RP + YS K + + ++ + F+S + + + L
Sbjct: 714 VKGQTAVLALSTRPWLGYSDVQTKSFMLTPLDYVGLEWGWNFSSEQCVEGMIGIQGRSLR 773
Query: 643 IGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD-- 699
I T++ + ++ SIPL PRR + F + N + + RL+DD
Sbjct: 774 IFTVEKLDNNMLQESIPLSYTPRRFVKHPEQPLFYVIESDNNVLSPATRQ---RLIDDSQ 830
Query: 700 ---------------------------QTFEFISTYP------LDTFEYGCSILSCSFSD 726
Q E I+T L+ E S+ + SFS
Sbjct: 831 AQNGEATDLPPADFGYPRATGHWASCIQVVEPITTKSVIFNLDLEDNEAAVSLAAVSFSS 890
Query: 727 -DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFN 780
D + VGTA V P + + G I ++ EDG+ L+ I + + +L F
Sbjct: 891 QDDETFLVVGTAKDMTVSPPSS--SCGFIHIYRFQEDGRELEFIHKTQVDEPPLALLGFQ 948
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFIVVGDLMKSI 837
G+LLA I +++Y D G ++L +C I+ L QT+G I+V D+ +S+
Sbjct: 949 GRLLAGIGPILRVY-----DLGMKQLLRKCQAPVVPKTIVGL--QTQGSRIIVSDVRESV 1001
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE 894
+ ++YK+++ + A D A W S+ ++D + G + NL+ VR K SE A ++
Sbjct: 1002 TYVVYKYQDNVLIPFADDSIARWTSSTTMVDYETTAGGDKFGNLWLVRCPSKVSEQADED 1061
Query: 895 --------ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIG 944
E+G L G H E + F + L + + G V++ G IG
Sbjct: 1062 GSGAHLIHEKGYLH--GTPHRLELMVHFYAQDIPTSLHKTQLVAGGRDIVVWTGFQGTIG 1119
Query: 945 VIASLP-HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
+ E F + L+T L + G +H +R + K V +DGDL E +
Sbjct: 1120 MFVPFASREDVDFFQLLETQLASQQPPLAGRDHLMYRGYYAPVKGV-----IDGDLCEMY 1174
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L L I+ ++ SV E+ +++ ++
Sbjct: 1175 LLLPNDTKLMIAGELDRSVREIERKISDM 1203
>gi|449282569|gb|EMC89402.1| Splicing factor 3B subunit 3 [Columba livia]
Length = 1225
Score = 209 bits (532), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 266/1193 (22%), Positives = 489/1193 (40%), Gaps = 187/1193 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNVFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQYLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PILCCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPVGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSV 533
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPE 638
Query: 534 LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
LC E G YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 639 SLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQ 818
Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
AEE +EM +R+++ +
Sbjct: 819 MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLD 878
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQ 762
L+ E S+ C FS+ + +Y VG A L G + +V G KL+
Sbjct: 879 LVQLEQNEAAFSVAVCRFSNTGDEWYVLVGVAKDLILNPRSVAGGFVYTYKLVNSGEKLE 938
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILA 818
+ + + ++ F G++L + + +++Y K +LR + L H
Sbjct: 939 FLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVYDLGKKKLLRKCENKILSFLSQKHIANYI 998
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+
Sbjct: 999 CGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTATLLDYDTVAGADKF 1058
Query: 879 FNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1059 GNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI 1118
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ T++G IG++ HE + F + ++ +LR + G +H +
Sbjct: 1119 -------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSF 1171
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN +DGDL E F + + +++ ++ + E+ K++E++
Sbjct: 1172 RSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVAEELDRTPPEVSKKLEDI 1219
>gi|189044515|sp|Q7RYR4.2|RSE1_NEUCR RecName: Full=Pre-mRNA-splicing factor rse-1
Length = 1209
Score = 209 bits (531), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 279/1164 (23%), Positives = 487/1164 (41%), Gaps = 161/1164 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G + + FR G +D++ +AT+ + ++++ ++++ + R
Sbjct: 64 IFGIVRAIASFRLAGSHKDYIILATDSGRITIIEYLPKTNKFQRIHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R LI + L V+ +++ +L + + + VL + + G
Sbjct: 124 GQYLAADPKGRACLISALEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSLVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L D DA T YE+ L V WS + +D + LL
Sbjct: 184 ANPVFAALELDYTDADQDPTGQAREEVETQLVYYELDLGLNHVVRK-WS-DTVDRTSSLL 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYL 217
VP P GVL+ GEE + Y +N AF+ IP R T+ RV G +
Sbjct: 242 FQVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRSGATEDPQRKRVIVSGVMHK 299
Query: 218 LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
L AG L+ T + + +V LKI+ +A+++ L +
Sbjct: 300 LKGSAGAFFFLLQTDDGDLFKVTIDMIEDSDGNPTGEVKRLKIKYFDTIPVATSLCILKS 359
Query: 260 AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
++ S +G+ + + D SY V LE V
Sbjct: 360 GFLFAASEFGNHHFYQFEKLGDDDEELEFSSDDFPTDPTASYNPVYFHPRPLENLVLVES 419
Query: 295 --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
++ P VD V +L + Q+ + G + R++++G+ ++E + EL G +W
Sbjct: 420 IDSMNPQVDCKVANLTGEDAPQIYSVCGNGARSTFRMLKHGLEVSEIVASELPGTPSAVW 479
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+ + + D +D ++V+SF + T +L++ + +EE GF + TL + L+Q
Sbjct: 480 TTKLTKYDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTTAPTLAVQQMGEDGLIQ 537
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
V +R + NEW +P S+ ATAN +QV++A G +VY E+ DG L E
Sbjct: 538 VHPKGIRHIVQGRV---NEWPAPQHRSIVAATANENQVVIALSSGEIVYFEMDSDGSLAE 594
Query: 471 V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
+ ++ ++ L + + E S AVG D +VRI SL PD L K
Sbjct: 595 YDEKKEMSGTVTSLSVGQVPEGLKRSSFLAVGC-DDCTVRILSLDPDSTLEMKSIQALTA 653
Query: 529 IPRSVLLCAFE-----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
P ++ + + E YL L G L +L+ TGELTD ++ LG +P L
Sbjct: 654 APSALSIMSMEDSFGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTRLFQV 713
Query: 584 SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
S ++ V A S RP + Y+ K + + ++ E+ + F+S + + L
Sbjct: 714 SVQDQPCVLALSSRPWLGYTDPLTKGFMMTPLSYTELEYGWNFSSEQCLEGMVGIHANYL 773
Query: 642 TIGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKN-------QSCAEESEMH 692
I +I+ + I +SIPL P+ + H EQ + I S N E+
Sbjct: 774 RIFSIEKLGDNMIQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG 833
Query: 693 FVRLLDDQTFEF----------------ISTYP-------LDTFEYGCSILSCSF-SDDS 728
+L + F + IS P LD E S F S +
Sbjct: 834 DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEG 893
Query: 729 NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
+ VGT ++ + + T+G I V+ EDG+ L+ I + + +L F G+LLA
Sbjct: 894 ESFLVVGTGKDMVLDPRQFTEGYIHVYRFHEDGRDLEFIHKTRVEEPPLALIPFQGRLLA 953
Query: 786 AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
+ + +++Y L+ R+ Q++ L + +Q++G+ I+VGDL + I+ ++YK E
Sbjct: 954 GVGKTLRIYDLGLK-QLLRKAQADV---TPTLIVSLQSQGNRIIVGDLQQGITYVVYKAE 1009
Query: 846 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
+ A D W + ++D + G + N++ VR + + + E E
Sbjct: 1010 GNRLIPFADDTLNRWTTCTTMVDYESVAGGDKFGNIYIVRCPERVSQETD----EPGSEI 1065
Query: 906 HLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV-IAS 948
HL N + HG+ LP S VG +++ + G +GV I
Sbjct: 1066 HLMHARN-YLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQDVLLWSGLQGTVGVFIPF 1124
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+ E F + L+ ++R + G +H +R + K V +DGDL E F L
Sbjct: 1125 VSREDVDFFQNLENHMRAEDPPLAGRDHLIYRGYYTPVKGV-----IDGDLCERFSLLPN 1179
Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
+ I+ ++ SV E+ +++ ++
Sbjct: 1180 DKKQMIAGELDRSVREIERKISDI 1203
>gi|350290373|gb|EGZ71587.1| Pre-mRNA-splicing factor rse-1 [Neurospora tetrasperma FGSC 2509]
Length = 1209
Score = 209 bits (531), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 279/1164 (23%), Positives = 487/1164 (41%), Gaps = 161/1164 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G + + FR G +D++ +AT+ + ++++ ++++ + R
Sbjct: 64 IFGIVRAIASFRLAGSHKDYIILATDSGRITIIEYLPKTNKFQRIHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R LI + L V+ +++ +L + + + VL + + G
Sbjct: 124 GQYLAADPKGRACLISALEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSLVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L D DA T YE+ L V WS + +D + LL
Sbjct: 184 ANPVFAALELDYTDADQDPTGQAREEVETQLVYYELDLGLNHVVRK-WS-DTVDRTSSLL 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYL 217
VP P GVL+ GEE + Y +N AF+ IP R T+ RV G +
Sbjct: 242 FQVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRSGATEDPQRKRVIVAGVMHK 299
Query: 218 LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
L AG L+ T + + +V LKI+ +A+++ L +
Sbjct: 300 LKGSAGAFFFLLQTDDGDLFKVTIDMIEDSDGNPTGEVKRLKIKYFDTIPVATSLCILKS 359
Query: 260 AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
++ S +G+ + + D SY V LE V
Sbjct: 360 GFLFAASEFGNHHFYQFEKLGDDDEELEFSSDDFPTDPTASYNPVYFHPRPLENLVLVES 419
Query: 295 --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
++ P VD V +L + Q+ + G + R++++G+ ++E + EL G +W
Sbjct: 420 IDSMNPQVDCKVANLTGEDAPQIYSVCGNGARSTFRMLKHGLEVSEIVASELPGTPSAVW 479
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+ + + D +D ++V+SF + T +L++ + +EE GF + TL + L+Q
Sbjct: 480 TTKLTKYDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTTAPTLAVQQMGEDGLIQ 537
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
V +R + NEW +P S+ ATAN +QV++A G +VY E+ DG L E
Sbjct: 538 VHPKGIRHIVQGRV---NEWPAPQHRSIVAATANENQVVIALSSGEIVYFEMDSDGSLAE 594
Query: 471 V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
+ ++ ++ L + + E S AVG D +VRI SL PD L K
Sbjct: 595 YDEKKEMSGTVTSLSVGQVPEGLKRSSFLAVGC-DDCTVRILSLDPDSTLEMKSIQALTA 653
Query: 529 IPRSVLLCAFE-----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
P ++ + + E YL L G L +L+ TGELTD ++ LG +P L
Sbjct: 654 APSALSIMSMEDSFGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTKLFQV 713
Query: 584 SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
S ++ V A S RP + Y+ K + + ++ E+ + F+S + + L
Sbjct: 714 SVQDQPCVLALSSRPWLGYTDPLTKGFMMTPLSYTELEYGWNFSSEQCLEGMVGIHANYL 773
Query: 642 TIGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKN-------QSCAEESEMH 692
I +I+ + I +SIPL P+ + H EQ + I S N E+
Sbjct: 774 RIFSIEKLGDNMIQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG 833
Query: 693 FVRLLDDQTFEF----------------ISTYP-------LDTFEYGCSILSCSF-SDDS 728
+L + F + IS P LD E S F S +
Sbjct: 834 DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEG 893
Query: 729 NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
+ VGT ++ + + T+G I V+ EDG+ L+ I + + +L F G+LLA
Sbjct: 894 ESFLVVGTGKDMVLDPRQFTEGYIHVYRFHEDGRDLEFIHKTRVEEPPLALIPFQGRLLA 953
Query: 786 AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
+ + +++Y L+ R+ Q++ L + +Q++G+ I+VGDL + I+ ++YK E
Sbjct: 954 GVGKTLRIYDLGLK-QLLRKAQADV---TPTLIVSLQSQGNRIIVGDLQQGITYVVYKAE 1009
Query: 846 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
+ A D W + ++D + G + N++ VR + + + E E
Sbjct: 1010 GNRLIPFADDTLNRWTTCTTMVDYESVAGGDKFGNIYIVRCPERVSQETD----EPGSEI 1065
Query: 906 HLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV-IAS 948
HL N + HG+ LP S VG +++ + G +GV I
Sbjct: 1066 HLMHARN-YLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQDVLLWSGLQGTVGVFIPF 1124
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+ E F + L+ ++R + G +H +R + K V +DGDL E F L
Sbjct: 1125 VSREDVDFFQNLENHMRAEDPPLAGRDHLIYRGYYTPVKGV-----IDGDLCERFSLLPN 1179
Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
+ I+ ++ SV E+ +++ ++
Sbjct: 1180 DKKQMIAGELDRSVREIERKISDI 1203
>gi|340520436|gb|EGR50672.1| predicted protein [Trichoderma reesei QM6a]
Length = 1212
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 286/1175 (24%), Positives = 487/1175 (41%), Gaps = 180/1175 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW---DAESSELITRAMGDVSDRIGRP 62
I+G I +L FR G +D+L +AT+ + ++++ + S+L G R P
Sbjct: 64 IFGIIRSLAAFRLAGSNKDYLILATDSGRITIIEYLPKENRFSKLHLETFGKSGVRRVIP 123
Query: 63 TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
G+ DP R LI + L V+ +++ +L + + + VL I +
Sbjct: 124 ---GEYLACDPKGRACLIASIEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVISMVALD 180
Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD-------------FVEGPWSQNNLDNGA 164
G A P L D + T E AL+D + V WS+ +D+ A
Sbjct: 181 VGYANPVFAALEIDYTEVDQDSTGE-ALRDLETQLVYYELDLGLNHVVRKWSEP-VDSTA 238
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGS 214
LL VP P GVL+ GEE I Y +N AF+ IP R T+ + R G
Sbjct: 239 SLLFQVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPIPRRRGATEDPSRKRTIISGV 296
Query: 215 RYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISY 256
+ L AG L+ T + + +V LKI+ A+++
Sbjct: 297 MHKLKGSAGAFFFLLQTEDGDMFKVTIDMVEDDEGNTTGEVRRLKIKYFDTVPPAASLCI 356
Query: 257 LDNAVVYIGSSYGD---SQLIKL------------NLQPDAKGSYVEV------------ 289
L + +Y+ S +G+ Q KL + DA+ SY V
Sbjct: 357 LKSGFLYVASQFGNFSFYQFEKLGDDDEELEFTSDDFPVDAQASYDPVYFYPRPLENLVL 416
Query: 290 LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
+E ++ P++D + +L + Q+ T G + R +++G+ INE S EL GI
Sbjct: 417 VESIPSMNPLLDCKIANLTGEDAPQIYTICGNGARSTFRTLKHGLEINEIVSSELPGIPS 476
Query: 349 GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
+W+L+ + ++ +D ++V+SF + T +L++ + +EE GF + TL +
Sbjct: 477 AVWTLKLNRNEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTLAAQLLGEDG 534
Query: 409 LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGI 467
L+QV +R + + N+W +P S+ AT NA QV +A G +VY E+ DG
Sbjct: 535 LIQVHPKGIRHIRNGQV---NQWDAPQHRSIVAATTNAHQVAIALSSGEIVYFEMDDDGS 591
Query: 468 LTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLG 525
L E + + ++CL + + E S AVG D +VRI SL P+ L K
Sbjct: 592 LAEYDEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDCTVRILSLDPESTLENKSVQA 650
Query: 526 GEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
P S+ + + E S YL L G L +L+ TGELTD ++ LG + +
Sbjct: 651 LTAAPTSLAIISMEDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELTDTRQKFLGPKQVR 710
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
L + + T V S RP + Y+ K + + +N ++ F S + + +
Sbjct: 711 LFQVTVQGRTCVLGLSSRPWLGYADPITKTFVVTPLNYVDLEWGWNFTSEQCEEGIVGIQ 770
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQS-----CAE---- 687
L I ++D + I+ SIPL P+++ F + N + CA+
Sbjct: 771 GQTLRIFSVDRLGDTLIQSSIPLTYTPKKMVKHPDQPLFYVIEADNHTLSPALCAQLLAD 830
Query: 688 -------------------------ESEMHFVRLL--DDQTFEFISTYPLDTFEYGCSIL 720
S + V L D Q + I L+ E S+
Sbjct: 831 PARVNGDSKVLPPEEFGHPRGNRRWASCISVVDPLAEDGQVLQRID---LEENEAAVSLA 887
Query: 721 SCSFSDDSN-VYYCVGTAYVL---PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSL 776
+F+ N + VGT + P I F + L I + + + ++
Sbjct: 888 IVTFASQENETFLVVGTGKDMVLNPRSFSDAFVHIYRFERDGRGLVFIHKTKVEEPPMAM 947
Query: 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
F G++L I + +++Y +R R+ Q+E I++L Q G IVVGD+ +
Sbjct: 948 IPFQGRVLVGIGKMLRIYDLGMR-QLLRKSQAEVAPQ-QIVSLNAQ--GSRIVVGDVQQG 1003
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATD 893
I+ +++K + + D A W + ++D + G + N+F VR K SE A +
Sbjct: 1004 ITYVVFKQQTNKLIPFVDDTVARWTTCSTMVDYETTAGGDKFGNIFIVRAPQKASEEADE 1063
Query: 894 EERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
E G RL+++ + + SLV VG +++
Sbjct: 1064 EPAGLHLLNARSYLHGTSHRLDLMCHLYTQDIPTSITKTSLV-------VGGQEVLLWSG 1116
Query: 939 VNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
+ G IGV I + E F + L+ +LR + G +H +RS+ K V +DG
Sbjct: 1117 LMGTIGVLIPFVTREDADFFQSLEQHLRAEDPPLAGRDHLMYRSYYAPMKGV-----IDG 1171
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DL E + L + I+ ++ SV E+ +++ ++
Sbjct: 1172 DLCERYALLPNDKKQMIAGELDRSVREIERKISDI 1206
>gi|334313376|ref|XP_003339894.1| PREDICTED: splicing factor 3B subunit 3-like [Monodelphis domestica]
Length = 1202
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 268/1183 (22%), Positives = 487/1183 (41%), Gaps = 190/1183 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDN--KDARHVKTYEVALKDKDFVEGPWSQNN---LDNGADLLIP 169
G P L D K+ ++ + D + N L G+D
Sbjct: 172 GVDVGFENPMFACLEMDYEVKEGTGLENKDKKFGGVDVTHPSYMIQNVTILPGGSD---- 227
Query: 170 VPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-------------- 215
GVLI E I Y N IR I + +D D R
Sbjct: 228 ---GPSGVLICSENYITY--KNFGDQPDIRCPIPRRRKELD-DPERGKIFVCSATPKNKS 281
Query: 216 --YLLGD--HAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
+ LG + + + T E VT ++++ +A+ + L +++ S +G+
Sbjct: 282 MFFFLGQTGQGDIFQITLETDENMGVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNH 341
Query: 272 QLIKLN------------------------LQPDAKGSYVEVLERYVNLGPIVDFC-VVD 306
L ++ QP + V +++ +L PI+ FC + D
Sbjct: 342 YLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLSPIL-FCQIAD 399
Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFL 365
L + Q+ G SLR++R+G+ ++E A EL G +W++R +D FD ++
Sbjct: 400 LANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYI 459
Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
+VSF++ T +L++ + +EE GF T TL C + LVQV +R + + R
Sbjct: 460 IVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR 517
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCL 483
NEWK+P ++ N QV++A GG LVY E+ G L E + ++ ++ C+
Sbjct: 518 --VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCM 575
Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE--- 539
+ + S+ AVG+ D +VRI SL + + + L + +P + LC E
Sbjct: 576 SLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPESLCIVEMGG 632
Query: 540 -------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
G YL L +G LL +L+ TG+L+D + LG++P+ L +
Sbjct: 633 AEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQ 692
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
V A S R + YS + + ++ + + F S P+ + L I +
Sbjct: 693 GQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILAL 752
Query: 647 DDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQSCAEE----- 688
+ + + + + PL PR+ + H E + T A + + Q AEE
Sbjct: 753 EKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAA 812
Query: 689 --------SEM-------------------------HFVRLLDDQTFEFISTYPLDTFEY 715
+EM VR+++ + L+ E
Sbjct: 813 GEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVVRVMNPIQGNTLDLVQLEQNEA 872
Query: 716 GCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIAEKETKG 771
S+ C FS+ + +Y VG A L G + +V G KL+ + + +
Sbjct: 873 AFSVAVCRFSNTGDDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNSGEKLEFLHKTPVEE 932
Query: 772 AVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY---VQTRGDFI 828
++ F G++L + + +++Y D G ++L +C + HI A Y +QT G +
Sbjct: 933 VPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQTIGHRV 985
Query: 829 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--K 886
+V D+ +S + YK E + A D W++ +LD D GA+ N+ VR
Sbjct: 986 IVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPP 1045
Query: 887 NSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
N+ DE +RG L EV+ YH+GE V + +L+ G
Sbjct: 1046 NTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-------PGG 1098
Query: 931 IPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
++++ T++G IG++ HE + F + ++ +LR + G +H +RS+
Sbjct: 1099 SESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YF 1153
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1154 PVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1196
>gi|299751161|ref|XP_001830098.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
gi|298409248|gb|EAU91763.2| pre-mRNA-splicing factor rse1 [Coprinopsis cinerea okayama7#130]
Length = 1205
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 262/1172 (22%), Positives = 475/1172 (40%), Gaps = 176/1172 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
++G I +L FR G ++D+ + ++ + +L++D +++ +L G R P
Sbjct: 58 VFGSIRSLAAFRLTGGSKDYAIVGSDSGRIVILEYDPKTNVFNKLHQETYGKSGTRRIVP 117
Query: 63 TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNI---RLEELQVLDIKFL 117
GQ DP R +I L V+ D L + + R + +
Sbjct: 118 ---GQYLATDPRGRSVMIAAMENSKLVYVLNRDVATNLTISSPLGAHRASTIVHDVVGVD 174
Query: 118 YGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGA 164
G P L D + T YE+ L V WS+ D A
Sbjct: 175 VGFDNPVYAALEVDYSQSDQDPTGNALRRAEKLLTYYELDLGLNHVVRK-WSEPT-DPRA 232
Query: 165 DLLIPVP------------PPLCGVLIIGEETIVYCSANAFK---AIPIR--PSITKAYG 207
+LL+ VP P GVL+ E I+Y +A IP R P
Sbjct: 233 NLLVQVPGGQSTNGATHDGP--SGVLVCCENHIIYQHGDAPSHRVPIPRRQDPVADPNQD 290
Query: 208 RVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
+ + + G + + HE V+ LKI+ +AS + L ++++ S
Sbjct: 291 VIIVAAVMHKMKTEDGDVFKATVDHEDGVVSSLKIKYFETVPVASNLCILKQGLLFVASE 350
Query: 268 YGDSQLIKL------------------------NLQPDAKGSY-------VEVLERYVNL 296
+G+ L + QP + S+ + +++ +L
Sbjct: 351 FGNHHLYRFCKLGDDDDQPEFSSSSYPSYGMAEPEQPLPRASFDPRPMENLMLIDELSSL 410
Query: 297 GPIVDFCVVDLERQGQG-QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI---KGMWS 352
P++D ++ + ++ G S +I+ +G+ ++E AS EL G +WS
Sbjct: 411 NPVLDAKILKPSLDSEAPKIFAACGRGPASSFKILSHGLEVDEMASSELPGFLAPHSLWS 470
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ D +D LV+SF + T +L++ + +EE + GF + TL N L+QV
Sbjct: 471 TKRMQTDHYDNLLVMSFQNATIVLSIG--ESMEEVKDSGFLTSMPTLAVQQIGENGLIQV 528
Query: 413 TSGSVR-LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE 470
+ +R LV S NEWK P G ++ A N QV++A +VY E+ +G L E
Sbjct: 529 HTHGIRHLVDSQV----NEWKVPQGQTIVAANTNRRQVVVALSSAEIVYFELDQEGQLNE 584
Query: 471 VKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
+ + + + L + + E AVG D +VR+ SL PD L T
Sbjct: 585 YQDMKAMGSTVLALGLGEVPEGRLRFPYLAVGC-EDQTVRVMSLDPDSTLETISLQALTA 643
Query: 529 IPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
P S+ + S ++ L +G LL +L+ G+LTD + LGT+P+ L
Sbjct: 644 PPSSICIAYMLDASINKVQRSMFVNIGLANGVLLRTVLDGTNGQLTDTRTRFLGTKPVRL 703
Query: 581 RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
K + A S R + YS +K+ ++ + + F + P+ + +
Sbjct: 704 LRVKLKGEDAILAISSRVWLNYSHQQKMEFTPLACETPECASSFTGESCPEGIISIAKNT 763
Query: 641 LTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM-------- 691
L+I T+ + KL S+PL PR+ +R F + ++ +EE E
Sbjct: 764 LSISTVSKLGMKLKQESVPLAFTPRKFVTHPGNRFFYLIESDQRTLSEEEEKKKLADLNI 823
Query: 692 -----------------------HF---VRLLDDQTFEFISTYPLDTFEYGCSILSCSFS 725
H+ + + D + ++T PL E SI F+
Sbjct: 824 SRDDHPILQLPAKIFGRTRASAGHWASRIHIFDPMEAKTVATLPLKANEAAFSIAVVPFA 883
Query: 726 DDSNVYY-CVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA--VYSLNAF 779
Y+ VGTA V P + + ++ + E L+L+ E + + +L AF
Sbjct: 884 STGGEYHLVVGTAMHHLVTPPQASASYLKVYKIVNEGTGLELLHETPIQDSELPRALLAF 943
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
G+LLA + + +++Y D G ++L + + + T+G IV+GD+ +S
Sbjct: 944 QGRLLAGVGKALRIY-----DLGKKKLLRKAETKSPTAIVSLATQGSRIVIGDMQESTLF 998
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS----------- 888
+YK E + D W+SA+ ++D + + N+F R +S
Sbjct: 999 AVYKEAENRLLIFGDDTQPRWVSAMTMVDYNTVAVGDKFGNIFVNRLDSTISDQVDEDPT 1058
Query: 889 ------EGAT-DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
E AT + + +++ +H+G+ + SLV VG +++ + G
Sbjct: 1059 GAGILHEKATLNGAPHKTKMLAHFHVGDIITSIHKVSLV-------VGGREVLLYTGLQG 1111
Query: 942 VIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
IG++ L ++ + FL L+ ++R + G +H WR + V K +DGDL
Sbjct: 1112 TIGILVPLTSKEDIEFLTMLEQHIRNEQGSLVGRDHLSWRGY-----YVPVKAVIDGDLC 1166
Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
E++ LS ++ I+ ++ +V ++ K+++++
Sbjct: 1167 ETYGGLSSSKQSAIASELDRTVGDVLKKLDQM 1198
>gi|325096432|gb|EGC49742.1| pre-mRNA-splicing factor Rse1 [Ajellomyces capsulatus H88]
Length = 1209
Score = 208 bits (529), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 271/1176 (23%), Positives = 486/1176 (41%), Gaps = 185/1176 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I L FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRALAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ + + +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALTALDVG 182
Query: 120 CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
P L + +A +H+ YE+ L V WS + +D A +
Sbjct: 183 YDNPIFAALEVEYTEADQDPTGQAYEELEKHLVYYELDLGLNHVVRK-WS-DPVDRSASM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLL 218
L VP P G L+ E+ I Y +N AF+ IP R T+ + + RY+
Sbjct: 241 LFQVPGGADGP--SGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT 293
Query: 219 GDHAGLLH---------------------LLVITHEKEKVTG----LKIELLGETSIAST 253
AG++H L ++ + ++TG LK++ IAS+
Sbjct: 294 ---AGVMHKMRGAFFFLLQTEDGDLFKVTLDMVEDDNGQITGEVSRLKLKYFDTVPIASS 350
Query: 254 ISYLDNAVVYIGSSYGDSQLIKL-------------------NLQPDAKGSY-------- 286
+ L + +++ S G+ + +L +Y
Sbjct: 351 LCILKSGFLFVASETGNHHFYQFEKLGDDDDEIEFTSDAYSADLSEPLPPAYFRPRPYEN 410
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
V ++E +L P++D + +L Q+ T G S R +++G+ + E EL
Sbjct: 411 VNLVESIDSLNPLMDCKIANLTDDDAPQIYTICGTGARSSFRTLKHGLEVAEIVESELPS 470
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
+ +W+ + + +D FD ++++SF + T +L++ + +EE GF S TL
Sbjct: 471 VPSAVWTTKLTREDEFDAYIILSFTNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLG 528
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
+ L+QV +R + + R NEW +P S+ AT N QV +A G +VY E+
Sbjct: 529 EDSLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDT 586
Query: 465 DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
DG L E + + ++CL + + + S AVG D +VRI SL P+ L K
Sbjct: 587 DGSLAEYDEKRAMSGTVTCLSLGEVPRGRARSSFLAVGC-DDSTVRILSLDPESTLENKS 645
Query: 523 HLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
P ++ + A + YL L G L +L+ TGEL+D + LG +
Sbjct: 646 VQALTSAPSALSIMAMSDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLK 705
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
P+ L S K T V A S RP + YS K + + ++ + F+S + +
Sbjct: 706 PVKLFQVSVKEQTAVLALSSRPWLGYSHLQTKAFMLTPLDYVSLEWGWNFSSEQCVEGMV 765
Query: 635 IAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 693
+ L I +++ ++ L +IPL PR + F + +N + +
Sbjct: 766 GIQGQNLRIFSLEKLENNLLQETIPLQYTPRHFIKHPEHPLFYVIEAENNILSPGTR--- 822
Query: 694 VRLLDDQTFEFISTYPLDTFEYG-----------------------------------CS 718
+LL+D T PL E+G S
Sbjct: 823 TKLLNDSDAVNGDTTPLPPEEFGYPRGTGHWASCIQIVDPINSKRVISQIELEENEAAVS 882
Query: 719 ILSCSFSD-DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVY 774
+ + FS D + VGT ++ T G I ++ E+GK L+ I + + +
Sbjct: 883 VAAVPFSSQDDETFLVVGTGKDMVVNPRSCTAGFIHIYRFQEEGKELEFIHKTKVEQPPM 942
Query: 775 SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
+L F G+LLA I +++Y ++ R+ Q+ H L + +QT+G I+V D+
Sbjct: 943 ALLGFQGRLLAGIGTDLRIYDLGMK-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQ 998
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
+S++ ++YK++E + D + W + ++D + G + NL+ +R K SE A
Sbjct: 999 ESLTYVVYKYQENRLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPAKASEEA 1058
Query: 892 TDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
++ G RL +V ++ + + LV D V T +
Sbjct: 1059 DEDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLVTGGRDILVW---TGLQ 1115
Query: 937 GTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
GTV+ +I I+ E+ F + L+ L + G +H +RS+ AK +D
Sbjct: 1116 GTVSMLIPFIS---REEVDFFQSLEMQLAAQNPPLAGRDHLIYRSY-----YAPAKGTID 1167
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
GDL E++L L + +I+ ++ SV E+ +++ ++
Sbjct: 1168 GDLCETYLLLPNDKKQQIAGELDRSVREIERKIADM 1203
>gi|119473054|ref|XP_001258481.1| nuclear mRNA splicing factor, putative [Neosartorya fischeri NRRL
181]
gi|119406633|gb|EAW16584.1| nuclear mRNA splicing factor, putative [Neosartorya fischeri NRRL
181]
Length = 1209
Score = 208 bits (529), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 271/1168 (23%), Positives = 477/1168 (40%), Gaps = 169/1168 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ +++ +L + + + Q + G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTVVFAMTALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D +A T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVDYSEADQDPTGRAYEESEKLLVYYELDLGLNHVVRK-WA-DPVDRTASM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGD 220
L VP GVL+ E++I Y +N AF+ IP R T+ R + + +
Sbjct: 241 LFQVPGGADGPSGVLVCAEDSITYRHSNQDAFRVPIPRRIGATENPERKRSIVAGVMHKM 300
Query: 221 HAGLLHLL-------------VITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
LL ++ + ++TG LKI+ IAS + L + +Y
Sbjct: 301 RGAFFFLLQTEDGDLFKVTIDMVEDDSGQLTGEVKRLKIKYFDTVPIASNLLILKSGFLY 360
Query: 264 IGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLERYVNL 296
+ S G+ + N D Y + ++E +L
Sbjct: 361 VASEAGNHHFYQFEKLGDDDEETEFSSENFPADPSVPYEPVFFRPRGAENLNLVETLNSL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P++D +V+L Q+ T SG S R +++G+ ++E EL + +W+ +
Sbjct: 421 NPLIDSKIVNLNEDDAPQIYTVSGTGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ D FD ++++SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 481 TRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVHPR 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
+R + + R NEW +P S+ A N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGTLAEYDER 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
Q+ ++CL + + E S AVG D +VRI SL PD L K P +
Sbjct: 597 RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPSA 655
Query: 533 VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ + + S YL L G L +L+ TGEL+D + LG + + L S K
Sbjct: 656 LNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKQVKLFRVSVK 715
Query: 587 NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
T V A S RP + YS K + + ++ + F+S + + + L I
Sbjct: 716 GQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQAQNLRIF 775
Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR-------- 695
+I+ + + SIPL PRR+ + F + N + + +
Sbjct: 776 SIEKLDNNILQESIPLSNTPRRMLKHPEQPLFYVIESDNNVLSPATRARLIEDSKARNGE 835
Query: 696 --LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSFSD-DSNV 730
+L + F + IST L+ E S+ + FS D
Sbjct: 836 TNVLPPEDFGYPRATGHWASCIQIVDPLDAKAVISTIELEENEAAVSMAAVPFSSQDDET 895
Query: 731 YYCVGTAYVLPEENEPTKGRILVFIV---EDGK-LQLIAEKETKGAVYSLNAFNGKLLAA 786
+ VGTA + N P+ + I EDGK L+ I + + + +L F G+LLA
Sbjct: 896 FLVVGTAKDM-IVNPPSSAGGFIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRLLAG 954
Query: 787 INQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYK 843
I +++Y D G ++L +C +++ + QT+G I+V D+ +S++ ++YK
Sbjct: 955 IGSTLRVY-----DLGMKQLLRKC--QAPVVSKTIVGLQTQGSRIIVSDVRESVTYVVYK 1007
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE------ 894
++E + D + W ++ ++D + G + NL+ VR K SE A ++
Sbjct: 1008 YQENVLIPFVDDSVSRWTTSTTMVDYETVAGGDKFGNLWLVRCPKKVSEEADEDGSGAHL 1067
Query: 895 --ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
ERG RL+++ + + LV G +++ G IG+
Sbjct: 1068 IHERGYLHGTPNRLDLMIHTYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIGM 1120
Query: 946 IAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
+ + E F + L+ L + G +H +RS+ K +DGDL E +
Sbjct: 1121 LVPFVSREDVDFFQNLEMQLASQCPPLAGRDHLIYRSY-----YAPVKGVIDGDLCEMYF 1175
Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L I+ ++ SV E+ +++ ++
Sbjct: 1176 LLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|37359734|dbj|BAC97845.1| mKIAA0017 protein [Mus musculus]
Length = 1122
Score = 207 bits (527), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 257/1139 (22%), Positives = 471/1139 (41%), Gaps = 182/1139 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR 695
L I ++ + + + + PL PR+ +S I + + E ++ +
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 696 LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI 755
ILS VY Y L E
Sbjct: 818 ----------------------QILSPRSVAGGFVY-----TYKLVNNGE---------- 840
Query: 756 VEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 815
KL+ + + + ++ F G++L + + +++Y D G ++L +C + H
Sbjct: 841 ----KLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKH 890
Query: 816 ILALY---VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
I A Y +QT G ++V D+ +S + YK E + A D W++ +LD D
Sbjct: 891 I-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTV 949
Query: 873 LGAENNFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRF 914
GA+ N+ VR N+ DE +RG L EV+ YH+GE V
Sbjct: 950 AGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSL 1009
Query: 915 RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
+ +L+ G ++++ T++G IG++ +E + F + ++ +LR + G
Sbjct: 1010 QKTTLI-------PGGSESLVYTTLSGGIGILVPFTSYEDHDFFQHVEMHLRSEHPPLCG 1062
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+H +RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1063 RDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1116
>gi|357528934|sp|Q5B1X8.2|RSE1_EMENI RecName: Full=Pre-mRNA-splicing factor rse1
gi|259485100|tpe|CBF81885.1| TPA: Pre-mRNA-splicing factor rse1
[Source:UniProtKB/Swiss-Prot;Acc:Q5B1X8] [Aspergillus
nidulans FGSC A4]
Length = 1209
Score = 207 bits (527), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 281/1165 (24%), Positives = 477/1165 (40%), Gaps = 163/1165 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I TL FR G +D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRTLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYG 119
GQ +DP R LI + L V+ +++ +L + + + Q L + G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVYSVVALDAG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D ++ T YE+ L V W+ + +D + +
Sbjct: 183 YENPVFAALEVDYSESDQDPTGRAYEEVEKLLVYYELDLGLNHVVRK-WT-DPVDRTSSM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPS----------ITKAYGRVD 210
L VP GVL+ E+ I Y +N AF+ IP R IT
Sbjct: 241 LFQVPGGADGPSGVLVCAEDNITYRHSNQDAFRVPIPRRKGAMENPERKRCITAGVMHKM 300
Query: 211 ADGSRYLLGDHAG-LLHLLVITHEKEK------VTGLKIELLGETSIASTISYLDNAVVY 263
+LL G L L + E +K V GLKI+ +AS++ L + +Y
Sbjct: 301 RGAFFFLLQTEDGDLFKLTLDMVEDDKGQLTGEVKGLKIKYFDTVPLASSLLILKSGFLY 360
Query: 264 IGSSYGDSQLIKL-----------------NLQPDAKGSYV----------EVLERYVNL 296
+ + G+ + + P A + V ++E +L
Sbjct: 361 VAAEGGNHHFYQFEKLGDDDEETEFNSDDFSADPAAPCTPVYFQPRGAENLNLVEAINSL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P+VD VV++ Q+ T SG + R +++G+ ++E EL + +W+ +
Sbjct: 421 NPLVDSKVVNISEDDAPQIFTVSGTGARSTFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ D FD ++V+SF + T +L++ + +EE GF S TL + L+Q+
Sbjct: 481 TRADEFDAYIVLSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQIHPR 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
+R + + R NEW +P S+ A N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFELDADGSLAEYDER 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
Q+ ++CL + + E S AVG D +VRI SL PD L K P +
Sbjct: 597 RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDTTLENKSVQALTAAPSA 655
Query: 533 VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ + A S YL L G L L+ TGEL+D + LG++ + L S
Sbjct: 656 LNIIAMADSSSGGTTLYLHIGLHSGVYLRTALDEVTGELSDTRTRFLGSKAVKLFQVSVT 715
Query: 587 NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
T V A S RP + YS K + + ++ + F+S + + + L I
Sbjct: 716 GQTAVLALSSRPWLGYSDTQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775
Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFE 703
+I+ + + +SIPL PR + F + N + + RLL+D
Sbjct: 776 SIEKLDNNMLQQSIPLAYTPRHFIKHPEEPLFYVIEADNNVLSPATR---ARLLEDSKAR 832
Query: 704 FIST---------YP--------------------------LDTFEYGCSILSCSF-SDD 727
T YP L+ E SI + F S D
Sbjct: 833 GGDTTVLPPEDFGYPRGTGHWASCIQIIDPLDAKAVVGAVELEENEAAVSIAAVPFTSQD 892
Query: 728 SNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
+ VGTA + N P+ G I ++ EDGK L+ I + + + +L F G+L
Sbjct: 893 DETFLVVGTAKDM-TVNPPSSAGGYIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRL 951
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIY 842
LA + +++Y D G ++L +C A+ +QT+G IVV D+ +S++ ++Y
Sbjct: 952 LAGVGSVLRIY-----DLGMKQLLRKCQAAVAPKAIVGLQTQGSRIVVSDVRESVTYVVY 1006
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRL 899
K+++ + D A W +A ++D + G + NL+ VR K SE A +E G
Sbjct: 1007 KYQDNVLIPFVDDSIARWTTAATMVDYETTAGGDKFGNLWLVRCPKKASEEADEEGSGAH 1066
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDV-----------GQIPTVIFGTVNGVIGVIAS 948
+ +L NR L++ + D+ G +++ G IG++
Sbjct: 1067 LIHDRGYLQGTPNRLE---LMIHVFTQDIPTSLHKTQLVAGGRDILVWTGFQGTIGILVP 1123
Query: 949 -LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
+ E F + L+ L + G +H +RS+ K V +DGDL E + LS
Sbjct: 1124 FVSREDVDFFQSLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEQYFLLS 1178
Query: 1008 RTRMDEISKTMNVSVEELCKRVEEL 1032
I+ ++ SV E+ +++ ++
Sbjct: 1179 NDTKMMIAAELDRSVREIERKISDM 1203
>gi|225560964|gb|EEH09245.1| pre-mRNA-splicing factor rse1 [Ajellomyces capsulatus G186AR]
Length = 1209
Score = 207 bits (527), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 270/1174 (22%), Positives = 485/1174 (41%), Gaps = 181/1174 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I L FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRALAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ + + +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALTALDVG 182
Query: 120 CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
P L + +A +H+ YE+ L V WS + +D A +
Sbjct: 183 YDNPIFAALEVEYTEADQDPTGQAYEELEKHLVYYELDLGLNHVVRK-WS-DPVDRSASM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGD 220
L VP G L+ E+ I Y +N AF+ IP R T+ + + RY+
Sbjct: 241 LFQVPGGADGPSGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT-- 293
Query: 221 HAGLLH---------------------LLVITHEKEKVTG----LKIELLGETSIASTIS 255
AG++H L ++ + ++TG LK++ IAS++
Sbjct: 294 -AGVMHKMRGAFFFLLQTEDGDLFKVTLDMVEDDNGQITGEVSRLKLKYFDTVPIASSLC 352
Query: 256 YLDNAVVYIGSSYGDSQLIKL-------------------NLQPDAKGSY--------VE 288
L + +++ S G+ + +L +Y V
Sbjct: 353 ILKSGFLFVASETGNHHFYQFEKLGDDDDEIEFTSDAYSADLSEPLPPAYFRPRPYENVN 412
Query: 289 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI- 347
++E +L P++D + +L Q+ T G S R +++G+ + E EL +
Sbjct: 413 LVESIDSLNPLMDCKIANLTDDDAPQIYTICGTGARSSFRTLKHGLEVAEIVESELPSVP 472
Query: 348 KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
+W+ + + +D FD ++++SF + T +L++ + +EE GF S TL +
Sbjct: 473 SAVWTTKLTREDEFDAYIILSFTNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGED 530
Query: 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DG 466
L+QV +R + + R NEW +P S+ AT N QV +A G +VY E+ DG
Sbjct: 531 SLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDG 588
Query: 467 ILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
L E + + ++CL + + + S AVG D +VRI SL P+ L K
Sbjct: 589 SLAEYDEKRAMSGTVTCLSLGEVPRGRARSSFLAVGC-DDSTVRILSLDPESTLENKSVQ 647
Query: 525 GGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
P ++ + A + YL L G L +L+ TGEL+D + LG +P+
Sbjct: 648 ALTSAPSALSIMAMSDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLKPV 707
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
L S K T V A S RP + YS K + + ++ + F+S + +
Sbjct: 708 KLFQVSVKEQTAVLALSSRPWLGYSHLQTKAFMLTPLDYVGLEWGWNFSSEQCVEGMVGI 767
Query: 637 KEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR 695
+ L I +++ ++ L +IPL PR + F + +N + + +
Sbjct: 768 QGQNLRIFSLEKLENNLLQETIPLQYTPRHFIKHPEYPLFYVIEAENNILSPGTR---TK 824
Query: 696 LLDDQTFEFISTYPLDTFEYG-----------------------------------CSIL 720
LL+D T PL E+G S+
Sbjct: 825 LLNDSDAVNGDTTPLPPEEFGYPRGTGHWASCIQIVDPINSKRVISQIELEENEAAVSVA 884
Query: 721 SCSFSD-DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSL 776
+ FS D + VGT ++ T G I ++ E+GK L+ I + + + +L
Sbjct: 885 AVPFSSQDDETFLVVGTGKDMVVNPRSCTAGFIHIYRFQEEGKELEFIHKTKVEQPPMAL 944
Query: 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
F G+LLA I +++Y ++ R+ Q+ H L + +QT+G I+V D+ +S
Sbjct: 945 LGFQGRLLAGIGTDLRIYDLGMK-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQES 1000
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATD 893
++ ++YK++E + D + W + ++D + G + NL+ +R K SE A +
Sbjct: 1001 LTYVVYKYQENRLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPAKASEEADE 1060
Query: 894 EERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
+ G RL +V ++ + + LV D V T + GT
Sbjct: 1061 DGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLVTGGRDILVW---TGLQGT 1117
Query: 939 VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
V+ +I I+ E+ F + L+ L + G +H +RS+ AK +DGD
Sbjct: 1118 VSMLIPFIS---REEVDFFQSLEMQLAAQNPPLAGRDHLIYRSY-----YAPAKGTIDGD 1169
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L E++L L + +I+ ++ SV E+ +++ ++
Sbjct: 1170 LCETYLLLPNDKKQQIAGELDRSVREIERKIADM 1203
>gi|302406266|ref|XP_003000969.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
gi|261360227|gb|EEY22655.1| pre-mRNA-splicing factor rse-1 [Verticillium albo-atrum VaMs.102]
Length = 1059
Score = 207 bits (526), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 248/989 (25%), Positives = 424/989 (42%), Gaps = 143/989 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I ++ FR G +D+L +AT+ + ++++ + + R
Sbjct: 64 VFGIIRSMAAFRIAGSNKDYLILATDSGRIAIIEYLPAQNRFQRLHLETFGKSGIRRVVP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE--LQVLDIKFL-YGC 120
G+ DP R LI + L V+ +++ +L + + + + VL + L G
Sbjct: 124 GEFLACDPKGRACLIASLEKNKLVYVLNRNSQAELTISSPLEAHKPGVHVLSMVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
A P L D +A T + AL + + V WS+ +DN A LL
Sbjct: 184 ANPVFAALETDYTEADQDPTGQAALDVETQLVYYELDLGLNHVVRKWSEP-VDNTASLLF 242
Query: 169 PVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYLL 218
VP P GVL+ GEE I Y +N AF+ +P R T+ + R G + L
Sbjct: 243 QVPGGNDGP--SGVLVCGEENITYRHSNQEAFRVPVPRRRGATEDPSRKRSIVAGVMHKL 300
Query: 219 GDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNA 260
AG L+ T + + +V LKI+ +AS++ L +
Sbjct: 301 KGSAGAFFFLLQTEDGDLFKITIDMIEDRDGNPTGEVKRLKIKYFDTIPVASSLCILKSG 360
Query: 261 VVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLERY 293
+Y+ S +G+ Q + + D K SY + ++E
Sbjct: 361 FLYVASQFGNYQFYQFEKLGDDDEELEFSSDDFPTDPKQSYEAVFFHPRELENLALVESI 420
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 352
++ P++D V +L + Q+ T G + RI+++G+ +NE + EL GI +W+
Sbjct: 421 DSMNPLIDCKVANLTGEDAPQIYTACGNGARSTFRILKHGLEVNEIVASELPGIPSAVWT 480
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
L+ S D +D ++V+SF + T +L++ + +EE GF + TL L+QV
Sbjct: 481 LKLSRGDQYDAYIVLSFTNATLVLSIG--ETVEEVNDSGFLTSVPTLAAQLLGGEGLIQV 538
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV 471
+R + + NEW +P S+ AT N QV +A G +VY E+ DG L E
Sbjct: 539 HPKGIRHIRNGQV---NEWAAPQHRSIVAATTNEHQVAVALSSGEIVYFEMDSDGSLAEY 595
Query: 472 KHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
+ + ++CL + + E S AVG D +VR+ SL P+ L +K
Sbjct: 596 DEKKEMFGTVTCLSLGEVPEGRLRSSFLAVGC-DDSTVRVLSLDPETTLESKSVQALTAP 654
Query: 530 PRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
P S+ + A E S YL L G L +L+ TGELTD ++ LG +P+ L
Sbjct: 655 PTSLAVIAMEDSSSGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPVKLFQV 714
Query: 584 SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
+ + T V S RP + Y+ K + + +N ++ F+S + + + L
Sbjct: 715 TVQGRTCVLGLSSRPWLGYADPITKGFVVTPLNYVDLEWGWNFSSEQCEEGVVGIQGQSL 774
Query: 642 TIGTIDDIQ-KLHIRSIPLGEHPRRIC-HQEQSRTFAICS--------LKNQSCAEES-- 689
I I+ + L +SIPL PRR+ H E + I S L+ Q A+ S
Sbjct: 775 RIFAIEKLSDTLTQKSIPLTYTPRRMVKHPEHPMFYTIESDNNTLPPELRAQLLADPSVV 834
Query: 690 ----------EMHFVR----------LLD--DQTFEFISTYPLDTFEYGCSILSCSF-SD 726
E + R ++D + + + T LD E S F S
Sbjct: 835 NGDARTLPPVEFGYPRAKGRWASCISVIDPLSEELQTLQTVDLDNNEAAVSAAIVPFTSQ 894
Query: 727 DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
D+ + VGT ++ + T+G I ++ EDG+ L+ I + + + +L AF G+L
Sbjct: 895 DNESFLVVGTGKDMIVNPRQFTEGYIHIYRFSEDGRELEFIHKTKVEEPPTALLAFQGRL 954
Query: 784 LAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
+A + + +++Y K ML R+ Q++ L + + T+G IVVGD+ + ++
Sbjct: 955 VAGVGKTLRIYDLGQKQML-----RKAQADVAPQ---LIVSLSTQGSRIVVGDVQQGVTY 1006
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILD 868
++YK + D A WM+ ++D
Sbjct: 1007 VVYKALSNKLIPFVDDTVARWMTCTTMVD 1035
>gi|332263858|ref|XP_003280968.1| PREDICTED: splicing factor 3B subunit 3 [Nomascus leucogenys]
Length = 1271
Score = 207 bits (526), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 270/1199 (22%), Positives = 495/1199 (41%), Gaps = 203/1199 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G ++++ + ++ + +L++ S + + + + G
Sbjct: 103 LLIVEVFGVIRSLMAFRVTGGTKEYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 161
Query: 61 -RPTDNGQIGIIDPDCRLIGL---HLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---- 112
R GQ +DP R + + H L ++ D +L + + + L
Sbjct: 162 CRRIVPGQFLAVDPKGRAVMISKCHEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHV 221
Query: 113 ---DIKF---LYGCAK---------PTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQ 157
D+ F ++ C + P I + + YE+ L V
Sbjct: 222 VGVDVGFENPMFACLEMDYEIFDCMPLISFYATVLCSQQTLTFYELDLGLNHVVRK--YS 279
Query: 158 NNLDNGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDAD 212
L+ + LI VP P GVLI E I Y N IR I + +D D
Sbjct: 280 EPLEEHGNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-D 334
Query: 213 GSR-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
R +L G + + + +++ VT ++++ +A+ +
Sbjct: 335 PERGMIFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMC 394
Query: 256 YLDNAVVYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLE 291
L +++ S +G+ L ++ QP + V +++
Sbjct: 395 VLKTGFLFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVD 453
Query: 292 RYVNLGPIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKG 349
+L PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G
Sbjct: 454 ELDSLSPIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNA 512
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
+W++R +D FD +++VSF++ T +L++ + +EE GF T TL C + L
Sbjct: 513 VWTVRRHIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDAL 570
Query: 410 VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGIL 468
VQV +R + + R NEWK+P ++ N QV++A GG LVY E+ G L
Sbjct: 571 VQVYPDGIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQL 628
Query: 469 TE-VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
E + ++ ++ C+ + + S+ AVG+ D +VRI SL + + + L +
Sbjct: 629 NEYTERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQ 685
Query: 528 IIP-RSVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
+P + LC E G YL L +G LL +L+ TG+L+D +
Sbjct: 686 ALPAQPESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRT 745
Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
LG++P+ L + V A S R + YS + + ++ + + F S P
Sbjct: 746 RYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCP 805
Query: 631 DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
+ + L I ++ + + + + PL PR+ + H E + T A
Sbjct: 806 EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEAT 865
Query: 678 CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
+ + Q AEE +EM +R+++
Sbjct: 866 KAQRKQQMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNP 925
Query: 700 QTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IV 756
+ L+ E S+ C FS+ +Y VG A L G + +V
Sbjct: 926 IQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLV 985
Query: 757 EDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 815
+G KL+ + + + ++ F G++L + + +++Y D G ++L +C + H
Sbjct: 986 NNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKH 1039
Query: 816 ILALY---VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
I A Y +QT G ++V D+ +S + YK E + A D W++ +LD D
Sbjct: 1040 I-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTV 1098
Query: 873 LGAENNFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRF 914
GA+ N+ VR N+ DE +RG L EV+ YH+GE V
Sbjct: 1099 AGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSL 1158
Query: 915 RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
+ +L+ G ++++ T++G IG++ HE + F + ++ +LR + G
Sbjct: 1159 QKTTLI-------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCG 1211
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+H +RS+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 1212 RDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 1265
>gi|384490247|gb|EIE81469.1| hypothetical protein RO3G_06174 [Rhizopus delemar RA 99-880]
Length = 1197
Score = 207 bits (526), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 266/1177 (22%), Positives = 485/1177 (41%), Gaps = 195/1177 (16%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I ++ FR G ++DF+ + ++ + +L+++ + + + + + G R
Sbjct: 59 FGLIRSIAPFRLTGGSKDFIVVGSDSGRIVILEYNPVKN-IFEKVHQETYGKTGCRRIVP 117
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
GQ DP R +I L ++ D+ L + + + + +D+ F
Sbjct: 118 GQYLTADPKGRAVMISATEKQKLVYILNRDSAANLTISSPLEAHRSHAIIHHIVAVDVGF 177
Query: 117 LYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGA 164
PT L D DA T E A + + V WS + +D A
Sbjct: 178 ----ENPTFACLEVDYADADTDSTGEAAQNAEKILTYYELDLGLNHVVRKWS-DVVDRKA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLL- 218
+LL+PVP P GVL+ E I Y +A + +R I + ++ D R ++
Sbjct: 233 NLLVPVPGGNDGP--SGVLVCTENFIAYKHPDAEE---LRIPIPRRAQPLE-DPKRGIII 286
Query: 219 ---GDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQ--- 272
G L+ + + H + V GL I+ A ++S L + ++ S +G+ +
Sbjct: 287 IASASEEGDLYKVTMDHVQGTVQGLTIKYFDTIPPAISLSILKSGFLFAASEFGNHRHYF 346
Query: 273 -----------------------------LIKLNLQPDAKGSYVEVLERYVNLGPIVDFC 303
L+ +P S V+ LE ++ P+++
Sbjct: 347 FEGLGDDDDDAEISSAVYMEQETYEGGIPLVYFKPRPLKNLSLVDELE---SMAPLMESK 403
Query: 304 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
V++L + ++ G + RI+ G+ E A EL G +W+ + DD +
Sbjct: 404 VLNLADEETPRIYAICGRGAQSTFRILNQGVEAAELAVSELPGNPSAVWTTKLRADDQYH 463
Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
++VVSF + T +L++ + +EE GF + TL + LVQV +R + +
Sbjct: 464 AYIVVSFANATLVLSIG--ETVEEVTDTGFLTNAPTLAVQQIGEDALVQVHPHGIRHIRA 521
Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYEI 480
R NEW++P G ++ A N+ Q+ +A G +VY E+ + G L E +H Q+ I
Sbjct: 522 DRR--VNEWRAPQGQTIVEAATNSRQIAIALSNGEIVYFEMDNMGQLNEHQEHRQMSAYI 579
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCA 537
+ L + + E ++ AVG D +VRI SL PD L I+ + L G +P S LC
Sbjct: 580 TTLALGEVPEGRVRARYIAVGC-EDQTVRILSLDPDSCLEPISMQALQG--VPSS--LCI 634
Query: 538 FE----------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
E G YL L +G L +L+ TG+L+D + +G + + L S +
Sbjct: 635 IEMMDTGIEQGHGTQYLNIGLSNGIFLRTILDTITGQLSDTRARFIGAKSVKLFRISIQG 694
Query: 588 TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
V A S +P V Y+ +L + ++ + + H F + P+ + L I T++
Sbjct: 695 HPAVLALSTKPWVSYTFQNRLYLTPLSYEMLEHGSAFVTEQCPEGVVAVAGNTLRIFTVE 754
Query: 648 DIQKLHIR-SIPLGEHPRRICHQEQSRTFAIC---------SLKNQSCAEESEMHF---- 693
+ + + SIPL PR+ +RTF + S K + E+ F
Sbjct: 755 KLGNIFNQVSIPLKYTPRKFALHAPTRTFVVIESDHATFSPSEKAKGIIEKENEGFEIDD 814
Query: 694 -----------------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSF------ 724
+RL++ + T L+ E S+ F
Sbjct: 815 DITNLDPLQFGHVRNAAGRWASCIRLIEPFEGRTLETIELEDNEAAFSVAMVQFRQNPHA 874
Query: 725 SDDSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDG-----KLQLIAEKETKGAVYSLNA 778
++ S + VGT V T G + V+ V G +L I + Y++ A
Sbjct: 875 TNSSEQFVIVGTGQNVNLSPRSCTSGYLHVYRVVQGEQGQLRLHFIHKTPIDDVPYAMLA 934
Query: 779 FNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
F G+LL + +++Y K MLR T+ + + C I++L+ T+G ++ D+
Sbjct: 935 FQGRLLVGAGKSLRIYDIGKKKMLRKCETKSIPN-C-----IVSLH--TQGHRVIATDVQ 986
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--------- 885
+S+ +IYKH + + A D WM+ ++D + G + N F R
Sbjct: 987 ESVHYVIYKHADNRLVVFADDTIPRWMTGSTMVDYETVAGGDKFGNFFVSRLPGSISREV 1046
Query: 886 ---------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
+ +G +++ + EY G+ + +L+ G V+
Sbjct: 1047 DEDTTGNRIYHEKGYLQGAPNKIDSLCEYFTGDIITSLHKTTLLS-------GGREVVLT 1099
Query: 937 GTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
+ G I + + L E F + L+ ++R + G +H +RS+ + K+ +
Sbjct: 1100 TSFLGAISIYVPFLSKEDVEFFQMLEMHMRAEAPPLAGRDHLLYRSY-----YIPVKSVI 1154
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DGDL E F L+ + I++ ++ SV ++ K++E++
Sbjct: 1155 DGDLCEQFNTLAAEKKRMIAEELDRSVADVQKKIEDM 1191
>gi|31213215|ref|XP_315551.1| AGAP005549-PA [Anopheles gambiae str. PEST]
gi|21299714|gb|EAA11859.1| AGAP005549-PA [Anopheles gambiae str. PEST]
Length = 1217
Score = 206 bits (524), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 262/1189 (22%), Positives = 477/1189 (40%), Gaps = 187/1189 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L ++G + +L FR G ++D+ I ++ + +L+++ ++L
Sbjct: 53 LLQTEVFGVVRSLMSFRLTGGSKDYAVIGSDSGRIVILEYNPAKNQLEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D++ +L + + + L +
Sbjct: 113 RRIVPGQYLAIDPKGRAVMIGAVEKQKLVYILNRDSEARLTISSPLEAHKSNTLTYHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D ++A T E A K + F E N+ L+ A
Sbjct: 173 VDVGFENPMFACLEIDYEEADTDPTGEAATKTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ LI VP P GVLI E + Y N IR I + +D D R
Sbjct: 233 NFLISVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+L+ G + + + + + V+ +K++ A+ + L +
Sbjct: 288 ICSATHRTKSMYFFLVQTEQGDIFKVTLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPI 299
++ +G+ L ++ + P +G R + + PI
Sbjct: 348 FVACEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRQLKNLVMVDDIPSYAPI 407
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
+ V DL + Q+ G S+R++R+G+ ++E A EL G +W+++ D
Sbjct: 408 LGCQVADLANEDTPQLYLACGRGPRSSIRVLRHGLEVSEMAVSELPGNPNAVWTVKKRID 467
Query: 359 DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
D FD +++VSF++ T +L++ D +EE GF T TL C + LVQV +R
Sbjct: 468 DEFDAYIIVSFVNATLVLSIG--DTVEEVTDSGFLGTTPTLCCSALGDDALVQVYPDGIR 525
Query: 419 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQL 476
+ + R NEWK+P ++ N QV++A GG LVY E+ G L E + ++
Sbjct: 526 HIRADKR--VNEWKAPGKKTIMKCAVNQRQVVIALSGGELVYFEMDPTGQLNEYTERKKM 583
Query: 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LL 535
E+ C+ + + S AVG+ D +VRI SL + ++ + + +P + L
Sbjct: 584 PSEVMCMALGSVPSGEQRSWFLAVGL-ADNTVRIISLDPTDCLSPRSM--QALPSAAESL 640
Query: 536 CAFE-----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
C E G YL L +G LL +L+ +G+L D + LG++P+
Sbjct: 641 CIVEMGTVETSSEDDGVTITTGCIYLNIGLTNGVLLRTVLDPVSGDLADTRTRYLGSRPV 700
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
L + + V A S R + Y + + ++ + + + F+S + +
Sbjct: 701 KLFRIQMQGSEAVLAMSSRSWLSYYYQNRFHLTPLSYETLEYASGFSSEQCSEGIVAIST 760
Query: 639 GELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLL 697
L I ++ + + + + PL P+R +++ I + + EE++ + +
Sbjct: 761 NTLRILALEKLGAVFNQITFPLEYTPKRFAIHQETGKLIISETDHNAYTEETKTVRKKQM 820
Query: 698 DDQTFE----------------FIS-TYPLDTF--------------------------- 713
D+ E FI+ P D F
Sbjct: 821 ADEMREAAGEDEQELANEMADAFINEVLPEDVFSSPKAGTGMWASQIRVMDPINGHTYSK 880
Query: 714 ------EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRIL-VFIVEDGKLQL--I 764
E S+ F+ D Y G A L + + G + V+ V+ QL +
Sbjct: 881 VQLAQNEAVLSLALVRFAVDQKWYVVAGVAKDLQINPKISGGGFIDVYKVDSQTHQLEHM 940
Query: 765 AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQ 822
E A +L F G+LLA I + +++Y D G ++L +C + HI + +Q
Sbjct: 941 HRTEIDDAPGALCPFQGRLLAGIGKVLRIY-----DLGKKKLLRKC-ENKHIPNQIVNIQ 994
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
G + V D+ +S+ + YK E + A D + W+++ +LD D + N+
Sbjct: 995 GMGQRVYVSDVQESVYCIKYKRAENQLIIFADDTHPRWITSASLLDYDTVATGDKFGNIA 1054
Query: 883 TVRKNSEGATD-----------EERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
+R + D +RG L E + +HLGE V + +L+
Sbjct: 1055 ILRLPHSVSDDVDEDPTGNKALWDRGLLNGASQKAENICTFHLGEIVMSLQKATLI---- 1110
Query: 925 DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
G ++I+ T++G +G + E Y F + L+ ++R + G +H +RS+
Sbjct: 1111 ---PGGSESLIYATMSGTVGALVPFTSREDYDFFQHLEMHMRNENPPLCGRDHLSYRSY- 1166
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F L + I+ + + E+ K++E++
Sbjct: 1167 ----YYPVKNVMDGDLCEQFTSLDPAKQKSIASDLGRTPSEVAKKLEDI 1211
>gi|324501533|gb|ADY40680.1| Splicing factor 3B subunit 3 [Ascaris suum]
Length = 1214
Score = 206 bits (524), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 264/1183 (22%), Positives = 504/1183 (42%), Gaps = 189/1183 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ + ++ + +L+++ + R + + G R
Sbjct: 59 VFGIIRSLLAFRLTGGSKDYIVVGSDAGRIVILEYNPQKV-CFERVHQETFGKTGCRRIV 117
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LIG L V+ D L + + + + + G
Sbjct: 118 PGQHLAVDPKGRAILIGAVERQKLVYVMNRDAAANLTISSPLEAHKSHCICYSIVGVDVG 177
Query: 120 CAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
PT L D NK + + YE+ L V +++ +D G ++
Sbjct: 178 FENPTFACLEVDYEEVDHDPSGHLANKIPQTLTFYELDLGLNHVVR-KYAEPLVDKG-NI 235
Query: 167 LIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-------- 215
LI VP GV++ E +V+ N IR I + +D D R
Sbjct: 236 LISVPGGQEGPSGVIVCCENYLVF--KNLGDQPDIRCPIPRRRNELD-DCDRTVMIVCTA 292
Query: 216 ---------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
+L+ G + + + E + VT LKI+ +A+ + L ++ S
Sbjct: 293 THKTKLMYFFLVQTDQGDIFKVTLESEHDIVTELKIKYFDTIPLANAMCILKTGFLFTAS 352
Query: 267 SYGDSQLIKL--------------NLQPDAKGSY---------VEVLERYVNLGPIVDFC 303
+GD L ++ +Q D ++ + V+++ +L P++
Sbjct: 353 EFGDHHLYQIAHLGDEDDEPEYSSRMQLDEGETFFFAPRGLLNLAVVDQIDSLCPLISCH 412
Query: 304 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
+ DL + Q+ G +L+++RNG+ + E A EL G +W+++ + DD +D
Sbjct: 413 IDDLANEDAPQLYALCGRSARSALKVLRNGLEVTEMAVSELPGNPNAVWTVKRNIDDKYD 472
Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
+ +VVSF++ T +L++ + +EE GF T TL C + L+QV +R + +
Sbjct: 473 SHIVVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLGCGLIGDDALLQVYPDGIRHIRA 530
Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEI 480
R NEWK+P ++ N QV +A GG +VY E+ G L E + +L ++
Sbjct: 531 DRRV--NEWKAPGKRTIIKCALNRRQVAIALAGGEIVYFELDVTGQLNEYTERRELPADV 588
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLCA 537
C+ ++ I E S+ VG+ D +VRI SL L ++ + L E P S+++
Sbjct: 589 LCMSLSEIPEGELRSRFLTVGL-ADKTVRIISLDPQDCLAPLSMQALPSE--PESIIVLE 645
Query: 538 F---EGIS----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
EG S +L L +G LL ++ TGELTD + LGT+ + L ++
Sbjct: 646 MFGGEGQSASTVHLNIGLQNGCLLRTTVDQVTGELTDNRTRYLGTRSVNLFRVRIQSKDA 705
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
+ AAS R ++Y+ + + ++ + + F+S P+ + E L I +++ +
Sbjct: 706 IMAASSRAWLLYNYQSRFHLTPLSYTALEYAAGFSSEQCPEGIVAIAENTLRILSLEKLG 765
Query: 651 KLHIRSI-PLGEHPRR-ICH---------------------QEQSRTFAICSLKNQSCAE 687
+ +++ PL PRR I H QE+ + A ++ A+
Sbjct: 766 AVFNQAVYPLDYTPRRMIIHKASGNLIVIETDHAAFTAKGKQERRKQLANELMEVAKEAD 825
Query: 688 ESEMHFV-------------------------------RLLDDQTFEFISTYPLDTFEYG 716
E++ V R++ E +S +P E
Sbjct: 826 EADEQAVKEMAEAILTEKVNEREMGAPKNQKGKWASTVRIMRSSDGETLSLFPFAEDEAA 885
Query: 717 CSILSCSFSDDSNVYYC-VGTAYVLPEENEPTK---GRILVFIVED--GKLQLIAEKETK 770
S+ F + ++ + VG L + +P K G I F++ + KL+ + T
Sbjct: 886 FSLAMVQFQNQNDAQFILVGCGLGL--QLKPRKSHGGSIYTFLLANNGSKLEFLHRTPTD 943
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-HILALYVQTRGDFIV 829
V +++ F G L + +KI+LY D G ++L ++C + + + +++ G IV
Sbjct: 944 EVVNAVHDFRGMALVGVGKKIRLY-----DLGKKKLLAKCENKQLPVQVVDIRSMGQRIV 998
Query: 830 VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE 889
V D +S+ + YK ++ + D + +++ + ILD D + + F V + +
Sbjct: 999 VSDSQESLHFMRYKKQDNQLSIFCDDTSPRFVTCICILDYDT-VAVGDRFGSIAVLRLPK 1057
Query: 890 GATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
G T+E +RG L E +G++++G+ V + SLV G
Sbjct: 1058 GVTEEVQEDPTGVRALWDRGNLNGASQKVEHIGQFYVGDTVTSMQKTSLV-------PGA 1110
Query: 931 IPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
+++ T++G IG++ + +++ F + L+ +LR + G +H +RSF K +
Sbjct: 1111 NDCLVYTTISGTIGMLVPFVSRDEFDFFQNLEMHLRVEFPPLCGRDHLAFRSFYFPVKCI 1170
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+DGDL E + + + +++ + E+ K++E++
Sbjct: 1171 -----IDGDLCEQYALMPLDKQKAVAEELGRKPAEIHKKLEDI 1208
>gi|116207186|ref|XP_001229402.1| conserved hypothetical protein [Chaetomium globosum CBS 148.51]
gi|88183483|gb|EAQ90951.1| conserved hypothetical protein [Chaetomium globosum CBS 148.51]
Length = 1211
Score = 206 bits (523), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 273/1168 (23%), Positives = 484/1168 (41%), Gaps = 167/1168 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G I ++ FR G +D++ +AT+ + ++++ +++ + R
Sbjct: 64 IFGIIRSIASFRLAGSHKDYIILATDSGRIAIIEYQPKTNRFSRIHLETFGKSGVRRVVP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R LI + L V+ + + +L + + + VL + + G
Sbjct: 124 GQYLAADPKGRACLIAGVEKNKLVYVLNRNAQAELTISSPLEAHKAGVLVLSLVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
A P L D +A + E + + + V WS + +D + LL
Sbjct: 184 ANPVFAALEIDYSEADQDPSGEAGKEPEAQLVYYELDLGLNHVVRKWS-DAVDPTSSLLF 242
Query: 169 PVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYLL 218
VP P GVL+ GEE + Y +N AF+ IP R T+ R G + L
Sbjct: 243 QVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRRGATEDPQRKRTIVAGVMHKL 300
Query: 219 GDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNA 260
AG L+ T + + +V LK++ IA ++ L +
Sbjct: 301 KGSAGAFFFLLQTDDGDLMKVTLDMVEDNDGNPTGEVRRLKVKYFDTIPIAQSLCILKSG 360
Query: 261 VVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERY 293
++ +G+ L + + D++ SY V +E
Sbjct: 361 FLFSAGEFGNHHLYQFEKLGDDDEELEFSSDDFPTDSRASYNPVYFHPRPLENLVLVESM 420
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG-MWS 352
++ P++D + +L + Q+ T G + R++++G+ ++E + EL G +W+
Sbjct: 421 DSMNPLIDCKIANLTGEDAPQIYTVCGNRARSTFRMLKHGLEVSEIVASELPGTPAAVWT 480
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ + D +D ++V+SF + T +L++ + +EE GF + TL L+QV
Sbjct: 481 TKLTKYDEYDGYIVLSFTNATLVLSIG--ETVEEVTESGFLTSVPTLAVQQLGEEGLIQV 538
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV 471
+R + NEW +P S+ A N +QV++A G +VY E+ DG L E
Sbjct: 539 HPKGIRHIVQGRV---NEWPAPQHRSIVAAATNENQVVIALSSGEIVYFEMDADGSLAEY 595
Query: 472 -KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
+ ++ ++ L + + E S AVG D +VRI SL P+ L K
Sbjct: 596 DEKKEMSGTVTSLSLGRVPEGLRRSSFLAVGC-DDCTVRILSLDPESTLEMKSIQALTSA 654
Query: 530 PRSVLLCAFE----GIS-YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
P S+L+ A E G++ YL L G L +L+ TGELTD ++ LG +P L S
Sbjct: 655 PSSLLVMAMEDSTGGMTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTKLFQVS 714
Query: 585 SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+N V A S RP + Y+ K + + E+ F+S + + L
Sbjct: 715 VQNQPCVLALSSRPWLGYTDPITKNFSMTPLIYSELEFGWNFSSEQCLEGMVGIHANFLR 774
Query: 643 IGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKN------------QSCAEE 688
I TI+ + I +SIPL P+R+ H EQ + I S N QS A
Sbjct: 775 IFTIERLGDTMIQKSIPLTYTPKRLVKHPEQPYFYTIESDNNTLPPELRAQLLEQSGAVN 834
Query: 689 SEMHFVRLLD-----------------DQTFEFISTYPLDTFEYGCSILSCSF----SDD 727
+ + D D E S FE + +S + S +
Sbjct: 835 GDAAILPPEDFGYPRATGRWASCISVVDPLGEEPSVLQRIDFEGNEAAVSAAVVPFSSQE 894
Query: 728 SNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLL 784
+ VGT ++ + ++G I V+ EDG+ L+ I + + + +L F G+LL
Sbjct: 895 GESFLIVGTGKDMVLNPRKFSEGYIHVYRFHEDGRDLEFIHKTKVEEPPMALIPFQGRLL 954
Query: 785 AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
A I + +++Y LR + L+ G L + +QT+G I+VGD+ + I+ ++YK
Sbjct: 955 AGIGKTLRVYDLGLR----QLLRKAQGEVAPQLIVSLQTQGSRIIVGDVQQGITYVVYKP 1010
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE--------- 895
E + A D W + ++D + G + N++ +R SE A+ E
Sbjct: 1011 ESNKLLPFADDTINRWTTCTTMVDYESVAGGDKFGNIWILR-CSERASQESDEPGSEIQL 1069
Query: 896 ----------RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
+ RL + ++ + +LV VG +++ + G +GV
Sbjct: 1070 LHARNYLHGAQSRLSAMAHFYTQDLPTSIVKTNLV-------VGGQDVLVWSGIQGTVGV 1122
Query: 946 -IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
I + E F + L++++R + G +H +R + V K +DGDL E F
Sbjct: 1123 LIPFVSREDVDFFQNLESHMRAEDPPLAGRDHLIYRGY-----YVPVKGVIDGDLCERFS 1177
Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L + I+ ++ SV E+ +++ ++
Sbjct: 1178 LLPNDKKQMIAGELDRSVREIERKISDI 1205
>gi|389638952|ref|XP_003717109.1| pre-mRNA-splicing factor RSE1 [Magnaporthe oryzae 70-15]
gi|148887431|sp|Q52E49.2|RSE1_MAGO7 RecName: Full=Pre-mRNA-splicing factor RSE1
gi|351642928|gb|EHA50790.1| pre-mRNA-splicing factor RSE1 [Magnaporthe oryzae 70-15]
Length = 1216
Score = 206 bits (523), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 287/1167 (24%), Positives = 487/1167 (41%), Gaps = 160/1167 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I L FR G ++D+L IA++ + ++++ + +R + + G R
Sbjct: 64 VFGIIRDLASFRLAGSSKDYLIIASDSGRITIVEY-LPAQNRFSRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ DP R LI L V+ +++ +L + + + L + + G
Sbjct: 123 PGQYLAADPKGRACLIASVERCKLVYVLNRNSQAELTISSPLEAHKNGTLTLSLVALDVG 182
Query: 120 CAKPTIVVLY-------QDNKD--ARHVKT----YEVALKDKDFVEGPWSQNNLDNGADL 166
+ P L QD K A++V+T YE+ L V W + +D+ A +
Sbjct: 183 YSNPVFAALEVDYSEVDQDPKGDAAQNVETVLNYYELDLGLNHVVRK-WF-DVVDSTASM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRY 216
L VP P GVL+ GEE I Y +N AF+ IP R T+ + R G +
Sbjct: 241 LFQVPGGSDGP--SGVLVCGEENITYRHSNQDAFRVPIPRRKGATEDPSRKRNIVSGVMH 298
Query: 217 LLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLD 258
L AG L+ T + + +V LKI+ +++ + L
Sbjct: 299 KLKGSAGAFFFLLQTEDGDLFKVTIDMVEDAEGNPTGEVRRLKIKYFDTIPVSNNLCILK 358
Query: 259 NAVVYIGSSYGDS---QLIKL------------NLQPDAKGSY------------VEVLE 291
+ +++ S +G+ Q KL + D K Y + ++E
Sbjct: 359 SGFLFVASEFGNHLFYQFEKLGDDDEELEFFSSDFPVDPKEPYEPVYFYPRPTENLALVE 418
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
++ P++D V +L + Q+ T SG + R++++G+ +NE + +L G +
Sbjct: 419 SIDSMNPLMDLKVANLTEEDAPQIYTVSGKGARSTFRMLKHGLEVNEIVASQLPGTPSAV 478
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+ + DD +D ++V+SF + T +L++ + +EE GF S TL + LV
Sbjct: 479 WTTKLRRDDEYDAYIVLSFTNGTLVLSIG--ETVEEVSDTGFLSSVPTLAVQQLGDDGLV 536
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
QV +R + + + NEW SP S+ A N QV +A G +VY E+ DG L
Sbjct: 537 QVHPKGIRHIRNG---VVNEWSSPQHRSIVAAATNERQVAVALSSGEIVYFEMDTDGSLA 593
Query: 470 EVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
E + + ++ L + + E S AVG D +VRI SL P+ L +K
Sbjct: 594 EYDEKKEMFGTVTSLSLGEVPEGRLRSSYLAVGC-DDCTVRILSLDPESTLESKSVQALT 652
Query: 528 IIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
P ++ + + E S YL L G L +L+ TGELTD ++ LG + + L
Sbjct: 653 AAPSALSIMSMEDSSSGGTTLYLHIGLNSGVYLRTVLDEVTGELTDTRQKFLGPKAVRLF 712
Query: 582 TFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
S + T V A S R + +S K + +N +E+ F S + +
Sbjct: 713 QVSVQKRTCVLALSSRSWLGFSDPVTKGFTMTPLNYEELEWGWNFVSEQCEEGMVGVNGQ 772
Query: 640 ELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV---- 694
L I I+ + I +SIPL PR++ R F N + A E +
Sbjct: 773 FLRIFAIEKLGDNVIQKSIPLTYTPRKLAKHPTQRIFYTIEADNNTLAPELREQLMAAPT 832
Query: 695 ------RLLDDQTFEF---------------------------ISTYPLDTFEYGCSILS 721
R+L F + + LD E S+
Sbjct: 833 AVNGDARVLPPDEFGYPRGNGRWASCISVVDPLGDGEELEPGVVQRIDLDNNEAALSMAV 892
Query: 722 CSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
SF S D + VGT ++ T+G I V+ EDG+ L+ I + + + +L
Sbjct: 893 VSFASQDGESFLVVGTGKDMVVNPRRFTEGYIHVYRFSEDGRELEFIHKTKVEEPPTALL 952
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
F G+L+A I + +++Y LR R+ Q+E L + + T+G I+VGD+ +
Sbjct: 953 PFQGRLVAGIGRMLRIYDLGLR-QLLRKAQAEVAPQ---LIVSLNTQGSRIIVGDVQHGL 1008
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEE 895
+ YK E + A D A W + ++D D GA+ NL+ +R + + +DE
Sbjct: 1009 IYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAGADKFGNLWILRCPEKASQESDEP 1068
Query: 896 RGRLEVV-GEYHLGEFVNRFRHGSLV--MRLPDS------DVGQIPTVIFGTVNGVIGV- 945
+ +V +L NR + V +P S VG +++G G IGV
Sbjct: 1069 GSEVHLVHSRDYLHGTSNRLALMAHVYTQDIPTSICKTNLVVGGQEVLLWGGFQGTIGVL 1128
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
I + E F + L+ +LR + G +H +R V K +DGDL E +
Sbjct: 1129 IPFVSREDADFFQSLEQHLRSEDPPLAGRDHLMYRGC-----YVPVKGVIDGDLCERYTM 1183
Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
L + I+ ++ SV E+ +++ ++
Sbjct: 1184 LPNDKKQMIAGELDRSVREIERKISDI 1210
>gi|67538564|ref|XP_663056.1| hypothetical protein AN5452.2 [Aspergillus nidulans FGSC A4]
gi|40743422|gb|EAA62612.1| hypothetical protein AN5452.2 [Aspergillus nidulans FGSC A4]
Length = 1226
Score = 206 bits (523), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 281/1162 (24%), Positives = 475/1162 (40%), Gaps = 163/1162 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I TL FR G +D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRTLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYG 119
GQ +DP R LI + L V+ +++ +L + + + Q L + G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVYSVVALDAG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D ++ T YE+ L V W+ + +D + +
Sbjct: 183 YENPVFAALEVDYSESDQDPTGRAYEEVEKLLVYYELDLGLNHVVRK-WT-DPVDRTSSM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPS----------ITKAYGRVD 210
L VP GVL+ E+ I Y +N AF+ IP R IT
Sbjct: 241 LFQVPGGADGPSGVLVCAEDNITYRHSNQDAFRVPIPRRKGAMENPERKRCITAGVMHKM 300
Query: 211 ADGSRYLLGDHAG-LLHLLVITHEKEK------VTGLKIELLGETSIASTISYLDNAVVY 263
+LL G L L + E +K V GLKI+ +AS++ L + +Y
Sbjct: 301 RGAFFFLLQTEDGDLFKLTLDMVEDDKGQLTGEVKGLKIKYFDTVPLASSLLILKSGFLY 360
Query: 264 IGSSYGDSQLIKL-----------------NLQPDAKGSYV----------EVLERYVNL 296
+ + G+ + + P A + V ++E +L
Sbjct: 361 VAAEGGNHHFYQFEKLGDDDEETEFNSDDFSADPAAPCTPVYFQPRGAENLNLVEAINSL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P+VD VV++ Q+ T SG + R +++G+ ++E EL + +W+ +
Sbjct: 421 NPLVDSKVVNISEDDAPQIFTVSGTGARSTFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ D FD ++V+SF + T +L++ + +EE GF S TL + L+Q+
Sbjct: 481 TRADEFDAYIVLSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQIHPR 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
+R + + R NEW +P S+ A N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFELDADGSLAEYDER 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
Q+ ++CL + + E S AVG D +VRI SL PD L K P +
Sbjct: 597 RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDTTLENKSVQALTAAPSA 655
Query: 533 VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ + A S YL L G L L+ TGEL+D + LG++ + L S
Sbjct: 656 LNIIAMADSSSGGTTLYLHIGLHSGVYLRTALDEVTGELSDTRTRFLGSKAVKLFQVSVT 715
Query: 587 NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
T V A S RP + YS K + + ++ + F+S + + + L I
Sbjct: 716 GQTAVLALSSRPWLGYSDTQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775
Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFE 703
+I+ + + +SIPL PR + F + N + + RLL+D
Sbjct: 776 SIEKLDNNMLQQSIPLAYTPRHFIKHPEEPLFYVIEADNNVLSPATR---ARLLEDSKAR 832
Query: 704 FIST---------YP--------------------------LDTFEYGCSILSCSF-SDD 727
T YP L+ E SI + F S D
Sbjct: 833 GGDTTVLPPEDFGYPRGTGHWASCIQIIDPLDAKAVVGAVELEENEAAVSIAAVPFTSQD 892
Query: 728 SNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
+ VGTA + N P+ G I ++ EDGK L+ I + + + +L F G+L
Sbjct: 893 DETFLVVGTAKDM-TVNPPSSAGGYIHIYRFQEDGKELEFIHKTKVEEPPLALLGFQGRL 951
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIY 842
LA + +++Y D G ++L +C A+ +QT+G IVV D+ +S++ ++Y
Sbjct: 952 LAGVGSVLRIY-----DLGMKQLLRKCQAAVAPKAIVGLQTQGSRIVVSDVRESVTYVVY 1006
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRL 899
K+++ + D A W +A ++D + G + NL+ VR K SE A +E G
Sbjct: 1007 KYQDNVLIPFVDDSIARWTTAATMVDYETTAGGDKFGNLWLVRCPKKASEEADEEGSGAH 1066
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDV-----------GQIPTVIFGTVNGVIGVIAS 948
+ +L NR L++ + D+ G +++ G IG++
Sbjct: 1067 LIHDRGYLQGTPNRLE---LMIHVFTQDIPTSLHKTQLVAGGRDILVWTGFQGTIGILVP 1123
Query: 949 -LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
+ E F + L+ L + G +H +RS+ K V +DGDL E + LS
Sbjct: 1124 FVSREDVDFFQSLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEQYFLLS 1178
Query: 1008 RTRMDEISKTMNVSVEELCKRV 1029
I+ ++ SV E+ +++
Sbjct: 1179 NDTKMMIAAELDRSVREIERKI 1200
>gi|121700262|ref|XP_001268396.1| nuclear mRNA splicing factor, putative [Aspergillus clavatus NRRL 1]
gi|119396538|gb|EAW06970.1| nuclear mRNA splicing factor, putative [Aspergillus clavatus NRRL 1]
Length = 1209
Score = 205 bits (522), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 274/1170 (23%), Positives = 482/1170 (41%), Gaps = 173/1170 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I TL FR G +D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRTLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ +++ +L + + + Q + + G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTVVFAMIALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D ++ T YE+ L V W+ + +D +++
Sbjct: 183 YENPIFAALEVDYSESDQDPTGRAYEESEKVLVYYELDLGLNHVVRK-WA-DPVDRTSNM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRV--------- 209
L VP P GVL+ E+++ Y +N AFK IP R T+ R
Sbjct: 241 LFQVPGGGDGP--SGVLVCAEDSVTYRHSNQDAFKVPIPRRSGATENPERKRTIVAGVMH 298
Query: 210 DADGSRYLL--GDHAGLLHLLV--ITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
G+ + L D L + + + + ++TG LKI+ +AS + L +
Sbjct: 299 KMRGAFFFLLQTDDGDLFKVTIDMVEDDNGQLTGEVKRLKIKYFDTVPVASNLLILKSGF 358
Query: 262 VYIGSSYGDSQLIKLNLQPD-------AKGSY--------------------VEVLERYV 294
+Y+ S G+ + D + SY + ++E
Sbjct: 359 LYVASEGGNHHFYQFEKLGDDDEEIEFSSESYSADPSVPCDPVFFRPRGAENLNLVESLN 418
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
+L P++D + +L Q+ SG + R +++G+ ++E EL + +W+
Sbjct: 419 SLNPLIDSKIANLNEDDAPQIYAVSGTGARSTFRTLKHGLEVSEIVESELPSVPSAVWTT 478
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ + D FD ++++SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 479 KLTRGDEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVH 536
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
+R + + R NEW +P S+ A N QV +A G +VY E+ DG L E
Sbjct: 537 PRGIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGSLAEYD 594
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
+ Q+ ++CL + + E S AVG D +VRI SL PD L K P
Sbjct: 595 ERRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 653
Query: 531 RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
++ + + S YL L G L +L+ TGEL+D + LG +P+ L S
Sbjct: 654 SALNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKPVKLFRVS 713
Query: 585 SKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+ T V A S RP + Y K + + ++ + F+S + + + L
Sbjct: 714 VRGQTAVLALSSRPWLGYCDIQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLR 773
Query: 643 IGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR------ 695
I +I+ + ++ SIPL PRR+ + F + N + + +
Sbjct: 774 IFSIEKLDNNMLQESIPLSYTPRRLLKHPEQPLFYVIGSDNNVLSPATRARLIEDSKARN 833
Query: 696 ----LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSFSD-DS 728
L + F + IST L+ E S+ + FS D
Sbjct: 834 GEADTLPPEEFGYPRATGHWASCIQVVDPVNAKAVISTIELEENEAAVSMAAVPFSSQDD 893
Query: 729 NVYYCVGTAYVLPEENEPTKGRILVFIV---EDGK-LQLIAEKETKGAVYSLNAFNGKLL 784
+ VGTA L N P+ + I EDGK L+ I + + + +L AF G+L+
Sbjct: 894 ETFLVVGTAKDL-TVNPPSSAGGFIHIYRFQEDGKELEFIHKTKVEEPPLALLAFQGRLV 952
Query: 785 AAINQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFIVVGDLMKSISLLI 841
A I +++Y D G ++L +C I+ L QT+G I+V D+ +S++ ++
Sbjct: 953 AGIGSILRIY-----DLGMKQLLRKCQAPVVPKTIVGL--QTQGSRIIVSDVRESVTYVV 1005
Query: 842 YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE---- 894
YK++E + D + WM++ ++D + G + NL+ VR K SE A ++
Sbjct: 1006 YKYQENVLIPFVDDTVSRWMTSTTMVDYETVAGGDKFGNLWLVRCPKKISEEADEDGSGA 1065
Query: 895 ----ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVI 943
ERG RLE++ + + LV G +++ G I
Sbjct: 1066 HLIHERGYLHGTPNRLELMIHTYTQDIPTSVHKTQLV-------AGGRDILVWTGFQGTI 1118
Query: 944 GVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
G++ + E F + L+ L + G +H +RS+ K V +DGDL E
Sbjct: 1119 GMLVPFMSREDVDFFQNLEMQLASQCPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEM 1173
Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+ L I+ ++ SV E+ +++ ++
Sbjct: 1174 YFLLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|440478305|gb|ELQ59147.1| pre-mRNA-splicing factor rse-1 [Magnaporthe oryzae P131]
Length = 1223
Score = 205 bits (522), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 287/1164 (24%), Positives = 485/1164 (41%), Gaps = 160/1164 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I L FR G ++D+L IA++ + ++++ + +R + + G R
Sbjct: 64 VFGIIRDLASFRLAGSSKDYLIIASDSGRITIVEY-LPAQNRFSRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ DP R LI L V+ +++ +L + + + L + + G
Sbjct: 123 PGQYLAADPKGRACLIASVERCKLVYVLNRNSQAELTISSPLEAHKNGTLTLSLVALDVG 182
Query: 120 CAKPTIVVLY-------QDNKD--ARHVKT----YEVALKDKDFVEGPWSQNNLDNGADL 166
+ P L QD K A++V+T YE+ L V W + +D+ A +
Sbjct: 183 YSNPVFAALEVDYSEVDQDPKGDAAQNVETVLNYYELDLGLNHVVRK-WF-DVVDSTASM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRY 216
L VP P GVL+ GEE I Y +N AF+ IP R T+ + R G +
Sbjct: 241 LFQVPGGSDGP--SGVLVCGEENITYRHSNQDAFRVPIPRRKGATEDPSRKRNIVSGVMH 298
Query: 217 LLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLD 258
L AG L+ T + + +V LKI+ +++ + L
Sbjct: 299 KLKGSAGAFFFLLQTEDGDLFKVTIDMVEDAEGNPTGEVRRLKIKYFDTIPVSNNLCILK 358
Query: 259 NAVVYIGSSYGDS---QLIKL------------NLQPDAKGSY------------VEVLE 291
+ +++ S +G+ Q KL + D K Y + ++E
Sbjct: 359 SGFLFVASEFGNHLFYQFEKLGDDDEELEFFSSDFPVDPKEPYEPVYFYPRPTENLALVE 418
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
++ P++D V +L + Q+ T SG + R++++G+ +NE + +L G +
Sbjct: 419 SIDSMNPLMDLKVANLTEEDAPQIYTVSGKGARSTFRMLKHGLEVNEIVASQLPGTPSAV 478
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+ + DD +D ++V+SF + T +L++ + +EE GF S TL + LV
Sbjct: 479 WTTKLRRDDEYDAYIVLSFTNGTLVLSIG--ETVEEVSDTGFLSSVPTLAVQQLGDDGLV 536
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
QV +R + + + NEW SP S+ A N QV +A G +VY E+ DG L
Sbjct: 537 QVHPKGIRHIRNG---VVNEWSSPQHRSIVAAATNERQVAVALSSGEIVYFEMDTDGSLA 593
Query: 470 EVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
E + + ++ L + + E S AVG D +VRI SL P+ L +K
Sbjct: 594 EYDEKKEMFGTVTSLSLGEVPEGRLRSSYLAVGC-DDCTVRILSLDPESTLESKSVQALT 652
Query: 528 IIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
P ++ + + E S YL L G L +L+ TGELTD ++ LG + + L
Sbjct: 653 AAPSALSIMSMEDSSSGGTTLYLHIGLNSGVYLRTVLDEVTGELTDTRQKFLGPKAVRLF 712
Query: 582 TFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
S + T V A S R + +S K + +N +E+ F S + +
Sbjct: 713 QVSVQKRTCVLALSSRSWLGFSDPVTKGFTMTPLNYEELEWGWNFVSEQCEEGMVGVNGQ 772
Query: 640 ELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV---- 694
L I I+ + I +SIPL PR++ R F N + A E +
Sbjct: 773 FLRIFAIEKLGDNVIQKSIPLTYTPRKLAKHPTQRIFYTIEADNNTLAPELREQLMAAPT 832
Query: 695 ------RLLDDQTFEF---------------------------ISTYPLDTFEYGCSILS 721
R+L F + + LD E S+
Sbjct: 833 AVNGDARVLPPDEFGYPRGNGRWASCISVVDPLGDGEELEPGVVQRIDLDNNEAALSMAV 892
Query: 722 CSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
SF S D + VGT ++ T+G I V+ EDG+ L+ I + + + +L
Sbjct: 893 VSFASQDGESFLVVGTGKDMVVNPRRFTEGYIHVYRFSEDGRELEFIHKTKVEEPPTALL 952
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
F G+L+A I + +++Y LR R+ Q+E L + + T+G I+VGD+ +
Sbjct: 953 PFQGRLVAGIGRMLRIYDLGLR-QLLRKAQAEVAPQ---LIVSLNTQGSRIIVGDVQHGL 1008
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEE 895
+ YK E + A D A W + ++D D GA+ NL+ +R + + +DE
Sbjct: 1009 IYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAGADKFGNLWILRCPEKASQESDEP 1068
Query: 896 RGRLEVV-GEYHLGEFVNRFRHGSLV--MRLPDS------DVGQIPTVIFGTVNGVIGV- 945
+ +V +L NR + V +P S VG +++G G IGV
Sbjct: 1069 GSEVHLVHSRDYLHGTSNRLALMAHVYTQDIPTSICKTNLVVGGQEVLLWGGFQGTIGVL 1128
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
I + E F + L+ +LR + G +H +R V K +DGDL E +
Sbjct: 1129 IPFVSREDADFFQSLEQHLRSEDPPLAGRDHLMYRGC-----YVPVKGVIDGDLCERYTM 1183
Query: 1006 LSRTRMDEISKTMNVSVEELCKRV 1029
L + I+ ++ SV E+ +++
Sbjct: 1184 LPNDKKQMIAGELDRSVREIERKI 1207
>gi|195996829|ref|XP_002108283.1| hypothetical protein TRIADDRAFT_49802 [Trichoplax adhaerens]
gi|190589059|gb|EDV29081.1| hypothetical protein TRIADDRAFT_49802 [Trichoplax adhaerens]
Length = 1208
Score = 205 bits (522), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 278/1182 (23%), Positives = 495/1182 (41%), Gaps = 192/1182 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I L FR G ++D++ I ++ + +L++ S + + + G R
Sbjct: 58 VFGEIRCLAPFRLTGGSKDYVVIGSDSGRITILEY-IPSKNTFEKIHQETFGKSGCRRIV 116
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIRLEELQVLDIKFL 117
GQ +DP R +IG L ++ D +L EA V+ I
Sbjct: 117 PGQYLAVDPKGRAVMIGAMEKQKLVYILNRDTSARLTISSPLEAHKSHTVVFSVVGIDV- 175
Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADL 166
G P L D ++A + T E AL + + E N+ L + +++
Sbjct: 176 -GFENPVFACLEVDYEEADNDPTGEAALTTRQMLTYYELDLGLNHVVRKFTETLQDFSNM 234
Query: 167 LIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIR-PSITKAYGRVDAD---------G 213
LIPVP G+L+ E I Y N IR P + Y D D G
Sbjct: 235 LIPVPGGNDGPSGILVCSENFITY--KNFGDQPDIRMPIPRRRYDLSDPDRGILFVCYAG 292
Query: 214 SR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
+ + + + G + L +T E + V+ +K++ +AS++ L ++ S
Sbjct: 293 HKTKSLFFFFIQNEQGDIFKLTLTVEDDMVSSIKLKYFDTIPVASSMIVLKTGFLFASSE 352
Query: 268 YGDSQLIKL--------------------------NLQPDAKGSYVEVLERYVNLGPIVD 301
+G+ L ++ +L+P + VEV E +L PI
Sbjct: 353 FGNHHLYQIAHLGDNDEETEFSSTMLLDEGETFYYSLRP--LKNLVEVDE-VDSLCPITG 409
Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDP 360
V DL + Q+ G + +LRI+R+G+ + E A EL G G+W++++S
Sbjct: 410 CQVADLANEDTPQLYVSCGRGPNSTLRILRHGLEVTEMAVSELPGNPNGVWTVKTSASAE 469
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+D ++VVSF++ T +L++ + +EE GF T TL C + L+QV + +R V
Sbjct: 470 YDAYIVVSFVNATLVLSIG--ESVEEVSDSGFLGTTPTLHCCQIGDDALLQVYANGIRHV 527
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEY 478
+ R NEWK+P ++ N QV +A GG LVY E+ G L E + +
Sbjct: 528 RADKR--VNEWKAPGKKIISKCAVNNRQVAIALTGGELVYFEMDLSGQLNEYTERREFSS 585
Query: 479 EISCLDIN--PIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLL 535
E+ C+ I P+GE + AVG+ D +VR+ SL P L +P S+ +
Sbjct: 586 EVICMSIGSVPVGEK--RCRFLAVGL-ADHTVRMISLDPSDCLQPMSMQALPTVPESLCI 642
Query: 536 CAF---------EGI---SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
A +G+ YL L +G LL +L+ TG+++D + LG++P+ L
Sbjct: 643 VAMGSGDSSESEQGVLSTYYLNIGLQNGVLLRSVLDSVTGDMSDTRTRYLGSRPVRLFKV 702
Query: 584 SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
+N+ V A S R + Y + ++ + + F+S P+ + L I
Sbjct: 703 KIQNSEAVLAISSRSWLGYMFQSVSRLTPLSYDALDYASGFSSDQCPEGVVSIAGDTLRI 762
Query: 644 GTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAIC------------SLKNQSCA---- 686
++ + + + +I L PRR +Q+ + S + Q A
Sbjct: 763 LALEKLGAVFNQMTINLKLTPRRFAIDQQNSNLVVVGSDHLCFTDSTKSERKQQMAKEII 822
Query: 687 -----EESEM------HFVR------------------------LLDDQTFEFISTYPLD 711
EE+E+ F+ L DQ+ E LD
Sbjct: 823 ESAGDEEAELAKEVAESFMNEVLPATEFGEPKAGNGQWASCIQLLAPDQSLE------LD 876
Query: 712 TFEYGCSILSCSFS-DDSNVYYCVGTAYVLPEENEPTKGRIL-VFIVEDGKLQLIAEKET 769
E S+ C F+ + VG A L + G ++ + + +G+L+L+ +
Sbjct: 877 QDEAALSVAICRFAYKPDETFVVVGVAKELNLNPSSSSGGLMNTYRMANGQLELVHKTVV 936
Query: 770 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFI 828
+ ++ AF G+LL + +++Y D G ++L +C + + + + G +
Sbjct: 937 EEVPRAMAAFQGRLLVGTGRILRVY-----DLGRKKLLRKCENKNFPYRIVTISSMGSRV 991
Query: 829 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD-DDIYLGAENNFNLFTVRKN 887
+VGD+ +S+ + Y+ +E + A D + +++A LD D I +G + F + +
Sbjct: 992 IVGDVQESVHFVKYRAKENRLVVFADDVSPRYVTATCFLDYDTIAVG--DKFGSIAILRL 1049
Query: 888 SEGATDE------------ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPT 933
S+ DE +RG L G F G VM L + + G +
Sbjct: 1050 SDDINDEIEEDPTGAKAFWDRGLLN--GASQKANLEASFYIGETVMSLQKTTIIPGGSES 1107
Query: 934 VIFGTVNGVIGVIASLP---HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
+I+ T++G IGV+ LP E+ F + L+ +LR + G +H +RS+
Sbjct: 1108 LIYTTLSGSIGVL--LPFTSREEVDFFQHLEMHLRSENAPICGRDHLAYRSY-----YFP 1160
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
AKN +DGD+ E F L ++ ++ ++ + E+ K++E++
Sbjct: 1161 AKNVIDGDMCEQFNALDGSKRRTLAMELDRTPPEISKKLEDM 1202
>gi|440473070|gb|ELQ41892.1| pre-mRNA-splicing factor rse-1 [Magnaporthe oryzae Y34]
Length = 1229
Score = 205 bits (521), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 287/1164 (24%), Positives = 485/1164 (41%), Gaps = 160/1164 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I L FR G ++D+L IA++ + ++++ + +R + + G R
Sbjct: 64 VFGIIRDLASFRLAGSSKDYLIIASDSGRITIVEY-LPAQNRFSRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ DP R LI L V+ +++ +L + + + L + + G
Sbjct: 123 PGQYLAADPKGRACLIASVERCKLVYVLNRNSQAELTISSPLEAHKNGTLTLSLVALDVG 182
Query: 120 CAKPTIVVLY-------QDNKD--ARHVKT----YEVALKDKDFVEGPWSQNNLDNGADL 166
+ P L QD K A++V+T YE+ L V W + +D+ A +
Sbjct: 183 YSNPVFAALEVDYSEVDQDPKGDAAQNVETVLNYYELDLGLNHVVRK-WF-DVVDSTASM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRY 216
L VP P GVL+ GEE I Y +N AF+ IP R T+ + R G +
Sbjct: 241 LFQVPGGSDGP--SGVLVCGEENITYRHSNQDAFRVPIPRRKGATEDPSRKRNIVSGVMH 298
Query: 217 LLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLD 258
L AG L+ T + + +V LKI+ +++ + L
Sbjct: 299 KLKGSAGAFFFLLQTEDGDLFKVTIDMVEDAEGNPTGEVRRLKIKYFDTIPVSNNLCILK 358
Query: 259 NAVVYIGSSYGDS---QLIKL------------NLQPDAKGSY------------VEVLE 291
+ +++ S +G+ Q KL + D K Y + ++E
Sbjct: 359 SGFLFVASEFGNHLFYQFEKLGDDDEELEFFSSDFPVDPKEPYEPVYFYPRPTENLALVE 418
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
++ P++D V +L + Q+ T SG + R++++G+ +NE + +L G +
Sbjct: 419 SIDSMNPLMDLKVANLTEEDAPQIYTVSGKGARSTFRMLKHGLEVNEIVASQLPGTPSAV 478
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+ + DD +D ++V+SF + T +L++ + +EE GF S TL + LV
Sbjct: 479 WTTKLRRDDEYDAYIVLSFTNGTLVLSIG--ETVEEVSDTGFLSSVPTLAVQQLGDDGLV 536
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
QV +R + + + NEW SP S+ A N QV +A G +VY E+ DG L
Sbjct: 537 QVHPKGIRHIRNG---VVNEWSSPQHRSIVAAATNERQVAVALSSGEIVYFEMDTDGSLA 593
Query: 470 EVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
E + + ++ L + + E S AVG D +VRI SL P+ L +K
Sbjct: 594 EYDEKKEMFGTVTSLSLGEVPEGRLRSSYLAVGC-DDCTVRILSLDPESTLESKSVQALT 652
Query: 528 IIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
P ++ + + E S YL L G L +L+ TGELTD ++ LG + + L
Sbjct: 653 AAPSALSIMSMEDSSSGGTTLYLHIGLNSGVYLRTVLDEVTGELTDTRQKFLGPKAVRLF 712
Query: 582 TFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
S + T V A S R + +S K + +N +E+ F S + +
Sbjct: 713 QVSVQKRTCVLALSSRSWLGFSDPVTKGFTMTPLNYEELEWGWNFVSEQCEEGMVGVNGQ 772
Query: 640 ELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV---- 694
L I I+ + I +SIPL PR++ R F N + A E +
Sbjct: 773 FLRIFAIEKLGDNVIQKSIPLTYTPRKLAKHPTQRIFYTIEADNNTLAPELREQLMAAPT 832
Query: 695 ------RLLDDQTFEF---------------------------ISTYPLDTFEYGCSILS 721
R+L F + + LD E S+
Sbjct: 833 AVNGDARVLPPDEFGYPRGNGRWASCISVVDPLGDGEELEPGVVQRIDLDNNEAALSMAV 892
Query: 722 CSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
SF S D + VGT ++ T+G I V+ EDG+ L+ I + + + +L
Sbjct: 893 VSFASQDGESFLVVGTGKDMVVNPRRFTEGYIHVYRFSEDGRELEFIHKTKVEEPPTALL 952
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
F G+L+A I + +++Y LR R+ Q+E L + + T+G I+VGD+ +
Sbjct: 953 PFQGRLVAGIGRMLRIYDLGLR-QLLRKAQAEVAPQ---LIVSLNTQGSRIIVGDVQHGL 1008
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEE 895
+ YK E + A D A W + ++D D GA+ NL+ +R + + +DE
Sbjct: 1009 IYVAYKSETNRLIPFADDTIARWTTCTTMVDYDSTAGADKFGNLWILRCPEKASQESDEP 1068
Query: 896 RGRLEVV-GEYHLGEFVNRFRHGSLV--MRLPDS------DVGQIPTVIFGTVNGVIGV- 945
+ +V +L NR + V +P S VG +++G G IGV
Sbjct: 1069 GSEVHLVHSRDYLHGTSNRLALMAHVYTQDIPTSICKTNLVVGGQEVLLWGGFQGTIGVL 1128
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
I + E F + L+ +LR + G +H +R V K +DGDL E +
Sbjct: 1129 IPFVSREDADFFQSLEQHLRSEDPPLAGRDHLMYRGC-----YVPVKGVIDGDLCERYTM 1183
Query: 1006 LSRTRMDEISKTMNVSVEELCKRV 1029
L + I+ ++ SV E+ +++
Sbjct: 1184 LPNDKKQMIAGELDRSVREIERKI 1207
>gi|327309050|ref|XP_003239216.1| pre-mRNA splicing factor rse1 [Trichophyton rubrum CBS 118892]
gi|326459472|gb|EGD84925.1| pre-mRNA splicing factor rse1 [Trichophyton rubrum CBS 118892]
Length = 1209
Score = 204 bits (520), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 278/1164 (23%), Positives = 491/1164 (42%), Gaps = 161/1164 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ + R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
GQ +DP R LI + L V+ + + +L + + L LD+
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182
Query: 116 F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
+ ++ + + QD + +T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVEYTEVDQDPTGQAYEETEKMLVYYELDLGLNHVVRR-WA-DPVDRTASM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
L VP GVL+ E+ IVY +N AFK IP R P+ R G
Sbjct: 241 LFQVPGGADGPSGVLVCAEDNIVYRHSNQDAFKVPIPRRRGPTENPERKRCITAGVMHKM 300
Query: 216 -----YLLGDHAGLLH---LLVITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
+LL G L + ++ E EK TG LK++ +AS++ L + ++
Sbjct: 301 RGAFFFLLQSEDGDLFKVTMEMVEDENEKATGEVKRLKLKYFDTVPLASSLCILKSGFLF 360
Query: 264 IGSSYGD-----------------------SQLIKLNLQP----DAKGSYVEVLERYVNL 296
+ S G+ S +I L P + ++E +L
Sbjct: 361 VASETGNQHFYQFEKLGDDDDEIEFISDDYSAIISEPLPPVYFRPRPAENLNLVESIASL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P++ + ++ + Q+ T G S R +++G+ ++E EL + +W+ +
Sbjct: 421 NPLMAASIANITEEDAPQIYTLCGTGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
S +D FD ++V+SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 481 SRNDQFDAYIVLSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
+R + + R NEW +P S+ AT+N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHIHADQR--VNEWPAPQHRSIVAATSNERQVAIALSSGEIVYFEMDTDGSLAEYDEK 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
Q+ ++CL + + E S AVG D +VRI SL PD L K P +
Sbjct: 597 RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPSA 655
Query: 533 V-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ ++ + S YL L G L +L+ TGEL+D + LG +P+ L + S K
Sbjct: 656 LSIMSMIDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFSVSVK 715
Query: 587 NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
V A S R + YS K + +N + F+S + + + L I
Sbjct: 716 EQRAVLALSSRSWLGYSDVQTKSFTLTPLNYVGLEWSWNFSSEQCVEGMVGIQGQNLRIF 775
Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEM---------- 691
+I+ + L IPL PR + F + N S A ++++
Sbjct: 776 SIEKLDNNLLQEPIPLAYTPRNFVRHPEYPLFYVIGSDNNILSPATKAKLLSESTTVNGD 835
Query: 692 ----------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILSCSF-SDDSNV 730
H+ ++++D T +S+ L+ E SI + SF S +
Sbjct: 836 SAELPPEGFGYPRGTNHWASSIQVVDPIHTKSVLSSLELEDNEAAVSIAAVSFTSQEDET 895
Query: 731 YYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAI 787
+ VGT ++ T G I ++ E+GK L+ I + + + +L F G+LLA I
Sbjct: 896 FLVVGTGKDMVVSPRTFTCGFIHIYRFQEEGKELEFIHKTKVEQPPLALLGFQGRLLAGI 955
Query: 788 NQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
+++Y D G R+L +C + + +QT+G I+V D+ +S++ ++YK++E
Sbjct: 956 GPDLRIY-----DLGMRQLLRKCQAQITPRVIVGLQTQGSRIIVSDVQESVTYVVYKYQE 1010
Query: 847 GAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVG 903
A+ A D W + ++D + G + N++ +R K SE A ++ G +
Sbjct: 1011 NALISFADDIIPRWTTCTTMVDYETVAGGDKFGNIWLLRCPTKASEEADEDGSGAHLIHE 1070
Query: 904 EYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT--------------VNGVIGV-IAS 948
+L NR SLV+ D IPT I T + G +G+ +
Sbjct: 1071 RQYLQGAPNRL---SLVIHFYSQD---IPTSIQKTQLVAGGRDILVWTGLQGTVGMFVPF 1124
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+ + F + L+ L + G +H +R + K +DGDL E+FL L
Sbjct: 1125 ITRDDVDFFQTLEMQLASQNPPLAGRDHLIYRGY-----YAPCKGVIDGDLCETFLLLPN 1179
Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
+ I+ ++ SV E+ +++ ++
Sbjct: 1180 DKKQAIAGELDRSVREIERKISDM 1203
>gi|336257679|ref|XP_003343663.1| hypothetical protein SMAC_08834 [Sordaria macrospora k-hell]
gi|380091896|emb|CCC10625.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 1209
Score = 204 bits (520), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 279/1168 (23%), Positives = 487/1168 (41%), Gaps = 169/1168 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G + + FR G +D++ +AT+ + ++++ ++++ + R
Sbjct: 64 IFGIVRAIASFRLAGSHKDYIILATDSGRITIIEYLPKTNKFQRIHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R LI + L V+ +++ +L + + + VL + + G
Sbjct: 124 GQYLAADPKGRACLISSLEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSLVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L D DA T YE+ L V WS + +D + LL
Sbjct: 184 ANPVFAALEIDYSDADQDPTGQAREEVETQLVYYELDLGLNHVVRK-WS-DTVDRTSSLL 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYL 217
VP P GVL+ GEE + Y +N AF+ IP R T+ RV G +
Sbjct: 242 FQVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRKGATEDPQRKRVIVSGVMHK 299
Query: 218 LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
L AG L+ T + + +V LKI+ +A+++ L +
Sbjct: 300 LKGSAGAFFFLLQTDDGDLFKITIDMIEDADGNPTGEVKRLKIKYFDTIPVATSLCILKS 359
Query: 260 AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
++ S +G+ + + D SY V LE V
Sbjct: 360 GFLFAASEFGNHHFYQFEKLGDDDEELEFSSDDFPTDPTASYNPVYFHPRPLENLVLVES 419
Query: 295 --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
++ P VD V +L Q+ + G + R +++G+ ++E + EL G +W
Sbjct: 420 IDSMNPQVDCKVANLTGDDAPQIYSVCGNGARSTFRSLKHGLEVSEIVASELPGTPSAVW 479
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+ + + D +D ++V+SF + T +L++ + +EE GF + TL + L+Q
Sbjct: 480 TTKLTKYDQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTTAPTLAVRQMGEDGLIQ 537
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
V +R + NEW +P S+ ATAN +QV++A G +VY E+ DG L E
Sbjct: 538 VHPKGIRHIVQGRV---NEWPAPQHRSIVAATANENQVVIALSSGEIVYFEMDSDGSLAE 594
Query: 471 V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
+ ++ ++ L + + E S AVG D +VRI SL PD L K
Sbjct: 595 YDEKKEMSGTVTSLSVGQVPEGLKRSSFLAVGC-DDCTVRILSLDPDSTLEMKSIQALTA 653
Query: 529 IPRSVLLCAFE----GIS-YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
P ++ + + E G + YL L G L +L+ TGELTD ++ LG +P L
Sbjct: 654 APSALSIMSMEDSFGGFTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTKLFQV 713
Query: 584 SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
S ++ V A S RP + Y+ K + + ++ E+ + F+S + + L
Sbjct: 714 SVQDQPCVLALSSRPWLGYTDPLTKGFMMTPLSYTELEYGWNFSSEQCLEGMVGIHANYL 773
Query: 642 TIGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEMHFVRLLDD 699
I +I+ + I +SIPL P+R+ H EQ + I S N E +LL+
Sbjct: 774 RIFSIEKLGDNMIQKSIPLTYTPKRLVKHPEQPYFYTIESDNNTLPPELR----AKLLEQ 829
Query: 700 QTFEFISTYPLDTFEY--------GCSILSCSFSDDSNVYYCVG--------TAYVLP-- 741
Q+ + P + F Y C + SD+ V + +A ++P
Sbjct: 830 QSNGDATILPPEDFGYPKAKGRWASCISIIDPISDEPRVLQRIDLDNNEAAVSAAIVPFA 889
Query: 742 -EENEP-----------------TKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNG 781
+E E T+G I V+ EDG+ L+ I + + +L F G
Sbjct: 890 SQEGESFLVVGTGKDMVLNPRQFTEGYIHVYRFHEDGRDLEFIHKTRVEEPPMALIPFQG 949
Query: 782 KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
+LLA + + +++Y L+ R+ Q++ L + +Q++G+ I+VGDL + ++ ++
Sbjct: 950 RLLAGVGKTLRIYDLGLK-QLLRKAQADV---TPTLIVSLQSQGNRIIVGDLQQGVTYVV 1005
Query: 842 YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEV 901
YK E + D W + ++D + + N+ VR + D + E
Sbjct: 1006 YKAEGNRLIPFVDDTLNRWTTCTTMVDYESVASGDKFGNISIVRCPERVSQDTD----EP 1061
Query: 902 VGEYHLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV 945
E HL N + HG+ LP S VG +++ + G +GV
Sbjct: 1062 GSEIHLMHARN-YLHGTPNRLSLQVHFFTQDLPTSICKTSLVVGGQDVLLWSGLQGTVGV 1120
Query: 946 -IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
I + E F + L+ ++R + G +H +R + K V +DGDL E F
Sbjct: 1121 FIPFVSREDVDFFQNLENHMRAEDPPLAGRDHLIYRGYYTPVKGV-----IDGDLCERFS 1175
Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L + I+ ++ SV E+ +++ ++
Sbjct: 1176 LLPNDKKQMIAGELDRSVREIERKISDI 1203
>gi|358366518|dbj|GAA83139.1| nuclear mRNA splicing factor [Aspergillus kawachii IFO 4308]
Length = 1209
Score = 204 bits (520), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 273/1168 (23%), Positives = 485/1168 (41%), Gaps = 169/1168 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R +I + L V+ +++ +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACIIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMTALDVG 182
Query: 120 CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
P L D ++ +H+ YE+ L V WS + +D A L
Sbjct: 183 YENPVFAALEVDYSESDQDPTGQAFEELEKHLVYYELDLGLNHVVRK-WS-DPVDRTASL 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDA--------- 211
L VP GVL+ E+++ Y +N AF+ IP R T+ R +
Sbjct: 241 LFQVPGGADGPSGVLVCAEDSVTYRHSNQDAFRVPIPRRSGATENPERKRSITAGVMHKM 300
Query: 212 DGSRYLL--GDHAGLLHLLV--ITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
G+ + L + L L + + + ++TG LKI+ +AS++ L + +Y
Sbjct: 301 RGAFFFLIQTEDGDLFKLTIDMVEDDNGQLTGEVKRLKIKYFDTVPLASSLLILKSGFLY 360
Query: 264 IGSSYGDSQLIKLNLQPD--------------------------AKGS-YVEVLERYVNL 296
+ S G+ + D +G+ + ++E +L
Sbjct: 361 VASEAGNHHFYQFEKLGDDDEEIEFTSEAFSADPSVPCDPIYFRPRGAENLNLVETINSL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P++D VV++ Q+ T SGA S R +++G+ ++E EL + +W+ +
Sbjct: 421 NPLIDSKVVNITEDDAPQIYTISGAGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ D FD ++++SF + T +L++ + +EE GF S TL + L+Q+
Sbjct: 481 TRSDEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQIHPR 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-KH 473
+R + + R NEW +P S+ A N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHILADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDSDGSLAEYDER 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
Q+ ++CL + + E S AVG D +VRI SL PD L K P +
Sbjct: 597 RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTAAPSA 655
Query: 533 VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ + + S YL L G L +L+ TGEL+D + LG++ + L S K
Sbjct: 656 LNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKAVKLFQVSVK 715
Query: 587 NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
T V S RP + YS K + + ++ + F+S + + + L I
Sbjct: 716 GQTAVLGLSSRPWLGYSDVQTKGFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775
Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR-------- 695
+I+ + + +SIPL PRR + F + N + + +
Sbjct: 776 SIEKLDNNMLQQSIPLSYTPRRFLKHPEQPLFYVIESDNNVLSPSTRAKLLEDSKSRGGD 835
Query: 696 --LLDDQTFEF----------------------ISTYPLDTFEYGCSILSCSF-SDDSNV 730
+L + F + + T L+ E SI + F S D
Sbjct: 836 ETVLPPEDFGYPRATGHWASCIQVVDPLDAKAVVHTIELEENEAAISIAAVPFTSQDDET 895
Query: 731 YYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAA 786
+ VGTA + N P G I ++ EDG+ L+ I + + + +L F G+L+A
Sbjct: 896 FLVVGTAKDM-SVNPPKSAGGYIHIYRFQEDGRELEFIHKTKVEEPPLALLGFQGRLVAG 954
Query: 787 INQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
I +++Y D G ++L +C I+ L QT+G IVV D+ +S++ ++YK
Sbjct: 955 IGSLLRIY-----DLGMKQLLRKCQAPVVPKTIVGL--QTQGSRIVVSDVRESVTYVVYK 1007
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE------ 894
++E + D + W +A ++D + G + NL+ +R K SE A ++
Sbjct: 1008 YQENVLIPFVDDSVSRWTTATTMVDYETTAGGDKFGNLWLLRCPKKTSEEADEDGSGAHL 1067
Query: 895 --ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV 945
ERG RLE++ + + LV G +++ G IG+
Sbjct: 1068 IHERGYLQGTPNRLELMIHVYTQDIPTSLHKTQLV-------AGGRDILVWTGFQGTIGM 1120
Query: 946 IAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
+ + E F + L+ L + G +H +RS+ K V +DGDL E +
Sbjct: 1121 LVPFIGREDVDFFQNLEMQLAAQHPPLAGRDHLIYRSYYAPVKGV-----IDGDLCEMYF 1175
Query: 1005 DLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L I+ ++ SV E+ +++ ++
Sbjct: 1176 LLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|242772631|ref|XP_002478075.1| nuclear mRNA splicing factor, putative [Talaromyces stipitatus ATCC
10500]
gi|218721694|gb|EED21112.1| nuclear mRNA splicing factor, putative [Talaromyces stipitatus ATCC
10500]
Length = 1209
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 273/1169 (23%), Positives = 487/1169 (41%), Gaps = 171/1169 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G +D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ +++ +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKAQTLVFALTALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P + D + T YE+ L V WS + +D +++
Sbjct: 183 YDNPIFAAIEVDYSECDQDPTGRAYEEVEKQLVYYELDLGLNHVVRR-WS-DVIDRTSNI 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKAIPI------------RPSITKAYGRV 209
L VP GVL+ E++I Y +N AF+ +PI + SIT
Sbjct: 241 LFQVPGGADGPSGVLVCAEDSITYRHSNQDAFR-VPIPRRRGPTENPERKRSITAGVMHK 299
Query: 210 DADGSRYLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVV 262
+LL G L L I ++ ++TG LKI+ IA+ + L + +
Sbjct: 300 MRGAFFFLLQTEDGDLFKLTIDMVEDDNGQLTGEVKRLKIKYFDTVPIATNLLILKSGFL 359
Query: 263 YIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYVN 295
++ S G+ + ++ D + V +E +
Sbjct: 360 FVASETGNHHFYQFEKLGDDDEETEFPSDDVSADLADPIIPVYFNVRDAENLNLVESVNS 419
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLR 354
L PI+D + DL + Q+ T G + R +++G+ ++E EL + +W+ +
Sbjct: 420 LNPIMDCKITDLLAEDAPQIYTICGTGARSTFRTLKHGLDVSEIVESELPSVPSAVWTTK 479
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
+ D FD ++V+SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 480 LTRKDEFDAYIVLSFTNGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVHP 537
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-K 472
+R + + R NEW +P ++ A N QV +A G +VY E+ DG L E +
Sbjct: 538 KGIRHILADHR--VNEWPAPQHRTIVAAATNERQVAVALSSGEIVYFEMDTDGSLAEYDE 595
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPR 531
Q+ ++ L + + E S AVG D +VRI SL PD L K P
Sbjct: 596 KRQMSGTVTSLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPS 654
Query: 532 SVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
++ + + + YL L G L +L+ TGEL+D + LG +P+ L S
Sbjct: 655 ALNIMSMADSTSGGSTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKPVKLFGVSV 714
Query: 586 KNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
K + V A S RP + YS K + + ++ + F+S + + + L I
Sbjct: 715 KGQSAVLALSSRPWLGYSDIQTKSFMLTPLDYVGLEWGWNFSSEQCLEGMVGIQGQNLRI 774
Query: 644 GTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD--- 699
+I+ + ++ SIPL PR + F + +N A ++ RLL+D
Sbjct: 775 FSIEKLDNNVLQESIPLAYTPRHFVKHPEQPLFYVIESENNVLAPATQ---TRLLEDSKL 831
Query: 700 ----------QTFEF----------------------ISTYPLDTFEYGCSILSCSF-SD 726
+TF F +S L+ E S+ + SF S
Sbjct: 832 QNGEAVIPPAETFGFPRATGHWASCIEVVDPINSKSVLSRLELEENESAVSVAAVSFASQ 891
Query: 727 DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
D+ + VGT V+ + G I ++ EDG+ L+ I + + + +L AF G+L
Sbjct: 892 DNETFLVVGTGKDVVTYPRSFSAGFIHIYRFQEDGRELEFIHKTKIEEPPLALLAFQGRL 951
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSECGHHGHI-LALYVQTRGDFIVVGDLMKSISLLIY 842
+A I + +++Y D G +++ +C L + +QT+G I+V D+ +S++ ++Y
Sbjct: 952 VAGIGKNLRVY-----DLGMKQMLRKCQVEASPNLIVGLQTQGSRIIVSDVQESVTYVVY 1006
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
K++E + D A W +A ++D + G + NL+ VR K SE + ++ G
Sbjct: 1007 KYQENQLIPFVDDVIARWTTATTMVDYETTAGGDKFGNLWLVRCPKKVSEESDEDGSGAH 1066
Query: 898 -------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
RL+++ ++ + +LV VG +++ + G IG
Sbjct: 1067 LIHERSYLQGTPNRLDLMVHFYTQDIPTSLHKTNLV-------VGGRDILVWTGLQGTIG 1119
Query: 945 -VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
+I + E F + L+ L + G H +RS+ V K +DGDL ES+
Sbjct: 1120 MMIPFISREDVDFFQNLEMQLASQNPPLAGREHLIYRSY-----YVPVKGVIDGDLCESY 1174
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L + I+ ++ SV E+ +++ ++
Sbjct: 1175 FLLPNDKKLMIAGELDRSVREIERKISDM 1203
>gi|195490209|ref|XP_002093045.1| GE20993 [Drosophila yakuba]
gi|194179146|gb|EDW92757.1| GE20993 [Drosophila yakuba]
Length = 1227
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 263/1199 (21%), Positives = 470/1199 (39%), Gaps = 197/1199 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L I+G I L FR G +D++ + ++ + +L+++A + L
Sbjct: 53 LLSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEYNAAKNALEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 113 RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D ++A + + A + + F E N+ L+ A
Sbjct: 173 VDVGFDNPMFACLEIDYEEADMDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 233 NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+LL G + + + + + V+ +K++ A+ + L +
Sbjct: 288 ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
++ S +G+ L ++ P A + V +++ + P
Sbjct: 348 FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 407 IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRA 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
DD FD +++VSF++ T +L++ + +EE GF T TL C + LVQV +
Sbjct: 467 DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQ 475
R + S R NEWK+P S+ N QV++ G LVY E+ G L E + ++
Sbjct: 525 RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
+ EI C+ + + E S AVG+ D +VRI SL N +T + P L
Sbjct: 583 MPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-L 640
Query: 536 CAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
C E G YL L +G LL +L+ +G+L D +
Sbjct: 641 CLVEMGHTESTTQGALDDDAPAPRSGNNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 700
Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
LG++P+ L + V A S R + Y + + ++ + + + F+S
Sbjct: 701 RYLGSRPVKLFRIKMQGAEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 760
Query: 631 DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
+ + L I ++ + + + + PL PR + H + R T
Sbjct: 761 EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDT 820
Query: 678 CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
S + + AEE EM +R LD
Sbjct: 821 KSARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDA 880
Query: 700 QTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
+ + + PL E ++L S + D Y VG A L ++G +
Sbjct: 881 MHGQTMFSVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKI 940
Query: 758 D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
D L+ + E +L F G+LLA + +++Y D G +++ +C H
Sbjct: 941 DPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKH 995
Query: 814 GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
+ +Q G + V D+ +S+ + Y+ E + A D + W++A +LD D +
Sbjct: 996 IPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1054
Query: 874 GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
+ F ++++ TD+ +RG L E + +H+GE +
Sbjct: 1055 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1114
Query: 915 RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
+ +L+ G +I+ T++G +G E Y F + L+ ++R + G
Sbjct: 1115 QKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1167
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+H +RS KN LDGDL E +L + + I+ M + ++CK++E++
Sbjct: 1168 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1221
>gi|357147370|ref|XP_003574320.1| PREDICTED: splicing factor 3B subunit 3-like [Brachypodium
distachyon]
Length = 1228
Score = 204 bits (518), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 270/1190 (22%), Positives = 475/1190 (39%), Gaps = 187/1190 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D+L + ++ + +L++ + +
Sbjct: 63 LLSVDVFGAIRSLAQFRLTGATKDYLVVGSDSGRLVILEYSPDRNRFDKVHQETFGKSGC 122
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE---------LQV 111
R GQ+ +DP R + + + V + + + LE L
Sbjct: 123 RRIVPGQLLTVDPKGRALCIAALEKQKLVYVLNRDASARLTISSPLEAHKSHTLTFALTA 182
Query: 112 LDIKF---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN--------L 160
LD F ++G I + Y D+ + A K F E N+ +
Sbjct: 183 LDCGFDNPVFG----AIELEYGDSDRDPTGQAASHAQKLLTFYELDLGLNHVSRKVSEPI 238
Query: 161 DNGADLLIPVP-----PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VD 210
DNGA+LL+ VP P G+L+ + ++Y + IP R + G V
Sbjct: 239 DNGANLLVTVPGGGDGP--SGLLVCCDNFVLYRNQGHPEVRAVIPRRVDLPAERGVLIVA 296
Query: 211 ADGSR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
A R +LL G + + + H + V+ L+I+ +AS I L + ++
Sbjct: 297 AATHRQKSLFFFLLQTEYGDIFKVDLEHNGDSVSELRIKYFDTIPVASAICVLRSGFLFA 356
Query: 265 GSSYGDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGP 298
S +G+ L + QP A + V + E +L P
Sbjct: 357 ASEFGNHALYQFRDIGRDVDVESSSATLMETEEGFQPVFFQPRALKNLVRIDE-IESLMP 415
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSST 357
I+D + +L + QV T G ++RI+R G+ I+E A L +W+++ +
Sbjct: 416 IMDMRIANLFDEETPQVYTACGRGSRSTMRILRPGLAISEMARSMLPAEPIAVWTVKKNI 475
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
+D FD ++VVSF + T +L++ + +EE F T +L + L+QV +
Sbjct: 476 NDMFDAYIVVSFANVTLVLSIG--ETIEEVSDSQFLDTTHSLAVSLLGEDSLMQVHPNGI 533
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQL 476
R + R NEW++P ++ +N QV++A GG L+Y E+ G L EV+ +
Sbjct: 534 RHIREDGRV--NEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDM 591
Query: 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLL 535
+++CL I P+ E S+ AVG + D ++RI SL PD L P S+L
Sbjct: 592 SGDVACLAIAPVPEGRQRSRFLAVGSY-DNTIRILSLDPDDCLQPLSVQSVSSAPESLLF 650
Query: 536 CAFE-------GISY-----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
+ G Y L L +G L ++M TG+L+D + LG +P L
Sbjct: 651 LEVQASVGGEDGADYPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPC 710
Query: 584 SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
+ + S RP + Y L + ++ + F+S + + L I
Sbjct: 711 IVSHRQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGDALRI 770
Query: 644 GTIDDI-QKLHIRSIPLGEHP------------------------------RRICHQ--- 669
TI+ + + + +IPL P +R C +
Sbjct: 771 FTIERLGETFNETAIPLRYTPRKFVILPKKKYLAVIESDKGAFSAEQREAAKRECLEASG 830
Query: 670 ----------------------EQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIST 707
E+S T E + +R+LD ++ +
Sbjct: 831 AVENGNGNGDQMENGDGQEDGAEESNTLPDEQYGYPKAESEKWVSCIRILDPRSRDTTCL 890
Query: 708 YPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGRILVF-IVEDGK-LQ 762
L E SI + +F D + VGT L + G I ++ V++GK L+
Sbjct: 891 LELQDNEAAVSICTVNFHDKEHGTLLAVGTTKGLQFWPKRSLASGFIHIYKFVDEGKSLE 950
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-V 821
L+ + + + SL F G+LLA + ++LY D G R+L +C + + +
Sbjct: 951 LLHKTQVEEVPLSLCQFQGRLLAGVGSVLRLY-----DLGKRKLLRKCENKLFPRTIVSI 1005
Query: 822 QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
T D I VGD+ +S Y+ +E + A D W++A +D D GA+ N+
Sbjct: 1006 HTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAANHIDFDTMAGADKFGNI 1065
Query: 882 FTVR------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRL 923
+ R K +G + ++E + ++H+G+ V + SL+
Sbjct: 1066 YFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDVVTCLQKASLI--- 1122
Query: 924 PDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
G +I+GTV G +G ++A E F L+ +LR+ + G +H +RS
Sbjct: 1123 ----PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQEHPPLCGRDHMAYRS- 1177
Query: 983 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E + L +I+ ++ + E+ K++E++
Sbjct: 1178 ----AYFPVKDVIDGDLCEQYPSLPADMQRKIADELDRTPGEILKKLEDI 1223
>gi|91092128|ref|XP_972649.1| PREDICTED: similar to AGAP005549-PA [Tribolium castaneum]
gi|270004662|gb|EFA01110.1| hypothetical protein TcasGA2_TC010322 [Tribolium castaneum]
Length = 1219
Score = 203 bits (517), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 262/1190 (22%), Positives = 480/1190 (40%), Gaps = 187/1190 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G + +L FR G +DF+ + ++ + +L++ S + + + G
Sbjct: 53 LLTVEIFGVVRSLMSFRLTGGTKDFVIVGSDSGRIVILEY-IPSKNNFEKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ DP R +IG L ++ D + +L + + + L +
Sbjct: 112 VRRIVPGQYLATDPRGRAVMIGAVEKQKLAYILNRDVQARLTISSPLEAHKSNTLVYHMV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + E A K + F E N+ L+
Sbjct: 172 GVDVGYDNPMFACLEIDYEEADSDPSGEAAQKTQQTLTFYELDLGVNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
A+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 232 ANFLVTVPGGDDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT +K++ +AS + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVASAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGP 298
+++ S +G+ L ++ + P +G R + +L P
Sbjct: 347 LFVTSEFGNHYLYQIAHLGDDDDELEFSSAMPLEEGDTFFFAPRSLRNLVLVDEMESLSP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ V DL + Q+ G SLR++R+G+ ++E A EL G +W+++ +
Sbjct: 407 ILSCRVADLAGEDTPQLYMLCGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVKRRS 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
DD +D +++VSF++ T +L++ + +EE GF T TL C + LVQV G +
Sbjct: 467 DDEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALSDDALVQVYPGGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVK-HAQ 475
R + S R NEWK+P ++ N QV++A GG L Y E+ G L E K +
Sbjct: 525 RHICSDKR--VNEWKAPGKKTIVKCAINQRQVVIALSGGELAYFEMDPTGQLHEYKERKR 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII------ 529
+ ++ C+ + + S AVG+ D +VRI SL + + + G +
Sbjct: 583 MNADVVCMALANVAPGEQLSLFLAVGL-ADSTVRIISLDPSDCLAPRSIQGLPVCAESLC 641
Query: 530 ----------PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
P + + YL L +G LL +L+ +GEL+D + LG++P+
Sbjct: 642 IVEMGCTDREPDNAAAASTTSTLYLNIGLTNGALLRNVLDPVSGELSDTRTRYLGSRPVK 701
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L + + V A S R + Y + + ++ + + + F+S P+ +
Sbjct: 702 LFRIRMQQSEAVLAMSSRSWLSYYYQSRFYLTPLSYESLEYASGFSSEQCPEGIVAISTN 761
Query: 640 ELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKN--------------- 682
L I ++ + + + S PL PR+ I H E + I + N
Sbjct: 762 TLRILALEKLGAVFNQVSFPLEYTPRKFIIHPESNNLLIIETEHNAYTEETKKQRRLQMA 821
Query: 683 ---QSCAEESEMHFVR-----LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
+ A E E + L++ E I + P S L V+ C+
Sbjct: 822 EEMKEAAGEEEQELAKEMADAFLNEDLPESIFSAPKAGHGMWASTLKIMDPVQGIVHKCI 881
Query: 735 G--------TAYVLPEENEPTKGRILVFIVEDG-----------------------KLQL 763
++ ++ +N+P + L+ + +L+
Sbjct: 882 RLEQNEAAMSSVLVKFQNQPQQTLFLIVGISKDFQLNPRHCNTGFLDTYKMDPMGRELEF 941
Query: 764 IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYV 821
+ +L A+NG LLA + + ++LY D G ++L +C + HI + +
Sbjct: 942 VHRTPVDEVPMALCAYNGLLLAGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAIVNI 995
Query: 822 QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
Q G I V D+ +S+ ++ YK E + A D + W++ +LD D A+ N+
Sbjct: 996 QAMGKRIFVSDVQESVFMVRYKRAENQLIIFADDTHPRWVTCNCVLDYDTVAVADKFGNI 1055
Query: 882 FTVRKNSEGATDEE-----------RGRL-------EVVGEYHLGEFVNRFRHGSLVMRL 923
+R + D E RG L + + +H+GE V + +L+
Sbjct: 1056 AILRLPPNVSDDVEEDPTGHKSLWDRGLLNGASQKADTIATFHVGETVTWLQKATLI--- 1112
Query: 924 PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
G ++I+ T++G +GV+ E + F + L+ ++R + G +H +RS+
Sbjct: 1113 ----PGGWESLIYTTLSGSVGVLVPFTSREDHDFFQHLEMHMRSENSPLCGRDHLSFRSY 1168
Query: 983 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E + L + I+ ++ + E+ K++E++
Sbjct: 1169 -----YFPVKNVIDGDLCEQYNSLEPVKQKSIASDLDRTPAEVSKKLEDI 1213
>gi|392869416|gb|EJB11761.1| pre-mRNA-splicing factor rse1 [Coccidioides immitis RS]
Length = 1209
Score = 203 bits (517), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 273/1176 (23%), Positives = 483/1176 (41%), Gaps = 185/1176 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSSKDYIIIGSDSGRIAIVEY-VPSQNRFNRIHLETFGKSGIRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ + + +L + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTLVFALTALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D ++ T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVDYTESDQDPTGAAYQEAEKLLVYYELDLGLNHVVRR-WA-DPVDRSAAM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLL 218
L VP P GVL+ GE I Y +N AF+ IP R T+ + + RY+
Sbjct: 241 LFQVPGGADGP--SGVLVCGEGNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT 293
Query: 219 GDHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIAST 253
AG++H + ++ E ++TG LK++ +AS+
Sbjct: 294 ---AGVVHKMRRAFFCLLQTEDGDLFKVTMDMVEDENGQLTGEVQRLKLKYFDTVPVASS 350
Query: 254 ISYLDNAVVYIGSSYGDSQLIKLN-----------------------LQP----DAKGSY 286
+ L N +++ S G+ + L P
Sbjct: 351 LCILKNGFLFVASETGNHHFYQFEKLGDDDEETEFSSDDFSADPSEPLAPVYFRPRPAEN 410
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+ ++E +L P++ + +L Q T SG + R +++G+ ++E EL
Sbjct: 411 LNLVESINSLNPLMSCKITNLTEDDAPQFYTLSGTGARSTFRTLKHGLEVSEIVESELPS 470
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
+ +W+ + + +D +D ++++SF + T +L++ + +EE GF S TL
Sbjct: 471 VPSAVWTTKLTRNDQYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLG 528
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
+ L+QV +R + + R NEW +P S+ A N QV +A G +VY E+
Sbjct: 529 EDSLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDT 586
Query: 465 DGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
DG L E + ++ ++CL + + S AVG D +VRI SL PD L K
Sbjct: 587 DGSLAEYDEKREMSGTVTCLSLGEVPPGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKS 645
Query: 523 HLGGEIIPRSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
P ++ ++ + S YL L G L +L+ TGEL+D + LG +
Sbjct: 646 VQALTSAPSALSIMSMVDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLK 705
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
P+ L + S K V A S RP + YS K + + ++ + F+S + +
Sbjct: 706 PVKLFSVSVKEQRAVLALSSRPWLGYSDLQTKNFMLTPLDYVPLEWSWNFSSEQCVEGMV 765
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 693
+ L I +I+ + L +IPL PR + F + N + ++
Sbjct: 766 GIQGQNLRIFSIEKLDNNLLQETIPLAYTPRHFVRHPEQPLFYVIEADNNILSPSTKAKL 825
Query: 694 ----------VRLLDDQTFEF----------------------ISTYPLDTFEYGCSILS 721
V L + F + IS L+ E S+ +
Sbjct: 826 LQDSKAANGEVAELPPEDFGYPRGTGHWASCIQVVDPINSKAVISRIELEENEAAVSVAA 885
Query: 722 CSFSD-DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYS 775
FS D + VGT V P + + G I ++ EDGK L+ I + + + ++
Sbjct: 886 VPFSSQDDETFLVVGTGKDMVVYPPSS--SCGFIHIYRFQEDGKELEFIHKTKVESPPHA 943
Query: 776 LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLM 834
L AF G+LLA I + +++Y D G ++L +C L + +QT+G I+V D+
Sbjct: 944 LLAFQGRLLAGIGRNLRIY-----DLGMKQLLRKCQAEVVPRLIVGLQTQGSRIIVSDVQ 998
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
+S++ ++YK++E + A D A W + ++D + G + NL+ +R K SE A
Sbjct: 999 ESVTYVVYKYQENRLIPFADDVIARWTTCTAMVDYETVAGGDKFGNLWLLRCPQKASEEA 1058
Query: 892 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT------------- 938
++ G + +L NR SL++ D IPT I T
Sbjct: 1059 DEDGSGAHLIHERQYLQGAPNRL---SLMVHFYPQD---IPTSIQKTQLVAGGRDILVWT 1112
Query: 939 -VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
+ G +G++ + E F + L+ L + G +H +RS+ AK +D
Sbjct: 1113 GLQGTVGMLVPFVSREDVDFFQSLEMQLTSQTPPLAGRDHLIYRSY-----YAPAKGTID 1167
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
GDL E++ L + I+ ++ SV E+ +++ ++
Sbjct: 1168 GDLCETYFTLPNDKKLMIAGELDRSVREIERKISDM 1203
>gi|154277742|ref|XP_001539706.1| hypothetical protein HCAG_05173 [Ajellomyces capsulatus NAm1]
gi|150413291|gb|EDN08674.1| hypothetical protein HCAG_05173 [Ajellomyces capsulatus NAm1]
Length = 1233
Score = 203 bits (517), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 271/1173 (23%), Positives = 483/1173 (41%), Gaps = 185/1173 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I L FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRALAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ + + +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALTALDVG 182
Query: 120 CAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
P L + +A +H+ YE+ L V WS + +D A +
Sbjct: 183 YDNPIFAALEVEYTEADQDPTGQAYEELEKHLVYYELDLGLNHVVRK-WS-DPVDRSASM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLL 218
L VP P G L+ E+ I Y +N AF+ IP R T+ + + RY+
Sbjct: 241 LFQVPGGADGP--SGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT 293
Query: 219 GDHAGLLH---------------------LLVITHEKEKVTG----LKIELLGETSIAST 253
AG++H L ++ + ++TG LK++ IAS+
Sbjct: 294 ---AGVMHKMRGAFFFLLQTEDGDLFKVTLDMVEDDNGQITGEVSRLKLKYFDTVPIASS 350
Query: 254 ISYLDNAVVYIGSSYGDSQLIKL-------------------NLQPDAKGSY-------- 286
+ L + +++ S G+ + +L +Y
Sbjct: 351 LCILKSGFLFVASETGNHHFYQFEKLGDDDDEIEFTSDAYSADLSEPLPPAYFRPRPYEN 410
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
V ++E +L P++D + +L Q+ T G S R +++G+ + E EL
Sbjct: 411 VNLVESIDSLNPLMDCKIANLTDDDAPQIYTICGTGARSSFRTLKHGLEVAEIVESELPS 470
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
+ +W+ + + +D FD ++++SF + T +L++ + +EE GF S TL
Sbjct: 471 VPSAVWTTKLTREDEFDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLG 528
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
+ L+QV +R + + R NEW +P S+ AT N QV +A G +VY E+
Sbjct: 529 EDSLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDT 586
Query: 465 DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
DG L E + + ++CL + + + S AVG D +VRI SL P+ L K
Sbjct: 587 DGSLAEYDEKRAMSGTVTCLSLGEVPRGRARSSFLAVGC-DDSTVRILSLDPESTLENKS 645
Query: 523 HLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
P ++ + A + YL L G L +L+ TGEL+D + LG +
Sbjct: 646 VQALTSAPSALSIMAMSDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLK 705
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
P+ L S K T V A S RP + YS K + + ++ + F+S + +
Sbjct: 706 PVKLFQVSVKEQTVVLALSSRPWLGYSHLQTKAFMLTPLDYVSLEWGWNFSSEQCVEGMV 765
Query: 635 IAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 693
+ L I +++ ++ L +IPL PR + F + +N + +
Sbjct: 766 GIQGQNLRIFSLEKLENNLLQETIPLQYTPRHFIKHPEYPLFYVIEAENNILSPGTR--- 822
Query: 694 VRLLDDQTFEFISTYPLDTFEYG-----------------------------------CS 718
+LL+D T PL E+G S
Sbjct: 823 TKLLNDSDTVNGDTTPLPPEEFGYPRGTGHWASCIQIVDPINSKRVISQIELEENEAAVS 882
Query: 719 ILSCSFSD-DSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVY 774
+ + FS D + VGT ++ T G I ++ E+GK L+ I + +
Sbjct: 883 VAAVPFSSQDDETFLVVGTGKDMVVNPRSCTAGFIHIYRFQEEGKELEFIHKTMVEQPPM 942
Query: 775 SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
+L F G+LLA I +++Y ++ R+ Q+ H L + +QT+G I+V D+
Sbjct: 943 ALLGFQGRLLAGIGTDLRIYDLGMK-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQ 998
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
+S++ ++YK++E + D + W + ++D + G + NL+ +R K SE A
Sbjct: 999 ESLTYVVYKYQENRLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPAKASEEA 1058
Query: 892 TDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
++ G RL +V ++ + + LV D V T +
Sbjct: 1059 DEDGSGAHLIHERQYLQGAPNRLNLVAHFYPQDLPTSIQKAQLVTGGRDILVW---TGLQ 1115
Query: 937 GTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
GTV+ +I I+ E+ F + L+ L + G +H +RS+ AK +D
Sbjct: 1116 GTVSMLIPFIS---REEVDFFQSLEMQLAAQNPPLAGRDHLIYRSY-----YAPAKGTID 1167
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
GDL E++L L + +I+ ++ SV E+ +++
Sbjct: 1168 GDLCETYLLLPNDKKQQIAGELDRSVREIERKI 1200
>gi|295666353|ref|XP_002793727.1| pre-mRNA-splicing factor rse1 [Paracoccidioides sp. 'lutzii' Pb01]
gi|226278021|gb|EEH33587.1| pre-mRNA-splicing factor rse1 [Paracoccidioides sp. 'lutzii' Pb01]
Length = 1209
Score = 203 bits (517), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 277/1164 (23%), Positives = 488/1164 (41%), Gaps = 161/1164 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I TL FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRTLAAFRLAGGSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ + + +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALTALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L + +A T YE+ L V WS + +D A +
Sbjct: 183 YDNPIFAALEVEYTEADQDPTGQAYEELEKQLVYYELDLGLNHVVRK-WS-DPVDRSATM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
L VP G L+ E+ I Y +N AF+ IP R P+ R G
Sbjct: 241 LFQVPGGADGPSGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTENPERKRCIVAGVMHKM 300
Query: 216 -----YLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVVY 263
+LL G L + I ++ ++TG LK++ IAS++ L + ++
Sbjct: 301 RGAFFFLLQTEDGDLFKVTIDMVEDDNGQITGEVRRLKLKYFDTVPIASSLCILKSGFLF 360
Query: 264 IGSSYGDS---QLIKLNLQPD----AKGSY--------------------VEVLERYVNL 296
+ S G+ Q KL D S+ V ++E +L
Sbjct: 361 VASETGNHHFYQFEKLGDDDDEIEFTSDSFSANPSEPLPPVYFHPRLSENVHLVESVNSL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P++D + +L Q+ T G S R +++G+ + E EL + +W+ +
Sbjct: 421 NPLMDCKITNLTEDDAPQIYTICGTGARSSFRTLKHGLEVTEIVESELPSVPSAVWTSKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ +D FD ++++SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 481 TREDEFDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
+R + + R NEW +P S+ AT N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAVALSSGEIVYFEMDTDGSLAEYDEK 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
++ ++CL + + + + S AVG D +VRI SL PD L K P +
Sbjct: 597 REMSGTVTCLSLGEVPKGRARSSFLAVGC-DDSTVRILSLDPDSTLGNKSVQALTSAPSA 655
Query: 533 VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ + A + YL L G L +L+ TGEL+D + LG +P+ L S K
Sbjct: 656 LSIMAMADSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLKPVKLFQVSVK 715
Query: 587 NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
V A S RP + YS K + + ++ + F+S + + + L I
Sbjct: 716 EQKAVLALSSRPWLGYSDLQTKAFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775
Query: 645 TIDDIQ-KLHIRSIPLGEHPRR-ICHQEQSRTFAI------------CSLKNQSCAEESE 690
+I+ ++ L +I L PR+ I H EQ + I L N S A +
Sbjct: 776 SIEKLENNLLQETISLTYTPRQFIKHPEQPLFYVIEADNNILSPATRAKLLNDSDAVNGD 835
Query: 691 M---------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILSCSFSD-DSNV 730
H+ ++++D + +S L+ E S+ + SFS D
Sbjct: 836 ATVLPSEDFGYPRGTGHWASCIQIVDPVNSKSVVSQIELEENEAAVSVAAVSFSSQDDET 895
Query: 731 YYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAI 787
+ VGT ++ + G I ++ EDGK L+ I + + + +L F G+LLA I
Sbjct: 896 FLVVGTGKDMVVNPRSCSAGFIHIYRFQEDGKELEFIHKTKVEQPPVALLGFQGRLLAGI 955
Query: 788 NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
+++Y +R R+ Q+ H L + +QT+G I+V D+ +S++ ++YK +E
Sbjct: 956 GTDVRIYDLGMR-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQESVTYVVYKSQEN 1011
Query: 848 AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG------- 897
+ D + W + ++D + G + NL+ +R K SE A ++ G
Sbjct: 1012 RLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPAKASEEADEDGSGAHLIHER 1071
Query: 898 --------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV-IAS 948
RL +V ++ + + LV G +++ + G +G+ I
Sbjct: 1072 QYLQGAPNRLNLVAHFYPQDLPTSIQKAQLV-------TGGRDILVWTGLQGTVGMLIPF 1124
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+ E+ F + L+ L + G +H +RS+ AK +DGDL E++L L
Sbjct: 1125 ISREEVDFFQSLEMQLAAQNPPLAGRDHLIYRSY-----YAPAKGTIDGDLCETYLLLPN 1179
Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
+ +I+ ++ SV E+ +++ ++
Sbjct: 1180 DKKQQIAGELDRSVREIERKIADM 1203
>gi|197246825|gb|AAI68883.1| Sf3b3 protein [Rattus norvegicus]
Length = 908
Score = 203 bits (516), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 217/927 (23%), Positives = 401/927 (43%), Gaps = 150/927 (16%)
Query: 228 LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN---------- 277
+ + +++ VT ++++ +A+ + L +++ S +G+ L ++
Sbjct: 4 ITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQIAHLGDDDEEPE 63
Query: 278 --------------LQPDAKGSYVEVLERYVNLGPIVDFC-VVDLERQGQGQVVTCSGAY 322
QP + V +++ +L PI+ FC + DL + Q+ G
Sbjct: 64 FSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLSPIL-FCQIADLANEDTPQLYVACGRG 121
Query: 323 KDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLE 381
SLR++R+G+ ++E A EL G +W++R +D FD +++VSF++ T +L++
Sbjct: 122 PRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIVSFVNATLVLSIG-- 179
Query: 382 DELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNV 441
+ +EE GF T TL C + LVQV +R + + R NEWK+P ++
Sbjct: 180 ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR--VNEWKTPGKKTIVK 237
Query: 442 ATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAA 499
N QV++A GG LVY E+ G L E + ++ ++ C+ + + S+ A
Sbjct: 238 CAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPGEQRSRFLA 297
Query: 500 VGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE----------------GIS 542
VG+ D +VRI SL + + + L + +P + LC E G
Sbjct: 298 VGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPESLCIVEMGGTEKQDELGERGSIGFL 354
Query: 543 YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIY 602
YL L +G LL +L+ TG+L+D + LG++P+ L + V A S R + Y
Sbjct: 355 YLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSY 414
Query: 603 SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGE 661
S + + ++ + + F S P+ + L I ++ + + + + PL
Sbjct: 415 SYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVAFPLQY 474
Query: 662 HPRR-ICHQEQSR-----------TFAICSLKNQSCAEE-------------SEM----- 691
PR+ + H E + T A + + Q AEE +EM
Sbjct: 475 TPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAAGEDERELAAEMAAAFL 534
Query: 692 --------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
+R+++ + L+ E S+ C FS+ +
Sbjct: 535 NENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDW 594
Query: 732 YC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAI 787
Y VG A L G + +V +G KL+ + + + ++ F G++L +
Sbjct: 595 YVLVGVAKDLILSPRSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGV 654
Query: 788 NQKIQLYKWMLRDDGTRELQSECGHHGHILALY---VQTRGDFIVVGDLMKSISLLIYKH 844
+ +++Y D G ++L +C + HI A Y +QT G ++V D+ +S + YK
Sbjct: 655 GKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKR 707
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE-------- 894
E + A D W++ +LD D GA+ N+ VR N+ DE
Sbjct: 708 NENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKAL 767
Query: 895 -ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
+RG L EV+ YH+GE V + +L+ G ++++ T++G IG++
Sbjct: 768 WDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGIL 820
Query: 947 ASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
HE + F + ++ +LR + G +H +RS+ KN +DGDL E F
Sbjct: 821 VPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNS 875
Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
+ + +S+ ++ + E+ K++E++
Sbjct: 876 MEPNKQKNVSEELDRTPPEVSKKLEDI 902
>gi|119173562|ref|XP_001239205.1| hypothetical protein CIMG_10227 [Coccidioides immitis RS]
Length = 1208
Score = 203 bits (516), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 275/1180 (23%), Positives = 483/1180 (40%), Gaps = 185/1180 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSSKDYIIIGSDSGRIAIVEY-VPSQNRFNRIHLETFGKSGIRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ + + +L + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTLVFALTALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D ++ T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVDYTESDQDPTGAAYQEAEKLLVYYELDLGLNHVVRR-WA-DPVDRSAAM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLL 218
L VP P GVL+ GE I Y +N AF+ IP R T+ + + RY+
Sbjct: 241 LFQVPGGADGP--SGVLVCGEGNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT 293
Query: 219 GDHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIAST 253
AG++H + ++ E ++TG LK++ +AS+
Sbjct: 294 ---AGVVHKMRRAFFCLLQTEDGDLFKVTMDMVEDENGQLTGEVQRLKLKYFDTVPVASS 350
Query: 254 ISYLDNAVVYIGSSYGDSQLIKLN-----------------------LQP----DAKGSY 286
+ L N +++ S G+ + L P
Sbjct: 351 LCILKNGFLFVASETGNHHFYQFEKLGDDDEETEFSSDDFSADPSEPLAPVYFRPRPAEN 410
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+ ++E +L P++ + +L Q T SG + R +++G+ ++E EL
Sbjct: 411 LNLVESINSLNPLMSCKITNLTEDDAPQFYTLSGTGARSTFRTLKHGLEVSEIVESELPS 470
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
+ +W+ + + +D +D ++++SF + T +L++ + +EE GF S TL
Sbjct: 471 VPSAVWTTKLTRNDQYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLG 528
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
+ L+QV +R + + R NEW +P S+ A N QV +A G +VY E+
Sbjct: 529 EDSLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDT 586
Query: 465 DGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
DG L E + ++ ++CL + + S AVG D +VRI SL PD L K
Sbjct: 587 DGSLAEYDEKREMSGTVTCLSLGEVPPGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKS 645
Query: 523 HLGGEIIPRSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
P ++ ++ + S YL L G L +L+ TGEL+D + LG +
Sbjct: 646 VQALTSAPSALSIMSMVDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLK 705
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
P+ L + S K V A S RP + YS K + + ++ + F+S + +
Sbjct: 706 PVKLFSVSVKEQRAVLALSSRPWLGYSDLQTKNFMLTPLDYVPLEWSWNFSSEQCVEGMV 765
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 693
+ L I +I+ + L +IPL PR + F + N + ++
Sbjct: 766 GIQGQNLRIFSIEKLDNNLLQETIPLAYTPRHFVRHPEQPLFYVIEADNNILSPSTKAKL 825
Query: 694 ----------VRLLDDQTFEF----------------------ISTYPLDTFEYGCSILS 721
V L + F + IS L+ E S+ +
Sbjct: 826 LQDSKAANGEVAELPPEDFGYPRGTGHWASCIQVVDPINSKAVISRIELEENEAAVSVAA 885
Query: 722 CSFSD-DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYS 775
FS D + VGT V P + + G I ++ EDGK L+ I + + + ++
Sbjct: 886 VPFSSQDDETFLVVGTGKDMVVYPPSS--SCGFIHIYRFQEDGKELEFIHKTKVESPPHA 943
Query: 776 LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLM 834
L AF G+LLA I + +++Y D G ++L +C L + +QT+G I+V D+
Sbjct: 944 LLAFQGRLLAGIGRNLRIY-----DLGMKQLLRKCQAEVVPRLIVGLQTQGSRIIVSDVQ 998
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
+S++ ++YK++E + A D A W + ++D + G + NL+ +R K SE A
Sbjct: 999 ESVTYVVYKYQENRLIPFADDVIARWTTCTAMVDYETVAGGDKFGNLWLLRCPQKASEEA 1058
Query: 892 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT------------- 938
++ G + +L NR SL++ D IPT I T
Sbjct: 1059 DEDGSGAHLIHERQYLQGAPNRL---SLMVHFYPQD---IPTSIQKTQLVAGGRDILVWT 1112
Query: 939 -VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
+ G +G++ + E F + L+ L + G +H +RS+ AK +D
Sbjct: 1113 GLQGTVGMLVPFVSREDVDFFQSLEMQLTSQTPPLAGRDHLIYRSY-----YAPAKGTID 1167
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
GDL E++ L + I+ ++ SV E+ +++ LH
Sbjct: 1168 GDLCETYFTLPNDKKLMIAGELDRSVREIERKISVSPLLH 1207
>gi|361131929|gb|EHL03544.1| putative Pre-mRNA-splicing factor rse1 [Glarea lozoyensis 74030]
Length = 967
Score = 203 bits (516), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 234/957 (24%), Positives = 412/957 (43%), Gaps = 115/957 (12%)
Query: 155 WSQNNLDNGADLLIPVPPPL---CGVLIIGEETIVYCSAN--AFK-AIPIRPSITKAYGR 208
W+ + +D A +L VP GVL+ G++ I Y +N AF+ AIP R T+ R
Sbjct: 41 WA-DAVDRTASVLFQVPGGTDGPSGVLVCGDDNITYRHSNQEAFRVAIPRRRGATEDPQR 99
Query: 209 VDA--DGSRYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGET 248
+ G + L AG L+ T + + +V LKI+
Sbjct: 100 KRSIVAGVMHKLKGAAGAFFFLLQTEDGDLFKITIEMVEDDQGQPTGEVKRLKIKYFDTV 159
Query: 249 SIASTISYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV---- 289
+AS++ L + +++ S +GD Q + + D Y V
Sbjct: 160 PVASSLCILKSGFLFVASEFGDHQFYQFEKLGDDDEETEFVSDDFPTDPTEPYTPVYFHP 219
Query: 290 --------LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQAS 341
+E ++ P++D V +L Q+ + G + R +++G+ ++E
Sbjct: 220 RQAENLSLVESIDSMNPLMDCKVANLTNADAPQIYSICGTGARSTFRSLKHGLEVSEIVE 279
Query: 342 VELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
EL G+ +W+ + + D +D ++++SF + T +L++ + +EE GF S TL
Sbjct: 280 SELPGVPSAVWTTKLTQGDTYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSTASTLA 337
Query: 401 CHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVY 460
+ L+QV +R + + R NEW +P S+ AT N QV +A G +VY
Sbjct: 338 VQQLGEDGLIQVHPKGIRHIRADRR--VNEWAAPQHRSIVAATTNERQVAVALSSGEIVY 395
Query: 461 LEI-GDGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLN 517
E+ DG L E + ++ ++CL + + E S AVG D +VRI SL PD
Sbjct: 396 FEMDSDGSLAEYDEKKEMSGTVTCLSLGDVPEGRVRSPYLAVGC-DDSTVRILSLDPDST 454
Query: 518 LITKEHLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKV 571
L K P ++ + A S YL L G L +L+ TGELTD +
Sbjct: 455 LENKSVQALTSAPSALSIMAMADSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELTDTRLR 514
Query: 572 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAF 629
LG +P L S + T V A S RP + Y+ K + + +N + F+S
Sbjct: 515 FLGPKPAKLFKVSVQGQTAVLALSSRPWLGYTDPVTKAFMLTPLNYPALEWGWNFSSEQC 574
Query: 630 PDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAE 687
+ + + L I +I+ + L SIPL PRR + H EQ + I S N
Sbjct: 575 TEGMVGIQGQNLRIFSIEKLTDNLLQESIPLTYTPRRFVRHPEQPLFYTIESDNNVLSPA 634
Query: 688 ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPT 747
+ +LL+D + L E+G ++ C+ + T
Sbjct: 635 TKQ----KLLEDPSVVNGDAAVLPPEEFGYPRGRGHWAS------CISVV------DPVT 678
Query: 748 KGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
+ ++L I +G + E +L F G+LLA + + +++Y ++ R+ Q
Sbjct: 679 EKKVLHTIHFEGNEAAVEEPPM-----ALLGFQGRLLAGVGKDLRIYDLGMK-QLLRKSQ 732
Query: 808 SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
+E + L + ++T+G I+V D+ +S+ +++YK +E + D A W S ++
Sbjct: 733 AEVVPN---LIVGLRTQGSRIIVSDVQESVVMVVYKFQENRLIPFVDDTIARWTSCSTMV 789
Query: 868 DDDIYLGAENNFNLFTVR---KNSEGATDEERG------RLEVVGEYHLGEFVNRFRHGS 918
D + G + NL+ +R K SE A +E G R + G H + F
Sbjct: 790 DYETVAGGDKFGNLWLLRCPAKASEEADEEGSGAHLLHERQYLQGAPHRLTLMAHFYSQD 849
Query: 919 LVMRLPDSD--VGQIPTVIFGTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLN 975
+ M + ++ VG +++ + G +G+ I + E F + L+ ++R + G +
Sbjct: 850 IPMSIQKTNLVVGGPDCLLWAGLQGTLGILIPFVGREDVDFFQTLEQHMRNEDAPLAGRD 909
Query: 976 HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
H +RS+ V K +DGDL E + L + I+ ++ SV E+ +++ ++
Sbjct: 910 HLIYRSY-----YVPVKGMIDGDLCERYTLLPTDKKQMIAGELDRSVREIERKISDI 961
>gi|402077250|gb|EJT72599.1| pre-mRNA-splicing factor RSE1 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 1216
Score = 203 bits (516), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 277/1176 (23%), Positives = 487/1176 (41%), Gaps = 178/1176 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I L FR G +D++ IA++ + ++++ + + R
Sbjct: 64 VFGIIRDLASFRLAGSNKDYIIIASDSGRITIIEYLPAKNRFHRIHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R LI L V+ +++ +L + + + L + + G
Sbjct: 124 GQYLAADPKGRACLIASVERCKLVYVLNRNSQAELTISSPLEAHKNGTLTLSLVALDVGY 183
Query: 121 AKPTIVVLYQDNKDA---------RHVKT----YEVALKDKDFVEGPWSQNNLDNGADLL 167
+ P L D DA ++V+T YE+ L V W + +D A++L
Sbjct: 184 SNPVFAALEIDYTDADQDSKSDATQNVETVLNYYELDLGLNHVVRK-WF-DVVDPTANML 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVD--ADGSRYL 217
VP P GVL+ G+E I Y +N AF+ IP R T+ R G +
Sbjct: 242 FQVPGGSDGP--SGVLVCGQENITYRHSNQDAFRVPIPRRKGATEDPNRKRNIVSGVMHK 299
Query: 218 LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
L AG L+ T + + +V LKI+ ++S + L +
Sbjct: 300 LKGSAGAFFFLLQTEDGDLFKVTIDMLEDAEGNTTGEVQRLKIKYFDTIPVSSNLCILKS 359
Query: 260 AVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLER 292
+++ S +G+ + N D Y + ++E
Sbjct: 360 GFLFVASEFGNHHFYQFEKLGDDDEELEFSSENFPSDPAEPYEPAYFYPRPTENLALVES 419
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
++ P++D V +L + Q+ T SG + R++++G+ +NE + +L G +W
Sbjct: 420 VESMNPLMDLKVANLTDEDAPQIYTVSGNGARSTFRMLKHGLEVNEIVASQLPGTPSAVW 479
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+ + + DD +D+++V+SF + T +L++ + +EE GF S TL + LVQ
Sbjct: 480 TTKIARDDQYDSYIVLSFTNGTLVLSIG--ETVEEVSDTGFLSSVSTLAVQQLGEDGLVQ 537
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE 470
V +R + S + NEW +P S+ A N QV +A G +VY E+ DG L E
Sbjct: 538 VHPRGIRHIRSG---VVNEWPTPQHRSIVAAATNERQVAVALSSGEIVYFEMDTDGSLAE 594
Query: 471 V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
+ ++ ++ L + + E S AVG D +VRI SL P+ L +K
Sbjct: 595 YDERKEMSGTVTSLSLGEVPEGRLRSSFLAVGC-DDCTVRILSLDPETTLESKSVQALTA 653
Query: 529 IPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
P ++ + A E S YL L G L +L+ TGELTD ++ LG + + L
Sbjct: 654 APSALSIMAMEDSSSGGTTLYLHIGLNSGVYLRTVLDEVTGELTDTRQKFLGPKAVRLFQ 713
Query: 583 FSSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
S + T V A S R + YS +K + ++ +E+ F S + + + +G+
Sbjct: 714 VSVQGMTCVLALSSRSWLGYSDPISKGFTMTPLSYEELEWAWNFRSEQCEEGM-VGVQGQ 772
Query: 641 L----TIGTIDD--IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
IG + D IQK SI L P+++ F +N + E +
Sbjct: 773 FLRICAIGKLGDSMIQK----SISLAYTPKKLIKNPTHPIFYTIEAENNTLPPELREQLL 828
Query: 695 ----------RLLDDQTFEF---------------------------ISTYPLDTFEYGC 717
++L + F + + L+ E
Sbjct: 829 AAPTAVNGDTKVLPPEEFGYPRGNGRWASCISVVDPLGDGEEREPSVLQQIHLENNEATV 888
Query: 718 SILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAV 773
S+ F S DS + VGT ++ T+G I V+ +EDG+ L+ I + + +
Sbjct: 889 SVAVVPFASQDSESFLVVGTGKDMVLNPRCFTEGYIHVYRFLEDGRELEFIHKTKVEEPP 948
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDL 833
+L AF GKL+A + + +++Y LR R+ QSE + + +QT+G IVVGD
Sbjct: 949 MALLAFQGKLVAGVGRSLRIYDLGLRQL-LRKAQSEVAPR---VIVSLQTQGSRIVVGDS 1004
Query: 834 MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
+ + YK E + A D W + ++D + G + N++ +R + + +
Sbjct: 1005 QHGLIYVAYKQEANKLIAFADDSIQRWTTCSTMVDYESTAGGDKFGNIWILRCPEKASQE 1064
Query: 894 EERGRLEVVGEYHLGEFVNRFRHGS-----LVMRLPDSD-----------VGQIPTVIFG 937
++ EV HL + HG+ L+ + D VG +++G
Sbjct: 1065 ADQPGSEV----HLMH-ARDYLHGTSNRLALMAHVYTQDIATSICKTNLVVGGQEVLLWG 1119
Query: 938 TVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
+ G IGV I + E F + L+ ++R + G +H +RS+ V K +D
Sbjct: 1120 GIQGTIGVLIPFVSREDADFFQTLEQHMRSEDPPLAGRDHLMYRSY-----YVPVKGVID 1174
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
GDL E F L + I+ ++ SV E+ +++ ++
Sbjct: 1175 GDLCERFTMLPNDKKQMIAGELDRSVREIERKISDI 1210
>gi|195586770|ref|XP_002083143.1| GD13507 [Drosophila simulans]
gi|194195152|gb|EDX08728.1| GD13507 [Drosophila simulans]
Length = 1227
Score = 203 bits (516), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 261/1199 (21%), Positives = 471/1199 (39%), Gaps = 197/1199 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L I+G + L FR G +D++ + ++ + +L+++ + L
Sbjct: 53 LLSTEIFGCVRALMAFRLTGGTKDYIVVGSDSGRIVILEYNPSKNALEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 113 RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D ++A + + A + + F E N+ L+ A
Sbjct: 173 VDVGFDNPMFACLEIDYEEADMDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 233 NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+LL G + + + + + V+ +K++ A+ + L +
Sbjct: 288 ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
++ S +G+ L ++ P A + V +++ + P
Sbjct: 348 FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 407 IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRA 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
DD FD +++VSF++ T +L++ + +EE GF T TL C + LVQV +
Sbjct: 467 DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQ 475
R + S R NEWK+P S+ N QV++ G LVY E+ G L E + ++
Sbjct: 525 RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
+ EI C+ + + E S AVG+ +D +VRI SL N +T + P L
Sbjct: 583 MPAEIMCMALGTVPEGEQRSWFLAVGL-SDNTVRILSLDPNNCLTPCSMQALPSPAES-L 640
Query: 536 CAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
C E G YL L +G LL +L+ +G+L D +
Sbjct: 641 CLVEMGHTESTTQGGLDDDAPAQRSGNNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 700
Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
LG++P+ L + + V A S R + Y + + ++ + + + F+S
Sbjct: 701 RYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 760
Query: 631 DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
+ + L I ++ + + + + PL PR + H + R T
Sbjct: 761 EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDT 820
Query: 678 CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
S + + AEE EM +R LD
Sbjct: 821 KSARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDA 880
Query: 700 QTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
+ + + PL E ++L S + D Y VG A L ++G +
Sbjct: 881 MHGQTMFSVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKI 940
Query: 758 D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
D L+ + E +L F G+LLA + +++Y D G +++ +C H
Sbjct: 941 DPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKH 995
Query: 814 GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
+ +Q G + V D+ +S+ + Y+ E + A D + W++A +LD D +
Sbjct: 996 IPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1054
Query: 874 GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
+ F ++++ TD+ +RG L E + +H+GE +
Sbjct: 1055 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1114
Query: 915 RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
+ +L+ G +I+ T++G +G E Y F + L+ ++R + G
Sbjct: 1115 QKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1167
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+H +RS KN LDGDL E +L + + I+ M + ++CK++E++
Sbjct: 1168 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1221
>gi|194864680|ref|XP_001971056.1| GG14635 [Drosophila erecta]
gi|190652839|gb|EDV50082.1| GG14635 [Drosophila erecta]
Length = 1227
Score = 202 bits (515), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 262/1199 (21%), Positives = 469/1199 (39%), Gaps = 197/1199 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L I+G I L FR G +D++ + ++ + +L+++A + L
Sbjct: 53 LLSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEYNASKNALEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 113 RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D ++A + + A + + F E N+ L+ A
Sbjct: 173 VDVGFDNPMFACLEIDYEEADMDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 233 NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+LL G + + + + + V+ +K++ A+ + L +
Sbjct: 288 ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
++ S +G+ L ++ P A + V +++ + P
Sbjct: 348 FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 407 IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRA 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
DD FD +++VSF++ T +L++ + +EE GF T TL C + LVQV +
Sbjct: 467 DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQ 475
R + S R NEWK+P S+ N QV++ G LVY E+ G L E + ++
Sbjct: 525 RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPSGELNEYTERSE 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
+ EI C+ + + + S AVG+ D +VRI SL N +T + P L
Sbjct: 583 MPAEIMCMALGTVPDGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-L 640
Query: 536 CAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
C E G YL L +G LL +L+ +G+L D +
Sbjct: 641 CLVEMGHTESTTQGALDDDAPAQRSGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 700
Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
LG++P+ L + V A S R + Y + + ++ + + + F+S
Sbjct: 701 RYLGSRPVKLFRIKMQGAEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 760
Query: 631 DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
+ + L I ++ + + + + PL PR + H + R T
Sbjct: 761 EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDT 820
Query: 678 CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
S + + AEE EM +R LD
Sbjct: 821 KSARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDA 880
Query: 700 QTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
+ + PL E ++L S + D Y VG A L ++G +
Sbjct: 881 MHGQTMFNVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKI 940
Query: 758 D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
D L+ + E +L F G+LLA + +++Y D G +++ +C H
Sbjct: 941 DPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKH 995
Query: 814 GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
+ +Q G + V D+ +S+ + Y+ E + A D + W++A +LD D +
Sbjct: 996 IPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1054
Query: 874 GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
+ F ++++ TD+ +RG L E + +H+GE +
Sbjct: 1055 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1114
Query: 915 RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
+ +L+ G +I+ T++G +G E Y F + L+ ++R + G
Sbjct: 1115 QKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1167
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+H +RS KN LDGDL E +L + + I+ M + ++CK++E++
Sbjct: 1168 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1221
>gi|195336406|ref|XP_002034829.1| GM14250 [Drosophila sechellia]
gi|194127922|gb|EDW49965.1| GM14250 [Drosophila sechellia]
Length = 1227
Score = 202 bits (514), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 261/1199 (21%), Positives = 470/1199 (39%), Gaps = 197/1199 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L I+G + L FR G +D++ + ++ + +L+++ + L
Sbjct: 53 LLSTEIFGCVRALMAFRLTGGTKDYIVVGSDSGRIVILEYNPSKNALEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 113 RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D ++A + + A + + F E N+ L+ A
Sbjct: 173 VDVGFDNPMFACLEIDYEEADMDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 233 NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+LL G + + + + + V+ +K++ A+ + L +
Sbjct: 288 ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
++ S +G+ L ++ P A + V +++ + P
Sbjct: 348 FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 407 IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRA 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
DD FD +++VSF++ T +L++ + +EE GF T TL C + LVQV +
Sbjct: 467 DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQ 475
R + S R NEWK+P S+ N QV++ G LVY E+ G L E + ++
Sbjct: 525 RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
+ EI C+ + + E S AVG+ D +VRI SL N +T + P L
Sbjct: 583 MPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-L 640
Query: 536 CAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
C E G YL L +G LL +L+ +G+L D +
Sbjct: 641 CLVEMGHTESTTQGGLDDDAPAQRSGNNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 700
Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
LG++P+ L + + V A S R + Y + + ++ + + + F+S
Sbjct: 701 RYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 760
Query: 631 DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
+ + L I ++ + + + + PL PR + H + R T
Sbjct: 761 EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDT 820
Query: 678 CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
S + + AEE EM +R LD
Sbjct: 821 KSARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDA 880
Query: 700 QTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
+ + + PL E ++L S + D Y VG A L ++G +
Sbjct: 881 MHGQTMFSVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKI 940
Query: 758 D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
D L+ + E +L F G+LLA + +++Y D G +++ +C H
Sbjct: 941 DPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKH 995
Query: 814 GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
+ +Q G + V D+ +S+ + Y+ E + A D + W++A +LD D +
Sbjct: 996 IPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1054
Query: 874 GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
+ F ++++ TD+ +RG L E + +H+GE +
Sbjct: 1055 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1114
Query: 915 RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
+ +L+ G +I+ T++G +G E Y F + L+ ++R + G
Sbjct: 1115 QKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1167
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+H +RS KN LDGDL E +L + + I+ M + ++CK++E++
Sbjct: 1168 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1221
>gi|405121632|gb|AFR96400.1| hypothetical protein CNAG_03173 [Cryptococcus neoformans var. grubii
H99]
Length = 1276
Score = 202 bits (514), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 225/912 (24%), Positives = 407/912 (44%), Gaps = 125/912 (13%)
Query: 186 VYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIEL- 244
V+ S F + I ++ + +G G+ ++GD G L EK G+ +
Sbjct: 353 VWRSRQGFGTV-IAATVIEDHG----SGASVVIGDEYGAFTALGWEFEKGLGAGMDGRVR 407
Query: 245 -----LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQ------PDAKGSY------- 286
LG +S S+I+YLD++ +++ S+ DS L++L P KG
Sbjct: 408 VLRTYLGASSPPSSITYLDSSHLFVSSAVADSVLLRLPTVESSTNVPSGKGKGREVISPI 467
Query: 287 -----------------------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 323
+E+LER++N+ P+ D C V E +V SGA +
Sbjct: 468 GDQLDKWEVLYEIGKDRNDTDEGLEILERWMNIAPVKDLCAVKDEGGNLSHLVLASGASE 527
Query: 324 DGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLED 382
SLR+VR+G+G+ E +++ L ++ MWSL ST P L++S + T + + L+
Sbjct: 528 SNSLRVVRSGVGLEELVTIQGLHDVQKMWSLTDSTAVPR---LLLS--TSTSTVLLQLQP 582
Query: 383 ELEETEIEGFCSQTQTLFCHDAIYNQLV-QVTSGSVRLVSSTS-RELRNEWKSPPGYSVN 440
E+ I +++TL +L+ QVT + L S S +L + K G +
Sbjct: 583 EISVIPIVDVIFKSETLAAGILPGAELLAQVTPRGLSLWSDLSVGQLEAQMKVDKGTEIV 642
Query: 441 VATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAV 500
A A ++A GG+LV + D + ++ E+S + I+ ++ S + A
Sbjct: 643 CAQVTADWAVVAKKGGNLVVFHVSDTGFSPEGTIDVKEEVSAVAISNSSDSSSPIIVIAT 702
Query: 501 GMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISY--------LLCALGDG- 551
WT + +++L ++ + G IP + + S+ LL L +G
Sbjct: 703 --WTAKTF-VYTLSQIS----NGVDGLSIPTKSSATSLQLRSHPFYPAGIQLLSGLDNGL 755
Query: 552 ---HLLNFLLNMKTGELT--DRKKVSLGTQPITLRTFSSKNTTHVFAA---SDRPTVIYS 603
H L+ + GEL K SLG +P+ L S + + ++R +VI+
Sbjct: 756 LHIHDLDTSDSGDAGELMVKSSKSASLGLRPLVLHPCESTHGDEKVISVGLTERMSVIFE 815
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EH 662
S ++ +S+VN+K + N+++ P ++ L++ I+ ++KLH+++ G E
Sbjct: 816 SKGRVEFSSVNIKNIMAATAVNTSSGPVFALFSRTSGLSLVKINSLKKLHVQTCDTGNES 875
Query: 663 PRRICHQEQSRTFAICSLKNQSCAEES---EMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
++ + ++ + A C L ++ + E +FV++ D + E +S++ L E S+
Sbjct: 876 VSKLTYMDEYKAVA-CGLTRRTQLRDGDVEEENFVQIRDGTSLEPLSSFSLRGRELVTSL 934
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPT-----------KGRILVFIVEDGK------LQ 762
S + ++Y VGT + P++ E + +GR+L+ ++G ++
Sbjct: 935 RSVFLT--GSIYLAVGTGILPPDDGEDSSWDEGNLAVVREGRVLLLEFKEGDAGSGWDIK 992
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-------H 815
+ AE T GAVY+L +G L A K+ +++ D EL+
Sbjct: 993 VKAELATVGAVYALEEIHGFLAVAAGSKLTIHR---LDHNPVELEETSSWASAYVISSLS 1049
Query: 816 ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD--DIYL 873
+L + ++VGD M+S+ +L +G I + R+ + ++A+ +L D D +
Sbjct: 1050 VLPPSLMRPEGALIVGDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGDGVV 1109
Query: 874 GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
++ + NL T R N +LE + L E V RF+ GSLV ++ IP
Sbjct: 1110 ISDAHSNLLTYRLNQ---------KLERAATFGLHEEVTRFQSGSLVPTTTAPEI-IIPD 1159
Query: 934 VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
V+F T G +G+I L L+ LQ N+ K+ KG G + WR + D
Sbjct: 1160 VLFATREGRLGIIGELGTRSSRTLDDLQRNMSKIWKGPGEVGWSNWRRAGSNLVGKDTAG 1219
Query: 994 FLDGDLIESFLD 1005
F+DGD ++ FLD
Sbjct: 1220 FVDGDFVQKFLD 1231
>gi|226293297|gb|EEH48717.1| pre-mRNA-splicing factor rse1 [Paracoccidioides brasiliensis Pb18]
Length = 1208
Score = 202 bits (514), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 277/1161 (23%), Positives = 486/1161 (41%), Gaps = 161/1161 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I TL FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRTLAAFRLAGGSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ + + +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALTALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L + +A T YE+ L V WS + +D A +
Sbjct: 183 YDNPIFAALEVEYTEADQDPTGQAYEELEKQLVYYELDLGLNHVVRK-WS-DPVDRSATM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
L VP G L+ E+ I Y +N AF+ IP R P+ R G
Sbjct: 241 LFQVPGGADGPSGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTENPERKRCIVAGVMHKM 300
Query: 216 -----YLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVVY 263
+LL G L + I ++ ++TG LK++ IAS++ L + ++
Sbjct: 301 RGAFFFLLQTEDGDLFKVTIDMVEDDNGQITGEVRRLKLKYFDTVPIASSLCILKSGFLF 360
Query: 264 IGSSYGDS---QLIKLNLQPD----AKGSY--------------------VEVLERYVNL 296
+ S G+ Q KL D S+ V ++E +L
Sbjct: 361 VASETGNHHFYQFEKLGDDDDEIEFTSDSFSANPSEPLPPVYFHPRLSENVHLVENVNSL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P++D + +L Q+ T G S R +++G+ + E EL + +W+ +
Sbjct: 421 NPLMDCKITNLTEDDAPQIYTICGTGARSSFRTLKHGLEVTEIVESELPSVPSAVWTSKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ +D FD ++++SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 481 TREDEFDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
+R + + R NEW +P S+ AT N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAVALSSGEIVYFEMDTDGSLAEYDEK 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
++ ++CL + + + + S AVG D +VRI SL PD L K P +
Sbjct: 597 REMSGTVTCLSLGEVPKGRARSSFLAVGC-DDSTVRILSLDPDSTLGNKSVQALTSAPSA 655
Query: 533 VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ + A + YL L G L +L+ TGEL+D + LG +P+ L S K
Sbjct: 656 LSIMAMADSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLKPVKLFQVSVK 715
Query: 587 NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
V A S RP + YS K + + ++ + F+S + + + L I
Sbjct: 716 EQKAVLALSSRPWLGYSDLQTKAFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775
Query: 645 TIDDIQ-KLHIRSIPLGEHPRR-ICHQEQSRTFAI------------CSLKNQSCAEESE 690
+I+ ++ L +I L PR+ I H EQ + I L N S A +
Sbjct: 776 SIEKLENNLLQETISLTYTPRQFIKHPEQPLFYVIEADNNILSPATRTKLLNDSDAVNGD 835
Query: 691 M---------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILSCSF-SDDSNV 730
H+ ++++D + +S L+ E S+ + SF S D
Sbjct: 836 ATVLPPEDFGYPRGTGHWASCIQIVDPVNSKSVVSQIELEENEAAVSVAAVSFSSQDDET 895
Query: 731 YYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAI 787
+ VGT ++ + G I ++ EDGK L+ I + + + +L F G+LLA I
Sbjct: 896 FLVVGTGKDMVVNPRSCSAGFIHIYRFQEDGKELEFIHKTKVEQPPVALLGFQGRLLAGI 955
Query: 788 NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
+++Y +R R+ Q+ H L + +QT+G I+V D+ +S++ ++YK +E
Sbjct: 956 GTDVRIYDLGMR-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQESVTYVVYKSQEN 1011
Query: 848 AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG------- 897
+ D + W + ++D + G + NL+ +R K SE A ++ G
Sbjct: 1012 RLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPAKASEEADEDGSGAHLIHER 1071
Query: 898 --------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV-IAS 948
RL +V ++ + + LV G +++ + G +G+ I
Sbjct: 1072 QYLQGAPNRLNLVAHFYPQDLPTSIQKAQLV-------TGGRDILVWTGLQGTVGMLIPF 1124
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+ E+ F + L+ L + G +H +RS+ AK +DGDL E++L L
Sbjct: 1125 ISREEVDFFQSLEMQLAAQNPPLAGRDHLIYRSY-----YAPAKGTIDGDLCETYLLLPN 1179
Query: 1009 TRMDEISKTMNVSVEELCKRV 1029
+ +I+ ++ SV E+ +++
Sbjct: 1180 DKKQQIAGELDRSVREIERKI 1200
>gi|60677959|gb|AAX33486.1| RE01065p [Drosophila melanogaster]
Length = 1227
Score = 202 bits (514), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 260/1199 (21%), Positives = 471/1199 (39%), Gaps = 197/1199 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L I+G + L FR G +D++ + ++ + +L+++ + L
Sbjct: 53 LLSTEIFGCVRALMAFRLTGGTKDYIVVGSDSGRIVILEYNPSKNALEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 113 RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P + L D ++A + + A + + F E N+ L+ A
Sbjct: 173 VDVGFDNPMLACLEIDYEEADMDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 233 NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+LL G + + + + + V+ +K++ A+ + L +
Sbjct: 288 ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
++ S +G+ L ++ P A + V +++ + P
Sbjct: 348 FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 407 IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRA 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
DD FD +++VSF++ T +L++ + +EE GF T TL C + LVQV +
Sbjct: 467 DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
R + S R NEWK+P S+ N QV++ G LVY E+ G L E + ++
Sbjct: 525 RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
+ EI C+ + + E S AVG+ D +VRI SL N +T + P L
Sbjct: 583 MPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-L 640
Query: 536 CAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
C E G YL L +G LL +L+ +G+L D +
Sbjct: 641 CLVEMGHTESTTQGGLDDDAPAQRSGNNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 700
Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
LG++P+ L + + V A S R + Y + + ++ + + + F+S
Sbjct: 701 RYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 760
Query: 631 DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
+ + L I ++ + + + + PL PR + H + R T
Sbjct: 761 EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDT 820
Query: 678 CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
S + + AEE EM +R LD
Sbjct: 821 KSARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDA 880
Query: 700 QTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
+ + + PL E ++L S + D Y VG A L ++G +
Sbjct: 881 MHGQTMFSVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKI 940
Query: 758 D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
D L+ + + +L F G+LLA + +++Y D G +++ +C H
Sbjct: 941 DPTCSSLEFMHRTDIDEIPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKH 995
Query: 814 GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
+ +Q G + V D+ +S+ + Y+ E + A D + W++A +LD D +
Sbjct: 996 IPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1054
Query: 874 GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
+ F ++++ TD+ +RG L E + +H+GE +
Sbjct: 1055 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1114
Query: 915 RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
+ +L+ G +I+ T++G +G E Y F + L+ ++R + G
Sbjct: 1115 QKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1167
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+H +RS KN LDGDL E +L + + I+ M + ++CK++E++
Sbjct: 1168 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1221
>gi|326483043|gb|EGE07053.1| pre-mRNA-splicing factor rse1 [Trichophyton equinum CBS 127.97]
Length = 1209
Score = 202 bits (513), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 277/1164 (23%), Positives = 489/1164 (42%), Gaps = 161/1164 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ + R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
GQ +DP R LI + L V+ + + +L + + L LD+
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182
Query: 116 F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
+ ++ + + QD + +T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVEYTEVDQDPTGQAYEETEKMLVYYELDLGLNHVVRR-WA-DPVDRTASM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
L VP GVL+ E+ IVY +N AF+ IP R P+ R G
Sbjct: 241 LFQVPGGADGPSGVLVCAEDNIVYRHSNQDAFRVPIPRRRGPTENPERKRCITAGVMHKM 300
Query: 216 -----YLLGDHAGLLH---LLVITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
+LL G L + ++ E EK TG LK++ +AS++ L + ++
Sbjct: 301 RGAFFFLLQSEDGDLFKVTMEMVEDENEKATGEVKRLKLKYFDTVPLASSLCILKSGFLF 360
Query: 264 IGSSYGD-----------------------SQLIKLNLQP----DAKGSYVEVLERYVNL 296
+ S G+ S +I L P + ++E +L
Sbjct: 361 VASETGNQHFYQFEKLGDDDDEIEFISDDYSAVISEPLPPVYFRPRPAENLNLVESIASL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P++ + ++ + Q+ T G S R +++G+ ++E EL + +W+ +
Sbjct: 421 NPLMAASITNITEEDAPQIYTLCGTGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
S +D FD ++V+SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 481 SRNDQFDAYIVLSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
+R + + R NEW +P S+ AT N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHIHADQR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDGSLAEYDEK 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
Q+ ++CL + + E S AVG D +VRI SL PD L K P +
Sbjct: 597 RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPSA 655
Query: 533 V-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ ++ + S YL L G L +L+ TGEL+D + LG +P+ L + S K
Sbjct: 656 LSIMSMIDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFSVSVK 715
Query: 587 NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
V A S R + YS K + +N + F+S + + + L I
Sbjct: 716 EQRAVLALSSRSWLGYSDVQTKSFTLTPLNYVGLEWSWNFSSEQCVEGMVGIQGQNLRIF 775
Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEM---------- 691
+I+ + L IPL PR + F + N S A ++++
Sbjct: 776 SIEKLDNNLLQEPIPLAYTPRNFVRHPEYPLFYVIGSDNNILSPATKAKLLSESTAVNGD 835
Query: 692 ----------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILSCSF-SDDSNV 730
H+ ++++D T +S L+ E SI + SF S +
Sbjct: 836 SAELPPEDFGYPRGTNHWASSIQVVDPIHTKSVLSNLELEDNEAAVSIAAVSFTSQEDET 895
Query: 731 YYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAI 787
+ VGT ++ T G I ++ E+GK L+ I + + + +L F G+LLA I
Sbjct: 896 FLVVGTGKDMVVSPRTFTCGFIHIYRFQEEGKELEFIHKTKVEQPPLALLGFQGRLLAGI 955
Query: 788 NQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
+++Y D G R+L +C + + +QT+G I+V D+ +S++ ++YK++E
Sbjct: 956 GPDLRIY-----DLGMRQLLRKCQAQITPRVIVGLQTQGSRIIVSDVQESVTYVVYKYQE 1010
Query: 847 GAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVG 903
A+ A D W + ++D + G + N++ +R K SE A ++ G +
Sbjct: 1011 NALIPFADDIIPRWTTCTTMVDYETVAGGDKFGNIWLLRCPTKASEEADEDGSGAHLIHE 1070
Query: 904 EYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT--------------VNGVIGV-IAS 948
+L NR SLV+ D IPT I T + G +G+ +
Sbjct: 1071 RQYLQGAPNRL---SLVIHFYSQD---IPTSIQKTQLVAGGRDILVWTGLQGTVGMFVPF 1124
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+ + F + L+ L + G +H +R + K +DGDL E+FL L
Sbjct: 1125 ITRDDVDFFQTLEMQLASQNPPLAGRDHLIYRGY-----YAPCKGVIDGDLCETFLLLPN 1179
Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
+ I+ ++ SV E+ +++ ++
Sbjct: 1180 DKKQAIAGELDRSVREIERKISDM 1203
>gi|413935524|gb|AFW70075.1| hypothetical protein ZEAMMB73_605375 [Zea mays]
Length = 1229
Score = 202 bits (513), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 261/1186 (22%), Positives = 472/1186 (39%), Gaps = 179/1186 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D+L + ++ + +L++ + + L
Sbjct: 64 LLSVDVFGAIRSLAQFRLTGANKDYLVVGSDSGRLVILEYSPDRNRLDKVHQETFGKSGC 123
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY-- 118
R GQ+ +DP R + + + V + + + LE + + F
Sbjct: 124 RRIVPGQLLAVDPKGRALCIAALEKQKLVYVLNRDAAARLTISSPLEAHKSNTLTFALAA 183
Query: 119 ---GCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P + + ++ T + A + + F E N+ +DNGA
Sbjct: 184 LDCGFDNPVFAAIELEYAESDRDPTGQAANQAQKLLTFYELDLGLNHVSRKVSEPIDNGA 243
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGRVDADGSR- 215
+LL+ VP P GVL+ + ++Y + + IP R + G + +
Sbjct: 244 NLLVTVPGGGDGP--SGVLVCCDNFVLYRNQGHPEIRAVIPRRADLPAERGVLIVAAATH 301
Query: 216 -------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
+LL G + + + H + V+ L+I+ + S + L + ++ S +
Sbjct: 302 KQKSMFFFLLQTEYGDIFKVDLDHNGDAVSELRIKYFDTIPVTSAVCVLRSGFLFAASEF 361
Query: 269 GDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVDF 302
G+ L + QP A + + + E +L P++D
Sbjct: 362 GNHALYQFRDIGREADVESSSATLMETEEGFQPVFFQPRALKNLMRIDE-IESLMPVMDM 420
Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPF 361
V +L + Q+ T G +LRI+R G+ I+E A L +W+++ + +D F
Sbjct: 421 RVANLFDEETPQLFTACGRGPRSTLRILRPGLAISEMARSMLPAEPIAVWTVKKNINDMF 480
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D ++VVSF + T +L++ + +EE F T +L + L+QV +R +
Sbjct: 481 DAYIVVSFTNVTLVLSIG--ETIEEVSDSQFLDTTHSLAVTLLGEDSLMQVHPNGIRHIR 538
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEI 480
R NEW++P ++ +N QV++A GG L+Y E+ G L EV+ + ++
Sbjct: 539 EDGRV--NEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDMSGDV 596
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFE 539
+CL I P+ E S+ AVG + D ++RI SL PD L P S+L +
Sbjct: 597 ACLAIAPVPEGRQRSRFLAVGSY-DNTIRILSLDPDDCLQPLSVQSVSSAPESLLFLEVQ 655
Query: 540 -------GISY-----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
G Y L L +G L ++M TG+L+D + LG +P L +
Sbjct: 656 ASVGGEDGADYPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPCIVSH 715
Query: 588 TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
+ S RP + Y L + ++ + F+S + + L I TI+
Sbjct: 716 RQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGDALRIFTIE 775
Query: 648 DI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESEMHFVRLLD------- 698
+ + + +IPL PR+ + + A+ S K AEE E LD
Sbjct: 776 RLGETFNETAIPLRYTPRKFVILPKKKYIAVIESDKGAFSAEEREAAKKECLDASGAAEN 835
Query: 699 --------------DQTFEFISTYPLDTFEY----------------------------- 715
E +T+P + + Y
Sbjct: 836 GNANNGDPMENGDGQDGAEEGNTFPDEQYGYPKAESERWVSCIRILDPRSRDTTCLLELQ 895
Query: 716 ----GCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKG----RILVFIVEDGKLQLIAE 766
SI + +F D + +GTA L T + F+ E L+L+ +
Sbjct: 896 DNEAAVSICTVNFHDKEHGTLLAIGTAKGLQFWPRRTLAGGFIHVYKFVDEGRSLELLHK 955
Query: 767 KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRG 825
+ + +L F G+LLA + ++LY D G R+L +C + L + T
Sbjct: 956 TQVEEVPLALCQFQGRLLAGVGSVLRLY-----DLGKRKLLRKCENKLFPRTLVSIHTYR 1010
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR 885
D I VGD+ +S Y+ +E + A D W++ + +D D GA+ N++ R
Sbjct: 1011 DRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTTAQHIDFDTMAGADKFGNIYFAR 1070
Query: 886 ------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
K +G + ++E + ++H+G+ V + SL+
Sbjct: 1071 LPQDISDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDVVTCLQKASLI------- 1123
Query: 928 VGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
G +I+GTV G +G ++A E F L+ +LR+ + G +H +RS
Sbjct: 1124 PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQEHPPLCGRDHMAYRS----- 1178
Query: 987 KTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E + L +I+ ++ + E+ K++E++
Sbjct: 1179 AYFPVKDVIDGDLCEQYPSLPADMQRKIADELDRTPGEILKKLEDI 1224
>gi|317143715|ref|XP_001819645.2| pre-mRNA-splicing factor rse1 [Aspergillus oryzae RIB40]
Length = 1209
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 277/1177 (23%), Positives = 492/1177 (41%), Gaps = 187/1177 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G +D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ +++ +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMSALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT---YEVALK---------DKDFVEGPWSQNNLDNGADLL 167
P L D ++ T YE A K + V W+ + +D + +L
Sbjct: 183 YENPIFAALEVDYSESDQDPTGQAYEEAEKLLVYYELDLGLNHVVRKWA-DPVDRTSSML 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLG 219
VP P GVL+ E+++ Y +N AF+ IP R T+ + + R++
Sbjct: 242 FQVPGGADGP--SGVLVCAEDSVTYRHSNQDAFRVPIPRRSGPTE-----NPERKRFIT- 293
Query: 220 DHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIASTI 254
AG++H + ++ + ++TG LKI+ +AS++
Sbjct: 294 --AGVMHKMRGAFFFLLQTEDGDLFKLNIDMVEDDNGQLTGEVKRLKIKYFDTVPVASSL 351
Query: 255 SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAK------------GSYV 287
L + +Y+ S G+ + N D +
Sbjct: 352 LILKSGFLYVASEAGNHHFYQFEKLGDDDEEIEFSSENFSADPSVPLEPIYFRPRSAENL 411
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
++E +L P++D V +L + Q+ T SG + R +++G+ ++E EL +
Sbjct: 412 NLVETINSLNPLIDSKVANLSEEDAPQIYTISGTGARSTFRTLKHGLEVSEIVDSELPSV 471
Query: 348 -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+ + + D FD ++++SF + T +L++ + +EE GF S TL
Sbjct: 472 PSAVWTTKLTRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGE 529
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
+ L+Q+ +R + + R NEW +P S+ A N QV +A G +VY E+ D
Sbjct: 530 DSLIQIHPRGIRHIMADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDAD 587
Query: 466 GILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
G L E + Q+ ++CL + + E S AVG D +VRI SL PD L K
Sbjct: 588 GTLAEYDERRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSV 646
Query: 524 LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
P ++ + + S YL L G L +L+ TGEL+D + LG++P
Sbjct: 647 QALTAAPSALNIMSMSDSSSGGTALYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKP 706
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
+ L S K T V A S RP + YS K + + ++ + F+S + +
Sbjct: 707 VKLFQVSVKGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVPLEWGWNFSSEQCLEGMVG 766
Query: 636 AKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICS----LKNQSCAEES 689
+ L I +I+ + + +SI L PRR H EQ + I S L + A+
Sbjct: 767 IQGQNLRIFSIEKLDNNMLQQSISLAYTPRRFLKHPEQPLFYVIESDNNVLSPSTRAKLL 826
Query: 690 EMHFVRLLDDQTF---EF------------------------ISTYPLDTFEYGCSILSC 722
E R D+ EF +ST L+ E S+ +
Sbjct: 827 EDSKARTGDETVLPPEEFGYPRGSGHWASCIQVVDPVHAKAVVSTIELEENEAAVSVAAV 886
Query: 723 SF-SDDSNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
F S D + VGTA + N P+ G I ++ EDG+ L+ I + + + +L
Sbjct: 887 PFTSQDDETFLVVGTAKDM-NVNPPSSAGGYIHIYRFQEDGRELEFIHKTKVEEPPLALL 945
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRGDFIVVGDLM 834
F G+L+A I +++Y D G ++L +C I+ L QT+G IVV D+
Sbjct: 946 GFQGRLVAGIGPMLRIY-----DLGMKQLLRKCNAQVVPKTIVGL--QTQGSRIVVSDVR 998
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
+S++ ++YK++E + D + W ++ ++D + G + N++ +R K SE A
Sbjct: 999 ESVTYVVYKYQENVLIPFVDDSVSRWTTSTTMVDYETTAGGDKFGNIWMLRCPKKISEQA 1058
Query: 892 TDE--------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
++ ERG RLE++ + + LV G +++
Sbjct: 1059 DEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKTQLV-------AGGRDILVW 1111
Query: 937 GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
+G IG++ + E F + L+ L + G +H +RS+ K V +
Sbjct: 1112 SGFHGTIGMLVPFVSREDVDFFQNLEMQLAAQNPPLAGRDHLIYRSYYAPVKGV-----I 1166
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DGDL E++ L I+ ++ SV E+ +++ ++
Sbjct: 1167 DGDLCETYFLLPNDTKMMIAAELDRSVREIERKISDM 1203
>gi|225683909|gb|EEH22193.1| pre-mRNA-splicing factor rse1 [Paracoccidioides brasiliensis Pb03]
Length = 1209
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 276/1164 (23%), Positives = 488/1164 (41%), Gaps = 161/1164 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I TL FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRTLAAFRLAGGSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGIRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ + + +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNTQAELTISSPLEAHKPQTLVFALTALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L + +A T YE+ L V WS + +D A +
Sbjct: 183 YDNPIFAALEVEYTEADQDPTGQAYEELEKQLVYYELDLGLNHVVRK-WS-DPVDRSATM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
L VP G L+ E+ I Y +N AF+ IP R P+ R G
Sbjct: 241 LFQVPGGADGPSGCLVCAEDNITYRHSNQDAFRVPIPRRSGPTENPERKRCIVAGVMHKM 300
Query: 216 -----YLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVVY 263
+LL G L + I ++ ++TG LK++ IAS++ L + ++
Sbjct: 301 RGAFFFLLQTEDGDLFKVTIDMVEDDNGQITGEVRRLKLKYFDTVPIASSLCILKSGFLF 360
Query: 264 IGSSYGDS---QLIKLNLQPD----AKGSY--------------------VEVLERYVNL 296
+ S G+ Q KL D S+ V ++E +L
Sbjct: 361 VASETGNHHFYQFEKLGDDDDEIEFTSDSFSANPSEPLPPVYFHPRLSENVHLVENVNSL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P++D + +L Q+ T G S R +++G+ + E EL + +W+ +
Sbjct: 421 NPLMDCKITNLTEDDAPQIYTICGTGARSSFRTLKHGLEVTEIVESELPSVPSAVWTSKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ +D FD ++++SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 481 TREDEFDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
+R + + R NEW +P S+ AT N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHIHADRR--VNEWPAPQHRSIVAATTNERQVAVALSSGEIVYFEMDTDGSLAEYDEK 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
++ ++CL + + + + S AVG D +VRI SL PD L K P +
Sbjct: 597 REMSGTVTCLSLGEVPKGRARSSFLAVGC-DDSTVRILSLDPDSTLGNKSVQALTSAPSA 655
Query: 533 VLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ + A + YL L G L +L+ TGEL+D + LG +P+ L S K
Sbjct: 656 LSIMAMADSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLKPVKLFQVSVK 715
Query: 587 NTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
V A S RP + YS K + + ++ + F+S + + + L I
Sbjct: 716 EQKAVLALSSRPWLGYSDLQTKAFMLTPLDYVGLEWGWNFSSEQCVEGMVGIQGQNLRIF 775
Query: 645 TIDDIQ-KLHIRSIPLGEHPRR-ICHQEQSRTFAI------------CSLKNQSCAEESE 690
+I+ ++ L +I L PR+ I H EQ + I L N S A +
Sbjct: 776 SIEKLENNLLQETISLTYTPRQFIKHPEQPLFYVIEADNNILSPATRTKLLNDSDAVNGD 835
Query: 691 M---------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILSCSFSD-DSNV 730
H+ ++++D + +S L+ E S+ + SFS D
Sbjct: 836 ATVLPPEDFGYPRGTGHWASCIQIVDPVNSKSVVSQIELEENEAAVSVAAVSFSSQDDET 895
Query: 731 YYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAI 787
+ VGT ++ + G I ++ EDGK L+ I + + + +L F G+LLA I
Sbjct: 896 FLVVGTGKDMVVNPRSCSAGFIHIYRFQEDGKELEFIHKTKVEQPPVALLGFQGRLLAGI 955
Query: 788 NQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
+++Y +R R+ Q+ H L + +QT+G I+V D+ +S++ +++K +E
Sbjct: 956 GTDVRIYDLGMR-QMLRKCQASVVPH---LVVGLQTQGSRIIVSDVQESVTYVVFKSQEN 1011
Query: 848 AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG------- 897
+ D + W + ++D + G + NL+ +R K SE A ++ G
Sbjct: 1012 RLIPFVDDVISRWTTCTTMVDYETVAGGDKFGNLWLLRCPAKASEEADEDGSGAHLIHER 1071
Query: 898 --------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV-IAS 948
RL +V ++ + + LV G +++ + G +G+ I
Sbjct: 1072 QYLQGAPNRLNLVAHFYPQDLPTSIQKAQLV-------TGGRDILVWTGLQGTVGMLIPF 1124
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+ E+ F + L+ L + G +H +RS+ AK +DGDL E++L L
Sbjct: 1125 ISREEVDFFQSLEMQLAAQNPPLAGRDHLIYRSY-----YAPAKGTIDGDLCETYLLLPN 1179
Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
+ +I+ ++ SV E+ +++ ++
Sbjct: 1180 DKKQQIAGELDRSVREIERKIADM 1203
>gi|19115326|ref|NP_594414.1| U2 snRNP-associated protein Sap130 (predicted) [Schizosaccharomyces
pombe 972h-]
gi|73919127|sp|Q9UTT2.1|RSE1_SCHPO RecName: Full=Pre-mRNA-splicing factor prp12; AltName:
Full=Pre-mRNA-processing protein 12; AltName:
Full=Spliceosome-associated protein 130
gi|6451681|dbj|BAA86918.1| Prp12p/SAP130 [Schizosaccharomyces pombe]
gi|7981404|emb|CAB92100.1| U2 snRNP-associated protein Sap130 (predicted) [Schizosaccharomyces
pombe]
Length = 1206
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 264/1176 (22%), Positives = 476/1176 (40%), Gaps = 177/1176 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L+ +G I + R G +D+L + ++ + +L+++ E ++L+
Sbjct: 59 ILNQNCFGIIRNVAPLRLTGFKRDYLVVTSDSGRITILEYNVEKNKLVPIYQETFGKSGI 118
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
R G+ ID R +I + L V+ D++ L + LE + +I F
Sbjct: 119 RRVVPGEYLAIDAKGRAAMIASVEKNKLVYVLNRDSEANL--TISSPLEAHKANNICFHL 176
Query: 119 -----GCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLD 161
G A P L D + H T E + + V WS+ +D
Sbjct: 177 IGLDTGYANPIFAALEVDYSEIDHDSTREAFTSSEKVLSYYELDLGLNHVVKRWSKV-VD 235
Query: 162 NGADLLIPVP-----PPLCGVLIIGEETIVYCS-ANAFKAIPI-------RPSITKAYGR 208
+ +LIPVP P G L+I I Y AF IPI +I+ + +
Sbjct: 236 RNSYMLIPVPGGNDGP--SGTLVISNGWISYRHLQKAFHQIPILRRQAASANAISTPWNQ 293
Query: 209 VDADGSR------------------YLLGDHAGLLHLLVITHEKE-KVTGLKIELLGETS 249
V+++ + YLL G L L I H+ + V L+++
Sbjct: 294 VNSNSANDGPLIVSAVLHKMKGSFFYLLQTGDGDLLKLTIEHDGQGNVVELRLKYFDTVP 353
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL---------------------------NLQPDA 282
+A ++ L +++ + +G+ QL + N+
Sbjct: 354 LAVQLNILKTGFLFVATEFGNHQLYQFENLGIDDDELEITSLDFQAQDNEVGTKNVHFGV 413
Query: 283 KG-SYVEVLERYVNLGPIVDFCVVDLERQGQG-QVVTCSGAYKDGSLRIVRNGIGINEQA 340
+G + ++E +L + D ++ G+ Q+ T G + SLR +R G+ E
Sbjct: 414 RGLQNLSLVEEIPSLYSLTDTLLMKAPSSGEANQLYTVCGRGSNSSLRQLRRGLETTEIV 473
Query: 341 SVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
+ EL G +W+L+ + D +D+++++SF + T +L++ + +EE GF S TL
Sbjct: 474 ASELPGAPIAIWTLKLNQTDVYDSYIILSFTNGTLVLSIG--ETVEEISDSGFLSSVSTL 531
Query: 400 FCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLV 459
+ LVQ+ +R + + + +EWK P V + N Q+++A G LV
Sbjct: 532 NARQMGRDSLVQIHPKGIRYIRANKQT--SEWKLPQDVYVVQSAINDMQIVVALSNGELV 589
Query: 460 YLEIGD----GILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP 514
Y E+ D G L E + + L ++ L + P+ E S + D +VR+ SL
Sbjct: 590 YFEMSDDVEGGQLNEYQERKTLTANVTSLALGPVQEGSRRSNFMCLAC-DDATVRVLSL- 647
Query: 515 DLNLITKEHLGGE----------IIPRSVLLCAFEGIS--YLLCALGDGHLLNFLLNMKT 562
DL T E+L + IIP +V G+S YL L +G L ++++ +
Sbjct: 648 DL-YTTLENLSVQALSSPANSLCIIPMNV-----NGVSTLYLHIGLMNGVYLRTVIDVTS 701
Query: 563 GELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
G+L D + LG + + + + KN V A S R + YS + L S + + H
Sbjct: 702 GQLLDTRTRFLGPRAVKIYPITMKNQNTVLAVSSRTFLAYSYQQNLQLSPIAYSAIDHAS 761
Query: 623 PFNSAAFPDSLAIAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQ---------EQS 672
F S P+ + ++ L I T+D +Q L PL PR+I +
Sbjct: 762 SFASEQCPEGIVAIQKNTLKIFTVDSLQDDLKSDIYPLICTPRKIVKHPNFPVLYILQSE 821
Query: 673 RTF--------------AICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS 718
R F + K +S + F+ + D + + I PL E S
Sbjct: 822 RNFDSFKYAQENGDVGSSYTKEKQNEHTSKSWVSFISVFDMISKKIIHESPLGDNEAAFS 881
Query: 719 ILSCSFSDDSNVYYCVGTAYVLPEENEPTKG---RILVFIVEDGKLQLIAEKETKGAVYS 775
+ + F + + G+A + E R+ F E KL+LI+ E G +
Sbjct: 882 MTAAFFKNRDEFFLVAGSATNMDLECRTCSHGNFRVYRFHDEGKKLELISHTEIDGIPMA 941
Query: 776 LNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVG 831
L F G++LA + + +++Y K MLR + + ++ + IVV
Sbjct: 942 LTPFQGRMLAGVGRFLRIYDLGNKKMLRKGELSAVP--------LFITHITVQASRIVVA 993
Query: 832 DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSE 889
D S+ ++YK E+ + A D W + ++D D G + N++ +R ++
Sbjct: 994 DSQYSVRFVVYKPEDNHLLTFADDTIHRWTTTNVLVDYDTLAGGDKFGNIWLLRCPEHVS 1053
Query: 890 GATDEERGRLEVVGEYHLGEFVNRFRHG-SLVMRLPDSDV-----------GQIPTVIFG 937
DEE +++ H F+N H L+ +D+ G +++
Sbjct: 1054 KLADEENSESKLI---HEKPFLNSTPHKLDLMAHFFTNDIPTSLQKVQLVEGAREVLLWT 1110
Query: 938 TVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
+ G +GV + E F ++L+ LRK + G +H +RS+ K V +D
Sbjct: 1111 GLLGTVGVFTPFINQEDVRFFQQLEFLLRKECPPLAGRDHLAYRSYYAPVKCV-----ID 1165
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
GDL E + L + I+ ++ ++ E+ K++E+
Sbjct: 1166 GDLCEMYYSLPHPVQEMIANELDRTIAEVSKKIEDF 1201
>gi|193786710|dbj|BAG52033.1| unnamed protein product [Homo sapiens]
Length = 897
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 217/918 (23%), Positives = 396/918 (43%), Gaps = 150/918 (16%)
Query: 237 VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN------------------- 277
VT ++++ +A+ + L +++ S +G+ L ++
Sbjct: 2 VTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEE 61
Query: 278 -----LQPDAKGSYVEVLERYVNLGPIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVR 331
QP + V +++ +L PI+ FC + DL + Q+ G SLR++R
Sbjct: 62 GDTFFFQPRPLKNLV-LVDELDSLSPIL-FCQMADLANEDTPQLYVACGRGPRSSLRVLR 119
Query: 332 NGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE 390
+G+ ++E A EL G +W++R +D FD +++VSF++ T +L++ + +EE
Sbjct: 120 HGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDS 177
Query: 391 GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVL 450
GF T TL C + LVQV +R + + R NEWK+P ++ N QV+
Sbjct: 178 GFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVV 235
Query: 451 LATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISV 508
+A GG LVY E+ G L E + ++ ++ C+ + + S+ AVG+ D +V
Sbjct: 236 IALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTV 294
Query: 509 RIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE----------------GISYLLCALGDG 551
RI SL + + + L + +P + LC E G YL L +G
Sbjct: 295 RIISLDPSDCL--QPLSMQALPAQPESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNG 352
Query: 552 HLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYS 611
LL +L+ TG+L+D + LG++P+ L + V A S R + YS + +
Sbjct: 353 VLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLT 412
Query: 612 NVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQ 669
++ + + F S P+ + L I ++ + + + + PL PR+ + H
Sbjct: 413 PLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHP 472
Query: 670 EQSR-----------TFAICSLKNQSCAEE-------------SEM-------------- 691
E + T A + + Q AEE +EM
Sbjct: 473 ESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAAGEDERELAAEMAAAFLNENLPESIF 532
Query: 692 -----------HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYV 739
+R+++ + L+ E S+ C FS+ +Y VG A
Sbjct: 533 GAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKD 592
Query: 740 LPEENEPTKGRILVF--IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW 796
L G + +V +G KL+ + + + ++ F G++L + + +++Y
Sbjct: 593 LILNPRSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-- 650
Query: 797 MLRDDGTRELQSECGHHGHILALY---VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
D G ++L +C + HI A Y +QT G ++V D+ +S + YK E + A
Sbjct: 651 ---DLGKKKLLRKC-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFA 705
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE---------ERGRL--- 899
D W++ +LD D GA+ N+ VR N+ DE +RG L
Sbjct: 706 DDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGA 765
Query: 900 ----EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQY 954
EV+ YH+GE V + +L+ G ++++ T++G IG++ HE +
Sbjct: 766 SQKAEVIMNYHVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGILVPFTSHEDH 818
Query: 955 LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
F + ++ +LR + G +H +RS+ KN +DGDL E F + + +
Sbjct: 819 DFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNV 873
Query: 1015 SKTMNVSVEELCKRVEEL 1032
S+ ++ + E+ K++E++
Sbjct: 874 SEELDRTPPEVSKKLEDI 891
>gi|238487250|ref|XP_002374863.1| nuclear mRNA splicing factor, putative [Aspergillus flavus NRRL3357]
gi|220699742|gb|EED56081.1| nuclear mRNA splicing factor, putative [Aspergillus flavus NRRL3357]
Length = 1210
Score = 201 bits (512), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 280/1182 (23%), Positives = 493/1182 (41%), Gaps = 188/1182 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G +D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ +++ +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMSALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT---YEVALK---------DKDFVEGPWSQNNLDNGADLL 167
P L D ++ T YE A K + V W+ + +D + +L
Sbjct: 183 YENPIFAALEVDYSESDQDPTGQAYEEAEKLLVYYELDLGLNHVVRKWA-DPVDRTSSML 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLG 219
VP P GVL+ E+++ Y +N AF+ IP R T+ + + R++
Sbjct: 242 FQVPGGADGP--SGVLVCAEDSVTYRHSNQDAFRVPIPRRSGPTE-----NPERKRFIT- 293
Query: 220 DHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIASTI 254
AG++H + ++ + ++TG LKI+ +AS++
Sbjct: 294 --AGVMHKMRGAFFFLLQTEDGDLFKLNIDMVEDDNGQLTGEVKRLKIKYFDTVPVASSL 351
Query: 255 SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAK------------GSYV 287
L + +Y+ S G+ + N D +
Sbjct: 352 LILKSGFLYVASEAGNHHFYQFEKLGDDDEEIEFSSENFSADPSVPLEPIYFRPRSAENL 411
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
++E +L P++D V +L + Q+ T SG + R +++G+ ++E EL +
Sbjct: 412 NLVETINSLNPLIDSKVANLSEEDAPQIYTISGTGARSTFRTLKHGLEVSEIVDSELPSV 471
Query: 348 -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+ + + D FD ++++SF + T +L++ + +EE GF S TL
Sbjct: 472 PSAVWTTKLTRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGE 529
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
+ L+Q+ +R + + R NEW +P S+ A N QV +A G +VY E+ D
Sbjct: 530 DSLIQIHPRGIRHIMADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDAD 587
Query: 466 GILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
G L E + Q+ ++CL + + E S AVG D +VRI SL PD L K
Sbjct: 588 GTLAEYDERRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSV 646
Query: 524 LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
P ++ + + S YL L G L +L+ TGEL+D + LG++P
Sbjct: 647 QALTAAPSALNIMSMSDSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKP 706
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
+ L S K T V A S RP + YS K + + ++ + F+S + +
Sbjct: 707 VKLFQVSVKGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVPLEWGWNFSSEQCLEGMVG 766
Query: 636 AKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICS----LKNQSCAEES 689
+ L I +I+ + + +SI L PRR H EQ + I S L + A+
Sbjct: 767 IQGQNLRIFSIEKLDNNMLQQSISLAYTPRRFLKHPEQPLFYVIESDNNVLSPSTRAKLL 826
Query: 690 EMHFVRLLDDQTF---EF------------------------ISTYPLDTFEYGCSILSC 722
E R D+ EF +ST L+ E S+ +
Sbjct: 827 EDSKARTGDETVLPPEEFGYPRGSGHWASCIQVVDPVHAKAVVSTIELEENEAAVSVAAV 886
Query: 723 SF-SDDSNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
F S D + VGTA + N P+ G I ++ EDG+ L+ I + + + +L
Sbjct: 887 PFTSQDDETFLVVGTAKDM-NVNPPSSAGGYIHIYRFQEDGRELEFIHKTKVEEPPLALL 945
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRGDFIVVGDLM 834
F G+L+A I +++Y D G ++L +C I+ L QT+G IVV D+
Sbjct: 946 GFQGRLVAGIGPMLRIY-----DLGMKQLLRKCNAQVVPKTIVGL--QTQGSRIVVSDVR 998
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
+S++ ++YK++E + D + W ++ ++D + G + N++ +R K SE A
Sbjct: 999 ESVTYVVYKYQENVLIPFVDDSVSRWTTSTTMVDYETTAGGDKFGNIWMLRCPKKISEQA 1058
Query: 892 TDE--------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
++ ERG RLE++ + + LV G +++
Sbjct: 1059 DEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKTQLV-------AGGRDILVW 1111
Query: 937 GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
+G IG++ + E F + L+ L + G +H +RS+ K V +
Sbjct: 1112 SGFHGTIGMLVPFVSREDVDFFQNLEMQLAAQNPPLAGRDHLIYRSYYAPVKGV-----I 1166
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE-LTRLH 1036
DGDL E++ L I+ ++ SV E+ +++ TR H
Sbjct: 1167 DGDLCETYFLLPNDTKMMIAAELDRSVREIERKISVCFTRYH 1208
>gi|212531303|ref|XP_002145808.1| nuclear mRNA splicing factor, putative [Talaromyces marneffei ATCC
18224]
gi|210071172|gb|EEA25261.1| nuclear mRNA splicing factor, putative [Talaromyces marneffei ATCC
18224]
Length = 1209
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 270/1167 (23%), Positives = 491/1167 (42%), Gaps = 167/1167 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ +++ +L + + + Q + G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKAQTVVFALTALDVG 182
Query: 120 CAKP---TIVVLYQDNKDARHVKTYEVALKDKDFVE---------GPWSQNNLDNGADLL 167
P I V Y ++ + YE K + E WS + + +++L
Sbjct: 183 YDNPIFAAIEVDYSESDQDPTGRAYEEVEKQLVYYELDLGLNHVVRRWS-DVIHRTSNIL 241
Query: 168 IPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR---- 215
VP GVL+ E+++ Y +N AF+ IP R P+ R G
Sbjct: 242 FQVPGGADGPSGVLVCAEDSVTYRHSNQDAFRVPIPRRNGPTENPERKRSIVAGVMHKMR 301
Query: 216 ----YLLGDHAGLLHLLVI---THEKEKVTG----LKIELLGETSIASTISYLDNAVVYI 264
+LL G L L I E ++TG LKI+ IA+ + L + +++
Sbjct: 302 GAFFFLLQTEDGDLFKLTIDMVEDENGQLTGEVKRLKIKYFDTVPIAANLLILKSGFLFV 361
Query: 265 GSSYGDSQLIKLN-----------------------LQP---DAKGS-YVEVLERYVNLG 297
S G+ + + P + +G+ + ++E +L
Sbjct: 362 ASETGNHHFYQFEKLGDDDEETEFSSDDVSADPADPINPVYFNLRGAENLNLVESINSLN 421
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSS 356
PI+D + D + Q+ T G + R +++G+ ++E EL + +W+ + +
Sbjct: 422 PIMDCKITDAIGEDAPQIYTICGTGARSTFRTLKHGLEVSEIVESELPSVPSAVWTTKLT 481
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD ++V+SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 482 RNDEFDAYIVLSFTNGTLVLSIG--ETVEEVTDTGFLSTAPTLAVQQLGEDSLIQVHPKG 539
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KHA 474
+R + + R NEW +P ++ A N QV +A G +VY E+ DG L E +
Sbjct: 540 IRHILADHR--VNEWPAPQHRTIVAAATNERQVAVALSSGEIVYFEMDTDGSLAEYDEKR 597
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSV 533
Q+ ++ L + + E S AVG D +VRI SL PD L K P ++
Sbjct: 598 QMSGTVTSLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPSAL 656
Query: 534 -LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
++ + S YL L G L +L+ TGEL+D + LG +P+ L S K
Sbjct: 657 NIMSMIDSTSGGSTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGAKPVKLFGVSVKG 716
Query: 588 TTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
V A S RP + YS K + ++++ + F+S + + + L I +
Sbjct: 717 QPAVLALSSRPWLGYSDVQTKSFMLTSLDYVGLEWGWNFSSEQCLEGMVGIQGQNLRIFS 776
Query: 646 IDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD----- 699
I+ + ++ SIPL PR + F + +N A ++ RLL++
Sbjct: 777 IEKLDNNVLQESIPLAYTPRHFVKHPEQPLFYVIESENNVLAPATQ---TRLLEESKLQN 833
Query: 700 --------QTFEF----------------------ISTYPLDTFEYGCSILSCSF-SDDS 728
+TF + +S L+ E SI + SF S D+
Sbjct: 834 GEAVIPPAETFGYPRATGHWASCIEVVDPVNAKSVLSRLELEENESAVSIAAVSFASQDN 893
Query: 729 NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
+ VGT V+ + G I ++ EDG+ L+ I + + + +L AF G+L+A
Sbjct: 894 ETFLVVGTGKDVVTYPRSFSAGFIHIYRFQEDGRELEFIHKTKIEEPPLALLAFQGRLVA 953
Query: 786 AINQKIQLYKWMLRDDGTRELQSECGHHG-HILALYVQTRGDFIVVGDLMKSISLLIYKH 844
I + +++Y D G +++ +C L + +QT+G I+V D+ +S++ ++YK+
Sbjct: 954 GIGKNLRIY-----DLGMKQMLRKCQVEAVPNLIVGLQTQGSRIIVSDVQESVTYVVYKY 1008
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER-------- 896
+E + D A W +A ++D + G + NL+ VR + + D +
Sbjct: 1009 QENQLIPFVDDVIARWTTATTMVDYETTAGGDKFGNLWLVRCPQKASDDSDEDGSGAHLI 1068
Query: 897 ----------GRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
RL ++ Y+ + +LV VG +++ + G IG++
Sbjct: 1069 HERSYLQGTANRLNLMIHYYTQDIPTSLHKTNLV-------VGGRDILVWTGLQGTIGIM 1121
Query: 947 AS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
+ E F + L+T L + G +H +RS+ V +K +DGDL ES+
Sbjct: 1122 VPFISREDVDFFQNLETQLASQNPPLAGRDHLIYRSY-----YVPSKGVIDGDLCESYFL 1176
Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
L + I+ ++ SV E+ +++ ++
Sbjct: 1177 LPNDKKLMIAGELDRSVREIERKISDM 1203
>gi|24654874|ref|NP_728546.1| CG13900, isoform A [Drosophila melanogaster]
gi|23092721|gb|AAF47416.2| CG13900, isoform A [Drosophila melanogaster]
gi|60678131|gb|AAX33572.1| LD01809p [Drosophila melanogaster]
gi|220950356|gb|ACL87721.1| CG13900-PA [synthetic construct]
gi|289803030|gb|ADD20765.1| FI04459p [Drosophila melanogaster]
Length = 1227
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 260/1199 (21%), Positives = 470/1199 (39%), Gaps = 197/1199 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L I+G + L FR G +D++ + ++ + +L+++ + L
Sbjct: 53 LLSTEIFGCVRALMAFRLTGGTKDYIVVGSDSGRIVILEYNPSKNALEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 113 RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D ++A + + A + + F E N+ L+ A
Sbjct: 173 VDVGFDNPMFACLEIDYEEADMDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 233 NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+LL G + + + + + V+ +K++ A+ + L +
Sbjct: 288 ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
++ S +G+ L ++ P A + V +++ + P
Sbjct: 348 FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 407 IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRA 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
DD FD +++VSF++ T +L++ + +EE GF T TL C + LVQV +
Sbjct: 467 DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
R + S R NEWK+P S+ N QV++ G LVY E+ G L E + ++
Sbjct: 525 RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
+ EI C+ + + E S AVG+ D +VRI SL N +T + P L
Sbjct: 583 MPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-L 640
Query: 536 CAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
C E G YL L +G LL +L+ +G+L D +
Sbjct: 641 CLVEMGHTESTTQGGLDDDAPAQRSGNNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 700
Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
LG++P+ L + + V A S R + Y + + ++ + + + F+S
Sbjct: 701 RYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 760
Query: 631 DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAI 677
+ + L I ++ + + + + PL PR + H + R T
Sbjct: 761 EGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDT 820
Query: 678 CSLKNQSCAEE-------------SEM-------------------------HFVRLLDD 699
S + + AEE EM +R LD
Sbjct: 821 KSARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDA 880
Query: 700 QTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
+ + + PL E ++L S + D Y VG A L ++G +
Sbjct: 881 MHGQTMFSVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKI 940
Query: 758 D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
D L+ + + +L F G+LLA + +++Y D G +++ +C H
Sbjct: 941 DPTCSSLEFMHRTDIDEIPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKH 995
Query: 814 GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
+ +Q G + V D+ +S+ + Y+ E + A D + W++A +LD D +
Sbjct: 996 IPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1054
Query: 874 GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
+ F ++++ TD+ +RG L E + +H+GE +
Sbjct: 1055 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1114
Query: 915 RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
+ +L+ G +I+ T++G +G E Y F + L+ ++R + G
Sbjct: 1115 QKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1167
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+H +RS KN LDGDL E +L + + I+ M + ++CK++E++
Sbjct: 1168 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1221
>gi|17508021|ref|NP_491953.1| Protein TEG-4 [Caenorhabditis elegans]
gi|351060889|emb|CCD68627.1| Protein TEG-4 [Caenorhabditis elegans]
Length = 1220
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 259/1185 (21%), Positives = 491/1185 (41%), Gaps = 187/1185 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
I+G + +L FR +DF+ + ++ + +LQ++AE + L G R P
Sbjct: 59 IFGIVRSLLAFRLTAGTRDFIAVGSDSGRIVILQYNAEKTCFERLHQETFGKTGCRRIVP 118
Query: 63 TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
G + DP R +IG L ++ D++ L + + + L +
Sbjct: 119 ---GHFLVGDPRGRALMIGAVERQKLVYIMNRDSEAHLTISSPLEAHKHHTLCYAMVGID 175
Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADL 166
G PT L D +DA + T E A + + F E N+ L++ +L
Sbjct: 176 VGFENPTFACLEFDYEDADNDPTGEAAKRTQQTLTFYELDLGLNHVVRKYAEPLNDPGNL 235
Query: 167 LIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-------- 215
LI VP GV++ E +VY N IR I + +D D R
Sbjct: 236 LIAVPGGNDGPSGVIVCCENYLVY--KNLGDQPDIRCPIPRRRNELD-DADRTMLIIATA 292
Query: 216 ---------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
+L+ G + + + +++ V+ +K++ A+ + L + +++ +
Sbjct: 293 THKTKNMYFFLVQAENGDIFKVTLETDEDLVSEMKLKYFDTVPPANALCILKSGFLFVAA 352
Query: 267 SYGDSQLIKLN-----------------------LQPDAKGSYVEVLERYVNLGPIVDFC 303
+G+ +L ++ +P S + + + +L P+ D
Sbjct: 353 EFGNHELYQIASLGEGDDDEFSSAMGFGENDAAFFEPHELKSLIPI-DSMDSLSPLTDAV 411
Query: 304 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
+ D+ R+ Q+ + G SL+++RNG+ I+E A +L G +W+++ + +D +D
Sbjct: 412 IGDIAREDAAQIYSLVGRGARSSLKVLRNGLEISEMAVSDLPGNPNAVWTVKKNIEDQYD 471
Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
+++VVSF++ T LA+ + D +EE GF T T+ C + LVQ+ S +R + +
Sbjct: 472 SYIVVSFVNAT--LALTIGDTVEEASDSGFLPTTPTIGCAMIGDDSLVQIYSEGIRHIRA 529
Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQL-EYEI 480
R NEWK+PP + N QV +A GG LVY E+ +G L E +L +I
Sbjct: 530 DKR--INEWKAPPRRQIVKCAVNRRQVAVALTGGELVYFELDLNGTLNEFTERKLFNADI 587
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAF- 538
+C+ + I E S+ A+G D +VRI SL P+ L+ P S+LL
Sbjct: 588 ACMTFSEISEGELNSRFLALGT-VDNAVRIISLDPNDMLMPLSTQSLPCPPESILLIDTP 646
Query: 539 ----EGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+G++ +L L +G L ++ TG + D + LGT+P+ L + + +
Sbjct: 647 NEDGKGVAAVHLNIGLQNGCLFRNTVDNVTGAIMDTRTRYLGTRPVKLFKVQCQGRSAIL 706
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI------GTI 646
S R ++Y ++ + ++ + + F S + + L I G
Sbjct: 707 CTSSRSWLLYHFQRRFHLTPLSYANLEYAASFCSNQCSEGIVAISASTLRIIAAEKLGVA 766
Query: 647 DDIQKLHIRSIP--LGEHPRRIC-----HQEQSRTFAICSLKNQSCAEESEMHFVRLLDD 699
++Q + P + HP C S T ++K A + E +
Sbjct: 767 FNVQSFEHKMTPRRVAVHPSMPCLIVIETDHASYTEVTKNIKRNQMAADVEAMASDETEA 826
Query: 700 QTFEFIST----YPLDTFEYGC------------SILSCSFSDDSNVY--------YCVG 735
Q + I+T LD YG S++S + D + + CV
Sbjct: 827 QLAQEIATNLRERRLDERVYGAPRAARGKWASAISLISATSGDKLSYFELPQDENAKCVA 886
Query: 736 TAYVLPEENE--------------------------PTKGRILVF-IVEDG-KLQLIAEK 767
NE PT+G + F + +G + +
Sbjct: 887 LVQFSKHPNEAMVLVGCGVNEVLNVHDIDPNDTSIRPTRGCVYTFHLSANGDRFDFLHRT 946
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
ET V +++ F G L + +++Y D G ++L ++C + +++ +Q+ G
Sbjct: 947 ETPLPVGAIHDFRGMALVGFGRFLRMY-----DIGQKKLLAKCENKNFPVSIVNIQSTGQ 1001
Query: 827 FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
I+V D +S+ L Y+ + + A D +++ V +LD A+ NL VR
Sbjct: 1002 RIIVSDSQESVHFLRYRKGDNQLVVFADDTTPRYVTCVCVLDYHTVAVADKFGNLAVVRL 1061
Query: 886 --KNSEGATDE--------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
+ +E D+ +RG ++E+V + +G+ + + SL +P
Sbjct: 1062 PERVNEDVQDDPTVSKSVWDRGWLNGASQKVELVSNFFIGDTITSLQKTSL---MP---- 1114
Query: 929 GQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
G +++ T+ G IG + S + ++ F L+ ++R + G +H +RS+ K
Sbjct: 1115 GANEALVYTTIGGAIGCLVSFMSKDEVDFFTNLEMHVRSEYPPLCGRDHLAYRSYYAPCK 1174
Query: 988 TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+V +DGD+ E F + + ++++ + +V E+ K++E++
Sbjct: 1175 SV-----IDGDICEQFSLMDTQKQKDVAEELGKTVSEISKKLEDI 1214
>gi|83767504|dbj|BAE57643.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1270
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 279/1181 (23%), Positives = 494/1181 (41%), Gaps = 188/1181 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G +D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ +++ +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMSALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT---YEVALK---------DKDFVEGPWSQNNLDNGADLL 167
P L D ++ T YE A K + V W+ + +D + +L
Sbjct: 183 YENPIFAALEVDYSESDQDPTGQAYEEAEKLLVYYELDLGLNHVVRKWA-DPVDRTSSML 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLG 219
VP P GVL+ E+++ Y +N AF+ IP R T+ + + R++
Sbjct: 242 FQVPGGADGP--SGVLVCAEDSVTYRHSNQDAFRVPIPRRSGPTE-----NPERKRFIT- 293
Query: 220 DHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIASTI 254
AG++H + ++ + ++TG LKI+ +AS++
Sbjct: 294 --AGVMHKMRGAFFFLLQTEDGDLFKLNIDMVEDDNGQLTGEVKRLKIKYFDTVPVASSL 351
Query: 255 SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAK------------GSYV 287
L + +Y+ S G+ + N D +
Sbjct: 352 LILKSGFLYVASEAGNHHFYQFEKLGDDDEEIEFSSENFSADPSVPLEPIYFRPRSAENL 411
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
++E +L P++D V +L + Q+ T SG + R +++G+ ++E EL +
Sbjct: 412 NLVETINSLNPLIDSKVANLSEEDAPQIYTISGTGARSTFRTLKHGLEVSEIVDSELPSV 471
Query: 348 -KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+ + + D FD ++++SF + T +L++ + +EE GF S TL
Sbjct: 472 PSAVWTTKLTRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGE 529
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GD 465
+ L+Q+ +R + + R NEW +P S+ A N QV +A G +VY E+ D
Sbjct: 530 DSLIQIHPRGIRHIMADRRV--NEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDAD 587
Query: 466 GILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
G L E + Q+ ++CL + + E S AVG D +VRI SL PD L K
Sbjct: 588 GTLAEYDERRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSV 646
Query: 524 LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
P ++ + + S YL L G L +L+ TGEL+D + LG++P
Sbjct: 647 QALTAAPSALNIMSMSDSSSGGTALYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKP 706
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
+ L S K T V A S RP + YS K + + ++ + F+S + +
Sbjct: 707 VKLFQVSVKGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVPLEWGWNFSSEQCLEGMVG 766
Query: 636 AKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICS----LKNQSCAEES 689
+ L I +I+ + + +SI L PRR H EQ + I S L + A+
Sbjct: 767 IQGQNLRIFSIEKLDNNMLQQSISLAYTPRRFLKHPEQPLFYVIESDNNVLSPSTRAKLL 826
Query: 690 EMHFVRLLDDQTF---EF------------------------ISTYPLDTFEYGCSILSC 722
E R D+ EF +ST L+ E S+ +
Sbjct: 827 EDSKARTGDETVLPPEEFGYPRGSGHWASCIQVVDPVHAKAVVSTIELEENEAAVSVAAV 886
Query: 723 SF-SDDSNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
F S D + VGTA + N P+ G I ++ EDG+ L+ I + + + +L
Sbjct: 887 PFTSQDDETFLVVGTAKDM-NVNPPSSAGGYIHIYRFQEDGRELEFIHKTKVEEPPLALL 945
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRGDFIVVGDLM 834
F G+L+A I +++Y D G ++L +C I+ L QT+G IVV D+
Sbjct: 946 GFQGRLVAGIGPMLRIY-----DLGMKQLLRKCNAQVVPKTIVGL--QTQGSRIVVSDVR 998
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
+S++ ++YK++E + D + W ++ ++D + G + N++ +R K SE A
Sbjct: 999 ESVTYVVYKYQENVLIPFVDDSVSRWTTSTTMVDYETTAGGDKFGNIWMLRCPKKISEQA 1058
Query: 892 TDE--------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
++ ERG RLE++ + + LV G +++
Sbjct: 1059 DEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKTQLV-------AGGRDILVW 1111
Query: 937 GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
+G IG++ + E F + L+ L + G +H +RS+ K V +
Sbjct: 1112 SGFHGTIGMLVPFVSREDVDFFQNLEMQLAAQNPPLAGRDHLIYRSYYAPVKGV-----I 1166
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE-ELTRL 1035
DGDL E++ L I+ ++ SV E+ +++ +TR+
Sbjct: 1167 DGDLCETYFLLPNDTKMMIAAELDRSVREIERKISLRVTRM 1207
>gi|195169735|ref|XP_002025674.1| GL20829 [Drosophila persimilis]
gi|194109167|gb|EDW31210.1| GL20829 [Drosophila persimilis]
Length = 1225
Score = 201 bits (510), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 264/1199 (22%), Positives = 465/1199 (38%), Gaps = 199/1199 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L I+G I L FR G +D++ + ++ + +L++ + L
Sbjct: 53 LLSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEYMPAKNALEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 113 RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D +++ T + A + + F E N+ L+ A
Sbjct: 173 VDVGFDNPMFACLEIDYEESDLDPTGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 233 NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+LL G + + + + + V+ +K++ AS + L +
Sbjct: 288 ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPI 299
++ S +G+ L ++ + P +G R + + PI
Sbjct: 348 FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRTLKNLVLVDELPSFAPI 407
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++ D
Sbjct: 408 ITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRAD 467
Query: 359 DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV +R
Sbjct: 468 DEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGIR 525
Query: 419 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQL 476
+ S R NEWK+P S+ N QV++ G LVY E+ G L E + +++
Sbjct: 526 HIRSDKRV--NEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSEM 583
Query: 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLC 536
EI C+ + + + S AVG+ D +VRI SL N +T + P LC
Sbjct: 584 PAEIMCMALGTVPDGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-LC 641
Query: 537 AFE--------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
E G YL L +G LL +L+ +G+L D +
Sbjct: 642 LVEMGHTESTTSAGALDDDAPPQRSGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 701
Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
LG++P+ L + V A S R + Y + + ++ + + + F+S
Sbjct: 702 RYLGSRPVKLFRIKMQGAEAVLAMSSRSWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 761
Query: 631 DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKN------ 682
+ + L I ++ + + + + PL PR + H + R + N
Sbjct: 762 EGIVAISTNTLRILALEKLGAVFNQVAFPLQFTPRTFVIHPDTGRMLIAETDHNAYTEDT 821
Query: 683 ------------QSCAEESEMHFVRLLDDQTFEFIS-TYPLDTFEY--------GCSILS 721
+S A + E R + + FI+ P D F G S +
Sbjct: 822 KSARKEQMAEEMRSAAGDEERELAREMANA---FINEVLPEDVFSAPKAGAGVCGASQIR 878
Query: 722 C------------------------SFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
C S + D Y VG A L ++G L
Sbjct: 879 CPGTPCMGQTMFKTRAIMSMAMVKFSVAADGRYYLAVGIARDLQLNPRISQGGCLDIYKI 938
Query: 758 D---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHH 813
D L+ + E +L F G+LLA + +++Y D G +++ +C H
Sbjct: 939 DPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCENKH 993
Query: 814 GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
+ +Q G + V D+ +S+ L Y+ E + A D + W++A +LD D +
Sbjct: 994 IPYQIVNIQAMGHRVYVSDVQESVFFLRYRRAENQLIIFADDTHPRWVTATTLLDYDT-I 1052
Query: 874 GAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRF 914
+ F ++++ TD+ +RG L E + +H+GE +
Sbjct: 1053 AIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSL 1112
Query: 915 RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
+ +L+ G +I+ T+NG +G E Y F + L+ ++R + G
Sbjct: 1113 QKATLI-------PGGSEALIYSTLNGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCG 1165
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+H +RS KN LDGDL E +L + + I+ M + ++CK++E++
Sbjct: 1166 RDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 1219
>gi|242060436|ref|XP_002451507.1| hypothetical protein SORBIDRAFT_04g003000 [Sorghum bicolor]
gi|241931338|gb|EES04483.1| hypothetical protein SORBIDRAFT_04g003000 [Sorghum bicolor]
Length = 1232
Score = 201 bits (510), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 262/1187 (22%), Positives = 477/1187 (40%), Gaps = 180/1187 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D+L + ++ + +L++ + + L
Sbjct: 66 LLSVDVFGAIRSLAQFRLTGANKDYLVVGSDSGRLVILEYSPDRNRLDKVHQETFGKSGC 125
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY-- 118
R GQ+ +DP R + + + V + + + LE + + F
Sbjct: 126 RRIVPGQLLAVDPKGRALCIAALEKQKLVYVLNRDAAARLTISSPLEAHKSNTLTFALAA 185
Query: 119 ---GCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P + + ++ T + A + + F E N+ +DNGA
Sbjct: 186 LDCGFDNPVFAAIELEYAESDRDPTGQAANQAQKLLTFYELDLGLNHVSRKVSEPIDNGA 245
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGRVDADGSR- 215
+LL+ VP P GVL+ + ++Y + + IP R + G + +
Sbjct: 246 NLLVTVPGGGDGP--SGVLVCCDNFVLYRNQGHPEIRAVIPRRADLPAERGVLIVAAATH 303
Query: 216 -------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
+LL G + + + H + V+ L+I+ + S + L + ++ S +
Sbjct: 304 KQKSMFFFLLQTEYGDIFKVDLDHNGDTVSELRIKYFDTIPVTSAVCVLRSGFLFAASEF 363
Query: 269 GDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVDF 302
G+ L + QP A + + + E +L P++D
Sbjct: 364 GNHALYQFRDIGRDADVESSSATLMETEEGFQPVFFQPRALKNLMRIDE-IESLMPVMDM 422
Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPF 361
V +L + Q+ T G +LRI+R G+ I+E A L +W+++ + +D F
Sbjct: 423 RVANLFDEETPQLFTACGRGPRSTLRILRPGLAISEMARSMLPAEPIAVWTVKKNINDMF 482
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D ++VVSF + T +L++ + +EE F T +L + L+QV +R +
Sbjct: 483 DAYIVVSFTNVTLVLSIG--ETIEEVSDSQFLDTTHSLAVTLLGEDSLMQVHPNGIRHIR 540
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEI 480
R NEW++P ++ +N QV++A GG L+Y E+ G L EV+ + ++
Sbjct: 541 EDGRV--NEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDMSGDV 598
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFE 539
+CL I P+ E S+ AVG + D ++RI SL PD L P S+L +
Sbjct: 599 ACLAIAPVPEGRQRSRFLAVGSY-DNTIRILSLDPDDCLQPLSVQSVSSAPESLLFLEVQ 657
Query: 540 -------GISY-----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
G Y L L +G L ++M TG+L+D + LG +P L +
Sbjct: 658 ASIGGEDGADYPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPCVVSH 717
Query: 588 TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
+ S RP + Y L + ++ + F+S + + L I TI+
Sbjct: 718 RQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESASSFSSDQCSEGVVAVAGDALRIFTIE 777
Query: 648 DI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESEMHFVRLLD------- 698
+ + + +IPL PR+ + + A+ S K AEE E L+
Sbjct: 778 RLGETFNETAIPLRYTPRKFVILPKKKYIAVIESDKGAFSAEEREAAKKECLEASGAAEN 837
Query: 699 ---------------DQTFEFISTYPLDTFEY---------------------------- 715
+ E +T+P + + Y
Sbjct: 838 GNANNGDPMENGDGQEDGAEEGNTFPDEQYGYPKAESERWVSCIRILDPRSRDTTCLLEL 897
Query: 716 -----GCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRILVFI---VEDGK-LQLIA 765
SI + +F D + VGTA L T + I V++G+ L+L+
Sbjct: 898 QENEAAVSICTVNFHDKEHGTLLAVGTAKGLQFWPRRTLAGGFIHIYKFVDEGRSLELLH 957
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTR 824
+ + + +L F G+LLA + ++LY D G R+L +C + + + T
Sbjct: 958 KTQVEEVPLALCQFQGRLLAGVGSVLRLY-----DLGKRKLLRKCENKLFPRTIVSIHTY 1012
Query: 825 GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
D I VGD+ +S Y+ +E + A D W++A + +D D GA+ N++
Sbjct: 1013 RDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAAQHIDFDTMAGADKFGNIYFA 1072
Query: 885 R------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
R K +G + ++E + ++H+G+ V + SL+
Sbjct: 1073 RLPQDISDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGDVVTCLQKASLI------ 1126
Query: 927 DVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
G +I+GTV G +G ++A E F L+ +LR+ + G +H +RS
Sbjct: 1127 -PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQEHPPLCGRDHMAYRS---- 1181
Query: 986 KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E + L +I+ ++ + E+ K++E++
Sbjct: 1182 -AYFPVKDVIDGDLCEQYPSLPADMQRKIADELDRTPGEILKKLEDI 1227
>gi|195126264|ref|XP_002007593.1| GI12293 [Drosophila mojavensis]
gi|193919202|gb|EDW18069.1| GI12293 [Drosophila mojavensis]
Length = 1227
Score = 200 bits (509), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 260/1200 (21%), Positives = 471/1200 (39%), Gaps = 199/1200 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++ I+G I L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LMSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEY-MPSKNALEKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 112 CRRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + + A + + F E N+ L+
Sbjct: 172 GVDVGFDNPMFACLEIDYEEADLDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
A+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 232 ANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+LL G + + + + + V+ +K++ A+ + L
Sbjct: 287 FICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ P A + V +++ +
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFA 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 406 PIITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKR 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
DD FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 IDDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
+R + S R NEWK+P S+ N QV++ G LVY E+ G L E + +
Sbjct: 524 IRHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERS 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL 534
++ EI C+ + + E S AVG+ D +VRI SL N ++ + P
Sbjct: 582 EMPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLSPCSMQALPSPAES- 639
Query: 535 LCAFE-------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRK 569
LC E G YL L +G LL +L+ +G+L D +
Sbjct: 640 LCLVEMGHTESTTNAGADDDVPAQRSGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTR 699
Query: 570 KVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAF 629
LG++P+ L + V A S R + Y + + ++ + + + F+S
Sbjct: 700 TRYLGSRPVKLFRIKMQGAEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQC 759
Query: 630 PDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKN----- 682
+ + L I ++ + + + + PL PR + H + R + N
Sbjct: 760 SEGIVAISTNTLRILALEKLGAVFNQVAFPLQFTPRAFVIHPDTGRMLIAETDHNAYTEE 819
Query: 683 -------------QSCAEESEMHF-------------------------------VRLLD 698
+S A E E +R LD
Sbjct: 820 TKNARKEQMAEEMRSAAGEEERELAREMANAFINEVLPEDVFSAPKAGLGLWASQIRCLD 879
Query: 699 DQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV 756
+ + T PL E ++L S + D Y VG A L ++G +
Sbjct: 880 AMHGQTMFTVPLTQNEAIMSMAMLKFSVAADGRYYLAVGIAKDLQLNPRISQGGCIDIYK 939
Query: 757 ED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GH 812
D L+ + E + +L F G+LLA + +++Y D G +++ +C
Sbjct: 940 IDPTCSALEFLHRTEIEEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCENK 994
Query: 813 HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
H + +Q G + V D+ +S+ + Y+ E + A D + W++A +LD D
Sbjct: 995 HIPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT- 1053
Query: 873 LGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNR 913
+ + F ++++ TD+ +RG L E + +H+GE +
Sbjct: 1054 IAIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMS 1113
Query: 914 FRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVG 972
+ +L+ G +I+ T++G +G E Y F + L+ ++R +
Sbjct: 1114 LQKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLC 1166
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
G +H +RS KN LDGDL E +L + + I+ M + ++CK++E++
Sbjct: 1167 GRDHLSYRS-----SYYPVKNVLDGDLCEQYLSIDAVKQKSIAGDMFRTPNQICKKLEDI 1221
>gi|195428692|ref|XP_002062402.1| GK16677 [Drosophila willistoni]
gi|194158487|gb|EDW73388.1| GK16677 [Drosophila willistoni]
Length = 1273
Score = 200 bits (509), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 259/1202 (21%), Positives = 477/1202 (39%), Gaps = 202/1202 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++ I+G I L FR G +D++ + ++ + +L++ + L + + + G
Sbjct: 98 LMSTEIFGCIRALMSFRLTGGTKDYIVVGSDSGRIVILEYIPAKNAL-EKVHQETFGKTG 156
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 157 CRRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMV 216
Query: 118 ---YGCAKPTIVVLYQDNKDA---------RHVKT----YEVALKDKDFVEGPWSQNNLD 161
G P L D ++A +H + YE+ L V +S+ L+
Sbjct: 217 GVDVGFDNPMFACLEIDYEEADLDPSGDAAQHTQQTLTFYELDLGLNHVVRK-YSEP-LE 274
Query: 162 NGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR- 215
A+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 275 EHANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERG 329
Query: 216 ----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
+LL G + + + + + V+ +K++ A+ + L
Sbjct: 330 MIFICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKT 389
Query: 260 AVVYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVN 295
+++ S +G+ L ++ P A + V +++ +
Sbjct: 390 GFLFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPS 448
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLR 354
PIV V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 449 FAPIVTSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVK 508
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
DD FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 509 KRVDDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYP 566
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VK 472
+R + S R NEWK+P S+ N QV++ G LVY E+ G L E +
Sbjct: 567 DGIRHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTE 624
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLI------------- 519
+++ EI C+ + + + S +VG+ D +VRI SL N +
Sbjct: 625 RSEMPAEIMCMALGTVPDGEQRSWFLSVGL-ADNTVRILSLDPNNCLSPCSMQALPSPAE 683
Query: 520 ------------TKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTD 567
T +G E +P + +G YL L +G LL +L+ +G+L D
Sbjct: 684 SLCLVEMGHTESTTTAVGDEDVPAQRSGGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLAD 743
Query: 568 RKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
+ LG++P+ L + + V A S R + Y + + ++ + + + F+S
Sbjct: 744 TRTRYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSE 803
Query: 628 AFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICS------ 679
+ + L I ++ + + + + PL PR + H + R +
Sbjct: 804 QCSEGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYT 863
Query: 680 --------------LKNQSCAEESEM-----------------------------HFVRL 696
++N + EE E+ +R
Sbjct: 864 EDTKSARKEQMAEEMRNAAGDEERELAREMANAFISEALPEDVFSAPKAGLGLWASQIRC 923
Query: 697 LDDQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF 754
LD + + PL E ++L S + D Y VG A L ++G +
Sbjct: 924 LDAMHGQTMFNVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDI 983
Query: 755 IVED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC- 810
D L+ + E + +L F G+LLA + +++Y D G +++ +C
Sbjct: 984 YKIDPTCSSLEFLHRTEIEEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCE 1038
Query: 811 GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
H + +Q G + V D+ +S+ + Y+ E + A D + W++A +LD D
Sbjct: 1039 NKHIPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYD 1098
Query: 871 IYLGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFV 911
+ + F ++++ TD+ +RG L E + +H+GE +
Sbjct: 1099 T-IAIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEII 1157
Query: 912 NRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKG 970
+ +L+ G +I+ T++G +G E Y F + L+ ++R
Sbjct: 1158 MSLQKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPP 1210
Query: 971 VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
+ G +H +RS KN LDGDL E +L + + I+ M + ++CK++E
Sbjct: 1211 LCGRDHLSYRS-----SYYPVKNVLDGDLCEQYLSIDAVKQKSIAGDMFRTPNQICKKLE 1265
Query: 1031 EL 1032
++
Sbjct: 1266 DI 1267
>gi|346327528|gb|EGX97124.1| pre-mRNA splicing factor RSE1 [Cordyceps militaris CM01]
Length = 1206
Score = 200 bits (508), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 273/1165 (23%), Positives = 491/1165 (42%), Gaps = 160/1165 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G + ++ +FR G +D++ +AT+ + +L++ + M R
Sbjct: 58 IFGILRSIAVFRLAGSNKDYIILATDSGRITILEYLPGPNRFNRLHMETFGKSGIRRVVP 117
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
G+ DP R LI + L V+ +++ +L + + + VL I + G
Sbjct: 118 GEYLACDPKGRACLISAVEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVIAMVALDVGY 177
Query: 121 AKPTIVVLY-------QD--NKDARHVKT----YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L QD + V+T YE+ L V WS + +D A LL
Sbjct: 178 ANPVFAALEIEYTEVDQDITGEALSEVETQLVYYELDLGLNHVVRK-WS-DPVDPTASLL 235
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGR---VDADGSRY 216
VP P GVL+ GEE I Y +N A + IP R T+ R + A
Sbjct: 236 FQVPGGNDGP--SGVLVCGEENITYRHSNQDALRVPIPRRRGATEDPSRKRNIVAGVMHK 293
Query: 217 LLGDHAGLLHLL-------------VITHEKEKVTG----LKIELLGETSIASTISYLDN 259
L G LL ++ E+ TG +KI+ +A+++ L +
Sbjct: 294 LKGSAGAFFFLLQSDDGDLFKITIDMVEDEEGAPTGEVQRMKIKYFDTVPVATSLCILKS 353
Query: 260 AVVYIGSSYGDS---QLIKL------------NLQPDAKGSY------------VEVLER 292
+Y+ S +G+ Q KL + D +Y + +++
Sbjct: 354 GFLYVASQFGNYAFYQFEKLGDDDDEVEFSSEDFPVDPLAAYEPVYFYPRLAENLALVDS 413
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
+ P++D V +L + Q+ T G + R +++G+ +NE + EL G+ +W
Sbjct: 414 IPAMNPLLDCKVANLTGEDAPQIFTICGNGARSTFRTLKHGLEVNEIVASELPGVPSAVW 473
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+L+ ++D+ +D ++V+SF + T +L++ + +EE GF + T+ + L+Q
Sbjct: 474 TLKLNSDEQYDAYIVLSFTNGTLVLSIG--ETVEEVSDSGFLTSVPTIAAQLLGTDGLIQ 531
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
V +R + + + NEW +P S+ A+ N+ QV +A G +VY E+ DG L E
Sbjct: 532 VHPRGIRHIRNGNV---NEWSAPQHRSIVAASTNSHQVAIALSSGEIVYFEMDSDGSLAE 588
Query: 471 VKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
+ + ++ L + + E S AVG D +VRI SL P+ L K
Sbjct: 589 YDEKKEMFGTVTALSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPESTLENKSVQALTA 647
Query: 529 IPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
P S+ + + S YL L G L +L+ TGELTD ++ LG +P+ L
Sbjct: 648 APTSLAIIPMDDSSSGGSTLYLHIGLHSGVYLRTVLDEITGELTDTRQKFLGPKPVRLFQ 707
Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHM-----CPFNSAAFPDSLAIAK 637
+ T V S RP + Y++ + S+ + +S++ F+S D + +
Sbjct: 708 VTVGGRTCVLGLSSRPWLGYANP---ITSSFEITPLSYVDLEWGWNFSSEQCEDGVVGIQ 764
Query: 638 EGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV-- 694
L I +ID + + L SIPL P++ F N + + + +
Sbjct: 765 GQSLRIFSIDRLGETLLQSSIPLTYTPKKFVKHPDEPLFYTIEADNHTLPPDLQAKLLAD 824
Query: 695 --------RLLDDQTF----------------EFISTYP-------LDTFEYGCSILSCS 723
++L + F + +S P + E S+ S
Sbjct: 825 PAAVNGDAKVLPPEDFGHPRGNRRWASCISVVDPVSEEPSVLQRVDFENNEAAVSVAVVS 884
Query: 724 F-SDDSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAF 779
F S D+ + VGT ++ ++ I ++ + G +L+ I + + + +L F
Sbjct: 885 FASQDNESFLVVGTGKDIVLNPRSSSEAYIYIYRFQQGGRELEFIHKTKIEEPATALLPF 944
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
GKLLA I + +++Y +R R+ Q+E I++L T+G IVV D+ + ++L
Sbjct: 945 QGKLLAGIGKTLRMYDLGMR-QLLRKAQAEVVPQ-QIVSL--NTQGSRIVVSDVQQGVTL 1000
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA--TDEERG 897
++YK + D A W + ++D + G + N+F VR ++ + DE+
Sbjct: 1001 VVYKSASNKLIPFVDDSIARWSTCTTMVDYESVAGGDKFGNMFIVRSPAKASEEADEDAA 1060
Query: 898 RLEVV-------GEYHLGEFVNRFRHGSLVMRLPDSD--VGQIPTVIFGTVNGVIGV-IA 947
L +V G H E + F ++ + + VG +++ + G IGV I
Sbjct: 1061 GLHLVNARDYLHGTQHRLELMCHFFTQDILTSINKTGLVVGGQDVLLWSGIMGTIGVFIP 1120
Query: 948 SLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
+ E F + L+ +LR + G +H +RS+ K V +DGDL E F L
Sbjct: 1121 FVSREDTDFFQSLEQHLRTEDGPLAGRDHLMYRSYYAPVKGV-----IDGDLCERFSILP 1175
Query: 1008 RTRMDEISKTMNVSVEELCKRVEEL 1032
+ I+ ++ SV E+ +++ ++
Sbjct: 1176 NDKKQMIAGELDRSVREIERKISDI 1200
>gi|115390120|ref|XP_001212565.1| splicing factor 3B subunit 3 [Aspergillus terreus NIH2624]
gi|114194961|gb|EAU36661.1| splicing factor 3B subunit 3 [Aspergillus terreus NIH2624]
Length = 1217
Score = 200 bits (508), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 275/1169 (23%), Positives = 476/1169 (40%), Gaps = 177/1169 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G +D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIIEY-VPSQNRFNRIHLETFGKSGVRRVI 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R +I + L V+ +++ +L + + + Q L G
Sbjct: 123 PGQYLAVDPKGRACIIASVEKNKLVYVLNRNSQAELTISSPLEAHKPQTLVFAMCALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D ++ T YE+ L V W+ + +D A +
Sbjct: 183 YENPVFAALEVDYSESDQDPTGRAFEETEKLLVYYELDLGLNHVVRK-WA-DPVDRTATM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSR----- 215
L VP GVL+ E+ + Y N AF+ IP R T+ R +
Sbjct: 241 LFQVPGGADGPSGVLVCAEDNVTYRHLNQDAFRVPIPRRRGATENPERKRCITAGVMHKM 300
Query: 216 -----YLLGDHAGLLHLLVITHEKE---KVTG----LKIELLGETSIASTISYLDNAVVY 263
+LL G L L I ++ ++TG LKI+ IAS + L + +Y
Sbjct: 301 RGAFFFLLQTEDGDLFKLTIDMMEDDNGQLTGEVKRLKIKYFDTVPIASNLLILKSGFLY 360
Query: 264 IGSSYGDSQLIKLN----------------------------LQPDAKGSYVEVLERYVN 295
+ S G+ + QP + +++ +
Sbjct: 361 VASEAGNHHFYQFEKLGDDDEEIEFSSEDFSADPSVPCEPVYFQPRG-AENLNLVDTINS 419
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLR 354
L P++D V +L + Q+ T SGA + R +++G+ ++E EL + +W+ +
Sbjct: 420 LNPLIDSKVANLSEEDAPQIYTISGAGARSTFRTLKHGLEVSEIVESELPSVPSAVWTTK 479
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
+ +D FD ++++SF + T +L++ + +EE GF S TL + LVQ+
Sbjct: 480 LTREDEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLVQIHP 537
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV-K 472
+R + R NEW +P S+ A N QV +A G +VY E+ DG L E +
Sbjct: 538 RGIRHIMVDRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDADGTLAEYDE 595
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPR 531
Q+ ++ L + + E S AVG D +VRI SL PD L K P
Sbjct: 596 RRQMSGTVTSLSLGEVPEGRIRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTAAPS 654
Query: 532 SVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
++ + + S YL L G L +L+ TGEL+D + LG++P+ L S
Sbjct: 655 ALNIMSMADSSSGGTTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGSKPVKLFQVSV 714
Query: 586 KNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
K T V A S RP + YS K + + ++ + F+S + + + L I
Sbjct: 715 KGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVPLEWGWNFSSEQCLEGMVGIQGQNLRI 774
Query: 644 GTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF 702
+I+ + + +SIPL PRR + F + N + + RLL+D
Sbjct: 775 FSIEKLDNNMLQQSIPLSYTPRRFVKHPEQPLFYVIESDNNVLSPSTR---ARLLEDSKS 831
Query: 703 -----------EF------------------------ISTYPLDTFEYGCSILSCSF-SD 726
EF + T L+ E S+ + F S
Sbjct: 832 RNGDTTVLPPEEFGYPRATGHWASCIQVVDPLDAKAVVHTVELEDNEAAVSVAAVPFTSQ 891
Query: 727 DSNVYYCVGTAYVLPEENEPTKGRILVFIV---EDGK-LQLIAEKETKGAVYSLNAFNGK 782
D + VGT + N P+ + I EDG+ L+ I + + + +L AF G+
Sbjct: 892 DDETFLVVGTVKDM-TVNPPSSAGGFIHIYRFQEDGRELEFIHKTKVEEPPLALLAFQGR 950
Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRGDFIVVGDLMKSISL 839
L + +++Y D G ++L +C H I+ L QT+G IVV D+ +S++
Sbjct: 951 LAVGLGSLLRIY-----DLGMKQLLRKCQAHVVPKTIVGL--QTQGSRIVVSDVRESVTY 1003
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE-- 894
++YK++E + D + W ++ ++D + G + NL+ VR K SE A ++
Sbjct: 1004 VVYKYQENVLIPFVDDSISRWTTSTTMVDYETVAGGDKFGNLWLVRCPKKVSEQADEDGS 1063
Query: 895 ------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
ERG RLE++ + + LV G +++ +G
Sbjct: 1064 GAHLIHERGYLHGTPNRLELMIHVYTQDIPTSLHKTQLV-------AGGRDILVWTGFHG 1116
Query: 942 VIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
IG++ + E F + L+ L + G +H +RS+ K V +DGDL
Sbjct: 1117 TIGMLVPFVSREDVDFFQNLEMQLASQHPPLAGRDHLIYRSYYAPVKGV-----IDGDLC 1171
Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
E++ L I+ ++ SV E+ +++
Sbjct: 1172 ETYFLLPNDTKMMIAADLDRSVREIERKI 1200
>gi|125977518|ref|XP_001352792.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
gi|54641542|gb|EAL30292.1| GA12611 [Drosophila pseudoobscura pseudoobscura]
Length = 1228
Score = 200 bits (508), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 263/1202 (21%), Positives = 466/1202 (38%), Gaps = 202/1202 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L I+G I L FR G +D++ + ++ + +L++ + L
Sbjct: 53 LLSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEYMPAKNALEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 113 RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D +++ T + A + + F E N+ L+ A
Sbjct: 173 VDVGFDNPMFACLEIDYEESDLDPTGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 233 NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+LL G + + + + + V+ +K++ AS + L +
Sbjct: 288 ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPASAMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPI 299
++ S +G+ L ++ + P +G R + + PI
Sbjct: 348 FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRTLKNLVLVDELPSFAPI 407
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++ D
Sbjct: 408 ITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRAD 467
Query: 359 DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV +R
Sbjct: 468 DEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGIR 525
Query: 419 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQL 476
+ S R NEWK+P S+ N QV++ G LVY E+ G L E + +++
Sbjct: 526 HIRSDKRV--NEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSEM 583
Query: 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLC 536
EI C+ + + + S AVG+ D +VRI SL N +T + P LC
Sbjct: 584 PAEIMCMALGTVPDGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES-LC 641
Query: 537 AFE--------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKK 570
E G YL L +G LL +L+ +G+L D +
Sbjct: 642 LVEMGHTESTTSAGALDDDAPPQRSGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRT 701
Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
LG++P+ L + V A S R + Y + + ++ + + + F+S
Sbjct: 702 RYLGSRPVKLFRIKMQGAEAVLAMSSRSWLSYYHQNRFHLTPLSYETLEYASGFSSEQCS 761
Query: 631 DSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKN------ 682
+ + L I ++ + + + + PL PR + H + R + N
Sbjct: 762 EGIVAISTNTLRILALEKLGAVFNQVAFPLQFTPRTFVIHPDTGRMLIAETDHNAYTEDT 821
Query: 683 ------------QSCAEESEMHFVRLLDDQTFEFIS-TYPLDTFE--------------- 714
+S A + E R + + FI+ P D F
Sbjct: 822 KSARKEQMAEEMRSAAGDEERELAREMANA---FINEVLPEDVFSAPKAGLGLWASQIRC 878
Query: 715 ----YGCSILSCSFSD----------------DSNVYYCVGTAYVLPEENEPTKGRILVF 754
+G ++ S S + D Y VG A L ++G L
Sbjct: 879 LDAMHGQTMFSVSLTQNEAIMSMAMVKFSVAADGRYYLAVGIARDLQLNPRISQGGCLDI 938
Query: 755 IVED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC- 810
D L+ + E +L F G+LLA + +++Y D G +++ +C
Sbjct: 939 YKIDPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCE 993
Query: 811 GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
H + +Q G + V D+ +S+ L Y+ E + A D + W++A +LD D
Sbjct: 994 NKHIPYQIVNIQAMGHRVYVSDVQESVFFLRYRRAENQLIIFADDTHPRWVTATTLLDYD 1053
Query: 871 IYLGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFV 911
+ + F ++++ TD+ +RG L E + +H+GE +
Sbjct: 1054 T-IAIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEII 1112
Query: 912 NRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKG 970
+ +L+ G +I+ T+NG +G E Y F + L+ ++R
Sbjct: 1113 MSLQKATLI-------PGGSEALIYSTLNGTVGAFVPFTSREDYDFFQHLEMHMRNENPP 1165
Query: 971 VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
+ G +H +RS KN LDGDL E +L + + I+ M + ++CK++E
Sbjct: 1166 LCGRDHLSYRS-----SYYPVKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLE 1220
Query: 1031 EL 1032
++
Sbjct: 1221 DI 1222
>gi|321260749|ref|XP_003195094.1| hypothetical protein CGB_G1120W [Cryptococcus gattii WM276]
gi|317461567|gb|ADV23307.1| Conserved hypothetical protein [Cryptococcus gattii WM276]
Length = 1276
Score = 199 bits (507), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 221/885 (24%), Positives = 395/885 (44%), Gaps = 120/885 (13%)
Query: 213 GSRYLLGDHAGLLHLLVITHEKEKVTGLKIEL------LGETSIASTISYLDNAVVYIGS 266
G+ ++GD G EK G + LG +S S+I+YLD++ +++ S
Sbjct: 375 GATVVIGDEYGAFTAFGWEFEKGSGAGTDGRVRVLRTYLGASSPPSSITYLDSSHLFVSS 434
Query: 267 SYGDSQLIKLNL--------QPDAKGSYV----------------------------EVL 290
+ DS L++ KG E+L
Sbjct: 435 AVADSVLLRFTTVESSSSVSSSKGKGRAATGPIGDQLDKWEVMYENGKERNDTDGGPEIL 494
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKG 349
ER++N+ P+ D CVV E +V SGA + SLR+VR+G+G+ E +++ L ++
Sbjct: 495 ERWMNIAPVKDLCVVKDEGGNLSHLVLASGASESNSLRVVRSGVGLEELVTIQGLHDVQK 554
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
MWSL TD + L++S + T +L + + + T F S+T L
Sbjct: 555 MWSL---TDSTATSRLLLSTSNSTVLLQLQPDISVIPTTDIIFNSETLAAGILPG-AELL 610
Query: 410 VQVTSGSVRLVSS-TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGIL 468
Q+T + L S S +L + + + A A ++A GG+LV + +
Sbjct: 611 AQITPRGLFLWSDLNSGQLEAQVEVDKETEIVCAQVTAEWAVVAKNGGNLVVFHVSNTGF 670
Query: 469 TEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
++ E+S + I+ + S S I A+ WT +V +++L ++ + G
Sbjct: 671 NPKGTIDVKEEVSAVAISR--NSDSSSPIIAISTWTAKTV-VYTLSQIS----SGVDGLS 723
Query: 529 IPRSVLLCAFEGISY--------LLCALGDGHLLNFLLNMKTGELTD------RKKVSLG 574
IP + + S+ LL L +G L + L+ D K SLG
Sbjct: 724 IPSKSSATSLQLRSHPSYPAGIQLLSGLDNGLLQIYDLDKDDMGGVDGLVVKSSKTTSLG 783
Query: 575 TQPITLR----TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
+P+ L T + V A ++R +VI+ S ++ +S+VN+K + N+++ P
Sbjct: 784 LRPLALHPCEITGGDEKVISV-ALTERMSVIFESKDRIEFSSVNIKNIMAAAAVNTSSGP 842
Query: 631 DSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQEQSRTFAICSLKNQSCAEES 689
++ L++ I+ ++KLH+++ G E ++ + ++ + A C L ++ +
Sbjct: 843 VLALFSRTSGLSLLKINSLKKLHVQTCDTGNESISKLNYMDEYKAIA-CGLTRRTQLRDG 901
Query: 690 ---EMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEP 746
E +FV++ D + E +S++ L E S+ S ++Y VGTA++ P++ E
Sbjct: 902 DVEEENFVQIRDGTSLEPLSSFTLRERELVTSLRSVFLV--GSMYLAVGTAFLPPDDGED 959
Query: 747 T-----------KGRILVFIVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
+ +GR+L+ +++G +++ AE T GAVY+L +G L A
Sbjct: 960 SSWDEGNLAVVKEGRVLLLEIKEGDAGGGWDVKIKAELTTVGAVYALEEIHGFLAVAAGS 1019
Query: 790 KIQLYKWMLRDDGTRELQSECGHHG-----HILALY-VQTRGD-FIVVGDLMKSISLLIY 842
K+ +++ D + EL+ + AL+ TR + ++VGD M+S+ +L
Sbjct: 1020 KLTMHR---LDHNSVELEETSSWASAYVISSLSALHPSHTRPEGALIVGDGMRSVIVLNV 1076
Query: 843 KHEEGAIEERARDYNANWMSAVEILDD--DIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
+G I + R+ + ++A+ +L D D + ++ + NL T R N +LE
Sbjct: 1077 DEGDGMIYDDERNMATHGVTALGLLKDKGDAVVVSDAHSNLLTYRLNQ---------KLE 1127
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
+ L E V RF++GSLV ++ IP V+F T G +GVI L L+ L
Sbjct: 1128 RAATFGLHEEVTRFQNGSLVPTTTAPEI-IIPDVLFATREGRLGVIGELGTMSSRTLDDL 1186
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
Q N+ K+ KG G + WR + D F+DGD ++ FLD
Sbjct: 1187 QRNMGKIWKGPGDVGWSNWRRAGSNLVGKDTAGFVDGDFVQKFLD 1231
>gi|194749950|ref|XP_001957397.1| GF24063 [Drosophila ananassae]
gi|190624679|gb|EDV40203.1| GF24063 [Drosophila ananassae]
Length = 1228
Score = 199 bits (505), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 258/1201 (21%), Positives = 473/1201 (39%), Gaps = 200/1201 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++ I+G I L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LMSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEY-IPSKNALEKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 112 CRRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + + A + + F E N+ L+
Sbjct: 172 GVDVGFDNPMFACLEIDYEEADLDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
A+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 232 ANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+LL G + + + + + V+ +K++ A+ + L
Sbjct: 287 FICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ P A + V +++ +
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFA 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 406 PIITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKR 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
DD FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 ADDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
+R + S R NEWK+P S+ N QV++ G LVY E+ G L E + +
Sbjct: 524 IRHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERS 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL 534
++ EI C+ + + E S AVG+ D +VRI SL N +T + P
Sbjct: 582 EMPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAES- 639
Query: 535 LCAFE--------------------------GISYLLCALGDGHLLNFLLNMKTGELTDR 568
LC E G YL L +G LL +L+ +G+L D
Sbjct: 640 LCLVEMGHTESTTSAGGVNEDAPAQRSGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADT 699
Query: 569 KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
+ LG++P+ L + + V A S R + Y + + ++ + + + F+S
Sbjct: 700 RTRYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQ 759
Query: 629 FPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICS------- 679
+ + L I ++ + + + + PL PR + H + R +
Sbjct: 760 CSEGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTE 819
Query: 680 -------------LKNQSCAEESEM-----------------------------HFVRLL 697
+++ + EE E+ +R L
Sbjct: 820 DTKNARKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSAPKAGLGLWASQIRCL 879
Query: 698 DDQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI 755
D + + PL E ++L S + D Y VG A L ++G +
Sbjct: 880 DAMHGQTMFNVPLTQNEAIMSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIY 939
Query: 756 VED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-G 811
D L+ + E +L F G+LLA + +++Y D G +++ +C
Sbjct: 940 KIDPTCSSLEFMHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCEN 994
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
H + +Q G + V D+ +S+ + Y+ E + A D + W++A +LD D
Sbjct: 995 KHIPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT 1054
Query: 872 YLGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVN 912
+ + F ++++ TD+ +RG L E + +H+GE +
Sbjct: 1055 -IAIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIM 1113
Query: 913 RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGV 971
+ +L+ G +I+ T++G +G E Y F + L+ ++R +
Sbjct: 1114 SLQKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPL 1166
Query: 972 GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
G +H +RS KN LDGDL E +L + ++ I+ M + ++CK++E+
Sbjct: 1167 CGRDHLSYRS-----SYYPVKNVLDGDLCEQYLSIDASKQKSIAGDMFRTPNQICKKLED 1221
Query: 1032 L 1032
+
Sbjct: 1222 I 1222
>gi|195012560|ref|XP_001983703.1| GH16029 [Drosophila grimshawi]
gi|193897185|gb|EDV96051.1| GH16029 [Drosophila grimshawi]
Length = 1228
Score = 199 bits (505), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 257/1200 (21%), Positives = 470/1200 (39%), Gaps = 198/1200 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++ I+G I L FR G +D++ I ++ + +L++ + L
Sbjct: 53 LMSTEIFGCIRALMAFRLTGGTKDYIVIGSDSGRIVILEYIPSKNSLEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 113 RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D ++A + + A + + F E N+ L+ A
Sbjct: 173 VDVGFDNPMFACLEIDYEEADLDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 233 NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+LL G + + + + + V+ +K++ A+ + L +
Sbjct: 288 ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
++ S +G+ L ++ P A + V +++ + P
Sbjct: 348 FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 407 IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRI 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
DD FD +++VSF++ T +L++ + +EE GF T TL C + LVQV +
Sbjct: 467 DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
R + S R NEWK+P S+ N QV++ G LVY E+ G L E + ++
Sbjct: 525 RHIRSDKR--VNEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
+ EI C+ + + E S AVG+ D +VRI SL N ++ + P L
Sbjct: 583 MPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLSPCSMQALPSPAES-L 640
Query: 536 CAFE--------------------------GISYLLCALGDGHLLNFLLNMKTGELTDRK 569
C E G YL L +G LL +L+ +G+L D +
Sbjct: 641 CLVEMGHTESTTNASGADEDVPAQRSGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTR 700
Query: 570 KVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAF 629
LG++P+ L + + V A S R + Y + + ++ + + + F+S
Sbjct: 701 TRYLGSRPVKLFRIKMQGSEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQC 760
Query: 630 PDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEE 688
+ + L I ++ + + + + PL PR + I + + EE
Sbjct: 761 SEGIVAISTNTLRILALEKLGAVFNQVAFPLQFTPRAFVIHPDTGRMLIAETDHNAYTEE 820
Query: 689 S----------EMHF----------------------------------------VRLLD 698
+ EM +R LD
Sbjct: 821 TKNERKEQMAEEMRSAAGDEERDLAREMANAFINEVLPEDMFSAPKAGLGLWASQIRCLD 880
Query: 699 DQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRIL-VFI 755
+ + + PL E ++L S + D Y VG A L ++G + V+
Sbjct: 881 AMHGQTMFSVPLAQNEAIMAMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDVYK 940
Query: 756 VED--GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GH 812
++ L+ + E + +L F G+LLA + +++Y D G +++ +C
Sbjct: 941 IDPTCSALEFLHRTEIEEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCENK 995
Query: 813 HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
H + +Q G + V D+ +S+ + Y+ E + A D + W++A +LD D
Sbjct: 996 HIPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT- 1054
Query: 873 LGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNR 913
+ + F ++++ TD+ +RG L E + +H+GE +
Sbjct: 1055 IAIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMS 1114
Query: 914 FRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVG 972
+ +L+ G +I+ T++G +G E Y F + L+ ++R +
Sbjct: 1115 LQKATLI-------PGGSEALIYSTLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLC 1167
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
G +H +RS KN LDGDL E +L + ++ I+ M + ++CK++E++
Sbjct: 1168 GRDHLSYRS-----SYYPVKNVLDGDLCEQYLSVDASKQKSIAGDMFRTPNQICKKLEDI 1222
>gi|353236335|emb|CCA68332.1| probable splicing factor 3B subunit 3 [Piriformospora indica DSM
11827]
Length = 1243
Score = 198 bits (504), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 258/1188 (21%), Positives = 481/1188 (40%), Gaps = 198/1188 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
++G I +L FR G +D+ + ++ + +L++D + + +L G R P
Sbjct: 86 VFGSIRSLAAFRLTGGTKDYAIVGSDSGRIVILEYDPKQNAFNKLYQETYGKTGARRIVP 145
Query: 63 TDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
GQ+ DP R + + + L V+ D L + + + + +
Sbjct: 146 ---GQMLATDPKGRSLMVSAMEKAKLVYVLNRDAAANLTISSPLEAHKPSAIIHHIVGVD 202
Query: 118 YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGA 164
G P L D +A + + YE+ L V WS+ D A
Sbjct: 203 VGFENPLYAALEVDYSEADNDPSGQAIRDTEKMLTFYELDLGLNHVVRK-WSEPT-DRRA 260
Query: 165 DLLIPVP--------PPLC------GVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD 210
+LL+ VP PP+ GVL+ E+ ++Y + + +R I + ++
Sbjct: 261 NLLVQVPGGQTTTGDPPITRIDGPSGVLVCCEDHVIYRHMDQPQ---LRVPIPRREHPLE 317
Query: 211 ADGSR-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIAST 253
SR +LL G L+ + + H+ E V LKI+ S A
Sbjct: 318 DASSRGLLIISAVMHRMKGAFFFLLQSELGDLYKVTLEHQDEDVVALKIKYFDTVSPAVN 377
Query: 254 ISYLDNAVVYIGSSYGDSQL------------------------IKLNLQPDAKGSY--- 286
+ L + +++ + +G+ ++ QP + +
Sbjct: 378 LCILKSGHLFVPAEFGNHAFYQFQKLGDDDKEPEYSSADYPGNGMRTPSQPLPRAYFRPK 437
Query: 287 ----VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
+ +L+ +L PI+ V +L T G + R +++G+ + E +
Sbjct: 438 PLENLVLLDELESLDPILAARVQNLP--DTSLFYTACGKGAKSTFRTLKHGLEVEENGNS 495
Query: 343 ELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
+L I +W+L+ + D +D+++V+SFI+ T +L+ +E+EE GF S TL
Sbjct: 496 DLPSIPNAVWTLKLAETDQYDSYIVLSFINGTLVLSFG--EEIEEIPNSGFLSSEPTLAA 553
Query: 402 HDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL 461
+ L+QV +R V S R NEW++P G ++ AT N QV++A LVY
Sbjct: 554 QQLGSDALLQVHPRGIRHVLSDKRV--NEWRAPIGMAIVAATTNKRQVVVALSSAELVYF 611
Query: 462 EIG-DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--L 516
E+ +G L E + + + + L + + E + AVG D +VRI SL PD L
Sbjct: 612 ELDYEGQLNEFQERKAMGSTVLALSVGEVPEGLQRFKYLAVGC-EDQTVRIISLDPDSTL 670
Query: 517 NLITKEHLGGEIIPRSVLLCAF--------EGISYLLCALGDGHLLNFLLNMKTGELTDR 568
+I+ + + P S+ + +++ L +G LL +L+ G+L D
Sbjct: 671 EMISLQAVTAP--PSSISIADMFDSSIDKHRPTTFVNIGLQNGVLLRTVLDPTNGKLADT 728
Query: 569 KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
+ LG +P+ L + V A S R + Y+ + ++ + + C N+
Sbjct: 729 RTRFLGNRPVRLVRTQVHGSPGVLALSSRSWLNYTYQGLVHFTPLAYDRLDGACSVNAEL 788
Query: 629 FPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE 687
PD L I + + KL +PL PR+ Q+ F + +++ +E
Sbjct: 789 CPDGFIGIANNTLRIFQVPKLGSKLKQEILPLSYTPRKFVSHPQNSYFYLIESDHRAMSE 848
Query: 688 -----------------ESEM---------HF----------VRLLDDQTFEFISTYPLD 711
+ EM HF +R++D ++ + LD
Sbjct: 849 TMIEERVKAIEMSGEKVDREMLELDPRIYGHFKAPEGVWASCIRIIDPVNLRSVAAFSLD 908
Query: 712 TFEYGCSILSCSF-SDDSNVYYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEK 767
E SI F + + + VGTA ++ P R+ F L+L+ +
Sbjct: 909 NNEAAFSIAVVPFAARNGELLLVVGTAVDTHLAPRSCSTGYLRVYSFTEGGSGLELLHKT 968
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQT 823
+ +L AF G+L+A + + ++LY K +LR R+ + + + T
Sbjct: 969 DIDEVPTALMAFQGRLIAGVGKALRLYDIGKKKLLRKAENRQFATAI--------VTLST 1020
Query: 824 RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
+G I+ GD+ +SI + YK E + A D +A W++A +LD + + A+ N+F
Sbjct: 1021 QGSRILAGDINQSIYYVAYKAAENRLLIFADDTSARWITASTMLDYNTVVAADKFGNVFV 1080
Query: 884 VRKN---SEGATDEERG---------------RLEVVGEYHLGEFVNRFRHGSLVMRLPD 925
R + S+ D+ G + +++ +H+G+ + + +LV
Sbjct: 1081 NRLDEAVSKQVDDDPTGAGILHEKSVLMGAPHKTKMLTHFHVGDVITSIQRVALV----- 1135
Query: 926 SDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN 984
G V++ ++G IG++ L + F+ L+ ++R + G +H WR +
Sbjct: 1136 --AGAREVVVYFGLHGTIGILVPLVTKDDVDFISTLEQHMRTENLSLVGRDHLSWRGYYT 1193
Query: 985 EKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K +DGDL E F L + I+ ++ +V ++ K++E L
Sbjct: 1194 PVKAT-----VDGDLCEYFAKLPTQKQLAIAGELDRTVGDVLKKLESL 1236
>gi|303324325|ref|XP_003072150.1| Splicing factor 3B subunit 3, putative [Coccidioides posadasii C735
delta SOWgp]
gi|240111860|gb|EER30005.1| Splicing factor 3B subunit 3, putative [Coccidioides posadasii C735
delta SOWgp]
Length = 1209
Score = 198 bits (503), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 271/1176 (23%), Positives = 481/1176 (40%), Gaps = 185/1176 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSSKDYIIIGSDSGRIAIVEY-VPSQNRFNRIHLETFGKSGIRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R LI + L V+ + + +L + + Q L G
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTLVFALTALDVG 182
Query: 120 CAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADL 166
P L D ++ T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVDYTESDQDPTGAAYQEAEKLLVYYELDLGLNHVVRR-WA-DPVDRSAAM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLL 218
L VP P GVL+ E I Y +N AF+ IP R T+ + + RY+
Sbjct: 241 LFQVPGGADGP--SGVLVCSEGNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT 293
Query: 219 GDHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIAST 253
AG++H + ++ E ++TG LK++ +AS+
Sbjct: 294 ---AGVVHKMRRAFFCLLQTEDGDLFKVTMDMVEDENGQLTGEVQRLKLKYFDTVPVASS 350
Query: 254 ISYLDNAVVYIGSSYGDSQLIKLN-----------------------LQP----DAKGSY 286
+ L N +++ S G+ + L P
Sbjct: 351 LCILKNGFLFVASETGNHHFYQFEKLGDDDEETEFSSDDFSADPSEPLAPVYFRPRPAEN 410
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+ ++E +L P++ + +L Q T SG + R +++G+ ++E EL
Sbjct: 411 LNLVESINSLNPLMSCKITNLTEDDAPQFYTLSGTGARSTFRTLKHGLEVSEIVESELPS 470
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
+ +W+ + + +D +D ++++SF + T +L++ + +EE GF S TL
Sbjct: 471 VPSAVWTTKLTRNDQYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLG 528
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
+ L+QV +R + + R NEW +P S+ A N QV +A G +VY E+
Sbjct: 529 EDSLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDT 586
Query: 465 DGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
DG L E + ++ ++CL + + S AVG D +VRI SL PD L K
Sbjct: 587 DGSLAEYDEKREMSGTVTCLSLGEVPPGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKS 645
Query: 523 HLGGEIIPRSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
P ++ ++ + S YL L G L +L+ TGEL+D + LG +
Sbjct: 646 VQALTSAPSALSIMSMVDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLK 705
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
+ L + S K V A S RP + YS K + + ++ + F+S + +
Sbjct: 706 SVKLFSVSVKEQRAVLALSSRPWLGYSDLQTKNFMLTPLDYVPLEWSWNFSSEQCVEGMV 765
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF 693
+ L I +I+ + L +IPL PR + F + N + ++
Sbjct: 766 GIQGQNLRIFSIEKLDNNLLQETIPLAYTPRHFVRHPEQPLFYVIEADNNILSPSTKAKL 825
Query: 694 ----------VRLLDDQTFEF----------------------ISTYPLDTFEYGCSILS 721
V L + F + IS L+ E S+ +
Sbjct: 826 LQDSKAANGEVAELPPEDFGYPRGTGHWASCIQVVDPINSKAVISRIELEENEAAVSVAA 885
Query: 722 CSFSD-DSNVYYCVGTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYS 775
FS D + VGT V P + + G I ++ EDGK L+ I + + + ++
Sbjct: 886 VPFSSQDDETFLVVGTGKDMVVYPPSS--SCGFIHIYRFQEDGKELEFIHKTKVESPPHA 943
Query: 776 LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLM 834
L AF G+LLA I + +++Y D G ++L +C L + +QT+G I+V D+
Sbjct: 944 LLAFQGRLLAGIGRNLRIY-----DLGMKQLLRKCQAEVVPRLIVGLQTQGSRIIVSDVQ 998
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGA 891
+S++ ++YK++E + A D A W + ++D + G + NL+ +R K SE A
Sbjct: 999 ESVTYVVYKYQENRLIPFADDVIARWTTCTAMVDYETVAGGDKFGNLWLLRCPQKASEEA 1058
Query: 892 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT------------- 938
++ G + +L NR SL++ D IPT I T
Sbjct: 1059 DEDGSGAHLIHERQYLQGAPNRL---SLMVHFYPQD---IPTSIQKTQLVAGGRDILVWT 1112
Query: 939 -VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
+ G +G++ + E F + L+ L + G +H +RS+ AK +D
Sbjct: 1113 GLQGTVGMLVPFVSREDVDFFQSLEMQLTSQTPPLAGRDHLIYRSY-----YAPAKGTID 1167
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
GDL E++ L + I+ ++ SV E+ +++ ++
Sbjct: 1168 GDLCETYFTLPNDKKLMIAGELDRSVREIERKISDM 1203
>gi|213405251|ref|XP_002173397.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
yFS275]
gi|212001444|gb|EEB07104.1| U2 snRNP-associated protein Sap130 [Schizosaccharomyces japonicus
yFS275]
Length = 1166
Score = 197 bits (502), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 259/1140 (22%), Positives = 483/1140 (42%), Gaps = 145/1140 (12%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELIT---RAMGDVSD 57
M+ YG + + R G +D+L + ++ +F +L++D ++L++ A G
Sbjct: 59 MISQNCYGIVRNIAPLRLTGFKKDYLVLTSDSGRFTILEYDIGKNKLVSVYQEAFGKSGI 118
Query: 58 RIGRPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIK 115
R P G+ +D R ++ + L V+ D++ L + + + +
Sbjct: 119 RRIVP---GEYLALDAKGRAAMVASTEKNKLVYVLNRDSEANLTISSPLEAHKAGTICFD 175
Query: 116 FL---YGCAKPTIVVLYQDNKDARH-------------VKTYEVALKDKDFVEGPWSQNN 159
+ G P L + D H + YE+ L V+ WS+
Sbjct: 176 LVGLDTGYENPIFAALEVEYSDLDHDPLGELYKHSEKVLTYYELDLGLNHVVKR-WSKV- 233
Query: 160 LDNGADLLIPVP-----PPLCGVLIIGEETIVY---CSANAFKAIPIR----------PS 201
+D A LI VP P GV++I I Y + F IP R P
Sbjct: 234 VDRSAYKLIRVPGGNDGP--SGVIVISTGWISYRHLQRQSHFVPIPTRETKATTNTALPI 291
Query: 202 ITKAYGRVDADGSRYLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
I A D YLL + G LL L + + +V L+I+ A+ ++ L +
Sbjct: 292 IVSAVMHKMRDSFFYLLQNSDGDLLKLTMELDDHSQVKELRIKYFDTIPFAAILNILKSG 351
Query: 261 VVYIGSSYGDSQLIKL------NLQPD---------------AKGSY-------VEVLER 292
+++ G G+ L + + +P+ K +Y + +L+
Sbjct: 352 LLFAGCEGGNHHLYQFESLAIDDDEPEFSSANFSEEQSKHSPKKLTYKLHPLQNISLLDE 411
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMW 351
+L P+ D V Q+ T G +K+ SLR+++ G+ E EL G +W
Sbjct: 412 IPSLFPLTDAIVTRTSTDANSQLYTLCGRHKEASLRLLKRGVSATEVVLSELPGAPIAIW 471
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+++ +DP+D ++V+SF + T +L++ + +EE G S TL + +VQ
Sbjct: 472 TVKQKLNDPYDKYMVLSFTNGTLVLSIG--ETVEEVLDSGLLSSVSTLNVRQLGRSSVVQ 529
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD--GILT 469
+ S +R +S+ ++E+ EWK+P + + N Q++++ L Y E+ D G L
Sbjct: 530 IHSKGIRCISA-NKEV-TEWKTPADTVITNSAINEQQIVVSLSNDELAYFEMDDEYGQLN 587
Query: 470 EVKHAQL-EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
E + +L ++ L + P+ + S + D +VRI SL + +
Sbjct: 588 EYQERKLLTSPVTALALGPVPQGSKRSNFLCLAS-EDSTVRIVSLDPYTTLENLSVQALS 646
Query: 529 IPRSVL------LCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
P S L + +E + YL L +G L ++++ +G+L D + LG +PI L
Sbjct: 647 APASSLCMVNMEVTGYETL-YLHIGLSNGVYLRTVVDVTSGQLIDTRTRFLGPRPIRLSP 705
Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+ V A ++R + YS+ + L S + + + F S P+ + + L
Sbjct: 706 IIVRGKQSVLAIANRSYLSYSNEQTLQVSPLLYSPLEYADSFASHQCPEGIVGIHQNILK 765
Query: 643 IGTI----DDIQKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEMHFVRLL 697
I T+ DD+++ PL P+R+ H E + + S ++ + + + +
Sbjct: 766 IFTVEATHDDLKQ---DVFPLSCTPKRVIKHPELDILYILQSERHCPNSTNEWITVLSIF 822
Query: 698 DDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPE--ENEPTKGRILVFI 755
D + I TF L +F N + V ++ + + + + G + V+
Sbjct: 823 DMNEKKMIEGL---TFNEASFDLCYAFFRSRNEGFLVCSSAINYDISKQSCSNGVLRVYS 879
Query: 756 VEDGKLQL--IAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSE 809
+ DG L L I+E ET L F+G+L+A + +++Y K +LR R + +
Sbjct: 880 LNDGGLSLKCISETETDSFARVLKPFHGRLIAGVGPFLRVYDLGNKKLLRKSEVRAVPN- 938
Query: 810 CGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
+QT+G I+V D S +++K E+ A D A W +A ++D
Sbjct: 939 -------FITTIQTQGYRIIVTDAQHSAFFVVFKPEDNRYIVFADDCVARWATATAMVDY 991
Query: 870 DIYLGAENNFNLFTVR--KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
D +G + NL+ +R ++ DEE +++ H F++ H +M +
Sbjct: 992 DTVVGGDKFSNLWLLRCPESVSQLADEENSGSKLL---HEKPFLHSSPHKLDLM--AHTF 1046
Query: 928 VGQIPT--------------VIFGTVNGVIGVIASLPH-EQYLFLEKLQTNLRKVIKGVG 972
V IPT +++ + G IGV++ + E F ++L++ LR +
Sbjct: 1047 VNDIPTSIQKVQLVEGARDVIMWTGLLGTIGVLSPFVNREDVRFFQQLESLLRAEDLSIV 1106
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
G +H +RS+ V K +DGDL E + +L R + + I+ ++ +V E+CK++E+L
Sbjct: 1107 GRDHLAYRSY-----YVPVKCVVDGDLCEQYYNLPRDKQESIANELDRTVIEVCKKIEDL 1161
>gi|170041368|ref|XP_001848437.1| splicing factor 3B subunit 3 [Culex quinquefasciatus]
gi|167864946|gb|EDS28329.1| splicing factor 3B subunit 3 [Culex quinquefasciatus]
Length = 1215
Score = 197 bits (502), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 256/1187 (21%), Positives = 473/1187 (39%), Gaps = 185/1187 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L ++G I +L FR G +D+ + ++ + +L+++A ++L
Sbjct: 53 LLQTEVFGVIRSLMSFRLTGGTKDYAVVGSDSGRIVILEYNAAKNQLEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D++ +L + + + L +
Sbjct: 113 RRIVPGQFLAIDPKGRAVMIGAIEKQKLVYILNRDSEARLTISSPLEAHKSSTLTYHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D ++A T E A K + F E N+ L+ A
Sbjct: 173 VDVGFENPMFACLEIDYEEADLDPTGEAAAKTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ LI VP P GVLI E + Y N IR I + +D D R
Sbjct: 233 NFLISVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+L+ G + + + + + V +K++ A+ + L +
Sbjct: 288 ICSATHRTKSMFFFLVQTEQGDIFKVTLETDDDVVAEIKLKYFDTVPPATAMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPI 299
++ +G+ L ++ + P +G R + + PI
Sbjct: 348 FVACDFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLKNLVMVDEIHSFAPI 407
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
+ V DL + Q+ G S+R++R+G+ ++E A EL G +W+++ D
Sbjct: 408 LGCQVADLANEDTPQLYLACGRGPRSSIRVLRHGLEVSEMAVSELPGNPNAVWTVKKRAD 467
Query: 359 DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
D FD +++VSF++ T +L++ D +EE GF T TL C + LVQV +R
Sbjct: 468 DEFDAYIIVSFVNATLVLSIG--DTVEEVTDSGFLGTTPTLCCSALGDDALVQVYPDGIR 525
Query: 419 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQL 476
+ + R NEWK+P ++ N+ QV++A GG LVY E+ G L E + ++
Sbjct: 526 HIRADKR--VNEWKAPGKKTIIKCAVNSRQVVIALSGGELVYFEMDPTGQLNEYTERKKM 583
Query: 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LL 535
++ C+ + + + AVG+ D +VRI SL + ++ + + +P + L
Sbjct: 584 PSDVMCMALGSVPAGEQRAWFLAVGL-ADNTVRIISLDPADCLSPRSM--QALPSAAESL 640
Query: 536 CAFE---------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
C E G YL L +G LL +L+ +G+L D + LG++P+ L
Sbjct: 641 CIVEMGTGESNEEGTVSTAGCFYLNIGLTNGVLLRTVLDPVSGDLADTRTRYLGSRPVKL 700
Query: 581 RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
+ + V A S R + Y + + ++ + + + F+S + +
Sbjct: 701 FRIKMQGSEAVLAMSSRTWLSYYYQNRFHLTPLSYETLEYASGFSSEQCAEGIVAISTNT 760
Query: 641 LTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD 699
L I ++ + + + + PL P+R ++ I + + EE++ + + D
Sbjct: 761 LRILALEKLGAVFNQITFPLEYTPKRFLIHNETGKLIISETDHNAYTEETKNIRKKQMAD 820
Query: 700 QTFE----------------FIS-TYPLDTF----------------------------- 713
+ E FI+ P D F
Sbjct: 821 EMREAAGEDEQELANEMADAFINEVLPEDQFSSPKAGAGMWASQIRVMDPINGHTYSKVQ 880
Query: 714 ----EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG---RILVFIVEDGKLQLIAE 766
E S+ F+ D Y G A L + T G + + +L+
Sbjct: 881 LAQNEAVMSMALVRFTVDQKWYVVAGVAKDLALNPKITNGGFIDVYKYDFHTHQLEHYHR 940
Query: 767 KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTR 824
E A ++ +F G++L I + +++Y D G ++L +C + HI + +Q
Sbjct: 941 TEIDDAPGAICSFQGRVLVGIGKVLRIY-----DLGKKKLLRKC-ENKHIPNQIVNIQAM 994
Query: 825 GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
G + V D+ +SI + YK E + A D + W++ +LD D A+ N+ +
Sbjct: 995 GSRVFVSDVQESIYCVKYKRAENQLIIFADDTHPRWITTSTLLDYDTVATADKFGNVSVL 1054
Query: 885 RKNSEGATD-----------EERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDS 926
R + D +RG L E V +HLGE + + +L+
Sbjct: 1055 RLPHSVSDDVDEDPTGNKALWDRGLLNGASQKAENVCTFHLGETIMSLQKATLI------ 1108
Query: 927 DVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
G ++I+ T++G +G + E + F + L+ ++R + G +H +RS+
Sbjct: 1109 -PGGSESLIYATMSGTVGALVPFTSREDFDFFQHLEMHMRNENPPLCGRDHLSFRSY--- 1164
Query: 986 KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + I+ + + E+ K++E++
Sbjct: 1165 --YYPVKNVMDGDLCEQFTSMDPAKQKSIATDLGRTPNEVAKKLEDI 1209
>gi|156051126|ref|XP_001591524.1| hypothetical protein SS1G_06970 [Sclerotinia sclerotiorum 1980]
gi|154704748|gb|EDO04487.1| hypothetical protein SS1G_06970 [Sclerotinia sclerotiorum 1980 UF-70]
Length = 1147
Score = 197 bits (502), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 244/1020 (23%), Positives = 426/1020 (41%), Gaps = 144/1020 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I + FR G +D++ I ++ + ++++ ++ + R
Sbjct: 64 VFGIIRAIAAFRLAGSNKDYIIITSDSGRITIVEFIPAQNKFNRLHLETFGKSGVRRVVP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R L + L V+ +++ +L + + + Q L + G
Sbjct: 124 GQYLAVDPKGRACLTASVEKNKLVYVLNRNSQAELTISSPLEAHKAQTLVFALVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L D ++ T YE+ L V WS + +D A++L
Sbjct: 184 ANPVFAALEVDYGESDQDPTGQAYEDIEKQLVYYELDLGLNHVVRK-WS-DPVDRTANIL 241
Query: 168 IPVPPPL---CGVLIIGEETIVYCSAN--AFK-AIPIRPSITKAYGRVD--ADGSRYLLG 219
VP GVL+ GE+ I Y +N AF+ AIP R T+ R G + L
Sbjct: 242 FQVPGGTDGPSGVLVCGEDNITYRHSNQEAFRVAIPRRRGATEDPQRKRNIVAGVMHKLK 301
Query: 220 DHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNAV 261
AG L+ T + + +V LKI+ +A+++ L +
Sbjct: 302 GAAGAFFFLLQTDDGDLFKITIEMVEDDNGQPTGEVKRLKIKYFDTVPVAASLCILKSGF 361
Query: 262 VYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYV 294
+++ S +G+ Q + + SY V +E
Sbjct: 362 LFVASEFGNHQFYQFEKLGDDDEEIEFVSDDFPTGTNESYTPVYFHPRPAENLSLVESID 421
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
++ P++D V +L + Q+ + G + R +++G+ ++E EL G+ +W+
Sbjct: 422 SMNPLMDCKVANLTDEDAPQIYSICGTGARSTFRTLKHGLEVSEIVESELPGVPSAVWTT 481
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ + +D +D ++++SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 482 KLTRNDVYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVH 539
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
+R + + R NEW +P S+ AT N QV +A G +VY E+ DG L E
Sbjct: 540 PKGIRHIRADRR--VNEWAAPQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSLAEYD 597
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
+ ++ ++CL + + E SQ AVG D +VRI SL PD L K P
Sbjct: 598 EKKEMSGTVTCLSLGEVPEGRQRSQFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSPP 656
Query: 531 RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
++ + A S YL L G L +L+ TGEL+D + LG +P+ L S
Sbjct: 657 NALSIMAMSDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGPKPVKLFRVS 716
Query: 585 SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+ T V A S RP + YS K + + ++ + F+S + + + L
Sbjct: 717 VQGQTAVLALSSRPWLGYSDPVTKGFMLTPLDYPALEWGWNFSSEQCTEGMVGIQGQNLR 776
Query: 643 IGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT 701
I +I+ + L SIPL PRR + F + N + ++ +LL+D +
Sbjct: 777 IFSIEKLTNNLLQESIPLTYTPRRFVRHPEHPLFYVVEADNNILSPATKQ---KLLEDPS 833
Query: 702 F-----------EFISTYPLDTFEYGCSI-----------LSCSFSDDS----------- 728
EF YP T + I L+ +D+
Sbjct: 834 VVNGDATVLPAEEF--GYPRGTNHWASCISVVDPVTEKKVLATIHLEDNESAVSVAVVAF 891
Query: 729 -----NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFN 780
+ VGT ++ + G I V+ EDGK L+ I + + + +L AF
Sbjct: 892 ASQEDETFLVVGTGKDLVVSPRSSSAGFIHVYRFHEDGKELEFIHKTKVEEPPMALLAFQ 951
Query: 781 GKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
G+LLA + + +++Y +R R+ QSE + + + +QT+G I+V D+ +SI+++
Sbjct: 952 GRLLAGVGKDLRIYDLGMR-QLLRKAQSEIAPN---MIVGLQTQGSRIIVSDVQESITMV 1007
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG 897
+YK +E + D A W S ++D + G + NL+ +R K SE A +E G
Sbjct: 1008 VYKFQENRLIPFVDDTIARWTSCTTMVDYETVAGGDKFGNLWLLRCPPKASEEADEEGSG 1067
>gi|171685748|ref|XP_001907815.1| hypothetical protein [Podospora anserina S mat+]
gi|170942835|emb|CAP68488.1| unnamed protein product [Podospora anserina S mat+]
Length = 1235
Score = 197 bits (502), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 280/1193 (23%), Positives = 496/1193 (41%), Gaps = 193/1193 (16%)
Query: 6 IYGRIATLELFRPHGEAQ---DFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
++G I + FR G + D++ +AT+ + ++++ +++ + R
Sbjct: 64 VFGIIRAIASFRLAGSHKGNLDYIILATDSGRIAIIEYLPKTNRFNRIHLETFGKSGVRR 123
Query: 63 TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
GQ DP R LI + L V+ + + +L + + + VL + +
Sbjct: 124 VVPGQYLAADPKGRACLIASLEKNKLVYVLNRNAQAELTISSPLEAHKPGVLVLSLVALD 183
Query: 118 YGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGA 164
G A P L D +A T YE+ L V WS + +D +
Sbjct: 184 VGYANPVFAALELDYTEADQDPTGQAGQEPEAQLVYYELDLGLNHVVRK-WS-DAVDPTS 241
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGS 214
LL VP P GVL+ GEE + Y +N AF+ IP R P+ R G
Sbjct: 242 SLLFQVPGGSDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRRGPTEDPQRKRTIVSGV 299
Query: 215 RYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISY 256
+ L AG L+ T + + +V LKI+ IA+++
Sbjct: 300 MHKLKGSAGAFFFLLQTEDGDLFKVTLDMVEDDNGNPTGEVKRLKIKYFDTIPIATSLCI 359
Query: 257 LDNAVVYIGSSYGDSQLIKLN---------------LQPDAKGSY------------VEV 289
L + +++ S +G+ L + D++ +Y + +
Sbjct: 360 LKSGFLFVASEFGNHSLYQFEKLGDDDEELEFSSEEFPTDSRAAYNPVYFQPRPLENLTL 419
Query: 290 LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG 349
+E ++ P +D V +L + Q+ + G S R++++G+ ++E + EL G
Sbjct: 420 VESIDSMNPQIDCKVANLTGEDAPQIYSVCGNGARSSFRMLKHGLEVSEIVASELPGTPA 479
Query: 350 -MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
+W+ + + D +D ++V+SF + T +L++ + +EE GF + TL
Sbjct: 480 AVWTTKLTKYDEYDAYIVLSFTNATLVLSIG--ETVEEVSDSGFLTTVPTLAVQQLGEEG 537
Query: 409 LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGI 467
L+QV +R + NEW +P S+ A N +QV++A G +VY E+ DG
Sbjct: 538 LIQVHPKGIRHIVQGRV---NEWPAPQHRSIVAAATNENQVVIALSSGEIVYFEMDADGS 594
Query: 468 LTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLG 525
L E + ++ ++ L + + E S AVG D +VRI SL P+ L K
Sbjct: 595 LAEYDEKKEMSGTVTSLSLGRVPEGLRRSSFLAVGC-DDCTVRILSLDPESTLEMKSIQA 653
Query: 526 GEIIPRSVLLCAFE----GIS-YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
P S+ + + E G + YL L G L +L+ TGELTD ++ LG +P L
Sbjct: 654 LTAAPASLSIMSMEDSFGGTTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTKL 713
Query: 581 RTFSSKNTTHVFAASDRPTVIYS-----------------------SNKKLLYSNVNLKE 617
S + V A + RP + Y+ S+++ L V +
Sbjct: 714 FQVSVQGRPCVLALNSRPWLGYTDPITKGFVMTPLSYVDLEYGWNFSSEQCLEGMVGIHA 773
Query: 618 VSHMCPFNSAAFP-------DSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRIC-HQ 669
FN P DS AIA + +G + IQK SIPL P+R+ H
Sbjct: 774 NFLRYVFNLFLLPLLYDDKRDSYAIAMRVDYDLGD-NMIQK----SIPLTYTPKRLVKHP 828
Query: 670 EQSRTFAICS--------LKNQSCAEESEMHFVRLLDDQTFEF----------------I 705
EQ + I + L+ Q A+ + +L + F + +
Sbjct: 829 EQPYFYTIEADNNTLPPELRAQLLAQSNTNGDAAILPPEEFGYPKARNRWASCISIVDPV 888
Query: 706 STYP-------LDTFEYGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-I 755
S P LD E S SF S D + VGT ++ + ++G I V+
Sbjct: 889 SDEPSVLNRVDLDNNEAAISAAVVSFASQDGESFLIVGTGKDMILSPRQFSEGYIHVYRF 948
Query: 756 VEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG 814
+DG+ L+ I + + + +L F G+LLA I + +++Y L+ R+ Q+E
Sbjct: 949 HDDGRDLEFIHKTKIEEPPMALIPFQGRLLAGIGKTLRIYDLGLK-QLLRKAQAEIAPQ- 1006
Query: 815 HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
L + +QT+G+ IVVGD+ + I+ +YK E + A D W + ++D + G
Sbjct: 1007 --LIVSLQTQGNRIVVGDVQQGITYAVYKPESNKLLAWADDTINRWTTCTAMVDYESVAG 1064
Query: 875 AENNFNLFTVRKNSEGA--TDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------LPDS 926
+ N++ +R + +DE +++V H +++ + + +M LP S
Sbjct: 1065 GDKFGNVWILRAPERASQESDEPGSEIQLV---HAKSYLHGAPNRTALMAHFYTQDLPTS 1121
Query: 927 ------DVGQIPTVIFGTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
VG +++ + G +GV I + E F + L++++R + G +H +
Sbjct: 1122 ITKTNLVVGGQDVLLWSGIQGTVGVLIPFVSREDVDFFQSLESHMRAEDPPLAGRDHLIY 1181
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
R + V K +DGDL E F L+ + I+ ++ SV E+ +++ ++
Sbjct: 1182 RGY-----YVPVKGVIDGDLCERFALLANDKKQMIAGELDRSVREIERKISDI 1229
>gi|164656549|ref|XP_001729402.1| hypothetical protein MGL_3437 [Malassezia globosa CBS 7966]
gi|159103293|gb|EDP42188.1| hypothetical protein MGL_3437 [Malassezia globosa CBS 7966]
Length = 1207
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 266/1171 (22%), Positives = 479/1171 (40%), Gaps = 180/1171 (15%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+ I + FR G ++D+L + ++ + VL+++ ++ + + R G R
Sbjct: 59 FSNIRSAASFRLMGGSKDYLILGSDAGRIVVLEYNGSLNQF-EKVHEETFGRSGNRRIVP 117
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNI---RLEELQVLDIKFLYGC 120
GQ DP R LIG L ++ D L + + R + + G
Sbjct: 118 GQYLATDPKGRAVLIGAVEKSKLIYILNRDTDAHLTISSPLEAHRHSAIVQCIVGVDVGY 177
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLLI 168
P L D DA T E + + + V W++ + + A LI
Sbjct: 178 ENPLFASLETDYADADADPTGEAYARTEQMLTYYELDLGLNHVVRRWTEP-VASSAHHLI 236
Query: 169 PVPPPL----------CGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGRVDADGSR 215
PVP GVL+ ++ IVY + IP+R + +G + GS
Sbjct: 237 PVPGGYNHATERWDGPSGVLVCMDDYIVYKHPGQPEHRVPIPVR----EQHGFGQSRGSM 292
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
L+ + G L + + HE E+V L+I +A+T+ L +
Sbjct: 293 IVASVLHKMKTSFFLLVQNEDGDLFKVSVDHEDEQVEALRIRYFDTVPVAATLCILRSGF 352
Query: 262 VYIGSSYGDSQLIKLNL------------------QPDAKGSYVEVLERYV--------- 294
+ + S G QL DA S + L +
Sbjct: 353 LLVASETGAQQLYAFQKLGDDDDERFPEYISTDYGSSDAGPSPLPSLPTFCPRPLDNLAL 412
Query: 295 -----NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
L P++D V + Q+ G S + +R+G+ ++E S +L G+ +
Sbjct: 413 AYELDALDPLLDAKVSNPLHSDVPQIYAACGRGARSSFKRLRHGLEVSEVVSSDLPGVPE 472
Query: 349 GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
+WS + + D +D ++V+SF++ T +L++ + +EE E GF + +TL +
Sbjct: 473 AVWSTKLRSSDQYDGYIVLSFVNGTLVLSIG--ETIEEVEDSGFLTTERTLAVQQLGTDA 530
Query: 409 LVQVTSGSVRLVSSTSRELRNEWKSPP-----GYSVNVATANASQVLLATGGGHLVYLEI 463
L+QV +R + S + NEW +P + AT N QV+LA +VY E+
Sbjct: 531 LLQVHPRGIRHILSNKQV--NEWTTPSCQDGSPTQIVAATTNERQVVLALDSQEIVYFEL 588
Query: 464 G-DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLIT 520
DG L E + + +E +I L I+ E + A+ D +VRI SL PD L
Sbjct: 589 DMDGQLNEFQERRDIEADIVALSISACPEGSQRTPYVAIAC-ADQTVRIVSLDPDSTLAP 647
Query: 521 KEHLGGEIIPRSVLLCAF-------EGISYLLC-ALGDGHLLNFLLNMKTGELTDRKKVS 572
P S+ +C ++ +C L +G + +L+ TG+LTD +
Sbjct: 648 MSLQALTAPPSSISVCEMLDASLDRHHLTMFVCIGLANGVYIRTVLDPSTGQLTDTRTRF 707
Query: 573 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
LG +P+ L T + A S R + Y+ + L ++ + L+ ++H+ F++ PD
Sbjct: 708 LGGRPVRLVRTQVHGDTAMMALSTRSWLAYTLHSHLHFTPLMLEALTHVSTFHTELCPDG 767
Query: 633 LAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAI----------CSL 680
L + L I T+ + +L + S+ L PR++ H +Q+ F + +
Sbjct: 768 LLGIEGNALRILTVPRLGSELKMDSLRLTYTPRKLAVHPQQASLFYVIESDHRVVPPTDV 827
Query: 681 KNQSCAEESEMHF-------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDD 727
+Q+ + F +R++D T + + Y L+ E S+ F+
Sbjct: 828 VSQARTAWDPVQFGHVRASAGHWASCLRVVDGTTMQTCAEYALEKDEAALSMALVPFAAC 887
Query: 728 SN-VYYCVGTAYVLPEENEPTKGRILVF----IVEDG-KLQLIAEKETKGAVYSLNAFNG 781
N ++ VG+A L + P R + ++G L L+ + E +L AF+G
Sbjct: 888 GNELFLVVGSA--LGVTHAPLTWRAAFLSTYRLTDNGCGLALVHKTEVDHVPLALRAFHG 945
Query: 782 KLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYVQTRGDFIVVGDLMKSIS 838
+LLA ++++ D GT++L +C +++L VQ G ++VGD+ +S+
Sbjct: 946 RLLAGTGPYVRIF-----DMGTKKLLRKCQSRPFPSKVVSLQVQ--GYRVIVGDMQESVH 998
Query: 839 LLIYKHEEGAIEERARDYNANW-MSAVEILDDDIYLGAENNFNLFTVRKNSEG--ATDE- 894
+YK + A D W SA+ +LD D + + N+F +R +S + DE
Sbjct: 999 YSVYKPATNTLVAFADDIMPRWTTSALLMLDYDTVMAGDKFGNVFVLRIDSSASLSADED 1058
Query: 895 --------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
ER R +++ YH+G+ + SLV G P V++ V
Sbjct: 1059 PTGLMLQNERSYLMGAAHRAQLLAHYHVGDIITSLSMESLV-------PGGRPVVLYTCV 1111
Query: 940 NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
NG IG + + E L+ ++R+ + G +H +R K V +D D
Sbjct: 1112 NGTIGALVPFISREDVRLFTTLEMHMRQENLSLTGRDHLAYRGHYTPVKAV-----VDAD 1166
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
L E + L + + I+ ++ + ++ K++
Sbjct: 1167 LCELYTALPHEKQESIADELDRTPADIAKKL 1197
>gi|331221690|ref|XP_003323519.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|309302509|gb|EFP79100.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1213
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 270/1181 (22%), Positives = 479/1181 (40%), Gaps = 186/1181 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I L FR G +D++ +A++ + VL++D S+ I R
Sbjct: 59 VFGTIRALSSFRLTGGTKDYIILASDSGRIVVLEFDPSSNTFIKLHQETYGKSGARRVVP 118
Query: 66 GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
GQ DP R + + + L ++ D+ L EA N + + +D+ F
Sbjct: 119 GQYLATDPKGRAVMIAAIEKSKLVYILNRDSAANLTISSPLEAHKSNAIIHHIVGVDVGF 178
Query: 117 LYGCAKPTIVVLYQDNKDARHVKTYEV-ALKDK-----------DFVEGPWSQNNLDNGA 164
P L D +A T E L +K + V WS+ + A
Sbjct: 179 ----ENPLFAALEVDYGEADQDPTGEAFNLAEKMLTYYELDLGLNHVVRKWSEPT-EPRA 233
Query: 165 DLLIPVP-------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------- 201
++L+ VP P GV++ E+ ++Y NA + IP R S
Sbjct: 234 NMLLQVPGGQSTSHPDKFDGP--SGVIVCCEDLLIYKHQNAKEHRVPIPKRDSPFSDPAQ 291
Query: 202 ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
I A +L+ G L + I HE E+V LKI+ +AS +S L
Sbjct: 292 GVIIVAAVMHKMRGAFFFLIQSEEGDLFKVTIDHEDEEVQALKIKYFDTVPVASNLSILK 351
Query: 259 NAVVYIGSSYGDSQLIKLNLQPD-------AKGSYVEV----------------LERYV- 294
+ +++ + G+ L + D + Y E LE V
Sbjct: 352 SGFLFVAAETGNHALYQFEKLGDDDEETEFSSADYPEFGASGAALPGAFFKPRGLENLVL 411
Query: 295 -----NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK- 348
+L PI L Q V G SLR++R+G+ ++E + EL G
Sbjct: 412 SDELESLAPITGAKTGHLLNSDSAQTVASCGTGSRSSLRMLRHGLEVSEIVTSELPGPPT 471
Query: 349 GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
+W+ R + D FD ++++ F++ T +LA+ + + E G + + T+ N
Sbjct: 472 NVWTTRLAETDEFDRYIILGFLNATLVLAIG--ETIVEVADTGLLTNSPTIAIQQLDSNG 529
Query: 409 LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGI 467
L+Q+ +R + EWK PPG + V+T+N QV++ GG L+Y E+ DG
Sbjct: 530 LLQIHPTGIRHIHLDGA--ITEWKVPPGRKIVVSTSNRRQVVIGLSGGELIYFELDLDGQ 587
Query: 468 LTEVK-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLG 525
L E + ++ I+ L ++ + + + A+G+ +++V+I SL P+ L T
Sbjct: 588 LNEYQEQKEMGATITSLSLSEVSKGRQRTPFLAIGL-ENLTVQIISLDPNSVLETISLQA 646
Query: 526 GEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
+P S+ + S ++ L +G LL +L+ G+LTD + LG++P
Sbjct: 647 LTAVPTSICIAELLDSSIDKNSETLFVNIGLANGVLLRTVLDSVNGQLTDTRTRFLGSRP 706
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L N T V S R + Y+ L + + + ++ F++ P+ L
Sbjct: 707 VKLLRVKVDNKTSVIGLSSRTWLNYTFQNLLHFDPLIYDAIDNVHSFSAELCPEGLIGIV 766
Query: 638 EGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-----AEESEM 691
L I TI + K+ ++PL PR++ S+ ++++ AE +
Sbjct: 767 GSSLRIFTIPKLGVKVKQDAMPLAYTPRKMLLDPSSKHVITIESEHRTMAPGVKAERLAL 826
Query: 692 HFVR-------LLDDQTF-----------------EFISTYPLDTFEYG-------CSIL 720
H + LLD F + I L + G +I+
Sbjct: 827 HKAQGLQVDESLLDQDAFGVPRAEAGKWASCIRISDPIEKTTLVREDLGDNEAATSLAIV 886
Query: 721 SCSFS---DDSNVYYCVGT---AYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVY 774
S +F+ + VG+ A+V P + R+ F+ + ++L+ + E
Sbjct: 887 SFAFAAHHPATEPLLVVGSAKDAFVQPRTCKNGFLRVYRFVNDGKAIELVHKTEVDEMPS 946
Query: 775 SLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
+L F G+L A + + +++Y K +LR + S I++L VQ G I+V
Sbjct: 947 ALVGFQGRLAAGVGKALRIYDLGKKKLLRKVENKSFGSA------IISLSVQ--GSRILV 998
Query: 831 GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNS 888
GD S+S +YK E + A D W + ++D D G + NL+ R KN
Sbjct: 999 GDSQDSVSYAVYKPAENRLIVFADDVVPRWTTCATMVDYDTVAGGDRFGNLWVSRLPKNV 1058
Query: 889 EGATDE---------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
DE E+G +L+ + +HL + + SLV G
Sbjct: 1059 SDEVDEDPTGAGIMHEKGYLMGAPHKLKNLVHFHLNDIPTSIQKTSLVP-------GGRE 1111
Query: 933 TVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 991
+++ V G IG++ + E F + L+ ++R + + G +H +R +
Sbjct: 1112 VLLYTGVQGSIGILVPFISKEDVDFFQTLEMHMRNEMPSLVGRDHLAYRGY-----YFPV 1166
Query: 992 KNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL ESF L + +++ ++ SV ++ K++E +
Sbjct: 1167 KNCVDGDLCESFALLPSAKQLQVASELDRSVSDVLKKIEAV 1207
>gi|325189950|emb|CCA24429.1| splicing factor putative [Albugo laibachii Nc14]
Length = 1644
Score = 197 bits (501), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 263/1178 (22%), Positives = 462/1178 (39%), Gaps = 175/1178 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++ ++G I ++ FR G D+L + ++ K V+Q D S+ T + + G
Sbjct: 489 IITTSVFGVIRSIIPFRLTGGTHDYLVVGSDSGKIVVIQVDP-STNTFTACQNETYGKTG 547
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIRLEELQVL 112
R GQ DP R +IG L V+ D +L EA V+
Sbjct: 548 CRRITPGQYLASDPKGRALMIGAVEKQKLVYVMNRDASNRLTISSPLEAHRSHTVHFDVV 607
Query: 113 DIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNL 160
+ G P L ++ +A T + A + + V WSQ +
Sbjct: 608 GVDV--GFENPIFASLELNHAEADIDPTGQAAHTSEKVLVYYELDLGLNHVTRRWSQV-V 664
Query: 161 DNGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFK---AIPIRPS----------- 201
A+ L+ VP P GVL+I + I Y + N + P+R +
Sbjct: 665 PRSANKLVAVPGGNDGP--GGVLVIAQGLIQYQNENHPPLSCSFPLRSTGGPNPVQDERK 722
Query: 202 ------ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
I D L+ G L + + + V L+I+ +A +
Sbjct: 723 QGYPMMIVSTATHKQRDLFFVLMQSEWGDLFKISLEYAGSSVQKLRIQYFDTIPVALALC 782
Query: 256 YLDNAVVYIGSSYGDSQLIKL------------------NLQPDAKGSYVEVLERYV--- 294
+++ S + + L + + +P+ V L+
Sbjct: 783 ITKTGLLFAASEFSNHYLFQFLSIGEDDDAAQCVSAAENDQEPEIPSFSVRKLKNLAMIS 842
Query: 295 ---NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
++ PI V D + Q+ G SLRI+R+G+ + E A+ L G+ K +
Sbjct: 843 NIPSISPITQLLVDDFANEQTPQLYALCGQGNRSSLRILRHGLPVMEMAASALPGVAKAV 902
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W L+ S D D ++VVSF E L + + D +EE G +L N ++
Sbjct: 903 WCLKESFTDTCDKYIVVSF--EDATLVLEIGDTVEEITDSGLLRDHSSLLVALLEDNSML 960
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
QV R + + + EWK+P + ANA Q++++ GG ++Y E+G L E
Sbjct: 961 QVHQNGFRHIRKS--QPVTEWKAPGKKIIERCAANARQLVVSLAGGEVIYFELGVNELME 1018
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
L +EI+ LD+ + + AVG W D SVR+ SL P+ K L
Sbjct: 1019 KGKLDLGFEITSLDVGQVPQGRQRFPFLAVGSW-DNSVRVLSLDPNDLFRQKSTLALTSH 1077
Query: 530 PRSVLLCAFEGIS-----------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
P ++ + S +L L +G ++ T L D LGT+P+
Sbjct: 1078 PHTLCFSNLQTESAAGSDHTTQCLFLNIGLHNGVFQQTRMDSITAALADSSARFLGTKPV 1137
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
L + + + A S R + Y + ++ + + + F S P+ LA
Sbjct: 1138 KLFCVTIQGKRAILALSSRAWLSYFYQTRRQLMPLSTEMLHYASSFQSEQCPEGLAALTS 1197
Query: 639 GELTIGTIDDI------QKLHIRSIP--LGEHP--RRICHQEQ----------------- 671
+ I T+D + QK +R P + HP RR+ E
Sbjct: 1198 EGMKILTLDHLGDTFNQQKCFLRYTPRRMVLHPPSRRLIIVESDYGEANAASKQLFGESL 1257
Query: 672 SRTFAICSLKNQSCAEESEMHF---------------VRLLDDQTFEFISTYPLDTFEYG 716
S T I S + + + + + VR+ D T E I+ + L+ E+
Sbjct: 1258 SDTPVIASEEPEEDEDRQALLYPRVERSHEPGKWASCVRIFDPITCETIACHELEDNEHT 1317
Query: 717 CSILSCSFSD-DSNVYYCVGTA--YVLPEENEPTKGRILVF-IVEDGKLQLIAEKETKGA 772
SI +C F D V+ +G+ L + P G + V+ +VE +L L+ G
Sbjct: 1318 RSITTCVFHDRGGEVFVIIGSVKNLRLHPISSPEGGLLRVYRVVEGSQLVLVHTTPVDGI 1377
Query: 773 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVG 831
Y++ F G+LL ++ + +++Y D G R+L +C + + +++ GD I
Sbjct: 1378 PYAMIEFQGRLLVSVGKVLRIY-----DLGKRKLLRKCENRYFTSPMIDLKSAGDRIYAS 1432
Query: 832 DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE-- 889
D+ +SI + YK E+ + A D ++M++ +LD D G + N+F R +E
Sbjct: 1433 DVHESIHFVKYKAEDNQLITFADDCVPHFMTSSTLLDYDTIAGGDKFGNVFVTRLPAEVS 1492
Query: 890 ---------------GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
G + +LE + ++H+GE + SLV G + +
Sbjct: 1493 DEIDNPTGNRMLWDTGLLNGAPHKLEQIAQFHVGEVITSVLRTSLV-------PGGMEVI 1545
Query: 935 IFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
++ T+ G IG + + F L+ +R+ + G +H +RS+ + KN
Sbjct: 1546 LYTTILGRIGALVPFTSRDDVDFYTHLEMYMRQEKAPLCGRDHLSYRSY-----FIPVKN 1600
Query: 994 FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
DGDL E F L + I++ ++ + E+ K++++
Sbjct: 1601 VTDGDLCEQFSSLGPDKQKNIAEDLDRTPTEVVKKLQD 1638
>gi|331238007|ref|XP_003331659.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
gi|309310649|gb|EFP87240.1| pre-mRNA-splicing factor RSE1 [Puccinia graminis f. sp. tritici CRL
75-36-700-3]
Length = 1213
Score = 197 bits (500), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 274/1183 (23%), Positives = 480/1183 (40%), Gaps = 190/1183 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I L FR G +D++ +A++ + VL++D S+ I R
Sbjct: 59 VFGTIRALSSFRLTGGTKDYIILASDSGRIVVLEFDPSSNTFIKLHQETYGKSGARRVVP 118
Query: 66 GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
GQ DP R + + + L ++ D+ L EA N + + +D+ F
Sbjct: 119 GQYLATDPKGRAVMIAAIEKSKLVYILNRDSAANLTISSPLEAHKSNAIIHHIVGVDVGF 178
Query: 117 LYGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNG 163
P L D +A T YE+ L V WS+ +
Sbjct: 179 ----ENPLFAALEVDYGEADQDPTGEAFNLAEKMLTYYELDLGLNHVVR-KWSEPT-EPR 232
Query: 164 ADLLIPVP-------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------ 201
A++L+ VP P GV++ E+ ++Y NA + IP R S
Sbjct: 233 ANMLLQVPGGQSTSHPDKFDGP--SGVIVCCEDLLIYKHQNAKEHRVPIPKRDSPFSDPA 290
Query: 202 ----ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
I A +L+ G L + I HE E+V LKI+ +AS +S L
Sbjct: 291 QGVIIVAAVMHKMRGAFFFLIQSEEGDLFKVTIDHEDEEVQALKIKYFDTVPVASNLSIL 350
Query: 258 DNAVVYIGSSYGDSQLIKLNLQPD-------AKGSYVEV----------------LERYV 294
+ +++ + G+ L + D + Y E LE V
Sbjct: 351 KSGFLFVAAETGNHALYQFEKLGDDDEETEFSSADYPEFGASGAALPGAFFKPRGLENLV 410
Query: 295 ------NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 348
+L PI L Q V G SLR++R+G+ ++E + EL G
Sbjct: 411 LSDELESLAPITGAKTGHLLNSDSAQTVASCGTGSRSSLRMLRHGLEVSEIVTSELPGPP 470
Query: 349 -GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
+W+ R + D FD ++++ F++ T +LA+ + + E G + + T+ N
Sbjct: 471 TNVWTTRLAETDEFDRYIILGFLNATLVLAIG--ETIVEVADTGLLTNSPTIAIQQLDSN 528
Query: 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DG 466
L+Q+ +R + EWK PPG + V+T+N QV++ GG L+Y E+ DG
Sbjct: 529 GLLQIHPTGIRHIHLDGA--ITEWKVPPGRKIVVSTSNRRQVVIGLSGGELIYFELDLDG 586
Query: 467 ILTEVK-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
L E + ++ I+ L ++ + + + A+G+ +++V+I SL P+ L T
Sbjct: 587 QLNEYQEQKEMGATITSLSLSEVSKGRQRTPFLAIGL-ENLTVQIISLDPNSVLETISLQ 645
Query: 525 GGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
+P S+ + S ++ L +G LL +L+ G+LTD + LG++
Sbjct: 646 ALTAVPTSICIAELLDSSIDKNSETLFVNIGLANGVLLRTVLDSVNGQLTDTRTRFLGSR 705
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L N T V S R + Y+ L + + + ++ F++ P+ L
Sbjct: 706 PVKLLRVKVDNKTSVIGLSSRTWLNYTFQNLLHFDPLIYDAIDNVHSFSAELCPEGLIGI 765
Query: 637 KEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-----AEESE 690
L I TI + K+ ++PL PR++ S+ ++++ AE
Sbjct: 766 VGSSLRIFTIPKLGVKVKQDAMPLAYTPRKMLLDPSSKHVITIESEHRTMAPGVKAERLA 825
Query: 691 MHFVR-------LLDDQTF-----------------EFISTYPLDTFEYG-------CSI 719
+H + LLD F + I L + G +I
Sbjct: 826 LHKAQGLQVDESLLDQDAFGVPRAEAGKWASCIRISDPIEKTTLVREDLGDNEAATSLAI 885
Query: 720 LSCSFS---DDSNVYYCVGT---AYVLPEENEPTKGRILVFIVEDGK-LQLIAEKETKGA 772
+S +F+ + VG+ A+V P + R+ F V DGK ++L+ + E
Sbjct: 886 VSFAFAAHHPATEPLLVVGSAKDAFVQPRTCKNGFLRVYRF-VNDGKVIELVHKTEVDEM 944
Query: 773 VYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
+L F G+L A + + +++Y K +LR + S I++L VQ G I
Sbjct: 945 PSALVGFQGRLAAGVGKALRIYDLGKKKLLRKVENKSFGSA------IISLSVQ--GSRI 996
Query: 829 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--K 886
+VGD S+S +YK E + A D W + ++D D G + NL+ R K
Sbjct: 997 LVGDSQDSVSYAVYKPAENRLIVFADDVVPRWTTCATMVDYDTVAGGDRFGNLWVSRLPK 1056
Query: 887 NSEGATDE---------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
N DE E+G +L+ + +HL + + SLV G
Sbjct: 1057 NVSDEVDEDPTGAGIMHEKGYLMGAPHKLKNLVHFHLNDIPTSIQKTSLVP-------GG 1109
Query: 931 IPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
+++ V G IG++ + E F + L+ ++R + + G +H +R +
Sbjct: 1110 REVLLYTGVQGSIGILVPFISKEDVDFFQTLEMHMRNEMPSLVGRDHLAYRGY-----YF 1164
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL ESF L + +++ ++ SV ++ K++E +
Sbjct: 1165 PVKNCVDGDLCESFALLPSAKQLQVASELDRSVSDVLKKIEAV 1207
>gi|393245024|gb|EJD52535.1| hypothetical protein AURDEDRAFT_111199 [Auricularia delicata
TFB-10046 SS5]
Length = 1214
Score = 197 bits (500), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 273/1171 (23%), Positives = 476/1171 (40%), Gaps = 169/1171 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSE---LITRAMGDVSDRIGRP 62
++G + +L FR G +D+ + ++ + +L +D +++E L T G R
Sbjct: 58 VFGAVRSLVAFRLTGGTKDYAIVGSDSGRIVILDYDPKANEFKRLYTETYGKSG---ARR 114
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL----- 117
GQ +DP R + + + V + + LE + I F
Sbjct: 115 IVAGQYLAVDPKGRAVMIAAMEKSKLVYILNRDAATNLTISSPLEAHRNQSILFHITGVD 174
Query: 118 YGCAKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGA 164
G P L D ++ T YE+ L V WS+ D A
Sbjct: 175 VGFENPLFAALEIDYGESDQDPTGEAFHNAEKMLTFYELDLGLNHVVRK-WSEPT-DPRA 232
Query: 165 DLLIPVPPPLC-----------GVLIIGEETIVYCSANAFK---AIPIRPS--------- 201
+ L+ VP GVL+ E I+Y + + IP R +
Sbjct: 233 NFLVQVPGGTSASSPDKFDGPSGVLVCCENHIIYRHMDHPQHRVPIPRRRAPLDSEERGL 292
Query: 202 -ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
I A +LL G L+ + I H+ ++VT LKI+ ++S++ L +
Sbjct: 293 IIVAAVMHKMKGAFFFLLQSEEGDLYKVTIDHQDDEVTSLKIKYFDTVPVSSSLCILKSG 352
Query: 261 VVYIGSSYGDSQLIKLNLQPD-------AKGSYVE------------------VLERYV- 294
+++ S +G L + D + SY + +LE +
Sbjct: 353 FLFVASDFGPHHLYQFQKLGDDDAETEFSSSSYPKNGMADPDEALPSAYFQPRLLENLLL 412
Query: 295 -----NLGPIVDFCVVDLER--QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
++ PI+D V++L Q G S R++R+G+ + E S +L G
Sbjct: 413 CDSLDSINPIIDAKVLNLMPTISDTPQFYVACGRGARSSFRLLRHGLEVEENISSDLPSG 472
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
I +W+ + DDPFDT++V+SF++ T L + + + +EE + GF S TL
Sbjct: 473 IPNAVWTTKIRADDPFDTYIVLSFVNGT--LVLTIGETIEECQETGFLSAEPTLGVQQIG 530
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
+ L+QV +R + + R NEWK P G ++ A N+ QV++A LVY E+
Sbjct: 531 DDALLQVYPHGIRHILADKRV--NEWKVPSGKTIVQAATNSRQVVVALNSAELVYFELDL 588
Query: 465 DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
DG L E + + + + L + + E + AVG D +VRI SL N +
Sbjct: 589 DGQLNEYQDRKAMGSVVLALSMAEVPEGCQRTPYLAVGC-EDQTVRIISLDPDNTLETIS 647
Query: 524 LGGEIIPRSVLL------CAFEGIS---YLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
L P S + A IS ++ L +G L +L+ TGELTD + LG
Sbjct: 648 LQALTAPPSAICIVAMIDAAVNKISETLFVNIGLTNGLFLRTVLDPVTGELTDTRTRFLG 707
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
T+PI L + + A S R + Y+ + L + + + ++ F + PD
Sbjct: 708 TRPIRLNRVTIAGKPALLALSSRMWLNYAHQEALHLTPLIFDPLDYVSGFTADLCPDGFI 767
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH- 692
K + I I I Q+L +PL PR+I + F I +++ ++E+ +H
Sbjct: 768 GLKGSTIKIFQITKIGQRLKHEVMPLSYTPRKIAVHPMNGLFYIAESDHRTHSQEA-VHG 826
Query: 693 ----------------------------------FVRLLDDQTFEFISTYPLDTFEYGCS 718
VR+++ I LD E S
Sbjct: 827 ALQQPDLKKYDRDVLELPPEVFGHPRAPAGIWASCVRIVNPVELTTIYRLDLDNNEAAFS 886
Query: 719 ILSCSFS-DDSNVYYCVGT---AYVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGA 772
I FS D+ + VGT A++ P T G + + +++DG+ L+L+ + E
Sbjct: 887 IALVPFSARDNELTLVVGTAKDAFLAPRSC--TSGYLRTYRVLDDGRGLELLHKTECDDI 944
Query: 773 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVG 831
SL AF GKL+A I + +++Y D G ++L + A+ + T+G I+ G
Sbjct: 945 PLSLLAFQGKLVAGIGKCLRIY-----DMGKKKLLRKAESKLFTTAITSLSTQGARIIAG 999
Query: 832 DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN---S 888
D +SI +YK E + A D W ++ +LD + + + N+F R + S
Sbjct: 1000 DAQQSIFFCVYKAAENRLLVFADDSQQRWTTSQLMLDYNTVVAGDKFGNVFVNRLSDHVS 1059
Query: 889 EGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVN 940
D+ G + +G H + F G ++M L + + G +++ ++
Sbjct: 1060 NLVDDDPTGAGLLHEKGMFMGAPHKTSMLCHFHVGDIIMSLQRTSLVPGGREVIVYFGLH 1119
Query: 941 GVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDL 999
G IG++ ++ + F L+ ++R + + WR + V K +DGDL
Sbjct: 1120 GTIGMLVPFASKEDVDFFTTLEQHMRSENLSLVSRDILAWRGY-----YVPVKAVVDGDL 1174
Query: 1000 IESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
E F L + + ++ ++ +V E+ K++E
Sbjct: 1175 CEYFAKLPQGKQGGLAGELDRTVGEVLKKLE 1205
>gi|429329284|gb|AFZ81043.1| CPSF A subunit region domain-containing protein [Babesia equi]
Length = 1175
Score = 197 bits (500), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 207/902 (22%), Positives = 375/902 (41%), Gaps = 119/902 (13%)
Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
L+ G ++ + +T+E+ V + ++ ++ L + +++ S +G+ L +
Sbjct: 302 LIQSEFGDVYKVELTYEEGVVKEIVCRYFDTVPVSVSLCILRSGYLFVASEFGNHHLYQF 361
Query: 277 -------------NLQPDAKGSYV----------EVLERYVNLGPIVDFCVVDLERQGQG 313
+L P + + + ++++ +L I D V D++ GQ
Sbjct: 362 TGLGTDERDPLCTSLHPHGRSAIIAFKPRALQNLQLVDELSSLSAITDMKVADIQGLGQH 421
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISE 372
Q+ G SLR++R GI I AS EL G K +W++RSS + +D F++V F E
Sbjct: 422 QIFLGCGRGSRSSLRVLRYGIAIEGLASSELPGRPKSVWTVRSSFESAYDGFIIVGF--E 479
Query: 373 TRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWK 432
L +++ + +EE F + TL +QV +R V + EW+
Sbjct: 480 GNTLVLSVGEAVEEVTDSCFLTSITTLHVALMGDGSFIQVHDAGIRHVYD---QRVKEWR 536
Query: 433 SPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVKHAQLEYEISCLDINPIGEN 491
+P V VA +N QV+L GG ++Y EI D G L E L EISCLD+ P +
Sbjct: 537 APSSKRVKVAASNDRQVILGLSGGDVIYFEIDDSGNLVEYAKKSLSVEISCLDLQPTPKG 596
Query: 492 PSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAF----EGISYLLC 546
+ A+G D SVR+ +L L +++ + L P SV + F + YL
Sbjct: 597 RILANFMAIGTL-DNSVRVLTLDKSLKVVSTQILSNNSTPESVCISEFAVGDSSLVYLHV 655
Query: 547 ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK 606
L G +L ++ +G L+D++ LG + + R S ++ + A SD+P +IY+
Sbjct: 656 GLNTGVMLRSTVDPISGALSDQESRFLGGRAVKFRRVSLGSSFAIVALSDKPWLIYTHRG 715
Query: 607 KLLYSNVNLKE-------VSHMCPFNSAAF---------------------------PDS 632
LL S +N+ +S +CP A P
Sbjct: 716 ILLVSPINVGTLESADSLISPICPDGFVAVSGNTLRIFRCTSLGETFAESQLPLTYTPRK 775
Query: 633 LAIAKEGELTIGTIDDI--------------QKLHIRSIPLG-EHPRRICHQEQSRTFAI 677
L + ++G+++ + Q L I+++ G E P C F
Sbjct: 776 LVLMPSEAPSVGSLNYMLAIVESDHARYNEDQSLEIKNVHAGIELPSDYCESLDYTDFKA 835
Query: 678 CSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTA 737
K SC +R+++ T E ++ +T E ++ + D VGTA
Sbjct: 836 EPGKWGSC--------LRIVNPLTLETVAKLLFETDE--AAMCAQVVVLDGIQCLVVGTA 885
Query: 738 YVLPEENEP--TKGRILVFIV-EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
+ + +P G + V+ + +++L+ G +L + GKL+ A+ +++LY
Sbjct: 886 IGMNLKGDPDSVSGYLRVYAYGANYEIRLLHATPITGVPRALAGYEGKLICALGSRLRLY 945
Query: 795 KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL---IYKHEEGAIEE 851
R + C HG I ++ G I GD+ + LL Y + E
Sbjct: 946 ALGKRQLLLKAEHRTCTDHGFI---WISVCGSRIFAGDIREGFQLLRLRFYAEDAAEFEW 1002
Query: 852 RARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFV 911
W+S E LD +G + +LF R E T + + E ++HLG+
Sbjct: 1003 IGHSTGPRWLSCCEQLDYHTVIGGDKFDSLFIARVPQEEFT--KATQFENHAQFHLGDL- 1059
Query: 912 NRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKG 970
+ + ++ +++ Q P VI+ T+ G IG ++ L ++ L+ +
Sbjct: 1060 -----PTAISKVSFNNMSQ-PIVIYSTILGSIGAFIPYANKDELDLMQHLEMIMANEHPP 1113
Query: 971 VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
+ G H +RS+ +N +DGDL E L +I+ ++ +V L ++V+
Sbjct: 1114 LCGREHAFFRSY-----YYPVQNIVDGDLCEQVKTLPEAVQRKIATQLDTNVHTLLRKVD 1168
Query: 1031 EL 1032
++
Sbjct: 1169 DV 1170
>gi|378730762|gb|EHY57221.1| pre-mRNA-splicing factor rse1 [Exophiala dermatitidis NIH/UT8656]
Length = 1210
Score = 196 bits (499), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 267/1175 (22%), Positives = 484/1175 (41%), Gaps = 184/1175 (15%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I +L FR G ++D+L + ++ + +L++ + + R + + G R
Sbjct: 65 FGIIRSLAAFRLAGSSKDYLIVGSDSGRITILEYVKDQNRF-NRVHLETFGKSGIRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYGC 120
GQ +DP R +I + L V+ + + +L + + + Q L + G
Sbjct: 124 GQYLAVDPKGRACIIASVEKNKLVYVLNRNAQAELTISSPLEAHKPQTLVFDCVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
P L D + T YE+ L V W++ + A L
Sbjct: 184 ENPIFAALEVDYTEVDQDPTGQALQEVEKLLVYYELDLGLNHVVRQ-WAEP-VARDATKL 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLG 219
+P P GVL+ E+ I Y +N AF+ IP R + + + + R+++
Sbjct: 242 FQIPGGSDGP--SGVLVCAEDNITYRHSNQEAFRVPIPRRRGVLE-----NPERKRHIV- 293
Query: 220 DHAGLLH-------LLVITHEKE------------------KVTGLKIELLGETSIASTI 254
AG++H LL+ T + + +V LKI+ +A+++
Sbjct: 294 --AGVMHKMKGQFFLLLQTEDGDLFKVTIDMVEDDNGQLTGEVRALKIKYFDTVPVANSL 351
Query: 255 SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------V 287
L + +++ S G+ + + D +Y +
Sbjct: 352 HILKSGFLFVASESGNHHFYQFEKLGDDDEETEFTSDDYPADPTAAYTPAFFHVRPLSNL 411
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
+++ P++D V +L + Q+ + GA S + +++G+ ++E EL
Sbjct: 412 NLVQSVDAANPLLDCKVANLLDEDAPQIYSICGAGARSSFKTLKHGLSVSEIVESELPDK 471
Query: 347 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+ +W+ + + DD +D ++++SF T L +++ + +EE GF S TL
Sbjct: 472 PEAVWTTKLTRDDQYDAYIILSF--RTGTLVLSIGETVEEVTDTGFLSTAPTLAVQQLGE 529
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-D 465
+ LVQV +R + + R NEW +P S+ AT N QV +A G +VY E+ D
Sbjct: 530 DALVQVHPKGIRHIRADKR--VNEWPAPQHRSIVAATTNERQVAVALSSGEIVYFEMDTD 587
Query: 466 GILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
G L E + ++ ++CL + + E S AVG D +VRI SL PD L K
Sbjct: 588 GSLAEYDERREMTGTVTCLSLGDVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSV 646
Query: 524 LGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
P ++ + A + YL L G L +L+ TGEL+D + LG +P
Sbjct: 647 QALTSAPSALSIMAMADSTSGGTTMYLHIGLYSGVYLRTVLDEITGELSDTRTRFLGLRP 706
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
L S K V A S RP + Y+ + + + ++ + ++ F S P+ +
Sbjct: 707 AKLFRVSVKGQNAVMALSSRPWLGYTDTQTNGFMLTPLDYVPLQYVWNFTSEQCPEGMVG 766
Query: 636 AKEGELTIGTIDDIQK-LHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
+ L I +I+D+ + L +IPL PR+ F + N A + +
Sbjct: 767 IQGQNLRIFSIEDLSRNLLQENIPLPYTPRKFVKHPDQPLFYVIEADNNVLAPATRQKLL 826
Query: 695 R----------LLDDQTFEF----------------------ISTYPLDTFEYGCSILSC 722
+L + F + + T L+ E SI +
Sbjct: 827 NESTAVNGDAVILPPEEFGYPKATGHWASCIQVVDPVNTKSVVFTLELEDNECATSITTA 886
Query: 723 SF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNA 778
F S D V+ VGTA ++ + G + V+ EDGK L+ I + + + +L A
Sbjct: 887 PFASQDDEVFLIVGTAKDLVVSPRSFSAGFLHVYRFHEDGKELEFIHKTKVEQPPTALLA 946
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
F G+LLA + +++Y D G +++ +C L + +QT+G I+V D+ +S++
Sbjct: 947 FQGRLLAGVGPDLRIY-----DLGMKQMLRKCQVTTPNLIVGLQTQGSRIIVSDIQESVT 1001
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE- 894
+YK +E + D A W + ++D + G + NL+ +R K SE A ++
Sbjct: 1002 YCVYKFQENKLIPFCDDVIARWTTCCTMVDYETVAGGDKFGNLWMLRCPQKVSEEADEDN 1061
Query: 895 -------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
ERG RL ++ Y G+ + +LV G V +
Sbjct: 1062 SGVHLLHERGYLNGTPNRLSLMIHYFPGDIPTSIQKTNLV-------AGGRDVVFWTGFQ 1114
Query: 941 GVIGVIAS-LPHEQYLFLEKLQTNLRKV--IKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
G +G++ + E F + L+ L + G +H +RS+ K V +DG
Sbjct: 1115 GTLGILVPFVSREDVDFFQSLEMQLASSNGNPPLLGRDHLIYRSYYAPSKGV-----IDG 1169
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DL E+F L + I+ ++ SV E+ +++ ++
Sbjct: 1170 DLCETFFLLPNDKKLMIAGELDRSVREIERKISDM 1204
>gi|195376606|ref|XP_002047087.1| GJ13230 [Drosophila virilis]
gi|194154245|gb|EDW69429.1| GJ13230 [Drosophila virilis]
Length = 1229
Score = 196 bits (499), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 256/1201 (21%), Positives = 468/1201 (38%), Gaps = 199/1201 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++ I+G I L FR G +D++ + ++ + +L++ + L
Sbjct: 53 LMSTEIFGCIRALMAFRLTGGTKDYIVVGSDSGRIVILEYIPSKNSLEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 113 RRIVPGQYFAIDPKGRAVMIGAVEKQKLAYIMNRDTQARLTISSPLEAHKSNTLTYHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D ++A + + A + + F E N+ L+ A
Sbjct: 173 VDVGFDNPMFACLEIDYEEADLDPSGDAAQRTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 233 NFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+LL G + + + + + V+ +K++ A+ + L +
Sbjct: 288 ICSATHRTKSMYFFLLQTEQGDIFKITLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
++ S +G+ L ++ P A + V +++ + P
Sbjct: 348 FVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGETFFFAPRALKNLV-LVDELPSFAP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 407 IITSQVADLANEDTPQLYVLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKKRI 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
DD FD +++VSF++ T +L++ + +EE GF T TL C + LVQV +
Sbjct: 467 DDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLCCAALGDDALVQVYPDGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
R + S R NEWK+P S+ N QV++ G LVY E+ G L E + ++
Sbjct: 525 RHIRSDKRV--NEWKAPGKKSITKCAVNQRQVVITLSGRELVYFEMDPTGELNEYTERSE 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
+ EI C+ + + E S AVG+ D +VRI SL N ++ + P L
Sbjct: 583 MPAEIMCMALGTVPEGEQRSWFLAVGL-ADNTVRILSLDPNNCLSPCSMQALPSPAES-L 640
Query: 536 CAFE---------------------------GISYLLCALGDGHLLNFLLNMKTGELTDR 568
C E G YL L +G LL +L+ +G+L D
Sbjct: 641 CLVEMGHTESTTNAGGADDDVPAQRSGGSNKGTIYLNIGLSNGVLLRTVLDPVSGDLADT 700
Query: 569 KKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
+ LG++P+ L + V A S R + Y + + ++ + + + F+S
Sbjct: 701 RTRYLGSRPVKLFRIKMQGAEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQ 760
Query: 629 FPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICS------- 679
+ + L I ++ + + + + PL PR + H + R +
Sbjct: 761 CSEGIVAISTNTLRILALEKLGAVFNQVAFPLQFTPRAFVIHPDTGRMLIAETDHNAYTE 820
Query: 680 -------------LKNQSCAEESEM-----------------------------HFVRLL 697
+++ + EE E+ +R L
Sbjct: 821 DTKNTRKEQMAEEMRSAAGDEERELAREMANAFINEVLPEDVFSAPKAGLGLWASQIRCL 880
Query: 698 DDQTFEFISTYPLDTFE--YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI 755
D + + + PL E ++L S + D Y VG A L ++G +
Sbjct: 881 DAMHGQTMFSVPLTQNEAIMSMTLLKFSVAADGRYYLAVGIAKDLQLNPRISQGGCIDIY 940
Query: 756 VED---GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-G 811
D L+ + E +L F G+LLA + +++Y D G +++ +C
Sbjct: 941 KIDPTCSALEFLHRTEIDEIPGALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCEN 995
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
H + +Q G + V D+ +S+ + Y+ E + A D + W++A +LD D
Sbjct: 996 KHIPYQIVNIQAMGHRVYVSDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT 1055
Query: 872 YLGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVN 912
+ + F ++++ TD+ +RG L E + +H+GE +
Sbjct: 1056 -IAIADKFGNLSIQRLPHSVTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIM 1114
Query: 913 RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGV 971
+ +L+ G +I+ T++G +G E Y F + L+ ++R +
Sbjct: 1115 SLQKATLI-------PGGSEALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPL 1167
Query: 972 GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
G +H +RS KN LDGDL E +L + + I+ M + ++CK++E+
Sbjct: 1168 CGRDHLSYRS-----SYYPVKNVLDGDLCEQYLSIDAVKQKSIAGDMFRTPNQICKKLED 1222
Query: 1032 L 1032
+
Sbjct: 1223 I 1223
>gi|351712542|gb|EHB15461.1| Splicing factor 3B subunit 3 [Heterocephalus glaber]
Length = 1240
Score = 196 bits (498), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 280/1218 (22%), Positives = 497/1218 (40%), Gaps = 222/1218 (18%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLL 218
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPER--- 283
Query: 219 GDHAGLLHLLVITH----------EKEKVTGLKIELLGETSIASTISY------LDNAVV 262
G++ + TH + E+ KI L + + +T+ +N +
Sbjct: 284 ----GMIFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMLATVGMNKSFVSAENCYL 339
Query: 263 YIGSSYGD-------SQLIKLN------LQPDAKGSYVEVLERYVNLGPIVDFC-VVDLE 308
Y + GD S + L QP + V +++ +L PI+ FC + DL
Sbjct: 340 YQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLSPIL-FCQIADLA 397
Query: 309 RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVV 367
+ Q+ G SLR++R+G+ ++E A EL G +W++R +D FD +++V
Sbjct: 398 NEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIV 457
Query: 368 SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
SF++ T +L++ + +EE GF T TL C + LVQV +R + + R
Sbjct: 458 SFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR-- 513
Query: 428 RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDI 485
NEWK+P ++ N QV++A GG LVY E+ G L E + ++ ++ C+ +
Sbjct: 514 VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSL 573
Query: 486 NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE----- 539
+ S+ AVG+ D +VRI SL + + + L + +P + LC E
Sbjct: 574 ANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPESLCIVEMGGTE 630
Query: 540 -----------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
G YL L +G LL +L+ TG+L+D + LG++P+ L +
Sbjct: 631 KQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQ 690
Query: 589 THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
V A S R + YS + + ++ + + F S P+ + L I ++
Sbjct: 691 EAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEK 750
Query: 649 IQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQSCAEE------- 688
+ + + + PL PR+ + H E + T A + + Q AEE
Sbjct: 751 LGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAAGE 810
Query: 689 ------SEM-------------------------HFVRLLDDQTFEFISTYPLDTFEYGC 717
+EM +R+++ + L+ E
Sbjct: 811 DERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAF 870
Query: 718 SILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIAEKETKGAV 773
S+ C FS+ +Y VG A L G + +V +G KL+ + + +
Sbjct: 871 SVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVP 930
Query: 774 YSLNAFNGKLLAAINQKIQLY----KWMLR------------------------------ 799
++ F G++L + + +++Y K +LR
Sbjct: 931 AAIAPFQGRVLIGVGKLLRVYDLGKKKLLRKCENKTPVEEVPAAIAPFQGRVLIGVGKLL 990
Query: 800 ---DDGTRELQSECGHHGHILALY---VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
D G ++L +C + HI A Y +QT G ++V D+ +S + YK E + A
Sbjct: 991 RVYDLGKKKLLRKC-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFA 1048
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE---------ERGRL--- 899
D W++ +LD D GA+ N+ VR N+ DE +RG L
Sbjct: 1049 DDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGA 1108
Query: 900 ----EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQY 954
EV+ YH+GE V + +L+ G ++++ T++G IG++ HE +
Sbjct: 1109 SQKAEVIMNYHVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGILVPFTSHEDH 1161
Query: 955 LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
F + ++ +LR + G +H +RS+ KN +DGDL E F + + +
Sbjct: 1162 DFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNV 1216
Query: 1015 SKTMNVSVEELCKRVEEL 1032
S+ ++ + E+ K++E++
Sbjct: 1217 SEELDRTPPEVSKKLEDI 1234
>gi|321249291|ref|XP_003191408.1| U2 snRNA binding protein [Cryptococcus gattii WM276]
gi|317457875|gb|ADV19621.1| U2 snRNA binding protein, putative [Cryptococcus gattii WM276]
Length = 1217
Score = 196 bits (497), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 278/1181 (23%), Positives = 478/1181 (40%), Gaps = 184/1181 (15%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I + FR G +D++ ++ + +L++ + +V + G R
Sbjct: 60 FGTIRNIAGFRLAGMTKDYILATSDSGRLSILEFVISPTPHFESLYQEVFGKSGSRRIVP 119
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R L+G L V+ + +G+L + + + L + G
Sbjct: 120 GQFLAVDPKGRSCLVGSLEKTKLVYVLNRNTEGKLYPSSPLEAHKNHTLVTHVIGVDQGY 179
Query: 121 AKPTIVVLYQDNKDARHVKT---YEVALKDKDFVE---------GPWSQNNLDNGADLLI 168
P L D ++ T YE A K F E WS+ D A+LL+
Sbjct: 180 DNPLYAALEIDYSESDQDPTGEAYENAQKHLTFYELDLGLNHVVRKWSEPT-DRRANLLV 238
Query: 169 PVP------------PPLCGVLIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSR 215
VP P GVL+ E+ I++ + IPI R D SR
Sbjct: 239 QVPGGQNANSDRFEGP--SGVLVCTEDHIIWKHMDVDAHRIPIPRRRNPLVQR--GDKSR 294
Query: 216 -----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
+LL G L+ + I H E V LKI+ +A+++ L
Sbjct: 295 GLIIVSAVMHKIKGAFFFLLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCILK 354
Query: 259 NAVVYIGSSYGDSQLIKLNLQPDAKG-------SYVE----------------------V 289
+Y+ S + D L + + G Y E +
Sbjct: 355 RGYIYVASEFSDQNLYQFQSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLLL 414
Query: 290 LERYVNLGPIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
++ +L PI D VV+L Q+ G + R +++G+ + E S L G+
Sbjct: 415 VDSVPSLDPITDAHVVNLLGASSDTPQIYAACGRGARSTFRTLKHGLDVTEMVSSPLPGV 474
Query: 348 K-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+L+ + DD +D+++V+SF + T +L++ + +EE GF S TL
Sbjct: 475 PTNVWTLKLTEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLGN 532
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-D 465
L+QV +R + + R +EW +PPG ++ AT N QV++A LVY E+ +
Sbjct: 533 AGLLQVHPYGLRHIRAADR--VDEWPAPPGQTIVAATTNQRQVVIALSTAELVYFELDPE 590
Query: 466 GILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEH 523
G L+E + + L +C+ I + E + AVG + +V I SL PD L T
Sbjct: 591 GSLSEYQEKKALPGNATCVTIAEVPEGRRRTPFLAVGC-DNQTVSIISLEPDSTLDTLSL 649
Query: 524 LGGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT 575
P S+ L S +L L +G LL +++ G L+D + LG
Sbjct: 650 QALTAPPTSICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLGA 709
Query: 576 QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
+P L + + V A S R ++Y+ L + + + ++A PD L
Sbjct: 710 KPPKLVRANVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLIG 769
Query: 636 AKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICH---------QEQSRTFAICSLKNQS 684
L I +I + +KL S PL PR+ I H + RT++ +++
Sbjct: 770 ISGNTLRIFSIPKLGEKLKQDSTPLTYTPRKFISHPFNPVFYMIEADHRTYSKGAIERIV 829
Query: 685 CAEESE--------------------------MHFVRLLDDQTFEFISTYPLDTFEYGCS 718
+ESE VR+LD E I T+ LD E S
Sbjct: 830 KQKESEGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLDPLANETIMTFDLDEDEAAFS 889
Query: 719 ILSCSFS-DDSNVYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAVY 774
I F + VGT + + K G + V+ I E G+ L+ + + +T
Sbjct: 890 IAIAYFERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAIKEQGRVLEFLHKTKTDDIPL 949
Query: 775 SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDL 833
L F G LLA + + ++LY+ G + L +C ++G A+ + +G I+VGD+
Sbjct: 950 CLAGFQGFLLAGVGKSLRLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGDM 1004
Query: 834 MKSISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN- 887
+S +Y+ +I R A D W++ V +D + + N+F R +
Sbjct: 1005 QESTFYCVYR----SIPTRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLDP 1060
Query: 888 --SEGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV----- 934
SE D+ G + ++G H E + + GS+V + + +IP V
Sbjct: 1061 SISEKVDDDPTGATILHEKSFLMGAAHKTEMIAHYNIGSVV-----TSITKIPLVAGGRD 1115
Query: 935 --IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 991
++ T++G +G + + + F+ L+ ++R + G +H +R + V
Sbjct: 1116 VLVYTTISGAVGALVPFVSPDDIEFMSTLEMHMRTQDISLVGRDHIAYRGY-----YVPI 1170
Query: 992 KNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K +DGDL ESF L + I+ ++ SV ++ K++E++
Sbjct: 1171 KGVVDGDLCESFSLLPYPKQQAIASDLDRSVGDVLKKLEQM 1211
>gi|391867503|gb|EIT76749.1| splicing factor 3b, subunit 3 [Aspergillus oryzae 3.042]
Length = 1034
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 243/1010 (24%), Positives = 430/1010 (42%), Gaps = 168/1010 (16%)
Query: 155 WSQNNLDNGADLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAY 206
W+ + +D + +L VP P GVL+ E+++ Y +N AF+ IP R T+
Sbjct: 55 WA-DPVDRTSSMLFQVPGGADGP--SGVLVCAEDSVTYRHSNQDAFRVPIPRRSGPTE-- 109
Query: 207 GRVDADGSRYLLGDHAGLLHLL---------------------VITHEKEKVTG----LK 241
+ + R++ AG++H + ++ + ++TG LK
Sbjct: 110 ---NPERKRFIT---AGVMHKMRGAFFFLLQTEDGDLFKLNIDMVEDDNGQLTGEVKRLK 163
Query: 242 IELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAK--- 283
I+ +AS++ L + +Y+ S G+ + N D
Sbjct: 164 IKYFDTVPVASSLLILKSGFLYVASEAGNHHFYQFEKLGDDDEEIEFSSENFSADPSVPL 223
Query: 284 ---------GSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGI 334
+ ++E +L P++D V +L + Q+ T SG + R +++G+
Sbjct: 224 EPIYFRPRSAENLNLVETINSLNPLIDSKVANLSEEDAPQIYTISGTGARSTFRTLKHGL 283
Query: 335 GINEQASVELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFC 393
++E EL + +W+ + + D FD ++++SF + T +L++ + +EE GF
Sbjct: 284 EVSEIVDSELPSVPSAVWTTKLTRADEFDAYIILSFANGTLVLSIG--ETVEEVTDTGFL 341
Query: 394 SQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLAT 453
S TL + L+Q+ +R + + R NEW +P S+ A N QV +A
Sbjct: 342 SSAPTLAVQQLGEDSLIQIHPRGIRHIMADRR--VNEWPAPQHRSIVAAATNERQVAVAL 399
Query: 454 GGGHLVYLEI-GDGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 511
G +VY E+ DG L E + Q+ ++CL + + E S AVG D +VRI
Sbjct: 400 SSGEIVYFEMDADGTLAEYDERRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRIL 458
Query: 512 SL-PDLNLITKEHLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGE 564
SL PD L K P ++ + + S YL L G L +L+ TGE
Sbjct: 459 SLDPDSTLENKSVQALTAAPSALNIMSMSDSSSGGTTLYLHIGLYSGVYLRTVLDEVTGE 518
Query: 565 LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMC 622
L+D + LG++P+ L S K T V A S RP + YS K + + ++ +
Sbjct: 519 LSDTRTRFLGSKPVKLFQVSVKGQTAVLALSSRPWLGYSDIQTKGFMLTPLDYVPLEWGW 578
Query: 623 PFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICS- 679
F+S + + + L I +I+ + + +SI L PRR H EQ + I S
Sbjct: 579 NFSSEQCLEGMVGIQGQNLRIFSIEKLDNNMLQQSISLAYTPRRFLKHPEQPLFYVIESD 638
Query: 680 ---LKNQSCAEESEMHFVRLLDDQTF---EF------------------------ISTYP 709
L + A+ E R D+ EF +ST
Sbjct: 639 NNVLSPSTRAKLLEDSKARTGDETVLPPEEFGYPRGSGHWASCIQVVDPVHAKAVVSTIE 698
Query: 710 LDTFEYGCSILSCSF-SDDSNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLI 764
L+ E S+ + F S D + VGTA + N P+ G I ++ EDG+ L+ I
Sbjct: 699 LEENEAAVSVAAVPFTSQDDETFLVVGTAKDM-NVNPPSSAGGYIHIYRFQEDGRELEFI 757
Query: 765 AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHILALYV 821
+ + + +L F G+L+A I +++Y D G ++L +C I+ L
Sbjct: 758 HKTKVEEPPLALLGFQGRLVAGIGPMLRIY-----DLGMKQLLRKCNAQVVPKTIVGL-- 810
Query: 822 QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
QT+G IVV D+ +S++ ++YK++E + D + W ++ ++D + G + N+
Sbjct: 811 QTQGSRIVVSDVRESVTYVVYKYQENVLIPFVDDSVSRWTTSTTMVDYETTAGGDKFGNI 870
Query: 882 FTVR---KNSEGATDE--------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRL 923
+ +R K SE A ++ ERG RLE++ + + LV
Sbjct: 871 WMLRCPKKISEQADEDGSGAHLIHERGYLHGTPNRLELMIHVYTQDIPTTLHKTQLV--- 927
Query: 924 PDSDVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
G +++ +G IG++ + E F + L+ L + G +H +RS+
Sbjct: 928 ----AGGRDILVWSGFHGTIGMLVPFVSREDVDFFQNLEMQLAAQNPPLAGRDHLIYRSY 983
Query: 983 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K V +DGDL E++ L I+ ++ SV E+ +++ ++
Sbjct: 984 YAPVKGV-----IDGDLCETYFLLPNDTKMMIAAELDRSVREIERKISDM 1028
>gi|167522323|ref|XP_001745499.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775848|gb|EDQ89470.1| predicted protein [Monosiga brevicollis MX1]
Length = 1172
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 261/1177 (22%), Positives = 476/1177 (40%), Gaps = 184/1177 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G I L FR G +D+L ++T+ + +LQ+D +++L M R
Sbjct: 24 IFGLIRDLRAFRLTGGTKDYLIVSTDSGRITILQYDVTANKLERVHMETFGKSGCRRIVP 83
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL-------DIKF 116
GQ DP R ++G L ++ D +L + + + L D+ F
Sbjct: 84 GQYLATDPKGRAVMVGAVEKQKLVYILNRDAAARLTISSPLEAHKGHSLVYDMVGVDVGF 143
Query: 117 ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN--------LDNGAD 165
L+ C + + Y+D + + E+ + F E N+ LD+ A+
Sbjct: 144 ENPLFAC----LELDYEDVDEDPSGEALELLAQTLVFYELDLGLNHVVRKESIELDSFAN 199
Query: 166 LLIPVPPPL---CGVLIIGEETIVYCSANAFKAIPI------------RPSITKAYGRVD 210
LIPVP GVL+ G+ I + + I + R +I A+
Sbjct: 200 KLIPVPGGADGPSGVLVCGDGQITWRTVGEHTPISVSIPRRLDPFRQPRSTIINAFCMHK 259
Query: 211 ADGSRY-LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
+ + LL G L L + H++++V G+ ++ +A ++ L +++I S +G
Sbjct: 260 TKKTFFFLLQTEEGDLFKLTMVHDEDEVQGMILKYFDTVPVAKSMVLLKIGLLFIASEHG 319
Query: 270 DSQLIK-------------LNLQPDAKGSY--------VEVLERYVNLGPIVDFCVVDLE 308
D QL + L+ +P+ K Y + L+ +L P++D + D
Sbjct: 320 DHQLYQIAQLGDNKDEPSFLSTEPEDKVLYFRPRPLLNLAPLDVIESLAPVMDCQLADAG 379
Query: 309 RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVV 367
+Q Q+ G SLR++R+G+ ++ A L G +WS++ +D DT++V+
Sbjct: 380 QQDGPQLYALCGRGSGSSLRVLRHGLEVSAMAESPLPGNPSAVWSVKRHVEDEADTYIVM 439
Query: 368 SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
SF+ T +L + + +EE GF TL + L+Q+ + +R + R
Sbjct: 440 SFVDATLVLGIG--ETVEEVTDSGFLGSVATLSASRIGDDALLQIHAMGIRHILHDGR-- 495
Query: 428 RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDI 485
NEWK+P ++ N Q ++A G LVY E+ G L E + ++ +++ L +
Sbjct: 496 INEWKAPDRTKISHCAVNERQAVVALSNGELVYFELDRSGQLIEHSERVEMSSQVTALAL 555
Query: 486 NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL-LCAFE----- 539
PI E ++ A+G+ D +VR+ SL + + L + +P + LC E
Sbjct: 556 APIPEGAQTARFLALGL-DDQTVRLMSLDKSDCLAP--LSMQALPGAAASLCMVEVRGTH 612
Query: 540 ----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS 595
G+S L LG+G L+ ++ T +L+D + LG + + L V A S
Sbjct: 613 GEPSGLS-LAIGLGNGVLMRSRVDTLTADLSDTRTRYLGARAVKLFPVKVAEEPAVLALS 671
Query: 596 DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHI 654
+P + Y + ++ + + F+S P+ +A L I + + Q H
Sbjct: 672 TKPWLSYRYQGHSRITPLSYDALEYASAFSSDQCPEGVAAVAGNTLRIFAFEKLGQVFHQ 731
Query: 655 RSIPLGEHPRRICHQEQSRTFAICSLKNQSC----------------------AEESEMH 692
SIPL RR+ ++ + A + +Q C AE+++
Sbjct: 732 NSIPLEYTGRRLLLDPEA-SLAFIAEGDQGCLSVTAKAERLSRIASHQAGTAMAEQTDPQ 790
Query: 693 --------FVRLLDDQTF----------------------EFISTYPLDTFEYGCSILSC 722
LD + F E PL E S+
Sbjct: 791 EDLTSPAALAEPLDVKQFGQSYAGDNLWASNIRVVNLAQGETTCLVPLAQDEMVMSLARV 850
Query: 723 SFSDDSNVYYCVGTAYV--LPEENEPTKGRILVFIVEDGKLQLIAEKETKGAV-YSLNAF 779
F+ N + V P++ +L F V+ ++ L+ G + +L F
Sbjct: 851 RFASSPNDKHIVAGVVKGWKPKQQSMDGAFLLTFQVQGDQVVLLHRTAVDGGLPCALAEF 910
Query: 780 NGKLLAAINQKIQLYKWMLRDDG----TRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
GK+LA + ++++ + R+L S+ H + T G I D
Sbjct: 911 AGKVLAGVGNTLRIFDLGKKKLLLKTENRQLPSQVVH--------ITTMGTRICAADQKH 962
Query: 836 SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE- 894
S L YK E A+ A D N W + +LD GA + F F V + TD+
Sbjct: 963 SFVWLKYKPAENALTIFADDTNPRWCTRGVLLDYQTVAGA-DKFGNFVVARLGTDLTDQI 1021
Query: 895 -----------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
RG +++++ YH+GE + + +LV G +++
Sbjct: 1022 DDDPTGSKAFWSRGILNGASQKMDILCNYHVGETILSLQKVTLV-------PGGAECILY 1074
Query: 937 GTVNGVIGVIASLPH-EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
T++G IG+ + + + F L+ +LR+ + G +H +RS K+V +
Sbjct: 1075 TTMSGGIGLFLPFSNRDDFEFFTSLELHLRQEHAPLCGRDHLHYRSAYFPVKSV-----I 1129
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DGDL E + LS + +E+++ + ++ K++E++
Sbjct: 1130 DGDLCEQYPLLSASVKNEVAEGLERPTTDVIKKIEDI 1166
>gi|427798971|gb|JAA64937.1| Putative damage-specific dna binding complex subunit ddb1, partial
[Rhipicephalus pulchellus]
Length = 1259
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 260/1173 (22%), Positives = 479/1173 (40%), Gaps = 194/1173 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I ++ FR G ++D+L + ++ + +L+++ + S + + + G R
Sbjct: 58 VFGIIRSMMSFRLTGGSKDYLVVGSDSGRIVILEYNPQKS-CFEKVHQETFGKSGCRRIV 116
Query: 65 NGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCA- 121
GQ +DP R + + + L ++ D +L + + + L +Y C
Sbjct: 117 PGQHLAMDPKGRALMVAAVEKQKLVYILNRDAAARLTISSPLEAHKSSTL----VYHCVG 172
Query: 122 ------KPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
P L D ++A + E A F E N+ L+
Sbjct: 173 VDVGFENPVFACLEMDYEEADSDPSGEAANNTHQTLTFYELDLGLNHVVRKYSEPLEEHG 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 233 NFLIAVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+L G + + + +++ VT +K++ +A+++ L +
Sbjct: 288 VCSANHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTIPVAASMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
++ + +G+ L ++ P A + + V E +L P
Sbjct: 348 FVAAEFGNHCLYQIARLGEEDEEPEFSSAIPLEEGDTFFFAPRALRNLLPV-EELDSLSP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
+ + DL + Q+ G +R++R+G+ ++E A EL G +W+++
Sbjct: 407 AMGCTIADLANEDTPQLYVACGRGPRSCIRVLRHGLEVSEMAVSELPGNPNAVWTVKRKA 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
D+ +D +++VSF++ T +L++ + +EE GF T TL C + LVQV +
Sbjct: 467 DEDYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCAQIGDDALVQVYPEGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
R + + R NEW++P ++ N QV++A GG LVY E+ G L E +
Sbjct: 525 RHIRADKR--VNEWRTPGKKAIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTDRRE 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIPRSVL 534
+ +I C+ + + S+ AVG+ D +VRI SL + ++ + P S+
Sbjct: 583 MSADIICMSLASVPTGEQRSRFLAVGL-ADNTVRIISLDPSDCLSPLSMQALPATPESLS 641
Query: 535 LCAF---------EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
+ +GI YL L +G LL +L+ TG+L+D + LG++P+ L
Sbjct: 642 IVEMGMTEVETTGQGILYLNIGLQNGVLLRTVLDQVTGDLSDTRTRYLGSRPVKLFKVRM 701
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
+ + A S R + Y + + ++ + + F+S P+ + L I
Sbjct: 702 QGSDAALAMSSRSWLSYYYQNRFHLTPLSYETLEFASGFSSEQCPEGIVAISTNTLRILA 761
Query: 646 IDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSL------------KNQSCAEE---- 688
++ + + + S L PR+ S I + Q AEE
Sbjct: 762 LEKLGAVFNQVSTTLEYTPRKFVVNSDSGHLIIVETDHNAYTEKMKQQRKQQMAEEMVEA 821
Query: 689 ---------SEM-------------------------HFVRLLDDQTFEFISTYPLDTFE 714
+EM +R+L+ + + L+ E
Sbjct: 822 AGEGEQELAAEMAAAFLSEVLPETVFGSPKAGSGMWASVIRVLNPADSQTLCKVALEQNE 881
Query: 715 YGCSILSCSF-SDDSNVYYCVGTAYVLPEE--NEPTKGRILVFIVE---DGKLQLIAEKE 768
S+ F S Y VG A L + + + G +L + + + +L+L+
Sbjct: 882 AALSVALVKFASQPDEQYVVVGAARELSLQPWHARSGGLLLTYRLSHAGETRLELVHATS 941
Query: 769 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDF 827
+ A +L F G+LLA + + ++LY D G ++L +C + A+ +Q+ G+
Sbjct: 942 VEEAPTALCPFQGRLLAGVGKCLRLY-----DLGRKKLLRKCENKYIPSAIVSIQSMGNR 996
Query: 828 IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-K 886
+VVGD+ +S L YK +E + A D W++A +LD D GA+ N+ +R
Sbjct: 997 VVVGDVQESFFFLRYKRQENQLVIFADDAVPRWITASCMLDYDTVAGADKFGNVSIIRLP 1056
Query: 887 NSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
NS +DE +RG L EV+ +H+GE V + +L+
Sbjct: 1057 NS--VSDEVDEDPTGIKSLWDRGWLGGSSQKAEVISNFHIGETVLSLQKATLI------- 1107
Query: 928 VGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
G ++++ T++G IGV+ HE + F + L+ ++R + G +H +RS
Sbjct: 1108 PGGSESLVYVTLSGTIGVLVPFTAHEDHDFFQHLEMHMRSENPPLCGRDHLSFRS----- 1162
Query: 987 KTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
KN +DGDL E F L ++ I++ ++
Sbjct: 1163 SYFPVKNVIDGDLCEQFNSLDPSKQKSIAEELD 1195
>gi|391341057|ref|XP_003744848.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Metaseiulus
occidentalis]
Length = 1211
Score = 194 bits (492), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 247/1174 (21%), Positives = 479/1174 (40%), Gaps = 173/1174 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G ++D+L + ++ + C+L+++ + L M R
Sbjct: 58 VFGVIRSLMSFRLTGGSKDYLIVGSDSGRICILEFNPAKNVLEKIHMETFGKSGCRRIVP 117
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R +IG L ++ D+ +L + + + L + G
Sbjct: 118 GQYLAVDPKGRAVMIGAVEKQKLVYILNRDSNARLTISSPLEAHKSNTLVYHMVGVDVGF 177
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADLLIP 169
P + D ++A + T E A K + F E N+ L+ ++ LI
Sbjct: 178 ENPVFACIELDYEEADNDHTGEAAKKAQQSLTFYELDLGLNHVVRKYSEPLEEPSNFLIA 237
Query: 170 VPPPL---CGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR----------- 215
VP + GVL+ E I Y N IR I + +D D R
Sbjct: 238 VPGGVDGPSGVLVCSENFITY--KNLGDQADIRCPIPRRRNDLD-DPDRGMLFVCSTQHK 294
Query: 216 ------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
+L G + + + + + VT +K++ +A T+ L + +++ S +G
Sbjct: 295 TKSMFFFLAQTEQGDIFKITLEFDDDAVTEIKLKYFDSLPVAQTMHVLKSGFLFVASEFG 354
Query: 270 DSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPIVDFCVVD 306
+ L ++ ++ P +G L R + +L PI+ V D
Sbjct: 355 NHSLYQIAHLGDNTDEPEFSSIFPLEEGDTFFFLPRELKNLVLVDEMDSLSPIMTARVAD 414
Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFL 365
L + Q+ G ++R++R+G+ ++E A EL G +W+++ DD +D ++
Sbjct: 415 LTNEDTPQLYAACGRGPRSTMRVLRHGLEVSEMAVSELPGNPSAVWTVKKRADDEYDAYI 474
Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
VVSFI+ T +L++ + +EE GF T TL CH ++ LVQ+ +R + + R
Sbjct: 475 VVSFINATLVLSIG--ETVEEVTDSGFLGTTPTLACHQIGHDALVQIYPEGIRHIRADRR 532
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE-VKHAQLEYEISCL 483
NEW++ + N QV++A GG L+Y E+ G L E + ++ ++ C+
Sbjct: 533 --VNEWRTSGKKLIVKCAVNQRQVVIALTGGELIYFEMDSSGQLNEYAERKEMNSDVLCM 590
Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG--EIIPRSVLLCAFEGI 541
+ + ++ AVG +D +V + SL + ++ + G E P S+ + G
Sbjct: 591 ALGSVPAGEQRTKFLAVGS-SDGTVHVISLDPKSCLSILSVQGMTESNPESLAIVDMSGH 649
Query: 542 S--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
YL L +G L +L+ TG+++D + LG++ + L + T V A
Sbjct: 650 EEGSGSSALYLNIGLQNGILKRMVLDAVTGDMSDTRTRFLGSRSVKLFKIKMQGTDAVLA 709
Query: 594 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
S R + Y + + ++ + + F S P+ + L I ++ + +
Sbjct: 710 MSSRCWLSYLFQNRFHLTPLSYDSLEYASGFCSEQCPEGIVAIAGNTLRILALEKLGAVF 769
Query: 654 IR-SIPLGEHPRRICHQEQSR------------TFAICSLKNQSCAEE------------ 688
+ S PL PR ++S+ T ++ + Q AEE
Sbjct: 770 NQLSAPLSFTPRSFVIDKKSQKLIVIETDHNAYTDSVKEKRKQHVAEEMIEAAGEAEQEM 829
Query: 689 -------------SEMHF-------------VRLLDDQTFEFISTYPLDTFEYGCSILSC 722
E F +R+L+ + L+ E S+
Sbjct: 830 AAELAAAFLSEDLPESQFGAPRAGKKHWASVIRILNPSDLSTVYKIQLEQNEAAVSVALV 889
Query: 723 SFSDDSNVYYCVGTAYVL---PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
+F ++ VG + L P E + + + +L+L+ ++ F
Sbjct: 890 NFDKSTDPILLVGISKDLQLSPRECRNSFINAYRVVKDCTELELVHTTVMDDVPQAMCNF 949
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKSI 837
++L + + +++Y D G +++ +C + HI L + + G+ IVVGD+ +S
Sbjct: 950 GNRVLIGVGRCLRIY-----DFGKKKMLRKC-ENKHIPNLIVTINAVGNRIVVGDVQESF 1003
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDE- 894
+ Y+ E + A D+ W +A ++D +G + N++ +R N+ DE
Sbjct: 1004 FFIRYRMLENQLIIFADDFTPRWTTAACMVDYRTVVGGDKFGNVYILRLPGNTSDDVDED 1063
Query: 895 --------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
+RG L EV+ H+GE + + +L+ P++ +++ T+
Sbjct: 1064 PTGVRSLWDRGWLGGAGQKAEVLSMTHVGELIVSLQKTALIPGGPEA-------IVYTTI 1116
Query: 940 NGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
G +G + + + F + L+ +R + G +H +RS+ K V +DGD
Sbjct: 1117 AGGVGALIPFSSKDDHEFFQHLEMYMRTEHPPICGRDHLSFRSYYFPVKAV-----IDGD 1171
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L E + L + +I+ + E+ K++E++
Sbjct: 1172 LCEQYNSLDANKQKQIADELERLPHEVAKKLEDI 1205
>gi|258570355|ref|XP_002543981.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
gi|237904251|gb|EEP78652.1| pre-mRNA splicing factor rse1 [Uncinocarpus reesii 1704]
Length = 1209
Score = 193 bits (491), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 274/1177 (23%), Positives = 490/1177 (41%), Gaps = 187/1177 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G +D++ I ++ + ++++ S R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIVEY-VPSQNRFNRIHLETFGKSGIRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
GQ +DP R LI + L V+ + + +L + + L LD+
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTIVFALTALDVG 182
Query: 116 F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
+ ++G + QD + + ++ YE+ L V W + +D A +
Sbjct: 183 YENPIFGALEVEYTESDQDPTGSAYEESEKLLVYYELDLGLNHVVRK-WVEP-VDRTAMM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLL 218
L VP P GVL+ E I Y +N F+ IP R T+ + D RY+
Sbjct: 241 LFQVPGGADGP--SGVLVCAEGNITYRHSNQDVFRVPIPRRSGPTE-----NPDRKRYIT 293
Query: 219 GDHAGLLHLL---------------------VITHEKEKVTG----LKIELLGETSIAST 253
AG++H + ++ + + TG LK++ IAS+
Sbjct: 294 ---AGVVHKMRRAFFCLLQTEDGDLFKVTIDMVEDDNGQPTGEVRRLKLKYFDTVPIASS 350
Query: 254 ISYLDNAVVYIGSSYGDSQLIKLN--------------------LQPDAKGSY------- 286
+ L N +++ S G+ + L+P A +
Sbjct: 351 LCILKNGFLFVASENGNHHFYQFEKLGDDDEETEFTSDDFSSDPLEPLAPVYFRPRPAEN 410
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+ ++E ++ P++ V +L Q+ T G + R +++G+ ++E EL
Sbjct: 411 LNLVESINSVNPLMSCKVANLTEDDAPQLYTLCGTGARSTFRTLKHGLEVSEIVESELPS 470
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
+ +W+ + + +D +D ++++SF + T +L++ + +EE GF S TL
Sbjct: 471 VPSAVWTTKLTRNDQYDAYIILSFTNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLG 528
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
+ L+QV +R + + R NEW +P S+ A N QV +A G +VY E+
Sbjct: 529 EDSLIQVHPKGIRHIHADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDT 586
Query: 465 DGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
DG L E + ++ ++CL + I S AVG D +VRI SL PD L K
Sbjct: 587 DGSLAEYDEKREMSGTVTCLSLGEILPGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKS 645
Query: 523 HLGGEIIPRSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
P ++ ++ + S YL L G L +L+ TGEL+D + LG +
Sbjct: 646 VQALTSAPSALSIMSMIDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLK 705
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
P+ L + S K V A S RP + YS K + + ++ + F+S + +
Sbjct: 706 PVKLFSVSVKEQRAVLALSSRPWLGYSDLQTKNFMLTPLDYVPLEWSWNFSSEQCVEGMV 765
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA------- 686
+ L I +I+ + L +IPL PR + F + N +
Sbjct: 766 GIQGQNLRIFSIEKLDNNLLQETIPLAYTPRHFVRHPEHPLFYVIGADNNILSPATKAKL 825
Query: 687 ---------EESEM------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILS 721
E SE+ H+ ++++D + +S L+ E S+ +
Sbjct: 826 LEDSKAVNGEASELPPEDFGYSRGTGHWASCIQVVDPINSKTVLSRIELEENEAAVSVAA 885
Query: 722 CSFSD-DSNVYYCVGTAYVLPEENEPTK--GRILVF-IVEDGK-LQLIAEKETKGAVYSL 776
FS D + VGT + N P+ G I ++ EDGK L+ I + + + +L
Sbjct: 886 VPFSSQDDETFLVVGTGKDM-VVNPPSSSCGYIHIYRFQEDGKELEFIHKTKVESPPQAL 944
Query: 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---QTRGDFIVVGDL 833
AF G+LLA I +++Y D G ++L +C ++ + QT+G I+V D+
Sbjct: 945 LAFQGRLLAGIGTNLRIY-----DLGMKQLLRKC--QAEVVPRMIVGLQTQGSRIIVSDV 997
Query: 834 MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEG 890
+S++ ++YK++E + A D A W + ++D + G + NL+ +R K SE
Sbjct: 998 QESVTYVVYKYQENRLIPFADDIIARWTTCTTMVDYETVAGGDKFGNLWLLRCPQKASEE 1057
Query: 891 ATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT------------ 938
A ++ G + +L NR SL++ D IPT I T
Sbjct: 1058 ADEDGSGAHLIHERQYLQGAPNRL---SLMIHFYPQD---IPTSIQKTQLVAGGRDILVW 1111
Query: 939 --VNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
+ G IG+ I + E F + L+ L + G +H +RS+ AK +
Sbjct: 1112 TGLQGTIGMLIPFVSREDVDFFQSLEMQLTSQTPPIAGRDHLIYRSY-----YAPAKGTI 1166
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DGDL E++ L + I+ ++ SV E+ +++ ++
Sbjct: 1167 DGDLCETYFTLPNDKKLMIAGELDRSVREIERKISDM 1203
>gi|390357128|ref|XP_001198237.2| PREDICTED: splicing factor 3B subunit 3-like [Strongylocentrotus
purpuratus]
Length = 949
Score = 193 bits (490), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 207/936 (22%), Positives = 406/936 (43%), Gaps = 144/936 (15%)
Query: 216 YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
+L G + + + + + VT ++++ +A++++ L ++I S YG+ L +
Sbjct: 33 FLAQTEQGDIFKITLETDDDMVTEIRMKYFDTVPVATSMNVLKTGFLFIASEYGNHYLYQ 92
Query: 276 L-------------NLQPDAKGSY----------VEVLERYVNLGPIVDFCVVDLERQGQ 312
+ + P +G +E +++ +L PI+ + DL +
Sbjct: 93 IAHLGDDDDEPEFSSATPLEEGDTFFFAPRTLRNLEEVDQLESLSPILSCQIADLASEDT 152
Query: 313 GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFIS 371
Q+ G S+R++R+G+ ++E A EL G +W+++ +DD +D +++VSF++
Sbjct: 153 PQLYVACGRGPRSSMRVLRHGLEVSEMAVSELPGNPNAVWTVKKKSDDEYDAYIIVSFVN 212
Query: 372 ETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
T +L++ + +EE GF T TL + L+Q+ +R + + R NEW
Sbjct: 213 ATLVLSIG--ETVEEVTDSGFLGTTPTLSSSLIGDDALLQIYPDGIRHIRADKR--VNEW 268
Query: 432 KSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINPIG 489
K+P S+ N QV++A GG +VY E+ G L E + ++ ++ C+ + +
Sbjct: 269 KTPGKKSIVKCAVNQRQVVIALTGGEMVYFEMDPTGQLNEYTERKEMNADVKCMSLATVP 328
Query: 490 ENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LLCAFE--------- 539
++ AVG+ D +VR+ SL + + + L + +P LC E
Sbjct: 329 SGEQRARFLAVGL-DDNTVRVISLDQSDCL--QPLSMQALPAPAEALCIIEMGGTEAREE 385
Query: 540 -------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
G +L L +G LL +L+ TG+L+D + LG++P+ L + + + V
Sbjct: 386 TGERGSSGGLFLNIGLQNGVLLRMVLDSVTGDLSDTRTRYLGSRPVKLFRINMQGSEAVL 445
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
A S R + Y + + ++ + + F S P+ + L I ++ + +
Sbjct: 446 AMSSRSWLSYWYQSRFHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAV 505
Query: 653 HIR-SIPLGEHPRRIC-HQEQSRTFAICSLKN-----------QSCAEE----------- 688
+ S L PR+ H E + I + N Q AEE
Sbjct: 506 FNQVSTQLKYTPRKFAVHLENNNIVVIETDHNTYTDSTKAQRKQQMAEEMVEAAGEEEKE 565
Query: 689 --SEM-------------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
+EM +RLL+ + + L+ E S+
Sbjct: 566 LAAEMAEAFLQEELPESTFSSPKAGSGMWASTIRLLNPVQGQTLDMVELEQNEAAFSLAL 625
Query: 722 CSFSD-DSNVYYCVGTAY-VLPEENEPTKGRILVFIV--EDGKLQLIAEKETKGAVYSLN 777
F++ + + VGTA V + G I +++ E KL+ + + S+
Sbjct: 626 VKFANREDETHVLVGTAKDVTLSPRTCSGGAIHTYLLTEEGKKLEFLHKTLVDDVPSSIA 685
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMK 835
AF G++L I + +++Y D G ++L +C + HI + T G+ +++ D+ +
Sbjct: 686 AFQGRVLIGIGRLLRIY-----DLGKKKLLRKC-ENKHIPNFITNILTTGNRVIISDVQE 739
Query: 836 SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATD 893
S+ L YK +E + A D N W++A +LD D + A+ N+ +R + D
Sbjct: 740 SLHFLKYKRQENQMVIFADDSNPRWITATCLLDHDTVMCADKFGNITVLRLPTSVNDNLD 799
Query: 894 E---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
E +RG L +VV + +GE V + +L+ G +V++
Sbjct: 800 EDPTGVKALWDRGLLNGASQKTDVVSSFFIGETVLSLQKATLI-------PGGSESVVYT 852
Query: 938 TVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
T++G IGV+ HE + F + L+ ++R + G +H +RS+ KN +D
Sbjct: 853 TLSGAIGVLVPFTAHEDHDFFQHLEMHMRSEYAPLCGRDHLSYRSY-----YYPVKNVID 907
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
GDL E + + ++ +++ ++ + E+ K++E++
Sbjct: 908 GDLCEQYNSMELSKQRTVAEELDRNPSEVSKKLEDI 943
>gi|312370905|gb|EFR19207.1| hypothetical protein AND_22901 [Anopheles darlingi]
Length = 1287
Score = 192 bits (489), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 264/1197 (22%), Positives = 476/1197 (39%), Gaps = 206/1197 (17%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L ++G + +L FR G ++D+ I ++ + +L+++ ++L
Sbjct: 126 LLQTEVFGVVRSLMSFRLTGGSKDYAVIGSDSGRIVILEYNPVKNQLEKVHQETFGKSGC 185
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D++ +L + + + L +
Sbjct: 186 RRIVPGQYLSIDPKGRAVMIGAIEKQKLVYILNRDSEARLTISSPLEAHKSNTLTYHMVG 245
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D ++A T E A K + F E N+ L+
Sbjct: 246 VDVGFENPMFACLEIDYEEADGDPTGEAAAKTQQTLTFYELDLGLNHVVRKYSEPLEEHV 305
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ LI VP P GVLI E + Y N IR I + +D D R
Sbjct: 306 NFLISVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 360
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+L+ G + + + + + V+ +K++ A+ + L +
Sbjct: 361 ICSATHRTKSMYFFLVQTEQGDIFKVTLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 420
Query: 263 YIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPI 299
++ +G+ L ++ + P +G R + + PI
Sbjct: 421 FVACEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRQLKNLVMVDDIPSYAPI 480
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
+ V DL + Q+ G S+R++R+G+ ++E A EL G +W+++ D
Sbjct: 481 LGCQVADLANEDTPQLYLACGRGPRSSIRVLRHGLEVSEMAVSELPGNPNAVWTVKKRID 540
Query: 359 DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
D FD +++VSF++ T +L++ D +EE GF DA LVQV +R
Sbjct: 541 DEFDAYIIVSFVNATLVLSIG--DTVEEVTDSGFL----VPLGDDA----LVQVYPDGIR 590
Query: 419 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQL 476
+ + R NEWK+P ++ N QV++A GG LVY E+ G L E + ++
Sbjct: 591 HIRADKR--VNEWKAPGKKTIMKCAVNQRQVVIALSGGELVYFEMDPTGQLNEYTERKKM 648
Query: 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LL 535
E+ C+ + + S AVG+ D +VRI SL + ++ + + +P + L
Sbjct: 649 PSEVMCMALGSVPSGEQRSWFLAVGL-ADNTVRIISLDPSDCLSPRSM--QALPSAAESL 705
Query: 536 CAFE----------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573
C E G YL L +G LL +L+ +G+L D + L
Sbjct: 706 CIVEMGTVETAANDDDDDDDAPLKTSGCIYLNIGLTNGVLLRTVLDPVSGDLADTRTRYL 765
Query: 574 GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
G++P+ L + + V A S R + Y + + ++ + + + F+S + +
Sbjct: 766 GSRPVKLFRIQMQGSEAVLAMSSRSWLSYYYQNRFHLTPLSYETLEYASGFSSEQCSEGI 825
Query: 634 AIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
L I ++ + + + S PL P+R ++ I + + EE++
Sbjct: 826 VAISTNTLRILALEKLGAVFNQISFPLEYTPKRFAIHAETGKLIISETDHNAYTEETKTV 885
Query: 693 FVRLLDDQTFE----------------FIS-TYPLDTF---------------------- 713
R + D+ E FI+ P D F
Sbjct: 886 RKRQMADEMREAAGDDEQELANEMADAFINEVLPEDVFSAPKAGTGMWASQIRVMDPING 945
Query: 714 -----------EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGR---ILVFIVED- 758
E S+ F+ D Y G A L + P G I V+ V++
Sbjct: 946 HTYSKVQLAQNEAVLSLALVRFAVDQKWYVVAGVAKDL--QINPKIGNGGFIDVYRVDNQ 1003
Query: 759 -GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI- 816
+L+ + E A +L AF G+LLA I + +++Y D G ++L +C + HI
Sbjct: 1004 TNELEHVHRTEIDDAPGALCAFQGRLLAGIGKVLRMY-----DLGKKKLLRKC-ENKHIP 1057
Query: 817 -LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
+ +Q G + V D+ +S+ L YK E + A D + W+++ +LD D +
Sbjct: 1058 NQIVNIQGMGQRVYVSDVQESVYCLKYKRPENQLIIFADDTHPRWVTSATLLDYDT-VAT 1116
Query: 876 ENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRH 916
+ F V + +D+ +RG L E + +HLGE V +
Sbjct: 1117 GDKFGNIAVLRLPHSVSDDVDEDPTGNKALWDRGLLNGASQKAENICTFHLGEIVMSLQK 1176
Query: 917 GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLN 975
+L+ G ++I+ T++G +G + E Y F + L+ ++R + G +
Sbjct: 1177 ATLI-------PGGSESLIYATMSGTVGALVPFTSREDYDFFQHLEMHMRNENPPLCGRD 1229
Query: 976 HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
H +RS+ KN +DGDL E F L + I+ + + E+ K++E++
Sbjct: 1230 HLSYRSY-----YYPVKNVMDGDLCEQFTSLDPAKQKSIASDLGRTPSEVAKKLEDI 1281
>gi|378730761|gb|EHY57220.1| pre-mRNA-splicing factor rse1, variant [Exophiala dermatitidis
NIH/UT8656]
Length = 914
Score = 192 bits (489), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 211/894 (23%), Positives = 379/894 (42%), Gaps = 119/894 (13%)
Query: 236 KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL---------------NLQP 280
+V LKI+ +A+++ L + +++ S G+ + +
Sbjct: 37 EVRALKIKYFDTVPVANSLHILKSGFLFVASESGNHHFYQFEKLGDDDEETEFTSDDYPA 96
Query: 281 DAKGSY------------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
D +Y + +++ P++D V +L + Q+ + GA S +
Sbjct: 97 DPTAAYTPAFFHVRPLSNLNLVQSVDAANPLLDCKVANLLDEDAPQIYSICGAGARSSFK 156
Query: 329 IVRNGIGINEQASVEL-QGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEET 387
+++G+ ++E EL + +W+ + + DD +D ++++SF T L +++ + +EE
Sbjct: 157 TLKHGLSVSEIVESELPDKPEAVWTTKLTRDDQYDAYIILSF--RTGTLVLSIGETVEEV 214
Query: 388 EIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANAS 447
GF S TL + LVQV +R + + R NEW +P S+ AT N
Sbjct: 215 TDTGFLSTAPTLAVQQLGEDALVQVHPKGIRHIRADKR--VNEWPAPQHRSIVAATTNER 272
Query: 448 QVLLATGGGHLVYLEIG-DGILTEV-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTD 505
QV +A G +VY E+ DG L E + ++ ++CL + + E S AVG D
Sbjct: 273 QVAVALSSGEIVYFEMDTDGSLAEYDERREMTGTVTCLSLGDVPEGRVRSSFLAVGC-DD 331
Query: 506 ISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLL 558
+VRI SL PD L K P ++ + A + YL L G L +L
Sbjct: 332 STVRILSLDPDSTLENKSVQALTSAPSALSIMAMADSTSGGTTMYLHIGLYSGVYLRTVL 391
Query: 559 NMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLK 616
+ TGEL+D + LG +P L S K V A S RP + Y+ + + + ++
Sbjct: 392 DEITGELSDTRTRFLGLRPAKLFRVSVKGQNAVMALSSRPWLGYTDTQTNGFMLTPLDYV 451
Query: 617 EVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK-LHIRSIPLGEHPRRICHQEQSRTF 675
+ ++ F S P+ + + L I +I+D+ + L +IPL PR+ F
Sbjct: 452 PLQYVWNFTSEQCPEGMVGIQGQNLRIFSIEDLSRNLLQENIPLPYTPRKFVKHPDQPLF 511
Query: 676 AICSLKNQSCAEESEMHFVR----------LLDDQTFEF--------------------- 704
+ N A + + +L + F +
Sbjct: 512 YVIEADNNVLAPATRQKLLNESTAVNGDAVILPPEEFGYPKATGHWASCIQVVDPVNTKS 571
Query: 705 -ISTYPLDTFEYGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK 760
+ T L+ E SI + F S D V+ VGTA ++ + G + V+ EDGK
Sbjct: 572 VVFTLELEDNECATSITTAPFASQDDEVFLIVGTAKDLVVSPRSFSAGFLHVYRFHEDGK 631
Query: 761 -LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
L+ I + + + +L AF G+LLA + +++Y D G +++ +C L +
Sbjct: 632 ELEFIHKTKVEQPPTALLAFQGRLLAGVGPDLRIY-----DLGMKQMLRKCQVTTPNLIV 686
Query: 820 YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNF 879
+QT+G I+V D+ +S++ +YK +E + D A W + ++D + G +
Sbjct: 687 GLQTQGSRIIVSDIQESVTYCVYKFQENKLIPFCDDVIARWTTCCTMVDYETVAGGDKFG 746
Query: 880 NLFTVR---KNSEGATDE--------ERG-------RLEVVGEYHLGEFVNRFRHGSLVM 921
NL+ +R K SE A ++ ERG RL ++ Y G+ + +LV
Sbjct: 747 NLWMLRCPQKVSEEADEDNSGVHLLHERGYLNGTPNRLSLMIHYFPGDIPTSIQKTNLV- 805
Query: 922 RLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKG--VGGLNHEQ 978
G V + G +G++ + E F + L+ L + G +H
Sbjct: 806 ------AGGRDVVFWTGFQGTLGILVPFVSREDVDFFQSLEMQLASSNGNPPLLGRDHLI 859
Query: 979 WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+RS+ K V +DGDL E+F L + I+ ++ SV E+ +++ ++
Sbjct: 860 YRSYYAPSKGV-----IDGDLCETFFLLPNDKKLMIAGELDRSVREIERKISDM 908
>gi|157131296|ref|XP_001662180.1| spliceosomal protein sap [Aedes aegypti]
gi|108881836|gb|EAT46061.1| AAEL002709-PA [Aedes aegypti]
Length = 1215
Score = 192 bits (488), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 251/1187 (21%), Positives = 471/1187 (39%), Gaps = 185/1187 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L ++G I +L FR G +D+ + ++ + +L+++ ++L
Sbjct: 53 LLQTEVFGVIRSLMSFRLTGGTKDYAVVGSDSGRIVILEYNPAKNQLEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ IDP R +IG L ++ D++ +L + + + L +
Sbjct: 113 RRIVPGQFLAIDPKGRAVMIGAIEKQKLVYILNRDSEARLTISSPLEAHKSSTLTYHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D ++A + E A K + F E N+ L+ A
Sbjct: 173 VDVGFENPMFACLEIDYEEADLDPSGEAAAKTQQTLTFYELDLGLNHVVRKYSEPLEEHA 232
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
+ LI VP P GVLI E + Y N IR I + +D D R
Sbjct: 233 NFLISVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIF 287
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+L+ G + + + + + V+ +K++ A+ + L +
Sbjct: 288 ICSATHRTKSMYFFLVQTEQGDIFKVTLETDDDVVSEIKLKYFDTVPPATAMCVLKTGFL 347
Query: 263 YIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPI 299
++ +G+ L ++ + P +G R + + PI
Sbjct: 348 FVACEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRQLKNLVMVDEIHSYAPI 407
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
+ V DL + Q+ G S+R++R+G+ ++E A EL G +W+++ D
Sbjct: 408 LGCQVADLANEDTPQLYLACGRGPRSSIRVLRHGLEVSEMAVSELPGNPNAVWTVKKRID 467
Query: 359 DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
+ FD +++VSF++ T +L++ D +EE GF T TL C + LVQV +R
Sbjct: 468 EEFDAYIIVSFVNATLVLSIG--DTVEEVTDSGFLGTTPTLSCSALGDDALVQVYPDGIR 525
Query: 419 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQL 476
+ + R NEWK+P ++ N QV++A GG LVY E+ G L E + ++
Sbjct: 526 HIRADKR--VNEWKAPGKKTITKCAVNQRQVVIALSGGELVYFEMDPTGQLNEYTERKKM 583
Query: 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LL 535
++ C+ + + S AVG+ D +VR+ SL + ++ + + +P + L
Sbjct: 584 PSDVMCMALGSVPAGEQRSWFLAVGL-ADNTVRVISLDPSDCLSPRSM--QALPSAAESL 640
Query: 536 CAFE---------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
C E G YL L +G LL +L+ +G+L D + LG++P+ L
Sbjct: 641 CIVEMGTGDTNEEGVASSAGCIYLNIGLTNGVLLRTVLDPVSGDLADTRTRYLGSRPVKL 700
Query: 581 RTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE 640
+ + V A S R + Y + + ++ + + + F+S + +
Sbjct: 701 FRIKMQGSEAVLAMSSRTWLSYYFQNRFHLTPLSYETLEYASGFSSEQCSEGIVAISTNT 760
Query: 641 LTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD 699
L I ++ + + + + PL P+R ++ I + + EE++ + + D
Sbjct: 761 LRILALEKLGAVFNQITFPLEYTPKRFLIHNETGKLVISETDHNAYTEETKNIRKKQMAD 820
Query: 700 QTFE----------------FIS-TYPLDTF----------------------------- 713
+ E FI+ P D F
Sbjct: 821 EMREAAGEDEQELANEMADAFINEVLPEDVFSAPKAGTGMWASQIRVMDPINGHTYSKVQ 880
Query: 714 ----EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG---RILVFIVEDGKLQLIAE 766
E S+ F+ D Y G A L + G + + V +L+
Sbjct: 881 LAQNEAVLSMALVRFAVDQKWYVVAGVAKDLQMNPKIANGGFIDVYKYDVHTHQLEHYHR 940
Query: 767 KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTR 824
E A ++ F G++L + + +++Y D G ++L +C + HI + +Q
Sbjct: 941 TEIDDAPGAIAGFQGRVLVGVGRVLRIY-----DLGKKKLLRKC-ENKHIPNQIVNIQAM 994
Query: 825 GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
G + V D+ +SI + YK E + A D + W++ +LD D A+ N+ +
Sbjct: 995 GSRVFVSDVQESIYCIKYKRAENQLIIFADDTHPRWITTSTLLDYDTVATADKFGNIAIL 1054
Query: 885 RKNSEGATD-----------EERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDS 926
R + D +RG L E + +HLGE + + +L+
Sbjct: 1055 RLPHSVSDDVDEDPTGNKALWDRGLLNGASQKAENICTFHLGETIMSLQKATLI------ 1108
Query: 927 DVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
G ++I+ T++G +G + E Y F + L+ ++R + G +H +RS+
Sbjct: 1109 -PGGSESLIYATMSGTVGALVPFTSREDYDFFQHLEMHMRNENTPLCGRDHLSYRSY--- 1164
Query: 986 KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E F + + I+ + + E+ K++E++
Sbjct: 1165 --YYPVKHVMDGDLCEQFTSMDPAKQKSIASDLGRTPNEVAKKLEDI 1209
>gi|443918546|gb|ELU38987.1| CPSF A subunit region domain-containing protein [Rhizoctonia solani
AG-1 IA]
Length = 1037
Score = 191 bits (485), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 217/845 (25%), Positives = 356/845 (42%), Gaps = 97/845 (11%)
Query: 212 DGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
DG ++GD G LHL ++ E + + LLG+ S + + +V S G
Sbjct: 219 DGKHIIVGDSFGHLHLATMSMEPQ--FAMSCMLLGDVSTCMILLDPEGSV-----SRGKK 271
Query: 272 QLIKLNLQPDAKGSYVEVL-ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIV 330
+ QPD + +++EV+ E NL PI+D +V+++ GQ ++ SG G L +V
Sbjct: 272 K------QPDDETTHLEVIVEHNKNLAPILDATLVEIDGSGQPRIALISGDESGGWLSVV 325
Query: 331 RNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDE----LE 385
G E A ++ L ++ ++ L+ D+ ++LV S + T IL+ L D L
Sbjct: 326 HKGASFRELAILDGLGHLENIFPLKKYFDERTHSYLVASTTTSTYILS--LADSSISLLS 383
Query: 386 ETEIEGFCSQTQTLFCHDAIY---NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVA 442
E+ G T+ + + + VT + L+ + + WK P G A
Sbjct: 384 REELSGISRSETTILASNVPLGGVDAAIHVTPQKIVLIDIITGRAISSWKPPKGDITAAA 443
Query: 443 -TANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVG 501
++S + +AT GHL L I L + + ++IS L IN + I A
Sbjct: 444 LDTSSSTICVATSEGHLFSLNIQSAGLLQT--GKPHWQISSLAIN--------AGIVAAA 493
Query: 502 MWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYL--LCA--LGDGHLLNFL 557
W V+IF LP L + E + E SV+ + GI + CA + + +++
Sbjct: 494 FWGSNEVQIFLLPGLKRV-GESVIQEPSAASVVYLSNFGIPNVPDRCAPRWSNCYTSDYI 552
Query: 558 LNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDR---PTVIYSSNKKLLYSNVN 614
+ P R+ S + + F S T S K +V
Sbjct: 553 QGRTRAQYA-------ACSPTGCRSCSYHDGSRAFRGSGHRCWETFNGSFGCKQAPFSVL 605
Query: 615 LKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID---DIQKLHIRSIPLGEHPRRICHQEQ 671
+ + S F D K G + IG D Q L + + P + H E+
Sbjct: 606 VAPIGEQNGHQSKCFSD----IKSGCMRIGGSQRTWDSQYLFYQYNLGSDTPLCLAHHEE 661
Query: 672 SRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
+AI ++ ++ E E HFVR DD TF+ + Y L E SI ++ N Y
Sbjct: 662 LSKYAIGCVRPRTDVLE-ERHFVRFHDDSTFKDLGQYKLKYSEMVTSI--GVYTHGGNSY 718
Query: 732 YCVGTAYVLPEENEPTKGRILVFIV-EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
GTA + P ENEP GRI++F E+ ++ A K+ +G V S+ +++AAI
Sbjct: 719 ILAGTAIINPGENEPLAGRIILFGQDEENMIKFKASKDVEGGVSSIKQLGARIIAAIGHG 778
Query: 791 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA-- 848
I LY + R + T G+I+ + R + IVV D ++S+S+L +
Sbjct: 779 IYLYN-LGRGEVTISDPVARWERGYIVHDII-VRPNMIVVSDRLRSVSVLRFIERTSTPE 836
Query: 849 ---------------IEERARDYNANWMSAVEIL-DDDIYLGAENNFNLFTVRKNSEGAT 892
E A D +A W ++VE+L D+ + ++ + N+ T
Sbjct: 837 SHEEIETEEDSTILQFETVAMDMHAVWPTSVEVLPDNKTIIASQTDGNILT--------W 888
Query: 893 DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
+ E G LE +H GE +++F + S G IF T G IG ++++
Sbjct: 889 ELEDGNLEPRAAFHTGEIIHKF-----IASTAKSSAGPRTVAIFVTNTGRIGTLSTVDDA 943
Query: 953 QYLFLEKLQTNLRKVIKGVGGLNHEQWRS---FNNEKKTVDAKNFLDGDLIESFLDLSRT 1009
L L +L+ L IKG+G + H +WR+ + K + DGD I+ FL+LS
Sbjct: 944 DALQLTRLEMKLGDAIKGLGNIKHPEWRAPKLLHTGTKPPPRRGVTDGDFIKKFLELSSE 1003
Query: 1010 RMDEI 1014
I
Sbjct: 1004 EAKRI 1008
>gi|401883281|gb|EJT47496.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 2479]
Length = 1216
Score = 191 bits (484), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 267/1174 (22%), Positives = 471/1174 (40%), Gaps = 170/1174 (14%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G + + FR G +D++ ++++ + +++ + +V + G R T
Sbjct: 59 FGTVRNVAAFRLAGMGKDYIVLSSDSGRLSIIELVISPTPHFESLYQEVYGKSGSRRTIP 118
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY---GC 120
GQ +DP R + G L ++ + +G++ + + + L + + G
Sbjct: 119 GQFLAVDPKGRSAMFGAVEKQKLCYILNRNTEGKVYPSSPLEAHKNHTLVVNMIACDTGY 178
Query: 121 AKPTIVVLYQDNKDARHVKT---YEVALKDKDFVE---------GPWSQNNLDNGADLLI 168
P L D D+ H T Y A K F E WS+ D A+LL+
Sbjct: 179 DNPMFAALELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRKWSEPT-DRRANLLV 237
Query: 169 PVP------------PPLCGVLIIGEETIVY--CSANAFKA-IPIRPSITKAYGRVDA-- 211
VP P GVL+ E+ +++ A A + IP R + G+
Sbjct: 238 QVPGGQNANTDRFDGP--GGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPGQSSRGI 295
Query: 212 ----------DGSRY-LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
GS + LL G L I HE E V L+I+ +A+++ L +
Sbjct: 296 IIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATSLCILKSG 355
Query: 261 VVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLG----------------------- 297
+++ S +GD L + D G Y G
Sbjct: 356 YLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPLQNLLLAD 415
Query: 298 ------PIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
PI+D VV+L Q+ G + R +++G+ IN L G+
Sbjct: 416 TLSSLDPILDAQVVNLLGNASDTPQIYAACGRGPRSTFRSLKHGLDINVLVESPLPGVPN 475
Query: 349 GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
+W+L+ S DD +D+++V+SF + T +L++ + +EE GF S TL
Sbjct: 476 AVWTLKLSEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLGSAG 533
Query: 409 LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGI 467
L+QV +R + + R +EW +P G ++ AT N +QV++A LVY E+ +G
Sbjct: 534 LLQVHPAGLRHIRAADRV--DEWNAPAGTTIVSATTNKNQVVIALSTQELVYFELDEEGS 591
Query: 468 LTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLG 525
L E + + L +C+ I + E + AVG + +V I SL PD L T
Sbjct: 592 LNEYQERKSLPGNATCVSIADVPEGRRRTPYLAVGC-DNQTVHIISLEPDNTLETLSLQA 650
Query: 526 GEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
P + L S +L L +G LL +++ G L+D + LG++P
Sbjct: 651 LTAPPSDICLAEIFDTSIDKNRATMFLTIGLLNGVLLRTVVDPVDGSLSDTRLRFLGSKP 710
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ + + V A S R ++Y+ L + + + C ++A PD L
Sbjct: 711 PRIVRSAVHGSPAVMAFSSRTWLLYTYQDLLQTQPLIYDMLEYACNLSAAMCPDGLIGIS 770
Query: 638 EGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTF-------AICSLKNQSCAEES 689
L I TI + KL S+PL PR+ + F + S + A E
Sbjct: 771 GNTLRIFTIPRLGTKLKQDSMPLSYTPRKFVSYPYNTVFYGIEADHRVWSAQAMEKATEE 830
Query: 690 EMH-----------------------------FVRLLDDQTFEFISTYPLDTFEYGCSIL 720
+ +R++D + T LD E S+
Sbjct: 831 KKREGDEKALAVLELPPAEFGRVRAPAGHWGSLIRVIDPLQNTTLQTLELDEDEAAFSLA 890
Query: 721 SCSFSD-DSNVYYCVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSL 776
C F VGT ++P+ + R+ E +L+ + + +T ++
Sbjct: 891 VCYFERMGGEPSLVVGTGVKTTLMPKGCKEGWLRVYAIRDEGKRLEFMHKTKTDDVPLAV 950
Query: 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMK 835
AF G LLA + + ++LY+ G + L +C ++G + + G I+VGDL +
Sbjct: 951 AAFQGYLLAGVGKSLRLYEM-----GKKALLRKCENNGFPTGVATINVVGARIIVGDLQE 1005
Query: 836 SISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KN 887
S +Y+ +I R A D +++AV +D D A+ N+F R +
Sbjct: 1006 STFYCVYR----SIPSRQLLIFADDSQPRFLTAVCNVDYDTVCCADKFGNIFVNRLEERV 1061
Query: 888 SEGATDEERGRL------EVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTV 939
SE D+ G + ++G + + + + GS+V L V G V++ T+
Sbjct: 1062 SEKVDDDPTGAVILHEKGFLMGSANKTDLIAHYNVGSVVTSLTKVSVAPGGRDVVVYTTI 1121
Query: 940 NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
+G +G + + ++ F+ L+ ++R + + G +H +R + K V +DGD
Sbjct: 1122 SGAVGALVPFISNDDVEFMTTLEMHIRSLNTSLVGRDHLAYRGYYAPVKAV-----VDGD 1176
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L ESF L + I+ ++ +V E+ K++E+L
Sbjct: 1177 LCESFNMLPYPQQQAIAADLDRNVGEVLKKLEQL 1210
>gi|430813298|emb|CCJ29330.1| unnamed protein product [Pneumocystis jirovecii]
Length = 1197
Score = 190 bits (482), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 256/1169 (21%), Positives = 482/1169 (41%), Gaps = 174/1169 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L ++G I TL FR G +D L + ++ + +L++ +S+ ++ + + G
Sbjct: 60 LLSHDVHGIIRTLVGFRLAGTNKDHLIVGSDSGRITILEYKPDSNAF-SKVHQETYGKSG 118
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R +I + L V+ D+ L + + + L +
Sbjct: 119 VRRVVPGQYLAVDPKGRATMIASIEKNKLVYVLNRDSATNLTISSPLEAHKSCSLVFHLI 178
Query: 118 ---YGCAKPTIVVLYQDNKDA---------RHVKT----YEVALKDKDFVEGPWSQNNLD 161
G P L D +A R ++ YE+ L V WS + +D
Sbjct: 179 GMDVGYENPVFAALEVDYTEAESDPSGKAYREIQKVLTYYELDLGLNHVVR-KWS-DPVD 236
Query: 162 NGADLLIPVP-----PPLCGVLIIGEETIVYC-SANAFKAIPIRPSITKAYGRVDADGSR 215
A+LL+ VP P G L+ E +I Y IPI I G ++ +
Sbjct: 237 RKANLLVTVPGGSDGP--SGALVCTEGSIFYKHKGKKTHRIPIPTRI----GSLENSQKK 290
Query: 216 ----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
+LL + G L + I +V LKI+ +++ +S L +
Sbjct: 291 QIIVSSVVHKMRGAFFFLLQNEDGDLFKVTIDSNDGEVESLKIKYFDTVPVSTGLSILKS 350
Query: 260 AVVYIGSSYGDSQLIK--------------------LNLQPDAKGSYVE--------VLE 291
+++ S YG+ L + L+L + SY +++
Sbjct: 351 GFLFVASEYGNHHLYQFEKLGDDNNEIEFSSVDFPVLDLNEGYEPSYFRPRSLENLLLVD 410
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GM 350
++ P++D +++L + Q+ G + R +R G+ +NE + L G +
Sbjct: 411 DLNSMNPLMDSKILNLTDEDAPQIYALCGRGPRSTFRTLRYGLEVNEIVASGLPGSPTAV 470
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+ + ++ D +D ++V+SF++ T +L++ + +EE GF S + TL + L+
Sbjct: 471 WTTKLTSSDQYDAYIVLSFVNGTLVLSIG--ETVEEVSDTGFLSSSPTLAVQQLGDDALI 528
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILT 469
QV +R + + R NEWK+P S+ T N QV++A G +VY E+ D G L
Sbjct: 529 QVHPKGIRHIQADKRV--NEWKTPQHRSIVQVTTNQRQVVVALSNGEIVYFELDDEGNLN 586
Query: 470 EVK-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
E + Q+ ++ L I + E AVG D +VRI SL N T E L +
Sbjct: 587 EYQDRKQMTGSVTSLSIGHVPEGRQRHPFLAVGC-DDSTVRIISLEPEN--TLESLSVQA 643
Query: 529 I---PRSVLLCAFEG----ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
+ P S+ + + + YL L +G L +L + TG+L D + LG++P+ L
Sbjct: 644 LTAPPNSLCIMSMKDGNIETFYLHIGLFNGVYLRSVLELSTGQLKDTRTRFLGSRPVKLF 703
Query: 582 TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
+ + +N V A S RP + Y N L + + + F+S + + + +L
Sbjct: 704 SVTIQNQLSVIALSSRPWLSYIINASLHLVPLIYDSLEYCWGFSSEQCSEGIVGIQGQDL 763
Query: 642 TIGTIDDIQK-LHIRSIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAEESEMHF------ 693
I ++ + L SI L PRR I H ++ + I S N E +
Sbjct: 764 KIFMVERLDNVLKQDSISLMYTPRRFIKHPDEHIFYIIESDHNVLPLSERQKRVEGLQNG 823
Query: 694 ----------------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN-V 730
+ +LD + + ++ LD E SI +F + ++ +
Sbjct: 824 DHILLPEDIGLPRGLSGNWASCITILDPLSKKILTRIELDDNEAAFSIAMVTFKNQNDEI 883
Query: 731 YYCVGTA---YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAI 787
+ +G+ + P+ I FI + ++L+ + E +L F G+LLA +
Sbjct: 884 FLAIGSGKNVILAPKSFSAAYISIYRFIDQGKSIELVHKTEVDDIPLALLGFQGRLLAGL 943
Query: 788 NQKIQLYKWMLRDDGTRELQSECGHHGHILA-----LYVQTRGDFIVVGDLMKSISLLIY 842
+ +++Y+ ++ +C + A + + T+G I++ D+ +SI +Y
Sbjct: 944 GKMLRIYEMGMK---------KCLRKCEVRAVPNCIVQLHTQGSRIIIADIQESIHFAVY 994
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
K+ E + A D W + +LD + + N + R + SE A ++ G
Sbjct: 995 KYLENRLIVFADDVIPRWTTTSTMLDYETVAAGDKFGNFWINRCPLEVSESADEDPSGAQ 1054
Query: 898 -------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
RL+++ +++G+ L+ G +++ + G IG
Sbjct: 1055 LIHEKSYLFGAAKRLKMLAHFYIGDTFTSMHKVQLI-------AGGRDIIVYTGMMGSIG 1107
Query: 945 V-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
+ + + E F ++L+ +R + G +H +R + V K+ +DGDL E F
Sbjct: 1108 IFLPFVGREDVDFFQQLEALMRTEDLSLIGRDHLMYRGY-----YVPVKSVVDGDLCERF 1162
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L L + I+ ++ + E+ K++E++
Sbjct: 1163 LMLPYNKKQVIANELDREISEIAKKIEDM 1191
>gi|58258783|ref|XP_566804.1| U2 snRNA binding protein [Cryptococcus neoformans var. neoformans
JEC21]
gi|338819361|sp|P0CR23.1|RSE1_CRYNB RecName: Full=Pre-mRNA-splicing factor RSE1
gi|338819362|sp|P0CR22.1|RSE1_CRYNJ RecName: Full=Pre-mRNA-splicing factor RSE1
gi|57222941|gb|AAW40985.1| U2 snRNA binding protein, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1217
Score = 190 bits (482), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 277/1182 (23%), Positives = 476/1182 (40%), Gaps = 186/1182 (15%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I + FR G +D++ ++ + +L++ + +V + G R
Sbjct: 60 FGTIRNIAGFRLAGMTKDYILATSDSGRLSILEFVISPTPHFESLYQEVFGKSGSRRIVP 119
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R L+G L V+ + +G+L + + + L + G
Sbjct: 120 GQFLAVDPKGRSCLVGSLEKTKLVYVLNRNTEGKLYPSSPLEAHKNHTLVTHIVGVDQGY 179
Query: 121 AKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
P L D ++ +H+ YE+ L V WS+ D A+LL
Sbjct: 180 DNPLYAALETDYSESDQDSTGEAYENTQKHLTFYELDLGLNHVVRK-WSEPT-DRRANLL 237
Query: 168 IPVP------------PPLCGVLIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGS 214
+ VP P GVL+ E+ I++ + IPI R D S
Sbjct: 238 VQVPGGQNANSDRFEGP--SGVLVCTEDHIIWKHMDVEAHRIPIPRRRNPLVQR--GDKS 293
Query: 215 R-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
R +LL G L+ + I H E V LKI+ +A+++ L
Sbjct: 294 RGLIIVSAVMHKIKGAFFFLLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCIL 353
Query: 258 DNAVVYIGSSYGDSQLIKLNLQPDAKG-------SYVE---------------------- 288
+Y+ S + D L + + G Y E
Sbjct: 354 KRGYIYVASEFSDQNLYQFQSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLL 413
Query: 289 VLERYVNLGPIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+++ +L PI D VV+L Q+ G + R +++G+ + E S L G
Sbjct: 414 LVDTVPSLDPITDAHVVNLLGASSDTPQIYAACGRGARSTFRTLKHGLDVAEMVSSPLPG 473
Query: 347 IK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
+ +W+L+ + DD +D+++V+SF + T +L++ + +EE GF S TL
Sbjct: 474 VPTNVWTLKLTEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLG 531
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
L+QV +R + + R +EW +PPG ++ AT N QV++A LVY E+
Sbjct: 532 NAGLLQVHPYGLRHIRAADRV--DEWPAPPGQTIVAATTNRRQVVIALSTAELVYFELDP 589
Query: 465 DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
+G L+E + + L +C+ I + E + AVG + +V I SL PD L T
Sbjct: 590 EGSLSEYQEKKALPGNATCVTIAEVPEGRRRTSFLAVGC-DNQTVSIISLEPDSTLDTLS 648
Query: 523 HLGGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
P S+ L S +L L +G LL +++ G L+D + LG
Sbjct: 649 LQALTAPPTSICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLG 708
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
+P L + + V A S R ++Y+ L + + + ++A PD L
Sbjct: 709 AKPPKLVRANVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLI 768
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQS---------RTFAICSLKNQ 683
L I I + +KL S L PR+ I H S RT++ +++
Sbjct: 769 GISGNTLRIFNIPKLGEKLKQDSTALTYTPRKFISHPFNSVFYMIEADHRTYSKSAIERI 828
Query: 684 SCAEESE--------------------------MHFVRLLDDQTFEFISTYPLDTFEYGC 717
+ESE VR+LD E I T LD E
Sbjct: 829 VKQKESEGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLDPLANETIMTLDLDEDEAAF 888
Query: 718 SILSCSFS-DDSNVYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAV 773
SI F + VGT + + K G + V+ I E G+ L+ + + +T
Sbjct: 889 SIAIAYFERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAIKEQGRILEFLHKTKTDDIP 948
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
L F G LLA I + ++LY+ G + L +C ++G A+ + +G I+VGD
Sbjct: 949 LCLAGFQGFLLAGIGKSLRLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGD 1003
Query: 833 LMKSISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
+ +S +Y+ +I R A D W++ V +D + + N+F R +
Sbjct: 1004 MQESTFYCVYR----SIPTRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLD 1059
Query: 888 ---SEGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV---- 934
SE D+ G + ++G H E + + GS+V + + +IP V
Sbjct: 1060 PSISEKVDDDPTGATILHEKSFLMGAAHKTEMIGHYNIGSVV-----TSITKIPLVAGGR 1114
Query: 935 ---IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
++ T++G +G + + + F+ L+ ++R + G +H +R + V
Sbjct: 1115 DVLVYTTISGAVGALVPFVSSDDIEFMSTLEMHMRTQDISLVGRDHIAYRGY-----YVP 1169
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K +DGDL ESF L + I+ ++ SV ++ K++E++
Sbjct: 1170 IKGVVDGDLCESFSLLPYPKQQAIALDLDRSVGDVLKKLEQM 1211
>gi|391341059|ref|XP_003744849.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Metaseiulus
occidentalis]
Length = 1223
Score = 190 bits (482), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 249/1186 (20%), Positives = 481/1186 (40%), Gaps = 185/1186 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +L FR G ++D+L + ++ + C+L+++ + L M R
Sbjct: 58 VFGVIRSLMSFRLTGGSKDYLIVGSDSGRICILEFNPAKNVLEKIHMETFGKSGCRRIVP 117
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R +IG L ++ D+ +L + + + L + G
Sbjct: 118 GQYLAVDPKGRAVMIGAVEKQKLVYILNRDSNARLTISSPLEAHKSNTLVYHMVGVDVGF 177
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADLLIP 169
P + D ++A + T E A K + F E N+ L+ ++ LI
Sbjct: 178 ENPVFACIELDYEEADNDHTGEAAKKAQQSLTFYELDLGLNHVVRKYSEPLEEPSNFLIA 237
Query: 170 VPPPL---CGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR----------- 215
VP + GVL+ E I Y N IR I + +D D R
Sbjct: 238 VPGGVDGPSGVLVCSENFITY--KNLGDQADIRCPIPRRRNDLD-DPDRGMLFVCSTQHK 294
Query: 216 ------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
+L G + + + + + VT +K++ +A T+ L + +++ S +G
Sbjct: 295 TKSMFFFLAQTEQGDIFKITLEFDDDAVTEIKLKYFDSLPVAQTMHVLKSGFLFVASEFG 354
Query: 270 DSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPIVDFCVVD 306
+ L ++ ++ P +G L R + +L PI+ V D
Sbjct: 355 NHSLYQIAHLGDNTDEPEFSSIFPLEEGDTFFFLPRELKNLVLVDEMDSLSPIMTARVAD 414
Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFL 365
L + Q+ G ++R++R+G+ ++E A EL G +W+++ DD +D ++
Sbjct: 415 LTNEDTPQLYAACGRGPRSTMRVLRHGLEVSEMAVSELPGNPSAVWTVKKRADDEYDAYI 474
Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
VVSFI+ T +L++ + +EE GF T TL CH ++ LVQ+ +R + + R
Sbjct: 475 VVSFINATLVLSIG--ETVEEVTDSGFLGTTPTLACHQIGHDALVQIYPEGIRHIRADRR 532
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE-VKHAQLEYEISCL 483
NEW++ + N QV++A GG L+Y E+ G L E + ++ ++ C+
Sbjct: 533 --VNEWRTSGKKLIVKCAVNQRQVVIALTGGELIYFEMDSSGQLNEYAERKEMNSDVLCM 590
Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG--EIIPRSVLL-----C 536
+ + ++ AVG +D +V + SL + ++ + G E P S+ + C
Sbjct: 591 ALGSVPAGEQRTKFLAVGS-SDGTVHVISLDPKSCLSILSVQGMTESNPESLAIVELKAC 649
Query: 537 AFEGIS---------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
IS YL L +G L +L+ TG+++D + LG++ + L
Sbjct: 650 FLPSISVDMSGHEEGSGSSALYLNIGLQNGILKRMVLDAVTGDMSDTRTRFLGSRSVKLF 709
Query: 582 TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
+ T V A S R + Y + + ++ + + F S P+ + L
Sbjct: 710 KIKMQGTDAVLAMSSRCWLSYLFQNRFHLTPLSYDSLEYASGFCSEQCPEGIVAIAGNTL 769
Query: 642 TIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSR------------TFAICSLKNQSCAEE 688
I ++ + + + S PL PR ++S+ T ++ + Q AEE
Sbjct: 770 RILALEKLGAVFNQLSAPLSFTPRSFVIDKKSQKLIVIETDHNAYTDSVKEKRKQHVAEE 829
Query: 689 -------------------------SEMHF-------------VRLLDDQTFEFISTYPL 710
E F +R+L+ + L
Sbjct: 830 MIEAAGEAEQEMAAELAAAFLSEDLPESQFGAPRAGKKHWASVIRILNPSDLSTVYKIQL 889
Query: 711 DTFEYGCSILSCSFSDDSNVYYCVGTAYVL---PEENEPTKGRILVFIVEDGKLQLIAEK 767
+ E S+ +F ++ VG + L P E + + + +L+L+
Sbjct: 890 EQNEAAVSVALVNFDKSTDPILLVGISKDLQLSPRECRNSFINAYRVVKDCTELELVHTT 949
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRG 825
++ F ++L + + +++Y D G +++ +C + HI L + + G
Sbjct: 950 VMDDVPQAMCNFGNRVLIGVGRCLRIY-----DFGKKKMLRKC-ENKHIPNLIVTINAVG 1003
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR 885
+ IVVGD+ +S + Y+ E + A D+ W +A ++D +G + N++ +R
Sbjct: 1004 NRIVVGDVQESFFFIRYRMLENQLIIFADDFTPRWTTAACMVDYRTVVGGDKFGNVYILR 1063
Query: 886 --KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
N+ DE +RG L EV+ H+GE + + +L+ P++
Sbjct: 1064 LPGNTSDDVDEDPTGVRSLWDRGWLGGAGQKAEVLSMTHVGELIVSLQKTALIPGGPEA- 1122
Query: 928 VGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
+++ T+ G +G + + + F + L+ +R + G +H +RS+
Sbjct: 1123 ------IVYTTIAGGVGALIPFSSKDDHEFFQHLEMYMRTEHPPICGRDHLSFRSYYFPV 1176
Query: 987 KTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K V +DGDL E + L + +I+ + E+ K++E++
Sbjct: 1177 KAV-----IDGDLCEQYNSLDANKQKQIADELERLPHEVAKKLEDI 1217
>gi|406698009|gb|EKD01256.1| U2 snRNA binding protein [Trichosporon asahii var. asahii CBS 8904]
Length = 1216
Score = 190 bits (482), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 266/1174 (22%), Positives = 471/1174 (40%), Gaps = 170/1174 (14%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G + + FR G +D++ ++++ + +++ + +V + G R T
Sbjct: 59 FGTVRNVAAFRLAGMGKDYIVLSSDSGRLSIIELVISPTPHFESLYQEVYGKSGSRRTIP 118
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY---GC 120
GQ +DP R + G L ++ + +G++ + + + L + + G
Sbjct: 119 GQFLAVDPKGRSAMFGAVEKQKLCYILNRNTEGKVYPSSPLEAHKNHTLVVNMIACDTGY 178
Query: 121 AKPTIVVLYQDNKDARHVKT---YEVALKDKDFVE---------GPWSQNNLDNGADLLI 168
P L D D+ H T Y A K F E WS+ D A+LL+
Sbjct: 179 DNPMFAALELDYGDSDHDATGEAYRAAEKHLTFYELDLGLNHVVRKWSEPT-DRRANLLV 237
Query: 169 PVP------------PPLCGVLIIGEETIVY--CSANAFKA-IPIRPSITKAYGRVDA-- 211
VP P GVL+ E+ +++ A A + IP R + G+
Sbjct: 238 QVPGGQNANTDRFDGP--GGVLVCTEDYVIWKHMDAEAHRVPIPRRRNPMAKPGQSSRGI 295
Query: 212 ----------DGSRY-LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
GS + LL G L I HE E V L+I+ +A+++ L +
Sbjct: 296 IIVAAVTHKIKGSFFFLLQSEDGDLFKATIEHEGEDVRALRIKYFDTVPVATSLCILKSG 355
Query: 261 VVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLG----------------------- 297
+++ S +GD L + D G Y G
Sbjct: 356 YLFVASEFGDQGLYQFQSLADDDGEREWSSTDYPGFGLGEEHLPYAFFQPRPLQNLLLAD 415
Query: 298 ------PIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
PI+D VV+L Q+ G + R +++G+ +N L G+
Sbjct: 416 TLSSLDPILDAQVVNLLGNASDTPQIYAACGRGPRSTFRSLKHGLDVNVLVESPLPGVPN 475
Query: 349 GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
+W+L+ S DD +D+++V+SF + T +L++ + +EE GF S TL
Sbjct: 476 AVWTLKLSEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLGSAG 533
Query: 409 LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGI 467
L+QV +R + + R +EW +P G ++ AT N +QV++A LVY E+ +G
Sbjct: 534 LLQVHPAGLRHIRAADRV--DEWNAPAGTTIVSATTNKNQVVIALSTQELVYFELDEEGS 591
Query: 468 LTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLG 525
L E + + L +C+ I + E + AVG + +V I SL PD L T
Sbjct: 592 LNEYQERKSLPGNATCVSIADVPEGRRRTPYLAVGC-DNQTVHIISLEPDNTLETLSLQA 650
Query: 526 GEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
P + L S +L L +G LL +++ G L+D + LG++P
Sbjct: 651 LTAPPSDICLAEIFDTSIDKNRATMFLTIGLLNGVLLRTVVDPVDGSLSDTRLRFLGSKP 710
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ + + V A S R ++Y+ L + + + C ++A PD L
Sbjct: 711 PRIVRSAVHGSPAVMAFSSRTWLLYTYQDLLQTQPLIYDMLEYACNLSAAMCPDGLIGIS 770
Query: 638 EGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTF-------AICSLKNQSCAEES 689
L I TI + KL S+PL PR+ + F + S + A E
Sbjct: 771 GNTLRIFTIPRLGTKLKQDSMPLSYTPRKFVSYPYNTVFYGIEADHRVWSAQAMEKATEE 830
Query: 690 EMH-----------------------------FVRLLDDQTFEFISTYPLDTFEYGCSIL 720
+ +R++D + T LD E S+
Sbjct: 831 KKREGDEKALAVLELPPAEFGRVRAPAGHWGSLIRVIDPLQNTTLQTLELDEDEAAFSLA 890
Query: 721 SCSFSD-DSNVYYCVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSL 776
C F VGT ++P+ + R+ E +L+ + + +T ++
Sbjct: 891 VCYFERMGGEPSLVVGTGVKTTLMPKGCKEGWLRVYAIRDEGKRLEFMHKTKTDDVPLAV 950
Query: 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMK 835
AF G LLA + + ++LY+ G + L +C ++G + + G I+VGDL +
Sbjct: 951 AAFQGYLLAGVGKSLRLYEM-----GKKALLRKCENNGFPTGVATINVVGARIIVGDLQE 1005
Query: 836 SISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KN 887
S +Y+ +I R A D +++AV +D D A+ N+F R +
Sbjct: 1006 STFYCVYR----SIPSRQLLIFADDSQPRFLTAVCNVDYDTVCCADKFGNIFVNRLEERV 1061
Query: 888 SEGATDEERGRL------EVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTV 939
SE D+ G + ++G + + + + GS+V L V G V++ T+
Sbjct: 1062 SEKVDDDPTGAVILHEKGFLMGSANKTDLIAHYNVGSVVTSLTKVSVAPGGRDVVVYTTI 1121
Query: 940 NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
+G +G + + ++ F+ L+ ++R + + G +H +R + K V +DGD
Sbjct: 1122 SGAVGALVPFISNDDVEFMTTLEMHIRSLNTSLVGRDHLAYRGYYAPVKAV-----VDGD 1176
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L ESF L + I+ ++ +V E+ K++E+L
Sbjct: 1177 LCESFNMLPYPQQQAIAADLDRNVGEVLKKLEQL 1210
>gi|313235544|emb|CBY10999.1| unnamed protein product [Oikopleura dioica]
Length = 1185
Score = 190 bits (482), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 251/1147 (21%), Positives = 478/1147 (41%), Gaps = 147/1147 (12%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
+ I +L FR G +D++ + ++ + +L+++ + L + R G
Sbjct: 59 FSLIRSLISFRLTGGTKDYIVLGSDSGRISILEYNPDKGSLERVHCETFGKSMARRIVPG 118
Query: 67 QIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-----YGCA 121
Q DP R I + + V + Q + + LE + I F G
Sbjct: 119 QFLAHDPKGRAIMVAACEKQKLVYILNRDAQARLTISSPLEAHKSSTIHFYPVGLDVGFE 178
Query: 122 KPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADLLIPV 170
P V+ D +DA T E A + K F E N+ L+ A++LIPV
Sbjct: 179 NPVFAVIELDYEDADADPTGEAAQEAKQSLTFYELDLGLNHVVRKYTDPLECAANMLIPV 238
Query: 171 P-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---------- 215
P P GV++ E ++Y N IR I + +D D R
Sbjct: 239 PGGSDGP--GGVIVCAENYLIY--KNFGDQPDIRFPIPRRRNDLD-DPERGMIIVAHATH 293
Query: 216 -------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
+LL G L + + E++ VT ++++ ++S++ L +++ +
Sbjct: 294 KTRSMFFFLLQTEQGDLFKVTLETEEDIVTEIRLKYFDTVPVSSSLCVLRTGFLFVAGEF 353
Query: 269 GDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPIVDFCVV 305
G+ L ++ + +P +G R + +L P+++ V
Sbjct: 354 GNHNLYQITRLGEDDDEPEFSSAEPLEEGETFFFTPRGLRNLALTDEMDSLSPVLNCEVA 413
Query: 306 DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTF 364
DL + Q+ G +LR++R+G+ ++E A EL G +W++++S D D++
Sbjct: 414 DLANEDTPQLYVTCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKTSADADHDSY 473
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
++VSF++ T +L++ + +EE GF T TL + LVQ+ +R + S
Sbjct: 474 IIVSFVNATLVLSIG--ETVEEITDSGFLGTTPTLSSGLMGEDALVQIYPEGIRHIRSDR 531
Query: 425 RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYEISC 482
R NEW++P + N QV++A GG +VY E+ G L E + + EI
Sbjct: 532 RV--NEWRAPDRKQIVRCACNRQQVVIALTGGEIVYFEMDPTGQLNEYTERREFGSEIIA 589
Query: 483 LDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGI 541
LD+ + + AVG+ +D +VRI SL P+ L + +P+ V + +G
Sbjct: 590 LDVGDVPAGEQRCRFLAVGL-SDGTVRIISLDPNDCLQPRTMQALPTVPQDVAITEHKGQ 648
Query: 542 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 601
L L +G LL ++ TGE++D + LGT+ + L ++ V A S R +
Sbjct: 649 YVLQIGLQNGVLLRTTIDSVTGEISDTRTRYLGTKAVKLYKVVTEGENSVLAVSSRSWLS 708
Query: 602 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL--HIRSIPL 659
Y ++ + ++ + + F+S P+ + L I ++ + + +R+ PL
Sbjct: 709 YRHQQRFHLTPLSYEALDSATGFSSEQCPEGIVAIAGNTLRILALEKLGAVFNQVRN-PL 767
Query: 660 GEHPRRICHQEQSRTFAIC---------SLKNQSCAEESEMHFVR-LLDDQTF------- 702
PR+ + I K Q A E F++ L + F
Sbjct: 768 QLTPRKFVIHPDTANLVIAESDHNAFTEETKKQPLAAEMAAAFLKEELPESDFGSPRAGQ 827
Query: 703 ----EFISTYPLDT--------FEYGCSILSCSFSDDSNV-----YYCVGTAYVL---PE 742
IS +T E G I S + +V Y VGT+ + P
Sbjct: 828 GMWASKISIISPETGGTLHHVWLEQGQGIHSMCLTKFQSVGMIDWYLLVGTSNSMILNPR 887
Query: 743 ENEPTKGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWM 797
+++ G + V+ + G + + + T+ A ++ F G++L + +++Y K M
Sbjct: 888 QSQ--GGSLHVYRIHAGERFEYLHTTYTEDAPTAITPFQGRVLIGVGNLLRIYDLGKKKM 945
Query: 798 LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
LR R + + + T G I+ D +S + Y+ +E + A D
Sbjct: 946 LRKCELRSIPNRVTK--------IITAGQRIIAFDQQESAFFIKYRVKENQLTIFADDTF 997
Query: 858 ANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG------RLEVVGEYHLG 908
W++A +LD + A+ ++ +R + +E D+ G R + G
Sbjct: 998 PRWVTAACMLDYWTVVIADKFGSVSVIRLPAETNENTQDDPSGAKALWSRGNLNGASQKA 1057
Query: 909 EFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLR 965
E +N F G ++ + + + G +++ T++G IG++ +E F + L+ ++R
Sbjct: 1058 ETLNNFFVGDMITSIQKTILIPGGGECIVYATMSGRIGILVPFASNEDADFFQSLEMHMR 1117
Query: 966 KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEEL 1025
+ + G +H +R + +K++ +DGDL E+F L + + I++ M+ + E+
Sbjct: 1118 QEHTFLTGRDHLAFRGYYYPQKSI-----VDGDLCEAFSLLGQDQQKGIAEEMDRQLAEV 1172
Query: 1026 CKRVEEL 1032
K++E++
Sbjct: 1173 SKKLEDI 1179
>gi|320037168|gb|EFW19106.1| pre-mRNA-splicing factor rse1 [Coccidioides posadasii str. Silveira]
Length = 970
Score = 189 bits (481), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 232/982 (23%), Positives = 410/982 (41%), Gaps = 156/982 (15%)
Query: 176 GVLIIGEETIVYCSAN--AFKA-IPIRPSITKAYGRVDADGSRYLLGDHAGLLHLL---- 228
GVL+ E I Y +N AF+ IP R T+ + + RY+ AG++H +
Sbjct: 14 GVLVCSEGNITYRHSNQDAFRVPIPRRSGPTE-----NPERKRYIT---AGVVHKMRRAF 65
Query: 229 -----------------VITHEKEKVTG----LKIELLGETSIASTISYLDNAVVYIGSS 267
++ E ++TG LK++ +AS++ L N +++ S
Sbjct: 66 FCLLQTEDGDLFKVTMDMVEDENGQLTGEVQRLKLKYFDTVPVASSLCILKNGFLFVASE 125
Query: 268 YGDSQLIKLN-----------------------LQP----DAKGSYVEVLERYVNLGPIV 300
G+ + L P + ++E +L P++
Sbjct: 126 TGNHHFYQFEKLGDDDEETEFSSDDFSADPSEPLAPVYFRPRPAENLNLVESINSLNPLM 185
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDD 359
+ +L Q T SG + R +++G+ ++E EL + +W+ + + +D
Sbjct: 186 SCKITNLTEDDAPQFYTLSGTGARSTFRTLKHGLEVSEIVESELPSVPSAVWTTKLTRND 245
Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
+D ++++SF + T +L++ + +EE GF S TL + L+QV +R
Sbjct: 246 QYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPKGIRH 303
Query: 420 VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KHAQLE 477
+ + R NEW +P S+ A N QV +A G +VY E+ DG L E + ++
Sbjct: 304 IHADRR--VNEWPAPQHRSIVAAATNERQVAVALSSGEIVYFEMDTDGSLAEYDEKREMS 361
Query: 478 YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSV-LL 535
++CL + I S AVG D +VRI SL PD L K P ++ ++
Sbjct: 362 GTVTCLSLGEIPPGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAPSALSIM 420
Query: 536 CAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
+ S YL L G L +L+ TGEL+D + LG + + L + S K
Sbjct: 421 SMVDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGLKSVKLFSVSVKEQRA 480
Query: 591 VFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
V A S RP + YS K + + ++ + F+S + + + L I +I+
Sbjct: 481 VLALSSRPWLGYSDLQTKNFMLTPLDYVPLEWSWNFSSEQCVEGMVGIQGQNLRIFSIEK 540
Query: 649 I-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF----------VRLL 697
+ L +IPL PR + F + N + ++ V L
Sbjct: 541 LDNNLLQETIPLAYTPRHFVRHPEQPLFYVIEADNNILSPSTKAKLLQDSKAANGEVAEL 600
Query: 698 DDQTFEF----------------------ISTYPLDTFEYGCSILSCSF-SDDSNVYYCV 734
+ F + IS L+ E S+ + F S D + V
Sbjct: 601 PPEDFGYPRGTGHWASCIQVVDPINSKAVISRIELEENEAAVSVAAVPFSSQDDETFLVV 660
Query: 735 GTA---YVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
GT V P + + G I ++ EDGK L+ I + + + ++L AF G+LLA I +
Sbjct: 661 GTGKDMVVYPPSS--SCGFIHIYRFQEDGKELEFIHKTKVESPPHALLAFQGRLLAGIGR 718
Query: 790 KIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
+++Y D G ++L +C L + +QT+G I+V D+ +S++ ++YK++E
Sbjct: 719 NLRIY-----DLGMKQLLRKCQAEVVPRLIVGLQTQGSRIIVSDVQESVTYVVYKYQENR 773
Query: 849 IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEY 905
+ A D A W + ++D + G + NL+ +R K SE A ++ G +
Sbjct: 774 LIPFADDVIARWTTCTAMVDYETVAGGDKFGNLWLLRCPQKASEEADEDGSGAHLIHERQ 833
Query: 906 HLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT--------------VNGVIGVIAS-LP 950
+L NR SL++ D IPT I T + G +G++ +
Sbjct: 834 YLQGAPNRL---SLMVHFYPQD---IPTSIQKTQLVAGGRDILVWTGLQGTVGMLVPFVS 887
Query: 951 HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTR 1010
E F + L+ L + G +H +RS+ AK +DGDL E++ L +
Sbjct: 888 REDVDFFQSLEMQLTSQTPPLAGRDHLIYRSY-----YAPAKGTIDGDLCETYFTLPNDK 942
Query: 1011 MDEISKTMNVSVEELCKRVEEL 1032
I+ ++ SV E+ +++ ++
Sbjct: 943 KLMIAGELDRSVREIERKISDM 964
>gi|198420618|ref|XP_002125906.1| PREDICTED: similar to Splicing factor 3B subunit 3
(Spliceosome-associated protein 130) (SAP 130)
(Pre-mRNA-splicing factor SF3b 130 kDa subunit) (SF3b130)
(STAF130) [Ciona intestinalis]
Length = 1216
Score = 189 bits (480), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 256/1190 (21%), Positives = 479/1190 (40%), Gaps = 190/1190 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++ V ++G I +L FR G +D++ ++ + +L++ ++ +
Sbjct: 53 LITVEVFGEIRSLMSFRLTGGNKDYVVCGSDSGRISILEYIPNKNKFEKIHLETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
R GQ IDP R +IG L ++ D++ +L + + + F+Y
Sbjct: 113 RRIVPGQYLAIDPKGRAVMIGALEKQKLVYILNRDSQARLTISSPLEAHKANT----FVY 168
Query: 119 -------GCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------L 160
G P L D ++A T E + F E N+ L
Sbjct: 169 HMVGIDVGFENPMFACLELDYEEADTDPTGEAVQNTQQTLTFYELDLGLNHVVRKYSEPL 228
Query: 161 DNGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR 215
+ A+ LI VP P GVL+ E I Y N IR I + +D D R
Sbjct: 229 EERANHLISVPGGNDGP--GGVLVCAENYITY--KNFGDQPDIRTPIPRRRNDLD-DPER 283
Query: 216 -----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
+L+ G + + + +++ VT ++++ ++ + L
Sbjct: 284 GMIVVCSATHKTKSMFFFLIQTEQGDIFKVTLETDEDMVTEIRLKYFDTVPVSMAMCVLR 343
Query: 259 NAVVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------N 295
+++ + G+ L ++ + P +G R + +
Sbjct: 344 TGFLFVAAEMGNHCLYQIAHLGDDDDETEFSSAMPLEEGDTFFYAPRALRNLVLVDELDS 403
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLR 354
L PI+ + DL + Q+ G SLR++R+G+ ++E A EL G +W+++
Sbjct: 404 LSPIMTCLISDLANEDTPQLYVTCGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVK 463
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
++ FD++++VSF++ T +L++ + +EE GF T TL C N LVQV
Sbjct: 464 IKEEEEFDSYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGENALVQVYP 521
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VK 472
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 522 DGIRHIRADKR--VNEWKTPGKKTILRCAVNQRQVVIALTGGELVYFEMDQSGQLNEYTE 579
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIPR 531
++ E+ C+D++ + ++ AVG+ D +VRI SL + + + P
Sbjct: 580 RKEMNSEVVCMDLSKVPPTEQRTRFLAVGL-ADNTVRIISLDPTDCLQPLSMQALPATPE 638
Query: 532 SVLLCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
SV + G L L +G LL L+ TG+L+D + LGT+ +
Sbjct: 639 SVCIIEIGGGEDETGAQKASGGLVLNIGLQNGVLLRTTLDNVTGDLSDTRTRYLGTRAVK 698
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L ++ + V A S R + Y + + ++ + + F S P+ +
Sbjct: 699 LFRVLTQGSDAVLAMSSRTWLSYQHQNRFHLTPLSYESLEFASGFASEQCPEGIVAISAN 758
Query: 640 ELTIGTIDDIQKL-HIRSIPLGEHPRRICHQEQSRTFAICSL------------KNQSCA 686
L I ++ + + + S L PRR S I + Q A
Sbjct: 759 TLRILALEKLGTVFNQASTGLQYTPRRFAIHYDSGNVIILETDHNAYTEETKENRKQQMA 818
Query: 687 EE-------------SEMHF-------------------------VRLLDDQTFEFISTY 708
EE +EM +RL+ ++ E I
Sbjct: 819 EEMIEAAGEDERELAAEMTAAFLRESLPEDKFGASKAGPGMWSSQIRLMHPRSGETIQKI 878
Query: 709 PLDTFEYGCSILSCSF---SDDSNVYYCVGTAYVLPEENEPTKGRILVFIV--EDGKLQL 763
L+ E SI C F SDD+ V V ++ G I + + E KL+L
Sbjct: 879 CLEQNEAALSIAICKFPNNSDDTFVLVGVACELLINPRQARGGGEIHTYKINEEGNKLEL 938
Query: 764 IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYV 821
+ + ++ + G++L + + +++Y D G ++L +C + HI +
Sbjct: 939 VHKTVVDEVPSAICPYQGRVLIGVGKLLRIY-----DLGKKKLLRKC-ENKHIPNYIASI 992
Query: 822 QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNL 881
Q G I+V D+ +S+ + Y+ E + A D W+++ +LD + ++ N+
Sbjct: 993 QAVGHRIIVCDVQESVHWVRYRRHENQLVVFADDTYPRWVTSATVLDWNTVAVSDKFGNI 1052
Query: 882 FTVRKNSEGATDEE-----------RG-------RLEVVGEYHLGEFVNRFRHGSLVMRL 923
+R S+ D + RG + EV+ YH+GE V + +L+
Sbjct: 1053 SVLRLPSDVNDDVQDDPSGTKALWTRGILNGAMQKCEVLCMYHVGETVLSLQKTTLI--- 1109
Query: 924 PDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
G ++++ T++G IG++ HE + F + L+ ++R + G +H +RS+
Sbjct: 1110 ----PGGSESLVYTTLSGSIGMLVPFTSHEDHDFFQHLEMHMRNECPPLLGRDHLAYRSY 1165
Query: 983 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + ++ +++ ++ E+ K++E++
Sbjct: 1166 -----YFPVKNVIDGDLCELFNSMEASKQKSVAEELDRVPSEVSKKLEDI 1210
>gi|405117821|gb|AFR92596.1| pre-mRNA-splicing factor RSE1 [Cryptococcus neoformans var. grubii
H99]
Length = 1217
Score = 188 bits (478), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 276/1182 (23%), Positives = 476/1182 (40%), Gaps = 186/1182 (15%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I + FR G +D++ ++ + +L++ + +V + G R
Sbjct: 60 FGTIRNIAGFRLAGMTKDYILATSDSGRLSILEFVISPTPHFESMYQEVFGKSGSRRIVP 119
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R L+G L V+ + +G+L + + + L + G
Sbjct: 120 GQFLAVDPKGRSCLVGSLEKTKLVYVLNRNTEGKLYPSSPLEAHKNHTLVTHVVGVDQGY 179
Query: 121 AKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
P L D ++ +H+ YE+ L V WS+ D A+LL
Sbjct: 180 DNPLYAALEIDYSESDQDPTGEAYENTQKHLTFYELDLGLNHVVRK-WSEPT-DRRANLL 237
Query: 168 IPVP------------PPLCGVLIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGS 214
+ VP P GVL+ E+ I++ + IPI R D S
Sbjct: 238 VQVPGGQSANSDRFEGP--SGVLVCTEDHIIWKHMDVEAHRIPIPRRRNPLVQR--GDKS 293
Query: 215 R-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
R +LL G L+ + I H E V LKI+ +A+++ L
Sbjct: 294 RGLIIVSAVMHKIKGAFFFLLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCIL 353
Query: 258 DNAVVYIGSSYGDSQLIKLNLQPDAKG-------SYVE---------------------- 288
+Y+ S + D L + + G Y E
Sbjct: 354 KRGYIYVASEFSDQNLYQFQSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLL 413
Query: 289 VLERYVNLGPIVDFCVVDL--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+++ +L PI D VV+L Q+ G + R +++G+ + E S L G
Sbjct: 414 LVDTVPSLDPITDAHVVNLLGASSDTPQIYAACGRGARSTFRTLKHGLDVAEMVSSPLPG 473
Query: 347 IK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
+ +W+L+ + DD +D+++V+SF + T +L++ + +EE GF S TL
Sbjct: 474 VPTNVWTLKLTEDDEYDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLG 531
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG- 464
L+QV +R + + R +EW +PPG ++ AT N QV++A LVY E+
Sbjct: 532 NAGLLQVHPYGLRHIRAADRV--DEWPAPPGQTIVAATTNQRQVVIALSTAELVYFELDP 589
Query: 465 DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKE 522
+G L+E + + L +C+ I + E + AVG + +V I SL PD L T
Sbjct: 590 EGSLSEYQEKKALPGNATCVTIAEVPEGRRRTPFLAVGC-DNQTVSIISLEPDSTLDTLS 648
Query: 523 HLGGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLG 574
P S+ L S +L L +G LL +++ G L+D + LG
Sbjct: 649 LQALTAPPTSICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLG 708
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
+P L + + V A S R ++Y+ L + + + ++A PD L
Sbjct: 709 AKPPKLVRANVQGQPSVMAFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLI 768
Query: 635 IAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQS---------RTFAICSLKNQ 683
L I +I + +KL S L PR+ I H S RT++ +++
Sbjct: 769 GISGNTLRIFSIPKLGEKLKQDSTALTYTPRKFISHPFNSVFYMIEADHRTYSKSAIERI 828
Query: 684 SCAEESE--------------------------MHFVRLLDDQTFEFISTYPLDTFEYGC 717
+ESE VR+LD E I T LD E
Sbjct: 829 VKQKESEGRRVDTLLLDLPANEFGRPRAPAGHWASCVRVLDPLANETIMTLDLDEDEAAF 888
Query: 718 SILSCSFS-DDSNVYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAV 773
SI F + VGT + + K G + V+ I E G+ L+ + + +T
Sbjct: 889 SIAIAYFERGGGEPFLVVGTGVKTTLQPKGCKEGYLRVYAIKEQGRVLEFLHKTKTDDIP 948
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGD 832
L F G LLA + + ++LY+ G + L +C ++G A+ + +G I+VGD
Sbjct: 949 LCLAGFQGFLLAGVGKSLRLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGD 1003
Query: 833 LMKSISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
+ +S +Y+ +I R A D W++ V +D + + N+F R +
Sbjct: 1004 MQESTFYCVYR----SIPTRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLD 1059
Query: 888 ---SEGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV---- 934
SE D+ G + ++G H E + + GS+V + + +IP V
Sbjct: 1060 PSISEKVDDDPTGATILHEKSFLMGAAHKTEMIAHYNIGSVV-----TSITKIPLVAGGR 1114
Query: 935 ---IFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
++ T++G +G + + + F+ L+ ++R G +H +R + V
Sbjct: 1115 DVLVYTTISGAVGALVPFVSSDDIEFMSTLEMHMRTQDISPVGRDHIAYRGY-----YVP 1169
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K +DGDL ESF L + I+ ++ SV ++ K++E++
Sbjct: 1170 IKGVVDGDLCESFSLLPYPKQQAIATDLDRSVGDVLKKLEQM 1211
>gi|164429062|ref|XP_957282.2| pre-mRNA splicing factor RSE1 [Neurospora crassa OR74A]
gi|157072394|gb|EAA28046.2| pre-mRNA splicing factor RSE1 [Neurospora crassa OR74A]
Length = 1192
Score = 188 bits (477), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 276/1164 (23%), Positives = 475/1164 (40%), Gaps = 178/1164 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G + + FR G +D++ +AT+ + ++++ ++++ + R
Sbjct: 64 IFGIVRAIASFRLAGSHKDYIILATDSGRITIIEYLPKTNKFQRIHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R LI + L V+ +++ +L + + + VL + + G
Sbjct: 124 GQYLAADPKGRACLISALEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSLVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L D DA T YE+ L V WS + +D + LL
Sbjct: 184 ANPVFAALELDYTDADQDPTGQAREEVETQLVYYELDLGLNHVVRK-WS-DTVDRTSSLL 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYL 217
VP P GVL+ GEE + Y +N AF+ IP R T+ RV G +
Sbjct: 242 FQVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRSGATEDPQRKRVIVSGVMHK 299
Query: 218 LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
L AG L+ T + + +V LKI+ +A+++ L +
Sbjct: 300 LKGSAGAFFFLLQTDDGDLFKVTIDMIEDSDGNPTGEVKRLKIKYFDTIPVATSLCILKS 359
Query: 260 AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
++ S +G+ + + D SY V LE V
Sbjct: 360 GFLFAASEFGNHHFYQFEKLGDDDEELEFSSDDFPTDPTASYNPVYFHPRPLENLVLVES 419
Query: 295 --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
++ P VD V +L + Q+ + G + R++++G+ ++E + EL G +W
Sbjct: 420 IDSMNPQVDCKVANLTGEDAPQIYSVCGNGARSTFRMLKHGLEVSEIVASELPGTPSAVW 479
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+ + + D I ET +EE GF + TL + L+Q
Sbjct: 480 TTKLTKYD---------HIGET----------VEEVSDSGFLTTAPTLAVQQMGEDGLIQ 520
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
V +R + NEW +P S+ ATAN +QV++A G +VY E+ DG L E
Sbjct: 521 VHPKGIRHIVQGRV---NEWPAPQHRSIVAATANENQVVIALSSGEIVYFEMDSDGSLAE 577
Query: 471 V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
+ ++ ++ L + + E S AVG D +VRI SL PD L K
Sbjct: 578 YDEKKEMSGTVTSLSVGQVPEGLKRSSFLAVGC-DDCTVRILSLDPDSTLEMKSIQALTA 636
Query: 529 IPRSVLLCAFE-----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
P ++ + + E YL L G L +L+ TGELTD ++ LG +P L
Sbjct: 637 APSALSIMSMEDSFGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTRLFQV 696
Query: 584 SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
S ++ V A S RP + Y+ K + + ++ E+ + F+S + + L
Sbjct: 697 SVQDQPCVLALSSRPWLGYTDPLTKGFMMTPLSYTELEYGWNFSSEQCLEGMVGIHANYL 756
Query: 642 TIGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKN-------QSCAEESEMH 692
I +I+ + I +SIPL P+ + H EQ + I S N E+
Sbjct: 757 RIFSIEKLGDNMIQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG 816
Query: 693 FVRLLDDQTFEF----------------ISTYP-------LDTFEYGCSILSCSF-SDDS 728
+L + F + IS P LD E S F S +
Sbjct: 817 DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEG 876
Query: 729 NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
+ VGT ++ + + T+G I V+ EDG+ L+ I + + +L F G+LLA
Sbjct: 877 ESFLVVGTGKDMVLDPRQFTEGYIHVYRFHEDGRDLEFIHKTRVEEPPLALIPFQGRLLA 936
Query: 786 AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
+ + +++Y L+ R+ Q++ L + +Q++G+ I+VGDL + I+ ++YK E
Sbjct: 937 GVGKTLRIYDLGLK-QLLRKAQADV---TPTLIVSLQSQGNRIIVGDLQQGITYVVYKAE 992
Query: 846 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
+ A D W + ++D + G + N++ VR + + + E E
Sbjct: 993 GNRLIPFADDTLNRWTTCTTMVDYESVAGGDKFGNIYIVRCPERVSQETD----EPGSEI 1048
Query: 906 HLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV-IAS 948
HL N + HG+ LP S VG +++ + G +GV I
Sbjct: 1049 HLMHARN-YLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQDVLLWSGLQGTVGVFIPF 1107
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+ E F + L+ ++R + G +H +R + K V +DGDL E F L
Sbjct: 1108 VSREDVDFFQNLENHMRAEDPPLAGRDHLIYRGYYTPVKGV-----IDGDLCERFSLLPN 1162
Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
+ I+ ++ SV E+ +++ ++
Sbjct: 1163 DKKQMIAGELDRSVREIERKISDI 1186
>gi|336469942|gb|EGO58104.1| pre-mRNA splicing factor RSE1 [Neurospora tetrasperma FGSC 2508]
Length = 1192
Score = 188 bits (477), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 276/1164 (23%), Positives = 475/1164 (40%), Gaps = 178/1164 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G + + FR G +D++ +AT+ + ++++ ++++ + R
Sbjct: 64 IFGIVRAIASFRLAGSHKDYIILATDSGRITIIEYLPKTNKFQRIHLETFGKSGVRRVIP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ DP R LI + L V+ +++ +L + + + VL + + G
Sbjct: 124 GQYLAADPKGRACLISALEKNKLVYVLNRNSQAELTISSPLEAHKPGVLVLSLVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L D DA T YE+ L V WS + +D + LL
Sbjct: 184 ANPVFAALELDYTDADQDPTGQAREEVETQLVYYELDLGLNHVVRK-WS-DTVDRTSSLL 241
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFK-AIPIRPSITK--AYGRVDADGSRYL 217
VP P GVL+ GEE + Y +N AF+ IP R T+ RV G +
Sbjct: 242 FQVPGGNDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRSGATEDPQRKRVIVAGVMHK 299
Query: 218 LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
L AG L+ T + + +V LKI+ +A+++ L +
Sbjct: 300 LKGSAGAFFFLLQTDDGDLFKVTIDMIEDSDGNPTGEVKRLKIKYFDTIPVATSLCILKS 359
Query: 260 AVVYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------LERYV---- 294
++ S +G+ + + D SY V LE V
Sbjct: 360 GFLFAASEFGNHHFYQFEKLGDDDEELEFSSDDFPTDPTASYNPVYFHPRPLENLVLVES 419
Query: 295 --NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
++ P VD V +L + Q+ + G + R++++G+ ++E + EL G +W
Sbjct: 420 IDSMNPQVDCKVANLTGEDAPQIYSVCGNGARSTFRMLKHGLEVSEIVASELPGTPSAVW 479
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+ + + D I ET +EE GF + TL + L+Q
Sbjct: 480 TTKLTKYD---------HIGET----------VEEVSDSGFLTTAPTLAVQQMGEDGLIQ 520
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE 470
V +R + NEW +P S+ ATAN +QV++A G +VY E+ DG L E
Sbjct: 521 VHPKGIRHIVQGRV---NEWPAPQHRSIVAATANENQVVIALSSGEIVYFEMDSDGSLAE 577
Query: 471 V-KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEI 528
+ ++ ++ L + + E S AVG D +VRI SL PD L K
Sbjct: 578 YDEKKEMSGTVTSLSVGQVPEGLKRSSFLAVGC-DDCTVRILSLDPDSTLEMKSIQALTA 636
Query: 529 IPRSVLLCAFE-----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
P ++ + + E YL L G L +L+ TGELTD ++ LG +P L
Sbjct: 637 APSALSIMSMEDSFGGSTLYLHIGLHSGVYLRTVLDEVTGELTDTRQKFLGPKPTKLFQV 696
Query: 584 SSKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
S ++ V A S RP + Y+ K + + ++ E+ + F+S + + L
Sbjct: 697 SVQDQPCVLALSSRPWLGYTDPLTKGFMMTPLSYTELEYGWNFSSEQCLEGMVGIHANYL 756
Query: 642 TIGTIDDIQKLHI-RSIPLGEHPRRIC-HQEQSRTFAICSLKN-------QSCAEESEMH 692
I +I+ + I +SIPL P+ + H EQ + I S N E+
Sbjct: 757 RIFSIEKLGDNMIQKSIPLTYTPKHLVKHPEQPYFYTIESDNNTLPPELRAKLLEQQSNG 816
Query: 693 FVRLLDDQTFEF----------------ISTYP-------LDTFEYGCSILSCSF-SDDS 728
+L + F + IS P LD E S F S +
Sbjct: 817 DATVLPPEDFGYPRAKGRWASCISIIDPISEEPRVLQRIDLDNNEAAVSAAIVPFASQEG 876
Query: 729 NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
+ VGT ++ + + T+G I V+ EDG+ L+ I + + +L F G+LLA
Sbjct: 877 ESFLVVGTGKDMVLDPRQFTEGYIHVYRFHEDGRDLEFIHKTRVEEPPLALIPFQGRLLA 936
Query: 786 AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
+ + +++Y L+ R+ Q++ L + +Q++G+ I+VGDL + I+ ++YK E
Sbjct: 937 GVGKTLRIYDLGLK-QLLRKAQADV---TPTLIVSLQSQGNRIIVGDLQQGITYVVYKAE 992
Query: 846 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
+ A D W + ++D + G + N++ VR + + + E E
Sbjct: 993 GNRLIPFADDTLNRWTTCTTMVDYESVAGGDKFGNIYIVRCPERVSQETD----EPGSEI 1048
Query: 906 HLGEFVNRFRHGS----------LVMRLPDS------DVGQIPTVIFGTVNGVIGV-IAS 948
HL N + HG+ LP S VG +++ + G +GV I
Sbjct: 1049 HLMHARN-YLHGTPNRLSLQVHFYTQDLPTSICKTSLVVGGQDVLLWSGLQGTVGVFIPF 1107
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+ E F + L+ ++R + G +H +R + K V +DGDL E F L
Sbjct: 1108 VSREDVDFFQNLENHMRAEDPPLAGRDHLIYRGYYTPVKGV-----IDGDLCERFSLLPN 1162
Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
+ I+ ++ SV E+ +++ ++
Sbjct: 1163 DKKQMIAGELDRSVREIERKISDI 1186
>gi|134113697|ref|XP_774433.1| hypothetical protein CNBG0790 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50257071|gb|EAL19786.1| hypothetical protein CNBG0790 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1276
Score = 188 bits (477), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 222/912 (24%), Positives = 402/912 (44%), Gaps = 125/912 (13%)
Query: 186 VYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIEL- 244
V+ S F + I ++ + +G G+ ++GD G EK G +
Sbjct: 353 VWRSRQGFGTV-IAATVIEDHG----SGASVVIGDEYGAFTAFGWEFEKGSGAGTDGRVR 407
Query: 245 -----LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL--------------------- 278
LG +S S+I+YLD++ +++ S+ DS L++L
Sbjct: 408 VLRTYLGASSPPSSITYLDSSHLFVSSAVADSVLLRLPKVESSSSVSSGKGKGRAVTSPI 467
Query: 279 ----------------QPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAY 322
+ D G E+LER++N+ P+ D CVV E +V SGA
Sbjct: 468 GDQADKWEVLYEIGKDRNDTDGGP-EILERWMNIAPVKDLCVVKDEGGNLSHLVLASGAS 526
Query: 323 KDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLE 381
+ SLR+VR+G+G+ E +++ L ++ MW L ST P L++S + T + + L+
Sbjct: 527 ESNSLRVVRSGVGLEELVTIQGLHDVQKMWPLTDSTAIPR---LLLS--TSTSTILLQLQ 581
Query: 382 DELEETEIEGFCSQTQTLFCHDAIYNQLV-QVTSGSVRLVSSTS-RELRNEWKSPPGYSV 439
E+ I ++TL +L+ QVT + L S S +L + + +
Sbjct: 582 PEISAIPITDVIFNSETLAAGILPGAELLAQVTPRGLSLWSDLSVGQLEAQVEVDKETEI 641
Query: 440 NVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENP------- 492
A A ++A GG LV + + + ++ E+S + I+ +
Sbjct: 642 VCAQVTADWAVVAKKGGSLVVFHVSNTGFSPQGTIDVKEEVSAVAISSSSGSSSPIIVIS 701
Query: 493 SYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGH 552
+++ V M + IS + L I E + RS C G LL L +G
Sbjct: 702 TWTAKTFVYMLSQISNGVDGLS----IQSESSATSLQLRSHPFCP--GGIQLLSGLDNGL 755
Query: 553 LLNFLLNM-KTGE-----LTDRKKVSLGTQPITLRTFSSKNTTHVFAA---SDRPTVIYS 603
L + LN ++GE + K SLG +P+ L S + + ++R +VI+
Sbjct: 756 LHIYDLNTSESGEAEGLTVKSSKTTSLGLRPLVLHPCESTHGDEKVISVGLTERMSVIFE 815
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EH 662
S ++ +S+VN+K V ++++ P ++ L++ ++ ++KLH+++ G E
Sbjct: 816 SKDRIEFSSVNIKNVMAATSVDTSSGPVFALFSRTSGLSLVKVNSLKKLHVQTCDTGNES 875
Query: 663 PRRICHQEQSRTFAICSLKNQSC--AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
++ + ++ + A S + + E +FV++ D + E +S++ L E S+
Sbjct: 876 ISKLTYMDEYKAIACGSTRRTQLRDGDVKEENFVQIRDGTSLEPLSSFSLRGRELVTSLR 935
Query: 721 SCSFSDDSNVYYCVGTAYVLPEENEPT-----------KGRILVFIVEDGK------LQL 763
S + +Y VGTA++ P++ E + +GR+L+ ++G +++
Sbjct: 936 SVFLT--GRIYLAVGTAFLPPDDGEDSSWDEGNLAVVREGRVLLLEFKEGDAGGGWDIKI 993
Query: 764 IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ-SECGHHGHILAL--- 819
AE T GAVY+L +G L A K+ +++ D EL+ + ++++
Sbjct: 994 KAELATVGAVYALEEIHGFLAVAAGSKLTIHR---LDHNPVELEETSSWASAYVISSLSV 1050
Query: 820 ----YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD--DIYL 873
+++ G IV GD M+S+ +L +G I + R+ + ++A+ +L D D +
Sbjct: 1051 LPPSHIRPEGALIV-GDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGDGVV 1109
Query: 874 GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
++ NL T R N +LE + L E V RF+ GSLV ++ IP
Sbjct: 1110 ISDAYSNLLTYRLNQ---------KLERAATFGLHEEVTRFQSGSLVPTTTAPEI-IIPD 1159
Query: 934 VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
V+F T G +G+I L L+ LQ N+ K+ KG G + WR + D
Sbjct: 1160 VLFATREGRLGIIGELGTRSSRTLDDLQRNMSKIWKGPGEVGWSNWRRAGSNLVGKDTAG 1219
Query: 994 FLDGDLIESFLD 1005
F+DGD ++ FLD
Sbjct: 1220 FVDGDFVQKFLD 1231
>gi|401407861|ref|XP_003883379.1| putative Splicing factor 3B subunit 3 [Neospora caninum Liverpool]
gi|325117796|emb|CBZ53347.1| putative Splicing factor 3B subunit 3 [Neospora caninum Liverpool]
Length = 1233
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 257/1198 (21%), Positives = 456/1198 (38%), Gaps = 198/1198 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I ++ FR G +D+L I ++ + ++Q+ AE +E R + + G R
Sbjct: 58 VFGIIRSIAAFRLTGANRDYLAIGSDSGRLVIVQFSAEKNEF-ERVHCETYGKTGIRRVV 116
Query: 65 NGQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAFNIRLEELQVL--DIKFL-YG 119
G+ +DP R + + + ++ DNK QL + + + + D+ + G
Sbjct: 117 PGEYLAVDPKGRTLMVAAVERQKFVYIVNRDNKAQLTISSPLEAHKSHAICHDLCGIDMG 176
Query: 120 CAKPTIVVLYQ--DNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGV 177
P L Q ++ D + K E N++ A L PVPP +
Sbjct: 177 FDNPLFASLEQNVESTDRKPATPGVTVPKGVCLWEMDLGLNHVIKKATL--PVPPSAHCL 234
Query: 178 LIIGEET-------IVYCSANAFK-----------AIPIRPSITKAYGRVDADGSRYLLG 219
+ + ++ C N AIP R G V + + +
Sbjct: 235 IPVPGGGGADGPSGVLVCCGNFLLYKKPDHEEVSCAIPRRLETGSDRGLVVVAFAVHRMK 294
Query: 220 D--------HAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
D G ++ + I+H++ V + +A+ + L + +++ S +G+
Sbjct: 295 DFFFILIQTEYGDIYKIEISHDEGVVREIVCRYFDTVPVANALCVLKSGYLFVASEFGNH 354
Query: 272 QLIKLN------LQPDAKGSY-----------------VEVLERYVNLGPIVDFCVVDLE 308
+ P S+ + +++ +L PI D V+D +
Sbjct: 355 LFYQFTGIGSDASDPRCSSSHPLGREAIIAFKPRPLKNLALVDELQSLSPITDLKVLDAQ 414
Query: 309 RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVV 367
G QV G +LRI+++G+G+ E A EL G + +W+ + S FD +++V
Sbjct: 415 GTGAPQVYVLCGKGPRSALRILQHGLGVEEMADNELPGRARAVWTTKLSHQSAFDGYIIV 474
Query: 368 SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
+F E L + + D +EE F + +L + +QV +R + + R
Sbjct: 475 AF--EGSSLVLQIGDTVEEVTDSLFLTNVSSLLVALMYDDSFIQVHETGIRHILKSKRV- 531
Query: 428 RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEISCLDIN 486
NEW++P G + A AN Q++++ GG L+ E+ + L E + E +C+ +
Sbjct: 532 -NEWRTPGGRRIKAADANERQLVISLAGGELILFEVDESHTLVETARRNINVETTCMSMQ 590
Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSL-PDLNL--ITKEHLGGEIIPRSVLLCAFEGIS- 542
I + + AVG D VRI SL D NL + + L P SV L G+
Sbjct: 591 AIPKGRLRASFLAVG-GLDNMVRILSLEKDRNLRQLATQLLPNNATPESVCLATLTGLGA 649
Query: 543 --------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
YL L G ++ +++ G L D++ LG + + + +
Sbjct: 650 HGKDAGKSEDNGVLYLHVGLNTGVMIRSVVDPVLGTLLDQRSRFLGGRAVRFHAVTLQGQ 709
Query: 589 THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
+ A S++ + Y+ KL + +N + + F S D G L I
Sbjct: 710 PAILALSEKSWLCYTFQHKLHCTPLNYDPLECVASFCSEQCTDGFVAIAGGSLRIFRCQR 769
Query: 649 IQKLHIRS-IPLGEHPRRICH--------------------QEQSRTFAICSLKNQSCAE 687
+ + ++ +PL PR + ++ AI + + E
Sbjct: 770 LGETFSQTLLPLSFTPRAMAALPHISATESQGAVDAAGVGPSRRAAALAIVEADHNAYDE 829
Query: 688 ESEMHFVRLL----------------DDQTFEFISTYPLDTFEYGC-------------- 717
++ R L DD E + L +YG
Sbjct: 830 STKAEIRRALRGIKVNQEEEEEKEETDDMQLEEKEQHDLPEDQYGTFKAGPGKWGSCIRI 889
Query: 718 -------SILSCSFSDDSNVYYC------------VGT--AYVLPEENEPTKGRILVFIV 756
+I S D C VGT A L P + I VF
Sbjct: 890 VNPLMAMTIDKVSLETDEAALSCCYCEMEGLPLLVVGTVTAMTLKPRKVP-QASIKVFSY 948
Query: 757 EDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 815
+D L L+ + +L F G LLA + K++LY G + L +C +
Sbjct: 949 DDKFSLNLVHSTPVEDYPMALAPFRGMLLAGVGHKLRLYAL-----GKKRLLKKCEYKNL 1003
Query: 816 ILAL-YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
+ +++ GD + VGDL +S+ ++ Y+ E A D W++ E+LD ++
Sbjct: 1004 PCGVAFIRVAGDRLFVGDLRESVHVMRYRLSENLFYVLADDVVPRWLTKGEVLDYHTFVA 1063
Query: 875 AENNFNLFTVRKNSEGATDE--------ERG----------RLEVVGEYHLGEFVNRFRH 916
A+ ++F R SE DE RG +L+ + +H+GE V
Sbjct: 1064 ADKFDSVFICRVPSEAKQDELGDTTGLRLRGDTTYLTDKCFKLQSLLHFHIGEVVTALER 1123
Query: 917 GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL--PHEQYLFLEKLQTNLRKVIKGVGGL 974
+L G ++I+GT+ G IG + HE LF L+ LR +GG
Sbjct: 1124 ATLT-------AGASESIIYGTIMGSIGAFSPFLTKHELDLFTH-LEMVLRSEKPPLGGR 1175
Query: 975 NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
H +RS+ + AKN +DGDL ES+ L I++ + ++ K +E++
Sbjct: 1176 EHIMFRSYYHP-----AKNTVDGDLCESYALLPYDVQKRIAQDFEKTPADILKHLEDI 1228
>gi|328853180|gb|EGG02320.1| hypothetical protein MELLADRAFT_44871 [Melampsora larici-populina
98AG31]
Length = 1210
Score = 187 bits (475), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 261/1180 (22%), Positives = 486/1180 (41%), Gaps = 189/1180 (16%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
+G I +L FR G +D++ + ++ + VL++D + + R G
Sbjct: 60 FGTIRSLTSFRLTGGTKDYIILGSDSGRIVVLEFDPVLNAFVKVHQETYGKSGARRVVPG 119
Query: 67 QIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKFL 117
Q DP R ++ L ++ D+ L EA N + + +D+ F
Sbjct: 120 QYLATDPKGRAVMVAAMEKSKLVYILNRDSAANLTISSPLEAHKSNAIIHHIVGVDVGF- 178
Query: 118 YGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA 164
P L D N + + YE+ L V WS+ + A
Sbjct: 179 ---ENPLFAALEVDYGESDQDHTGEAFNSAEKMLTYYELDLGLNHVVRK-WSEPT-EPRA 233
Query: 165 DLLIPVP-------------PPLCGVLIIGEETIVYCSANAFK---AIPIRPS------- 201
++L+ VP P GVL+ E+ I+Y N + IP R +
Sbjct: 234 NMLLQVPGGQSATHADKFDGP--SGVLVCCEDMIIYKHQNVKEHRVPIPKRNTPFNDVER 291
Query: 202 ---ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
I A +L+ G L + I HE E+V LKI+ ++S+++ L
Sbjct: 292 GVIIVAAVMHKMRGAFFFLVQTEEGDLFKVTIDHEDEEVQALKIKYFDTVPVSSSLTILK 351
Query: 259 NAVVYIGSSYG------------------------------DSQLIKLNLQPDAKGSYVE 288
+ +++ + G D L K +P + V
Sbjct: 352 SGFLFVAAEMGNHYLYQFEKLGDDDDETEFSSEQYPDNGCSDEPLPKAFFKPRPLQNLV- 410
Query: 289 VLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 348
+ + +L PI+D +L Q++T G SLRI+R+G+ ++E + +L G
Sbjct: 411 LSDELESLAPIIDAKAANLLNTDSPQILTSCGRGSRSSLRILRHGLDVSEIVTSDLPGPP 470
Query: 349 -GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN 407
+W+ + + DD FD ++++ F++ T +L++ + + E GF + + T+ N
Sbjct: 471 TNVWTTKLNDDDLFDRYIILGFLNATLVLSIG--ETIVEVSDTGFLTNSPTISIQQLDKN 528
Query: 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DG 466
L+Q+ +R + EWK P G + V+T+N QV++ GG L+Y E+ +G
Sbjct: 529 GLLQIHPTGIRHIHLNGG--VTEWKVPAGRKIVVSTSNQRQVVVGLSGGELIYFELDLEG 586
Query: 467 ILTEVK-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
L E + ++ I+ L ++ + + + A+G+ +++V+I SL P+ L T
Sbjct: 587 QLNEYQEQKEMGSTITTLSLSEVPKGRQRTPFLAIGL-ENLTVQIISLDPNSVLETISLQ 645
Query: 525 GGEIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
+P S+ + S ++ L +G LL +L+ G+LTD + LG++
Sbjct: 646 ALTSVPSSICIAELLDSSIDKNNETLFVNIGLSNGVLLRTVLDSVNGQLTDTRTRFLGSR 705
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + T V A S R + Y+ L ++ + + ++ F++ P+ L
Sbjct: 706 PVKLLRVKVDSKTSVIALSSRTWLNYTYQNMLQFNPLIYDSIDNVHSFSAELCPEGLIGI 765
Query: 637 KEGELTIGTIDDIQ-KLHIRSIPLGEHPRR----------ICHQEQSRTFAICSLKNQSC 685
L I TI + K+ S+PL PR+ I + + RT + S+K +
Sbjct: 766 VGSSLRIFTIPKLGIKVKQDSMPLSYTPRKMILDPVTKHVITIESEHRTMS-PSVKAERL 824
Query: 686 AE------ESEMHF-------------------VRLLDDQTFEFISTYPLDTFEYGCSIL 720
AE E + H +R+ D ++ L+ E S
Sbjct: 825 AEHKAQGLEFDEHIFNDEVLGLPRSEPGQWASCIRISDPLEKTTLTKIELENNEAATSAA 884
Query: 721 SCSFS--DDSNVYYCVGT---AYVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAV 773
SF+ D VGT A+V P G + VF ++GK L+L+ + E
Sbjct: 885 IVSFANQDLQGPLLVVGTAKDAFVQPRTCR--NGYLSVFKFTDNGKGLELLHKTEVDDIP 942
Query: 774 YSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL-YVQTRGDFIVVGD 832
++ F G+L A I + ++++ D G ++L + + A+ + T+G ++VGD
Sbjct: 943 TAIIGFQGRLAAGIGKALRIF-----DLGKKKLLRKVENKTFSAAISSLSTQGPRLLVGD 997
Query: 833 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
SIS +YK E + A D + W + ++D D G + N++ V + + +
Sbjct: 998 AQDSISYAVYKPAENRLLVFADDISPRWTTCATMVDYDTCAGGDRFGNVW-VNRIPKTVS 1056
Query: 893 DE------------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
DE E+G +++ + +HL + SLV G
Sbjct: 1057 DEVDDDPTGAGIMHEKGYLMGAPHKVKNLVHFHLNDIPTSMEKTSLV-------PGGREV 1109
Query: 934 VIFGTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK 992
+++ + G +G+ I L E F + L+ ++R + + G +H +R + K
Sbjct: 1110 LLYTGLQGTVGILIPFLSKEDVDFFQTLEMHMRSELPSLVGRDHLAYRGY-----YFPVK 1164
Query: 993 NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
N +DGD+ ESF L ++ +++ ++ SV ++ K++E +
Sbjct: 1165 NCVDGDMCESFALLPMSKQQQVAAELDRSVSDVLKKIEAV 1204
>gi|403370717|gb|EJY85226.1| Spliceosomal protein sap, putative [Oxytricha trifallax]
Length = 1203
Score = 187 bits (475), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 262/1164 (22%), Positives = 475/1164 (40%), Gaps = 160/1164 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I + FR G +DFL + ++ K +L++D E ++ + R
Sbjct: 58 VFGLIRKMIPFRLLGMQKDFLVVGSDSGKIVILEYDGEHNKFVKIHQETFGKTGCRRIVP 117
Query: 66 GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY-----GC 120
G+ DP R I + + V + Q K + LE + I F G
Sbjct: 118 GEYIAADPKGRAIMIGAVEKQKFVFILNRDSQNKLTISSPLEAHKPHTITFAMVGVDVGI 177
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADLLIP 169
P L D +A + V + + F E N+ +DN A +LI
Sbjct: 178 ENPQFACLEVDYGEADSTYSAVVTGQHQKMLVFYEMDLGLNHVVRKYSTPVDNSAHMLIA 237
Query: 170 VP-PPL--CGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGRVDADGSRY------- 216
VP P G+L++ E I Y + +IPIR G S Y
Sbjct: 238 VPGEPYGPSGILVVCENMISYKKVDHDDRECSIPIRNEQADKNGTFFTAHSTYTSKDMFF 297
Query: 217 -LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
LL + G + + + +V G+ I+ T+ ++ L ++ S + + L
Sbjct: 298 FLLSNEFGDIFKINLNFTNAQVHGMTIQYFDTTTPTVCMNILRPGQLFCASEFSNHTLYT 357
Query: 276 L----------------NLQPDAKGSY----------VEVLERYVNLGPIVDFCVVDLER 309
+ +P + + ++ ++ + NL I D V DL
Sbjct: 358 FLDIGENDPNPIITSSADRKPHGQQTLTTYNPREFLNLQAVDEFPNLASINDMKVEDLTG 417
Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QGIKGMWSLRSSTDDPFDTFLVVS 368
+G Q+ SG G LR++R+G+ I E+A + Q + +++ T D FD +++VS
Sbjct: 418 EGNPQIYLASGRGAQGCLRVLRHGLTIIEKAVTAMPQKPLNVITVKGKTTDVFDKYMIVS 477
Query: 369 FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL- 427
F +T +L++ ++++ E + GF +TL N +QVT S+ V +
Sbjct: 478 FQQQTLVLSIG-QEKVSEVKDSGFVDNERTLHVGILEDNSYIQVTPKSIIHVKGDQQNRK 536
Query: 428 RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD--GILTEVKHAQLEYEISCLDI 485
R +W S G V A +N QV ++ GG +VY E+ + G L EV+ + E++C+DI
Sbjct: 537 RAKWDSGQGKIVK-ACSNQRQVAVSIEGGQIVYFELDEMSGTLNEVESRFYDSEVACIDI 595
Query: 486 NPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAFEGIS 542
+ E + AVG + D +V+I SL P+ L I+ + L P SV L +F+
Sbjct: 596 ADVPEGRQRCRFLAVG-YADKTVKIMSLDPESCLQRISMQALPAH--PESVALISFQRDE 652
Query: 543 --------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAA 594
+L L +G LL L++ TG L+D + LGT I+L NT + A
Sbjct: 653 VAQQQQQLFLHVGLVNGVLLRTLVDNVTGVLSDSRTRFLGTNSISLAKVRQGNTNALVAL 712
Query: 595 SDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE-LTIGTIDDI-Q 650
++P + Y+ S K+ + ++ + + F S PD +A G L I T++ + +
Sbjct: 713 CNKPWLCYTHMSTNKVNITPLSYEMLEVASSFCSEKCPDGGIVAISGNTLRIITVERLGE 772
Query: 651 KLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA-----------------------E 687
+ +PL P +I ++ I + S + +
Sbjct: 773 NFTHKVMPLRYTPTKIQIHRETNYLVILEKDHNSYSYSERLRMKEEIAKNTEDENYLALD 832
Query: 688 ESEMHFVRLLDDQ---TFEFISTYPLDTFEY---------GCSILSCSFSDDSNVYYCVG 735
ES + + R ++ + Y L T E ++ + + + +G
Sbjct: 833 ESRISYPRAGQNKFASCIRIVDPYLLQTLELIEFENNEVVFSHFIATTLGNPGETHLILG 892
Query: 736 TAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
TA V + + G I + V +G+ LQL+ + + N + G+L+A + ++
Sbjct: 893 TALNVTFQPRSCSVGFIKTYKFVNNGQSLQLMHSTPCEDIPMAFNEYKGRLIAGVGPILR 952
Query: 793 LY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
+Y K +LR + Q+ + +Q I GDL +S+ +L YK E+
Sbjct: 953 IYELGQKKLLRKVENKNFQAPI--------IQIQVDEGRIYAGDLQESVHVLKYKPEDVQ 1004
Query: 849 IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR------------------KNSEG 890
+ + D W+++ +LD D G + N+F R K G
Sbjct: 1005 LYIFSDDILNRWLTSFCLLDHDTIAGVDKFENVFINRLPVGCEDDAEDDPTATKFKWENG 1064
Query: 891 ATDEERGRLEVVGEYHLGEFVNRFRHGSL-VMRLPDSDVGQIPTVIFGTVNGVIGVIASL 949
+ +++ + ++ GE + SL + +S++ ++FGT +G +G +
Sbjct: 1065 YLNGAAFKMDPICQFFTGEVGTCIQKCSLNTLSGTNSEI-----ILFGTTSGSLGALLPF 1119
Query: 950 -PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
E+ F L+ LR + + G +H +RS V K+ +DGDL E F L
Sbjct: 1120 ETREEIDFFVHLEMYLRIEAQPLCGRDHVTFRS-----SYVPVKDVVDGDLCEQFASLEF 1174
Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
+ +++ M+ + E+ K++E +
Sbjct: 1175 NKQRVLAEEMDRTPPEVMKKLENM 1198
>gi|296814646|ref|XP_002847660.1| pre-mRNA-splicing factor rse1 [Arthroderma otae CBS 113480]
gi|238840685|gb|EEQ30347.1| pre-mRNA-splicing factor rse1 [Arthroderma otae CBS 113480]
Length = 1235
Score = 187 bits (475), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 272/1164 (23%), Positives = 478/1164 (41%), Gaps = 167/1164 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G +D++ I ++ + ++++ + R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
GQ +DP R LI + L V+ + + +L + + L LD+
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182
Query: 116 F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
+ ++ + + QD+ + +T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVEYTEVDQDSTGQAYEETEKMLVYYELDLGLNHVVRK-WA-DPVDRTASM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
L VP GVL+ E+ IVY +N AF+ IP R P+ R G
Sbjct: 241 LFQVPGGADGPSGVLVCAEDNIVYRHSNQDAFRVPIPRRRGPTENPERKRCITAGVMHKM 300
Query: 216 -----YLLGDHAGLLHLL---VITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
+LL G L L ++ E E+ TG LK++ +AS++ L + ++
Sbjct: 301 RGAFFFLLQSEDGDLFKLTMEMVEDENEQATGEVKRLKLKYFDTVPLASSLCILKSGFLF 360
Query: 264 IGSSYGDSQLIKLN-----------LQPDAKGSYVEVL----------------ERYVNL 296
+ S G+ + + D E L E +L
Sbjct: 361 VASETGNQHFYQFEKLGDDDDEIEFISDDYSADLTEPLPPVYFRPRPAENLNLVESIASL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P++ + +L + Q+ T G S R +++G+ ++E EL + +W+ +
Sbjct: 421 NPLMSCSIANLTEEDAPQLYTLCGTGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+ +D +D ++V+SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 481 TRNDQYDAYIVLSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
+R + + R NEW +P S+ AT N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHIHADQR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDGSLAEYDEK 596
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
Q+ ++CL + + E S AVG D +VRI SL P+ L K P +
Sbjct: 597 RQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPESTLENKSVQALTSAPSA 655
Query: 533 V-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
+ ++ + S YL L G L +L+ TGEL+D + LG +P+ L S K
Sbjct: 656 LSIMSMIDSTSGGPTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFGVSVK 715
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS--HMCPFNSAAFPDSLAIAKEGELTIG 644
V A S R + YS + + ++ L V F+S + + + L I
Sbjct: 716 EQRAVLALSSRSWLGYSDVQTMSFTLTPLNYVGLEWSWNFSSEQCVEGMVGIQGQNLRIF 775
Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT-- 701
+I+ + L SIPL R + + F + N + ++ +LL + T
Sbjct: 776 SIEKLDNNLLQESIPLAYTSRSLVRHPEYPIFYVIGSDNNVLSPSTK---AKLLSESTTV 832
Query: 702 -----------FEF----------------------ISTYPLDTFEYGCSILSCSF-SDD 727
F + +S L+ E SI + SF S +
Sbjct: 833 NGDNAELPPEDFGYPRGTNHWASCIQVVDPINAKAVMSRIELEDNEAAVSIAAVSFTSQE 892
Query: 728 SNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLL 784
+ VGT ++ T G I ++ E+GK L+ I + + + +L F G+LL
Sbjct: 893 DETFLVVGTGKGMVVSPRSFTCGFIHIYRFQEEGKELEFIHKTKVEQPPLALLGFQGRLL 952
Query: 785 AAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
A I +++Y D G R+L +C + + +QT+G I+V D+ +S++ ++YK
Sbjct: 953 AGIGPDLRIY-----DLGMRQLLRKCQAQITPRVIVGLQTQGSRIIVSDVQESVTYVVYK 1007
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLE 900
++E + A D W + ++D + G + N++ +R K SE A ++ G
Sbjct: 1008 YQENNLIPFADDIIPRWTTCTTMVDYETVAGGDKFGNIWLLRCPPKASEEADEDGSGAHL 1067
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT--------------VNGVIGV- 945
+ +L NR SLV D IPT I T + G +G+
Sbjct: 1068 IHERQYLQGAPNRL---SLVAHFYSQD---IPTSIQKTQLVAGGRDILVWTGLQGTVGML 1121
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
I + + F + L+ L + G +H +R + K +DGDL E+F
Sbjct: 1122 IPFVTRDDVDFFQTLEMQLTSQNPPLAGRDHLIYRGY-----YAPCKGVIDGDLCETFFL 1176
Query: 1006 LSRTRMDEISKTMNVSVEELCKRV 1029
L + I+ ++ SV E+ +++
Sbjct: 1177 LPNDKKQAIAGELDRSVREIERKI 1200
>gi|392578232|gb|EIW71360.1| hypothetical protein TREMEDRAFT_71141 [Tremella mesenterica DSM 1558]
Length = 1250
Score = 187 bits (474), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 224/890 (25%), Positives = 375/890 (42%), Gaps = 140/890 (15%)
Query: 210 DADGSRYLLGDHAGLLHLLVITHEKEKVTG--------LKIELLGETSIASTISYLDNAV 261
D D + LLGD AG L L I +E ++ LK +L G + S+++ L ++
Sbjct: 356 DRDNAVVLLGDDAGHLTKLTIESAREDLSSKLSAVMKVLKTDL-GFAAAPSSLTLLGSSH 414
Query: 262 VYIGSSYGDSQLIKLN---------LQPDAKG---------------------SYVEVLE 291
V++GS+ GD+ +I+L L P K + VE++E
Sbjct: 415 VFMGSACGDALIIRLPSVTSSHSTPLSPALKSKGKARADDSLDGFGVEHYHETATVEMVE 474
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGM 350
R++NL P+ DFC V+ + G +V +GA SLR VR+G+ + +E +QGI+ M
Sbjct: 475 RWMNLAPVKDFCAVEEDGGGLSHLVVAAGASNTNSLRAVRSGVSLETLMDIEGVQGIERM 534
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL- 409
W + + D FL S T + + L E++ + T+ A + ++
Sbjct: 535 WPIMLPSRDQ-GIFLSTS----TSSMLLGLAPEVQALALPESVVNHPTIAA--ASWEEVA 587
Query: 410 VQVTSGSVRLVSSTSRELR-NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG---- 464
V VT+ +V + S + ++ W + A + +A GG LV L++
Sbjct: 588 VIVTANNVTVWSDLTGSVKVGSWSHGQSRQILAAQISGGLAAIAISGGELVILQVSAHGV 647
Query: 465 DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
D ILT QL EI+ + I S I WT+ + +F+L L +
Sbjct: 648 DAILTR----QLGGEIASVAI-----LDGLSPIVVTSTWTN-EIFLFTLDQLR---SPDM 694
Query: 525 GGEIIPR-----SVLLCAFEGISYLLCALGDGHLLNFLLNMKTG---ELTDRKKVSLGTQ 576
G I S+ LC G + LL L DG ++ + + EL++RK VSLGTQ
Sbjct: 695 QGSTIRENSFCASLQLCPLSGGARLLAGLSDGTMVTYHIESSANNALELSERKAVSLGTQ 754
Query: 577 PITLRTFS---SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
P+ L S + S+R VI S +++ S+VN + N+ + L
Sbjct: 755 PLRLSPTSLSCGDDRIISVGLSERMCVISESRERIESSSVNRNDTRAAATINTPSHGSCL 814
Query: 634 AIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA------- 686
A+A +++ ++K+H+R++ LG H+ SR I LK +
Sbjct: 815 ALATSSGISLVKPTSLKKVHVRTLDLG-------HRSVSRLTNISPLKAIAAGSTERPFD 867
Query: 687 -EESEMH---FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPE 742
E E+H +V L D T E + PL++ E +I + D + VG A +
Sbjct: 868 RETGEIHQSSYVELRDSTTLELLVEKPLESREIVTTIAYVTLGDQN--LLAVGIATFSED 925
Query: 743 ENEPTKGRILVFI-VEDGKLQL---IAEKET-------------KGAVYSLNAFNGKLLA 785
+ + +V I + G+L L + ++++ + AV + L
Sbjct: 926 DEDLPDDLDMVTISAQSGRLVLYEPVVDQDSAEPNLIELTSVGLESAVNDIKVIKNLLAV 985
Query: 786 AINQKIQLYKWMLRDD---GTRELQSECGHHGHILA----LYVQTRGDFIVVGDLMKSIS 838
A + +YK T S ++A L+ + R +VVGD M+SI
Sbjct: 986 ATGSNVTIYKHEKASHLLIPTSRFASAFVAKSLVVAPPDKLHPEER---LVVGDGMRSIF 1042
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDD----IYLGAENNFNLFTVRKNSEGATDE 894
+L G I RD + + A+E L D I A +N + F +R+
Sbjct: 1043 VLDIDEGTGMIMGDERDMATHSVMAMEGLRDGGQAVIVADAHSNISTFRLREE------- 1095
Query: 895 ERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
+E + L E ++ FR GSL DV P +IF T++G +G++ L
Sbjct: 1096 ----IETAATFGLHEDISVFRRGSLAPASSAEDVLS-PEIIFATIDGRLGIVGELTPSAA 1150
Query: 955 LFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFL 1004
L+ LQ N+ + I+G G + +R E D F+DGD +++++
Sbjct: 1151 RTLDDLQRNMDRYIRGPGDIAWRSYRRAGTELVQRDTAGFIDGDFVQTYI 1200
>gi|320587625|gb|EFX00100.1| nuclear mRNA splicing factor protein [Grosmannia clavigera kw1407]
Length = 1220
Score = 186 bits (473), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 274/1180 (23%), Positives = 484/1180 (41%), Gaps = 184/1180 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW---DAESSELITRAMGDVSDRIGRP 62
++G I ++ FR G +D++ +AT+ + ++++ D + G R P
Sbjct: 65 VFGIIRSIAAFRLAGSNKDYVILATDSGRITIIEYIPADNRFQRIHLETFGKSGVRRVVP 124
Query: 63 TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
GQ DP R LI + L V+ + + +L + + + V+ I +
Sbjct: 125 ---GQYLAADPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHKPGVIVISLVALD 181
Query: 118 YGCAKPTIVVLYQDNKDA---------RHVKT----YEVALKDKDFVEGPWSQNNLDNGA 164
G A P L D ++ R +T YE+ L V WS + +D+ A
Sbjct: 182 VGYANPVFAALELDYAESDQDPTGEAYREAETLLVYYELDLGLNHVVRR-WS-DTVDSTA 239
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGS 214
+L VP P GVL+ GEE I Y +N A++ IP R T+ R G
Sbjct: 240 SMLFQVPGGSDGP--SGVLVCGEENITYRHSNQEAYRVPIPRRRGATEDPNRKRTIVSGV 297
Query: 215 RYLLGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISY 256
+ L AG L+ T + + +V LKI+ +AS++
Sbjct: 298 MHKLKGSAGAFFFLLQTEDGDLLKLTLDMVEDDDGNPTGEVLRLKIKYFDTIPVASSLCI 357
Query: 257 LDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEV 289
L + ++ S +G+ Q + N + K Y + +
Sbjct: 358 LKSGFLFSASEFGNHQFYQFEKLGDDDDELEFSSDNFPHEPKAPYEPVYFYPRPAENLAL 417
Query: 290 LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK- 348
+E ++ P++D V +L Q+ + G + R++++ + +NE + +L G
Sbjct: 418 VESIESMNPMLDCKVANLTDDDVPQIYSVCGNGARSTFRMLKHALEVNEIVASQLPGTPT 477
Query: 349 GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
+W+ + D+ +D F+V+SF + T +L++ + +EE GF S TL +
Sbjct: 478 AVWTTKVRRDEEYDAFIVLSFNNGTLVLSIG--ETVEEVTDTGFLSSVPTLAVQQLGDDG 535
Query: 409 LVQVTSGSVRLVSSTSRELR-NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DG 466
L+QV +R + R+ R NEW +P S+ A N QV++A G +VY E+ DG
Sbjct: 536 LIQVHPKGIRHI----RDGRVNEWAAPQHRSIVAAATNERQVVVALSSGEIVYFEMDTDG 591
Query: 467 ILTEVKHAQLEY-EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
L E + + ++CL + + E S AVG D +VRI SL P+ L
Sbjct: 592 SLAEYDEKKEMFGTVTCLSLGAVPEGRLRSSYLAVGC-DDCTVRILSLDPETTLDNMSIQ 650
Query: 525 GGEIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
P S+L+ A + S YL L G L +L+ TGELTD ++ LG + +
Sbjct: 651 ALTAAPSSLLIMAMDDSSAGGTALYLHIGLHSGVYLRTVLDEVTGELTDTRQRFLGPKLV 710
Query: 579 TLRTFSSKNTTHVFAASDRPTVIY--SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
L S + V A S RP + Y S K + ++ E+ F+S + +
Sbjct: 711 RLFQVSVQRRVCVLALSSRPWLGYDDSKAKNFAMTRLDYSELEWGWNFSSEQCEEGMVGI 770
Query: 637 KEGELTIGTIDDIQKLHI-RSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR 695
L I +I+ + I +S PL PRR+ + F N + + + +
Sbjct: 771 HGNFLRIFSIEKLGDALIQKSFPLTYTPRRLVKHPEFGIFYTIEADNNTLSPDLRRQLIE 830
Query: 696 ----------LLDDQTFEF-----------------------ISTYPLDTFEYGCSILSC 722
+L + F + T L+ E +
Sbjct: 831 APGVTNGDASVLPPEDFGYPRGNGRWASCISVVDPVGEDPGVTQTIELEGNEAAVCMAVA 890
Query: 723 SF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKETKGAVYSLNA 778
SF S Y VGT ++ + ++G I + V+DG +L+L+ + + + +L A
Sbjct: 891 SFVSRGGESYLIVGTGRNMILSPRQFSEGYIHAYRFVKDGTQLELVHKTKVEEPPTALLA 950
Query: 779 FNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
F G++L I +++Y K ML R+ QSE L + +QT+G +VVGD+
Sbjct: 951 FQGRILVGIGNVLRIYDLGIKQML-----RKAQSEVSSK---LIVSLQTQGSRVVVGDVE 1002
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
+ ++ ++YK E + D W + ++D G + NL+ +R + + + D
Sbjct: 1003 EGVTYVVYKPEINKLLPFVDDTIKRWTTCTTMVDYQSVAGGDKFGNLWILRVSDKASQDA 1062
Query: 895 E-----------RG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
+ RG RL ++ + + SLV VG +++
Sbjct: 1063 DEPGSELQLVHARGYLHGAPNRLALMAHVYTQDVPMSICKASLV-------VGGQEVLVW 1115
Query: 937 GTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
+ G +G + + + F + L+T++R+ + G +H +RS+ V K +
Sbjct: 1116 SGLQGTVGALVPFVSRDDADFFQNLETHMRQEDAPLAGRDHLMYRSY-----YVPVKGVI 1170
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
DGDL E F L R + I+ ++ SV E+ +++ R+
Sbjct: 1171 DGDLCERFNLLPREKKQMIAGELDRSVREIERKISVSARV 1210
>gi|242018509|ref|XP_002429717.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
gi|212514723|gb|EEB16979.1| Splicing factor 3B subunit, putative [Pediculus humanus corporis]
Length = 1218
Score = 186 bits (472), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 259/1194 (21%), Positives = 494/1194 (41%), Gaps = 196/1194 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G I +L FR G +D++ + ++ + +L++ + L + + + G
Sbjct: 53 LLTVEIFGVIRSLMSFRLTGGTKDYIVVGSDSGRIVILEY-IPAKNLFEKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R ++G L ++ D + +L + + + L +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMVGAVEKQKLVYILNRDAEARLTISSPLEAHKSNTLVYHMV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A T E A K + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEIDYEEADTDPTGEAASKTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
A+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 232 ANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT +K++ +A+++ +
Sbjct: 287 FVCSATHKTKSVFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVATSMCVMKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ P A + V+V E +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRALRNLVQVDE-MDSLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 406 PIMACQVADLANEDTPQLYMLCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRR 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
++ +D +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VEEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGDDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A G LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKAPGKKTIMKCAVNQRQVVIALTAGELVYFEMDPTGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV- 533
+ ++SC+ + + S+ AVG+ D +VRI SL + ++ L + +P +
Sbjct: 582 AMPSDVSCMALGNVVPGELRSRFLAVGL-ADNTVRIISLDPSDCLSP--LSMQALPAAAE 638
Query: 534 LLCAFE-GIS----------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G + YL L +G LL +L+ TG+L D + LG++
Sbjct: 639 SLCIVEMGAADKKPDSEESTVTQSNLYLNVGLQNGVLLRTVLDPVTGDLADTRTRYLGSR 698
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + + V A S R + Y + + ++ + + + F+S P+ +
Sbjct: 699 PVKLFRIKMQGSEAVLAMSSRSWLSYYYQNRFHLTPLSYESLEYASGFSSEQCPEGIVAI 758
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE--------------------QSRT 674
L I ++ + + + S PL PR+ + H E Q R
Sbjct: 759 STNTLRILALEKLGAVFNQISFPLEYTPRKFVVHPETGKMILLETEHNAYTEETKKQRRV 818
Query: 675 FAICSLKNQSCAEESEM-----------------------------HFVRLLDDQTFEFI 705
++ + EE E+ +R++D
Sbjct: 819 QMAEEMQEAAGDEEQELAKEMAEAFLSEDLPESIFGAPKAGSGMWASIIRIIDPVEGRTD 878
Query: 706 STYPLDTFEYGCSILSCSFSD-DSNVYYCVGTA--YVLPEENEPTKGRILVF-IVED-GK 760
L+ E SI F++ +++ VG Y L + + G + F I ED
Sbjct: 879 KIVRLEQNEAALSIALVKFNNHPESLFLVVGVVKEYQL-SPRQVSFGYLYTFRINEDVTD 937
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH---GHIL 817
L+L+ + A ++ ++G+LL + + ++LY D G ++L +C + I+
Sbjct: 938 LELVHKTTVDEAPAAVCPYHGRLLVGVGRMLRLY-----DLGKKKLLRKCENKYIPNQIV 992
Query: 818 ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
++ G + V D+ +S+ ++ YK +E + A D + W++ ILD D A+
Sbjct: 993 SICAT--GQRVFVSDVQESVYMVRYKRQENQLIIFADDTHPRWITCTTILDYDTVATADK 1050
Query: 878 NFNLFTVRKNS--EGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSL 919
N+ +R +S DE +RG L +V+ +H+GE + +L
Sbjct: 1051 FGNIAIIRLSSIITDDVDEDPTGNKALWDRGLLNGASQKADVLANFHVGETCMSLQKATL 1110
Query: 920 VMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
+ G ++++ +++G +GV+ E + F + L+ ++R + G +H
Sbjct: 1111 I-------PGGSESLVYTSLSGTVGVLVPFTSREDHDFFQHLEMHMRSEHPPLCGRDHLS 1163
Query: 979 WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+RS+ KN +DGDL E + + + I++ ++ + ++ K++E++
Sbjct: 1164 FRSY-----YYPVKNVIDGDLCEQYNSIEPAKQKSIAEDLDRNPSDVSKKLEDI 1212
>gi|357606250|gb|EHJ64976.1| putative Splicing factor 3B subunit [Danaus plexippus]
Length = 1216
Score = 186 bits (471), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 272/1183 (22%), Positives = 483/1183 (40%), Gaps = 176/1183 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++ V I+G I ++ FR G +D++ + ++ + +L++ + ++ + + + G
Sbjct: 53 LMKVEIFGVIRSMMSFRLTGGTKDYIVVGSDSGRIVILEY-IPAKNILEKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDAEARLTISSPLEAHKSNTLVYHMV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A T E A K + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEIDYEEADSDPTGEAAQKTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
A+ LI VP P GVLI E + Y N IR I + +D D R
Sbjct: 232 ANFLITVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + I +++ VT +K++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITIETDEDMVTEIKLKYFDTVPVATAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGP 298
+++ +G+ L ++ + P +G R + +L P
Sbjct: 347 LFVACEFGNHYLYQIAHLGDEDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDELDSLSP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ V DL + QV G SLR +R+G+ + E A EL G +W++R
Sbjct: 407 ILACHVADLTGEDTPQVYLACGRGPRSSLRALRHGLEVAEMAVSELPGSPNAVWTVRRHK 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
DD +D++++VSF++ T +L++ + +EE GF T TL CH + LVQV +
Sbjct: 467 DDDYDSYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCHALGSDALVQVYPDGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQ 475
R + + R NEWK+P S+ N QV++A GG LVY E+ G L E + +
Sbjct: 525 RHIRADKRV--NEWKAPGKKSIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERKK 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-L 534
L ++SC+ + + + AVG+ D +VRI SL + + + + +P S
Sbjct: 583 LSSDVSCMALGSVATGEQRAWFLAVGL-VDNTVRIISLDPADCLAPRSM--QALPASPES 639
Query: 535 LC----AFEGIS----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
LC FE + +L L +G LL L+ +G+L D + LG++P+ L +
Sbjct: 640 LCIVDQPFESGAKSALHLNIGLSNGVLLRTTLDSVSGDLADTRTRYLGSRPVKLFKVRVQ 699
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
+ V A S R + Y + + ++ + + + F+S + + L I +
Sbjct: 700 SAEAVLAVSSRTWLGYQYQNRFHLTPLSYECLEYAAGFSSEQCTEGIVAISSNTLRILAL 759
Query: 647 DDIQKLHIRSI-PLGEHPRRICHQEQSRTFAICSLKNQSCAEE----------SEMHFV- 694
+ + + ++ L PR+ + + + + EE EM
Sbjct: 760 EKLGAVFNQTFQQLDYTPRKFVINSDNNHIIVLETDHNAYTEEMKKQRRVQMAQEMREAA 819
Query: 695 --------RLLDDQTFEFIS-TYPLDTF---EYGCSILSCSFS--DDSNVYYCVGTAYVL 740
+L ++ F+S P + F + G + + D S T +L
Sbjct: 820 AGGTPEEQQLANEMADAFLSDVLPENIFSSPKAGAGMWASQIRILDMSGGVGGCSTVCLL 879
Query: 741 P-EENEPTKGRILV-----------FIVEDGKLQLIAEKE-TKGAVYSLNAFN-GKL--- 783
P E+NE +V +V K L++ + ++G+++ +N GKL
Sbjct: 880 PLEQNEAAVSLCVVRWAALTDNTPHLVVGVAKDALLSPRSCSEGSLHVYKIYNTGKLELV 939
Query: 784 -----------LAAINQKIQL-YKWMLR--DDGTRELQSEC-GHHGHILALYVQTRGDFI 828
LAA N K+ MLR D G R+L +C H L ++T I
Sbjct: 940 HKTPIDEYPGALAAFNGKLLAGVGRMLRLYDIGRRKLLRKCENRHIPNLIADIKTIRQRI 999
Query: 829 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--K 886
V D+ +S+ + YK E + A D N W++ ILD D A+ N+ +R +
Sbjct: 1000 FVSDVQESVFCVKYKKRENQLIIFADDTNPRWITNTCILDYDTVAMADKFGNVAVLRLPQ 1059
Query: 887 NSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
+ DE +RG L ++ +H+GE V + +L+ G
Sbjct: 1060 SVSDDVDEDPTGNKALWDRGLLNGASQKGDITVNFHVGETVTSLQRATLI-------PGG 1112
Query: 931 IPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
+++ TV+G +GV E + F + L+ ++R + G +H +RS+
Sbjct: 1113 SEALLYATVSGALGVFLPFTSREDHDFFQHLEMHMRSENSPLCGRDHLSFRSY-----YY 1167
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F L + I+ + + E+ K++E++
Sbjct: 1168 PVKNVIDGDLCEQFNSLEPAKQKAIAGDLERTPAEVSKKLEDI 1210
>gi|449664009|ref|XP_004205855.1| PREDICTED: splicing factor 3B subunit 3-like [Hydra magnipapillata]
Length = 1213
Score = 183 bits (465), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 209/941 (22%), Positives = 399/941 (42%), Gaps = 148/941 (15%)
Query: 216 YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
+L+ G + + + E++ VT ++++ +AS + L ++ S +G+ L +
Sbjct: 291 FLVQSEQGDIFKVTLETEEDMVTEIRVKYFDTIPVASALCVLKTGFLFAASEFGNHALYQ 350
Query: 276 L------NLQPDAKG-------------------SYVEVLERYVNLGPIVDFCVVDLERQ 310
+ + +P+ + +++ +L PI+ V DL +
Sbjct: 351 ITHLGDDDEEPEFSSLMATELDEGETFFFHARDLQNLVLVDEMESLAPIMHCQVADLANE 410
Query: 311 GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSF 369
Q+ G SLR++R+G+ ++E A EL G +W+++ ++ D D ++VVSF
Sbjct: 411 DTPQLYAACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNSVWTVKKNSTDEQDAYIVVSF 470
Query: 370 ISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
++ T +L++ + +EE GF T TL C + L+Q+ +R + S R N
Sbjct: 471 VNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALLQIYPDGIRHIRSDRR--VN 526
Query: 430 EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINP 487
EWK+P ++ N Q+++A G LVY E+ G L E + ++ ++ C+ + P
Sbjct: 527 EWKTPGKKNIIQCAVNERQIVIALTGNELVYFELDQSGQLNEYTERKEMSADVVCMALGP 586
Query: 488 IGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS-VLLCAFE------- 539
+ S+ AVG+ D +VRI SL + + E L + +P + LC E
Sbjct: 587 VHAGEQRSRFLAVGL-IDNTVRIISLDPNDCL--EPLSMQALPAAGESLCIIEMGGTEVG 643
Query: 540 --------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
G +L L +G LL +L+ TG+L+D + LG++P+ L + + V
Sbjct: 644 EKGTAGNAGGLFLNIGLANGVLLRTVLDSVTGDLSDTRIRYLGSKPVKLFNVKTYGCSSV 703
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQK 651
A S R + Y+ + + ++ + + + F S P+ + L I ++ +
Sbjct: 704 LAMSSRTWLSYTYQSRFHLTPLSYESLEYASSFASEQCPEGIVAISSNTLRILALEKLGA 763
Query: 652 LHIR-SIPLGEHPRRIC--------------HQEQSRTFAI-------------CSLKNQ 683
+ + + PL PR+ H ++TF + C ++
Sbjct: 764 VFNQVTTPLDFTPRKFVVDHKSHTLIMIETDHNAMTKTFKLDRKQRIAEEMVESCGDNDE 823
Query: 684 SCAEES------------EMHF-------------VRLLDDQTFEFISTYPLDTFEYGCS 718
+ A ES E+ F VR++D E L+ E S
Sbjct: 824 ARAAESQVAANFLNTDIPEIQFGAPKAGIGRWASIVRIMDPIRSETHHEIQLEQDEAAFS 883
Query: 719 ILSCSFSDD-SNVYYCVGTAYVL---PEENEPTKGRILVFIVED-GKLQ--LIAEKETKG 771
+ F++ + + VG A + P ++ D G++Q L+ +
Sbjct: 884 VCLTEFANQPGDRFVIVGCATKMILNPHSSQSCSIHTYQLYQNDRGRMQLELLHKTPVDD 943
Query: 772 AVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-HILALYVQTRGDFIVV 830
+L +F G+LL + + +++Y D G ++L +C + + T G I+V
Sbjct: 944 IPGALCSFQGRLLVGVGRILRIY-----DLGKKKLLRKCENKKVSTFITGIDTIGYRIMV 998
Query: 831 GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
D+ S L YK ++ + A D N +++ ++D D GA+ N+F VR ++
Sbjct: 999 HDIQDSFHFLKYKRKDNQLVLFADDVNKRFLTCGCLVDYDTMAGADKFGNIFFVRLPADV 1058
Query: 891 ATDEE-----------RGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
D E RG L +VV YH+GE + +L+ G
Sbjct: 1059 KDDIEEDPTGVKALWDRGLLNGASQKADVVSVYHIGETALSLQKATLI-------PGGSE 1111
Query: 933 TVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDA 991
++++ TV+G IG++ E F + L+ ++R+ + G +H +RS
Sbjct: 1112 SLVYTTVSGGIGMLVPFTSREDIDFFQHLEMHMRQDSPPLCGRDHLWFRS-----SFSSV 1166
Query: 992 KNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F L + I++ ++ + E+ K++E++
Sbjct: 1167 KNCVDGDLCEQFNSLEYAKRKAIAEELDRTPAEVSKKLEDI 1207
>gi|355718834|gb|AES06402.1| splicing factor 3b, subunit 3, 130kDa [Mustela putorius furo]
Length = 1101
Score = 182 bits (462), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 256/1124 (22%), Positives = 461/1124 (41%), Gaps = 197/1124 (17%)
Query: 10 IATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDNGQI 68
I +L FR G +D++ + ++ + +L++ S + + + + G R GQ
Sbjct: 2 IRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSGCRRIVPGQF 60
Query: 69 GIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGCAKP 123
+DP R + + + L ++ D +L + + + L + G P
Sbjct: 61 LAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVVGVDVGFENP 120
Query: 124 TIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADLLIPVP- 171
L D ++A + T E A + F E N+ L+ + LI VP
Sbjct: 121 MFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEHGNFLITVPG 180
Query: 172 ----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR------------ 215
P GVLI E I Y N IR I + +D D R
Sbjct: 181 GSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMIFVCSATHKT 235
Query: 216 ---------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
+L G + + + +++ VT ++++ +A+ + L +++ S
Sbjct: 236 KSMFLSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVAS 295
Query: 267 SYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGPIVDF 302
+G+ L ++ QP + V +++ +L PI+ F
Sbjct: 296 EFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLSPIL-F 353
Query: 303 C-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDP 360
C + DL + Q+ G SLR++R+G+ ++E A EL G +W++R +D
Sbjct: 354 CQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRHIEDE 413
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FD +++VSF++ T +L++ + +EE GF T TL C + LVQV +R +
Sbjct: 414 FDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHI 471
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEY 478
+ R NEWK+P ++ N QV++A GG LVY E+ G L E + ++
Sbjct: 472 RADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERKEMSA 529
Query: 479 EISCLDIN--PIGENPS-YSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVL 534
++ C+ + P GE S + + AVG+ D +VRI SL + + + L + +P +
Sbjct: 530 DVVCMSLANVPPGEQRSRFLAVLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPES 586
Query: 535 LCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
LC E G YL L +G LL +L+ TG+L+D + LG++P+
Sbjct: 587 LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPV 646
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 647 KLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIST 706
Query: 639 GELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQSC 685
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 707 NTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQM 766
Query: 686 AEE-------------SEM-------------------------HFVRLLDDQTFEFIST 707
AEE +EM +R+++ +
Sbjct: 767 AEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDL 826
Query: 708 YPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQL 763
L+ E S+ C FS+ + +Y VG A L G + +V +G KL+
Sbjct: 827 VQLEQNEAAFSVAVCRFSNTGDDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEF 886
Query: 764 IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY--- 820
+ + + ++ F G++L + + +++Y D G ++L +C + HI A Y
Sbjct: 887 LHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISG 939
Query: 821 VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
+QT G ++V D+ +S + YK E + A D W++ +LD D GA+ N
Sbjct: 940 IQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGN 999
Query: 881 LFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMR 922
+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 1000 ICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-- 1057
Query: 923 LPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLR 965
G ++++ T++G IG++ HE + F + ++ +LR
Sbjct: 1058 -----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLR 1096
>gi|300120114|emb|CBK19668.2| unnamed protein product [Blastocystis hominis]
Length = 1240
Score = 180 bits (457), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 267/1171 (22%), Positives = 471/1171 (40%), Gaps = 173/1171 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
+YG I L+ FR G DF+ I ++ + VL++D E + ++ G V G P
Sbjct: 57 VYGLIRALKPFRLSGNNTDFVLIGSDSGRIVVLKYDTEKNTFEKIHQETFGKVGVIRGLP 116
Query: 63 TDNGQIGIIDPDCR--LIGLHLY----------DGLFKVIPFDNKGQLKEAFNIRLEELQ 110
GQ +DP+ R +I L+ L V+ D++G + + + + Q
Sbjct: 117 ---GQYLAVDPNGRAFMISGFLFHVSSLGAIEKQNLVYVLNRDSRGNVTISSPLEAHKSQ 173
Query: 111 --VLDIKFL-YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPW 155
VL + L G P L + +A T E A + + + V W
Sbjct: 174 TIVLTTEGLDVGYDNPMFACLELNYGEADEDSTGEAAQQTEKVLTYYQLDLGLNHVIRKW 233
Query: 156 SQNNLDNGADLLIPVP--PPLCGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGRVD 210
S+ ++ A+LL VP GVL+ E ++YC+ N + IP R + + G +
Sbjct: 234 SK-TVERSANLLARVPGDSGPGGVLVFAENWVMYCNENHPEIRAPIPRREFLPENRGTMI 292
Query: 211 ADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+ YL+ G L+ + ++ + V L ++ A+T+ +
Sbjct: 293 ITAALFQREGNLFYLVQSEYGDLYKITLSFDSSHVHDLVVKYFDTIPAATTLHITKKGFL 352
Query: 263 YIGSSYGDS--------------QLIKLNLQPDAKGSYVEVLE----------------- 291
S GD Q I L A+ S++ L+
Sbjct: 353 LATSEVGDQYALPLALLTGSCFYQFIALGDDDSAQASFLGSLQPDGSVQVPVFSPRALTN 412
Query: 292 -RYVN----LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ- 345
R V+ L P+V DL G Q+ G L++++ G+ I+ + L
Sbjct: 413 LRPVDTLPSLSPLVKIHAEDLRGDGTPQLFALCGRSSRSQLKVLQQGLAISLLSQNPLPY 472
Query: 346 GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
G+W+LR + F+V+SF + T +LA+ +E+ GF TL
Sbjct: 473 APSGLWTLRDARTGQ-HRFMVISFNNATIVLAVG--KSVEQVMDTGFKLDESTLATGVLE 529
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD 465
N +QV G R + W+ P S+ A N Q+++A G ++Y E+ D
Sbjct: 530 GNSFLQVYPGGFRQIFEDGHT--KVWEPPSRRSITAAAMNLRQIVVALSNGEVLYFEL-D 586
Query: 466 GILTEVKHAQLEY--EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEH 523
L V+ + + E+ CLD+ + N + AVG + D S R++SL +L+ +
Sbjct: 587 ERLEWVERESMNHREEVICLDLPALAPNSLRAPFLAVG-YGDRSCRVYSLAPNSLLEELS 645
Query: 524 LGG-EIIPRSVLLCAFEGIS------YLLCALG--DGHLLNFLLNMKTGELTD-RKKVSL 573
+ +P ++ L S LL +G +G L+ ++ +G+L++ R L
Sbjct: 646 MQALNAMPSNLTLDTMRMGSGSLARETLLLTVGMENGILMRVEVDPVSGKLSNARSTRFL 705
Query: 574 GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
G +P+ L + V A S +P + Y +N L +++ + PF + + +
Sbjct: 706 GPRPVRLFKILAGGNPCVLALSVKPWLCYCANNTLTLTSLVSDPLDLAAPFCNEDCSEGI 765
Query: 634 AIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTF-------AICSLKNQS 684
L I IDD+ Q SIPL PR + + Q R A L+ QS
Sbjct: 766 VCVAGTNLNIIRIDDLTQPFTATSIPLSYTPRELVVYPGQPRLLLLETDHNAYSELEKQS 825
Query: 685 CAEESEMHFV--------------------RLLDDQTFEFISTYPLDTFEYGCSILSCSF 724
++ + +V R++D + + + L E S+ C F
Sbjct: 826 FYQQHNVSYVNEYDCGAPIPAEPDKWASCIRVVDAISLQTLERLELADNEAAFSMCVCRF 885
Query: 725 SDDSN-VYYCVGTAYVLP-EENEPTKGRILVF-IVEDGKLQLIAEKETKGAVYSLNAFNG 781
+ + + +GTA L ++G I VF VE LQL+ E +L F+G
Sbjct: 886 ASKGDEPFVVIGTAKNLKIHPRSCSQGFISVFRFVEGHSLQLLHRTEVDEVPAALCEFDG 945
Query: 782 KLLAAINQKIQLYKWMLRDDGTRELQSECGHHG--HILALYVQTRGDFIVVGDLMKSISL 839
KL A I + +++Y D G ++L +C + H + ++ G+ + GDL ++S
Sbjct: 946 KLAAGIGRSVRVY-----DLGKKKLLRKCENKAMPHFVT-KLRAMGERLYAGDLTDNVSF 999
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE---------- 889
+ ++ + E A ++A+++LD + + + NLF R + +
Sbjct: 1000 VKFRKGTNQLVEFADGGIPRSITALDVLDYNTVVCGDKGGNLFVERVDPKVDDDIANPTG 1059
Query: 890 -------GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGV 942
G + E +LGE V + L+ P D V++GT+ G
Sbjct: 1060 SRSLWNSGLLSAAPNKAEQAASIYLGEIVTSVQKTVLI---PGGD----EVVLYGTIFGT 1112
Query: 943 IGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
IG + +P L L ++ +RK + G + WRS K +DG+L E
Sbjct: 1113 IGALLPMPSRDDLHHLMHIEMYIRKQEPSLVGRDILSWRS-----AYTPMKGIIDGNLCE 1167
Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+F L + + +EI+ + +SV + K++E+L
Sbjct: 1168 TFSMLPQIKQEEIANALVLSVSSIVKKMEDL 1198
>gi|326469377|gb|EGD93386.1| splicing factor 3B subunit 3 [Trichophyton tonsurans CBS 112818]
Length = 1188
Score = 180 bits (457), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 276/1166 (23%), Positives = 478/1166 (40%), Gaps = 186/1166 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ + R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
GQ +DP R LI + L V+ + + +L + + L LD+
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182
Query: 116 F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
+ ++ + + QD + +T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVEYTEVDQDPTGQAYEETEKMLVYYELDLGLNHVVRR-WA-DPVDRTASM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR- 215
L VP P GVL+ E+ IVY +N AF+ IP R P+ R G
Sbjct: 241 LFQVPGGADGP--SGVLVCAEDNIVYRHSNQDAFRVPIPRRRGPTENPERKRCITAGVMH 298
Query: 216 -------YLLGDHAGLLH---LLVITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
+LL G L + ++ E EK TG LK++ +AS++ L +
Sbjct: 299 KMRGAFFFLLQSEDGDLFKVTMEMVEDENEKATGEVKRLKLKYFDTVPLASSLCILKSGF 358
Query: 262 VYIGSSYGD-----------------------SQLIKLNLQP----DAKGSYVEVLERYV 294
+++ S G+ S +I L P + ++E
Sbjct: 359 LFVASETGNQHFYQFEKLGDDDDEIEFISDDYSAVISEPLPPVYFRPRPAENLNLVESIA 418
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
+L P++ + ++ + Q+ T G G I+E EL + +W+
Sbjct: 419 SLNPLMAASITNITEEDAPQIYTLCG-----------TGARISEIVESELPSVPSAVWTT 467
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ S +D FD ++V+ I ET +EE GF S TL + L+QV
Sbjct: 468 KLSRNDQFDAYIVL--IGET----------VEEVTDTGFLSSAPTLAVQQLGEDSLIQVH 515
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV- 471
+R + + R NEW +P S+ AT N QV +A G +VY E+ DG L E
Sbjct: 516 PKGIRHIHADQR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDGSLAEYD 573
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
+ Q+ ++CL + + E S AVG D +VRI SL PD L K P
Sbjct: 574 EKRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 632
Query: 531 RSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
++ ++ + S YL L G L +L+ TGEL+D + LG +P+ L + S
Sbjct: 633 SALSIMSMIDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFSVS 692
Query: 585 SKNTTHVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
K V A S R + YS K + +N + F+S + + + L
Sbjct: 693 VKEQRAVLALSSRSWLGYSDVQTKSFTLTPLNYVGLEWSWNFSSEQCVEGMVGIQGQNLR 752
Query: 643 IGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEM-------- 691
I +I+ + L IPL PR + F + N S A ++++
Sbjct: 753 IFSIEKLDNNLLQEPIPLAYTPRNFVRHPEYPLFYVIGSDNNILSPATKAKLLSESTAVN 812
Query: 692 ------------------HF---VRLLDD-QTFEFISTYPLDTFEYGCSILSCSF-SDDS 728
H+ ++++D T +S L+ E SI + SF S +
Sbjct: 813 GDSAELPPEDFGYPRGTNHWASSIQVVDPIHTKSVLSNLELEDNEAAVSIAAVSFTSQED 872
Query: 729 NVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLA 785
+ VGT ++ T G I ++ E+GK L+ I + + + +L F G+LLA
Sbjct: 873 ETFLVVGTGKDMVVSPRTFTCGFIHIYRFQEEGKELEFIHKTKVEQPPLALLGFQGRLLA 932
Query: 786 AINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
I +++Y D G R+L +C + + +QT+G I+V D+ +S++ ++YK+
Sbjct: 933 GIGPDLRIY-----DLGMRQLLRKCQAQITPRVIVGLQTQGSRIIVSDVQESVTYVVYKY 987
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEV 901
+E A+ A D W + ++D + G + N++ +R K SE A ++ G +
Sbjct: 988 QENALIPFADDIIPRWTTCTTMVDYETVAGGDKFGNIWLLRCPTKASEEADEDGSGAHLI 1047
Query: 902 VGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT--------------VNGVIGV-I 946
+L NR SLV+ D IPT I T + G +G+ +
Sbjct: 1048 HERQYLQGAPNRL---SLVIHFYSQD---IPTSIQKTQLVAGGRDILVWTGLQGTVGMFV 1101
Query: 947 ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
+ + F + L+ L + G +H +R + K +DGDL E+FL L
Sbjct: 1102 PFITRDDVDFFQTLEMQLASQNPPLAGRDHLIYRGY-----YAPCKGVIDGDLCETFLLL 1156
Query: 1007 SRTRMDEISKTMNVSVEELCKRVEEL 1032
+ I+ ++ SV E+ +++ ++
Sbjct: 1157 PNDKKQAIAGELDRSVREIERKISDM 1182
>gi|406602265|emb|CCH46158.1| Pre-mRNA-splicing factor [Wickerhamomyces ciferrii]
Length = 1123
Score = 179 bits (454), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 256/1125 (22%), Positives = 465/1125 (41%), Gaps = 158/1125 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L+V I+ I ++E F+ G +D+L I ++ F +L+ D + + T
Sbjct: 59 LLNVQIFAIIRSMENFKIAGSGKDYLAITSDSGNFSILELDLNNGKFNTLFNEPYHKSGI 118
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE------------ 108
R GQ +DP R I L I K +L FN L+
Sbjct: 119 RRLSPGQDLAVDPKGRAI-------LATAIE---KNKLCYVFNRDLDNNLTISSPLEANR 168
Query: 109 ---LQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNL----- 160
L + I G P + D + +++ + F E N++
Sbjct: 169 SKILTLTSIGLDVGYENPVFATIEIDYSNYEYIENLTEVERFLTFYELDLGLNHIVRRKT 228
Query: 161 ---DNGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDAD 212
+ ++LL+ VP P GVL+ + I Y + K P + + ++
Sbjct: 229 EKIHDSSNLLLQVPGGNDGP--SGVLVCSKNLISYRNLIGDKVSINIPKLENS-----SN 281
Query: 213 GSRYLLGDHAGLLH-------LLVITHEKEKVTGLKIELLGETSI----------ASTIS 255
+++ AG++H +V T+ + KIELL E SI AS+I
Sbjct: 282 QDSFIV---AGVVHKMKNQFFFIVQTNHGDL---FKIELLEEGSITISYFDTIPLASSII 335
Query: 256 YLDNAVVYIGSSYG------------DSQLIKLNLQPDAKGSYVEVLERYVNL------- 296
L + +Y S YG +S I N +P + ++ E L
Sbjct: 336 ILRSGFLYCDSEYGSKFFYQFEKLGSESPFISSNDEPIEELTFERTEELDNLLLVDVLDS 395
Query: 297 -GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG-MWSLR 354
P++D +V+ ++ + SG SL+I++ G+ INE +L GI +W+ +
Sbjct: 396 LNPLIDSKLVN--DDAFTKIYSLSGVKDSSSLKILQYGLSINEIVESDLPGIANKVWTTK 453
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
+ +D FD +LV+SF+ T +L++ + +EE G +T+ N LVQ+ S
Sbjct: 454 LNKNDEFDKYLVISFMDTTLVLSIG--ENVEEITDSGLALNEETIGIQQIGINSLVQIHS 511
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG--ILTEVK 472
+R + + EL NEW+ P G + + Q+ + LVY E+ D ++ +
Sbjct: 512 NGIRNIKNG--ELINEWQPPAGIKILTTSTTNRQIAIGLSNDELVYFEVDDRDRLIEYNE 569
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL---PDLNLITKEHLGGEII 529
+L I L + I E S VG D ++R+ S L L++ + L I
Sbjct: 570 RKELTSRIVSLSLGDIPEGRLRSPFLIVG-CQDSTIRVLSTDPGSTLELLSLQALSS--I 626
Query: 530 PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
P +L + ++ L +G + L++ + G+L+D + LG +P++L S
Sbjct: 627 PFDILTLSMNNQLFVNIGLENGVYVRTLIDEQNGQLSDTRIKYLGNKPVSLSKISISGVN 686
Query: 590 HVFAASDRPTVIY-SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDD 648
V A S++ +++ + N +++ + + FNS D + + L I TIDD
Sbjct: 687 VVLAFSNKTWIVHETKNSTFKINSLLINPLKFGFMFNSEDCIDGIVGVYKKNLIIFTIDD 746
Query: 649 IQK-LHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIST 707
+ +I SI L P+ + Q I +N S + F+ D Q E +
Sbjct: 747 LDNDFNINSIQLKTTPKNLLKNNQD----IFITQNSSS-----LGFIEKFD-QELELNKS 796
Query: 708 YPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI-VEDGKLQLIAE 766
L E + F S Y V + + G FI D +L LI
Sbjct: 797 IQLSESETIITSTIVKFESKSTSYLLVSIS------KKHDNGSNEFFINTYDLELNLIHI 850
Query: 767 KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA--LYVQTR 824
+ Y++ F GK+L I+ ++L+ D G ++L S+ ++ + ++T+
Sbjct: 851 TKISDIAYAITEFQGKVLIGISNHLRLF-----DMGLKQLLSKANSKIDTISKIVKIETQ 905
Query: 825 GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
G +VVGD+ +SI+ L+YK + D ++++++LD + +G + NLF +
Sbjct: 906 GYRVVVGDIRESITFLVYKPKSNEFLTFTDDILPRHITSIKMLDYNTVIGGDKFGNLFIL 965
Query: 885 RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGS--LVMRLPDSDVGQIPT--------- 933
R + E + + ++ + ++F +G+ +M L + + IPT
Sbjct: 966 RASEESSKISDTNSTFLITK-------DKFLNGAPFKLMNLCNFFIEDIPTSFAKGSFTI 1018
Query: 934 -----VIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
+I+ + G +G +I L F L+ +R + G + ++R +
Sbjct: 1019 GGKDLIIYTGLQGTVGALIPLLTKSDIKFFINLEKQMRLHKPDLLGRMNLKYRGYYQP-- 1076
Query: 988 TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDLIE F LS + EIS ++ + E+ K+++E+
Sbjct: 1077 ---VKNVIDGDLIELFNTLSESTKVEISNELDKTPREISKKIDEI 1118
>gi|328770812|gb|EGF80853.1| hypothetical protein BATDEDRAFT_29900 [Batrachochytrium dendrobatidis
JAM81]
Length = 1213
Score = 178 bits (451), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 252/1182 (21%), Positives = 480/1182 (40%), Gaps = 177/1182 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L ++G I +L FR G ++D+L + ++ + +L+++ +
Sbjct: 53 LLSHQVFGLIRSLAAFRLAGASKDYLVVGSDSGRIVILEYNPAKNTFDKVHQETYGKTGA 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DIK 115
R GQ DP R +IG L ++ D+ +L + + + ++ I
Sbjct: 113 RRVVPGQYLAADPKGRAVMIGAIEKQKLAYILNRDSSLKLTISSPLEAHKTAIICHGIIG 172
Query: 116 FLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNG 163
G P + D DA T E A + + + W++ +D
Sbjct: 173 VDVGYENPIFASIEVDYSDADQDPTGEAARDAEKMLTYYELDLGLNHIVRKWTEK-IDFS 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIV--------YCSANAFKAIPIRPSITKAYGRVD 210
A+ LI +P P GVL+ E I +C + P+ I +V
Sbjct: 232 ANHLIAIPGGDDGP--SGVLVCSEGRITWKHMQKPSFCVPIPQRPDPLTSVIPGTQNKVI 289
Query: 211 ADGS-------RYLLGDHAGLLHLLVITHE--------KEKVTGLKIELLGETSIASTIS 255
S + + + + IT E +V +KI+ +A +S
Sbjct: 290 VVSSVVHRLKRGFFVLAQTEVGDVFKITVECTADSSGNAGQVQAIKIKYFDTIPVAVNLS 349
Query: 256 YLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEV-------------------------- 289
L + ++ S +G+ L ++ D S VE
Sbjct: 350 LLKSGFLFAASEFGNHYLYQIENLGDDDESQVEYSSVDFPQGDSLPDSDSMPVVAFIPRE 409
Query: 290 ------LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE 343
++ +L P++D V++L + Q+ G + RI+R+G+ ++E A E
Sbjct: 410 LRNLAPVDEMESLCPLIDAKVLNLTDENSPQIYALCGRGSRSTFRILRHGLDVSEMAVSE 469
Query: 344 LQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH 402
L G +W++R S D +D+++++SFI+ T +L++ + +EE G + T T+
Sbjct: 470 LPGNPNAIWTVRRSVSDIYDSYIIISFINATLVLSIG--ETVEEVTDTGVLATTSTITVG 527
Query: 403 DAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
N LVQV +R + + R +EWK+P S+ A N QV++A +VY E
Sbjct: 528 QLGENALVQVYPQGIRYIRADKRV--SEWKAPTNQSIVSAACNQRQVVVALSNNEIVYFE 585
Query: 463 IG-DGILTEVK-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LN 517
+ G L E + ++ + CL ++PI ++ A+G D++VRI SL P+ L+
Sbjct: 586 LDVSGHLNEFQDRKEMSSRVLCLSLSPIPTGRLRARFLAIGC-ADLTVRILSLDPESCLH 644
Query: 518 LITKEHLGGEIIPRSVLLCAFE----GIS--YLLCALGDGHLLNFLLNMKTGELTDRKKV 571
++ + L P S+ + G+S +L L +G LL ++ +G ++D +
Sbjct: 645 PLSMQALSAP--PDSLAMIDMPDPTTGVSNLFLNIGLANGVLLKTCVDSGSGNMSDTRMR 702
Query: 572 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
LG++ + L + T + A S RP + ++ + + ++ + + + F S +
Sbjct: 703 FLGSRGVKLFLLKIQGETGLLALSSRPWISFTYHSRSKLMPLSYESLEYGSSFCSEQCTE 762
Query: 632 SLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTF--------------- 675
+ L I + + + + SIPL PRR + S+ F
Sbjct: 763 GIVAITGNTLRILNTEKLGSVFKQASIPLKYTPRRFIFDQVSQNFVVIESDHGVFCPSDR 822
Query: 676 ---------------------AICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFE 714
A+ + E +R++ E + LD E
Sbjct: 823 AKILETKATLDADEGTIPEELAVEQFGHSKAGPERWASCIRVISPIHGETLHLEDLDDNE 882
Query: 715 YGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKETK 770
I C F S + + VGTA V + T+G + V+ + DG L+ + + K
Sbjct: 883 AAFCISFCIFQSSLTTTHIVVGTASNVNLSSSSFTEGYLRVYKLAPDGTSLEFLHKTPIK 942
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIV 829
G + +F G+LL + +++Y D G +++ +C G + + T+G+ I+
Sbjct: 943 GIPKVMCSFQGRLLVGVGSLLRIY-----DLGKKKMLRKCECKGFPTTIVTLHTQGNRII 997
Query: 830 VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE 889
+GD +S+ +Y+ + I A D W++A ++D D +G + N+F R ++E
Sbjct: 998 LGDAQESVHYAMYRAFDNRIVIFADDTIPRWVTATCMVDYDTVVGGDKMGNIFVNRLSAE 1057
Query: 890 --GATDE---------ERGRLEVV-------GEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
DE +RG L+ ++ LGE + SLV G
Sbjct: 1058 VSKGIDEDTTGNQAIFDRGYLQGAPHKVHHEADFFLGETLTSLTKTSLV-------PGGR 1110
Query: 932 PTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
+++ T+ G IG +I + + F + L+ +R + G +H +RSF +
Sbjct: 1111 EILLYTTLMGGIGLLIPFISKDDVDFFQTLEMTMRSECPPLCGRDHLAYRSFYTPVHAI- 1169
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+DGDL E F + + I+++++ SV ++ K++E++
Sbjct: 1170 ----IDGDLCEMFNVMVGDKKRGIAESVDRSVADVGKKLEDM 1207
>gi|58269920|ref|XP_572116.1| hypothetical protein [Cryptococcus neoformans var. neoformans JEC21]
gi|57228352|gb|AAW44809.1| conserved hypothetical protein [Cryptococcus neoformans var.
neoformans JEC21]
Length = 1276
Score = 178 bits (451), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 205/844 (24%), Positives = 379/844 (44%), Gaps = 108/844 (12%)
Query: 245 LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL-------------------------- 278
LG +S S+I+YLD++ +++ S+ DS L++L
Sbjct: 413 LGASSPPSSITYLDSSHLFVSSAVADSVLLRLPKVESSSSVSSGKGKGRAVTSPIGDQAD 472
Query: 279 -----------QPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSL 327
+ D G E+LER++N+ P+ D CVV E +V SGA + SL
Sbjct: 473 KWEVLYEIGKDRNDTDGGP-EILERWMNIAPVKDLCVVKDEGGNLSHLVLASGASESNSL 531
Query: 328 RIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
R+VR+G+G+ E +++ L ++ MW L ST P L++S + T + + L+ E+
Sbjct: 532 RVVRSGVGLEELVTIQGLHDVQKMWPLTDSTAIPR---LLLS--TSTSTILLQLQPEISA 586
Query: 387 TEIEGFCSQTQTLFCHDAIYNQLV-QVTSGSVRLVSSTS-RELRNEWKSPPGYSVNVATA 444
I ++TL +L+ QVT + L S S +L + + + A
Sbjct: 587 IPITDVIFNSETLAAGILPGAELLAQVTPRGLSLWSDLSVGQLEAQVEVDKETEIVCAQV 646
Query: 445 NASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
A ++A GG LV + + + ++ E+S + I+ + S + + M
Sbjct: 647 TADWAVVAKKGGSLVVFHVSNTGFSPQGTIDVKEEVSAVAISSSSGSSSPIIVISTWM-- 704
Query: 505 DISVRIFSLPDL-NLITKEHLGGEIIPRSVLLCA---FEGISYLLCALGDGHLLNFLLNM 560
+++L + N + + E S+ L + + LL L +G L + LN
Sbjct: 705 -AKTFVYTLSQISNGVDGLSIQSESSATSLQLRSHPFYPAGIQLLSGLDNGLLHIYDLNT 763
Query: 561 K-TGE-----LTDRKKVSLGTQPITLRTFSSKNTTHVFAA---SDRPTVIYSSNKKLLYS 611
+GE + K SLG +P+ L S + + ++R +VI+ S ++ +S
Sbjct: 764 SDSGEAEGLMVKSSKTTSLGLRPLVLHPCESTHGDEKVISVGLTERMSVIFESKDRIEFS 823
Query: 612 NVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EHPRRICHQE 670
+VN+K V ++++ P ++ L++ ++ ++KLH+++ G E ++ + +
Sbjct: 824 SVNIKNVMAATSVDTSSGPVFALFSRTSGLSLVKVNSLKKLHVQTCDTGNESISKLTYMD 883
Query: 671 QSRTFAICSLKNQSC--AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS 728
+ + A S + + E +FV++ D + E +S++ L E S+ S +
Sbjct: 884 EYKAIACGSTRRTQLRDGDVEEENFVQIRDGTSLEPLSSFSLRGRELVTSLRSVFLT--G 941
Query: 729 NVYYCVGTAYVLPEENEPT-----------KGRILVFIVEDGK------LQLIAEKETKG 771
+Y VGTA++ ++ E + +GR+L+ ++G +++ AE T G
Sbjct: 942 RIYLAVGTAFLPADDGEDSSWDEGNLAVVREGRVLLLEFKEGDAGGGWDIKIKAELATVG 1001
Query: 772 AVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ-SECGHHGHILAL-------YVQT 823
AVY+L +G L A K+ +++ D EL+ + ++++ +++
Sbjct: 1002 AVYALEEIHGFLAVAAGSKLTIHR---LDHNPVELEETSSWASAYVISSLSVLPPSHIRP 1058
Query: 824 RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD--DIYLGAENNFNL 881
G IV GD M+S+ +L +G I + R+ + ++A+ +L D D + ++ NL
Sbjct: 1059 EGALIV-GDGMRSVIVLNVDEGDGMIYDDERNMATHGVTALGLLKDKGDGVVISDAYSNL 1117
Query: 882 FTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
T R N +LE + L E V RF+ GSLV ++ IP V+F T G
Sbjct: 1118 LTYRLNQ---------KLERAATFGLHEEVTRFQSGSLVPTTTAPEI-IIPDVLFATREG 1167
Query: 942 VIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
+G+I L L+ LQ N+ K+ KG G + WR + D F+DGD ++
Sbjct: 1168 RLGIIGELGTRSSRTLDDLQRNMSKIWKGPGEVGWSNWRRAGSNLVGKDTAGFVDGDFVQ 1227
Query: 1002 SFLD 1005
FLD
Sbjct: 1228 KFLD 1231
>gi|145510432|ref|XP_001441149.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124408388|emb|CAK73752.1| unnamed protein product [Paramecium tetraurelia]
Length = 1174
Score = 177 bits (449), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 207/992 (20%), Positives = 409/992 (41%), Gaps = 164/992 (16%)
Query: 159 NLDNGADLLIP-VPPPLCGVLIIGEETIVY-CSANAFKAIPI---------RPSITKAYG 207
++ + A L++P P GV+++ E+ + Y N +P R ++ ++G
Sbjct: 224 HISSHALLMVPNAPDGPGGVIVLCEDFLHYRGPKNQEMKVPYPKNQGMPTDRGAMISSFG 283
Query: 208 RVDADGS-RYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
S YLL G L L + K++V +++ +A+++ + + ++
Sbjct: 284 FYKQKSSFLYLLQSEYGDLFELSLQFTKDEVHSIRMIYFDTIPVANSLCLMRSKHLFAAC 343
Query: 267 SYGDSQLIKLN----------------------LQPDAKGSYVEVLERYVNLGPIVDFCV 304
G+ K +P K ++ +L+ N I D V
Sbjct: 344 EKGNHCFYKYQREETNTNIICTDSSMALEEQIFFKPQ-KLKHLSLLQELNNFSCISDLKV 402
Query: 305 VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDT 363
DL ++G Q+ C A +LR++R+G+ I + A+ LQ G+W+L+ +DP
Sbjct: 403 ADLAKEGNPQIYICCAAGNRSTLRVLRHGLEITQLANTNLQAKPLGIWTLKERYEDPVHK 462
Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
++V+S+I++T L + + +++E+ G QT+ I + +Q+ + R +
Sbjct: 463 YIVISYINKT--LVLKIGEKVEQVHDTGLEGTKQTIHVGTLIDDSQIQILTNGYRHIRKN 520
Query: 424 SRELRNEWKSPPGYSVN----VATANASQVLLATGGGHLVYLEIG----DGILTEVKHAQ 475
K P Y ++ +N QV A GG + Y E+ G L E+ Q
Sbjct: 521 --------KPPTDYIIDGKVIKGVSNEKQVAFALAGGDVYYFELDTTSFQGNLIEITKDQ 572
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-L 534
++ EI L++ PI E + V + +D ++R+ SL + + + + +P
Sbjct: 573 MDNEIKALELGPIEEGRQRCKFLCVAL-SDQTIRLLSLEPESCFERGAM--QALPSEAES 629
Query: 535 LCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
LC E S +L L +G L+ ++ +G L+D + LGT+P+
Sbjct: 630 LCMMEMASEQEGQQSFTKQLFLFIGLNNGLLMRTSVDQLSGGLSDTRTRYLGTKPVKCLR 689
Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
++ + A S R + Y+++ ++ ++ + + F +AKE +
Sbjct: 690 INANQQQAMLALSSRSWLCYNNSGRIFMQPLSYDYLDYASAF----------LAKEFQGI 739
Query: 643 IGTIDDIQKLHI----------RSIPLGEHPRRICHQEQSRTFAICSLKNQSC------- 685
+GT ++ + +S+ L PR++ E S+ I N+S
Sbjct: 740 VGTNQSTLRIIMPERFGEIFNQQSLDLTYSPRKMIFHEPSKAIFIIESDNRSYNNVEQKV 799
Query: 686 -----------------AEESE-MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDD 727
AE+ + +R++D E I+ + ++ CSI F+
Sbjct: 800 QEVYQTQEIPEQWNQVQAEQYKWASLIRIVDATKLETINVHQFYENQHACSICYIQFAGY 859
Query: 728 SNVYYCVGTAYVLPEENEPTK----GRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
Y CVGT L NEPT+ G I FI ++ L+L + Y+L A+ G+L
Sbjct: 860 PEQYVCVGTVKDLV--NEPTRKFSQGFIHTFIYDNKTLKLKHSTQIDEIPYALAAWRGRL 917
Query: 784 LAAINQKIQLYKW----MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
L +++Y+ +L+ + L S + + D I V ++ SI L
Sbjct: 918 LVGAGCNLRVYEMGNQRILKKAEIKNLNS--------FITSIMVKEDRIYVAEVSDSIHL 969
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-------------- 885
L Y + E A D +++A +LD + + N+F R
Sbjct: 970 LRYNIRDQTFMELADDILPRYVTASTVLDYHTVIAGDKFENIFVSRVPLDIDEEQEEHPY 1029
Query: 886 ----KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
K +G + +++ + +++GE + + +LV S+V V++GT G
Sbjct: 1030 EYKMKMDQGCMNGAPFKMDQICNFYVGEVITSLQKIALVS--TSSEV-----VVYGTSMG 1082
Query: 942 VIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
I + +++ + F L+ LR + + G +H Q+RS K++ +DGDL
Sbjct: 1083 SIAALYPFDNKEDIDFFLHLEMYLRVEHQPLSGRDHMQFRSAYGPCKSI-----IDGDLC 1137
Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
E F ++ + +++ + + ++ K++E++
Sbjct: 1138 EQFGNMQYNKQRTVAEEFDRTPADIIKKLEDI 1169
>gi|260819270|ref|XP_002604960.1| hypothetical protein BRAFLDRAFT_126697 [Branchiostoma floridae]
gi|229290289|gb|EEN60970.1| hypothetical protein BRAFLDRAFT_126697 [Branchiostoma floridae]
Length = 1049
Score = 177 bits (448), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 233/1010 (23%), Positives = 409/1010 (40%), Gaps = 162/1010 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G I +L FR G +D++ T+ + +L++ ++
Sbjct: 53 LLSVEIFGVIRSLMSFRLTGGTKDYIVCGTDSGRIVILEYLPAKNKFDKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ +DP R +IG L ++ D +L + + + L +
Sbjct: 113 RRIVPGQYLAVDPKGRALMIGAVEKQKLVYILNRDAAARLTISSPLEAHKSNTLCYHLVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGA 164
G P L D +DA + T E + + F E N+ L+
Sbjct: 173 VDIGFENPMFACLEIDYEDADNDPTGESSQSTQQCLTFYELDLGLNHVVRKYSEPLEEHG 232
Query: 165 DLLIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR------ 215
+ LI VP GVLI E I Y N IR I + +D D R
Sbjct: 233 NFLIAVPGGSEGPSGVLICSENYITY--KNFGDQPDIRSPIPRRRNDLD-DPERGMIFVC 289
Query: 216 -----------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
+L G + + + +++ VT +K++ +A+++ L +++
Sbjct: 290 SATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVATSMCVLKTGFLFV 349
Query: 265 GSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPIVD 301
S +G+ L ++ + P +G R + +L PI+
Sbjct: 350 ASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFLFTPRGLKNLVLVDEMDSLSPIMS 409
Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDP 360
+ DL + Q+ +G S R++R+G+ ++E A EL G +W+++ + DD
Sbjct: 410 CQIADLANEDTPQLYVANGRGPRSSFRVLRHGLEVSEMAVSELPGNPNAVWTVKKNVDDE 469
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FD ++VVSF++ T +L++ + +EE GF T TL C + L+Q+ +R +
Sbjct: 470 FDAYIVVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLIGEDALLQIYPEGIRHI 527
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEY 478
S R NEW++P ++ N QV++A GG LVY E+ G L E + ++
Sbjct: 528 RSDKR--VNEWRTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERKEMSA 585
Query: 479 EISCLDIN--PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLI----------TKEHL-- 524
E+ C+ + P+GE + AVG+ TD +VRI SL + + T E L
Sbjct: 586 EVICMALANVPVGEQ--RCRFLAVGL-TDDTVRIISLDPADCLQPLSMQALPATPESLCI 642
Query: 525 ----GGEII----PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
GGE I PR+ +L +L L +G LL +L+ TG+L+D + LG++
Sbjct: 643 LQMGGGEGIDDTGPRNTML-------FLNIGLQNGVLLRTVLDQITGDLSDTRTRYLGSR 695
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + Y+ + + ++ + + + F S P+ +
Sbjct: 696 PVKLFQVKMQGNEAVLAMSSRSWLSYTYQNRFHLTPLSYETLEYASGFASEQCPEGIVAI 755
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSR------------TFAICSLKNQ 683
L I ++ + + + S PL PR+ QS T A + + Q
Sbjct: 756 SANTLRILALEKLGAVFNQISTPLMYTPRKFVIDTQSNNIIMIETDHNSYTEATKAQRKQ 815
Query: 684 SCAEE-------------SEM-------------------------HFVRLLDDQTFEFI 705
AEE +EM +R+L+ +
Sbjct: 816 QMAEEMVEAAGEEEKELAAEMAAAFINENLPEATFGSPKAGPGMWASVIRVLNPIQGNTL 875
Query: 706 STYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRIL--VFIVEDG-KL 761
L+ E SI C F++ + + VGTA + G L +V +G KL
Sbjct: 876 DLIQLEQNEAAFSIAICKFANRGEDTFVVVGTAKDMTMNPRTCSGGFLHTYQLVNNGEKL 935
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALY 820
+LI + + ++++F G++L I + +++Y D G ++L +C H +
Sbjct: 936 ELIHKTPVEDVPGAISSFQGRILVGIGRLLRIY-----DLGKKKLLRKCENKHVPNFIMS 990
Query: 821 VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
+ T G I V D+ +S + YK +E + + D N W++ LD D
Sbjct: 991 IHTMGHRIFVSDVQESFHFIRYKRQENQLIIFSDDTNYRWITCSTQLDYD 1040
>gi|118400837|ref|XP_001032740.1| CPSF A subunit region family protein [Tetrahymena thermophila]
gi|89287084|gb|EAR85077.1| CPSF A subunit region family protein [Tetrahymena thermophila SB210]
Length = 1197
Score = 177 bits (448), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 280/1177 (23%), Positives = 478/1177 (40%), Gaps = 194/1177 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I T+ +FR HG+ QD++ + ++ + +L+++ E E + + + G R
Sbjct: 60 VFGIIRTINIFRLHGQTQDYIIVGSDSGRIVILKFNKEKGEF-EKVHQETYGKTGCRRIV 118
Query: 65 NGQIGIIDPD---CRLIGLHLYDGLFKVIPFDNKGQLK-----EA------------FNI 104
GQ +DP C + ++ ++ DN+ +L EA ++
Sbjct: 119 PGQYIAVDPKGRACMFAAIEKQKFVY-ILNRDNEDKLTISSPLEAHKSHTLCYDIIGIDV 177
Query: 105 RLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA 164
E Q I+ YG + ++ + + + YE+ V+ S + A
Sbjct: 178 GYENSQFACIECDYGEVEQKDSAVH-TGQIQKQLTIYELDFGLNHVVKK--SSEPIPETA 234
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCSANA-FKA-IPIRPSITK----AYGRV---- 209
+L+PVP P GVLI E+ ++Y N K P R S +K ++G +
Sbjct: 235 HMLLPVPGLPDGPG--GVLIACEDMLIYKGNNQELKCRYPQRFSTSKIMINSFGFLKQKS 292
Query: 210 -DADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
+ + S +LL G L L I +KV GL ++ A +I ++
Sbjct: 293 KNQNLSFFLLQTELGDLFKLQIKSTDDKVHGLTLQYFDSIPTAISICITKTGHLFAACEK 352
Query: 269 GDSQLIKLNL------QPDAKGSY---------------VEVLERYVNLGPIVDFCVVDL 307
GD L + +P S +E ++ NL I D V DL
Sbjct: 353 GDHLLYRFKSLGEQEERPTKTTSADFEQQAIFIPRKLVNLEQVDSLDNLSAISDIKVNDL 412
Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLRSSTDDPFDTFLV 366
+G Q+ A +LRI+R+G+ ++E A+ L G+W+++ D+ ++V
Sbjct: 413 TGEGNPQIYALCAAGSRSTLRILRHGLQVSEYATSRLPLRPNGIWTIKQRHDEGLTKYIV 472
Query: 367 VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
+S +T +L+ ++D + + +QTL N ++QVT S R +
Sbjct: 473 LSSSKKTYVLS--IKDTISAVNDSSLDTNSQTLHAGILENNCIIQVTPESFRQIRDNRVT 530
Query: 427 LRNEWKSPPGYSVNVATANASQVLLA------TGGGHLVYLE--IGDGILTEVKHAQLEY 478
L +K+ V A +N+ Q+ LA GG ++Y E +G + + A+L
Sbjct: 531 L---YKTESNKFVK-ACSNSRQIALALQGGQGQPGGDIIYFEFDLGGQLKRIEEKAKLTS 586
Query: 479 EISCLDIN--PIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSV 533
+I C+DI PIG P + + AVG D +VRI SL L ++ + L G IP SV
Sbjct: 587 DIECIDIGEVPIGR-PKF-KFLAVG-CKDQTVRILSLEQDQYLQRLSLQALPG--IPESV 641
Query: 534 LLC-----------AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
L A E YL L +G LL ++ TG L+D + L PI
Sbjct: 642 SLIEMKRGTGTEQEAEEYQLYLFVGLQNGILLRATVDQVTGSLSDVRTRVLSAAPIRTCK 701
Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCP--FNSAAFPDSLAIAKEGE 640
+ + + A S RP + Y+ K ++ + + P FN ++ GE
Sbjct: 702 YVVQGQPALLALSTRPWICYTYLGKYQMVPISYDMLEYAAPAVFNMQGEQKQCIVSTSGE 761
Query: 641 LTIGTIDDIQK----LHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF--- 693
I + QK L+ + I L PR++ E+S + N+ E +
Sbjct: 762 SV--RIIEPQKYGDLLNQQVIKLRYSPRKMAIHERSHNIVLIESDNKVYNESQKKQLIEQ 819
Query: 694 ---------------------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD 726
VR++D T + ++ Y L+ E+ S+ +F +
Sbjct: 820 YYSNQNKGDLATQVDLLTAPSGNWASCVRIVDPVTRKTLNIYELENNEHALSMCMVNFDN 879
Query: 727 DSNVYYCVGTAYVLPEENEPTKGRILVFI------VEDGKLQLIAEKETKGAVYSLNAFN 780
Y CVGT V + P + + +I + L+L+ + E +L+A
Sbjct: 880 KDETYVCVGT--VKDYQVHPNRNFSVCYINTFQLNEKLNTLELLHKTEIFEIPGALHAHK 937
Query: 781 GKLLAAINQKIQLYKW----MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
GKLLA ++ Y+ +L+ + LQS +QT GD I V + S
Sbjct: 938 GKLLAGCGTFLRYYELGKKKLLKKAEIKGLQSPVNG--------IQTFGDRIYVSMVADS 989
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEER 896
+ +L Y+ ++ E D WMS+ ++LD Y+G + N F R S + E
Sbjct: 990 VHVLKYRSKDQTFYEVCDDILPRWMSSFQVLDYHTYIGGDKFENAFVCRVPSNADEEMEE 1049
Query: 897 G------------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
+ E + ++ GE + + SLV S +I V++GT
Sbjct: 1050 NPMAYKLRWETGYLNGAPFKTEQICHFYAGEVITTLQKASLV-----STGNEI--VLYGT 1102
Query: 939 VNGVIGVIASLP---HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
G IG A LP E F L+ LR + + G +H +RSF K+V +
Sbjct: 1103 SMGSIG--AYLPFQTKEDIDFFIHLEMYLRLDVLPLAGRDHVMFRSFYGPVKSV-----I 1155
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DGDL E F+ LS + ++ M+ E+ K+++E+
Sbjct: 1156 DGDLCEQFIKLSSGKQKVLADEMDRHPHEVHKKLDEI 1192
>gi|156406895|ref|XP_001641280.1| predicted protein [Nematostella vectensis]
gi|156228418|gb|EDO49217.1| predicted protein [Nematostella vectensis]
Length = 1171
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 260/1183 (21%), Positives = 475/1183 (40%), Gaps = 221/1183 (18%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L + ++G + L FR G ++DF+ + ++ + +L++ S + + + + G
Sbjct: 53 VLAIEMFGIVRDLIAFRLTGGSKDFIVVGSDSGRIVILEY-IPSKNVFEKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ DP R +IG L ++ D+ +L + + + L +
Sbjct: 112 CRRIVPGQYLAADPKGRAIMIGAIEKQKLVYIMNRDSAARLTISSPLEAHKSHTLVYHMV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A T E A + F E N+ L+
Sbjct: 172 GVDAGFENPMFACLEMDYEEADTDHTGEAAQTTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+LLI VP P GVL+ E + Y N IR I + +D D R
Sbjct: 232 GNLLIAVPGGSDGP--SGVLVCSENYLTY--KNFGDQPDIRMPIPRRKYDLD-DPERGML 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L+ G + + + +++ VT ++++ A ++ L
Sbjct: 287 FICSATHKTKSMFFFLVQTEQGDIFKITLETDEDMVTEIRMKYFDTVPPAVSMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFTASEFGNHYLYQIAHLGDDDDELEFSSTMELEEGTTFFFQPRGLKNLV-LVDELESLA 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI++ + DL + Q+ G SLR++R+G+ ++E A EL G +W+++ +
Sbjct: 406 PIMNCQIADLANEDTPQLFAACGRGSRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVKHT 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
T D FD ++VVSFI+ T +L++ + +EE GF T TL C LVQ+
Sbjct: 466 TADEFDAYIVVSFINATLVLSIG--ETVEEVTDSGFLGTTPTLSCSQLGEEALVQIYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTE-VKHA 474
+R + + R NEWK+P ++ + N QV++A GG +VY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKAPGKKTIVKSAVNERQVVIALTGGEVVYFEMDSSGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIPRSV 533
++ ++ C+ + I S+ A+G+ +D +VRI SL + +T + +P S
Sbjct: 582 EMSADVQCMALGTILHGELRSRFLAIGL-SDNTVRIISLDPQDCLTPLSMQALPAMPES- 639
Query: 534 LLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
LC E G + LN+ E + L +P V A
Sbjct: 640 -LCIVEMGGKEGAESEGGSMGGLFLNIGLSE------ICLTLEP-----------GEVLA 681
Query: 594 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
S R + Y+ ++ + ++ + + + F+S P+ + L I ++ + +
Sbjct: 682 MSSRSWLSYTHQSRVHLTPLSYETLEYASGFSSEQCPEGIVAIAGNTLRILALEKLGAVF 741
Query: 654 IR-SIPLGEHPRRICHQEQSRTFAICSL------------KNQSCAEE------------ 688
+ + PL PR+ + T + + Q AEE
Sbjct: 742 NQVATPLKYTPRKFVIHSPTNTLVVIETDHNTYTDAAREERKQQMAEEMVEMASEEEKEL 801
Query: 689 -------------SEMHF-------------VRLLDDQTFEFISTYPLDTFEYGCSILSC 722
SE F +R+LD E LD E S+ C
Sbjct: 802 AAQAAAEFLSEELSEQQFGSPKAGTGMWASVIRVLDPIKGETHDVVQLDQNEAAFSVCVC 861
Query: 723 SFS---DDSNVYYCVGTAYVLP-EENEPTKGRILVF-IVEDG----KLQLIAEKETKGAV 773
SFS DD+ + VGTA + + G + + +V+ KL+ + E
Sbjct: 862 SFSARPDDT--FVIVGTAKDMALSPRSCSTGYLYAYRLVQQPSGGIKLEFQHKTEVDDVP 919
Query: 774 YSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
+L F G+LLA I + +++Y K MLR ++L + + + G IV
Sbjct: 920 AALEPFQGRLLAGIGRLLRVYDIGKKKMLRKCENKKLPN--------FICNINSMGTRIV 971
Query: 830 VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE 889
V D+ +S + YK + + A D N W++ LD D GA+ N+FTVR S
Sbjct: 972 VSDIQESFHFVKYKPRDNQLVVFADDVNPRWLTCCCYLDYDTLAGADKFGNIFTVRLPS- 1030
Query: 890 GATDE------------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
G++D+ +RG +LE + Y +GE V + +L+ G
Sbjct: 1031 GSSDDVDEDPTGTKAFWDRGLLNGAQQKLETMCNYFVGETVLSLQKATLI-------PGG 1083
Query: 931 IPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
+++F T++G +G++ E F + L+ +LR +E+ ++
Sbjct: 1084 SESLVFTTLSGGVGMLVPFTSREDIDFFQHLEMHLR------------------SEQPSL 1125
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+ +DGDL E + L ++ I++ ++ + E+ K++E++
Sbjct: 1126 ---SVVDGDLCEHYNSLDYSKRRTIAEELDRTPAEVSKKLEDI 1165
>gi|154295205|ref|XP_001548039.1| pre-mRNA splicing factor 3b [Botryotinia fuckeliana B05.10]
Length = 1020
Score = 173 bits (439), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 224/947 (23%), Positives = 392/947 (41%), Gaps = 137/947 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I + FR G +D++ I ++ + ++++ ++ + R
Sbjct: 64 VFGIIRAIAAFRLAGSNKDYIIITSDSGRITIVEFVPAQNKFNRLHLETFGKSGVRRVVP 123
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R L + L V+ +++ +L + + + Q L + G
Sbjct: 124 GQYLAVDPKGRACLTASVEKNKLVYVLNRNSQAELTISSPLEAHKAQTLVFALVALDVGY 183
Query: 121 AKPTIVVLYQDNKDARHVKT-------------YEVALKDKDFVEGPWSQNNLDNGADLL 167
A P L D ++ T YE+ L V WS+ +D A++L
Sbjct: 184 ANPVFAALEIDYGESDQDPTGQAYDEIEKQLVYYELDLGLNHVVRK-WSEP-VDRTANIL 241
Query: 168 IPVPPPL---CGVLIIGEETIVYCSAN--AFK-AIPIRPSITKAYGRVD--ADGSRYLLG 219
VP GVL+ GE+ I Y +N AF+ AIP R T+ R G + L
Sbjct: 242 FQVPGGTDGPSGVLVCGEDNITYRHSNQEAFRVAIPRRRGATEDPQRKRNIVAGVMHKLK 301
Query: 220 DHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDNAV 261
AG L+ T + + +V LKI+ +A+++ L +
Sbjct: 302 GAAGAFFFLLQTDDGDLFKITIEMVEDDNGQPTGEVRRLKIKYFDTVPVATSLCILKSGF 361
Query: 262 VYIGSSYGDSQLIKL---------------NLQPDAKGSYVEV------------LERYV 294
+++ S +G+ Q + + A SY + +E
Sbjct: 362 LFVASEFGNHQFYQFEKLGDDDEETEFVSDDFPTGAHESYTPIYFHPRPAENLSLVESID 421
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
++ P++D V +L + Q+ + G + R +++G+ ++E EL G+ +W+
Sbjct: 422 SMNPLMDCKVANLTDEDAPQIYSICGTGARSTFRTLKHGLEVSEIVESELPGVPSAVWTT 481
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ + D +D ++++SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 482 KLTRGDTYDAYIILSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVH 539
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEV- 471
+R + + R NEW +P S+ AT N QV +A G +VY E+ DG L E
Sbjct: 540 PKGIRHIRADHR--VNEWAAPQHRSIVAATTNERQVAVALSSGEIVYFEMDSDGSLAEYD 597
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
+ ++ ++CL + + E SQ AVG D +VRI SL PD L K P
Sbjct: 598 EKKEMSGTVTCLSLGEVPEGRQRSQFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSPP 656
Query: 531 RSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
++ + A S YL L G L +L+ TGEL+D + LG +P+ L +
Sbjct: 657 NALSIMAMSDSSSGGSTLYLHIGLYSGVYLRTVLDEVTGELSDTRTRFLGPKPVKLFRVA 716
Query: 585 SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+ T V A S RP + YS K + + ++ + F+S + + + L
Sbjct: 717 VQGQTAVLALSSRPWLGYSDPVTKGFMLTPLDYPALEWGWNFSSEQCTEGMVGIQGQNLR 776
Query: 643 IGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT 701
I +I+ + L SIPL PRR + F + N + ++ +LL+D +
Sbjct: 777 IFSIEKLTDNLLQESIPLTYTPRRFVRHPEHACFYVIEADNNILSPATKQ---KLLEDPS 833
Query: 702 F-----------EF------------------------ISTYPLDTFEYGCSILSCSF-S 725
EF +ST L+ E SI +F S
Sbjct: 834 VTNGDATVLPPEEFGYPRGTNHWASCISVVDPVTEKKVLSTIHLEDNECAVSIAVVAFAS 893
Query: 726 DDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGK 782
+ + CVGT ++ + G I V+ EDGK L+ I + + + +L AF G+
Sbjct: 894 QEDETFLCVGTGKDMVVSPRSFSAGFIHVYRFHEDGKELEFIHKTKVEEPPMALLAFQGR 953
Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
LLA + + +++Y +R R+ QSE + + + +QT+G I+
Sbjct: 954 LLAGVGKDLRIYDLGMR-QLLRKAQSEVAPN---MIVGLQTQGSRII 996
>gi|225448823|ref|XP_002282354.1| PREDICTED: splicing factor 3B subunit 3-like [Vitis vinifera]
Length = 1214
Score = 172 bits (437), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 176/754 (23%), Positives = 329/754 (43%), Gaps = 88/754 (11%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G I +L FR G +D++ + ++ + +L+++ E + + + + + G
Sbjct: 52 ILSVEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKEKN-VFDKIHQETFGKSG 110
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R +IG L V+ D +L + + + +
Sbjct: 111 CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTVARLTISSPLEAHKSHTIVYSIT 170
Query: 118 ---YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLD 161
G P + D +A +H+ YE+ L + V WS+ +D
Sbjct: 171 GVDCGFDNPIFAAIELDYSEADQDSTGQAASEAQKHLTFYELDLG-LNHVSRKWSEQ-VD 228
Query: 162 NGADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VDADG 213
NGA++L+ VP GVL+ E ++Y + + IP R + G V A
Sbjct: 229 NGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAAT 288
Query: 214 SR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
R +LL G + + + HE ++++ LKI+ + S++ L + ++ S
Sbjct: 289 HRQKSMFFFLLQTEYGDVFKVTLEHENDRISELKIKYFDTIPVTSSMCVLKSGFLFAASE 348
Query: 268 YGDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVD 301
+G+ L + QP + V + ++ +L PI+D
Sbjct: 349 FGNHGLYQFQAIGDDADVESSSASLMETEEGFQPVFFQPRGLKNLVRI-DQVESLMPIMD 407
Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDP 360
V +L + Q+ G S+RI+R G+ I+E A +L G+ +W+++ + +D
Sbjct: 408 MKVSNLFEEETPQIFALCGRGPRSSIRILRPGLAISEMAVSQLPGVPSAVWTVKKNVNDE 467
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FD ++VVSF + T +L++ + +EE GF T +L + L+QV +R +
Sbjct: 468 FDAYIVVSFANATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHI 525
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYE 479
R NEW++P ++ +N QV++A GG L+Y E+ G L EV+ ++ +
Sbjct: 526 REDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEVDMTGQLMEVEKHEMSGD 583
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLC 536
++CLDI P+ E S+ AVG + D ++RI SL PD + +++ + + P S+L
Sbjct: 584 VACLDIAPVPEGRQRSRFLAVGSY-DNTIRILSLDPDDCMQILSVQSVSSP--PESLLFL 640
Query: 537 AFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
+ +L L +G L +++M TG+L+D + LG + L +
Sbjct: 641 EVQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDARSRFLGLRAPKLFSVI 700
Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
+ + S RP + Y L + ++ + + F+S + + L +
Sbjct: 701 VRGRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEFAASFSSDQCAEGVVAVAGDALRVF 760
Query: 645 TIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAI 677
TI+ + + + IPL PR+ Q + + +
Sbjct: 761 TIERLGETFNETVIPLRYTPRKFVLQPKRKLLVV 794
Score = 97.1 bits (240), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 100/370 (27%), Positives = 168/370 (45%), Gaps = 54/370 (14%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGR 750
+R+LD +T L E SI + +F D + VGTA L + G
Sbjct: 863 IRILDPRTATTTCLLELQDNEAAFSICTVNFHDKEYGTLLAVGTAKSLQFWPKRSFDAGY 922
Query: 751 ILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
I ++ +EDGK L+L+ + + +G +L F G+LLA I ++LY D G R L
Sbjct: 923 IHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSVLRLY-----DLGKRRLLR 977
Query: 809 ECGHH---GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
+C + I++++ T D I VGD+ +S Y+ +E + A D W++A
Sbjct: 978 KCENKLFPNTIVSIH--TYRDRIYVGDIQESFHYCKYRRDENQLYIFADDSVPRWLTASY 1035
Query: 866 ILDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHL 907
+D D GA+ N++ VR K +G + ++E + ++H+
Sbjct: 1036 HIDFDTMAGADKFGNIYFVRLPQDVSDEVEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHV 1095
Query: 908 GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRK 966
G+ V + SL+ G +I+GTV G +G ++A + F L+ ++R+
Sbjct: 1096 GDVVTCLQKASLI-------PGGGECIIYGTVMGSLGALLAFTSRDDVDFFSHLEMHMRQ 1148
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSV 1022
+ G +H +RS K+ +DGDL E F LDL R DE+ +T
Sbjct: 1149 EHPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPTLPLDLQRKIADELDRTPG--- 1200
Query: 1023 EELCKRVEEL 1032
E+ K++EE+
Sbjct: 1201 -EILKKLEEV 1209
>gi|145549784|ref|XP_001460571.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124428401|emb|CAK93174.1| unnamed protein product [Paramecium tetraurelia]
Length = 1178
Score = 172 bits (436), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 201/984 (20%), Positives = 404/984 (41%), Gaps = 162/984 (16%)
Query: 166 LLIP-VPPPLCGVLIIGEETIVY-CSANAFKAIPI---------RPSITKAYGRVDADGS 214
L++P P GV+++ E+ + Y N +P R ++ ++G S
Sbjct: 235 LMVPNAPDGPGGVIVLCEDFLHYRGPKNQEMKVPYPKNQGMPADRGAMISSFGFYKQKSS 294
Query: 215 -RYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL 273
YLL G L L + K++V +++ +A+++ + + ++ G+
Sbjct: 295 FLYLLQSEYGDLFELSLQFTKDEVHSIRMIYFDTIPVANSLCLMRSKHLFAACEKGNHCF 354
Query: 274 IKLNLQPDA---------------------KGSYVEVLERYVNLGPIVDFCVVDLERQGQ 312
K + + K ++ +L+ N I D V DL ++G
Sbjct: 355 YKYQREETSTNIICTDSSMALEEQIFFKPQKLKHLSLLQELNNFSCISDLKVADLAKEGN 414
Query: 313 GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFIS 371
Q+ C A +LR++R+G+ I + A+ LQ G+W+L+ +DP ++V+S+I+
Sbjct: 415 PQIYICCAAGNRSTLRVLRHGLEITQLANTNLQAKPLGIWTLKERYEDPVHKYIVISYIN 474
Query: 372 ETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
+T L + + +++E+ G QT+ I + +Q+ + R +
Sbjct: 475 KT--LVLKIGEKVEQVHDTGLEGTKQTIHVGTLIDDSQIQILTNGYRHIRKN-------- 524
Query: 432 KSPPGYSVN----VATANASQVLLATGGGHLVYLEIG----DGILTEVKHAQLEYEISCL 483
K P Y ++ +N QV A GG + Y E+ G L E+ Q++ EI L
Sbjct: 525 KPPTDYIIDGKVIKGVSNEKQVAFALAGGDVYYFELDTTSVSGNLIEITKDQMDNEIKAL 584
Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LLCAFEGIS 542
++ PI E + V + +D ++R+ SL + + + + +P LC E S
Sbjct: 585 ELGPIEEGRQRCKFLCVAL-SDQTIRLLSLEPESCFERGAM--QALPSEAESLCMMEMAS 641
Query: 543 ------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
+L L +G L+ ++ +G L+D + LGT+P+ ++
Sbjct: 642 EQEGQQSFTKQLFLFIGLNNGLLMRTSVDQLSGGLSDTRTRYLGTKPVKCLRITANQQQA 701
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ 650
+ A S R + Y+++ ++ ++ + + F +AKE + +GT
Sbjct: 702 MLALSSRSWLCYNNSGRIFMQPLSYDYLDYASAF----------LAKEFQGIVGTNQSTL 751
Query: 651 KLHI----------RSIPLGEHPRRICHQEQSRTFAICSLKNQSC--------------- 685
++ + +S+ L PR++ E S+ I N+S
Sbjct: 752 RIIMPERFGEIFNQQSLDLTYSPRKMIFHEPSKAIFIIESDNRSYNNIQQKVQEVYQTEE 811
Query: 686 ---------AEESE-MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVG 735
AE+ + +R++D E ++ + ++ CSI F+ Y CVG
Sbjct: 812 LPEQWNQIQAEQYKWASLIRIVDATKLETVNVHQFYENQHACSICYIQFAGYPEQYICVG 871
Query: 736 TAYVLPEENEPTK----GRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
T L NEP++ G I F+ ++ L+L Y+L A+ G+LL +
Sbjct: 872 TVKDLV--NEPSRKFSQGFIHTFVYDNKTLKLKHSTPIDEIPYALAAWRGRLLVGAGCNL 929
Query: 792 QLYKW----MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG 847
++Y+ +L+ + L S + + D I V ++ SI LL Y +
Sbjct: 930 RVYEMGNQRILKKAEIKNLNS--------FITSIMVKEDRIYVAEVADSIHLLRYNIRDQ 981
Query: 848 AIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR------------------KNSE 889
E A D +++A +LD + + N+F R K +
Sbjct: 982 TFMELADDILPRYVTASTVLDYHTVIAGDKFENIFVSRVPLDIDEEQEEHPYEYKMKMDQ 1041
Query: 890 GATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL 949
G + +++ + +++GE + + +LV S+V V++GT G I +
Sbjct: 1042 GCMNGAPFKMDQICNFYVGEVITSLQKIALVS--TSSEV-----VVYGTSMGSIAALYPF 1094
Query: 950 PHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+++ + F L+ LR + + G +H Q+RS K++ +DGDL + F ++
Sbjct: 1095 DNKEDIDFFLHLEMYLRVEHQPLSGRDHMQFRSAYGPCKSI-----IDGDLCDQFGNMQY 1149
Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
+ +++ + + ++ K++E++
Sbjct: 1150 NKQRAVAEEFDRTPADIIKKLEDI 1173
>gi|342888540|gb|EGU87812.1| hypothetical protein FOXB_01669 [Fusarium oxysporum Fo5176]
Length = 1408
Score = 172 bits (436), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 199/829 (24%), Positives = 348/829 (41%), Gaps = 124/829 (14%)
Query: 177 VLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD-------------------ADGSRYL 217
VL+ EE+I Y N P+R I + G + AD +L
Sbjct: 592 VLVFAEESITYHHLNQ---DPLRVPIPRRRGATEDLPRKRTIIGGVMHKLKNTADAFFFL 648
Query: 218 LGDHAG-LLHLLVITHEKE-----KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
L G LL L H+ E +V +KI+ +AS++ L +++ + +G+
Sbjct: 649 LQTEDGDLLKLTFDMHDSEGSLAGEVRRIKIKYFDTVPVASSLCILKTGFLFVAAEFGNH 708
Query: 272 ---QLIKLNLQPDA-------------KGSYVEV------------LERYVNLGPIVDFC 303
Q+ KL DA +G+Y V LE +L P++D
Sbjct: 709 HFYQIEKLGEDDDAEPVITSDDFSTGPRGAYQPVYFDPRPLKNLALLESIDSLSPLLDCE 768
Query: 304 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
V D +G Q+ + SG ++++G+ IN+ AS +L G + G+W+ R + D FD
Sbjct: 769 VADPTGEGPPQIYSESGNGARSHFWMLKHGLEINKVASSDLPGTVSGVWTTRMTRHDKFD 828
Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
++++++ S + M++ DE+E+ GF + TL + +VQ+ S +R + +
Sbjct: 829 SYIILT--SSDGTVVMSVGDEVEQVSDSGFLTTVTTLAIQQIGDDGIVQIHSKGIRHLRA 886
Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQ-LEYEI 480
NEW +P S+ T N Q+ +A G +VY E+ DG L E + + +
Sbjct: 887 GQI---NEWPAPQHRSIVAVTTNKRQIAIALSSGEIVYFEVDSDGSLAEYDETKVISTTV 943
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFE 539
+CL + P+ E S + VG D +VRI L PD L + +P ++++ A E
Sbjct: 944 TCLSLGPVPEGRLRSPLLVVGC-EDCTVRILGLNPDSMLESNSIKTLTAVPSALIIMAME 1002
Query: 540 ---GISYLLC---ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
S LC L G L +N TGEL+D + LG + I L + K T V A
Sbjct: 1003 HPLTSSSGLCLHIGLSSGVYLRTGMNEITGELSDTQTRFLGLKAIKLFQVTVKGQTCVLA 1062
Query: 594 ASDRPTVIYSSNKK-LLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL 652
+P + Y K+ + + +E+ F+S + + L I +++++
Sbjct: 1063 LGSKPWMGYIDPKRGFTMTPLECEELHWASSFSSEQCQEGIIAIHANFLHIFSVENLHNN 1122
Query: 653 HIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF------------------ 693
+R SIPL PR F + N + + E
Sbjct: 1123 VVRKSIPLTYTPRHFVKHPAEPYFYTIEVDNNTLSPELRAQLLAVDDHGDDKVLPPELFG 1182
Query: 694 -----------VRLLDDQTFEFISTYPLDTFEYGCSILSCSF-SDDSNVYYCVGTAY-VL 740
+ ++D + + L+ E SI F S D ++ VGT ++
Sbjct: 1183 YPRGNGRWASCISIIDPIGEQVLQRIDLEGNEAAISIAVAPFTSQDGEIFLLVGTGKDMI 1242
Query: 741 PEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
+ + G I V+ ++G+ LQ I + + + +L AF +L + + + +Y
Sbjct: 1243 LNPRQSSGGYIHVYRFHQNGRELQFIHKTKVEEPPMALVAFRDRLAVGLGKDLCIY---- 1298
Query: 799 RDDGTRELQSECGHHGHI-----LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
D G ++L + HI L + + T+GD IVVGD+ + ++++++ HEE +
Sbjct: 1299 -DLGLKQLLRKA----HIEAAPQLIVSLDTQGDRIVVGDVQQGMTMVMFNHEEQRLIPFV 1353
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRL 899
D A W + ++D + +G + N++ VR K S D R L
Sbjct: 1354 DDIIARWTTCTTMVDYESVVGGDKFGNIWIVRCPKKTSHHVGDYARNYL 1402
>gi|167539942|ref|XP_001741428.1| DNA repair protein xp-E [Entamoeba dispar SAW760]
gi|165894130|gb|EDR22214.1| DNA repair protein xp-E, putative [Entamoeba dispar SAW760]
Length = 1004
Score = 172 bits (436), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 241/1058 (22%), Positives = 472/1058 (44%), Gaps = 153/1058 (14%)
Query: 10 IATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIG 69
I+ L ++P + D+L I T+ + +L+ + +I+ G + D GR G
Sbjct: 65 ISLLLKYKPLKDITDYLIIVTQSNEIEILKLTTSALHIISS--GSLKDTFGRKAFFGVKA 122
Query: 70 IIDPDCRLIGLHLYDGLFKVI--PFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVV 127
I P+ +++ L+LY+ L KV+ P K QL A NI++ +LD+ F + +
Sbjct: 123 AISPNNQILLLNLYEQLIKVVLLPQYPKDQLT-ASNIKVNHSHILDMTFCLLNDSYQLAL 181
Query: 128 LYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
L+++ +D RH+ TY + + +G ++Q N+ + ++ + G+ +I +ET Y
Sbjct: 182 LHENKRDIRHIDTYTLN-SFNELEKGSFNQPNVGSNTSRILGFKDGIIGIFVISDETGCY 240
Query: 188 CSANA---FKAIPIRPSITKAYGRVD----ADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
+ N + P+ + + D D R L L + I++E +VT
Sbjct: 241 FNGNGKHILCNLNQYPNTACCFIKKDVLIMTDSKRNL----DTLEFKVNISNEIIEVTKK 296
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
+ L + SI+STIS L N +++IGS GD+ ++ LN + +LE++ N G ++
Sbjct: 297 QAPKLLQ-SISSTISDLSNQILFIGSLNGDNLIMNLNGK---------ILEKWSNFGSLM 346
Query: 301 DFCVVDLERQ--GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTD 358
D RQ + + GS+ ++ G GI E + E+ GIK + S+ ++
Sbjct: 347 D------ARQISNREDYLVAGNGGGKGSIGLMIKGSGIEELGACEIDGIKSIESIEYNSK 400
Query: 359 DPFDTFLVVSFISETRILAMNLEDE---LEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+++V F E+ I ++ + + +EE+ ++ + + + C I +L+ V
Sbjct: 401 ----KYVIVGFEEESNIWEIHQKSKHIKIEESLLKKIIGKERVICC-GIIKERLIFVCKK 455
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
+ + + +N + + N Q + G L +I + + + K
Sbjct: 456 GIYSIDKHQK--KNTLMKFDSFITHAKFIN--QQIYFIYGTEL--FKINEELKIQ-KMKD 508
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
LE +SC D N +I +G+W D S+ IF D+N + K+ + I RSVL+
Sbjct: 509 LEQNVSCFDAN---------EIIVIGLW-DGSILIF---DINGVLKKTEVVDTIGRSVLI 555
Query: 536 CAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL---GTQPITLRTFSSKNTTHVF 592
+ Y+ C D ++ FL + + + K+ +Q + L+ S
Sbjct: 556 DNNK--IYVGC---DNEIIIFLYHQMEEDCKYKNKLKYQVEASQSVKLKKLYS------- 603
Query: 593 AASDRPTVIYSSNKKLLYSNVNL-----KEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
P V+ +NK + L +++ +C A+ + A + + G +
Sbjct: 604 ----IPCVV--ANKTFTFQQTGLVPLAVEDLIDICESPVGAY--GIICATKRGIVFGLMK 655
Query: 648 DIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIST 707
++ ++ + I GE+ +I + SR + + + L+D +
Sbjct: 656 EMSRVTFKMIHSGENCCKIA-TDGSRGLIV----------GKTIKSINLVD-------GS 697
Query: 708 YPLDTFEYGCSILS-CSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE-DGKLQLIA 765
+ + T E + L+ C S + N+Y VGTA + E EP+ GRIL+ + +G + ++
Sbjct: 698 FGVTTVELKSNELALCVDSLEDNIY-AVGTAIIRENEIEPSSGRILLIRQDSEGLIYIVG 756
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG 825
++ GAVY L + ++A IN+ + + + +D T++ ++ + +
Sbjct: 757 TEDYDGAVYCLKKYQKGIVAFINRNVHVIEKKGKDLSTKQ-----NMLLPLIGVSLDICK 811
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI---YLGAENNFNLF 882
D+I+ GDL +S+S+ Y+++ ++ +D W S V ++ + +L A+ + N+
Sbjct: 812 DYIIAGDLARSVSVYRYRNDIEHLDIVGKDNQIVWSSCVGSIESEYGTSFLVADVSGNIK 871
Query: 883 TVRKNSEG-ATDEERGRLEVVGEYHLGEFVN----RFRHGSLVMRLPDSDVGQIPTVIFG 937
N E TD+E+ + ++ + H+ + +N F G L+ G
Sbjct: 872 IFNSNEEEPKTDDEK--INLISQVHIADSINFIQKSFYKGCLM----------------G 913
Query: 938 TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
V+G I I + EQYLFL+K+Q+ L K WR N ++T N +DG
Sbjct: 914 GVHGGIYNICEISKEQYLFLDKIQSKLVK----------PNWRESVNTQQTNPMMNCIDG 963
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
D IES L+ S + +++ + V +E+ ++++ L L
Sbjct: 964 DKIESVLEWSEKKQILLAQKIGVEYQEMIEKIQSLFSL 1001
>gi|353442065|gb|AER00318.1| xeroderma pigmentosum group E, partial [Hydra vulgaris]
Length = 156
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 83/154 (53%), Positives = 110/154 (71%), Gaps = 1/154 (0%)
Query: 859 NWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGS 918
NWM+AVEI+DDD +LGAEN+FNLF +K++ DEER L+ +G+YHLG+FVN F+HGS
Sbjct: 2 NWMTAVEIIDDDTFLGAENSFNLFICQKDNSSVNDEERHHLQTIGKYHLGDFVNVFKHGS 61
Query: 919 LVMRLPDSDVGQI-PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
LVM + I ++++GTV G IG++A LP + FL ++Q L K IK VG + HE
Sbjct: 62 LVMHHSTEQLTPISSSILYGTVRGAIGLVAGLPKNTFDFLSQVQEKLSKTIKSVGKIEHE 121
Query: 978 QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
WRSF N+KKT A +DGDLIES LDL+RT+M
Sbjct: 122 FWRSFYNDKKTDLAVGCVDGDLIESCLDLTRTQM 155
>gi|71004436|ref|XP_756884.1| hypothetical protein UM00737.1 [Ustilago maydis 521]
gi|74704394|sp|Q4PGM6.1|RSE1_USTMA RecName: Full=Pre-mRNA-splicing factor RSE1
gi|46095609|gb|EAK80842.1| hypothetical protein UM00737.1 [Ustilago maydis 521]
Length = 1221
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 259/1175 (22%), Positives = 483/1175 (41%), Gaps = 171/1175 (14%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I +L FR G ++D++ + ++ + +L++ +++ L + + R G R
Sbjct: 62 FGVIRSLAAFRLTGGSKDYVIVGSDSGRIVILEYQPKTNSL-EKVHQETFGRSGSRRIVP 120
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIR-------LEELQVLDIKF 116
GQ DP R +IG L ++ D + L + + + + +D+ F
Sbjct: 121 GQYLATDPKGRATMIGAMEKAMLVYILNRDAQANLTISSPLEAHRPSAIIHHIVGVDVGF 180
Query: 117 ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE---------GPWSQNNLDNGA 164
L+ C + V Y D+ + +E A K + E WS+ +D +
Sbjct: 181 ENPLFACLE----VDYSDSDHDPSGRAFEEAAKTLTYYELDLGLNHVVRKWSEP-VDPRS 235
Query: 165 DLLIPVPPPL----------CGVLIIGEETIVYCSANAFK-AIPIRPSITKAYGRVDADG 213
+LL+ VP GVL+ E+ I Y + + +PI + + G
Sbjct: 236 NLLVQVPGGYNQNLEKWDGPSGVLVCSEDYITYKHQDQPEHRVPIPKRLNPVEKLSERRG 295
Query: 214 SR--------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
+ +L+ G L + + H+ +++ LKI+ +AS + L +
Sbjct: 296 TLIVASVLHKMKNAFFFLVQTEDGDLFKITMEHQDDEIRSLKIKYFDTVPVASGLVILRS 355
Query: 260 AVVYIGSSYGDSQLIKLN------------------------------LQPDAKGSYVEV 289
+++ S YG L P + V+V
Sbjct: 356 GFLFVASEYGAQLLYSFQKLGDDDDLPEYISTDYDENGAGRRRPQLPTFTPRPLDNLVQV 415
Query: 290 LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
E +L PI+D ++ Q+ G S +++R+G+ + E S +L G+
Sbjct: 416 DE-MPSLDPILDAKPLNPLAADSPQIFAACGRGARSSFKMLRHGLEVQEAVSSDLPGVPS 474
Query: 349 GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
+W+ + + D +D+++++SF++ T +L++ + +EE GF + + TL +
Sbjct: 475 AVWTTKITQQDEYDSYIILSFVNGTLVLSIG--ETIEEVSDSGFLTSSSTLAVQQLGQDA 532
Query: 409 LVQVTSGSVRLVSSTSRELRNEWKSP--PG--YSVNVAT-ANASQVLLATGGGHLVYLEI 463
L+QV +R V +++ NEW +P P + VAT N QV++A LVY E+
Sbjct: 533 LLQVHPHGIRHVL-VDKQI-NEWATPSLPNGRQTTIVATCTNERQVVVALSSNELVYFEL 590
Query: 464 G-DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK 521
DG L E + + + + + + E + AVG D +VRI SL + +
Sbjct: 591 DMDGQLNEYQERKAMGAGVLTMSMPDCPEGRQRTPYLAVGC-DDSTVRIISLEPNSTLAS 649
Query: 522 EHLGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 572
+ P S + A +++ L +G LL +L+ TG+LTD +
Sbjct: 650 ISIQALTAPASSICMAEMLDATIDRNHATTFVNIGLQNGVLLRTILDAVTGQLTDTRTRF 709
Query: 573 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
LG++ + L V A S R + Y+ +L + + + H F++ P+
Sbjct: 710 LGSKAVRLIRTKVHGQAAVMALSTRTWLSYTYQDRLQFVPLIFDVLDHAWSFSAELCPEG 769
Query: 633 LAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCA---- 686
L L I TI + KL S+ L PR+I H + F + ++++ +
Sbjct: 770 LIGIVGSTLRIFTIPSLASKLKQDSVALSYTPRKIANHPNEQGLFYVVEAEHRTLSPGAQ 829
Query: 687 --------EESEMHFVRLLDDQTFEF------------------------ISTYPLDTFE 714
+E + H +LD EF +D E
Sbjct: 830 RRRTEMLGKELKPHQRGVLDLNPAEFGAIRAEAGNWASCIRAVDGVQAQTTHRLEMDDNE 889
Query: 715 YGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETK 770
SI F S + V VG+A V+ K + + ++++G+ L+L+ + E
Sbjct: 890 AAFSIAVVPFASAEKEVMLVVGSAVDVVLSPRSCKKAYLTTYRLLDNGRELELLHKTEVD 949
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIV 829
L AF G+LLA I + +++Y D G ++L +C + A+ + +G IV
Sbjct: 950 DIPLVLRAFQGRLLAGIGKALRIY-----DLGKKKLLRKCENRSFPTAVVSLDAQGSRIV 1004
Query: 830 VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KN 887
VGD+ +SI YK E + A D +++ +LD D A+ N++ +R N
Sbjct: 1005 VGDMQESIIFASYKPLENRLVTFADDVMPKFVTRCTMLDYDTVAAADKFGNIYVLRLDGN 1064
Query: 888 SEGATDEERGRLEVV-------GEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGT 938
+ + DE+ + +V G H V F G ++ L + + G +++
Sbjct: 1065 TSRSVDEDPTGMTIVHEKPVLMGAAHKASLVAHFFVGDIITSLHRTAMVAGGREVLLYTG 1124
Query: 939 VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
++G IG + + E L L+++LR+ + G +H +RS K+V +DG
Sbjct: 1125 LSGSIGALVPFVSKEDVDTLSTLESHLRQENNSIVGRDHLAYRSSYAPVKSV-----IDG 1179
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DL E+F LS + + I+ ++ E+ K++ +L
Sbjct: 1180 DLCETFGLLSPAKQNAIAGELDRKPGEINKKLAQL 1214
>gi|315053737|ref|XP_003176243.1| pre-mRNA-splicing factor rse1 [Arthroderma gypseum CBS 118893]
gi|311338089|gb|EFQ97291.1| pre-mRNA-splicing factor rse1 [Arthroderma gypseum CBS 118893]
Length = 1181
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 264/1165 (22%), Positives = 472/1165 (40%), Gaps = 191/1165 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G +D++ I ++ + ++++ + R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSNKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGIRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
GQ +DP R LI + L V+ + + +L + + L LD+
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182
Query: 116 F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
+ ++ + + QD + T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVEYTEVDQDPTGQAYDDTEKMLVYYELDLGLNHVVRR-WA-DPVDRTASM 240
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR--- 215
L VP GVL+ E+ IVY +N AF+ IP R P+ R G
Sbjct: 241 LFQVPGGADGPSGVLVCAEDNIVYRHSNQDAFRVPIPRRRGPTENPERKRCITAGVMHKM 300
Query: 216 -----YLLGDHAGLLH---LLVITHEKEKVTG----LKIELLGETSIASTISYLDNAVVY 263
+LL G L + ++ E E TG LK++ +AS++ L + ++
Sbjct: 301 RGAFFFLLQSEDGDLFKVTMEMVEDENENATGEVKRLKLKYFDTVPLASSLCILKSGFLF 360
Query: 264 IGSSYGD-----------------------SQLIKLNLQP----DAKGSYVEVLERYVNL 296
+ S G+ S +I L P + ++E +L
Sbjct: 361 VASETGNQHFYQFEKLGDDDDEIEFISDDYSAVISEPLPPVYFRPRPAENLNLVESIASL 420
Query: 297 GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRS 355
P++ V ++ + Q+ T G S R +++G+ ++E EL + +W+ +
Sbjct: 421 NPLMAASVTNVTEEDAPQIYTLCGTGARSSFRTLKHGLEVSEIVESELPSVPSAVWTTKL 480
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
S +D +D ++V+SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 481 SRNDQYDAYIVLSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVHPK 538
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV-KH 473
+R + + R NEW +P S+ AT N QV +A G +VY E+ DG L E +
Sbjct: 539 GIRHIHADQR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDGSLAEYDEK 596
Query: 474 AQLEYEISCLDINPIGENPS----YSQIAAVGMWTD-ISVRIFSLPDLNLITKEHLGGEI 528
Q+ ++CL +GE P + + + D ++R+ D+ L H+G
Sbjct: 597 RQMSGTVTCLS---LGEVPKDDVDFCPVCPIHHVHDRFTLRV----DVIL----HIG--- 642
Query: 529 IPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
L G L +L+ TGEL+D + LG +P+ L + S K
Sbjct: 643 -------------------LYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFSVSVKEQ 683
Query: 589 THVFAASDRPTVIYS--SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
V A S R + YS K + +N + F+S + + + L I +I
Sbjct: 684 RAVLALSSRSWLGYSDLQTKSFTLTPLNYVGLEWSWNFSSEQCVEGMVGIQGQNLRIFSI 743
Query: 647 DDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF------------ 693
+ + L IPL PR + F + N + ++
Sbjct: 744 EKLDNNLLQEPIPLAYTPRNFVRHPEYPLFYVIGSDNNVLSPSTKAKLLGESTAVNGDSA 803
Query: 694 -------------------VRLLDD-QTFEFISTYPLDTFEYGCSILSCSF-SDDSNVYY 732
+ ++D T +S L+ E SI + SF S + +
Sbjct: 804 ELPPEDFGYPRGTNHWASCIEVVDPINTKSVLSKLELEDNEAAVSIAAVSFTSQEDETFL 863
Query: 733 CVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
VGT ++ T G I ++ E+GK L+ I + + + +L F G+LLA +
Sbjct: 864 VVGTGKDMVVSPRTYTCGFIHIYRFQEEGKELEFIHKTKVEQPPLALLGFQGRLLAGVGP 923
Query: 790 KIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA 848
+++Y D G R+L +C + + +QT+G I+V D+ +S++ ++YK++E A
Sbjct: 924 DLRIY-----DLGMRQLLRKCQAQITPRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENA 978
Query: 849 IEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEY 905
+ A D + W + ++D + G + N++ +R K SE A ++ G +
Sbjct: 979 LIPFADDIISRWTTCTTMVDYETVAGGDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQ 1038
Query: 906 HLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT--------------VNGVIGV-IASLP 950
+L NR SLV+ D IPT I T + G +G+ + +
Sbjct: 1039 YLQGAPNRL---SLVVHFYSQD---IPTSIQKTQLVAGGRDILVWTGLQGTVGMFVPFIT 1092
Query: 951 HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTR 1010
+ F + L+ L + G +H +R + K +DGDL E+FL L +
Sbjct: 1093 RDDVDFFQTLEMQLASQNAPLAGRDHLIYRGY-----YAPCKGVIDGDLCETFLLLPNDK 1147
Query: 1011 MDEISKTMNVSVEELCKRVEELTRL 1035
I+ ++ SV E+ +++ T L
Sbjct: 1148 KQAIAGELDRSVREIERKISVRTIL 1172
>gi|358060450|dbj|GAA93855.1| hypothetical protein E5Q_00501 [Mixia osmundae IAM 14324]
Length = 1153
Score = 171 bits (432), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 204/915 (22%), Positives = 392/915 (42%), Gaps = 124/915 (13%)
Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
L+ G L+ + + ++E+V LKI+ +AS++ L + +++ S +G+ + +
Sbjct: 257 LVQSEEGDLYKVTVERQEEEVLALKIKYFDTVPVASSLCILRSGFLFVASEFGNHNVYQF 316
Query: 277 N------------------LQPDAKGSYVEV--LERYV------NLGPIVDFCVVDLERQ 310
+ ++ + ++ + LE V +L PIVD V ++
Sbjct: 317 DKLGDDGDEVSSADYPSFGMEREEPATFFKPRPLENLVLVDELDSLCPIVDAKVANVLGA 376
Query: 311 GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK--GMWSLRSSTDDPFDTFLVVS 368
Q++T G SL+++R+G+ + E S L G + G+W+ + +DD +D+++V+S
Sbjct: 377 DAPQIITACGRGHRSSLKMLRHGLEVEEMVSSGL-GFEPTGLWTTKLKSDDTYDSYIVLS 435
Query: 369 FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV-TSGSVRLVSSTSREL 427
+ T +L + + +E+ G +TL + +VQV +G +R+ + S+E+
Sbjct: 436 APAATIVLTIG--ESIEQASNSGLVENQRTLCVQQVGDDSIVQVHPAGFLRVRADGSKEV 493
Query: 428 RNEWK-SPPGYSVNVATANASQVLLATGGGHLVYL--EIGDGILTEVKHAQLEYEISCLD 484
W +P ++ VA AN Q++LAT G ++YL E+ I QL ++ +
Sbjct: 494 ---WPLAPERVNLAVACANQRQIVLATTSGDIIYLRCELDGEITVFDDRKQLGVSVTSMS 550
Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIPRSVLLCAF----- 538
I + E+ + VG D +VRI SL + N + + +P S+ +
Sbjct: 551 IGELSEDRLQTDYLCVGC-EDQTVRIISLQEQNCLETISIQALTALPSSICIAEILDSSV 609
Query: 539 ---EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS 595
+++ L +G LL +L+ +G LTD + LGT+P+ LR + +++ V + S
Sbjct: 610 DKTRPTTFVNIGLQNGVLLRTVLDPMSGRLTDTRTRFLGTRPVKLRRLAIGDSSGVLSIS 669
Query: 596 DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHI 654
R V Y+ L + + + H + P+ + L I T+ I K+ +
Sbjct: 670 SRTWVNYTHQGLLQFDPLISDPLDHATGLRAEVCPEGIIGVTGDTLRIFTLPKIGTKVKM 729
Query: 655 RSIPLGEHPRRICH----------QEQSRTFAICSL--KNQSCAEESEMHF--------- 693
SIPL PRR Q RT + + K + E SE +
Sbjct: 730 DSIPLSLTPRRTAFHPAGTLLYMIQSDHRTLSPITQEEKAKDLMEPSEAMWTAEINGLMR 789
Query: 694 ---------VRLLDDQTFE--FISTYPLDTFEYGCSILSCSFSDDSNVYY-CVGTAY--- 738
+ ++D E ++ LD E S+ F++ ++ VGTA
Sbjct: 790 AEAGQWSSCISIIDPTEPENATVTQIYLDNNEAAFSVAVAQFAERPGKWFLLVGTAQDTT 849
Query: 739 VLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW 796
V P T G I + I E G+ L+L+ + E S+ AF G+ + + + ++LY
Sbjct: 850 VSPRTC--THGFIRTYEITEAGRSLELLHKTELDDVPLSIAAFQGRAVVGVGRALRLYTM 907
Query: 797 MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
+R L+ + +Q +G I D S+ + YK + + A D
Sbjct: 908 ----GKSRLLRKSENKSFPAAVVSLQVQGSRIYASDAQDSVYFVAYKAADNRLLIFADDT 963
Query: 857 NANWMSAVEILDDDIYLGAENNFNLFTVRKN---SEGATDEERG---------------R 898
W++ ++D D + N+F R + SE +++ G R
Sbjct: 964 QQRWITCNTVVDYDTVASGDKFGNVFVSRVDKLVSEDVDEDQTGAGILHEKPLFMGAPHR 1023
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV-IASLPHEQYLFL 957
L+++ +++G+ + + SLV G +++ + G +G+ I + E F
Sbjct: 1024 LQLLTHFNVGDILTCIQKVSLV-------AGGREILLYTCLGGTVGMLIPFISKEDVEFS 1076
Query: 958 EKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKT 1017
L+ ++R + G +H +R + V K +DGDL E+F L + +I+
Sbjct: 1077 STLEMHMRAENPSIVGRDHLAYRGYY-----VPQKATVDGDLCETFALLPMQKQAQIAGE 1131
Query: 1018 MNVSVEELCKRVEEL 1032
++ SV E+ K+++ +
Sbjct: 1132 LDRSVSEVLKKIDSM 1146
>gi|393236715|gb|EJD44262.1| hypothetical protein AURDEDRAFT_137476 [Auricularia delicata
TFB-10046 SS5]
Length = 1244
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 252/1115 (22%), Positives = 450/1115 (40%), Gaps = 163/1115 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSE---LITRAMGDVSDRIGRP 62
++G + +L FR G +D+ + ++ + +L +D +++E L T G R
Sbjct: 51 VFGAVRSLVAFRLTGGTKDYAIVGSDSGRIFILDYDPKANEFKRLYTETYGKSG---ARR 107
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL----- 117
GQ +DP R + + + V + + LE + I F
Sbjct: 108 IVAGQYFAVDPKGRAVMIAAMEKSKLVYILNRDAATNLTISSPLEVHRNQSILFHITGVD 167
Query: 118 YGCAKPTIVVLYQD-------------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGA 164
G P L D + + + YE+ L V WS+ D A
Sbjct: 168 VGFENPLFAALEIDYGKSDQDPTGEAFHNAEKMLTFYELDLGLNHVVRK-WSEPT-DPRA 225
Query: 165 DLLIPVP-------------PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDA 211
+ L+ VP P GVL+ E I+Y + + R + + +D+
Sbjct: 226 NFLVQVPGGTSASSPDKFDGP--SGVLVCCENHIIYRHMDHPQH---RVPVPRRRAPLDS 280
Query: 212 DGSRYLLGDHAGLLHLLVIT--HEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
+ ++ A ++H + +T H+ ++VT LKI+ ++S++ L + +++ S +G
Sbjct: 281 EERGLIIV--AAVMHKMKVTIDHQDDEVTSLKIKYFDTVPVSSSLCILKSGFLFVASDFG 338
Query: 270 DSQLIKLNLQPD-------AKGSYVE------------------------VLERYVNLGP 298
L + D + SY + + + ++ P
Sbjct: 339 PHHLYQFQKLGDDDAETEFSSSSYPKNGMADPDEALPSAYFQPRPLENLLLCDSLDSINP 398
Query: 299 IVDFCVVDLER--QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QGI-KGMWSLR 354
I+D V++L Q G S R++R+G+ + E S +L GI +W+ +
Sbjct: 399 IIDAKVLNLMPTISDTPQFYVACGRGARSSFRLLRHGLEVEENISSDLPSGIPNAVWTTK 458
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
DDPFDT++V+SF++ T L + + + +EE + GF S TL + L+QV
Sbjct: 459 IRADDPFDTYIVLSFVNGT--LVLTIGETIEECQETGFLSAEPTLGVQQIGDDALLQVYP 516
Query: 415 GSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKH 473
+R + + R NEWK P G ++ A N+ QV++A LVY E+ DG L E +
Sbjct: 517 HGIRHILADKRV--NEWKVPSGKTIVQAATNSRQVVVALNSAELVYFELDLDGQLNEYQD 574
Query: 474 AQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRS 532
+ + + L + + E + AVG D +VRI SL N + L P S
Sbjct: 575 RKAMGSVVLALSMAEVPEGRQRTPYLAVGC-EDQTVRIISLDPDNTLETISLQALTAPPS 633
Query: 533 VLL------CAFEGIS---YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
+ A IS ++ L G L +L+ TGELTD + LGT+PI L
Sbjct: 634 AICIVAIIDAAVNKISETLFVNIGLTSGLFLRTVLDPVTGELTDTRTRFLGTRPIRLNRV 693
Query: 584 SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
+ + A S R + Y+ + L + + + ++ F + PD K + I
Sbjct: 694 TIAGKPALLALSSRMWLNYAHQEALHLTPLIFDPLDYVSGFTADLCPDGFIGLKGSTIKI 753
Query: 644 GTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH---------- 692
I I Q+L +PL PR+I + F I +++ ++E+ +H
Sbjct: 754 FQITKIGQRLKHEVMPLSYTPRKIAVHPMNGLFYIAESDHRTHSQEA-VHGALQQPDLKK 812
Query: 693 -------------------------FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-D 726
VR+++ I LD E I FS
Sbjct: 813 YDRDVLELLPEVFGHPRAPACVWASCVRIVNPVELTTIYRLDLDNNEAAFLIALVPFSAR 872
Query: 727 DSNVYYCVGT---AYVLPEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNG 781
D+ + VGT A++ P T G + + +++DG+ L+L+ + E SL AF G
Sbjct: 873 DNELTLVVGTAKDAFLAPRSC--TSGYLRTYRVLDDGRGLELLHKTECDDIPLSLLAFQG 930
Query: 782 KLL-AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISL 839
LL A I + +++Y D G ++L + A+ + T+G I+ GD +SI
Sbjct: 931 NLLVAGIGKCLRIY-----DMGKKKLLRKAESKLFTTAITSLSTQGARIIAGDAQQSIFF 985
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN---SEGATDEER 896
+YK E + A D W ++ +LD + + + N+F R + S D+
Sbjct: 986 CVYKATENRLLVFADDSQQRWTTSQVMLDYNTVVAGDKFGNVFVNRLSDHVSNLVDDDPT 1045
Query: 897 G------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIAS 948
G + +G H + F G ++M L + + G +++ ++G IG++
Sbjct: 1046 GTGLLHEKGMFMGAPHKTSMLCHFHVGDIIMSLQRTSLVPGGREVIVYFGLHGTIGMLVP 1105
Query: 949 LPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSF 982
++ + F L+ ++R + + WR +
Sbjct: 1106 FASKEDVDFFTPLEQHMRSENLSLVSRDILAWRGY 1140
>gi|410926437|ref|XP_003976685.1| PREDICTED: splicing factor 3B subunit 3-like, partial [Takifugu
rubripes]
Length = 1020
Score = 170 bits (430), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 223/992 (22%), Positives = 404/992 (40%), Gaps = 137/992 (13%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L + ++G I +L FR G +D++ I ++ + +L++ S + + + + G
Sbjct: 53 LLTMEVFGIIRSLMAFRLTGGTKDYIVIGSDSGRIVILEYHT-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R +IG L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMIGAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + ++E VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRLKYFDTIPVATAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V V E+ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-SLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ + DL + Q+ G +LR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIMSCQIADLANEDTPQLYVACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD +++VSF++ T +L++ + +EE GF T TL C LVQV
Sbjct: 466 VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDEALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKTPGKKTIVRCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
++ ++ C+ + + S+ AVG+ D +VRI SL L ++ + L + P
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-ADNTVRIISLDPSDCLQPLSMQALPAQ--PE 638
Query: 532 SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
S+ + G+ YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 639 SLCIVEMGGVEKQDELGEKGSMGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + + F S P+ +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLVLIESDHNAYTEATKAQRKQQ 818
Query: 685 CAEE-------------SEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
AEE +EM L++ E I P S++ N
Sbjct: 819 MAEEMVEAAGEDERELAAEMA-AAFLNENLPEAIFGAPKAGAGQWASLVRL-----INPI 872
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
V V E+NE + KL+ + + + A ++ F G++L + + +
Sbjct: 873 QGVTLDQVQLEQNEAAFXXXXRLVSGGEKLEFVHKTPVEDAPLAIAPFQGRVLVGVGKLL 932
Query: 792 QLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
++Y D G ++L +C + HI + + T G ++V D+ +S+ + Y+ E +
Sbjct: 933 RIY-----DMGKKKLLRKC-ENKHIPNVVTSIHTSGQRVIVTDVQESLFWVRYRRNENQL 986
Query: 850 EERARDYNANWMSAVEILDDDIYLGAENNFNL 881
A D W++ +LD D A+ N+
Sbjct: 987 IIFADDTYPRWVTTACLLDYDTMASADKFGNI 1018
>gi|134106833|ref|XP_777958.1| hypothetical protein CNBA4270 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|50260658|gb|EAL23311.1| hypothetical protein CNBA4270 [Cryptococcus neoformans var.
neoformans B-3501A]
Length = 1218
Score = 170 bits (430), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 223/924 (24%), Positives = 381/924 (41%), Gaps = 138/924 (14%)
Query: 216 YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
+LL G L+ + I H E V LKI+ +A+++ L +Y+ S + D L +
Sbjct: 320 FLLQSEDGDLYKVWIEHNGEDVVALKIKYFDTVPVANSLCILKRGYIYVASEFSDQNLYQ 379
Query: 276 LNLQPDAKG-------SYVE----------------------VLERYVNLGPIVDFCVVD 306
+ G Y E +++ +L PI D VV+
Sbjct: 380 FQSLAEDDGEQEWSSTDYPENGNIDGPLPFAFFDPQPLRNLLLVDTVPSLDPITDAHVVN 439
Query: 307 L--ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDT 363
L Q+ G + R +++G+ + E S L G+ +W+L+ + D
Sbjct: 440 LLGASSDTPQIYAACGRGARSTFRTLKHGLDVAEMVSSPLPGVPTNVWTLKLTED----- 494
Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
V+SF + T +L++ + +EE GF S TL L+QV +R + +
Sbjct: 495 --VLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLGNAGLLQVHPYGLRHIRAA 550
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQ-LEYEIS 481
R +EW +PPG ++ AT N QV++A LVY E+ +G L+E + + L +
Sbjct: 551 DRV--DEWPAPPGQTIVAATTNRRQVVIALSTAELVYFELDPEGSLSEYQEKKALPGNAT 608
Query: 482 CLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEG 540
C+ I + E + AVG + +V I SL PD L T P S+ L
Sbjct: 609 CVTIAEVPEGRRRTSFLAVGC-DNQTVSIISLEPDSTLDTLSLQALTAPPTSICLAEIFD 667
Query: 541 IS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
S +L L +G LL +++ G L+D + LG +P L + + V
Sbjct: 668 TSIDKNRATMFLNIGLMNGVLLRTVVDPVDGSLSDTRLRFLGAKPPKLVRANVQGQPSVM 727
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QK 651
A S R ++Y+ L + + + ++A PD L L I I + +K
Sbjct: 728 AFSSRTWLLYTYQDMLQTQPLIYDTLEYAWSLSAAMCPDGLIGISGNTLRIFNIPKLGEK 787
Query: 652 LHIRSIPLGEHPRR-ICHQEQS---------RTFAICSLKNQSCAEESE----------- 690
L S L PR+ I H S RT++ +++ +ESE
Sbjct: 788 LKQDSTALTYTPRKFISHPFNSVFYMIEADHRTYSKSAIERIVKQKESEGRRVDTLLLDL 847
Query: 691 ---------------MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS-DDSNVYYCV 734
VR+LD E I T LD E SI F + V
Sbjct: 848 PANEFGRPRAPAGHWASCVRVLDPLANETIMTLDLDEDEAAFSIAIAYFERGGGEPFLVV 907
Query: 735 GTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
GT + + K G + V+ I E G+ L+ + + +T L F G LLA I + +
Sbjct: 908 GTGVKTTLQPKGCKEGYLRVYAIKEQGRILEFLHKTKTDDIPLCLAGFQGFLLAGIGKSL 967
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
+LY+ G + L +C ++G A+ + +G I+VGD+ +S +Y+ +I
Sbjct: 968 RLYEM-----GKKALLRKCENNGFPTAVVTINVQGARIIVGDMQESTFYCVYR----SIP 1018
Query: 851 ER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN---SEGATDEERG----- 897
R A D W++ V +D + + N+F R + SE D+ G
Sbjct: 1019 TRQLLIFADDSQPRWITCVTSVDYETVACGDKFGNIFINRLDPSISEKVDDDPTGATILH 1078
Query: 898 -RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV-------IFGTVNGVIGVIAS- 948
+ ++G H E + + GS+V + + +IP V ++ T++G +G +
Sbjct: 1079 EKSFLMGAAHKTEMIGHYNIGSVV-----TSITKIPLVAGGRDVLVYTTISGAVGALVPF 1133
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSR 1008
+ + F+ L+ ++R + G +H +R + V K +DGDL ESF L
Sbjct: 1134 VSSDDIEFMSTLEMHMRTQDISLVGRDHIAYRGY-----YVPIKGVVDGDLCESFSLLPY 1188
Query: 1009 TRMDEISKTMNVSVEELCKRVEEL 1032
+ I+ ++ SV ++ K++E++
Sbjct: 1189 PKQQAIALDLDRSVGDVLKKLEQM 1212
>gi|407034933|gb|EKE37449.1| DNA damage-binding protein, putative [Entamoeba nuttalli P19]
Length = 995
Score = 168 bits (425), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 243/1057 (22%), Positives = 459/1057 (43%), Gaps = 151/1057 (14%)
Query: 10 IATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIG 69
I L ++P + D+L + T+ + +L+ + +I+ G + D GR G
Sbjct: 56 ITLLLKYKPLKDITDYLIVVTQSNEIEILKLTTSALHIISS--GSLKDTFGRKAFFGVKA 113
Query: 70 IIDPDCRLIGLHLYDGLFKVI--PFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVV 127
+I P+ +++ L+LY+ L KV+ P K QL A NI++ +LD+ F + +
Sbjct: 114 VISPNNQILLLNLYEQLIKVVLLPQYPKDQLT-ASNIKVNHSHILDMTFCLLNDSYQLAL 172
Query: 128 LYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
L+++ +D RH+ TY + + +G +SQ N+ ++ + G+ +I +E Y
Sbjct: 173 LHENKRDIRHIDTYTLN-SFNELEKGSFSQPNVGPTTSRIVGFKDGIIGIFVISDEIGCY 231
Query: 188 CSANA---FKAIPIRPSITKAYGRVD----ADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
+ N + P+ + + D D R L L + I +E ++T
Sbjct: 232 FNGNGKHILCNLNQYPNTACCFIKKDVLIMTDSKRNL----DTLEFKVNINNEIIEITKK 287
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
+ L + SI+STISYL N +++IGS GD+ ++ LN + +LE++ N G ++
Sbjct: 288 QSPKLLQ-SISSTISYLSNKILFIGSLNGDNLIMNLNGK---------ILEKWSNFGSLM 337
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
D + + VV G L I G GI E S E+ IK + S+
Sbjct: 338 D--ARQISNREDYLVVGNGGGKGSIGLMI--KGSGIEELGSCEINDIKSVESIEYDRKK- 392
Query: 361 FDTFLVVSFISETRILAMNLEDE---LEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
+++V F E+ I ++ + + +EE+ ++ + + + C I +L+ V +
Sbjct: 393 ---YVIVGFEEESNIWEIHQKSKHIKIEESSLKKVIGKERVMCC-GIIKERLIFVCKKGI 448
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLE 477
+ + +N + + N Q + G L +I + + + K E
Sbjct: 449 YSIDKHQK--KNTLMKFDSFITHAKFIN--QQIYFIYGTEL--FKITEELKIQ-KMKDFE 501
Query: 478 YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCA 537
+SC DIN +I +G+W D S+ IF D+N I K+ + I RS+L+
Sbjct: 502 QNVSCFDIN---------EIIVIGLW-DGSILIF---DINGILKKTEVVDTIGRSILI-- 546
Query: 538 FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL---GTQPITLRTFSSKNTTHVFAA 594
Y+ C D ++ FL N + T + K+ +Q I L+ S
Sbjct: 547 DHNKIYVGC---DNEIIVFLYNQMEEDCTYKNKLKYHIETSQSIKLKKIYS--------- 594
Query: 595 SDRPTVIYSSNKKLLYSNVNL-----KEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
P VI +NK + L +++ +C S + A + + G + ++
Sbjct: 595 --IPCVI--ANKTFTFQQTGLVPLAVEDLIDICE--SPVGDYGVICATKRGIVFGLMKEM 648
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYP 709
++ + I GE+ +I ++K+ + L+D ++
Sbjct: 649 SRVTFKMIHSGENCCKIATDGSYGLLVGKTIKS-----------INLID-------GSFG 690
Query: 710 LDTFEYGCSILS-CSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE-DGKLQLIAEK 767
+ T E + L+ C S + N+Y VGTA + E EP+ GRIL+ + +G + ++ +
Sbjct: 691 VTTIELKSNELALCVDSLEDNIY-AVGTAIIRENEIEPSSGRILLIRQDTEGLIYIVGTE 749
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDF 827
+ GAVY L ++A IN+ + + + +D T++ ++ + + D+
Sbjct: 750 DYDGAVYCLKKCQKGIVAFINRNVHVIEKKGKDLNTKQ-----NMLLPLIGVSLDICKDY 804
Query: 828 IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI---YLGAE--NNFNLF 882
I+ GDL +S+S+ Y+++ ++ +D W S V ++ + +L A+ N +F
Sbjct: 805 IIAGDLARSLSVYRYRNDIEHLDVVGKDNQIVWSSCVGSIESEYGTSFLVADVSGNIKIF 864
Query: 883 TVRKNSEGATDEERGRLEVVGEYHLGEFVN----RFRHGSLVMRLPDSDVGQIPTVIFGT 938
E TD+E+ + ++ + H+ + +N F G L+ G
Sbjct: 865 N-SDEEEPKTDDEK--ISLISQVHVADSINFIQKSFYKGCLM----------------GG 905
Query: 939 VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
V+G I I + E YLFL K+Q+ L K WR N ++T N +DGD
Sbjct: 906 VHGGIYNICEISKEHYLFLNKIQSKLVK----------SNWRESVNTQQTSPMMNCIDGD 955
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
IES L+ S + +++ + V +E+ ++++ L L
Sbjct: 956 KIESILEWSEKKQMLLAQKIGVEHQEMIEKIQSLFSL 992
>gi|392580116|gb|EIW73243.1| hypothetical protein TREMEDRAFT_37240 [Tremella mesenterica DSM 1558]
Length = 1214
Score = 166 bits (421), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 258/1175 (21%), Positives = 479/1175 (40%), Gaps = 175/1175 (14%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G + ++ FR G +D++ ++++ + ++++ + +V + G R
Sbjct: 60 FGTVRSVAGFRLAGMTKDYILLSSDSGRMSIIEFVVTPTPHFESLYQEVFGKSGSRRVVP 119
Query: 66 GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGCAK 122
GQ +DP R + L V+ +++G+L + + + L + G
Sbjct: 120 GQFLAVDPKGRSC-MFAASKLVYVLNRNSEGKLFPSSPLEAHKNHALITHVIGVDQGYDN 178
Query: 123 PTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIP 169
P L D ++ +H+ YE+ L V WS+ D A+LL+
Sbjct: 179 PLYAALEVDYSESDEDPTGAAFEAAEKHLVFYELDLGLNHVVRK-WSEPT-DRRANLLVQ 236
Query: 170 VP------------PPLCGVLIIGEETIVYCSANA-FKAIPI---------RPSITKAYG 207
VP P GVL+ E+ +++ +A +PI R I++
Sbjct: 237 VPGGQNATSDRFEGP--SGVLVCCEDHVIWKHMDAEAHRVPIPRRRNPLAQRGEISRGMI 294
Query: 208 RVDADGSR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
V A + +LL + G L+ L I H+ E V +KI+ +++++ L +
Sbjct: 295 IVSAVMHKIRGAFFFLLQNEDGDLYKLWIDHQGEDVVAVKIKYFDTVPVSNSLCILKSGY 354
Query: 262 VYIGSSYGDSQLIKL-NLQ-------------PDAKGSYVEVLERYVN------------ 295
++ S +GD L + NL PD + + + N
Sbjct: 355 LFAASEFGDQNLYQFQNLAEDDDEQEWSSTDYPDNGNTTGALPYAFFNPRPLQNLLLVDT 414
Query: 296 ---LGPIVDFCVVDLERQGQG--QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KG 349
L PI+D VV+L Q Q+ G + R +++G+ I + + L G+
Sbjct: 415 LSSLDPILDAQVVNLLGQSSDTPQIYAACGRGPRSTFRTLKHGLEIQQIVASPLPGVPNA 474
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
+W+L+ + +D FD+++V+SF + T +L++ + +EE GF S TL L
Sbjct: 475 VWTLKLTEEDEFDSYIVLSFPNGTLVLSIG--ETIEEVNDTGFLSSGPTLAVQQLGDAGL 532
Query: 410 VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGIL 468
+QV +R + + R +EW PPG ++ AT N QV++A LVY E+ +G L
Sbjct: 533 LQVHPYGLRHIRAADRV--DEWACPPGSAITAATTNKRQVVIALSTAELVYFELDPEGSL 590
Query: 469 TEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGG 526
+E + + L +C+ I + E + AVG + +V + SL P+ L T
Sbjct: 591 SEYQDKKSLPGNATCVSIAEVPEGRRRTPFLAVGC-DNQTVHVISLEPESTLTTLSLQAL 649
Query: 527 EIIPRSVLLCAFEGIS--------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
P S+ L S +L L +G LL +++ G L+D + LG +P
Sbjct: 650 TAPPASICLAEIFDTSIDKNRATMFLNIGLMNGVLLRTVVDPVEGSLSDTRLRFLGAKPP 709
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
L S + V A S R ++Y+ L + + + ++A P+ L
Sbjct: 710 KLVRSSIHGSPSVMAFSSRAWLLYTYQDMLQTQPLIYDALEYASTLSAAMCPEGLIGISG 769
Query: 639 GELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTF-------------AICSLKNQS 684
L I TI + +KL S+ L PR+ S F AI + +Q
Sbjct: 770 NTLRIFTIPRLGEKLKQDSMSLTYTPRKFISHPFSTIFYMIESDHRVLGPKAIQRIVSQK 829
Query: 685 CAEESEM-----------------------HFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
A + +R+LD T + +ST LD E S+
Sbjct: 830 KAAGDRVDGSILELPPSEFGRPRAGPGHWASLIRILDPLTNQTVSTIELDEDEAAFSLTI 889
Query: 722 CSFSDDSN-VYYCVGTAYVLPEENEPTK-GRILVF-IVEDGK-LQLIAEKETKGAVYSLN 777
F + + VGTA K G + V+ I E+G+ L+ + + + +
Sbjct: 890 AYFENMAGEPSLVVGTAVKTTLTPRGCKEGWLRVYAIKENGRTLEFMHKTKLDEIPLCVA 949
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKS 836
F G LL + ++LY + G + L +C ++ + + G I+VGD+ +S
Sbjct: 950 GFQGYLLVGAGKSLRLY-----EAGKKALLRKCENNSFPTVIATINVIGARIIVGDMQES 1004
Query: 837 ISLLIYKHEEGAIEER-----ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN---S 888
+Y+ +I R D +++ V +D D + N+F R + S
Sbjct: 1005 TFFCVYR----SIPTRQLLVFGDDTQPRFLTCVTNVDYDTVACGDKFGNVFVNRMDQAVS 1060
Query: 889 EGATDEERG------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP----TVIFGT 938
E D+ G + ++G H + ++ GS+V L + V +P +++ T
Sbjct: 1061 EKVDDDPTGAGILHEKGFLMGAAHKTTLIAHYQVGSVVTSL--TKVSLVPGGRDVLVYTT 1118
Query: 939 VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
++G +G + + + F+ L+ ++R + G +H +R + V +DG
Sbjct: 1119 ISGAVGALVPFISMDDVEFMTTLEMHMRSQNISLVGRDHLAYRGYYAPVMGV-----VDG 1173
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DL +++ L T+ I+ ++ SV ++ K++E++
Sbjct: 1174 DLCDAYSSLPYTKQSSIANELDRSVGDVLKKLEQM 1208
>gi|440907208|gb|ELR57379.1| Splicing factor 3B subunit 3 [Bos grunniens mutus]
Length = 1133
Score = 166 bits (420), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 185/788 (23%), Positives = 332/788 (42%), Gaps = 136/788 (17%)
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
T D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 364 TTDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDG 421
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 422 IRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 479
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSV 533
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 480 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPE 536
Query: 534 LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
LC E G YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 537 SLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 596
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 597 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIS 656
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 657 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQ 716
Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
AEE +EM +R+++ +
Sbjct: 717 MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLD 776
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQ 762
L+ E S+ C FS+ +Y VG A L G + +V +G KL+
Sbjct: 777 LVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLE 836
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-------H 815
+ + + ++ F G++L + + +++Y D G ++L +C +
Sbjct: 837 FLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKCENKAFLADGSSS 891
Query: 816 ILALYV------------QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
+ +YV QT G ++V D+ +S + YK E + A D W++
Sbjct: 892 VFPMYVAVRHIANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTT 951
Query: 864 VEILDDDIYLGAENNFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEY 905
+LD D GA+ N+ VR N+ DE +RG L EV+ Y
Sbjct: 952 ASLLDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNY 1011
Query: 906 HLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNL 964
H+GE V + +L+ G ++++ T++G IG++ HE + F + ++ +L
Sbjct: 1012 HVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHL 1064
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
R + G +H +RS+ KN +DGDL E F + + +S+ ++ + E
Sbjct: 1065 RSEHPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPE 1119
Query: 1025 LCKRVEEL 1032
+ K++E++
Sbjct: 1120 VSKKLEDI 1127
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 76/378 (20%), Positives = 147/378 (38%), Gaps = 52/378 (13%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFC-----VVDLERQGQGQVV 316
+++ S +G+ L+K+ + D +Y+ V +VN ++ V D G +
Sbjct: 347 LFVASEFGNQWLVKVLITTDEFDAYIIV--SFVNATLVLSIGETVEEVTDSGFLGTTPTL 404
Query: 317 TCSGAYKDGSLRIVRNGI 334
+CS D +++ +GI
Sbjct: 405 SCSLLGDDALVQVYPDGI 422
>gi|330790247|ref|XP_003283209.1| CPSF domain-containing protein [Dictyostelium purpureum]
gi|325086890|gb|EGC40273.1| CPSF domain-containing protein [Dictyostelium purpureum]
Length = 1233
Score = 165 bits (418), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 175/758 (23%), Positives = 333/758 (43%), Gaps = 104/758 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I ++ FR ++D++ + ++ + +L++++ ++ + + R G R
Sbjct: 57 VFGVIRSIIPFRLTSGSKDYIVVGSDSGRVVILEYNSTKNQF-DKVHQETFGRSGCRRIV 115
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY---- 118
GQ +DP R +IG L ++ D+ +L + LE + I F
Sbjct: 116 PGQYLAVDPKGRAFMIGAIEKQKLVYILNRDSSAKL--TISSPLEAHKSSTIIFSMCGVD 173
Query: 119 -GCAKP---TIVVLY------------QDNKDARHVKTYEVALKDKDFVEGPWSQNNLDN 162
G P TI V Y +DN + + YE+ L + V WS + +D+
Sbjct: 174 VGFENPIFATISVDYSDDSGGNIEEDLEDNGKKKVLTFYELDLGLNNVVR-KWS-DQIDD 231
Query: 163 GADLLIPVPPP---LCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
A++++ VP GVL+ E+ IVY + + + IR I + Y G
Sbjct: 232 TANIVMTVPGSNEGPGGVLVTSEDYIVYRNQDHPE---IRAKIPRRYNSNPDKGLLIVSY 288
Query: 216 ----------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIG 265
+L+ G L+ + + + E VT + I +A++++ L N ++
Sbjct: 289 SSHKQKNMFFFLVQSEFGDLYKITLDFKGEMVTAVNISYFDTIPLANSLTVLKNGFLFAA 348
Query: 266 SSYGDSQLIKLN--------------LQPDAKGSYVEVLER-----------YVNLGPIV 300
S +GD L + + K ++ +R +L PIV
Sbjct: 349 SEFGDHSLYFFKSLGDDTEEGNNEGTITMEEKDGFIWFKQREELKNLVQTSTLSSLSPIV 408
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL----RS 355
DF V+D+ R+ Q+ + G KD SL+++R+G+ ++ + +L G+ G+W++
Sbjct: 409 DFKVLDIVREENPQLYSLCGTSKDSSLKVLRHGLSVSTITNAKLPGVPSGIWTVPKSNSP 468
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
S D D ++VVSF+ T +L++ + ++E G T TL + ++QV
Sbjct: 469 SAQDQTDKYIVVSFVGSTSVLSVG--ETIQENHESGILESTTTLLIGSMGEDSILQVFPT 526
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHA 474
R + S R NEW++P ++ AT N +Q+++A GG ++Y E+ G LTE+
Sbjct: 527 GFRFIRSDMRT--NEWRAPGRKTIVRATCNQNQLVIALSGGEVIYFELDQSGGLTEIIKK 584
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL--GGEIIPRS 532
+I+C++I+PI + S ++ A+ W + VR+ SL N + + + ++ S
Sbjct: 585 DFRRDIACIEISPIPKGRSMARFLAISDW-EGPVRLLSLDRDNCLGQISMLDTDKVYIES 643
Query: 533 VLLCAFE-----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT 575
+ + + G +L L +G + L+ +GEL+D + LG
Sbjct: 644 LSIVEMQVNEAGVDNFKVSSQNMGGSLFLFVGLKNGVIKRATLDNYSGELSDIRTRLLGR 703
Query: 576 QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
+P+ L + + A S R + Y++ KL ++++ + + +S P+S+
Sbjct: 704 RPVKLFKIKVRGGNGILALSSRVWLNYANQGKLEIVPLSVEPLENASNLSSDQSPESIVA 763
Query: 636 AKEGELTIGTIDDIQKL-HIRSIPLGEHPRRICHQEQS 672
E + I TID + L + +I LG PRR Q+
Sbjct: 764 TSEDHIIIFTIDKLGDLFNQETIKLGATPRRFIVHPQT 801
Score = 116 bits (291), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 95/364 (26%), Positives = 167/364 (45%), Gaps = 42/364 (11%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD--DSNVYYCVG---TAYVLPEENEPTK 748
++++D + + +Y LD E G S+ + SFSD D ++ VG + P+ E
Sbjct: 881 IKIIDPISHQVYESYQLDENEAGFSLCTLSFSDRGDGEIFLVVGCGNNVILNPKSCESAS 940
Query: 749 GRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
+ F E KLQL+ + E + VY++ F G+L A + + I++Y D G ++L
Sbjct: 941 INLYKFTEEGKKLQLVYKTEAEEPVYAMAPFQGRLCAGVGKNIRIY-----DMGKKKLLR 995
Query: 809 ECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
+C + + + GD +VVGD+ +SI + YK E + A D W+++ +L
Sbjct: 996 KCETKNLPNTIVNIHSLGDRLVVGDIQESIHFIKYKKLENMLYVFADDLAPRWITSSVML 1055
Query: 868 DDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHLGE 909
D D GA+ N+F +R K G + +LE + G+
Sbjct: 1056 DYDTVAGADKFGNIFILRLPSNVSDEVEEDPTGSKLKFESGLLNGAPHKLEHIANIFAGD 1115
Query: 910 FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVI 968
+ SLV VG +++ T++G IG +I + E F L+ LR
Sbjct: 1116 AITTLNKTSLV-------VGGSDVLLYTTISGAIGALIPFVSREDVDFFSSLELQLRNEH 1168
Query: 969 KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
+ G +H +RS+ KN +DGDL E F+ L + +I++ ++ S E+ K+
Sbjct: 1169 APLCGRDHLAYRSY-----YFPVKNIIDGDLCEQFITLDPQKQRQIAEELSRSPSEVLKK 1223
Query: 1029 VEEL 1032
+E++
Sbjct: 1224 LEDI 1227
>gi|183233163|ref|XP_654084.2| damaged DNA binding protein [Entamoeba histolytica HM-1:IMSS]
gi|169801703|gb|EAL48698.2| damaged DNA binding protein, putative [Entamoeba histolytica
HM-1:IMSS]
gi|449708240|gb|EMD47737.1| DNA-repair binding protein, putative [Entamoeba histolytica KU27]
Length = 995
Score = 165 bits (417), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 241/1057 (22%), Positives = 459/1057 (43%), Gaps = 151/1057 (14%)
Query: 10 IATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIG 69
I L ++P + D+L + T+ + +L+ + +I+ G + D GR G
Sbjct: 56 ITLLLKYKPLKDITDYLIVVTQSNEIEILKLTTSALHIISS--GSLKDTFGRKAFFGVKA 113
Query: 70 IIDPDCRLIGLHLYDGLFKVI--PFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVV 127
+I P+ +++ L+LY+ L KV+ P K QL A NI++ +LD+ F + +
Sbjct: 114 VISPNNQILLLNLYEQLIKVVLLPQYPKDQLT-ASNIKVNHSHILDMTFCLLNDSYQLAL 172
Query: 128 LYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
L+++ +D RH+ TY + + +G +SQ N+ + ++ + G+ +I +E Y
Sbjct: 173 LHENKRDIRHIDTYTLN-SFNELEKGSFSQPNVGSTTSRILGFKDGIIGIFVISDEIGCY 231
Query: 188 CSANA---FKAIPIRPSITKAYGRVDA----DGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
+ N + P+ + + D D R L L + I +E ++T
Sbjct: 232 FNGNGKHILCNLNQYPNTACCFIKKDVLIMTDSKRNL----DTLEFKVNINNEILEITKK 287
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIV 300
+ L + SI+STISYL N +++IGS GD+ ++ LN + +LE++ N G ++
Sbjct: 288 QSPKLLQ-SISSTISYLSNKILFIGSLNGDNLIMNLNGK---------ILEKWSNFGSLM 337
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP 360
D + + VV G L I G GI E S E+ IK + S+
Sbjct: 338 D--ARQISNREDYLVVGNGGGKGSIGLMI--KGSGIEELGSCEINDIKSVESIEYDRKK- 392
Query: 361 FDTFLVVSFISETRILAMNLEDE---LEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
+++V F E+ I + + + +EE+ ++ + + + C I +L+ V +
Sbjct: 393 ---YVIVGFEEESNIWEIYQQSKHIKIEESSLKKVIGKERVICC-GIIKERLIFVCKKGI 448
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLE 477
+ + +N + + N Q + G L +I + + + K LE
Sbjct: 449 YSIDKHQK--KNTLMKFDSFITHAKFIN--QQIYFIYGTEL--FKITEELKIQ-KMKDLE 501
Query: 478 YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCA 537
+SC DIN +I +G+W D ++ IF D+N + K+ + I RS+L+
Sbjct: 502 QNVSCFDIN---------EIIVIGLW-DGNILIF---DINGVLKKTEVVDTIGRSILI-- 546
Query: 538 FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL---GTQPITLRTFSSKNTTHVFAA 594
Y+ C D ++ FL N + T + K+ +Q I L+ S
Sbjct: 547 DHNKIYVGC---DNEIIVFLYNQMEEDCTYKNKLKYHIETSQSIKLKKIYS--------- 594
Query: 595 SDRPTVIYSSNKKLLYSNVNL-----KEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
P VI +NK + L +++ +C S + A + + G + ++
Sbjct: 595 --IPCVI--ANKTFTFQQTGLVPLAVEDLIDICE--SPVGDYGVICATKRGIVFGLMKEM 648
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYP 709
++ + I GE+ +I ++K+ + L+D ++
Sbjct: 649 SRVTFKMIHSGENCCKIATDGSYGLLVGKTIKS-----------INLID-------GSFG 690
Query: 710 LDTFEYGCSILS-CSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE-DGKLQLIAEK 767
+ T E + L+ C S + N+Y VGTA + E EP+ GRIL+ + +G + ++ +
Sbjct: 691 VTTIELKSNELALCVDSLEDNIY-AVGTAIIRENEIEPSSGRILLIRQDTEGLIYIVGTE 749
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDF 827
+ GAVY L ++A IN+ + + + +D T++ ++ + + D+
Sbjct: 750 DYDGAVYCLKKCQKGIVAFINRNVHVIEKKGKDLNTKQ-----NMLLPLIGVSLDICKDY 804
Query: 828 IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI---YLGAE--NNFNLF 882
I+ GDL +S+S+ Y+++ ++ +D W S V ++ + +L A+ N +F
Sbjct: 805 IIAGDLARSLSVYRYRNDIEHLDVVGKDNQIVWSSCVGSIESEYGTSFLVADVSGNIKIF 864
Query: 883 TVRKNSEGATDEERGRLEVVGEYHLGEFVN----RFRHGSLVMRLPDSDVGQIPTVIFGT 938
E TD+E+ + ++ + H+ + +N F G L+ G
Sbjct: 865 N-SDEEEPKTDDEK--INLISQVHVADSINFIQKSFYKGCLM----------------GG 905
Query: 939 VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
V+G I I + E YLFL K+Q+ L K WR N ++ N +DGD
Sbjct: 906 VHGGIYNICEISKEHYLFLNKIQSKLVK----------SNWRESVNTQQASPMMNCIDGD 955
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
IES L+ S + +++ + V +E+ ++++ L L
Sbjct: 956 KIESILEWSEKKQMLLAQKIGVEHQEMIEKIQSLFSL 992
>gi|440792421|gb|ELR13643.1| splicing factor 3b subunit 3, putative [Acanthamoeba castellanii
str. Neff]
Length = 1227
Score = 165 bits (417), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 168/730 (23%), Positives = 315/730 (43%), Gaps = 73/730 (10%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I ++ FR G+ +D++ + ++ + +LQ++ E +
Sbjct: 53 VLSVEVFGTIRSIVPFRLTGDTRDYIIVGSDSGRIVILQYNNERNTFDKIHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ +DP R +IG L ++ D+ +L + + + ++I +
Sbjct: 113 RRIVPGQYLAVDPKGRAVMIGAVEKQKLVYILNRDSAARLTISSPLEAHKSHTINIHMVG 172
Query: 118 --YGCAKPTIVVLYQDNKDARHVKT---YEVALKDKDFVE---------GPWSQNNLDNG 163
G P L D D H T +E K+ F E WS+ +D
Sbjct: 173 VDVGFENPIFATLEVDYADVDHDPTGKEFEALHKNLTFYELDLGLNHVVRKWSEP-VDIT 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSAN---AFKAIPIRPS--------ITKAYG 207
A++L+ VP P GVL++GE +V+ + A+P R I A
Sbjct: 232 ANMLVAVPGGSDGP--GGVLVMGENFVVWKNQGHQEVRAALPRRKILGEERSVLIVSATA 289
Query: 208 RVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
D LL G ++ + E++ V+ +KI ++ + L +++ +
Sbjct: 290 HKQKDLFFILLQSEYGDIYKATLAWEEDNVSAIKINYFDTVPTSNAMCILKTGFLFVAAE 349
Query: 268 YGDSQLIKLN--------------------LQPDAKGSYVEVLERYVNLGPIVDFCVVDL 307
+G+ L + P + + + E +L PI+DF V D+
Sbjct: 350 FGNHFLFQFQSIGDEEEESTMDQDEDEIPTFDPQPLKNLIAIDE-IESLAPIMDFKVADM 408
Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLV 366
++ Q +T G SLR++++G+ + E A L G ++++R + D +D++++
Sbjct: 409 VKEETKQFLTLCGRGPRSSLRLLKHGLAVAEMADSPLPGNPNNIFTVRKNVADEYDSYII 468
Query: 367 VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
VSF++ T +L++ D +EE + GF TL + LVQV +R + S R
Sbjct: 469 VSFLNATLVLSIG--DNVEEVKDAGFNENASTLNVGLVGDDSLVQVFPTGIRFIRSDKR- 525
Query: 427 LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCLDI 485
EW +P ++ + N QV++A GG L+Y E+ G L EV + +I+C+DI
Sbjct: 526 -ITEWPTPARRTIVRSAINNKQVVIALTGGELLYFELDITGSLVEVGRKDMGRDIACIDI 584
Query: 486 NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIPRSVLLCAFEGIS-- 542
PI E ++ AVG + + +VR+ SL ++ + + P S+ + +G +
Sbjct: 585 APIPEGRLRARFLAVGDYEN-TVRVLSLDPEDVFSSLAVQALPAPPESLCIVKMKGGTDS 643
Query: 543 -----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDR 597
+L L +G L +L+ TGEL+D + LG++P+ L + + A + R
Sbjct: 644 SAGTLFLNIGLTNGVLQRTVLDKVTGELSDTRTRFLGSRPVRLLKLRVGDQPAMLALTSR 703
Query: 598 PTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRS 656
+ Y+ L + ++ + + F S P+ + L I + + + + H
Sbjct: 704 AWLCYNYQLHLHLTPLSYPALDYASNFCSERCPEGMVATIGNALRIISPERLGEVFHQEL 763
Query: 657 IPLGEHPRRI 666
IPL PR++
Sbjct: 764 IPLRYTPRKM 773
Score = 112 bits (281), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 96/362 (26%), Positives = 168/362 (46%), Gaps = 44/362 (12%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---PEENEPTKG 749
VRLLD T + I LD E S +C F D ++ VGTA L P + G
Sbjct: 878 VRLLDVNTKQTIDVVELDNNEAAFSACTCVFHDRGGEIFLVVGTAKGLVLNPRSCD--AG 935
Query: 750 RILVFIVEDG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
I V+ + DG + L+ + + +G ++ F G+LL I + +++Y D G R+L
Sbjct: 936 YIHVYRLLDGGKRFSLVHKTQVEGVPTAVCGFQGRLLVGIGKMLRIY-----DLGKRKLL 990
Query: 808 SECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
+C + G + + T+G+ I+VGDL +S + Y+ E + A D N W++A ++
Sbjct: 991 RKCENKGFPHCIQSITTQGERIIVGDLAESFHFVKYRKAENQLNVYADDSNPRWLTASQM 1050
Query: 867 LDDDIYLGAENNFNLFTVRKNSEGATDEERG-----------------RLEVVGEYHLGE 909
LD D GA+ N+F VR SE + E +L+ + +H+G+
Sbjct: 1051 LDYDTMAGADKFGNVFIVRLPSEVNEELEDNPMGNFLMSKQSLNGAAFKLQTLINFHVGD 1110
Query: 910 FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVI 968
+N SL G +++ T+ G +G ++ + E F L+ ++R +
Sbjct: 1111 TINSMTKASLF-------TGGADVLVYTTLMGGMGALLPFVSREDVDFFSHLEMHMRSEL 1163
Query: 969 KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
+ G +H +RS+ K+ +DGDL E F L + I++ ++ + E+ K+
Sbjct: 1164 PPLCGRDHLAYRSY-----YFPVKDVIDGDLCEQFSLLPPEKQRTIAEELDRTPGEVLKK 1218
Query: 1029 VE 1030
+E
Sbjct: 1219 LE 1220
>gi|388855100|emb|CCF51231.1| probable splicing factor 3B subunit 3 [Ustilago hordei]
Length = 1221
Score = 164 bits (415), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 248/1175 (21%), Positives = 474/1175 (40%), Gaps = 171/1175 (14%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G + +L FR G ++D+L + ++ + +L++ +++ + + R G R
Sbjct: 62 FGVVRSLAAFRLTGGSKDYLIVGSDSGRIVILEFQPQTNSF-DKVHQETFGRSGSRRIVP 120
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
GQ DP R +IG L ++ D + L EA N + + +D+ F
Sbjct: 121 GQYLATDPKGRATMIGAMEKAKLVYILNRDAQANLTISSPLEAHRPNGIIHHIVGVDVGF 180
Query: 117 ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE---------GPWSQNNLDNGA 164
L+ C + + Y ++ + Y+ A K + E WS+ +D +
Sbjct: 181 ENPLFAC----LEIDYSESDHDPSGRAYDEAEKTLTYYELDLGLNHVVRKWSEP-VDPRS 235
Query: 165 DLLIPVPPPL----------CGVLIIGEETIVYCSANAFK-AIPIRPSITKAYGRVDADG 213
+LL+ VP GVL+ E+ I Y + + +PI + + G
Sbjct: 236 NLLLQVPGGYNHNLEKWDGPSGVLVCSEDYITYKHQDQPEHRVPIPRRLNPIEKMSERRG 295
Query: 214 SR--------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDN 259
+ +L+ G L + + H+ +++ LKI+ +AS ++ L +
Sbjct: 296 TLVVASVLHKMKNAFFFLVQTEDGDLFKITMEHQDDEIRSLKIKYFDTVPVASGLAILRS 355
Query: 260 AVVYIGSSYGDSQLIKLN------------------------------LQPDAKGSYVEV 289
+++ S +G L P + + V+V
Sbjct: 356 GFLFVASEFGPQLLYSFQKLGDDDDLPEYISTDYDENGAGRKRPQLPTFTPRSLDNLVQV 415
Query: 290 LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
E +L PI+D ++ Q+ G S +++R+G+ E S +L G+
Sbjct: 416 DE-MPSLDPILDAKPLNPLASDSPQIFVACGRGARSSFKMLRHGLEAQEAVSSDLPGVPS 474
Query: 349 GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
+W+ + + D +D+++++SF++ T +L++ + +EE GF + + TL +
Sbjct: 475 AVWTTKITRQDEYDSYIILSFLNGTLVLSIG--ETIEEVGDSGFLTSSPTLAVQQLGEDA 532
Query: 409 LVQVTSGSVRLVSSTSRELRNEWKSPP---GYSVNVAT--ANASQVLLATGGGHLVYLEI 463
L+QV +R + + NEW +P G + N QV +A LVY E+
Sbjct: 533 LLQVHPYGIRHILVDKQV--NEWATPSLPNGVQTTIVAICTNERQVAVALSSNELVYFEL 590
Query: 464 G-DGILTEVKHAQLE-YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK 521
DG L E + + + + + E + A G D +VRI SL + +
Sbjct: 591 DMDGQLNEYQDRKATGATVLTMSMADCPEGRQRTPYLAAGC-DDSTVRIISLEPASTLAS 649
Query: 522 EHLGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 572
+ P S + A + +++ L +G LL +L+ TG+LTD +
Sbjct: 650 ISIQALTAPASSICVAEMKDATVDRNQATTFVNIGLSNGVLLRTVLDAMTGQLTDTRTRF 709
Query: 573 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
LG++ + L + V A S R + ++ +L ++ + + H F++ P+
Sbjct: 710 LGSKAVRLIRTKVHGQSAVMALSTRTWLSFTYQSRLQFTPLIFDALDHAWSFSAELCPEG 769
Query: 633 LAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICH-----------QEQSRTFAICSL 680
L L I TI + KL S+ L PR+I H + RT + +
Sbjct: 770 LIGIVGSTLRIFTIPSLASKLKQDSVALSYTPRKIAHHPDEQGLFYVVEADRRTLSPGAQ 829
Query: 681 KNQSCAEESEMH--------------------------FVRLLDDQTFEFISTYPLDTFE 714
+ + A E E+ VR++D + LD E
Sbjct: 830 RRRVEALEKELKPHQRGVLDLKPAEFGLIRGEAGNWASCVRVVDGPQSQTTHKIELDDNE 889
Query: 715 YGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDG--KLQLIAEKETK 770
S+ F S + + VG+A V+ K + + + +G +L++ + E
Sbjct: 890 AAFSVAIVPFASAEKQSFLVVGSAVDVVLSPRSFKKAYLTTYRLINGGRELEVHHKTEID 949
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIV 829
L F G+LLA + + +++Y D G ++L +C + A+ + +G IV
Sbjct: 950 DIPLVLRPFQGRLLAGVGKALRIY-----DLGKKKLLRKCENKSFPTAIVSLDAQGSRIV 1004
Query: 830 VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KN 887
VGD+ +SI YK E + A D +++ +LD D A+ NL+ +R +
Sbjct: 1005 VGDMQESIVFTSYKPLENRLVTFADDVMPKFVTRCTMLDYDTVAAADKFGNLYVLRIDAD 1064
Query: 888 SEGATDEERGRLEVV-------GEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGT 938
+ + DE+ + +V G H + + G ++ L + + G +++
Sbjct: 1065 TSRSVDEDPTGMTIVHEKPVLMGAAHKATLLAHYFVGDIITSLNRTVMVPGGREVLMYTG 1124
Query: 939 VNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
++G IG + + E L LQT LR+ + G +H +RS K+V +DG
Sbjct: 1125 ISGTIGALVPFVSKEDVDTLSTLQTQLRQENNSLVGRDHLAYRSSYAPVKSV-----IDG 1179
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DL E+F L + + I++ ++ E+ K++ +L
Sbjct: 1180 DLCETFGLLQPAKQNAIAQELDRKPSEINKKLAQL 1214
>gi|452820919|gb|EME27955.1| splicing factor 3B subunit 3 [Galdieria sulphuraria]
Length = 1294
Score = 164 bits (415), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 203/870 (23%), Positives = 356/870 (40%), Gaps = 148/870 (17%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSL 353
+L P+ + + D +G Q+V SG GS + +R G E A EL G G++++
Sbjct: 404 SLAPLTCWQIGDFYGEGSPQIVVGSGRRHRGSWKTLRLGYRWTEMAVSELPGYPVGVFTV 463
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
R D +D++++VSF++ + +L++ + +EE GF + T+F N +VQV
Sbjct: 464 REQQQDSYDSYIIVSFVNASLVLSVG--ETVEEVSDSGFSTFVSTIFVQLFSNNSIVQVH 521
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVK 472
+ +R + + + +EWK P G + VA NASQ+++ L+Y E+ + G+LTE++
Sbjct: 522 AQGIRHIRPDLQTV-HEWKPPKGLFIQVADGNASQLVIILNSKQLLYFELDETGMLTEIQ 580
Query: 473 HAQL----EYEIS----CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
+L YE S C+ I P+ S + AV R++SL + + L
Sbjct: 581 EVELIQDQSYESSSFLPCVSIAPLVAGQSKASFVAVSDGISYYTRLYSLKKESCLKPISL 640
Query: 525 GG-EIIPRSVLLCAFEGIS------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
+ + SVLL + LL G + L++ TG L++++ +GTQP
Sbjct: 641 QALDTVANSVLLIGLSAPASSRSELVLLIGSHSGIYIRALVDPITGALSEQQSRLVGTQP 700
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYS-NVNLKEV-SHMCPFNSAAFPDSLAI 635
+ L+ + S R ++Y N LL + + V H+ F S PD +
Sbjct: 701 VRLQRIEIQGKPVALVVSSRSYLVYLDNHLLLQTLPIGYDWVFDHVVGFCSEQCPDGVIA 760
Query: 636 AKEGELTIGTIDDIQKL-------HIRSIPLGEHPRRICHQEQSRTFAICSL-KNQSCAE 687
L I + ++ + L H + P PRR+ +E + +I + +N+
Sbjct: 761 CCGSSLCILSFEEDRDLFQTTHLFHSDTQPSLYTPRRLVDREDILSSSITNYTRNRILTL 820
Query: 688 ESEMHFVRLLDD-------------------------QTFEFIST--------------- 707
E + V D Q F+
Sbjct: 821 ECDQQIVISPQDIVSHSSSLRQTGYPLGPIGSWSCAIQQMTFMDNQFPNHEDPVESTSCR 880
Query: 708 -------YPLDTFEYGCSILSCSFS-DDSNVYYCVGTA--YVL----PEENEPTKGRI-- 751
Y +T EY SI FS D S++Y CVG Y L P E PTK I
Sbjct: 881 WQQKCEIYNSETKEYISSICCIRFSHDSSSLYLCVGVVMDYALQVTNPLEKNPTKDNIDI 940
Query: 752 ----LVF---------------------IVEDGKLQLIAEKETKGA-------------- 772
+VF ++D + GA
Sbjct: 941 QRESIVFPRSEIRVYRWNETQWEYIHTTPIQDASSHRMEWNYDYGASHLSFHAASRYRNE 1000
Query: 773 -VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH---HGHILALYVQTRGDFI 828
+ ++ +F G LL A+ +++Y D G ++L + H H + ++T D I
Sbjct: 1001 IITTMASFQGHLLVAVGTSLRMY-----DLGKKQLLKKTQHPRATPHKITC-IETCYDRI 1054
Query: 829 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
+ D+ +S+ L Y + A DY W + + +LD D + N+ +R
Sbjct: 1055 FLSDVQESVFLYRYSAADNLFLCIADDYLPKWCTTMCLLDYDTVAIGDKMGNISILRLPP 1114
Query: 889 EGAT----DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD--VGQIPTVIFGTVNGV 942
E T D G L +H + + GS++ L + G +P + +GT++G
Sbjct: 1115 EAGTFIEQDPTGGLLSKEAPHHF-QLEACYYVGSVIQCLSKVEWTTGDVPLLFYGTLDGA 1173
Query: 943 IGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
IGV+ L + + L+ LR+ + G +H +RS+ ++ +DGDL E
Sbjct: 1174 IGVMIPLRSTLDMELFQALELQLREYRSPLCGRHHLAYRSY-----FFPVRHVIDGDLCE 1228
Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
F LS + ++I K ++ S+ ++ +++E+
Sbjct: 1229 EFYRLSLEQQEKIVKELDRSIVDVHRKLED 1258
>gi|323508292|emb|CBQ68163.1| probable splicing factor 3B subunit 3 [Sporisorium reilianum SRZ2]
Length = 1221
Score = 163 bits (412), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 254/1178 (21%), Positives = 477/1178 (40%), Gaps = 177/1178 (15%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G + +L FR G ++D+L + ++ + +L++ + ++ + + R G R
Sbjct: 62 FGVVRSLAAFRLTGGSKDYLIVGSDSGRIVILEYQSRTNSF-DKVHQETFGRSGSRRIVP 120
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
GQ DP R +IG L ++ D + L EA N + + +D+ F
Sbjct: 121 GQYLATDPKGRAAMIGAMEKAKLVYILNRDAQANLTISSPLEAHRPNGIIHHIIGVDVGF 180
Query: 117 ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE---------GPWSQNNLDNGA 164
L+ C + V Y ++ + +E A K + E WS+ +D+ +
Sbjct: 181 ENPLFAC----LEVDYSESDHDPSGRAFEEAEKALTYYELDLGLNHVVRKWSEP-VDSRS 235
Query: 165 DLLIPVPPPL----------CGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGS 214
+LLI VP GVL+ E+ I Y + + R I K V+
Sbjct: 236 NLLIQVPGGYNQNLEKWDGPSGVLVCSEDYITYKHQDQPEH---RVPIPKRLNPVEKSSE 292
Query: 215 R------------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISY 256
R +L+ G L + + H+ +++ LKI+ +AS ++
Sbjct: 293 RRGTLIVASVLHKMKNAFFFLVQSEDGDLFKVTMEHQDDEIRSLKIKYFDTVPVASGLAI 352
Query: 257 LDNAVVYIGSSYGDSQLIKLN------------------------------LQPDAKGSY 286
L + +++ S +G L P +
Sbjct: 353 LRSGFLFVASEFGPQLLYSFQKLGDDDDLPEYSSTDYDENGAGRRRPQLPTFTPRPLDNL 412
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
V+V E +L PI+D ++ Q+ G S +++R+G+ E S +L G
Sbjct: 413 VQVDE-MPSLDPILDAKPLNPLASDSPQIFAACGRGARSSFKMLRHGLEAQEAVSSDLPG 471
Query: 347 I-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
+ +W+ + + D D+++++SF++ T +L++ + +EE GF + + TL
Sbjct: 472 VPSAVWTTKITRQDEHDSYIILSFVNGTLVLSIG--ETIEEVSDSGFLTSSSTLAVQQLG 529
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSP--PG--YSVNVAT-ANASQVLLATGGGHLVY 460
+ L+QV +R + +++ NEW +P P + VAT N QV++A LVY
Sbjct: 530 EDALLQVHPHGIRHIL-VDKQI-NEWATPSLPNGRQTTIVATCTNERQVVVAFSSNELVY 587
Query: 461 LEIG-DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
E+ DG L E + + + + + + E + AVG D +VRI SL +
Sbjct: 588 FELDMDGQLNEYQERKAMGAAVLTMSMADCPEGRQRTPYLAVGC-DDSTVRIISLEPNST 646
Query: 519 ITKEHLGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRK 569
+ + P S + A +++ L +G LL +L+ TG+LTD +
Sbjct: 647 LASISIQALTAPASSICMAEMHDATIDRNHATTFVNIGLQNGVLLRTVLDAVTGQLTDTR 706
Query: 570 KVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAF 629
LG++ + L + V A S R + ++ +L + + + H F++
Sbjct: 707 TRFLGSKAVRLIRTKVHGQSAVMALSTRTWLSFTYQDRLQFVPLIFDALDHAWSFSAELC 766
Query: 630 PDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICH-----------QEQSRTFAI 677
P+ L L I T+ + KL S+ L PR+ H + + RT +
Sbjct: 767 PEGLIGIVGSTLRIFTMPSLASKLKQDSVALSYTPRKFAHHPNEQGLFYVVEAEHRTLSP 826
Query: 678 CSLKNQSCAEESEMH--------------------------FVRLLDDQTFEFISTYPLD 711
+ + ++ E E+ VR++D + LD
Sbjct: 827 GAQRRRTEMLEKELKPHQRGVLDLNPAEFGLIRAEAGNWASCVRVVDGVQSQTTHKLELD 886
Query: 712 TFEYGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIAEK 767
E S+ F S + VG+A V+ K + + + +G+ L+++ +
Sbjct: 887 DNEAAFSVAVVPFASAEKEAMLVVGSAVDVVLSPRSFKKAYLTTYRLTNNGRELEVLHKT 946
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
E L F G+LLA I + +++Y D G ++L +C + A+ + +G
Sbjct: 947 EVDDIPLVLRPFQGRLLAGIGKALRIY-----DLGKKKLLRKCENKSFATAIVSLDAQGS 1001
Query: 827 FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
IVVGD+ +SI YK E + A D +++ +LD D A+ N++ +R
Sbjct: 1002 RIVVGDMQESIIFTSYKPLENRLVTFADDVMPKFVTRCAMLDYDTVAAADKFGNVYVLRI 1061
Query: 886 -KNSEGATDEERGRLEVV-------GEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVI 935
++ + DE+ + +V G H V F G +V L + + G ++
Sbjct: 1062 DADTSRSVDEDPTGMTIVHEKPVLMGAAHKATLVAHFFVGDIVTSLNRTVMVPGGREVLL 1121
Query: 936 FGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNF 994
+ V+G IG + + E L L+++LR+ + G +H +RS K+
Sbjct: 1122 YTGVSGTIGALVPFVSKEDVDTLSTLESHLRQENSSLVGRDHLAYRS-----SYAPVKSV 1176
Query: 995 LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+DGDL E+F L + + I+ ++ E+ K++ +L
Sbjct: 1177 IDGDLCETFGLLPPAKQNAIATELDRKPSEINKKLAQL 1214
>gi|320164105|gb|EFW41004.1| splicing factor 3B subunit 3 [Capsaspora owczarzaki ATCC 30864]
Length = 1218
Score = 162 bits (410), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 259/1190 (21%), Positives = 467/1190 (39%), Gaps = 200/1190 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G + L FR G ++D+L + ++ + +L+++ E + R
Sbjct: 60 IFGVVRCLTAFRLTGASKDYLVVGSDSGRVAILEYNPTKYEFERVHLETYGKSGSRRIVP 119
Query: 66 GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVL---DIKFLYGC 120
GQ D R I + + L V+ D+ +L + + + + G
Sbjct: 120 GQYLAADAKGRAIMISAVEKQKLVYVMNRDSTARLTITSPLDAHKAHAVVFATVGLDVGF 179
Query: 121 AKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
PT + D ++A +H+ YE+ L V S L A++L
Sbjct: 180 EAPTFACVELDYEEADNDPSGEAVEATQQHLVFYELDLGLNHVVRK--SSEPLPGWANML 237
Query: 168 IPVPPP---LCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHA-- 222
I VP GV++ E +VY S P P+I R G R+ + D +
Sbjct: 238 IQVPGADDGPGGVVVCCEGFLVYKS-------PEDPAIDV---RCPFPGRRHDISDPSKR 287
Query: 223 GLLHLLVITHEKEK---------------------------VTGLKIELLGETSIASTIS 255
G +V+ H K V ++++L +A+++
Sbjct: 288 GAGRGIVVAHASHKTKTKFFFVVQLDNGDMFKVTLDVADDNVVSIRMKLFDTLPVATSLH 347
Query: 256 YLDNAVVYIGSSYGD------SQLIKLNLQPD-----------------AKG-----SYV 287
L + +++ + GD +QL + + +P+ A G ++V
Sbjct: 348 ILRSGHLFVAAELGDHHLYQITQLAENDDEPEFTTRGLVMNGRVVPSSVAHGGVELSAFV 407
Query: 288 EVLERYVNL---------GPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 338
+NL P + V DL + Q G + +LRI+R+G+ +++
Sbjct: 408 YTPRPLLNLVLLDVMESTAPTMQCRVEDLLGEDAPQFYLLCGRGPNSTLRILRHGLEVSQ 467
Query: 339 QASVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQ 397
A+ EL +WS++ S D DT+ +VS+ S T +L++ D++ G
Sbjct: 468 LAATELTASPVAIWSVKRSIHDIHDTYFIVSWASATVVLSVG--DQVAPVSDSGLLLTVG 525
Query: 398 TLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
T+ + L+QV + +R + + SR NEW++P + A N QV++A GG
Sbjct: 526 TIAVSRIGEDDLLQVYTDGIRHIRADSRV--NEWRTPGRRQIVRAAINDRQVVIALAGGE 583
Query: 458 LVYLEIG-DGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515
LVY E+ G L E + E+ L I P+ + ++ AV D +VRI SL
Sbjct: 584 LVYFELDITGQLNEFAERFTSSAEVCALAIAPVPADRRRARFLAVAA-EDNTVRIVSLEL 642
Query: 516 LNLITKEHLGGEIIP-RSVLLCAFEGISY--------LLCALGDGHLLNFLLNMKTGELT 566
+ T + LG + + R+ LC F S L L +G LL + +G LT
Sbjct: 643 SD--TLQSLGVQTVADRASSLC-FADPSLDNSSADLVLGVGLQNGILLRTSVEPSSGNLT 699
Query: 567 DRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS 626
D + LGT+P+ L A S RP + Y + ++ + F++
Sbjct: 700 DTRTRYLGTRPVMLFPSKVHGAAGFLALSSRPWIFYVQQGTPTLTPLSYDALDCAATFSA 759
Query: 627 AAFPDSLAIAKEGELTIGTIDDIQKL-HIRSIPLGEHPRRIC----HQ------------ 669
A P+ L L + + + + + S+PL PRR H+
Sbjct: 760 ANCPNGLVAIAGNTLRVLNLPRLGSVFNQTSMPLKYTPRRFIVDSEHRLLVIAESDHNTF 819
Query: 670 -------EQSRTFAICSLKNQSCAEE-SEMHF-------------VRLLDDQTFEFISTY 708
EQ+R A + + AE + F +R++D Q E +
Sbjct: 820 AAGRKAVEQARVAAETNTTQAAIAESLPDAVFGAPAAGAARWGSCLRIVDAQLRETLELI 879
Query: 709 PLDTFEYGCSILSCSF-SDDSNVYYCVGTAYVLPEENEPTKGRILV---FIVEDGKLQLI 764
LD E S+ SCSF + ++ VGT L + +G L F + +L +
Sbjct: 880 ELDEDEGVFSMTSCSFHGHEGQTFFIVGTTKALNLQTRAHQGGCLYTYRFNPQTRRLDFV 939
Query: 765 AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-HILALYVQT 823
E + ++ F G+LL + +++Y D G R+L +C + + T
Sbjct: 940 HRTEVEDVPGAMYPFQGRLLVGVGSLLRVY-----DLGKRKLLRKCENRSIPNFVSSITT 994
Query: 824 RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFT 883
G IVV D+ +S L Y+ + + A D N W+++ ++D D + N+F
Sbjct: 995 SGGRIVVTDVQESFHFLRYRPSDNMLAVFADDSNPRWVTSSTMVDYDTVAAGDKFGNVFV 1054
Query: 884 VR-------------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP 924
+R +S+ + + + + +H+G+ V + +L+ P
Sbjct: 1055 LRLPQDLQDDLEDDPTGGRLLGSSKNTLNGAAQKADTIINFHVGDTVTTMQKTALI---P 1111
Query: 925 DSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
+++ T G IGV+ + + F + L+ ++R+ + G +H +RS
Sbjct: 1112 SGS----ECLVYTTTLGAIGVLIPFTTQSDIDFFKHLEMHMRQENPPICGRDHLAFRS-- 1165
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM-NVSVEELCKRVEEL 1032
+KN +DGDL E F L + I+ + + + E+ K++E+L
Sbjct: 1166 ---HYFPSKNVIDGDLCEQFNSLDPHKKRLIAGDLADRTPSEVSKKLEDL 1212
>gi|47212603|emb|CAF93045.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1171
Score = 161 bits (408), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 235/1076 (21%), Positives = 422/1076 (39%), Gaps = 207/1076 (19%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L + ++G I +L FR G +D++ I ++ + +L++ + S + + + + G
Sbjct: 53 LLTMEVFGIIRSLMAFRLTGGTKDYIVIGSDSGRIVILEY-SPSKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R +IG L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMIGAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + ++E VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRLKYFDTIPVATAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V V E+ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-SLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ + DL + Q+ G +LR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIMSCQIADLANEDTPQLYVACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKTPGKKTIVRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
++ ++ C+ + + S+ AVG+ D +VRI SL L ++ + L + P
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-ADNTVRIISLDPSDCLQPLSMQALPAQ--PE 638
Query: 532 SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
S+ + G+ YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 639 SLCIVEMGGVEKQDELGEKASIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + + F S P+ +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLVLIESDHNAYTEATKAQRKQQ 818
Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
AEE +EM VRL++ +
Sbjct: 819 MAEEMVEAAGEDERELAAEMAAAFLNENLPEAIFGAPKAGAGQWASLVRLVNPIQGSTLD 878
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
L+ E S+ C F + + +Y VG A ++ T G I + + G KL+
Sbjct: 879 QVQLEQNEAAFSVAVCRFPNTGDDWYVLVGVARDMILNPKSVTGGFIYTYRLVAGGEKLE 938
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLR-------------DDGTRE 805
+ + + ++ F G++L + + +++Y K +LR D+GT
Sbjct: 939 FVHKTPVEDVPLAIAPFQGRVLVGVGKLLRIYDMGKKKLLRKCENKPARRRRFQDEGTTS 998
Query: 806 LQSECGHHG-----------------------------------HI--LALYVQTRGDFI 828
L + C H G HI L + T G +
Sbjct: 999 L-AHCNHVGPLDGPSSDPGPQLGQLTASGGPMTTSSQPIRQSRQHIPNLVTGIHTTGQRV 1057
Query: 829 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTV 884
+V D+ +S+ + Y+ E + A D W++ +LD D A+ N+ V
Sbjct: 1058 IVTDVQESLFWVRYRRNENQLIIFADDTYPRWVTTACLLDYDTMASADKFGNISIV 1113
>gi|168031491|ref|XP_001768254.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162680432|gb|EDQ66868.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1391
Score = 159 bits (401), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 220/990 (22%), Positives = 390/990 (39%), Gaps = 192/990 (19%)
Query: 200 PSITKAYGRVDADG-SRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLG-ETSIASTISY 256
P+IT A+G SR L G +HL +I + V ++++ + S + + +
Sbjct: 378 PTITAWSWEPYAEGQSRLALAMDTGEIHLARLIFESPDDVPRIEVQQRQYKCSPCNVVLW 437
Query: 257 LDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVV 316
++ + GD Q+++ + + K + +++ N+ PI+DF + D + Q Q+
Sbjct: 438 TKGGLLAVFVEMGDGQVLQCS--DNGKLIFKSLIQ---NVAPILDFSLADYHNEKQDQMF 492
Query: 317 TCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETR 374
CSGA +GSLR++RNGI + + S QG+ G +++R DP+ F V+SF+ ETR
Sbjct: 493 ACSGAGNEGSLRVIRNGISVEKLYTTSPIYQGVTGTYTMRMCCRDPYHAFFVMSFVQETR 552
Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR--------- 425
+L++ L + ++ TE GF TL C +VQV S V + T
Sbjct: 553 VLSVGL-NFVDITEAVGFQPCASTLACGTIEDYHVVQVCSKEVIVCVPTKTAHPAGIDSP 611
Query: 426 -ELRNEWKSPPGYSVNV-ATANASQVLLATGGGHLVYLEIGDGI-----LTEVKHAQLEY 478
+ WK P G V++ A A+ + VL + G +V L G L + +L+
Sbjct: 612 LPFCSSWKPPQGLVVSLGAVASKAIVLALSKPGLIVMLGSQRGANGALELCMTQQCELKA 671
Query: 479 EISCLDI---------------------NPIGENPS---YSQIAAVGMWTDISVRIFSL- 513
E+SC+ I P NPS +I VG + SV + S+
Sbjct: 672 ELSCISIPDEEDWTSSPLPPSIVGLVEGTPKSRNPSGVEVGRICVVGT-HEPSVEVLSIV 730
Query: 514 --PDLNLITKEHLG---------GEIIPRSVLLCAFEGISYLLCALGDGHLLNF------ 556
L + H+ +P SV L F+ + Y+L L +G LL +
Sbjct: 731 PGEGLAPLAVGHISLVSCVGTTLSGCVPESVRLAQFDRL-YILAGLRNGMLLRYEWPASS 789
Query: 557 ---------LLNMKTGE---------------LTDRKKV--------SLGTQPITLRTFS 584
LL+ E L D V +G P++L +
Sbjct: 790 TATLPDCTNLLSTSDWENIGITQPNLGGDKDVLEDSSPVLLHLVAVRRMGVSPVSLISLQ 849
Query: 585 SKNTTHVFAASDRPTVIYSS--NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+ + V A SDRP ++ ++ ++++ +++++ SH P NS P+ + + L
Sbjct: 850 ASLSADVIALSDRPWLLQTARHSQRIAHTSISFPSSSHAAPVNSVDCPNGILFVADCSLH 909
Query: 643 IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF 702
+ ++ +++L+++ +PLG PRR+ + +S+T + ++ + + V +D +
Sbjct: 910 LVEMEHLKRLNVQKLPLGRTPRRVLYHTESKTLIV--MRTDYGPDGGLVSDVCCVDPLSG 967
Query: 703 EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY------VLPEENEPTKGRILVFIV 756
S Y LD E SI + VGT+ + E E KGR+LVF +
Sbjct: 968 ANYSCYTLDAGEVARSIQ--LWKRRQEQLLLVGTSLIGGGGIMSSGEAESAKGRLLVFQL 1025
Query: 757 ------------------------------------------------EDGKLQLIAEKE 768
E +L+L
Sbjct: 1026 LSKHVGTHSQPVMSSTNTPTLSNQSTPGSSAADPMVLSESDESDLTDGEGWELRLKTHII 1085
Query: 769 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFI 828
GAV S++++ G+ + A L+ R D + L+ + + I
Sbjct: 1086 LPGAVLSVSSYLGQYVLASAGNC-LFCLGFRPDSPQRLRRMAMVKTRFMITSLSVHLSRI 1144
Query: 829 VVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD--IYLGAENNFNLFTVRK 886
VGD I Y+ G +E ++ ++D D + NF F+
Sbjct: 1145 AVGDCRDGILFYTYQEVSGQLELLYCGGIQQLVADCVLMDTDTAVVTDRRGNFCTFSSAS 1204
Query: 887 NSEG--------ATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP---------DSDVG 929
EG + ER L + YH+GE + R S P +D
Sbjct: 1205 TPEGDLNFFFAESVSPER-NLSLGCWYHIGETLMRIHKASFAYESPAEESMKNCGSNDAI 1263
Query: 930 QIPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLR--KVIKGVGGLNHEQWRSFNN 984
PT V+ ++ G + + + E+Y L+ +Q+ L + + G NHE +R
Sbjct: 1264 AHPTHSSVVASSLLGSVFIFIKVTREEYDLLKAVQSRLAHYPITTPLLGNNHEDYR---G 1320
Query: 985 EKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
+ LDGD++ FL+L+ + + +
Sbjct: 1321 QGCPAGVCQVLDGDMLCQFLELTSAQQENV 1350
>gi|443922899|gb|ELU42250.1| splicing factor 3B subunit 3 [Rhizoctonia solani AG-1 IA]
Length = 1212
Score = 158 bits (400), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 262/1207 (21%), Positives = 467/1207 (38%), Gaps = 239/1207 (19%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I ++ FR G +D+ + ++ + VL++D +++ R
Sbjct: 58 VFGTIRSIAAFRLTGGTKDYCIVGSDSGRIIVLEYDPKTNSFTKLQQETFGKSGSRRIVP 117
Query: 66 GQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEE--------------- 108
G++ +DP R + + + L V+ D L + +
Sbjct: 118 GEMLAVDPKGRSVMISAVEKSKLVYVLNRDAAANLTISSPLEAHRNGAIIHAIVGVDVGF 177
Query: 109 ----LQVLDIKFLYGCAKPTIVVLYQDNKDARHVKT-YEVALKDKDFVEGPWSQNNLDNG 163
L++ + A PT Q ++A + T YE+ L V S D
Sbjct: 178 ENPLFAALEVDYTESDADPT----GQAFRNAEKMLTFYELDLGLNHVVRK--SSEPTDRR 231
Query: 164 ADLLIPVP------------PPLCGVLIIGEETIVYCSANA-FKAIPIRPSITKAYGRVD 210
A+LL+ VP P GVL+ E+ I+Y + A+ R I +
Sbjct: 232 ANLLVQVPGGQSATTNAFDGP--SGVLVCCEDHIIYRPVDLDGGAVSHRVPIPRRRTPWG 289
Query: 211 ADGSRYLLGDHAGL-------LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
D R L+ A + L+ + + HE E V + I+ A+++ L + ++
Sbjct: 290 GDEERGLIITSAVMHKMKVCDLYKVTLEHEDEIVKSMTIKYFDTVPTATSLCILKSGFLF 349
Query: 264 IGSSYGDSQLIKLN-------------------------LQP--------------DAKG 284
+ + G+ L + L P D
Sbjct: 350 VAAEGGNHHLYQFQKLGDDDAEPEFTSNSYSNNGISSSPLPPCYFRPHPLDNLVLADEMT 409
Query: 285 SYVEVLE-RYVNL-GP-----------------IVDFCVVDLERQGQG-QVVTCSGAYKD 324
S+ +L+ R +NL GP IV + +V Q Q G
Sbjct: 410 SFCPILDARILNLFGPTSGGMSAFTQRSDEPSLIVSYRIVPATLQSDTPQFYLACGRGAR 469
Query: 325 GSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDE 383
S R++R+G+ + E S EL G+ G+W+++ + D +D ++++SF++ T +L++ +
Sbjct: 470 SSFRMLRHGLEVEESVSSELPGVPNGVWTVKVNDGDQYDKYIILSFVNGTLVLSIG--ET 527
Query: 384 LEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSRELRN-----EWKSPPGY 437
+EE GF S TL N L+QV G VR V + +L + EW++P G
Sbjct: 528 IEEVADTGFLSSAPTLAVQQLADNGGLIQVHPGGVRHVRPSQGDLASTAGVTEWRAPQGR 587
Query: 438 SVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQ-LEYEISCLDIN--PIGENPS 493
V AT N QV +A G LVY E+ +G+L E + + + C+ I P G P+
Sbjct: 588 QVASATTNTRQVCVALNSGELVYFELDLNGVLQEYGEMRAVGSAVVCMSIAEVPEGRQPN 647
Query: 494 YSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV--LLCAFEGISYLLCALGDG 551
S F P ++L G + R V L+ + ++ A G
Sbjct: 648 GSN-------------HFLGPRVDLGNHLVTG---LSRHVWMLVVGLTNETSVVGAHGTS 691
Query: 552 HLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYS 611
++ F LGT+P+ L S + T + A S R + Y+ L +S
Sbjct: 692 NVYFF----------------LGTRPVRLTRVSVQRNTCILALSSRSWLNYTHQNLLRFS 735
Query: 612 NVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQE 670
+ + + H F++ P+ L L I I + KL ++PL PR+I
Sbjct: 736 PLIYENLDHAWSFSAELCPEGLIGIAGSVLRIFQIPKLSDKLKQVTMPLSYTPRKIAVHP 795
Query: 671 QSRTFAICSLKNQSCAEESE----MHFVR--------LLD-------------------- 698
+ + F + +++ E++ VR L+D
Sbjct: 796 EHQLFYVIESDHRTWGSEAKNKRLAELVRAGRQIDQELVDLPAEDFGLPRAGAGQWASCI 855
Query: 699 ---DQTFEFIST----YPLDTFEYGCSILSCSFSDDSN-VYYCVGTAY---VLPEE--NE 745
D T F S LD E S+ F+ N ++ VGTA VLP +
Sbjct: 856 RIIDPTEVFCSATLFKIELDNNESAFSVAVVPFAARENELFLVVGTAKDTNVLPRQCVGA 915
Query: 746 PTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRE 805
T G ++ L + E +L G+L A + + +++Y+ G ++
Sbjct: 916 VTSGSLVKLGWSTHILTRPIQTEVDDVPLALLGIKGRLCAGVGKALRIYEM-----GKKK 970
Query: 806 LQSECGHHGHILALYVQT-RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV 864
L + + G A+ T +G I+VG++ +S+ YK E + A D +A W+++
Sbjct: 971 LLRKSENKGFATAIVTLTSQGSRIIVGEMQESVHYATYKPESNRLLVFADDTSARWVTSA 1030
Query: 865 EILDDDIYLGAENNFNLFTVR---KNSEGATDEERG---------------RLEVVGEYH 906
++D D + N+F R S+ D+ G + +++ Y+
Sbjct: 1031 ALVDYDTVAVGDKFGNIFVNRLPANISQQVDDDPTGAGIMHEREFLHGAPHKTKLLAHYN 1090
Query: 907 LGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLR 965
+G+ V +LV G V + ++G IGV+ L ++ + F+ L+ ++R
Sbjct: 1091 VGDIVTSVHRAALVP-------GGRDVVAYTGLHGTIGVLIPLASKEDVDFITTLEQHMR 1143
Query: 966 KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEEL 1025
+ G +H +R + V K +DGDL E F L T+ I+ ++ +V E+
Sbjct: 1144 SEHSSLVGRDHLAYRGY-----YVPVKAVVDGDLCERFAMLPSTKQKSIAGELDRTVGEV 1198
Query: 1026 CKRVEEL 1032
K++E L
Sbjct: 1199 LKKLEGL 1205
>gi|302822731|ref|XP_002993022.1| hypothetical protein SELMODRAFT_136264 [Selaginella moellendorffii]
gi|300139222|gb|EFJ05968.1| hypothetical protein SELMODRAFT_136264 [Selaginella moellendorffii]
Length = 1277
Score = 157 bits (396), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 196/853 (22%), Positives = 345/853 (40%), Gaps = 135/853 (15%)
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
GD ++K+ D + S+ ++ N+ PI+DF +VD + Q Q+ C G ++GS+R
Sbjct: 402 GDGHVLKVE---DGQLSFQSFVQ---NIAPILDFSLVDYYGEKQDQMFACCGGDEEGSVR 455
Query: 329 IVRNGIGINEQASVE--LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
I+RNG + + QG+ G+W++R DP+ F ++SF+ ETR+L++ L + ++
Sbjct: 456 IIRNGNSVEKLICTPPVYQGVSGIWTMRYRFKDPYHAFFLISFVEETRVLSVGL-NFVDI 514
Query: 387 TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPG 436
T+ GF SQ TL C + QV V+L S T S L W+ PG
Sbjct: 515 TDAVGFESQVNTLACGLVEDGWVAQVWRYEVKLCSPTKAAHPAGVSGSSPLSTTWRK-PG 573
Query: 437 YSVNVATANASQVLLATGGGHLVYLEIGDGI-------LTEVKHAQLEYEISCLDINPIG 489
Y ++V S+V+LA L+ L +G L E++ +E EISC+ I P G
Sbjct: 574 YPISVGAVCRSRVILALARPGLL-LMLGTTQTSAESFELVELQLCMMEAEISCISI-PQG 631
Query: 490 E--NPSYSQIAAVGMWTDISVRI----------------------------FSLPDLNLI 519
+ P IA + + + F++ ++LI
Sbjct: 632 DISIPVPPTIAGLHAGNTVPAGVDLGNVCVVGTHKPSVELLSIVPGDKFAPFAVGQVSLI 691
Query: 520 TKEHLGGEI---IPRSVLLCAFEGISYLLCALGDGHLLNF--------LLNMKTGELTDR 568
T +G + IP+ V L F+ + Y+L L +G LL + L+ K EL
Sbjct: 692 TS--VGTAVSGCIPQDVRLALFDRL-YILAGLRNGMLLRYEWPEDTPSLVLSKPAELHLI 748
Query: 569 KKVSLGTQPITLRTF-SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
LG P+ L + + A SDRP ++ + K++ Y++++ + +H P S
Sbjct: 749 AARRLGVSPVCLVPLEACALRADIIALSDRPWLLQMA-KRISYTSISFQPSTHATPVCSK 807
Query: 628 AFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ---- 683
P + + L + ++ + L+++ + LG RR+ + +S + L ++
Sbjct: 808 DCPKGIIFLADCSLHLVEMEQSRTLNVQKLRLGCTGRRVLYHPESGVLIVLRLLSEHRSD 867
Query: 684 -SCAEESE-----------------MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS 725
C E M ++L D+Q + T DT ++++ +
Sbjct: 868 VCCIEPLSGAVLCVHPFGVGQIVKCMELMKLGDEQLL-LVGTAS-DTRR---AVMATGEA 922
Query: 726 DDSNVYYCVGTAYVLPEEN--------------------------EPTKGRILVFIVEDG 759
+ Y+CV P + E + G +VF +D
Sbjct: 923 ERQAFYFCVSNTGYFPSSSRGVLVVLYLDAPPPPSPHSPMSSPASESSGGASIVFQPDDY 982
Query: 760 KLQLIAEKETKGAVYSLNAFNGK-LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 818
A G V ++ ++ G+ +LA + D+ R + +
Sbjct: 983 CFVPRANVGLPGPVNAVASYLGQYVLACAGNHLFCLGIASMDESPRRWKKLASIKTRFVI 1042
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA--E 876
+ R I VGD + L Y+ + +E D +S ++D D +
Sbjct: 1043 TSISVRFTTIAVGDCRDGVLLFTYREDSKKLEPIRCDPMRRLVSDCTLVDVDTAVVVDRH 1102
Query: 877 NNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR--HGSLVMRLPDSDVGQIPTV 934
NF + + +EG E+ LE +H+GE R S + S + V
Sbjct: 1103 GNFCALSTNEETEGNGSPEK-NLEAHCWFHIGEVCTTVRKVRTSRFLCKDTSRECSVSCV 1161
Query: 935 IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNF 994
I T+ G + + + E+Y L+ LQ L + L ++ R F + + K
Sbjct: 1162 IATTLLGSVFIFVRITGEEYSLLQALQRRLSFLPATAPVLGNDHAR-FRGQGRPAGVKEV 1220
Query: 995 LDGDLIESFLDLS 1007
LDGDL+E FL+L+
Sbjct: 1221 LDGDLLEQFLELT 1233
>gi|281202530|gb|EFA76732.1| CPSF domain-containing protein [Polysphondylium pallidum PN500]
Length = 933
Score = 156 bits (395), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 134/577 (23%), Positives = 254/577 (44%), Gaps = 88/577 (15%)
Query: 304 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFD 362
V+DL + Q Q+ +G ++R++R G+ I + A L GI G+W++ S +D D
Sbjct: 246 VMDLVNEEQPQIYALTGVADRSAMRVLRYGLPIAQIAGTPLPGIPSGLWTIPRSQEDSID 305
Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
++VVSFI T +L++ + +EE G + T ++ + ++QV +R +
Sbjct: 306 KYIVVSFIGSTLVLSVG--ETVEEVVDSGILATTTSILVRPIGADSIIQVFPHGIRHIKG 363
Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEIS 481
R NEW++P ++++AT+NA+QV++A GGG ++Y E+ G L+EV + EIS
Sbjct: 364 DRR--INEWRAPGRKTISLATSNATQVVIALGGGEVIYFELDAAGNLSEVAKKEFRREIS 421
Query: 482 CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK------EHLGGEIIPRSVLL 535
L+I P+ + ++ A+G W + VR+ SL NL+ + E + E + + +
Sbjct: 422 ALEIAPVRKGRQMARFVAIGDW-EGPVRVLSLDKDNLLNQVSVLDTERMHIESLMMNEMT 480
Query: 536 CAFE-----------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVS 572
E + +L L +G + +++ TGEL+D +
Sbjct: 481 IGVEMPEGTASSSSAIAKQQQLQNQSNQVLFLNVGLKNGVMKRAVVDPITGELSDTRTRL 540
Query: 573 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
LG +P+ K+ V A S R + Y + + V+++ + + F+S PD
Sbjct: 541 LGRKPVKFSRVKLKSGNAVLALSSRVWLCYVNQGRYDMVPVSVEPLDNASGFSSEHCPDG 600
Query: 633 LAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR-ICHQEQSRTFAICSLKN-------- 682
+ E L I I+ + +K + ++P PRR + H + +I + N
Sbjct: 601 IVATSENNLKIFVIEKLSEKFNQVNVPYKCTPRRFVVHPQTHYIVSIETEHNYLNELPTP 660
Query: 683 -----QSCAEESE---------------------------------MHFVRLLDDQTFEF 704
AE+ + + ++R+L+ ++
Sbjct: 661 KENGSNKMAEDKDPEVKREGDDMDYEVDQFSLNQNDKPPKAGDGKWLSYIRVLEPVSYRT 720
Query: 705 ISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---PEENEPTKGRILVFIVEDGK 760
+ LD E S+ +C F D + V+ VG + P + E + F
Sbjct: 721 LDLVKLDQDEAAYSLATCIFHDREGEVFLAVGCGKGVQLNPRKVESASIHLYRFTNNGQT 780
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
LQL+ + E + Y+++ F G+LL I ++++Y+ +
Sbjct: 781 LQLVYKTEVEEVPYAISHFQGRLLVGIANQLRIYEMV 817
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/140 (27%), Positives = 67/140 (47%), Gaps = 15/140 (10%)
Query: 896 RGRLEV--VGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHE 952
+GRL V + + E VN SL VG +++ T+NG IG + + E
Sbjct: 800 QGRLLVGIANQLRIYEMVNHISKTSL-------SVGGPEVLVYATLNGTIGALVPFVSRE 852
Query: 953 QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD 1012
F L+ +R+ + G +H +RS+ KN +DGDL E ++ L T+
Sbjct: 853 DVDFYTSLELQMRQENPPLCGRDHLAYRSY-----YFPVKNVIDGDLCEQYISLDPTKQQ 907
Query: 1013 EISKTMNVSVEELCKRVEEL 1032
I++ ++ S E+ K++E+L
Sbjct: 908 SIAEELSRSPSEILKKLEDL 927
>gi|399216895|emb|CCF73582.1| unnamed protein product [Babesia microti strain RI]
Length = 1232
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 249/1215 (20%), Positives = 462/1215 (38%), Gaps = 233/1215 (19%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YG I + FR G ++D+L + ++ + +L+++ + R + + G N
Sbjct: 58 VYGIIRCIAPFRLTGASRDYLVVTSDSGRIVILEFNTAKNNF-DRVHCETYGKSGNNGYN 116
Query: 66 --------GQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAFNIRLEE------- 108
G +DP R + L + ++ DNK L + + +
Sbjct: 117 LGNRRIVPGHFLGVDPKGRALMLAALERQKFVYILNRDNKSNLTISSPLEAHKSHSICHY 176
Query: 109 LQVLDIKFLYGCAKPTIVVLYQD----NKDARHVKTYEVALKDKDFVEGPWSQNNL---- 160
L +D+ F P V + Q+ N D + + K F E N++
Sbjct: 177 LTGIDVGF----DNPIFVTIEQNYATVNGDLSSSSVGDESKKYLSFWEMDLGLNHVTKKI 232
Query: 161 ----DNGADLLIPVPPPLCG-------VLIIGEETIVYCSANAFKAIPIRPSITKAYGRV 209
D A +L+P+ CG VL+ E ++Y P P + AY R
Sbjct: 233 SMQTDATAHMLVPI----CGGTDGPGGVLVCCENYLIYTK-------PDHPELCCAYPRR 281
Query: 210 DADGSRYLLGDHAGLLHLLV----ITHEKEKVTGLKIELLGE--------------TSIA 251
+ +A ++H +V I + E LK+E++ IA
Sbjct: 282 LECSQDTGILINAWVVHKMVDFFFILLQTEYGDLLKVEIVSNDQTVKEIVCRYFDTVPIA 341
Query: 252 STISYLDNAVVYIGSSYGDS---QLIKL---------NLQPDAKGSYVE----------V 289
+++ L +++GS +G+ Q I L +L P+ + V +
Sbjct: 342 NSLCVLKTGYLFVGSEFGNHCFYQFISLGNDETTACTSLHPEGGNAIVAFNPRSPKNLLL 401
Query: 290 LERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-K 348
++ +L P+VD + D ++ G + +I+R GI + E A+ EL G +
Sbjct: 402 VDELTSLSPVVDMKIADARGLDTSEIYLACGRGSRSTFQILRPGISVEELANNELPGYPR 461
Query: 349 GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYN- 407
++SL+ D +++V F E L +++ D +EE F ++T TL C +Y+
Sbjct: 462 YVFSLKDKNADDCVGYIIVVF--EGNTLVLSVGDAVEEVSNSFFNTETTTL-CAMLMYDD 518
Query: 408 QLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG- 466
+Q+ G +R + +EWK P + +AN Q+L+A GG ++Y EI +
Sbjct: 519 SFIQIHEGGIRHIIDNHV---SEWKPPTSKRIKCCSANTRQILIALSGGEVIYFEIDESH 575
Query: 467 ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEH 523
L E+ +I+C+ I + + YS A+ ++ V++ SL L ++ +
Sbjct: 576 ALVEIFKRNFGVDITCMAIQAVPTDRVYSSFGAISGLDNV-VKLISLEKEKGLKQLSTQL 634
Query: 524 LGGEIIPRSVLLCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKV 571
L SV + + + +L+ L G ++ ++ TG L+D+
Sbjct: 635 LPNNATAESVCIAQIDSLVRDAGHMRRNSSLFLVVGLNTGVMIRSNIDAITGALSDQHSR 694
Query: 572 SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
LG + + + A SD+ + Y+ KL S +N V H+ F S+ D
Sbjct: 695 FLGARAVRFSLVKVGDGMGFMAMSDKSWLCYAHQGKLFTSYINYDMVEHVASFCSSHCSD 754
Query: 632 SLAIAKEGELTIGTIDDIQKLHIRSI-PLGEHPRRI------------------------ 666
L I ++ + +S+ PL PR+I
Sbjct: 755 GFVAISGNSLRIYRCLNLGQEFSQSVAPLTYTPRKISILPSLSPITDNGTETGVTKNRHM 814
Query: 667 ---------CHQEQSRTFAICSLKN----QSCAEESEMHF-----------VRLLDDQTF 702
+ E +RT L+N AE+ E+ VR++D T
Sbjct: 815 LAIVECDHNTYDELTRTEIKKGLENIMPEGEQAEDVELGLYKAGEGKWGSCVRIIDPTTL 874
Query: 703 EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVG----TAYVLPEENEPTKGRILVFIVED 758
+ LDT E S +C D Y C+ T + L N + + +
Sbjct: 875 STAAKLLLDTDEAAISCCAC----DLEGYRCLAVGTVTGWNLANSNSNSCHIRMYAYGPN 930
Query: 759 GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL- 817
++ + + G +L A+ G+LLA + + LY G R+L + + G ++
Sbjct: 931 FEITFLHSTKVTGIPRALLAYEGRLLAGVGPDVILYAL-----GKRQLLKKAEYRGGVID 985
Query: 818 -------------------ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
+++ G+ I VGD+ +SI++L + + + D
Sbjct: 986 IQGYGVATPRTIGNGGLFGVMWLGASGNRIFVGDIRESITVLKFDEQMAKLSLICDDIRP 1045
Query: 859 NWMSAVEILDDDIYLGAENNFNLFTV-RKNSEGATDE-----ERGRLEVV---------- 902
W++ +LD + + F+ F V R SE + G LE V
Sbjct: 1046 RWITGATVLDHHT-VALVDKFDTFAVCRVPSEASASNLSSALNSGSLEAVMPTILGVGNK 1104
Query: 903 ----GEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFL 957
++HLG+ +L G V++ T+ G IG +I + ++ L
Sbjct: 1105 FEQEAQFHLGDLSTCIDKVTLCS-------GCTEAVVYATILGSIGALIPFISSDELDTL 1157
Query: 958 EKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKT 1017
+ L+ + + G H +RS+ + V +DGDL E F L I+
Sbjct: 1158 QHLELLMANENPPLSGREHSIYRSYYGPVQHV-----IDGDLCEEFESLDSITQSRIAAK 1212
Query: 1018 MNVSVEELCKRVEEL 1032
++ +V E+ K++ ++
Sbjct: 1213 IDKTVTEIIKKLRDI 1227
>gi|340367933|ref|XP_003382507.1| PREDICTED: splicing factor 3B subunit 3-like isoform 1 [Amphimedon
queenslandica]
Length = 1214
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 173/770 (22%), Positives = 333/770 (43%), Gaps = 90/770 (11%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++ ++G I +L FR G +D++ + ++ + +L++D + + + + + G
Sbjct: 53 LVSTEVFGLIRSLITFRLTGSTKDYIVLGSDSGRIVILEYDPVKN-VFEKVHQETYGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL----- 112
R GQ +DP R +IG L ++ D+ +L + + + L
Sbjct: 112 CRRIVPGQYLAVDPKGRALMIGAVEKQKLVYILNRDSAARLTISSPLEAHKSHTLVYHVV 171
Query: 113 --DIKF---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN-------- 159
D+ F L+ C + + Y+++ + + +VA ++ + E N+
Sbjct: 172 GVDVGFENPLFACLE----LDYEESDNDHTGEAAQVAQQNLTYYELDLGLNHVVRKYSEP 227
Query: 160 LDNGADLLIPVPPPL---CGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD-ADGSR 215
LD+ A++LI VP GVL+ E I+Y N IR I + +D AD S
Sbjct: 228 LDDMANMLITVPGGTDGPSGVLVCCENYIIY--KNFGDQPDIRCPIPRRQNDLDDADRSM 285
Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
+LL G + +++ E + V +KI+ + S++ L +
Sbjct: 286 LIITSAMHKTKHLFFFLLQSEQGDIFKVMLDVEDDIVVRIKIKYFDTVPVCSSLCVLKSG 345
Query: 261 VVYIGSSYGDSQLIKL-NLQPDAKGSY---------------------VEVLERYVNLGP 298
+++ S +G+ L ++ +L D++ + + +++ +L P
Sbjct: 346 LLFCSSEFGNHYLYQIAHLGDDSEREFHSDYPLEEGETFFFGPRPLKNLILVDEMESLSP 405
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ + DL + Q+ G SLR++R+G+ + E A EL G +W+++ +
Sbjct: 406 IMSCQIADLANEDTPQLYAACGRGPRSSLRVLRHGLEVTEMAVSELPGNPHAVWTVKKDS 465
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
+ FD+++VVSF++ T IL++ + +EE GF T TL C + L+Q+ +
Sbjct: 466 KEDFDSYIVVSFMNATLILSIG--ETVEEVTDSGFLGTTPTLACSQLGDDALIQIYPEGI 523
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
R + S R NEW+SP + N QV++A GG +VY E+ G L E + +
Sbjct: 524 RHIRSDKRV--NEWRSPGKRLIRQCAVNEHQVVIALSGGEIVYFEMDQSGQLNEYTERKE 581
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVL 534
+ E+ C+ + + + AVG+ +D +VRI SL P L +P S+
Sbjct: 582 MTAEVICISLGSVPPGQQRCRFLAVGL-SDQTVRIISLDPHDTLQPLSMQALPALPESLC 640
Query: 535 LCAFEG-------IS----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
+ G +S +L L +G LL L+ TG+L+D + LGT+P+ L
Sbjct: 641 IVNMSGNVSDDTTVSTGGLFLNIGLQNGVLLRTALDNVTGDLSDTRTRYLGTRPVKLFRV 700
Query: 584 SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
+ T V A S R + Y+ + + ++ + + F S P+ + L I
Sbjct: 701 RIQGTEGVIAVSSRTWLNYTYQSRFHLTPLSYDLLEYASSFTSEQCPEGMVAISSNTLRI 760
Query: 644 GTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
++ + + + S PL PR++ + + + E +++H
Sbjct: 761 LALEKLGVVFNQVSTPLQYTPRKLLIHPPMSNLVLIETDHNAFTEATKLH 810
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 81/345 (23%), Positives = 157/345 (45%), Gaps = 44/345 (12%)
Query: 714 EYGCSILSCSFSDDSNV--YYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKE 768
E S+ C F ++ + VGTA ++ + G ++VF + DG KL+ + +
Sbjct: 882 EAAFSLAVCQFVSKGDLEWFVVVGTAKDMIITPRAISSGSLIVFRLSPDGSKLEHVHTTQ 941
Query: 769 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDF 827
++ F G+LL + + +++Y D G +++ +C H L + ++ G
Sbjct: 942 LDDVPIAMAPFQGRLLVGVGKLLRIY-----DIGKKKMLRKCENKHLPYLVVDIKVMGRR 996
Query: 828 IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
+ VGD+ +++ L Y+ E + A + + + ILD + A+ N+ +R
Sbjct: 997 VYVGDVQEAVHFLYYRPHENQLVIFADEVVPRFCTTSCILDYNTVASADKFGNITILRLP 1056
Query: 888 SEGATDE------------ERGRLE-------VVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
S+ TD+ +RG L V+ YH+GE +N SL+
Sbjct: 1057 SD-VTDQVDEDPSGSRSLWDRGFLNGATQKANVMTSYHVGEGINTLHKVSLI-------P 1108
Query: 929 GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
G +++ T++G IG++ E F + L+ ++R + G +H +RS+
Sbjct: 1109 GGSEVLVYTTLSGSIGILVPFSSKEDSDFFQHLEMHMRSEWSNLVGRDHLSFRSY----- 1163
Query: 988 TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
V K+ +DGDL E + L ++ EI+ ++ S E+ K++E+L
Sbjct: 1164 YVPVKSVIDGDLCEVYNSLDPSKRREIALDLDRSPSEVAKKLEDL 1208
>gi|432863837|ref|XP_004070177.1| PREDICTED: splicing factor 3B subunit 3-like [Oryzias latipes]
Length = 1146
Score = 154 bits (388), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 207/948 (21%), Positives = 382/948 (40%), Gaps = 157/948 (16%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L + ++G + +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTMEVFGIVRSLMAFRLTGGTKDYIVVGSDSGRIVILEYHP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R +IG L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQYLAVDPKGRAVMIGAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEALEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + ++E VT ++++ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKVTLETDEEMVTEIRLKYFDTIPVATAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V V E+ N+
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVDEQE-NMS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ + DL + Q+ G +LR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIMSCQIADLANEDTPQLYVACGRGPKSTLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGEDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKTPGKKTIVRCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
++ ++ C+ + + S+ AVG+ D +VRI SL L ++ + L + P
Sbjct: 582 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCLQPLSMQALPAQ--PE 638
Query: 532 SVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
S+ + G+ YL L +G LL +L+ TG+L+D + LG++P
Sbjct: 639 SLCIVEMGGVEKQDELGEKGTMGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + YS + + ++ + + + F S P+ +
Sbjct: 699 VKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEYASGFASEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQE-----------QSRTFAICSLKNQS 684
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 759 TNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPETNNLVLIETDHNAYTEATKAQRKQQ 818
Query: 685 CAEE-------------SEM-------------------------HFVRLLDDQTFEFIS 706
AEE +EM VRL++ +
Sbjct: 819 MAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGAGQWASLVRLVNPIQGTTLD 878
Query: 707 TYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQ 762
L+ E S+ C F + + +Y VG A ++ + G I + + G KL+
Sbjct: 879 QVQLEQNEAAFSVAVCRFLNTGDDWYVLVGVARDMILNPRSVSGGFIYTYRLTGGGEKLE 938
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
+ + + ++ F G++L + + +++Y D G ++L +C
Sbjct: 939 FVHKTPVEDVPLAIAPFQGRVLIGVGKLLRIY-----DLGKKKLLRKC 981
Score = 56.2 bits (134), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 34/136 (25%), Positives = 70/136 (51%), Gaps = 13/136 (9%)
Query: 898 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLF 956
+ EVV YH+GE V + +L+ G ++++ T++G IG++ HE + F
Sbjct: 1017 KAEVVVNYHVGETVLSLQKTTLI-------PGGSESLVYTTLSGGIGILVPFTSHEDHDF 1069
Query: 957 LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
+ L+ ++R + G +H +RS+ KN +DGDL E F + + +++
Sbjct: 1070 FQHLEMHMRSEFPPICGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSMDTHKQKSVAE 1124
Query: 1017 TMNVSVEELCKRVEEL 1032
++ + E+ K++E++
Sbjct: 1125 ELDRTPPEVSKKLEDI 1140
>gi|444722328|gb|ELW63026.1| Splicing factor 3B subunit 3 [Tupaia chinensis]
Length = 840
Score = 153 bits (387), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 196/905 (21%), Positives = 367/905 (40%), Gaps = 162/905 (17%)
Query: 216 YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
+L G + + + +++ VT ++++ +A+ + L +++ S +G+ L +
Sbjct: 4 FLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFLFVASEFGNHYLYQ 63
Query: 276 L----NLQPDAKGSYVEVLER----YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSL 327
+ + + + S LE + P+ + +VD E ++ C
Sbjct: 64 IAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLVLVD-ELDSLSPILFCQ-------- 114
Query: 328 RIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
++E A EL G +W++R +D FD +++VSF++ T +L++ + +EE
Sbjct: 115 --------VSEMAVSELPGNPNAVWTVRRHIEDEFDAYIIVSFVNATLVLSIG--ETVEE 164
Query: 387 TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANA 446
GF T TL C + LVQV +R + + R NEWK+P ++ N
Sbjct: 165 VTDSGFLGTTPTLSCSLLGDDALVQVYPDGIRHIRADKR--VNEWKTPGKKTIVKCAVNQ 222
Query: 447 SQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
QV++A GG LVY E+ G L E + ++ ++ C+ + + S+ AVG+
Sbjct: 223 RQVVIALTGGELVYFEMDPSGQLNEYTERKEMSADVVCMSLANVPPGEQRSRFLAVGL-V 281
Query: 505 DISVRIFSLPDLNLITKEHLGGEIIP-RSVLLCAFE----------------GISYLLCA 547
D +VRI SL + + + L + +P + LC E G YL
Sbjct: 282 DNTVRIISLDPSDCL--QPLSMQALPAQPESLCIVEMGGTEKQDELGERGSIGFLYLNIG 339
Query: 548 LGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK 607
L +G LL +L+ TG+L+D + LG++P+ L + V A S R + YS +
Sbjct: 340 LQNGVLLRTVLDPVTGDLSDTRTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSR 399
Query: 608 LLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR- 665
+ ++ + + F S P+ + L I ++ + + + + PL PR+
Sbjct: 400 FHLTPLSYETLEFASGFASEQCPEGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKF 459
Query: 666 ICHQEQSR-----------TFAICSLKNQSCAEE-------------SEM---------- 691
+ H E + T A + + Q AEE +EM
Sbjct: 460 VIHPESNNLIIIETDHNAYTEATKAQRKQQMAEEMVEAAGEDERELAAEMAAAFLNENLP 519
Query: 692 ---------------HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VG 735
+R+++ + L+ E S+ C FS+ +Y VG
Sbjct: 520 ESIFGAPKAGNGQWASVIRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVG 579
Query: 736 TAYVLPEENEPTKGRILVF--IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
A L G + +V +G KL+ + + + ++ F G++L + + ++
Sbjct: 580 VAKDLILNPRSVAGGFVYTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLR 639
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALY---VQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
+Y D G ++L +C + HI A Y +QT G ++V D+ +S + YK E +
Sbjct: 640 VY-----DLGKKKLLRKC-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQL 692
Query: 850 EERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEERGRLEVVGEYHL 907
A D W++ +LD D GA+ N+ VR N+ DE+
Sbjct: 693 IIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTNDEVDED------------ 740
Query: 908 GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
G G +NG + + F + ++ +LR
Sbjct: 741 -------------------PTGNKALWDRGLLNGA-------SQKDHDFFQHVEMHLRSE 774
Query: 968 IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
+ G +H +RS+ KN +DGDL E F + + +S+ ++ + E+ K
Sbjct: 775 HPPLCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSK 829
Query: 1028 RVEEL 1032
++E++
Sbjct: 830 KLEDI 834
>gi|298713790|emb|CBJ27162.1| spliceosomal protein sap, putative [Ectocarpus siliculosus]
Length = 1256
Score = 152 bits (385), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 168/750 (22%), Positives = 306/750 (40%), Gaps = 92/750 (12%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L P + + +L FR +D+ + T+ K +L++ E+SEL M
Sbjct: 54 ILQQPAFCVVRSLIAFRLTSTHKDYCVVGTDAGKVTILEYLPETSELKAVHMETFGKTGC 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
R GQ DP R I + + V + + A + LE + I F
Sbjct: 114 RRIVPGQYLAADPQGRAIMISAVEKQKLVYVLNRDTASRLAISSPLEAHKSHAIIFHTCG 173
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNG 163
G P V+ D +A T E A + + V WS + + G
Sbjct: 174 VDVGFDNPIFAVIELDYSEADQDSTGEEAANTEKQLVYYQLDLGLNHVTRLWS-DPISRG 232
Query: 164 ADLLIPVP---PPLCGVLIIGEETIVYCSAN---AFKAIPIRPSITKAYGRVDADGSRYL 217
A LL+PVP GVL+ GE + Y + +P R ++ G + G+
Sbjct: 233 ASLLLPVPGGEAGPSGVLVCGENWVAYKHQDHPEVRAPLPRRKTMPTGRGLLVTSGT--- 289
Query: 218 LGDHAGLL-HL------------LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
L GL HL LV+ E+ +VT + IE+ +A+++ ++
Sbjct: 290 LHQQPGLFFHLIQSELGDLYKVSLVVNDEQTEVTDISIEVFDTIPVANSLQITKKGKLFC 349
Query: 265 GSSYGDSQLIKLNLQPDAKGSYVEV--------------------------------LER 292
S + + + + +G+ V L+
Sbjct: 350 ASEFSNHMMFHFHKMGGEEGAVVATQCADPELDDSSESAAQVAPTFTPSATLKNLWGLDE 409
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
+L P+ D V DL R+ Q+ T G +LRI+R+G+ + E A +L G+ ++
Sbjct: 410 VESLAPLTDMLVGDLAREDTPQMYTLCGRGNRSTLRILRHGLAVTELAENDLPGVPSAVF 469
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+ + D +D ++V+SF + T +L ++D +EE E GF + + TL N+++Q
Sbjct: 470 AFKERLTDDYDRYIVLSFTNATMVL--EVQDSVEEVENSGFLATSSTLDVKLMANNKILQ 527
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI--LT 469
V S +R++ + EW++P + A AN QV+LA GG +++ E+ + + +
Sbjct: 528 VYSHGLRVI--MKGQPPQEWRAPGKKQIEKACANERQVVLALAGGEIIFFELDEAMQNIQ 585
Query: 470 EVKHAQL-EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEI 528
EV QL EI+CL++ + ++ + AVG W +V+I L NL+ + +
Sbjct: 586 EVGTKQLGGVEIACLEMGEVPKDRVRAPFLAVGDWNG-NVKILGLSPENLLEQVAMINLP 644
Query: 529 IPRSVLLCAFEGIS----------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
P L A +L L G ++ G L+D ++ LG++P+
Sbjct: 645 HPAESLCLAHMAAEQAAGGSNEQLFLYIGLASGVCQRVAVDATAGTLSDPRQRFLGSKPV 704
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
L ++ V A S R + Y+ + + ++ + F++ P+ +
Sbjct: 705 KLFRIQVQDKRGVLALSSRSWLSYNYQGRYQMTPLSYDPLDFAAEFSTEMCPEGVVAVSG 764
Query: 639 GELTIGTIDDI-QKLHIRSIPLGEHPRRIC 667
+L I ++ + + + ++PL PR+I
Sbjct: 765 EKLRIFGVERLGEVFNQAALPLKYTPRQIA 794
Score = 92.8 bits (229), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 82/359 (22%), Positives = 158/359 (44%), Gaps = 38/359 (10%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRIL 752
+RLLD + LD E S+ +F + + + VGTA L +G +
Sbjct: 909 IRLLDPVEGTTVECLDLDDNEAALSVAPVAFHNRNGEAFVAVGTAKSLTFHPRGHEGCFV 968
Query: 753 -VFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC- 810
V+ + + +L L+ + E ++ F G+LL + Q +++Y D G ++L +C
Sbjct: 969 HVYRILENRLVLLHKTEVPDVPLAMKEFQGRLLVGVGQSLRMY-----DLGRKKLLRKCE 1023
Query: 811 GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
+ + + GD + GD M+S Y+ E + E A D +M+ +LD D
Sbjct: 1024 NKRMPSMVVSLTVTGDRVFAGDQMESCHCFKYRRAENRLVEFADDQVPRFMTKTCLLDYD 1083
Query: 871 IYLGAENNFNLFTVRKNSEGATDEER-----------------GRLEVVGEYHLGEFVNR 913
GA+ N+F +R + + D + +++ ++H+GE V+
Sbjct: 1084 SIAGADKFGNIFVLRVPLDVSDDVDNPTGNRLLWDSGHLSGAPNKVQQQLQFHVGEVVSS 1143
Query: 914 FRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVG 972
R +LV G +++ T+NG IG + + F ++ +R+ +
Sbjct: 1144 LRRTTLV-------PGGAEVLLYSTINGSIGALLPFKSRDDVDFFTHMEMYMRQEKPTLC 1196
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
G +H +RS+ + AK+ +DGDL E F L + ++ ++ +V E+ K++E+
Sbjct: 1197 GRDHISYRSY-----YLPAKDVIDGDLCEQFSSLPFEKQKLVANGLDRTVGEVVKKLED 1250
>gi|406700450|gb|EKD03620.1| hypothetical protein A1Q2_02097 [Trichosporon asahii var. asahii CBS
8904]
Length = 1119
Score = 152 bits (384), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 211/874 (24%), Positives = 345/874 (39%), Gaps = 158/874 (18%)
Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIEL--LGETSIASTISYLDNAVVYIGSSYGDSQLI 274
LLGD G L + E E + +++ + +G S S+++YLDN ++ S+ GDS L+
Sbjct: 291 LLGDEYG--RLTAVGWEFEDMEKVQVGMIDMGVVSAPSSLTYLDNGYLFSASACGDSLLV 348
Query: 275 KLNL-------QPDAKG----------------------SYVEVLERYVNLGPIVDFCVV 305
L L QP KG V+V ER++N+ P DF +V
Sbjct: 349 YLVLPSPNSTRQPSGKGKEKAHDISDAGAYEIVTAEPQHGRVDVRERWMNIAPAKDFAIV 408
Query: 306 DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTF 364
+ VV SG+ S R+VR+G+G ++E + GI+ MW++ ++ D P
Sbjct: 409 KEDDGRVSHVVVASGSASSNSFRVVRSGVGFENLMTIEEIPGIERMWTIPAA-DGPS--- 464
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
L+VSF T IL LE E+ + + T + L+QVT +RL S +
Sbjct: 465 LLVSFAYSTTIL--QLEPEVSVFKAADQVTAVPTFAAGLVDKSLLLQVTPEGIRLWSDLA 522
Query: 425 RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLD 484
+ NV ++++ A G++ DG +T + +
Sbjct: 523 AGT---------LAGNVDAPEDNRIVTANVRGNIAVAAFRDGTVTLFRASSQ-------- 565
Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI--- 541
G A+ W ++ L+ E + +S+ A+
Sbjct: 566 ----GLQEVVKGFLAIADWCG---------EIELVAVEGADQGTVLKSIREAAYATSLQF 612
Query: 542 -------SYLLCALGDGHLLNFLLNMKTGELTD-RKKVSLGTQPITLRTF--SSKNTTHV 591
+ +L L DG +++ + + E +D R SLG +P+ L T S HV
Sbjct: 613 QESNNEPTRMLAGLSDGTFVSYSVKLNGAECSDNRHASSLGLRPLRLITLDISPNAEEHV 672
Query: 592 FAA--SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
AA SDR ++++ S +S+ K + + +
Sbjct: 673 VAAGISDRLSLVFESRDHYEFSSSGKKGI-----------------------VFERLTSL 709
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES----EMHFVRLLDDQTFEFI 705
+KL ++++ LG + +S ++ + V L + T E +
Sbjct: 710 KKLQVQTLDLGNRSATRVAALPGYNLVVAGTVTRSMDHQTGDVLQSSSVELRNATTLELL 769
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENE----------------PTKG 749
S + L E S+ + + Y VGTA ENE +G
Sbjct: 770 SEFQLPEREAVASVNAVTL--HGRKYILVGTAIF---ENEDALEDATLEDVTSFIATNRG 824
Query: 750 RILVFIVEDG---KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
R+L+F + + L L+ G VY +G L A + K+ + + + E
Sbjct: 825 RLLLFQINESAGPSLDLVTSMTFNGPVYDTVVIHGFLAVATSTKVSILRLTTQPPSLEEA 884
Query: 807 QS-ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW---MS 862
S H LA+ + +VVGD M+SI +L E G I RD NA+ +S
Sbjct: 885 ASFAFAFETHHLAVVEIDKEKRLVVGDAMRSIIVLSVDPESGDIVGDQRDMNAHLVRCLS 944
Query: 863 AVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR 922
AV ++ + + A+N NL T R +G T L E V R + G+L
Sbjct: 945 AVHDVEPGVMI-ADNYANLLTFRLG-KGITP--------AASIGLSEDVTRLQPGTLA-- 992
Query: 923 LPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
P S G I ++ TVNG +GVI L L+ LQ N+ K+ KG GGL +W+
Sbjct: 993 -PVSAEGDILRADLLCTTVNGRLGVIGELGKGSIRTLDDLQRNMNKLYKGPGGL---EWK 1048
Query: 981 SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
N + F+DGD ++ F L + D+I
Sbjct: 1049 ESGNMLVPRETVGFIDGDFVQRFSSLDSSLQDKI 1082
>gi|42409127|dbj|BAD10377.1| putative splicing factor 3b, subunit 3, 130kDa [Oryza sativa
Japonica Group]
gi|42409258|dbj|BAD10521.1| putative splicing factor 3b, subunit 3, 130kDa [Oryza sativa
Japonica Group]
gi|125538000|gb|EAY84395.1| hypothetical protein OsI_05771 [Oryza sativa Indica Group]
Length = 1234
Score = 150 bits (379), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 179/765 (23%), Positives = 312/765 (40%), Gaps = 83/765 (10%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D+L + ++ + +L++ + + L
Sbjct: 68 LLSVDVFGAIRSLAQFRLTGATKDYLVVGSDSGRLVILEYSPDRNRLDKVHQETFGKSGC 127
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY 118
R GQ+ +DP R + + + L V+ D +L + + + L
Sbjct: 128 RRIVPGQLLAVDPKGRALCIAALEKQKLVYVLNRDAAARLTISSPLEAHKSNTLTFSLTA 187
Query: 119 ---GCAKPTIVVL---YQDNKDARHVKTYEVALKDKDFVEGPWSQNN--------LDNGA 164
G P + Y ++ + E A K F E N+ +DNGA
Sbjct: 188 LDCGFDNPVFAAIELEYAESDRDPTGQAAEQAQKHLTFYELDLGLNHVSRKASEPIDNGA 247
Query: 165 DLLIPVP-----PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VDADGS 214
+LL+ VP P G+L+ + ++Y + IP R + G V A
Sbjct: 248 NLLVTVPGGGDGP--SGLLVCCDNFVLYRNQGHPEVRAVIPRRADLPAERGVLIVAAATH 305
Query: 215 R------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
R +LL G + + + H + VT L+I+ + S I L + ++ S +
Sbjct: 306 RQKSLFFFLLQTEYGDIFKVDLEHSNDTVTELRIKYFDTIPVTSAICVLRSGFLFAASEF 365
Query: 269 GDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVDF 302
G+ L + QP A + + E +L PI+D
Sbjct: 366 GNHALYQFRDIGRDVDVESSSATLMETDEGFQPVFFQPRALKNLYRIDE-IESLMPIMDM 424
Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPF 361
V +L + QV T G +LRI+R G+ I+E A L +W+++ + +D F
Sbjct: 425 RVANLFDEETPQVFTACGRGPRSTLRILRPGLAISEMARSMLPAEPIAVWTVKKNINDMF 484
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D ++VVSF + T +L++ + +EE F T +L + L+QV +R +
Sbjct: 485 DAYIVVSFANVTLVLSIG--ETIEEVSDSQFLDTTHSLAVSLLGEDSLMQVHPNGIRHIR 542
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEI 480
R NEW++P ++ +N QV++A GG L+Y E+ G L EV+ + ++
Sbjct: 543 EDGRV--NEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDMSGDV 600
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFE 539
+CL I P+ E S+ AVG + D ++RI S+ PD L P S++ +
Sbjct: 601 ACLAIAPVPEGRQRSRFLAVGSF-DNTIRILSVDPDDCLQPLSVQSVSSAPESLMFLEVQ 659
Query: 540 GIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
+L L +G L ++M TG+L+D + LG +P L +
Sbjct: 660 ASVGGEDGADHPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPCIVSH 719
Query: 588 TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
+ S RP + Y L + ++ + F+S + + L I TI+
Sbjct: 720 RQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGDALRIFTIE 779
Query: 648 DI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE 690
+ + + +IPL PR+ + + A+ S K AEE E
Sbjct: 780 HLGETFNETAIPLRYTPRKFVILPKKKYLAVIESDKGALSAEERE 824
Score = 93.2 bits (230), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 166/365 (45%), Gaps = 44/365 (12%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---PEENEPTKG 749
+R+LD ++ + L E SI + +F D + VGTA L P+ N + G
Sbjct: 883 IRILDPKSRDTTCLLELQDNEAAVSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNL-SAG 941
Query: 750 RILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
I ++ V++G+ L+L+ + + + +L F G+LLA + ++LY D G R+L
Sbjct: 942 FIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSVLRLY-----DLGKRKLL 996
Query: 808 SECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
+C + + + T D I VGD+ +S Y+ +E + A D W++A
Sbjct: 997 RKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAANH 1056
Query: 867 LDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHLG 908
+D D GA+ N++ R K +G + ++E + ++H+G
Sbjct: 1057 IDFDTMAGADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVG 1116
Query: 909 EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKV 967
+ V + SL+ G +I+GTV G +G ++A E F L+ +LR+
Sbjct: 1117 DVVTCLQKASLI-------PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQE 1169
Query: 968 IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
+ G +H +RS K+ +DGDL E F L +I+ ++ + E+ K
Sbjct: 1170 HPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPSLPADMQRKIADELDRTPGEILK 1224
Query: 1028 RVEEL 1032
++E++
Sbjct: 1225 KLEDI 1229
>gi|254572247|ref|XP_002493233.1| Protein involved in pre-mRNA splicing [Komagataella pastoris GS115]
gi|238033031|emb|CAY71054.1| Protein involved in pre-mRNA splicing [Komagataella pastoris GS115]
gi|328352752|emb|CCA39150.1| Pre-mRNA-splicing factor rse1 [Komagataella pastoris CBS 7435]
Length = 1179
Score = 150 bits (379), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 203/797 (25%), Positives = 352/797 (44%), Gaps = 86/797 (10%)
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLR 354
L PI D L Q + Q+V SG SL+++++ + INE AS EL G +++ +
Sbjct: 409 LNPITD----GLLDQEKSQLVLTSGGSSRSSLKLLKHELAINELASSELPGSALNIFTTK 464
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
S D +D ++V+SFI T +L++ + ++E G Q T+ N L QV S
Sbjct: 465 VSHKDQYDKYMVISFIDGTLVLSIG--ESVDEVSDSGLELQVSTIAVQQVGRNSLAQVHS 522
Query: 415 -GSVRL--VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGILT 469
G V L ++ S E ++ W+ P G + VA+ SQ+ L LVY EI D ++
Sbjct: 523 HGIVHLKGLNEPSFE-KSLWQPPVGAEILVASMTNSQIALGLSNRELVYFEIDEYDQLIE 581
Query: 470 EVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGG 526
+ ++ IS L + I + S AVG D++++I S P+ L +++ + L
Sbjct: 582 HKERKEMSGRISALSLGQIPDGRLRSPFLAVG-CNDMTIKILSTDPNSCLEVLSLQALSS 640
Query: 527 EIIPRSVLLCAFE-GISYLL-CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
IP + + E G Y + L G + LLN +G+L+D + LG + ++L
Sbjct: 641 --IPSDLQITNLEQGAGYFVHIGLDSGIYIRTLLNRISGQLSDTRVNYLGPKRVSLSPVK 698
Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE-LTI 643
V A SD + Y+S + + + ++ C F S P++ + E LTI
Sbjct: 699 IFGKNLVLAFSDTSYLSYNSGFDAKITPLCDGKFTYGCGFISEECPENGVVGVSAENLTI 758
Query: 644 GTIDDIQK-LHIRSIPLGEHPRRIC-----------HQEQSRTFAICSLKNQSCAE---- 687
TID + ++++ IPL PRRI Q ++ + S+ + E
Sbjct: 759 FTIDQLDTDMNVKDIPLTFTPRRITLVSEDVVVVTESQHYTKNPYLNSVATEESRELYQQ 818
Query: 688 ---ESEMHF---VRLLDDQTFEFISTYPLDT--FEYGCSILSCSFSDDSNVYYCVGTAY- 738
E H+ + +D Q+ + I LD +G + +S + D VY+ V +
Sbjct: 819 IGYEKSSHWSSVIETVDIQSQQVIQAINLDKELAIFGSTKVSFASHPD-EVYFIVSCSVE 877
Query: 739 --VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKW 796
+LP N+ T+ R I E+G LQ + ET VYSL F GKLLA + + LY
Sbjct: 878 QELLPNINKGTQLRTYK-ITEEG-LQFLHSTETDQLVYSLTEFQGKLLAGVGNLLVLYDL 935
Query: 797 MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
L+ + L H + +G IV+ D+ +S L + EE D
Sbjct: 936 GLKKLLKKSLTQTSFHQ----ITSIDYQGFRIVISDMKQSSMFLSFNAEENRFISLCDDI 991
Query: 857 NANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGE--------- 904
++ ++LD + + + +R K ++ D + +++GE
Sbjct: 992 TQRHITCSKMLDYSTVVTGDRFGTISVLRCPEKVNKDHLDYIEAKSKMIGECPFKLKQLA 1051
Query: 905 -YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL--PHEQYLFL---E 958
Y++ + V GSL VG ++I+ + G IG+++ + P + F+ +
Sbjct: 1052 SYYVQDVVTSLSRGSLT-------VGGKQSIIYTGLQGTIGILSPIQTPSDIDFFVSLED 1104
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
L+ L K + G +H ++R + V A+ +DGDLI+ F L + IS +
Sbjct: 1105 SLRQELLKSKHLLTGRSHLKYRGY-----YVPAQGVVDGDLIQYFYQLPDSTKQFISTKL 1159
Query: 1019 NVSVEELCKRVEELTRL 1035
+ S E+ +++ + L
Sbjct: 1160 DRSTLEIERKISIMRSL 1176
>gi|74190887|dbj|BAE28225.1| unnamed protein product [Mus musculus]
Length = 937
Score = 149 bits (377), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 178/788 (22%), Positives = 327/788 (41%), Gaps = 100/788 (12%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A + T E A + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
+ LI VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 GNFLITVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT ++I+ +A+ + L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRIKYFDTVPVAAAMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLG 297
+++ S +G+ L ++ QP + V +++ +L
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLS 405
Query: 298 PIVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRS 355
PI+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 PIL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRR 464
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 HIEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPD 522
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKH 473
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 GIRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTER 580
Query: 474 AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RS 532
++ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 581 KEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQP 637
Query: 533 VLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
LC E G YL L +G LL +L+ TG+L+D + LG++
Sbjct: 638 ESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSR 697
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 698 PVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAI 757
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR 695
L I ++ + + + + PL PR+ +S I + + E ++ +
Sbjct: 758 STNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQ 817
Query: 696 LLDDQTFE 703
+ ++ E
Sbjct: 818 QMAEEMVE 825
>gi|341886300|gb|EGT42235.1| hypothetical protein CAEBREN_28831 [Caenorhabditis brenneri]
Length = 1005
Score = 148 bits (374), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 170/735 (23%), Positives = 312/735 (42%), Gaps = 86/735 (11%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
I+G + +L FR +DF+ + ++ + +LQ++A+ + L G R P
Sbjct: 59 IFGIVRSLLAFRLTAGTRDFIAVGSDSGRIVILQYNADKTCFERLHQETFGKTGCRRIVP 118
Query: 63 TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
G DP R +IG L ++ D + L + + + L +
Sbjct: 119 ---GHYLAGDPRGRALMIGAVERQKLVYIMNRDAEAHLTISSPLEAHKQHTLCYAMVGVD 175
Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADL 166
G PT L D +DA + T E A + + F E N+ L + +L
Sbjct: 176 VGFENPTFACLEFDYEDADNDPTGEAAKRTQQTLTFYELDLGLNHVVRKYAEPLTDPGNL 235
Query: 167 LIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-------- 215
LI VP GV++ E +VY N IR I + +D D R
Sbjct: 236 LIAVPGGNDGPSGVIVCCENYLVY--KNLGDQPDIRCPIPRRRNELD-DADRTMLIIATA 292
Query: 216 ---------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
+L+ G + + + +++ VT +K++ A+ + L + +++ +
Sbjct: 293 THKTKNMFFFLVQAENGDVFKVTLETDEDLVTEMKLKYFDTVPPANAMCILKSGFLFVAA 352
Query: 267 SYGDSQLIKLN-----------------------LQPDAKGSYVEVLERYVNLGPIVDFC 303
+G+ +L ++ +P S + + + +L P+ D
Sbjct: 353 EFGNHELYQIASLGEGGDDEFSSAMGFGENDAAFFEPHELRSLIPI-DSMDSLSPLTDAV 411
Query: 304 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
+ D+ R+ Q+ T G +L+++RNG+ I+E A +L G +W+++ + +D +D
Sbjct: 412 IGDVAREDAAQLYTLVGRGARSNLKVLRNGLEISEMAVSDLPGNPNAVWTVKKNIEDQYD 471
Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
+++VVSF++ T LA+ + D +EE GF T T+ C + LVQ+ + +R + +
Sbjct: 472 SYIVVSFVNAT--LALTIGDTVEEASDSGFLPTTPTIGCSMIGDDSLVQIYAEGIRHIRA 529
Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQL-EYEI 480
R NEWK+PP + N QV +A GG LVY E+ +G L E +L +I
Sbjct: 530 DKR--INEWKAPPRRQIVKCCVNRRQVAVALSGGELVYFELDLNGTLNEFTERKLFNADI 587
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAF- 538
+C+ + I E S+ A+G D +VRI SL P+ L+ P S+LL
Sbjct: 588 ACMTFSEISEGELNSRFLALGT-VDNAVRIISLDPNDMLMPLSTQNLPCPPESILLIDTP 646
Query: 539 ----EGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+G++ +L L +G L ++ TG + D + LGT+P+ L + + +
Sbjct: 647 NDDGKGVAAVHLNIGLQNGCLFRNTVDNVTGAIMDTRTRYLGTRPVKLFKVQVQGRSAIL 706
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ-K 651
S R ++Y ++ + ++ + + F S + + L I + +
Sbjct: 707 CTSSRSWLLYHFQRRFHLTPLSYVNLEYAASFCSNQCAEGIVAISSSTLRIIAAEKLGVA 766
Query: 652 LHIRSIPLGEHPRRI 666
+++S PRRI
Sbjct: 767 FNVQSFEQKMTPRRI 781
>gi|341886293|gb|EGT42228.1| CBN-TEG-4 protein [Caenorhabditis brenneri]
Length = 1006
Score = 148 bits (373), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 167/695 (24%), Positives = 299/695 (43%), Gaps = 92/695 (13%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
I+G + +L FR +DF+ + ++ + +LQ++A+ + L G R P
Sbjct: 59 IFGIVRSLLAFRLTAGTRDFIAVGSDSGRIVILQYNADKTCFERLHQETFGKTGCRRIVP 118
Query: 63 TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
G DP R +IG L ++ D + L + + + L +
Sbjct: 119 ---GHYLAGDPRGRALMIGAVERQKLVYIMNRDAEAHLTISSPLEAHKQHTLCYAMVGVD 175
Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADL 166
G PT L D +DA + T E A + + F E N+ L + +L
Sbjct: 176 VGFENPTFACLEFDYEDADNDPTGEAAKRTQQTLTFYELDLGLNHVVRKYAEPLTDPGNL 235
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR------ 215
LI VP P GV++ E +VY N IR I + +D D R
Sbjct: 236 LIAVPGGNDGP--SGVIVCCENYLVY--KNLGDQPDIRCPIPRRRNELD-DADRTMLIIA 290
Query: 216 -----------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
+L+ G + + + +++ VT +K++ A+ + L + +++
Sbjct: 291 TATHKTKNMFFFLVQAENGDVFKVTLETDEDLVTEMKLKYFDTVPPANAMCILKSGFLFV 350
Query: 265 GSSYGDSQLIKLN-----------------------LQPDAKGSYVEVLERYVNLGPIVD 301
+ +G+ +L ++ +P S + + + +L P+ D
Sbjct: 351 AAEFGNHELYQIASLGEGGDDEFSSAMGFGENDAAFFEPHELRSLIPI-DSMDSLSPLTD 409
Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDP 360
+ D+ R+ Q+ T G +L+++RNG+ I+E A +L G +W+++ + +D
Sbjct: 410 AVIGDVAREDAAQLYTLVGRGARSNLKVLRNGLEISEMAVSDLPGNPNAVWTVKKNIEDQ 469
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+D+++VVSF++ T LA+ + D +EE GF T T+ C + LVQ+ + +R +
Sbjct: 470 YDSYIVVSFVNAT--LALTIGDTVEEASDSGFLPTTPTIGCSMIGDDSLVQIYAEGIRHI 527
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQL-EY 478
+ R NEWK+PP + N QV +A GG LVY E+ +G L E +L
Sbjct: 528 RADKR--INEWKAPPRRQIVKCCVNRRQVAVALSGGELVYFELDLNGTLNEFTERKLFNA 585
Query: 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCA 537
+I+C+ + I E S+ A+G D +VRI SL P+ L+ P S+LL
Sbjct: 586 DIACMTFSEISEGELNSRFLALGT-VDNAVRIISLDPNDMLMPLSTQNLPCPPESILLID 644
Query: 538 F-----EGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH 590
+G++ +L L +G L ++ TG + D + LGT+P+ L + +
Sbjct: 645 TPNDDGKGVAAVHLNIGLQNGCLFRNTVDNVTGAIMDTRTRYLGTRPVKLFKVQVQGRSA 704
Query: 591 VFAASDRPTVIYSSNKKL---LYSNVNLKEVSHMC 622
+ S R ++Y ++ S VNL+ + C
Sbjct: 705 ILCTSSRSWLLYHFQRRFHLTPLSYVNLEYAASFC 739
>gi|170580631|ref|XP_001895346.1| splicing factor 3B subunit 3 [Brugia malayi]
gi|158597745|gb|EDP35799.1| splicing factor 3B subunit 3, putative [Brugia malayi]
Length = 1181
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 242/1178 (20%), Positives = 463/1178 (39%), Gaps = 214/1178 (18%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I +L FR G ++D++ I ++ + +L+++A+ R + + G R
Sbjct: 60 FGIIRSLLAFRLTGGSKDYIVIGSDAGRIVILEYNAQKV-CFERVHQETFGKTGCRRIVP 118
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R LIG L ++ D L + + + + + G
Sbjct: 119 GQFLAVDPKGRAILIGAVERQKLVYIMNRDASANLTISSPLEAHKSHCICYSVVGVDVGF 178
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALK--------DKDFV----EGPWS----------QN 158
PT L D ++ H T +A K + D V +GP
Sbjct: 179 ENPTFACLEVDYEEVDHDPTGHLATKIPQTLTFYELDLVPGGQDGPSGVIVCCENYLVYK 238
Query: 159 NLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLL 218
NL + D+ PVP + I+ CSA + +L+
Sbjct: 239 NLGDQPDIKCPVPRRRNELDDCDRTVIIVCSATHKTKLMYF----------------FLV 282
Query: 219 GDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL-- 276
G + + + E + VT LKI+ +++ + L ++ S +G+ L ++
Sbjct: 283 QTDQGDIFKVTLESEHDIVTELKIKYFDTIPVSNAMCILKTGFLFTASEFGNHHLYQIAH 342
Query: 277 ------------NLQPDAKGSY---------VEVLERYVNLGPIVDFCV---VDLERQGQ 312
+Q + ++ + ++++ +L P++ F Q
Sbjct: 343 LGDEDDEPEFSSRMQLEEGETFFFAPRGLTNLAIVDQMDSLSPLIFFPTRTRCTFSSQSI 402
Query: 313 GQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISE 372
Q + + ++ N + W+++ + DD FD+ +VVSF++
Sbjct: 403 AQWFGGKRKWLSLNCLVIPNAV----------------WTVKRNIDDKFDSHIVVSFVNA 446
Query: 373 TRILAMNLEDELEETEIEGFCSQTQTLFC----HDAIYNQLVQVTSGSVRLVSSTSRELR 428
T +L++ + +EE GF T TL C DA+ + V G + + S R L
Sbjct: 447 TLVLSIG--ETVEEVTDSGFLGTTPTLGCALIGDDALCIRTVFDILGLIGELMS-GRHLV 503
Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEYEISCLDIN 486
N + N QV +A GG LVY E+ G L E + +L ++ C+ ++
Sbjct: 504 N-------VQLLKCALNRRQVAIALAGGELVYFELDVTGQLNEYTERRELPADVLCMSLS 556
Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSL------PDLNLITKEHLGGEIIPRSVLLCAFEG 540
I E S+ VG+ D +VRI SL L LI+ P S+++ G
Sbjct: 557 EIPEGELRSRFLTVGL-ADKTVRIISLDPQDCLSPLQLISFSMQALPSEPESIIVLEMFG 615
Query: 541 IS-------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
+L L +G LL ++ TGELTD + LGT+ + L ++ + A
Sbjct: 616 TETQSASTVHLNIGLQNGCLLRTTVDQVTGELTDNRTRYLGTKSVKLFRVRIQSKDAIMA 675
Query: 594 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
AS R ++Y + + ++ + F+S P+ + E L I +++ + +
Sbjct: 676 ASSRAWLLYDYQSRFHLTPLSYAALEFAAGFSSEQCPEGIVAIAENTLRILSLEKLGAVF 735
Query: 654 IRSI-PLGEHPRR-ICHQ---------------------EQSRTFAICSLKNQSCAEESE 690
+ PL PRR + H+ E+ + A ++ AEE++
Sbjct: 736 NHVVHPLDYTPRRMVVHKASGNLIIIENDHAAFTVKGKMERRKQLADELMEVAKEAEEAD 795
Query: 691 MHFV-------------------------------RLLDDQTFEFISTYPLDTFEYGCSI 719
V R++ E +S +P E +I
Sbjct: 796 QQAVKEMADAIRTEKVDERVYGSPKNQKGKWASTVRIMRSNDGETLSHFPFAEDEAAFAI 855
Query: 720 LSCSFSDDSNVYYC-VGTAYVLP-EENEPTKGRILVFIVEDG--KLQLIAEKETKGAVYS 775
F + S+ + VG L + + G I F++ LQL+ T V +
Sbjct: 856 AMVQFQNQSDTQFVLVGCGCDLQLKPRKANGGCIYTFLLAANGTTLQLLHRTPTDEVVNA 915
Query: 776 LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-HILALYVQTRGDFIVVGDLM 834
++ F G LA + +K++LY D G R+L ++C + + +++ G IVV D
Sbjct: 916 IHDFRGMALAGVGKKVRLY-----DLGKRKLLAKCENRQIPTQVVDIRSMGQRIVVSDSQ 970
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE 894
+S+ + YK ++G + + + +++ V +LD D + + F V + +G T+E
Sbjct: 971 ESVHFMRYKKQDGQLSIFCDETSPRYVTCVCLLDYDT-VAVGDRFGNVAVLRLPKGVTEE 1029
Query: 895 ------------ERG-------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVI 935
+RG +LE + ++G+ + + SLV G +
Sbjct: 1030 VQEDPTGVRALWDRGNLNGASQKLEAIAHLYIGDAITSMQKTSLV-------PGANDCLS 1082
Query: 936 FGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNF 994
+ T++G+IG++ + +++ F + L+ ++R + G +H +RS+ K+V
Sbjct: 1083 YTTISGIIGILVPFMSRDEFEFFQNLEMHMRVEYPPLCGRDHLAYRSYYFPVKSV----- 1137
Query: 995 LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+DGDL E + + + + + + E+ K++E++
Sbjct: 1138 IDGDLCEQYSLMPLDKQKSVGEELGRKPTEIHKKLEDI 1175
>gi|326497839|dbj|BAJ94782.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1227
Score = 147 bits (372), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 181/767 (23%), Positives = 313/767 (40%), Gaps = 87/767 (11%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D+L + ++ + +L++ + +
Sbjct: 63 LLSVDVFGAIRSLAQFRLTGATKDYLVVGSDSGRLVILEYSPDRNRFDKVHQETFGKSGC 122
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE---------LQV 111
R GQ+ +DP R + + + V + + + LE L
Sbjct: 123 RRIVPGQLLSVDPKGRALCIAALEKQKLVYVLNRDASARLTISSPLEAHKSHTLTFALTA 182
Query: 112 LDIKF---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN--------L 160
LD F ++G I + Y D+ + A K F E N+ +
Sbjct: 183 LDCGFDNPVFG----AIELEYGDSDRDPTGQAASHAQKLLTFYELDLGLNHVSRKVSEPI 238
Query: 161 DNGADLLIPVPPPL---CGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VDAD 212
DNGA++L+ VP G+L+ + ++Y + IP R + G V A
Sbjct: 239 DNGANMLVTVPGGADGPSGLLVCCDNFVLYRNQGHPEVRAVIPRRVDLPAERGVLIVAAA 298
Query: 213 GSR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
R +LL G + + + H + V+ L+I+ +AS I L + ++ S
Sbjct: 299 THRQKSLFFFLLQTEYGDIFKVDLDHHGDTVSELRIKYFDTIPVASAICVLRSGFLFAAS 358
Query: 267 SYGDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIV 300
+G+ L + QP A + V + E +L PI+
Sbjct: 359 EFGNHALYQFRDIGRDVDVESSSATLMETEEGFQPVFFQPRALKNLVRIDE-IESLMPIM 417
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDD 359
D + +L + QV T G +LRI+R G+ I+E A L +W+++ + +D
Sbjct: 418 DMRIANLFDEETPQVYTACGRGSRSTLRILRPGLAISEMARSMLPAEPIAVWTVKKNIND 477
Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
FD ++VVSF + T +L++ + +EE F T +L + L+QV +R
Sbjct: 478 MFDAYIVVSFANVTLVLSIG--ETIEEVSDSHFLDTTHSLAVSLLGEDSLMQVHPNGIRH 535
Query: 420 VSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEY 478
+ R NEW++P ++ +N QV++A GG L+Y E+ G L EV+ +
Sbjct: 536 IREDGRV--NEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDMSG 593
Query: 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCA 537
+++CL I P+ E S+ AVG + D ++RI S PD L P S+L
Sbjct: 594 DVACLAIAPVPEGRQRSRFLAVGSY-DNTIRILSFDPDDCLQPLSVQSVSSAPESLLFLE 652
Query: 538 FE-------GISY-----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
+ G Y L L +G L ++M TG+L+D + LG +P L
Sbjct: 653 VQASVGGEDGADYPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPCIV 712
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
+ + S RP + Y L + ++ + F+S + + L I T
Sbjct: 713 SHRQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGDALRIFT 772
Query: 646 IDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE 690
I+ + + + SIPL PR+ + + A+ S K AE+ E
Sbjct: 773 IERLGETFNETSIPLRYTPRKFVILPKKKYLAVIESDKGAFSAEQRE 819
Score = 89.7 bits (221), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 88/364 (24%), Positives = 160/364 (43%), Gaps = 42/364 (11%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---PEENEPTKG 749
+R+LD ++ + L E SI + +F D + VGT L P+ + +
Sbjct: 876 IRILDPRSRDTTCLLELQDNEAAVSICTVNFHDKEHGTLLAVGTTKGLQFWPKRSLASGF 935
Query: 750 -RILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
I F+ E L+L+ + + + SL F G+LLA + ++LY D G R+L
Sbjct: 936 IHIYKFVNEGKSLELLHKTQVEEVPLSLCQFQGRLLAGVGSVLRLY-----DLGKRKLLR 990
Query: 809 ECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
+C + + + T D I VGD+ +S Y+ +E + A D W++A +
Sbjct: 991 KCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAANHI 1050
Query: 868 DDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHLGE 909
D D GA+ N++ R K +G + ++E + ++H+G+
Sbjct: 1051 DFDTMAGADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVGD 1110
Query: 910 FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVI 968
V + SL+ G +I+GTV G +G ++A E F L+ +LR+
Sbjct: 1111 VVTCLQKASLI-------PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQEH 1163
Query: 969 KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
+ G +H +RS K+ +DGDL E + L +I+ ++ + E+ K+
Sbjct: 1164 PPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQYPSLPADMQRKIADELDRTPGEILKK 1218
Query: 1029 VEEL 1032
+E++
Sbjct: 1219 LEDI 1222
>gi|229577321|ref|NP_001153352.1| splicing factor 3b, subunit 3 [Nasonia vitripennis]
Length = 1216
Score = 147 bits (372), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 177/782 (22%), Positives = 327/782 (41%), Gaps = 96/782 (12%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
ML V ++G I +L FR G +D++ + ++ + +L++ + + + + + G
Sbjct: 53 MLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEY-VPAKNVFEKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDPEARLTISSPLEAHKSNTLVYHMV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P L D ++A T E A+K + F E N+ L+
Sbjct: 172 GVDVGFENPMFACLEIDYEEADTDPTGEAAIKTQQTLTFYELDLGLNHVVRKYSEPLEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
A+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 232 ANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQPDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT +K++ +A+++ L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDVVTEIKLKYFDTVPVANSMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGP 298
+++ S +G+ L ++ + P +G R + +L P
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 407 IMACQVADLANEDTPQLYIACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRRI 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
D+ FD +++VSF++ T +L++ + +EE GF T TL C + LVQV +
Sbjct: 467 DEEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
R + + R NEWK+P ++ N QV++A GG LVY E+ G L E + +
Sbjct: 525 RHIRADKR--VNEWKAPGKKTIIKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERKK 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-L 534
+ E+ C+ + + S AVG+ D +VRI SL + + + + +P +
Sbjct: 583 MPSEVMCMALGNVAAGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSM--QALPAAAES 639
Query: 535 LCAFE-GIS--------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
LC E GI +L L +G LL +L+ +G+L+D + LG++P+
Sbjct: 640 LCIVEMGIKDDTSEDSNQMQSTLHLNIGLQNGVLLRTVLDPISGDLSDTRTRYLGSRPVK 699
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L + V A S R + Y + + ++ + + F+S P+ +
Sbjct: 700 LFRIRMQGNQAVLAMSSRSWLSYYYQNRFHLTPLSYESLEFASGFSSEQCPEGIVAISTN 759
Query: 640 ELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAICSLKNQSCAEESEMHFVRLL 697
L I ++ + + + S P PR+ + H E + I + N E + +++
Sbjct: 760 TLRILALEKLGAVFNQVSFPFEYTPRKFVIHNESAHVLLIETEHNAYTEETKQQRRLQMA 819
Query: 698 DD 699
++
Sbjct: 820 EE 821
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 66/275 (24%), Positives = 126/275 (45%), Gaps = 39/275 (14%)
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKS 836
+ G++L + + ++LY D G ++L +C + HI + + G I V D+ +S
Sbjct: 954 YQGRVLVGVGKMLRLY-----DMGKKKLLRKC-ENKHIPNAVICINAIGQRIYVSDVQES 1007
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSE--GATDE 894
+ + YK +E + A D + W++ +LD D A+ N+ +R S DE
Sbjct: 1008 VYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKFGNIAVIRLASSINDDVDE 1067
Query: 895 ---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGT 938
+RG L + V +H+GE V + +L+ G ++++ T
Sbjct: 1068 DPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI-------PGGSESLVYTT 1120
Query: 939 VNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDG 997
++G +GV+ HE + F + L+ ++R + G +H +RS+ KN +DG
Sbjct: 1121 LSGTVGVLVPFTSHEDHDFFQHLEMHMRAEHPPLCGRDHLSFRSY-----YYPVKNVIDG 1175
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DL E F + + IS + + E+ K++E++
Sbjct: 1176 DLCEQFNSIEPGKQKSISGDLERTPSEVSKKLEDI 1210
>gi|443896643|dbj|GAC73987.1| predicted DNA methylase [Pseudozyma antarctica T-34]
Length = 1285
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 247/1174 (21%), Positives = 476/1174 (40%), Gaps = 169/1174 (14%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I +L FR G ++D+L + ++ + +L++ SS + + R G R
Sbjct: 126 FGVIRSLAAFRLTGGSKDYLIVGSDSGRIVILEYQP-SSNSFDKVHQETFGRSGSRRIVP 184
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAF--NIRLEELQVLDIKF 116
GQ DP R +IG L ++ D + L EA N + + +D+ F
Sbjct: 185 GQYLATDPKGRATMIGAMEKAKLVYILNRDAEANLTISSPLEAHRPNGIIHHIVGVDVGF 244
Query: 117 ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNL--------DNGAD 165
L+ C + V + ++DA + +E A K + E N++ D ++
Sbjct: 245 ENPLFACLE---VDNSESDRDASG-RAFEEAEKTLTYYELDLGLNHVVRRWAEPVDPRSN 300
Query: 166 LLIPVPPPL----------CGVLIIGEETIVYCSANAFK-AIPI--------RPS----- 201
LLI VP GVL+ E+ I Y + + +PI RP+
Sbjct: 301 LLIQVPGGYNQNQEKWDGPSGVLVCSEDYITYKHQDQPEHRVPIPKRLNPIERPNERRGT 360
Query: 202 -ITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
I + + +L+ G L + + H+ +++ L+I+ +AS + L +
Sbjct: 361 LIVASVLHKMKNAFFFLVQTEDGDLFKVTMDHQDDEIRALRIKYFDTVPVASGLCILRSG 420
Query: 261 VVYIGSSYGDSQLIKLN------------------------------LQPDAKGSYVEVL 290
+++ S +G L P + ++V
Sbjct: 421 FLFVASEFGPQLLYSFQKLGDDDELPEYISTDYDDYGAGRRRPQLPTFTPRPLDNLMQVD 480
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KG 349
E +L PI+D ++ Q+ G S +++R+G+ E S +L G+
Sbjct: 481 E-VPSLDPILDATPLNPLASDSPQIFAACGRGARSSFKMLRHGLEALEAVSSDLPGVPSA 539
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
+W+ + + D +D+++V+SF++ T +L++ + +EE G + + TL + L
Sbjct: 540 VWTTKITRRDEYDSYIVLSFVNGTLLLSIG--ETIEEVSDSGLLTSSSTLAVQQLGEDAL 597
Query: 410 VQVTSGSVRLVSSTSRELRNEWKSP--PG--YSVNVAT-ANASQVLLATGGGHLVYLEIG 464
+QV +R + +++ NEW +P P + VAT N QV++A LVY E+
Sbjct: 598 LQVHPHGIRHIL-VDKQI-NEWVTPSLPNGRQTTIVATCTNERQVVVALSSNELVYFELD 655
Query: 465 -DGILTEVKHAQ-LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKE 522
DG L E + + + + + + E + AVG D +VRI SL + +
Sbjct: 656 MDGQLNEYQERKAMGAPVLTMSMPECPEGRQRTAYLAVGCG-DSTVRIVSLEPSSTLASI 714
Query: 523 HLGGEIIPRSVLLCA---------FEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573
+ P S + A +++ L +G LL +L+ TG+LTD + L
Sbjct: 715 SIQALTAPASSICMAEMHDSTVDRHHATTFVNIGLQNGVLLRTVLDGVTGQLTDTRTRFL 774
Query: 574 GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
G++ + L T V A S R + ++ ++ ++ + + H F++ PD L
Sbjct: 775 GSKAVRLVRTRVHGQTAVMALSTRTWLSFTYQSRVQFTPLIFDALDHAWSFSAELCPDGL 834
Query: 634 AIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEM 691
L I TI + KL S+ L PRR+ H + F + +++ + ++
Sbjct: 835 IGIVGSTLRIFTIASLASKLKQDSVALSYTPRRMASHPDGQGLFYVAEADHRTLSPGAQR 894
Query: 692 HFVRLLDDQTFEFI-STYPLDTFEYGC------SILSCSFS--------------DDSNV 730
V LD + LD E+G + SC DD+
Sbjct: 895 RRVEALDKELKPHQRGVLDLDPAEFGSIRAEAGNWASCVRVVDGVNAQTTHRIELDDNEA 954
Query: 731 YYCVGTAYVLPEENE--------------P---TKGRILVFIVEDG--KLQLIAEKETKG 771
+ + + + P K + + + +G +L+++ + E
Sbjct: 955 AFSIAVVPFASADKQLFVVVGSAVEVVMSPRSFKKAYLTTYRLGNGGRELEVVHKTEVDD 1014
Query: 772 AVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVV 830
L AF G+LLA + + +++Y+ G ++L +C + A+ + +G IVV
Sbjct: 1015 VPLVLRAFQGRLLAGVGKVLRIYEL-----GKKKLLRKCENRSFPTAIVALDAQGSRIVV 1069
Query: 831 GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNS 888
GD+ +S+ YK E + A D +++ +LD D A+ N++ VR ++
Sbjct: 1070 GDMQESVIFASYKPLENRLVTFADDIMPRYVTRCTMLDYDTVAAADKFGNVYVVRIDADT 1129
Query: 889 EGATDEERGRLE-------VVGEYHLGEFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTV 939
+ DE+ + ++G H + + G ++ L + + G +++ +
Sbjct: 1130 SRSVDEDVTGMTTMHEKPLLMGAAHKATLLAHYFVGDIITSLSRAVMVPGGREVLLYTGI 1189
Query: 940 NGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
+G IG + + E + L+ LR+ + G +H +RS K+ +DGD
Sbjct: 1190 SGTIGALVPFVSKEDVDTMTTLEMQLRQQSDSLVGRDHLAYRS-----SYAPVKHVIDGD 1244
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L ESF L + +++ ++ E+ K++ +L
Sbjct: 1245 LCESFGLLPPAKQSAVAQELDRKPSEVNKKLAQL 1278
>gi|451852814|gb|EMD66108.1| hypothetical protein COCSADRAFT_34693 [Cochliobolus sativus ND90Pr]
Length = 1235
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 227/1002 (22%), Positives = 394/1002 (39%), Gaps = 157/1002 (15%)
Query: 155 WSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVY------CSANAFKAIPIRPSITKA 205
WS+ +D A+ L VP GVL GE++I Y S+ AIP R T+
Sbjct: 224 WSEP-VDRTANTLFRVPGGPSAPSGVLCCGEDSITYRRIFNNKSSVRRLAIPRREGATED 282
Query: 206 YGR-----------VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTI 254
R + YLL G + L + V +KI+ + ++I
Sbjct: 283 PNRKRMIVAGTLYSLKGGDFFYLLQTEDGDVFKLTVDAPNGTVENIKIKYFDTIPVTTSI 342
Query: 255 SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------V 287
L VY GD L +L D + S+ +
Sbjct: 343 CILRAGFVYAACESGDRILYELESLGDETDDPVFESSQFPVDPEASFAPPFFRPRALVNL 402
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
+E +L PI+D V + + Q+ T +G S R RN + + + L Q
Sbjct: 403 TAVEAMPSLNPIMDMEVANPALEDAPQIYTINGTGGRSSFRTTRNALEVLDLIESPLPQN 462
Query: 347 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+ + +++D DT +V+ +R L + + D++EE GF T TL
Sbjct: 463 ASDVWTTKLTSEDETDTLIVLCL--HSRTLVLKIGDDVEEASNTGFLPDTNTLGVQQFGE 520
Query: 407 NQLVQVTSGSVRLV---------SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
+ ++Q+ +R + +S + +W+ P ++ N QV +A G
Sbjct: 521 DCIIQIHPKGIRHIQGIQFPNDDASATHASLTDWQPPAHRTIVACATNNRQVAIALSSGQ 580
Query: 458 LVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-P 514
++Y E DG L + LE I+CL + + E + AVG +D +VRI++L P
Sbjct: 581 ILYFECDSDGSLAMAEEEIVLESTINCLAMPDVPEGSVRAFFLAVGC-SDQTVRIYNLSP 639
Query: 515 DL--NLITKEHLGGEIIPRSVLLC------AFEGIS-YLLCALGDGHLLNFLLNMKTGEL 565
D+ N++ + P S L + G S +L L G + +L+ TG++
Sbjct: 640 DMEGNILRSISVQALTSPPSDLTINLMTDKSSRGYSQFLHIGLRSGVYIRSVLDEMTGDI 699
Query: 566 TDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHMCP 623
D ++ LG +PI + + A + RP + Y+ + L + +N
Sbjct: 700 GDTRRRFLGPEPIKFAKVTVAGEPAILAMTSRPWLGYTHPRTGVLQLTPLNYISFKSAWN 759
Query: 624 FNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLK 681
F+ + F + ++ EL I T +D+ SIPL PR++ + +Q + I S
Sbjct: 760 FDGSQFKGIICVSAN-ELRIFTFNDLTDNTTYESIPLKYTPRKMVGYHDQGVFYVIQSDN 818
Query: 682 N------------QSCAEESEMHFVRLLDDQTFEFIST--YPLDTFEY------------ 715
N QSC ++ + + +Q+ + +P F Y
Sbjct: 819 NTISADRRQQLIAQSCGKKEDGTNGSMETEQSNGAADSDEFPAVDFGYPRAQGSWASCIQ 878
Query: 716 ------------------GCSILSCSF----SDDSNVYYCVGTAYVL---PEENEPTKGR 750
S++S + S + + VGTA L P + +
Sbjct: 879 VVDPVTEKTVTHTVELNGNISLVSAALVFFESRNDEAFLAVGTAKDLSFTPYKFSSASIQ 938
Query: 751 ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
I +L+ E +L AF GKLLA + + + LY ++ R+ Q+
Sbjct: 939 IYKINPTGRELEFFHETTVSDPPLALLAFKGKLLAGVGRHLCLYDCGMK-SVLRKAQAPN 997
Query: 811 GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSAVEILD 868
I L +T+G +VV D +S++ +++K + + D A +A E+LD
Sbjct: 998 CVPTRITGL--KTQGSRLVVSDQAQSVTYVVHKDQVHPNRLIPFVDDTIARHTTASEMLD 1055
Query: 869 DDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD 925
D +G + N++ VR K SE + + G +V + +LG NR L+
Sbjct: 1056 YDTTVGGDKFGNIWLVRCPQKVSESSDESPDGSDLLVDKSYLGGTPNRL---DLIAHYFT 1112
Query: 926 SDV-----------GQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGG 973
+D+ G V + + G +G + + ++L+ LR K + G
Sbjct: 1113 NDIPVSIQKTVLLSGGERVVFWAGLQGTLGALIPFNSRRSHKMFQQLELQLRSDDKPLSG 1172
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
+H +RS+ K+V +DGDLIE FL L R + + I+
Sbjct: 1173 RDHLAFRSYFAPVKSV-----IDGDLIERFLVLPRDKRESIA 1209
>gi|428180132|gb|EKX49000.1| hypothetical protein GUITHDRAFT_105085 [Guillardia theta CCMP2712]
Length = 1207
Score = 146 bits (368), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 234/1054 (22%), Positives = 433/1054 (41%), Gaps = 173/1054 (16%)
Query: 124 TIVVLYQDNKD---------ARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL 174
+I V Y DN + +R V YEV + + +E ++ + +DN ++LLIPV L
Sbjct: 184 SIEVDYVDNPNDEFQDPVVSSRRVCLYEVDMANNS-LEVRYN-DFIDNSSNLLIPVYNHL 241
Query: 175 CG---VLIIGEETIVYCSANAFKAIPIRPS----------ITKAY--GRVDADGSRYLLG 219
G VLI E I S +A + + + P I +Y + D S ++
Sbjct: 242 SGLFGVLICAENKIALKSPDAEEIVLLIPRRSMLPLEQSVIITSYVLMGIQDDLSIFIAQ 301
Query: 220 DHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN-- 277
+ G L+ + ++ + L IE L +A ++ + + + + S +G+ L +
Sbjct: 302 NDIGDLYKITVSLRCTQDHVLGIEYLDTVPVAQSMIIVRDQYLLLCSEFGNHILFSFSAS 361
Query: 278 -------------LQPDAKG-----------SYVEVLERYVNLGPIVDFCVVDLERQGQG 313
+Q D + Y V+E +L PIV V+DL +G
Sbjct: 362 EVGQVTSTIGLTSVQFDDENIDIPNFLPHNLQYFRVVEEIESLSPIVGMKVMDLYSEGNF 421
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFI-S 371
QV T G S+R++R+G+ + E A +L + +W+L++++ DP D ++VVS S
Sbjct: 422 QVFTLCGKGPRSSIRVLRHGLSVVEMAISQLPTVPVAVWTLKANSTDPHDRYIVVSLAHS 481
Query: 372 ETRILAMNLEDELEETEIEGFCSQTQTLF------CHDAIYNQLVQVTSGSVRLVSSTSR 425
+ L +++ + +E G +++ C ++ +QV G + S +
Sbjct: 482 SVQTLVLSIGETVEAVANHGLLPHAKSISIATMGDCMIQVHTNGIQVMKGGKNIPSPFAS 541
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLE-YEISCL 483
+P SVN N Q+L+ G YLE+ G G L E + E+ +
Sbjct: 542 ------SNPVILSVN----NEQQMLVTQTDGVTSYLEMEGTGTLREKGKININAQEVCAV 591
Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEII----PRSV-LLCAF 538
++ P+ + + + A +G + D + + L+ + + G + P S+ LL ++
Sbjct: 592 EVTPLSASQTLGKFAIMGAFMD-NAWFLCVVSLDASSFSSVVGRQVLQARPSSIALLQSY 650
Query: 539 --------EGISYLLCALGDGHLLNFLLNMKTGELTDRKKV-SLGTQPITLRTFSSKNTT 589
+ +L L +G L+ N +TGE++ + SLG+ P+ L +
Sbjct: 651 SRSSREPGRSVFFLYVGLENGVLMRMSFNAETGEISQEFRTRSLGSNPVKLVRVKVQEKE 710
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
+ A S R + Y K ++ + F+S P+ +G L I +++ +
Sbjct: 711 ALLALSSRSWLAYHHQGKQCLDPLSYDMLDFAWNFSSQQCPEGFVCISQGSLRILSLERV 770
Query: 650 QKLHIRSIP-LGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLD----DQTFEF 704
++ ++ L PR+ + + + + + A ++ +D D +
Sbjct: 771 GEIFSQATAKLAHTPRQCAPIKGTTMMMVIESDHNTDANKALQDGANKVDMEGQDGEADK 830
Query: 705 IS--TYPLDTFEYGC-----------SILS---CSFSDD-SNVYYC------------VG 735
+S + + GC S++S ++SDD + + C VG
Sbjct: 831 LSYEMFGVQRAGNGCWASSLRVIDMDSLVSRQIINYSDDEAALSLCSSVGANGEHLVLVG 890
Query: 736 TAYVLPE-ENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
T+ L E + G + + V L L + G ++ + G++L + +++Y
Sbjct: 891 TSVGLRMGEKHASSGFVYTYSVSGSHLHLEHKTPMDGIPRAICNYQGRVLVGVGSALRMY 950
Query: 795 ----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
K +LR +R+ + L + T+GD I VGD +S SLL + + E
Sbjct: 951 EIGKKKLLRKCESRKFPN--------LICSIHTQGDRIFVGDSAESFSLLRFNSLSNSFE 1002
Query: 851 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-----------KNSEGATDE--ERG 897
A D A W++A LD D GA+ NLF R + TD RG
Sbjct: 1003 LFAEDARARWLTASCPLDFDTVAGADKFGNLFICRIPEEAASELEEEGEAVQTDSVLHRG 1062
Query: 898 ---RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHEQ 953
+L+ V ++GE V + +LV G VI+GT+ G +G + + E
Sbjct: 1063 AKHKLDEVAHQYVGEAVLGLQRSALVQ-------GGTEAVIYGTILGGLGALQPFVSKED 1115
Query: 954 YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN-----------EKKTVDAKNFLDGDLIES 1002
F +L+ LR + +N +SF+N + +DGDL E+
Sbjct: 1116 VDFFLRLEMLLRGNLGVRESVND---KSFDNPSLCGNDQLGFRSYFAPMRGVVDGDLCET 1172
Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEEL-TRL 1035
F L R +++ + + EE+ K++E++ TRL
Sbjct: 1173 FHTLDSGRQHKVAAELGKTPEEVAKKIEDMRTRL 1206
>gi|396488712|ref|XP_003842924.1| similar to pre-mRNA splicing factor 3b [Leptosphaeria maculans JN3]
gi|312219502|emb|CBX99445.1| similar to pre-mRNA splicing factor 3b [Leptosphaeria maculans JN3]
Length = 1236
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 227/1000 (22%), Positives = 395/1000 (39%), Gaps = 152/1000 (15%)
Query: 155 WSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVY-----CSANAFK-AIPIR--PSIT 203
WS+ +D A+ L VP GVL+ GE+ I Y N + AIP R P+
Sbjct: 230 WSEP-IDRTANTLFRVPGGANAPSGVLVCGEDNITYRRIFNQKENVHRLAIPRREGPTED 288
Query: 204 KAYGRVDADGSRYLL--GDHAGLLHL-------LVITHEKEKVTGLKIELLGETSIASTI 254
R+ G+ Y L GD LL L + KV +KI+ + ++I
Sbjct: 289 PNRKRMIVAGTLYSLKGGDFFYLLQTEDGDVFKLTVEASNGKVEKIKIKYFDTIPVCTSI 348
Query: 255 SYLDNAVVYIGSSYGDSQLIKLN-----------------LQPDAKG----------SYV 287
L VY GD L +L + DAK + +
Sbjct: 349 CILRAGFVYAACESGDRILYELESLGDETEDPLFSSDQFPVDTDAKFAPPYFKPRALTNL 408
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
+E+ +L PI+ V + + Q+ T +G S R RN + + + L Q
Sbjct: 409 TAVEQMPSLNPIMGMEVANPALEDAPQIYTINGTGGRSSFRSTRNALEVLDLIESPLPQN 468
Query: 347 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+ + ++DD DT +V+ +R L + + D++EE GF T TL
Sbjct: 469 ASDVWTTKLTSDDETDTLIVLCL--HSRTLVLKIGDDVEEASNTGFLPDTNTLGVQQFGE 526
Query: 407 NQLVQVTSGSVRLV---------SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
+ ++Q+ +R + ++ + + +W+ P ++ N QV +A G
Sbjct: 527 DCIIQIHPKGIRHIQGIQFPDNDANATHQSLTDWQPPAHRTIVACATNNRQVAIALSSGQ 586
Query: 458 LVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-P 514
++Y E DG L + L+ I+CL + + E + AVG +D +VRIF+L P
Sbjct: 587 ILYFECDSDGSLAMAEEEIALDSTINCLAMPDVPEGSVRAFFLAVGC-SDQTVRIFNLSP 645
Query: 515 DL--NLITKEHLGGEIIPRSVLLCAF------EGIS-YLLCALGDGHLLNFLLNMKTGEL 565
D+ N++ + P S L G S +L L G + +L+ TG++
Sbjct: 646 DMEGNILRSISVQALTSPPSDLTINLMSDKSPRGYSQFLHIGLRSGVYIRSVLDEMTGDI 705
Query: 566 TDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHMCP 623
D ++ LG +PI + + A + RP + Y+ + L + +N
Sbjct: 706 GDTRRRFLGPEPIKFAKVTVAGEPAILAMTSRPWLGYTHPRTGVLQLTPLNYIAFKSAWN 765
Query: 624 FNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN 682
F+ + F + ++ EL I T +D+ +IPL PR++ + F I +
Sbjct: 766 FDGSQFKGIICVSAN-ELRIFTFNDLTDNTTYENIPLKYTPRKMVGYHEQGIFYIIESEC 824
Query: 683 QSCAEESEMHFVRLLDDQT----------FEFISTYPLDTFEY--------GC------- 717
+ ++ + ++ Q E P F Y C
Sbjct: 825 NTLNADTRQNLIQKASKQDENGNGEATNGTEESDELPAVDFGYPRAQGRWASCIQAVDPV 884
Query: 718 ---------------SILSCSF----SDDSNVYYCVGTAYVL---PEENEPTKGRILVFI 755
S++S + S + + VGTA L P + +I
Sbjct: 885 SEKAVTHTIELKTNQSLVSAALVFFESRGEDAFLAVGTAKDLSFTPYKYSGASIQIYKIS 944
Query: 756 VEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGH 815
+ + E E +L AF GKL+A + + + LY ++ R+ Q+
Sbjct: 945 PNGREFEFFHETEVGEPPLALLAFKGKLIAGVGRHLCLYDCGMK-SVLRKAQAPNCVATR 1003
Query: 816 ILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSAVEILDDDIYL 873
I ++T+G +VV D +S++ +++K + + A D SA ++LD + +
Sbjct: 1004 ITD--IKTQGSRLVVSDQAQSVTYVVHKDQVHPNRLIPFADDTVPRHTSASDMLDYETTV 1061
Query: 874 GAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV-- 928
G + N++ VR K SE + + G +V + LG NR LV +D+
Sbjct: 1062 GGDKFGNIWLVRCPPKVSEASDESPDGSDLLVDKSFLGGTPNRL---DLVAHYFANDIPV 1118
Query: 929 ---------GQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
G + + + G +G + Q+ ++L+ LR+ K + G +H
Sbjct: 1119 SIQKTVLLSGGERIIFWAGLQGTLGALIPFNSRRQHKMFQQLELQLRQDDKPLSGRDHLA 1178
Query: 979 WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
+RS+ K V +DGDLIE FL LSR + + I+ M
Sbjct: 1179 YRSYFAPVKCV-----IDGDLIERFLVLSRDKRESIAGQM 1213
>gi|147787360|emb|CAN64633.1| hypothetical protein VITISV_043788 [Vitis vinifera]
Length = 1143
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 167/753 (22%), Positives = 315/753 (41%), Gaps = 109/753 (14%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V I+G I +L FR G +D++ + ++ + +L+++ E + + + + + G
Sbjct: 34 ILSVEIFGAIRSLAQFRLTGSQKDYIVVGSDSGRIVILEYNKEKN-VFDKIHQETFGKSG 92
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ IDP R +IG L V+ D +L + + + +
Sbjct: 93 CRRIVPGQYLAIDPKGRAVMIGACEKQKLVYVLNRDTVARLTISSPLEAHKSHTIVYSIT 152
Query: 118 ---YGCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLD 161
G P + D +A +H+ YE+ L + V WS+ +D
Sbjct: 153 GVDCGFDNPIFAAIELDYSEADQDSTGQAASEAQKHLTFYELDL-GLNHVSRKWSEQ-VD 210
Query: 162 NGADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VDADG 213
NGA++L+ VP GVL+ E ++Y + + IP R + G V A
Sbjct: 211 NGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAERGVLIVSAAT 270
Query: 214 SR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
R +LL G + + + HE ++++ LKI+ + S++ L + ++ S
Sbjct: 271 HRQKSMFFFLLQTEYGDVFKVTLEHENDRISELKIKYFDTIPVTSSMCVLKSGFLFAASE 330
Query: 268 YGDSQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVD 301
+G+ L + QP + V + ++ +L PI+D
Sbjct: 331 FGNHGLYQFQAIGDDADVESSSASLMETEEGFQPVFFQPRGLKNLVRI-DQVESLMPIMD 389
Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDP 360
V +L + Q+ G S+RI+R G+ I+E A +L G+ +W+++ + +D
Sbjct: 390 MKVSNLFEEETPQIFALCGRGPRSSIRILRPGLAISEMAVSQLPGVPSAVWTVKKNVNDE 449
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
FD ++VVSF + T +L++ + +EE GF T +L + L+QV +R +
Sbjct: 450 FDAYIVVSFANATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLMQVHPSGIRHI 507
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI 480
R NEW++P ++ +N QV++A GG L+Y E
Sbjct: 508 REDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFE------------------ 547
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCA 537
++ G+ S+ AVG + D ++RI SL PD + +++ + + P S+L
Sbjct: 548 ----VDMTGQLMERSRFLAVGSY-DNTIRILSLDPDDCMQILSVQSVSSP--PESLLFLE 600
Query: 538 FEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
+ +L L +G L +++M TG+L+D + LG + L +
Sbjct: 601 VQASVGGEDGADHPASLFLNAGLQNGVLFRTVVDMVTGQLSDARSRFLGLRAPKLFSVIV 660
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
+ + S RP + Y L + ++ + + F+S + + L + T
Sbjct: 661 RGRRAMLCLSSRPWLGYIHQGHFLLTPLSYETLEFAASFSSDQCAEGVVAVAGDALRVFT 720
Query: 646 IDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAI 677
I+ + + + IPL PR+ Q + + +
Sbjct: 721 IERLGETFNETVIPLRYTPRKFVLQPKRKLLVV 753
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 93/351 (26%), Positives = 154/351 (43%), Gaps = 46/351 (13%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGR 750
+R+LD +T L E SI + +F D + VGTA L + G
Sbjct: 822 IRILDPRTATTTCLLELQDNEAAFSICTVNFHDKEYGTLLAVGTAKSLQFWPKRSFDAGY 881
Query: 751 ILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
I ++ +EDGK L+L+ + + +G +L F G+LLA I ++LY D G R L
Sbjct: 882 IHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSVLRLY-----DLGKRRLLR 936
Query: 809 ECGHH---GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
+C + I++++ T D I VGD+ +S Y+ +E + A D W++A
Sbjct: 937 KCENKLFPNTIVSIH--TYRDRIYVGDIQESFHYCKYRRDENQLYIFADDSVPRWLTASY 994
Query: 866 ILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD 925
+D D GA+ N++ VR + +DE V E G + ++ G L
Sbjct: 995 HIDFDTMAGADKFGNIYFVRL-PQDVSDE-------VEEDPTGGKI-KWEQGKL------ 1039
Query: 926 SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
P + V +G + + + L + ++R+ + G +H +RS
Sbjct: 1040 ---NGAPNKVEEIVQFHVGDVVTCLQKASLIPGGWRMHMRQEHPPLCGRDHMAYRS---- 1092
Query: 986 KKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E F LDL R DE+ +T E+ K++EE+
Sbjct: 1093 -AYFPVKDVIDGDLCEQFPTLPLDLQRKIADELDRTPG----EILKKLEEV 1138
>gi|303285956|ref|XP_003062268.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456679|gb|EEH53980.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1213
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 172/733 (23%), Positives = 304/733 (41%), Gaps = 80/733 (10%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G + +L FR G +D++ + ++ + VL +D E S + R
Sbjct: 57 VFGVVRSLSAFRLTGANRDYVVVGSDSGRIVVLSFDKEKSAFVKVHQETFGKSGCRRIVP 116
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY---GC 120
GQ +DP R +IG + V+ D L + + + V+ G
Sbjct: 117 GQFLAVDPKGRAVMIGAIEKSKIVYVLNRDTSANLTISSPLEANKSHVITFAVCALDCGL 176
Query: 121 AKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGADLL 167
P + D DA +H+ YE+ L V WS+ +DNGA+ L
Sbjct: 177 DNPQFAAVELDYGDADQDPTGEAAAEAQKHLVFYELDLGLNHVVR-KWSEA-IDNGANHL 234
Query: 168 IPVPPPLCG---VLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSR------ 215
I VP G L+ E I+Y + A+ IP R S++ G + +
Sbjct: 235 IAVPGGGDGPGGCLVCAENFIIYKNEGHADVRAVIPRRSSLSGDRGVLIVSSATHRTKQQ 294
Query: 216 --YLLGDHAGLLHLLVI--THEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
+L G ++ + + T E V+ +KI+ +++ L ++ S + +
Sbjct: 295 FFFLAQSEYGDVYKVTVEWTPGSETVSEVKIKYFDTIPPCASLCVLKTGFLFAASEFSNH 354
Query: 272 QLIKLN-------LQPDAKGSYVEVLERYV-------------------NLGPIVDFCVV 305
L + + + VE E Y +L P++D
Sbjct: 355 ALYQFQGIGDDDDDVESSSATLVETDEGYQPVFFEPRALRNLHPIDEIESLCPVLDAQCH 414
Query: 306 DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTF 364
+L + QV GA +LR++R G+ ++E A L G +++++ + D FD +
Sbjct: 415 NLTSEETPQVYALCGAGSRSTLRVLRQGVALSEMAVSPLPGNPNAVFTVKKNIADEFDAY 474
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
+VVSF++ T +L++ + +EE GF T TL + L+Q G +R V +
Sbjct: 475 IVVSFVNATLVLSIG--ETVEEVSDSGFLGTTPTLSASLLGDDSLLQAHPGGLRHVRADK 532
Query: 425 RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQLEYEISCL 483
R NEW+ P ++ N QV++A GG ++Y E+ G L E++ + +++CL
Sbjct: 533 R--INEWRCPGRKTITRVACNNRQVVIALSGGEIIYFELDSTGQLMEIEKLETNGDVACL 590
Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVL-LCAFEGI 541
+ P+ E ++ AVG + D +VR+ SL D L T P S+L L EG
Sbjct: 591 HVGPVPEGSLRNRFLAVGSF-DSTVRVLSLSADDCLQTMGVQALAAAPCSLLMLQTREGG 649
Query: 542 SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 601
YL L +G LL ++ TG+L+D + LG +P L + + + A S RP +
Sbjct: 650 LYLNVGLANGVLLRAEVDRVTGQLSDTRARFLGARPPKLHGATVRGQPAMLALSSRPWLG 709
Query: 602 YSS-NKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI------QKLHI 654
+ ++ + ++ + + H F S P+ + L I +++ + +
Sbjct: 710 HVDLQRRFALAPLSYEALEHAADFTSDQCPEGVVAVAGSTLRIVSVERLGDAFNQTSCKL 769
Query: 655 RSIP--LGEHPRR 665
R P L HP R
Sbjct: 770 RYTPRQLTVHPDR 782
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 81/332 (24%), Positives = 138/332 (41%), Gaps = 43/332 (12%)
Query: 724 FSDDSNVYYCVGTAYVLPEENEPTKGRI--LVFIVEDGKLQLIAEKETKGAVYSLNAFNG 781
F + V+ VGTA L G L +EDG + L+ + G ++ F G
Sbjct: 897 FPVANEVFLAVGTAVGLTFAPRDCDGGFVHLYRYLEDGTVTLVHKTPLDGVPGAMCGFKG 956
Query: 782 KLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
+LL ++LY K +LR R + V GD I VGD+ +S
Sbjct: 957 RLLLGCGNALRLYDFGKKKLLRKVENRNFPN--------FITTVHASGDRIYVGDVQESF 1008
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR------------ 885
+ YK E+ ++ A D ++A LD D GA+ N+F R
Sbjct: 1009 HFVKYKREDLSLIIVADDVQPRHITAALPLDYDTMAGADKFGNVFVARLAQDVSADIEED 1068
Query: 886 ----KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNG 941
K S G + ++E V ++H+GE V G+L G + +++ T+ G
Sbjct: 1069 PTGGKASGGTLNGAPRKVEHVAQFHVGETVCALTKGTL-------QAGGLECMLYATLMG 1121
Query: 942 VIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
+G + + F L+ ++R+ + G +H +RS K+ +DGDL
Sbjct: 1122 TVGALMPFTSRADVDFATHLEMHVRQENPPLLGRDHMAYRS-----AYFPVKDVVDGDLC 1176
Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
E ++ L I++ M+ + E+ K++E+L
Sbjct: 1177 EQYVTLPAETQRAIAEEMDRTPAEVMKKLEDL 1208
>gi|326432370|gb|EGD77940.1| splicing factor 3b subunit 3 [Salpingoeca sp. ATCC 50818]
Length = 1232
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 172/740 (23%), Positives = 315/740 (42%), Gaps = 87/740 (11%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
IYG + +L+ FR G +D+L + T+ + VL++D +++ L + + + G R
Sbjct: 67 IYGVVRSLQPFRLTGGNKDYLVVGTDSGRITVLEYDTDTN-LFKKVHQETFGKSGCRRIV 125
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQV---------LDIK 115
GQ DP R + + ++ V F+ + LE + +D+
Sbjct: 126 PGQYLAADPKGRAVMIGAFEKQKLVYIFNRDAAAHLTISSPLEAHKSHTIVYDTVGVDVG 185
Query: 116 F---LYGCAKPTIVVLYQD-NKDARHVKTYEVALKDKDFVEGPWSQNN---LDNGADLLI 168
F ++ C + + +D +AR T + + D + LD A+ L+
Sbjct: 186 FDNPIFACLEVDYEEVDEDPTGEARQFLTQSLTFYELDLGLNHVVRKESIPLDEFANKLV 245
Query: 169 PVP-----PPLCGVLIIGEETIVYCSANAFKAI---------PIR----PSITKAYGRVD 210
VP P GVL+ I + + + + P+R P +T
Sbjct: 246 SVPGGSDGP--GGVLVCSPGRITWRTYGEHEPVAINLPRRDDPLRTDRAPLVTAVTMHKT 303
Query: 211 ADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
+L+ G L L + EK +V+GL I+ +A+ + L N ++++ + YG+
Sbjct: 304 KRMFFFLVQTEEGDLFKLTMVAEKGEVSGLIIKYFDTVPVANAMCLLRNGLLFVAAEYGN 363
Query: 271 SQLIK-------------LNLQPDAKGSY--------VEVLERYVNLGPIVDFCVVDLER 309
L + L++ P K Y + +++ +L P++ + DL
Sbjct: 364 HHLYQIASLGDNEDEPSYLSIDPLDKIHYFRPRDLLNLALVDDQESLHPMIACQLADLHE 423
Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVS 368
+ Q+ G S R++R+G+ ++E A EL G +WS++ DD DT+LV+S
Sbjct: 424 EETPQLYALCGRGAKSSFRVLRHGLEVSEVAVSELPGNPSAVWSVKRHVDDESDTYLVLS 483
Query: 369 FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 428
F+ +T +L + + +EE + GF + TL + L+QV G +R + R
Sbjct: 484 FVDKTLVLGIG--ETVEEVKDSGFLEEVPTLSASRIGDDSLLQVYPGGIRHIRFDQR--V 539
Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYEISCLDIN 486
EWK+P ++ N QV++ LVY E+ G L E + ++ +++ + +
Sbjct: 540 KEWKAPGSTAITNCAVNERQVVITLSSNELVYFELDRAGQLNEYTERIEMTSKVTAMALA 599
Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLCAFE---- 539
P+ E+ SQ A+G+ D +VR+ SL L + + L G P S LC E
Sbjct: 600 PVAEDAFTSQFLALGL-EDNTVRVLSLDPSSCLQPLRMQALPGA--PSS--LCIIEIAGQ 654
Query: 540 ----GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS 595
G L L +G + ++ TG+L+D + LG + + L + V A S
Sbjct: 655 AGEPGTLQLHIGLANGVVSRSTMDKVTGDLSDSRTRYLGVREVRLFPVRAHGHPAVLALS 714
Query: 596 DRPTV--IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLH 653
RP + Y ++ + E +HM F S P+ + + L I ++ ++ +
Sbjct: 715 TRPWIAFTYQGQPRMAPLSYEALEYAHM--FCSEQLPEGIVAVAKNTLRILSLQNLGSIF 772
Query: 654 IRS-IPLGEHPRRICHQEQS 672
+S IPL PR+ E++
Sbjct: 773 NQSTIPLAYTPRKFFLDEKT 792
Score = 72.0 bits (175), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 72/306 (23%), Positives = 129/306 (42%), Gaps = 44/306 (14%)
Query: 753 VFIVEDG-----KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
+F DG KL + E + +L F G+L+A + +++Y D G ++L
Sbjct: 939 LFSAPDGSARLTKLDFVHRTEVEAMPCALTPFAGRLIAGVGNIVRIY-----DMGRKKLL 993
Query: 808 SEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
+C H + ++ G +VV D +S+ L YK E + D W +A+ +
Sbjct: 994 RKCENKHLPSRVVDIEVMGTRVVVADQRESVFFLKYKPTENVLSVFCDDTTPRWCTAMLM 1053
Query: 867 LDDDIYLGAENNFNLFTVRKNSEGATDE------------ERG-------RLEVVGEYHL 907
+D A + F +V + + TD RG +L V +++
Sbjct: 1054 VDYSTVCVA-DKFGNVSVLRCPDDVTDTLQEDPSGAKAFWARGYLNGAPQKLVQVANFYI 1112
Query: 908 GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH-EQYLFLEKLQTNLRK 966
GE V +L P + + T++G IG + H E F + L+ +LR+
Sbjct: 1113 GEIVQSLHKTTLT---PSG----TECIAYTTLSGSIGALMPFSHKEDAEFFQTLELHLRQ 1165
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
+ G +H +RS V K+ +DGDL E + LS + +I+ + + +E+
Sbjct: 1166 EHPPICGRDHLAFRS-----AYVPCKSVIDGDLCEEYNMLSASLKSDIADGLERTPQEVA 1220
Query: 1027 KRVEEL 1032
K++EE
Sbjct: 1221 KKLEEF 1226
>gi|308505958|ref|XP_003115162.1| CRE-TEG-4 protein [Caenorhabditis remanei]
gi|308259344|gb|EFP03297.1| CRE-TEG-4 protein [Caenorhabditis remanei]
Length = 1013
Score = 143 bits (361), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 172/742 (23%), Positives = 312/742 (42%), Gaps = 95/742 (12%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
I+G + +L FR +DF+ + ++ + +LQ++AE + L G R P
Sbjct: 59 IFGIVRSLLAFRLTAGTRDFIAVGSDSGRIVILQYNAEKTCFERLHQETFGKTGCRRIVP 118
Query: 63 TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
G DP R +IG L ++ D++ L + + + L +
Sbjct: 119 ---GHYLAGDPRGRALMIGAVERQKLVYIMNRDSEAHLTISSPLEAHKHHTLCYAMVGVD 175
Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADL 166
G PT L D +D+ + T E A + + F E N+ L++ +L
Sbjct: 176 VGFENPTFACLEFDYEDSDNDPTGEAAKRTQQTLTFYELDLGLNHVVRKYAEPLNDPGNL 235
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR------ 215
LI VP P GV++ E +VY N IR I + +D D R
Sbjct: 236 LIAVPGGNDGP--SGVIVCCENYLVY--KNLGDQPDIRCPIPRRRNELD-DADRTMLIIA 290
Query: 216 -----------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYI 264
+L+ G + + + +++ VT +K++ A+ + L ++I
Sbjct: 291 TATHKTRNMFFFLVQAENGDIFKVTLETDEDLVTEMKLKYFDTVPPANAMCILKAGFLFI 350
Query: 265 GSSYGDSQLIKLN-----------------------LQPDAKGSYVEVLERYVNLGPIVD 301
+ +G+ +L ++ +P S + + + +L P+ D
Sbjct: 351 AAEFGNHELYQIASLGEGGDDEFSSAMGFGENDAAFFEPHELRSLIPI-DSMDSLSPLTD 409
Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDP 360
+ D+ R+ Q+ T G ++++RNG+ I+E A +L G +W+++ + +D
Sbjct: 410 AVIGDIAREDAAQLYTLIGRGSRSHMKVLRNGLEISEMAVSDLPGNPNAVWTVKKNIEDQ 469
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+D+++VVSF++ T LA+ + D +EE GF T T+ C + LVQV ++
Sbjct: 470 YDSYIVVSFVNAT--LALTIGDTVEEASDSGFLPTTPTIGCSMIGDDSLVQVIRN--QIY 525
Query: 421 SSTSRELR-----NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHA 474
S R +R NEWK PP + N QV +A GG LVY E+ +G L E
Sbjct: 526 SEGIRHIRADKRINEWKVPPRRQIVKCAVNRRQVAVALSGGELVYFELDLNGTLNEFTER 585
Query: 475 QL-EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRS 532
+L +I+C+ + I E S+ A+G D +VRI SL P+ L+ P S
Sbjct: 586 KLFNADIACMTFSEISEGELNSRFLALGT-VDNAVRIISLDPNDMLMPLSTQNLPCPPES 644
Query: 533 VLLCAF-----EGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
+LL +G++ +L L +G L ++ TG + D + LGT+P+ L
Sbjct: 645 ILLIDTPNEDGKGVAAVHLNIGLQNGCLFRNTVDNVTGAIMDTRTRYLGTRPVKLFKVQC 704
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGT 645
+ + + S R ++Y ++ + ++ + + F S + + L I
Sbjct: 705 QGRSAILCTSSRSWLLYHFQRRFHLTPLSYANLEYAASFCSNQCAEGIVAISASTLRIIA 764
Query: 646 IDDIQ-KLHIRSIPLGEHPRRI 666
+ + +++S PRR+
Sbjct: 765 AEKLGVAFNVQSFEHKLTPRRV 786
>gi|357496593|ref|XP_003618585.1| Splicing factor 3B subunit [Medicago truncatula]
gi|355493600|gb|AES74803.1| Splicing factor 3B subunit [Medicago truncatula]
Length = 702
Score = 143 bits (360), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 173/699 (24%), Positives = 303/699 (43%), Gaps = 76/699 (10%)
Query: 379 NLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYS 438
++ + ++E GF T +L + L+QV +R + NEW++P +
Sbjct: 30 SVRETVKEVTDNGFLDTTPSLSVSLIGDDSLMQVHPNGIRHIREDGH--INEWRTPGKRT 87
Query: 439 VNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCLDINPIGENPSYSQI 497
+ +N QV++A GG L+Y E+ G L EV+ ++ +++CLDI P+ E S+
Sbjct: 88 IAKVGSNRLQVVIALNGGELIYFEVDVTGQLMEVEKHEMSGDVACLDIAPVPEGRQRSRF 147
Query: 498 AAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCAL-GDGHLLN 555
AVG + D ++RI SL PD + T LG + + + F + + G+G L
Sbjct: 148 LAVGSY-DKTIRILSLDPDDCMQT---LGIQSLSSASESLLFLEVQASVGGEDGNGVLSR 203
Query: 556 FLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNL 615
+++M TG L+D + LG +P L + + S RP + Y L + ++
Sbjct: 204 TVVDMVTGLLSDSRSPFLGLRPPKLFPIVVRGKRAMLCLSSRPWLGYIHQGHFLLTPLSY 263
Query: 616 KEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRT 674
+ + F+S + + L I TI+ + + + IPL P + Q + +
Sbjct: 264 ETLEFAASFSSDQCVEGVVALAGEALRIFTIERLGETFNETVIPLRYTPMKFVLQPKRKL 323
Query: 675 FAIC-SLKNQSCAEESEM---------------HF-------------VRLLDDQTFEFI 705
+ S + AEE E H+ +R+LD +T
Sbjct: 324 LVVIESDQGAFTAEEREANGGEDEDKDDPLSDEHYGYPKAESDKWASCIRILDPKTGNTT 383
Query: 706 STYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL--PEENEPTKGRILVF-IVEDGK- 760
L E S + +F D + VGTA L T G I ++ +EDG+
Sbjct: 384 CLLELQDNEAAFSGCTVNFHDKEYGTLLDVGTAKGLQFTPRRSLTAGFIHIYRFLEDGRS 443
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA-- 818
L+L+ + + +G +L+ F G+LLA I ++ Y D G R L + + +
Sbjct: 444 LELLHKTQVEGVPLALSQFQGRLLAGIGPVLRFY-----DLGKRRLLRK--YENKLFPNT 496
Query: 819 -LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
+ +QT D I VGD +S Y+ +E + A D W++A +D D G E
Sbjct: 497 IVSIQTYRDRIYVGDTQESFHYCKYRWDENQLYIFADDCVPRWLTASYHIDFDTMAGIEE 556
Query: 878 NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
+ + K +G + ++E + ++H+G+ ++ + SL+ G ++ G
Sbjct: 557 DPTGGRI-KWEQGKLNGAPNKVEEIVQFHVGDVISCLQKASLI-------PGGGECILNG 608
Query: 938 TVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
TV G IG + A + F L+ ++R+ + G +H +RS K+ +D
Sbjct: 609 TVMGSIGALHAFTSRDDVDFFSHLEMHMRQDNPPLCGRDHMAYRS-----AYFPVKDVID 663
Query: 997 GDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEE 1031
GDL E F +DL R DE+ +T E+ K++EE
Sbjct: 664 GDLCEQFPTLPMDLQRKIADELDRTRG----EILKKLEE 698
>gi|322797581|gb|EFZ19622.1| hypothetical protein SINV_00421 [Solenopsis invicta]
Length = 1217
Score = 143 bits (360), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 171/775 (22%), Positives = 318/775 (41%), Gaps = 96/775 (12%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ A +
Sbjct: 53 LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYIAAKNTFEKVHQETFGKSGC 112
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DIK 115
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 113 RRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDAEARLTISSPLEAHKSNTLVYHTVG 172
Query: 116 FLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNG 163
G P L D ++A T + A+K + + V +S+ L+
Sbjct: 173 VDVGFENPMFACLEIDYEEADSDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEEH 231
Query: 164 ADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR--- 215
A+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 232 ANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMI 286
Query: 216 --------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+L G + + + +++ VT +K++ +A+++ L
Sbjct: 287 FVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGF 346
Query: 262 VYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGP 298
+++ S +G+ L ++ + P +G R + +L P
Sbjct: 347 LFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSP 406
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 407 IMACQVADLANEDTPQLYIACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRRI 466
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
D+ +D +++VSF++ T +L++ + +EE GF T TL C + LVQV +
Sbjct: 467 DEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDGI 524
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQ 475
R + + R NEWK+P ++ N QV++A GG LVY E+ G L E + +
Sbjct: 525 RHIRADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERKK 582
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-L 534
+ E+ C+ + + S AVG+ D +VRI SL + + + + +P +
Sbjct: 583 MPSEVMCMALGNVAVGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSM--QALPAAAES 639
Query: 535 LCAFEGIS----------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
LC E + YL L +G LL +L+ +G+L D + LG++P+
Sbjct: 640 LCIVEMGAKEADNSEDAAPQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRPV 699
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
L + V A S R + Y + + ++ + + F+S P+ +
Sbjct: 700 KLFRIRMQGNQAVLAMSSRSWLSYYHQNRFHLTPLSYESLEFASGFSSEQCPEGIVAIST 759
Query: 639 GELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
L I ++ + + + S PL PR+ S + ++ + EE++
Sbjct: 760 NTLRILALEKLGAVFNQVSFPLEYTPRKFAIHADSAHLVVIETEHNAYTEETKQQ 814
Score = 79.3 bits (194), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 66/276 (23%), Positives = 128/276 (46%), Gaps = 41/276 (14%)
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKS 836
+ G++L + + ++LY D G ++L +C + HI + + G I V D+ +S
Sbjct: 955 YQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAVVSINAIGQRIYVSDVQES 1008
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE-- 894
+ + YK +E + A D + W++ +LD D A+ N+ +R + G D+
Sbjct: 1009 VYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKFGNIAVIRL-ATGINDDVD 1067
Query: 895 ----------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
+RG L + V +H+GE V + +L+ G ++++
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI-------PGGSESLVYT 1120
Query: 938 TVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
T++G +GV+ HE + F + L+ ++R + G +H +RS+ KN +D
Sbjct: 1121 TLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLSFRSY-----YYPVKNVID 1175
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
GDL E F + T+ IS + + E+ K++E++
Sbjct: 1176 GDLCEQFNSIEPTKQKSISGDLERTPSEVSKKLEDI 1211
>gi|116191283|ref|XP_001221454.1| hypothetical protein CHGG_05359 [Chaetomium globosum CBS 148.51]
gi|88181272|gb|EAQ88740.1| hypothetical protein CHGG_05359 [Chaetomium globosum CBS 148.51]
Length = 979
Score = 143 bits (360), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 190/850 (22%), Positives = 354/850 (41%), Gaps = 126/850 (14%)
Query: 236 KVTGLKIELLGETSIASTISYLDNAVVYIGSSY-----------GDSQL----------I 274
+V +K+ +AS++ L + + + S + GD Q+ +
Sbjct: 112 EVNCIKLRYFDTVPVASSLCILKSGFLLVASEFENHHLYQFEQLGDDQVEFNSNRDIAPV 171
Query: 275 KLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQG--QVVTCSGAYKDGSLRIVRN 332
+ QP S VE +E ++ PI+ VV+L R+G Q+ + G+ RI+R+
Sbjct: 172 FFDPQPSGNLSLVETIE---SMNPILASGVVNL-REGDDAPQIYSACGSGTRSHFRILRH 227
Query: 333 GIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 391
G+ +NE + EL G + +W+ + ++ D DT+++++ S T L +++ +E+++ G
Sbjct: 228 GLEVNEIVASELPGTVSAVWTTKLTSQDKHDTYIILT--SSTDTLVLSIGEEVKQVSDSG 285
Query: 392 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLL 451
F + TL + L QV +R + S+ + + EW +P ++ N QV +
Sbjct: 286 FLTSVSTLAIQQIGDDSLAQVHPKGIRHIRSSDQII--EWPAPQHRTIVATATNQRQVAV 343
Query: 452 ATGGGHLVYLEI--GDGILTEVK-HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISV 508
A G ++Y E+ DG L E ++ ++CL + + E S AVG D ++
Sbjct: 344 ALSSGEIIYFELDDADGSLAEYDGREEMSGTVTCLSLGQVPEGRLRSSFLAVGC-DDRTI 402
Query: 509 RIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTD 567
RI SL P+ L ++ + + ++ AL + + LL T+
Sbjct: 403 RILSLDPESTLESR---------------SVQALTAAPSALAIIPMDDSLLGGSNSSDTN 447
Query: 568 RKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSS-NKKLLYSNVNLKEVSHMCPFNS 626
K LG +P+ L + K V A S + + + + L+ + E + + P
Sbjct: 448 YK--FLGVRPVRLFPVTVKQKPRVLALSSKAWLGHVDLARGFLFYTIE-SENNTLAPELR 504
Query: 627 AAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--S 684
A + GT + P GE P + F + + S
Sbjct: 505 AQL-----------VAAGT----------TAPGGEKPDHVAEILPPEQFGYPRGQGRWAS 543
Query: 685 CAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFS---DDSNVYYCVGTAYVLP 741
C + ++D I T L E S+ SF+ D+S + G VL
Sbjct: 544 C--------ISIVDPIGETVIRTIHLQENEAAASLAIASFTSQDDESFLVVSTGRDMVL- 594
Query: 742 EENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
+ + G V+ +DG+ L+LI + T +L AF G+L+A I + + +Y L+
Sbjct: 595 NPRQLSGGYSYVYRFHDDGRDLELIHKTGTTEPPTALEAFRGRLVAGIGKTLVVYDLGLK 654
Query: 800 DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
R+ Q+ G I++L QT+G+ IVVGD+ ++++ Y+ E + D A
Sbjct: 655 -QMLRKTQANDVVPGLIVSL--QTQGNRIVVGDVQHGVAMVAYRTESNQLIPFVDDTIAR 711
Query: 860 WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD-EERG----------------RLEVV 902
W + ++D D G + N + VR + + + +E G RL++
Sbjct: 712 WTTCTTMVDYDSVAGGDKFGNFWIVRTPQQASLEADEPGAHRLLHAREHLHGAPHRLQLT 771
Query: 903 GEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGV-IASLPHEQYLFLEKLQ 961
+H + LV VG +++ G +GV + + E F L+
Sbjct: 772 AHFHTQDIPTGITKTHLV-------VGGQEVLVWSGFQGTVGVFVPFVTREDADFFLALE 824
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
++R + G +H +R + AK +DGDL E + L + I+ ++ S
Sbjct: 825 QHMRGEEPSLIGRDHLAYRGYYEP-----AKGVVDGDLCERYQLLPGDKKQRIAAELDRS 879
Query: 1022 VEELCKRVEE 1031
V E+ +++ +
Sbjct: 880 VREVERKISK 889
>gi|383847297|ref|XP_003699291.1| PREDICTED: splicing factor 3B subunit 3-like [Megachile rotundata]
Length = 1217
Score = 143 bits (360), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 172/776 (22%), Positives = 321/776 (41%), Gaps = 98/776 (12%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ + + + + + G
Sbjct: 53 LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEY-IPAKNIFEKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DI 114
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDPEARLTISSPLEAHKSNTLVYHTV 171
Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
G P L D ++A T + A+K + + V +S+ L+
Sbjct: 172 GVDVGFENPMFACLEIDYEEADSDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEE 230
Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-- 215
A+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 231 HANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGM 285
Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
+L G + + + +++ VT +K++ +A+++ L
Sbjct: 286 IFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTG 345
Query: 261 VVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLG 297
+++ S +G+ L ++ + P +G R + +L
Sbjct: 346 FLFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 406 PIMACQVADLANEDTPQLYITCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRR 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
D+ +D +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VDEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV- 533
++ E+ C+ + + S AVG+ D +VRI SL + + + + +P +
Sbjct: 582 KMPSEVMCMALGNVAIGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSM--QALPAAAE 638
Query: 534 LLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
LC E YL L +G LL +L+ +G+L D + LG++P
Sbjct: 639 SLCIVEMGAKDANNSEELSSQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + Y + + ++ + + F+S P+ +
Sbjct: 699 VKLFRIKMQGNQAVLAMSSRSWLSYYYQNRFHLTPLSYESLEFASGFSSEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
L I ++ + + + S PL PR+ S I ++ + EE++
Sbjct: 759 TNTLRILALEKLGAVFNQISFPLEYTPRKFAIHSDSAHLIIIETEHNAYTEETKQQ 814
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 67/294 (22%), Positives = 134/294 (45%), Gaps = 41/294 (13%)
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LA 818
L+L+ + ++ + G++L + + ++LY D G ++L +C + HI
Sbjct: 937 LELVHKTTLDEVPLAICPYQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAI 990
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+ + G I V D+ +S+ + YK +E + A D + W++ +LD D A+
Sbjct: 991 VSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKF 1050
Query: 879 FNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSL 919
N+ +R S G D+ +RG L + V +H+GE V + +L
Sbjct: 1051 GNIAVIRLAS-GINDDVDEDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATL 1109
Query: 920 VMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
+ G ++++ T++G +GV+ HE + F + L+ ++R + G +H
Sbjct: 1110 I-------PGGSESLVYTTLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLS 1162
Query: 979 WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+RS+ KN +DGDL E F + + I+ + + E+ K++E++
Sbjct: 1163 FRSY-----YYPVKNVIDGDLCEQFNSIEPAKQKSIAGDLERTPSEVSKKLEDI 1211
>gi|414883930|tpg|DAA59944.1| TPA: hypothetical protein ZEAMMB73_987949 [Zea mays]
Length = 1355
Score = 143 bits (360), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 149/618 (24%), Positives = 253/618 (40%), Gaps = 91/618 (14%)
Query: 214 SRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL 273
+R L G H+L T + E E + T + ++ N ++ GD +
Sbjct: 403 ARLLFCLDDGEFHILEFTLDVEGAKLYTFEYIDRTLPCRPLFWMKNRMIIGFVEMGDGMI 462
Query: 274 IKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNG 333
KL + S ++ N+ PI+D + D + Q Q+ C G +GSLR++RNG
Sbjct: 463 FKLGHRRLLHKSTIQ------NVAPILDLAIADYHGEKQDQMFACCGMSPEGSLRVLRNG 516
Query: 334 IGINEQASVE--LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 391
+ ++ E QG+ G+W+LR D + +FLV+SF+ ETRIL++ L + ++ G
Sbjct: 517 VNVDRLLKTEAIYQGVTGLWTLRMKATDAYHSFLVLSFVEETRILSVGLSFN-DISDAVG 575
Query: 392 FCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPGYSVNV 441
F TL C N LVQ+ S V++ T + + +W P +++V
Sbjct: 576 FQPDVCTLACGLVADNLLVQIYSKGVKVCLPTVYAHPEGAPLTSPICTDWY--PAITISV 633
Query: 442 ATANASQVLLATGGGHLVYLEIGDGI---------LTEVKHAQLEYEISCLDI---NPIG 489
+ V++AT +Y+ G+ L H QL+YE+SC+ I + I
Sbjct: 634 GAVGHNIVVVATSNPCCLYVL---GVRSSSSYQYELYATHHVQLQYEVSCISIPQEDCIH 690
Query: 490 ENPSYS------------------QIAAVGMWTDISVRIFSLPD---LNLIT------KE 522
+N S+S + A +G SV I SL L ++T
Sbjct: 691 DNVSFSCGEGDDICKNPPPKVNVCKFAVIGTHRP-SVEIISLEPGEALRVLTIGTVSVNN 749
Query: 523 HLGGEI---IPRSVLLCAFEGISYLLCALGDGHLLNFLLNMK---TGELTDRKKVS---- 572
LG + IP +V A E Y+L L +G LL F + G L +
Sbjct: 750 ALGAPMSGCIPENVRFVAAERF-YILAGLRNGMLLRFESETRDYLPGFLYKDSSIPSVNT 808
Query: 573 ---------LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCP 623
+G P+ L + SDRP +++++ L YS+++ SH+ P
Sbjct: 809 FLQLISIRRIGITPVLLVPIHDSANADIIVLSDRPWLLHAARHSLAYSSISFLSASHVTP 868
Query: 624 FNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ 683
+S P+ L E L + + ++L+ + +G PR++ + +SRT +
Sbjct: 869 VSSVDCPNGLLFVAESCLHLVELVHGKRLNAQKFSIGGTPRKVLYHNESRTLLVLRTGLN 928
Query: 684 SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS----NVYYCVGTAYV 739
+ S+ V +D Q +S Y + E + +D G +
Sbjct: 929 GASSSSD---VVQVDPQNGVLLSRYKCEPGETAKCMQIAKIGNDQVLIVGTTKSAGRPMM 985
Query: 740 LPEENEPTKGRILVFIVE 757
E E KGR++V +E
Sbjct: 986 SNGEAESIKGRLIVLSLE 1003
>gi|125580741|gb|EAZ21672.1| hypothetical protein OsJ_05303 [Oryza sativa Japonica Group]
Length = 1224
Score = 142 bits (359), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 179/763 (23%), Positives = 309/763 (40%), Gaps = 85/763 (11%)
Query: 5 PIYGRIATLEL--FRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
P GR+ TL FR G +D+L + ++ + +L++ + + L R
Sbjct: 60 PETGRLRTLLSVDFRLTGATKDYLVVGSDSGRLVILEYSPDRNRLDKVHQETFGKSGCRR 119
Query: 63 TDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY-- 118
GQ+ +DP R + + + L V+ D +L + + + L
Sbjct: 120 IVPGQLLAVDPKGRALCIAALEKQKLVYVLNRDAAARLTISSPLEAHKSNTLTFSLTALD 179
Query: 119 -GCAKPTIVVL---YQDNKDARHVKTYEVALKDKDFVEGPWSQNN--------LDNGADL 166
G P + Y ++ + E A K F E N+ +DNGA+L
Sbjct: 180 CGFDNPVFAAIELEYAESDRDPTGQAAEQAQKHLTFYELDLGLNHVSRKASEPIDNGANL 239
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGR--VDADGSR- 215
L+ VP P G+L+ + ++Y + IP R + G V A R
Sbjct: 240 LVTVPGGGDGP--SGLLVCCDNFVLYRNQGHPEVRAVIPRRADLPAERGVLIVAAATHRQ 297
Query: 216 -----YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD 270
+LL G + + + H + VT L+I+ + S I L + ++ S +G+
Sbjct: 298 KSLFFFLLQTEYGDIFKVDLEHSNDTVTELRIKYFDTIPVTSAICVLRSGFLFAASEFGN 357
Query: 271 SQLIKLN--------------------------LQPDAKGSYVEVLERYVNLGPIVDFCV 304
L + QP A + + E +L PI+D V
Sbjct: 358 HALYQFRDIGRDVDVESSSATLMETDEGFQPVFFQPRALKNLYRIDE-IESLMPIMDMRV 416
Query: 305 VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDT 363
+L + QV T G +LRI+R G+ I+E A L +W+++ + +D FD
Sbjct: 417 ANLFDEETPQVFTACGRGPRSTLRILRPGLAISEMARSMLPAEPIAVWTVKKNINDMFDA 476
Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
++VVSF + T +L++ + +EE F T +L + L+QV +R +
Sbjct: 477 YIVVSFANVTLVLSIG--ETIEEVSDSQFLDTTHSLAVSLLGEDSLMQVHPNGIRHIRED 534
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISC 482
R NEW++P ++ +N QV++A GG L+Y E+ G L EV+ + +++C
Sbjct: 535 GRV--NEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDMSGDVAC 592
Query: 483 LDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGI 541
L I P+ E S+ AVG + D ++RI S+ PD L P S++ +
Sbjct: 593 LAIAPVPEGRQRSRFLAVGSF-DNTIRILSVDPDDCLQPLSVQSVSSAPESLMFLEVQAS 651
Query: 542 S------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
+L L +G L ++M TG+L+D + LG +P L +
Sbjct: 652 VGGEDGADHPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPCIVSHRQ 711
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
+ S RP + Y L + ++ + F+S + + L I TI+ +
Sbjct: 712 AMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGDALRIFTIEHL 771
Query: 650 -QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE 690
+ + +IPL PR+ + + A+ S K AEE E
Sbjct: 772 GETFNETAIPLRYTPRKFVILPKKKYLAVIESDKGALSAEERE 814
Score = 93.2 bits (230), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 166/365 (45%), Gaps = 44/365 (12%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---PEENEPTKG 749
+R+LD ++ + L E SI + +F D + VGTA L P+ N + G
Sbjct: 873 IRILDPKSRDTTCLLELQDNEAAVSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNL-SAG 931
Query: 750 RILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
I ++ V++G+ L+L+ + + + +L F G+LLA + ++LY D G R+L
Sbjct: 932 FIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSVLRLY-----DLGKRKLL 986
Query: 808 SECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
+C + + + T D I VGD+ +S Y+ +E + A D W++A
Sbjct: 987 RKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAANH 1046
Query: 867 LDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHLG 908
+D D GA+ N++ R K +G + ++E + ++H+G
Sbjct: 1047 IDFDTMAGADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVG 1106
Query: 909 EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKV 967
+ V + SL+ G +I+GTV G +G ++A E F L+ +LR+
Sbjct: 1107 DVVTCLQKASLI-------PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQE 1159
Query: 968 IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
+ G +H +RS K+ +DGDL E F L +I+ ++ + E+ K
Sbjct: 1160 HPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPSLPADMQRKIADELDRTPGEILK 1214
Query: 1028 RVEEL 1032
++E++
Sbjct: 1215 KLEDI 1219
>gi|332026090|gb|EGI66238.1| Splicing factor 3B subunit 3 [Acromyrmex echinatior]
Length = 1217
Score = 142 bits (359), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 172/783 (21%), Positives = 325/783 (41%), Gaps = 98/783 (12%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ + + + + + G
Sbjct: 53 LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEY-ISAKNIFEKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DI 114
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDAEARLTISSPLEAHKSNTLVYHTV 171
Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
G P L D ++A T + A+K + + V +S+ L+
Sbjct: 172 GVDVGFENPMFACLEIDYEEADSDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEE 230
Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-- 215
A+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 231 HANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGM 285
Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
+L G + + + +++ VT +K++ +A+++ L
Sbjct: 286 IFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTG 345
Query: 261 VVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLG 297
+++ S +G+ L ++ + P +G R + +L
Sbjct: 346 FLFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 406 PIMACQVADLANEDTPQLYIACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRR 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
D+ +D +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 IDEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV- 533
++ E+ C+ + + S AVG+ D +VRI SL + + + + +P +
Sbjct: 582 KMPSEVMCMALGNVAVGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSM--QALPAAAE 638
Query: 534 LLCAFEGIS----------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
LC E + YL L +G LL +L+ +G+L D + LG++P
Sbjct: 639 SLCIVEMGAKDADNSEDAAPQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + Y + + ++ + + F+S P+ +
Sbjct: 699 VKLFRIRMQGNQAVLAMSSRSWLSYYHQNRFHLTPLSYESLEFASGFSSEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRL 696
L I ++ + + + S PL PR+ S + ++ + EE++ +
Sbjct: 759 TNTLRILALEKLGAVFNQVSFPLEYTPRKFAIHTDSAHLVVIETEHNAYTEETKQQRRQQ 818
Query: 697 LDD 699
+ D
Sbjct: 819 MAD 821
Score = 79.7 bits (195), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 66/276 (23%), Positives = 128/276 (46%), Gaps = 41/276 (14%)
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKS 836
+ G++L + + ++LY D G ++L +C + HI + + G I V D+ +S
Sbjct: 955 YQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAVVSINAIGQRIYVSDVQES 1008
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE-- 894
+ + YK +E + A D + W++ +LD D A+ N+ +R + G D+
Sbjct: 1009 VYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKFGNIAVIRL-ATGINDDVD 1067
Query: 895 ----------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
+RG L + V +H+GE V + +L+ G ++++
Sbjct: 1068 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI-------PGGSESLVYT 1120
Query: 938 TVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
T++G +GV+ HE + F + L+ ++R + G +H +RS+ KN +D
Sbjct: 1121 TLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLSFRSY-----YYPVKNVID 1175
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
GDL E F + T+ IS + + E+ K++E++
Sbjct: 1176 GDLCEQFNSIEPTKQKSISGDLERTPSEVSKKLEDI 1211
>gi|328717412|ref|XP_003246201.1| PREDICTED: splicing factor 3B subunit 3-like [Acyrthosiphon pisum]
Length = 1218
Score = 142 bits (359), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 177/778 (22%), Positives = 320/778 (41%), Gaps = 105/778 (13%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I L FR G +D++ + ++ + +L++ S ++ + + + G
Sbjct: 53 LLTVEVFGIIRALMSFRQSGGTKDYIVVGSDSGRIVILEY-LPSKNVLDKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQL-------KEAFNIRLEELQ 110
R GQ IDP R +IG L + D + +L N + +
Sbjct: 112 CRRIVPGQYLAIDPKGRAIMIGAVEKQKLVYIPNRDAEARLTISSPLEANKSNTLVYHMV 171
Query: 111 VLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN-------- 159
+D+ F P L D ++ T E + F E + N+
Sbjct: 172 GIDVAF----DNPIFACLEIDYEEVDADPTGEAVQTTRQTLTFYEVDFGLNHVVRKYSEP 227
Query: 160 LDNGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGS 214
L+ A+ LI VP P GVLI E I Y N + IR I + +D D
Sbjct: 228 LEEHANSLITVPGDKDGP--GGVLICSENYITY--KNQGEQSDIRCPIPRRRNDLD-DPE 282
Query: 215 R-----------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
R +L+ G + + + +E VT ++++ +AS + L
Sbjct: 283 RGMILVCSATHKTKSMFFFLVQTEQGDVFKVTLETNEEFVTEIRLKYFDTVPVASAMCVL 342
Query: 258 DNAVVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV---------- 294
+++ S +G+ L ++ + P +G R +
Sbjct: 343 KTGFLFVASEFGNHYLYQIANLGDDDDEPEFSSAMPLEEGDTFFFAPRQLRNLVLVDEMD 402
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSL 353
+L PI+ V DL + Q+ G +LR++R+G+ ++E A EL G +W++
Sbjct: 403 SLSPIMACQVADLAAEDTPQLYMACGRGSRSTLRVLRHGLEVSEMAVSELPGSPNAVWTV 462
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ D+ FD +++VSF + T +L++ + +EE GF T TL C + +VQ+
Sbjct: 463 KRRADENFDAYIIVSFSNATLVLSIG--ETVEEVSDSGFLGTTPTLSCSPLGDDAVVQIY 520
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTEVK 472
VR + S R ++WK+P + AN QV++A GGG LVY E+ G L E K
Sbjct: 521 PNGVRHIRSDKR--MHDWKAPEKKKIVKCAANQRQVVIALGGGELVYFEMDPTGHLNEHK 578
Query: 473 -HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG-EIIP 530
++ ++ C+ + S+ AVG+ TD +VRI SL + + + + +P
Sbjct: 579 DRKEMNSDVLCMALANAPSGEQMSRFLAVGL-TDETVRIISLDTTDCLVQLKMQAIPAMP 637
Query: 531 RSVLLCAFE-----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573
S LC E + YL L +G LL +L+ TGE+ D + L
Sbjct: 638 ES--LCIVEMGASDGGSSDEPAMNSLSMLYLNIGLQNGVLLRTVLDGVTGEMADTRARYL 695
Query: 574 GTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSL 633
G +P+ L ++ V A S R + Y + + ++ + + + F+S P+ +
Sbjct: 696 GGKPVKLFKIRTRGNEAVLAMSSRSWLSYYYQNRFHLTPLSYESLEYASGFSSEQCPEGI 755
Query: 634 AIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE 690
L I ++ + + + S P+ PR+ S + ++ + E+++
Sbjct: 756 VAISGNTLRILALEKLGAVFNQVSFPVEYTPRKFVIHPDSAHLIVVETEHNAYTEQTK 813
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 69/293 (23%), Positives = 135/293 (46%), Gaps = 39/293 (13%)
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG-HILAL 819
L+L+ + ++ F G+++ I + ++LY D G ++L +C + +L +
Sbjct: 938 LELVHKTTVDNVPTAICGFQGRVIIGIGRILRLY-----DIGKKKLLRKCENKQIPLLIM 992
Query: 820 YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNF 879
++ G I V D+ +S+ ++ YK E + A D +++A+EILD + A+
Sbjct: 993 GIRVMGCRIYVSDVQESVYMVRYKRNENQLIIFADDTQPRYITAMEILDYNTVATADKCG 1052
Query: 880 NLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSLV 920
N+ VR S +DE +RG L + + +H+GE + +L+
Sbjct: 1053 NISVVRLAS-SISDEVDDDPTGNKSLWDRGLLNGASQKADFIVNFHIGEICTSIQKATLI 1111
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
G ++++ TV G IGV+ HE+ F + L+ ++R + G +H +
Sbjct: 1112 -------PGGSESLVYATVTGTIGVLVPFTAHEEQDFFQHLEMHMRSENPPLCGRDHLSY 1164
Query: 980 RSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
RS+ KN DGDL E + + + I+ ++ + E+ KR+E++
Sbjct: 1165 RSY-----YFPVKNVCDGDLCEQYNSIDIAKQKSIAADLDKTPAEVSKRLEDI 1212
>gi|340721347|ref|XP_003399083.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus terrestris]
gi|350406701|ref|XP_003487854.1| PREDICTED: splicing factor 3B subunit 3-like [Bombus impatiens]
Length = 1217
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 170/774 (21%), Positives = 320/774 (41%), Gaps = 94/774 (12%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ + + + + + G
Sbjct: 53 LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEY-IPAKNVFEKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DI 114
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDPEARLTISSPLEAHKSNTLVYHTV 171
Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
G P L D ++A + T + A+K + + V +S+ L+
Sbjct: 172 GVDVGFENPMFACLEIDYEEADNDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEE 230
Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-- 215
A+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 231 HANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGM 285
Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
+L G + + + +++ VT +K++ +A+++ L
Sbjct: 286 IFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTG 345
Query: 261 VVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLG 297
+++ S +G+ L ++ + P +G R + +L
Sbjct: 346 FLFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ V DL + ++ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 406 PIMACQVADLANEDTPELYITCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRR 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
D+ +D +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VDEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKE------------ 522
++ E+ C+ + + S AVG+ D +VRI SL + +
Sbjct: 582 KMPSEVMCMALGNVAVGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSMQALPAAAESL 640
Query: 523 ---HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
+G + S L + YL L +G LL +L+ +G+L D + LG++P+
Sbjct: 641 CIVEMGAKDANNSEDLSPQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRPVK 700
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L + V A S R + Y + + ++ + + F+S P+ +
Sbjct: 701 LFRIKMQGNQAVLAMSSRSWLSYYYQNRFHLTPLSYESLEFASGFSSEQCPEGIVAISTN 760
Query: 640 ELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
L I ++ + + + S PL PR+ S I ++ + EE++
Sbjct: 761 TLRILALEKLGAVFNQISFPLEYTPRKFAIHTDSAHLIIIETEHNAYTEETKQQ 814
Score = 82.0 bits (201), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 69/294 (23%), Positives = 135/294 (45%), Gaps = 41/294 (13%)
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LA 818
L+L+ + ++ + G++L + + ++LY D G ++L +C + HI
Sbjct: 937 LELVHKTTLDEVPLAICPYQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAV 990
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+ + G I V D+ +S+ + YK +E + A D + W++ +LD D A+
Sbjct: 991 VSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKF 1050
Query: 879 FNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSL 919
N+ +R S G D+ +RG L + V +H+GE V + +L
Sbjct: 1051 GNIAVIRLAS-GINDDVDEDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATL 1109
Query: 920 VMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
+ G ++++ T++G +GV+ HE + F + L+ ++R + G +H
Sbjct: 1110 I-------PGGSESLVYTTLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLS 1162
Query: 979 WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+RS+ KN +DGDL E F + T+ IS + + E+ K++E++
Sbjct: 1163 FRSY-----YYPVKNVIDGDLCEQFNSIEPTKQKSISGDLERTASEVSKKLEDI 1211
>gi|401882870|gb|EJT47110.1| hypothetical protein A1Q1_04103 [Trichosporon asahii var. asahii CBS
2479]
Length = 1107
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 204/874 (23%), Positives = 339/874 (38%), Gaps = 170/874 (19%)
Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIEL--LGETSIASTISYLDNAVVYIGSSYGDSQLI 274
LLGD G L + E E + +++ + +G S S+++YLDN ++ S+ GDS L+
Sbjct: 291 LLGDEYG--RLTAVGWEFEDMEKVQVGMIDMGVVSAPSSLTYLDNGYLFSASACGDSLLV 348
Query: 275 KLNL-------QPDAKGS----------------------YVEVLERYVNLGPIVDFCVV 305
L L QP KG V+V ER++N+ P DF +V
Sbjct: 349 YLVLPSPNSTRQPSGKGKEKAHDISDAGAYEIVTAEPQHGRVDVRERWMNIAPAKDFAIV 408
Query: 306 DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE-LQGIKGMWSLRSSTDDPFDTF 364
+ VV SG+ S R+VR+G+G ++E + GI+ MW++ ++ D P
Sbjct: 409 KEDDGRVSHVVVASGSASSNSFRVVRSGVGFENLMTIEEIPGIERMWTIPAA-DGPS--- 464
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
L+VSF T IL +E E+ + + T + L+QVT +RL S +
Sbjct: 465 LMVSFAYSTTIL--QIEPEVSVFKAADQVTAVPTFAAGLVDKSLLLQVTPEGIRLWSDLA 522
Query: 425 RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLD 484
+ + NV +++ A G++ DG +T + +
Sbjct: 523 AGM---------LAGNVDAPEDNRIATANVRGNIAVAAFRDGTVTLFRASSQ-------- 565
Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGI--- 541
G A+ W +++L+ E + +S+ A+
Sbjct: 566 ----GLQEVVKGFLAIADWCG---------EIDLVAVEGADQGTVLKSIREAAYATSLQF 612
Query: 542 -------SYLLCALGDGHLLNFLLNMKTGELTD-RKKVSLGTQPITLRTF--SSKNTTHV 591
+ LL L DG +++ + + E +D R SLG +P+ L S HV
Sbjct: 613 QESNNEPTRLLAGLSDGTFVSYSVKLNGAECSDSRHASSLGLRPLRLIALDISPNAEEHV 672
Query: 592 FAA--SDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI 649
AA SDR ++++ S +S+ K + + +
Sbjct: 673 VAAGISDRLSLVFESRDHYEFSSSGKKGI-----------------------VFERLTSL 709
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES----EMHFVRLLDDQTFEFI 705
+KL ++++ LG + +S ++ + V L + T E +
Sbjct: 710 KKLQVQTLDLGNRSATRVAALPGYNLVVAETVTRSMDHQTGDVLQSSSVELRNATTLELL 769
Query: 706 STYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENE----------------PTKG 749
S + L E S+ + + Y VGTA ENE +G
Sbjct: 770 SEFQLPEREAVASVNAVTL--HGRKYILVGTAIF---ENEDALEDATLEDVTSFIATNRG 824
Query: 750 RILVFIVEDG---KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
R+L+F + + L L+ G VY +G L A + K+ + + + E
Sbjct: 825 RLLLFQINESAGPSLDLVTSMTFNGPVYDTVVIHGFLAVATSTKVSILRLTTQPPSLEEA 884
Query: 807 QS-ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW---MS 862
S H LA+ + +VVGD M+SI +L E G I RD NA+ +S
Sbjct: 885 ASFAFAFETHHLAVVEIDKEKRLVVGDAMRSIIVLSVDPESGDIVGDQRDMNAHLVRCLS 944
Query: 863 AVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR 922
AV ++ + + ++G T L E V R + G+L
Sbjct: 945 AVHDVEPGVMI--------------ADGITP--------AASIGLSEDVTRLQPGTLA-- 980
Query: 923 LPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
P S G I ++ TVNG +GVI L L+ LQ N+ K+ KG GGL +W+
Sbjct: 981 -PVSAEGDILRADLLCTTVNGRLGVIGELGKGSIRTLDDLQRNMNKLYKGPGGL---EWK 1036
Query: 981 SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
N + F+DGD ++ F L + D+I
Sbjct: 1037 ESGNMLVPRETVGFIDGDFVQRFSSLDSSLQDKI 1070
>gi|307205956|gb|EFN84082.1| Splicing factor 3B subunit 3 [Harpegnathos saltator]
Length = 1217
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 170/776 (21%), Positives = 322/776 (41%), Gaps = 98/776 (12%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G + +L FR G +D++ + ++ + +L++ + + + + + G
Sbjct: 53 LLTVEVFGIVRSLMAFRLTGGTKDYIVVGSDSGRIVILEY-IPAKNIFEKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DI 114
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDAEARLTISSPLEAHKSNTLVYHTV 171
Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
G P L D ++A T + A+K + + V +S+ L+
Sbjct: 172 GVDVGFENPMFACLEIDYEEADSDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEE 230
Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-- 215
A+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 231 HANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGM 285
Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
+L G + + + + + VT +K++ +A+++ L
Sbjct: 286 IFVCSATHKTKSMFFFLAQTEQGDIFKITLETDDDMVTEIKLKYFDTVPVAASMCVLKTG 345
Query: 261 VVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLG 297
+++ S +G+ L ++ + P +G R + +L
Sbjct: 346 FLFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 406 PIMACQVADLANEDTPQLYIACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRR 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
D+ +D +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VDEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV- 533
++ E+ C+ + + S AVG+ D +VRI SL + + + + +P +
Sbjct: 582 KMPSEVMCMALGNVAVGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSM--QALPAAAE 638
Query: 534 LLCAFE-GIS---------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
LC E G+ YL L +G LL +L+ +G+L D + LG++P
Sbjct: 639 SLCIVEMGVKDADNSEDSAPQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRP 698
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ L + V A S R + Y + + ++ + + F+S P+ +
Sbjct: 699 VKLFRIRMQGNQAVLAMSSRSWLSYYYQNRFHLTPLSYESLEFASGFSSEQCPEGIVAIS 758
Query: 638 EGELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
L I ++ + + + S PL PR+ S + ++ + +E++
Sbjct: 759 TNTLRILALEKLGAVFNQVSFPLEYTPRKFAIHADSAHLVVIETEHNAYTDETKQQ 814
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 67/294 (22%), Positives = 134/294 (45%), Gaps = 41/294 (13%)
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LA 818
L+L+ + ++ + G++L + + ++LY D G ++L +C + HI
Sbjct: 937 LELLHKSPLDEVPLAICPYQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAV 990
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+ + G I V D+ +S+ + YK +E + A D + W++ +LD D A+
Sbjct: 991 VSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKF 1050
Query: 879 FNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSL 919
N+ +R + G D+ +RG L + V +H+GE V + +L
Sbjct: 1051 GNIAVIRL-ATGINDDVDEDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATL 1109
Query: 920 VMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
+ G ++++ T++G +GV+ HE + F + L+ ++R + G +H
Sbjct: 1110 I-------PGGSESLVYTTLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLS 1162
Query: 979 WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+RS+ KN +DGDL E F + + IS + + E+ K++E++
Sbjct: 1163 FRSY-----YYPVKNVIDGDLCEQFNSIEPAKQKSISGDLERTPSEVSKKLEDI 1211
>gi|268568396|ref|XP_002640241.1| C. briggsae CBR-TAG-203 protein [Caenorhabditis briggsae]
Length = 1218
Score = 139 bits (350), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 170/735 (23%), Positives = 307/735 (41%), Gaps = 88/735 (11%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESS---ELITRAMGDVSDRIGRP 62
I+G + +L FR +DF+ + ++ + +LQ++ E + L G R P
Sbjct: 59 IFG-VRSLLAFRLTAGTRDFIAVGSDSGRIVILQYNPEKTCFERLHQETFGKTGCRRIVP 117
Query: 63 TDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL--- 117
G DP R +IG L ++ D + L + + + L +
Sbjct: 118 ---GHYLAGDPRGRALMIGAVERQKLVYIMNRDAEAHLTISSPLEAHKQHTLCYAMVGID 174
Query: 118 YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNGADL 166
G PT L D +DA + T E A + + F E N+ L + +L
Sbjct: 175 VGFENPTFACLEFDYEDADNDPTGEAAKRTQQTLTFYELDLGLNHVVRKYAEPLTDPGNL 234
Query: 167 LIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-------- 215
LI VP GV++ E IVY N IR I + +D D R
Sbjct: 235 LIAVPGGNEGPSGVIVCCENYIVY--KNLGDQPDIRCPIPRRRNELD-DADRTMLIIATA 291
Query: 216 ---------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
+L+ G + + + +++ VT +K++ A+ + L +++ +
Sbjct: 292 THKTKNMFFFLIQAENGDIFKVTLETDEDLVTEMKLKYFDTVPPANALCILKAGFLFVAA 351
Query: 267 SYGDSQLIKLN-----------------------LQPDAKGSYVEVLERYVNLGPIVDFC 303
+G+ +L ++ +P S + + + +L P+ D
Sbjct: 352 EFGNHELYQIASLGEGGDDEFSSAMGFGENDAAFFEPHELRSLIPI-DSMDSLSPLTDAV 410
Query: 304 VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFD 362
+ D+ R+ Q+ T G ++++RNG+ I+E A +L G +W+++ + +D +D
Sbjct: 411 IGDIAREDAAQLFTLVGRGARSHMKVLRNGLEISEMAVSDLPGNPNAVWTVKKNIEDQYD 470
Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
+++VVSF++ T L + + D +EE GF T T+ C + LVQ+ S +R + +
Sbjct: 471 SYIVVSFVNAT--LTLTIGDTVEEASDSGFLPTTPTIGCSMIGDDSLVQIYSEGIRHIRA 528
Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQL-EYEI 480
R NEWK+PP + N QV +A GG LVY E+ +G L E +L +I
Sbjct: 529 DKR--INEWKAPPRRQIVKCAVNRRQVAVALSGGELVYFELDLNGTLNEFTERKLFNADI 586
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAF- 538
+C+ + I E S+ A+G D +VRI SL P+ L+ P S+LL
Sbjct: 587 ACMTFSEISEGELNSRFLALGT-VDNAVRIISLDPNDMLMPLSTQNLPCPPESILLIDTP 645
Query: 539 ----EGIS--YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
+G++ +L L +G L ++ TG + D + LGT+P L + + +
Sbjct: 646 NEDGKGVASVHLNIGLQNGCLFRNTVDNVTGAIMDTRTRYLGTRP-KLFKVQVQGRSAIL 704
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ-K 651
S R ++Y ++ + ++ + + F S + + L I + +
Sbjct: 705 CTSSRSWLLYHFQRRFHLTPLSYANLEYAASFCSNQCAEGVVAISASTLRIIAAEKLGVA 764
Query: 652 LHIRSIPLGEHPRRI 666
+++S PRRI
Sbjct: 765 FNVQSFEHKMTPRRI 779
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 68/309 (22%), Positives = 143/309 (46%), Gaps = 39/309 (12%)
Query: 746 PTKGRILVFIVEDG--KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGT 803
P +G + F + + + ET V +++ F G L + +++Y D G
Sbjct: 921 PVRGCVYTFHLSPNGDRFDFLHRTETPLPVGAIHDFRGMALVGFGKFLRMY-----DIGQ 975
Query: 804 RELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMS 862
++L ++C + + + +Q+ G I+V D +S+ L Y+ + + A D ++S
Sbjct: 976 KKLLAKCENKNFPVNIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQLVVFADDTTPRYVS 1035
Query: 863 AVEILDDDIYLGAENNFNLFTVR---KNSEGATDE--------ERG-------RLEVVGE 904
V +LD A+ NL VR + +E D+ +RG ++E+V
Sbjct: 1036 CVCVLDYHTVAIADKFGNLSVVRLPERVNEDVQDDPTVSKSVWDRGWLNGASQKVELVAN 1095
Query: 905 YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTN 963
+ +G+ + + SL +P G +++ T+ G IG + S + ++ F L+ +
Sbjct: 1096 FFIGDTITSLQKTSL---MP----GANEALVYTTIGGAIGCLVSFMSKDEVDFFTNLEMH 1148
Query: 964 LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
+R + G +H +RS+ K+V +DGD+ E F + ++ E+++ + +V
Sbjct: 1149 VRSEYPPLCGRDHLSYRSYYAPCKSV-----IDGDICEQFSLMELSKQKEVAEELGKTVS 1203
Query: 1024 ELCKRVEEL 1032
E+ K++E++
Sbjct: 1204 EISKKLEDI 1212
>gi|66553024|ref|XP_623333.1| PREDICTED: splicing factor 3B subunit 3 isoform 1 [Apis mellifera]
gi|380015815|ref|XP_003691890.1| PREDICTED: splicing factor 3B subunit 3-like [Apis florea]
Length = 1217
Score = 139 bits (349), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 170/774 (21%), Positives = 318/774 (41%), Gaps = 94/774 (12%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ + + + + + G
Sbjct: 53 LLTVEVFGIIRSLMAFRLTGGTKDYIVVGSDSGRIVILEY-IPAKNVFEKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DI 114
R GQ IDP R +IG L ++ D + +L + + + L +
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIGAIEKQKLVYILNRDPEARLTISSPLEAHKSNTLVYHTV 171
Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDN 162
G P L D ++A T + A+K + + V +S+ L+
Sbjct: 172 GVDVGFENPMFACLEIDYEEADSDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEE 230
Query: 163 GADLLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR-- 215
A+ L+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 231 HANFLVSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGM 285
Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
+L G + + + +++ VT +K++ +A+++ L
Sbjct: 286 IFVCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTG 345
Query: 261 VVYIGSSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLG 297
+++ S +G+ L ++ + P +G R + +L
Sbjct: 346 FLFVASEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLS 405
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
PI+ V DL + Q+ G +LR++R+G+ ++E A EL G +W+++
Sbjct: 406 PIMACQVADLANEDTPQLYITCGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRR 465
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
D+ +D +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 466 VDEEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDG 523
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 524 IRHIRADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERK 581
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKE------------ 522
++ E+ C+ + + S AVG+ D +VRI SL + +
Sbjct: 582 KMPSEVMCMALGNVAIGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSMQALPAAAESL 640
Query: 523 ---HLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
+G + S L + YL L +G LL +L+ +G+L D + LG++ +
Sbjct: 641 CIVEMGAKDANNSEELSLQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRAVK 700
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L + V A S R + Y + + ++ + + F+S P+ +
Sbjct: 701 LFRIKMQGNQAVLAMSSRSWLSYYYQNRFHLTPLSYESLEFASGFSSEQCPEGIVAISTN 760
Query: 640 ELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
L I ++ + + + S PL PR+ S I ++ + EE++
Sbjct: 761 TLRILALEKLGAVFNQISFPLEYTPRKFAIHSDSAHLIIIETEHNAYTEETKQQ 814
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 68/294 (23%), Positives = 134/294 (45%), Gaps = 41/294 (13%)
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LA 818
L+L+ + ++ + G++L + + ++LY D G ++L +C + HI
Sbjct: 937 LELVHKTTLDEVPLAICPYQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAV 990
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+ + G I V D+ +S+ + YK +E + A D + W++ +LD D A+
Sbjct: 991 VSINAIGQRIYVSDVQESVYAVRYKRQENQLIVFADDTHPRWITTTCVLDYDTVATADKF 1050
Query: 879 FNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSL 919
N+ +R S G D+ +RG L + V +H+GE V + +L
Sbjct: 1051 GNIAVIRLAS-GINDDVDEDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATL 1109
Query: 920 VMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
+ G ++++ T++G +GV+ HE + F + L+ ++R + G +H
Sbjct: 1110 I-------PGGSESLVYTTLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLS 1162
Query: 979 WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+RS+ KN +DGDL E F + + IS + + E+ K++E++
Sbjct: 1163 FRSY-----YYPIKNVIDGDLCEQFNSIEPAKQKSISSDLERTASEVSKKLEDI 1211
>gi|452002380|gb|EMD94838.1| hypothetical protein COCHEDRAFT_1128717 [Cochliobolus heterostrophus
C5]
Length = 1235
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 226/1004 (22%), Positives = 393/1004 (39%), Gaps = 161/1004 (16%)
Query: 155 WSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVY------CSANAFKAIPIRPSITKA 205
WS+ +D A+ L VP GVL GE++I Y S+ AIP R T+
Sbjct: 224 WSEP-VDRTANTLFRVPGGPSAPSGVLCCGEDSITYRRIFNNKSSVRRLAIPRREGATED 282
Query: 206 YGR-----------VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTI 254
R + YLL G + L + V +KI+ + ++I
Sbjct: 283 PNRKRMIVAGTLYSLKGGDFFYLLQTEDGDVFKLTVDAPNGTVENIKIKYFDTIPVTTSI 342
Query: 255 SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------V 287
L VY GD L +L D + S+ +
Sbjct: 343 CILRAGFVYAACESGDRILYELESLGDETDDPVFESSQFPVDPEASFAPPFFRPRALVNL 402
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
+E +L PI+D V + + Q+ T +G S R RN + + + L Q
Sbjct: 403 TAVEAMPSLNPIMDMEVANPALEDAPQIYTINGTGGRSSFRTTRNALEVLDLIESPLPQN 462
Query: 347 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+ + +++D DT +V+ +R L + + D++EE GF T TL
Sbjct: 463 ASDVWTTKLTSEDETDTLIVLCL--HSRTLVLKIGDDVEEASNTGFLPDTNTLGVQQFGE 520
Query: 407 NQLVQVTSGSVRLV---------SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
+ ++Q+ +R + +S + +W+ P ++ N QV +A G
Sbjct: 521 DCIIQIHPKGIRHIQGIQFPNDDASATHASLTDWQPPAHRTIVACATNNRQVAIALSSGQ 580
Query: 458 LVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-P 514
++Y E DG L + L+ I+CL + + E + AVG +D +VRI++L P
Sbjct: 581 ILYFECDSDGSLAMAEEEIVLDSTINCLAMPDVPEGSVRAFFLAVGC-SDQTVRIYNLSP 639
Query: 515 DL--NLITKEHLGGEIIPRSVLLC------AFEGIS-YLLCALGDGHLLNFLLNMKTGEL 565
D+ N++ + P S L + G S +L L G + +L+ TG++
Sbjct: 640 DMEGNILRSISVQALTSPPSDLTVNLMTDKSSRGYSQFLHIGLRSGVYIRSVLDEMTGDI 699
Query: 566 TDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHMCP 623
D ++ LG +PI + + A + RP + Y+ + L + +N
Sbjct: 700 GDTRRRFLGPEPIKFAKVTVAGEPAILAMTSRPWLGYTHPRTGVLQLTPLNYISFKSAWN 759
Query: 624 FNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPL--------GEHPRRICHQEQSRT 674
F+ + F + ++ EL I T +D+ SIPL G H + + + QS
Sbjct: 760 FDGSQFKGIICVSAN-ELRIFTFNDLTDNTTYESIPLKYTPRKMVGYHDQGVFYVIQSDN 818
Query: 675 FAICSLKNQSC---------------------------------------AEESEMHFVR 695
I + + Q A+ S ++
Sbjct: 819 NTISADRRQQLIAQSGGKKEDGTNGSMETEQSNGATDSDEFPAVDFGYPRAQGSWASCIQ 878
Query: 696 LLDDQTFEFISTYPLDTFEYGCSILSCSF----SDDSNVYYCVGTAYVL---PEENEPTK 748
++D T E T+ ++ S++S + S + + VGTA L P +
Sbjct: 879 VVDPVT-EKAVTHTIE-LNGNISLVSAALVFFESRNDEAFLAVGTAKDLSFTPYKFSSAS 936
Query: 749 GRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
+I +L+ E +L AF GKLLA I + + LY ++ R+ Q+
Sbjct: 937 IQIYKINPTGRELEFFHETTVSDPPLALLAFKGKLLAGIGRHLCLYDCGMKS-VLRKAQA 995
Query: 809 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSAVEI 866
I L +T+G +VV D +S++ +++K + + D A +A E+
Sbjct: 996 PNCVPTRITGL--KTQGSRLVVSDQAQSVTYVVHKDQVHPNRLIPFVDDTVARHTTASEM 1053
Query: 867 LDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRL 923
LD D +G + N++ VR K SE + + G +V + +LG NR L+
Sbjct: 1054 LDYDTTVGGDKFGNIWLVRCPQKVSESSDESPDGSDLLVDKSYLGGTPNRL---DLIAHY 1110
Query: 924 PDSDV-----------GQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGV 971
+D+ G V + + G +G + + ++L+ LR K +
Sbjct: 1111 FTNDIPVSIQKTVLLSGGERVVFWAGLQGTLGALIPFNSRRSHKMFQQLELQLRSDDKPL 1170
Query: 972 GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
G +H +RS+ K+V +DGDLIE FL L R + + I+
Sbjct: 1171 SGRDHLAFRSYFAPVKSV-----IDGDLIERFLVLPRDKRESIA 1209
>gi|452845193|gb|EME47126.1| hypothetical protein DOTSEDRAFT_69180 [Dothistroma septosporum NZE10]
Length = 1223
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 272/1197 (22%), Positives = 474/1197 (39%), Gaps = 211/1197 (17%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+LD ++G + + FR G + D L ++++ + V+ +DAE + +
Sbjct: 67 VLDQNVFGIVRGVTAFRIPGTSTDQLIVSSDSGRVAVVNYDAEKNRFNKVHLETYGKSGV 126
Query: 61 RPTDNGQIGIIDPDCRLIGLH-------LY------DGLFKVIPFDNKGQLKE------A 101
R T GQ D R I L +Y DG ++ Q A
Sbjct: 127 RRTIPGQYLASDTRGRCIMLASAEKNKVVYMMNRNADGTIQISSPHEANQWGSLCFAVCA 186
Query: 102 FNIRLEE--LQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN 159
+ E L++ + + PT + K + + Y V L V+ WS +
Sbjct: 187 LDTGWEPPVFAALEVDYTEAESDPTGTM---HEKREKQLVYYTVDLGLNHVVKS-WS-DT 241
Query: 160 LDNGADLLIPVP-----PPLCGVLIIGEETIVY-CSANAFKAIPI--RPSITKAYGRVDA 211
+D A+++ VP P GVL+ E+ I Y NA +IPI R T+ D
Sbjct: 242 VDYTANMIFGVPGGQDGP--SGVLVCAEDRIYYRHDKNASLSIPIPRRKGSTE-----DP 294
Query: 212 DGSRYLLGDHAGLLHLLVITHE------KEKVTGLKIEL-LGETSIASTIS--------Y 256
+ R ++ AG LHL HE E K+ + +GE + T S Y
Sbjct: 295 NRKRTIV---AGCLHLAKTRHEFFFLLQTEDGDVFKLSMAMGEDAQGRTTSNPERIILKY 351
Query: 257 LDNAVV------------YIGSSYGDSQLIKLN-----LQPDAKGSYV------EVLERY 293
D V Y+ + G+SQL ++ L+ + ++ E +RY
Sbjct: 352 YDTFPVARQMLLHKKGYLYVATENGNSQLFHIDDLADDLEFEPHNTFTSDDISPEPGDRY 411
Query: 294 -------------------VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGI 334
L P++ V +L + Q+ G + +R+G+
Sbjct: 412 EPMYFQPRELTMTHLAVDIPGLHPLLKTKVDNLTNEDAPQIYAIQGTGNKSQFKTIRHGL 471
Query: 335 GI-----NEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI 389
+ N +V I W+ + + D +L++S +A ++ D +E+ E
Sbjct: 472 DVEVLINNSMGNVPYDNI---WTFKHRSSDDHHKYLLLSSSYGDLTIACSIGDSVEQIES 528
Query: 390 EGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQV 449
F T+ LVQV + +R + + NEW+SP +V VA+AN Q+
Sbjct: 529 SPFLENRATVHAQQMGDATLVQVHARGIRSILESG--AHNEWQSPAHRTVVVASANERQL 586
Query: 450 LLATGGGHLVYLEIG-DGILTEVKH-AQLEYEISCLDINPIGENPSYSQIAAVGMWTDIS 507
LLA L + +G DGIL +++ ++ +I+ L + ++ A VG D +
Sbjct: 587 LLALSSAELAFFFMGDDGILNQLEEMPEMSGKITALSVGQTPRGRQQARYAVVGC-DDCT 645
Query: 508 VRIFSLP-DLNLITKEHLGGEIIPRSVLLCAFEG------ISYLLCALGDGHLLNFLLNM 560
+R+ S+ D L + IP S+ + ++ + L G L +++
Sbjct: 646 IRVLSIELDSPLEPRSVQALSAIPTSLEVVEMLDPASNTIVNVVHIGLQSGLYLRAIIDE 705
Query: 561 KTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSH 620
TGEL D + LGTQP L + + A S RP + ++ + LY+ L
Sbjct: 706 TTGELGDVRTKFLGTQPPRLCPVEVNDQDCILACSSRPWLGFNHPQNNLYTVTPLITE-- 763
Query: 621 MCPFNSA-AF--PD--SLAIAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQSRT 674
P N+A +F PD L + L I T+ I+ +L +I L PR +
Sbjct: 764 --PINAARSFISPDLSGLCAIQGSSLLIFTVPSIEGRLSHSTIDLNYTPRSMTRNPWYPI 821
Query: 675 FAIC-------------SLKNQSCAEES-----EMHF------------VRLLDDQTFE- 703
+ I L+ +S E+ E H ++ +D T
Sbjct: 822 WYIAESDGNALSKATRDQLRGKSIEEDEGATALEKHLGLPRGVQHWASCIQAVDPVTKNA 881
Query: 704 FISTYPLDTFEYGCSILSCSF-SDDSNVYYCVGTA-YVLPEENEPTKGRILVF-IVEDG- 759
+ST L E +F S D V+ VGT ++ P G + ++ +VEDG
Sbjct: 882 VVSTVELAENEAALCCTCVAFESRDWEVFLAVGTGQHMQPGTGVQAVGYVHIYKLVEDGT 941
Query: 760 KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRD-DGTRELQSECGHHG 814
KL+ + + + VY++ AFNG+L + ++ +Y K MLR GT
Sbjct: 942 KLEFVHKTKFDLPVYTVLAFNGRLALGVGNELFIYDMGIKAMLRKARGT-------ATPN 994
Query: 815 HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
I++L Q G+ IV GD+ + I+ ++YK + + D W + ++D + G
Sbjct: 995 QIVSLEAQ--GNRIVCGDVSEGITYVVYKPKFNRMIPFVDDVVQRWTTCTTMVDYETAAG 1052
Query: 875 AENNFNLFTVR---KNSEGATDEERG---------------RLEVVGEYHLGEFVNRFRH 916
+ NL+ VR + S+ A +E G RL++ Y + +
Sbjct: 1053 GDKFGNLWVVRCPEQPSQEADEEGAGGFIMNERSYLNGAPYRLDLRAHYFCQDIPTSMQR 1112
Query: 917 GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLN 975
+LV G + + + G +G++ + E F +L+ +R + G +
Sbjct: 1113 TALV-------AGGQELLFWSGLQGTLGMLVPFVTREDVEFFTQLEQQMRAEDPPLAGRD 1165
Query: 976 HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
H +RS+ V K +DGDL E F+ LS +I+ ++ V+E+ K+++E+
Sbjct: 1166 HLMYRSY-----YVPVKGVIDGDLCERFMHLSYDGKQKIAAEVDRGVKEIEKKIQEM 1217
>gi|307166104|gb|EFN60356.1| Splicing factor 3B subunit 3 [Camponotus floridanus]
Length = 1201
Score = 137 bits (346), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 170/771 (22%), Positives = 317/771 (41%), Gaps = 104/771 (13%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ + + + + + G
Sbjct: 53 LLTVEVFGIIRSLMAFRLTGATKDYIVVGSDSGRIVILEY-IPAKNMFDKVHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYG 119
R GQ IDP R + + L P + N + +D+ F
Sbjct: 112 CRRIVPGQYLAIDPKGRAVMIEAR--LTISSPLEAHKS-----NTLVYHTVGVDVGF--- 161
Query: 120 CAKPTIVVLYQDNKDARHVKTYEVALKDK------------DFVEGPWSQNNLDNGADLL 167
P L D ++A T + A+K + + V +S+ L+ A+ L
Sbjct: 162 -DNPMFACLEIDYEEADSDPTGDAAVKTQQTLTLYELDLGLNHVVRKYSEP-LEEHANFL 219
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR------- 215
+ VP P GVLI E + Y N IR I + +D D R
Sbjct: 220 VSVPGGNDGP--SGVLICSENYLTY--KNLGDQHDIRCPIPRRRNDLD-DPERGMIFVCS 274
Query: 216 ----------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIG 265
+L G + + + +++ VT +K++ +A+++ L +++
Sbjct: 275 ATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIKLKYFDTVPVAASMCVLKTGFLFVA 334
Query: 266 SSYGDSQLIKL-------------NLQPDAKGSYVEVLERYV----------NLGPIVDF 302
S +G+ L ++ + P +G R + +L PI+
Sbjct: 335 SEFGNHYLYQIAHLGDDDDEPEFSSAMPLEEGDTFFFAPRPLRNLVLVDEMDSLSPIMAC 394
Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPF 361
V DL + Q+ G +LR++R+G+ ++E A EL G +W+++ D+ +
Sbjct: 395 QVADLANEDTPQLYIACGRGPRSTLRVLRHGLEVSEMAVSELPGNPNAVWTVKRRIDEEY 454
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D +++VSF++ T +L++ + +EE GF T TL C + LVQV +R +
Sbjct: 455 DAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSALGEDALVQVYPDGIRHIR 512
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYE 479
+ R NEWK+P ++ N QV++A GG LVY E+ G L E + ++ E
Sbjct: 513 ADKR--VNEWKAPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTERKKMPSE 570
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LLCAF 538
+ C+ + + S AVG+ D +VRI SL + + + + +P + LC
Sbjct: 571 VMCMALGNVAVGEQRSWFLAVGL-QDNTVRIISLDPSDCLAPRSM--QALPAAAESLCIV 627
Query: 539 EGIS----------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
E + YL L +G LL +L+ +G+L D + LG++P+ L
Sbjct: 628 EMGAKEADNSEDSAPQQSSLYLNIGLQNGVLLRTVLDPISGDLADTRTRYLGSRPVKLFR 687
Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+ V A S R + Y + + ++ + + F+S P+ + L
Sbjct: 688 IRMQGNQAVLAMSSRSWLSYYHQNRFHLTPLSYESLEFASGFSSEQCPEGIVAISTNTLR 747
Query: 643 IGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
I ++ + + + S PL PR+ S I ++ + EE++
Sbjct: 748 ILALEKLGAVFNQVSFPLEYTPRKFAIHTDSAHLVIIETEHNAYTEETKQQ 798
Score = 74.3 bits (181), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 64/276 (23%), Positives = 127/276 (46%), Gaps = 41/276 (14%)
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQTRGDFIVVGDLMKS 836
+ G++L + + ++LY D G ++L +C + HI + + G I V D+ +S
Sbjct: 939 YQGRVLVGVGRMLRLY-----DMGKKKLLRKC-ENKHIPNAVVSINAIGQRIYVSDVQES 992
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDE-- 894
+ + YK +E + A D + +++ +LD D A+ N+ +R + G D+
Sbjct: 993 VYAVRYKRQENQLIVFADDTHPRFITTTCVLDYDTVATADKYGNIAVIRL-ATGINDDVD 1051
Query: 895 ----------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
+RG L + V +H+GE V + +L+ G ++++
Sbjct: 1052 EDPTGNKALWDRGLLNGASQKADTVACFHVGETVMSLQKATLI-------PGGSESLVYT 1104
Query: 938 TVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLD 996
T++G +GV+ HE + F + L+ ++R + G +H +RS+ KN +D
Sbjct: 1105 TLSGTVGVLVPFTSHEDHDFFQHLEMHMRSEHPPLCGRDHLSFRSY-----YYPVKNVID 1159
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
GDL E F + + IS + + E+ K++E++
Sbjct: 1160 GDLCEQFNSIEPVKQKSISGDLERTPSEVSKKLEDI 1195
>gi|302654421|ref|XP_003019018.1| hypothetical protein TRV_07031 [Trichophyton verrucosum HKI 0517]
gi|291182708|gb|EFE38373.1| hypothetical protein TRV_07031 [Trichophyton verrucosum HKI 0517]
Length = 802
Score = 137 bits (345), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 175/701 (24%), Positives = 296/701 (42%), Gaps = 100/701 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ + R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
GQ +DP R LI + L V+ + + +L + + L LD+
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182
Query: 116 F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
+ ++ + + QD + +T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVEYTEVDQDPTGQAYEETEKMLVYYELDLGLNHVVRR-WA-DPVDRTASM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR- 215
L VP P GVL+ E+ IVY +N AF+ IP R P+ R G
Sbjct: 241 LFQVPGGADGP--SGVLVCAEDNIVYRHSNQDAFRVPIPRRRGPTENPERKRCITAGVMH 298
Query: 216 -------YLLGDHAGLLH---LLVITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
+LL G L + ++ E EK TG LK++ +AS++ L +
Sbjct: 299 KMRGAFFFLLQSEDGDLFKVTMEMVEDENEKATGEVKRLKLKYFDTVPLASSLCILKSGF 358
Query: 262 VYIGSSYGD-----------------------SQLIKLNLQP----DAKGSYVEVLERYV 294
+++ S G+ S +I L P + ++E
Sbjct: 359 LFVASETGNQHFYQFEKLGDDDDEIEFISDDYSAVISEPLPPVYFRPRPAENLNLVESIA 418
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
+L P++ + ++ + Q+ T G S R +++G+ ++E EL + +W+
Sbjct: 419 SLNPLMAASIANITEEDAPQIYTLCGTSARSSFRTLKHGLEVSEIVESELPSVPSAVWTT 478
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ S +D FD ++V+SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 479 KLSRNDQFDAYIVLSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVH 536
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV- 471
+R + + R NEW +P S+ AT N QV +A G +VY E+ DG L E
Sbjct: 537 PKGIRHIHADQR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDGSLAEYD 594
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
+ Q+ ++CL + + E S AVG D +VRI SL PD L K P
Sbjct: 595 EKRQMSGTVTCLSLGEVPEGRGRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 653
Query: 531 RSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
++ ++ + S YL L G L +L+ TGEL+D + LG +P+ L + S
Sbjct: 654 SALSIMSMIDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFSVS 713
Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFN 625
K V A S R L YS+V K + + P N
Sbjct: 714 VKEQRAVLALSSR--------SWLGYSDVQTKSFT-LTPLN 745
>gi|357111224|ref|XP_003557414.1| PREDICTED: DNA damage-binding protein 1-like [Brachypodium
distachyon]
Length = 1356
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 122/464 (26%), Positives = 206/464 (44%), Gaps = 74/464 (15%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE--LQGIKGMWS 352
N+GPI+D + D + Q Q+ +CSG +GSLR++RNGI + + E QG+ G+W+
Sbjct: 478 NVGPILDLAIADYHGEKQDQMFSCSGMCPEGSLRVIRNGINVEKLLRTEPIYQGVTGLWT 537
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LR D + +FLV++F+ ETRIL++ L + ++ GF TL C LVQ+
Sbjct: 538 LRMKRTDMYHSFLVLAFVEETRILSVGLSFN-DISDAVGFQPDVCTLACGLIADGVLVQI 596
Query: 413 TSGSVRLVSSTSRELRN--EWKSP------PGYSVNVATANASQVLLATGGGHLVYLEIG 464
S V+L T+ SP P +++V + V +AT +Y+
Sbjct: 597 HSKGVKLCLPTAYAHPEGAPLTSPVCVDWYPDVTISVGAVGHNIVAVATSNPCCLYILSV 656
Query: 465 DGI------LTEVKHAQLEYEISCLDI-------NPIGENPSYSQIAAVGMWTDISVRIF 511
+ L E++H QL YE+SC+ I +P+ ++ + + + VR+F
Sbjct: 657 RPLSSFQYELYEIQHVQLRYEVSCISIPEEDSRRSPVAVRRAFGRGKRNNLPAKVDVRMF 716
Query: 512 SL-----PDLNLITKE-----------------HLGGEI---IPRSVLLCAFEGISYLLC 546
++ P + +I+ E G + IP SV A E Y+L
Sbjct: 717 AVIGTHKPSVEVISLEPGEAFMLLSIGSISVNNSFGAPVSGRIPESVRFVASERF-YILA 775
Query: 547 ALGDGHLLNFLLNMKTGE---LTDR--KKVS---------------LGTQPITLRTFSSK 586
L +G LL F +T E L D K+ S +G P+ L S
Sbjct: 776 GLRNGMLLRF--ESETSEEHYLPDSFYKESSTHSVNTLLQLIAMRHIGITPVGLVPLSDS 833
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTI 646
+ + SDR ++++S L YS+++ SH+ P +S P L E L + +
Sbjct: 834 ANSDIILLSDRSWLLHASRHSLAYSSISFLPASHVTPVSSMDCPSGLLFVAENCLHLVEM 893
Query: 647 DDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEE 688
++L+ + + + PR++ + SRT + L SC+ +
Sbjct: 894 VHGKRLNAQKLSIEGTPRKVLYHSDSRTLLVMRTGLTGASCSSD 937
>gi|323454388|gb|EGB10258.1| hypothetical protein AURANDRAFT_23619 [Aureococcus anophagefferens]
Length = 1212
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 106/374 (28%), Positives = 176/374 (47%), Gaps = 12/374 (3%)
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I D DL + QV G SLR++R+G+ ++E A EL G +W++R
Sbjct: 415 ITDVYCGDLAGEEAPQVYALCGKGHRSSLRVLRHGVAVSEMAVSELPGRPSAVWTVRGRH 474
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
D+P+D ++VVSF + T +L++ + +EE GF + TL N L+QV +
Sbjct: 475 DEPYDKYIVVSFTNATLVLSIG--ETVEEVTDSGFLATAPTLDVALLADNALLQVHGEGI 532
Query: 418 RLVSSTSRELR-NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVKHAQ 475
R V +LR +EWK+P ++ A AN QV A GG ++Y E+ G L E+ +
Sbjct: 533 RHVRG---DLRISEWKTPGRKAIEKAAANERQVAAALAGGEVIYFELDASGALAELGTKE 589
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVL 534
L E++CLD+ + + + A+G W D S+R+ SL PD L+ + SV
Sbjct: 590 LGVEVACLDVGVVPAGRARAPFLALGGW-DGSLRLLSLAPDELLVQVATMQLGARAESVR 648
Query: 535 LC-AFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
C +G + L +G L ++ TG+L D + LG++ + L + A
Sbjct: 649 FCETPDGRLGVAAGLANGVLQRAAVDASTGQLGDARARFLGSRAVRLFRVDVGGAPGLLA 708
Query: 594 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKL 652
S R + Y+ +L + ++ + H F S P+ + L I D + +
Sbjct: 709 LSSRAWLCYAHAGRLETAPLSYDALEHAAGFKSEQCPEGVVAIAGSTLRIFVPDKLGETF 768
Query: 653 HIRSIPLGEHPRRI 666
+ ++PL PRR+
Sbjct: 769 NQSALPLRYTPRRL 782
>gi|302504587|ref|XP_003014252.1| hypothetical protein ARB_07557 [Arthroderma benhamiae CBS 112371]
gi|291177820|gb|EFE33612.1| hypothetical protein ARB_07557 [Arthroderma benhamiae CBS 112371]
Length = 762
Score = 136 bits (343), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 175/701 (24%), Positives = 296/701 (42%), Gaps = 100/701 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +L FR G ++D++ I ++ + ++++ + R + + G R
Sbjct: 64 VFGIIRSLAAFRLAGSSKDYIIIGSDSGRITIVEY-VPAQNRFNRIHLETFGKSGVRRVV 122
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEE-------LQVLDIK 115
GQ +DP R LI + L V+ + + +L + + L LD+
Sbjct: 123 PGQYLAVDPKGRACLIASVEKNKLVYVLNRNAQAELTISSPLEAHRPQTVVFALTALDVG 182
Query: 116 F---LYGCAKPTIVVLYQDNKDARHVKT------YEVALKDKDFVEGPWSQNNLDNGADL 166
+ ++ + + QD + +T YE+ L V W+ + +D A +
Sbjct: 183 YENPIFAALEVEYTEVDQDPTGQAYEETEKMLVYYELDLGLNHVVRR-WA-DPVDRTASM 240
Query: 167 LIPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIR--PSITKAYGRVDADGSR- 215
L VP P GVL+ E+ IVY +N AF+ IP R P+ R G
Sbjct: 241 LFQVPGGADGP--SGVLVCAEDNIVYRHSNQDAFRVPIPRRRGPTENPERKRCITAGVMH 298
Query: 216 -------YLLGDHAGLLH---LLVITHEKEKVTG----LKIELLGETSIASTISYLDNAV 261
+LL G L + ++ E EK TG LK++ +AS++ L +
Sbjct: 299 KMRGAFFFLLQSEDGDLFKVTMEMVEDENEKATGEVKRLKLKYFDTVPLASSLCILKSGF 358
Query: 262 VYIGSSYGD-----------------------SQLIKLNLQP----DAKGSYVEVLERYV 294
+++ S G+ S +I L P + ++E
Sbjct: 359 LFVASETGNQHFYQFEKLGDDDDEIEFISDDCSAVISEPLPPVYFRPRPAENLNLVESIA 418
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
+L P++ + ++ + Q+ T G S R +++G+ ++E EL + +W+
Sbjct: 419 SLNPLMAASIANITEEDAPQIYTLCGTGARSSFRTLKHGLEVSEIVESELPSVPSAVWTT 478
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ S +D FD ++V+SF + T +L++ + +EE GF S TL + L+QV
Sbjct: 479 KLSRNDQFDAYIVLSFSNGTLVLSIG--ETVEEVTDTGFLSSAPTLAVQQLGEDSLIQVH 536
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEV- 471
+R + + R NEW +P S+ AT N QV +A G +VY E+ DG L E
Sbjct: 537 PKGIRHIHADQR--VNEWPAPQHRSIVAATTNERQVAIALSSGEIVYFEMDTDGSLAEYD 594
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIP 530
+ Q+ ++CL + + E S AVG D +VRI SL PD L K P
Sbjct: 595 EKRQMSGTVTCLSLGEVPEGRVRSSFLAVGC-DDSTVRILSLDPDSTLENKSVQALTSAP 653
Query: 531 RSV-LLCAFEGIS-----YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
++ ++ + S YL L G L +L+ TGEL+D + LG +P+ L + S
Sbjct: 654 SALSIMSMIDSTSGGSTLYLHIGLYSGIYLRTVLDEVTGELSDTRTRFLGVKPVKLFSVS 713
Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFN 625
K V A S R L YS+V K + + P N
Sbjct: 714 VKEQRAVLALSSR--------SWLGYSDVQTKSFT-LTPLN 745
>gi|452986188|gb|EME85944.1| hypothetical protein MYCFIDRAFT_59215 [Pseudocercospora fijiensis
CIRAD86]
Length = 1223
Score = 136 bits (343), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 228/1017 (22%), Positives = 412/1017 (40%), Gaps = 176/1017 (17%)
Query: 155 WSQNNLDNGADLLIPVP-----PPLCGVLIIGEETIVYCSANAFK-AIPI--RPSITKAY 206
WS+ +D A++LI VP P GVL+ E+ I Y A AIPI R T+
Sbjct: 238 WSEP-VDYSANMLIGVPGGQNGP--SGVLVCCEDRIYYKHDKAASLAIPIPRRKGATE-- 292
Query: 207 GRVDADGSRYLLGDHAGLLHL--------LVITHEKEKVTGLKIELLGETSIAST----- 253
D + RY++ A LHL ++ E V L I + + T
Sbjct: 293 ---DPNRKRYIV---ASCLHLAASRKEFFFLLQTEDGDVFKLNITMAVDAQGRQTADPEQ 346
Query: 254 --ISYLDN------------AVVYIGSSYGDSQLIKLN------------------LQPD 281
+ Y D +YI + G+SQL ++ + PD
Sbjct: 347 IILKYYDTFPIAKQMLLHKKGFLYIATENGNSQLYHVDDLADDLEFEPHNNFTSDGVSPD 406
Query: 282 AKGSY------------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRI 329
+Y + L P++ V +L + Q+ G +
Sbjct: 407 PAEAYEPTYFQPRELTMTHLAVDVPGLHPLMRTKVDNLTGEDAPQIYGIQGTGNKSLFKT 466
Query: 330 VRNGIGI-----NEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDEL 384
+R+G+ + N +V GI W+ + + D +L++S +A ++ D +
Sbjct: 467 IRHGLDVEIMIDNNMGNVPYDGI---WTFKHRSSDEHHKYLIISSSYGDLTVACSIGDSV 523
Query: 385 EETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATA 444
E+ E F T+ + LVQV + +R + T NEW +PP +V A+A
Sbjct: 524 EQIENSPFLENRATVHAQQMGDSTLVQVHARGIRSILETG--AFNEWPTPPHRTVAAASA 581
Query: 445 NASQVLLATGGGHLVYLEIG-DGILTEVKH-AQLEYEISCLDINPIGENPSYSQIAAVGM 502
N Q+LLA L + +G DGIL +++ ++ +I+ + + + ++ A VG
Sbjct: 582 NERQLLLALSSAELAFFFMGEDGILIQLEEMPEMSGKITAISVGQTPKGRQQAKYAVVGC 641
Query: 503 WTDISVRIFSLP-DLNLITKEHLGGEIIPRSV----LLCAFEGISYLLCALG--DGHLLN 555
D ++R+ S+ D L + +P S+ +L G + + +G G L
Sbjct: 642 -EDCTIRVLSIELDSPLEARSVQALSAVPTSLEVVEMLDPASGTTVNVVHIGLRSGLYLR 700
Query: 556 FLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNL 615
+++ TGEL D + LGT+P L V A S RP + Y+ + LY+ L
Sbjct: 701 AIIDETTGELGDVRTKFLGTKPPRLCPVEVDGEDCVLACSSRPWLGYNHPQSKLYTVTPL 760
Query: 616 --KEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQ-KLHIRSIPLGEHPRRICHQEQS 672
+ + F+S AI + L I +I I+ +L S+ L PR +
Sbjct: 761 IAEPMEAARSFSSPELKGLCAI-QGSSLLIFSIPSIEGRLSHSSVSLQYTPRSMTRNPWY 819
Query: 673 RTFAICSLKNQ--SCAEESEMHFVRLLDDQTFEFISTY------------------PLDT 712
+ + + S A ++ + DD + + + P+
Sbjct: 820 PLWYVAQSEGNTLSQATRDQLRGKSIGDDDEAKAMERHLGLPRGNGHWASCIQAVDPIGG 879
Query: 713 FEYGCSI--------LSCSF----SDDSNVYYCVGTA-YVLPEENEPTKGRILVF-IVED 758
E C+I LSC+ S + VY VGT ++ P T G + ++ +++D
Sbjct: 880 REVTCTIELGENEAALSCACVAFESKNWEVYLAVGTGQHMQPGTGVQTAGYVHIYKLLKD 939
Query: 759 G-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH---HG 814
G +L+ + + + + VY+L F G+L + ++ +Y D G + L + +
Sbjct: 940 GAELEFVHKTKFELPVYALMPFRGRLALGVGNELFIY-----DMGMKALLRKARNIAVPN 994
Query: 815 HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
I++L +++G+ I+ GD+ + ++ L+YK + D W + ++D + G
Sbjct: 995 QIVSL--ESQGNRIICGDVSEGVTYLVYKPTFNRLIPFVDDTVQRWTTTTTMVDYETAAG 1052
Query: 875 AENNFNLFTVR---KNSEGATDEERG---------------RLEVVGEYHLGEFVNRFRH 916
+ NL+ VR + S+ A +E G RL++ Y + +
Sbjct: 1053 GDKFGNLWIVRCPEQPSQEADEEGAGGYIMNERSYLNGAPYRLDLRAHYFCQDIPMSMQR 1112
Query: 917 GSLVMRLPDSDVGQIPTVIFGTVNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLN 975
+LV G + + + G +G+ I + E F L+ +R + G +
Sbjct: 1113 TALV-------AGGQEVLFWSGLQGTLGILIPFVTREDVEFFTALEQQMRTEDPPLAGRD 1165
Query: 976 HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
H +RS+ V K +DGDL+E F+ LS +I+ ++ SV+E+ K+V+E+
Sbjct: 1166 HLMYRSY-----YVPVKGVIDGDLLERFMGLSYDTKQKIAAEVDRSVKEIEKKVQEM 1217
>gi|169599248|ref|XP_001793047.1| hypothetical protein SNOG_02443 [Phaeosphaeria nodorum SN15]
gi|160704565|gb|EAT90655.2| hypothetical protein SNOG_02443 [Phaeosphaeria nodorum SN15]
Length = 1246
Score = 135 bits (339), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 222/1003 (22%), Positives = 397/1003 (39%), Gaps = 156/1003 (15%)
Query: 155 WSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVY-----CSANAFK-AIPIRPSITKA 205
WS+ +D A+ L VP GVL G + I Y ++N + AIP R T+
Sbjct: 230 WSEP-VDRTANALYRVPGGPNAPSGVLCCGLDNITYRRIFNTNSNVHRLAIPRREGATED 288
Query: 206 YGR-----------VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTI 254
R + YLL G + L + V +KI+ +A++I
Sbjct: 289 PNRKRSIVAGTLYSLKGGDFFYLLQTDDGDVFKLTVDAPSGTVERIKIKYFDTIPVATSI 348
Query: 255 SYLDNAVVYIGSSYGDSQLIKLN-----------------LQPDA----------KGSYV 287
L VY GD L +L + P+A + +
Sbjct: 349 CILRAGFVYAACESGDRILYELESLGDETDDPMFESGQFPVDPEAVFAPPFFKPRALTNL 408
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
+E +L PI+ + + + Q+ T +GA + R RN + + + L Q
Sbjct: 409 TAVETMPSLNPIMGMEIANPMLEDAPQIYTINGAGGRSTFRTTRNALEVLDLIESPLPQN 468
Query: 347 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+ + + +D DT +V+ +R L + + +++EE GF + T TL
Sbjct: 469 ASDVWTTKLTLEDETDTLIVLCL--HSRTLVLKIGEDVEEASNTGFMADTNTLGVQQFGE 526
Query: 407 NQLVQVTSGSVRLVSS---------TSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
+ ++QV +R + + +W+ P ++ N QV +A G
Sbjct: 527 DCIIQVHPKGIRHIQGIQFPEDGAVATHTSLTDWQPPAHRTIVACATNNRQVAIALSSGQ 586
Query: 458 LVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-P 514
++Y E DG L + L+ I+CL I + E + AVG +D +VRIF+L P
Sbjct: 587 ILYFECDADGSLAMAEEEIALDSTINCLAIPDVPEGSVRAFFMAVGC-SDQTVRIFNLSP 645
Query: 515 DL--NLITKEHLGGEIIPRSVLLC------AFEGIS-YLLCALGDGHLLNFLLNMKTGEL 565
D+ N++ + P S L + G S +L L G + +L+ TG++
Sbjct: 646 DMEGNILRSISVQALTSPPSGLTINMMSDKSSRGYSQFLHIGLRSGVYIRSVLDEMTGDI 705
Query: 566 TDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHMCP 623
D ++ LG +PI + + A + RP + Y+ + L + +N
Sbjct: 706 GDTRRRFLGPEPIKFAKVTVAGEPAILAMTSRPWLGYTHPRTGVLQLTPLNYIAFKSAWN 765
Query: 624 FNSAAFPDSLAIAKEGELTIGTIDDI------QKLHIRSIP---LGEHPRRICHQEQSRT 674
F + F + ++ EL I T +D+ + + ++ P +G H + + + QS
Sbjct: 766 FEGSQFKGIICVSAN-ELRIFTFNDLTDNTTYETIQLKYTPRKMVGYHEQGVFYVIQSEN 824
Query: 675 FAICSLKNQSCAEES-------------------EMHFVRLLDDQTFEFISTYPLDTFEY 715
+ + Q E+ E R D+ YP +
Sbjct: 825 NTLSADTRQKLIAEADTKQEDEDDAPMEIEGANGETTNGRATSDELPPTAFGYPRAQGNW 884
Query: 716 GC-----------------------SILSCSF----SDDSNVYYCVGTAYVLP-EENEPT 747
S++S + S ++ + VGTA L + +
Sbjct: 885 ASCIQVVDPVTEKAVTHTVEINANQSLVSAALVYFESRGNDAFLAVGTAKDLSFQPYHYS 944
Query: 748 KGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRE 805
I V+ I DG+ L+ E E +L AF GK++A + + + LY +R R+
Sbjct: 945 SASIQVYKISPDGRELEFFHETEVTEPPTALLAFKGKMVAGVGRNLALYDCGMRS-VLRK 1003
Query: 806 LQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSA 863
Q+ I+ L +T+G +VV D ++S++ +++K + + D +
Sbjct: 1004 AQATNCVPTRIVDL--KTQGSRLVVSDSVQSVTYVVHKDQIHPNRMIPFIDDTIQRHTTC 1061
Query: 864 VEILDDDIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNR------F 914
E+LD D +G + NL+ VR K SE + + G +V + +LG NR +
Sbjct: 1062 TEMLDYDTTVGGDKFGNLWLVRCPQKVSESSDESPDGSDILVDKSYLGGTANRLDSVAHY 1121
Query: 915 RHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGV 971
+ + + + + G + + + G +G + Q+ ++L+ LR K +
Sbjct: 1122 FANDIPISIQKTHLLSGAERIIFWAGLQGTLGALIPFNSRRQHKMFQQLELQLRSDDKPM 1181
Query: 972 GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
G +H +RS+ N K+V +DGDLIE FL LSR + + I
Sbjct: 1182 SGRDHLAFRSYFNPVKSV-----IDGDLIERFLVLSRDKRESI 1219
>gi|124359136|gb|ABD32504.2| CPSF A subunit, C-terminal; WD40-like [Medicago truncatula]
Length = 632
Score = 135 bits (339), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 161/647 (24%), Positives = 279/647 (43%), Gaps = 80/647 (12%)
Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCLDINP 487
NEW++P ++ +N QV++A GG L+Y E+ G L EV+ ++ +++CLDI P
Sbjct: 18 NEWRTPGKRTIAKVGSNRLQVVIALNGGELIYFEVDVTGQLMEVEKHEMSGDVACLDIAP 77
Query: 488 IGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGISYLLC 546
+ E S+ AVG + D ++RI SL PD + T LG + + + F +
Sbjct: 78 VPEGRQRSRFLAVGSY-DKTIRILSLDPDDCMQT---LGIQSLSSASESLLFLEVQ---- 129
Query: 547 ALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK 606
+G L +++M TG L+D + LG +P L + + S RP + Y
Sbjct: 130 -ASNGVLSRTVVDMVTGLLSDSRSPFLGLRPPKLFPIVVRGKRAMLCLSSRPWLGYIHQG 188
Query: 607 KLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRR 665
L + ++ + + F+S + + L I TI+ + + + IPL P +
Sbjct: 189 HFLLTPLSYETLEFAASFSSDQCVEGVVALAGEALRIFTIERLGETFNETVIPLRYTPMK 248
Query: 666 ICHQEQSRTFAIC-SLKNQSCAEESEM--------------HF-------------VRLL 697
Q + + + S + AEE E H+ +R+L
Sbjct: 249 FVLQPKRKLLVVIESDQGAFTAEEREAAKKDEDKDDPLSDEHYGYPKAESDKWASCIRIL 308
Query: 698 DDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL--PEENEPTKGRILVF 754
D +T L E S + +F D + VGTA L T G I ++
Sbjct: 309 DPKTGNTTCLLELQDNEAAFSGCTVNFHDKEYGTLLDVGTAKGLQFTPRRSLTAGFIHIY 368
Query: 755 -IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
+EDG+ L+L+ + + +G +L+ F G+LLA I ++ Y D G R L + +
Sbjct: 369 RFLEDGRSLELLHKTQVEGVPLALSQFQGRLLAGIGPVLRFY-----DLGKRRLLRK--Y 421
Query: 813 HGHILA---LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
+ + +QT D I VGD +S Y+ +E + A D W++A +D
Sbjct: 422 ENKLFPNTIVSIQTYRDRIYVGDTQESFHYCKYRWDENQLYIFADDCVPRWLTASYHIDF 481
Query: 870 DIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 929
D + K +G + ++E + ++H+G+ ++ + SL+ G
Sbjct: 482 DTMAEDPTGGRI----KWEQGKLNGAPNKVEEIVQFHVGDVISCLQKASLI-------PG 530
Query: 930 QIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT 988
++ GTV G IG + A + F L+ ++R+ + G +H +RS
Sbjct: 531 GGECILNGTVMGSIGALHAFTSRDDVDFFSHLEMHMRQDNPPLCGRDHMAYRS-----AY 585
Query: 989 VDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSVEELCKRVEE 1031
K+ +DGDL E F +DL R DE+ +T E+ K++EE
Sbjct: 586 FPVKDVIDGDLCEQFPTLPMDLQRKIADELDRTRG----EILKKLEE 628
>gi|218199276|gb|EEC81703.1| hypothetical protein OsI_25307 [Oryza sativa Indica Group]
Length = 1429
Score = 134 bits (336), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 146/610 (23%), Positives = 256/610 (41%), Gaps = 94/610 (15%)
Query: 223 GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDA 282
G HLL + + E V L E + + ++D +V GD +++L
Sbjct: 414 GEFHLLEFSLDMEGVKVLP-ECVHRGLPCKPLLWMDKGMVVGFVEMGDGMILQLENNRLV 472
Query: 283 KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
S ++ N+ PI+D + D + Q Q+ C G +GSLR++RNG+ + +
Sbjct: 473 HKSAIQ------NVAPILDLAIADHHGEKQDQMFACCGMCPEGSLRVIRNGVNVEKLLRT 526
Query: 343 E--LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
+ G+ G+W+LR D + +FLV+SF+ ETRIL++ L + + GF + TL
Sbjct: 527 DPIYHGVTGLWTLRMKRTDAYHSFLVLSFVEETRILSVGLSFN-DICDAVGFQTDVCTLA 585
Query: 401 CHDAIYNQLVQVTSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVL 450
C LVQ+ S V+L T+ + +W P +++V + V+
Sbjct: 586 CGLVADGLLVQIHSKCVKLCLPTACAHPEGTLLPSPVCADWY--PDVTISVGAVGHNVVV 643
Query: 451 LATGGGHLVYLEIGDGILT-------EVKHAQLEYEISCLDINPIG---ENPSYSQIAAV 500
+AT +Y+ +G L+ E++H QL YE+SC+ I +N S S +
Sbjct: 644 VATSNPCCLYI-LGVRSLSSFQYELYEIQHVQLHYEVSCISIPQEDWRLDNSSSSCATSG 702
Query: 501 GMWTDIS--VRIFSL-----PDLNLITKE-----------------HLGGEI---IPRSV 533
D + +R F++ P +++I+ E LG I IP +V
Sbjct: 703 DFRKDFAANIRKFAVIGTHEPSVHIISLEPGEAFQQLAVGHISVNNALGTPISGCIPENV 762
Query: 534 LLCAFEGISYLLCALGDGHLLNFLLNMKTGEL----------TDRKKVSL--------GT 575
A Y+L L +G LL F G T SL G
Sbjct: 763 RFVAAARF-YILAGLRNGMLLRFESQTSKGHCFPGSFYKESSTPCDDTSLMLIAVRRIGI 821
Query: 576 QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
P+ L + + DRP +++S+ L YS+++ SH+ P +S P+ L
Sbjct: 822 TPVVLVPLHDRANADIIVLGDRPWLLHSARHSLAYSSISFLPASHVTPVSSTDCPNGLLF 881
Query: 636 AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMHF 693
E L + + ++L+ + +G PR++ + SRT + L + SC+ +
Sbjct: 882 VSENCLHLVEMVHGKRLNAQKFSIGGTPRKVLYHSDSRTLLVLRTGLTSVSCSSD----- 936
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY-----VLPE-ENEPT 747
+ +D +S + + E + +D + VGT+ ++P E E
Sbjct: 937 IVQIDPSNGALLSRFKCEPGETAKCMQIAKIGNDQVLI--VGTSKSNGRPMMPNGEAESI 994
Query: 748 KGRILVFIVE 757
KGR+++ +E
Sbjct: 995 KGRLILLSLE 1004
>gi|300176205|emb|CBK23516.2| unnamed protein product [Blastocystis hominis]
Length = 702
Score = 133 bits (335), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 117/441 (26%), Positives = 205/441 (46%), Gaps = 53/441 (12%)
Query: 621 MCPFNS------AAFPDS------LAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICH 668
M P +S A+ PD A+ L++G IDD++ +H IPL + +
Sbjct: 227 MAPLDSPPLCSCASLPDEHEHRFFFVSARSSALSLGEIDDVRSVHAARIPLQHNLLALSV 286
Query: 669 QEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGC-----SILSCS 723
Q +++ + + + + ++L D +T+ PL E SI S
Sbjct: 287 QPATQSLLLLLSRAAASQNAA---LLQLRDPKTYAIRDELPLKPSEIALCVASGSIFPLS 343
Query: 724 FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
+ + N + VGTA+VLPEENEP++GR+LV + +L+L+AE G S+ F GK+
Sbjct: 344 NAPERNEVFVVGTAFVLPEENEPSQGRLLVLRAVEHRLELVAETMLSGGCLSICLFKGKV 403
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYK 843
+ +N ++Q++ + +L SE + +L + I +GD++ S+ ++YK
Sbjct: 404 VCGVNSELQVFDVDEKTSTISKLASEVACIS-VTSLSPNEADETIALGDILYSV--VVYK 460
Query: 844 -------------HEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
E A E R RD A + + ++ +G + NL ++ E
Sbjct: 461 LVLEVVRGRQLAQLECIASERRRRDVTA--LERLPEAQSEMVVG-DAYGNLMVMQVVEEA 517
Query: 891 ATDEERGRLEVVGE--YHLGEFVNR------FRHGS----LVMRLPDSDVGQIPTVIFGT 938
D + VV + +HL + +NR FR G+ R +S++ + F T
Sbjct: 518 DLDRSNPQKIVVTKESFHLDDQINRFVPVQLFRSGAEDKKKEKRAEESEIAF--NLAFAT 575
Query: 939 VNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
V+G IG+I +L ++ L ++T + VI VGGL+H+QWR N + +DGD
Sbjct: 576 VSGRIGMIGALNDREFRMLRAIETAMENVITPVGGLDHKQWRCSNTPFGIKNLAYCIDGD 635
Query: 999 LIESFLDLSRTRMDEISKTMN 1019
L+E FL+L +I+ +++
Sbjct: 636 LVEMFLELDDESQAKIADSVS 656
>gi|326510951|dbj|BAJ91823.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1360
Score = 133 bits (334), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 118/471 (25%), Positives = 207/471 (43%), Gaps = 86/471 (18%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE--LQGIKGMWS 352
N+GP++D + D + Q Q+ +C G +GSLR++RNG+ + + E QGI G+W+
Sbjct: 480 NVGPMLDLAIADYHGEKQDQLFSCCGMCPEGSLRVIRNGVNVEKLLRTEPIYQGITGLWT 539
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LR D + +FLV++F+ ETRIL++ L + ++ GF + TL C LVQ+
Sbjct: 540 LRMKRTDTYHSFLVLAFVEETRILSVGLSFN-DISDAVGFQPEVCTLACGLIADGLLVQI 598
Query: 413 TSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
S V+L TS + +W P +++V V +AT +Y+
Sbjct: 599 HSKGVKLCLPTSYAHPEGATLTSPVCVDWY--PDVTISVGAVGHDVVAVATSNPCCLYIL 656
Query: 463 IGDGI------LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVG----------MWTDI 506
+ L E +H QL+YE+SC+ I P ++ + +A+G ++
Sbjct: 657 RVRPLSSLQYELYETQHVQLQYEVSCISI-PEEDSRLRTPSSAIGGDFRERKGNNSVAEV 715
Query: 507 SVRIFSL-----PDLNLITKE-----------------HLGGEI---IPRSVLLCAFEGI 541
+VR+F++ P + +I+ E LG + IP +V + A E
Sbjct: 716 NVRMFAVIGTHKPSVEVISLEPDEAFRLLSTGSISVNNALGAPVSGCIPENVRVVASERF 775
Query: 542 SYLLCALGDGHLLNFLLNMKTGELTDRKKVS----------------------LGTQPIT 579
Y+L L +G LL F ++G D+ +G P+
Sbjct: 776 -YILAGLRNGMLLRF----ESGTSKDQYLPGSFYKESFAPSLNTLLQLVAIRHIGITPVG 830
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L + + SDR ++++S L YS+++ SH+ P +S P L E
Sbjct: 831 LVPLRDSANSDIIVLSDRSWLLHASRHSLAYSSISFLPASHVIPVSSVDCPSGLLFVAEN 890
Query: 640 ELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEE 688
L + + ++L+ + +G PR++ + SRT + L SC+ +
Sbjct: 891 CLHLVEMVHGKRLNAQKFSIGGTPRKVLYHSDSRTLLVMRTGLTGASCSSD 941
>gi|398407593|ref|XP_003855262.1| hypothetical protein MYCGRDRAFT_69118 [Zymoseptoria tritici IPO323]
gi|339475146|gb|EGP90238.1| hypothetical protein MYCGRDRAFT_69118 [Zymoseptoria tritici IPO323]
Length = 1218
Score = 133 bits (334), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 173/807 (21%), Positives = 340/807 (42%), Gaps = 92/807 (11%)
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK--GMWSL 353
L P++ V +L + Q+ G + +R+G+ + E + + + +W+
Sbjct: 428 LHPLIKTKVDNLTSEDAPQIYAIQGTGNKSQFKTIRHGLNVEEIINNSMGNVPYDNIWTF 487
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ + D F +L++S +A ++ D +E+ E F T+ LVQV
Sbjct: 488 KHRSTDEFHRYLLLSSSYGDLTIACSIGDSVEQIENSSFLENRATVHAQQMGDATLVQVH 547
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-GDGILTEVK 472
+ +R + + NEW +P +V VA+AN Q+LLA L + + DGIL +++
Sbjct: 548 ARGIRSILESG--ALNEWPTPAHRTVVVASANERQLLLALSSAELAFFFMQDDGILVQLE 605
Query: 473 H-AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP-DLNLITKEHLGGEIIP 530
++ +++ L + + ++ A VG D ++R+ S+ D L + +P
Sbjct: 606 EMPEMSGKVTALSVGRTPKGRQQAKYAVVGC-DDCTIRVLSIELDSPLEPRSVQALSAVP 664
Query: 531 RSVLLCAFEG------ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
S+ + ++ + L G L +++ TGEL D + LGT+P L
Sbjct: 665 TSLEVVEMHDPSSNSYVNVVHIGLQSGLYLRAVIDETTGELGDVRTKFLGTRPPRLCPVE 724
Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNL-------------KEVSHMCPFNSAAFPD 631
+ V A S RP + Y+ + Y+ L ++ +C ++
Sbjct: 725 INDQECVLACSSRPWLGYNHPQSQQYTVTPLIAEPIEAARSFISPDLVGLCAIQGSSLLI 784
Query: 632 SLAIAKEGELTIGTID-DIQKLHIRSIPLGEHPRRICHQEQSRTFAIC---SLKNQSCAE 687
+ EG L+ TID + + PL +P Q T + L+ +S +
Sbjct: 785 MQVPSIEGRLSSSTIDLNYTPRAMDRNPL--YPIWYTAQADGNTLSKAMRDQLRGKSIDD 842
Query: 688 ESEM--------------HF---VRLLDDQTFEFIS-TYPLDTFEYGCSILSCSF-SDDS 728
+ E H+ ++ +D T + ++ T L E I F S
Sbjct: 843 DEEATALEKHLGVSRGTGHWASCIQAIDPITQKAVTHTIELGENEAALCIACVPFTSRQD 902
Query: 729 NVYYCVGTA-YVLPEENEPTKGRILVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLA 785
V+ VGT ++ P ++P+ G + ++ ++E+G KL+ + + +Y+L FNG+L
Sbjct: 903 EVFLAVGTGQHMSPGTSQPSTGFVHIYRLLEEGTKLEFVHRTQFDSPIYALCKFNGRLAL 962
Query: 786 AINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
+ ++ +Y D G + L + G L ++ + G+ ++ D+ +S++ ++YK
Sbjct: 963 GVGNELFIY-----DMGMKHLLRKARGTAVPNLITHITSVGNRLICADVSESVTYVVYKP 1017
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD-EERG------ 897
+ D W +A ++D + GA+ NL+ VR +T+ +E G
Sbjct: 1018 AFNRLIPFVDDTIQRWTTATALVDYETVAGADKFGNLWVVRCPEATSTEADEDGAGGYIM 1077
Query: 898 -----------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
RLE+ + + + SLV G + + + G +G++
Sbjct: 1078 NERSYLGGAPYRLELRSHFFANDIPTSLQRTSLV-------AGGAEVIFWSGLQGTLGML 1130
Query: 947 AS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
+ E F L++ +R+ + G +H +RS+ V K +DGDL E +L
Sbjct: 1131 VPFVSREDVEFFTALESQMRQEDPPLAGRDHLMYRSY-----YVPVKGVIDGDLCERYLG 1185
Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
L +I+ ++ SV+E+ K+V+E+
Sbjct: 1186 LREDVKGKIAAEVDRSVKEVEKKVQEM 1212
>gi|326519701|dbj|BAK00223.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 1360
Score = 132 bits (333), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 118/471 (25%), Positives = 207/471 (43%), Gaps = 86/471 (18%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVE--LQGIKGMWS 352
N+GP++D + D + Q Q+ +C G +GSLR++RNG+ + + E QGI G+W+
Sbjct: 480 NVGPMLDLAIADYHGEKQDQLFSCCGMCPEGSLRVIRNGVNVEKLLRTEPIYQGITGLWT 539
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LR D + +FLV++F+ ETRIL++ L + ++ GF + TL C LVQ+
Sbjct: 540 LRMKRTDTYHSFLVLAFVEETRILSVGLSFN-DISDAVGFQPEVCTLACGLIADGLLVQI 598
Query: 413 TSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
S V+L TS + +W P +++V V +AT +Y+
Sbjct: 599 HSKGVKLCLPTSYAHPEGATLTSPVCVDWY--PDVTISVGAVGHDVVAVATSNPCCLYIL 656
Query: 463 IGDGI------LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVG----------MWTDI 506
+ L E +H QL+YE+SC+ I P ++ + +A+G ++
Sbjct: 657 RVRPLSSLQYELYETQHVQLQYEVSCISI-PEEDSRLRTPSSAIGGDFRERKGNNSVAEV 715
Query: 507 SVRIFSL-----PDLNLITKE-----------------HLGGEI---IPRSVLLCAFEGI 541
+VR+F++ P + +I+ E LG + IP +V + A E
Sbjct: 716 NVRMFAVIGTHKPSVEVISLEPDEAFRLLSTGSISVNNALGAPVSGCIPENVRVVASERF 775
Query: 542 SYLLCALGDGHLLNFLLNMKTGELTDRKKVS----------------------LGTQPIT 579
Y+L L +G LL F ++G D+ +G P+
Sbjct: 776 -YILAGLRNGMLLRF----ESGTSKDQYLPGSFYKESFAPSLNTLLQLVAIRHIGITPVG 830
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L + + SDR ++++S L YS+++ SH+ P +S P L E
Sbjct: 831 LVPLRDSANSDIIVLSDRSWLLHASRHSLAYSSISFLPASHVIPVSSVDCPSGLLFVAEN 890
Query: 640 ELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEE 688
L + + ++L+ + +G PR++ + SRT + L SC+ +
Sbjct: 891 CLHLVEMVHGKRLNAQKFSIGGTPRKVLYHSDSRTLLVMRTGLTGASCSSD 941
>gi|412993390|emb|CCO16923.1| predicted protein [Bathycoccus prasinos]
Length = 1273
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 167/737 (22%), Positives = 294/737 (39%), Gaps = 109/737 (14%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
+G I ++ FR G +D + I ++ + VL++DAE ++ R G
Sbjct: 58 FGIIRSMAAFRLAGANKDHVVIGSDSGRIVVLRFDAEKNQFEQVHKETFGKSGVRRVVPG 117
Query: 67 QIGIIDPDCRLIGLHLYDGLFKVI---------------PFD-NKGQLKEAFNI-----R 105
Q +DP R + L + KV+ P + NKG+ AF++
Sbjct: 118 QFCCVDPKGRAVMLAALEKQ-KVVYVLNRDSQANLTISSPLEANKGKTI-AFDVCALDNG 175
Query: 106 LEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGAD 165
LE + I+ YG A + + +A+ + TY + V WS+ +DNGA+
Sbjct: 176 LENPGFMAIELFYGEAD--VDDTGEAAAEAQKMLTYYELDLGLNHVARKWSEP-VDNGAN 232
Query: 166 LLIPVP---PPLCGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGRVDA-------- 211
LIPVP GV++ E +VY + + IP R S+ G + A
Sbjct: 233 KLIPVPGGSDGPGGVIVCCENFLVYRAEKHEEIRCVIPRRTSLDAERGVLIASFASHRSK 292
Query: 212 DGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
+G ++ G + + + KV+ LK++ + S + L ++ GS +G
Sbjct: 293 NGFFFIAQSEYGDCYKVTLDWTNRKVSELKMKYFDTVPVCSALCVLKTGFLFCGSEFGAH 352
Query: 272 QLIKL-----------------------------------------NLQP----DAKGSY 286
L + N QP K
Sbjct: 353 ALFQFIALGDDEESAESSSKTLKKIDNATKKKGRGKNDDEDDEEEDNFQPVFFNPRKLKN 412
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
+ +++ +L P + ++ +++ G + R++R G+ ++E A L G
Sbjct: 413 LALIDEIESLSPTIALASNISRKEETPRLLALCGQGPRSTFRVLRQGVPLSEMARSPLPG 472
Query: 347 -IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH--- 402
G++++R S D D ++VVSF + T +L++ D +EE G + + TL
Sbjct: 473 NPNGVFTIRKSKSDTTDAYIVVSFTNATLVLSIG--DTVEEVTDTGILATSSTLAVSALG 530
Query: 403 DAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
D L+Q+ VR V ++ + NEW++P + N Q ++A GG LVY E
Sbjct: 531 DDDDGSLIQIHPSGVRHVRGNNKGV-NEWRAPGRKKITACACNRGQAIVALTGGELVYFE 589
Query: 463 IGD-GILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLIT 520
+ + G L E++ + E+ +DI PI + ++ AAV + D +VR+ SL P L T
Sbjct: 590 LDEAGQLLEIEKIETSSEVVSVDIPPIPDGSLRAKFAAVAGY-DSTVRVLSLNPGEALRT 648
Query: 521 KEHLGGEIIPRSVLLCAFEGIS--------------YLLCALGDGHLLNFLLNMKTGELT 566
P SVLL + +L L +G L+ ++ +G L+
Sbjct: 649 VGVQATPSPPESVLLLEVNNKTTKNKKAGSSAPPSMFLNVGLSNGILVRCEVDRVSGALS 708
Query: 567 DRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS 626
D + +G +P L + A S RP + ++ N + + + C F S
Sbjct: 709 DARSRFVGQRPPKLNRIEMNEESGFVALSTRPWLGFNENGRFSIVPACHETIDRACGFAS 768
Query: 627 AAFPDSLAIAKEGELTI 643
P+ + +G L I
Sbjct: 769 EQVPEGIVAVVDGSLRI 785
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 67/301 (22%), Positives = 122/301 (40%), Gaps = 46/301 (15%)
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL--QSECGHHGHILA 818
L+L+ E +GA ++L F GKLL ++ ++LY D G ++L ++EC
Sbjct: 985 LELVHETPCEGAPFALRNFEGKLLVGVDDVLRLY-----DFGKKKLLRKAECAQKFPSFI 1039
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHE---EGAIEERARDYNANWMSAVEILDDDIYLGA 875
++ GD V D +S + Y E E ++ A D +++++ LD D +
Sbjct: 1040 NDIRCSGDRFFVTDACESAFFVKYVREDDQECSMHIFADDIAPRYVTSMLPLDRDTVAVS 1099
Query: 876 ENNFNLFTVRKNSE-----------------------GATDEERGRLEVVGEYHLGEFVN 912
+ N +R + GA + +L+ ++H+G+ +
Sbjct: 1100 DKFGNFAALRLPKDVSDEIESDISGGKHAALTSSAALGALNGANNKLQACAQFHVGDVIC 1159
Query: 913 RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGV 971
+L G +++ T+ G +G ++ F L+ +LR V
Sbjct: 1160 SLTKCAL-------QTGGSEVIVYATLGGALGAFVPFASKDEADFCTHLEMHLRIEAPPV 1212
Query: 972 GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
G H +RS K +DGDL E F L IS+ M+ + E+ KR+E+
Sbjct: 1213 LGNEHGAFRS-----SYFPVKAVVDGDLCEQFGRLGADAQRRISEEMDRTPSEIVKRLEQ 1267
Query: 1032 L 1032
+
Sbjct: 1268 I 1268
>gi|33146591|dbj|BAC79787.1| putative Splicing factor 3B subunit 3 [Oryza sativa Japonica Group]
gi|222636635|gb|EEE66767.1| hypothetical protein OsJ_23488 [Oryza sativa Japonica Group]
gi|429459546|gb|AFZ84679.1| spotted leaf 5 [Oryza sativa Japonica Group]
Length = 1355
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 146/610 (23%), Positives = 254/610 (41%), Gaps = 94/610 (15%)
Query: 223 GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDA 282
G HLL + + E V L E + + ++D +V GD +++L
Sbjct: 414 GEFHLLEFSLDMEGVKVLP-ECVHRGLPCKPLLWMDKGMVVGFVEMGDGMILQLENNRLV 472
Query: 283 KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
S ++ N+ PI+D + D + Q Q+ C G +GSLR++RNG+ + +
Sbjct: 473 HKSAIQ------NVAPILDLAIADHHGEKQDQMFACCGMCPEGSLRVIRNGVNVEKLLRT 526
Query: 343 E--LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
+ G+ G+W+LR D + +FLV+SF+ ETRIL++ L + + GF + TL
Sbjct: 527 DPIYHGVTGLWTLRMKRTDAYHSFLVLSFVEETRILSVGLSFN-DICDAVGFQTDVCTLA 585
Query: 401 CHDAIYNQLVQVTSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVL 450
C LVQ+ S V+L T+ + +W P +++V + V+
Sbjct: 586 CGLVADGLLVQIHSKCVKLCLPTACAHPEGTLLPSPVCADWY--PDVTISVGAVGHNVVV 643
Query: 451 LATGGGHLVYLEIGDGILT-------EVKHAQLEYEISCLDINPIG---ENPSYSQIAAV 500
+AT +Y+ +G L+ E++H QL YE+SC+ I +N S S +
Sbjct: 644 VATSNPCCLYI-LGVRSLSSFQYELYEIQHVQLHYEVSCISIPQEDWRLDNSSSSCATSG 702
Query: 501 GMWTDIS--VRIFSL-----PDLNLITKE-----------------HLGGEI---IPRSV 533
D + +R F++ P +++I+ E LG I IP +V
Sbjct: 703 DFRKDFAANIRKFAVIGTHEPSVHIISLEPGEAFQQLAVGHISVNNALGTPISGCIPENV 762
Query: 534 LLCAFEGISYLLCALGDGHLLNFLLNMKTGEL----------TDRKKVSL--------GT 575
A Y+L L +G LL F G T SL G
Sbjct: 763 RFVAAARF-YILAGLRNGMLLRFESQTSKGHCFPGSFYKESSTPCDDTSLMLIAVRRIGI 821
Query: 576 QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
P+ L + + DRP ++ S+ L YS+++ SH+ P +S P L
Sbjct: 822 TPVVLVPLHDRANADIIVLGDRPWLLQSARHSLAYSSISFLPASHVTPVSSTDCPSGLLF 881
Query: 636 AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMHF 693
E L + + ++L+ + +G PR++ + SRT + L + SC+ +
Sbjct: 882 VSENCLHLVEMVHGKRLNAQKFSIGGTPRKVLYHSDSRTLLVLRTGLTSVSCSSD----- 936
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY-----VLPE-ENEPT 747
+ +D +S + + E + +D + VGT+ ++P E E
Sbjct: 937 IVQIDPSNGALLSRFKCEPGETAKCMQIAKIGNDQVLI--VGTSKSNGRPMMPNGEAESI 994
Query: 748 KGRILVFIVE 757
KGR+++ +E
Sbjct: 995 KGRLILLSLE 1004
>gi|19075704|ref|NP_588204.1| silencing protein Rik1 [Schizosaccharomyces pombe 972h-]
gi|13124796|sp|Q10426.2|RIK1_SCHPO RecName: Full=Chromatin modification-related protein rik1; AltName:
Full=Silencing protein rik1
gi|6016996|emb|CAB57850.1| silencing protein Rik1 [Schizosaccharomyces pombe]
Length = 1040
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 226/1060 (21%), Positives = 428/1060 (40%), Gaps = 127/1060 (11%)
Query: 4 VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT 63
+P++ + + +RP G +D+LF+ + + WD + ++I V R+ P
Sbjct: 53 IPLFSTVKHIRPYRPPGLDRDYLFVVLNDDTYFSIYWDEDYQKVIVDH-PPVRYRVTFPW 111
Query: 64 DNGQIGIIDPDCRLIGLHLYDGLFKVI---------PFDNKGQLKEAFNIRLEELQVLDI 114
+ D R+ + L +I + F + D+
Sbjct: 112 NRNAKSYCLVDLRMRAIFLSIDEISMICIRILSAEERLKTGRSIDSGFPFSFPVHLIYDM 171
Query: 115 KFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPL 174
L + PT+VVL+ D D +V + + L K +L G L V P +
Sbjct: 172 CILNDSSTPTLVVLHSDGLDC-YVTAFLLDLSSK----------SLGKGIRLFERVKPSM 220
Query: 175 CGV-----LIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLL 228
L++ E ++C F I + + + Y++ D G L +
Sbjct: 221 IMPFGKRGLLVFESLFIHCMYRGNFVTINGPCTTYMHWTPLKGQKMHYIVCDTNGYLFGV 280
Query: 229 VITHEKEKVTGLKIELLG--ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSY 286
+ + L +E L ++++ + +++IGS +S+LI L+ D
Sbjct: 281 YSSILGKNKWSLVMERLPIPPFDFITSLNSIHEGLLFIGSKNSESKLINLSTLKD----- 335
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG 346
++ NLGPI D V L+ + + C+G ++ SL ++ + ++ ++ G
Sbjct: 336 ---VDSIPNLGPIHDLLV--LKNDIEKSFLVCAGTPRNASLIYFQHALKLDILGQTKISG 390
Query: 347 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
I L S + L + F SET +A N++++ + E++ S +
Sbjct: 391 ILRAMVLPSYPEHK----LFLGFPSET--VAFNIKEDFQ-LELDPSLSTKERTIALSGTN 443
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG 466
+ VQVTS + + S R + N A ++ G ++ +
Sbjct: 444 GEFVQVTSTFLCIYDSAKRSRLVYIEK----ITNAACYQEYSAIVINGTALAIFKKD--- 496
Query: 467 ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT-KEHLG 525
TEV E EISCLD + + QI VG W+ V I + D + I+
Sbjct: 497 --TEVARKVFESEISCLDFS------AQFQIG-VGFWSK-QVMILTFSDNSSISCAFQTN 546
Query: 526 GEIIPRSVLLCAFEGI----SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
+PR+++L EG+ + LL + G G +++L ++ K GT P++ R
Sbjct: 547 VPSLPRNIIL---EGVGVDRNLLLVSSGSGEFKSYVLFKNNLVFSETKH--FGTTPVSFR 601
Query: 582 TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
F+ T++ +D P ++Y N L Y +++ + +C F + D L G L
Sbjct: 602 RFTMNIGTYIICNNDCPHMVYGFNGALCYMPLSMPQSYDVCQFRDNSGKDFLISVSLGGL 661
Query: 642 TIGTIDDIQKLHIRSIPLGEHP-RRICHQEQSRTFAICSLKN-----QSCAEESEMHFVR 695
++ + +L R + L P + I Q + + +L+N +S E + V
Sbjct: 662 KFLQLNPLPELTPRKVLLEHVPLQAIIFQNK---LLLRTLENRYEDYESYKENYHLELVD 718
Query: 696 LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILV-- 753
DD +F S + E I S VGT+ + ++ P GR+++
Sbjct: 719 SYDDNSFRVFSFTENERCEKVLKINESSL--------LVGTSIIEQDKLVPVNGRLILLE 770
Query: 754 FIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH 813
F E L++++ AV L +N + + A Q++ + K + + S
Sbjct: 771 FEKELQSLKVVSSMVLSAAVIDLGVYNDRYIVAFGQQVAIVKL---TEERLMIDSRISLG 827
Query: 814 GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
+L L V+ G+ I + D + +++ + ++ + R + N + A + + +Y+
Sbjct: 828 SIVLQLIVE--GNEIAIADSIGRFTIMYFDGQKFIVVARYL-FGENIVKAA-LYEGTVYI 883
Query: 874 GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI-- 931
A N+ L +R N + +R E V YHL + V++F++ P ++
Sbjct: 884 IATNSGLLKLLRYNKDAKNFNDRFICESV--YHLHDKVSKFQN------FPITNTNSFLE 935
Query: 932 PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV-IKGVGGLNHEQWRSFNNEKKTVD 990
P ++F T G IG I SL ++ L LE+L +RK+ + +++E E +
Sbjct: 936 PKMLFATEIGAIGSIVSLKDKE-LELEELTRKIRKLKFSYLSSMDYESI-----EADLIS 989
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
F+DGDL+ +D+ R E+ + LC+ VE
Sbjct: 990 PVPFIDGDLV---IDVKRWASSELFR--------LCRSVE 1018
>gi|302820387|ref|XP_002991861.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
gi|300140399|gb|EFJ07123.1| hypothetical protein SELMODRAFT_448595 [Selaginella moellendorffii]
Length = 1292
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 121/475 (25%), Positives = 215/475 (45%), Gaps = 73/475 (15%)
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
GD ++K+ D + S+ ++ N+ PI+DF +VD + Q Q+ C G ++GS+R
Sbjct: 413 GDGHVLKVE---DGQLSFQSFVQ---NIAPILDFSLVDYYGEKQDQMFACCGGDEEGSVR 466
Query: 329 IVRNGIGINEQASVE--LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
I+RNG + + QG+ G+W++R DP+ F ++SF+ ETR+L++ L + ++
Sbjct: 467 IIRNGNSVEKLICTPPVYQGVSGIWTMRYRFKDPYHAFFLISFVEETRVLSVGL-NFVDI 525
Query: 387 TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPG 436
T+ GF SQ TL C + QV V+L S T S L W+ PG
Sbjct: 526 TDAVGFESQVNTLACGLVEDGWVAQVWRYEVKLCSPTKAAHPAGVSGSNPLSTTWRK-PG 584
Query: 437 YSVNVATANASQVLLATGGGHLVYL----EIGDGI--LTEVKHAQLEYEISCLDI----- 485
Y ++V S+V+LA L+ + + DG L E++ +E EISC+ I
Sbjct: 585 YPISVGAVCRSRVILALARPGLLLMLGATQTSDGSFELVELQLCMMEAEISCISIPQGDI 644
Query: 486 --------------NPIGENPSYSQIAAVGMWTDISVRI-----------FSLPDLNLIT 520
N + + VG SV + F++ ++LIT
Sbjct: 645 SIPVPPTIAGLHAGNTVPAGLDLGNVCVVGTHKP-SVELLSIVPGDKFAPFAVGQVSLIT 703
Query: 521 KEHLGGEI---IPRSVLLCAFEGISYLLCALGDGHLLNF--------LLNMKTGELTDRK 569
+G + IP+ V L F+ + Y+L L +G LL + L+ K +L
Sbjct: 704 S--VGTAVSGCIPQDVRLALFDRL-YILAGLRNGMLLRYEWPEDTPSLVLSKPAQLHLVA 760
Query: 570 KVSLGTQPITLRTF-SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
LG P+ L + + A SDRP ++ + K++ Y++++ + +H P S
Sbjct: 761 ARRLGVSPVCLVPLEACALRADIIALSDRPWLLQMA-KRISYTSISFQPSTHATPVCSKD 819
Query: 629 FPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ 683
P + + L + ++ + L+++ + LG RR+ + +S + L ++
Sbjct: 820 CPKGIIFLADCSLHLVEMEQSRTLNVQKLHLGCTGRRVLYHPESGVLIVLRLLSE 874
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 51/201 (25%), Positives = 82/201 (40%), Gaps = 15/201 (7%)
Query: 821 VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA--ENN 878
+ R I VGD + L Y+ + +E D +S ++D D + + N
Sbjct: 1049 ISVRFTTIAVGDCRDGVLLFTYREDSKKLEPIRCDPMRRLVSDCTLVDVDTAVVVDRQGN 1108
Query: 879 FNLFTVRKNSEGATDEERG----RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ---I 931
F + + +EG D G LE +H+GE R GS + + I
Sbjct: 1109 FCALSANEETEGKCDSNSGSPEKNLEAHCWFHIGEVCTTVRKGSFAFKAVEDSCSVDRLI 1168
Query: 932 P-----TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
P VI T+ G + + + E+Y L+ LQ L + L ++ R F +
Sbjct: 1169 PNMGKSCVIATTLLGSVFIFVRMTGEEYSLLQALQRRLSVLPATAPVLGNDHAR-FRGQG 1227
Query: 987 KTVDAKNFLDGDLIESFLDLS 1007
+ K LDGDL+E FL+L+
Sbjct: 1228 RPAGVKEVLDGDLLEQFLELT 1248
>gi|340367935|ref|XP_003382508.1| PREDICTED: splicing factor 3B subunit 3-like isoform 2 [Amphimedon
queenslandica]
Length = 1160
Score = 129 bits (324), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 163/758 (21%), Positives = 322/758 (42%), Gaps = 104/758 (13%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++ ++G I +L FR G +D++ + ++ + +L++D + + + + + G
Sbjct: 53 LVSTEVFGLIRSLITFRLTGSTKDYIVLGSDSGRIVILEYDPVKN-VFEKVHQETYGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL----- 112
R GQ +DP R +IG L ++ D+ +L + + + L
Sbjct: 112 CRRIVPGQYLAVDPKGRALMIGAVEKQKLVYILNRDSAARLTISSPLEAHKSHTLVYHVV 171
Query: 113 --DIKF---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN-------- 159
D+ F L+ C + + Y+++ + + +VA ++ + E N+
Sbjct: 172 GVDVGFENPLFACLE----LDYEESDNDHTGEAAQVAQQNLTYYELDLGLNHVVRKYSEP 227
Query: 160 LDNGADLLIPVPPPL---CGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD-ADGSR 215
LD+ A++LI VP GVL+ E I+Y N IR I + +D AD S
Sbjct: 228 LDDMANMLITVPGGTDGPSGVLVCCENYIIY--KNFGDQPDIRCPIPRRQNDLDDADRSM 285
Query: 216 ---------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNA 260
+LL G + +++ E + V +KI+ + S++ L +
Sbjct: 286 LIITSAMHKTKHLFFFLLQSEQGDIFKVMLDVEDDIVVRIKIKYFDTVPVCSSLCVLKSG 345
Query: 261 VVYIGSSYGDSQLIKL-NLQPDAKGSY---------------------VEVLERYVNLGP 298
+++ S +G+ L ++ +L D++ + + +++ +L P
Sbjct: 346 LLFCSSEFGNHYLYQIAHLGDDSEREFHSDYPLEEGETFFFGPRPLKNLILVDEMESLSP 405
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+ + DL + Q+ G SLR++R+G+ + E A EL G +W+++ +
Sbjct: 406 IMSCQIADLANEDTPQLYAACGRGPRSSLRVLRHGLEVTEMAVSELPGNPHAVWTVKKDS 465
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
+ FD+++VVSF++ T IL++ + +EE GF T TL C + L+Q+ +
Sbjct: 466 KEDFDSYIVVSFMNATLILSIG--ETVEEVTDSGFLGTTPTLACSQLGDDALIQIYPEGI 523
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQ 475
R + S R NEW+SP + N QV++A GG +VY E+ G L E + +
Sbjct: 524 RHIRSDKR--VNEWRSPGKRLIRQCAVNEHQVVIALSGGEIVYFEMDQSGQLNEYTERKE 581
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
+ E+ C+ + + + AVG+ +D +VRI SL + T + L + +P
Sbjct: 582 MTAEVICISLGSVPPGQQRCRFLAVGL-SDQTVRIISLDPHD--TLQPLSMQALP----- 633
Query: 536 CAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAAS 595
+ LC ++NM +G ++D VS G + + V A S
Sbjct: 634 ----ALPESLC----------IVNM-SGNVSDDTTVSTGGLFLNI----GLQNGGVIAVS 674
Query: 596 DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIR 655
R + Y+ + + ++ + + F S P+ + L I ++ + + +
Sbjct: 675 SRTWLNYTYQSRFHLTPLSYDLLEYASSFTSEQCPEGMVAISSNTLRILALEKLGVVFNQ 734
Query: 656 -SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
S PL PR++ + + + E +++H
Sbjct: 735 VSTPLQYTPRKLLIHPPMSNLVLIETDHNAFTEATKLH 772
Score = 66.2 bits (160), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 76/345 (22%), Positives = 150/345 (43%), Gaps = 60/345 (17%)
Query: 714 EYGCSILSCSFSDDSNV--YYCVGTAY-VLPEENEPTKGRILVF-IVEDG-KLQLIAEKE 768
E S+ C F ++ + VGTA ++ + G ++VF + DG KL+ + +
Sbjct: 844 EAAFSLAVCQFVSKGDLEWFVVVGTAKDMIITPRAISSGSLIVFRLSPDGSKLEHVHTTQ 903
Query: 769 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDF 827
++ F G+LL + + +++Y D G +++ +C H L + ++ G
Sbjct: 904 LDDVPIAMAPFQGRLLVGVGKLLRIY-----DIGKKKMLRKCENKHLPYLVVDIKVMGRR 958
Query: 828 IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
+ VGD+ +++ L Y+ E + A + + + ILD + A+ N+ +R
Sbjct: 959 VYVGDVQEAVHFLYYRPHENQLVIFADEVVPRFCTTSCILDYNTVASADKFGNITILRLP 1018
Query: 888 SEGATDE------------ERGRLE-------VVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
S+ TD+ +RG L V+ YH+GE +N SL+
Sbjct: 1019 SD-VTDQVDEDPSGSRSLWDRGFLNGATQKANVMTSYHVGEGINTLHKVSLI-------P 1070
Query: 929 GQIPTVIFGTVNGVIGVIASLP-HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
G +++ T++G IG++ E F + L+ ++R +W + +
Sbjct: 1071 GGSEVLVYTTLSGSIGILVPFSSKEDSDFFQHLEMHMRS-----------EWSNLS---- 1115
Query: 988 TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+DGDL E + L ++ EI+ ++ S E+ K++E+L
Sbjct: 1116 ------VIDGDLCEVYNSLDPSKRREIALDLDRSPSEVAKKLEDL 1154
>gi|308812436|ref|XP_003083525.1| UV-damaged DNA binding protein 1 (ISS) [Ostreococcus tauri]
gi|116055406|emb|CAL58074.1| UV-damaged DNA binding protein 1 (ISS), partial [Ostreococcus
tauri]
Length = 265
Score = 128 bits (321), Expect = 2e-26, Method: Composition-based stats.
Identities = 62/103 (60%), Positives = 73/103 (70%)
Query: 868 DDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
DD+ YLGAEN+ NLFTV +N TDEER RLE+ GEYHLGE VN F GSLVM L D +
Sbjct: 2 DDETYLGAENSLNLFTVSRNVNAVTDEERSRLEITGEYHLGELVNAFAPGSLVMSLRDGE 61
Query: 928 VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
+PT++FGT NGVIGV+ASLP + Y F E+LQ VI G
Sbjct: 62 SLSVPTLLFGTANGVIGVLASLPKDVYEFTERLQALGEFVICG 104
>gi|221061705|ref|XP_002262422.1| splicing factor 3b, subunit 3, 130kd [Plasmodium knowlesi strain H]
gi|193811572|emb|CAQ42300.1| splicing factor 3b, subunit 3, 130kd, putative [Plasmodium knowlesi
strain H]
Length = 1276
Score = 127 bits (320), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 150/698 (21%), Positives = 292/698 (41%), Gaps = 81/698 (11%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
I+G I L+ FR G +D++ I ++ + +LQ+ E ++ + R + + G R
Sbjct: 58 IFGIIRCLQTFRLTGSNKDYVVIGSDSGRLVILQFSNEKNDFV-RVHCETYGKSGLRRII 116
Query: 65 NGQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAFNIRLEELQVL--DIKFL-YG 119
G+ +DP R + + + ++ DNK QL + + + + D+ + G
Sbjct: 117 PGEYIAVDPKGRALMICAIERQKFVYILNRDNKEQLTISSPLDAHKSHTICHDVVGMDVG 176
Query: 120 CAKPTIVVLYQDNK--DARHVKTYEVALKDKDFVEGPWSQN------------NLDNGAD 165
P + Q+ + D + T E+ + + W + +D A
Sbjct: 177 FENPMFASIEQNYEMYDKQVTNTTEIDACTRKTLLCLWEMDLGLNHVIRKHTLPIDMSAH 236
Query: 166 LLIPVP---PPLCGVLIIGEETIVYCSA---NAFKAIPIR------PSITKAYGRVDADG 213
LLIP+P GV++ + +VY + + A P R +I+ V
Sbjct: 237 LLIPIPGGQQGPSGVIVCCDNYLVYKKVEHVDVYCAYPRRLETGQEKNISIVCSTVHRIR 296
Query: 214 SRY--LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG-- 269
+ L+ G L+ + + H+ V + + +A+ I + + +++ + +G
Sbjct: 297 KFFFILIQSEYGDLYKIEMDHQDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFGNH 356
Query: 270 -----------DSQLIKLNLQPDAKGSYVE----------VLERYVNLGPIVDFCVVDLE 308
D++ + + P + + + ++++ +L PI+D ++D +
Sbjct: 357 FFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKLTNLFLIDQVYSLSPILDMKILDAK 416
Query: 309 RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVV 367
Q+ G SLRI+++G+ I E A EL G K +W+++ +D +++V
Sbjct: 417 NANSPQIYALCGRGPRSSLRILQHGLSIEELADNELPGRPKYIWTIKKDNASDYDGYIIV 476
Query: 368 SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSREL 427
SF T IL + + +EE + T+ + N L+QV +R ++ ++
Sbjct: 477 SFEGSTLILEIG--ETVEEVVDTLLLTNVTTIHVNILYDNSLIQVHDTGIRHING---KV 531
Query: 428 RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEISCLDIN 486
NEW P V AT+NA+Q++++ GG L+Y EI + L E+ L EI CL I
Sbjct: 532 INEWVPPKNKQVKAATSNATQIVISLSGGELIYFEIDESHSLVEIFRKSLNVEILCLSIQ 591
Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL---LCAFE---- 539
+ EN + AVG ++ VR+ S+ + L I+P + +C E
Sbjct: 592 EVEENKVRANFLAVGCLDNV-VRLLSIEKEKYFNQ--LSTFILPNNSSAQDICISEMCEL 648
Query: 540 ------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFA 593
+ +L L +G LL +++ TG LT+ LG + + + K +
Sbjct: 649 GNDKERKLLFLNIGLNNGVLLRSVVDPITGTLTNHYSKYLGAKNVKICPVHVKKNPALLV 708
Query: 594 ASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
++ + Y K +YS +N + + F+S D
Sbjct: 709 LCEKTYLCYVHQGKYIYSPLNYDILEYASSFHSEQCSD 746
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 82/364 (22%), Positives = 158/364 (43%), Gaps = 47/364 (12%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC--VGTAYVLPEENE-PTKGR 750
++++ + + I L+ E S+ +C +C VGT L +N T
Sbjct: 930 IKIIHPVSLQTIDKISLEMEEAALSVCACELE----ALHCLIVGTTTNLSLKNRTATTAA 985
Query: 751 ILVFIVE-DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
+ V+ + + KL L+ + + + FNG+LLA++ K+++Y G ++L +
Sbjct: 986 LRVYTYDINYKLNLLHITPVEDQPFCFSPFNGRLLASVGNKLRIYAL-----GKKKLLKK 1040
Query: 810 CGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
C + A+ ++ GD I D+ +S+ + Y A+ + D W++ EILD
Sbjct: 1041 CEYKDIPEAIISIKVSGDRIFASDIRESVLVFFYDANMNALRLISDDIIPRWITCSEILD 1100
Query: 869 DDIYLGAENNFNLFTVRKNSEGATDEERG-------------------RLEVVGEYHLGE 909
+ A+ ++F +R E A EE G RLE + +H+GE
Sbjct: 1101 HHTIMAADKFDSVFVLRVPEE-AKQEEYGISNKCWYGGEMMAGSNKNRRLEHIMNFHVGE 1159
Query: 910 FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL-EKLQTNLRKVI 968
V + L P S +I+ T+ G IG +++ L L + L+ LR
Sbjct: 1160 IVTSLQKVKLS---PTSS----ECIIYSTIMGTIGAFIPYDNKEELELTQHLEIILRTEN 1212
Query: 969 KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
+ G H +RS+ + + V +DGDL E F L +++ + + +++ ++
Sbjct: 1213 PPLCGREHIFFRSYYHPVQHV-----IDGDLCEQFSSLPYDIQRKVAADLERTPDDILRK 1267
Query: 1029 VEEL 1032
+E++
Sbjct: 1268 LEDI 1271
>gi|429328598|gb|AFZ80358.1| hypothetical protein BEWA_032110 [Babesia equi]
Length = 1387
Score = 127 bits (320), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 214/1089 (19%), Positives = 417/1089 (38%), Gaps = 234/1089 (21%)
Query: 176 GVLIIGEETIVYCSANAFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLL-VIT 231
VL++G ++ + + N K+I S+++ G + S+++L D G +HL+ ++
Sbjct: 303 SVLLLGAGSVGFFAFNDPKSIKDFVSDFSMSEITGYCQIEHSKFVLSDDNGNIHLMELVL 362
Query: 232 HEKEKVTGLK-----------------------------IELLGETSIASTISYLDNAVV 262
K TGLK I L I S+I + N +
Sbjct: 363 SSKPANTGLKRRIRPINSNNNCDMHAHRGRTIATISDVIITRLYNCDIPSSIINIGNNTI 422
Query: 263 YIGSSYGDSQLIKLNL--QPDAKGSYVEVLERYV-------NLGPIVDFCVVDLERQGQG 313
+ S G++ +++ +P ++ E V NLGPI DF + G
Sbjct: 423 FSTSKVGNTCTFRIDFGNKPSICSDSFKIEEFNVYKTWNQTNLGPIADFTYGEENSHGNK 482
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS--STDDPFDTFLVVSFIS 371
V+ C G G + G+GI+ S + GI ++S+ S S D ++ L SF
Sbjct: 483 SVLACCGMGSAGKFCSITLGVGIDTICSNTILGISNLFSVSSGGSKTDSNESILCCSFYR 542
Query: 372 ETRILAMNLED-------------ELEETEI----------EGFCSQTQTLFCHDAIYNQ 408
T+ ++N ++ E + F +T+ + ++
Sbjct: 543 NTQFYSVNFVSMNNTQGNINSSLLDINEPNVGYVNKMVMDGSTFREDERTILLSNIDKDK 602
Query: 409 LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNV--ATANASQVLLATGGGHLVYLEIGD- 465
+VQ+T+ S+ ++SS + R +K+ + + ++++ A G +++ + +
Sbjct: 603 IVQITTSSIIILSSNPKLNRIAYKTVEEIMTEIDEGSEERNRIVHACLCGSYIFIILSNH 662
Query: 466 --GILTEVKHAQLEYEISCLDINP---IGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT 520
IL K + E ++I+ I + ++ A+ W S+ I S P+L+++
Sbjct: 663 WAAILDGKKLEMVNVEPLSINISSVAYISHVKNSAEFLALSTWEKSSILILSFPNLDIVR 722
Query: 521 KEHLGGE--IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMK----------------- 561
+ + I +S+ A + +L DG L+ + + M
Sbjct: 723 RIKIECTFGIYIKSMKFGATNNNLFFFVSLSDGQLMVYKVQMSRNITNAVSKGKSDVFKN 782
Query: 562 --------------TGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK 607
+ T + ++L I +T + + ++ A + T I+S K
Sbjct: 783 DVEIDFILDSTMHLSNTCTKLEFLNLSNNYIHEQT-CNLDCNNIIAIGKQSTAIFSKRDK 841
Query: 608 LLYSNVNLKEVSHMCPF----NSAAFPDSL-AIAKEGELTIGTIDDIQKLHIRSIPLGEH 662
+ ++ VN+ VS +C S+ F L A L IG +D I++ H+++I G
Sbjct: 842 VEFAKVNISHVSCLCTIPTQRTSSEFNSVLIAYYSYPSLVIGKLDPIEQFHVQNIISGRT 901
Query: 663 PRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFIS---------------- 706
++C+ S+ + S + + +DD F+
Sbjct: 902 FDKVCYHVPSKVAVVGCPPELSATTSDSVPGLNDIDDNNPPFMCLDGSESEKVNSLKLPA 961
Query: 707 ------------TYPLD-TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILV 753
Y L+ E+ S L D +N +GT+ V + PT+G +
Sbjct: 962 LALFINVSSKEVIYNLNLPREHLVSSLHTMSFDGTNEVILIGTSKVDESHDAPTEG--FL 1019
Query: 754 FIVE-----DGKLQLIAEKETK---GAVYSLNAFNGKLLAAINQKIQL------------ 793
+IV+ + +L + ++ TK G V + + + A+N +
Sbjct: 1020 YIVDISSHSEDRLVMTIQRTTKPFAGGVVEITSCGQNPVIAVNNVVLALSLCSNIGVRNE 1079
Query: 794 -------YKWMLRD--DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
Y + D D + L E + + + + + + I+VGDLM S LL +K
Sbjct: 1080 KDRYDLEYLYACDDIIDSRQCLVVEAKYTSNTYVVSLDSCENNIIVGDLMTSAKLLRFK- 1138
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
+ E RD+N+ W +AV +D + + ++++ N KN D + + +V+G
Sbjct: 1139 -DNCFYELCRDFNSIWCTAVASIDKNTSIVSDDSGNFSIFSKNETPTNDAQSAKFQVMGL 1197
Query: 905 YHLGEFVNRF-----------RHGSLVMR-------------LPDSDVGQI--------- 931
+H GE +N+ R SL+ L +D +
Sbjct: 1198 FHHGEIINKIVEHVSKECKYERKKSLIKLGGVASSCKRNFCCLSTADTCPMDPTNGSKST 1257
Query: 932 --------------PTVIFGTVNGVIGVIASLPH-EQYLFLEKLQTNLRKVIKGVGGLNH 976
T + T +G I +++ + +L L L+ + + G + +
Sbjct: 1258 NIHKDAAPTRYRFDKTYLCATSSGSILKMSTFKDMDAFLRLALLEEAIIGIQSDAGNIPN 1317
Query: 977 EQWRSFNNEKKTVDAKNFLDGDLIESFL----DLSRT---RMDEISKTMN---VSVEELC 1026
+WR+F N + + F+DGD +ESFL DL R ++ E SK + S+E L
Sbjct: 1318 SKWRNFQNSHTNIQTRGFIDGDTVESFLHLPMDLKRRVLEKLQESSKDIQGDFDSIETLT 1377
Query: 1027 KRVEELTRL 1035
+E++ R+
Sbjct: 1378 LEIEQIQRV 1386
>gi|68531971|ref|XP_723667.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23478038|gb|EAA15232.1| Drosophila melanogaster CG13900 gene product [Plasmodium yoelii
yoelii]
Length = 1235
Score = 126 bits (316), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 156/700 (22%), Positives = 292/700 (41%), Gaps = 85/700 (12%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I +LE+FR G +D++ I ++ + +L++D E ++ I R + + G R
Sbjct: 58 VFGIIRSLEIFRLLGSNKDYIAIGSDSGRLVILKYDDEKNDFI-RVHCETYGKSGIRRII 116
Query: 65 NGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYG 119
G+ IDP R + L + ++ DNK L + + + + + G
Sbjct: 117 PGEYIAIDPKGRALMLCAIEKQKFVYILNRDNKENLTISSPLEAHKSHSICHAVVALNVG 176
Query: 120 CAKPTIVVLYQ--DNKDARHVKTYEVAL----KDKDFVEGPWSQNN--------LDNGAD 165
P V + Q +N D + + E + K F E N+ +D A
Sbjct: 177 FENPMFVSIEQNYENLDKQVLNANEQIMEYPKKGLCFWEMDLGLNHVIKKHTIPIDITAH 236
Query: 166 LLIPVP-----PPLCGVLIIGEETIVYCSAN---AFKAIPIR------PSITKAYGRVDA 211
LLIP+P P G+++ E +VY + F + P R +I+ +
Sbjct: 237 LLIPLPGGQQGPS--GLIVCCENYLVYKKIDHDDIFCSYPRRLEVGEEKNISIVCWTIHR 294
Query: 212 DGSRY--LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
+ + L+ G L+ + + HE V + + IA++I L + +++ + +G
Sbjct: 295 IKTFFFILIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFG 354
Query: 270 DSQLIKLN-------------LQPDAKGSYVE----------VLERYVNLGPIVDFCVVD 306
+ + + P K + + ++++ +L PIVD ++D
Sbjct: 355 NHFFYQFSGIGNDSNDAMCTSNHPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILD 414
Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFL 365
+ Q+ G SLRI+++G+ I E A+ EL G + +W+++ +D ++
Sbjct: 415 AKNSNLPQIYALCGRGPRSSLRILQHGLSIEELANNELPGKPRYIWTVKKDNSSEYDGYI 474
Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
+VSF T IL + + +EE + T+ + N +QV +R ++
Sbjct: 475 IVSFEGNTLILEIG--ETVEEVYDSLLLTNVTTIHINLLYDNSFIQVYDTGIRHING--- 529
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEISCLD 484
++ EW P +N AT+N SQ++++ GG L+Y EI + LTE+ + EI CL
Sbjct: 530 KIVQEWIPPKNKQINAATSNGSQIVVSLSGGELIYFEIDESHTLTEIFRKNINVEILCLS 589
Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL---LCAFE-- 539
I I +N + AVG ++ VR+ S+ + L I+P + +C E
Sbjct: 590 IQQIQQNKLRASFLAVGCLDNV-VRLLSIEKDQYF--KQLSTYILPNNSSPQDICISEMK 646
Query: 540 --------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
I YL L G LL +++ G L++ LG + + + +
Sbjct: 647 ELGNQKEHTILYLNIGLNTGVLLRSVIDPICGTLSNHYSKYLGAKSVKICHVQVNKNPAL 706
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
S++ + Y K +YS +N + + F S D
Sbjct: 707 LVLSEKTYLCYVYQGKYIYSPLNYDVLEYASSFYSEQCSD 746
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 78/376 (20%), Positives = 154/376 (40%), Gaps = 57/376 (15%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG-RIL 752
+++++ + I L+ E S+ +C + T L N P+ R+
Sbjct: 875 IKIINPINLQIIDKISLELEEAALSVCACELEALHCLIVGTTTNMTLKNRNVPSASLRVY 934
Query: 753 VFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
+ + + KL L+ + Y FNG+++ ++ K+++Y G ++L +C +
Sbjct: 935 TYDI-NYKLNLLHITPIEDQPYCFCPFNGRVIVSVGNKLRIYAL-----GKKKLLKKCEY 988
Query: 813 HGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
A+ ++ GD I D+ +S+ + Y + I + D W++ EILD
Sbjct: 989 KDIPEAIVSIKVSGDRIFASDIRESVLIFFYDSNQNLIRLISDDIIPRWITCSEILDHHT 1048
Query: 872 YLGAENNFNLFTVRKN-------------SEGATDEERG-------------------RL 899
+ A+ ++F +R + E A EE G ++
Sbjct: 1049 IIAADKFDSVFILRVSLLTFFITPFCHLVPEEAKQEEYGIANKCWYGGEVINSSTKNRKM 1108
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH---EQYLF 956
E + +H+GE V + L P S +I+ T+ G IG A +P+ E+
Sbjct: 1109 EHIMSFHIGEIVTSLQKVKLS---PTSS----ECIIYSTIMGTIG--AFIPYDSKEELEL 1159
Query: 957 LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
+ L+ LR + G H +RS+ + + V +DGDL E F L +I
Sbjct: 1160 TQHLEIILRTEKHSLCGREHIFFRSYYHPVQHV-----IDGDLCEQFSSLPFEVQRKIGS 1214
Query: 1017 TMNVSVEELCKRVEEL 1032
+ + +E+ +++E++
Sbjct: 1215 DLEKTPDEILRKLEDI 1230
>gi|146420838|ref|XP_001486372.1| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC 6260]
Length = 1206
Score = 125 bits (315), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 237/1169 (20%), Positives = 447/1169 (38%), Gaps = 176/1169 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+ G I ++ R G D L I ++ + +L++D + + S T
Sbjct: 66 VIGVIQNIDRIRKGGSNLDLLVITSDSGRLSILEFDKDELKFFPVVQEPHSKNGMNRTTP 125
Query: 66 GQIGIIDPDCRLIGLHLYDG---LFKVIPFDNKGQLKEAF-NIRLEELQVLDIKFLYGCA 121
G+ +DP R I + + ++K +NK +L ++ L + + G
Sbjct: 126 GEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKLELLSPLESVSKNTLTIQMVSLDTGYE 185
Query: 122 KPTIVVL------------YQDNKDARHVKTYEV------ALKDKDFVEGPWSQNNLDNG 163
P + + Y ++ YE + KD +E P S L
Sbjct: 186 NPMLAAIECNYAHYDASLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTL--- 242
Query: 164 ADLLIPVPPPLCGVLIIGEETIVY----CSANAFKAIPIR------PSITKAYGRVDADG 213
+P+P P+ GV++ G I Y + IP+R P + A ++ +
Sbjct: 243 ----VPLPTPIGGVIVAGSSFIFYHNPTIDQQLYLPIPLRAGSSPVPIVCYAVHKLKKNN 298
Query: 214 SRYLLGDHAGLLHLLVITH--EKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
LL + G ++I + + EKVT L + S +++I+ ++ + D
Sbjct: 299 FFILLHNELGDCFRVLIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDK 358
Query: 272 QLIKLN--------------------LQPDAKGSYVEVLERYVNLGPIVDF-------CV 304
L ++ + K + R + L I+D +
Sbjct: 359 MLYQIEDLGDNDSYISSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGAL 418
Query: 305 VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDT 363
V + ++ +G L++ +GI I+ S L I +++ R S + D
Sbjct: 419 VKTSESKESRIAMITG---HSHLKLKTHGIPISTLVSSPLPMIATSVFTTRLSAESKNDE 475
Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
++V+S + ++ L + + + +EE + F + T+ L+Q+ S +R + T
Sbjct: 476 YMVISSSASSKTLVLAIGEVVEEVQDSSFVTDQPTIGVQQVGLKSLIQIYSNGIRHIRQT 535
Query: 424 SRELR-----NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY 478
E + +W P G ++ A+ N QVL+ L Y EI D ++ Q
Sbjct: 536 ETEGKITKKTFDWYPPAGITIISASTNQEQVLIGLSNRELCYFEI-DPTDDQLIEYQERI 594
Query: 479 EISCLDINPIGENPSYSQIA------AVGMWTDISVRIFSLPD---LNLITKEHLGGEII 529
E+S I + S+ + A+ TD +V++ SL L +T + L
Sbjct: 595 EMSGGQICALALASSFVNKSQRKSPFALVACTDETVQVISLQQHNCLETLTFQALSANCT 654
Query: 530 PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
++ +++L L +G + L TG+ +D + LG + L S +
Sbjct: 655 SAVIISNETSLVAHL--GLENGLYVRSTLESITGKFSDTRVKYLGADSVKLSAISLRQLQ 712
Query: 590 H--VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
V A S P + Y S K + + + F S + + A ELTI TI
Sbjct: 713 QVGVLAVSTYPWLCYRSKNKTRITPLMGANIKCGTSFYSEEIGEGIVGASASELTIFTIG 772
Query: 648 DIQ---------KLHIRSIPLGEHPRRICHQEQSRTFAICS-----LKNQSCAE------ 687
+ + +L I I L P ++ ++Q+ F I S L QS E
Sbjct: 773 EDEDNSEMSIEDELAIEDIRLRYSPTKMILEDQT-AFVIESEYGVVLPYQSGEEVDSDYY 831
Query: 688 ---------ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY 738
ES V++LD ++ + + L+ + S+ +F + Y VGTA
Sbjct: 832 SAFGYLRKDESWASCVQILDLESKKITFSTELEDNQKPLSLCRMNFGNQK--YLMVGTAK 889
Query: 739 VLP-EENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY--- 794
L + N K +I+ F++ +L+L+ E ++ F GKLL + + ++LY
Sbjct: 890 DLTFQLNNNPKYKIITFLINGSELELLHYTEVDHPPAAMIPFEGKLLVGMGKYLRLYELG 949
Query: 795 -KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
K +LR T + I+ + Q + IVVGD S + L Y + A
Sbjct: 950 KKQLLRKSSTL-----VDYLTKIVQITHQGKQR-IVVGDGSNSTTFLKYDSLDNIFVSFA 1003
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVR----------------KNSEGATDEERG 897
D ++A+E LD D +G + N+F R K + +
Sbjct: 1004 DDVMKRHITALECLDHDTVIGGDKFGNVFVNRIPFTLSKQADQEWSLVKYQDHYLNSAGN 1063
Query: 898 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE-QYLF 956
RL+ + E+ L + F G+LV G ++ + + G +G L + + F
Sbjct: 1064 RLKGLCEFFLQDIPTLFFKGTLV-------TGGKESIFYTGLCGSLGFFEPLISKLEVSF 1116
Query: 957 LEKLQTNLRKVIK-------------GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
L+ ++RKV+ + G + ++R + N KN +DGD +E +
Sbjct: 1117 FTALENSIRKVLDPNLEEHDKKRLYCQLLGKDQLKFRGYYNP-----VKNVIDGDFVEYY 1171
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+L +I+ ++ + ++ +++ ++
Sbjct: 1172 FELDPRTKTKIATELDRTPRDIERKIADI 1200
>gi|297598550|ref|NP_001045829.2| Os02g0137400 [Oryza sativa Japonica Group]
gi|255670583|dbj|BAF07743.2| Os02g0137400 [Oryza sativa Japonica Group]
Length = 845
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 110/413 (26%), Positives = 187/413 (45%), Gaps = 22/413 (5%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSL 353
+L PI+D V +L + QV T G +LRI+R G+ I+E A L +W++
Sbjct: 28 SLMPIMDMRVANLFDEETPQVFTACGRGPRSTLRILRPGLAISEMARSMLPAEPIAVWTV 87
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ + +D FD ++VVSF + T +L++ + +EE F T +L + L+QV
Sbjct: 88 KKNINDMFDAYIVVSFANVTLVLSIG--ETIEEVSDSQFLDTTHSLAVSLLGEDSLMQVH 145
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVK 472
+R + R NEW++P ++ +N QV++A GG L+Y E+ G L EV+
Sbjct: 146 PNGIRHIREDGR--VNEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVE 203
Query: 473 HAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPR 531
+ +++CL I P+ E S+ AVG + D ++RI S+ PD L P
Sbjct: 204 KQDMSGDVACLAIAPVPEGRQRSRFLAVGSF-DNTIRILSVDPDDCLQPLSVQSVSSAPE 262
Query: 532 SVLLCAFEGIS------------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
S++ + +L L +G L ++M TG+L+D + LG +P
Sbjct: 263 SLMFLEVQASVGGEDGADHPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPK 322
Query: 580 LRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG 639
L + + S RP + Y L + ++ + F+S + +
Sbjct: 323 LFPCIVSHRQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGD 382
Query: 640 ELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE 690
L I TI+ + + + +IPL PR+ + + A+ S K AEE E
Sbjct: 383 ALRIFTIEHLGETFNETAIPLRYTPRKFVILPKKKYLAVIESDKGALSAEERE 435
Score = 93.6 bits (231), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 91/365 (24%), Positives = 166/365 (45%), Gaps = 44/365 (12%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVL---PEENEPTKG 749
+R+LD ++ + L E SI + +F D + VGTA L P+ N + G
Sbjct: 494 IRILDPKSRDTTCLLELQDNEAAVSICTVNFHDKEHGTLLAVGTAKGLQFWPKRNL-SAG 552
Query: 750 RILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
I ++ V++G+ L+L+ + + + +L F G+LLA + ++LY D G R+L
Sbjct: 553 FIHIYKFVDEGRSLELLHKTQVEEVPLALCQFQGRLLAGVGSVLRLY-----DLGKRKLL 607
Query: 808 SECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
+C + + + T D I VGD+ +S Y+ +E + A D W++A
Sbjct: 608 RKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAANH 667
Query: 867 LDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHLG 908
+D D GA+ N++ R K +G + ++E + ++H+G
Sbjct: 668 IDFDTMAGADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHVG 727
Query: 909 EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKV 967
+ V + SL+ G +I+GTV G +G ++A E F L+ +LR+
Sbjct: 728 DVVTCLQKASLI-------PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQE 780
Query: 968 IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
+ G +H +RS K+ +DGDL E F L +I+ ++ + E+ K
Sbjct: 781 HPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPSLPADMQRKIADELDRTPGEILK 835
Query: 1028 RVEEL 1032
++E++
Sbjct: 836 KLEDI 840
>gi|294875343|ref|XP_002767276.1| spliceosome factor, putative [Perkinsus marinus ATCC 50983]
gi|239868839|gb|EEQ99993.1| spliceosome factor, putative [Perkinsus marinus ATCC 50983]
Length = 1258
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 174/744 (23%), Positives = 300/744 (40%), Gaps = 115/744 (15%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++ V +G I +LE FR G +D+L + ++ + +L+++ + + + + + G
Sbjct: 53 VISVNCFGLIRSLETFRLVGANRDYLLVGSDSGRIVILEYNT-TKNIFDKVHQETYGKTG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-- 117
R T GQ +DP R + + V + Q + + LE + I F
Sbjct: 112 CRRTVPGQYLAVDPKGRSCMVSAVERQKFVYILNRDLQSRLTISSPLEAHKSYTIVFATV 171
Query: 118 ---YGCAKPTIVVL---YQD-----NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADL 166
G P + Y D + + YE+ L + + A
Sbjct: 172 GLDVGFDNPQFAAIECQYDDFGKKGPNPQKLLTIYEMDLGVNHVTRK--HSDKIPFTAHD 229
Query: 167 LIPVPPPL---CGVLIIGEETIVYCSAN---AFKAIPIRPSITKAYGRVDADGSR----- 215
LIPVP GVL+ E + Y A+P R G + SR
Sbjct: 230 LIPVPGGTDGPGGVLLCCENFLSYYKQGHPVLSCAMPRRLDTPAEKGLMVVCWSRHKLKN 289
Query: 216 ---YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQ 272
+L+ G L+ + ++H++ V+ ++ +A +I L ++ S +G+
Sbjct: 290 FFFFLIQSEYGDLYKVTLSHKEGVVSEIQCSYFDSIPVAISICVLKTGFLFAASEFGNHA 349
Query: 273 LIKL------------NLQPDAKGSYVEVLERYV----------NLGPIVDFCVVD---- 306
L + + P+ + + V R + +L PI D V+D
Sbjct: 350 LYQFASLGDVTPALVTSSHPNRENAVVAFKPRTLKNLTPFDELSSLAPITDMKVMDCFST 409
Query: 307 ----LERQGQG-------------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IK 348
L+ G G Q+ SG +LRI+R+G+ + E + EL G
Sbjct: 410 QTQVLQADGSGMQQTVTTGMSVGCQIYALSGRGPRSALRILRHGLTLGEAGASELPGQPN 469
Query: 349 GMWSLRSSTDDPF--------------DTFLVVSFISETRILAMNLEDELEETEIEGFCS 394
+++++ PF D ++VVSF+ +T L + D + E GF
Sbjct: 470 ALFTIK-----PFGASYAPVAEGEVESDRYIVVSFVDQTLTLLVT-SDNIHEVTDSGFAK 523
Query: 395 QTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATG 454
+ TLF +QV +R V++ R EW++PPG V +A +N SQV++A
Sbjct: 524 EQPTLFAMRMQDKSAIQVMPTGIRHVAAGRRT--TEWRAPPGRQVTMAASNGSQVVIALS 581
Query: 455 GGHLVYLEIG---DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIF 511
GG + E+ +G L+EV + E++ L + P+ + SQ AV D SVR+
Sbjct: 582 GGEIQLFELDADTNGHLSEVAKRDIGCEVAALTVQPLSSGRTRSQFMAVA-GVDSSVRVL 640
Query: 512 SL-PDLNLITKEHLGGEIIPRSVLLCAF-EGIS---YLLCALGDGHLLNFLLNMKTGELT 566
SL D L SV + F EG + YL L DG L+ +++ TG+L+
Sbjct: 641 SLDSDRPLRQLAAQALRTTASSVCMLQFGEGTTATIYLAIGLEDGVLVRSVVDGVTGQLS 700
Query: 567 DRKKVSLGTQPITLRTFS-SKNTTHVFAASDRPTVIYSSNKKLL-------YSNVNLKEV 618
D+++ LG + +TL + + A S RP + + +N Y V L ++
Sbjct: 701 DQRQRVLGPRRVTLCPITVDGGRPAMVAMSSRPFLCFQANDYSAAGASGGQYQCVPLTKI 760
Query: 619 SHMCPFNSAAFPDSLAIAKEGELT 642
+ F+ AA P S+ +EG T
Sbjct: 761 PALPRFDHAA-PFSIENCQEGICT 783
Score = 76.3 bits (186), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 82/366 (22%), Positives = 159/366 (43%), Gaps = 41/366 (11%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD--DSNVYYCVGTAYVL----PEENEPT 747
+R++D T LD E ++ C F D+ VGTA + P +
Sbjct: 902 IRVVDPLTASTSFKLDLDVDEAATAMTVCYFYQLKDNRPCLVVGTATGVDPHNPSRSAHG 961
Query: 748 KGRILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM-LRDDGTRE 805
K I ++ ++ LQLI +G ++ F G+LL A+ + + + + G +
Sbjct: 962 KCYIKTYLYDESYNLQLIHVTPLEGVPSAMYPFEGRLLVALRGSPTVAPVLRIYELGKKR 1021
Query: 806 LQSECGHH-----GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
L +C + G I+ L V D I D SI +L +++ + ++ + D
Sbjct: 1022 LLKKCEYKFLPESGGIMWLDVNK--DRIFAADSRDSILVLRWRYSDNQMQVISDDTYPRC 1079
Query: 861 MSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGR-----------LEVVGEYHLGE 909
++A +LD + + + N+ +R + GR ++++G +H+GE
Sbjct: 1080 ITAAAVLDYNTIVVGDKFDNIAVLRVPGDAKDAGAWGRDNDYASGNTFKMDLIGHFHVGE 1139
Query: 910 FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE-QYLFLEKLQTNLRKVI 968
+ + ++V G VI+ TV G IG + + ++ FL+ L+ ++R
Sbjct: 1140 TITSLQRVTMV-------AGGAEIVIYSTVLGTIGALYPFSSKREHGFLQALEMHMRNTA 1192
Query: 969 KG--VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
+ G H +RSF + KNF+D DL E + L + +I+ M+ + +E+
Sbjct: 1193 ASPSLTGREHVMYRSFYHP-----IKNFVDADLCEVYYQLPAEKQRQIAVDMDKTPQEVM 1247
Query: 1027 KRVEEL 1032
K++E++
Sbjct: 1248 KKLEDI 1253
>gi|147839354|emb|CAN61260.1| hypothetical protein VITISV_043608 [Vitis vinifera]
Length = 831
Score = 124 bits (310), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 59/67 (88%), Positives = 59/67 (88%)
Query: 46 ELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIR 105
E I AMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIR
Sbjct: 763 EKIYGAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIR 822
Query: 106 LEELQVL 112
L L
Sbjct: 823 YFFLSFL 829
Score = 103 bits (258), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 47/53 (88%), Positives = 49/53 (92%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG 53
MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAE+SE+I G
Sbjct: 483 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAENSEVIASIYG 535
>gi|389586447|dbj|GAB69176.1| splicing factor 3B subunit 3 [Plasmodium cynomolgi strain B]
Length = 1286
Score = 124 bits (310), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 145/713 (20%), Positives = 288/713 (40%), Gaps = 111/713 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G I L+ FR G +D++ I ++ + +LQ+ E ++ + R + + G R
Sbjct: 58 VFGIIRCLQTFRLTGSNKDYVVIGSDSGRLVILQFSNEKNDFV-RVHCETYGKSGLRRII 116
Query: 65 NGQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAF-------------------- 102
G+ +DP R + + + ++ D K QL +
Sbjct: 117 PGEYIAVDPKGRALMICAIERQKFVYILNRDTKEQLTISSPLDAHKSHTICHDVVGMDVG 176
Query: 103 -----------NIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFV 151
N + + QV + + GC + T++ L++ + HV
Sbjct: 177 FENPIFASIEQNYEMYDKQVTNTNEIDGCTRKTLLCLWEMDLGLNHVIRKHTL------- 229
Query: 152 EGPWSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVYCSA---NAFKAIPIR------ 199
+D+ A LLIP+P GV++ + +VY + + A P R
Sbjct: 230 -------PIDSSAHLLIPIPGGQQGPSGVIVCCDNYLVYKKVEHVDVYCAYPRRLETGQE 282
Query: 200 ---PSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISY 256
+ A R+ L+ G L+ + + HE V + + +A+ I
Sbjct: 283 KNISIVCSALHRI-RKFFFILIQSEFGDLYKIEMDHEDGIVKEITCKYFDTVPVANAICV 341
Query: 257 LDNAVVYIGSSYG-------------DSQLIKLNLQPDAKGSYVE----------VLERY 293
+ + +++ + +G D++ + + P + + + ++++
Sbjct: 342 MKSGSLFVAAEFGNHFFYQFSGIGDDDNEAMCTSKHPSGRNAIIAFRTKKLTNLFLIDQV 401
Query: 294 VNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWS 352
+L PI+D ++D + Q+ G SLRI+++G+ I E A EL G K +W+
Sbjct: 402 YSLSPILDMKILDAKNANSPQIYALCGRGPRSSLRILQHGLSIEELADNELPGRPKFIWT 461
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
++ +D +++VSF T IL + + +EE + T+ + N L+QV
Sbjct: 462 IKKDNASDYDGYIIVSFEGSTLILEIG--ETVEEVVDSLLLTNVTTIHVNILYDNTLIQV 519
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEV 471
+R ++ ++ +EW P + AT+N++Q++++ GG L+Y EI + L E+
Sbjct: 520 HDTGIRHING---KVVHEWVPPKNKQIKAATSNSTQIVISLSGGELIYFEIDESHTLVEI 576
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
L E CL I + EN + AVG ++ VR+ S+ + L I+P
Sbjct: 577 FRKNLNVETLCLSIQQVEENKLRANFLAVGCLDNV-VRLLSIEKEKYFNQ--LSTFILPN 633
Query: 532 S--------VLLCAF-----EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
+ +C + +L L +G LL +++ TG LT+ LG + +
Sbjct: 634 NSSAQDICISQMCELGNDKERKLIFLNIGLNNGVLLRSVVDPITGTLTNHYSKYLGAKNV 693
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
+ K + + ++ + Y K +YS +N + + F+S D
Sbjct: 694 KICPVHLKKNSALLVLCEKTYLCYVHQGKYIYSPLNYDILEYASSFHSEQCSD 746
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 82/364 (22%), Positives = 152/364 (41%), Gaps = 47/364 (12%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC--VGTAYVLPEENEPTKGRI 751
++++ + + I L+ E S+ +C +C VGT L +N
Sbjct: 940 IKIIHPISLQTIDKISLEMEEAALSVCACELE----ALHCLIVGTTTNLSLKNRTAPAAA 995
Query: 752 LVFIVED--GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
L D KL L+ + + FNG+LLA+I K+++Y G ++L +
Sbjct: 996 LRVYTYDINYKLNLLHITPVEDQPFCFCPFNGRLLASIGNKLRIYAL-----GKKKLLKK 1050
Query: 810 CGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
C + A+ ++ GD I D+ +S+ + Y + + D W++ EILD
Sbjct: 1051 CEYKDIPEAIISIKVSGDRIFASDIRESVLIFFYDSNMNTLRLISDDIIPRWITCSEILD 1110
Query: 869 DDIYLGAENNFNLFTVRKNSEGATDEERG-------------------RLEVVGEYHLGE 909
+ A+ ++F +R E A EE G RLE + +H+GE
Sbjct: 1111 HHTIMAADKFDSVFVLRVPEE-AKQEEYGISNKCWYGGEIMAGSNKNRRLEHIMSFHVGE 1169
Query: 910 FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL-EKLQTNLRKVI 968
V + L P S +I+ T+ G IG +++ L L + L+ LR
Sbjct: 1170 IVTSLQKVKLS---PTSS----ECIIYSTIMGTIGAFIPYDNKEELELTQHLEIILRTEN 1222
Query: 969 KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
+ G H +RS+ + + V +DGDL E F L +++ + + +++ ++
Sbjct: 1223 PPLCGREHIFFRSYYHPVQHV-----IDGDLCEQFSSLPYDVQRKVAADLERTPDDILRK 1277
Query: 1029 VEEL 1032
+E++
Sbjct: 1278 LEDI 1281
>gi|190345965|gb|EDK37945.2| hypothetical protein PGUG_02043 [Meyerozyma guilliermondii ATCC 6260]
Length = 1206
Score = 123 bits (309), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 233/1169 (19%), Positives = 443/1169 (37%), Gaps = 176/1169 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+ G I ++ R G D L I ++ + +L++D + + S T
Sbjct: 66 VIGVIQNIDRIRKGGSNLDLLVITSDSGRLSILEFDKDELKFFPVVQEPHSKNGMNRTTP 125
Query: 66 GQIGIIDPDCRLIGLHLYDG---LFKVIPFDNKGQLKEAF-NIRLEELQVLDIKFLYGCA 121
G+ +DP R I + + ++K +NK +L ++ L + + G
Sbjct: 126 GEYLCVDPQDRTITIGAIERDKLMYKAQTNNNKLELSSPLESVSKNTLTIQMVSLDTGYE 185
Query: 122 KPTIVVL------------YQDNKDARHVKTYEV------ALKDKDFVEGPWSQNNLDNG 163
P + + Y ++ YE + KD +E P S L
Sbjct: 186 NPMLAAIECNYAHYDASLKYDPQSSNLTLQYYEFEQGLNYVARRKDTLEIPSSSTTL--- 242
Query: 164 ADLLIPVPPPLCGVLIIGEETIVY----CSANAFKAIPIR------PSITKAYGRVDADG 213
+P+P P+ GV++ G I Y + IP R P + A ++ +
Sbjct: 243 ----VPLPTPIGGVIVAGSSFIFYHNPTIDQQLYLPIPSRAGSSPVPIVCYAVHKLKKNN 298
Query: 214 SRYLLGDHAGLLHLLVITH--EKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS 271
LL + G ++I + + EKVT L + S +++I+ ++ + D
Sbjct: 299 FFILLHNELGDCFRVLIDYDDDSEKVTELSVGYFDTISPSTSINVFKKGYLFANVTNNDK 358
Query: 272 QLIKLN--------------------LQPDAKGSYVEVLERYVNLGPIVDF-------CV 304
L ++ + K + R + L I+D +
Sbjct: 359 MLYQIEDLGDNDSYISSSQFSSLEDVFDGNKKHEFKPRGLRNLALVQIIDSSNPCFGGAL 418
Query: 305 VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDT 363
V + ++ +G L++ +GI I+ S L I +++ R S + D
Sbjct: 419 VKTSESKESRIAMITG---HSHLKLKTHGIPISTLVSSPLPMIATSVFTTRLSAESKNDE 475
Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
++V+S + ++ L + + + +EE + F + T+ L+Q+ S +R + T
Sbjct: 476 YMVISSSASSKTLVLAIGEVVEEVQDSSFVTDQPTIGVQQVGLKSLIQIYSNGIRHIRQT 535
Query: 424 SRELR-----NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEY 478
E + +W P G ++ A+ N QVL+ L Y EI D ++ Q
Sbjct: 536 ETEGKITKKTFDWYPPAGITIISASTNQEQVLIGLSNRELCYFEI-DPTDDQLIEYQERI 594
Query: 479 EISCLDINPIGENPSYSQIA------AVGMWTDISVRIFSLPD---LNLITKEHLGGEII 529
E+S I + S+ + A+ TD +V++ SL L +T + L
Sbjct: 595 EMSGGQICALALASSFVNKSQRKSPFALVACTDETVQVISLQQHNCLETLTFQALSANCT 654
Query: 530 PRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT 589
++ +++L L +G + L TG+ +D + LG + L S + +
Sbjct: 655 SAVIISNETSLVAHL--GLENGLYVRSTLESITGKFSDTRVKYLGADSVKLSAISLRQSQ 712
Query: 590 H--VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
V A S P + Y S K + + + F S + + A ELTI TI
Sbjct: 713 QVGVLAVSTYPWLCYRSKNKTRITPLMGANIKCGTSFYSEEIGEGIVGASASELTIFTIG 772
Query: 648 DIQ---------KLHIRSIPLGEHPRRICHQEQSRTFAI--------------------C 678
+ + +L I I L P ++ ++Q+ F I
Sbjct: 773 EDEDNSEMSIEDELAIEDIRLRYSPTKMILEDQT-AFVIESEYGVVSPYQSGEEVDSDYY 831
Query: 679 SLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY 738
S S +ES V++LD ++ + + L+ + S+ +F + Y VGTA
Sbjct: 832 SAFGYSRKDESWASCVQILDLESKKITFSTELEDNQKPLSLCRMNFGNQK--YLMVGTAK 889
Query: 739 VLP-EENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY--- 794
L + N K +I+ F++ +L+L+ E ++ F GKLL + + ++LY
Sbjct: 890 DLTFQLNNNPKYKIITFLINGSELELLHYTEVDHPPAAMIPFEGKLLVGMGKYLRLYELG 949
Query: 795 -KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
K +LR T + I+ + Q + IVVGD S + L Y + A
Sbjct: 950 KKQLLRKSSTL-----VDYLTKIVQITHQGK-QRIVVGDGSNSTTFLKYDSSDNIFVSFA 1003
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVR----------------KNSEGATDEERG 897
D ++A+E LD D +G + N+F R K + +
Sbjct: 1004 DDVMKRHITALECLDHDTVIGGDKFGNVFVNRIPFTLSKQADQEWSLVKYQDHYLNSAGN 1063
Query: 898 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLF 956
R + + E+ L + F G+LV G ++ + + G +G L + F
Sbjct: 1064 RSKGLCEFFLQDIPTSFFKGTLV-------TGGKESIFYTGLCGSLGFFEPLISKSEVSF 1116
Query: 957 LEKLQTNLRKVIK-------------GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
L+ ++RKV+ + G + ++R + N KN +DGD +E +
Sbjct: 1117 FTALENSIRKVLDPNLEEHDKKRLYCQLLGKDQLKFRGYYNP-----VKNVIDGDFVEYY 1171
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+L +I+ ++ + ++ +++ ++
Sbjct: 1172 FELDPRTKTKIATELDRTPRDIERKIADI 1200
>gi|380488197|emb|CCF37544.1| hypothetical protein CH063_08850 [Colletotrichum higginsianum]
Length = 271
Score = 123 bits (309), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 76/268 (28%), Positives = 134/268 (50%), Gaps = 16/268 (5%)
Query: 782 KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
KL+A +++ + +Y++ + L+ + + G+ I + DLM+S++L+
Sbjct: 7 KLVAGLSKTVVVYEYAEESSTSGALRKLATFRPSTFPVDIDVNGNMIGIADLMQSMTLVE 66
Query: 842 Y----KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG 897
+ + + ERAR + W ++V L+ +L A+ NL +R+N + T+ ++
Sbjct: 67 FIPAQDGNKAKLVERARHFQYIWATSVCHLEGHSWLEADAQGNLMVLRRNPDAPTEHDQK 126
Query: 898 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL 957
++EV E+HLGE VN+ R + P+ + +P TV G + V A + E L
Sbjct: 127 QMEVTSEFHLGEQVNKIRPLDIT---PNENDPIVPKAFLATVEGSLYVFADIKSEYQSLL 183
Query: 958 EKLQTNLRKVIKGVG--------GLNHEQWRSFNNEKKTVDAK-NFLDGDLIESFLDLSR 1008
+ Q L V+K +G GL+ WR F N K+ D F+DG+LIE FLDL
Sbjct: 184 LQFQERLADVVKTLGQAGGDSTSGLSFMAWRGFRNAKRAADGPFRFVDGELIERFLDLDE 243
Query: 1009 TRMDEISKTMNVSVEELCKRVEELTRLH 1036
+ + + + + +VE + VEEL R+H
Sbjct: 244 AKQEAVVQGLGPTVENMRNLVEELKRMH 271
>gi|156095699|ref|XP_001613884.1| Splicing factor 3B subunit 3 [Plasmodium vivax Sal-1]
gi|148802758|gb|EDL44157.1| Splicing factor 3B subunit 3, putative [Plasmodium vivax]
Length = 1230
Score = 123 bits (308), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 150/700 (21%), Positives = 289/700 (41%), Gaps = 85/700 (12%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
I+G I L+ FR G +D++ I ++ + +LQ+ E ++ + R + + G R
Sbjct: 58 IFGIIRCLQTFRLTGSNKDYVVIGSDSGRLTILQFSNEKNDFV-RVHCETYGKSGLRRII 116
Query: 65 NGQIGIIDPDCRLIGLHLYD--GLFKVIPFDNKGQLKEAFNIRLEELQVL--DIKFL-YG 119
G+ +DP R + + + ++ D K QL + + + + D+ + G
Sbjct: 117 PGEYIAVDPKGRALMICAIERQKFVYILNRDTKEQLTISSPLDAHKSHTICHDVVGMDVG 176
Query: 120 CAKPTIVVLYQDNK--DARHVKTYEVALKDKDFVEGPWSQN------------NLDNGAD 165
P + Q+ + D + T E+ + + W + +D A
Sbjct: 177 FENPMFASIEQNYEALDKQVTNTSEIDSYTRKTLLSLWEMDLGLNHVIRKYTFPIDASAH 236
Query: 166 LLIPVP-----PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSI--TKAYGRVDADGSR 215
LLIP+P P GV++ + +VY A+ + A P R K V + R
Sbjct: 237 LLIPIPGGQQGPS--GVIVCCDNFLVYKKVDHADVYCAYPRRLETGQEKNLSIVCSTLHR 294
Query: 216 Y------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
L+ G L+ + + HE V + + +A+ I + + +++ + +G
Sbjct: 295 IRKFFFILIQSELGDLYKIEMEHEDGVVKEITCKYFDTVPVANAICVMKSGSLFVAAEFG 354
Query: 270 -------------DSQLIKLNLQPDAKGSYVE----------VLERYVNLGPIVDFCVVD 306
D++ + + P + + + ++++ +L PI+D V+D
Sbjct: 355 NHFFYQFSGIGDEDNEAMCTSKHPSGRNAIIAFRTKKLTNLFLIDQVYSLSPILDMKVID 414
Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFL 365
+ Q+ G SLRI+++G+ I E A EL G K +W+++ +D ++
Sbjct: 415 AKNASSPQIYALCGRGPRSSLRILQHGLSIEELADNELPGRPKFIWTIKKDNASDYDGYI 474
Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
+VSF T IL + + +EE + T+ + N L+QV +R ++
Sbjct: 475 IVSFEGSTLILEIG--ETVEEVVDSLLLTNVTTIHVNILYDNSLIQVHDAGIRHING--- 529
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEISCLD 484
++ +EW P + AT+N +Q++++ GG L+Y EI + L E L E CL
Sbjct: 530 KVIHEWVPPKNKQIKAATSNCAQIVISLSGGELLYFEIDESHTLVETFRKNLNVETLCLS 589
Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL---LCAFE-- 539
I + EN + AVG ++ VR+ S+ + L I+P + +C E
Sbjct: 590 IQQVQENKLRANFLAVGCLDNV-VRLLSIEKEKYFNQ--LSTFILPNNSSAQDICITEMS 646
Query: 540 --------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
+ +L L +G LL +++ TG LT+ LG + + + K +
Sbjct: 647 ELGNDKERKLLFLNIGLNNGVLLRSVVDPITGTLTNHYSKYLGAKNVKICPVHVKKNAAL 706
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
++ + Y K +YS +N + + F+S D
Sbjct: 707 LVLCEKTYLCYVHQGKYIYSPLNYDILEYASSFHSEQCSD 746
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 82/364 (22%), Positives = 151/364 (41%), Gaps = 47/364 (12%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC--VGTAYVLPEENEPTKGRI 751
++++ + I L+ E S+ +C +C VGT L +N
Sbjct: 884 IKIIHPVNLQTIDKISLEMEEAALSVCACELE----ALHCLIVGTTTSLSLKNRSAPAAA 939
Query: 752 LVFIVED--GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
L D KL L+ + + FNG+LLA+I K+++Y G ++L +
Sbjct: 940 LRVYTYDINYKLNLLHITPVEDQPFCFCPFNGRLLASIGNKLRIYAL-----GKKKLLKK 994
Query: 810 CGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
C + A+ ++ GD I D+ +S+ + Y + + D W++ EILD
Sbjct: 995 CEYKDIPEAIISIKVSGDRIFASDIRESVLIFFYDANMNTLRLISDDIIPRWITCSEILD 1054
Query: 869 DDIYLGAENNFNLFTVRKNSEGATDEERG-------------------RLEVVGEYHLGE 909
+ A+ ++F +R E A EE G RLE + +H+GE
Sbjct: 1055 HHTIMAADKFDSVFVLRVPEE-AKQEEYGISNKCWYGGEIMAGSNKNRRLEHIMSFHVGE 1113
Query: 910 FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL-EKLQTNLRKVI 968
V + L P S +I+ T+ G IG +++ L L + L+ LR
Sbjct: 1114 IVTSLQKVKLS---PTSS----ECIIYSTIMGTIGAFIPYDNKEELELTQHLEIILRTEN 1166
Query: 969 KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
+ G H +RS+ + + V +DGDL E F L +++ + + +++ ++
Sbjct: 1167 PPLCGREHIFFRSYYHPVQHV-----IDGDLCEQFSSLPYDVQRKVAADLERTPDDILRK 1221
Query: 1029 VEEL 1032
+E++
Sbjct: 1222 LEDI 1225
>gi|164658491|ref|XP_001730371.1| hypothetical protein MGL_2753 [Malassezia globosa CBS 7966]
gi|159104266|gb|EDP43157.1| hypothetical protein MGL_2753 [Malassezia globosa CBS 7966]
Length = 919
Score = 123 bits (308), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 190/827 (22%), Positives = 336/827 (40%), Gaps = 105/827 (12%)
Query: 215 RYLLGDHAGLLHLLVITHEKEKVTGLKIELL--GETSIAST---ISYLDNAVVYIGSSYG 269
R LL G +++L ++ + + LL +S+A++ +S L V++ S+ G
Sbjct: 130 RVLLSTTRGSMYILTVSSSTNAWETIHMHLLRLQMSSVATSPQGLSSLGEDFVFLASTTG 189
Query: 270 DSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRI 329
DS L ++ P Y R+ + PIVD V+D +++T SGA SLR
Sbjct: 190 DSVLWHVDSSPSLHEVY-----RWPSTAPIVDM-VLD-NSSVMDRIITASGAGPTSSLR- 241
Query: 330 VRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI 389
I +A IK P D V + S ++ N ++ E
Sbjct: 242 -----AIMYKALTSTLFIK-----------PMDLVSVYALESALVLVHRNSASIVQLNEH 285
Query: 390 EGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNE------WKSPPGYSVNVAT 443
E TQ HDA +++ + + +TS+++R W +P S
Sbjct: 286 E---DPTQISIVHDA---RVLHAGMLAGLYLVATSKDIRTYGRGKTVWTAPDEISA-CHV 338
Query: 444 ANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMW 503
+ V++ L+ L + + + ISCL G S I A G W
Sbjct: 339 LDDKHVVIGLVTPQLLVLHHDHHNWSILHSLRTHAPISCL----FG---CASAIVA-GFW 390
Query: 504 TDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAF--EGISYLLCALGDGHLLNFLLNMK 561
D +V+ + P + H+ +P S+L AF G ++ + DGH+ +
Sbjct: 391 -DRTVQKYVWPSMEG-DPIHIACTSLPTSILQHAFTPHGENHTVVGYADGHVDVY----- 443
Query: 562 TGELTDRKKV----SLGTQPITLRT-----FSSKNTTHVFAASDRPTVIYSSNKKLLYSN 612
D+ K+ +GT+PI L T NT + A+ +Y + S
Sbjct: 444 ----NDQDKIVHTCQVGTEPIHLTTAPLSMMELPNTHAIVASGSYTCFLYPQDSAFRASI 499
Query: 613 VNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPL-GEHPRRICHQEQ 671
++ V ++++ ++ A + L + I + L I+S+ L G+ P I
Sbjct: 500 WLIEHVRAAACVDASSQALTMVCATQFGLEVHAIHTLHHLDIQSLSLKGQQPTSIA---S 556
Query: 672 SRTFAICSLKNQSCAEESE--MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
+ + + E E + +R++ + ++++ L +E S +C +
Sbjct: 557 FGPYIVLTTWPMHAWEHHERGLSTIRVMTKAGLKQVTSHSLHAYERPNS--ACVKKFYNT 614
Query: 730 VYYCVGTAYVLPEENEPTKGRILVFIVED--GKLQLIAEKETKGAVYSLNAFNGK-LLAA 786
Y VGT + + + + GR++ + KLQLI + G VY + G L+AA
Sbjct: 615 PYIVVGTGFYVGGRDSTSSGRLIGYTWSTPLSKLQLIWSCDVPGHVYGVTDVPGSYLVAA 674
Query: 787 INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
++ ++ Y DG +L G LA + ++G +VVGD M S+++L +
Sbjct: 675 VDAQVHTYALC---DGKLKLCDRWG--CAFLATCLTSQGSTVVVGDAMHSLTVLQVD-PK 728
Query: 847 GAIEERARDYNANWMSAVEIL--DDDIYLGAENNFNLFTV-RKNSEGATDEERGRLEVVG 903
G + E ARD + W +AV +L + Y+G + N+F R + TD
Sbjct: 729 GQLHEIARDVDPYWTTAVGVLSTEQQQYIGTDIAMNMFVAERVHQTDGTDPWS------- 781
Query: 904 EYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
H+ F HG +V + G+I +FGT +G +G + +P E L +Q
Sbjct: 782 --HVMHRATAFHHGDMVNTIQSVADGRI---LFGTASGSVGTLMYVPDETASMLWCVQEA 836
Query: 964 LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL-SRT 1009
L G L+ E WR+ + + ++ LD +L+ +F SRT
Sbjct: 837 LSHQRHAHGDLSWESWRTLRTDMQICAPRHVLDAELLTTFFTCESRT 883
>gi|68075683|ref|XP_679761.1| splicing factor 3b, subunit 3, 130kD [Plasmodium berghei strain
ANKA]
gi|56500578|emb|CAH95367.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium berghei]
Length = 1216
Score = 123 bits (308), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 157/699 (22%), Positives = 289/699 (41%), Gaps = 84/699 (12%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
I+G I +LE+FR G +D++ I ++ + +L++D E ++ I R + + G R
Sbjct: 58 IFGIIRSLEIFRLMGSNKDYIAIGSDSGRLVILKYDDEKNDFI-RVHCETYGKSGIRRII 116
Query: 65 NGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLD---IKFLYG 119
G+ IDP R + L + ++ DNK L + + + + + G
Sbjct: 117 PGEYIAIDPKGRALMLCAIEKQKFVYILNRDNKENLTISSPLEAHKSHSICHAVVGLNVG 176
Query: 120 CAKPTIVVLYQDNK--DARHVKTYEVAL----KDKDFVEGPWSQNN--------LDNGAD 165
P V + Q+ + D + + T E + K F E N+ +D A
Sbjct: 177 FENPMFVSIEQNYEALDKQVLNTNEQIMEYPKKGLCFWEMDLGLNHVIKKHTIPIDITAH 236
Query: 166 LLIPVP-----PPLCGVLIIGEETIVY---------CSANAFKAIPIRPSITKAYGRVDA 211
LLIP+P P G+++ E +VY CS + +I+ +
Sbjct: 237 LLIPLPGGQQGPS--GLIVCCENYLVYKKIDHDDIYCSYPRRLEVGEEKNISIVCWTIHR 294
Query: 212 DGSRY--LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYG 269
+ + L+ G L+ + + HE V + + IA++I L + +++ + +G
Sbjct: 295 IKTFFFILIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFG 354
Query: 270 DSQLIKLN-------------LQPDAKGSYVEV-LERYVNLG--------PIVDFCVVDL 307
+ + + P K + + ++ NL PIVD ++D
Sbjct: 355 NHFFYQFSGIGNDSNESMCTSNHPSGKNAIIAFKTQKLKNLYLVDQIYSLPIVDMKILDA 414
Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLV 366
+ Q+ G SLRI+++G+ I E A+ EL G + +W+++ +D +++
Sbjct: 415 KNSNIPQIYALCGRGPRSSLRILQHGLSIEELANNELPGKPRYIWTIKKDNSSEYDGYII 474
Query: 367 VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
VSF T IL + + +EE + T+ + N +QV +R ++ +
Sbjct: 475 VSFEGNTLILEIG--ETVEEVYDSLLLTNVTTIHINLLYDNSFIQVYDTGIRHING---K 529
Query: 427 LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEISCLDI 485
+ EW P +N AT+N SQ++++ GG L+Y EI + LTE+ + EI CL I
Sbjct: 530 IVQEWVPPKNKQINAATSNGSQIVISLSGGELIYFEIDESHTLTEIFRKNINVEILCLSI 589
Query: 486 NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVL---LCAFE--- 539
I +N + AVG ++ VR+ S+ + L I+P + +C E
Sbjct: 590 QQIQQNKLRASFLAVGCLDNV-VRLLSIEKDQYF--KQLSTYILPNNSSPQDICISEMKE 646
Query: 540 -------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVF 592
I YL L G LL +++ G L++ LG + + + +
Sbjct: 647 LGSQKEHTILYLNIGLNTGVLLRSVIDPICGTLSNHYSKYLGAKSVKICHVQVNKNPALL 706
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD 631
S++ + Y K +YS +N + + F S D
Sbjct: 707 VLSEKTYLCYVYQGKYIYSPLNYDVLEYASSFYSEQCSD 745
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 75/360 (20%), Positives = 149/360 (41%), Gaps = 40/360 (11%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG-RIL 752
+++++ + I L+ E S+ +C + T L N P+ R+
Sbjct: 871 IKIINPINLQVIDKISLELEEAALSVCACELEALHCLIVGTTTNMTLKSRNVPSASLRVY 930
Query: 753 VFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
+ + + KL L+ + Y FNGK++ ++ K+++Y G ++L +C +
Sbjct: 931 TYDI-NYKLNLLHITPIEDQPYCFCPFNGKVIVSVGNKLRIYAL-----GKKKLLKKCEY 984
Query: 813 HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
A+ + I D+ +S+ + Y + I + D W++ EILD
Sbjct: 985 KDIPEAIVSIKVSNRIFASDIRESVLIFFYDSNQNVIRLISDDIIPRWITCSEILDHHTI 1044
Query: 873 LGAENNFNLFTVRKNSEGATDEERG-------------------RLEVVGEYHLGEFVNR 913
+ A+ ++F +R E A EE G ++E + +H+GE V
Sbjct: 1045 IAADKFDSVFILRVPEE-AKQEEYGIANKCWYGGEVINSSTKNRKMEHIMSFHIGEIVTS 1103
Query: 914 FRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL-EKLQTNLRKVIKGVG 972
+ L P S +I+ T+ G IG +++ L L + L+ LR +
Sbjct: 1104 LQKVKLS---PVSS----ECIIYSTIMGTIGAFIPYDNKEELELTQHLEIILRTEKHALC 1156
Query: 973 GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
G H +RS+ + ++ +DGDL E F L +I + + +E+ +++E++
Sbjct: 1157 GREHIFFRSYYHP-----VQHVIDGDLCEQFSSLPFEVQRKIGSDLEKTPDEILRKLEDI 1211
>gi|167380951|ref|XP_001733297.1| DNA repair protein xp-E [Entamoeba dispar SAW760]
gi|165902459|gb|EDR28278.1| DNA repair protein xp-E, putative [Entamoeba dispar SAW760]
Length = 349
Score = 122 bits (306), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 74/235 (31%), Positives = 133/235 (56%), Gaps = 5/235 (2%)
Query: 805 ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY---KHEEGAIEERARDYNANWM 861
+LQ + + ++ LYV+T G+ I+VGDLMKSIS+ + + + + E +RD+ A++
Sbjct: 114 KLQEKGSCNVKLIGLYVKTMGNKILVGDLMKSISVYSFDNNGNNKNCLNEVSRDFYASYT 173
Query: 862 SAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVM 921
+A+E +D++ YL +++N NL NS G + ER RL H+GE +N GS+
Sbjct: 174 TAIEFVDENCYLSSDSNSNLLVFNTNSTG-NESERFRLNNCAHIHVGECINVMCKGSIAP 232
Query: 922 RLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG-VGGLNHEQWR 980
+ Q ++FG V G IG I +P+E Y L K+Q + +KG V ++W+
Sbjct: 233 THSTYETIQKKCILFGGVTGYIGGICEIPNEIYDILIKVQNQILLQMKGIVECTTPDEWK 292
Query: 981 SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
++ K + + N +DG+++ES+L++S+ + EI+ V+ E++ +E + L
Sbjct: 293 KVIDDWKRMPSSNIIDGNIVESYLEMSKEKQCEIAHLSGVNEEQINDIIENMISL 347
>gi|449296290|gb|EMC92310.1| hypothetical protein BAUCODRAFT_151722 [Baudoinia compniacensis UAMH
10762]
Length = 1224
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 164/779 (21%), Positives = 330/779 (42%), Gaps = 85/779 (10%)
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK--GMWSL 353
L P++ V +L + Q+ G L+ +R+G+ INE S L ++ +WSL
Sbjct: 433 LNPLMRTKVDNLTGEDAPQIYAIQGTGNSSVLKTMRHGLEINEIVSSPLGNLQYDNLWSL 492
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ D + ++L++S + + +++ DE+E E F + T+ + LVQV
Sbjct: 493 KHRAVDDYHSYLLLSSGYGDKTIVLSIGDEVETMENSPFLTNRATITANQMGDATLVQVH 552
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVK 472
+ + +S NEW +P ++ A++N Q+LL L + +G DG+L +++
Sbjct: 553 ARGI--LSVLESGALNEWPAPAHRTIVAASSNDRQLLLGLSSSELAFFFMGDDGVLNQLE 610
Query: 473 H-AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP-DLNLITKEHLGGEIIP 530
++ +I+ L + ++ A VG D ++R++S+ + L + +P
Sbjct: 611 EMPEMSGKITALSVGQTPRGQQQAKYAVVGC-DDCTIRVYSIELETALEPRSVQALSAVP 669
Query: 531 RSVLLCAFEG------ISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS 584
S+ + ++Y+ L G L +++ TGEL + + LG +P +
Sbjct: 670 TSIEVVEMLDPSSNVLVNYVHIGLQSGLYLRAVIDDVTGELGEVRTRFLGARPTRIFPVE 729
Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL--T 642
+ A+S RP + Y++ + + +V PF + A+ + L T
Sbjct: 730 IAEEQGIVASSSRPWLGYNNKGTYTLTPLVTGQVEAARPFVTEHLRGFCAVQGQSLLIFT 789
Query: 643 IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE--------------- 687
+ T D +L + L PR + S + + + A+
Sbjct: 790 LPTTDG--RLSSSEVGLKYTPRAMTRHPYSPVWYVAQSDGNTLAQGTKDQLLSADGAMNG 847
Query: 688 ------------ESEMHFVRLLD--DQTFEFISTYPLDTFEYGCSILSCSF----SDDSN 729
S H+ + D F +T+ ++ F + L C+ S +
Sbjct: 848 VDEDVSKQLGLSRSAGHWASCIQAVDPIFAKAATHTIE-FTDNEAALCCAAVPFESKNWE 906
Query: 730 VYYCVGTAYVL----PEENEPTKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKL 783
VY VGT L + + +G + ++ ++EDG+ ++L+ + +Y+++AF G+L
Sbjct: 907 VYLAVGTGQHLRPGVSAKGQAPRGYVHIYKLLEDGRNMELVHKTPFDAPIYAVHAFQGRL 966
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA---LYVQTRGDFIVVGDLMKSISLL 840
+ + LY D G + L + G ++A + +RG+ ++ D+ +SI+ +
Sbjct: 967 ALGVGADLFLY-----DVGLKSLLRKS--RGTVVANTITSIDSRGNRLICADVSESITYV 1019
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSEGATDEERGR 898
++K + + D W + ++D + G + NL+ VR + + +DEE
Sbjct: 1020 VFKPKHNRMIGFVDDVIQRWTTTAAMIDYETSAGGDKFGNLWVVRCPEQASKESDEEGLG 1079
Query: 899 LEVVGEY-HLG------EFVNRFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIAS- 948
+V E +LG E + F + M + + + G + + + G +G++
Sbjct: 1080 GYIVNERSYLGGAPYRLELRSHFFTQDIPMSIQRTALVAGGQEVLFWSGLQGTLGMLVPF 1139
Query: 949 LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLS 1007
+ E F +L+T +R + G +H +RS+ V K +DGDL E F+ LS
Sbjct: 1140 VSREDVEFFTQLETLIRAEEPPLAGRDHLMYRSY-----YVPVKGVIDGDLCERFMKLS 1193
>gi|237837399|ref|XP_002367997.1| splicing factor 3B subunit 3, putative [Toxoplasma gondii ME49]
gi|211965661|gb|EEB00857.1| splicing factor 3B subunit 3, putative [Toxoplasma gondii ME49]
gi|221488748|gb|EEE26962.1| splicing factor 3B subunit, putative [Toxoplasma gondii GT1]
gi|221509241|gb|EEE34810.1| splicing factor 3B subunit, putative [Toxoplasma gondii VEG]
Length = 1233
Score = 122 bits (305), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 117/512 (22%), Positives = 211/512 (41%), Gaps = 66/512 (12%)
Query: 223 GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGD------------ 270
G ++ + I+HE+ V + +A+ + L + +++ S +G+
Sbjct: 306 GDIYKVEISHEEGVVREVVCRYFDTVPVANALCVLKSGYLFVASEFGNHLFYQFTGIGSD 365
Query: 271 --------------SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVV 316
+I +P + V+ L+ +L PI D V+D + G QV
Sbjct: 366 ASDPRCSSTHPLGREAIIAFKPRPLRNLALVDELQ---SLSPITDLKVLDAQGTGAPQVY 422
Query: 317 TCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRI 375
G +LRI+++G+G+ E A EL G + +W+ + S + FD ++ V+F +
Sbjct: 423 VLCGKGPRSTLRILQHGLGVEEMADNELPGRARAVWTTKLSHQNAFDGYIFVAFDGSS-- 480
Query: 376 LAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPP 435
L + + D +EE F + +L + +QV +R + + R NEW++P
Sbjct: 481 LVLQIGDTVEEVTDSAFLTNVSSLLVALMYDDSFIQVHETGIRHILKSKRV--NEWRAPG 538
Query: 436 GYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEISCLDINPIGENPSY 494
G + A AN Q++++ GG LV E+ D L E + E +C+ + I +
Sbjct: 539 GRRIKAAAANERQLVISLAGGELVLFEVDDAHTLVETARRNINVESTCMSMQAIPKGRLR 598
Query: 495 SQIAAVGMWTDISVRIFSL-PDLNL--ITKEHLGGEIIPRSVLLCAFEGIS--------- 542
+ AVG D VRI SL D NL ++ + L + P SV L G+
Sbjct: 599 ASFLAVG-GLDNMVRILSLEKDRNLRQLSTQLLPNDATPESVCLATLTGLGANSTDAGKS 657
Query: 543 ------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASD 596
YL L G ++ +++ G L D++ LG + + + + + A S+
Sbjct: 658 QDNGVLYLHVGLNTGVMIRSVVDPVLGTLLDQRSRFLGGRAVRFHAVTLQGQPAILALSE 717
Query: 597 RPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRS 656
+ + Y+ KL +N + + F S D G L I + + ++
Sbjct: 718 KSWLCYTFQHKLHCIPLNYDPLECVASFCSEQCTDGFVAIAGGSLRIFRCQRLGETFGQT 777
Query: 657 I-PLGEHPRRICHQEQSRTFAICSLKNQSCAE 687
+ PL PR A+ +L + S AE
Sbjct: 778 VLPLSFTPR-----------AMAALPHPSAAE 798
Score = 90.9 bits (224), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 158/364 (43%), Gaps = 47/364 (12%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV-YYCVGT--AYVLPEENEPTKGR 750
+R+++ I L+T E + LSC F + + VGT A L + P
Sbjct: 887 IRIVNPLMAMTIDKVSLETDE---AALSCCFCEMEGLPLLVVGTVTAMTLKPKKVP-HAS 942
Query: 751 ILVFIVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
I VF +D L L+ + +L AF G LLA + K++LY G + L +
Sbjct: 943 IKVFSYDDKFSLSLVHSTPVEDYPMALTAFRGMLLAGVGHKLRLYAL-----GRKRLLKK 997
Query: 810 CGHHGHILAL-YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
C + + +++ GD + VGD+ +S+ ++ Y+ E A D W++ E+LD
Sbjct: 998 CEYKNLPCGVAFIRVAGDRLFVGDVRESVHVMRYRLSENLFYVLADDVVPRWLTKGEVLD 1057
Query: 869 DDIYLGAENNFNLFTVRKNSEGATDE--------ERG----------RLEVVGEYHLGEF 910
++ A+ ++F R SE DE RG +L+ + +H+GE
Sbjct: 1058 YHTFVAADKFDSVFICRVPSEAKEDELGDTTGLRLRGDTTYLTDKCFKLQSLLHFHIGEI 1117
Query: 911 VNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL--PHEQYLFLEKLQTNLRKVI 968
V +L +S +++GT+ G IG + HE LF L+ +R
Sbjct: 1118 VTALERATLTSAASES-------IVYGTIMGSIGSFSPFLTKHELDLFTH-LEMVMRSEK 1169
Query: 969 KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
+ G H +RS+ + AKN +DGDL ES+ L I++ + ++ K
Sbjct: 1170 PPLAGREHIMFRSYYHP-----AKNTVDGDLCESYALLPYEDQKRIAQDFEKTPADILKH 1224
Query: 1029 VEEL 1032
+E++
Sbjct: 1225 LEDI 1228
>gi|297829750|ref|XP_002882757.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297328597|gb|EFH59016.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 1384
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 128/549 (23%), Positives = 241/549 (43%), Gaps = 96/549 (17%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
N+ PI+DF V+D + + + Q+ C G ++GSLRI+R+GI + + + + QGI G W+
Sbjct: 465 NIAPILDFSVMDDQNEKRDQIFACCGVTREGSLRIIRSGINVEKLLKTAPVYQGITGTWT 524
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
++ D + +FLV+SF+ ETR+L++ L + + T+ GF S TL C LVQ+
Sbjct: 525 VKMKLTDVYHSFLVLSFVEETRVLSVGLSFK-DVTDSVGFQSDVCTLACGLVADGLLVQI 583
Query: 413 TSGSVRLVSST----------SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
++RL T S + W P S+++ + ++++T + +
Sbjct: 584 HQDAIRLCMPTMDAHSDGIPVSSPFFSSW-FPDNVSISLGAVGQNLIVVSTSNPCFLSIL 642
Query: 463 IGDGI------LTEVKHAQLEYEISCLDI--------NPIGENPSYSQIAAV--GMWTDI 506
+ + E++ L+YE+SC+ + +P S AA+ GM
Sbjct: 643 GVKSVSSQCCEIYEIQRVTLQYEVSCISVPQKHIGKKRSCASSPDNSCKAAIPSGMEQGY 702
Query: 507 ---------SVRIFSLPD---------LNLITKEHLGGEI----IPRSVLLCAFEGISYL 544
SV + S + L++ + G + IP+ V L + + Y+
Sbjct: 703 SFLIGTHKPSVEVLSFSEDGVGVRVLASGLVSLTNTMGAVISGCIPQDVRLVLVDQL-YV 761
Query: 545 LCALGDGHLLNF---LLNMKTG--------------ELTDRKKVSL------------GT 575
L L +G LL F L + +G ++ KK +L G
Sbjct: 762 LSGLRNGMLLRFEWPLFSNASGLNCPDYFSYCKEEMDIVVGKKDNLPINLLLIATRRIGI 821
Query: 576 QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
P+ L FS + + A SDRP ++ ++ + L Y++++ + +H P S+ P +
Sbjct: 822 TPVFLVPFSDSLDSDIIALSDRPWLLQTARQSLSYTSISFQPSTHATPVCSSECPQGILF 881
Query: 636 AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK-NQSCAEESEMHFV 694
E L + + ++ + + LG PR++ + +S+ + +C + +
Sbjct: 882 VSENCLHLVEMVHSKRRNAQKFHLGGTPRKVIYHSESKLLIVMRTDLYDTCTSD-----I 936
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY-----VLPE-ENEPTK 748
+D + +S+Y L E G S+ ++ + VGT+ +LP E E TK
Sbjct: 937 CCVDPLSGSVLSSYKLKPGETGKSMELVRVGNEHVLV--VGTSLSSGPAILPSGEAESTK 994
Query: 749 GRILVFIVE 757
GR+++ +E
Sbjct: 995 GRLIILCLE 1003
Score = 53.5 bits (127), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 40/146 (27%), Positives = 67/146 (45%), Gaps = 18/146 (12%)
Query: 882 FTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI---------- 931
F+ R + E ++ E L Y++GE + G + +LP DV +
Sbjct: 1204 FSPRDDPEYSSPESNLNLNCA--YYMGEIAMAIKKGCNIYKLPADDVLRSYGLSKSIDTA 1261
Query: 932 -PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKT 988
T+I GT+ G I V A + E+Y LE +Q L + V G +H ++R N +
Sbjct: 1262 DDTIIAGTLLGSIFVFAPISSEEYELLEAVQAKLGIHPLTAPVLGNDHNEFRGRENPSQ- 1320
Query: 989 VDAKNFLDGDLIESFLDLSRTRMDEI 1014
A LDGD++ FL+L+ + + +
Sbjct: 1321 --ATKILDGDMLAQFLELTNRQQESV 1344
>gi|150863836|ref|XP_001382447.2| hypothetical protein PICST_54680 [Scheffersomyces stipitis CBS 6054]
gi|149385092|gb|ABN64418.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 1228
Score = 120 bits (301), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 245/1184 (20%), Positives = 474/1184 (40%), Gaps = 188/1184 (15%)
Query: 8 GRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDR-IGRPTDNG 66
G I +E R G +D + E K L++D + + + GR + G
Sbjct: 68 GVIHKIEKIRIVGTQKDLAVVVGESGKVVFLEFDVDLHRFVPVLQEPYAKTGFGR-VNPG 126
Query: 67 QIGIIDPDCRLIGLHLYDG---LFKVIPFDNKGQLK-----EAFNIRLEELQV--LDIKF 116
+ +DP R I L + +FKV D++G+++ EA + L V LD +F
Sbjct: 127 EYLAVDPQSRCIFLGAIERNKLIFKV-ETDSQGKVELSSPLEAHSKHTLTLSVVALDTQF 185
Query: 117 ---LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG------PWSQNNLDNGADLL 167
++ + + D K + + L + +G S N + + A L
Sbjct: 186 SNPVFAAIECDYSNYHSDGKVQFDADSSPLLLNYYELDQGLNHIVKKKSTNTIPSSATHL 245
Query: 168 IPVPPPLCGVLIIGEETIVYCSANA-----FKAIPIRPS------ITKAYGRVDADGSRY 216
IP+P + GV + + I+Y + + + +P+R ++ ++ +
Sbjct: 246 IPLPSHVGGVFVCCKNYIIYDNLHKNLERLYLPLPLRKDSEYTVVVSHVVHKLKKNNFFV 305
Query: 217 LLGDHAGLLHLLVITH--EKEKVTGLKIELLGETSIASTISYL-----------DNAVVY 263
LL G L + + + +KE + ++I ++S+++ L ++ + Y
Sbjct: 306 LLQSSMGDLFKVTVEYNSDKELIEDIQIGYFDTIPVSSSLNILKSGFLFANVLNNDKLYY 365
Query: 264 IGSSYGDS-QLIKLNLQPDAKG----------------SYVEVLERYVNLGPIVDFCVVD 306
GD + I+L PD + ++E + +L PI D +V+
Sbjct: 366 QFEKLGDDDENIQLKASPDISSIDEEDRSNRTFTVKALDNLALVEIFTSLSPITDAGIVE 425
Query: 307 LERQGQG----QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPF 361
G Q++T S L+ + +GI + S L I G+ + + D
Sbjct: 426 SISSGTADSLQQMITASS---HSHLKSLVHGIQTSTLVSSPLPIIPTGVLTTKLFADSRS 482
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D +LV+S +R L +++ + +EE E F + TL + +VQ+ + +R V
Sbjct: 483 DEYLVISSTVASRTLVLSIGEVVEEVENSQFVNDQPTLAVQQVGTSSVVQIYTNGIRHVK 542
Query: 422 STSRELR--------NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGILTEV 471
T E + +W P G ++ A+ + QV++A + Y E+ D L E
Sbjct: 543 HTRTEDKEQSISRKITDWYPPAGITIVNASTHREQVIIALSNAEICYFEVDATDDQLIEY 602
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
+ ++E S I E + + AV +D ++++ SL N + L +
Sbjct: 603 QD-RVEMSNSITSIAICEETANKKNLFAVVGCSDETIQVLSLQPHNCLETLSLQA-LSAN 660
Query: 532 SVLLCAFEGISYLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN-T 588
S L + + + +G +G + + +G L+D + LG++P+TL N +
Sbjct: 661 STSLSMLQNDNTTMVHIGMDNGLYVRTSIEEISGNLSDTRIKYLGSKPVTLSVTKLPNGS 720
Query: 589 THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS--------AAFPDSLAIAKEGE 640
+ A S RP + Y + + + + ++ F+S A D+ I
Sbjct: 721 KAILAISSRPWICYYNRSEFKVTPLLGVKILKGASFSSEDIGGEGIVALSDNNLIV---- 776
Query: 641 LTIGTID---DI-QKLHIRSIPLGEHPRRICHQEQSRT------------------FAIC 678
TIG D DI Q ++I I L PR++ + ++ F+
Sbjct: 777 FTIGKEDVEFDINQDVNIEKIRLRYTPRKLIIDDDGKSSKVNYIYALQSEYGTKSPFSPS 836
Query: 679 SLKNQSCAE-------------ESEM----HFVRLLDDQTFEFISTYPLDTFEYGCSILS 721
+L ++ E E+E+ ++++D + I T + E S+
Sbjct: 837 NLNSEDDPESEIDQDYYDAFGYETEVDKWASCIQVVDFENSSIIQTVEFSSNESAISMAK 896
Query: 722 CSF--SDDSNVYYCVGTAYVLPE--ENEPTKGRILVFIVEDG-------KLQLIAEKETK 770
F S N+ + + + +N K + F ++ +L+ + + E
Sbjct: 897 LHFVSSGKGNMEHLIIGVTTDRKFLKNSVGKSYLFTFKIQKNTRKSNKKRLEYLHKTEID 956
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL----QSECGHHGHILALYVQTRGD 826
+ + FNG+LL + + ++LY D G R+L + + I+ L V T G+
Sbjct: 957 CSPTVMIPFNGRLLVGMGKYLRLY-----DIGHRQLLRKSSTNIDYISSIVDL-VHTGGE 1010
Query: 827 FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
I GD SI + E A D ++AV LD D +G + N+F R
Sbjct: 1011 RIAFGDSHSSIVFAKFDSAENRFVPFADDIMKRQITAVAALDYDTVIGGDKFGNVFVSRV 1070
Query: 886 ---------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
K E + R + + E+ L + F GS+ +G
Sbjct: 1071 PDSVSKKSDEDWSLLKVQESYLNASPSRTKNLCEFFLSDTPTSFTKGSMT-------IGG 1123
Query: 931 IPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIK-GVGGLNHEQWRSFNNEKKT 988
+I+ + G +G++ L + + F+ L+ +LR+ + + G++H ++RS+ N
Sbjct: 1124 HDGIIYTGIQGTVGLLLPLSTKSEVQFINSLEQSLRQQMGFNLLGMDHLKFRSYYNP--- 1180
Query: 989 VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDLIE + +LS++ +I++ +N + +E+ K++ +L
Sbjct: 1181 --VKNVIDGDLIEKYYELSQSLKIKIARELNRTPKEVEKKISDL 1222
>gi|407416557|gb|EKF37684.1| damage-specific DNA binding protein, putative [Trypanosoma cruzi
marinkellei]
Length = 1276
Score = 120 bits (300), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 195/878 (22%), Positives = 339/878 (38%), Gaps = 155/878 (17%)
Query: 236 KVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVN 295
KV + + + +I +++ L IGS D+ +K G +LE N
Sbjct: 376 KVPVVNLAMSSLRTIPDSLAILSGGHCVIGSRMTDTLWLKW-----CTGESGVLLE---N 427
Query: 296 LGPIVDFCV-VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR 354
GP+ D + VD R G V+ +G G + + R+ + A+++ + + L
Sbjct: 428 CGPVFDLTIAVDGPRMG---VIASTGVGMGGGVSLQRSAVNFRHDAAID--KLLNVTRLF 482
Query: 355 SSTDDPFDTFLVVSFISETRILA------MNLEDELEETEIEGFCSQTQTLFCHDAIYNQ 408
++ D ++SF +R+ M EL ET + S+ + + N
Sbjct: 483 AAGD-----IFIISFPGYSRVYQFQTTPNMTTIKELSETSFDK--SKETLSLVYLSERNT 535
Query: 409 LVQVTSGSVRLVSSTSRE---LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD 465
VQVTS V +V +NE + + A+ANA LL V + +
Sbjct: 536 FVQVTSVGVNVVKGGKGVYIICKNE------FGIQHASANAKLRLLVFSSSRFVSV-VDL 588
Query: 466 GILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLG 525
++E E+SCL I S SQ +G W +V ++ + D
Sbjct: 589 NTRNTRASLEVENEVSCLLI-------SSSQGFVIGEWNSGAVCLYEVQD---------- 631
Query: 526 GEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTD-----------RKKVSLG 574
GEI+ + + C+ S +C L + ++ + G + D ++ +
Sbjct: 632 GEILLKGRIFCSATSCS--MCILSHPYTPRLVVGLLNGYIADISLEAILMGGGARETLIR 689
Query: 575 TQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLA 634
QP+ L S N V + P +I + + ++ +V+ +C + S
Sbjct: 690 MQPVELFNLESHNA--VLCLGEVPLIIILCDTGFQLTGIDFNDVA-VCAIIECSHISSKY 746
Query: 635 I---AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ-------- 683
I E L G I D++KL+ + L R+ + F I +N+
Sbjct: 747 IFFSQSENSLAFGNIVDLKKLNSCFVGLKATVTRVKYIAWWNFFVISVRRNEKDQVFLLM 806
Query: 684 ------SCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTA 737
S ++E+ + LL+++ F T L G + D + +GT
Sbjct: 807 GHELANSWVPQNELVSIELLENERCVFFETVVLG----GSNESPDGNEADKDKAILLGTT 862
Query: 738 YVLPEENEPTKGRILVFIVEDGKL-------QLIAEKETKGAVYS---LNAFNGKLLAAI 787
+ P+E P R + + VE GKL + I K+ +GA+ + + G++ I
Sbjct: 863 FAFPDEQLPRSSRFIWYSVEQGKLLSERPQLRQIGSKDVEGALQCCCIVPNYVGRIALGI 922
Query: 788 NQKIQLYKWMLRDDGTRELQSECGHH--GHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
N I LY W D ++ C +L ++ Q ++V D S +
Sbjct: 923 NGCIALYSWNAADSVFVAEETICIGTIVTRVLPIF-QGDASYMVAFDARHSCFFIQVDTI 981
Query: 846 EGAIEERARDYNANWM--SAVEILDD--DIYLGAENNFNLFTVRK---------NSEGAT 892
+G++E ARD + AV L + DI G ++ +N F + + N+ AT
Sbjct: 982 QGSLEIVARDSEPRGVMDGAVFQLGNIYDICFG-DDYYNFFCLSRGATTLPSASNAISAT 1040
Query: 893 DEERGRLEVVGEYHLGEFVNRFRHGSLV--------MRLPDSDVGQI--PTVIFGTVNGV 942
G+L+ +YHLG+ V + GS + +P + + I P V+FGT +G
Sbjct: 1041 VSS-GKLKTRAQYHLGDMVTAMQLGSFAPCSVTNSAVPIPTTLIPGICGPQVVFGTSHGA 1099
Query: 943 IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS--------------------- 981
G I + +E YLFL+ L+ + V+ +GG H +R
Sbjct: 1100 FGTITPVSNETYLFLKALEVAVSSVVPALGGFEHVTYREVLRAGQERGYSRNASFENVNV 1159
Query: 982 -----FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
FN +K ++ GDLIESFL + I
Sbjct: 1160 VSSEVFNKRRKRYLSRCVCSGDLIESFLTFPHSIQQRI 1197
>gi|453087531|gb|EMF15572.1| splicing factor 3B subunit 3 [Mycosphaerella populorum SO2202]
Length = 1223
Score = 120 bits (300), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 176/809 (21%), Positives = 328/809 (40%), Gaps = 96/809 (11%)
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGI-----NEQASVELQGIKGM 350
L P++ V +L + Q+ G + +R+G+ + N +V I
Sbjct: 433 LHPLMKTRVDNLTGEDAPQIYGIQGKGNRSQFKTIRHGLDVEVLINNSMGNVPYDNI--- 489
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+ + T D +L++S +A ++ D +E+ E F T+ LV
Sbjct: 490 WTFKHRTTDEHHRYLLLSSNYGDLTIACSIGDSVEQIENSNFLENRATVHAEQMGDAVLV 549
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILT 469
QV + +R + + NEW +P + VA+AN Q+LL L + +G D IL
Sbjct: 550 QVHARGIRSIYQDGK--LNEWNTPAHRTCVVASANQRQLLLGLSSAELCFFWMGEDEILV 607
Query: 470 EVKH-AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLP-DLNLITKEHLGGE 527
+++ ++ +I+ + + + ++ A VG D ++R+ S+ D L +
Sbjct: 608 QLEEMPEMSGKITAISVGSTPKGRQQAKYAVVGC-DDCTIRVLSIELDSPLEARSVQALS 666
Query: 528 IIPRSV----LLCAFEGISYLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
+P S+ +L G + + +G G L +++ TGEL D + LGT+ L
Sbjct: 667 AVPTSLEVVEMLDPASGTTVNVVHIGLQSGLYLRAIIDETTGELGDVRTKFLGTKAPRLC 726
Query: 582 TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPD--SLAIAKEG 639
++ V A S RP V Y+ + Y+ L M S PD L +
Sbjct: 727 PVQVEDEECVLACSTRPWVGYNHPQTGQYTITPLI-AEKMEAARSFISPDLSGLCAIQGS 785
Query: 640 ELTIGTIDDIQ-KLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQ-SCAEESEMHFVRL 696
L I + ++ +L S+ L PR + H + + S N S A ++ +
Sbjct: 786 SLLIFQLPSVEGRLSHSSVQLNNTPRGMARHPYYQIWYTVQSDGNTLSKATRDQLRGSPI 845
Query: 697 LDDQTFEFISTY------------------PLD------TFEYG------CSILSCSFSD 726
DD+ + + P++ T E G C S
Sbjct: 846 DDDEEAMALERHLGLPRGNTHWASCIQAIDPINRQAVVSTVEMGDNEAALCCACVAFESR 905
Query: 727 DSNVYYCVGTA-YVLPEENEPTKGRILVF-IVEDG-KLQLIAEKETKGAVYSLNAFNGKL 783
+ V+ VGT ++ P G + +F + EDG KL + + + VY+L F+G+L
Sbjct: 906 NWEVFLAVGTGQHMSPGTGLQAAGYVHIFKLEEDGTKLTFVHKTKFDQPVYALLPFHGRL 965
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
+ ++ +Y D G + L + G + +++ G I+ GD+ + ++ ++Y
Sbjct: 966 ALGVGNELFIY-----DIGQKALLRKARGQATPNQIVSLESHGQRIICGDVSEGVTYMVY 1020
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDEERG-- 897
K + D W + ++D + G + NL+ VR + S+ A +E G
Sbjct: 1021 KPGYNRLIPFVDDVVQRWTTGTTMIDYETTAGGDKFGNLWVVRCPEQPSQEADEEGAGGF 1080
Query: 898 -------------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
RL++ Y+ + + +LV G + + + G +G
Sbjct: 1081 IMNERSYLGGAPYRLDLRAHYYCQDIPMSLQRTALV-------AGGQEVLFWSGLQGTLG 1133
Query: 945 VIAS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
++ + E F L+ LR + G +H +RS+ V K +DGDL E F
Sbjct: 1134 MLVPFVTREDVEFFTSLEQQLRIEDPPLAGRDHLMYRSY-----YVPVKGVIDGDLCERF 1188
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+ LS +++ ++ SV+E+ K+V+E+
Sbjct: 1189 MALSYDSKQKVAAEVDRSVKEIEKKVQEM 1217
>gi|11359310|pir||T51288 silencing protein rik1 - fission yeast (Schizosaccharomyces pombe)
gi|4580723|gb|AAD24488.1|AF136156_1 silencing protein Rik1 [Schizosaccharomyces pombe]
Length = 916
Score = 119 bits (299), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 209/944 (22%), Positives = 391/944 (41%), Gaps = 117/944 (12%)
Query: 111 VLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPV 170
+ D+ L + PT+VVL+ D D +V + + L K +L G L V
Sbjct: 44 IYDMCILNDSSTPTLVVLHSDGLDC-YVTAFLLDLSSK----------SLGKGIRLFERV 92
Query: 171 PPPLCGV-----LIIGEETIVYCSANA-FKAIPIRPSITKAYGRVDADGSRYLLGDHAGL 224
P + L++ E ++C F I + + + A Y++ D G
Sbjct: 93 KPSMIMPFGKRGLLVFESLFIHCMYRGNFVTINGPCTTYMHWTPLKAQKMHYIVCDTNGY 152
Query: 225 LHLLVITHEKEKVTGLKIELLG--ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDA 282
L + + + L +E L ++++ + +++IGS +S+LI L+ D
Sbjct: 153 LFGVYSSILGKNKWSLVMERLPIPPFDFITSLNSIHEGLLFIGSKNSESKLINLSTLKD- 211
Query: 283 KGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
++ NLGPI D V L+ + + C+G ++ SL ++ + ++
Sbjct: 212 -------VDSIPNLGPIHDLLV--LKNDIEKSFLVCAGTPRNASLIYFQHALKLDILGQT 262
Query: 343 ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH 402
++ GI L S + L + F SET +A N++++ + E++ S +
Sbjct: 263 KISGILRAMVLPSYPEHK----LFLGFPSET--VAFNIKEDFQ-LELDPSLSTKERTIAL 315
Query: 403 DAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
+ VQVTS + + S R + N A ++ G ++ +
Sbjct: 316 SGTNGENVQVTSTFLCIYDSAKRSRLVYIEK----ITNAACYQEYSAIVINGTALAIFKK 371
Query: 463 IGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT-K 521
TEV E EISCLD + + QI VG W+ V I + D + I+
Sbjct: 372 D-----TEVARKVFESEISCLDFS------AQFQIG-VGFWSK-QVMILTFSDNSSISCA 418
Query: 522 EHLGGEIIPRSVLLCAFEGI----SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQP 577
+PR+++L EG+ + LL + G G +++L ++ K GT P
Sbjct: 419 FQTNVPSLPRNIIL---EGVGVDRNLLLVSSGSGEFKSYVLFKNNLVFSETKH--FGTTP 473
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
++ R F+ T++ +D P ++Y N L Y +++ + +C F + D L
Sbjct: 474 VSFRRFTMNIGTYIICNNDCPHMVYGFNGALCYMPLSMPQSYDVCQFRDNSGKDFLISVS 533
Query: 638 EGELTIGTIDDIQKLHIRSIPLGEHP-RRICHQEQSRTFAICSLKN-----QSCAEESEM 691
G L ++ + +L R + L P + I Q + + +L+N +S E +
Sbjct: 534 LGGLKFLQLNPLPELTPRKVLLEHVPLQAIIFQNK---LLLRTLENRYEDYESYKENYHL 590
Query: 692 HFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRI 751
V DD +F S + E I S VGT+ + ++ P GR+
Sbjct: 591 ELVDSYDDNSFRVFSFTENERCEKVLKINESSL--------LVGTSIIEQDKLVPVNGRL 642
Query: 752 LV--FIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
++ F E L++++ AV L +N + + A Q++ + K + + S
Sbjct: 643 ILLEFEKELQSLKVVSSMVLSAAVIDLGVYNDRYIVAFGQQVAIVKL---TEERLMIDSR 699
Query: 810 CGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
+L L V+ G+ I + D + +++ + ++ + R + N + A + +
Sbjct: 700 ISLGSIVLQLIVE--GNEIAIADSIGRFTIMYFDGQKFIVVARYL-FGENIVKAA-LYEG 755
Query: 870 DIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 929
+Y+ A N+ L +R N + +R E V YHL + V++F++ P ++
Sbjct: 756 TVYIIATNSGLLKLLRYNKDAKNFNDRFICESV--YHLHDKVSKFQN------FPITNTN 807
Query: 930 QI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV-IKGVGGLNHEQWRSFNNEK 986
P ++F T G IG I SL ++ L LE+L +RK+ + +++E E
Sbjct: 808 SFLEPKMLFATEIGAIGSIVSLKDKE-LELEELTRKIRKLKFSYLSSMDYESI-----EA 861
Query: 987 KTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
+ F+DGDL+ +D+ R E+ + LC+ VE
Sbjct: 862 DLISPVPFIDGDLV---IDVKRWASSELFR--------LCRSVE 894
>gi|297739782|emb|CBI29964.3| unnamed protein product [Vitis vinifera]
Length = 1363
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 125/537 (23%), Positives = 226/537 (42%), Gaps = 80/537 (14%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
N+ PI+D VVD + Q+ C G +GSLRI+R+GI + + + + QGI G W+
Sbjct: 462 NIAPILDMSVVDCHDEEHDQMFACCGVTPEGSLRIIRSGISVEKLLRTAPIYQGITGTWT 521
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
++ D + +FLV+SF+ ETR+L++ L + T+ GF TL C LVQ+
Sbjct: 522 VKMKVIDSYHSFLVLSFVEETRVLSVGLSFT-DVTDSVGFQPDVSTLACGVVDDGLLVQI 580
Query: 413 TSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
V+L T+ + W P S+++ + +++AT +++
Sbjct: 581 HKNGVKLCLPTTVAHPEGIPLASPICTSW-FPENISISLGAVGYNLIVVATSSPCFLFIL 639
Query: 463 IGDGI------LTEVKHAQLEYEISCLDI--NPIGENPSY--------SQIAAVGMWTDI 506
+ + E++H +L+ E+SC+ I + PS S AA+ + +I
Sbjct: 640 GVRSVSAYQYEIYEMQHVRLQNEVSCISIPHKHFDKKPSTFLSNLVDNSSAAALLIGVNI 699
Query: 507 SVRIF-------SLPDLNLITKEHL----GGEIIPRSVLLCAFEGIS------------Y 543
RIF S+ L+ + E L G I + L A G Y
Sbjct: 700 G-RIFVIGTHKPSVEILSFLPDEGLRILASGAISLTNTLGTAVSGCVPQDARLVLVDRFY 758
Query: 544 LLCALGDGHLLNFLLNMKTGELTDRKKVS-------------------LGTQPITLRTFS 584
+L L +G LL F L + + +G P+ L S
Sbjct: 759 VLSGLRNGMLLRFELPAASMVFSSELSSHSPSTNINSPVNLQLIAIRRIGITPVFLVPLS 818
Query: 585 SKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIG 644
+ A SDRP ++ S+ L Y++++ + +H+ P S P + E L +
Sbjct: 819 DSLEADIIALSDRPWLLQSARHSLSYTSISFQPSTHVTPVCSMECPMGILFVAENSLHLV 878
Query: 645 TIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEF 704
+ ++L+++ LG PR++ + +SR + ++++ + +D +
Sbjct: 879 EMVHSKRLNVQKFYLGGTPRKVLYHSESRLLLV---MRTELSQDTYSSDICCVDPLSGSV 935
Query: 705 ISTYPLDTFEYGCSILSCSFSDDS----NVYYCVGTAYVLPEENEPTKGRILVFIVE 757
+S++ L+ E G S+ ++ G A + E E TKGR++V +E
Sbjct: 936 LSSFKLELGETGKSMELVRVVNEQVLVIGTSLSSGPAMMPSGEAESTKGRLIVLCLE 992
Score = 50.4 bits (119), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 44/203 (21%), Positives = 86/203 (42%), Gaps = 17/203 (8%)
Query: 828 IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN--NFNLFTVR 885
I VGD + Y + +E+ D ++ ++D D + ++ + + +
Sbjct: 1122 IAVGDCRDGVVFYSYHEDSRKLEQLYCDPEQRLVADCILMDVDTAVVSDRKGSIAVLSCS 1181
Query: 886 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV-----GQIPTVIF---- 936
+ E E L + Y++GE + GS +LP DV G + F
Sbjct: 1182 NHLEDNASPE-CNLTLNCSYYMGEIAMSIKKGSFSYKLPADDVLKGCDGSNTIIDFSENS 1240
Query: 937 ---GTVNGVIGVIASLPHEQYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDA 991
GT+ G I ++ + E++ LE +Q L ++ + G +H ++RS N +
Sbjct: 1241 IMAGTLLGSIIMLIPISREEHELLEAVQARLAVHQLTAPILGNDHNEFRSRENSVRKAGV 1300
Query: 992 KNFLDGDLIESFLDLSRTRMDEI 1014
LDGD++ FL+L+ + + +
Sbjct: 1301 SKILDGDMLAQFLELTSMQQEAV 1323
>gi|226480826|emb|CAX73510.1| glyceraldehyde 3-phosphate dehydrogenase [Schistosoma japonicum]
Length = 332
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 79/249 (31%), Positives = 125/249 (50%), Gaps = 21/249 (8%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW--DAESSELITRAMGDVSDRIG 60
DVPI +I LFR D LF+ T + +++ + +S E +T A G V DR
Sbjct: 54 DVPINAKIVAASLFRRKDRETDSLFLLTHKAGVAIIECVRNNDSVEFVTVASGSVEDRSA 113
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIP------------FDNKGQLKEAFNIRLEE 108
R D G +IDP I + LY GL K+I D +++R+EE
Sbjct: 114 RIIDQGFDVLIDPGANYIVVRLYHGLLKIILLQCIGDKIGTDFLDTNQWTVNTYSVRIEE 173
Query: 109 LQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLI 168
++D+ F+YG + PT ++Y+D H+KTYE+ ++ + ++++ + LLI
Sbjct: 174 GNIVDMAFIYGYSLPTFAMIYEDEL-VLHMKTYEIYGREPALRNVQLTLDSIEPDSKLLI 232
Query: 169 PVPPPLCGVLIIGEETIVYCSANA---FKAIP-IRPSITKAYGRVDADGSRYLLGDHAGL 224
PVP P GV+++G+ I Y + + + IP + S Y VDA RYLLGD AG
Sbjct: 233 PVPKPYGGVILVGDNIICYHTKDGPHISQYIPQAKASQVLCYAAVDA--QRYLLGDMAGR 290
Query: 225 LHLLVITHE 233
L+++ + E
Sbjct: 291 LYMVHLLSE 299
>gi|124806507|ref|XP_001350742.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium
falciparum 3D7]
gi|23496869|gb|AAN36422.1|AE014849_41 splicing factor 3b, subunit 3, 130kD, putative [Plasmodium
falciparum 3D7]
Length = 1329
Score = 116 bits (291), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 123/535 (22%), Positives = 219/535 (40%), Gaps = 57/535 (10%)
Query: 160 LDNGADLLIPVP---PPLCGVLIIGEETIVYCSAN---AFKAIPIRPSI--TKAYGRVDA 211
+D A LLIP+P GVLI E +VY + + A P R I K +
Sbjct: 263 IDITAHLLIPLPGGQQGPSGVLICCENFLVYKKVDHEDIYCAYPRRLEIGQDKNISIICW 322
Query: 212 DGSRY------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIG 265
R L+ G L+ + + HE V + + I ++IS L + +++
Sbjct: 323 TMHRIKKFFFILIQSEYGDLYKIEVDHEDGIVKEIVCKYFDTVPIGNSISVLKSGSLFVA 382
Query: 266 SSYGD-------------SQLIKLNLQPDAKGSYVE----------VLERYVNLGPIVDF 302
+ +G+ Q + + P K + + ++++ +L PI+D
Sbjct: 383 AEFGNHYFYQFSGIGDDNKQFMCTSNHPLGKNAIIAFKTNKLKNLYLVDQIYSLSPILDM 442
Query: 303 CVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPF 361
++D + Q+ T G SLRI+++G+ I E A EL G K +W+++ +
Sbjct: 443 KIIDAKNTHTPQIYTLCGRGPRSSLRILQHGLSIEELADNELPGKPKYIWTIKKDNLSEY 502
Query: 362 DTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS 421
D ++VVSF T IL + + +EE + TL + N +QV +R ++
Sbjct: 503 DGYIVVSFEGNTLILEIG--ESVEEVSDTLLLNNVTTLHINILYDNSFIQVYDTGIRHIN 560
Query: 422 STSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVKHAQLEYEI 480
++ EW +P + A++N+SQ++++ GG L+Y EI + L E+ L E+
Sbjct: 561 G---KVVQEWVAPKNKQIKAASSNSSQIVISLSGGELIYFEIDESHTLVEIFRKNLNVEV 617
Query: 481 SCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLCA 537
CL I I N + AVG ++ VR+ S+ ++ L P+ + +
Sbjct: 618 LCLSIQQIPPNRVRANFLAVGCLDNV-VRLLSIEKDKYFKQLSTHLLPNNSSPQDICISE 676
Query: 538 F---------EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNT 588
I +L L G LL +++ G L++ LG + I + +
Sbjct: 677 MNDNGNTMKERNIIFLNIGLNTGVLLRSIIDPVAGTLSNHYSKYLGAKSIKICPVNVNKN 736
Query: 589 THVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI 643
+ ++ + Y K LYS +N + + F S D L I
Sbjct: 737 PALLVLCEKTYLCYMHQGKFLYSPLNYDMLEYASSFYSPQCSDGYVAISSNSLRI 791
Score = 72.8 bits (177), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 80/363 (22%), Positives = 157/363 (43%), Gaps = 45/363 (12%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC--VGTAYVLPEENEP-TKGR 750
+++++ + + LD E S+ +C +C VGT L + + T
Sbjct: 983 IKIINPVNLQILDKISLDMEEAALSVCACELE----ALHCLIVGTTTNLSLKTKSLTSAS 1038
Query: 751 ILVFIVE-DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
+ V+ + KL L+ + Y ++NGKL+A+I K+++Y G ++L +
Sbjct: 1039 LRVYTYDIQYKLNLLHITPIEEQPYCFCSYNGKLIASIGNKLRIYAL-----GKKKLLKK 1093
Query: 810 CGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
C + A+ ++ G+ I D+ +S+ + Y + + + D W++ EILD
Sbjct: 1094 CEYKDIPEAIVSIKISGNRIFACDIRESVLIFFYDPNQNTLRLISDDIIPRWITCSEILD 1153
Query: 869 DDIYLGAENNFNLFTVRKNSEGATDE------------------ERGRLEVVGEYHLGEF 910
+ A+ ++F +R E DE + +LE + +H+GE
Sbjct: 1154 HHTIMAADKFDSVFILRVPEEAKQDEYGITNKCWYGGEIMNSSTKNRKLEHMMSFHIGEI 1213
Query: 911 VNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL-EKLQTNLRKVIK 969
V + L P S +I+ T+ G IG +++ L L + L+ LR
Sbjct: 1214 VTSMQKVRLS---PTSS----ECIIYSTIMGTIGAFIPYDNKEELELTQHLEIILRTEKP 1266
Query: 970 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
+ G H +RS+ + +N +DGDL E F LS +I+ + + E++ +++
Sbjct: 1267 PLCGREHIFFRSYYHP-----VQNVVDGDLCEQFSSLSYDAQKKIANDLERTPEDILRKL 1321
Query: 1030 EEL 1032
E++
Sbjct: 1322 EDI 1324
>gi|340508225|gb|EGR33979.1| splicing factor subunit 130kda, putative [Ichthyophthirius
multifiliis]
Length = 983
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 159/692 (22%), Positives = 283/692 (40%), Gaps = 111/692 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I T+ FR G +D++ I ++ + +L ++ + E R
Sbjct: 59 VFGLIRTISTFRLPGFKKDYIIIGSDSGRIVILNYNKKKGEFEKLHQETYGKTGCRRIVP 118
Query: 66 GQIGIIDPD---CRLIGLHLYDGLFKVIPFDNKGQLKE---------------AFNIRLE 107
GQ +DP C + L +F + ++K + ++ E
Sbjct: 119 GQYIAVDPKGRVCMIAALEKQKFVFILNRENDKLTISSPLEAHKSHTICYDICGIDVGYE 178
Query: 108 ELQVLDIKFLYGCAKPTIVVLYQDNKDA--------RHVKTYEVALKDKDFVEGPWSQNN 159
Q I+ YG +D KD+ + + YE+ V+ S N
Sbjct: 179 NAQFACIECDYG---------EKDQKDSPMNNGIIQKQLTIYELDFGLNHVVKK--SSEN 227
Query: 160 LDNGADLLIPVP---PPLCGVLIIGEETIVYCSANAFK--AIPIRPSI-TKAYGRVDADG 213
+ A LLI +P GV+I+ E+ ++Y + P R S T + ++ G
Sbjct: 228 VPESAHLLISIPGGQEGPGGVVIVCEDFLIYKGTKGERICQFPKRFSADTNSKIMINTFG 287
Query: 214 SR-------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGS 266
+LL G L L I K+ V + ++ + +I + N ++
Sbjct: 288 FHKQKEFFFFLLQTELGDLFKLEIQSTKDDVHSISLQYFDSIPPSISICVMRNGYLFAAC 347
Query: 267 SYGDSQLIKL---------NLQPDA-------------KGSYVEVLERYVNLGPIVDFCV 304
G+ L + +++ D+ K ++ +++ NL I D V
Sbjct: 348 EKGNHLLYRFKSIGEKEANSVRTDSTQDQKMPVLFIPRKLKNLQQVDQLENLSAISDIKV 407
Query: 305 VDLERQGQGQVVT-CSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSLRSSTDDPFD 362
DL +G Q+ T CS Y+ SLRI+R+G+ +NE A+ L G+ G+W+++S D+ F
Sbjct: 408 SDLTGEGHPQIYTLCSAGYRS-SLRILRHGLQVNEVAASRLPGVPTGIWTIKSRYDENFS 466
Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS 422
++++SF +T L +++ D +++ G QT+ + N ++QV R +
Sbjct: 467 KYIILSFSKQT--LVLSISDRVQQVTDSGIDLNKQTIHANLLEDNAIIQVMPDGFRHIRV 524
Query: 423 TSR--ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYE 479
R +L+ E K + A +N Q+ ++ GG ++Y E+ G L EV L+ E
Sbjct: 525 DKRVQQLKTEGK------IVKAVSNQKQIAISLQGGDIIYFELDYAGQLIEVAKTNLQEE 578
Query: 480 ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPRSVLLC 536
I C+DI + S+ +VG +D SVRI SL + L I+ + L G I +V L
Sbjct: 579 IECMDIGEVPFGRQKSKFLSVG-CSDHSVRILSLENDTCLQKISIQALPG--IAENVSLI 635
Query: 537 AFEGIS-----------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS 585
+ S YL L +G LL ++ TG L+D + L P+ +
Sbjct: 636 EMKRGSGLEQEAEQYQLYLYVGLKNGILLRASVDQITGSLSDTRTRVLSGAPVRTCKYQV 695
Query: 586 KNTTHVFA--------ASDRPTVIYSSNKKLL 609
+ + A D P +++ KLL
Sbjct: 696 QGQPALLALKLIHKTEVDDIPGSLHAHKGKLL 727
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 80/295 (27%), Positives = 135/295 (45%), Gaps = 43/295 (14%)
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHI 816
L+LI + E SL+A GKLLA ++ Y K +L+ + LQS
Sbjct: 704 LKLIHKTEVDDIPGSLHAHKGKLLAGCGTFLRYYDIGKKKLLKKSEVKGLQSPING---- 759
Query: 817 LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
+QT GD I V + ++ ++ +K +E E D WMSA ++LD Y+G +
Sbjct: 760 ----IQTFGDRIFVSMVGDAVHIMKHKQKEQTFYEVCDDVLPRWMSAFQVLDYSTYIGGD 815
Query: 877 NNFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGS 918
N+F R +N+E DE E G L E + ++++GE V F+
Sbjct: 816 KFENMFVCRIPQNAEEEMDENPMSYKLRWESGYLNGAPYKTEQICQFYVGEVVTTFQKAC 875
Query: 919 LVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHE 977
LV G +I+GT G IG ++ + F L+ LR + + G +H
Sbjct: 876 LV------STGN-ECIIYGTSMGSIGAFYPFQTKEDIDFFVHLEMYLRIDVLPLAGRDHV 928
Query: 978 QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+R+F K+V +DGDL E F+ + + + +++ M + E+ K+++E+
Sbjct: 929 MFRAFYGPVKSV-----IDGDLCEQFMRIGQGKQKVLAEEMERTPAEVHKKLDEI 978
>gi|335310432|ref|XP_003362030.1| PREDICTED: splicing factor 3B subunit 3-like [Sus scrofa]
Length = 701
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 135/579 (23%), Positives = 244/579 (42%), Gaps = 79/579 (13%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L V ++G I +L FR G +D++ + ++ + +L++ S + + + + G
Sbjct: 53 LLTVEVFGVIRSLMAFRLTGGTKDYIVVGSDSGRIVILEYQP-SKNMFEKIHQETFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ +DP R + + + L ++ D +L + + + L +
Sbjct: 112 CRRIVPGQFLAVDPKGRAVMISAIEKQKLVYILNRDAAARLTISSPLEAHKANTLVYHVV 171
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNNLDNGAD------ 165
G P L D ++A + T E A + F E N++
Sbjct: 172 GVDVGFENPMFACLEMDYEEADNDPTGEAAANTQQTLTFYELDLGLNHVVRKYSQIIAEF 231
Query: 166 -LLIPVP-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSR---- 215
L+ VP P GVLI E I Y N IR I + +D D R
Sbjct: 232 HFLLTVPGGSDGP--SGVLICSENYITY--KNFGDQPDIRCPIPRRRNDLD-DPERGMIF 286
Query: 216 -------------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+L G + + + +++ VT ++++ +A+ + L +
Sbjct: 287 VCSATHKTKSMFFFLAQTEQGDIFKITLETDEDMVTEIRLKYFDTVPVAAAMCVLKTGFL 346
Query: 263 YIGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGP 298
++ S +G+ L ++ QP + V +++ +L P
Sbjct: 347 FVASEFGNHYLYQIAHLGDDDEEPEFSSAMPLEEGDTFFFQPRPLKNLV-LVDELDSLSP 405
Query: 299 IVDFC-VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSS 356
I+ FC + DL + Q+ G SLR++R+G+ ++E A EL G +W++R
Sbjct: 406 IL-FCQIADLANEDTPQLYVACGRGPRSSLRVLRHGLEVSEMAVSELPGNPNAVWTVRRH 464
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+D FD +++VSF++ T +L++ + +EE GF T TL C + LVQV
Sbjct: 465 IEDEFDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLSCSLLGDDALVQVYPDG 522
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV++A GG LVY E+ G L E +
Sbjct: 523 IRHIRADKR--VNEWKTPGKKTIVKCAVNQRQVVIALTGGELVYFEMDPSGQLNEYTERK 580
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL 513
++ ++ C+ + + S+ AVG+ D +VRI SL
Sbjct: 581 EMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISL 618
>gi|225441567|ref|XP_002276675.1| PREDICTED: pre-mRNA-splicing factor rse1-like [Vitis vinifera]
Length = 1387
Score = 115 bits (287), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 126/551 (22%), Positives = 229/551 (41%), Gaps = 94/551 (17%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
N+ PI+D VVD + Q+ C G +GSLRI+R+GI + + + + QGI G W+
Sbjct: 462 NIAPILDMSVVDCHDEEHDQMFACCGVTPEGSLRIIRSGISVEKLLRTAPIYQGITGTWT 521
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
++ D + +FLV+SF+ ETR+L++ L + T+ GF TL C LVQ+
Sbjct: 522 VKMKVIDSYHSFLVLSFVEETRVLSVGLSFT-DVTDSVGFQPDVSTLACGVVDDGLLVQI 580
Query: 413 TSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
V+L T+ + W P S+++ + +++AT +++
Sbjct: 581 HKNGVKLCLPTTVAHPEGIPLASPICTSW-FPENISISLGAVGYNLIVVATSSPCFLFIL 639
Query: 463 IGDGI------LTEVKHAQLEYEISCLDI--NPIGENPSY--------SQIAAVGMWTDI 506
+ + E++H +L+ E+SC+ I + PS S AA+ + +I
Sbjct: 640 GVRSVSAYQYEIYEMQHVRLQNEVSCISIPHKHFDKKPSTFLSNLVDNSSAAALLIGVNI 699
Query: 507 SVRIF-------SLPDLNLITKEHL----GGEIIPRSVLLCAFEGIS------------Y 543
RIF S+ L+ + E L G I + L A G Y
Sbjct: 700 G-RIFVIGTHKPSVEILSFLPDEGLRILASGAISLTNTLGTAVSGCVPQDARLVLVDRFY 758
Query: 544 LLCALGDGHLLNFLLNMKT---------------------GELTDRKKVS---------- 572
+L L +G LL F L + + K ++
Sbjct: 759 VLSGLRNGMLLRFELPAASMVFSSELSSHSPSVSSCSVNDADTNLSKNINSPVNLQLIAI 818
Query: 573 --LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
+G P+ L S + A SDRP ++ S+ L Y++++ + +H+ P S P
Sbjct: 819 RRIGITPVFLVPLSDSLEADIIALSDRPWLLQSARHSLSYTSISFQPSTHVTPVCSMECP 878
Query: 631 DSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESE 690
+ E L + + ++L+++ LG PR++ + +SR + ++++
Sbjct: 879 MGILFVAENSLHLVEMVHSKRLNVQKFYLGGTPRKVLYHSESRLLLV---MRTELSQDTY 935
Query: 691 MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS----NVYYCVGTAYVLPEENEP 746
+ +D + +S++ L+ E G S+ ++ G A + E E
Sbjct: 936 SSDICCVDPLSGSVLSSFKLELGETGKSMELVRVVNEQVLVIGTSLSSGPAMMPSGEAES 995
Query: 747 TKGRILVFIVE 757
TKGR++V +E
Sbjct: 996 TKGRLIVLCLE 1006
Score = 44.7 bits (104), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 32/124 (25%), Positives = 58/124 (46%), Gaps = 17/124 (13%)
Query: 905 YHLGEFVNRFRHGSLVMRLPDSDV-----GQIPTVIF-------GTVNGVIGVIASLPHE 952
Y++GE + GS +LP DV G + F GT+ G I ++ + E
Sbjct: 1227 YYMGEIAMSIKKGSFSYKLPADDVLKGCDGSNTIIDFSENSIMAGTLLGSIIMLIPISRE 1286
Query: 953 QYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTR 1010
++ LE +Q L ++ + G +H ++RS N + LDGD++ FL+L+ +
Sbjct: 1287 EHELLEAVQARLAVHQLTAPILGNDHNEFRSREN---SAGVSKILDGDMLAQFLELTSMQ 1343
Query: 1011 MDEI 1014
+ +
Sbjct: 1344 QEAV 1347
>gi|149038189|gb|EDL92549.1| splicing factor 3b, subunit 3 (predicted) [Rattus norvegicus]
Length = 650
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 149/662 (22%), Positives = 275/662 (41%), Gaps = 117/662 (17%)
Query: 466 GILTE-VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
G L E + ++ ++ C+ + + S+ AVG+ D +VRI SL + + + L
Sbjct: 5 GQLNEYTERKEMSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPL 61
Query: 525 GGEIIP-RSVLLCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTD 567
+ +P + LC E G YL L +G LL +L+ TG+L+D
Sbjct: 62 SMQALPAQPESLCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSD 121
Query: 568 RKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA 627
+ LG++P+ L + V A S R + YS + + ++ + + F S
Sbjct: 122 TRTRYLGSRPVKLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASE 181
Query: 628 AFPDSLAIAKEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------T 674
P+ + L I ++ + + + + PL PR+ + H E + T
Sbjct: 182 QCPEGIVAISTNTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYT 241
Query: 675 FAICSLKNQSCAEE-------------SEM-------------------------HFVRL 696
A + + Q AEE +EM +R+
Sbjct: 242 EATKAQRKQQMAEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRV 301
Query: 697 LDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF- 754
++ + L+ E S+ C FS+ +Y VG A L G +
Sbjct: 302 MNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILSPRSVAGGFVYTY 361
Query: 755 -IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
+V +G KL+ + + + ++ F G++L + + +++Y D G ++L +C
Sbjct: 362 KLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-E 415
Query: 813 HGHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
+ HI A Y+ QT G ++V D+ +S + YK E + A D W++ +LD
Sbjct: 416 NKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDY 474
Query: 870 DIYLGAENNFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFV 911
D GA+ N+ VR N+ DE +RG L EV+ YH+GE V
Sbjct: 475 DTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETV 534
Query: 912 NRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKG 970
+ +L+ G ++++ T++G IG++ HE + F + ++ +LR
Sbjct: 535 LSLQKTTLI-------PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPP 587
Query: 971 VGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVE 1030
+ G +H +RS+ KN +DGDL E F + + +S+ ++ + E+ K++E
Sbjct: 588 LCGRDHLSFRSY-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLE 642
Query: 1031 EL 1032
++
Sbjct: 643 DI 644
>gi|414882040|tpg|DAA59171.1| TPA: hypothetical protein ZEAMMB73_269016 [Zea mays]
Length = 690
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 115/447 (25%), Positives = 196/447 (43%), Gaps = 23/447 (5%)
Query: 278 LQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
QP A + + + E +L P++D V +L + Q+ T G +LRI+R G+ I+
Sbjct: 248 FQPRALKNLMRIDE-IESLMPVMDMRVANLFDEETPQLFTACGRGPRSTLRILRPGLAIS 306
Query: 338 EQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQT 396
E A L +W+++ + +D FD ++VVSF + T +L++ + +EE F T
Sbjct: 307 EMARSMLPAEPIAVWTVKKNINDMFDAYIVVSFTNVTLVLSIG--ETIEEVSDSQFLDTT 364
Query: 397 QTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGG 456
+L + L+QV +R + R NEW++P ++ +N QV++A GG
Sbjct: 365 HSLAVTLLGEDSLMQVHPNGIRHIREDGRV--NEWRTPGKKTITKVGSNRLQVVIALSGG 422
Query: 457 HLVYLEIG-DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-P 514
L+Y E+ G L EV+ + +++CL I P+ E S+ VG + D ++ I SL P
Sbjct: 423 ELIYFEMDMTGQLMEVEKQDMSGDVACLAIAPVPEGRQRSRFLVVGSY-DNTIGILSLDP 481
Query: 515 DLNLITKEHLGGEIIPRSVLLCAFE-------GISY-----LLCALGDGHLLNFLLNMKT 562
D L P S+L + G Y L L +G L ++M T
Sbjct: 482 DDCLQPLSVQSVSSAPESLLFLEVQASVGGEDGADYPANLFLNAGLQNGVLFRTNVDMVT 541
Query: 563 GELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
G+L+D + LG +P L + S RP + Y L + ++ +
Sbjct: 542 GQLSDTRSRFLGLRPPKLFPCIVSHRQAKLCLSSRPWLGYIHQGHFLLTPLSCDTLESAS 601
Query: 623 PFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SL 680
F+S + + L I T++ + + + +IPL PR+ + + + S
Sbjct: 602 SFSSDQCSEGVVAVAGDALRIFTVERLGETFNETTIPLRYTPRKFVILPKKKYIPVIESD 661
Query: 681 KNQSCAEESEMHFVRLLDDQTFEFIST 707
K AEE E LD F+ +
Sbjct: 662 KGAFSAEEREAAKKECLDGCLRAFVPS 688
>gi|119572189|gb|EAW51804.1| splicing factor 3b, subunit 3, 130kDa, isoform CRA_b [Homo sapiens]
Length = 635
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 146/651 (22%), Positives = 270/651 (41%), Gaps = 116/651 (17%)
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIP-RSVL 534
+ ++ C+ + + S+ AVG+ D +VRI SL + + + L + +P +
Sbjct: 1 MSADVVCMSLANVPPGEQRSRFLAVGL-VDNTVRIISLDPSDCL--QPLSMQALPAQPES 57
Query: 535 LCAFE----------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
LC E G YL L +G LL +L+ TG+L+D + LG++P+
Sbjct: 58 LCIVEMGGTEKQDELGERGSIGFLYLNIGLQNGVLLRTVLDPVTGDLSDTRTRYLGSRPV 117
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
L + V A S R + YS + + ++ + + F S P+ +
Sbjct: 118 KLFRVRMQGQEAVLAMSSRSWLSYSYQSRFHLTPLSYETLEFASGFASEQCPEGIVAIST 177
Query: 639 GELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQSC 685
L I ++ + + + + PL PR+ + H E + T A + + Q
Sbjct: 178 NTLRILALEKLGAVFNQVAFPLQYTPRKFVIHPESNNLIIIETDHNAYTEATKAQRKQQM 237
Query: 686 AEE-------------SEM-------------------------HFVRLLDDQTFEFIST 707
AEE +EM +R+++ +
Sbjct: 238 AEEMVEAAGEDERELAAEMAAAFLNENLPESIFGAPKAGNGQWASVIRVMNPIQGNTLDL 297
Query: 708 YPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQL 763
L+ E S+ C FS+ +Y VG A L G + +V +G KL+
Sbjct: 298 VQLEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEF 357
Query: 764 IAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV-- 821
+ + + ++ F G++L + + +++Y D G ++L +C + HI A Y+
Sbjct: 358 LHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISG 410
Query: 822 -QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
QT G ++V D+ +S + YK E + A D W++ +LD D GA+ N
Sbjct: 411 IQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGN 470
Query: 881 LFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMR 922
+ VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 471 ICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-- 528
Query: 923 LPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS 981
G ++++ T++G IG++ HE + F + ++ +LR + G +H +RS
Sbjct: 529 -----PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRS 583
Query: 982 FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+ KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 584 Y-----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 629
>gi|339259094|ref|XP_003369733.1| splicing factor 3B subunit 3 [Trichinella spiralis]
gi|316965959|gb|EFV50595.1| splicing factor 3B subunit 3 [Trichinella spiralis]
Length = 1241
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 165/765 (21%), Positives = 321/765 (41%), Gaps = 92/765 (12%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I +L FR G ++D++ + ++ + +L++ + + + T+ + + G R
Sbjct: 114 FGIIRSLSSFRLTGSSKDYIVVGSDSGRIVILEYIPDKN-IFTKVHQETFGKSGCRRIVP 172
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R +IG L ++ D+ +L + + + Q + + + G
Sbjct: 173 GQYLTVDPKGRAIMIGATEKQKLVYIMNRDSAARLTISSPLEAHKSQSICFQIVGVDVGF 232
Query: 121 AKPTIVVLYQDNKDARHVKTYEV--ALKDKDFVEGPWSQNN--------LDNGADLLIPV 170
P L D ++ V T EV A++ F E N+ LD A+ LI V
Sbjct: 233 ENPLFACLEVDYEEIDSVNT-EVGKAIQTLTFYELDLGLNHVVRKYAEPLDYFANHLIAV 291
Query: 171 P-----PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVD-------ADGSRYLL 218
P P GV++ E I+Y N + + +R I + ++ + +LL
Sbjct: 292 PGGSEGP--SGVILCCENYIIY--KNLGEQLDVRCPIPRRRNDLEDPDRGILVNAFFFLL 347
Query: 219 GDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV--YIGSSYGD------ 270
G + + + + VT +KI+ +A++I L + + Y + GD
Sbjct: 348 QTEQGDIFKVTLEVFDDIVTEIKIKYFDTLPVANSICMLKSGFLNLYQIAKLGDNDDETE 407
Query: 271 -SQLIKLN------LQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYK 323
S L+ L+ P + ++V + ++L PI+ F V D+ + Q+ G
Sbjct: 408 FSSLMPLDEGDTFFFAPRGLQNLIQV-DELLSLSPIMSFEVADIVYEDTPQLYAACGRGP 466
Query: 324 DGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAM-NLE 381
S++++R+G+ + E A EL G +W+++ D +D ++VVSF++ T +L++
Sbjct: 467 RSSIKVLRHGLEVTEMAISELPGNPNAVWTVKKHQDAEYDAYIVVSFVNATLVLSIGETV 526
Query: 382 DELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNV 441
+E+ ++ + G VQ+ +R + R NEWK P +
Sbjct: 527 EEVTDSVVLGVVD---------------VQIYPEGIRHIRCDRR--VNEWKVPGRRVITN 569
Query: 442 ATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYEISCLDINPIGENPSYSQIAA 499
N QVL+A GG +VY E+ + G L E + ++ ++ + + I + A
Sbjct: 570 CAVNCRQVLIALNGGEIVYFELDEGGQLNEYTERKEMPCDVVAMALADIPPGEIRCRFLA 629
Query: 500 VGMWTDISVRIFSLPDLNLITKEHLGG-EIIPRSVLLCAFEGI---SYLLCALG------ 549
V + TD +VR+ SL + +T ++ + +S+ + A G S L +G
Sbjct: 630 VAL-TDRTVRVISLDPSDCLTPLNMQSLPAMAQSLCIAAVGGDTMESTLFLNIGLEVNKF 688
Query: 550 -DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTT--------HVFAASDRPTV 600
+G LL +L+ TGEL D + LG +P+ L + ++ A S R +
Sbjct: 689 TNGVLLRTVLDSVTGELQDTRTRYLGMKPVKLFKVKMHDNIAVMNSVQLNIIAMSSRAWL 748
Query: 601 IYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKL-HIRSIPL 659
+Y + + ++ + + F S P+ + L I I+++ + + S L
Sbjct: 749 LYKYQTRYHLTPLSYIPLEYASNFASDQCPEGVVAIAGNSLRILAIENLGTIFNYSSFQL 808
Query: 660 GEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEF 704
PRR ++ + S +++M +L + E
Sbjct: 809 SHTPRRFVFHPETGNIITVETDHNSLTTQTKMKRREILAKEMIEL 853
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 71/283 (25%), Positives = 132/283 (46%), Gaps = 27/283 (9%)
Query: 760 KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILA 818
+L+++ AV +L +F G+LLA+ + +++Y D G ++L +C H L
Sbjct: 970 RLEMMHRTTVDEAVTALASFRGRLLASAGKMLRIY-----DLGKKKLLRKCENKHMPNLI 1024
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
++ T G I VGD+ +S+ YK E + A D + + SA+ ILD D +
Sbjct: 1025 THILTMGHRIFVGDVQESVFFYRYKPIENQLVVFADDTHQRFCSAMCILDYDTVALRLPS 1084
Query: 879 FNLFTVRKNS--------EGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
V+++ +G + + E+V +++GE V + L+ G
Sbjct: 1085 DCTDDVQEDPTGIRALWDKGILNGASQKCEMVATFYVGECVTCLQKAMLI-------PGS 1137
Query: 931 IPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
++++ T++G+IG + E Y F + L+ +LR + G +H +RSF
Sbjct: 1138 SESLVYSTMSGMIGALVPFSSKEDYEFFQHLEMHLRTEYPPLCGRDHLAYRSF-----YA 1192
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K +DGDL E + L + EIS ++ E+ K++E++
Sbjct: 1193 PVKGVIDGDLCEQYCLLEYGKQKEISNELDRVPSEIMKKLEDI 1235
>gi|296086939|emb|CBI33172.3| unnamed protein product [Vitis vinifera]
Length = 934
Score = 114 bits (284), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 90/353 (25%), Positives = 163/353 (46%), Gaps = 47/353 (13%)
Query: 152 EGPWSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITKA 205
+G WS+ +DNGA++L+ VP GVL+ E ++Y + + IP R +
Sbjct: 81 KGKWSEQ-VDNGANMLVTVPGGGDGPSGVLVCAENFVIYKNQGHPDVRAVIPRRADLPAE 139
Query: 206 YGR--VDADGSR------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYL 257
G V A R +LL G + + + HE ++++ LKI+ + S++ L
Sbjct: 140 RGVLIVSAATHRQKSMFFFLLQTEYGDVFKVTLEHENDRISELKIKYFDTIPVTSSMCVL 199
Query: 258 DNAVVYIGSSYGDSQLIKLN--------------------------LQPDAKGSYVEVLE 291
+ ++ S +G+ L + QP + V + +
Sbjct: 200 KSGFLFAASEFGNHGLYQFQAIGDDADVESSSASLMETEEGFQPVFFQPRGLKNLVRI-D 258
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGM 350
+ +L PI+D V +L + Q+ G S+RI+R G+ I+E A +L G+ +
Sbjct: 259 QVESLMPIMDMKVSNLFEEETPQIFALCGRGPRSSIRILRPGLAISEMAVSQLPGVPSAV 318
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W+++ + +D FD ++VVSF + T +L++ + +EE GF T +L + L+
Sbjct: 319 WTVKKNVNDEFDAYIVVSFANATLVLSIG--ETVEEVSDSGFLDTTPSLAVSLIGDDSLM 376
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
QV +R + R NEW++P ++ +N QV++A GG L+Y E+
Sbjct: 377 QVHPSGIRHIREDGR--INEWRTPGKRTIVKVGSNRLQVVIALSGGELIYFEV 427
Score = 97.1 bits (240), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 100/370 (27%), Positives = 168/370 (45%), Gaps = 54/370 (14%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLP--EENEPTKGR 750
+R+LD +T L E SI + +F D + VGTA L + G
Sbjct: 583 IRILDPRTATTTCLLELQDNEAAFSICTVNFHDKEYGTLLAVGTAKSLQFWPKRSFDAGY 642
Query: 751 ILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQS 808
I ++ +EDGK L+L+ + + +G +L F G+LLA I ++LY D G R L
Sbjct: 643 IHIYRFLEDGKSLELLHKTQVEGVPLALCQFQGRLLAGIGSVLRLY-----DLGKRRLLR 697
Query: 809 ECGHH---GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
+C + I++++ T D I VGD+ +S Y+ +E + A D W++A
Sbjct: 698 KCENKLFPNTIVSIH--TYRDRIYVGDIQESFHYCKYRRDENQLYIFADDSVPRWLTASY 755
Query: 866 ILDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHL 907
+D D GA+ N++ VR K +G + ++E + ++H+
Sbjct: 756 HIDFDTMAGADKFGNIYFVRLPQDVSDEVEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHV 815
Query: 908 GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRK 966
G+ V + SL+ G +I+GTV G +G ++A + F L+ ++R+
Sbjct: 816 GDVVTCLQKASLI-------PGGGECIIYGTVMGSLGALLAFTSRDDVDFFSHLEMHMRQ 868
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNVSV 1022
+ G +H +RS K+ +DGDL E F LDL R DE+ +T
Sbjct: 869 EHPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPTLPLDLQRKIADELDRTPG--- 920
Query: 1023 EELCKRVEEL 1032
E+ K++EE+
Sbjct: 921 -EILKKLEEV 929
>gi|358253751|dbj|GAA53701.1| splicing factor 3B subunit 3, partial [Clonorchis sinensis]
Length = 726
Score = 112 bits (281), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 135/594 (22%), Positives = 248/594 (41%), Gaps = 75/594 (12%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L +G I + FR G ++D+L + ++ + VL++ S + + + G
Sbjct: 56 LLTCEAFGIIRSFMPFRLTGSSKDYLIVGSDSGRVVVLEY-IPSKNAFEKLQQETFGKSG 114
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL 117
R GQ ++P R +IG L ++ D++ +L + + + L +
Sbjct: 115 CRRIVPGQYLAVNPKGRAFMIGAVEKQKLVYIMNRDSQARLTISSPLEAHKSNTLVYHMV 174
Query: 118 ---YGCAKPTIVVLYQDNKDARHVKTYEVALKDKD---FVEGPWSQNN--------LDNG 163
G P + D +DA T E A + E N+ L+
Sbjct: 175 GVDVGFENPLFACIEMDYEDADQDSTGEAARGANQLLTYYELDLGLNHVVRKYSEPLEEH 234
Query: 164 ADLLIPVP---PPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLL-- 218
A+ LI VP G+LI E I Y N IR I + +D D R +L
Sbjct: 235 ANFLIAVPGGNEGPSGILICSENYITY--KNFGDQPDIRCPIPQRRNNLD-DPDRGILFV 291
Query: 219 --GDHA--GLLHLLVITHE-----------KEKVTGLKIELLGETSIASTISYLDNAVVY 263
H L L T + + VT +K++ +AS + L ++
Sbjct: 292 CSASHKTKNLFFFLAQTEQGDIFKINLEVDDDMVTEIKLKYFDTLPVASAMCVLKTGFLF 351
Query: 264 IGSSYGDSQLIKLN------------------------LQPDAKGSYVEVLERYVNLGPI 299
+ S +G+ L ++ P A + +EV + NL PI
Sbjct: 352 VASEFGNHSLYQIAHLGDDDDEPEFSSAMPLEEGDTFYFAPRALKNLIEV-DVLENLSPI 410
Query: 300 VDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD 358
+ + + D + Q+ G + R++R+G+ ++E A +L G +W+++ +++
Sbjct: 411 MHYHIADFANEDTPQLAVLCGRGPGSTFRLLRHGLEVSEMAKSDLPGNPNAVWTVKRNSE 470
Query: 359 DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
+ +D +++VSF++ T +L++ + +EE GF T TL C + LVQV +R
Sbjct: 471 EEYDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLTCSQLGDDALVQVYPDGIR 528
Query: 419 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQL 476
+ + R N W++P + N QV++A GG LVY E+ G L E + ++
Sbjct: 529 HIRADKRV--NVWRAPGKKMIVRCAVNRRQVVIALTGGELVYFEMDMTGQLNEYTERKEM 586
Query: 477 EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII 529
++ C+ + I + S+ AVG+ D +VRI SL P ++ ++ E ++
Sbjct: 587 PADVICMALGRIPTSEQRSRFLAVGL-ADNTVRILSLDPSVSRVSCERFSTPMV 639
>gi|341893349|gb|EGT49284.1| hypothetical protein CAEBREN_30765 [Caenorhabditis brenneri]
Length = 213
Score = 112 bits (279), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 68/213 (31%), Positives = 106/213 (49%), Gaps = 10/213 (4%)
Query: 834 MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-KNSEGAT 892
M+S+SLL Y+ EG EE A+D+N+ WM E + + LG E + N+FTV S T
Sbjct: 1 MRSVSLLSYRTLEGNFEEVAKDWNSEWMVTCEFITAESILGGEAHLNMFTVEVDKSRPVT 60
Query: 893 DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
D+ R LE G ++LGE LV + D+ + +++GT G +G++ +
Sbjct: 61 DDGRYVLEPTGYWYLGELTKVMIRAVLVPQPDDNSIRYTQPIMYGTNQGSLGLVVQIDDM 120
Query: 953 QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMD 1012
FL ++ + K + H +RSF K+ F+DGDLIES LD+ R+R
Sbjct: 121 YKKFLLSIEKAISDAEKNCMQIEHSTYRSFTYNKRIEPPSGFIDGDLIESILDMDRSRAI 180
Query: 1013 EISKTMNV---------SVEELCKRVEELTRLH 1036
EI + N E+ K +++L+R H
Sbjct: 181 EILEKANTRGWDPSIPKDPVEILKIIDDLSRTH 213
>gi|260947152|ref|XP_002617873.1| hypothetical protein CLUG_01332 [Clavispora lusitaniae ATCC 42720]
gi|238847745|gb|EEQ37209.1| hypothetical protein CLUG_01332 [Clavispora lusitaniae ATCC 42720]
Length = 1242
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 214/1010 (21%), Positives = 400/1010 (39%), Gaps = 165/1010 (16%)
Query: 156 SQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSA----NAFKAIPIRPS-----ITKAY 206
S+ ++D A+LLIP+P P GVLI + ++Y S + P+R IT Y
Sbjct: 259 SKPDIDASANLLIPLPAPAGGVLICCKSYLIYESTPGGTRKYLQFPVRSDTSEVVITNFY 318
Query: 207 -GRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIG 265
++ L+ G L + ++ E E +T + ++++ ++ ++
Sbjct: 319 VHKLKKKEFFVLVQSSLGDLFKITVSIEGETLTAFYATYFDTIPVCTSLNIFNSGFLFAN 378
Query: 266 SSYGDS---QLIKLNLQPDAKGSYVE--------------------VLERYVNLGPIVDF 302
S+ + Q L D S V ++E +L P+VD
Sbjct: 379 STDNNKFFYQFESLGEADDTTTSSVSEKSDFPTKTFFDSRGLKNLSLVEVLGSLSPLVDS 438
Query: 303 CVVDLERQ----GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLRSST 357
+++ Q+VT + + ++ + G+ ++E S + +++ + S
Sbjct: 439 TMLETYSSEFPDPLKQLVTLAS---NSYMKTLTYGVPVSELVSSPVPINPTSIFTTKISK 495
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
D +LV++ + + +++ + +EE GF T+ + LVQV + +
Sbjct: 496 KSINDDYLVLTSSLTMKSMVLSIGEVVEEVSDSGFALDQHTIAVQQIGQHSLVQVHNNGI 555
Query: 418 RLVSS--------TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI--GDGI 467
R + + TS+ N W P G + A+ N QVL+ + Y EI D
Sbjct: 556 RHIRNFFDDSDNVTSKRETN-WYPPAGIVILHASTNNEQVLVGLSNREVCYFEIDPADDQ 614
Query: 468 LTEVKHAQLEY---EISCLDINP--IGENPSYSQIAAVGMWTDISVRIFSLPD--LNLIT 520
L E + +LE I+ L + + E S A +G + I LP ++T
Sbjct: 615 LIEYQE-RLEVTGGSITALALASKFVTEGDRKSNFAVIGTSEETIQVISLLPKTCFEIVT 673
Query: 521 KEHLGGEIIPRSVLLCAFEGISYLL-CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPIT 579
+ L S+L+ + +Y + + +G +++ +G+L D + LGTQP++
Sbjct: 674 LQALSANCF--SLLMMPLDKDNYFVHIGMINGVYARVRIDVVSGKLGDSRLKYLGTQPVS 731
Query: 580 LRTFSSKN--TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAF------PD 631
LR+ + N ++ + A S RP + Y +N+ SN L + + + +AF +
Sbjct: 732 LRSLALPNVASSGILAISSRPWIGYFNNE----SNFKLTPLIDISIKDGSAFYSEDIGIE 787
Query: 632 SLAIAKEGELTIGTI--------DDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS---- 679
S+ ++GE+TI T+ D L I SI L PR+ + F +
Sbjct: 788 SVVGIRDGEITIFTVGGEEGEGFDVTNDLTISSIKLRYAPRKQLKDSLTDYFFVIESEFG 847
Query: 680 -----LKNQSC------------AEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSC 722
L+++ E S ++++D T+ E S+ S
Sbjct: 848 VISPYLESKEVDAEYFEVFGYKKKENSWASCLQIVDVAGSGIEYTHEFTNNECTISLASI 907
Query: 723 SFSDDSNVYYCVGTAYVLPEENEPTKGR-ILVFIVEDG---KLQLIAEKET--KGAVYSL 776
D+ Y VGT L G I F ++ K +L+ +T + SL
Sbjct: 908 KIRDEE--YLIVGTTENLVYSPHYYSGNSIYTFRIKRNAKKKPELVYLHKTTIEFPPSSL 965
Query: 777 NAFNGKLLAAINQKIQLY----KWMLRDDGTR-ELQSECGHHGHILALYVQTRGDFIVVG 831
AFN KLL +++LY K +LR T+ + HI GD IVV
Sbjct: 966 CAFNEKLLVGAGNQLRLYDVGRKQLLRKTSTKIDFLRRINKIQHI-------AGDVIVVC 1018
Query: 832 DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR------ 885
D +S+S + + + D ++A+E+LD + + N+F R
Sbjct: 1019 DSSESVSFMRFDQTKNQFIAFCNDTAKRQITALEVLDSRTVIAGDRFGNIFVSRIPKNVA 1078
Query: 886 ---------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
K E RL+ + +++ + V GS V VG ++I+
Sbjct: 1079 EQLENNVLMKFEEETLGASSSRLDKLCDFYTQDIVTSLHKGSFV-------VGGSESIIY 1131
Query: 937 GTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIK-------------GVGGLNHEQWRSF 982
+ G +G++ L Q + L KL+ +LR + G H ++R +
Sbjct: 1132 TGLQGTVGILLPLATTQEVDLLMKLENSLRDYFNDSFDDFDNTKQGFNLVGREHLKFRGY 1191
Query: 983 NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
N +N +DGD IE F +L+ + +++ ++ S ++ +++ +L
Sbjct: 1192 YNP-----VENVIDGDFIERFFELNPSAQVKLAGRLDKSPRDIERKIYDL 1236
>gi|312380158|gb|EFR26239.1| hypothetical protein AND_07834 [Anopheles darlingi]
Length = 1503
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 121/481 (25%), Positives = 212/481 (44%), Gaps = 67/481 (13%)
Query: 609 LYSNVNLKEVSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH----IRSIPLGEHP 663
LY+ +K + PFN+ P+ L ++ +L I + +R IPL P
Sbjct: 978 LYARTVMKAFA---PFNNVNCPNGFLYFDEQYQLKISILPTYLSYDSVWPVRKIPLRSSP 1034
Query: 664 RRICHQEQSRTFAICSLKNQSC-------AEESEM----------------HFVRLLDDQ 700
++I + ++R + + + C E+ E+ V L++
Sbjct: 1035 KQIVYHRENRVYCVVMDAEEICNKYYRFNGEDKELTEENKGERFLYPMGHQFSVVLVNPA 1094
Query: 701 TFEFI--STYPLDTFEYGCSILSCSFSDDS-----NVYYCVGTAYVLPEENEPTKGRILV 753
+E + + L+ +E+ S+ + S + + Y VGT + E+ ++GR+L+
Sbjct: 1095 AWEIVPDTAIALEEWEHVVSLKNVSLAYEGARSGLKEYIAVGTNFNY-SEDITSRGRLLL 1153
Query: 754 F-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDG 802
+ + E GK + + K+ KG V +++ G L+ A+ QK+ Y W ++DD
Sbjct: 1154 YDIIEVVPEPGKPLTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKV--YLWQMKDD- 1210
Query: 803 TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMS 862
+L +I + + I+V D+ KS+SLL ++ E + +RDY+ +
Sbjct: 1211 --DLVGVAFIDTNIFVHQMVSIKSLILVADVYKSVSLLRFQDEFRTLSLVSRDYHPLNVY 1268
Query: 863 AVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN-RFR--- 915
VE + D+ LG A++ NL T E RL G+YHLG+ VN FR
Sbjct: 1269 QVEYVVDNTNLGFLVADDQANLITYMYQPESRESFGGQRLLRKGDYHLGQRVNAMFRVQC 1328
Query: 916 --HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
H S VMR + T F T++G G + LP + Y L LQ L G
Sbjct: 1329 DFHESDVMRR-TLNYDNKHTTFFATLDGGFGFVLPLPEKTYRRLFMLQNVLLTHSPHTCG 1387
Query: 974 LNHEQWRSFNNEKKTV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
LN + +R+ + ++ +DGDL+ SFL+L E++K + +EE+C + E
Sbjct: 1388 LNPKAYRTIKQSRALPINPSRCVVDGDLVWSFLELPANEKQEVAKKIGTRIEEICADLME 1447
Query: 1032 L 1032
+
Sbjct: 1448 I 1448
Score = 67.0 bits (162), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 122/554 (22%), Positives = 210/554 (37%), Gaps = 140/554 (25%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
M ++G I +L+ G +D L I+ K V+Q+D ++ +L T ++ D
Sbjct: 81 MASYRLFGNIMSLQSVSLAGSQRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDEDI 140
Query: 61 RPTDNGQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE---------------- 100
R G I +DPD R + +Y V+PF L E
Sbjct: 141 RGGWTGHYHIPLVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEMQDVKPIKKTPTLL 200
Query: 101 --------AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTY 141
++ I L++L V+D++FL+G +PT+++LY+ + R
Sbjct: 201 IAKTPILASYIIELKDLDEKIDNVIDVQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCT 260
Query: 142 EVALK---DKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY----------- 187
VAL + W+ N+L +P+ P+ G L++ +++Y
Sbjct: 261 MVALSLNIQQRVHPVIWTVNSLPFDCLQAVPISKPIGGCLVMCVNSLIYLNQSVPPYGVS 320
Query: 188 --CSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELL 245
SA+ P++P DG R L D A + I EK ++ EL
Sbjct: 321 LNSSADHSTNFPLKPQ----------DGVRISL-DAA---QVCFIESEKLVLSLKGGELY 366
Query: 246 GETSIAST---ISYLDNAVVYIGSSYGDSQLIK-------LNLQPDAKG----------- 284
T A + I + +++GS G+S L++ L + D G
Sbjct: 367 VLTLCADSMRSICVCETEYLFLGSRLGNSLLLRFREKDESLVITIDDSGTVEKEQKRQRL 426
Query: 285 -------------------SYV-EVLERYVNLGPIVDFCVVD---------------LER 309
SY+ EV + +N+GPI V + +
Sbjct: 427 EEEELEVYGSGYKTSVQLTSYIFEVCDSVLNIGPIAHMAVGERICEEEMEEGAEVQFVPN 486
Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------------RSS 356
+ +VVT SG K+G+L ++++ I S L G +W++ R
Sbjct: 487 KLDVEVVTASGHGKNGALCVLQSSIKPQVITSFGLSGCLDVWTVFDEAAGPGGVTGTRKP 546
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSG 415
D P + E + + +E+ E E GF + T+ + N+ +VQVT+
Sbjct: 547 DDAPPPNHAFMILSQEGATMVLQTGEEINEIENTGFATDVPTIHVGNIGSNRFIVQVTTK 606
Query: 416 SVRLVSSTSRELRN 429
S+RL+ T R L+N
Sbjct: 607 SIRLLQGT-RLLQN 619
>gi|356570929|ref|XP_003553635.1| PREDICTED: uncharacterized protein LOC100799711 [Glycine max]
Length = 1258
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 120/506 (23%), Positives = 214/506 (42%), Gaps = 72/506 (14%)
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
GD ++KL D + Y ++ N+ PI+D VVD + Q+ C G +GSLR
Sbjct: 443 GDGMVLKLE---DGRLCYTNPIQ---NIAPILDMEVVDYHDEKHDQMFACCGVAPEGSLR 496
Query: 329 IVRNGIGI---NEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
I+RNGI + + AS+ QG+ G W++R D +FLV+SF+ ETRIL++ L +
Sbjct: 497 IIRNGINVENLHRTASI-YQGVSGTWTVRMKVTDSHHSFLVLSFLDETRILSVGLSFT-D 554
Query: 386 ETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATAN 445
T+ GF TL C LVQ+ +V+L T A+ +
Sbjct: 555 VTDSVGFQPNVCTLACGLVTDGLLVQIHRSTVKLCLPTK-----------------ASHS 597
Query: 446 ASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTD 505
++ GG + + G LT N IG S S V + +
Sbjct: 598 EVEIWFFAPGGGITVVACGTISLT----------------NTIGSVKSDSIPQDVRLVSA 641
Query: 506 ISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNM--KTG 563
+ + ++ + E P S I+ + AL +L+N + N K
Sbjct: 642 DKYYVLAGLRNGMLLRFEWPAEPCPSS-------PINMVDTALSSTNLVNSVTNAFDKRN 694
Query: 564 ELTDRKKV----SLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS 619
+L ++ +G PI L + +DRP +++S+ + L Y++++ + +
Sbjct: 695 DLPSMLQLIAIRRIGITPIFLVPLGDTLDADIIVLADRPWLLHSARQGLSYTSISFQPAT 754
Query: 620 HMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI-- 677
H+ P + FP + E L + + ++L+++ L PR++ + ++S+ +
Sbjct: 755 HVTPVSCVEFPKGILFVAENSLHLVEMGHGKRLNVQKFHLEGTPRKVLYHDESKMLLVMR 814
Query: 678 CSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTA 737
L C + + +D + +S++ L+ E G S+ S VGT+
Sbjct: 815 TELNCGPCLSD-----ICCVDSLSGSVLSSFRLELGETGKSMELVRVG--SEQVLVVGTS 867
Query: 738 -----YVLPE-ENEPTKGRILVFIVE 757
+ +P E E KGR+LV ++
Sbjct: 868 LSSGPHTMPTGEAESCKGRLLVLCLD 893
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 50/201 (24%), Positives = 82/201 (40%), Gaps = 17/201 (8%)
Query: 828 IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
I VGD I L Y E +E D + ++ ++D D + ++ ++ + +
Sbjct: 1023 IAVGDCRDGILLYSYHEEAKKLELLYNDPSQRIVADCILMDADTAVVSDRKGSIAVLCSD 1082
Query: 888 SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV---GQIP---------TVI 935
+ + + Y + E + GS RLP DV G P T+I
Sbjct: 1083 HLEDNAGAQCNMTLSCAYFMAEIAMSIKKGSYSYRLPADDVLEGGNGPKTNVDSLQNTII 1142
Query: 936 FGTVNGVIGVIASLPHEQYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
T+ G I + L E+Y LE +Q L + V G +H ++RS N V
Sbjct: 1143 ASTLLGSIMIFIPLSREEYELLEVVQARLVVHHLTAPVLGNDHHEFRSRENR---VGVPK 1199
Query: 994 FLDGDLIESFLDLSRTRMDEI 1014
LDGD++ FL+L+ + I
Sbjct: 1200 ILDGDILTQFLELTSMQQKMI 1220
>gi|322700233|gb|EFY91989.1| Pre-mRNA-splicing factor rse-1 [Metarhizium acridum CQMa 102]
Length = 1039
Score = 111 bits (277), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 127/583 (21%), Positives = 239/583 (40%), Gaps = 79/583 (13%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L ++G I ++ FR G ++D++ +A++ + ++++ + +
Sbjct: 59 LLSHDVFGIIRSIAAFRLAGSSKDYIILASDSGRIAIVEYAPTKNSFSRIHLATFGKSGV 118
Query: 61 RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL- 117
R GQ DP R LI + L V+ + + +L + + + +L +
Sbjct: 119 RRVIPGQYLAADPKGRACLIASIEKNKLVYVLNRNAEAELTISSPLEAHKHGILVFSVVS 178
Query: 118 --YGCAKPTIVVLYQDNKDARHVKTYEVALK--------DKDFVEGPWSQNNLDNGADLL 167
G A P L D +A + EV ++ + V WS+ +D + LL
Sbjct: 179 LDVGYATPVFAALEADYTEAEQDDSAEVEIQLVYYELDLGLNHVVRKWSEV-VDPTSSLL 237
Query: 168 IPVP-----PPLCGVLIIGEETIVYCSAN--AFKA-IPIRPSITK--AYGRVDADGSRYL 217
VP P GVL+ GEE + Y +N AF+ IP R T+ R+ G +
Sbjct: 238 FQVPGGSDGP--SGVLVCGEENVTYRHSNQEAFRVPIPRRRGATEDPQRRRIIVSGVMHK 295
Query: 218 LGDHAGLLHLLVITHEKE------------------KVTGLKIELLGETSIASTISYLDN 259
L L+ T + + +VT +KI+ ++++I + +
Sbjct: 296 LKGSTSAFFFLLQTDDGDLFKVTLDMAEDADGNLTGEVTRIKIKYFDTIPVSNSICIMKS 355
Query: 260 AVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------VEVLER 292
++ +G+ Q + + D + SY + ++E
Sbjct: 356 GFLFTAGEFGNYQFYQFEKLGDDDDETEFTSDDFPTDYQASYHPVYFHPRPLENLTLVES 415
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMW 351
++ P++D ++ + Q+ G +LRI+R+G+ +NE + EL G +W
Sbjct: 416 VSSMAPLMDCKILTQAGEDISQIYAACGNGARSTLRILRHGLEVNELVASELPGTPSAVW 475
Query: 352 SLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQ 411
+ + + D +D +++++F+ +T +L++ + + + GF + TL N L Q
Sbjct: 476 TTKLTQSDDYDAYIILTFLHDTMVLSVG--ETVTQVTDSGFITTVATLAVQQIGKNSLFQ 533
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE 470
V S +R + S EW P ++ A N QV +A G +VY E+ DG L E
Sbjct: 534 VYSKGIRHIQSGQ---FTEWPVPQHRTIVAAATNERQVAIALSSGEIVYFEVDEDGSLAE 590
Query: 471 VKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL 513
+ ++ L + P+ E S AVG D +VRI SL
Sbjct: 591 FDERKEIGSVTSLGLGPVPEGRLRSPFLAVGC-DDCTVRILSL 632
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 84/339 (24%), Positives = 163/339 (48%), Gaps = 25/339 (7%)
Query: 710 LDTFEYGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IVEDGK-LQLIA 765
L+ E G S F S D + +GT ++ + ++G I V+ +DG+ L+ I
Sbjct: 704 LEGNEAGVSAAVVPFTSQDGESFLIIGTGKDMIVNPRQSSEGFIHVYRFHDDGRSLEFIH 763
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG 825
+ + + +L +F+G+LLA I + +++Y +R R+ Q++ HI++L Q++G
Sbjct: 764 KTKVEEPPTALLSFHGRLLAGIGKTLRIYDLGMRQL-LRKAQADISPQ-HIVSL--QSQG 819
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR 885
IVVGD+ +++++Y + D A W + + + D + G + N++ VR
Sbjct: 820 FRIVVGDVQHGVTMVVYNPVSNKLLPFVDDTIARWTTCLAMADYESVAGGDKFGNIWIVR 879
Query: 886 --KNSEGATDEERGRLEVV-GEYHLGEFVNRFRHGS--LVMRLPDS------DVGQIPTV 934
+ DE +++ G+ +L +R + + V +P S VG +
Sbjct: 880 CPDKASAEADEPGSDVQLSNGQSYLHGAAHRLQLMAHMFVQDIPTSICKTSLVVGGQDVL 939
Query: 935 IFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
++ + G IGV+ L E F + L+ ++R + G +H +R ++ V K
Sbjct: 940 LWSGLQGTIGVLIPLVTRETADFFQTLEMHMRNEDPPLAGRDHLMYRGYH-----VPVKG 994
Query: 994 FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+DGDL E F LSR + I+ ++ SV E+ +R+ ++
Sbjct: 995 VIDGDLCERFSLLSREKKQMIAGELDRSVREVERRISDV 1033
>gi|297816810|ref|XP_002876288.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322126|gb|EFH52547.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 633
Score = 110 bits (274), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 145/634 (22%), Positives = 257/634 (40%), Gaps = 125/634 (19%)
Query: 497 IAAVGMWTDISVRIFSL-PD--LNLITKEHLGGEIIPRSVLLCAFEGIS----------- 542
+ AVG + D +VRI SL PD L +++ + + P S+L +
Sbjct: 20 LLAVGSY-DNTVRILSLDPDDCLQILSVQSVSS--APESLLFLEVQASIGGEDGADHPAN 76
Query: 543 -YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVI 601
+L L +G L +++M TG+L+D + LG +P L + S + + + S RP +
Sbjct: 77 LFLNSGLQNGVLFRTVVDMVTGQLSDSRSRFLGLKPPKLFSISVRGRSAMLCLSSRPWLG 136
Query: 602 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLG 660
Y + ++ + + PF+S + + L I ID + + + +PL
Sbjct: 137 YIHRGHFHLTPLSYETLEFAAPFSSDQCAEGVVSVAGDALRIFMIDRLGETFNETVVPLR 196
Query: 661 EHPRRICHQEQSRTFAIC-SLKNQSCAEESE----------------------------- 690
PR+ Q + + I S + AEE E
Sbjct: 197 YTPRKFVLQPKRKLLVIIESDQGAFTAEEREAARKECFEAGGVGENGNGNADQMENGADD 256
Query: 691 ----------------------MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-D 727
+ +R+LD +T L E S+ + +F D +
Sbjct: 257 EDKEDPLSDEQYGYPKAESEKWVSCIRVLDPKTATTTCLLELQDNEAAYSVCTVNFHDKE 316
Query: 728 SNVYYCVGTAYVLPEENEPTKGRILVFI-----VEDGK-LQLIAEKETKGAVYSLNAFNG 781
VGT + + P K + FI VE+GK L+L+ + + +G +L F G
Sbjct: 317 YGTLLAVGTVKGM--QFWPKKNLVAGFIHIYRFVEEGKSLELLHKTQVEGVPLALCQFQG 374
Query: 782 KLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI-LALYVQTRGDFIVVGDLMKSISLL 840
+LLA I ++LY D G + L +C + + +QT D I VGD+ +S
Sbjct: 375 RLLAGIGPVLRLY-----DLGKKRLLRKCENKLFPNTIISIQTYRDRIYVGDIQESFHYC 429
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--------------- 885
Y+ +E + A D W++A +D D GA+ N++ VR
Sbjct: 430 KYRRDENQLYIFADDCVPRWLTASHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTG 489
Query: 886 ---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGV 942
K +G + +++ + ++H+G+ V + S++ G ++++GTV G
Sbjct: 490 GKIKWEQGKLNGAPNKVDEIVQFHVGDVVTCLQKASMI-------PGGSESIMYGTVMGS 542
Query: 943 IGVI-ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
IG + A + F L+ ++R+ + G +H +RS K+ +DGDL E
Sbjct: 543 IGALHAFTSRDDVDFFSHLEMHMRQEYPPLCGRDHMAYRS-----AYFPVKDVIDGDLCE 597
Query: 1002 SF----LDLSRTRMDEISKTMNVSVEELCKRVEE 1031
F +DL R DE+ +T E+ K++E+
Sbjct: 598 QFPTLPMDLQRKIADELDRT----PAEILKKLED 627
>gi|356505471|ref|XP_003521514.1| PREDICTED: uncharacterized protein LOC100806799 [Glycine max]
Length = 1278
Score = 110 bits (274), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 124/524 (23%), Positives = 223/524 (42%), Gaps = 77/524 (14%)
Query: 260 AVVYIGSSY-------GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQ 312
A++++ S Y GD ++KL D + Y+ ++ N+ PI+D VVD + Q
Sbjct: 427 ALLWVESGYLAALVEMGDGMVLKLE---DGRLCYINPIQ---NIAPILDMEVVDYHDEKQ 480
Query: 313 GQVVTCSGAYKDGSLRIVRNGIGI---NEQASVELQGIKGMWSLRSSTDDPFDTFLVVSF 369
Q+ C G +GSLRI+RNGI + + AS+ QG+ G W++R D +FLV+SF
Sbjct: 481 DQMFACCGVAPEGSLRIIRNGINVENLHRTASI-YQGVTGTWTVRMRVTDSHHSFLVLSF 539
Query: 370 ISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN 429
+ ETRIL++ L + T+ GF TL C LVQ+ +V+L T
Sbjct: 540 VEETRILSVGLSFT-DVTDSVGFQPNVCTLACGLVTDGLLVQIHKSTVKLCLPTK----- 593
Query: 430 EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIG 489
+S + + A GGG + + G LT + G
Sbjct: 594 -----AAHSEGIPLPSVEIWYFAPGGG-ITVVACGTISLTNTVGTAIS-----------G 636
Query: 490 ENPSYSQIAAVGMWTDISVRIFSLPDLN--LITKEHLGGEIIPRSVLLCAFEGISYLLCA 547
P ++ VG + + L L ++ + E P S I+ + A
Sbjct: 637 CVPQDVRLVFVGKY-------YVLAGLRNGMLLRFEWPAEPCPSS-------PINIVDTA 682
Query: 548 LGDGHLLNFLLNM--KTGELTDRKKV----SLGTQPITLRTFSSKNTTHVFAASDRPTVI 601
L +L+N + N K + ++ +G P+ L + SDRP ++
Sbjct: 683 LSSINLVNSVTNAFDKRNDFPSMLQLIAIRRIGITPVFLVPLGDTLDADIITLSDRPWLL 742
Query: 602 YSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGE 661
+S+ L YS+++ + +H+ P S P + E L + + ++L+++ L
Sbjct: 743 HSARHSLSYSSISFQPSTHVTPVCSVECPKGILFVAENSLHLVEMVHSKRLNMQKFHLEG 802
Query: 662 HPRRICHQEQSRTFAI--CSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
PR++ + ++S+ + L +C + + ++D + +S++ L+ E G S+
Sbjct: 803 TPRKVLYHDESKMLLVMRTELNCGTCLSD-----ICIMDPLSGSVLSSFRLELGETGKSM 857
Query: 720 LSCSFSDDSNVYYCVGTAY------VLPEENEPTKGRILVFIVE 757
+ + VGT+ + E E KGR+LV ++
Sbjct: 858 ELVRVGSEQVLV--VGTSLSSGPHTMATGEAESCKGRLLVLCLD 899
Score = 53.1 bits (126), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 50/201 (24%), Positives = 82/201 (40%), Gaps = 17/201 (8%)
Query: 828 IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
I VGD I L Y E +E D + ++ ++D D + ++ ++ + +
Sbjct: 1029 IAVGDCRDGILLYSYHEEAKKLELLYNDPSLRLVADCILMDADTAVVSDRKGSIAVLCSD 1088
Query: 888 SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV---GQIP---------TVI 935
+ + + Y + E + GS RLP DV G P T+I
Sbjct: 1089 HLEDNAGAQCNMALSCAYFMAEIAMSIKKGSYSYRLPADDVLQGGNGPKTNVDSLQNTII 1148
Query: 936 FGTVNGVIGVIASLPHEQYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
T+ G I + L E+Y LE +Q L + V G +H ++RS N V
Sbjct: 1149 ATTLLGSIMIFIPLSREEYELLEAVQARLVVHHLTAPVLGNDHNEFRSRENR---VGVPK 1205
Query: 994 FLDGDLIESFLDLSRTRMDEI 1014
LDGD++ FL+L+ + I
Sbjct: 1206 ILDGDMLTQFLELTSMQQKMI 1226
>gi|344231825|gb|EGV63707.1| hypothetical protein CANTEDRAFT_134986 [Candida tenuis ATCC 10573]
gi|344231826|gb|EGV63708.1| hypothetical protein CANTEDRAFT_134986 [Candida tenuis ATCC 10573]
Length = 991
Score = 110 bits (274), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 160/701 (22%), Positives = 289/701 (41%), Gaps = 113/701 (16%)
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
+++ + + DD D +LV+S SET +L++ +E+ E F S+ T+ L
Sbjct: 378 IFTTKLTRDDVNDAYLVISS-SETSVLSIG--EEVAEVTDSKF-SKDPTILVQQVGKMAL 433
Query: 410 VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGI 467
+QV S ++ ++ E +W P G ++ A++N Q+++ ++Y E+ D +
Sbjct: 434 IQVYSNGIKHING---EKVTDWFPPAGINIIKASSNNQQLIIGLTNNEVIYFEVDVDDQL 490
Query: 468 LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
+ +L I+ L I S+ AV D +V++ SL N + E
Sbjct: 491 VEYQDKVELPTNITALAI---------SKDFAVAGCADETVQVISLKQQNCL-------E 534
Query: 528 IIPRSVLLCAFEGISY----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF 583
I+ +L I + + + +G + ++ + G+L++ + LGT+PI L
Sbjct: 535 ILSMQMLSSNSSAIEFSEQEVHIGMENGLFVRTNIDAR-GKLSNTRVKYLGTKPIRL--- 590
Query: 584 SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEG-ELT 642
SK + A S +P V + +N +N +++ F S + G +L
Sbjct: 591 -SKINDSILAISSKPWVGFKTNGNFNIVPLNDIDITDGTSFYSEDIGGEGIVGFRGNDLI 649
Query: 643 IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF 702
I TIDD + I S + + E + SL + E
Sbjct: 650 IFTIDDFRNNFIISTEDIPATKLLVDNE------VYSLGKELVRNNKE------------ 691
Query: 703 EFISTYPLDTFEYGCSILS-CSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKL 761
F + S LS C F D Y VG P F+ +
Sbjct: 692 ---------KFAFEESPLSICRFKD----YVVVGVT-------NPN------FLYTFKDM 725
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHIL 817
+L+ + E + S+ FNG+LL ++ ++ Y K +LR T H I+
Sbjct: 726 KLVHKTEVEMPPRSILEFNGRLLVGMDNLLRTYDLGKKQLLRKSST-----AIKHINKII 780
Query: 818 ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
+ Q + D IVVGD + S + K E + + D ++++ LD D +G +
Sbjct: 781 RIVYQGK-DRIVVGD--SNNSTIFCKFIENSFVPISDDTMNRQITSLSTLDYDTVIGGDK 837
Query: 878 NFNLFTVRKN-------SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ 930
N+F R E + GR + + E+ L + F G+LV+ P+
Sbjct: 838 FGNVFVNRIKYDNTYFVEESYLNGSSGRCQTLAEFFLNDIPMSFTKGTLVLGGPE----- 892
Query: 931 IPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
+I+ + G IG++ + + FL L L K + + G +H ++R + N
Sbjct: 893 --VIIYAGLQGTIGILLPISESDFKFLSNLSIELNKDL--LLGRDHMKFRGYYNS----- 943
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
N +DGD+IE FL+L+ + +IS +N SV E+ ++ +
Sbjct: 944 THNVIDGDIIEKFLELNASSRIKISNKLNKSVREIENKIND 984
>gi|261329035|emb|CBH12013.1| damage-specific DNA binding protein, putative [Trypanosoma brucei
gambiense DAL972]
Length = 1270
Score = 109 bits (273), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 179/800 (22%), Positives = 313/800 (39%), Gaps = 127/800 (15%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR 354
N GP+ D V + V+ +G +G L VR+ + + + + + G++ + +
Sbjct: 426 NCGPVFDVTVA--ADGSRTSVIAGTGVGLNGGLSFVRSAVSVRQD--IRVTGLQNVRQIS 481
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDE--LEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
S D ++ S +R+ + + LEE F ++ L ++ N +QV
Sbjct: 482 VSEDT-----IIFSLPGYSRVCRYCVGETMVLEEIHNTFFGTEETLLLEYNNERNVFLQV 536
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVAT-----ANASQVLLATGGGHLVYLEIGDGI 467
T+ LRN W Y + V+ A+AS+ LL +Y+
Sbjct: 537 TTAG----------LRNVWPDRGEYVIRVSNNDIGHAHASEGLLVFSNPANLYV-FCMKT 585
Query: 468 LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
LT V L ISCL + S VG W +V ++ L D + +K L
Sbjct: 586 LTPVATLCLAEAISCLLVFS-------SNSLLVGTWGSCAVHLYELHDGMIQSKVILQCS 638
Query: 528 IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKT--GELTDRKKVSLGTQPITLRTFSS 585
P S+ + G LL L +G++ + ++ GE+ + L TQP+ R F+
Sbjct: 639 ATPCSMCVVFRSGGHRLLVGLHNGYVADVPISGANVYGEMVETM---LTTQPV--RLFNL 693
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE--LTI 643
++ V + P ++ +N + ++ ++VS P + E L +
Sbjct: 694 ESHAAVLCLGEIPLILIVTNTGFQLTGIDFRDVSACAIMPKMRSPSRYIFFSKSECALIL 753
Query: 644 GTIDDIQKLH------------IRSIP----LGEHPRRICHQEQSRTFAICSLKNQSCAE 687
G+I +QKL+ ++ +P L RRI ++Q + + N S
Sbjct: 754 GSITSVQKLNTDYVGLKATATCVKYMPWWNILVLSIRRI-EKDQVVSTMGHEVSNLSVLL 812
Query: 688 ESEMHFVRLLDDQTFEFISTYPL-DTFEYGCSILSCS-FSDDSNVYYCVGTAYVLPEENE 745
+ F LL+++ FI L E+G SC + D+ V +GT +V P+E
Sbjct: 813 DEPNSF-ELLENERCAFIDCVALGQANEWG----SCGEVASDAGVVL-IGTTFVFPDEQL 866
Query: 746 PTKGRILVFIVEDGKLQ-------LIAEKETKGAVYS---LNAFNGKLLAAINQKIQLYK 795
R + VE KL+ L K+ +GA+ + + G++ I + LY
Sbjct: 867 SRSSRFMWCTVEVAKLRTEKTLLRLQGSKDVEGALQCCCIVPNYAGRVALGIGGCVVLYS 926
Query: 796 WMLRDDG-TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
W D E + G L +Q +IV D S + +G++ AR
Sbjct: 927 WNAADATFVAEETIQIGTLIVRLIPVMQKEVSYIVASDARHSCFFVRIDTIQGSLNIVAR 986
Query: 855 DYNANWMSAVEILD----DDIYLGAENNFNLFTVR-----KNSEG--ATDEERGRLEVVG 903
D + IL D+ LG ++ FN F V NS G A +L+
Sbjct: 987 DPELRGVMDCAILQYESRHDVCLG-DDLFNFFCVSHVEPLANSSGVSAPAMPTKKLQTSA 1045
Query: 904 EYHLGEFVNRFRHGSL--------VMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQ 953
+YH+G+ + GS V+ +P + V + P +++GT +G G I + E
Sbjct: 1046 QYHMGDLITVMHQGSFAPCSVLNDVVPIPATLVRGVCGPQIVYGTSHGAFGAITPISSET 1105
Query: 954 YLFLEKLQTNLRKVIKGVGGLNHEQWRS--------------------------FNNEKK 987
++ L+ L+ ++ V+ +GG H +R FN ++
Sbjct: 1106 FILLKGLEVSVASVVPPLGGFTHASFREVLRVGQERGASRNASFQVTNPQATELFNRRRR 1165
Query: 988 TVDAKNFLDGDLIESFLDLS 1007
+ GD++E FL LS
Sbjct: 1166 RCVPRGVCCGDVVEMFLTLS 1185
>gi|427780291|gb|JAA55597.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
cft1 cpsf subunit [Rhipicephalus pulchellus]
Length = 1237
Score = 107 bits (267), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 127/507 (25%), Positives = 211/507 (41%), Gaps = 76/507 (14%)
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC--PFNSAAFPDS-LAIAKEGELTIGTID 647
VF RP ++ S++ L + + C PF++ P L K+GEL I T+
Sbjct: 735 VFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQGELRISTLP 794
Query: 648 DI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-------AEESEMHFVR- 695
+R +PL P + + S+T+ + + + C EE E +
Sbjct: 795 THLTYDAPWPVRKVPLRCTPHFVNYHVDSKTYCVVTSQPDPCNHLVRFTGEEKEYELLER 854
Query: 696 -------LLDDQTFEFIS-----TYP-----LDTFEYGCSILSCSFSDDSNV-----YYC 733
+D + + +S T P LD +E+ + + S + Y
Sbjct: 855 DSRYIFPTMDKFSLQLLSPVSWETIPNTRVDLDEWEHLTCLKNVMLSSEGTTTGMKGYLA 914
Query: 734 VGTAYVLPEE--------------NEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
+GT Y E+ P G+ L K++++ KE KG V +L+
Sbjct: 915 LGTNYCYGEDVTSRGRIIILDIIDVVPEPGQPLT----KNKIKIVYSKEQKGPVTALSQV 970
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
G LL+AI QKI Y W L+D+ EL I V T + I+VGD+ KS+SL
Sbjct: 971 VGFLLSAIGQKI--YIWQLKDN---ELVGVAFIDTQIYIHSVVTVKNLILVGDVFKSVSL 1025
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDE 894
L Y+ + +RD + AVE D+ + AE N L+ + S +
Sbjct: 1026 LRYQEASRTLSLVSRDVRPLEVYAVEFFIDNTQMSFLVTDAERNLLLYMYQPESRESCGG 1085
Query: 895 ERGRLEVVGEYHLGE-----FVNRFRHGSLVM--RLPDSDVGQIPTVIFGTVNGVIGVIA 947
+R L G++H+G F + R G + R S V + T++G + +
Sbjct: 1086 QR--LLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSLAYVL 1143
Query: 948 SLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV--DAKNFLDGDLIESFLD 1005
+P + Y L LQ L I GLN + +R + ++++ + KN LDG+LI F+
Sbjct: 1144 PVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHKNILDGELIWKFMH 1203
Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
LS E+SK + +V ++ + E+
Sbjct: 1204 LSFMERSELSKKIGTTVTQITDDLLEI 1230
>gi|297810341|ref|XP_002873054.1| hypothetical protein ARALYDRAFT_908112 [Arabidopsis lyrata subsp.
lyrata]
gi|297318891|gb|EFH49313.1| hypothetical protein ARALYDRAFT_908112 [Arabidopsis lyrata subsp.
lyrata]
Length = 114
Score = 107 bits (266), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 54/103 (52%), Positives = 67/103 (65%), Gaps = 6/103 (5%)
Query: 643 IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF 702
I ID QK ++ +IPLGE+ ICHQ Q+ T+ IC+ S HFVRLL+DQTF
Sbjct: 13 IIAIDATQKHNVLTIPLGEYASCICHQAQTCTYGICT------GTHSGTHFVRLLEDQTF 66
Query: 703 EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENE 745
+S Y LD +E G SI+SC F+ DSN YYCVGT YV PEE +
Sbjct: 67 NVLSAYTLDPYERGLSIVSCLFTADSNEYYCVGTEYVFPEEED 109
>gi|340054355|emb|CCC48651.1| putative damage-specific DNA binding protein [Trypanosoma vivax Y486]
Length = 1272
Score = 107 bits (266), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 176/819 (21%), Positives = 322/819 (39%), Gaps = 124/819 (15%)
Query: 295 NLGPIVDFCV-VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL 353
N GP+ D V VD R V+ +G G L ++R+ + + + A+V G+ + +
Sbjct: 426 NCGPVFDMTVAVDGPRM---SVIASTGVGLSGGLNLLRSAVNVCQFAAV--HGVVNVKRV 480
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI--EGFCSQTQTLFCHD-AIYNQLV 410
+ + +++SF +R+ + +E E+ F + +TL D + +
Sbjct: 481 CMAGN-----VIILSFPGYSRVCHFAVGKTMEVVEVYETPFDTLVETLELVDISQGGAFL 535
Query: 411 QVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
QVT+ V V T +E + PG+ +V A++ GG LV+ +
Sbjct: 536 QVTTSGVNAVQGTMQEYIHR----PGHGKSVEHAHSC-------GGMLVFSSYTSICVLA 584
Query: 471 VK--HAQL----EYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
V H+ + EYE+S L + + VG W ++ ++ + + +
Sbjct: 585 VNEPHSVITFHSEYEVSSLAM-------VSTTSLLVGEWGSNAISLYDVTAEGVHPRGRF 637
Query: 525 GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT-QPITLRTF 583
P SV + + G S LL L +G++ + LL+ G+ + + +L T P+ L
Sbjct: 638 TCSATPCSVSVIPYHGTSRLLVGLLNGYVADVLLDDIAGDSSVKVAETLLTLHPVRLFNL 697
Query: 584 SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLK--EVSHMCPFNSAAFPDSLAIAKEGEL 641
S N V + P VI S+ + ++L E+ + P + + KE L
Sbjct: 698 QSHNA--VLCLGEVPLVIIVSDNGFQITGIDLYGIELCGIIPSSCSPVRYIFHSKKEQSL 755
Query: 642 TIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ--------------SCAE 687
G I D+Q+L+ PL + + F + +N+
Sbjct: 756 IFGNITDVQRLNTNFFPLRATVTLVKYMAWWNVFVMSLRRNERDQILTIMGYELWAPSPL 815
Query: 688 ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPT 747
E + + LL+++ F+ + L G + S N +GT + P+E
Sbjct: 816 RDEQNSLELLENERCVFVESVILG----GGNEWSTGTDALENSAIIIGTTFAFPDEQLSR 871
Query: 748 KGRILVFIVEDGK-------LQLIAEKETKGAVYS---LNAFNGKLLAAINQKIQLYKWM 797
R + VE GK L+ K+ +GA+ + + G++ IN + LY W
Sbjct: 872 SSRFMWCSVEQGKTANEKPQLRQQGSKDVQGALGCCCVVPNYAGRIALGINGCVALYCWN 931
Query: 798 LRDDGTRELQSECGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
D + + E G IL + Q ++V D S S++ +G++ AR
Sbjct: 932 AVD--STFIAEETIGVGMILTRLMPVLQHDMSYLVAFDSRHSSSVIAVDTIQGSLSVAAR 989
Query: 855 DYNANWMSAVEILD----DDIYLGAENNFNLFTVRKNSEGATDEERG-------RLEVVG 903
D + +L DD+ G ++ FNLF V AT + +L
Sbjct: 990 DPELRGVMDGTVLHSGARDDMCFG-DDFFNLFCVPHVMPSATSPDAPAAGMPTCKLPTSA 1048
Query: 904 EYHLGEFVNRFRHGSL--------VMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQ 953
+YH+G+ + + GS V+ +P + + P ++FGT +G G I + E
Sbjct: 1049 QYHVGDLITAMQQGSFAPCSLSNGVVPVPKVLIPGVCGPQIVFGTTHGSFGTITPVTSET 1108
Query: 954 YLFLEKLQTNLRKVIKGVGGLNHEQWRSF--NNEKKTVD--------------------- 990
YL L+ ++ + V+ +GG +H +R N+++ V
Sbjct: 1109 YLLLKGIEVAVAAVVPALGGFSHAAYREVLCANQERGVSRNASFEAVNPRAMEVLRERRL 1168
Query: 991 ---AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
A+ GDL+E FL LS + + + ++ C
Sbjct: 1169 KYVARCVCSGDLVEMFLTLSPDTQLRVIEEAAMHIQRWC 1207
>gi|340371789|ref|XP_003384427.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Amphimedon queenslandica]
Length = 1408
Score = 106 bits (264), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 130/575 (22%), Positives = 235/575 (40%), Gaps = 82/575 (14%)
Query: 535 LCAFEGISYLLCALGDGHL-LNF--------LLNMKTGELTDRKKVSLGTQPITLRTFSS 585
L +E Y A+ GHL L F L + + G+L + + P LR FS+
Sbjct: 838 LVIYEAFQYT-SAIHPGHLKLRFSKVQHNVILQDKRVGKLAKHFQQQEFSFPPHLRKFSN 896
Query: 586 -KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDS-LAIAKEGEL 641
+ VF P I+ + + L + + V PF++ P L KE EL
Sbjct: 897 IAGYSGVFVCGPYPHWIFMAARGHLSIHPMYIDGPVQSFAPFDNVNCPSGFLYFNKESEL 956
Query: 642 TIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA----------- 686
I + +R +PL P + + +S+ I + Q
Sbjct: 957 RISVLPTQLSYDSYWPVRKVPLKATPHFVGYHMESKVHVIIASTPQPVTVIPDPNGETED 1016
Query: 687 -------------EESEMHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV- 730
+ E ++++LL ++E I S Y ++ + + +
Sbjct: 1017 ALETVERDGRFVYSQEETYYLQLLSPTSWETIPHSKYEMEAHYHVTDMKVMRLRSQETLS 1076
Query: 731 ----YYCVGTAYVLPEENEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYS 775
Y VGT EE KG++L+F I E GK L+ + ++E K V
Sbjct: 1077 GRKEYIVVGTMATFGEELS-AKGKVLIFDVSVVIPEPGKPFSQYRLKNLYDQEQKWPVTG 1135
Query: 776 LNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
L NG +L A+ QKI + W +D+ ++L + Q+ FI+ GD+ +
Sbjct: 1136 LECVNGLILTAMGQKI--FMWQFKDN--KDLLAVAFIDAETYIHTAQSIKGFILTGDVTR 1191
Query: 836 SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGAT 892
SI LL Y + ++ ++D N + + + D LG ++++ N+ + E
Sbjct: 1192 SIQLLHYNEDRRSLSLISQDPNPMEVFSTTFMIDGKALGFLVSDSDRNITLFQYQPENPA 1251
Query: 893 DEERGRLEVVGEYHLGEFVNRFRH----------GSLVMRLPDSDVGQIPTVIFGTVNGV 942
L G+ H+G VN F + S M++ +D Q FGT++G
Sbjct: 1252 SSGGANLVRCGDIHVGSLVNVFLNIRCKTSAGLGASREMKIALADKRQ--CTFFGTLDGG 1309
Query: 943 IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLI 1000
IG + +P + Y L LQ + + ++ + GLN + +R+F + + +N LDG L+
Sbjct: 1310 IGCLLPIPEKVYRRLSMLQVKMTQGMRHMAGLNPKAFRTFQTRHQYLHNAQRNILDGTLL 1369
Query: 1001 ESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+L L+ + SK + +V ++ + ++E+ ++
Sbjct: 1370 YQYLSLTAKEKFDFSKQIGTTVAQIMEDLKEIDKV 1404
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 54/257 (21%), Positives = 103/257 (40%), Gaps = 42/257 (16%)
Query: 2 LDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGR 61
L+ +G I +L+ R +D L ++ K +++++ E++ L T +M D R
Sbjct: 59 LEFSFHGNIQSLDKVRLRHSDRDCLLLSFNDAKLSIVEYNPETNGLKTVSMHQFEDEEIR 118
Query: 62 ----PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFD---------------NKGQLKEAF 102
D+ + +DP+ R + L+ V PF + + +
Sbjct: 119 GGILHNDSRPVVKVDPEGRCAVMLLFGSHLAVCPFQQDLSIDTPLSPSPSLDTHDILPTY 178
Query: 103 NIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDN----------KDARHVKTYEVALKD 147
I L +L + D+ F+ G PT++ L + + +D+ + + D
Sbjct: 179 TISLRDLPEPLPVIKDMTFIEGYTSPTLLFLSEVSPTWAGRISLRQDSMMLLGLSLNTSD 238
Query: 148 KDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYG 207
K W+ NL + L PVP PL GVL+ G T++Y + ++ P +
Sbjct: 239 KSHTVI-WTLKNLPFDSSYLHPVPKPLGGVLVFGANTLIYLNQSS-------PPYGLSLN 290
Query: 208 RVDADGSRYLLGDHAGL 224
+ +R+LL + L
Sbjct: 291 SITDYTTRFLLKNEGSL 307
>gi|17861814|gb|AAL39384.1| GM01240p [Drosophila melanogaster]
gi|220943184|gb|ACL84135.1| CG13900-PB [synthetic construct]
Length = 688
Score = 105 bits (263), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 146/702 (20%), Positives = 271/702 (38%), Gaps = 125/702 (17%)
Query: 434 PPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE-VKHAQLEYEISCLDINPIGENP 492
P S + ++L T +++ L+ G+ L E + +++ EI C+ + + E
Sbjct: 3 PAVLSFKINQKPYPRLLNFTTNEYILCLQTGE--LNEYTERSEMPAEIMCMALGTVPEGE 60
Query: 493 SYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFE------------- 539
S AVG+ D +VRI SL N +T + P L C E
Sbjct: 61 QRSWFLAVGL-ADNTVRILSLDPNNCLTPCSMQALPSPAESL-CLVEMGHTESTTQGGLD 118
Query: 540 ------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
G YL L +G LL +L+ +G+L D + LG++P+ L +
Sbjct: 119 DDAPAQRSGNNKGTIYLNIGLSNGVLLRTVLDPVSGDLADTRTRYLGSRPVKLFRIKMQG 178
Query: 588 TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID 647
+ V A S R + Y + + ++ + + + F+S + + L I ++
Sbjct: 179 SEAVLAMSSRTWLSYYHQNRFHLTPLSYETLEYASGFSSEQCSEGIVAISTNTLRILALE 238
Query: 648 DIQKLHIR-SIPLGEHPRR-ICHQEQSR-----------TFAICSLKNQSCAEE------ 688
+ + + + PL PR + H + R T S + + AEE
Sbjct: 239 KLGAVFNQVAFPLQYTPRTFVIHPDTGRMLIAETDHNAYTEDTKSARKEQMAEEMRSAAG 298
Query: 689 -------SEM-------------------------HFVRLLDDQTFEFISTYPLDTFE-- 714
EM +R LD + + + PL E
Sbjct: 299 DEERELAREMANAFINEVLPEDVFSSPKAGLGLWASQIRCLDAMHGQTMFSVPLTQNEAI 358
Query: 715 YGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVED---GKLQLIAEKETKG 771
++L S + D Y VG A L ++G + D L+ + +
Sbjct: 359 MSMAMLKFSIAADGRYYLAVGIAKDLQLNPRISQGGCIDIYKIDPTCSSLEFMHRTDIDE 418
Query: 772 AVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVV 830
+L F G+LLA + +++Y D G +++ +C H + +Q G + V
Sbjct: 419 IPGALCGFQGRLLAGCGRMLRIY-----DFGKKKMLRKCENKHIPYQIVNIQAMGHRVYV 473
Query: 831 GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
D+ +S+ + Y+ E + A D + W++A +LD D + + F ++++
Sbjct: 474 SDVQESVFFIRYRRAENQLIIFADDTHPRWVTATTLLDYDT-IAIADKFGNLSIQRLPHS 532
Query: 891 ATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
TD+ +RG L E + +H+GE + + +L+ G
Sbjct: 533 VTDDVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSLQKATLI-------PGGS 585
Query: 932 PTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
+I+ T++G +G E Y F + L+ ++R + G +H +RS
Sbjct: 586 EALIYATLSGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCGRDHLSYRS-----SYYP 640
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN LDGDL E +L + + I+ M + ++CK++E++
Sbjct: 641 VKNVLDGDLCEQYLSIEAAKQKSIAGDMFRTPNQICKKLEDI 682
>gi|354547430|emb|CCE44165.1| hypothetical protein CPAR2_503890 [Candida parapsilosis]
Length = 1171
Score = 105 bits (263), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 217/981 (22%), Positives = 385/981 (39%), Gaps = 156/981 (15%)
Query: 162 NGADLLIPVPPPLCGVLIIGEETIVY---CSANAFKAIPIRPS--------ITKAYGRVD 210
N A+ L+PVP + GVL+ GE I+Y S + +P R S +T +
Sbjct: 231 NDANYLVPVPGHIGGVLVCGENWIIYDKLGSESIALPLPRRKSQSSVIVTHVTHVLKK-K 289
Query: 211 ADGSRYLLGDHAGLLHLLVITHE--KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
+ G LL + G L LVI ++ +E V ++I + ++S N + + +
Sbjct: 290 SYGFFILLQNEFGDLFRLVIDYDYDRELVKDIEISYFDTIPVCYSLSIFKNGLCF-ANCV 348
Query: 269 GDSQL---------------------IKLNLQPDAKGSYVE----------VLERYVNLG 297
SQL I+++ P K VE +++ +L
Sbjct: 349 NRSQLLYQFEKLGEEIGEEGVGINRRIQMDSIPLTKEKVVEFKLKGLDNLALIDVVESLS 408
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWSLRS 355
PI D +++ +VT S K L+ + +G ++S ++ +++ ++
Sbjct: 409 PITDSTLIN------DTLVTLSTKSK---LKSIVHGTPTTTLVESSFPIKPT-NVFTAKT 458
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
S D D +LV++ + L +++ + +EE F T+ + +VQV S
Sbjct: 459 SADAMDDEYLVITSTLAFKTLVLSIGEVIEEVNDSKFVMDQPTVAVQQVGKSSIVQVYSN 518
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
+R ++ + W P G ++ AT N QVLL LVY E D +++ Q
Sbjct: 519 GLRHINGKKK--VTNWYPPAGITITHATTNNQQVLLGMSNLELVYFET-DPEDDQLREYQ 575
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLL 535
E+S I + S A VG +D ++ + SL N + + L + L+
Sbjct: 576 DRLEVSS-PIRSMCIPKEQSSFAVVGC-SDETISVVSLQSQNCLQVKSLQALSSSANSLV 633
Query: 536 CAFEGISYLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS-SKNTTHVF 592
S L +G +G + ++ G+L+D + LG +P+TL S + T V
Sbjct: 634 MLTHNPSTTLIHIGMDNGVYVRTKIDTFNGKLSDTRIKYLGPKPVTLNELKLSDDITGVL 693
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS---------AAFPDSLAIAKEGELTI 643
A S +P + Y K + + ++ + F S + D+L I G+
Sbjct: 694 AISTKPWIGYFYQGKYRCTPLLDIDIINGASFKSEDIGGEGIVGIYDDNLVIFSVGKED- 752
Query: 644 GTIDDIQKLHIRSIPLGEHPRRICH-QEQSRTFAICSLKNQSCAEESEM----HFVRLLD 698
D Q+L + + L PR+I + + F A E EM +V L
Sbjct: 753 SIFDPNQELTVTELNLRYTPRKIVKGGDDDKLF----------ASEVEMGINTPYVSNLT 802
Query: 699 DQTFEFIST--YPLDTFEYGCSILSC-------------SFSDDSNVYYCV-----GTAY 738
+ Y FE G SC F D + V Y
Sbjct: 803 KDVENGVDPEYYEAFGFERGPGCASCVQLIENGEIKQSLEFVKDQRIVDMVKIQFNKNQY 862
Query: 739 VLPEENEPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY--- 794
++ + + + F ++ + LQ I + + K + F +LL A I LY
Sbjct: 863 LIVGVTKQEENLLYTFKIDKKRNLQYIHKTQLKYVPQVMEVFQNRLLVASGNAISLYELG 922
Query: 795 -KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
+ +LR TR I+ + Q R D I++ D SI + +EE A
Sbjct: 923 QRQLLRKSLTR-----IDFVQSIVKVSPQPR-DRILLADSANSIVFAKFDNEENQFIPVA 976
Query: 854 RDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG---------------- 897
D ++A + LD D +G + N+F R + E + ++
Sbjct: 977 DDVVKRNITAWKQLDYDTIIGGDKFGNIFVTRLDREESKQVDQDWTVLKQAAQNSSNLNS 1036
Query: 898 ---RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY 954
+L+ + +Y++ + + F+ GS ++G +I+ V G IGV+ L +
Sbjct: 1037 CVFKLQNLCQYYIPDIITSFQLGSF-------NLGGEECIIYTGVMGTIGVLIPLLSKSE 1089
Query: 955 LFLE---KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
+ L +LQ + V G NH + RS+ N + KN DGD +E + +
Sbjct: 1090 IELVHDLQLQIGIWNDGVNVAGKNHGKLRSYYNPR-----KNVYDGDFLELYFAIPLDVK 1144
Query: 1012 DEISKTMNVSVEELCKRVEEL 1032
+I+K +N SV E+ K++ ++
Sbjct: 1145 VKIAKKLNKSVGEIEKKLNDI 1165
>gi|427795803|gb|JAA63353.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
cft1 cpsf subunit, partial [Rhipicephalus pulchellus]
Length = 726
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 127/507 (25%), Positives = 211/507 (41%), Gaps = 76/507 (14%)
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC--PFNSAAFPDS-LAIAKEGELTIGTID 647
VF RP ++ S++ L + + C PF++ P L K+GEL I T+
Sbjct: 224 VFLCGYRPYWLFMSSRGELRCHPMFVDGPIHCFAPFHNVNCPKGFLHFNKQGELRISTLP 283
Query: 648 DI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-------AEESEMHFVR- 695
+R +PL P + + S+T+ + + + C EE E +
Sbjct: 284 THLTYDAPWPVRKVPLRCTPHFVNYHVDSKTYCVVTSQPDPCNHLVRFTGEEKEYELLER 343
Query: 696 -------LLDDQTFEFIS-----TYP-----LDTFEYGCSILSCSFSDDSNV-----YYC 733
+D + + +S T P LD +E+ + + S + Y
Sbjct: 344 DSRYIFPTMDKFSLQLLSPVSWETIPNTRVDLDEWEHLTCLKNVMLSSEGTTTGMKGYLA 403
Query: 734 VGTAYVLPEE--------------NEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAF 779
+GT Y E+ P G+ L K++++ KE KG V +L+
Sbjct: 404 LGTNYCYGEDVTSRGRIIILDIIDVVPEPGQPLT----KNKIKIVYSKEQKGPVTALSQV 459
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
G LL+AI QKI Y W L+D+ EL I V T + I+VGD+ KS+SL
Sbjct: 460 VGFLLSAIGQKI--YIWQLKDN---ELVGVAFIDTQIYIHSVVTVKNLILVGDVFKSVSL 514
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDE 894
L Y+ + +RD + AVE D+ + AE N L+ + S +
Sbjct: 515 LRYQEASRTLSLVSRDVRPLEVYAVEFFIDNTQMSFLVTDAERNLLLYMYQPESRESCGG 574
Query: 895 ERGRLEVVGEYHLGE-----FVNRFRHGSLVM--RLPDSDVGQIPTVIFGTVNGVIGVIA 947
+R L G++H+G F + R G + R S V + T++G + +
Sbjct: 575 QR--LLRRGDFHVGSPVVSMFRIKCRMGDIAKYDRRAASIVDGRHITMMATLDGSLAYVL 632
Query: 948 SLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV--DAKNFLDGDLIESFLD 1005
+P + Y L LQ L I GLN + +R + ++++ + KN LDG+LI F+
Sbjct: 633 PVPEKTYRRLLMLQNVLVTNIPHYAGLNPKAYRMYYSQRRFLGNPHKNILDGELIWKFMH 692
Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL 1032
LS E+SK + +V ++ + E+
Sbjct: 693 LSFMERSELSKKIGTTVTQITDDLLEI 719
>gi|72390667|ref|XP_845628.1| damage-specific DNA binding protein [Trypanosoma brucei TREU927]
gi|62359843|gb|AAX80271.1| damage-specific DNA binding protein, putative [Trypanosoma brucei]
gi|70802163|gb|AAZ12069.1| damage-specific DNA binding protein, putative [Trypanosoma brucei
brucei strain 927/4 GUTat10.1]
Length = 1270
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 178/800 (22%), Positives = 313/800 (39%), Gaps = 127/800 (15%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLR 354
N GP+ D V + V+ +G +G L VR+ + + + + + G++ + +
Sbjct: 426 NCGPVFDVTVA--ADGSRTSVIAGTGVGLNGGLSFVRSAVSVRQD--IRVTGLQNVRQIS 481
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDE--LEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
S D ++ S +R+ + + LEE F ++ L ++ N +QV
Sbjct: 482 VSEDT-----IIFSLPGYSRVCRYCVGETMVLEEIHNTFFGTEETLLLEYNNERNVFLQV 536
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSV-----NVATANASQVLLATGGGHLVYLEIGDGI 467
T+ LRN W Y + ++ A+AS+ LL +Y+
Sbjct: 537 TTAG----------LRNVWPDRGEYVIRVINNDIGHAHASEGLLVFSNPANLYV-FCMKT 585
Query: 468 LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGE 527
LT V L ISCL + S VG W +V ++ L D + +K L
Sbjct: 586 LTPVATLCLADAISCLLV-------FSSSSLLVGTWGSCAVHLYELHDGMIQSKVILQCS 638
Query: 528 IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKT--GELTDRKKVSLGTQPITLRTFSS 585
P S+ + G LL L +G++ + ++ GE+ + L TQP+ R F+
Sbjct: 639 ATPCSMCVVFHSGGHRLLVGLHNGYVADVPISGANVCGEMVETM---LTTQPV--RLFNL 693
Query: 586 KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE--LTI 643
++ V + P ++ +N + ++ ++VS P + E L +
Sbjct: 694 ESHAAVLCLGEIPLILIVTNTGFQLTGIDFRDVSACAIMPKMRSPSRYIFFSKSECALIL 753
Query: 644 GTIDDIQKLH------------IRSIP----LGEHPRRICHQEQSRTFAICSLKNQSCAE 687
G+I +QKL+ ++ +P L RRI ++Q + + N S
Sbjct: 754 GSITSVQKLNTDYVGLKATATCVKYMPWWNVLVLSIRRI-EKDQVVSTMGHEVSNLSVLL 812
Query: 688 ESEMHFVRLLDDQTFEFISTYPL-DTFEYGCSILSCS-FSDDSNVYYCVGTAYVLPEENE 745
+ F LL+++ FI L E+G SC + D+ V +GT +V P+E
Sbjct: 813 DEPNSF-ELLENERCAFIDCVALGQANEWG----SCGEVASDAGVVL-IGTTFVFPDEQL 866
Query: 746 PTKGRILVFIVEDGKLQ-------LIAEKETKGAVYS---LNAFNGKLLAAINQKIQLYK 795
R + VE KL+ L K+ +GA+ + + G++ I + LY
Sbjct: 867 SRSSRFMWCTVEVAKLRTEKTLLRLQGSKDVEGALQCCCIVPNYAGRVALGIGGCVVLYS 926
Query: 796 WMLRDDG-TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERAR 854
W D E + G L +Q +IV D S + +G++ AR
Sbjct: 927 WNAADATFVAEETIQIGTLIVRLIPVMQKEVSYIVASDARHSCFFVRIDTIQGSLNIVAR 986
Query: 855 DYNANWMSAVEILD----DDIYLGAENNFNLFTVR-----KNSEG--ATDEERGRLEVVG 903
D + IL D+ LG ++ FN F V NS G A +L+
Sbjct: 987 DPELRGVMDCAILQYESRHDVCLG-DDLFNFFCVSHVEPLANSSGVSAPAMPTKKLQTTA 1045
Query: 904 EYHLGEFVNRFRHGSL--------VMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQ 953
+YH+G+ + GS V+ +P + V + P +++GT +G G I + E
Sbjct: 1046 QYHMGDLITVMHQGSFAPCSVLNDVVPIPATLVRGVCGPQIVYGTSHGAFGAITPISSET 1105
Query: 954 YLFLEKLQTNLRKVIKGVGGLNHEQWRS--------------------------FNNEKK 987
++ L+ L+ ++ V+ +GG H +R FN ++
Sbjct: 1106 FILLKGLEVSVASVVPPLGGFTHASFREVLRVGQERGASRNASFQVTNPQATELFNRRRR 1165
Query: 988 TVDAKNFLDGDLIESFLDLS 1007
+ GD++E FL LS
Sbjct: 1166 RCVPRGVCCGDVVEMFLTLS 1185
>gi|388579257|gb|EIM19583.1| hypothetical protein WALSEDRAFT_33953 [Wallemia sebi CBS 633.66]
Length = 986
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 172/367 (46%), Gaps = 20/367 (5%)
Query: 544 LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYS 603
+L DG L F + + D + V++G+ P+ L + N +FA D P+++
Sbjct: 549 VLAGSADGSLYAFRFDEHLKSV-DIQTVAIGSTPVCL---TRSNDGLIFALCDVPSIVTL 604
Query: 604 SNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLG-EH 662
N +L YS++N+ ++ + + + + + + + + ++HI SI +G +
Sbjct: 605 DNTRLRYSSININYINGLTSYKTNDMVNYVFVQNDQLKFSRILSTENRVHIHSIEMGADV 664
Query: 663 PRRICHQEQSRTFAICSLKNQSCAEES---EMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
PR++ ++E +A+ ++N ++ + V+LLD+ +E ++ ++ E +
Sbjct: 665 PRQVAYKEDR--YAVGCVRNAYRSDRTLYESSSCVKLLDNN-YEQLAQMEMEKDEIVSVV 721
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEEN--EPTKGRILVFIVEDGKLQLIAEKETKGAVYSLN 777
S S ++ + VGT Y E E TKGR ++ +V+D K + + G VY++
Sbjct: 722 ESLSIANME--VFVVGTYYNNETEGTEEATKGRFIILLVKDDKFIIASSFLVPGCVYAVC 779
Query: 778 AFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
+ KL A+N ++++Y D T +++ + + + + + G +VV D +KS
Sbjct: 780 GIDQKLAVAVNYQVRVYDIESIRDDTYKMRFIASYGNAFVVVSLTSVGKILVVADFLKSA 839
Query: 838 SLLIYKHEEGAIEERARDYNANWMSAVEILDDD-----IYLGAENNFNLFTVRKNSEGAT 892
L E GA+ + D W S V LDD LGA+ F+LF + + S G T
Sbjct: 840 IYLQLDTERGALTQVGYDTAQRWSSLVVALDDGNEETFTTLGADIRFHLFALDRTSSGIT 899
Query: 893 DEERGRL 899
+L
Sbjct: 900 SRTLAQL 906
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 101/408 (24%), Positives = 181/408 (44%), Gaps = 63/408 (15%)
Query: 2 LDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGR 61
+D I+ +L L H +AQ L I + Y+ V++ E ++++ G+
Sbjct: 20 IDKIIFQTHESLLLTLDHPQAQ--LVILSLSYENGVIKHIVECTKMLVETTGE------- 70
Query: 62 PTDNGQIGIIDPDCRLIGLHLYDG---LFKVIPFDN--KGQLKEAFNIRLEELQVLDIKF 116
P+ I+D ++ HL+ G FK + +D+ K + + N +++ VL + F
Sbjct: 71 PSYEYCNSIVDKSTQIGVSHLWQGQLHAFK-LSYDDRRKTHIIDGRNSQIDHSVVLSMAF 129
Query: 117 LYGCA--KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDN---GADLLIPVP 171
L KPT+ L Q + +E + +D+V+ + ++ A ++ V
Sbjct: 130 LATDKDEKPTLCRLVQSADVDNPLLVFEHLICKEDYVDISQTVLRIETQCPSAQKIVAVE 189
Query: 172 PPLCGVLIIGEETIVY--CSANAFKAIPIRPSIT---------------------KAYGR 208
VL+IG Y K + + S T + Y
Sbjct: 190 GKKRAVLVIGAFGCEYYEIPKKDLKGVRRKSSTTVTSPQVVDMEHLSVKSPMAAVRGYTA 249
Query: 209 VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
V+ + +L+GD G ++ + I++ L+I L+G +S+AST+ +L + +++GS
Sbjct: 250 VNDSCTAWLIGDEKGDIYYISISN------FLEITLVGNSSVASTLQHLGSGFLFLGSQN 303
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
DS+L+ + P + LE Y NL P+ D + + QGQ+V CSG+ K G LR
Sbjct: 304 EDSKLLVVQTNP----VRIVELENYTNLAPVSDMALTHPDGI-QGQLVVCSGSNKTGKLR 358
Query: 329 IVRNGIGINEQASVEL-QGIKGMWSLRSSTDDPFDTFLVVSFISETRI 375
+V GIG+ + V L I ++ L S L+VS++S T+I
Sbjct: 359 VVTTGIGLCDIYQVGLGDSISNVFILGS--------HLLVSYLSTTKI 398
>gi|448528339|ref|XP_003869702.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis Co 90-125]
gi|380354055|emb|CCG23569.1| hypothetical protein CORT_0D07360 [Candida orthopsilosis]
Length = 1170
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 218/991 (21%), Positives = 387/991 (39%), Gaps = 177/991 (17%)
Query: 162 NGADLLIPVPPPLCGVLIIGEETIVYCS-ANAFKAIP----------IRPSITKAYGRVD 210
N A+ L PVP + GVL+ G I+Y N +P I +T A + +
Sbjct: 231 NDANYLAPVPGHIGGVLVCGANWIMYDKLGNESILLPLLRRKDQTSVIISHVTHALKKKN 290
Query: 211 ADGSRYLLGDHAGLLHLLVITHE--KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
G LL + G L L+I ++ +E + ++I + ++ N + + +
Sbjct: 291 Y-GFFILLQNDLGDLFRLIIDYDSNRELIKDIEITYFDTIPVCYNLNIFKNGLCF-ANCI 348
Query: 269 GDSQL---------------IKLN--LQPD----AKGSYVE----------VLERYVNLG 297
SQL I++N +Q D K Y E +++ +L
Sbjct: 349 NRSQLLYQFEKLGEEISEEDIRINKTVQMDNIQLTKEKYFEFKLKGLDNLALIDVVESLS 408
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK--GMWSLRS 355
PI D + D +VT S K L+ + +G +L IK +++ ++
Sbjct: 409 PITDSILND------DTLVTLSTKSK---LKTIVHGTPTTTLVESQLP-IKPTNIFTTKT 458
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG 415
S + D +LV++ + L ++L + +EE F T+ + +VQ+ S
Sbjct: 459 SANAVDDEYLVITSTLSFKTLVLSLGEVIEEVNDSEFVLDQPTVAVQQVGKSSIVQIYSN 518
Query: 416 SVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
+R ++ + W P G +V AT N QV L LVY E D +++ Q
Sbjct: 519 GLRHINGNKK--VTNWYPPAGITVTHATTNNQQVFLGLSNLELVYFET-DPDDDQLREYQ 575
Query: 476 LEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG--EIIPRSV 533
E+S I + + S A VG +D ++ + SL N + + L +
Sbjct: 576 DRLEVSS-PIRSMCISKEQSSFAVVGC-SDETISVISLQRQNCLQVKSLQALSSSANSLI 633
Query: 534 LLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFS-SKNTTHVF 592
+L G + + + +G + ++ G+L+D + LG +P+TL S + V
Sbjct: 634 MLTHKPGTTLIHIGMDNGVYVRTKIDTFNGKLSDTRIKYLGPKPVTLNELKLSDDIIGVL 693
Query: 593 AASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGE----LTIGTIDD 648
A S +P + Y K + + ++ + F S + G+ ++G D
Sbjct: 694 AISSKPWIGYFHQGKYRCTPLLDIDIINGAFFKSEDIGGDGIVGIHGDNLVIFSVGKEDS 753
Query: 649 I----QKLHIRSIPLGEHPRRICHQEQSRTF----------------------------- 675
+ Q+ + + L PR+I + + F
Sbjct: 754 LFDPNQEFTVTELNLRYTPRKIVKGDDGKLFVSEVELGIKTPYLSNLTKEVEDTVDPEYY 813
Query: 676 -AICSLKNQSCAEESEMHFVRLLDD----QTFEFISTYPLDTFEYGCSILSCSFSDDSNV 730
A ++ CA V+L+ D QT EF+ + ++ F + N+
Sbjct: 814 EAFGYERDSGCAS-----CVQLVQDGEIKQTLEFVKNQRI------VDMIKIQF--NKNL 860
Query: 731 YYCVGTAYVLPEENEPTKGRILVFIVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
Y VG E+N + F ++ K LQ I + E K ++ F +LL A
Sbjct: 861 YLVVGVTK--HEDN-----LLYTFKIDKKKNLQYIHKTELKYVPQTMEVFQDRLLVASGN 913
Query: 790 KIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
I LY + +LR TR I+ + Q R D I++ D SI + E
Sbjct: 914 SISLYELGQRQLLRKSLTR-----IDFIQTIVKVTPQPR-DRILLADSANSIVFAKFDQE 967
Query: 846 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERG-------- 897
E A D ++A + LD D +G + N+F R + E + ++
Sbjct: 968 ENQFVSMADDTVKRNITAWKQLDYDTVIGGDKFGNIFVSRLDREESKQIDQNWTVLKQAA 1027
Query: 898 -----------RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
+L+ + EY++ + + F+ GS ++G +I+ + G IG++
Sbjct: 1028 KNSPNLNSCVYKLQNLCEYYIPDIITSFQLGSF-------NLGGEECIIYTGLTGTIGIL 1080
Query: 947 ASLPHEQYL-FLEKLQTNLR----KVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
L + + L LQ + KV V G NH + RS+ N AKN DGD +E
Sbjct: 1081 LPLISKSEIELLHDLQLEISAYNDKV--NVAGKNHAKLRSYYNP-----AKNIFDGDFLE 1133
Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+L+L +I+K +N SV E+ K++ ++
Sbjct: 1134 LYLNLPLDEKLKIAKRLNKSVGEVEKKLNDI 1164
>gi|167395195|ref|XP_001741266.1| pre-mRNA-splicing factor rse-1 [Entamoeba dispar SAW760]
gi|165894222|gb|EDR22283.1| pre-mRNA-splicing factor rse-1, putative [Entamoeba dispar SAW760]
Length = 1145
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 230/1134 (20%), Positives = 438/1134 (38%), Gaps = 149/1134 (13%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L ++ I ++ R EA+D++ I ++ +L + E ++LI ++ + G
Sbjct: 53 LLQESVFCMIRSICKIRIGSEAKDYILIGSDSGSITLLGYSFEQNKLIP-ICNEIFGKSG 111
Query: 61 -RPTDNGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVL---DI 114
R G+ DP R +IG L + DN G++ + + + + I
Sbjct: 112 IRRVVPGEYICSDPMGRAAMIGAIEKQKLVYIFNRDNNGKITISSPLEAHKSNTICYNII 171
Query: 115 KFLYGCAKPTIVVL-------YQDNKDARHVKTYEVALKDKDF--------VEGPWSQNN 159
G P + Y + + R +K Y V L + D V P
Sbjct: 172 ALNVGYDNPIFATIEVDYNEQYLSHIEHRIIKKY-VVLYELDLGLNHVIRKVAEP----- 225
Query: 160 LDNGADLLIPVPP--PLC--GVLIIGEETIVYCSANAFKA-IPI---------RPSITKA 205
+D ++ LIP+ C G+ I E+ I + + K IPI ++ +
Sbjct: 226 IDPTSNYLIPILSLNEECHYGIFICSEDKITWFNIGHDKVCIPIPKYHNQSNTHSTLITS 285
Query: 206 YGRVDADGSRYLL--GDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
+ G ++L + L LL T EK ++ + I IA ++ + +
Sbjct: 286 HVMRTHKGKHFVLLQSEFGDLFQLLFDTKEKGQLDNIHIRYFDSIPIALSLQFSKRGHLL 345
Query: 264 IGSSYGDSQL--------IKLNLQPDAKGSY--------VEVLERYVNLGPIVDFCVVD- 306
YGDS L I + L+ + + + +E R+ +L P++D V
Sbjct: 346 CIGEYGDSILYNIISMENINVPLEKNGRLEFERHEEILNLEEFYRFKSLAPLIDLKVAPA 405
Query: 307 LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFL 365
Q ++ G +++I++N + + +EL I +W L+ TD+ + +L
Sbjct: 406 TSSQDTTKMYAFCGKGNQSTIKILKNQLNTLDAVEIELPAIPISVWPLKKETDE-YHQYL 464
Query: 366 VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSR 425
+S+ + T +L + EDE+ E T +L +QV + + + S +
Sbjct: 465 AISYSNITTLLKIT-EDEMSECNTTPILLSTPSLLVSMLSDGTFLQVMTDRIIVYSEPIQ 523
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGH---LVYLEIGDGILTEVKHAQLEYEISC 482
+ E + A+ N S+++++ + L+Y + G L ++ + +I+
Sbjct: 524 QFITENQK-----YVCASCNGSELIVSVEKNNQTALIYFQYQSGHLLTMERKENLAKITA 578
Query: 483 LDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFEGI 541
L ++ +PS Q A+G D SV + SL P+ + + SV F I
Sbjct: 579 LALDQF--HPS--QHCAIGC-VDGSVHLLSLIPNETTKALSRVSLQTYDCSVNSLTFNII 633
Query: 542 ---SYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRP 598
SYL L +G L + +GE+ + +G++P+TL V A S R
Sbjct: 634 DNKSYLFAGLSNGLLGRSTYDPISGEINESSLNFVGSRPVTLSNVKDCGEDSVLAISGRS 693
Query: 599 TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSI 657
+ Y + K+ + +N++ + C +++AI E + I TI+ L ++I
Sbjct: 694 LLSYKNGTKIKTTPLNIQNTTLACGILVPFVDNAIAIICEKVMKIITIESTSSSLTGKNI 753
Query: 658 PLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM---------HFVRLLDDQTFEFISTY 708
L PR+I I N SC +E+ + +LD E I
Sbjct: 754 QLSYTPRKIITHPTIPLLYILEGDNNSCKVGNEIIQTNEGNWVGGIHILDASQDELIQFI 813
Query: 709 PLDTFEY---GCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGR---ILVFIVEDGKLQ 762
D ++ GC + S S + Y VG + + P + + I V+I+ + +
Sbjct: 814 DFDNNKHPTAGCVVRSIS---KNQTYLIVGV--IESYKTRPIQWKSSEIQVYIINEHSIN 868
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILA 818
+ + V + F G +LA + ++LY K +L+ R+L S+
Sbjct: 869 YCYSTKVEYPVRAFAEFKGMVLAGVGNVLRLYDIGLKSLLKKAEKRQLASDVAQ------ 922
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
+ G+ I++ + +L+ Y + A D W+ L+ L ++
Sbjct: 923 --LHVIGETILLTGISDGFNLIRYNQINHKFDIYA-DSLPRWVVTAAPLNQSTVLASDKF 979
Query: 879 FNLF-----------------TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVM 921
+F T+ + + + ++ ++ +G+ F SL+
Sbjct: 980 GEIFMYQLPKEIEEKALSPFSTLLQPHKTIYEGSNYKMVTATQFFIGDIATSFAQCSLIP 1039
Query: 922 RLPDSDVGQIPTVIFGTVNGVIGVIASLP---HEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
P IF N + G+ A +P F + L+ ++R + NH
Sbjct: 1040 GAPS---------IFLYSNFMGGLSALIPLQNQNDIDFYQHLEMHMRVHWTNLTDRNHIS 1090
Query: 979 WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+RS V K+ +DGDL E + L EI++ M V E+ K++ +L
Sbjct: 1091 FRS-----SMVPVKDTVDGDLCELYERLPYEIQQEIAEEMEKEVNEIIKKLHDL 1139
>gi|158287218|ref|XP_309311.4| AGAP011340-PA [Anopheles gambiae str. PEST]
gi|157019545|gb|EAA05261.4| AGAP011340-PA [Anopheles gambiae str. PEST]
Length = 1434
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 117/484 (24%), Positives = 213/484 (44%), Gaps = 73/484 (15%)
Query: 609 LYSNVNLKEVSHMCPFNSAAFPDS-LAIAKEGELTIG------TIDDIQKLHIRSIPLGE 661
LY+ +K + PFN+ P+ L ++ EL I + D + +R IPL
Sbjct: 957 LYARTVMKAFA---PFNNVNCPNGFLYFDEQYELKISIFPTYLSYDSVWP--VRKIPLRS 1011
Query: 662 HPRRICHQEQSRTFAICSLKNQSC-------AEESEM----------------HFVRLLD 698
P++I + +++ + + + C E+ E+ V L+
Sbjct: 1012 SPKQIVYHRENKVYCVVMDAEEICNKYYRFNGEDKELTEENKGERFLYPMGHRFSVVLVT 1071
Query: 699 DQTFEFI--STYPLDTFEYGCSILSCSFSDDS-----NVYYCVGTAYVLPEENEPTKGRI 751
+E + ++ L+ +E+ ++ + S + + Y VGT + E+ ++GR+
Sbjct: 1072 PAAWEVVPETSINLEEWEHVIALKNVSLTYEGARSGLKEYIAVGTNFNY-SEDITSRGRL 1130
Query: 752 LVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
L++ + E GK + + K+ KG V +++ G L+ A+ QK+ Y W ++D
Sbjct: 1131 LLYDIIEVVPEPGKPLTKHKFKEVIVKDQKGPVSAISHVCGFLVGAVGQKV--YLWQMKD 1188
Query: 801 DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
D +L +I + + I+V D+ KS+SLL ++ E + +RDY+
Sbjct: 1189 D---DLVGVAFIDTNIFVHQMVSIKSLILVADVYKSVSLLRFQEEYRTLSVVSRDYHPLN 1245
Query: 861 MSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN----- 912
+ VE + D+ LG +++ NL T E RL +YHLG+ VN
Sbjct: 1246 VFQVEYVVDNANLGFLVSDDQCNLITYMYQPESRESFGGQRLLRKSDYHLGQQVNCMFRV 1305
Query: 913 --RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
F ++ R + D T F T++G IG + LP + Y L LQ L
Sbjct: 1306 QCDFHETDVMKRTLNYDNKH--TTFFATLDGGIGFVLPLPEKTYRRLFMLQNVLLTHSPH 1363
Query: 971 VGGLNHEQWRSFNNEKKTV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
GLN + +R+ +K ++ +DGDL+ SFL+L E++K + +EE+C
Sbjct: 1364 TCGLNPKAYRTIKQTRKLPINPSRCVVDGDLVWSFLELPANEKHEVAKKIGTRIEEICAD 1423
Query: 1029 VEEL 1032
+ E+
Sbjct: 1424 LMEI 1427
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 123/547 (22%), Positives = 217/547 (39%), Gaps = 131/547 (23%)
Query: 8 GRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQ 67
G I +++ G +D L I+ K V+Q+D ++ +L T ++ D R G
Sbjct: 75 GNIKSMQSVSLAGSLRDALLISFPDAKLSVVQFDPDNFDLKTLSLHYFEDEDIRGGWTGH 134
Query: 68 IGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE----------------------- 100
I +DPD R + +Y V+PF L E
Sbjct: 135 YHIPMVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIELQDVKPIKKAPMQLVAKTPIL 194
Query: 101 -AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK-- 146
++ I L++L V+DI+FL+G +PT+++LY+ + R VAL
Sbjct: 195 ASYIIELKDLDEKIDNVIDIQFLHGYYEPTLLILYEPVRTFPGRIAVRSDTCTMVALSLN 254
Query: 147 -DKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANA 192
+ W+ N+L IP+ P+ G L++ +++Y SA+
Sbjct: 255 IQQRVHPVIWTVNSLPFDCIQAIPINKPIGGCLVMCVNSLIYLNQSVPPYGVSLNSSADH 314
Query: 193 FKAIPIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLLVITHEKEK-VTGLKIE 243
+ P++P + R+ D + + +L G L++L + + + V
Sbjct: 315 STSFPLKP---QDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRSVRNFHFN 371
Query: 244 LLGETSIASTISYLDNAVVYIGSSYGDSQLIK-------LNLQPDAKG------------ 284
+ + S I ++ +++GS G+S L++ L + D G
Sbjct: 372 KAAASVLTSCICVCEDEYLFLGSRLGNSLLLRFKEKDESLVITIDDSGAVEKEPKRPRLE 431
Query: 285 ------------------SYV-EVLERYVNLGPIVDFCV------VDLERQGQGQ----- 314
SY+ EV + +N+GPI V D E Q Q
Sbjct: 432 EEELEVYGSGYKTSVQLTSYIFEVCDNVLNIGPIAHMAVGERVAEEDAENQPDVQIVQNK 491
Query: 315 ----VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------RSSTDDP-FDT 363
VVT SG K+G+L ++++ I S L G +W++ R + D P
Sbjct: 492 LDIEVVTSSGHGKNGALCVLQSSIKPQVITSFGLSGCVDVWTVFDEAVARRAEDGPSTHA 551
Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSS 422
F+++S E + + +E+ E E GF + T+ + N+ +VQVT+ S+RL+
Sbjct: 552 FMILS--QEGGTMVLQTGEEINEIENTGFATTVPTIHVGNIGTNRFIVQVTTKSIRLLQG 609
Query: 423 TSRELRN 429
T R L+N
Sbjct: 610 T-RLLQN 615
>gi|350413821|ref|XP_003490124.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Bombus impatiens]
Length = 1417
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 127/525 (24%), Positives = 233/525 (44%), Gaps = 76/525 (14%)
Query: 575 TQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPD 631
T+ +R FS+ VF SD P I+ + + L ++ + V+ PFN+ P
Sbjct: 899 TRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQ 958
Query: 632 S-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICS-----LK 681
L ++ EL I + +R +PL P + + +S+T+ + + LK
Sbjct: 959 GFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIAEPLK 1018
Query: 682 N---------QSCAEESEMHFVRLLDDQ----TFEFIS--TYP-----LDTFEYGCSILS 721
+ + EE F+ +Q F +S T P LD +E+ + +
Sbjct: 1019 SYYRFNGEDKEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKN 1078
Query: 722 CSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIA 765
S + + Y +GT Y E+ ++GRIL+F I+E + + I
Sbjct: 1079 VSLAYEGTRSGLKGYIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIY 1137
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQ 822
KE KG + ++ +G L++A+ QKI Y W L+D+ G + ++ H +L++
Sbjct: 1138 AKEQKGPITAITQVSGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI--- 1191
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AEN 877
I++ D+ KSISLL ++ E + +RD+ + +E L D+ LG E+
Sbjct: 1192 --KSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGES 1249
Query: 878 NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP----- 932
N LF + S + + +L ++HLG+ VN F + P +D
Sbjct: 1250 NMALFMYQPESRESLGGQ--KLIRKADFHLGQKVNTFFRIKCRVSDPANDKKHFSGADKR 1307
Query: 933 -TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV-- 989
++ +++G +G I +P + Y L LQ L I + GLN + +R++ + +T
Sbjct: 1308 HVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGN 1367
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
A+ +DGDL+ +L L +++K + V+E+ + + E+ R
Sbjct: 1368 PARGIIDGDLVWRYLYLPNNEKIDVAKKIGTRVQEIIEDLTEIDR 1412
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 65/330 (19%), Positives = 139/330 (42%), Gaps = 61/330 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
++G + +++ G +D L ++ K V+++D ++ +L T ++ + R T
Sbjct: 73 LHGNVMSMQAVTLVGSQRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEEIRDGWT 132
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFN 103
++ I I+ DP+ R + +Y V+PF NK + ++
Sbjct: 133 NHHHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKALSNKTPILSSYM 192
Query: 104 IRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKD 149
I L+ L+ ++D++FL+G +PT+++LY+ + R VA+ +
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252
Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAI 196
WS +NL +PV PL G LI+ +++Y + A
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLAETSTNF 312
Query: 197 PIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE 247
P++P + ++ +GS R ++ +G L++L + V G +
Sbjct: 313 PLKP---QEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAA 369
Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
+ + S + ++ +++GS G+S L++
Sbjct: 370 SVLTSCVCMCEDNYLFLGSRLGNSLLLRFT 399
Score = 43.1 bits (100), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 45/172 (26%), Positives = 76/172 (44%), Gaps = 14/172 (8%)
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
++VT SG K+G+L +++ I + EL G + MW++ + ++ F
Sbjct: 507 ELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVIGTLNNDEQIRPEAEGSHAF 566
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
L++S T IL E+ E + GF +Q T+F + N+ +VQVT VRL+
Sbjct: 567 LILSQEDSTMILQTG--QEINEVDQSGFSTQGSTIFAGNLGANRYIVQVTQMGVRLLQGI 624
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
E G + A+ V L + G ++ L + +G T H Q
Sbjct: 625 --EQIQHMPIDLGCPIVHASCADPYVTLLSEDGQVMLLTLREGRGTAKLHVQ 674
>gi|110750698|ref|XP_624382.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Apis mellifera]
Length = 1415
Score = 104 bits (259), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 129/527 (24%), Positives = 236/527 (44%), Gaps = 80/527 (15%)
Query: 575 TQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPD 631
T+ +R FS+ VF SD P I+ + + L ++ + V+ PFN+ P
Sbjct: 897 TRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQ 956
Query: 632 S-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICS-----LK 681
L ++ EL I + +R +PL P + + +S+T+ + + LK
Sbjct: 957 GFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIAEPLK 1016
Query: 682 N---------QSCAEESEMHFVRLLDDQ----TFEFIS--TYP-----LDTFEYGCSILS 721
+ + EE F+ +Q F +S T P LD +E+ + +
Sbjct: 1017 SYYRFNGEDKEFTEEERPDRFIFPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKN 1076
Query: 722 CSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIA 765
S + + Y +GT Y E+ ++GRIL+F I+E + + I
Sbjct: 1077 VSLAYEGTRSGLKGYIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIY 1135
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQ 822
KE KG + ++ +G L++A+ QKI Y W L+D+ G + ++ H +L++
Sbjct: 1136 AKEQKGPITAITQVSGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI--- 1189
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AEN 877
I++ D+ KSISLL ++ E + +RD+ + +E L D+ LG E+
Sbjct: 1190 --KSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGES 1247
Query: 878 NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD--------SDVG 929
N LF + S + + +L ++HLG+ VN F + R+ D SD
Sbjct: 1248 NIALFMYQPESRESLGGQ--KLIRKADFHLGQKVNTFFR--IRCRISDPANDKKHFSDAD 1303
Query: 930 QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
+ ++ +++G +G I +P + Y L LQ L I + GLN + +R++ + +T
Sbjct: 1304 KRHVTMYASLDGNLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQ 1363
Query: 990 --DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
A+ +DGDL+ +L L +++K + V+E+ + + E+ R
Sbjct: 1364 GNPARGIIDGDLVWRYLYLPNNEKIDVAKKIGTRVQEIIEDLTEIDR 1410
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/329 (19%), Positives = 139/329 (42%), Gaps = 61/329 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
++G + +++ G +D L ++ K V+++D ++ +L T ++ + R T
Sbjct: 73 LHGNVMSMQAVTLVGSQRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEEIRDGWT 132
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFN 103
++ I I+ DP+ R + +Y V+PF NK + ++
Sbjct: 133 NHHHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYM 192
Query: 104 IRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKD 149
I L+ L+ ++D++FL+G +PT+++LY+ + R VA+ +
Sbjct: 193 IVLKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252
Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAI 196
WS +NL +PV PL G LI+ +++Y + A
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLAETSTNF 312
Query: 197 PIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE 247
P++P + ++ +GS R ++ +G L++L + V G +
Sbjct: 313 PLKP---QEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAA 369
Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKL 276
+ + S + ++ +++GS G+S L++
Sbjct: 370 SVLTSCVCMCEDNYLFLGSRLGNSLLLRF 398
Score = 45.4 bits (106), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 78/172 (45%), Gaps = 14/172 (8%)
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
++VT SG K+G+L ++++ I + EL G + MW++ + ++ F
Sbjct: 505 ELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVIGTLNNDEQIRPEAEGSHAF 564
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
L++S T IL E+ E + GF +Q T+F + N+ +VQVT VRL+
Sbjct: 565 LILSQEDSTMILQTG--QEINEVDQSGFSTQGSTIFAGNLGANRYIVQVTQMGVRLLQGI 622
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
E G + A+ V L + G ++ L + +G T HAQ
Sbjct: 623 --EQIQHMPIDLGCPIVHASCADPYVTLLSEDGQVMLLTLREGRGTARLHAQ 672
>gi|350645868|emb|CCD59413.1| Splicing factor 3B subunit 3, 5' fragment [Schistosoma mansoni]
Length = 1073
Score = 104 bits (259), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 92/343 (26%), Positives = 158/343 (46%), Gaps = 33/343 (9%)
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+D +++VSF++ T +L++ + +EE GF T TL C + LVQV +R +
Sbjct: 433 YDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLTCSQLGDDALVQVYPDGIRHI 490
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEY 478
S R N W++P ++ N QV++A GG LVY E+ G L E + ++
Sbjct: 491 RSDKRV--NVWRAPGKKTITKCAVNRRQVVIALTGGELVYFEMDMTGQLNEYTERKEMPA 548
Query: 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LLCA 537
++ C+ + I N S+ AVG+ D +VRI SL + +T + G IP + LC
Sbjct: 549 DVICMALGRIPANEQRSRFLAVGL-ADNTVRILSLDPSDCLTPLTMQG--IPSTPESLCI 605
Query: 538 FE---------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
E GI Y+ L +G LL +L+ TGEL+D + LGT+
Sbjct: 606 VEMGTNEPSPSTDDGESEATSSGGILYMNIGLINGVLLRVILDPVTGELSDTRTRYLGTR 665
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V + S R + Y+ + ++ + + + F+S P+ +
Sbjct: 666 PVKLFRIMMQGGEAVLSVSSRSWLSYAYQNRFHLIPLSYEALEYASGFSSEQCPEGIVAI 725
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAI 677
L I ++ + + + S PL PR+ + H + + T+ I
Sbjct: 726 CNNSLRIMALEKLGAVFNQISYPLQYTPRKFVFHPDSNITYII 768
>gi|209875989|ref|XP_002139437.1| spliceosomal protein SAP 130 [Cryptosporidium muris RN66]
gi|209555043|gb|EEA05088.1| spliceosomal protein SAP 130, putative [Cryptosporidium muris RN66]
Length = 1300
Score = 103 bits (258), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 102/412 (24%), Positives = 177/412 (42%), Gaps = 45/412 (10%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSL 353
+L PI+D + D + Q+ G +LRI G + E A L G + +W+L
Sbjct: 401 SLSPIIDMKIADSSGEFGPQIYVACGRGPRSTLRICTYGKAVEEMAETPLPGRPRFIWTL 460
Query: 354 RSS---------------TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQT 398
+ + + +F+++SFI T +L++ +++EE F T
Sbjct: 461 KKGGTSYILNKETSETLEDEGNYHSFIIISFIDRTLVLSVG--EQVEEISDSPFTLSEST 518
Query: 399 LFCHDA-IYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
++ + N +Q+ V+L++ E EWK+P G + A +N Q+ LA GG+
Sbjct: 519 IYASSMELKNSYLQILETYVKLIT---EEKIYEWKAPDGRHIVAADSNGRQISLALSGGY 575
Query: 458 LVYLEIG--DGI---------LTEVKHAQLEYEISCLDINP-IGENPSYSQIAAVGMWTD 505
+V LE+ D I LTE+ H ++ Y+I C+ I I + AVG TD
Sbjct: 576 IVILEMNTIDLISGTTNSIMGLTELCHREVSYDIICISIQQLIYPTKLCREYVAVGTSTD 635
Query: 506 ISVRIF--SLPDLNL---ITKEHLGGEIIPRSVLLC--AFEGISYLLCALGDGHLLNFLL 558
SVR++ + D L T+ IP +++L YLL L +G LL ++
Sbjct: 636 NSVRVYWINTADKKLKQTSTQVMPNASSIPENIVLYKPGLNDSLYLLIGLNNGVLLCCIV 695
Query: 559 NMKTGELTDRKKVSLGTQPITLRT--FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLK 616
+ G L+D + LG + I + N F ++RP ++ + + L Y + +
Sbjct: 696 DELNGTLSDNRSRFLGGKSIKMVRVGIGEYNKLSTFCMANRPWIVNHNGRSLNYIPIQYR 755
Query: 617 EVSHMCPFNSAAFPDSLAIAKEGELTIGTI--DDIQKLHIRSIPLGEHPRRI 666
+ + PF++ F + L I + DD I L PR++
Sbjct: 756 TLDTVAPFHTKQFKNGFVAVSGTTLVIFQVLYDDSGTFTHSIIKLNYTPRKL 807
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 81/349 (23%), Positives = 149/349 (42%), Gaps = 43/349 (12%)
Query: 710 LDTFEYGCSILSCSFSDDSNVYYCVGTAYVLP-----EENEPTKGRILVFIVEDGK--LQ 762
LD E S C F + VGT Y + EN T G + D L+
Sbjct: 964 LDVNEGCISACVCKFDEMDLPCLVVGTTYGMKLRKEYNENSSTLGATIKVYNYDTNFNLK 1023
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL-YV 821
L+ + + + G+LL +IN+ +++Y G ++L +C + L ++
Sbjct: 1024 LVHVTPIENVATCMIGWRGRLLVSINKTLRIYSL-----GKKKLLKKCEYRNIPEVLVWL 1078
Query: 822 QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWM-SAVEILDDDIYLGAENNFN 880
+ D I GD+ + + Y + + A D W+ SA EILD + ++ N
Sbjct: 1079 KVINDRIFAGDIRHGVIIFKYHSVQNRLSIIANDIMPRWLTSACEILDYHTVITSDKFDN 1138
Query: 881 LFTVRKNSEGATDEERG----------------RLEVVGEYHLGEFVNRFRHGSLVMRLP 924
L R +E +++ + ++ V +H+G+ V L
Sbjct: 1139 LIVCRVPTEASSNYDFTSNFNSQTNTSSYMKPYQINPVAHFHVGDLVTCIHKNQL----- 1193
Query: 925 DSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
S +G + T+IFGT+ G IG + + ++ + L KL+ LR + +H +RS+
Sbjct: 1194 -SPLG-VETLIFGTILGSIGTLTPITNKDDVDLLCKLELLLRNESPSLMSRDHLMFRSY- 1250
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
N +DGDL E+F S I+ ++++S++E+ K++++L
Sbjct: 1251 ----YAPVLNVIDGDLCETFTSYSSDVQARIASSLDISIQEIFKKLDDL 1295
>gi|256075924|ref|XP_002574265.1| Splicing factor 3B subunit 3 5' fragment [Schistosoma mansoni]
Length = 982
Score = 103 bits (258), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 92/343 (26%), Positives = 158/343 (46%), Gaps = 33/343 (9%)
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+D +++VSF++ T +L++ + +EE GF T TL C + LVQV +R +
Sbjct: 433 YDAYIIVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLTCSQLGDDALVQVYPDGIRHI 490
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHAQLEY 478
S R N W++P ++ N QV++A GG LVY E+ G L E + ++
Sbjct: 491 RSDKRV--NVWRAPGKKTITKCAVNRRQVVIALTGGELVYFEMDMTGQLNEYTERKEMPA 548
Query: 479 EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV-LLCA 537
++ C+ + I N S+ AVG+ D +VRI SL + +T + G IP + LC
Sbjct: 549 DVICMALGRIPANEQRSRFLAVGL-ADNTVRILSLDPSDCLTPLTMQG--IPSTPESLCI 605
Query: 538 FE---------------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQ 576
E GI Y+ L +G LL +L+ TGEL+D + LGT+
Sbjct: 606 VEMGTNEPSPSTDDGESEATSSGGILYMNIGLINGVLLRVILDPVTGELSDTRTRYLGTR 665
Query: 577 PITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA 636
P+ L + V + S R + Y+ + ++ + + + F+S P+ +
Sbjct: 666 PVKLFRIMMQGGEAVLSVSSRSWLSYAYQNRFHLIPLSYEALEYASGFSSEQCPEGIVAI 725
Query: 637 KEGELTIGTIDDIQKLHIR-SIPLGEHPRR-ICHQEQSRTFAI 677
L I ++ + + + S PL PR+ + H + + T+ I
Sbjct: 726 CNNSLRIMALEKLGAVFNQISYPLQYTPRKFVFHPDSNITYII 768
>gi|71663721|ref|XP_818850.1| damage-specific DNA binding protein [Trypanosoma cruzi strain CL
Brener]
gi|70884123|gb|EAN96999.1| damage-specific DNA binding protein, putative [Trypanosoma cruzi]
Length = 774
Score = 103 bits (258), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 159/705 (22%), Positives = 267/705 (37%), Gaps = 134/705 (19%)
Query: 407 NQLVQVTSGSVRLVSSTSRE---LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
N VQVTS V V +NE + + A+A+A+ LL V +
Sbjct: 28 NTFVQVTSVGVNFVKGGKGVYIICKNE------FGIQHASADANLRLLVFSSSRFVSVV- 80
Query: 464 GDGILTEVKHAQLEY--EISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK 521
+ T A LE E+SCL I S SQ +G W +V ++ + D
Sbjct: 81 --DLNTHHTRASLELDNEVSCLVI-------SSSQGFVIGEWNSGAVCLYEVQD------ 125
Query: 522 EHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTD-----------RKK 570
GEI+ + + C+ S +C L H ++ + G + D ++
Sbjct: 126 ----GEILLKGRIFCSATSCS--MCILSHLHTPRLVVGLLNGYIADISLESMLMGSAARE 179
Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
+ QP+ L S N V + P +I + + ++ +V+ C +
Sbjct: 180 TFIRMQPVQLFNLESHNA--VLCLGEVPLIIILCDTGFQLTGIDFNDVA-ACAIIEGSHI 236
Query: 631 DSLAI---AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQ---- 683
S I E L G I D++KL+ + L R+ + F + +N+
Sbjct: 237 SSKYIFLSQSENSLAFGNIVDLKKLNSCFLGLKATVTRVKYIAWWNVFVMSVRRNEKDQI 296
Query: 684 ----------SCAEESEMHF----VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN 729
S ++E+ + LL ++ F T L G + D +
Sbjct: 297 LLLMGHELANSWVPQNELLIELFSIELLGNERCVFFETVVLG----GSNESPNGNEADKD 352
Query: 730 VYYCVGTAYVLPEENEPTKGRILVFIVEDGKL-------QLIAEKETKGAVYS---LNAF 779
+GT + P+E P R + + VE GKL + I K+ +GA+ + +
Sbjct: 353 KAILLGTTFAFPDEQLPRSSRFIWYSVEQGKLISERPQLRQIGSKDVEGALQCCCIVPNY 412
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHH--GHILALYVQTRGDFIVVGDLMKSI 837
G++ IN I LY W D ++ C +L ++ Q ++V D S
Sbjct: 413 VGRIALGINGCIALYSWNAADSVFVAEETICIGTIVTRVLPIF-QGDASYMVAFDARHSC 471
Query: 838 SLLIYKHEEGAIEERARDYNANW-MSAVEILDDDIY--LGAENNFNLFTVRK-------- 886
+ +G++E ARD M +IY ++ +N F + +
Sbjct: 472 FFIQVDTFQGSLEIVARDSEPRGVMDGTVFQLGNIYNICFGDDYYNFFCLSRGASTLPSA 531
Query: 887 -NSEGATDEERGRLEVVGEYHLGEFVNRFRHGS--------LVMRLPDSDVGQI--PTVI 935
N+ AT G+LE +YHLG+ V + GS + + +P + + I P V+
Sbjct: 532 SNATSATVSS-GKLETSAQYHLGDMVTAMQLGSFAPCSVTNIAVPIPTTLIPGICGPQVV 590
Query: 936 FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS-------------- 981
FGT +G G I + +E YL L+ L+ + V+ +GG H +R
Sbjct: 591 FGTSHGAFGTITPVSNETYLLLKALEVAVSSVVPALGGFEHVTYREVLRAGQERGYSRNA 650
Query: 982 ------------FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
F+ +K ++ GDLIESFL S+T I
Sbjct: 651 SFENANVVSSGVFDKRRKRYLSRCVCSGDLIESFLTFSQTIQQRI 695
>gi|413948668|gb|AFW81317.1| hypothetical protein ZEAMMB73_752618 [Zea mays]
Length = 122
Score = 103 bits (258), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 47/58 (81%), Positives = 49/58 (84%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDR 58
MLDVPIYGRIAT+ELFRPH E QDFLFIATERYKFCVLQWD E SELITR+ V R
Sbjct: 53 MLDVPIYGRIATIELFRPHNETQDFLFIATERYKFCVLQWDTEKSELITRSHLHVVCR 110
>gi|270003792|gb|EFA00240.1| hypothetical protein TcasGA2_TC003068 [Tribolium castaneum]
Length = 1392
Score = 103 bits (258), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 128/526 (24%), Positives = 228/526 (43%), Gaps = 82/526 (15%)
Query: 578 ITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNL--KEVSHMCPFNSAAFPDS-L 633
I +R F++ VF P I+ S + L ++ EV FN+ P L
Sbjct: 877 IKMRYFTNIAGYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFL 936
Query: 634 AIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES 689
++ EL IG + +R +PL P + + +S+T+ + + S AE S
Sbjct: 937 YFNRKSELRIGVLPTHLSYDAAWPVRKVPLRCTPHFVTYHLESKTYCLVT----SIAEPS 992
Query: 690 EMHF---------------------------VRLLDDQTFEFI--STYPLDTFEYGCSIL 720
++ + L +++ I + LD +E+ +
Sbjct: 993 NKYYKFNGEDKELSVEDRGDRFPYPLQEKFSLMLFSPVSWDVIPNTKIDLDEWEHVNCLK 1052
Query: 721 SCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLI 764
+ S + + Y VGT Y E+ ++GRIL+F I+E + + I
Sbjct: 1053 NVSLAYEGTRSGLKGYIAVGTNYNYGEDVT-SRGRILIFDIIEVVPEPGQPLTKNRFKEI 1111
Query: 765 AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
K+ KG V +L+ G L++A+ QKI Y W L+D+ +L I + T
Sbjct: 1112 YAKDQKGPVTALSQVKGFLVSAVGQKI--YIWQLKDN---DLVGVAFIDTQIYTHQILTI 1166
Query: 825 GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNF 879
++V D+ KSISLL ++ E + +RD+ + +VE + D+ +G +E N
Sbjct: 1167 KSLLLVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMIDNTTMGFLVSDSEKNL 1226
Query: 880 NLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH--------GSLVMRLPDSDVGQI 931
L+ + S + +R L ++HLG+ VN F G L +D I
Sbjct: 1227 VLYMYQPESRESLGGQR--LLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADKRHI 1284
Query: 932 PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK--TV 989
++ T++G +G I +P + Y L LQ L + GLN + +R++ + KK T
Sbjct: 1285 --TMYATLDGGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTN 1342
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
A++ +DG+L+ ++L LS E+SK + +EEL + ++ ++
Sbjct: 1343 PARSVIDGELVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDIQKI 1388
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 105/542 (19%), Positives = 204/542 (37%), Gaps = 147/542 (27%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG--DVSDRIGRPT 63
++G I +++ +D L +A + K V+++D E+ +L T ++ + D T
Sbjct: 74 LFGNIMSMQSVNLANSPRDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDDMKDGWT 133
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPFD---------------------NKGQLKE 100
+ + ++ DP+ R + ++ V+PF +K +
Sbjct: 134 HHYHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILA 193
Query: 101 AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDF 150
++ I L++ ++DI+FL+G +PT+++L++ K A T +A +
Sbjct: 194 SYMIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNL 253
Query: 151 VEGP----WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAF 193
+ WS NL +P+ PL G LI ++Y + A
Sbjct: 254 QQKVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQSIPPYGVSLNSIAENS 313
Query: 194 KAIPIRP------SITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLG 246
P++P S+ A D + +L G L++L + + + V E
Sbjct: 314 TNFPLKPQDDLCISLDCAQATFLEDDT-IVLSLKGGELYVLTLLADNMRYVRSFHFEKAA 372
Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKL------------NLQPDAK----------- 283
+ + + IS +N +++GS G+S L++ ++P AK
Sbjct: 373 ASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITLDETIEPSAKRLKASNSTSEN 432
Query: 284 ---------------------------------------GSYV-EVLERYVNLGPIVDFC 303
SYV EV + +N+GP +
Sbjct: 433 EDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQISSYVFEVCDSLLNIGPCGNIS 492
Query: 304 V-------------VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
+ +DL+ ++VT +G K+G+L +++ + + L G M
Sbjct: 493 LGEPAFLSEEFSENLDLDL----ELVTTAGYGKNGALCVLQKSVRPQIVTTFTLPGCSNM 548
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLV 410
W++ + D FL++S T IL DE+ E + GF + T++ A NQL
Sbjct: 549 WTVHAGEDK--HAFLILSQEDGTMILQTG--DEINEIDNTGFATHIPTVY---AGINQLQ 601
Query: 411 QV 412
+
Sbjct: 602 HI 603
>gi|91078626|ref|XP_968117.1| PREDICTED: similar to cleavage and polyadenylation specificity factor
cpsf [Tribolium castaneum]
Length = 1413
Score = 103 bits (257), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 128/526 (24%), Positives = 228/526 (43%), Gaps = 82/526 (15%)
Query: 578 ITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNL--KEVSHMCPFNSAAFPDS-L 633
I +R F++ VF P I+ S + L ++ EV FN+ P L
Sbjct: 898 IKMRYFTNIAGYNGVFVCGANPHWIFMSARGELRTHPMTIDGEVLSFAAFNNVNCPQGFL 957
Query: 634 AIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEES 689
++ EL IG + +R +PL P + + +S+T+ + + S AE S
Sbjct: 958 YFNRKSELRIGVLPTHLSYDAAWPVRKVPLRCTPHFVTYHLESKTYCLVT----SIAEPS 1013
Query: 690 EMHF---------------------------VRLLDDQTFEFI--STYPLDTFEYGCSIL 720
++ + L +++ I + LD +E+ +
Sbjct: 1014 NKYYKFNGEDKELSVEDRGDRFPYPLQEKFSLMLFSPVSWDVIPNTKIDLDEWEHVNCLK 1073
Query: 721 SCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLI 764
+ S + + Y VGT Y E+ ++GRIL+F I+E + + I
Sbjct: 1074 NVSLAYEGTRSGLKGYIAVGTNYNY-GEDVTSRGRILIFDIIEVVPEPGQPLTKNRFKEI 1132
Query: 765 AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTR 824
K+ KG V +L+ G L++A+ QKI Y W L+D+ +L I + T
Sbjct: 1133 YAKDQKGPVTALSQVKGFLVSAVGQKI--YIWQLKDN---DLVGVAFIDTQIYTHQILTI 1187
Query: 825 GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNF 879
++V D+ KSISLL ++ E + +RD+ + +VE + D+ +G +E N
Sbjct: 1188 KSLLLVADVYKSISLLRFQEEYRTLSLVSRDFRPCEVFSVEYMIDNTTMGFLVSDSEKNL 1247
Query: 880 NLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRH--------GSLVMRLPDSDVGQI 931
L+ + S + +R L ++HLG+ VN F G L +D I
Sbjct: 1248 VLYMYQPESRESLGGQR--LLRKADFHLGQAVNSFFRIKCKLGELGEDKKNLTGADKRHI 1305
Query: 932 PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK--TV 989
++ T++G +G I +P + Y L LQ L + GLN + +R++ + KK T
Sbjct: 1306 --TMYATLDGGLGYIMPVPEKTYRRLLMLQNVLVSQGAHIAGLNPKAFRTYKSWKKLQTN 1363
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
A++ +DG+L+ ++L LS E+SK + +EEL + ++ ++
Sbjct: 1364 PARSVIDGELVYNYLQLSIPEKLEVSKKIGTKLEELLDDLSDIQKI 1409
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 118/588 (20%), Positives = 226/588 (38%), Gaps = 150/588 (25%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG--DVSDRIGRPT 63
++G I +++ +D L +A + K V+++D E+ +L T ++ + D T
Sbjct: 74 LFGNIMSMQSVNLANSPRDALLLAFKDAKLSVVEYDPETHDLKTLSLHYFEEDDMKDGWT 133
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPFD---------------------NKGQLKE 100
+ + ++ DP+ R + ++ V+PF +K +
Sbjct: 134 HHYHVPMVRADPENRCAVMTVFGRKLVVLPFRRENAIDDTDADIKPMIGGAYGSKAPILA 193
Query: 101 AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDF 150
++ I L++ ++DI+FL+G +PT+++L++ K A T +A +
Sbjct: 194 SYMIVLKDFIDKVDNIIDIQFLHGYYEPTLLILFEPLKTFAGRVAVRTDTCAMAAISLNL 253
Query: 151 VEGP----WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAF 193
+ WS NL +P+ PL G LI ++Y + A
Sbjct: 254 QQKVHPIIWSVANLPFDCVKAVPIKKPLGGTLIFAVNALIYLNQSIPPYGVSLNSIAENS 313
Query: 194 KAIPIRP------SITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLG 246
P++P S+ A D + +L G L++L + + + V E
Sbjct: 314 TNFPLKPQDDLCISLDCAQATFLEDDT-IVLSLKGGELYVLTLLADNMRYVRSFHFEKAA 372
Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKL------------NLQPDAK----------- 283
+ + + IS +N +++GS G+S L++ ++P AK
Sbjct: 373 ASVLTTCISVCENNFLFLGSRLGNSLLLRFTEKCNEVITLDETIEPSAKRLKASNSTSEN 432
Query: 284 ---------------------------------------GSYV-EVLERYVNLGPIVDFC 303
SYV EV + +N+GP +
Sbjct: 433 EDDKVLDTLNDCMASDVLDIRDPEELEVYGNQKQASLQISSYVFEVCDSLLNIGPCGNIS 492
Query: 304 V-------------VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
+ +DL+ ++VT +G K+G+L +++ + + L G M
Sbjct: 493 LGEPAFLSEEFSENLDLDL----ELVTTAGYGKNGALCVLQKSVRPQIVTTFTLPGCSNM 548
Query: 351 WSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD-AIYNQL 409
W++ + D FL++S T IL DE+ E + GF + T++ + +
Sbjct: 549 WTVHAGEDK--HAFLILSQEDGTMILQTG--DEINEIDNTGFATHIPTVYAGNLGNLKYI 604
Query: 410 VQVTSGSVRLVSSTS--RELRNEWKSPPGYSVNVATANASQVLLATGG 455
VQVTS +VRL+ + + + E SP V+V + + LL T G
Sbjct: 605 VQVTSSAVRLLQGINQLQHIPLELGSP---IVHVTSVDPYISLLTTDG 649
>gi|340710064|ref|XP_003393618.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Bombus terrestris]
Length = 1417
Score = 103 bits (257), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 126/525 (24%), Positives = 232/525 (44%), Gaps = 76/525 (14%)
Query: 575 TQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPD 631
T+ +R FS+ VF SD P I+ + + L ++ + V+ PFN+ P
Sbjct: 899 TRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQ 958
Query: 632 S-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICS-----LK 681
L ++ EL I + +R +PL P + + +S+T+ + + LK
Sbjct: 959 GFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIAEPLK 1018
Query: 682 N---------QSCAEESEMHFVRLLDDQ----TFEFIS--TYP-----LDTFEYGCSILS 721
+ + EE F+ +Q F +S T P LD +E+ + +
Sbjct: 1019 SYYRFNGEDKEFTEEERPERFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKN 1078
Query: 722 CSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIA 765
S + + Y +GT Y E+ ++GRIL+F I+E + + I
Sbjct: 1079 VSLAYEGTRSGLKGYIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIY 1137
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQ 822
KE KG + ++ +G L++A+ QKI Y W L+D+ G + ++ H +L++
Sbjct: 1138 AKEQKGPITAITQVSGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI--- 1191
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AEN 877
I++ D+ KSISLL ++ E + +RD+ + +E L D+ LG E+
Sbjct: 1192 --KSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGES 1249
Query: 878 NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP----- 932
N LF + S + + +L ++HLG+ VN F + P +D
Sbjct: 1250 NMALFMYQPESRESLGGQ--KLIRKADFHLGQKVNTFFRIRCRLSDPANDKKHFSGADKR 1307
Query: 933 -TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV-- 989
++ +++G +G I +P + Y L LQ L I + GLN + +R++ + +T
Sbjct: 1308 HVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQGN 1367
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
A+ +DGDL+ + L +++K + V+E+ + + E+ R
Sbjct: 1368 PARGIIDGDLVWRYFYLPNNEKIDVAKKIGTRVQEIIEDLTEIDR 1412
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 65/330 (19%), Positives = 139/330 (42%), Gaps = 61/330 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
++G + +++ G +D L ++ K V+++D ++ +L T ++ + R T
Sbjct: 73 LHGNVMSMQAVTLVGSQRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEEIRDGWT 132
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFN 103
++ I I+ DP+ R + +Y V+PF NK + ++
Sbjct: 133 NHHHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKALSNKTPILSSYM 192
Query: 104 IRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKD 149
I L+ L+ ++D++FL+G +PT+++LY+ + R VA+ +
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252
Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAI 196
WS +NL +PV PL G LI+ +++Y + A
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLAETSTNF 312
Query: 197 PIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE 247
P++P + ++ +GS R ++ +G L++L + V G +
Sbjct: 313 PLKP---QEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAA 369
Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
+ + S + ++ +++GS G+S L++
Sbjct: 370 SVLTSCVCMCEDNYLFLGSRLGNSLLLRFT 399
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 77/172 (44%), Gaps = 14/172 (8%)
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
++VT SG K+G+L +++ I + EL G + MW++ + ++ F
Sbjct: 507 ELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVIGTLNNDEQIRPEAEGSHAF 566
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
L++S T IL E+ E + GF +Q T+F + N+ +VQVT VRL+
Sbjct: 567 LILSQEDSTMILQTG--QEINEVDQSGFSTQGSTIFAGNLGANRYIVQVTQMGVRLLQGI 624
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
E G + A+ V L + G ++ L + +G T HAQ
Sbjct: 625 --EQIQHMPIDLGCPIVHASCADPYVTLLSEDGQVMLLTLREGRGTAKLHAQ 674
>gi|431908146|gb|ELK11749.1| Cleavage and polyadenylation specificity factor subunit 1 [Pteropus
alecto]
Length = 820
Score = 103 bits (257), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 113/466 (24%), Positives = 194/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 349 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 408
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 409 KVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 468
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 469 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 527
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 528 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 582
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 583 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 642
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 643 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 696
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 697 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 756
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 757 AFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 802
>gi|53791734|dbj|BAD53405.1| UV-damaged DNA-binding protein-like [Oryza sativa Japonica Group]
Length = 154
Score = 103 bits (257), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 50/93 (53%), Positives = 64/93 (68%), Gaps = 7/93 (7%)
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
+G +E +RDYN WM+AVE+LDD +Y+GA+N +NLFTV K GRL V+G+
Sbjct: 61 NKGQTDELSRDYNITWMTAVEMLDDYVYIGADNCYNLFTVLK-------RRVGRLLVIGQ 113
Query: 905 YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
YHLG+ VNRF G LVM+ P S+V QIP IFG
Sbjct: 114 YHLGDLVNRFHQGPLVMQDPGSEVDQIPAFIFG 146
>gi|357611296|gb|EHJ67409.1| putative cleavage and polyadenylation specific factor 1 [Danaus
plexippus]
Length = 328
Score = 103 bits (256), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 89/337 (26%), Positives = 160/337 (47%), Gaps = 41/337 (12%)
Query: 722 CSFSDDSNV--YYCVGTAYVLPEENEPTKGRILVFIVED-----------GKLQLIAEKE 768
C F S + Y +GT Y E+ ++GRIL++ + D + + I KE
Sbjct: 4 CGFIQLSGLRGYIAIGTNYNYGEDI-TSRGRILIYDIIDVVPEPGQPLTKNRFKEIYAKE 62
Query: 769 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRG 825
KG V +L G L++A+ QKI Y W L+D+ G + ++ H +LA+
Sbjct: 63 QKGPVTALTQVLGFLISAVGQKI--YLWQLKDNDLVGVAFIDTQIYVH-RMLAV-----K 114
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFN 880
+ I+V D+ KSISLL Y+H+ + +RD + ++ + D+ LG +E NF
Sbjct: 115 NLILVADVYKSISLLRYQHQHRTLSLVSRDLRTAQIYDMQFMIDNTSLGFLVSESEGNFA 174
Query: 881 LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVN 940
++ + + + +R L +YHLG+ V+ + RL Q +F T++
Sbjct: 175 MYMHQPQARESYGGQR--LIRKCDYHLGQRVH------AMFRLAARGERQTHVTMFTTLD 226
Query: 941 GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDG 997
G +G + + + Y L LQ + + GLN + +R++ ++ + A+ LDG
Sbjct: 227 GGVGYVLPVSEKVYRRLLMLQNVINNYCCHLAGLNPKAYRTYKVSRRALCGGAARGVLDG 286
Query: 998 DLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
DL+ + + RT +I++ + VEE+ + E+ R
Sbjct: 287 DLVSLYTSMPRTEQQDIARKIGTKVEEIMSDLYEIDR 323
>gi|255918233|ref|NP_001157645.1| cleavage and polyadenylation specificity factor subunit 1 isoform 1
[Mus musculus]
Length = 1450
Score = 103 bits (256), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 114/480 (23%), Positives = 199/480 (41%), Gaps = 74/480 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 970 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1030 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1089
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1090 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1148
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1149 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1203
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1204 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1263
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1264 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1317
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1318 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1377
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+R + +++ + +N LDG+L+ +L LS E++K + + + R+ +R+
Sbjct: 1378 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTTPDITGSRLTSNSRM 1437
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 108/565 (19%), Positives = 220/565 (38%), Gaps = 151/565 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE---- 288
+ +++ ++ +++GS G+S L+K LQ P +K VE
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVEPAVG 429
Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
V + +N+GP + V + L +
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 489
Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
Q ++V CSG K+G+L +++ I + EL G MW++
Sbjct: 490 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 549
Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
++ D FL++S T IL E+ E + GF +Q T+F
Sbjct: 550 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 607
Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
+ N+ +VQV+ +RL+ ++
Sbjct: 608 GNIGDNRYIVQVSPLGIRLLEGVNQ 632
>gi|417406474|gb|JAA49895.1| Putative mrna cleavage and polyadenylation factor ii complex subunit
cft1 cpsf subunit [Desmodus rotundus]
Length = 1444
Score = 102 bits (255), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 112/466 (24%), Positives = 194/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 973 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1032
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1033 KVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1092
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1093 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1151
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1152 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1206
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ + + +RD + +V+ + D+
Sbjct: 1207 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEDSKTLSLVSRDAKPLEVYSVDFMVDNA 1266
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1267 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1320
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1321 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1380
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1381 AFRMLHVDRRTLQNAVRNILDGELLNRYLYLSTMERSELAKKIGTT 1426
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 63/326 (19%), Positives = 141/326 (43%), Gaps = 60/326 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 77 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 137 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 195
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 196 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 255
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + + A P+
Sbjct: 256 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTSGTTAFPL 315
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R D + + ++ G +++L +IT V + +
Sbjct: 316 R---TQEGVRTTLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDKAAASV 372
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
+ S++ ++ +++GS G+S L+K
Sbjct: 373 LTSSMVTMEPGYLFLGSRLGNSLLLK 398
>gi|440301427|gb|ELP93813.1| DNA repair protein xp-E, putative [Entamoeba invadens IP1]
Length = 1001
Score = 102 bits (255), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 97/331 (29%), Positives = 147/331 (44%), Gaps = 41/331 (12%)
Query: 700 QTFEFI--STYPLDTFEYGCSILSCSFSDD-SNVYYCVGTAYVLPEENEPTKGRILVFIV 756
Q F I ++YP + C C + S Y +GTA + E EPT GR+LV
Sbjct: 684 QLFNLIDGASYPSEFLLLSCDEEICMCVETISEHLYVIGTAIIKENEVEPTVGRLLVAEE 743
Query: 757 EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI 816
DGKL L E +GAVY++ F +LA IN+ + + R + ++
Sbjct: 744 VDGKLVLKCTHEFEGAVYAIKKFKQHVLALINRHLHVMDITNESVLPRNIIEL-----NM 798
Query: 817 LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA-RDYNANWMSAVEILDDDIYLGA 875
+ + ++ GD+I+VGD MKS+++ + K ++ + E+A D +W++AV +++ G
Sbjct: 799 IGVCMEVLGDYIIVGDFMKSVTVFVCKSDDLSQTEKAWTDIAVSWVNAVGCIEN----GD 854
Query: 876 ENNFNLFTVRKNSEGATD--EERGRLE----VVGEYHLGEFVNRFRHGSLVMRLPDSDVG 929
N F V N + EER +E VG+ H+ E +N F SL
Sbjct: 855 SNKFVSCDVDGNVKIFVKGGEERWYIEDLLKCVGKIHVCECIN-FVEKSLY--------- 904
Query: 930 QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
V G V+G + I+ L Y L+K Q L K + WR F K
Sbjct: 905 --KGVTLGGVSGALYNISCLSDNDYQILKKAQDMLVK----------DNWRQFVGTKTKE 952
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
NF+DGD IE+ LD R K + V
Sbjct: 953 VMTNFIDGDKIETILDWPLERQTSFCKKIGV 983
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 81/343 (23%), Positives = 166/343 (48%), Gaps = 21/343 (6%)
Query: 9 RIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQI 68
+I L ++P + +++ + T + V++ ++ L + G +SD GR + G
Sbjct: 63 KIEVLLKYKPVKDEDEYIIVVTASVQIDVIKL---TTNLHVVSSGFLSDPFGRLSFYGMK 119
Query: 69 GIIDPDCRLIGLHLYDGLFKVIPFDNKGQLK-EAFNIRLEELQVLDIKFLYGCAKPTIVV 127
++ P+ + + LH+Y+ L K + + K A N ++ +++D F + T+V
Sbjct: 120 AVVSPNGQYLLLHIYEQLIKFVKLPQRPSDKIFATNSKISINRIIDFSFCTIADQETVVF 179
Query: 128 LYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
L+++++D RHV TY + ++FV+ +SQ N+ + L++ +P + GVL+I +ET Y
Sbjct: 180 LHENSRDTRHVNTY-IIDDHQNFVKSTFSQPNVGSSTRLVLGLPNGVNGVLVISDETACY 238
Query: 188 CSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-KVTGLKIELLG 246
+ + + + + +L D HLL + E V+ + +L
Sbjct: 239 FTGTG-DHVLVNVGKNRITTTTFLNKDMVILTDSNTGCHLLKLKVCVENSVSEILYSVLP 297
Query: 247 ET-SIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVV 305
T S+A T+S + +++ GSS G S L++++ E+ E + N GPI D +
Sbjct: 298 STESVAQTVSVIGGDIIFFGSSSGKSYLMRVS---------GEIFETWENCGPITDLKKI 348
Query: 306 DLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK 348
+ + V+ C+G ++ L ++ G G+ E S L+ ++
Sbjct: 349 E---NKENYVINCNGGGQE-CLGVIFKGSGVKELGSTSLEHVR 387
>gi|219110831|ref|XP_002177167.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411702|gb|EEC51630.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 1303
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 124/503 (24%), Positives = 212/503 (42%), Gaps = 75/503 (14%)
Query: 233 EKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL--QPDA-------- 282
+K KV G++I LL + ++ +++ +S+GD L + PDA
Sbjct: 372 DKTKVVGMEIALLDTLPNVNALNVSKLGMLFAAASFGDHHLYQFERIDLPDAPKNTSDEA 431
Query: 283 ------KGSYV-------EVLERYV--------------NLGPIVDFCVVDLE-RQGQGQ 314
GS V E+ ++V N P V +L + Q
Sbjct: 432 TKALTASGSTVLTTALASEIASKFVPTVLQNLRKIHSLDNPSPTTGVLVGELAGNEVSPQ 491
Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG----MWSLRSSTDDPFDTFLVVSFI 370
+ +G+ +LR+ R+G + E A EL G+ G + + ++ D F+VVSF
Sbjct: 492 IYALTGSGPTSALRVTRHGASVTELAVSELPGVPGAIFTIGTGNTAGGKKLDQFIVVSFA 551
Query: 371 SETRILAMNLEDELEETEIE-GFCSQTQTLFCHD-AIYNQLVQVTSGSVRLVSSTSRELR 428
T +L++ + +EE E GF + + TL C L QV VR + + +
Sbjct: 552 DATLVLSVG--ETVEEVGKESGFLTNSPTLACSALGSDGALCQVHPVGVRHIQNGQAK-- 607
Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGILTEVKHAQLEYEISCLDIN 486
+W P + A+AN SQV++A GG ++Y E+ G L E ++ ++ CLD+
Sbjct: 608 -QWHCPGLKRIECASANESQVIVALAGGEIIYFELDPMSGNLMESATREMGADVCCLDVG 666
Query: 487 PIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEII--PRSVLLCAFE-GIS 542
I + S S A VG D +VR+ SL PD I + + P SV L G++
Sbjct: 667 TIPKGRSRSLFAVVG-CRDQTVRVISLEPDKTKILSQRSSTALKARPHSVALQMMNAGVA 725
Query: 543 ---------YLLCALGDGHLLNFLLNMKTGEL-TDRKKVSLGTQPITLRTFSSKNTTHVF 592
L+ L DG L ++ TG + T + LG +P+ + S ++
Sbjct: 726 NSNANVDDLTLIIGLDDGSSLRASIDPITGSIGTSPTRRFLGARPVAVSRISLESNPATL 785
Query: 593 AASDRPTVIY--SSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTID--- 647
S RP + +++ K + + ++ + H C F+S A + + L I +I+
Sbjct: 786 LLSSRPWITRRDAASGKHIMAPLSYAPLDHGCSFSSDAVSEGIVATAGKTLRILSIESSG 845
Query: 648 ----DIQKLHIRSIPLGEHPRRI 666
D + + IPL PR++
Sbjct: 846 MEGGDDEAFNTNKIPLRYTPRQM 868
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 77/303 (25%), Positives = 139/303 (45%), Gaps = 42/303 (13%)
Query: 751 ILVFIVEDGKLQLIAEKETK-GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL--Q 807
++ IV + QL+ G V SL F G+LL I ++LY+ G R+L +
Sbjct: 1015 VIYRIVNGDRFQLLHRTSVDDGPVLSLAHFQGRLLVGIGTTLRLYEM-----GKRQLLRK 1069
Query: 808 SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
SE + + VQT G+ +GD+M+SI ++ Y + A D + + E+L
Sbjct: 1070 SELRNFPTFVK-TVQTVGERAYIGDMMQSIQIVRYDVSANRLVLIANDASPRPIVCQELL 1128
Query: 868 D-DDIYLGAENNFNLFTVRKNSEGAT-------------DEER----GRLEVVGEYHLGE 909
D + + +G + F +V + GA D R +LE++ +Y++GE
Sbjct: 1129 DWNTVAVG--DKFGNISVMRLPRGADTSAIDVTGQRALWDSSREDMIPKLELLCQYYVGE 1186
Query: 910 FVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVI 968
V SLV G ++I+ TV+G IG + + F +L++ LR
Sbjct: 1187 VVTSMTRSSLV-------AGGAESLIYVTVSGRIGAFVPFTNRNDVDFYSQLESELRGDA 1239
Query: 969 KGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKR 1028
G + + +RS+ V +DGDL ++F L + ++I++ ++ +V E+ K+
Sbjct: 1240 SRPTGRDPQSYRSYYAPMMHV-----VDGDLCDAFNSLGPEKQNKIAEKLDRTVGEIMKK 1294
Query: 1029 VEE 1031
+E+
Sbjct: 1295 LED 1297
>gi|402879380|ref|XP_003903320.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Papio anubis]
Length = 1389
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 114/460 (24%), Positives = 195/460 (42%), Gaps = 62/460 (13%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 918 VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 977
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N CA + E ++L+ ++E I +
Sbjct: 978 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1037
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1038 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1096
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI + W LR EL
Sbjct: 1097 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKI--FLWSLR---ASELTGMAF 1151
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1152 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1211
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
LG ++ + NL E RL ++H+G VN F G+
Sbjct: 1212 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1271
Query: 925 DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
S V + + F T++G IG++ + + Y L LQ L ++ GLN +R +
Sbjct: 1272 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1331
Query: 984 NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1332 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1371
Score = 40.0 bits (92), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
EV + +N+GP + + + L + Q ++V CSG K+G+L +++ I
Sbjct: 411 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQV 470
Query: 339 QASVELQGIKGMWSL-----RSSTDDP-------------------FDTFLVVSFISETR 374
+ EL G MW++ + D+P FL++S T
Sbjct: 471 VTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEARSPEADDDGRRHGFLILSREDSTM 530
Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
IL E+ E + GF +Q T+F + N+ +VQV+ +RL+ ++
Sbjct: 531 ILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 580
>gi|426361048|ref|XP_004047737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Gorilla gorilla gorilla]
Length = 1440
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 969 VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1028
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N CA + E ++L+ ++E I +
Sbjct: 1029 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1088
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1089 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1147
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1148 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1202
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1203 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1262
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
LG ++ + NL E RL ++H+G VN F G+
Sbjct: 1263 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1322
Query: 925 DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
S V + + F T++G IG++ + + Y L LQ L ++ GLN +R +
Sbjct: 1323 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1382
Query: 984 NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1383 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1422
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 104/564 (18%), Positives = 219/564 (38%), Gaps = 150/564 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGTCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
+ +++ ++ +++GS G+S L+ KL P DA
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAG 429
Query: 286 YV------------------------------EVLERYVNLGPIVDFCVVD---LERQGQ 312
+ +V + +N+GP + + + L + Q
Sbjct: 430 WSGEGRSRAGQERGQVTQGWSGAGAPLTVAVPQVCDSILNIGPCANAAMGEPAFLSEEFQ 489
Query: 313 G------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDDP- 360
++V CSG K+G+L +++ I + EL G MW++ + D+P
Sbjct: 490 NSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPK 549
Query: 361 ------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH 402
FL++S T IL E+ E + GF +Q T+F
Sbjct: 550 GEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAG 607
Query: 403 DAIYNQ-LVQVTSGSVRLVSSTSR 425
+ N+ +VQV+ +RL+ ++
Sbjct: 608 NIGDNRYIVQVSPLGIRLLEGVNQ 631
>gi|395512730|ref|XP_003760588.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Sarcophilus harrisii]
Length = 1449
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 117/480 (24%), Positives = 200/480 (41%), Gaps = 74/480 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 978 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1037
Query: 673 RTFAICSLKNQSC-------AEESEMHFV----------------RLLDDQTFEFI--ST 707
+ +A+ + N C EE E + +L+ ++E I +
Sbjct: 1038 KVYAVATSTNALCTRIPRMTGEEKEFETIERDDRYIHPLQEAFSIQLISPVSWEAIPNAR 1097
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1098 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1156
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1157 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1211
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D
Sbjct: 1212 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDSA 1271
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1272 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1325
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ PT F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1326 AEGPTKKSIVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1385
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+R + +++ + +N LDG+L+ +L LS E++K + + + + + E+ R+
Sbjct: 1386 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLEIDRV 1445
Score = 44.7 bits (104), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 45/216 (20%), Positives = 99/216 (45%), Gaps = 35/216 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 77 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 137 QNVHTPRVRV-DPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGEGQKSSFLPSYIID 195
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP- 154
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + ++
Sbjct: 196 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNILQKVH 255
Query: 155 ---WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
WS NL + VP P+ GV+I +++Y
Sbjct: 256 PVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLY 291
>gi|355680843|gb|AER96659.1| cleavage and polyadenylation specific factor 1, 160kDa [Mustela
putorius furo]
Length = 1399
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 929 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 988
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 989 KVYAVATSTNMPCTRIPRMTGEEKEFEAIERDDRYVHPQQEAFSIQLISPVSWEAIPNAR 1048
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1049 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1107
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1108 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1162
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1163 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1222
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1223 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1276
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1277 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1336
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1337 AFRLLHADRRALQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1382
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 110/570 (19%), Positives = 218/570 (38%), Gaps = 158/570 (27%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 30 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 89
Query: 61 RPTDNGQIGIIDPDCR----LIGLHLYDGLFKVIPF-------DNKGQLKEA-------- 101
+ ++ + DPD R L + +Y V+PF +++G + E
Sbjct: 90 QNVHAPRVRV-DPDGRCAAMLTAMLIYGSRLVVLPFRRESLAEEHEGLMGEGQRSSFLPS 148
Query: 102 --FNIR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---D 147
++R L+E L ++D++FL+G +PT+++L++ N+ R VA+
Sbjct: 149 YIIDVRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCCIVAISLNIT 208
Query: 148 KDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFK 194
+ WS +L + VP P+ GV++ +++Y +
Sbjct: 209 QKVHPVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNGLTTGTT 268
Query: 195 AIPIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELL 245
A P+R T+ R+ D + + ++ G +++L +IT V +
Sbjct: 269 AFPLR---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKA 325
Query: 246 GETSIASTISYLDNAVVYIGSSYGDSQLIKL--NLQ--------------PDAKGSYV-- 287
+ + +++ ++ +++GS G+S L+K LQ P +K V
Sbjct: 326 AASVLTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEAPAGAVRETDKDEPPSKKKRVES 385
Query: 288 ------------------------------------EVLERYVNLGPIVDFCVVD---LE 308
EV + +N+GP + + + L
Sbjct: 386 AVGWSGGKSAPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAMGEPAFLS 445
Query: 309 RQGQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWS---------- 352
+ Q ++V CSG K+G+L +++ I + EL G MW+
Sbjct: 446 EEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPARKEQE 505
Query: 353 --------------LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQT 398
L + D FL++S T IL E+ E + GF +Q T
Sbjct: 506 ETPKGDGAEQEPSALEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPT 563
Query: 399 LFC---HDAIYNQLVQVTSGSVRLVSSTSR 425
+F D Y +VQV+ +RL+ S+
Sbjct: 564 VFAGNIGDGRY--IVQVSPLGIRLLEGVSQ 591
>gi|355698297|gb|EHH28845.1| Cleavage and polyadenylation specificity factor 160 kDa subunit
[Macaca mulatta]
Length = 1436
Score = 102 bits (254), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 965 VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1024
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N CA + E ++L+ ++E I +
Sbjct: 1025 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1084
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1085 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1143
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1144 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1198
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1199 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1258
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
LG ++ + NL E RL ++H+G VN F G+
Sbjct: 1259 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1318
Query: 925 DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
S V + + F T++G IG++ + + Y L LQ L ++ GLN +R +
Sbjct: 1319 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1378
Query: 984 NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1379 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1418
Score = 46.2 bits (108), Expect = 0.098, Method: Compositional matrix adjust.
Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
+ +++ ++ +++GS G+S L+K
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLK 395
>gi|410042329|ref|XP_003954555.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Pan troglodytes]
Length = 1296
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 825 VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 884
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N CA + E ++L+ ++E I +
Sbjct: 885 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 944
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 945 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1003
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1004 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1058
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1059 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1118
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
LG ++ + NL E RL ++H+G VN F G+
Sbjct: 1119 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1178
Query: 925 DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
S V + + F T++G IG++ + + Y L LQ L ++ GLN +R +
Sbjct: 1179 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1238
Query: 984 NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1239 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1278
Score = 46.2 bits (108), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
+ +++ ++ +++GS G+S L+K
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLK 395
>gi|119602512|gb|EAW82106.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
gi|119602513|gb|EAW82107.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
gi|119602514|gb|EAW82108.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_a
[Homo sapiens]
Length = 1365
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 114/460 (24%), Positives = 195/460 (42%), Gaps = 62/460 (13%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 894 VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 953
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N CA + E ++L+ ++E I +
Sbjct: 954 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1013
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1014 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1072
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI + W LR EL
Sbjct: 1073 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKI--FLWSLR---ASELTGMAF 1127
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1128 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1187
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
LG ++ + NL E RL ++H+G VN F G+
Sbjct: 1188 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1247
Query: 925 DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
S V + + F T++G IG++ + + Y L LQ L ++ GLN +R +
Sbjct: 1248 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1307
Query: 984 NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1308 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1347
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 106/553 (19%), Positives = 214/553 (38%), Gaps = 153/553 (27%)
Query: 20 GEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIGRPTDNGQIGIIDPD 74
G +D L ++ + K V+++D + +L T ++ ++ D + ++ + DPD
Sbjct: 10 GAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFVQNVHTPRVRV-DPD 68
Query: 75 CRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FNIR-LEE--LQVLDI 114
R + +Y V+PF +++G + E ++R L+E L ++D+
Sbjct: 69 GRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIIDVRALDEKLLNIIDL 128
Query: 115 KFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNNLDNGAD 165
+FL+G +PT+++L++ N+ A T + + + P WS +L
Sbjct: 129 QFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCT 188
Query: 166 LLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIRPSITKAYGRVDAD 212
+ VP P+ GV++ +++Y + A P+R T+ R+ D
Sbjct: 189 QALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLR---TQEGVRITLD 245
Query: 213 GS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
+ + ++ G +++L +IT V + + + +++ ++ ++
Sbjct: 246 CAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLF 305
Query: 264 IGSSYGDSQLI----KLNLQP--------------------DAKGSY------------- 286
+GS G+S L+ KL P DA +
Sbjct: 306 LGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAGWSAAGKSVPQDEVD 365
Query: 287 -------------------VEVLERYVNLGPIVDFCVVD---LERQGQG------QVVTC 318
EV + +N+GP + V + L + Q ++V C
Sbjct: 366 EIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAVGEPAFLSEEFQNSPEPDLEIVVC 425
Query: 319 SGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDDP------------- 360
SG K+G+L +++ I + EL G MW++ + D+P
Sbjct: 426 SGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDNPKGEGTEQEPSTTP 485
Query: 361 -------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQV 412
FL++S T IL E+ E + GF +Q T+F + N+ +VQV
Sbjct: 486 EADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQV 543
Query: 413 TSGSVRLVSSTSR 425
+ +RL+ ++
Sbjct: 544 SPLGIRLLEGVNQ 556
>gi|16751835|ref|NP_444423.1| cleavage and polyadenylation specificity factor subunit 1 isoform 2
[Mus musculus]
gi|17374611|sp|Q9EPU4.1|CPSF1_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|11762096|gb|AAG40326.1|AF322193_1 cleavage and polyadenylation specificity factor 1 [Mus musculus]
gi|38614159|gb|AAH56388.1| Cleavage and polyadenylation specific factor 1 [Mus musculus]
Length = 1441
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 970 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1030 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1089
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1090 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1148
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1149 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1203
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1204 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1263
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1264 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1317
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1318 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1377
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1378 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1423
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 108/565 (19%), Positives = 220/565 (38%), Gaps = 151/565 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE---- 288
+ +++ ++ +++GS G+S L+K LQ P +K VE
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVEPAVG 429
Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
V + +N+GP + V + L +
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 489
Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
Q ++V CSG K+G+L +++ I + EL G MW++
Sbjct: 490 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 549
Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
++ D FL++S T IL E+ E + GF +Q T+F
Sbjct: 550 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 607
Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
+ N+ +VQV+ +RL+ ++
Sbjct: 608 GNIGDNRYIVQVSPLGIRLLEGVNQ 632
>gi|348512553|ref|XP_003443807.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Oreochromis niloticus]
Length = 1456
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 115/483 (23%), Positives = 208/483 (43%), Gaps = 80/483 (16%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L K+GEL I + +R IPL + + +S
Sbjct: 985 IESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVES 1044
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFISTYP 709
+ +A+C+ + C + E ++L+ ++E I
Sbjct: 1045 KVYAVCTSVKEPCTRIPRMTGEEKEYEVIERDERYIHPQQEKFSIQLISPVSWEAIPNTR 1104
Query: 710 LD--TFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
+D +E+ + + + V Y GT + EE +GRIL+ + E
Sbjct: 1105 IDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVT-CRGRILILDVIEVVPE 1163
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 808
G K +++ EKE KG V +L NG L++AI QKI L W+L+D+ G + +
Sbjct: 1164 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFL--WVLKDNDLTGMAFIDT 1221
Query: 809 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
+ H + +FI+ DLMKSISLL Y+ E + +RD + ++E +
Sbjct: 1222 QLYIHQMF------SIKNFILAADLMKSISLLRYQEESKTLSLVSRDAKPLEVYSIEFMV 1275
Query: 869 DDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP- 924
D+ LG ++ + NL+ E RL +++ G +N F R+P
Sbjct: 1276 DNNQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTF------WRMPC 1329
Query: 925 ----DSDVGQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
D+ + T F T++G IG++ + + Y L LQ L ++ GL
Sbjct: 1330 RGALDASSKKALTWDNKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1389
Query: 975 NHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
N + +R +++++++ KN LDG+L+ +L LS E++K + + + + + E+
Sbjct: 1390 NPKAFRMLHSDRRSLQNPVKNILDGELLNKYLYLSMMERSELAKKIGTTQDIILDDLLEI 1449
Query: 1033 TRL 1035
R+
Sbjct: 1450 DRV 1452
Score = 49.7 bits (117), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 63/323 (19%), Positives = 136/323 (42%), Gaps = 54/323 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
++G I ++ + G ++D L ++ + K V+++D + +L T ++ + R
Sbjct: 74 LFGNIMSMASVQLVGASRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPFDN---------------KGQLKEAFNIRL 106
N I ++ DP+ R + +Y V+PF K ++ I +
Sbjct: 134 QNVHIPVVRVDPENRCAVMLVYGTKLVVLPFRKDTLTDEQESGVGEGPKSSFLPSYIIDV 193
Query: 107 EEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP-- 154
EL ++D+KFL+G +PT+++L++ N+ A T + + ++
Sbjct: 194 RELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKVHP 253
Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIR 199
WS +NL ++ VP P+ GV++ +++Y + N A P+R
Sbjct: 254 VIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVSLNSQTNGTTAFPLR 313
Query: 200 PSITKAYGRVDADGSRYLLGD------HAGLLHLL-VITHEKEKVTGLKIELLGETSIAS 252
+ +D S ++ D G +++L +IT V + + + +
Sbjct: 314 VQ-DEVKLTLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTT 372
Query: 253 TISYLDNAVVYIGSSYGDSQLIK 275
+ ++ +++GS G+S L+K
Sbjct: 373 CMVTMEPGYLFLGSRLGNSLLLK 395
>gi|56676371|ref|NP_037423.2| cleavage and polyadenylation specificity factor subunit 1 [Homo
sapiens]
gi|23503048|sp|Q10570.2|CPSF1_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|16878041|gb|AAH17232.1| Cleavage and polyadenylation specific factor 1, 160kDa [Homo sapiens]
gi|119602516|gb|EAW82110.1| cleavage and polyadenylation specific factor 1, 160kDa, isoform CRA_c
[Homo sapiens]
gi|123993607|gb|ABM84405.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
gi|123999626|gb|ABM87355.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
gi|307684758|dbj|BAJ20419.1| cleavage and polyadenylation specific factor 1, 160kDa [synthetic
construct]
Length = 1443
Score = 102 bits (253), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 972 VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1031
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N CA + E ++L+ ++E I +
Sbjct: 1032 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1091
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1092 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1150
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1151 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1205
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1206 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1265
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
LG ++ + NL E RL ++H+G VN F G+
Sbjct: 1266 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1325
Query: 925 DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
S V + + F T++G IG++ + + Y L LQ L ++ GLN +R +
Sbjct: 1326 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1385
Query: 984 NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1386 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1425
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 107/567 (18%), Positives = 220/567 (38%), Gaps = 153/567 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
+ +++ ++ +++GS G+S L+ KL P DA
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAG 429
Query: 286 Y--------------------------------VEVLERYVNLGPIVDFCVVD---LERQ 310
+ EV + +N+GP + V + L +
Sbjct: 430 WSAAGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAVGEPAFLSEE 489
Query: 311 GQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDD 359
Q ++V CSG K+G+L +++ I + EL G MW++ + D+
Sbjct: 490 FQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDN 549
Query: 360 P--------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
P FL++S T IL E+ E + GF +Q T+
Sbjct: 550 PKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTV 607
Query: 400 FCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
F + N+ +VQV+ +RL+ ++
Sbjct: 608 FAGNIGDNRYIVQVSPLGIRLLEGVNQ 634
>gi|410987992|ref|XP_004000273.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Felis catus]
Length = 1432
Score = 102 bits (253), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 113/480 (23%), Positives = 200/480 (41%), Gaps = 74/480 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 961 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1020
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1021 KVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1080
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1081 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1139
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI + W LR EL
Sbjct: 1140 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKI--FLWSLR---ASELTGMAF 1194
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1195 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1254
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1255 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1308
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1309 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1368
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+R + +++ + +N LDG+L+ +L LS E++K + + + + + + E R+
Sbjct: 1369 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILEDLLETDRV 1428
Score = 46.6 bits (109), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 65 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 124
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 125 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 183
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 184 VRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 243
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 244 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNGLTTGTTAFPL 303
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 304 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 360
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
+ +++ ++ +++GS G+S L+K
Sbjct: 361 LTTSMVTMEPGYLFLGSRLGNSLLLK 386
Score = 40.0 bits (92), Expect = 7.2, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 36/172 (20%)
Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
EV + +N+GP + + + L + Q ++V CSG K+G+L +++ I
Sbjct: 454 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 513
Query: 339 QASVELQGIKGMW------------------------SLRSSTDDPFDTFLVVSFISETR 374
+ EL G MW +L + D FL++S T
Sbjct: 514 VTTFELPGCYDMWTVIAPVRKEQEETSKGEGAEQEPSTLEAEDDGRRHGFLILSREDSTM 573
Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
IL E+ E + GF +Q T+F + N+ +VQV+ +RL+ ++
Sbjct: 574 ILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 623
>gi|392306997|ref|NP_001254722.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
gi|380812168|gb|AFE77959.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
gi|383417835|gb|AFH32131.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
Length = 1442
Score = 102 bits (253), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 971 VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1030
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N CA + E ++L+ ++E I +
Sbjct: 1031 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1090
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1091 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1149
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1150 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1204
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1205 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1264
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
LG ++ + NL E RL ++H+G VN F G+
Sbjct: 1265 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1324
Query: 925 DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
S V + + F T++G IG++ + + Y L LQ L ++ GLN +R +
Sbjct: 1325 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1384
Query: 984 NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1385 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1424
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 107/566 (18%), Positives = 221/566 (39%), Gaps = 152/566 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
+ +++ ++ +++GS G+S L+ KL P DA S
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAS 429
Query: 286 Y--------------------------------VEVLERYVNLGPIVDFCVVD---LERQ 310
+ EV + +N+GP + + + L +
Sbjct: 430 WSAGGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAMGEPAFLSEE 489
Query: 311 GQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDD 359
Q ++V CSG K+G+L +++ I + EL G MW++ + D+
Sbjct: 490 FQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDN 549
Query: 360 P-------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
P FL++S T IL E+ E + GF +Q T+F
Sbjct: 550 PKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVF 607
Query: 401 CHDAIYNQ-LVQVTSGSVRLVSSTSR 425
+ N+ +VQV+ +RL+ ++
Sbjct: 608 AGNIGDNRYIVQVSPLGIRLLEGVNQ 633
>gi|354491126|ref|XP_003507707.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 3 [Cricetulus griseus]
Length = 1449
Score = 102 bits (253), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 970 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1030 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1089
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1090 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1148
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1149 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1203
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1204 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1263
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1264 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1317
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1318 AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1377
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1378 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1423
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 107/565 (18%), Positives = 220/565 (38%), Gaps = 151/565 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLN--LQ---------------PDAKGSYVE---- 288
+ +++ ++ +++GS G+S L+K LQ P +K V+
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVDPTAG 429
Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
V + +N+GP + V + L +
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 489
Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
Q ++V CSG K+G+L +++ I + EL G MW++
Sbjct: 490 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEEAP 549
Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
++ D FL++S T IL E+ E + GF +Q T+F
Sbjct: 550 RAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 607
Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
+ N+ +VQV+ +RL+ ++
Sbjct: 608 GNIGDNRYIVQVSPLGIRLLEGVNQ 632
>gi|397497327|ref|XP_003819464.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Pan paniscus]
gi|410336497|gb|JAA37195.1| cleavage and polyadenylation specific factor 1, 160kDa [Pan
troglodytes]
Length = 1442
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 115/460 (25%), Positives = 195/460 (42%), Gaps = 62/460 (13%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 971 VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1030
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N CA + E ++L+ ++E I +
Sbjct: 1031 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1090
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1091 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1149
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1150 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1204
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1205 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1264
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
LG ++ + NL E RL ++H+G VN F G+
Sbjct: 1265 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1324
Query: 925 DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
S V + + F T++G IG++ + + Y L LQ L ++ GLN +R +
Sbjct: 1325 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1384
Query: 984 NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1385 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1424
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 106/566 (18%), Positives = 220/566 (38%), Gaps = 152/566 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
+ +++ ++ +++GS G+S L+ KL P DA
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAG 429
Query: 286 Y--------------------------------VEVLERYVNLGPIVDFCVVD---LERQ 310
+ EV + +N+GP + + + L +
Sbjct: 430 WSAAGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAMGEPAFLSEE 489
Query: 311 GQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDD 359
Q ++V CSG K+G+L +++ I + EL G MW++ + D+
Sbjct: 490 FQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDN 549
Query: 360 P-------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
P FL++S T IL E+ E + GF +Q T+F
Sbjct: 550 PKGEGTEQEPSTPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVF 607
Query: 401 CHDAIYNQ-LVQVTSGSVRLVSSTSR 425
+ N+ +VQV+ +RL+ ++
Sbjct: 608 AGNIGDNRYIVQVSPLGIRLLEGVNQ 633
>gi|345779232|ref|XP_532356.3| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Canis lupus familiaris]
Length = 1460
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 111/466 (23%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 989 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1048
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1049 KVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1108
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1109 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1167
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI + W LR EL
Sbjct: 1168 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKI--FLWSLR---ASELTGMAF 1222
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1223 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1282
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1283 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1336
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1337 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1396
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1397 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1442
Score = 46.6 bits (109), Expect = 0.073, Method: Compositional matrix adjust.
Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 93 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 152
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 153 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 211
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 212 VRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 271
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 272 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNGLTTGTTAFPL 331
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 332 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 388
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
+ +++ ++ +++GS G+S L+K
Sbjct: 389 LTTSMVTMEPGYLFLGSRLGNSLLLK 414
Score = 40.0 bits (92), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 72/172 (41%), Gaps = 36/172 (20%)
Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
EV + +N+GP + + + L + Q ++V CSG K+G+L +++ I
Sbjct: 482 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 541
Query: 339 QASVELQGIKGMWS------------------------LRSSTDDPFDTFLVVSFISETR 374
+ EL G MW+ L + D FL++S T
Sbjct: 542 VTTFELPGCYDMWTVIAPVRKEQEETSKGEVAEQESSALEAEDDGRRHGFLILSREDSTM 601
Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
IL E+ E + GF +Q T+F + N+ +VQV+ +RL+ ++
Sbjct: 602 ILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 651
>gi|156364999|ref|XP_001626630.1| predicted protein [Nematostella vectensis]
gi|156213514|gb|EDO34530.1| predicted protein [Nematostella vectensis]
Length = 1420
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 125/531 (23%), Positives = 227/531 (42%), Gaps = 81/531 (15%)
Query: 570 KVSLGTQPITLRTFSSKNT-THVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNS 626
K L + LR F+ ++ + +F P I+ +N+ + + + V+ F++
Sbjct: 900 KSGLDPKVAMLRVFNDISSYSGIFVCGSYPFWIFVTNRGAFHWHPMSIDGPVTCFAAFHN 959
Query: 627 AAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLK 681
P L GEL I + +R +PL P + + +S+T+AI + +
Sbjct: 960 VNCPKGFLYFNTRGELRISVLPTHLSYDSPWPVRKVPLRYTPHMVSYNRESKTYAIVTSE 1019
Query: 682 NQSC-------AEESEMHFVRLLDDQTFEFIST--------------------YPLDTFE 714
+ C AE+ E FV + D F + ST + LD +E
Sbjct: 1020 QEPCKKIPRVTAEDKE--FVDTIRDARFIYPSTERFVLQLISPISWEVIPNTRHDLDEWE 1077
Query: 715 YGCSILSCSF-SDDSNV----YYCVGTAYVLPEENEPTKGRILVF-IVE----------D 758
+ ++ + S++++ + CVGT + EE +GRIL+F I+E
Sbjct: 1078 HVTTMKNLLLHSEETHTGRKGFICVGTTQLYGEEIA-VRGRILIFDIIEVVPEPGQPLTK 1136
Query: 759 GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGH 815
K +L+ EKE KG V +LN NG L++ I QKI Y W D+ G + ++ H
Sbjct: 1137 NKFKLLYEKEQKGPVTALNQVNGYLVSGIGQKI--YIWNFTDNDLVGMAFIDTQLYIHSL 1194
Query: 816 ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG- 874
+ T +F++ D+ KSI+LL + E + ++D + A + D +G
Sbjct: 1195 V------TIRNFVIAADVCKSITLLRLQEETKTLAFVSKDPKNLEVYAADFFIDGPQIGF 1248
Query: 875 ----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS---- 926
E N LFT + E + RL + ++G + F + L S
Sbjct: 1249 LVSDVEKNLVLFTYQ--PEAIESQGGQRLLQRADINVGTHITSFFRIAAKAHLKASGEKS 1306
Query: 927 -DVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE 985
++ Q+ FGT++G +G++ + + + L LQT L I V GLN + +R
Sbjct: 1307 KEMRQL--TCFGTLDGALGLMLPMTEKTFRRLHMLQTKLVDCIPHVAGLNPKAFRMLQWR 1364
Query: 986 KKTV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
K+ + +N LD L+ ++ LS E+++ + + ++ + ++ R
Sbjct: 1365 KRKLCNPHRNVLDWQLLFKYMHLSFMERQEVARKIGTTPAQIMDDMMDIER 1415
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 110/569 (19%), Positives = 217/569 (38%), Gaps = 163/569 (28%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGR---- 61
++G I +L R G +D L ++ + K ++ +D ++ TR++ D +
Sbjct: 72 LFGNIESLHAIRLAGNTRDSLLMSFKDAKLSIVDYDPGKHDIKTRSLHFFEDEKIKSHCL 131
Query: 62 PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKE-----------------AFNI 104
D + IDP+ R + Y V+PF +G + + ++ I
Sbjct: 132 AQDRAPVVRIDPERRCAVMLAYGTHLVVLPFRQEGGIDDTAQDSIISSSDRPPVLPSYII 191
Query: 105 RLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD------------ARHVKTYEVALKD 147
++E+ +LDI+FL+G +PT+++LY+ K A + ++ K
Sbjct: 192 DVKEIDEKTCNILDIQFLHGYYEPTLLILYEPLKTWAGRLAMRNDTCALVAVSLNMSQKA 251
Query: 148 KDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAF----K 194
V W + L ++PVP P+ GVL+ ++Y S N+
Sbjct: 252 HPVV---WQLSCLPFDCIYVMPVPKPIGGVLVCCMNALLYLNQSVPPYGVSVNSIGENST 308
Query: 195 AIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLL------VITHEKEKVTGLKIELLGET 248
P++P + + +GS + + L+ L V+T + V ++ + +T
Sbjct: 309 VFPLKP---QKGVTITLEGSNAIFIANDKLVFSLKGGEIYVVTLIADGVRSVRNFVFDKT 365
Query: 249 S---IASTISYLDNAVVYIGSSYGDSQLIKLNLQP---------DAKGSYVEVLERY--- 293
+ + S + + +++GS G+S L+K +P +A+ + +ER+
Sbjct: 366 AASVLTSCVCECGDGYLFLGSRLGNSLLVKYTEKPQDIVYGTENNAQSMQCDNIERWQIL 425
Query: 294 --------------------------------------VNLGP----------------- 298
+N+GP
Sbjct: 426 NGSLLLIVDDLDELEVYGAQQEAGVELTSYTFEVCDSLLNIGPCSCMDIGEPAFLSVSSY 485
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST- 357
D +DLE VV+CSG K+G+L +++ I + EL G MW++ S
Sbjct: 486 FADAQELDLE------VVSCSGYGKNGALTVLQRSIRPQVVTTFELPGCTDMWTVFSKDQ 539
Query: 358 --------------------DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQ 397
++ + +FL++S E + + E E+ E + GF +Q
Sbjct: 540 KKGAQTNAIHRYPSQPCTQGNEKYHSFLILS--REDSSMILKTEQEIMEVDQSGFSTQCA 597
Query: 398 TLFCHD-AIYNQLVQVTSGSVRLVSSTSR 425
T++ + + ++QVT VRL+ ++
Sbjct: 598 TIYAGNFGNGSYILQVTPLGVRLLEGVNQ 626
>gi|344236599|gb|EGV92702.1| Cleavage and polyadenylation specificity factor subunit 1 [Cricetulus
griseus]
Length = 1419
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 948 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1007
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1008 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1067
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1068 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1126
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1127 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1181
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1182 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1241
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1242 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1295
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1296 AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1355
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1356 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1401
Score = 59.7 bits (143), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 107/570 (18%), Positives = 220/570 (38%), Gaps = 156/570 (27%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL----------------------NLQPDAKGSYV 287
+ +++ ++ +++GS G+S L+K N +P +K V
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKASAHNEEPPSKKKRV 429
Query: 288 --------------------------------------EVLERYVNLGPIVDFCVVD--- 306
EV + +N+GP + V +
Sbjct: 430 DPTAGWTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAF 489
Query: 307 LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------- 353
L + Q ++V CSG K+G+L +++ I + EL G MW++
Sbjct: 490 LSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKE 549
Query: 354 -----------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQT 396
++ D FL++S T IL E+ E + GF +Q
Sbjct: 550 EEEAPRAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQG 607
Query: 397 QTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
T+F + N+ +VQV+ +RL+ ++
Sbjct: 608 PTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 637
>gi|197245729|gb|AAI68713.1| Cpsf1 protein [Rattus norvegicus]
Length = 1439
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 968 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1027
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1028 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1087
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1088 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1146
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1147 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1201
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1202 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1261
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1262 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1315
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1316 AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1375
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1376 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1421
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 108/563 (19%), Positives = 220/563 (39%), Gaps = 149/563 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYV----- 287
+ +++ ++ +++GS G+S L+K LQ P +K V
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVDPTVG 429
Query: 288 -------------------------------EVLERYVNLGPIVDFCVVD---LERQGQG 313
EV + +N+GP + V + L + Q
Sbjct: 430 WTGGKTQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQN 489
Query: 314 ------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------------- 353
++V CSG K+G+L +++ I + EL G MW++
Sbjct: 490 SPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKA 549
Query: 354 ----------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
++ D FL++S T IL E+ E + GF +Q T+F +
Sbjct: 550 ESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGN 607
Query: 404 AIYNQ-LVQVTSGSVRLVSSTSR 425
N+ +VQV+ +RL+ ++
Sbjct: 608 IGDNRYIVQVSPLGIRLLEGVNQ 630
>gi|354491122|ref|XP_003507705.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 1 [Cricetulus griseus]
Length = 1441
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 970 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1030 KVYAVATSTNTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1089
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1090 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1148
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1149 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1203
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1204 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1263
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1264 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1317
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1318 AEGPSKKSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1377
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1378 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1423
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 107/565 (18%), Positives = 220/565 (38%), Gaps = 151/565 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLN--LQ---------------PDAKGSYVE---- 288
+ +++ ++ +++GS G+S L+K LQ P +K V+
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVDPTAG 429
Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
V + +N+GP + V + L +
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 489
Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
Q ++V CSG K+G+L +++ I + EL G MW++
Sbjct: 490 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEEAP 549
Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
++ D FL++S T IL E+ E + GF +Q T+F
Sbjct: 550 RAESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 607
Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
+ N+ +VQV+ +RL+ ++
Sbjct: 608 GNIGDNRYIVQVSPLGIRLLEGVNQ 632
>gi|1045574|gb|AAC50293.1| cleavage and polyadenylation specificity factor [Homo sapiens]
Length = 1442
Score = 101 bits (252), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 116/465 (24%), Positives = 195/465 (41%), Gaps = 73/465 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 972 VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1031
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N CA + E ++L+ ++E I +
Sbjct: 1032 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1091
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1092 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1150
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1151 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1205
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1206 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1265
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP---- 924
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1266 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRAT 1319
Query: 925 -----DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQ 978
S V + + F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1320 EGLSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRA 1379
Query: 979 WRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1380 FRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1424
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 107/567 (18%), Positives = 220/567 (38%), Gaps = 153/567 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
+ +++ ++ +++GS G+S L+ KL P DA
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAG 429
Query: 286 Y--------------------------------VEVLERYVNLGPIVDFCVVD---LERQ 310
+ EV + +N+GP + V + L +
Sbjct: 430 WSAAGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAVGEPAFLSEE 489
Query: 311 GQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDD 359
Q ++V CSG K+G+L +++ I + EL G MW++ + D+
Sbjct: 490 FQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDN 549
Query: 360 P--------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
P FL++S T IL E+ E + GF +Q T+
Sbjct: 550 PKGEGTEQEPSTTPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTV 607
Query: 400 FCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
F + N+ +VQV+ +RL+ ++
Sbjct: 608 FAGNIGDNRYIVQVSPLGIRLLEGVNQ 634
>gi|338728511|ref|XP_001505047.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like isoform 1 [Equus caballus]
Length = 1444
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 973 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1032
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1033 KVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1092
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1093 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1151
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1152 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1206
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1207 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1266
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1267 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1320
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1321 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1380
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1381 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1426
Score = 46.6 bits (109), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 77 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 137 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 195
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 196 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 255
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 256 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 315
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 316 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 372
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
+ +++ ++ +++GS G+S L+K
Sbjct: 373 LTTSMVTMEPGYLFLGSRLGNSLLLK 398
>gi|444523674|gb|ELV13604.1| Cleavage and polyadenylation specificity factor subunit 1 [Tupaia
chinensis]
Length = 1469
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 115/468 (24%), Positives = 197/468 (42%), Gaps = 78/468 (16%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 998 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1057
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1058 KVYAVATSTNAPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1117
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1118 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1176
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1177 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1231
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1232 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1291
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVV--GEYHLGEFVNRFRHGSLVMRLPDS 926
LG ++ + NL E E G L ++ ++HLG VN F R P
Sbjct: 1292 QLGFLVSDRDRNLMVYMYLPEA--KESFGGLLLLRRADFHLGAHVNTF------WRTPCR 1343
Query: 927 DVGQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLN 975
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1344 GAVEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLN 1403
Query: 976 HEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1404 PRAFRMLHVDRRTLQNAVRNVLDGELLSRYLYLSTMERSELAKKIGTT 1451
Score = 47.4 bits (111), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 62/323 (19%), Positives = 138/323 (42%), Gaps = 54/323 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVE--G 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQRVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFK---AIPIRPSITKAYGR 208
P WS +L + VP P+ GV++ +++Y + + A+ + T A+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTAGTTAFPL 312
Query: 209 VDADGSRYLLG-DHAGLLH---------------LLVITHEKEKVTGLKIELLGETSIAS 252
DG R L HA + L ++T V + + + +
Sbjct: 313 RTQDGVRLTLDCAHAAFISYDKMVISLKGGEIYVLTLVTDGMRSVRAFHFDKAAASVLTT 372
Query: 253 TISYLDNAVVYIGSSYGDSQLIK 275
++ ++ +++GS G+S L+K
Sbjct: 373 SMVTMEPGYLFLGSRLGNSLLLK 395
>gi|383863556|ref|XP_003707246.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Megachile rotundata]
Length = 1415
Score = 101 bits (251), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 125/529 (23%), Positives = 232/529 (43%), Gaps = 84/529 (15%)
Query: 575 TQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPD 631
T+ +R FS+ VF SD P I+ + + L ++ + ++ PFN+ P
Sbjct: 897 TRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPITSFAPFNNINCPQ 956
Query: 632 S-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA 686
L ++ EL I + +R +PL P + + +S+T+ + + S A
Sbjct: 957 GFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVIT----SIA 1012
Query: 687 EESEMHFVRLLDDQTF-------EFI-----------------STYP-----LDTFEYGC 717
E + ++ +D+ F FI T P LD +E+
Sbjct: 1013 EPLKSYYRFNGEDKEFTEEDRPDRFIFPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVT 1072
Query: 718 SILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKL 761
+ + S + + Y +GT Y E+ ++GRIL+F I+E +
Sbjct: 1073 CLKNVSLAYEGTRSGLKGYIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRF 1131
Query: 762 QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILA 818
+ I KE KG + ++ +G L++A+ QKI Y W L+D+ G + ++ H +L+
Sbjct: 1132 KQIYAKEQKGPITAITQVSGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLS 1188
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---- 874
+ I++ D+ KSISLL ++ E + +RD+ + +E L D+ LG
Sbjct: 1189 I-----KSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNNNLGFLVA 1243
Query: 875 -AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP- 932
E+N LF + S + + +L ++HLG+ VN F + P +D
Sbjct: 1244 DGESNIALFMYQPESRESLGGQ--KLIRKADFHLGQKVNTFFRIRCRISDPANDKKHFSG 1301
Query: 933 -----TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
++ +++G +G I +P + Y L LQ L I + GLN + +R++ + +
Sbjct: 1302 ADKRHVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYIR 1361
Query: 988 TV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
T A+ +DGDL+ +L L +++K + V+E+ + + E+ R
Sbjct: 1362 TQGNPARGIIDGDLVWRYLYLPNNEKIDVAKKIGTRVQEIIEDLTEIDR 1410
Score = 54.3 bits (129), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 66/329 (20%), Positives = 138/329 (41%), Gaps = 61/329 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
++G + +++ G +D L ++ K V+++D + +L T ++ + R T
Sbjct: 73 LHGNVMSMQAVTLVGSQRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEEIRDGWT 132
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFN 103
++ I I+ DP+ R + +Y V+PF NK + ++
Sbjct: 133 NHHHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYM 192
Query: 104 IRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKD 149
I L+ L+ ++D++FL+G +PT+++LY+ + R VA+ +
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252
Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAI 196
WS +NL +PV PL G LI+ +++Y + A
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLAETSTNF 312
Query: 197 PIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE 247
P++P + ++ +GS R ++ +G L++L + V G +
Sbjct: 313 PLKP---QEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAA 369
Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKL 276
+ + S + D+ +++GS G+S L++
Sbjct: 370 SVLTSCVCMCDDNYLFLGSRLGNSLLLRF 398
Score = 44.3 bits (103), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 77/172 (44%), Gaps = 14/172 (8%)
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
++VT SG K+G+L +++ I + EL G + MW++ + ++ F
Sbjct: 505 ELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVIGALNNDEQVRPEAEGSHAF 564
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
L++S T IL E+ E + GF +Q T+F + N+ +VQVT VRL+
Sbjct: 565 LILSQEDSTMILQTG--QEINEVDQSGFSTQGSTVFAGNLGANRYIVQVTQMGVRLLQGI 622
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
E G + A+ V L + G ++ L + +G T HAQ
Sbjct: 623 --EQIQHMPIDLGCPIVHASCADPYVSLLSEDGQVMLLTLREGRGTAKLHAQ 672
>gi|338728513|ref|XP_003365689.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Equus caballus]
Length = 1450
Score = 101 bits (251), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 979 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1038
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1039 KVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1098
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1099 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1157
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1158 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1212
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1213 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1272
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1273 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1326
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1327 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1386
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1387 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1432
Score = 46.6 bits (109), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 77 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 137 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 195
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 196 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 255
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 256 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 315
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 316 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 372
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
+ +++ ++ +++GS G+S L+K
Sbjct: 373 LTTSMVTMEPGYLFLGSRLGNSLLLK 398
>gi|241952575|ref|XP_002419009.1| pre-mRNA-splicing factor, putative; pre-spliceosome component,
putative [Candida dubliniensis CD36]
gi|223642349|emb|CAX42591.1| pre-mRNA-splicing factor, putative [Candida dubliniensis CD36]
Length = 1187
Score = 101 bits (251), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 158/754 (20%), Positives = 308/754 (40%), Gaps = 97/754 (12%)
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
+++ + S + D +LV+S ++ L +++ + +E+ E F T+ +
Sbjct: 454 IFTTKLSLESANDEYLVISSSLSSKTLVLSIGEVVEDVEDSEFVLDQSTISVQQVGIASV 513
Query: 410 VQVTSGSVRLVSSTS-RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG---D 465
VQV S ++ + + + ++ +W P G ++ A+ N QVL+A ++VY EI D
Sbjct: 514 VQVYSNGIKHIRTVNGKKKTTDWFPPAGITITHASTNNQQVLIALSNLNVVYFEIDSTDD 573
Query: 466 GILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL- 524
++ H ++ I+ + I N S A +G +D ++++ SL + N + + L
Sbjct: 574 QLIEYQDHLEISTTITAMAIQE--HNSEKSSFAIIGC-SDETIQVVSLQEDNCLEIKSLQ 630
Query: 525 ---GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
+ + E +++ + +G ++ G L++ + LG++P+ L
Sbjct: 631 ALSANSSSLKMLKSSGKE--THVHIGMENGVYARIKIDTINGNLSNSRVKYLGSKPVNLS 688
Query: 582 TFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP-DSLAIAKEG 639
N V A S P + Y + + ++++ F S + + K+
Sbjct: 689 VIKFSNEIEGVLAISSAPWISYLYRDSFKITPLLEIDITNGSSFISEDIGGEGIVGIKDN 748
Query: 640 ELTIGTI-------DDIQKLHIRSIPLGEHPRR-ICHQEQSRTFAI-----------CSL 680
L I ++ D Q L I + L PR+ I H + R F C++
Sbjct: 749 NLVIFSVGKEDSVFDPSQDLTITNTKLRYTPRKMITHGD--RLFISESEYNVQGPFKCNI 806
Query: 681 KNQSCAEESEMHF---------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFS 725
E ++ ++++DD+T + I + + E S+ +F+
Sbjct: 807 NGNVKENVDEDYYEAFGYDRKQDSWASCIQVVDDKTNQVIQSLQFEENESIVSMSDVAFN 866
Query: 726 DD-SNVY---YCVGTAY---VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNA 778
S+V+ VG +LP E K + F + LQL+ + E L
Sbjct: 867 KSLSSVHASHLVVGVCTNQTILPNSYE--KSYLYTFKIGKKHLQLVHKTELDYIPQVLEN 924
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECG---HHGHILALYVQTRGDFIVVGDLMK 835
F KLL A N IQLY D G ++L + + + I+
Sbjct: 925 FQDKLLVASNNHIQLY-----DIGQKQLLKKSTTIIDFSQNINKIIPQSNRIIICDSHKS 979
Query: 836 SISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---------- 885
SI + + A D ++++ LD D +G + N+F R
Sbjct: 980 SIIFAKFDDSQNQFVPFADDIMKRQITSIMNLDIDTLIGGDKFGNIFVTRIDEDISKQAD 1039
Query: 886 ------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTV 939
K+ EG + +L+ + E+H+G+ + G L ++ +VI+ +
Sbjct: 1040 DDWTILKSQEGILNSCPYKLQNLIEFHIGDIITSLNLGCL-------NLAGTESVIYTGL 1092
Query: 940 NGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGD 998
G IG++ L + + L LQ +++ + G +H ++RS+ N KN +DGD
Sbjct: 1093 QGTIGLLVPLVSKSEVELLFNLQLLMQQFQNNLVGKDHLKFRSYYNP-----IKNVIDGD 1147
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L+E FL+ + EIS+ +N SV ++ K++ +L
Sbjct: 1148 LLERFLEFDTSLRIEISRKLNKSVNDIEKKLIDL 1181
>gi|334326317|ref|XP_001364707.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Monodelphis domestica]
Length = 1449
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 116/480 (24%), Positives = 200/480 (41%), Gaps = 74/480 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 978 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1037
Query: 673 RTFAICSLKNQSC-------AEESEMHFV----------------RLLDDQTFEFI--ST 707
+ +A+ + N C EE E + +L+ ++E I +
Sbjct: 1038 KVYAVATSTNALCTRIPRMTGEEKEFETIERDERYIHPLQEAFSIQLISPVSWEAIPNAR 1097
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1098 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1156
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1157 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1211
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D
Sbjct: 1212 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDSA 1271
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1272 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1325
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1326 AEGPSKKSIVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1385
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+R + +++ + +N LDG+L+ +L LS E++K + + + + + E+ R+
Sbjct: 1386 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTTPDIILDDLLEIDRV 1445
Score = 44.7 bits (104), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 45/216 (20%), Positives = 99/216 (45%), Gaps = 35/216 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 77 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 137 QNVHTPRVRV-DPDGRCAVMLIYGTRLVVLPFRRESLAEEHEGLVGEGQKSSFLPSYIID 195
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP- 154
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + ++
Sbjct: 196 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNILQKVH 255
Query: 155 ---WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
WS NL + VP P+ GV+I +++Y
Sbjct: 256 PVIWSLTNLPFDCTQALAVPKPIGGVVIFAVNSLLY 291
>gi|395860104|ref|XP_003802355.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Otolemur garnettii]
Length = 1441
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 112/466 (24%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 970 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1029
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1030 KVYAVATSTNTPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1089
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1090 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1148
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1149 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1203
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1204 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1263
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1264 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1317
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1318 TEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1377
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1378 AFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1423
Score = 46.6 bits (109), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 62/326 (19%), Positives = 141/326 (43%), Gaps = 60/326 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
+ +++ ++ +++GS G+S L+K
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLK 395
Score = 39.7 bits (91), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 73/172 (42%), Gaps = 36/172 (20%)
Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
EV + +N+GP + + + L + Q ++V CSG K+G+L +++ I
Sbjct: 463 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQV 522
Query: 339 QASVELQGIKGMWSLRSST------------------------DDPFDTFLVVSFISETR 374
+ EL G MW++ +S D FL++S T
Sbjct: 523 VTTFELPGCYDMWTVIASVRKEEEETPKGEGTEQESGVPEGEEDGRRHGFLILSREDSTM 582
Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
IL E+ E + GF +Q T+F + N+ +VQV+ +RL+ ++
Sbjct: 583 ILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 632
>gi|241060959|ref|XP_002408050.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
gi|215492346|gb|EEC01987.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
Length = 1241
Score = 100 bits (250), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 92/323 (28%), Positives = 155/323 (47%), Gaps = 35/323 (10%)
Query: 731 YYCVGTAYVLPEENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAF 779
Y +GT Y E+ ++GRI + + D K++++ KE KG V +L+
Sbjct: 916 YLALGTNYCY-GEDVTSRGRITILDIIDVVPEPGQPLTKNKIKIVYSKEQKGPVTALSQV 974
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
G LL+AI QK +Y W L+D+G L I V T + I+VGD+ KS+SL
Sbjct: 975 VGFLLSAIGQK--MYIWQLKDNG---LVGVAFIDTQIYIHSVVTVKNLILVGDVFKSVSL 1029
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDE 894
L Y+ + +RD + AVE D+ + +E N L+ + S +
Sbjct: 1030 LRYQEASRTLSLVSRDVRPLEVFAVEFFIDNSQMSFLVTDSERNMILYMYQPESRESCGG 1089
Query: 895 ERGRLEVVGEYHLGE-----FVNRFRHGSLV---MRLPDSDVGQIPTVIFGTVNGVIGVI 946
+ RL G++H+G F + R G + RL S G+ T++ T++G +G +
Sbjct: 1090 Q--RLLRRGDFHIGSPVVSMFRIKCRMGEVAKHDRRLAASVDGRHITML-ATLDGSLGYV 1146
Query: 947 ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV--DAKNFLDGDLIESFL 1004
+P + Y L LQ L + GLN + +R ++++++ + KN LDG+LI F+
Sbjct: 1147 LPVPEKTYRRLLMLQNVLVTNMPHYAGLNPKAFRMYHSQRRVLGNPHKNILDGELIWKFM 1206
Query: 1005 DLSRTRMDEISKTMNVSVEELCK 1027
LS E+SK + +V ++ +
Sbjct: 1207 HLSFMERSELSKKIGTTVTQVVQ 1229
>gi|157110889|ref|XP_001651294.1| cleavage and polyadenylation specificity factor cpsf [Aedes aegypti]
gi|108883895|gb|EAT48120.1| AAEL000832-PA [Aedes aegypti]
Length = 1417
Score = 100 bits (249), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 129/538 (23%), Positives = 237/538 (44%), Gaps = 79/538 (14%)
Query: 561 KTGELTDRKKVSLGTQPITL-RTFSSKNTTH-VFAASDRPTVIYSSNK-----KLLYSNV 613
+ GE D + + I++ R F++ N + V ++P ++ +++ LY+
Sbjct: 886 QEGEKPDEHSTKIMYENISMIRYFNNVNGYNGVAVCGEKPYIMLLTSRGELRAHRLYAKT 945
Query: 614 NLKEVSHMCPFNSAAFPDS-LAIAKEGELTIG------TIDDIQKLHIRSIPLGEHPRRI 666
+K + PFN+ P+ L ++ EL I + D I +R IPL P++I
Sbjct: 946 IMKGFA---PFNNVNCPNGFLYFDEQYELKIAVFPGYLSYDSIWP--VRKIPLRSSPKQI 1000
Query: 667 CHQEQSRTFAICSLKNQSC-------AEESEM----------------HFVRLLDDQTFE 703
+ ++++ + + + C E+ E+ V L+ +E
Sbjct: 1001 VYHKENKVYCVVMDAEEVCNKYYRFNGEDKELTEENKGERFLYPMAHKFSVVLVTPSAWE 1060
Query: 704 FI--STYPLDTFEYGCSILSCSFSDDS-----NVYYCVGTAYVLPEENEPTKGRILVF-- 754
I ++ LD +E+ ++ + S S + Y VGT + E+ ++GR+L++
Sbjct: 1061 IIPETSINLDEWEHVIALKNVSLSYEGARSGFKEYIAVGTNFNY-SEDITSRGRLLLYDI 1119
Query: 755 ---IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRE 805
+ E GK + + KE KG V ++ +G L+ A+ QK+ Y W L+DD +
Sbjct: 1120 IEVVPEPGKPLTRYKFKEVIVKEQKGPVSAITHVSGFLVGAVGQKV--YLWQLKDD---D 1174
Query: 806 LQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
L +I + + I+V D+ KS+SLL ++ + + +RDY + +E
Sbjct: 1175 LVGVAFIDTNIFVHQLVSIKSLILVADVYKSVSLLRFQEDYRTLSLVSRDYQPLNVFQIE 1234
Query: 866 ILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR-FR-----H 916
+ D+ LG ++ N+ T E RL +YH+G+ +N FR H
Sbjct: 1235 YVVDNHNLGFLVSDEQCNIITYMYQPESRESFGGQRLLRKCDYHVGQKINSMFRVQCDFH 1294
Query: 917 GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
R +S+ T F T++G IG + LP + Y L LQ L + GLN
Sbjct: 1295 EMDYKR--NSNYECKHTTYFATLDGGIGYVLPLPEKTYRRLFMLQNVLMTHSPHLCGLNP 1352
Query: 977 EQWRSFNNEKKTV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+ +R+ KK A+ +DGDLI +FL L E++K + ++++C + E+
Sbjct: 1353 KAFRTIKTVKKLPINPARCVVDGDLIWTFLTLPANEKLEVAKKIGTRIDDICADLMEI 1410
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 117/548 (21%), Positives = 213/548 (38%), Gaps = 133/548 (24%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG--DVSDR 58
M ++G I +++ G +D L I+ + K V+Q+D ++ EL T ++ + D
Sbjct: 68 MATYTLFGNIMSMQSVSLAGSQRDALLISFQDAKLSVVQFDPDNFELKTLSLHYFEEEDI 127
Query: 59 IGRPTDNGQIGII--DPDCRLIGLHLYDGLFKVIPFDNKGQLKE---------------- 100
G T + I+ DPD R + +Y V+PF L E
Sbjct: 128 KGGWTGHYHTPIVRVDPDNRCAVMLVYGRKLVVLPFRKDSSLDEIEVQDVKPMKKAPTQL 187
Query: 101 --------AFNIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTY 141
++ I L+E + V+DI+FL+G +PT+++LY+ K R
Sbjct: 188 IAKTPILASYVIELKESEERIDNVIDIQFLHGYYEPTLLILYEPVKTFPGRIAVRSDTCM 247
Query: 142 EVALK---DKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS--------- 189
VAL + W+ N L I + P+ G LI+ ++Y +
Sbjct: 248 MVALSLNIQQRVHPVIWTVNCLPFDCLQAIAISKPIGGCLILSVNALIYLNQSVPPYGVS 307
Query: 190 ----ANAFKAIPIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLLVITHEKEK- 236
A+ P++P + R+ D + + +L G L++L + + +
Sbjct: 308 LNSIADHCTNFPLKP---QDGVRISLDAAQVCFIEPEKLVLSLKGGELYVLTLCADSMRS 364
Query: 237 VTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAK------------- 283
V + + I ++ +++GS G+S L++ + ++
Sbjct: 365 VRSFHFSKAASSVLTCCICVVEEEYLFLGSRLGNSLLLRFKEKDESMVITIDDTEEVVEK 424
Query: 284 -------------------------GSYV-EVLERYVNLGPIVDFCV---VDLERQGQG- 313
SY+ EV + +N+GPI V + E Q +
Sbjct: 425 EPKRLRLEQEELEVYGSGQKTSVQLTSYIFEVCDSILNIGPIGHMAVGERISEEEQDENK 484
Query: 314 -----------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFD 362
++VT SG K+G+L +++N I S L G + D
Sbjct: 485 DVQFVPNKLDLEIVTSSGHGKNGALCVLQNSIKPQVITSFGLSGCLDV--------DDMH 536
Query: 363 TFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVS 421
+F+++S E + + DE+ E E GF + T+ + N+ +VQVT+ S+RL+
Sbjct: 537 SFMILS--QEAGTMVLQTGDEINEIENTGFATNVPTIHVGNIGGNRFIVQVTTKSIRLLQ 594
Query: 422 STSRELRN 429
T R L+N
Sbjct: 595 GT-RLLQN 601
>gi|330932624|ref|XP_003303845.1| hypothetical protein PTT_16226 [Pyrenophora teres f. teres 0-1]
gi|311319874|gb|EFQ88052.1| hypothetical protein PTT_16226 [Pyrenophora teres f. teres 0-1]
Length = 1247
Score = 100 bits (248), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 140/617 (22%), Positives = 243/617 (39%), Gaps = 85/617 (13%)
Query: 155 WSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVY-----CSANAFK-AIPIRPSITKA 205
WS+ +D A+ L VP GVL GE++I Y +N + AIP R T+
Sbjct: 224 WSEP-VDRTANTLFRVPGGPNAPSGVLCCGEDSITYRRIFNNKSNVHRLAIPRREGATED 282
Query: 206 YGR-----------VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTI 254
R + YLL G + L + V +KI+ IA++I
Sbjct: 283 PNRKRMIIAGTLYSLKGGDFFYLLQTEDGDVFKLTVEAPSGTVDKIKIKYFDTIPIATSI 342
Query: 255 SYLDNAVVYIGSSYGDSQLIKL---------------NLQPDAKGSY------------V 287
L VY GD L +L D + S+ +
Sbjct: 343 CILRAGFVYAACESGDRILYELESLGDETEDPVFESDQFPVDPEASFAPPFFKPRALVNL 402
Query: 288 EVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QG 346
+E +L PI+ V + + QV T +G S R RN + + + L Q
Sbjct: 403 TAVESMPSLNPIMGMEVANPALEDAPQVYTINGTSGRSSFRTTRNALEVLDLIESPLPQN 462
Query: 347 IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY 406
+W+ + +++D DT +V+ +R L + + D++EE GF T TL
Sbjct: 463 ASDVWTTKLTSEDEADTLIVLCL--HSRTLVLKIGDDVEEASNTGFLPDTNTLGVQQFGE 520
Query: 407 NQLVQVTSGSVRLV---------SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
+ ++Q+ +R + + + +W P ++ N QV +A G
Sbjct: 521 DCIIQIHPKGIRHIQGIQFPNDDAGATHANLMDWHPPAHRTIVACATNNRQVAIALSSGQ 580
Query: 458 LVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-P 514
++Y E DG L + L+ I+CL + + E + AVG +D +VRIF+L P
Sbjct: 581 ILYFECDSDGSLAMAEEEISLDSTINCLAMPDVPEGSVRAFFLAVGC-SDQTVRIFNLSP 639
Query: 515 DL--NLITKEHLGGEIIPRSVLLCAF------EGIS-YLLCALGDGHLLNFLLNMKTGEL 565
D+ N++ + P S L G S +L L G + +L+ TG++
Sbjct: 640 DMDGNILRSISVQALTSPPSDLTINLMTDRSPRGYSQFLHIGLRSGVYIRSVLDEMTGDI 699
Query: 566 TDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHMCP 623
D ++ LG +PI + + A + RP + Y+ + L + +N
Sbjct: 700 GDTRRRFLGPEPIKFAKVTVAGEPGILAMTSRPWLGYTHPRTGVLQLTPLNYIPFKSAWN 759
Query: 624 FNSAAFPDSLAIAKEGELTIGTIDDI------QKLHIRSIP---LGEHPRRICHQEQSRT 674
F+ + F + ++ EL I T +D+ + + +R P +G H + + + QS
Sbjct: 760 FDGSQFKGIICVSAN-ELRIFTFNDLTDNTTYENISLRYTPRKMVGYHDQGVFYVIQSDN 818
Query: 675 FAICSLKNQSCAEESEM 691
I + + Q ++ M
Sbjct: 819 NTISADRRQQLIAQANM 835
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 73/287 (25%), Positives = 132/287 (45%), Gaps = 33/287 (11%)
Query: 755 IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL--QSECG 811
I DG+ L+L E +L AF GKL+A + + + LY D G + L +++
Sbjct: 954 ISPDGRELELFNETAVSEPPLALLAFKGKLIAGVGRHLCLY-----DCGMKSLLRKAQAP 1008
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSAVEILDD 869
+ + ++T+G +VV D +S++ ++K + + D + SA E+LD
Sbjct: 1009 NSVPTRIVDIKTQGSRLVVSDQAQSVTYFVHKDQVHPNRLIPFVDDSISRHTSASEMLDY 1068
Query: 870 DIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
D +G + N++ VR + SE + + G +V + +LG NR L+ +
Sbjct: 1069 DTTIGGDKFGNIWLVRCPKEISESSDESPDGSDLLVDKSYLGGTPNRL---DLIAHYFTN 1125
Query: 927 DV-----------GQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
D+ G + + + G +G +I + ++L+ +LR K + G
Sbjct: 1126 DIPVSIQKANLISGGDKVIFWAGLQGTLGALIPFTSRRNHKLFQQLELSLRSEDKPLSGR 1185
Query: 975 NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+H +RS+ K+V +DGDLIE +L LSR + + I+ M S
Sbjct: 1186 DHLTYRSYYAPVKSV-----IDGDLIERYLVLSRDKRESIAAQMTGS 1227
>gi|403302917|ref|XP_003942095.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Saimiri boliviensis boliviensis]
Length = 1390
Score = 99.8 bits (247), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 113/460 (24%), Positives = 194/460 (42%), Gaps = 62/460 (13%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 919 VDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 978
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 979 KVYAVATSTNTPCTRIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1038
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1039 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1097
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI + W LR EL
Sbjct: 1098 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKI--FLWSLR---ASELTGMAF 1152
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1153 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1212
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
LG ++ + NL E RL ++H+G VN F G+
Sbjct: 1213 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1272
Query: 925 DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
S V + + F T++G IG++ + + Y L LQ L ++ GLN +R +
Sbjct: 1273 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1332
Query: 984 NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1333 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1372
>gi|432883539|ref|XP_004074300.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Oryzias latipes]
Length = 1456
Score = 99.8 bits (247), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 119/483 (24%), Positives = 208/483 (43%), Gaps = 80/483 (16%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L K+GEL I + +R IPL + + +S
Sbjct: 985 IESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHFVSYHVES 1044
Query: 673 RTFAICSLKNQSCA---------------EESEMHFVRLLDDQTFEFIS-----TYP--- 709
+ +A+C+ + C E E + L + + + IS T P
Sbjct: 1045 KVYAVCTSVKELCTRIPRMTGEEKEFETIERDERYINPLQEKFSIQLISPVSWETIPNTR 1104
Query: 710 --LDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + + V Y GT VL E +GRIL+ + E
Sbjct: 1105 IDLEEWEHVTCMKTVALRSQETVSGLKGYIAAGTC-VLQGEEVTCRGRILILDVIEVVPE 1163
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 808
G K +++ EKE KG V +L +G L++AI QKI L W L+D+ G + +
Sbjct: 1164 PGQPLTKNKFKVLYEKEQKGPVTALCHCHGYLVSAIGQKIFL--WALKDNDLTGMAFIDT 1221
Query: 809 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
+ H I + +FI+ D+MKSISLL Y+ E + +RD + ++E +
Sbjct: 1222 QLYIHQMI------SIKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSIEFIV 1275
Query: 869 DDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP- 924
D+ LG ++ + NLF E RL +++ G +N + R+P
Sbjct: 1276 DNNQLGFLVSDRDKNLFVYMYLPEAKESFGGMRLLRRADFNAGAHINS------LWRMPC 1329
Query: 925 ----DSDVGQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
DS + T F T++G IG++ + + Y L LQ L ++ GL
Sbjct: 1330 RGALDSGSKKALTWDNKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGL 1389
Query: 975 NHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
N + +R ++ ++++ KN LDG+L+ +L LS E++K + + + + + E+
Sbjct: 1390 NPKAFRMMHSNRRSLQNAVKNILDGELLAKYLYLSTMERSELAKKIGTTQDIILDDLLEI 1449
Query: 1033 TRL 1035
R+
Sbjct: 1450 DRV 1452
Score = 49.3 bits (116), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 62/323 (19%), Positives = 136/323 (42%), Gaps = 54/323 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT-- 63
++G + ++ + G ++D L ++ + K V+++D + +L T ++ + R
Sbjct: 74 LFGNVMSMASVQLTGASKDALLLSFKDAKLSVIEYDPGTHDLKTLSLHYFEEPELRDGFF 133
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPFDN---------------KGQLKEAFNIRL 106
N I I+ DP+ R + +Y V+PF K ++ I +
Sbjct: 134 QNVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKDTLSDEQEGGVGEGPKSSFLPSYIIDV 193
Query: 107 EEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP-- 154
EL ++D+KFL+G +PT+++L++ N+ A T + + ++
Sbjct: 194 RELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKVHP 253
Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIR 199
WS +NL ++ VP P+ GV++ +++Y + N + P+R
Sbjct: 254 VIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVSLNSQTNGTTSFPLR 313
Query: 200 PSITKAYGRVDADGSRYLLGD------HAGLLHLL-VITHEKEKVTGLKIELLGETSIAS 252
+ +D S ++ D G +++L +IT V + + + +
Sbjct: 314 VQ-EEVKITLDCCQSDFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTT 372
Query: 253 TISYLDNAVVYIGSSYGDSQLIK 275
+ ++ +++GS G+S L+K
Sbjct: 373 CMVTMEPGYLFLGSRLGNSLLLK 395
Score = 40.4 bits (93), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 42/172 (24%), Positives = 73/172 (42%), Gaps = 41/172 (23%)
Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
EV + +N+GP + + + L + Q ++V CSG K+G+L +++ I
Sbjct: 469 EVCDSILNIGPCANASMGEPAFLSEEFQSNPEPDLEIVVCSGYGKNGALSVLQRSIRPQV 528
Query: 339 QASVELQGIKGMWSLRS-----------------------STDDPFDT------FLVVSF 369
+ EL G MW++ S T+ P + FL++S
Sbjct: 529 VTTFELPGCHDMWTVISGEDKKESEGGEKEADAEKKEEQDKTEPPLEDDAKKHGFLILSR 588
Query: 370 ISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
T IL E+ E + GF +Q T+F + NQ ++QV+ +RL+
Sbjct: 589 EDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIGDNQYIIQVSPMGLRLL 638
>gi|358415280|ref|XP_003583063.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Bos taurus]
Length = 1490
Score = 99.8 bits (247), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 111/466 (23%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 1019 IDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1078
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + + C + E ++L+ ++E I +
Sbjct: 1079 KVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVSWEAIPNAR 1138
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1139 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1197
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1198 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1252
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1253 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1312
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1313 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1366
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1367 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1426
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1427 AFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1472
Score = 46.6 bits (109), Expect = 0.064, Method: Compositional matrix adjust.
Identities = 63/326 (19%), Positives = 141/326 (43%), Gaps = 60/326 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 123 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 182
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 183 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 241
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDN-----KDARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N K A T + + +
Sbjct: 242 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGKVAVRQDTCSIVAISLNITQKVH 301
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 302 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 361
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 362 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 418
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
+ +++ ++ +++GS G+S L+K
Sbjct: 419 LTTSMVTMEPGYLFLGSRLGNSLLLK 444
>gi|345482082|ref|XP_001607052.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Nasonia vitripennis]
Length = 1415
Score = 99.4 bits (246), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 123/519 (23%), Positives = 225/519 (43%), Gaps = 74/519 (14%)
Query: 580 LRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSN-VNLK-EVSHMCPFNSAAFPDS-LAI 635
+R FS+ VF D P I+ + + L ++ +N+ V PFN+ P L
Sbjct: 902 MRYFSNIAGYNGVFIGGDYPHWIFLTGRGELRAHPMNIDGPVKSFAPFNNVNCPQGFLYF 961
Query: 636 AKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICS-----LKN---- 682
++ EL I + +R +PL P + + +S+T+ + + LK+
Sbjct: 962 NRKDELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVVTSTAEPLKSYYRF 1021
Query: 683 -----QSCAEESEMHFVRLLDDQ----TFEFIS--TYP-----LDTFEYGCSILSCSFSD 726
+ EE F+ +Q F +S T P LD +E+ + + S +
Sbjct: 1022 NGEDKEFTEEERNERFLYPTQEQFSIVLFSPVSWDTIPNTKIDLDQWEHVTCLKNVSLAY 1081
Query: 727 DSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIAEKETK 770
+ Y +GT Y E+ ++GRI +F I+E + + I KE K
Sbjct: 1082 EGTRSGLKGYIVIGTNYNYGEDIT-SRGRIFIFDIIEVVPEPGQPLTKNRFKQIYAKEQK 1140
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
G V ++ +G L++AI QKI Y W L+D+ +L I + + I+V
Sbjct: 1141 GPVTAITQVSGFLVSAIGQKI--YIWQLKDN---DLVGVAFIDTQIYVCQMLSIKSLILV 1195
Query: 831 GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVR 885
D+ KS+SLL ++ E + +RD+ + A+E + LG E+N ++F+ +
Sbjct: 1196 ADVYKSVSLLRFQPEYKTLSLVSRDFRTTEIYAIEYFIQNNELGFIVADGESNISIFSYQ 1255
Query: 886 KNSEGATDEERGRLEVVGEYHLGEFVNRFRH--------GSLVMRLPDSDVGQIPTVIFG 937
S + ++ L + HLG+ +N F + + +D + ++
Sbjct: 1256 PESSQSLGGQK--LIRKADIHLGQKINTFFRIKCKTTDSANPTKQFSGADKRHV--TMYA 1311
Query: 938 TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN--EKKTVDAKNFL 995
T++G +G I +P + Y L LQ L I + GLN + +R++ + + A+ +
Sbjct: 1312 TLDGSLGYILPVPEKTYRRLLMLQNVLVSHIYHIAGLNPKAFRTYKSCVRMQGNPARGII 1371
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
DGDL+ +LDL EI+K + +E+ + E+ +
Sbjct: 1372 DGDLVRKYLDLPVNEKIEIAKKIGTGAQEIMDDMHEIYK 1410
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 122/604 (20%), Positives = 232/604 (38%), Gaps = 137/604 (22%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
++G + +++ + G +D L ++ K V+++D E L T ++ + + T
Sbjct: 73 LHGNVMSMQAVQLIGSPRDSLLLSFREAKLSVVEYDPEIHSLRTVSLHYFEEEEIKDGWT 132
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFN 103
++ + I+ DP+ R + +Y V+PF +K + ++
Sbjct: 133 NHHHVPIVRVDPEGRCAVMLIYGRKLVVLPFRKDPILDEGDLIENPKSSSHKTPILSSYM 192
Query: 104 IRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKD 149
I L+ L+ ++D++FL+G +PT+++LY+ + R VA+ +
Sbjct: 193 IVLKSLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFAGRIAVRQDTCAMVAISLNIQQK 252
Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAFK----AI 196
WS +NL + V PL G LI+ +++Y S N+
Sbjct: 253 VHPIIWSVSNLPFDCYQAVAVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLTDNCTNF 312
Query: 197 PIRPS----ITKAYGRVD-ADGSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSI 250
P++P I+ +V R ++ G L++L + V G + + +
Sbjct: 313 PLKPQEGVKISLESSQVAFISPDRLVISLKTGELYVLSLFADSMRSVRGFHFDKAAASVL 372
Query: 251 ASTISYLDNAVVYIGSSYG----------------DSQLIKLNL-------QPDAK---- 283
S + D+ +++GS G D +++++L QP K
Sbjct: 373 TSCVCLCDDNYLFLGSRLGNSLLLRFTEKESEKINDISMLEMSLNSSNSQEQPTKKIKLD 432
Query: 284 --------------------------------GSYV-EVLERYVNLGPIVD-------FC 303
SY+ EV + +N+GP + F
Sbjct: 433 YLEDWMASDVLDIKDPEELEVYGSETQTSIQITSYIFEVCDSLLNIGPCGNISMGEPAFL 492
Query: 304 VVDLERQGQGQV--VTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST--DD 359
+ + V VT SG K+G+L +++ I + +L G + +W++ ST D+
Sbjct: 493 SEEFSNNSEPDVELVTTSGYGKNGALCVLQRSIRPQVITTFDLPGYENIWTVIDSTVSDN 552
Query: 360 PFDT-------FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQ 411
T FL+++ T +L E E + GF +Q T+F + N+ ++Q
Sbjct: 553 RAKTETEGTHGFLILTQDDSTMVLQTGQEIN-EVVDQSGFSTQGTTIFAGNLGSNRYIIQ 611
Query: 412 VTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
VT VRL+ E G + A+ V L + G +V L + +G T
Sbjct: 612 VTQMGVRLLQGL--EQIQHMPMDLGCPIVHASCADPYVSLLSEDGQVVLLTLREGRGTAR 669
Query: 472 KHAQ 475
HAQ
Sbjct: 670 LHAQ 673
>gi|440904368|gb|ELR54893.1| Cleavage and polyadenylation specificity factor subunit 1, partial
[Bos grunniens mutus]
Length = 1417
Score = 99.4 bits (246), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 111/466 (23%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 963 IDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1022
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + + C + E ++L+ ++E I +
Sbjct: 1023 KVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVSWEAIPNAR 1082
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1083 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1141
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1142 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1196
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1197 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1256
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1257 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1310
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1311 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1370
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1371 AFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSPMERGELAKKIGTT 1416
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 52/256 (20%), Positives = 110/256 (42%), Gaps = 54/256 (21%)
Query: 71 IDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FNIR-LEE--LQ 110
+DPD R + +Y V+PF +++G + E ++R L+E L
Sbjct: 136 VDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIIDVRALDEKLLN 195
Query: 111 VLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNNLD 161
++D++FL+G +PT+++L++ N+ A T + + + P WS +L
Sbjct: 196 IVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLP 255
Query: 162 NGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIRPSITKAYGR 208
+ VP P+ GV+I +++Y + A P+R T+ R
Sbjct: 256 FDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLR---TQEGVR 312
Query: 209 VDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDN 259
+ D + + ++ G +++L +IT V + + + +++ ++
Sbjct: 313 ITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEP 372
Query: 260 AVVYIGSSYGDSQLIK 275
+++GS G+S L+K
Sbjct: 373 GYLFLGSRLGNSLLLK 388
>gi|27807297|ref|NP_777145.1| cleavage and polyadenylation specificity factor subunit 1 [Bos
taurus]
gi|1706101|sp|Q10569.1|CPSF1_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit
gi|929007|emb|CAA58152.1| cleavage and polyadenylation specificity factor, 160 kDa subunit [Bos
taurus]
gi|296480730|tpg|DAA22845.1| TPA: cleavage and polyadenylation specificity factor subunit 1 [Bos
taurus]
Length = 1444
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 111/466 (23%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 973 IDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1032
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + + C + E ++L+ ++E I +
Sbjct: 1033 KVYAVATSTSTPCTRVPRMTGEEKEFETIERDERYVHPQQEAFCIQLISPVSWEAIPNAR 1092
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1093 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1151
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1152 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1206
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1207 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1266
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1267 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1320
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1321 AEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1380
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1381 AFRMLHVDRRVLQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1426
Score = 46.6 bits (109), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 62/326 (19%), Positives = 141/326 (43%), Gaps = 60/326 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 77 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 136
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 137 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 195
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 196 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 255
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 256 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 315
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 316 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 372
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
+ +++ ++ +++GS G+S L+K
Sbjct: 373 LTTSMVTMEPGYLFLGSRLGNSLLLK 398
>gi|348555854|ref|XP_003463738.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 1 [Cavia porcellus]
Length = 1440
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 111/466 (23%), Positives = 193/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 969 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1028
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + + C + E ++L+ ++E I +
Sbjct: 1029 KVYAVATSTSTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1088
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1089 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1147
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1148 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1202
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1203 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1262
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1263 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1316
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1317 TEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1376
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1377 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1422
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 98/495 (19%), Positives = 190/495 (38%), Gaps = 145/495 (29%)
Query: 71 IDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FNIR-LEE--LQ 110
+DPD R + +Y V+PF +++G + E ++R L+E L
Sbjct: 142 VDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIIDVRALDEKLLN 201
Query: 111 VLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNNLD 161
++D++FL+G +PT+++L++ N+ A T + + + P WS +L
Sbjct: 202 IVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLP 261
Query: 162 NGADLLIPVPPPLCGVLIIGEETIVYCSAN-------------AFKAIPIRPSITKAYGR 208
+ VP P+ GV+I +++Y + + A P+R T+ R
Sbjct: 262 FDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLGTTAFPLR---TQEGVR 318
Query: 209 VDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDN 259
+ D + + ++ G +++L +IT V + + + +++ ++
Sbjct: 319 ITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEP 378
Query: 260 AVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE-------------- 288
+++GS G+S L+K LQ P +K V+
Sbjct: 379 GYLFLGSRLGNSLLLKYTEKLQEPPASTVREAADKEEPPSKKKRVDSTAGWAGSKTVPQD 438
Query: 289 ------------------------VLERYVNLGPIVDFCVVD---LERQGQG------QV 315
V + +N+GP + V + L + Q ++
Sbjct: 439 EVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPDLEI 498
Query: 316 VTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL--------------RSSTDDPF 361
V CSG K+G+L +++ I + EL G MW++ S +P
Sbjct: 499 VVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAEGSEQEPS 558
Query: 362 DT----------FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LV 410
FL++S T IL E+ E + GF +Q T+F + N+ +V
Sbjct: 559 APEAEDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIV 616
Query: 411 QVTSGSVRLVSSTSR 425
QV+ +RL+ ++
Sbjct: 617 QVSPLGIRLLEGVNQ 631
>gi|402588688|gb|EJW82621.1| splicing factor 3b [Wuchereria bancrofti]
Length = 601
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 117/515 (22%), Positives = 209/515 (40%), Gaps = 79/515 (15%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTDN 65
+G I +L FR G ++D++ I ++ + +L+++A+ R + + G R
Sbjct: 90 FGIIRSLLAFRLTGGSKDYIVIGSDAGRIVILEYNAQKV-CFERVHQETFGKTGCRRIVP 148
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
GQ +DP R LIG L ++ D L + + + + + G
Sbjct: 149 GQFLAVDPKGRAILIGAVERQKLVYIMNRDASANLTISSPLEAHKSHCICYSVVGVDVGF 208
Query: 121 AKPTIVVLYQDNKDARHVKTYEVALK--------DKDFV----EGPWS----------QN 158
PT L D ++ H T +A K + D V +GP
Sbjct: 209 ENPTFACLEVDYEEVDHDPTGHLATKIPQTLTFYELDLVPGGQDGPSGVIVCCENYLVYK 268
Query: 159 NLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLL 218
NL + D+ PVP + I+ C+A + +L+
Sbjct: 269 NLGDQPDIKCPVPRRRNELDDCDRTVIIVCAATHKTKLMYF----------------FLV 312
Query: 219 GDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNL 278
G + + + E + VT LKI+ +++ + L ++ S +G+ L ++
Sbjct: 313 QTDQGDIFKVTLESEHDIVTELKIKYFDTIPVSNAMCILKTGFLFTASEFGNHHLYQIAH 372
Query: 279 QPDAK----------------------GSYV-EVLERYVN-----LGPIVDFCVVD-LER 309
D GS + ++L R++ L C++D L
Sbjct: 373 LGDEDDEPEFSSRMQLEEGETFFLLLVGSQIWQLLTRWIAYHRLFLPTFKVHCMIDDLAN 432
Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVS 368
+ Q+ T G +++++RNG+ + E A EL G +W+++ + DD FD+ +VVS
Sbjct: 433 EDSPQIYTLVGRGALSAVKVLRNGLEVTEMAVSELPGNPNAVWTVKRNIDDKFDSHIVVS 492
Query: 369 FISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR 428
F++ T +L++ + +EE GF T TL C + L+QV +R + + R
Sbjct: 493 FVNATLVLSIG--ETVEEVTDSGFLGTTPTLGCALIGDDALLQVYPDGIRHIRADRRV-- 548
Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
NEWK+P ++ N QV +A GG LVY E+
Sbjct: 549 NEWKAPGKRTIMKCALNRRQVAIALAGGELVYFEL 583
>gi|307190910|gb|EFN74734.1| Cleavage and polyadenylation specificity factor subunit 1 [Camponotus
floridanus]
Length = 1418
Score = 99.0 bits (245), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 88/332 (26%), Positives = 156/332 (46%), Gaps = 39/332 (11%)
Query: 731 YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAF 779
Y +GT Y E+ ++GRIL+F I+E + + I KE KG + ++
Sbjct: 1093 YIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPITAITQV 1151
Query: 780 NGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
+G L++A+ QKI Y W L+D+ G + ++ H +L++ I++ D+ KS
Sbjct: 1152 SGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI-----KSLILIADVYKS 1203
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGA 891
ISLL ++ E + +RD+ + +E L D+ LG E+N LF + S +
Sbjct: 1204 ISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFFLADGESNLALFMYQPESRES 1263
Query: 892 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGV 945
++ L ++HLG+ VN F + P +D Q ++ T++G +G
Sbjct: 1264 LGGQK--LIRKADFHLGQKVNTFFRIRCRVSDPANDKKQFSGADKRHVTMYATLDGSLGY 1321
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN---EKKTVDAKNFLDGDLIES 1002
I +P + Y L LQ L I + GLN + +R + A+ +DGDL+
Sbjct: 1322 ILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKSYRQTYKSYIRNQGNPARGIIDGDLVWR 1381
Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
+L L +++K + V+E+ + + E+ R
Sbjct: 1382 YLFLPNNEKTDVAKKIGTRVQEIIEDITEIDR 1413
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 67/327 (20%), Positives = 139/327 (42%), Gaps = 56/327 (17%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
++G I +++ G +D L ++ K V+++D + +L T ++ + + T
Sbjct: 73 LHGNIMSMQAVHLIGSQRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEEIKDGWT 132
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAF 102
++ I I+ DP+ R + +Y V+PF NK + ++
Sbjct: 133 NHHHIPIVRVDPEGRCAIMLIYGRKLVVLPFRKDPSLDDGDLLDSAKLTSSNKTPILSSY 192
Query: 103 NIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DK 148
I L+ L+ V+D++FLYG +PT+++LY+ + R VA+ +
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLYGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252
Query: 149 DFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKA 195
WS +NL ++PV PL G LI+ +++Y + A+
Sbjct: 253 RVHPIIWSVSNLPFDCYQVVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTN 312
Query: 196 IPIRP----SITKAYGRVD-ADGSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
P++P ++ +V G R ++ +G L++L + V G + +
Sbjct: 313 FPLKPQEGVKMSLEGSQVAFISGDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASV 372
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL 276
+ S + ++ +++GS G+S L++
Sbjct: 373 LTSCVCMCEDNYLFLGSRLGNSLLLRF 399
Score = 42.4 bits (98), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 58/117 (49%), Gaps = 12/117 (10%)
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
++VT SG K+G+L +++ I + EL G + MW++ + ++ F
Sbjct: 506 ELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVIGTLNNDEQVKAEAEGSHAF 565
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
L++S T IL E+ E + GF +Q T+F + N+ +VQVT VRL+
Sbjct: 566 LILSQEDSTMILQTG--QEINEVDQSGFSTQGSTVFAGNLGANRYIVQVTQMGVRLL 620
>gi|229335612|ref|NP_001108153.2| cleavage and polyadenylation specificity factor subunit 1 [Danio
rerio]
Length = 1449
Score = 98.6 bits (244), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 114/477 (23%), Positives = 209/477 (43%), Gaps = 68/477 (14%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L K+GEL I + +R IPL + + +S
Sbjct: 978 IESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVES 1037
Query: 673 RTFAICSLKNQSC-------AEESEMHFV----------------RLLDDQTFEFI--ST 707
+ +A+C+ + C EE E + +L+ ++E I +
Sbjct: 1038 KVYAVCTSVKEPCTRIPRMTGEEKEFETIERDERYIHPQQDKFSIQLISPVSWEAIPNTR 1097
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + + V Y +GT + EE +GRIL+ + E
Sbjct: 1098 VDLEEWEHVTCMKTVALKSQETVSGLKGYVALGTCLMQGEEVT-CRGRILILDVIEVVPE 1156
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQS 808
G K +++ EKE KG V +L +G L++AI QKI L W L+D+ G + +
Sbjct: 1157 PGQPLTKNKFKVLYEKEQKGPVTALCHCSGFLVSAIGQKIFL--WSLKDNDLTGMAFIDT 1214
Query: 809 ECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
+ +I +Y + +FI+ D+MKSISLL Y+ E + +RD + ++E +
Sbjct: 1215 QL----YIHQMY--SIKNFILAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFMV 1268
Query: 869 DDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD 925
D+ LG ++ + NL E RL ++++G VN F L
Sbjct: 1269 DNNQLGFLVSDRDKNLMVYMYLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDT 1328
Query: 926 SDVGQIP-----TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
++ + F T++G +G++ + + Y L LQ L ++ GLN + +R
Sbjct: 1329 ANKKALTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFR 1388
Query: 981 SFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+ +++T+ KN LDG+L+ +L LS E++K + + + + + E+ R+
Sbjct: 1389 MLHCDRRTLQNAVKNILDGELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEIERV 1445
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 104/565 (18%), Positives = 212/565 (37%), Gaps = 148/565 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
++G + ++ + G +D L ++ + K V+++D + +L T ++ + R
Sbjct: 74 LFGNVMSMASVQLVGTNRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPF---------------DNKGQLKEAFNIRL 106
N I ++ DP+ R + +Y V+PF K ++ I +
Sbjct: 134 QNVHIPMVRVDPENRCAVMLVYGTCLVVLPFRKDTLADEQEGIVGEGQKSSFLPSYIIDV 193
Query: 107 EEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP-- 154
EL ++D+KFL+G +PT+++L++ N+ A T + + ++
Sbjct: 194 RELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKVHP 253
Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIR 199
WS +NL + ++ VP P+ GV++ +++Y + N A P+R
Sbjct: 254 VIWSLSNLPFDCNQVMAVPKPIGGVVVFAVNSLLYLNQSVPPFGVSLNSLTNGTTAFPLR 313
Query: 200 PSITKAYGRVDADGSRYLLGD------HAGLLHLL-VITHEKEKVTGLKIELLGETSIAS 252
P + +D + ++ D G +++L +IT V + + + +
Sbjct: 314 PQ-EEVKITLDCSQASFITSDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTT 372
Query: 253 TISYLDNAVVYIGSSYGDSQLIKLN------------------LQPDAKGSYV------- 287
+ ++ +++GS G+S L++ QP K V
Sbjct: 373 CMMTMEPGYLFLGSRLGNSLLLRYTEKLQETPMEEGKENEEKEKQPPNKKKRVDSNWAGC 432
Query: 288 -------------------------------EVLERYVNLGPIVDFCVVD---LERQGQG 313
EV + +N+GP + + L + Q
Sbjct: 433 PGKGNLPDELDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCASASMGEPAFLSEEFQT 492
Query: 314 ------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDT---- 363
+VV CSG K+G+L +++ I + EL G MW++ + P
Sbjct: 493 NPEPDLEVVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVIYCEEKPEKPSAEG 552
Query: 364 ----------------------FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
FL++S T IL E+ E + GF +Q T++
Sbjct: 553 DGESPEEEKREPTIEDDKKKHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVYA 610
Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
+ N+ ++QV+ +RL+ ++
Sbjct: 611 GNIGDNKYIIQVSPMGIRLLEGVNQ 635
>gi|242021233|ref|XP_002431050.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
putative [Pediculus humanus corporis]
gi|212516279|gb|EEB18312.1| Cleavage and polyadenylation specificity factor 160 kDa subunit,
putative [Pediculus humanus corporis]
Length = 1409
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 106/438 (24%), Positives = 194/438 (44%), Gaps = 66/438 (15%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE------ESEMHFVRLLDDQ------- 700
+R +PL P + + +S+T+ + + ++ E E + H V DD+
Sbjct: 978 VRKVPLRCTPHMVTYHLESKTYCLITSSSEPSNEYFRFNGEDKEHSVEDRDDRFPLPLQD 1037
Query: 701 TFEFISTYP------------LDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEE 743
F + P LD +E+ + + + S + Y VGT Y E
Sbjct: 1038 KFSIVLFSPVSWEVIPNTKMELDEWEHVTCVKTVNLSYEGTRSGLKGYVAVGTNYNY-SE 1096
Query: 744 NEPTKGRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
+ +KGRIL++ I+E + + + KE KG V +L G L+ A+ QKI
Sbjct: 1097 DITSKGRILIYDIIEVVPEPGQPLTKNRFKTVYAKEQKGPVTALCHVLGFLVTAMGQKI- 1155
Query: 793 LYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
Y W L+D+ G + ++ H I + I+V D+ KSISLL ++ E +
Sbjct: 1156 -YIWQLKDNDLVGIAFIDTQIYIHQMI------SVKSLILVADVYKSISLLRFQEEYRTL 1208
Query: 850 EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
+RD+ + A+E+L D+ +G ++ N+ E +L ++H
Sbjct: 1209 SLVSRDFRPCEVYAIELLLDNTQMGFLISDVEMNIIMYMYKPEDRDSVGGQKLLRKADFH 1268
Query: 907 LGEFVN-----RFRHGSLVMRLPDSDVG--QIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
LG+ +N R R G D +G + +F T++G +G + +P + Y L+
Sbjct: 1269 LGQHINSWFRIRCRLGDQAENY-DFPIGAEKRHISMFATLDGALGYLLPIPEKTYRRLQM 1327
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTV--DAKNFLDGDLIESFLDLSRTRMDEISKT 1017
LQ L I + GLN + +R + + +K + K +DG+LI +L L+ +++K
Sbjct: 1328 LQNILVYHIPHLAGLNPKAFRIYKSGRKLLGNPCKRIVDGELIWMYLSLTVMEKQDVAKK 1387
Query: 1018 MNVSVEELCKRVEELTRL 1035
M ++++ + + + RL
Sbjct: 1388 MGSKMDDIIEDIAVIERL 1405
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 70/331 (21%), Positives = 137/331 (41%), Gaps = 59/331 (17%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMG--DVSDRIGRPT 63
++ + +++ G ++D L ++ K CV+++D +S +L T ++ + D G T
Sbjct: 73 LFANVMSMQAVSLAGSSRDALLLSFREAKLCVVEYDPDSHDLRTLSLHYFEEEDMKGGWT 132
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
++ I + DP+ R + +Y ++PF + +L +
Sbjct: 133 NHYDIPYVRVDPEGRCAAMLVYGRKLVILPFRRESKLDDPDIALLDPHSSSVATAKAPVL 192
Query: 101 -AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD--------ARHVKTYEVALK 146
++ I L E+ V+DI+FLYG +PT+++LY+ K + V+L
Sbjct: 193 SSYTITLREIDEKLENVIDIQFLYGYYEPTLLILYEPLKTFAGRIAVRSDTCAMIAVSLN 252
Query: 147 DKDFVEGP-WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAFKAI 196
+ V WS NL IPVP PL G LI ++Y S N+
Sbjct: 253 IQQRVHPAIWSVGNLPFNCTQAIPVPKPLGGTLIFSVNALIYLNQSIPPFGVSVNSIAEN 312
Query: 197 PIRPSITKAYG-RVDADGS--------RYLLGDHAGLLHLLVITHEK-EKVTGLKIELLG 246
+ G ++ +GS R +L G L++L + + V G +
Sbjct: 313 STNFQLKIQEGVKITLEGSQATFISHDRLVLSLKTGELYVLSLLADNIRSVRGFHFDKAA 372
Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
+ + + + ++ +++GS G+S L++
Sbjct: 373 ASVLTTCLCVCEDKYLFLGSRLGNSLLLRFT 403
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 52/180 (28%), Positives = 79/180 (43%), Gaps = 22/180 (12%)
Query: 309 RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL---------RSSTDD 359
R ++VT SG K+G++ +++ I + EL G MW++ + DD
Sbjct: 501 RDPDLELVTTSGHGKNGAICVLQRTIRPQVVTTFELPGCLDMWTVIGPQSDSGPTQAEDD 560
Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVR 418
FL++S T IL E+ E + GF +Q T+F + N+ +VQV+ VR
Sbjct: 561 ISHAFLILSQKDSTMILQTG--QEINEVDHSGFNTQGPTIFAGNLASNKYIVQVSKAGVR 618
Query: 419 LVSSTSRELRNEWKSP--PGYSVNVATANASQVLLATGGGHLVYLEI----GDGILTEVK 472
L+ R L P G SV A+ V L T G +V L + G G L+ K
Sbjct: 619 LL----RGLEQIQHIPLDLGSSVVHASTADPYVALLTEDGQVVLLTLRESRGQGRLSVFK 674
>gi|410911304|ref|XP_003969130.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Takifugu rubripes]
Length = 1444
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 115/467 (24%), Positives = 200/467 (42%), Gaps = 76/467 (16%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L K+GEL I + +R IPL + + +S
Sbjct: 973 IESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVES 1032
Query: 673 RTFAICSLKNQSC-------AEESEMHFV----------------RLLDDQTFEFISTYP 709
+ +A+C+ + C EE E + +L+ ++E I
Sbjct: 1033 KVYAVCTSLKELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLISPVSWEAIPNTR 1092
Query: 710 LD--TFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
+D +EY + + + V Y GT + EE +GRIL+ + E
Sbjct: 1093 IDLEEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVT-CRGRILILDVIEVVPE 1151
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD-DGTRELQSEC 810
G K +++ EKE KG V +L NG L++AI QKI L W+L+D D T +
Sbjct: 1152 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFL--WVLKDNDLTGMAFIDT 1209
Query: 811 GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
H H + + +FI+ DLMKS+SLL Y+ E + +RD + ++E + D+
Sbjct: 1210 QLHIHQM----MSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVYSIEFMVDN 1265
Query: 871 IYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS- 926
LG ++ + NL+ E RL +++ G +N F R+P
Sbjct: 1266 NQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTF------WRMPCRG 1319
Query: 927 --DVGQIPTVI--------FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
+ G + F T++G +G++ + + Y L LQ L ++ GLN
Sbjct: 1320 ALEAGSRKAMTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLSHHAGLNP 1379
Query: 977 EQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+ +R + +++++ KN LDG+L+ +L LS E++K + +
Sbjct: 1380 KAFRMLHCDRRSLQNPVKNILDGELLNKYLYLSMMERSELAKKIGTT 1426
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 64/322 (19%), Positives = 135/322 (41%), Gaps = 52/322 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
++G + ++E + G +D L ++ + K V+++D + +L T ++ + R
Sbjct: 74 LFGNVMSMESVQLVGANRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEELELRDGFV 133
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPFDN---------------KGQLKEAFNIRL 106
N I I+ DP+ R + +Y V+PF K + I +
Sbjct: 134 QNVHIPIVRVDPENRCAVMLIYGTKLVVLPFRKDTLTDEQEVGVGEGPKSSFLPTYIIDV 193
Query: 107 EEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP-- 154
EL ++D+KFL+G +PT+++L++ N+ A T + + ++
Sbjct: 194 RELDEKLLNIIDMKFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKVHP 253
Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIR 199
WS +NL ++ VP P+ GV++ +++Y + N A P+R
Sbjct: 254 VIWSLSNLPFDCTQVMAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFPLR 313
Query: 200 ----PSITKAYGRVDADG-SRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIAST 253
IT + D + ++ G +++L +IT V + + + +
Sbjct: 314 LQDEVKITLDCSQADFIAYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTC 373
Query: 254 ISYLDNAVVYIGSSYGDSQLIK 275
+ ++ +++GS G+S L+K
Sbjct: 374 MVTMEPGYLFLGSRLGNSLLLK 395
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 41/171 (23%), Positives = 75/171 (43%), Gaps = 35/171 (20%)
Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
EV + +N+GP + + + L + Q +VV CSG K+G+L +++ I
Sbjct: 465 EVCDSILNIGPCANASMGEPAFLSEEFQSNPEPDLEVVVCSGHGKNGALSVLQRSIRPQV 524
Query: 339 QASVELQGIKGMWSL-----------------RSSTDDPFDT------FLVVSFISETRI 375
+ EL G MW++ + T+ P + FL++S T I
Sbjct: 525 VTTFELPGCHDMWTVISNEPVQKEQEETEREGKEKTEPPAEEDTKKHGFLILSREDSTMI 584
Query: 376 LAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
L E+ E + GF +Q T+F + N+ ++QV+ +RL+ ++
Sbjct: 585 LQTG--QEIMELDTSGFATQGPTVFAGNIGDNKYIIQVSPMGIRLLEGVTQ 633
>gi|71028348|ref|XP_763817.1| hypothetical protein [Theileria parva strain Muguga]
gi|68350771|gb|EAN31534.1| hypothetical protein TP04_0182 [Theileria parva]
Length = 1231
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 101/402 (25%), Positives = 183/402 (45%), Gaps = 47/402 (11%)
Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
LL G L+ + ++H+ V+ + + +AST+ L + +++GS GD +L +
Sbjct: 304 LLLSEYGDLYKIELSHDDTTVSEIVVRYFDTVDVASTMCILRSGYLFVGSESGDHKLYQF 363
Query: 277 -------------NLQPDAKGSYV----------EVLERYVNLGPIVDFCVVDLERQGQG 313
+L PDAK + + V++R ++ VD VVD+
Sbjct: 364 TALENGDKDVICTSLHPDAKNAIIAFKPRKLQNLAVVDRMNSMALAVDMKVVDVLGLNNY 423
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISE 372
+ G + D L+ +R G E A EL G K +++++ S + +D ++++SF
Sbjct: 424 DIFVACGRWYDSRLKCLRYGFNTEELAFNELPGRPKAVFTIK-SLESNYDEYIIISFQGN 482
Query: 373 TRILAMNLEDELEETEIEGFCSQTQTLF-CHDAIYNQ------LVQVTSGSVRLVSSTSR 425
T +L++ + +EE F + TL CH + VQV G R ++
Sbjct: 483 TLVLSIG--EAVEEVTDSFFLTSITTLHSCHMGSTSGSVGGGIFVQVHDGGFRYLTG--- 537
Query: 426 ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGI---LTEVKHAQLEYEISC 482
++ EWK V +A N +Q++L GG ++Y ++ + L EV L E++C
Sbjct: 538 DVVKEWKVQTTKRVKLADNNNTQLVLVLTGGEVIYFQLTEADVLELVEVGRRNLSTEVTC 597
Query: 483 LDI-NPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSV-LLCAFE 539
L + + + N ++ G +I VR+ L L L + + LG +P SV LL
Sbjct: 598 LAVQHQVSGNK--AEFCCCGSIDNI-VRVMKLDKTLKLCSSQILGNNSLPESVALLTTSV 654
Query: 540 GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
YL L +G L+ +++ G L+D++ LG +P+ L+
Sbjct: 655 DELYLYVGLNNGVLIRNTVDV-AGNLSDQESRFLGAKPLRLK 695
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 76/283 (26%), Positives = 127/283 (44%), Gaps = 34/283 (12%)
Query: 761 LQLIAEKETKGAVYSLNAFNGKLL-AAINQKIQLY----KWMLRDDGTRELQSECGHHGH 815
++L+ TKG + N + KLL AI K+++Y K ML R L S +G
Sbjct: 967 IKLLHVTNTKGWIRCFNNYENKLLLCAIGSKLRMYALGRKQMLLKGEHRSLTS----NGF 1022
Query: 816 ILALYVQTRGDFIVVGDLMKSISLL---IYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
+ ++ G I GD+ +S+ LL Y + G E W+S +E+LD
Sbjct: 1023 ---MDIKVVGSRIYCGDIRESVQLLRLKFYGEDLGEFELTTTSTGPRWLSTMELLDYSTV 1079
Query: 873 LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
+ + +LF R +E+ R EYH N+F G +V L + I
Sbjct: 1080 IAGDKFDSLFVSRV----PHNEDVVRSNYF-EYH-----NQFHLGDIVTSLQRVRINPIH 1129
Query: 933 T--VIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
+ V++ T+ G IGV+ + L FL+ L+ L I V G + +RS+
Sbjct: 1130 SEVVLYTTLMGSIGVLIPFVSKDELDFLQHLEMLLCNQIDTVTGREVQMFRSY-----YF 1184
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+N +DGDL E F L + + ++ +N+ V E+ K+++ +
Sbjct: 1185 PVQNIVDGDLCEMFTALGDEKFN-VANQLNLKVAEVVKKLKNI 1226
>gi|84996279|ref|XP_952861.1| splicing factor 3b subunit [Theileria annulata strain Ankara]
gi|65303859|emb|CAI76236.1| splicing factor 3b subunit, putative [Theileria annulata]
Length = 1340
Score = 98.6 bits (244), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 105/428 (24%), Positives = 182/428 (42%), Gaps = 73/428 (17%)
Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
LL G L+ + ++H+ V+ + I + ++ L + ++IGS GD +L +
Sbjct: 357 LLLSEYGDLYKIELSHDDNTVSEVVIRYFDTVDVGISMCILRSGYLFIGSESGDHKLYQF 416
Query: 277 -------------NLQPDAKGSYVE----------VLERYVNLGPIVDFCVVDLERQGQG 313
+L PDAK + + V++R ++G +VD V D+
Sbjct: 417 TSLDNGDKDVICTSLHPDAKNAIIAFKPRVLQNLVVVDRMSSMGLVVDMKVADVMGLNNY 476
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISE 372
+ G + + L+ +R G E A EL G K +++++ S + FD ++++SF
Sbjct: 477 DIFVACGRWYNSRLKCLRYGFNTEELAFNELPGRPKHVFTIK-SLESNFDEYIIISFQGN 535
Query: 373 TRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL----------------------- 409
T +L++ + +EE F + TL C Y
Sbjct: 536 TLVLSIG--EAVEEVTDSFFLTSITTLHCCYMSYRNGVSSSGVGGVSGVSGVGTVTNGVP 593
Query: 410 ------------VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
VQ+ G R +S ++ EWK V +A N +Q++L GG
Sbjct: 594 VTGGTGITRGIYVQIHDGGYRYLSG---DIIKEWKVQSTKRVKLADNNDTQLILVLTGGE 650
Query: 458 LVYLEIGDG---ILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL- 513
++Y ++ D L EV L EI+CL I + + ++ G +I VRI L
Sbjct: 651 IIYFQLTDTEVPELVEVGRRNLSTEITCLAIQH-PNSGTKAEFCCCGSIDNI-VRIMKLD 708
Query: 514 PDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSL 573
+L L + + LG +P SV L + I YL L +G L+ L+M G L D++ +
Sbjct: 709 KNLKLCSSQILGNNSLPESVTLLTNDEI-YLYVGLNNGVLIRNTLDM-IGNLIDQESRFM 766
Query: 574 GTQPITLR 581
GT+P+ L+
Sbjct: 767 GTKPLKLK 774
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 92/381 (24%), Positives = 160/381 (41%), Gaps = 47/381 (12%)
Query: 675 FAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCV 734
F + LKN +R+++ E I+ E + +C +S V
Sbjct: 979 FEVMELKNYRAGVGKWSSCIRIINPINLETIAKLLFTENEAATTAYTCIL--NSIQLLIV 1036
Query: 735 GT---AYVLPEE---NEPTKGRILVF--------IVEDGKLQLIAEKETKGAVYSLNAFN 780
GT A++ P N+ + I V+ +V ++L+ TKG + N +
Sbjct: 1037 GTIKNAHLYPTHMDVNDEVESCIRVYEYDSNYNNLVNGFNIKLLHITNTKGWIRCFNNYE 1096
Query: 781 GKLL-AAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMK 835
KLL AI K+++Y K ML R L S HG + ++ G I GD+ +
Sbjct: 1097 NKLLLCAIGSKLRMYSLGKKQMLLKGEHRSLTS----HGF---MDIKVIGSRIYCGDIRE 1149
Query: 836 SISLL---IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
S+ LL Y + G E W+S++E+LD + A + F+ V +
Sbjct: 1150 SVQLLRIKFYGEDLGEFELTTTSTGPRWLSSMELLDYSTVI-AGDKFDSIFVSRVPHNED 1208
Query: 893 DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
E ++HLG+ V F+ + + S+V V++ T+ G IGV+ +
Sbjct: 1209 VVRSNYFEYHNQFHLGDIVTSFQR--VRINPIHSEV-----VLYTTLMGSIGVLVPFVSK 1261
Query: 953 QYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
L FL+ L+ + I V G + +RS+ +N +DGDL E ++ L
Sbjct: 1262 DELDFLQHLEMLMCNQIDTVTGREVQMFRSY-----YFPVQNIVDGDLCEMYMTLDDKY- 1315
Query: 1012 DEISKTMNVSVEELCKRVEEL 1032
I+ +N+ V E+ K+++ +
Sbjct: 1316 -NIANQLNLKVNEIIKKLKNI 1335
>gi|320581947|gb|EFW96166.1| hypothetical protein HPODL_2449 [Ogataea parapolymorpha DL-1]
Length = 1203
Score = 98.2 bits (243), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 173/822 (21%), Positives = 326/822 (39%), Gaps = 122/822 (14%)
Query: 295 NLGPIVDFCVVDLERQGQGQVV-TCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWS 352
N+ P++D + + V+ + G +L+++ + + +E + EL + + ++
Sbjct: 415 NINPLIDAKLYNSNETTSLPVIYSLCGTGPRSALKVLNHELPFSEVVTQELPSVVQKVFV 474
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ + DD +D ++V+SF+ T L + + +++EE E G T TL LVQV
Sbjct: 475 SKLNRDDEYDKYIVLSFVDGT--LVLRIGEDVEEVENSGLVLDTNTLGVFQVGSTALVQV 532
Query: 413 TSGSVRLVSSTSRELRN--EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGIL 468
VR V + +W P G V +A SQ+ +A +VY E+ D ++
Sbjct: 533 HPNGVRQVFYVDETPQKTIDWAPPAGIRVLHCSATNSQLAIALSNREIVYFELDDLDKLI 592
Query: 469 TEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
+H +L +I+ L + + + V D ++ +FS P+ L E +G E
Sbjct: 593 EYNEHKELAAQITALTLGEVQAGTARFPYLLVA-CQDKTLTVFSTDPESTL---EIVGEE 648
Query: 528 IIPRSV-LLCAF------------------------EGISYLLCALGDGHLLNFLLNMKT 562
I+ + L AF + Y+ + G ++ +T
Sbjct: 649 ILSSAASSLMAFYMKDTAIVLNKDAGEDDEDEPELATSLLYVHIGMESGVYARLQMDPQT 708
Query: 563 GELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC 622
GEL++ + G +P+ L + V S R + Y+ + + + H C
Sbjct: 709 GELSNPRNKYTGPRPVQLSKIEAVGQNAVCIFSARTYLGYTRPSEFKITPLTKPVFQHAC 768
Query: 623 PFNSAAFPDSLAIAKEG-ELTIGTIDDIQK-LHIRSIPLGEHPRRICHQEQSRTFAICSL 680
F S P++ +A G LTI TID ++ I SI L P+ +C
Sbjct: 769 SFRSEDIPENGVLAVYGNSLTIMTIDQLENDTIIESIALRNTPKYMC------------- 815
Query: 681 KNQSCAEESEMHFVRLLDDQTFEFIST----------YPLDTFEYGCSILSCSFSDDSNV 730
C +++ M +V D T I YP + I + +D S V
Sbjct: 816 ---DCTDKNGMMYVLEGDIDTKRDIEEERIEEYQQFGYPRAPGAWASCIQTVVVADKSVV 872
Query: 731 YYC-VG--TAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGA---VYSLNAFNGKL- 783
+G TA+ N ++ V K Q ++ GA +Y +++ NG L
Sbjct: 873 QTIDLGDETAFRACWVNFESRPGEPFLAVSSAKNQQLSPPRNDGAYIHLYRISS-NGTLE 931
Query: 784 -------------LAAINQKIQL---YKWMLRDDGTRELQSECGHHGHILALY-VQTRGD 826
L A K+ + + +L D G ++L + ++T+G
Sbjct: 932 LFQTTKTEHLSLALTAFQGKLLVGAKNELILYDIGQKQLVKRSSTRLECYEIVDLKTQGF 991
Query: 827 FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR- 885
++V D+ S+ +YK E + + D ++ +LD D + + N+ +R
Sbjct: 992 RVIVSDVRDSVRYTVYKPLENSFVDFIDDTMQRHVTRTLLLDYDTVVVGDKFGNISVLRC 1051
Query: 886 -KNSEGATDEE---------RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVI 935
+ +DE+ R +L+ Y++G+ F+ GSL +G ++I
Sbjct: 1052 PEQISEMSDEDNHGFLVKMRRTKLDNPVNYYVGDMPTFFQKGSLT-------IGGAESII 1104
Query: 936 FGTVNGVIGVIASLPH-EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNF 994
+G + G +G + + + F ++LQ R +I L ++ F KN
Sbjct: 1105 YGCLQGQMGCLYPMKSLSEINFFKELQ---RLIIHEFTSLTDREYLKFKGYYNP--PKNS 1159
Query: 995 LDGDLIESFLDLS-------RTRMDEISKTMNVSVEELCKRV 1029
+DGDLIE + L T+MD + + ++ + ++ R+
Sbjct: 1160 IDGDLIEEYYRLGPEKRIRIATKMDRLPRDIDRRISDMRSRI 1201
>gi|195056749|ref|XP_001995154.1| GH22991 [Drosophila grimshawi]
gi|193899360|gb|EDV98226.1| GH22991 [Drosophila grimshawi]
Length = 1426
Score = 97.8 bits (242), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 107/435 (24%), Positives = 198/435 (45%), Gaps = 62/435 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQSC-------AEESEM-------HFVR---- 695
+R +PL PR++ + ++R + + + K + E+ E+ F+
Sbjct: 997 VRKVPLRCTPRQLVYHRENRVYCLITQKEEPMTKYYRFNGEDKELSEECRGERFIYPIGS 1056
Query: 696 -----LLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
L+ +T+E + ++ + +E+ I+ S+ + Y C+GT + E
Sbjct: 1057 LFEMVLISPETWEIVPDASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1115
Query: 744 NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
+ ++G I ++ + E GK L+ + +KE KG V +++ G L+ + QKI
Sbjct: 1116 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKI- 1174
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1175 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLA 1230
Query: 853 ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
+RD+N + +E + D+ LG AE N ++ + + + + +L +YHL
Sbjct: 1231 SRDFNPMEVFGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQ--KLLRKADYHL 1288
Query: 908 GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
G+ VN FR L R P + VI+G+++G +G LP + Y LQ
Sbjct: 1289 GQVVNTMFRVQCHQRGLHQRQPFLYENK-HLVIYGSLDGALGYCLPLPEKVYRRFLMLQN 1347
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
L + GLN +++R+ + KK ++ +DGDLI SF L+ + +E++K +
Sbjct: 1348 VLLSYQDHLCGLNPKEYRTIKSVKKLGINPSRCIIDGDLIWSFRMLAHSERNEVAKKIGT 1407
Query: 1021 SVEELCKRVEELTRL 1035
EE+ + E+ R+
Sbjct: 1408 RTEEILADLLEIERI 1422
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 70/323 (21%), Positives = 129/323 (39%), Gaps = 77/323 (23%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YG + +L+ G +D L I+ + K VLQ DA++ L T ++ + R
Sbjct: 76 LYGNVMSLQSVSLAGAMRDALLISFKDAKLSVLQLDADTQTLKTLSLHYFEEDDIRGGWT 135
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
G+ + +DPD R + +Y V+PF L E
Sbjct: 136 GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195
Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE 152
++ I L +L VLDI+FL+G +PT+++LY+ V+T +K +
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEP------VRTCAGRIKVR---- 245
Query: 153 GPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIR 199
+D P+ P+ G L++ ++Y SA+ A P++
Sbjct: 246 -----------SDTFFPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSADNSTAFPLK 294
Query: 200 P------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSIA 251
P S+ A + +D D + ++ G L++L + + + V + +
Sbjct: 295 PQDNVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHFHKAAASVLT 352
Query: 252 STISYLDNAVVYIGSSYGDSQLI 274
S I +++GS G+S L+
Sbjct: 353 SCICVCHTEYIFLGSRLGNSLLL 375
Score = 41.2 bits (95), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 43/160 (26%), Positives = 70/160 (43%), Gaps = 26/160 (16%)
Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
EV + +N+ PI C V+ E G ++V +G K+G+L +
Sbjct: 455 EVCDSLINVAPINYMCAGERVEFEEDGATLRPHADNLNDLKIELVAATGHSKNGALSVFV 514
Query: 332 NGIGINEQASVELQGIKGMWSL-----RSSTDDPFDT--FLVVSFISETRILAMNLEDEL 384
N I S EL+G +W++ R +T D F+++S S T +L E+
Sbjct: 515 NCINPQIITSFELEGCLDVWTVFDDATRKATTARQDQHDFMLLSQRSSTLVLQTG--QEI 572
Query: 385 EETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
E E GF T++ + + +VQVT+ VRL+ T
Sbjct: 573 NEIENTGFTVNQPTIYVGNLGQQRFIVQVTTRHVRLLQGT 612
>gi|296227035|ref|XP_002807684.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Callithrix jacchus]
Length = 1394
Score = 97.8 bits (242), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 111/460 (24%), Positives = 190/460 (41%), Gaps = 62/460 (13%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 923 VDSFAPFHNINCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 982
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 983 KVYAVATSTNTPCTRIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1042
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVFIVED---- 758
L +E+ + + S + V Y GT + EE +GRIL+ V +
Sbjct: 1043 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVTE 1101
Query: 759 -------GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1102 PRQTLTXXKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1156
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1157 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1216
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF-----RHGSLVMRL 923
LG ++ + NL E RL ++H+G VN F R + +
Sbjct: 1217 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1276
Query: 924 PDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
F T++G IG++ + + Y L LQ L ++ GLN +R +
Sbjct: 1277 KSVMWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1336
Query: 984 NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1337 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1376
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 106/527 (20%), Positives = 211/527 (40%), Gaps = 122/527 (23%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP- 154
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 155 ---WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE-- 247
R T+ R+ D + + ++ G +++L +IT V +
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 248 --TSIASTISYL-------------DNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER 292
TS++ T +L D VY + +QL + EV +
Sbjct: 370 LTTSVSGTEGFLCAAGGKSVPQDEXDEIEVYGSETQSGTQLATYSF---------EVCDS 420
Query: 293 YVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVE 343
+N+GP + + + L + Q ++V CSG K+G+L +++ I + E
Sbjct: 421 ILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFE 480
Query: 344 LQGIKGMWSL-------------------RSSTDDPFDT-----FLVVSFISETRILAMN 379
L G MW++ ST + D FL++S T IL
Sbjct: 481 LPGCYDMWTVIAPVRKEEEENPKGEGTEQEPSTPEADDDSRRHGFLILSREDSTMILQTG 540
Query: 380 LEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
E+ E + GF +Q T+F + N+ +VQV+ +RL+ ++
Sbjct: 541 --QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 585
>gi|380014171|ref|XP_003691113.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Apis florea]
Length = 1583
Score = 97.8 bits (242), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 127/516 (24%), Positives = 229/516 (44%), Gaps = 80/516 (15%)
Query: 575 TQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPD 631
T+ +R FS+ VF SD P I+ + + L ++ + V+ PFN+ P
Sbjct: 897 TRHCMMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQ 956
Query: 632 S-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICS-----LK 681
L ++ EL I + +R +PL P + + +S+T+ + + LK
Sbjct: 957 GFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVITSIAEPLK 1016
Query: 682 N---------QSCAEESEMHFVRLLDDQ----TFEFIS--TYP-----LDTFEYGCSILS 721
+ + EE F+ +Q F +S T P LD +E+ + +
Sbjct: 1017 SYYRFNGEDKEFTEEERPDRFIYPSQEQFSIVLFSPVSWETIPNTKIELDQWEHVTCLKN 1076
Query: 722 CSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIA 765
S + + Y +GT Y E+ ++GRIL+F I+E + + I
Sbjct: 1077 VSLAYEGTRSGLKGYIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIY 1135
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQ 822
KE KG + ++ +G L++A+ QKI Y W L+D+ G + ++ H +L++
Sbjct: 1136 AKEQKGPITAITQVSGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI--- 1189
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AEN 877
I++ D+ KSISLL ++ E + +RD+ + +E L D+ LG E+
Sbjct: 1190 --KSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLVADGES 1247
Query: 878 NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD--------SDVG 929
N LF + S + + +L ++HLG+ VN F + R+ D SD
Sbjct: 1248 NIALFMYQPESRESLGGQ--KLIRKADFHLGQKVNTFFR--IRCRISDPANDKKHFSDAD 1303
Query: 930 QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
+ ++ +++G +G I +P + Y L LQ L I + GLN + +R++ + +T
Sbjct: 1304 KRHVTMYASLDGNLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSHIRTQ 1363
Query: 990 --DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
A+ +DGDL+ +L L +++K + VE
Sbjct: 1364 GNPARGIIDGDLVWRYLYLPNNEKIDVAKKIAPIVE 1399
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 65/329 (19%), Positives = 139/329 (42%), Gaps = 61/329 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
++G + +++ G +D L ++ K V+++D ++ +L T ++ + R T
Sbjct: 73 LHGNVMSMQAVTLVGSQRDSLLLSFRDAKLSVVEYDQDTHDLRTVSLHYFEEEEIRDGWT 132
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPF------------------DNKGQLKEAFN 103
++ I I+ DP+ R + +Y V+PF NK + ++
Sbjct: 133 NHHHIPIVRVDPEGRCAVMLIYGRKLVVLPFKKDPSLDDGDLLDNSKASSNKTPILSSYM 192
Query: 104 IRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKD 149
I L+ L+ ++D++FL+G +PT+++LY+ + R VA+ +
Sbjct: 193 IVLKCLEEKMDNIIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQR 252
Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAI 196
WS +NL +PV PL G LI+ +++Y + A
Sbjct: 253 VHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLAETSTNF 312
Query: 197 PIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE 247
P++P + ++ +GS R ++ +G L++L + V G +
Sbjct: 313 PLKP---QEGVKISLEGSQVAFISSDRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAA 369
Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKL 276
+ + S + ++ +++GS G+S L++
Sbjct: 370 SVLTSCVCMCEDNYLFLGSRLGNSLLLRF 398
Score = 45.4 bits (106), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 46/172 (26%), Positives = 78/172 (45%), Gaps = 14/172 (8%)
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
++VT SG K+G+L ++++ I + EL G + MW++ + ++ F
Sbjct: 505 ELVTTSGYGKNGALCVLQHSIRPQVVTTFELPGCEDMWTVIGTLNNDEQIRPEAEGSHAF 564
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
L++S T IL E+ E + GF +Q T+F + N+ +VQVT VRL+
Sbjct: 565 LILSQEDSTMILQTG--QEINEVDQSGFSTQGSTIFAGNLGANRYIVQVTQMGVRLLQGI 622
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ 475
E G + A+ V L + G ++ L + +G T HAQ
Sbjct: 623 --EQIQHMPIDLGCPIVHASCADPYVTLLSEDGQVMLLTLREGRGTARLHAQ 672
>gi|195381337|ref|XP_002049409.1| GJ21566 [Drosophila virilis]
gi|194144206|gb|EDW60602.1| GJ21566 [Drosophila virilis]
Length = 1420
Score = 97.8 bits (242), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 108/435 (24%), Positives = 197/435 (45%), Gaps = 62/435 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLK-------------NQSCAEESE---------- 690
+R +PL PR++ + ++R + + + K ++ +EES
Sbjct: 991 VRKVPLRCTPRQLVYHRENRVYCLITQKEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1050
Query: 691 MHFVRLLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
+ + L+ +T+E + ++ + +E+ I+ S+ + Y C+GT + E
Sbjct: 1051 LFEMVLISPETWEIVPDASIQFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1109
Query: 744 NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
+ ++G I ++ + E GK L+ + +KE KG V +++ G L+ + QKI
Sbjct: 1110 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKI- 1168
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1169 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 1224
Query: 853 ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
+RD+N + +E + D+ LG AE N ++ + + + ++ L +YHL
Sbjct: 1225 SRDFNPLEVFGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQK--LLRKADYHL 1282
Query: 908 GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
G+ VN FR L R P + VI+GT++G +G LP + Y LQ
Sbjct: 1283 GQVVNTMFRVQCHQRGLHHRQPFLYENK-HLVIYGTLDGALGYCLPLPEKVYRRFLMLQN 1341
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
L + GLN +++R+ KK ++ +DGDLI S+ L+ + E++K +
Sbjct: 1342 VLLSYQDHLCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIGT 1401
Query: 1021 SVEELCKRVEELTRL 1035
EE+ + E+ RL
Sbjct: 1402 RTEEILADMLEIERL 1416
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 69/323 (21%), Positives = 127/323 (39%), Gaps = 82/323 (25%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YG + +L+ G +D L I+ + K VLQ DA++ L T ++ + R
Sbjct: 76 LYGNVMSLQSVSLAGGMRDALLISFKDAKLSVLQLDADTQALKTLSLHYFEEEDIRGGWT 135
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
G+ + +DPD R + +Y V+PF L E
Sbjct: 136 GRYHVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVTRTP 195
Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE 152
++ I L +L VLDI+FL+G +PT+++LY+ V+T +K
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEP------VRTCAGRIK------ 243
Query: 153 GPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIR 199
+ P+ P+ G L++ I+Y SA+ + P++
Sbjct: 244 --------------VFPIQKPIGGCLVMTVNAIIYLNQSVPPYGVSLNSSADNSTSFPLK 289
Query: 200 P------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSIA 251
P S+ A + +D D + ++ G L++L + + + V + +
Sbjct: 290 PQDNVRLSLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHFHKAAASVLT 347
Query: 252 STISYLDNAVVYIGSSYGDSQLI 274
S I +++GS G+S L+
Sbjct: 348 SCICVCHTEYIFLGSRLGNSLLL 370
>gi|224004656|ref|XP_002295979.1| spliceosome associated factor 3b, subunit 3; 130kD spliceosome
associated protein [Thalassiosira pseudonana CCMP1335]
gi|209586011|gb|ACI64696.1| spliceosome associated factor 3b, subunit 3; 130kD spliceosome
associated protein [Thalassiosira pseudonana CCMP1335]
Length = 1212
Score = 97.4 bits (241), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 104/410 (25%), Positives = 185/410 (45%), Gaps = 44/410 (10%)
Query: 295 NLGPIVDFCVVDLE-RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKG-MWS 352
+L P++ V +L + Q+ G SLR++R+G+ + E A EL G+ G +++
Sbjct: 386 SLAPVISVLVGELAGNEVSPQLYALCGRGPTSSLRVLRHGLSVTELAVSELPGVPGAVFN 445
Query: 353 LR----SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIE-GFCSQTQTLFCH----- 402
+R + +D ++VVSF T +L++ + +EE E GF + TL C
Sbjct: 446 VRDDQAAKNGKFYDRYIVVSFADATLVLSVG--ETVEEMGKESGFLTTEPTLACSALGNG 503
Query: 403 --DAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVY 460
+VQV G VR + R ++W P + A+AN SQ+L+A GG L+Y
Sbjct: 504 GNGEQEGGIVQVYPGGVRHIQ---RGSVSQWHVPGIKKIECASANESQILIALVGGELIY 560
Query: 461 LEIG--DGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
E+ G L E + ++ LD+ + + S S AAVG D +VR+ SL +L
Sbjct: 561 FELDPLSGNLMEAATKDVGADVCSLDVGAVPKGKSRSLFAAVG-CRDSTVRLLSLAPGSL 619
Query: 519 ITKEHLG--GEIIPRSVLL---CAFEGISYLLCALG--DGHLLNFLLNMKTGEL-TDRKK 570
+ ++ G P SV L +G + ++G DG L ++ TG + T +
Sbjct: 620 LEQKSSTTLGTTRPHSVALSNGVGSDGAGEITLSVGLDDGSALRAGVDPITGAMSTSPSR 679
Query: 571 VSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFP 630
LG +P+ + + + S RP + + + + ++ + H C F++ A
Sbjct: 680 RFLGARPVAVSRVMIEGSPSTLLLSSRPWIGRAGQGRHALAPMSYAPLDHGCSFSNEAVR 739
Query: 631 DS-----------LAIAKEG--ELTIGTIDDIQKLHIRSIPLGEHPRRIC 667
+ L++ + G ++ +G DD + + I L PR++C
Sbjct: 740 EGIVATSGSTLRILSVGESGGNDVVLGAEDD-EAFNSTRIGLRYTPRQMC 788
Score = 87.4 bits (215), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 75/300 (25%), Positives = 135/300 (45%), Gaps = 36/300 (12%)
Query: 751 ILVFIVEDGKLQLIAEKETK-GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
+L +V +LQL+ G V SL F G+LL + + ++LY+ G R+L +
Sbjct: 924 VLYRVVNGERLQLLHRTTVDDGPVLSLVHFQGRLLVGVGKTVRLYEM-----GKRQLLKK 978
Query: 810 CGHHGH-ILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILD 868
C G + +Q GD VGD+M+S+ + Y + A+D N ++ E+LD
Sbjct: 979 CELRGMPTMVKTLQAAGDRAFVGDMMQSMQFIRYDSTANRLVLVAKDRNPRPITCQELLD 1038
Query: 869 -DDIYLGAE-NNFNLFTVRKNSEG--------------ATDEERGRLEVVGEYHLGEFVN 912
+ + +G + N + + + ++ A D+ +LE + YH+GE V
Sbjct: 1039 INTVAVGDKFGNVTILRLPRGADAGAIDVTGTRALWDSARDDATPKLETLCTYHVGEVVT 1098
Query: 913 RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGV 971
SLV G ++I+ TV G +G + F L+ LR
Sbjct: 1099 SMTRASLV-------AGGAESLIYVTVTGRVGAFVPFTSRDDVEFYTSLEGFLRTETPRP 1151
Query: 972 GGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
G + + +RS+ K+ +DGDL ++F L +I+++++ SV E+ K++E+
Sbjct: 1152 TGRDPQSYRSYY-----APMKHIVDGDLCDAFAQLPYETKQKIAESLDRSVGEVMKKLED 1206
>gi|307191845|gb|EFN75271.1| Cleavage and polyadenylation specificity factor subunit 1
[Harpegnathos saltator]
Length = 1214
Score = 97.4 bits (241), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 160/333 (48%), Gaps = 42/333 (12%)
Query: 731 YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAF 779
Y +GT Y E+ ++GRIL+F I+E + + I KE KG + ++
Sbjct: 890 YIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPITAITQV 948
Query: 780 NGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
+G L+ A+ QKI Y W L+D+ G + ++ H +L++ I++ D+ KS
Sbjct: 949 SGFLVTAVGQKI--YIWQLKDNDLVGIAFIDTQIYIH-QMLSI-----KSLILIADVYKS 1000
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGA 891
ISLL ++ + + +RD+ + +E L D+ LG E+N LF + S +
Sbjct: 1001 ISLLRFQEKCRTLSLVSRDFRPAEVYTIEYLIDNTNLGFLIADGESNLALFMYQPESRES 1060
Query: 892 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPD--------SDVGQIPTVIFGTVNGVI 943
++ L ++HLG+ +N F + R+ D SD + ++ +++G +
Sbjct: 1061 LGGQK--LIRKADFHLGQKINTFFR--IKCRVTDVASDKKHFSDADKKHVTMYASLDGSL 1116
Query: 944 GVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN--EKKTVDAKNFLDGDLIE 1001
G + +P + Y L LQ L I + GLN + +R++ + + A+ +DGDL+
Sbjct: 1117 GYVLPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRTYKSYVRNQGNPARGIIDGDLVW 1176
Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
+L L +++K + V+E+ + + E+ R
Sbjct: 1177 RYLSLPNNEKADVAKKIGTRVQEIIEDITEIDR 1209
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 86/417 (20%), Positives = 161/417 (38%), Gaps = 109/417 (26%)
Query: 111 VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DKDFVEGPWSQNNLD 161
V+D++FL+G +PT+++LY+ + R VA+ + WS +NL
Sbjct: 4 VIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQRVHPIIWSVSNLP 63
Query: 162 NGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIRPS----ITK 204
+PV PL G LI+ +++Y + A+ P+RP I+
Sbjct: 64 FDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADTSTNFPLRPQDGVKISL 123
Query: 205 AYGRVD-ADGSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
+V R ++ +G L++L + V G + + + S + ++ +
Sbjct: 124 EGAQVAFLSADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAAASVLTSCVCMCEDNYL 183
Query: 263 YIGSSYGDSQLIKL----------------------NLQPDAKG---------------- 284
++GS G+S L++ N +P AK
Sbjct: 184 FLGSRLGNSLLLRFTEKEPETIKSLDDGEINIEDNDNEEPPAKKAKQDFLGDWMASDVLD 243
Query: 285 ---------------------SYV-EVLERYVNLGPIVDFCVVDL---------ERQGQG 313
SY+ EV + +N+GP + + + +
Sbjct: 244 IKDPEELEVYGSETHTSIQITSYIFEVCDSLLNIGPCGNISMGEPAFLSEEFAHNQNPDV 303
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---------FDTF 364
++VT SG K+G+L +++ I + EL G + MW++ S ++ F
Sbjct: 304 ELVTTSGYGKNGALCVLQRSIRPQVVTTFELPGCEDMWTVIGSLNNDEQVKSETEGSHAF 363
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
L++S E + + E+ E + GF +Q T+F + N+ +VQVT VRL+
Sbjct: 364 LILS--QEDSTMVLQTGQEINEVDQSGFSTQGSTVFAGNLGANRYIVQVTQMGVRLL 418
>gi|195122290|ref|XP_002005645.1| GI18959 [Drosophila mojavensis]
gi|193910713|gb|EDW09580.1| GI18959 [Drosophila mojavensis]
Length = 1431
Score = 97.1 bits (240), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 107/435 (24%), Positives = 195/435 (44%), Gaps = 62/435 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLK-------------NQSCAEESE---------- 690
+R +PL PR++ + ++R + + + K ++ +EES
Sbjct: 1002 VRKVPLRCTPRQLVYHRENRVYCLITQKEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1061
Query: 691 MHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEE 743
+ + L+ +T+E + ++ + +E+ + S + Y C+GT + E
Sbjct: 1062 LFEMVLISPETWEIVPDASIQFEPWEHVTAFKLVKLSYEGTRSGLKEYLCIGTNFNY-SE 1120
Query: 744 NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
+ ++G I ++ + E GK L+ + +KE KG V +++ G L+ + QKI
Sbjct: 1121 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVVGFLVTGLGQKI- 1179
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1180 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 1235
Query: 853 ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
+RD+N + +E + D+ LG AE N ++ + + + ++ L +YHL
Sbjct: 1236 SRDFNPLEVFGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQK--LLRKADYHL 1293
Query: 908 GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
G+ VN FR L R P + VI+GT++G +G LP + Y LQ
Sbjct: 1294 GQVVNTMFRVQCHQRGLHQRQPFLYENK-HFVIYGTLDGALGYCLPLPEKVYRRFLMLQN 1352
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
L + GLN +++R+ KK ++ +DGDLI S+ L+ + E++K +
Sbjct: 1353 VLLSYQDHLCGLNPKEYRTIKTVKKMGINPSRCIIDGDLIWSYRMLAHSERSEVAKKIGT 1412
Query: 1021 SVEELCKRVEELTRL 1035
EE+ + E+ RL
Sbjct: 1413 RTEEILADLLEIERL 1427
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 67/323 (20%), Positives = 127/323 (39%), Gaps = 82/323 (25%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YG + +L+ G +D L ++ + K VLQ DA++ L T ++ + R
Sbjct: 76 LYGNVMSLQSVSLAGGMRDALLVSFKDAKLSVLQLDADTQTLKTLSLHYFEEDDIRGGWT 135
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
G+ + +DPD R + +Y V+PF L E
Sbjct: 136 GRYHVPVVRVDPDARCAIMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVSRTP 195
Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVE 152
++ I L +L VLDI+FL+G +PT+++LY+ V+T +K
Sbjct: 196 IMASYLIALADLDEKLDNVLDIQFLHGYYEPTLLILYEP------VRTCAGRIK------ 243
Query: 153 GPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIR 199
+ P+ P+ G L++ ++Y SA+ + P++
Sbjct: 244 --------------VFPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSADNSTSFPLK 289
Query: 200 P------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSIA 251
P S+ A + +D D + ++ G L++L + + + V + +
Sbjct: 290 PQDNVRLSLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHFHKAAASVLT 347
Query: 252 STISYLDNAVVYIGSSYGDSQLI 274
S I +++GS G+S L+
Sbjct: 348 SCICVCHTEYIFLGSRLGNSLLL 370
>gi|195455711|ref|XP_002074834.1| GK23274 [Drosophila willistoni]
gi|194170919|gb|EDW85820.1| GK23274 [Drosophila willistoni]
Length = 1463
Score = 96.7 bits (239), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 115/477 (24%), Positives = 212/477 (44%), Gaps = 67/477 (14%)
Query: 617 EVSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQ 671
EV FN+ P+ L EL I + +R +PL PR++ + +
Sbjct: 992 EVRSFAAFNNINIPNGFLFFDTTFELKISVLPSYLSYDSTWPVRKVPLRCTPRQLVYHRE 1051
Query: 672 SRTFAICSL-------------KNQSCAEESE-MHFVR---------LLDDQTFEFI--S 706
+R + + + +++ +EES F+ L+ +T+E + +
Sbjct: 1052 NRVYCLITQTEEPMTKFYRFNGEDKELSEESRGERFIYPIGSQFDMVLISPETWEIVPDA 1111
Query: 707 TYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEENEPTKGRILVF-----IV 756
+ + +E+ I+ S+ + Y C+GT + E+ ++G I ++ +
Sbjct: 1112 SIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SEDITSRGNIHIYDIIEVVP 1170
Query: 757 EDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
E GK L+ + +KE KG V +++ G L+ + QKI Y W LRD +L
Sbjct: 1171 EPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKI--YIWQLRDG---DLIGVA 1225
Query: 811 GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
+I + T I + D+ KSISLL ++ E + +RD+N + +E + D+
Sbjct: 1226 FIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLASRDFNPLEVYGIEFMVDN 1285
Query: 871 IYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR-FR----HGSLV 920
LG AE+N ++ + + + ++ L +YHLG+ VN FR L
Sbjct: 1286 TNLGFLVTDAESNLIVYMYQPEARESLGGQK--LLRKADYHLGQVVNTMFRVQCHQRGLH 1343
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
R P + V++GT++G +G LP + Y LQ L + GLN +++R
Sbjct: 1344 QRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQNVLLSYQDHLCGLNPKEYR 1402
Query: 981 SFNNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+ + K+ ++ +DGDLI S+ L+ + +E++K + EE+ + E+ RL
Sbjct: 1403 TLKSSKRLGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGTRTEEILADLLEIERL 1459
Score = 54.3 bits (129), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 72/334 (21%), Positives = 136/334 (40%), Gaps = 67/334 (20%)
Query: 6 IYGRIATLEL--FRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT 63
+YG + +L+ G +D L ++ + K VLQ D ++ L T ++ + R
Sbjct: 76 LYGNVMSLQCVSLAGAGAMRDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEEDIRGG 135
Query: 64 DNGQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE------------------- 100
G+ + +DPD R + +Y V+PF L E
Sbjct: 136 WTGRYYVPEVRVDPDARCAVMLIYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTALVTR 195
Query: 101 -----AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-ARHVKTYE------- 142
++ I L +L VLDI+FL+G +PT+++LY+ + A +K
Sbjct: 196 TPIMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCAGRIKVRSDTCVLVA 255
Query: 143 VALKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------C 188
++L + V W+ NNL L+P+ P+ G L++ ++Y
Sbjct: 256 ISLNIQQRVHPIIWTVNNLPFDCLRLLPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNS 315
Query: 189 SANAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGL 240
SA+ + P++P S+ A + +D D + ++ G L++L + + + V
Sbjct: 316 SADNSTSFPLKPQDGVRISLDCANFAFIDVD--KLVVSLRTGDLYVLTLCVDSMRTVRNF 373
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
+ + S I +++GS G+S L+
Sbjct: 374 HFHKAASSVLTSCICVCHMEYIFLGSRLGNSLLL 407
Score = 45.8 bits (107), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 26/159 (16%)
Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
EV + +N+ PI C V+ E G ++V +G K+G+L +
Sbjct: 490 EVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAESLQDVKIELVAATGHSKNGALSVFV 549
Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
N I S EL+G +W++ ++S D D F+++S + T L + E+
Sbjct: 550 NCINPQIITSFELEGCLDVWTVFDDATKKTSRQDQHD-FMLLSQKNST--LVLQTGQEIN 606
Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
E E GF T+F + N+ +VQVT+ VRL+ T
Sbjct: 607 EIENTGFTVNQATIFVGNLGQNRFIVQVTTRHVRLLQGT 645
>gi|68476233|ref|XP_717766.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
albicans SC5314]
gi|68476422|ref|XP_717672.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
albicans SC5314]
gi|74586274|sp|Q5A7S5.1|RSE1_CANAL RecName: Full=Pre-mRNA-splicing factor RSE1
gi|46439394|gb|EAK98712.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
albicans SC5314]
gi|46439495|gb|EAK98812.1| potential spliceosomal U2 snRNP complex SF3b component [Candida
albicans SC5314]
Length = 1219
Score = 96.7 bits (239), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 234/1145 (20%), Positives = 457/1145 (39%), Gaps = 185/1145 (16%)
Query: 24 DFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPD--CRLIGLH 81
D + I ++ +LQ+D ++ + I++ ++ G+ IDP+ C L+
Sbjct: 118 DGVVITSDSGNLSILQYDNKTKKFISKIQEPMTKNGWGRNYVGENLAIDPENRCILVAAM 177
Query: 82 LYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKF----------LYGC-----AKPTIV 126
+ LF I ++ G + + + QVL +K L+G K I+
Sbjct: 178 EKNKLFYKIESNSSGSKELSSPLEAHSKQVLCLKIVALNTDHNNPLFGALELTPEKKCII 237
Query: 127 VLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN--LDNGADLLIPVPPPLCGVLIIGEET 184
Y+ ++ HV +K K P S N+ L N + LIP+P + G+++ G
Sbjct: 238 NYYELDQGLNHV------VKKK-----PNSSNSDPLPNDVNYLIPLPGHIGGMVVCGTNW 286
Query: 185 IVYCSANA---FKAIPIRPSITKAYGRVD-----ADGSRY--LLGDHAGLLHLLVITHE- 233
Y + + +P R T+ V+ ++ LL + G L L + ++
Sbjct: 287 CFYDKLDGPRIYLPLPRRNGQTQDSIIVNHVTHVLKKKKFFILLQNALGDLFKLTVDYDF 346
Query: 234 -KEKVTGLKIELLGETSIASTISYLDNAVVY---------------IGSSYGDSQLI--- 274
KE + + I A +++ N ++ +G + +L+
Sbjct: 347 DKEIIKNISITYFDTIPPALSLNIFKNGFLFANVLNNDKLLYQFEKLGDDLTEGELVINS 406
Query: 275 ----KLNLQPDAKGSY----------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 320
LN ++ S+ ++VLE L PI D ++D ++VT S
Sbjct: 407 SDYESLNSVRESVTSFKLKGLDNLALIDVLE---TLSPITDSKIID------SKLVTLSS 457
Query: 321 AYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
++ + +G+ L +++ + S + D +LV+S ++ L ++
Sbjct: 458 H---SYVKSITHGVPTTTLVESPLPITPTDIFTTKLSLESANDEYLVISSSLSSKTLVLS 514
Query: 380 LEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN-EWKSPPGYS 438
+ + +E+ E F T+ +VQ+ S ++ V + + + +W P G +
Sbjct: 515 IGEVVEDVEDSEFVLDQPTIAVQQVGIASVVQIYSNGIKHVRTVNGNKKTTDWFPPAGIT 574
Query: 439 VNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI-SCLDINPIGENPSYSQI 497
+ AT N QVL+A +VY EI D ++ Q EI + + I EN S
Sbjct: 575 ITHATTNNQQVLIALSNLSVVYFEI-DATDDQLIEYQDRLEIATTITAMAIQENISEKSP 633
Query: 498 AAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEGISYLLCALGDGHL 553
A+ +D ++++ SL + N + + L + + E +++ + +G
Sbjct: 634 FAIIGCSDETIQVVSLQEHNCLEIKSLQALSANSSSLKMLKSSGKE--THVHIGMENGVY 691
Query: 554 LNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSN 612
++ G L++ + +G++P++L N + A S P + Y +
Sbjct: 692 ARIKIDTINGNLSNSRVKYIGSKPVSLSVIKFSNEIEGILAISSAPWISYLYRDSFKITP 751
Query: 613 VNLKEVSHMCPFNSAAFP-DSLAIAKEGELTIGTI-------DDIQKLHIRSIPLGEHPR 664
+ ++++ F S + + K+ L I ++ D Q L I + L PR
Sbjct: 752 LLEIDITNGSSFISEDIGGEGIVGIKDNNLIIFSVGKEDSVFDPSQDLTIATTKLRYTPR 811
Query: 665 RICHQEQSRTFAI-----------CSLKNQSCAEESEMHF---------------VRLLD 698
++ +R F C++ E ++ ++++D
Sbjct: 812 KMI-TNGNRLFISESEYNVQGPFKCNINGDVKENVDEDYYEAFGYEWKQNSWASCIQVVD 870
Query: 699 DQTFEFISTYPLDTFEYGCSILSCSFSDDSN-----VYYCVGTAY---VLPEENEPTKGR 750
++ + I + LD E S+ + SF+ S + VG +LP N K
Sbjct: 871 SKSNQVIQSLQLDGNESIVSMSAVSFNKTSTPSVPASHLVVGVCTNQTILP--NSYDKSY 928
Query: 751 ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC 810
+ F + LQL+ + E L F KLL A I+LY D G ++L +
Sbjct: 929 LYTFKIGKKHLQLVHKTELDHIPQVLENFQDKLLVASGNHIRLY-----DIGQKQLLKKS 983
Query: 811 ----GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE--ERARDYNANWMSAV 864
+I + QT + I++ D KS S++ K +E + A D ++++
Sbjct: 984 TTIIDFSTNINKIIPQT--NRIIICDSHKS-SIVFAKFDESQNQFVPFADDVMKRQITSI 1040
Query: 865 EILDDDIYLGAENNFNLFTVR----------------KNSEGATDEERGRLEVVGEYHLG 908
LD D +G + N+F R K +G + +L+ + E+H+G
Sbjct: 1041 MNLDIDTLIGGDKFGNIFVTRIDEDISKQADDDWTILKTQDGILNSCPYKLQNLIEFHIG 1100
Query: 909 EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKV 967
+ + F G L ++ +VI+ + G IG++ L + + L LQ +++
Sbjct: 1101 DIITSFNLGCL-------NLAGTESVIYTGLQGTIGLLIPLVSKSEVELLFNLQLYMQQS 1153
Query: 968 IKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
+ G +H + RS+ N KN +DGDL+E FL+ + EIS+ +N SV ++ K
Sbjct: 1154 QNNLVGKDHLKLRSYYNP-----IKNVIDGDLLERFLEFDISLKIEISRKLNKSVNDIEK 1208
Query: 1028 RVEEL 1032
++ +L
Sbjct: 1209 KLIDL 1213
>gi|189208368|ref|XP_001940517.1| pre-mRNA-splicing factor rse1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187976610|gb|EDU43236.1| pre-mRNA-splicing factor rse1 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 1247
Score = 96.3 bits (238), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 138/614 (22%), Positives = 238/614 (38%), Gaps = 80/614 (13%)
Query: 155 WSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVY-----CSANAFK-AIPIRPSITKA 205
WS+ +D A+ L VP GVL GE++I Y +N + AIP R T+
Sbjct: 224 WSEP-VDRTANTLFRVPGGPNAPSGVLCCGEDSITYRRIFNNKSNVHRLAIPRREGATED 282
Query: 206 YGR-----------VDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTI 254
R + YLL G + L + V +KI+ I ++I
Sbjct: 283 PNRKRMIIAGTLYSLKGGDFFYLLQTEDGDVFKLTVDAPSGTVEKIKIKYFDTIPIVTSI 342
Query: 255 SYLDNAVVYIGSSYGDSQLIKLN----------------------------LQPDAKGSY 286
L VY GD L +L +P A ++
Sbjct: 343 CILRAGFVYAACESGDRILYELESLGDETDDPVFESDQFPVDPDTSFAPPFFKPRALVNF 402
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-Q 345
V E +L PI+ V + + QV T +G S R RN + + + L Q
Sbjct: 403 TPV-ESMPSLNPIMGMEVANPALEDAPQVYTINGTSGRSSFRTTRNALEVLDLIESPLPQ 461
Query: 346 GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
+W+ + ++ D DT +V+ +R L + + D++EE GF T TL
Sbjct: 462 NASDVWTTKLTSGDEADTLIVLCL--HSRTLVLKIGDDVEEASNTGFLPDTNTLGVQQFG 519
Query: 406 YNQLVQVTSGSVRLV---------SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGG 456
+ ++Q+ +R + + + +W P ++ N QV +A G
Sbjct: 520 EDCIIQIHPKGIRHIQGIQFPNDDADATHANLMDWHPPAHRTIVACATNNRQVAIALSSG 579
Query: 457 HLVYLEI-GDGILTEVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL- 513
++Y E DG L + L+ I+CL + + E + AVG +D +VRIF+L
Sbjct: 580 QILYFECDSDGSLAMAEEEISLDSTINCLAMPDVPEGSVRAFFLAVGC-SDQTVRIFNLS 638
Query: 514 PDL--NLITKEHLGGEIIPRSVLLCAF------EGIS-YLLCALGDGHLLNFLLNMKTGE 564
PD+ N++ + P S L F G S +L L G + +L+ TG+
Sbjct: 639 PDMDGNILRSISVQALTSPPSDLTINFMKDRSPRGESQFLHIGLRSGVYIRSVLDEMTGD 698
Query: 565 LTDRKKVSLGTQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKK--LLYSNVNLKEVSHM 621
+ D ++ LG +PI + + + A + RP + Y+ + L + +N
Sbjct: 699 IGDTRRRFLGPEPIKFAKVNDVAGSPGILAMTSRPWLGYTHPRTGVLQLTPLNYIPFKSA 758
Query: 622 CPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSL 680
F+ + F + ++ EL I T +D+ +I L PR++ + F +
Sbjct: 759 WNFDGSQFKGIICVSTN-ELRIFTFNDLTDNTTYENISLKYTPRKMVGYHEQGVFYVIQS 817
Query: 681 KNQSCAEESEMHFV 694
N + + + +
Sbjct: 818 DNNTISADRRQQLI 831
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 71/287 (24%), Positives = 130/287 (45%), Gaps = 33/287 (11%)
Query: 755 IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL--QSECG 811
I DG+ ++L E +L AF GKL+A + + + LY D G + L +++
Sbjct: 954 ISPDGREIELFNETAVSEPPLALLAFKGKLIAGVGRHLCLY-----DCGMKSLLRKAQAP 1008
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSAVEILDD 869
+ + ++T+G +VV D +S++ ++K + + D A SA E+LD
Sbjct: 1009 NSVPTRIVDIKTQGSRLVVSDQAQSVTYFVHKDQVHPNRLIPFVDDSVARHTSASEMLDY 1068
Query: 870 DIYLGAENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
D +G + N++ VR + SE + + G +V + +LG NR L+ +
Sbjct: 1069 DTTIGGDKFGNIWLVRCPKEISESSDESPDGSDLLVDKSYLGGTPNRL---DLIAHYFTN 1125
Query: 927 DV-----------GQIPTVIFGTVNGVIGVIASLPHEQ-YLFLEKLQTNLRKVIKGVGGL 974
D+ G + + + G +G + + + ++L+ LR K + G
Sbjct: 1126 DIPVSIQKANLISGGDKVIFWAGLQGTLGALIPFTSRRTHKLFQQLELTLRSEDKPLSGR 1185
Query: 975 NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+H +RS+ K+V +DGDLIE +L LS + + I+ M S
Sbjct: 1186 DHLTYRSYYAPVKSV-----IDGDLIERYLVLSSDKRESIAGQMTGS 1227
>gi|159486547|ref|XP_001701300.1| nuclear pre-mRNA splicing factor, component of splicing factor 3b
[Chlamydomonas reinhardtii]
gi|158271783|gb|EDO97595.1| nuclear pre-mRNA splicing factor, component of splicing factor 3b
[Chlamydomonas reinhardtii]
Length = 1078
Score = 96.3 bits (238), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 119/518 (22%), Positives = 209/518 (40%), Gaps = 89/518 (17%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G+I +L R G D L I ++ + +++++ + + + + + R G R
Sbjct: 57 VFGQIRSLAAVRLTGSTTDHLAIGSDSGRIVLIKFNKDKNVWV-KVHQETFGRSGCRRIV 115
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLY---- 118
GQ +DP R ++G V+ D L + LE + I F
Sbjct: 116 PGQFLAVDPKGRACMVGAVEKQKFVYVLNRDAAANL--TISSPLEAHKSHHITFSICGMD 173
Query: 119 -GCAKPTIVVLYQDNKDA-------------RHVKTYEVALKDKDFVEGPWSQNNLDNGA 164
G P + D +A +H+ YE+ L + V W++ +DNGA
Sbjct: 174 CGFDNPIFAAIELDYSEADQDPTGEAASLAQKHLTFYEMDLGLNNVVR-KWTEP-VDNGA 231
Query: 165 DLLIPVPPPL---CGVL--------------IIGEETIVYCSANAFKAIPIRPSITKAYG 207
+LLI VP GVL + G+ ++ S K KAY
Sbjct: 232 NLLIAVPGGADGPGGVLDHEEVRAVIPRRSDLPGDRGVLIVSYATHK--------KKAY- 282
Query: 208 RVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSS 267
S +L+ G ++ + +++E E VT +K++ + ++I+ L ++ S
Sbjct: 283 ------SFFLVQSEYGDIYKVTLSYEGEAVTEVKVKYFDTIPVTASIAVLKTGFLFAASE 336
Query: 268 YGDSQLIKL----NLQPDAKGSYVEV----------------------LERYVNLGPIVD 301
YG+ L + D + S ++ ++ +L PI D
Sbjct: 337 YGNHALYQFVGTGEDDDDVESSSAQLVATEEGFQPVFFEPRPLKNLLLIDEMASLMPITD 396
Query: 302 FCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDP 360
V +L + Q+ G SL ++R G+ + E A L G +W++R + D
Sbjct: 397 MKVANLLGEEIPQIYALCGRGPRASLSVLRPGLAVTELAVSPLPGAPTAVWTVRRAASDE 456
Query: 361 FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV 420
+D ++VVSF + T L ++ +E++ET GF TL N ++QV G +R +
Sbjct: 457 YDAYIVVSFANAT--LVFSIGEEVKETNESGFLGTVPTLHTQLLADNSMLQVYPGGLRHI 514
Query: 421 SSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
R NEWK P + A +N QV +A GG +
Sbjct: 515 RPDRR--INEWKVPGRRVIKAAASNDKQVAIALQGGEV 550
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 70/303 (23%), Positives = 131/303 (43%), Gaps = 38/303 (12%)
Query: 751 ILVFIVEDG--KLQLIAEKETKGAVY-SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
I V+ + DG +L L+ + + G V +L F G+LLA + ++LY D G +++
Sbjct: 788 IRVYRLGDGGRRLDLLHKTQVDGGVPGALAGFKGRLLAGVGPTLRLY-----DMGKKKML 842
Query: 808 SECGHHG------HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWM 861
+C ++ H+ R +S+ ++ YK + A A D ++
Sbjct: 843 RKCEYNRWTNIFLHVFFYRPYFRSS-------QESVHMMRYKKADNAFYIFADDVAPRYL 895
Query: 862 SAVEILDDDIYLGAENNFNLFTVRKNSEGAT----DEERGRLEVV-----GEYHLGEFVN 912
SA+ LD D + NL +R E + D G++ G H E +
Sbjct: 896 SALLPLDYDTIATGDKFGNLVILRLPQEASQQVEDDPTGGKMAAASGKLNGAPHKLEELV 955
Query: 913 RFRHGSLVMRLPDSDV--GQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIK 969
+F G + L +++ G +++ TV G IGV+ E F L+ +LR+
Sbjct: 956 KFHVGDTITALQRAEMQAGGQEVLVYSTVMGAIGVVYPFTSREDVDFFSHLEMHLRQENP 1015
Query: 970 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
+ G +H +RS +N +DGDL + + + I++ M+ + E+ K++
Sbjct: 1016 PLAGRDHLAYRS-----AYFPVRNCVDGDLCSQYASIPMKKQQMIAEAMDRTTGEMLKKL 1070
Query: 1030 EEL 1032
E++
Sbjct: 1071 EDI 1073
>gi|449661926|ref|XP_002167992.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Hydra magnipapillata]
Length = 1122
Score = 96.3 bits (238), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 119/564 (21%), Positives = 244/564 (43%), Gaps = 111/564 (19%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM---------GDVS 56
++G I +++ R +D L +A + K ++++D S +L T +M G +S
Sbjct: 76 LFGNIINMQVVRLGSNVRDSLLLAFKHAKLSIVEFDPLSHDLKTDSMHYFENDEFKGGLS 135
Query: 57 DRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEA--------------- 101
I P + +DP+ R + +Y+ V+PF + +L E+
Sbjct: 136 HNIYLP-----LVRVDPEQRCACMLIYNRHLVVLPFKHDIKLDESEELSDGEHIKSVLPS 190
Query: 102 FNIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGP-- 154
+ I L L+ + +++FL+G +PT++ L++ + + T VA++ F
Sbjct: 191 YMIDLHSLEQPLLNITELQFLHGYHQPTLMFLFEPVQTS----TGRVAVRQDTFCVSAIS 246
Query: 155 -----------WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAFK 194
WS NL +L P+ P+ GVL+ +++Y S N+
Sbjct: 247 LNMTEKVHPVIWSVTNLPFDCHMLRPIEKPIGGVLVFASNSLIYLNQSIPPYGVSLNSIT 306
Query: 195 ----AIPIRPS----ITKAYGRVDADGS-RYLLGDHAGLLHLL-VITHEKEKVTGLKIEL 244
P++ IT A DA + +++L G +++L +++ V E
Sbjct: 307 EGSTMFPLKIQEDVVITLAESSCDAIATDQFILSLKGGEIYVLSLLSDGLRTVRSFHFEK 366
Query: 245 LGETSIASTISYLDNAVVYIGSSYGDSQLIKL------NLQPDAKGSYVEVL-------- 290
+ +AS + ++++ V++GS G+S L++ ++ +K + VE +
Sbjct: 367 AAGSVLASCVCWIEHGFVFLGSRLGNSLLLRYTEKDSASIAEKSKEAKVEKMYGGGVGGG 426
Query: 291 ----ERYVNLGPIVDFCVVDL---------ERQGQGQVVTCSGAYKDGSLRIVRNGIGIN 337
+ +N+GPI + + RQ ++V CSG K+G+L +++ I
Sbjct: 427 IIVCDSLLNIGPITKAALGEPAFLSEEFFGSRQIDLEMVCCSGYGKNGTLTVLQRSIRPQ 486
Query: 338 EQASVELQGIKGMWSL--RSSTD--DPFDTFLVVSFISETRILAMNLEDELEETEIEGFC 393
+ EL G MW++ +SS + + + ++L++S T +L E+ E + GF
Sbjct: 487 VVTTFELPGCVNMWTVCGKSSKESVENYHSYLILSRDDSTMVLKTGA--EITELDNSGFN 544
Query: 394 SQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLA 452
Q T+F + + N+ ++QV S+ L+ T + N + + + V++
Sbjct: 545 VQQPTIFACNHLSNKYILQVCPQSIHLLEDTVQ--INSISLQDTIKITQCSISDPYVVMV 602
Query: 453 TGGGHLVYLEI-----GDGILTEV 471
G L+YL++ G+G ++ +
Sbjct: 603 DSTGQLIYLQVIEDSEGNGKISSI 626
Score = 76.3 bits (186), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 80/324 (24%), Positives = 144/324 (44%), Gaps = 28/324 (8%)
Query: 731 YYCVGTAYVLPEENEPTKGRILVF-----IVEDG------KLQLIAEKETKGAVYSLNAF 779
Y VGT + E+ KGRIL+F + E G K + + +KE KG V ++ A
Sbjct: 802 YLVVGTTFNYGED-LACKGRILIFDVLEVVPEPGQPLTKTKCKCVYDKEQKGPVTAICAT 860
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
+G ++AA+ QKI +K+ +D+ +L + + + + IV D+ +SISL
Sbjct: 861 SGYIIAAVGQKIYAFKY--KDN---DLVGVAFVDSQVFTVNLMAIRNVIVAADISRSISL 915
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDE 894
+ ++ E ++ +RD E D +G AE N +F+ + E
Sbjct: 916 VRFQVEHKSLALVSRDTKTLEAYTSEFFIDGSQVGFVVSDAERNIVIFSYQ--PEALESF 973
Query: 895 ERGRLEVVGEYHLGEFVNRFRHGSLVM--RLPDSDVGQIPTVIFGTVNGVIGVIASLPHE 952
RL + ++G VN L+ + Q +I T++G IG++ L +
Sbjct: 974 GGHRLLQKADINIGSHVNTMMRIKLIQDEQSLSKSSEQRQLIILPTLDGSIGILFPLSEK 1033
Query: 953 QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTR 1010
+ L LQ L + GLN +R+ + +T+ +N LDG L++ + LS
Sbjct: 1034 PFRRLTMLQNKLVDCLPHKAGLNPRAFRALDVPLRTLTNPHRNILDGQLLDKYAQLSFQE 1093
Query: 1011 MDEISKTMNVSVEELCKRVEELTR 1034
+I+K M + ++ + ++ R
Sbjct: 1094 RFDIAKKMGTTSGQILDDMMDIER 1117
>gi|358372791|dbj|GAA89393.1| cleavage and polyadenylation specificity factor subunit A
[Aspergillus kawachii IFO 4308]
Length = 1372
Score = 96.3 bits (238), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 97/429 (22%), Positives = 179/429 (41%), Gaps = 52/429 (12%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 694
++ + LGE + + S + + C + ++ E+H F+
Sbjct: 940 LKKVHLGEQVDHLAYSTSSGMYVLGTCHATDFKLPDDDELHPEWRNEAISFFPSARGSFI 999
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
+L+ T+ I +Y L T EY +I + S N + VGTA+ E+ P++G
Sbjct: 1000 KLVSPNTWSIIDSYSLGTDEYVMAIKNISLEISENTHERKDLIVVGTAFAR-GEDIPSRG 1058
Query: 750 RILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
I VF V D KL+LI ++ KGAV +L+ G+ + Q + L
Sbjct: 1059 CIYVFEVVQVVPDPDDPETDRKLKLIGKESVKGAVTALSEIGGQGFVLVAQGQKCMVRGL 1118
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
++DG+ + ++ + ++GD +K I Y E + A+D +
Sbjct: 1119 KEDGSLLPVAFMDMQCYVSVVKELKGTGMCILGDAVKGIWFAGYSEEPYKMSLFAKDLDY 1178
Query: 859 NWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF- 914
+SA E L D L A+++ N+ ++ + E +L ++H G F +
Sbjct: 1179 LEVSAAEFLPDGRRLFIVVADSDCNIHVLQYDPEDPKSSNGDKLLSRSKFHTGNFASTLT 1238
Query: 915 -------RHGSLVMRLPDSDV---GQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
++ D D+ + V+ T NG +G+I +P E Y L LQ+ L
Sbjct: 1239 LLPRTMVSSEKMISNSDDMDIDNQSALHQVLMTTQNGSLGLITCMPEESYRRLSALQSQL 1298
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
++ GLN +R+ E + LDG+L+ ++D+S+ R EI+ + E
Sbjct: 1299 TNTLEHPCGLNPRAFRAV--ESDGTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWE 1356
Query: 1025 LCKRVEELT 1033
+ +E ++
Sbjct: 1357 IKADLEAIS 1365
>gi|328771718|gb|EGF81757.1| hypothetical protein BATDEDRAFT_34564 [Batrachochytrium dendrobatidis
JAM81]
Length = 1248
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 161/664 (24%), Positives = 273/664 (41%), Gaps = 79/664 (11%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGI-NEQASVELQGIKGMWSL 353
N GP +DFC+ D + +GA G ++ +R+G+G+ + + + G +W L
Sbjct: 439 NAGPTLDFCLADTLGNNTDTLYMTAGAAPLGYIQEIRHGVGVLIDDTTKQFDGAIKLWGL 498
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL---FCHDAIYNQL 409
R+S +D D+ LV SF++ TRI+ M +DE E+ ++I GF TL C + Y
Sbjct: 499 RTSCEDTVDSLLVASFVASTRIMYMQ-DDEFEDISDISGFTIDVATLNTAACFVSGY--F 555
Query: 410 VQVTSGSVRLVSSTSRELRN------EWKSPPGYSVNVATANASQVLLA-TGGGHLVYLE 462
+QV + + + N W P + V + VLL TG L+ L+
Sbjct: 556 IQVHLHGIIIARPLLDKTINNNTCTANWSPPNDHKVGFSAFYQDCVLLTLTGENSLMLLK 615
Query: 463 IGDGILTEVKHAQLEYEIS-CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK 521
+ I+ E + Q+E S LDI P + S T I+V P I K
Sbjct: 616 V---IIQENSNVQIESISSISLDIEPSSQYYS----------TPINV-----PHSICIIK 657
Query: 522 EHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
EI RS L + +LL + DG LL+F L+ + ++ V L L
Sbjct: 658 ---SKEISSRSNALISKP---HLLIGIRDGALLDFTLDFENDQINFMPPVILQLGDCPLD 711
Query: 582 TFSSKNT----THVFAASDRP-TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS-LAI 635
S +T +V +++ + S + S + K + H F A D+
Sbjct: 712 LVYSHHTDPLDRYVLGVTNQTWQLSVSPIGGIRISQLLHKPILHAARFLYDADVDTGFLF 771
Query: 636 AKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVR 695
L+ +D HIR+I +G+ PRRI ++ + +Q A+E + V+
Sbjct: 772 LTSDTLSFVKVDPESGYHIRNINMGDTPRRILVDPVTKLLVVAG-SSQGDADEI-LTTVK 829
Query: 696 LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI 755
+++ T + +T L E S++ + Y C+GT T GR+LVF
Sbjct: 830 VMNPDTGQVYATERLAANETIHSLIVWHVK-PTKRYICMGTRI------HATSGRVLVFG 882
Query: 756 VEDG------KLQLIAEKETKGAVYSLNAF-NGKLLAAINQ-----KIQLYKWMLRDDGT 803
++ K L+ + G V +L F N LLA+ KI+ + +
Sbjct: 883 LKPATKNKHIKFTLMGQYTLNGPVLALCTFVNSYLLASAGSTLYQLKIEAVHRTITAGAS 942
Query: 804 RELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
++ S I AL Q I + + SIS+ + A D + S
Sbjct: 943 IDINSII---TRIHALKTQ-----IFIANTQDSISVYKFDIATKAFAFIKSDVTSRVGSE 994
Query: 864 VEILDDDIYLGAENNFNLFTVRKN----SEGATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
LDD + +G + + N++ + N S+ + ++ E+H+ + V + + GS+
Sbjct: 995 CFPLDDSLVIGTDRHGNIYGLDTNQGEDSDTHESTDSQSMQTGFEFHILDIVLQLKPGSM 1054
Query: 920 VMRL 923
RL
Sbjct: 1055 KHRL 1058
>gi|49619065|gb|AAT68117.1| cleavage and polyadenylation specific factor 1 [Danio rerio]
Length = 1105
Score = 95.9 bits (237), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 112/475 (23%), Positives = 206/475 (43%), Gaps = 64/475 (13%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L K+GEL I + +R IPL + + +S
Sbjct: 634 IESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVES 693
Query: 673 RTFAICSLKNQSC-------AEESEMHFV----------------RLLDDQTFEFI--ST 707
+ +A+C+ + C EE E + +L+ ++E I +
Sbjct: 694 KVYAVCTSVKEPCTRIPRMTGEEKEFETIERDERYIHPQQDKFSIQLISPVSWEAIPNTR 753
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + + V Y +GT + EE +GRIL+ + E
Sbjct: 754 VDLEEWEHVTCMKTVALKSQETVSGLKGYVALGTCLMQGEEVT-CRGRILILDVIEVVPE 812
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD-DGTRELQSEC 810
G K +++ EKE KG V +L +G L++AI QKI L+ D G + ++
Sbjct: 813 PGQPLTKNKFKVLYEKEQKGPVTALCHCSGFLVSAIGQKIFLWSLKYNDLTGMAFIDTQL 872
Query: 811 GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
+I +Y + +FI+ D+MKSISLL Y+ E + +RD + ++E + D+
Sbjct: 873 ----YIHQMY--SIKNFILAADVMKSISLLRYQPESKTLSLVSRDAKPLEVYSIEFMVDN 926
Query: 871 IYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
LG ++ + NL E RL ++++G VN F L ++
Sbjct: 927 NQLGFLVSDRDKNLMVYMYLPEAKESFGGMRLLRRADFNVGSHVNAFWRMPCRGTLDTAN 986
Query: 928 VGQIP-----TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
+ F T++G +G++ + + Y L LQ L ++ GLN + +R
Sbjct: 987 KKALTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPKAFRML 1046
Query: 983 NNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+ +++T+ KN LDG+L+ +L LS E++K + + + + + E+ R+
Sbjct: 1047 HCDRRTLQNAVKNILDGELLNKYLYLSTMERSELAKKIGTTPDIILDDLLEIERV 1101
>gi|342181720|emb|CCC91200.1| putative damage-specific DNA binding protein [Trypanosoma congolense
IL3000]
Length = 1274
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 178/826 (21%), Positives = 324/826 (39%), Gaps = 131/826 (15%)
Query: 295 NLGPIVDFCV-VDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV-ELQGIKGMWS 352
N GP+ D V +D R G V +G +G L VR + ++ + + Q ++ +
Sbjct: 426 NCGPVFDITVALDGPRTG---VFAGTGVDHNGGLSFVRAAVSVSRDVCISDFQDVRCVCV 482
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDE--LEETEIEGFCSQTQTLF-CHDAIYNQL 409
S D ++ SF +RI + + LEE F + +TL + +
Sbjct: 483 ---SHD-----IIIFSFPGYSRICRYCVGETTVLEELISSTFDTSRETLLLTYSDEHEAF 534
Query: 410 VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILT 469
+QVT+ VR+V E G + A AN ++ + E+ +
Sbjct: 535 LQVTTAGVRIVKPEEGSYLFECA---GNGIEHAHANEGLIVFSNPA------ELNAFSVR 585
Query: 470 EVKH-AQLE--YEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 526
E H A L +E SCL + S+ G W +V +++L D + K
Sbjct: 586 EAAHIANLHQVHETSCLWVL---SGSSF----LTGEWGTCAVNLYALVDRAVQHKACFNC 638
Query: 527 EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
P S+ + A + LL L +G++++ ++ + G T + + QP+ L S
Sbjct: 639 SATPCSMCVVARPEGNRLLVGLLNGYVVDVPVD-EVGSSTTVTETVVTMQPVRLFNLESH 697
Query: 587 NTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVS--HMCPFNSAAFPDSLAIAKEGELTIG 644
N V + P ++ + + + V+L +V+ + P ++ L E L G
Sbjct: 698 NA--VLCLGEVPLILIVTETRFQLTGVDLNDVALGAVIPNPNSPLRYILFSKTERALIFG 755
Query: 645 TIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI-----------CSLKNQSCAEESEMH- 692
I ++QKL+I I L R+ S F + K ++ +S +
Sbjct: 756 NISNVQKLNIDFIGLKATVTRLKFMPWSGIFVVSIRRTDKDQLLAVTKQETLCPDSLLEN 815
Query: 693 --FVRLLDDQTFEFISTYPLD-TFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG 749
+ LL+++ FI + L + E+ SC + + + +GT +V P E
Sbjct: 816 PTSLELLENERCVFIESTVLGGSNEWS----SCGDAGNKDSVVLIGTTFVFPHEQLSRSS 871
Query: 750 RILVFIVEDG-------KLQLIAEKETKGAVY---SLNAFNGKLLAAINQKIQLYKWMLR 799
R VE G L+L + +GA+ S+ + G++ I+ + L+ W
Sbjct: 872 RFTWCTVEAGGVAPERATLRLQGRTDVEGALQCCCSVPNYAGRIALGISGCVALFSWNPA 931
Query: 800 DDGTRELQSECGHHGHILALYVQT-RGD--FIVVGDLMKSISLLIYKHEEGAIEERARDY 856
D + + E G I+ V T +GD +IV D+ S + +G++ ARD
Sbjct: 932 D--STFVPEETIRVGTIVTRIVPTVQGDTSYIVACDVRHSCFFIRVDTIQGSLSIAARDP 989
Query: 857 NANWM---SAVEILDDDIYLGAENNFNLFTVRKNSEGATDEE-------RGRLEVVGEYH 906
+ + ++ A++ FN F+V + E RL +YH
Sbjct: 990 ELRGVMDGTVIQYPSHHDTCFADDCFNFFSVSHTTPSTDSSEPPPSTITTQRLRTTAQYH 1049
Query: 907 LGEFVNRFRHGSLV--------MRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLF 956
+G+ + + GS + +P + V + P + +GT +G G I + +E +L
Sbjct: 1050 IGDLITAMQQGSFAPCSVINDFVPVPMALVPGVCGPQIAYGTSHGAFGTITPVTNETFLL 1109
Query: 957 LEKLQTNLRKVIKGVGGLNHEQWR---------------SF-----------NNEKKTVD 990
L+ ++ ++ V+ +GG H +R SF N ++
Sbjct: 1110 LKSIEISVAAVLPPLGGFCHSSYREVLCAGQERGMSRNTSFEVLNPKAAEVMNRRRQKYL 1169
Query: 991 AKNFLDGDLIESFLDLS--------RTRMDEI---SKTMNVSVEEL 1025
K GD++E FL +S + D+I S + N S+EE
Sbjct: 1170 PKCVCSGDVVEMFLRMSDSEKRSTIKLASDQIVRWSPSSNTSLEEF 1215
>gi|194756960|ref|XP_001960738.1| GF11349 [Drosophila ananassae]
gi|190622036|gb|EDV37560.1| GF11349 [Drosophila ananassae]
Length = 1455
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 108/435 (24%), Positives = 199/435 (45%), Gaps = 62/435 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
IR +PL PR++ + ++R + + + +++ +EES F+
Sbjct: 1026 IRKVPLRCTPRQLVYHRENRVYCLITQNEEPMTKFYRFNGEDKELSEESRGERFIYPIGS 1085
Query: 696 -----LLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
L+ +T+E + ++ + +E+ I+ S+ + Y C+GT + E
Sbjct: 1086 QFEMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1144
Query: 744 NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
+ ++G I ++ + E GK L+ + +KE KG V +++ G L+ + QKI
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKI- 1203
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1204 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 1259
Query: 853 ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
+RD+N + +E + D+ LG AE N ++ + + + ++ L +YHL
Sbjct: 1260 SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQK--LLRKADYHL 1317
Query: 908 GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
G+ VN FR L R P + V++GT++G +G LP + Y LQ
Sbjct: 1318 GQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKLYRRFLMLQN 1376
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
L + + GLN +++R+ KK ++ +DGDLI S+ L+ + +E++K +
Sbjct: 1377 VLLSYQEHLCGLNPKEYRTIKAVKKQGINPSRCIIDGDLIWSYRLLANSERNEVAKKIGT 1436
Query: 1021 SVEELCKRVEELTRL 1035
EE+ + E+ RL
Sbjct: 1437 RTEEILSDLLEIERL 1451
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 71/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YG + +L+ G +D L I+ + K VLQ D ++ L T ++ + R
Sbjct: 76 LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTYALKTLSLHYFEEEDIRGGWT 135
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
G+ + +DPD R + +Y V+PF L E
Sbjct: 136 GRYFVPVVRVDPDSRCAVMLVYGKRLVVLPFRKDNTLDEIELADVKPIKKAPTAMVTRTP 195
Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
++ I L +L VLDI+FL+G +PT+++LY+ + +K ++
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
L + V W+ N+L + P+ P+ G L++ ++Y SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCQQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315
Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
+ + P++P S+ A + +D D + ++ G L++L + + + V
Sbjct: 316 DNSTSFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHF 373
Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
+ + S I L + +++GS G+S L+
Sbjct: 374 HKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405
Score = 42.4 bits (98), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 70/159 (44%), Gaps = 26/159 (16%)
Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
EV + +N+ PI C V+ E G ++V +G K+G+L +
Sbjct: 485 EVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLNDLKIELVAATGHSKNGALSVFV 544
Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
N I S EL G +W++ ++S D D F+++S + T +L E+
Sbjct: 545 NCINPQIITSFELDGCLDVWTVFDDATKKTSRHDQHD-FMLLSQRNSTLVLQTG--QEIN 601
Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
E E GF T+F + + +VQVT+ VRL+ T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640
>gi|332018184|gb|EGI58789.1| Cleavage and polyadenylation specificity factor subunit 1 [Acromyrmex
echinatior]
Length = 1412
Score = 95.5 bits (236), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 122/515 (23%), Positives = 221/515 (42%), Gaps = 81/515 (15%)
Query: 575 TQPITLRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPD 631
T+ +R FS+ VF SD P I+ + + L ++ + V+ PFN+ P
Sbjct: 897 TRICVMRYFSNIAGYNGVFICSDYPHWIFLTGRGELRTHPMGIDGPVTSFAPFNNINCPQ 956
Query: 632 S-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA 686
L ++ EL I + +R +PL P + + +S+T+ + + S A
Sbjct: 957 GFLYFNRKEELRICVLPTHLSYDAPWPVRKVPLRCTPHFVTYHLESKTYCVIT----STA 1012
Query: 687 EESE------------------MHFVRLLDDQTFEFIS--TYP-----LDTFEYGCSILS 721
E + F R+ + F +S T P LD +E+ + +
Sbjct: 1013 EPLKSYYRFNGEDKVLTKLYYLFQFSRIFMNLLFSPVSWETIPNTKIELDQWEHVTCLKN 1072
Query: 722 CSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIA 765
S + + Y +GT Y E+ ++GRIL+F I+E + + I
Sbjct: 1073 VSLAYEGTRSGLKGYIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIY 1131
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQ 822
KE KG + ++ +G L++A+ QKI Y W L+D+ G + ++ H +L++
Sbjct: 1132 AKEQKGPITAITQVSGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI--- 1185
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AEN 877
I++ D+ KSISLL ++ E + +RD+ + +E L D+ LG E+
Sbjct: 1186 --KSLILIADVYKSISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNSNLGFIVADGES 1243
Query: 878 NFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP----- 932
N LF + S + + +L ++HLG+ +N F + P +D Q
Sbjct: 1244 NLALFMYQPESRESLGGQ--KLIRKADFHLGQKINTFFRIKCRITDPANDKKQFSGADKR 1301
Query: 933 -TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN---EKKT 988
++ +++G +G I +P + Y L LQ L I + GLN + +R +
Sbjct: 1302 HVTMYASLDGSLGYILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKAYRHTYKSYVRNQG 1361
Query: 989 VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
A+ +DGDL+ +L L +++K + V+
Sbjct: 1362 NPARGIIDGDLVWRYLFLPNNEKADLAKKIGTRVQ 1396
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 65/330 (19%), Positives = 140/330 (42%), Gaps = 62/330 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
++G I +++ G +D L ++ K V+++D + +L T ++ + + T
Sbjct: 73 LHGNIMSMQAVHLIGSQRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEEIKDGWT 132
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAF 102
++ I I+ DP+ R + ++ V+PF NK + ++
Sbjct: 133 NHHHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPSLDDGDLLDSAKLTSTNKTPILSSY 192
Query: 103 NIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DK 148
I L+ L+ V+D++FL+G +PT+++LY+ + R VA+ +
Sbjct: 193 MIVLKTLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFSGRIAVRQDTCAMVAISLNIQQ 252
Query: 149 DFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKA 195
WS +NL +PV PL G LI+ +++Y + A++
Sbjct: 253 RVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAVNSLIYLNQSIPPYGVSLNSLADSSTN 312
Query: 196 IPIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLG 246
P++P + ++ +GS R ++ +G L++L + V G +
Sbjct: 313 FPLKP---QEGVKMSLEGSQVAFISADRLVISLKSGELYVLSLFADSMRSVRGFHFDKAA 369
Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
+ + S + ++ +++GS G+S L++
Sbjct: 370 ASVLTSCVCMCEDNYLFLGSRLGNSLLLRF 399
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 41/154 (26%), Positives = 75/154 (48%), Gaps = 20/154 (12%)
Query: 285 SYV-EVLERYVNLGPIVDFCVVD--------LERQGQG-QVVTCSGAYKDGSLRIVRNGI 334
SY+ EV + +N+GP + + + L+ Q ++VT SG K+G+L +++ I
Sbjct: 467 SYIFEVCDSLLNIGPCGNISMGEPAFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSI 526
Query: 335 GINEQASVELQGIKGMWSL-------RSSTDDPFDTFLVVSFISETRILAMNLEDELEET 387
+ +L G + MW++ T++ FL++S E + + E+ E
Sbjct: 527 RPQVVTTFQLPGCEDMWTVIGIVNNDEIRTEEGSHAFLILS--QEDSTMVLQTGQEINEV 584
Query: 388 EIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
+ GF +Q T+F + N+ +VQVT VRL+
Sbjct: 585 DQSGFSTQGSTVFAGNLGANRYIVQVTQMGVRLL 618
>gi|157141630|ref|XP_001647735.1| spliceosomal protein sap [Aedes aegypti]
gi|108867854|gb|EAT32403.1| AAEL015441-PA, partial [Aedes aegypti]
Length = 645
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 128/649 (19%), Positives = 253/649 (38%), Gaps = 112/649 (17%)
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSV- 533
++ ++ C+ + + S AVG+ D +VR+ SL + ++ + + +P +
Sbjct: 12 KMPSDVMCMALGSVPAGEQRSWFLAVGL-ADNTVRVISLDPSDCLSPRSM--QALPSAAE 68
Query: 534 LLCAFE---------------GISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPI 578
LC E G YL L +G LL +L+ +G+L D + LG++P+
Sbjct: 69 SLCIVEMGTGDTNEEGVASSAGCIYLNIGLTNGVLLRTVLDPVSGDLADTRTRYLGSRPV 128
Query: 579 TLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKE 638
L + + V A S R + Y + + ++ + + + F+S + +
Sbjct: 129 KLFRIKMQGSEAVLAMSSRTWLSYYFQNRFHLTPLSYETLEYASGFSSEQCSEGIVAIST 188
Query: 639 GELTIGTIDDIQKLHIR-SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLL 697
L I ++ + + + + PL P+R ++ I + + EE++ + +
Sbjct: 189 NTLRILALEKLGAVFNQITFPLEYTPKRFLIHNETGKLVISETDHNAYTEETKNIRKKQM 248
Query: 698 DDQTFE----------------FIS-TYPLDTF--------------------------- 713
D+ E FI+ P D F
Sbjct: 249 ADEMKEAAGEDEQELANEMADAFINEVLPEDVFSAPKAGTGMWASQIRVMDPINGHTYSK 308
Query: 714 ------EYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG---RILVFIVEDGKLQLI 764
E S+ F+ D Y G A L + G + + V +L+
Sbjct: 309 VQLAQNEAVLSMALVRFAVDQKWYVVAGVAKDLQMNPKIANGGFIDVYKYDVHTHQLEHY 368
Query: 765 AEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHI--LALYVQ 822
E A ++ F G++L + + +++Y D G ++L +C + HI + +Q
Sbjct: 369 HRTEIDDAPGAIAGFQGRVLVGVGRVLRIY-----DLGKKKLLRKC-ENKHIPNQIVNIQ 422
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
G + V D+ +SI + YK E + A D + W++ +LD D A+ N+
Sbjct: 423 AMGSRVFVSDVQESIYCIKYKRAENQLIIFADDTHPRWITTSTLLDYDTVATADKFGNIA 482
Query: 883 TVRKNSEGATD-----------EERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
+R + D +RG L E + +HLGE + + +L+
Sbjct: 483 ILRLPHSVSDDVDEDPTGNKALWDRGLLNGASQKAENICTFHLGETIMSLQKATLI---- 538
Query: 925 DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
G ++I+ T++G +G + E Y F + L+ ++R + G +H +RS+
Sbjct: 539 ---PGGSESLIYATMSGTVGALVPFTSREDYDFFQHLEMHMRNENTPLCGRDHLSYRSY- 594
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E F + + I+ + + E+ K++E++
Sbjct: 595 ----YYPVKHVMDGDLCEQFTSMDPAKQKSIASDLGRTPNEVAKKLEDI 639
>gi|156086042|ref|XP_001610430.1| hypothetical protein [Babesia bovis T2Bo]
gi|154797683|gb|EDO06862.1| conserved hypothetical protein [Babesia bovis]
Length = 1450
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 198/943 (20%), Positives = 339/943 (35%), Gaps = 216/943 (22%)
Query: 292 RYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW 351
+ NLGPIVDF G ++ C G +G + + NG+ I+ AS G+
Sbjct: 523 KQTNLGPIVDFTFGPQSDSGAIPILACCGYGAEGRVCSITNGVAIDVMASSPASGVLYTT 582
Query: 352 SLRSSTDDPFDTFLVVSFISETR----ILAMNLEDELEETEIE----------------- 390
+L + + + S+ + TR ++ + + +T I+
Sbjct: 583 ALPLFNEANNNFMICTSYFNRTRFYKVVVPPAPKRDPSKTAIDFVKQPLWVVTSMDPSSH 642
Query: 391 GFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR------NEWKSPPGYSVNVATA 444
F +TL ++QVT+ + LV+ S + N+ G +V
Sbjct: 643 RFVEHMRTLLMVPYGTQMVLQVTTNGISLVNYKSVAIPRVDHSLNDICKIAGIAVGSGVM 702
Query: 445 NAS---------QVLLATGGGHLVYLEIGDGILTEVKHAQ-LEYEISCLDINPIGE--NP 492
S V + + L++ +G+ ++ H L ++S G+
Sbjct: 703 PISIVSCHLCDSGVFIGLSNHVFLILDVSNGV--KILHKSVLPKQVSSTAYLSGGDFKTK 760
Query: 493 SYSQIAAVGMWTDISVRIFSLPDLNLI--TKEHLGGEIIPRSVLLCAFEGISYLLCALGD 550
+ + AV W D + + S L ++ TK G + R+V ++ + AL D
Sbjct: 761 RRTGLIAVSTWEDTEISLISPEGLQMLHTTKVPCGYGVAIRAVRFGVVGDVALVFAALSD 820
Query: 551 GHLLNFLLNM----KTGELTDRK---------KVSLGTQPITLRTFSSKNTT-------- 589
G L + L K G T+ + KVS G PI L S +
Sbjct: 821 GTLCVYRLKFGDCDKDGLGTNTQLSMVMENVIKVSNG--PIGLDAISMHASVPGSGGTNL 878
Query: 590 ---HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSA-AFPD--SLAIAKEGELTI 643
+ D P +IY++ KL Y VN+ + + FN P SL + + I
Sbjct: 879 LKNRIVTTGDNPMLIYANRGKLEYVPVNVPRIDTVTSFNFVEKNPQGVSLVFTDKKSIHI 938
Query: 644 GTIDDIQKLHIRSIPLGEHPRRIC-HQEQSRTFAIC-----------SLKNQSCAEESEM 691
G +D +LH+ +I G IC H E C S+ + S
Sbjct: 939 GHMDTALQLHVETICSGRSFETICYHDESDLVVVGCNGELIADCDMPSISDSSVTHNDSS 998
Query: 692 HF--------------------VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY 731
F V+ + T E + T + S+ + F D
Sbjct: 999 VFRCMDVASCGTIPGVYVLKSCVKFIHLGTKEVVHTLNMPQRHVITSMCTVKF-DGPKTL 1057
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFN----------- 780
+G++ + ++ P++G I F+V+ +I K V L N
Sbjct: 1058 IALGSSLIQDKDGVPSQGYI--FLVD-----VIKSDAHKWNVVFLRTLNFIDKGVTQMTP 1110
Query: 781 --GKLLAAINQKIQLYKWMLR------DDGTRELQSECGHHG------HILALYVQTRG- 825
+L+ A+N+ + + ++R D R + E H + L + +
Sbjct: 1111 CINQLVVALNETVAVLS-LVRGADPEPSDTLRSYKMEVLHESDESYGQYTLVTRAEYKSC 1169
Query: 826 ----------DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA 875
D IV+GDLM S+ +L ++ E + E +D+N+ + +A +D + A
Sbjct: 1170 SYVVSLDAYQDVIVIGDLMNSMRMLQWQGTE--LREVCKDFNSVYCTAAAAIDQTSCVVA 1227
Query: 876 ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHG-------SLVMRLPDSDV 928
+++ N + K D E + E VG +H GE +NR R +L+ D D+
Sbjct: 1228 DSSGNFYVFAKRQVVTNDAEAIKAEDVGLFHHGELINRIRRNPKVQSRVALLNGNSDHDM 1287
Query: 929 GQIPTVI-----FGTVNGVIGVIASLPHE----------------QYLF----------- 956
+ P F V G V A++ H +Y F
Sbjct: 1288 PEFPMKTYCNRPFCCVQG--DVSAAISHSSCNDLVRGLAKPNKFWKYGFKTILTCVTTSG 1345
Query: 957 ----------------LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLI 1000
L ++ + ++ G +++E WRSF N AK F+DGD I
Sbjct: 1346 SLLQLCIFDDTKLFCRLAFVEEGINRIQPQAGNISNEHWRSFKNRWMMCAAKGFIDGDAI 1405
Query: 1001 ESFLDLSRTRMDE----ISKT----MNVSVEELCKRVEELTRL 1035
ES+ L + E +SKT + S E L VE + RL
Sbjct: 1406 ESYNRLDTSLKSEVYNMVSKTDAHGLFYSPELLSLEVEHIQRL 1448
>gi|351713968|gb|EHB16887.1| Cleavage and polyadenylation specificity factor subunit 1
[Heterocephalus glaber]
Length = 1440
Score = 95.1 bits (235), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 109/466 (23%), Positives = 192/466 (41%), Gaps = 74/466 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 969 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1028
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + + C + E ++L+ ++E I +
Sbjct: 1029 KVYAVATSTSTPCTRIPRMTGEEKEFEAIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1088
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRI-----LVFIVE 757
L+ +E+ + + S + V Y GT + EE +GR+ + + E
Sbjct: 1089 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRVRDWERIEVVPE 1147
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1148 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1202
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1203 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1262
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1263 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1316
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ + + Y L LQ L ++ GLN
Sbjct: 1317 SEGPSKKSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPR 1376
Query: 978 QWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+R + +++ + +N LDG+L+ +L LS E++K + +
Sbjct: 1377 AFRMLHVDRRILQNAVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1422
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 107/565 (18%), Positives = 219/565 (38%), Gaps = 151/565 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE---- 288
+ +++ ++ +++GS G+S L+K LQ P +K V+
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASTVREAADKEEPPSKKKRVDSAAG 429
Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
V + +N+GP + V + L +
Sbjct: 430 WAGNKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 489
Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
Q ++V CSG K+G+L +++ I + EL G MW++
Sbjct: 490 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 549
Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
+ D FL++S T IL E+ E + GF +Q T+F
Sbjct: 550 KAEGSEQEPSAPEAQDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 607
Query: 402 HDAIYNQ-LVQVTSGSVRLVSSTSR 425
+ N+ +VQV+ +RL+ ++
Sbjct: 608 GNIGDNRYIVQVSPLGIRLLEGVNQ 632
>gi|356530945|ref|XP_003534039.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Glycine max]
Length = 1449
Score = 94.7 bits (234), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 164/356 (46%), Gaps = 40/356 (11%)
Query: 706 STYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFI---VED 758
+T P+ + E ++ L + S ++ +GTAYV E+ +GRIL+F + D
Sbjct: 1100 ATIPMQSSENALTVRMVTLLNTTSKENETLLAIGTAYV-QGEDVAARGRILLFSLGKITD 1158
Query: 759 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG- 814
L++E KE KGA+ +L + G LL A KI L+KW +GT EL
Sbjct: 1159 NPQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKW----NGT-ELNGIAFFDAP 1213
Query: 815 --HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
H+++L + +FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 1214 PLHVVSLNIVK--NFILIGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGST 1271
Query: 873 LG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF----------RHGSL 919
L +++N N+ + + + +L E+H+G V +F R GS+
Sbjct: 1272 LSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSDRAGSV 1331
Query: 920 VMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQW 979
P SD ++FGT++G IG IA L + L+ LQ L + V GLN +
Sbjct: 1332 ----PGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAF 1387
Query: 980 RSF--NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
R F N + + +D +L+ + L EI+ + + ++ + +L+
Sbjct: 1388 RLFRSNGKAHRPGPDSIVDCELLCHYEMLPLEEQLEIANQIGTTRSQILSNLSDLS 1443
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 108/460 (23%), Positives = 176/460 (38%), Gaps = 128/460 (27%)
Query: 66 GQIGIIDPDCRLIGLHLYDGLFKVI--------------PFDNKG----QLKEAFNIRLE 107
G + IDP R G+ +YD ++ F + G +++ ++ I L
Sbjct: 178 GPVVKIDPQGRCGGVLIYDLQMIILKATQVGSGLVGDDDAFGSSGAVAARIESSYMINLR 237
Query: 108 EL---QVLDIKFLYGCAKPTIVVLYQDNKD-------ARHVKTYEVALKDKDFVEGP--W 155
+L V D F+YG +P +V+L++ H + P W
Sbjct: 238 DLDMRHVKDFTFVYGYIEPVMVILHERELTWAGRVSWTHHTCMISALSISTTLKQHPLIW 297
Query: 156 SQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR---------PSITKAY 206
S NL + A L+ VP P+ GVL+IG TI Y S +A A+ + I ++
Sbjct: 298 SAVNLPHDAYKLLAVPSPIGGVLVIGANTIHYHSQSASCALALNNYAVTLDSSQEIPRSS 357
Query: 207 GRV--DADGSRYLLGDHA------GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
V DA + +LL D A G L LL++ ++ V L + + ++S I+ +
Sbjct: 358 FNVELDAANATWLLSDVALLSTKTGELLLLMLVYDGRVVQRLDLSKSKASVLSSGITTIG 417
Query: 259 NAVVYIGSSYG-----------------------------DSQLIKLNLQPD-------- 281
N++ ++ S G D+ +L P
Sbjct: 418 NSLFFLASRLGDSMLVQFSCGSGVSMMSSNLKEEVGDIEVDAPSKRLRRSPSDALQDMVS 477
Query: 282 ----------------AKGSY-VEVLERYVNLGPIVDFC----------VVDLERQGQGQ 314
A+ S+ V + +N+GP+ DF + +Q +
Sbjct: 478 GEELSLYGSATNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSNYE 537
Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW-----SLRSST---------DDP 360
+V CSG K+GSL ++R I VEL G KG+W S RS DD
Sbjct: 538 LVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADDDDE 597
Query: 361 FDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
+ +L++S E R + + D L E TE + Q +TL
Sbjct: 598 YHAYLIISL--EARTMVLETADLLSEVTESVDYYVQGKTL 635
>gi|71407487|ref|XP_806209.1| damage-specific DNA binding protein [Trypanosoma cruzi strain CL
Brener]
gi|70869885|gb|EAN84358.1| damage-specific DNA binding protein, putative [Trypanosoma cruzi]
Length = 508
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 86/340 (25%), Positives = 140/340 (41%), Gaps = 62/340 (18%)
Query: 735 GTAYVLPEENEPTKGRILVFIVEDGKL-------QLIAEKETKGAVYS---LNAFNGKLL 784
GT + P+E P R + + VE GKL + I K+ +GA+ + + G++
Sbjct: 92 GTTFAFPDEQLPRSSRFIWYCVEQGKLISERPQLRQIGSKDVEGALQCCCIVPNYVGRIA 151
Query: 785 AAINQKIQLYKWMLRDDGTRELQSECGHH--GHILALYVQTRGDFIVVGDLMKSISLLIY 842
IN I LY W D ++ C +L ++ Q ++V D S +
Sbjct: 152 LGINGCIALYSWNAADSVFVAEETICIGTIVTRVLPIF-QGDASYMVAFDARHSCFFIQV 210
Query: 843 KHEEGAIEERARDYNANW-MSAVEILDDDIY--LGAENNFNLFTVRK---------NSEG 890
+G++E ARD M +IY ++ +N F + + N+
Sbjct: 211 DTIQGSLEIVARDSEPRGVMDGAVFQLGNIYNICFGDDYYNFFCLSRGASTLPSVSNATS 270
Query: 891 ATDEERGRLEVVGEYHLGEFVNRFRHGS--------LVMRLPDSDVGQI--PTVIFGTVN 940
AT G+LE +YHLG+ V + GS + + +P + + I P V+FGT +
Sbjct: 271 ATVSS-GKLETSAQYHLGDMVTAMQLGSFAPCSVTNIAVPIPTTLIPGICGPQVVFGTSH 329
Query: 941 GVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS------------------- 981
G G I + +E YLFL+ L+ + V+ +GG H +R
Sbjct: 330 GAFGTITPVSNETYLFLKALEVAVSSVVPALGGFEHVTYREVLRAGQERGYSRNASFENA 389
Query: 982 -------FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
F+ +K ++ GDLIESFL S+T I
Sbjct: 390 NVVSSGVFDKHRKRYLSRCVCSGDLIESFLTFSQTIQQRI 429
>gi|224120960|ref|XP_002318462.1| predicted protein [Populus trichocarpa]
gi|222859135|gb|EEE96682.1| predicted protein [Populus trichocarpa]
Length = 1455
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 98/374 (26%), Positives = 170/374 (45%), Gaps = 49/374 (13%)
Query: 694 VRLLD--DQTFEFISTYPLDTFEYGCSILSCSF----SDDSNVYYCVGTAYVLPEENEPT 747
VR+L+ + ++ +T P+ T E ++ S + ++ VGTAYV E+
Sbjct: 1091 VRILEPSNGPWQVKATIPMQTSENALTVRMVSLFNTSTKENETLLAVGTAYV-QGEDVAA 1149
Query: 748 KGRILVFIV----EDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
+GRIL+F V E+ ++ L++E KE KGA+ +L + G LL A KI L+KW
Sbjct: 1150 RGRILLFSVVKNPENSQI-LVSEVYSKELKGAISALASLQGHLLIASGPKIILHKWT--- 1205
Query: 801 DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
GT + + + +FI++GD+ KSI L +K + + A+D+ +
Sbjct: 1206 -GTELTGVAFSDAPPLYVVSLNIVKNFILLGDIHKSIYFLSWKEQGAQLSLLAKDFASLD 1264
Query: 861 MSAVEILDDDIYLG-----AENNFNLFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
+ E L D L + N +F K SE + +L E+H+G V +F
Sbjct: 1265 CFSTEFLIDGSTLSLVVSDEQKNVQIFYYAPKMSESWKGQ---KLLSRAEFHVGALVTKF 1321
Query: 915 RHGSLVMRL-------------PDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
MRL P SD ++FGT++G IG IA L + L+ LQ
Sbjct: 1322 ------MRLQMLSPSLDRSGAAPVSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQ 1375
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
L + V GLN + +R F ++ K ++ +D +++ + + EI++ +
Sbjct: 1376 KKLVDAVPHVAGLNPKSFRQFRSDGKAHRPGPESIVDCEMLSYYEMIPLEEQVEIAQQIG 1435
Query: 1020 VSVEELCKRVEELT 1033
+ ++ + +LT
Sbjct: 1436 TTRAQILSNLNDLT 1449
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 122/517 (23%), Positives = 201/517 (38%), Gaps = 144/517 (27%)
Query: 23 QDFLFIATERYKFCVLQWDAESSELITRAM----GDVSDRIGRPTDN---GQIGIIDPDC 75
+D + +A + K VL++D L T +M G + R ++ G + +DP
Sbjct: 130 RDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPDWRHLKRGRESFARGPLVKVDPQG 189
Query: 76 RLIGLHLYDGLFKVIPFDNKG----QLKEAF----------------NIR-LEELQVLDI 114
R G+ +YD ++ G Q ++AF N+R L+ V D
Sbjct: 190 RCGGVLVYDLQMIILKAAQAGSALVQDEDAFGSGAAISAHIASSYIINLRDLDMKHVKDF 249
Query: 115 KFLYGCAKPTIVVLYQDNKDA------RHVKTYEVALKDKDFVEGP---WSQNNLDNGAD 165
F++ +P +VVL++ +H AL ++ P WS NL + A
Sbjct: 250 IFVHDYIEPVVVVLHERELTWAGRVVWKHHTCMISALSISTTLKQPTLIWSIGNLPHDAY 309
Query: 166 LLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR------------PSITKAYGRVDADG 213
L+ VP P+ GVL+IG TI Y S +A A+ + P T + +DA
Sbjct: 310 KLLAVPSPIGGVLVIGVNTIHYHSESASCALALNSYAASVDSSQELPRATFSV-ELDAAN 368
Query: 214 SRYLLGDHA------GLLHLLVITHEKEKVTGLKIELLGETSIASTISYL---------- 257
+ +LL D A G L LL + ++ V L + + + S I+ L
Sbjct: 369 ATWLLKDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLTSDITTLGNSFFFLGSR 428
Query: 258 --DNAVVYIGSSYGDSQL----------IKLNLQPDAK---------------------- 283
D+ +V S G S L I+ +L P AK
Sbjct: 429 LGDSLLVQFTSGLGSSMLSPGLKEEVGDIEGDL-PSAKRLKVSSSDALQDMVSGEELSLY 487
Query: 284 ---------GSYVEVLERY--------VNLGPIVDFC----------VVDLERQGQGQVV 316
V V++ + +N+GP+ DF + +Q ++V
Sbjct: 488 SSAPNNAESSQVVSVIKTFSFTVRDSLINVGPLKDFAYGLRINADANATGISKQSNYELV 547
Query: 317 TCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW-------------SLRSSTDDPFDT 363
CSG K+G+L +++ I VEL G KG+W SL+ ++DD +
Sbjct: 548 CCSGHGKNGALCVLQQSIRPEMITEVELPGCKGIWTVYHKNARIHSVDSLKMASDDEYHA 607
Query: 364 FLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
+L++S E R + + D L E TE + Q +T+
Sbjct: 608 YLIISM--EARTMVLETADHLTEVTESVDYFVQGRTI 642
>gi|317036382|ref|XP_001398211.2| protein cft1 [Aspergillus niger CBS 513.88]
Length = 1393
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 98/429 (22%), Positives = 179/429 (41%), Gaps = 52/429 (12%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 694
++ + LGE + + S + + C + E+ E+H F+
Sbjct: 961 LKRVHLGEQVDHLAYSTSSGMYVLGTCHATDFKLPEDDELHPEWRNEAISFFPSARGSFI 1020
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
+L+ T+ I ++ L EY +I + S N + VGTA+ E+ P++G
Sbjct: 1021 KLVSPNTWSIIDSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAFAR-GEDIPSRG 1079
Query: 750 RILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
I VF V D KL+LI ++ KGAV +L+ G+ + Q + L
Sbjct: 1080 CIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQGQKCMVRGL 1139
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
++DG+ + ++ + ++GD +K + Y E + A+D +
Sbjct: 1140 KEDGSLLPVAFMDMQCYVSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKMSLFAKDLDY 1199
Query: 859 NWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
+ A E L D L A+++ N+ ++ + E RL ++H+G F +
Sbjct: 1200 LEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFHMGNFASTLT 1259
Query: 916 HGSLVM----RLPDSDVG-----QIP--TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
M ++ S G Q P V+ T NG +G+I +P E Y L LQ+ L
Sbjct: 1260 LLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYRRLSALQSQL 1319
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
++ GLN +R+ E + LDG+L+ ++D+S+ R EI+ + E
Sbjct: 1320 TNTLEHPCGLNPRAFRAV--ESDGTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAREWE 1377
Query: 1025 LCKRVEELT 1033
+ +E ++
Sbjct: 1378 IKADLEAIS 1386
>gi|66812672|ref|XP_640515.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
gi|60468551|gb|EAL66554.1| CPSF domain-containing protein [Dictyostelium discoideum AX4]
Length = 1628
Score = 94.0 bits (232), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 115/467 (24%), Positives = 211/467 (45%), Gaps = 77/467 (16%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGEL----TIGTIDDIQK-LHIRSIPLGEHPRRICHQEQ 671
V FN+ + D + +KE ++ T+ T+ + + + IR IP +I + +
Sbjct: 1180 VETFTSFNNISCQDGFIYFSKEKDVIKICTLSTLMNFENDIAIRRIPTKNSCHKIAYHSE 1239
Query: 672 SRTFA-ICSLKN--QSCAEESEMHF-------VRLLD---DQTFEFISTYPLDTFE--YG 716
++ + I S Q E+S+ ++L+D D ++FI ++ L E
Sbjct: 1240 AKCYVVIVSFPQVTQELQEDSKKPILTDDKFQIKLIDPTIDWNWKFIDSFSLQDRETVLA 1299
Query: 717 CSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-IVE-----------DG 759
I+S F++ + + +GTA+ E+ + KGR+LVF IV +
Sbjct: 1300 MKIVSLKFTEPDGITRARPFLVIGTAFTFGEDTQ-CKGRVLVFEIVSHKTQFESEELGEK 1358
Query: 760 KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
+L L+ EKE KG V +L++ NG LL I K+ + ++ T L + + I
Sbjct: 1359 RLNLLYEKEQKGPVTALSSVNGLLLMTIGPKLTVNQFY-----TGSLVTLSFYDAQIYIC 1413
Query: 820 YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA---- 875
+ T ++IV+GD+ KS+ L +K + + ++DY A + + E + + L
Sbjct: 1414 SICTIKNYIVIGDMYKSVYFLQWK-DNKTLNLLSKDYQALNIFSTEFIVNQKTLSILVSD 1472
Query: 876 -ENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
+ N LF+ + + + E+ G N+ + RLP + V
Sbjct: 1473 LDKNILLFSFEPQDPSSRSGQINQ-EINGN-------NKNDN-----RLPKKE----QLV 1515
Query: 935 IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD---- 990
IFGT++G + V+ L + YL +Q+ L + GLN +Q+RSF + +
Sbjct: 1516 IFGTLDGGLNVLRPLDEKIYLLFYHIQSKLY-YLPQTAGLNPKQYRSFKSFSQNFHFSPS 1574
Query: 991 -----AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K LDGDLI FL LS++ IS ++N + +E+ + ++++
Sbjct: 1575 TFHQLPKFILDGDLISKFLSLSQSEKRLISNSINSTSDEIIESLKDV 1621
Score = 47.0 bits (110), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 48/223 (21%), Positives = 90/223 (40%), Gaps = 49/223 (21%)
Query: 94 NKGQLKEAFNIRLEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFV-- 151
N Q KE NI +E ++ D FL+G +PTI+ L++ ++T+ + K F
Sbjct: 273 NNNQDKEKKNIEIENVK--DFCFLHGYYEPTILFLHEP------IQTWTSRIAVKKFTCQ 324
Query: 152 -------------EGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--------SA 190
W+ +N ++L+ VP PL G L+I + Y +
Sbjct: 325 MTAISLNLLTKAGSFIWNVSNFPYNCEMLVSVPEPLGGALVITANIMFYVNQTSRYGLAV 384
Query: 191 NAFKAI-------------PIRPSITKAY-----GRVDADGSRYLLGDHAGLLHLLVITH 232
N + +I PI ++ + V + +++ G L + +
Sbjct: 385 NEYASIDTSTIIGSQPFDFPIDDTLNLVFTLDRSNFVFLESDKFIGSLKGGELLIFHLIS 444
Query: 233 EKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
+ V + + G + + S I L N ++++GS GDS L++
Sbjct: 445 DGRSVQRIHVSKAGGSVLTSCICVLSNNLIFLGSRLGDSLLLQ 487
>gi|356559917|ref|XP_003548242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Glycine max]
Length = 1447
Score = 93.6 bits (231), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 87/305 (28%), Positives = 144/305 (47%), Gaps = 30/305 (9%)
Query: 706 STYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---ED 758
+T P+ + E ++ L + S ++ +GTAYV E+ +GRIL+F + D
Sbjct: 1098 ATIPMQSSENALTVRMVTLVNTTSKENETLLAIGTAYV-QGEDVAARGRILLFSLGKNTD 1156
Query: 759 GKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHG- 814
L++E KE KGA+ +L + G LL A KI L+KW +GT EL
Sbjct: 1157 NPQTLVSEVYSKELKGAISALASLQGHLLIASGPKIILHKW----NGT-ELNGIAFFDAP 1211
Query: 815 --HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
H+++L + +FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 1212 PLHVVSLNIVK--NFILIGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGST 1269
Query: 873 LG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR------L 923
L +++N N+ + + + +L E+H+G V +F ++ +
Sbjct: 1270 LSLMVSDDNRNIQIFYYAPKMSESWKGQKLLSRAEFHVGAHVTKFLRLQMLSTSDRAGAV 1329
Query: 924 PDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
P SD ++FGT++G IG IA L + L+ LQ L + V GLN +R F
Sbjct: 1330 PGSDKTNRFALLFGTLDGSIGCIAPLDEITFRRLQSLQRKLVDAVPHVAGLNPRAFRLFR 1389
Query: 984 NEKKT 988
+ K
Sbjct: 1390 SNGKA 1394
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 122/536 (22%), Positives = 208/536 (38%), Gaps = 140/536 (26%)
Query: 66 GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG------------------QLKEAFNIRLE 107
G + +DP R G+ +YD ++ G +++ ++ I L
Sbjct: 176 GPVVKVDPQGRCGGVLIYDLQMIILKATQAGSGLVGEDDALGSSGAVAARIESSYMINLR 235
Query: 108 EL---QVLDIKFLYGCAKPTIVVLYQ------DNKDARHVKTYEVALK-DKDFVEGP--W 155
+L V D F++G +P +V+L++ +H AL + P W
Sbjct: 236 DLDMRHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIW 295
Query: 156 SQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRP---------SITKAY 206
S NL + A L+ VP P+ GVL+I TI Y S +A A+ + I ++
Sbjct: 296 SAVNLPHDAYKLLAVPSPIGGVLVISANTIHYHSQSASCALALNSYAVTLDSSQEIPRSS 355
Query: 207 GRV--DADGSRYLLGDHA------GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLD 258
V DA + +LL D A G L LL + ++ V L + + ++S I+ +
Sbjct: 356 FNVELDAANATWLLSDVALLSTKTGELLLLTLVYDGRVVQRLDLSKSKASVLSSGITTIG 415
Query: 259 NAVVYIGSSYGDSQLIKL------------------NLQPDA------------------ 282
N++ ++ S GDS L++ +++ DA
Sbjct: 416 NSLFFLASRLGDSMLVQFSCGSGVSMLSSNLKEEVGDIEADAPSKRLRRSPSDALQDMVS 475
Query: 283 ------------------KGSYVEVLERYVNLGPIVDFC----------VVDLERQGQGQ 314
K V + +N+GP+ DF + +Q +
Sbjct: 476 GEELSLYGSAPNRTESAQKSFSFAVRDSLINVGPLKDFSYGLRINADANATGIAKQSNYE 535
Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMW-----SLRSST---------DDP 360
+V CSG K+GSL ++R I VEL G KG+W S RS DD
Sbjct: 536 LVCCSGHGKNGSLCVLRQSIRPEVITEVELPGCKGIWTVYHKSTRSHNADSSKMADDDDE 595
Query: 361 FDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTLFCHD--------AIYNQLVQ 411
+ +L++S E R + + D L E TE + Q +TL + +Y + +
Sbjct: 596 YHAYLIISL--EARTMVLETADLLSEVTESVDYYVQGKTLAAGNLFGRCRVIQVYERGAR 653
Query: 412 VTSGS--VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD 465
+ GS + VS + L + S +++V+ A+ +L + G + L IGD
Sbjct: 654 ILDGSFMTQDVSFGASNLESGSASDSAIALSVSIADPFVLLRMSDGS--IRLLIGD 707
>gi|195150431|ref|XP_002016158.1| GL10645 [Drosophila persimilis]
gi|194110005|gb|EDW32048.1| GL10645 [Drosophila persimilis]
Length = 1459
Score = 93.6 bits (231), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 105/434 (24%), Positives = 191/434 (44%), Gaps = 60/434 (13%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESEMHFVRLLDDQ 700
+R +PL PR++ + ++R + + + +++ +EES +
Sbjct: 1030 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPNGS 1089
Query: 701 TFEFISTYP------------LDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
FE + P + +E+ I+ S+ + Y C+GT + E
Sbjct: 1090 QFEMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1148
Query: 744 NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
+ ++G I ++ + E GK L+ + +KE KG V +++ G L+ + QKI
Sbjct: 1149 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKI- 1207
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1208 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLA 1263
Query: 853 ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
+RD+N + +E + D+ LG AE N ++ + + + + +L +YHL
Sbjct: 1264 SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQ--KLIRKADYHL 1321
Query: 908 GEFVNR-FR---HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
G+ VN FR H V + V++GT++G +G LP + Y LQ
Sbjct: 1322 GQVVNTMFRVQCHQRGVHQRQPFLYENKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNV 1381
Query: 964 LRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
L + GLN +++R+ + KK ++ +DGDLI S+ L + +E++K +
Sbjct: 1382 LLSYQDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTR 1441
Query: 1022 VEELCKRVEELTRL 1035
EE+ + E+ RL
Sbjct: 1442 TEEILSDLLEIERL 1455
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 71/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YG + +L+ G QD L ++ + K VLQ D ++ L T ++ + R
Sbjct: 76 LYGNVMSLQCVSLAGAMQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEEDIRGGWT 135
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
G+ + +DPD R + +Y V+PF L E
Sbjct: 136 GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSLDEIELTDVKPFKKAPTAMVSRTP 195
Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
++ I L+EL VLDI+FL+G +PT+++LY+ + +K ++
Sbjct: 196 IMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCSGRIKVRSDTCVLVAIS 255
Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
L + V W+ N+L + P+ P+ G L++ ++Y SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315
Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
+ + P++P S+ A + +D D + ++ G L++L + + + V
Sbjct: 316 DNSTSFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGELYVLTLCVDSMRTVRNFHF 373
Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
+ + S I + +++GS G+S L+
Sbjct: 374 HKAAASVLTSCICVCHSEYIFLGSRLGNSLLL 405
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 70/159 (44%), Gaps = 26/159 (16%)
Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
EV + +N+ PI C V+ E G ++V +G K+G+L +
Sbjct: 488 EVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGHSKNGALSVFV 547
Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
N I S EL G +W++ ++S D D F+++S + T L + E+
Sbjct: 548 NCINPQIITSFELDGCLDVWTVFDDATKKTSRHDQHD-FMLLSQSNST--LVLQTGQEIN 604
Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
E E GF T+F + + +VQVT+ VRL+ T
Sbjct: 605 EIENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGT 643
>gi|297820284|ref|XP_002878025.1| hypothetical protein ARALYDRAFT_906938 [Arabidopsis lyrata subsp.
lyrata]
gi|297323863|gb|EFH54284.1| hypothetical protein ARALYDRAFT_906938 [Arabidopsis lyrata subsp.
lyrata]
Length = 454
Score = 93.2 bits (230), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 95/371 (25%), Positives = 167/371 (45%), Gaps = 58/371 (15%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKGRIL 752
+R+LD +T L E S+ + +F D + VGT + + P K +
Sbjct: 103 IRILDPKTATTTCLLELQDNEAAYSVCTVNFHDKEYGTLLAVGTVKGM--QFWPKKNLVA 160
Query: 753 VFI-----VEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
FI VE+GK L+L+ + + +G +L F G+LLA I ++LY D G + L
Sbjct: 161 GFIHIYRFVEEGKSLELLHKTQVEGVPLALCQFQGRLLAGIGPVLRLY-----DLGKKRL 215
Query: 807 QSECGHH---GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
+C + I++ +QT D I VGD+ +S Y+ +E + A D W++A
Sbjct: 216 LRKCENKLFPNTIIS--IQTYRDRIYVGDIQESFHYCKYRRDENQLYIFADDCVPRWLTA 273
Query: 864 VEILDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEY 905
+D D GA+ N++ VR K +G + +++ + ++
Sbjct: 274 SHHVDFDTMAGADKFGNVYFVRLPQDLSEEIEEDPTGGKIKWEQGKLNGAPNKVDEIVQF 333
Query: 906 HLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI-ASLPHEQYLFLEKLQTNL 964
H+G+ V + S++ G ++++GTV G IG + A + F L+ ++
Sbjct: 334 HVGDVVTCLQKASMI-------PGGSESIMYGTVMGSIGALHAFTSRDDVDFFSHLEMHM 386
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF----LDLSRTRMDEISKTMNV 1020
R+ + G +H +RS K+ +DGDL E F +DL R DE+ +T
Sbjct: 387 RQEYPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQFPTLPMDLQRKIADELDRT--- 438
Query: 1021 SVEELCKRVEE 1031
E+ K++E+
Sbjct: 439 -PAEILKKLED 448
>gi|198457226|ref|XP_001360595.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
gi|198135905|gb|EAL25170.2| GA10080 [Drosophila pseudoobscura pseudoobscura]
Length = 1459
Score = 93.2 bits (230), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 105/434 (24%), Positives = 191/434 (44%), Gaps = 60/434 (13%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESEMHFVRLLDDQ 700
+R +PL PR++ + ++R + + + +++ +EES +
Sbjct: 1030 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPNGS 1089
Query: 701 TFEFISTYP------------LDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
FE + P + +E+ I+ S+ + Y C+GT + E
Sbjct: 1090 QFEMVLISPETWEIVPDASIRFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1148
Query: 744 NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
+ ++G I ++ + E GK L+ + +KE KG V +++ G L+ + QKI
Sbjct: 1149 DITSRGNIHIYDIIEVVPEPGKPMTKFKLKEVFKKEQKGPVSAISDVLGFLVTGLGQKI- 1207
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1208 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEHRTLSLA 1263
Query: 853 ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
+RD+N + +E + D+ LG AE N ++ + + + + +L +YHL
Sbjct: 1264 SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQ--KLIRKADYHL 1321
Query: 908 GEFVNR-FR---HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
G+ VN FR H V + V++GT++G +G LP + Y LQ
Sbjct: 1322 GQVVNTMFRVQCHQRGVHQRQPFLYENKHFVVYGTLDGGLGYCLPLPEKVYRRFLMLQNV 1381
Query: 964 LRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
L + GLN +++R+ + KK ++ +DGDLI S+ L + +E++K +
Sbjct: 1382 LLSYQDHLCGLNPKEFRTLKSFKKQGLNPSRCIIDGDLIWSYRLLPNSDRNEVAKKIGTR 1441
Query: 1022 VEELCKRVEELTRL 1035
EE+ + E+ RL
Sbjct: 1442 TEEILSDLLEIERL 1455
Score = 57.8 bits (138), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 71/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YG + +L+ G QD L ++ + K VLQ D ++ L T ++ + R
Sbjct: 76 LYGNVMSLQCVSLAGAMQDALLVSFKDAKLSVLQHDPDTYALKTLSLHYFEEEDIRGGWT 135
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
G+ + +DPD R + +Y V+PF L E
Sbjct: 136 GRYFVPVVRVDPDARCAVMLVYGKRLVVLPFRKDNSLDEIELTDVKPFKKAPTAMVSRTP 195
Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
++ I L+EL VLDI+FL+G +PT+++LY+ + +K ++
Sbjct: 196 IMASYLITLKELDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
L + V W+ N+L + P+ P+ G L++ ++Y SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCFQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315
Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
+ + P++P S+ A + +D D + ++ G L++L + + + V
Sbjct: 316 DNSTSFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGELYVLTLCVDSMRTVRNFHF 373
Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
+ + S I + +++GS G+S L+
Sbjct: 374 HKAAASVLTSCICVCHSEYIFLGSRLGNSLLL 405
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 70/159 (44%), Gaps = 26/159 (16%)
Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
EV + +N+ PI C V+ E G ++V +G K+G+L +
Sbjct: 488 EVCDSLINVAPINYMCAGERVEFEEDGTTLRPHAENLHDLKIELVAATGHSKNGALSVFV 547
Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
N I S EL G +W++ ++S D D F+++S + T L + E+
Sbjct: 548 NCINPQIITSFELDGCLDVWTVFDDATKKTSRHDQHD-FMLLSQSNST--LVLQTGQEIN 604
Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
E E GF T+F + + +VQVT+ VRL+ T
Sbjct: 605 EIENTGFTVNQATIFVGNLGQQRFIVQVTTRHVRLLQGT 643
>gi|449524573|ref|XP_004169296.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like, partial [Cucumis sativus]
Length = 741
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 86/276 (31%), Positives = 129/276 (46%), Gaps = 32/276 (11%)
Query: 733 CVGTAYVLPEENEPTKGRILVFIV---EDGKLQLIAE---KETKGAVYSLNAFNGKLLAA 786
VGTAYV E+ +GR+L+F V D L++E KE KGA+ +L + G LL A
Sbjct: 423 AVGTAYV-QGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIA 481
Query: 787 INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG---DFIVVGDLMKSISLLIYK 843
KI L+KW EL + + LYV + +FI++GD+ KSI L +K
Sbjct: 482 SGPKIILHKWT-----GAELNGIAFYD--VPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 534
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGA-----ENNFNLFTVRKNSEGATDEERG- 897
+ + A+D+ + A E L D L + N +F S T+ +G
Sbjct: 535 EQGAQLSLLAKDFGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKS---TESWKGQ 591
Query: 898 RLEVVGEYHLGEFVNRFRHGSLVMRLPD------SDVGQIPTVIFGTVNGVIGVIASLPH 951
+L E+H+G V +F ++ D SD ++FGT++G IG IA L
Sbjct: 592 KLLSRAEFHVGAHVTKFLRLQMLSTSSDKACSTVSDKTNRFALLFGTLDGSIGCIAPLDE 651
Query: 952 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
+ L+ LQ L + VGGLN +R F++ K
Sbjct: 652 LTFRRLQSLQKKLGDAVPHVGGLNPRSFRQFHSNGK 687
>gi|449470342|ref|XP_004152876.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Cucumis sativus]
Length = 1504
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 86/276 (31%), Positives = 129/276 (46%), Gaps = 32/276 (11%)
Query: 733 CVGTAYVLPEENEPTKGRILVFIV---EDGKLQLIAE---KETKGAVYSLNAFNGKLLAA 786
VGTAYV E+ +GR+L+F V D L++E KE KGA+ +L + G LL A
Sbjct: 1186 AVGTAYV-QGEDVAARGRVLLFSVGKDADNSQTLVSEVYSKELKGAISALASLQGHLLIA 1244
Query: 787 INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG---DFIVVGDLMKSISLLIYK 843
KI L+KW EL + + LYV + +FI++GD+ KSI L +K
Sbjct: 1245 SGPKIILHKWT-----GAELNGIAFYD--VPPLYVVSLNIVKNFILLGDIHKSIYFLSWK 1297
Query: 844 HEEGAIEERARDYNANWMSAVEILDDDIYLGA-----ENNFNLFTVRKNSEGATDEERG- 897
+ + A+D+ + A E L D L + N +F S T+ +G
Sbjct: 1298 EQGAQLSLLAKDFGSLDCYATEFLIDGSTLSLTVSDDQKNIQIFYYAPKS---TESWKGQ 1354
Query: 898 RLEVVGEYHLGEFVNRFRHGSLVMRLPD------SDVGQIPTVIFGTVNGVIGVIASLPH 951
+L E+H+G V +F ++ D SD ++FGT++G IG IA L
Sbjct: 1355 KLLSRAEFHVGAHVTKFLRLQMLSTSSDKACSTVSDKTNRFALLFGTLDGSIGCIAPLDE 1414
Query: 952 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
+ L+ LQ L + VGGLN +R F++ K
Sbjct: 1415 LTFRRLQSLQKKLGDAVPHVGGLNPRSFRQFHSNGK 1450
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 77/338 (22%), Positives = 144/338 (42%), Gaps = 60/338 (17%)
Query: 6 IYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSD---- 57
++G + ++ + G + +D + + + K VL++D + L T +M
Sbjct: 108 LHGNVESMAILSSRGGDGSKKRDSIILVFQEAKISVLEFDDSTHSLRTSSMHCFDGPQWL 167
Query: 58 --RIGRPT-DNGQIGIIDPDCRLIGLHLYDGLFKVI---------------PFDNKG--- 96
+ GR + G + +DP R G+ +Y GL +I F N G
Sbjct: 168 HLKRGRESFARGPVVKVDPQGRCGGVLVY-GLQMIILKASQAGSGLVVDDEAFGNTGAIS 226
Query: 97 -QLKEAFNIRLEELQVLDIK---FLYGCAKPTIVVLYQD------NKDARHVKTYEVALK 146
+++ ++ I L +L V +K F++G +P +V+L++ +H AL
Sbjct: 227 ARVESSYLINLRDLDVKHVKDFVFVHGYIEPVMVILHEQELTWAGRVSWKHHTCMVSALS 286
Query: 147 -DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSIT 203
+ P WS +NL + A L+ VP P+ GVL+I +I Y S +A + +
Sbjct: 287 ISTTLKQHPLIWSASNLPHDAYKLLAVPSPIGGVLVISANSIHYNSQSASCMLALNNYAV 346
Query: 204 KAYG-----------RVDADGSRYLLGDHA------GLLHLLVITHEKEKVTGLKIELLG 246
A +DA + +L+ D A G L LL + ++ V L +
Sbjct: 347 SADSSQDMPRSNFNVELDAANATWLVNDVALLSTKTGELLLLALVYDGRVVQRLDLSKSK 406
Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKG 284
+ + S I+ + N++ ++GS GDS L++ + + G
Sbjct: 407 ASVLTSGIASIGNSLFFLGSRLGDSLLVQFSCGVGSSG 444
Score = 47.4 bits (111), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 32/112 (28%), Positives = 52/112 (46%), Gaps = 24/112 (21%)
Query: 289 VLERYVNLGPIVDFC----------VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 338
V + +N+GP+ DF + +Q ++V CSG K+G+L I+R I
Sbjct: 556 VRDSLINIGPLKDFSYGLRINADPNATGIAKQSNYELVCCSGHGKNGALCILRQSIRPEM 615
Query: 339 QASVELQGIKGMWSL-----RSST---------DDPFDTFLVVSFISETRIL 376
VEL G KG+W++ R S DD + +L++S + T +L
Sbjct: 616 ITEVELPGCKGIWTVYHKNTRGSIADSSRMVPDDDEYHAYLIISLEARTMVL 667
>gi|241560031|ref|XP_002400960.1| spliceosomal protein sap, putative [Ixodes scapularis]
gi|215501812|gb|EEC11306.1| spliceosomal protein sap, putative [Ixodes scapularis]
Length = 1019
Score = 92.4 bits (228), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 82/329 (24%), Positives = 150/329 (45%), Gaps = 50/329 (15%)
Query: 729 NVYYCVGTAYVL-----PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKL 783
N C G +L P EP G +L+L+ + A +L F G+L
Sbjct: 710 NPRVCRGGGLLLTYRLAPNPEEPMAGPT--------QLELVHATPVEEAPTALCPFQGRL 761
Query: 784 LAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIY 842
LA + + ++LY D G ++L +C + A+ +Q G+ +VV D+ +S L Y
Sbjct: 762 LAGVGKCLRLY-----DLGRKKLLRKCENKYIPNAIVSIQAMGNRVVVSDVQESFFFLRY 816
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD--------- 893
K +E + A D W++A +LD + GA+ N+ +R S + D
Sbjct: 817 KRQENQLVIFADDSVPRWITASCMLDYETVAGADKFGNVSIIRLPSSISDDVDEDPTGIK 876
Query: 894 --EERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG 944
+RG L +V+ +H+GE V + +L+ G ++++ T++G +G
Sbjct: 877 SLWDRGWLGGSSQKADVISNFHIGETVLSLQKATLI-------PGGSESLVYVTLSGTVG 929
Query: 945 VIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESF 1003
V+ HE + F + L+ ++R + G +H +RS KN +DGDL E F
Sbjct: 930 VLVPFTAHEDHDFFQHLEMHMRYENPPLCGRDHLSFRS-----SYFPVKNVIDGDLCEQF 984
Query: 1004 LDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
L ++ I++ ++ + E+ K++E++
Sbjct: 985 NSLDPSKQKSIAEELDRNPSEVSKKLEDI 1013
Score = 49.3 bits (116), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 28/87 (32%), Positives = 48/87 (55%), Gaps = 3/87 (3%)
Query: 333 GIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 391
G+ ++E A EL G +W+++ D+ FD +++VSF++ T +L++ + +EE G
Sbjct: 428 GLQVSEMAVSELPGNPNAVWTVKRKADEDFDAYIIVSFVNATLVLSIG--ETVEEVTDSG 485
Query: 392 FCSQTQTLFCHDAIYNQLVQVTSGSVR 418
F T TL C + LVQ + S R
Sbjct: 486 FLGTTPTLSCAQIGDDALVQALAMSSR 512
Score = 48.1 bits (113), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 67/326 (20%), Positives = 126/326 (38%), Gaps = 80/326 (24%)
Query: 216 YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
+L G + + + +++ VT +K++ +A+++ L +++ + +G+ L +
Sbjct: 111 FLAQTEQGDIFKITLETDEDMVTEIKLKYFDTIPVAASMCVLKTGFLFVAAEFGNHCLYQ 170
Query: 276 LN------------------------LQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG 311
+ P A + + V E +L P + + DL +
Sbjct: 171 IARLGEEDEEPEFSSAIPLEEGDTFFFAPRALRNLLPV-EELDSLSPAMGCTIADLANED 229
Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
Q+ G LR++R+G+ + A V L ++ W + + P
Sbjct: 230 TPQLYVACGRGPRSCLRVLRHGLEASVHAEVLLCVLE--WLIDCDQNVP----------- 276
Query: 372 ETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEW 431
ILA++ QV +R + + R NEW
Sbjct: 277 ---ILAVHW------------------------------QVYPEGIRHIRADKR--VNEW 301
Query: 432 KSPPGYSVNVATANASQVLLATGGGHLVYLEIGD-GILTE-VKHAQLEYEISCLDIN--P 487
++P ++ N QV++A GG LVY E+ G L E ++ +I C+ + P
Sbjct: 302 RTPGKKAIVKCAVNQRQVVIALTGGELVYFEMDPTGQLNEYTDRREMSADIICMSLASVP 361
Query: 488 IGENPSYSQIAAVGMWTDISVRIFSL 513
+GE S+ AVG+ D +VRI SL
Sbjct: 362 VGEQ--RSRFLAVGL-ADNTVRIISL 384
>gi|350633238|gb|EHA21604.1| hypothetical protein ASPNIDRAFT_51242 [Aspergillus niger ATCC 1015]
Length = 1406
Score = 92.0 bits (227), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 98/432 (22%), Positives = 179/432 (41%), Gaps = 55/432 (12%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH------------------- 692
++ + LGE + + S + + C + E+ E+H
Sbjct: 971 LKRVHLGEQVDHLAYSTSSGMYVLGTCHATDFKLPEDDELHPEWRNEDCLAISFFPSARG 1030
Query: 693 -FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEP 746
F++L+ T+ I ++ L EY +I + S N + VGTA+ E+ P
Sbjct: 1031 SFIKLVSPNTWSIIDSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAFAR-GEDIP 1089
Query: 747 TKGRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYK 795
++G I VF V D KL+LI ++ KGAV +L+ G+ + Q +
Sbjct: 1090 SRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQGQKCMV 1149
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ + ++ + ++GD +K + Y E + A+D
Sbjct: 1150 RGLKEDGSLLPVAFMDMQCYVSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKMSLFAKD 1209
Query: 856 YNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ + A E L D L A+++ N+ ++ + E RL ++H+G F +
Sbjct: 1210 LDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFHMGNFAS 1269
Query: 913 RFRHGSLVM----RLPDSDVG-----QIP--TVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
M ++ S G Q P V+ T NG +G+I +P E Y L LQ
Sbjct: 1270 TLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYRRLSALQ 1329
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+ L ++ GLN +R+ E + LDG+L+ ++D+S+ R EI+ +
Sbjct: 1330 SQLTNTLEHPCGLNPRAFRAV--ESDGTAGRGMLDGNLLFKWIDMSKQRKTEIAGRVGAR 1387
Query: 1022 VEELCKRVEELT 1033
E+ +E ++
Sbjct: 1388 EWEIKADLEAIS 1399
>gi|218188544|gb|EEC70971.1| hypothetical protein OsI_02599 [Oryza sativa Indica Group]
Length = 274
Score = 92.0 bits (227), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 58/168 (34%), Positives = 78/168 (46%), Gaps = 57/168 (33%)
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGE 904
+G +E RDYN WM+AVE+LDD +Y+GA+N +NLFT K GRL V+G+
Sbjct: 148 NKGQTDELNRDYNITWMTAVEMLDDYVYIGADNCYNLFTELK-------RRVGRLLVIGQ 200
Query: 905 YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
YHLG ++LQ L
Sbjct: 201 YHLG-------------------------------------------------DQLQLVL 211
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVD-AKNFLDGDLIESFLDLSRTRM 1011
IK VG L H QWR+F + ++V A+ F+D DLIESFL L ++M
Sbjct: 212 ADYIKSVGNLTHAQWRAFYTDGRSVGVAQGFVDNDLIESFLSLEPSKM 259
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 45/79 (56%), Gaps = 5/79 (6%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQ-----DFLFIATERYKFCVLQWDAESSELITRAMGDV 55
MLD P+YGRIAT+ELFRP A D ++ ++ + + E+ +I R+ D
Sbjct: 80 MLDAPVYGRIATIELFRPCVLASIKFSPDSRAKSSCKFVNAFVYLNQENRYVICRSGRDA 139
Query: 56 SDRIGRPTDNGQIGIIDPD 74
SD IGRPT+ GQ ++ D
Sbjct: 140 SDHIGRPTNKGQTDELNRD 158
>gi|302814354|ref|XP_002988861.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
gi|300143432|gb|EFJ10123.1| hypothetical protein SELMODRAFT_184138 [Selaginella moellendorffii]
Length = 1413
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/315 (27%), Positives = 145/315 (46%), Gaps = 24/315 (7%)
Query: 733 CVGTAYVLPEENEPTKGRI-LVFIVEDGK-----LQLIAEKETKGAVYSLNAFNGKLLAA 786
+GT Y L E+ ++GRI LV + ED + + KE KGA+ +L A G LL A
Sbjct: 1103 AIGTGY-LQGEDVASRGRIILVSLGEDPSDPKVWAKELYSKELKGAISALAALQGHLLLA 1161
Query: 787 INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
I KI L+ W +G+ EL + + + +F++ GD KSI L +K E
Sbjct: 1162 IGPKIILHTW----NGS-ELIGTAFFDAPLYVVSLNIVKNFVLFGDFHKSIYFLCWKEEG 1216
Query: 847 GAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFT-VRKNSEGATDEERGRLE 900
+ A+D+ + A E L D L + N +F+ KN+E + +L
Sbjct: 1217 AQLVLLAKDFGSLDCYATEFLIDGSTLSLLVSDSRKNIQVFSYAPKNAESWKGQ---KLL 1273
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
E+HLG V +F + ++ P S + FGT++G IG I L + L+ L
Sbjct: 1274 PRVEFHLGSHVTKFLRLQM-LQTPGSSRTNRFALCFGTLDGGIGYITPLDELTFRRLQTL 1332
Query: 961 QTNLRKVIKGVGGLNHEQWRSF--NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
Q L ++ V GLN + +R F N E N +D + + + LS + I++ +
Sbjct: 1333 QRKLVDLVPHVAGLNPKAYRQFQANGEHHKHGPDNTVDSEQLREYESLSLDKQVAIARQI 1392
Query: 1019 NVSVEELCKRVEELT 1033
+ +++ + +++
Sbjct: 1393 GTTRQQIFANLRDIS 1407
Score = 58.9 bits (141), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 124/589 (21%), Positives = 220/589 (37%), Gaps = 136/589 (23%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM----GDVSDRIGR 61
I +A L R G +D + +A KF VL +D + +L T +M G + R
Sbjct: 114 IVDSMAILHRGRDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEWIHLKR 173
Query: 62 PTDN---GQIGIIDPDCRLIGLHLYDGLFKVI--------------PFDN--KGQLKEAF 102
+ G + D R G+ +Y ++ P N +++ ++
Sbjct: 174 GREKFPGGPLVRADSQGRCAGVLIYKSQLVMMKAAQEAYGLVEEDDPSGNIVSARIESSY 233
Query: 103 NIRLEELQVLDIK---FLYGCAKPTIVVLYQD----------NKDARHVKTYEVALKDKD 149
+ L+EL ++ +K FLYG +P + +L++ +D V + K
Sbjct: 234 VVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALSINTNTKK 293
Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRV 209
W Q L A L+ VP P+ GVL++ +I+Y S + + + T G +
Sbjct: 294 HPR-LWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELATPPAGSL 352
Query: 210 DADGSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIAS 252
+ S++ LL G+L L + + V L++ + ++S
Sbjct: 353 EMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSKGSVLSS 412
Query: 253 TISYLDNAVVYIGSSYGDSQLI---------------------------KLNLQPDAKGS 285
++ + + ++GS GDS L+ ++ L DA
Sbjct: 413 SLCTIGDMFFFVGSRLGDSLLVQFGSASTSNSLSQSYDGEDDIMVRPSKRMRLDDDANEQ 472
Query: 286 YV------------------EVLERYVNLGPIVDFCVVDL---ERQGQGQ-VVTCSGAYK 323
+ V + N+GPI D E+ G Q ++ C G K
Sbjct: 473 SLYQYKSAVSDSQKNMNFLFSVRDSLCNIGPIRDITGRSQNPSEQPGSAQDLIACCGHGK 532
Query: 324 DGSLRIVRNGIGIN--EQAS---------------VELQGIKGMWSL--RS----STDDP 360
+GSL I+ I + QA+ V+L G G+W++ RS + D
Sbjct: 533 NGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCVGVWTVYHRSGQIPAEKDE 592
Query: 361 FDTFLVVSFISETRILAM--NLEDELEETE--IEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+ +L++S S T +L L + + E EG LF ++ QV
Sbjct: 593 YHAYLIISLESRTMVLETGETLGEVTDSVEYYTEGPSISAGNLFGR----RRIAQVYQKG 648
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD 465
VR++ +R+ ++ PG ++ A+ VLL G L +GD
Sbjct: 649 VRILDG-ARQTQDLQVGEPGNAIESASFADPYVLLRMQDGS-CQLVVGD 695
>gi|302761560|ref|XP_002964202.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
gi|300167931|gb|EFJ34535.1| hypothetical protein SELMODRAFT_82277 [Selaginella moellendorffii]
Length = 1413
Score = 91.7 bits (226), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/315 (27%), Positives = 145/315 (46%), Gaps = 24/315 (7%)
Query: 733 CVGTAYVLPEENEPTKGRI-LVFIVEDGK-----LQLIAEKETKGAVYSLNAFNGKLLAA 786
+GT Y L E+ ++GRI LV + ED + + KE KGA+ +L A G LL A
Sbjct: 1103 AIGTGY-LQGEDVASRGRIILVSLGEDPSDPKVWAKELYSKELKGAISALAALQGHLLLA 1161
Query: 787 INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
I KI L+ W +G+ EL + + + +F++ GD KSI L +K E
Sbjct: 1162 IGPKIILHTW----NGS-ELIGTAFFDAPLYVVSLNIVKNFVLFGDFHKSIYFLCWKEEG 1216
Query: 847 GAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFT-VRKNSEGATDEERGRLE 900
+ A+D+ + A E L D L + N +F+ KN+E + +L
Sbjct: 1217 AQLVLLAKDFGSLDCYATEFLIDGSTLSLLVSDSRKNIQVFSYAPKNAESWKGQ---KLL 1273
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
E+HLG V +F + ++ P S + FGT++G IG I L + L+ L
Sbjct: 1274 PRVEFHLGSHVTKFLRLQM-LQTPGSSRTNRFALCFGTLDGGIGYITPLDELTFRRLQTL 1332
Query: 961 QTNLRKVIKGVGGLNHEQWRSF--NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
Q L ++ V GLN + +R F N E N +D + + + LS + I++ +
Sbjct: 1333 QRKLVDLVPHVAGLNPKAYRQFQANGEHHKHGPDNTVDSEQLREYESLSLDKQVAIARQI 1392
Query: 1019 NVSVEELCKRVEELT 1033
+ +++ + +++
Sbjct: 1393 GTTRQQIFANLRDIS 1407
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 122/589 (20%), Positives = 222/589 (37%), Gaps = 136/589 (23%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGD------VSDRI 59
I +A L R G +D + +A KF VL +D + +L T +M + +
Sbjct: 114 IVDSMAILHRGRDGGRHRDAIILAFPAAKFSVLFFDDATQQLKTSSMHYFEGPEWIHLKR 173
Query: 60 GRPTDNGQ-----------IGIIDPDCRLIGLHLYDGLFKVIPFDN------KGQLKEAF 102
GR G G++ C+L+ + + ++ D+ +++ ++
Sbjct: 174 GREKFPGGPLVRADSQGRCAGVLIYKCQLVMMKAAQEAYGLVEEDDPSGNIVSARIESSY 233
Query: 103 NIRLEELQVLDIK---FLYGCAKPTIVVLYQD----------NKDARHVKTYEVALKDKD 149
+ L+EL ++ +K FLYG +P + +L++ +D V + K
Sbjct: 234 VVNLQELGMMHVKDFVFLYGYIEPVVAILHERELTWAGRVTFRRDTCCVTALSINTNTKK 293
Query: 150 FVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRV 209
W Q L A L+ VP P+ GVL++ +I+Y S + + + T G +
Sbjct: 294 HPR-LWFQTGLPYDAYSLLAVPSPIGGVLVLCANSILYYSQVSTCIVAVNELATPPAGSL 352
Query: 210 DADGSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIAS 252
+ S++ LL G+L L + + V L++ + ++S
Sbjct: 353 EMPRSKFSIELDAAHATWLSYDAALLSTKTGMLVHLHLIFDGRNVQRLELSKSKGSVLSS 412
Query: 253 TISYLDNAVVYIGSSYGDSQLI---------------------------KLNLQPDAKGS 285
++ + + ++GS GDS L+ ++ L DA
Sbjct: 413 SLCTIGDKFFFVGSRLGDSLLVQFGSASTSNSLEHSYDGEDDIMVRPSKRMRLDDDASEQ 472
Query: 286 YV------------------EVLERYVNLGPIVDFCVVDL---ERQGQGQ-VVTCSGAYK 323
+ V + N+GPI D E+ G Q ++ C G K
Sbjct: 473 SLYQYKSGVSDSQKNMNFLFSVRDSLCNIGPIRDITCRSQNPSEQPGSAQDLIACCGHGK 532
Query: 324 DGSLRIVRNGIGIN--EQAS---------------VELQGIKGMWSL--RS----STDDP 360
+GSL I+ I + QA+ V+L G G+W++ RS + D
Sbjct: 533 NGSLNIISRSIRPDFITQANMSLLFFAVAYALFFQVKLPGCVGVWTVYHRSGQIPAEKDE 592
Query: 361 FDTFLVVSFISETRILAM--NLEDELEETE--IEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
+ +L++S S T +L L + + E EG LF ++ QV
Sbjct: 593 YHAYLIISLESRTMVLETGETLGEVTDSVEYYTEGPSISAGNLFGR----RRIAQVYQKG 648
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD 465
VR++ +R+ ++ PG ++ A+ VLL G L +GD
Sbjct: 649 VRILDG-ARQTQDLQVGEPGNAIESASFADPYVLLRMQDGS-CQLVVGD 695
>gi|255539681|ref|XP_002510905.1| cleavage and polyadenylation specificity factor cpsf, putative
[Ricinus communis]
gi|223550020|gb|EEF51507.1| cleavage and polyadenylation specificity factor cpsf, putative
[Ricinus communis]
Length = 1461
Score = 91.3 bits (225), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 104/404 (25%), Positives = 185/404 (45%), Gaps = 50/404 (12%)
Query: 664 RRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQT----FEFISTYPLDTFEYGCSI 719
+ + HQ ++ + L EE E VR+L+ + ++ +T P+ + E ++
Sbjct: 1068 QEVGHQIENHNLSSDELLQTYSVEEFE---VRILESENGGGPWQTKATIPMQSSENALTV 1124
Query: 720 ----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV----EDGKLQLIAE---KE 768
L + + ++ +GTAYV E+ +GR+L+F V E+ ++ L++E KE
Sbjct: 1125 RVVTLFNATTKENETLLAIGTAYV-QGEDVAARGRVLLFSVVKSTENSQV-LVSEVYSKE 1182
Query: 769 TKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG--- 825
KGA+ +L + G LL A KI L+KW GT EL + LYV +
Sbjct: 1183 LKGAISALASLQGHLLIASGPKIILHKWT----GT-ELNGVAFYDAP--PLYVASMNIVK 1235
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFN 880
+FI++GD+ KSI L +K + + A+D+ + A E L D L + N
Sbjct: 1236 NFILLGDIHKSIYFLSWKEQGAQLSLLAKDFGSLDCFATEFLIDGSTLSLVVSDEQKNIQ 1295
Query: 881 LFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------- 932
+F + + +G +L E+H+G + +F S++ D G P
Sbjct: 1296 IFYY---APKMLESWKGQKLLSRAEFHVGAHITKFIRLSMLSTSSDRS-GAAPGPDKTNR 1351
Query: 933 -TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT--V 989
++FGT++G IG IA L + L+ LQ L + V GLN +R F ++ K
Sbjct: 1352 FALLFGTLDGSIGCIAPLDELTFRRLQSLQRKLVDAVPHVAGLNPRSFRQFRSDGKVHRP 1411
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
++ +D +L+ F L EI++ + + ++ + +L+
Sbjct: 1412 GPESIVDCELLSHFEMLPLEEQLEIAQQVGTTRAQILSNLNDLS 1455
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 73/309 (23%), Positives = 130/309 (42%), Gaps = 54/309 (17%)
Query: 23 QDFLFIATERYKFCVLQWDAESSELITRAMGDVSD------RIGRPT-DNGQIGIIDPDC 75
+D + +A + K VL++D L T +M + GR + G + +DP
Sbjct: 131 RDSIILAFKDAKISVLEFDDSIHGLRTSSMHCFEGPEWLHLKRGRESFARGPLLKVDPQG 190
Query: 76 RLIGLHLYDGLFKVIPFDN------------------KGQLKEAFNIRLEEL---QVLDI 114
R G+ +YD ++ +++ ++ I L ++ V D
Sbjct: 191 RCGGILVYDMQMIILRAAQASSGLVGDDDALSSGGSISARVQSSYVINLRDMDMKHVKDF 250
Query: 115 KFLYGCAKPTIVVLYQ------DNKDARHVKTYEVALKDKDFVEGP---WSQNNLDNGAD 165
FL+ +P +V+L++ +H AL ++ P WS NL + A
Sbjct: 251 IFLHDYIEPVVVILHERELTWAGRVSWKHHTCMISALSISTTLKQPTLIWSVVNLPHDAY 310
Query: 166 LLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIR---------PSITKAYGRVDADGSR- 215
L+ VPPP+ GVL+I TI Y S +A A+ + + +A V+ D +
Sbjct: 311 KLLAVPPPIGGVLVICANTIHYHSESATYALALNNYAVSIDSSQELPRASFSVELDAVKA 370
Query: 216 -YLLGDHA------GLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
+LL D A G L LL + ++ V L + + + S I+ + N++ ++GS
Sbjct: 371 AWLLNDVALLSAKNGELLLLSLVYDGRVVQRLDLSKSKASVLTSDITTIGNSLFFLGSRL 430
Query: 269 GDSQLIKLN 277
GDS L++
Sbjct: 431 GDSLLVQFT 439
Score = 45.1 bits (105), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 36/148 (24%), Positives = 63/148 (42%), Gaps = 27/148 (18%)
Query: 277 NLQPDAKGSYVEVLERYVNLGPIVDFC----------VVDLERQGQGQVVTCSGAYKDGS 326
N + K V + +N+GP+ DF + +Q +V CSG K+G+
Sbjct: 494 NTESAQKSFSFAVRDSLINVGPLKDFSYGLRSNYDASATGIAKQSNYDLVCCSGHGKNGT 553
Query: 327 LRIVRNGIGINEQASVELQGIKGMWSL--------------RSSTDDPFDTFLVVSFISE 372
L I+R I V+L G +G+W++ ++ D + +L++S E
Sbjct: 554 LCILRQSIRPEMITEVDLPGCRGIWTVYHKNARGHNVDLSKMAAAADEYHAYLIISM--E 611
Query: 373 TRILAMNLEDELEE-TEIEGFCSQTQTL 399
R + + D L E TE + Q +T+
Sbjct: 612 ARTMVLETADLLSEVTESVDYFVQGRTI 639
>gi|330799483|ref|XP_003287774.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
gi|325082229|gb|EGC35718.1| hypothetical protein DICPUDRAFT_32967 [Dictyostelium purpureum]
Length = 1453
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 108/496 (21%), Positives = 210/496 (42%), Gaps = 64/496 (12%)
Query: 584 SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLK-EVSHMCPFNSAAFPDSLAIAKEGE-- 640
SSKN VF +P I++ + +++ + + PF++A P+ E +
Sbjct: 954 SSKNNG-VFIGGQKPVWIFNEKGYIRLHSMDFDGAIVSLKPFHNADCPNGFLYYTEDKQH 1012
Query: 641 LTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRL 696
+ IG ++ + IR +P+ +I + + + + + Q E E +
Sbjct: 1013 IKIGYLNGLMNFENEYAIRRVPIKLSAHKIAYHNELKCYVVVVSFPQVTQELEEDSKKPI 1072
Query: 697 LDDQTFE-------------FISTYPLDTFE--YGCSILSCSFSDD-----SNVYYCVGT 736
L D+ F+ FI ++ L E I+S F + S + +GT
Sbjct: 1073 LTDEKFQIKIIDPTIDWSWRFIDSFSLQDRETVLAMKIVSLKFKESDETIKSKPFLVIGT 1132
Query: 737 AYVLPEENEPTKGRILVFIVEDGKLQ------------LIAEKETKGAVYSLNAFNGKLL 784
A+ E+ + KGR+LVF + K Q L+ EKE KG V +L++ +G LL
Sbjct: 1133 AFTFGEDTQ-CKGRVLVFEIVSHKTQFESDDLGTKRLNLLYEKEQKGPVTALSSVSGLLL 1191
Query: 785 AAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH 844
I K+ + +++ T +L + H I + T +IV+GD+ KS+ L +
Sbjct: 1192 MTIGPKLTVNQFL-----TGQLVTLSFHDAQIYICSISTIKTYIVIGDMYKSVYFLQWNG 1246
Query: 845 EEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEV 901
++ + ++DY + + + E + + L ++ + N+ + T + L
Sbjct: 1247 KQ--LVPLSKDYQSLNIFSTEFIVNQQTLSILVSDLDKNILLFSFDPADPTSRQGQMLLC 1304
Query: 902 VGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLE 958
++H+G + +F + + S G V FGT++G + V+ L Y
Sbjct: 1305 KADFHIGSNIEKFVRTPMKFNIQSSSNGNNNNDQLVFFGTLDGSLNVLRPLDERMYQLFY 1364
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD---------AKNFLDGDLIESFLDLSRT 1009
LQ+ L + GLN +Q+R+F + + K LDGDL+ F+ L++
Sbjct: 1365 HLQSKLY-YLPQPAGLNAKQYRAFKSFSQNFHFSPSTIHQLPKYILDGDLLSKFVKLNQK 1423
Query: 1010 RMDEISKTMNVSVEEL 1025
++ ++ + +E+
Sbjct: 1424 ERRLLASSVGSNTDEI 1439
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 63/337 (18%), Positives = 125/337 (37%), Gaps = 69/337 (20%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
++++ ++G I ++ R GE +D L + K VL ++ + + R++ +
Sbjct: 72 LIELKLFGTIESMASVRYPGENKDSLLLTFRDAKISVLDYNIDIMDFEIRSLHFYENDEF 131
Query: 61 RPTDNGQIGI-------IDPDCRLIGLHLYDGLFKVIPFDNKGQL--------------- 98
+ NG+I ID R + LYD V+PF +
Sbjct: 132 K---NGRIHFKHPPILKIDTQQRCATMLLYDRNIVVLPFKQISSILDDEDEEEKDEEDEK 188
Query: 99 --KEAFNIRLEELQVL----DIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALK------ 146
A EE + FLYG +PTI+ L++ T +A+K
Sbjct: 189 ENDNANQDYTEEFDDDDDDNNFCFLYGYYEPTILFLHE----PSQTWTSRIAVKRLTSQL 244
Query: 147 -------DKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC--------SAN 191
W +N+ D L+ VP PL G L+I + + + N
Sbjct: 245 TAISINFSTKLASIIWHTSNMPYNCDQLVSVPEPLSGALVITPNIMFHVNQTSKYGLAVN 304
Query: 192 AFKAI--------PIRPSITKAY-----GRVDADGSRYLLGDHAGLLHLLVITHEKEKVT 238
+ I P+ ++ + V + +++ G L + + + V
Sbjct: 305 EYANIDIGDKFEFPLDETLNLVFTLDRSNFVFLEADKFIGSLKGGELLIFHLISDGRTVQ 364
Query: 239 GLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
+ + G + +A+ + + + ++++GS GDS L++
Sbjct: 365 RIHVSKAGGSVLATCMCVVSDNLLFLGSRLGDSLLLQ 401
>gi|67472417|ref|XP_652013.1| splicing factor 3B subunit 3 [Entamoeba histolytica HM-1:IMSS]
gi|56468814|gb|EAL46625.1| splicing factor 3B subunit 3, putative [Entamoeba histolytica
HM-1:IMSS]
gi|449709507|gb|EMD48761.1| premRNA-splicing factor rse-1, putative [Entamoeba histolytica KU27]
Length = 1145
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 174/883 (19%), Positives = 343/883 (38%), Gaps = 109/883 (12%)
Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL--- 273
L + L L T EK ++ + ++ A ++ + YGDS L
Sbjct: 299 LQSEFGDLFQLSFDTKEKGQLDNIHLKYFDSIPTALSLQFSKRGHFLCVGEYGDSILYNI 358
Query: 274 -----IKLNLQPDAKGSY--------VEVLERYVNLGPIVDFCVVD-LERQGQGQVVTCS 319
I + + D + + +E + R+ +L P++D V Q ++
Sbjct: 359 ISMENISIPFEKDGRLEFERHKEILNLEEIYRFKSLAPLIDLKVAPATSSQDTTKMYAFC 418
Query: 320 GAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRILAM 378
G +++I++N + + +EL I +W L+ TD+ + +L +S+ + T +L +
Sbjct: 419 GKGNQSTIKILKNQLNTLDAVEIELPAIPISIWPLKKETDE-YHQYLAISYSNITTLLKI 477
Query: 379 NLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYS 438
EDE+ E T +L +QV + + + S ++ E +
Sbjct: 478 T-EDEMSECTTSPILLSTPSLLVSMLFDGTFLQVMTDRIIIYSEPIQQFITEDQK----- 531
Query: 439 VNVATANASQVLLATGGGH---LVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYS 495
A+ N +++++ + L+Y + G L ++ +I+ L ++ +PS
Sbjct: 532 YVCASCNGCELIVSVEKNNQTSLIYFQYQSGHLLTMERKDNLSKITALALDQF--HPS-- 587
Query: 496 QIAAVGMWTDISVRIFSL-PD-----LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALG 549
+ A+G D SV + SL P+ L+ ++ + I S+ + SYL L
Sbjct: 588 KHCAIGC-IDGSVHLLSLIPNETTKALSRVSLQTYECSI--NSLTFNIIDNTSYLFAGLS 644
Query: 550 DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLL 609
+G L + + +GE+ + +G++P+TL V A S R + Y + K+
Sbjct: 645 NGLLGRSVYDPISGEINESSLNFVGSRPVTLSNVKDCGEDSVLAISGRSLLSYKNGTKIK 704
Query: 610 YSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICH 668
+ +N++ + C +++AI E + I TI+ L ++I + PR++
Sbjct: 705 TTPLNIQNTTLACGILVPFVDNAIAIICEKVMKIITIESTSSSLTGKNIQISYTPRKVIT 764
Query: 669 QEQSRTFAICSLKNQSCA------EESEMHFV---RLLDDQTFEFISTYPLDTFEY---G 716
I N SC + +E ++V LD E I D ++ G
Sbjct: 765 HPTIPLLYILEGDNNSCKVGNEIIQTNEGNWVGGIHTLDASQDELIQFIDFDNNKHPTGG 824
Query: 717 CSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGR---ILVFIVEDGKLQLIAEKETKGAV 773
C + S S + Y +G + + P + + I V+ + + + + + V
Sbjct: 825 CVVRSIS---KNQTYLIIGV--IESYKTRPIQWKSSEIQVYSINERSINYCYSTKVEYPV 879
Query: 774 YSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
+ F G +LA + ++LY K +LR R+ S+ + G+ I+
Sbjct: 880 RAFAEFKGMVLAGVGNILRLYDIGLKSLLRKAEKRQFASDIAQ--------LHVIGETIL 931
Query: 830 VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA-------------- 875
+ + +L+ Y + A D W+ L+ L +
Sbjct: 932 LTGVSDGFNLIRYNQINHKFDIYA-DSLPRWVVTAAPLNQSTVLASDKFGEIFMYQLPKE 990
Query: 876 --ENNFNLF-TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP 932
E N F T+ + + + ++ ++ +G+ F SL+ P
Sbjct: 991 IEEQALNPFSTLLQPHKTIYEGSSYKMVTATQFFVGDIATSFAQCSLIPGAPS------- 1043
Query: 933 TVIFGTVNGVIGVIASLP---HEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
IF N + G+ A +P F + L+ ++R + NH +RS V
Sbjct: 1044 --IFLYSNFMGGLSALIPLQSQNDIDFYQHLEMHMRVHWTNLTDRNHISFRS-----SIV 1096
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
K+ +DGDL E + L EI++ M V E+ K++ +L
Sbjct: 1097 PVKDTVDGDLCELYERLPYEIQQEIAEEMEKEVNEIIKKLHDL 1139
>gi|6671952|gb|AAF23212.1|AC016795_25 hypothetical protein [Arabidopsis thaliana]
gi|10998135|dbj|BAB03106.1| unnamed protein product [Arabidopsis thaliana]
Length = 1331
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/209 (27%), Positives = 106/209 (50%), Gaps = 20/209 (9%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
N+ PI+DF V+D + + + Q+ C G +GSLRI+R+GI + + + + QGI G W+
Sbjct: 440 NIAPILDFSVMDDQNEKRDQIFACCGVTPEGSLRIIRSGINVEKLLKTAPVYQGITGTWT 499
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
++ D + +FLV+SF+ ETR+L++ L + + T+ GF S T C LVQ+
Sbjct: 500 VKMKLTDVYHSFLVLSFVEETRVLSVGLSFK-DVTDSVGFQSDVCTFACGLVADGLLVQI 558
Query: 413 TSGSVRLVSST----------SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
++RL T S + W P S+++ + ++++T + +
Sbjct: 559 HQDAIRLCMPTMDAHSDGIPVSSPFFSSW-FPENVSISLGAVGQNLIVVSTSNPCFLSIL 617
Query: 463 IGDGI------LTEVKHAQLEYEISCLDI 485
+ + E++ L+YE+SC+ +
Sbjct: 618 GVKSVSSQCCEIYEIQRVTLQYEVSCISV 646
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 60/126 (47%), Gaps = 16/126 (12%)
Query: 905 YHLGEFVNRFRHGSLVMRLPDSDVGQI-----------PTVIFGTVNGVIGVIASLPHEQ 953
Y++GE + G + +LP DV + T+I GT+ G I V A + E+
Sbjct: 1172 YYMGEIAMSIKKGCNIYKLPADDVLRSYGLSKSIDTADDTIIAGTLLGSIFVFAPISSEE 1231
Query: 954 YLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
Y LE +Q L + V G +H ++R N + A+ LDGD++ FL+L+ +
Sbjct: 1232 YELLEGVQAKLGIHPLTAPVLGNDHNEFRGRENPSQ---ARKILDGDMLAQFLELTNRQQ 1288
Query: 1012 DEISKT 1017
+ + T
Sbjct: 1289 ESVLST 1294
>gi|30681985|ref|NP_850565.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit
protein [Arabidopsis thaliana]
gi|332641609|gb|AEE75130.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit
protein [Arabidopsis thaliana]
Length = 1329
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/209 (27%), Positives = 106/209 (50%), Gaps = 20/209 (9%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
N+ PI+DF V+D + + + Q+ C G +GSLRI+R+GI + + + + QGI G W+
Sbjct: 410 NIAPILDFSVMDDQNEKRDQIFACCGVTPEGSLRIIRSGINVEKLLKTAPVYQGITGTWT 469
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
++ D + +FLV+SF+ ETR+L++ L + + T+ GF S T C LVQ+
Sbjct: 470 VKMKLTDVYHSFLVLSFVEETRVLSVGLSFK-DVTDSVGFQSDVCTFACGLVADGLLVQI 528
Query: 413 TSGSVRLVSST----------SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
++RL T S + W P S+++ + ++++T + +
Sbjct: 529 HQDAIRLCMPTMDAHSDGIPVSSPFFSSW-FPENVSISLGAVGQNLIVVSTSNPCFLSIL 587
Query: 463 IGDGI------LTEVKHAQLEYEISCLDI 485
+ + E++ L+YE+SC+ +
Sbjct: 588 GVKSVSSQCCEIYEIQRVTLQYEVSCISV 616
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 60/126 (47%), Gaps = 16/126 (12%)
Query: 905 YHLGEFVNRFRHGSLVMRLPDSDVGQI-----------PTVIFGTVNGVIGVIASLPHEQ 953
Y++GE + G + +LP DV + T+I GT+ G I V A + E+
Sbjct: 1170 YYMGEIAMSIKKGCNIYKLPADDVLRSYGLSKSIDTADDTIIAGTLLGSIFVFAPISSEE 1229
Query: 954 YLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
Y LE +Q L + V G +H ++R N + A+ LDGD++ FL+L+ +
Sbjct: 1230 YELLEGVQAKLGIHPLTAPVLGNDHNEFRGRENPSQ---ARKILDGDMLAQFLELTNRQQ 1286
Query: 1012 DEISKT 1017
+ + T
Sbjct: 1287 ESVLST 1292
>gi|42564075|ref|NP_187802.2| Cleavage and polyadenylation specificity factor (CPSF) A subunit
protein [Arabidopsis thaliana]
gi|29824376|gb|AAP04148.1| unknown protein [Arabidopsis thaliana]
gi|110739103|dbj|BAF01468.1| hypothetical protein [Arabidopsis thaliana]
gi|332641608|gb|AEE75129.1| Cleavage and polyadenylation specificity factor (CPSF) A subunit
protein [Arabidopsis thaliana]
Length = 1379
Score = 91.3 bits (225), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/209 (27%), Positives = 106/209 (50%), Gaps = 20/209 (9%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
N+ PI+DF V+D + + + Q+ C G +GSLRI+R+GI + + + + QGI G W+
Sbjct: 468 NIAPILDFSVMDDQNEKRDQIFACCGVTPEGSLRIIRSGINVEKLLKTAPVYQGITGTWT 527
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
++ D + +FLV+SF+ ETR+L++ L + + T+ GF S T C LVQ+
Sbjct: 528 VKMKLTDVYHSFLVLSFVEETRVLSVGLSFK-DVTDSVGFQSDVCTFACGLVADGLLVQI 586
Query: 413 TSGSVRLVSST----------SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
++RL T S + W P S+++ + ++++T + +
Sbjct: 587 HQDAIRLCMPTMDAHSDGIPVSSPFFSSW-FPENVSISLGAVGQNLIVVSTSNPCFLSIL 645
Query: 463 IGDGI------LTEVKHAQLEYEISCLDI 485
+ + E++ L+YE+SC+ +
Sbjct: 646 GVKSVSSQCCEIYEIQRVTLQYEVSCISV 674
Score = 52.4 bits (124), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/126 (28%), Positives = 60/126 (47%), Gaps = 16/126 (12%)
Query: 905 YHLGEFVNRFRHGSLVMRLPDSDVGQI-----------PTVIFGTVNGVIGVIASLPHEQ 953
Y++GE + G + +LP DV + T+I GT+ G I V A + E+
Sbjct: 1220 YYMGEIAMSIKKGCNIYKLPADDVLRSYGLSKSIDTADDTIIAGTLLGSIFVFAPISSEE 1279
Query: 954 YLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRM 1011
Y LE +Q L + V G +H ++R N + A+ LDGD++ FL+L+ +
Sbjct: 1280 YELLEGVQAKLGIHPLTAPVLGNDHNEFRGRENPSQ---ARKILDGDMLAQFLELTNRQQ 1336
Query: 1012 DEISKT 1017
+ + T
Sbjct: 1337 ESVLST 1342
>gi|168021793|ref|XP_001763425.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162685218|gb|EDQ71614.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 1452
Score = 90.9 bits (224), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 84/322 (26%), Positives = 144/322 (44%), Gaps = 22/322 (6%)
Query: 725 SDDSNVYYCVGTAYVLPEENEPTKGRI-LVFIVEDGK-----LQLIAEKETKGAVYSLNA 778
+D + +GT+YV E+ KGRI LV + +D + + + KE KG++ ++ +
Sbjct: 1134 TDQTQTLLAIGTSYV-QGEDVAAKGRIILVSVGKDPQDPGSWAREVYSKELKGSISAIAS 1192
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSIS 838
G LL AI KI L+ W +G+ EL + + + +FI+ GD+ KSI
Sbjct: 1193 LQGHLLIAIGPKIILHSW----NGS-ELNGAAFFDAPLYVVSLNIVKNFILFGDIHKSIY 1247
Query: 839 LLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATD 893
L +K + + A+D+ + A E L D L + N +F+ S +
Sbjct: 1248 FLCWKEDGAQLTLLAKDFGSLDCYATEFLIDGSTLSLLVSDSRKNLQIFSYAPKSMESWK 1307
Query: 894 EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQ 953
++ L E+HLG VN+F H ++ P S V+FGT++G I +A L
Sbjct: 1308 GQK--LLSRAEFHLGAHVNKF-HRLQMLPTPGSARSNRYAVLFGTLDGAIDYLAPLDELT 1364
Query: 954 YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRM 1011
+ L LQ L + V G+N +R F + K N +D +L+ + L
Sbjct: 1365 FRRLHTLQRKLVDCVSHVAGVNPRAFRQFRCDGKAHRPGPDNIVDCELLSHYDMLPLDEQ 1424
Query: 1012 DEISKTMNVSVEELCKRVEELT 1033
EI++ + + + + +L
Sbjct: 1425 LEIARQIGTTRAHVLSNLRDLA 1446
Score = 53.1 bits (126), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 86/404 (21%), Positives = 147/404 (36%), Gaps = 107/404 (26%)
Query: 122 KPTIVVLYQDNKD-------ARHVKTYEVALKDKDFVEGP--WSQNNLDNGADLLIPVPP 172
+P +VVL++ + RH + + P WS NL A L+ VP
Sbjct: 273 EPVLVVLHEKDPTWAGRVAVRRHTCAITALSINTTLKQHPHIWSATNLPYDAYKLLAVPA 332
Query: 173 PLCGVLIIGEETIVYCSANAFKAI-----PIRPSITKAYGR----VDADGSRY------- 216
P+ GVL+ ++ Y S + A+ + P + Y R V+ D +
Sbjct: 333 PIGGVLVFCANSLHYHSQSGSCALGLNEFAVAPEGSAEYPRSKMSVELDCAHATWVANEV 392
Query: 217 -LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
L+ G+L L + +E V L++ + + S + + ++GS DS L++
Sbjct: 393 ALISTKNGMLLFLNLVYEGRSVQRLELTKSKASVLTSCMCTIGENFFFLGSRLADSLLVQ 452
Query: 276 LNL-------------------QPDAKGSYVE---------------------------- 288
L P AK E
Sbjct: 453 HTLGSASGRTSSLMGDIEADLSAPAAKRLKREPSEEEEGVSAEEMSLYYSTPTASDISQK 512
Query: 289 ------VLERYVNLGPIVDFC----------VVDLERQGQGQVVTCSGAYKDGSLRIVRN 332
V + VN+ P+ DF L +Q ++V CSG K+GSL ++
Sbjct: 513 KTFTFTVRDSLVNICPLRDFAYGLRSNADQSATGLGKQSNYELVACSGHGKNGSLSVLHQ 572
Query: 333 GIGINEQASVELQGIKGMWSLR--------------SSTDDPFDTFLVVSFISETRILAM 378
I + V L G G+W++ +S DD F +L++S E+R + +
Sbjct: 573 SIRPDLINKVALPGCSGIWTVYHKTDRDDSNEFDFGTSEDDEFHAYLIISL--ESRTMVL 630
Query: 379 NLEDELEE-TEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
D L E TE + ++ T+ + + +VQV +RL+
Sbjct: 631 ETGDTLGEVTENVEYYTEGNTIAAGNLFGRRFVVQVYQNGLRLL 674
>gi|407044064|gb|EKE42344.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
Length = 1145
Score = 90.9 bits (224), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 160/797 (20%), Positives = 315/797 (39%), Gaps = 93/797 (11%)
Query: 287 VEVLERYVNLGPIVDFCVVD-LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
+E + R+ +L P++D V Q ++ G +++I++N + + +EL
Sbjct: 385 LEEIYRFKSLAPLIDLKVAPATSSQDTTKMYAFCGKGNQSTIKILKNQLNTLDAVEIELP 444
Query: 346 GIK-GMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA 404
I +W L+ TD+ + +L +S+ + T +L + EDE+ E T +L
Sbjct: 445 AIPISIWPLKKETDE-YHQYLAISYSNITTLLKIT-EDEMSECTTSPILLSTPSLLVSML 502
Query: 405 IYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH---LVYL 461
+QV + + + S ++ E + A+ N S+++++ + L+Y
Sbjct: 503 FDGTFLQVMTDRIIIYSEPIQQFITEDQK-----YVCASCNGSELIVSVEKNNQTSLIYF 557
Query: 462 EIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PD----- 515
+ G L ++ +I+ L ++ +PS + A+G D SV + SL P+
Sbjct: 558 QYQSGHLLTMERKDNLSKITALALDQF--HPS--KHCAIGC-IDGSVHLLSLIPNETTKA 612
Query: 516 LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGT 575
L+ ++ + I S+ + SYL L +G L + + +GE+ + +G+
Sbjct: 613 LSRVSLQTYECSI--NSLTFNIIDNTSYLFAGLSNGLLGRSVYDPISGEINESSLNFVGS 670
Query: 576 QPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAI 635
+P+TL V A S R + Y + K+ + +N++ + C +++AI
Sbjct: 671 RPVTLSNVKDCGEDSVLAISGRSLLSYKNGTKIKTTPLNIQNTTLACGILVPFVDNAIAI 730
Query: 636 AKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM--- 691
E + I TI+ L ++I + PR++ I N SC +E+
Sbjct: 731 ICEKVMKIITIESTSSSLTGKNIQISYTPRKVITHPTIPLLYILEGDNNSCKVGNEIIQT 790
Query: 692 ------HFVRLLDDQTFEFISTYPLDTFEY---GCSILSCSFSDDSNVYYCVGTAYVLPE 742
+ LD E I D ++ GC + S S + Y VG +
Sbjct: 791 NEGNWVGGIHTLDASQDELIQFIDFDNNKHPTGGCVVRSIS---KNQTYLIVGV--IESY 845
Query: 743 ENEPTKGR---ILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----K 795
+ P + + I V+ + + + + + V + F G +LA + ++LY K
Sbjct: 846 KTRPIQWKSSEIQVYSINERSINYCYSTKVEYPVRAFAEFKGMVLAGVGNILRLYDIGLK 905
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
+L+ R+ S+ + G+ I++ + +L+ Y + A D
Sbjct: 906 SLLKKAEKRQFASDIAQ--------LHVIGETILLTGVSDGFNLIRYNQINHKFDIYA-D 956
Query: 856 YNANWMSAVEILDDDIYLGA----------------ENNFNLF-TVRKNSEGATDEERGR 898
W+ L+ L + E N F T+ + + + +
Sbjct: 957 SLPRWVVTAAPLNQSTVLASDKFGEIFMYQLPKEIEEQALNPFSTLLQPHKTIYEGSSYK 1016
Query: 899 LEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLP---HEQYL 955
+ ++ +G+ F SL+ P IF N + G+ A +P
Sbjct: 1017 MVTATQFFVGDIATSFAQCSLIPGAPS---------IFLYSNFMGGLSALIPLQSQNDID 1067
Query: 956 FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
F + L+ ++R + NH +RS V K+ +DGDL E + L EI+
Sbjct: 1068 FYQHLEMHMRVHWTNLTDRNHISFRS-----SIVPVKDTVDGDLCELYERLPYEIQQEIA 1122
Query: 1016 KTMNVSVEELCKRVEEL 1032
+ M V E+ K++ +L
Sbjct: 1123 EEMEKEVNEIIKKLHDL 1139
>gi|308459872|ref|XP_003092248.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
gi|308253976|gb|EFO97928.1| CRE-CPSF-1 protein [Caenorhabditis remanei]
Length = 1448
Score = 90.9 bits (224), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 84/312 (26%), Positives = 146/312 (46%), Gaps = 29/312 (9%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+PE ++PT R K++++ +KE KG V L A NG LL+ + QK+ + W
Sbjct: 1147 VVPEPDQPTSNR---------KIKVLFDKEQKGPVTGLCAINGLLLSGMGQKV--FIWQF 1195
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD--Y 856
+D+ L S H ++ L+ + + D +S+SL+ ++ E A+ +RD
Sbjct: 1196 KDNDLMGL-SFLDMHYYVYQLH--SLRTIALACDARESMSLIRFQEENKAMSIASRDDRR 1252
Query: 857 NANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
A A + + D +LG ++ N N+ + E RL V ++G VN
Sbjct: 1253 TAKPPMAAQFVVDGAHLGFLLSDENGNITLFNYSPEAPESNGGERLTVRAAMNIGTNVNA 1312
Query: 914 FR----HGSLVMRLPDSD---VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
F H SL+ D + V Q + IFG+++G G + L + Y L LQT +
Sbjct: 1313 FLRVKGHTSLLNLQSDEEKESVEQRMSTIFGSLDGSFGFVRPLSEKSYRRLHFLQTFIGS 1372
Query: 967 VIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
V + GL+ + RS + V +A+N +DGD++E +L LS ++++ + V
Sbjct: 1373 VTPQIAGLHIKGARSARPAQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVGRY 1432
Query: 1024 ELCKRVEELTRL 1035
+ + L R+
Sbjct: 1433 HIIDDLMHLRRM 1444
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 119/527 (22%), Positives = 207/527 (39%), Gaps = 103/527 (19%)
Query: 72 DPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTIV 126
DP R +Y ++PF + ++ I L+++ V D+ FL G +PTI+
Sbjct: 145 DPSNRCAASLVYGKHIAILPFHENSKRILSYIIPLKQIDPRLDNVADMVFLDGYYEPTIL 204
Query: 127 VLYQD----------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCG 176
LY+ D + V + D+ F W NL L+P+P PL G
Sbjct: 205 FLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAV-VWQTANLPMDCTSLLPIPKPLGG 263
Query: 177 VLIIGEETIVYCSA-------------NAFKAIPIRPSITKAYGRVDADGSRYL------ 217
L+ G TIVY + + F P++ + +D S Y+
Sbjct: 264 ALVFGSNTIVYLNQAVPPCGVVLNSCYDGFTKFPLK-DMKHLKMTLDCATSVYMEDGRIA 322
Query: 218 LGDHAGLLHL--LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
+G G+L+L LV + V ++ + ETSIA ++ ++IGS GDSQL++
Sbjct: 323 VGGRDGVLYLLRLVTSSGGATVKSMEFSRVWETSIAYCLTVCAPGHLFIGSRLGDSQLVE 382
Query: 276 LNLQPDAK-------------------------GSYVEV-----------------LERY 293
L K G +E+ L+R
Sbjct: 383 YTLLKMTKESAKRQKIEKDPGEIELDEDDMELYGGAIEMQLNDDEEQILESLEFRELDRL 442
Query: 294 VNLGPIVDFC----------VVDLERQGQ-GQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
N+GP+ C + +++R+ +VT SG K+G+L + + + S
Sbjct: 443 RNVGPVKSMCFGRPNYMSSDLAEMKRRDPVFDLVTASGHGKNGALCVHQRSLRPEIITSS 502
Query: 343 ELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCH 402
L+G + +W++ ++ +L+VS + T +L + E E ++ F + T+
Sbjct: 503 ILEGAEQLWAVGRKENESH-KYLIVSRVRSTLVLELGEELVELEEQL--FVTNEPTVAAG 559
Query: 403 DAIYNQL-VQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL 461
+ L VQVTS + LV T + E + V A+ V + T G +
Sbjct: 560 ELSQGALAVQVTSTCIALV--TDGQQMQEVHIDSNFPVVQASIQDPYVAVLTQNGRPLLY 617
Query: 462 EIGDGILTEVKHAQL-EYEISCLDINPIGENPSYSQIAAVGMWTDIS 507
E L + QL E +IS + + +Q+ ++ ++TD S
Sbjct: 618 E-----LVMAPYVQLREIDISQTPFATFIAHNAATQLTSISVYTDAS 659
>gi|28175094|gb|AAH31197.2| Sf3b3 protein, partial [Mus musculus]
Length = 494
Score = 90.5 bits (223), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 161/349 (46%), Gaps = 45/349 (12%)
Query: 710 LDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIA 765
L+ E S+ C FS+ +Y VG A L G + +V +G KL+ +
Sbjct: 159 LEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILSPRSVAGGFVYTYKLVNNGEKLEFLH 218
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---Q 822
+ + ++ F G++L + + +++Y D G ++L +C + HI A Y+ Q
Sbjct: 219 KTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQ 271
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
T G ++V D+ +S + YK E + A D W++ +LD D GA+ N+
Sbjct: 272 TIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNIC 331
Query: 883 TVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 332 VVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIP--- 388
Query: 925 DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
G ++++ T++G IG++ HE + F + ++ +LR + G +H +RS+
Sbjct: 389 ----GGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY- 443
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 444 ----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 488
>gi|193785523|dbj|BAG50889.1| unnamed protein product [Homo sapiens]
Length = 503
Score = 90.5 bits (223), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 161/349 (46%), Gaps = 45/349 (12%)
Query: 710 LDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIA 765
L+ E S+ C FS+ +Y VG A L G + +V +G KL+ +
Sbjct: 168 LEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEFLH 227
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---Q 822
+ + ++ F G++L + + +++Y D G ++L +C + HI A Y+ Q
Sbjct: 228 KTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQ 280
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
T G ++V D+ +S + YK E + A D W++ +LD D GA+ N+
Sbjct: 281 TIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNIC 340
Query: 883 TVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 341 VVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI---- 396
Query: 925 DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
G ++++ T++G IG++ HE + F + ++ +LR + G +H +RS+
Sbjct: 397 ---PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY- 452
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 453 ----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 497
>gi|75145059|sp|Q7XWP1.2|CPSF1_ORYSJ RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
gi|38345987|emb|CAD39979.2| OSJNBa0032B23.5 [Oryza sativa Japonica Group]
Length = 1441
Score = 90.5 bits (223), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 87/353 (24%), Positives = 160/353 (45%), Gaps = 33/353 (9%)
Query: 702 FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF--I 755
+E ST P+ FE ++ L + + ++ +GTAYVL E+ +GR+L+F
Sbjct: 1095 WETKSTIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVL-GEDVAARGRVLLFSFT 1153
Query: 756 VEDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
+ L+ E KE+KGAV ++ + G LL A KI L KW EL + +
Sbjct: 1154 KSENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWT-----GAELTAVAFY 1208
Query: 813 HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
+ + + +F++ GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 1209 DAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGST 1268
Query: 873 LG-----AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
L ++ N +F + + +G +L E+H+G + +F + LP
Sbjct: 1269 LSLVASDSDKNVQIFYY---APKMVESWKGQKLLSRAEFHVGAHITKFLR---LQMLPTQ 1322
Query: 927 DVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
+ T ++FG ++G IG IA + + L+ LQ L + V GLN +R F
Sbjct: 1323 GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQF 1382
Query: 983 NNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
++ K N +D +L+ S+ LS ++++ + + ++ +++
Sbjct: 1383 HSNGKGHRPGPDNIIDFELLCSYEMLSLDEQLDVAQQIGTTRSQILSNFSDIS 1435
Score = 66.2 bits (160), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 90/393 (22%), Positives = 160/393 (40%), Gaps = 109/393 (27%)
Query: 110 QVLDIKFLYGCAKPTIVVLYQDNKD------ARH----VKTYEVALKDKDFVEGP--WSQ 157
V D F++G +P +V+L++ ++H + + +++ K + P WS
Sbjct: 250 HVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSISMTLK---QHPVIWSA 306
Query: 158 NNLDNGADLLIPVPPPLCGVLIIGEETIVY------CSA--NAFKAIPI-RPSITKAYGR 208
NL + A L+ VPPP+ GVL+I +I Y CS N F + P P I+K+ +
Sbjct: 307 ANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSHPDGSPEISKSNFQ 366
Query: 209 VDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGL-----KIELLGE--TSIAST 253
V+ D ++ + AG + LL + ++ V L K +L TSI ++
Sbjct: 367 VELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKASVLSSAVTSIGNS 426
Query: 254 ISYL-----DNAVVYIGSSYGDSQLIKL-------------------------------- 276
+L D+ +V S L L
Sbjct: 427 FFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLKRIPSDVLQDVTSV 486
Query: 277 ------------NLQPDAKGSYVEVLERYVNLGPIVDFC----------VVDLERQGQGQ 314
+L+ K SY+ V + +N+GP+ DF + +Q +
Sbjct: 487 EELSFQNIIAPNSLESAQKISYI-VRDALINVGPLKDFSYGLRANADPNAMGNAKQSNYE 545
Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------RSSTDDPFDTFLVV 367
+V CSG K+GSL +++ I + VEL +G+W++ + + D+ + +L++
Sbjct: 546 LVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRGQMAEDNEYHAYLII 605
Query: 368 SFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
S E R + + D+L E TE + Q T+
Sbjct: 606 SL--ENRTMVLETGDDLGEVTETVDYFVQASTI 636
>gi|26351753|dbj|BAC39513.1| unnamed protein product [Mus musculus]
Length = 463
Score = 90.5 bits (223), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 161/349 (46%), Gaps = 45/349 (12%)
Query: 710 LDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIA 765
L+ E S+ C FS+ +Y VG A L G + +V +G KL+ +
Sbjct: 128 LEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILSPRSVAGGFVYTYKLVNNGEKLEFLH 187
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---Q 822
+ + ++ F G++L + + +++Y D G ++L +C + HI A Y+ Q
Sbjct: 188 KTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQ 240
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
T G ++V D+ +S + YK E + A D W++ +LD D GA+ N+
Sbjct: 241 TIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNIC 300
Query: 883 TVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 301 VVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIP--- 357
Query: 925 DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
G ++++ T++G IG++ HE + F + ++ +LR + G +H +RS+
Sbjct: 358 ----GGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY- 412
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 413 ----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 457
>gi|391328522|ref|XP_003738737.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Metaseiulus occidentalis]
Length = 1500
Score = 90.1 bits (222), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 97/426 (22%), Positives = 178/426 (41%), Gaps = 60/426 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA----------------EESEMHFVRLL 697
+R IP+ E P + + SRT+ + + K ++ ES +
Sbjct: 1067 MRKIPIYETPHSVNYHVDSRTYCVATSKEETATCVPKLANEDKEFEPIERESSRFIPPTV 1126
Query: 698 DDQTFEFIST----------YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPE 742
D E S P++ +E + + + + VGT + E
Sbjct: 1127 DKFALELWSPVSWEAIPNTRMPMEDWEKITCVKNVMIASEGTTSGEKGLIAVGTIHNFGE 1186
Query: 743 ENEPTKGRILVF-----IVEDG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
+ KGRIL+ + E G K++ I K V +L + G L+AA+ QK+
Sbjct: 1187 DIT-AKGRILLIDIIEVVPEPGQPLTRSKVKTILSKPQNAPVTALCSVKGHLMAAVGQKL 1245
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
L++ L+D+ +L I L + FI++GD+ KSI+LL Y+ E +
Sbjct: 1246 FLFQ--LKDN---DLVGMAFLDTQIYILSAISIKSFILIGDVHKSITLLRYQEESKTLAV 1300
Query: 852 RARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
++D + ++E L D+ +L + N+ E RL G++++G
Sbjct: 1301 VSKDTKPVQIYSIEYLVDNSQMAFLATDAQCNILVYMYQPENRETFGGQRLIRRGDFNIG 1360
Query: 909 EFVNR-FRHGSLVMRLPDS------DVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
+N FR + +P S D+ ++ +++G G + + + Y L LQ
Sbjct: 1361 SRINTMFRIRCRLAEVPRSERRLLSDLEARHVTLYASLDGAFGYLLPISEKTYRRLLMLQ 1420
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
L + VGGLN + +R + + + KN +DGDLI F+DL+ E+++ +
Sbjct: 1421 NVLNSYCQHVGGLNPKAFRIMQTDVRALSNPQKNIVDGDLINVFMDLNFNEKAEVARKIG 1480
Query: 1020 VSVEEL 1025
+V ++
Sbjct: 1481 TTVHQI 1486
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 70/329 (21%), Positives = 134/329 (40%), Gaps = 63/329 (19%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
+G I+ +E R + LF+ E K ++++D + EL T A+ + +
Sbjct: 66 FGNISGMEKIRLNASRDSLLFVFKET-KISLVEYDPATHELQTLAIRSLEKEEYKEGFYN 124
Query: 67 QIG----IIDPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAFN 103
+G +DP R + +Y +IPF N E +
Sbjct: 125 FVGNTLIKVDPLNRCAAVLIYGKHLAIIPFVKKDATDLSDPIASSKSTQTNTSGFLEYYT 184
Query: 104 IRLEEL-------QVLDIKFLYGCAKPTIVVLYQD----------NKDARHVKTYEVALK 146
IRL +L + D+ FL G +PT+++LY+ +D + + +
Sbjct: 185 IRLIDLDEEKGVNNIHDMTFLNGYYEPTLLLLYEPIRTWTGRVAIRQDTCSIMALSLNVY 244
Query: 147 DKDFVEGP-WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANA 192
+ V P WS + L + ++PVP P+ GVLI+ ++Y C
Sbjct: 245 QR--VHPPVWSFSGLPFNSFKVLPVPKPIGGVLILSVNALLYLNQSVPAYGVSLNCFTEC 302
Query: 193 FKAIPIR----PSITKAYGRVD-ADGSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLG 246
+ P++ P +T R + ++ LL G L++L + T + + + +
Sbjct: 303 STSFPLKDQAGPPLTLDCCRCEFLSETKILLSVANGDLYVLSLFTDGMRSINQFEFKKIA 362
Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIK 275
T++A+ IS + +++GS G+S L++
Sbjct: 363 TTTVATCISLCEPGYLFVGSRIGNSLLLR 391
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/162 (25%), Positives = 76/162 (46%), Gaps = 20/162 (12%)
Query: 277 NLQPDAKGSY---VEVLERYVNLGPIVDFCV--------VDLERQGQGQVVTCSGAYKDG 325
N + D G+ +EV +R LGP+ + + ++ + ++ G + G
Sbjct: 526 NCEADCLGTTCYKLEVTDRLQTLGPVTKVAMGEPAFQSELSMKTDTEVEIFAACGHERSG 585
Query: 326 SLRIVRNGIGINEQASVELQGIKGMWSLRSST------DDPFDTFLVVSFISETRILAMN 379
+L +++ + + EL G +W++RSS+ D+ FL++S T IL
Sbjct: 586 ALCVLQRTVRPQVITTFELPGCTDLWTVRSSSTRSPDVDEDSHQFLILSRPDSTMILQTG 645
Query: 380 LEDELEETEIEGFCSQTQTLFCHD-AIYNQLVQVTSGSVRLV 420
E+ E + GFC+Q+ T+F + A ++QV SVRL+
Sbjct: 646 --QEINELDHSGFCTQSPTIFAGNLADGRYIIQVCPNSVRLL 685
>gi|285999|dbj|BAA02805.1| KIAA0017 [Homo sapiens]
Length = 399
Score = 90.1 bits (222), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 161/349 (46%), Gaps = 45/349 (12%)
Query: 710 LDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIA 765
L+ E S+ C FS+ +Y VG A L G + +V +G KL+ +
Sbjct: 64 LEQNEAAFSVAVCRFSNTGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEFLH 123
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---Q 822
+ + ++ F G++L + + +++Y D G ++L +C + HI A Y+ Q
Sbjct: 124 KTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQ 176
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
T G ++V D+ +S + YK E + A D W++ +LD D GA+ N+
Sbjct: 177 TIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNIC 236
Query: 883 TVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 237 VVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIP--- 293
Query: 925 DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
G ++++ T++G IG++ HE + F + ++ +LR + G +H +RS+
Sbjct: 294 ----GGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY- 348
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 349 ----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 393
>gi|326511015|dbj|BAJ91855.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 706
Score = 90.1 bits (222), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 88/366 (24%), Positives = 159/366 (43%), Gaps = 46/366 (12%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD-DSNVYYCVGTAYVLPEENEPTKG--- 749
+R+LD ++ + L E SI + +F D + VGT L + P +
Sbjct: 355 IRILDPRSRDTTCLLELQDNEAAVSICTVNFHDKEHGTLLAVGTTKGL--QFWPKRSLAS 412
Query: 750 ---RILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL 806
I F+ E L+L+ + + + SL F G+LLA + ++LY D G R+L
Sbjct: 413 GFIHIYKFVNEGKSLELLHKTQVEEVPLSLCQFQGRLLAGVGSVLRLY-----DLGKRKL 467
Query: 807 QSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVE 865
+C + + + T D I VGD+ +S Y+ +E + A D W++A
Sbjct: 468 LRKCENKLFPRTIVSIHTYRDRIYVGDMQESFHYCKYRRDENQLYIFADDSVPRWLTAAN 527
Query: 866 ILDDDIYLGAENNFNLFTVR------------------KNSEGATDEERGRLEVVGEYHL 907
+D D GA+ N++ R K +G + ++E + ++H+
Sbjct: 528 HIDFDTMAGADKFGNIYFARLPQDLSDEIEEDPTGGKIKWEQGKLNGAPNKVEEIVQFHV 587
Query: 908 GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIG-VIASLPHEQYLFLEKLQTNLRK 966
G+ V + SL+ G +I+GTV G +G ++A E F L+ +LR+
Sbjct: 588 GDVVTCLQKASLI-------PGGGECLIYGTVMGSVGALLAFTSREDVDFFSHLEMHLRQ 640
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
+ G +H +RS K+ +DGDL E + L +I+ ++ + E+
Sbjct: 641 EHPPLCGRDHMAYRS-----AYFPVKDVIDGDLCEQYPSLPADMQRKIADELDRTPGEIL 695
Query: 1027 KRVEEL 1032
K++E++
Sbjct: 696 KKLEDI 701
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 80/300 (26%), Positives = 131/300 (43%), Gaps = 19/300 (6%)
Query: 407 NQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-D 465
+ L+QV +R + R NEW++P ++ +N QV++A GG L+Y E+
Sbjct: 2 DSLMQVHPNGIRHIREDGR--VNEWRTPGKKTITKVGSNRLQVVIALSGGELIYFEMDMT 59
Query: 466 GILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHL 524
G L EV+ + +++CL I P+ E S+ AVG + D ++RI SL PD L
Sbjct: 60 GQLMEVEKQDMSGDVACLAIAPVPEGRQRSRFLAVGSY-DNTIRILSLDPDDCLQPLSVQ 118
Query: 525 GGEIIPRSVLLCAFE-------GISY-----LLCALGDGHLLNFLLNMKTGELTDRKKVS 572
P S+L + G Y L L +G L ++M TG+L+D +
Sbjct: 119 SVSSAPESLLFLEVQASVGGEDGADYPANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRF 178
Query: 573 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
LG +P L + + S RP + Y L + ++ + F+S +
Sbjct: 179 LGLRPPKLFPCIVSHRQAMLCLSSRPWLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEG 238
Query: 633 LAIAKEGELTIGTIDDI-QKLHIRSIPLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE 690
+ L I TI+ + + + SIPL PR+ + + A+ S K AE+ E
Sbjct: 239 VVAVAGDALRIFTIERLGETFNETSIPLRYTPRKFVILPKKKYLAVIESDKGAFSAEQRE 298
>gi|449437538|ref|XP_004136549.1| PREDICTED: pre-mRNA-splicing factor RSE1-like [Cucumis sativus]
Length = 1376
Score = 89.7 bits (221), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 61/210 (29%), Positives = 107/210 (50%), Gaps = 22/210 (10%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
N+ PI+D VVD + Q Q+ C G +GSLRI+RNGI + + S QGI +W+
Sbjct: 458 NIAPILDMSVVDKHDEKQDQMFACCGMAPEGSLRIIRNGISVENLLRTSPIYQGITSIWT 517
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
++ D + ++LV+SF+ ETR+L++ L ++ T+ GF S T TL C ++Q+
Sbjct: 518 IKMKRSDTYHSYLVLSFVEETRVLSVGLS-FIDVTDSVGFQSDTCTLACGLLDDGLVIQI 576
Query: 413 TSGSVRLVSST----------SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
+VRL T S W P +++ + ++++T +++
Sbjct: 577 HQNAVRLCLPTKIAHSEGIELSSPACTSW-FPDNIGISLGAVGHNVIVVSTSNPCFLFI- 634
Query: 463 IG-------DGILTEVKHAQLEYEISCLDI 485
+G D + E ++ +L+YE+SC+ I
Sbjct: 635 LGVRKVSGYDYEIYEKQYLRLQYELSCISI 664
Score = 52.8 bits (125), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 47/201 (23%), Positives = 85/201 (42%), Gaps = 20/201 (9%)
Query: 828 IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN--NFNLFTVR 885
I VGD I Y+ + +E+ D + ++ +LD D + ++ + + +
Sbjct: 1139 IAVGDCRDGILFFSYQEDAKKLEQIYSDPSQRLVADCTLLDVDTAVVSDRKGSIAILSCS 1198
Query: 886 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV---GQIP---------T 933
E E L + Y++GE R GS +LP D+ +P T
Sbjct: 1199 DRLEDNASPE-CNLTLNCAYYMGEIAMTLRKGSFSYKLPADDLLRGCAVPGSDFDSSHNT 1257
Query: 934 VIFGTVNGVIGVIASLPHEQYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTVDA 991
+I T+ G I + L ++Y LE +Q L + + G +H ++RS N +
Sbjct: 1258 IIASTLLGSIVIFTPLSRDEYELLEAVQAKLAVHPLTSPILGNDHYEYRSRENP---IGV 1314
Query: 992 KNFLDGDLIESFLDLSRTRMD 1012
LDGD++ FL+L+ + +
Sbjct: 1315 PKILDGDILTQFLELTSMQQE 1335
>gi|449526686|ref|XP_004170344.1| PREDICTED: uncharacterized protein LOC101227016, partial [Cucumis
sativus]
Length = 997
Score = 89.7 bits (221), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 66/236 (27%), Positives = 118/236 (50%), Gaps = 28/236 (11%)
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
GD ++KL + + Y ++ N+ PI+D VVD + Q Q+ C G +GSLR
Sbjct: 438 GDGMVLKLE---NGRLIYANPIQ---NIAPILDMSVVDKHDEKQDQMFACCGMAPEGSLR 491
Query: 329 IVRNGIGINE--QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
I+RNGI + + S QGI +W+++ D + ++LV+SF+ ETR+L++ L ++
Sbjct: 492 IIRNGISVENLLRTSPIYQGITSIWTIKMKRSDTYHSYLVLSFVEETRVLSVGLS-FIDV 550
Query: 387 TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPG 436
T+ GF S T TL C ++Q+ +VRL T S W P
Sbjct: 551 TDSVGFQSDTCTLACGLLDDGLVIQIHQNAVRLCLPTKIAHSEGIELSSPACTSW-FPDN 609
Query: 437 YSVNVATANASQVLLATGGGHLVYLEIG-------DGILTEVKHAQLEYEISCLDI 485
+++ + ++++T +++ +G D + E ++ +L+YE+SC+ I
Sbjct: 610 IGISLGAVGHNVIVVSTSNPCFLFI-LGVRKVSGYDYEIYEKQYLRLQYELSCISI 664
>gi|53136520|emb|CAG32589.1| hypothetical protein RCJMB04_30d19 [Gallus gallus]
Length = 503
Score = 89.4 bits (220), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 85/349 (24%), Positives = 162/349 (46%), Gaps = 45/349 (12%)
Query: 710 LDTFEYGCSILSCSFSDDSNVYYC-VGTAY-VLPEENEPTKGRILVFIVEDG--KLQLIA 765
L+ E S+ C FS+ +Y VG A ++ G + + + +G KL+ +
Sbjct: 168 LEQNEAAFSVAVCRFSNTGEEWYVLVGVAKDLILNPRSVAGGFVYTYKLVNGGEKLEFLH 227
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---Q 822
+ + ++ F G++L + + +++Y D G ++L +C + HI A Y+ Q
Sbjct: 228 KTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYICGIQ 280
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
T G ++V D+ +S + YK E + A D W++ +LD D GA+ N+
Sbjct: 281 TIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTATLLDYDTVAGADKFGNIC 340
Query: 883 TVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 341 VVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI---- 396
Query: 925 DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
G ++++ T++G IG++ HE + F + ++ +LR + G +H +RS+
Sbjct: 397 ---PGGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY- 452
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + +++ ++ + E+ K++E++
Sbjct: 453 ----YFPVKNVIDGDLCEQFNSMEPNKQKNVAEELDRTPPEVSKKLEDI 497
>gi|222628488|gb|EEE60620.1| hypothetical protein OsJ_14038 [Oryza sativa Japonica Group]
Length = 1441
Score = 89.4 bits (220), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 86/353 (24%), Positives = 159/353 (45%), Gaps = 33/353 (9%)
Query: 702 FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF--I 755
+E ST P+ FE ++ L + + ++ +GTAYVL E+ +GR+L+F
Sbjct: 1095 WETKSTIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVL-GEDVAARGRVLLFSFT 1153
Query: 756 VEDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
+ L+ E KE+KGAV ++ + G LL A KI L KW EL + +
Sbjct: 1154 KSENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWT-----GAELTAVAFY 1208
Query: 813 HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
+ + + +F++ GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 1209 DAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGST 1268
Query: 873 LG-----AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
L ++ N +F + + +G +L E+H+G + +F + LP
Sbjct: 1269 LSLVASDSDKNVQIFYY---APKMVESWKGQKLLSRAEFHVGAHITKFLR---LQMLPTQ 1322
Query: 927 DVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
+ T ++FG ++G IG IA + + L+ LQ L + V GLN +R F
Sbjct: 1323 GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQF 1382
Query: 983 NNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
++ K N +D +L+ + LS ++++ + + ++ +++
Sbjct: 1383 HSNGKGHRPGPDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSDIS 1435
Score = 66.2 bits (160), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 90/393 (22%), Positives = 160/393 (40%), Gaps = 109/393 (27%)
Query: 110 QVLDIKFLYGCAKPTIVVLYQDNKD------ARH----VKTYEVALKDKDFVEGP--WSQ 157
V D F++G +P +V+L++ ++H + + +++ K + P WS
Sbjct: 250 HVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSISMTLK---QHPVIWSA 306
Query: 158 NNLDNGADLLIPVPPPLCGVLIIGEETIVY------CSA--NAFKAIPI-RPSITKAYGR 208
NL + A L+ VPPP+ GVL+I +I Y CS N F + P P I+K+ +
Sbjct: 307 ANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNFSSHPDGSPEISKSNFQ 366
Query: 209 VDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGL-----KIELLGE--TSIAST 253
V+ D ++ + AG + LL + ++ V L K +L TSI ++
Sbjct: 367 VELDAAKATWLSNDIVMFSTKAGEMLLLTVVYDGRVVQRLDLMKSKASVLSSAVTSIGNS 426
Query: 254 ISYL-----DNAVVYIGSSYGDSQLIKL-------------------------------- 276
+L D+ +V S L L
Sbjct: 427 FFFLGSRLGDSLLVQFSYCASKSVLQDLTNERSADIEGDLPFSKRLKRIPSDVLQDVTSV 486
Query: 277 ------------NLQPDAKGSYVEVLERYVNLGPIVDFC----------VVDLERQGQGQ 314
+L+ K SY+ V + +N+GP+ DF + +Q +
Sbjct: 487 EELSFQNIIAPNSLESAQKISYI-VRDALINVGPLKDFSYGLRANADPNAMGNAKQSNYE 545
Query: 315 VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------RSSTDDPFDTFLVV 367
+V CSG K+GSL +++ I + VEL +G+W++ + + D+ + +L++
Sbjct: 546 LVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSYRGQMAEDNEYHAYLII 605
Query: 368 SFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
S E R + + D+L E TE + Q T+
Sbjct: 606 SL--ENRTMVLETGDDLGEVTETVDYFVQASTI 636
>gi|196012166|ref|XP_002115946.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
gi|190581722|gb|EDV21798.1| hypothetical protein TRIADDRAFT_59883 [Trichoplax adhaerens]
Length = 1187
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 135/562 (24%), Positives = 234/562 (41%), Gaps = 98/562 (17%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
YG+I +E R +D LFI K ++++DA++S L T ++ D + +
Sbjct: 81 FYGKIYGIESVRFRHHHRDSLFICFADAKLSLVEYDADNSNLTTLSLHTFEDDELKNGFS 140
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPFDNKGQ-----------------LKEAFNI 104
N I II DPD R + + + ++PF ++G L ++ +
Sbjct: 141 RNLSIPIIRVDPDNRCAAMVVSNVHLAILPFRHRGPAEQQVQIDPKNTSGKYPLMPSYVV 200
Query: 105 RLEEL------QVLDIKFLYGCAKPTIVVL------YQDNKDARH--VKTYEVALKDKDF 150
+ +L +++DI+FL G +PTI++L + R V+L D
Sbjct: 201 DVRDLGNEKVSRLIDIRFLEGYYEPTILILCEILRTWSGRVAVRQDTCSILAVSLNTIDK 260
Query: 151 VEGP-WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRV 209
V WS NNL I VP P+ GVLI +++ + + +P ++ +
Sbjct: 261 VHPVIWSLNNLPFDCLGAITVPRPIGGVLIFAANCLLHLNQS-------KPPYAESLNSI 313
Query: 210 DADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIE-LLGETSIASTISYLDNAVVYIGSSY 268
+ + + + D A L T + ++ T K+ + Y + SY
Sbjct: 314 TDESTTFPMHD-ADLAPNTPETQDTDEPTSKKLRTDDEKEDEELEKLYSAHTSCTAKESY 372
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDL----ERQGQGQVVTCSGAYKD 324
+ EV +R +++GP + + + + +VV CSG K+
Sbjct: 373 -------------LRSYTFEVCDRILHVGPCASIAIGQISTFVQEESDVEVVICSGHDKN 419
Query: 325 GSLRIVRNGIGINEQASVELQGIKGMWS-----LRSSTDDPFDT-----FLVVSFISETR 374
G+L ++ GI AS +L G MW+ L D F+T FL++S + T
Sbjct: 420 GALSVLNKGIKPQVVASYDLPGCVDMWTVKDIRLNDENDGDFETENTHKFLIISRDNLTM 479
Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIY-NQLVQVTSGSVRLVSSTSR----ELRN 429
IL E+ E E GF +QT+T+F + N ++QVT V LVS + EL N
Sbjct: 480 ILRTG--KEITEVEQLGFLTQTKTVFAGNLDNGNCIIQVTPYEVILVSKGEKIQQLELEN 537
Query: 430 EWKSPPGYSVNVATANASQVLLATGGG-HLVYLEIGDGILTEVKHAQLEYEISCLDINPI 488
E SP V + + LL GG ++ E+ D E ++ ++ P+
Sbjct: 538 E--SP---IVFCSLQDPYISLLLEGGSIMMLAFELSDN---------GEKQVKLVNTTPL 583
Query: 489 GENPSYSQIAAVGMWTDISVRI 510
++S+IAA ++ D + R+
Sbjct: 584 ----NHSRIAACCLFQDNNGRM 601
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 107/470 (22%), Positives = 185/470 (39%), Gaps = 111/470 (23%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDIQKLH------IRSIPLGEHPRRICHQE 670
V PFN A P+ L E +L I +D Q+ + +PL I H
Sbjct: 772 VKCFAPFNIANCPNGFLYFNSEEDLRICVLD--QRFTYDCPWPVHKVPLRNTLHFITHHF 829
Query: 671 QSRTFAICSLKNQSC-------AEESEM-------HFV---------RLLDDQTFEFI-- 705
++T+ I S C E+ E F+ +L+ +T+E I
Sbjct: 830 VTKTYVIISSTMTVCEKMPHITTEDKEFIPVEKGDRFIHAPVEKFCLQLITSETWEIIPD 889
Query: 706 STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----I 755
+ + +E+ + S + V + VGT V EE +GRI++F +
Sbjct: 890 AEIQMAEWEHVTCLKSVKLKSEETVSGLKEFIAVGTTNVCGEE-VACRGRIVIFDVIEVV 948
Query: 756 VEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
E GK ++ +KE KG V ++ G L+ +I QKI Y W RD+
Sbjct: 949 PEPGKPLTKNKIKTYYDKEQKGPVTAITCVEGFLVTSIGQKI--YIWEFRDNKD------ 1000
Query: 810 CGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD 869
++G M I LIY H + +N N+ L
Sbjct: 1001 -------------------LIG--MAFIDTLIYIHSLD--RHQLEIFNTNFYVNKNQLG- 1036
Query: 870 DIYLGAENNFNLFTVRKN--SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
++ E++ F VR+ G+ R +V L + N +H +
Sbjct: 1037 --FVAPESHGGQFLVRRAEIQTGSNAHAFFRTKVRA---LNQRQNENKHITW-------- 1083
Query: 928 VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK 987
FGT++G IG++ + ++Y L LQ L ++ GLN + +R+F + +K
Sbjct: 1084 --------FGTLDGSIGLLLPVDEKEYRRLFSLQAKLSIYLEQNAGLNQKAFRTFRSHQK 1135
Query: 988 TV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+ +N LDGDL++ + L +++K + + E++ + +LT+L
Sbjct: 1136 KLQNSMRNILDGDLLKRYFHLGFVERRDLAKQIMSTPEQI---INDLTKL 1182
>gi|322792443|gb|EFZ16427.1| hypothetical protein SINV_15375 [Solenopsis invicta]
Length = 1532
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 84/321 (26%), Positives = 149/321 (46%), Gaps = 39/321 (12%)
Query: 731 YYCVGTAYVLPEENEPTKGRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAF 779
Y +GT Y E+ ++GRIL+F I+E + + I KE KG + ++
Sbjct: 1060 YIVLGTNYNYGEDIT-SRGRILIFDIIEVVPEPGQPLTKNRFKQIYAKEQKGPITAITQV 1118
Query: 780 NGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
+G L++A+ QKI Y W L+D+ G + ++ H +L++ I++ D+ KS
Sbjct: 1119 SGFLVSAVGQKI--YIWQLKDNDLVGVAFIDTQIYIH-QMLSI-----KSLILIADVYKS 1170
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGA 891
ISLL ++ E + +RD+ + +E L D+ LG E+N LF + S +
Sbjct: 1171 ISLLRFQEEYRTLSLVSRDFRPAEVYTIEYLIDNTNLGFIVADGESNLALFMYQPESRES 1230
Query: 892 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TVIFGTVNGVIGV 945
+ +L ++HLG+ VN F + P +D Q ++ +++G +G
Sbjct: 1231 LGGQ--KLIRKADFHLGQKVNTFFRIRCRVTDPANDKKQFSGADKRHVTMYASLDGSLGY 1288
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN---EKKTVDAKNFLDGDLIES 1002
I +P + Y L LQ L I + GLN + +R + A+ +DGDL+
Sbjct: 1289 ILPVPEKTYRRLLMLQNVLVTHICHIAGLNPKSYRHTYKSYIRNQGNPARGIIDGDLVWR 1348
Query: 1003 FLDLSRTRMDEISKTMNVSVE 1023
+L L +++K + V+
Sbjct: 1349 YLFLPNNEKADLAKKIGTRVQ 1369
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 67/329 (20%), Positives = 139/329 (42%), Gaps = 60/329 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP--T 63
++G I +++ G +D L ++ K V+++D + +L T ++ + + T
Sbjct: 41 LHGNIMSMQAVHLIGSQRDSLLLSFRDAKLSVVEYDQDIHDLRTVSLHYFEEEEIKDGWT 100
Query: 64 DNGQIGII--DPDCRLIGLHLYDGLFKVIPF-------------------DNKGQLKEAF 102
++ I I+ DP+ R + ++ V+PF NK + ++
Sbjct: 101 NHHHIPIVRVDPEGRCAVMLIFGRKLVVLPFRKDPSLDDGDLLDTAKLTSSNKAPILSSY 160
Query: 103 NIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKD------ARHVKTYEVALK---DK 148
I L+ L+ V+D++FL+G +PT+++LY+ + R VA+ +
Sbjct: 161 MIVLKSLEEKMDNVIDLQFLHGYYEPTLLILYEPVRTFAGRIAVRQDTCAMVAISLNIQQ 220
Query: 149 DFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKA 195
WS +NL +PV PL G LI+ +++Y + A+
Sbjct: 221 RVHPIIWSVSNLPFDCYQAVPVKKPLGGTLIMAFNSLIYLNQSIPPYGVSLNSLADTSTN 280
Query: 196 IPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGE 247
P++P S+ A + AD R ++ +G L++L + V G +
Sbjct: 281 FPLKPQEGVKMSLEGAQVAFISAD--RLVISLKSGELYVLSLFADSMRSVRGFHFDKAAA 338
Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKL 276
+ + S + ++ +++GS G+S L++
Sbjct: 339 SVLTSCVCMCEDNYLFLGSRLGNSLLLRF 367
Score = 44.7 bits (104), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 76/155 (49%), Gaps = 21/155 (13%)
Query: 285 SYV-EVLERYVNLGPIVDFCVVD--------LERQGQG-QVVTCSGAYKDGSLRIVRNGI 334
SY+ EV + +N+GP + + + L+ Q ++VT SG K+G+L +++ I
Sbjct: 435 SYIFEVCDSLLNIGPCGNISMGEPAFLSEEFLQNQDPDVELVTTSGYGKNGALCVLQRSI 494
Query: 335 GINEQASVELQGIKGMWSLRSS-TDDPFDT-------FLVVSFISETRILAMNLEDELEE 386
+ EL G + MW++ + +D T FL++S T IL E+ E
Sbjct: 495 RPQVVTTFELPGCEDMWTVIGTLNNDEIKTEAEGSHAFLILSQEDSTMILQTG--QEINE 552
Query: 387 TEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
+ GF +Q T+F + N+ +VQVT VRL+
Sbjct: 553 VDQSGFSTQGSTVFAGNLGANRYIVQVTQMGVRLL 587
>gi|12653387|gb|AAH00463.1| SF3B3 protein [Homo sapiens]
gi|13111947|gb|AAH03146.1| SF3B3 protein [Homo sapiens]
Length = 399
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/349 (25%), Positives = 161/349 (46%), Gaps = 45/349 (12%)
Query: 710 LDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRILVF--IVEDG-KLQLIA 765
L+ E S+ C FS+ +Y VG A L G + +V +G KL+ +
Sbjct: 64 LEQNEAAFSVAVCRFSNIGEDWYVLVGVAKDLILNPRSVAGGFVYTYKLVNNGEKLEFLH 123
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYV---Q 822
+ + ++ F G++L + + +++Y D G ++L +C + HI A Y+ Q
Sbjct: 124 KTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRKC-ENKHI-ANYISGIQ 176
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
T G ++V D+ +S + YK E + A D W++ +LD D GA+ N+
Sbjct: 177 TIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNIC 236
Query: 883 TVR--KNSEGATDE---------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLP 924
VR N+ DE +RG L EV+ YH+GE V + +L+
Sbjct: 237 VVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLIP--- 293
Query: 925 DSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
G ++++ T++G IG++ HE + F + ++ +LR + G +H +RS+
Sbjct: 294 ----GGSESLVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY- 348
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
KN +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 349 ----YFPVKNVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 393
>gi|9794908|gb|AAF98388.1| cleavage and polyadenylation specificity factor [Drosophila
melanogaster]
Length = 813
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 106/435 (24%), Positives = 201/435 (46%), Gaps = 62/435 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEES-EMHFVR---- 695
+R +PL PR++ + ++R + + + +++ +EES + F+
Sbjct: 384 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRDERFIYPIGS 443
Query: 696 -----LLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
L+ +T+E + ++ + +E+ I+ S+ + Y C+GT + E
Sbjct: 444 QFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 502
Query: 744 NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
+ ++G I ++ + E GK ++ I +KE KG V +++ G L+ + QKI
Sbjct: 503 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI- 561
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 562 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 617
Query: 853 ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
+RD+N + +E + D+ LG AE N ++ + + + ++ L +YHL
Sbjct: 618 SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQK--LLRKADYHL 675
Query: 908 GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
G+ VN FR L R P + V++GT++G +G LP + Y LQ
Sbjct: 676 GQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQN 734
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
L + + GLN +++R+ + KK ++ +DGDLI S+ ++ + +E++K +
Sbjct: 735 VLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGT 794
Query: 1021 SVEELCKRVEELTRL 1035
EE+ + E+ RL
Sbjct: 795 RTEEILGDLLEIERL 809
>gi|390347522|ref|XP_003726804.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 1439
Score = 89.4 bits (220), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 82/322 (25%), Positives = 144/322 (44%), Gaps = 38/322 (11%)
Query: 731 YYCVGTAYVLPEENEPTKGRILVF-----IVEDG------KLQLIAEKETKGAVYSLNAF 779
Y V T +V E+ + T+G + ++ + E G KL+ + EK KG V +L
Sbjct: 1115 YVVVATTHVYSEDLQ-TRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQKGPVSALCEV 1173
Query: 780 NGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
G LL I QK+ Y W +D+ G + ++ H + + FI++ D+MK
Sbjct: 1174 MGFLLTCIGQKV--YMWQFKDNDLIGLAFIDTQIYIHNAV------SVKQFILITDVMKG 1225
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATD 893
L Y+ ++ + +RD + E + DD +L ++ + NL + E
Sbjct: 1226 AYFLQYQAQDRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAPES 1285
Query: 894 EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--------PTVIFGTVNGVIGV 945
L G+ ++G VN F + RL D Q+ V F T++G +G+
Sbjct: 1286 HGGAYLLRRGDMNIGSAVNTFVR--VRCRLTDPSTEQVLSGPVLRRQVVFFATLDGSLGL 1343
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESF 1003
+ + + Y L LQ L + VGGLN + +R + + ++ +N LDGDL+ +
Sbjct: 1344 LLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLLLKY 1403
Query: 1004 LDLSRTRMDEISKTMNVSVEEL 1025
LS +E +K + SV+++
Sbjct: 1404 CHLSVVERNEFAKKIGTSVDQI 1425
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 70/332 (21%), Positives = 139/332 (41%), Gaps = 71/332 (21%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
I+G++ +++ + G +D L ++ K ++++D +L T +M + D
Sbjct: 70 IFGKVMSMQSAQVTGSGRDALLLSFMNAKVSIVEYDPNMHDLKTLSMHYFEE------DE 123
Query: 66 GQIGI----------IDPDCRLIGLHLYDGLFKVIPF----------------DNKGQLK 99
+ G+ +DPD R + Y V+PF +G L
Sbjct: 124 TKEGVYRNIFHPVVKVDPDHRCAIMLTYGSKLVVLPFRRDGLVEDLDKSMSASTRRGALM 183
Query: 100 EAFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQD-----NKDARHVKTYEVALKDKD 149
++ IRL E+ VLDI+FL+G +PT+++LY+ + A T + +
Sbjct: 184 PSYVIRLNEMDDPICNVLDIQFLHGYYEPTLLILYEPLRTWAGRVAVRQDTCSIVALSLN 243
Query: 150 FVEGP----WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAF--- 193
+ WSQ++L + VP P+ GVLI+ +++Y S N+
Sbjct: 244 MAQKVHPIIWSQSSLPYDCMQVQAVPKPIGGVLILAVNSLLYLNQSIPPYGVSLNSLTDW 303
Query: 194 -KAIPIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIE 243
A P++ T+ ++ D + R L G +++L ++ V G ++
Sbjct: 304 STAFPLK---TQEGVKLSMDCTQATFISYDRLALSLKDGEIYVLTLLVDGMRSVRGFHLD 360
Query: 244 LLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
+ + + I + + +++GS G+S L+K
Sbjct: 361 KAAASVLTTCICPMGDGFLFLGSRLGNSLLLK 392
>gi|66361481|ref|XP_627314.1| possible spliceosome factor [Cryptosporidium parvum Iowa II]
gi|46228697|gb|EAK89567.1| possible spliceosome factor [Cryptosporidium parvum Iowa II]
Length = 1317
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 110/522 (21%), Positives = 209/522 (40%), Gaps = 76/522 (14%)
Query: 176 GVLIIGEETIVYCS---ANAFKAIPIRPSITKAYGRVDADGSRY--------LLGDHAGL 224
GVL+ + I+YC +N F P R + YG + S + L+ G
Sbjct: 245 GVLVCDYKGIIYCKMGHSNIFCPYPFRFGDSSEYGTMIVASSLHKLKGFFLILVQTELGD 304
Query: 225 LHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL-NLQPDAK 283
++ + + H + V ++I + +++ L + ++ +G+ ++ +L D
Sbjct: 305 IYRINLIHNEGIVKEMRIYYYDTIPVCNSLLLLRSGFLFASHEFGNHTNYQIVSLGDDKT 364
Query: 284 GSYVEVL----------------------ERYVNLGPIVDFCVVDLERQGQGQVVTCSGA 321
Y L E ++L PI D V+D G Q+V G
Sbjct: 365 DPYTSSLPDSNDLKRVYFRPRNCQCIRKSEEILSLSPITDIKVIDTNNDGTPQIVATCGR 424
Query: 322 YKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTD-------------DPFDTFLVV 367
+LR+ G + E A L G + +W+L++ D D ++++
Sbjct: 425 GPRSTLRVCSYGKNVEEIAENPLPGRPRCIWTLKNGIDPSLSGSQAEAAILDNIHHYIII 484
Query: 368 SFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI-YNQLVQVTSGSVRLVSSTSRE 426
SFI R L + + + +EET F T++ I YN +QV V+L+ ++
Sbjct: 485 SFID--RSLVLTIGEHVEETNDTLFTLNETTMYAASMIFYNSFLQVLETHVKLI---IQD 539
Query: 427 LRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI-----------GDGILTEVKHAQ 475
+WK+P + A +N QV LA GG +V LE+ G G L EV +
Sbjct: 540 RIYDWKTPDSRKIIAADSNGRQVSLALEGGLIVILELNVNGVSGITNTGMGGLVEVCRRE 599
Query: 476 LEYEISCLDINPIGENPSY-SQIAAVGMWTDISVRIFSLPDLNLITKEHL-----GGEII 529
+ EI C+ I + + S VG T+ ++R++ + K+ I
Sbjct: 600 ITCEIICIGIQQLSYSGQMRSDYVVVGTSTENALRLYKIDSAEKRLKQTCTQILPNSNSI 659
Query: 530 PRSVLLCAFEGISYLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN 587
P +V L +L+ +G G +L+ ++ G ++D + LG + + + ++
Sbjct: 660 PENVQLYHSNKYGHLILFVGLTTGVILSCKVDASNGSISDPRSKYLGNRGVNICRIMRED 719
Query: 588 ---TTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNS 626
+ S RP ++ S + ++ + + + + P N+
Sbjct: 720 FGGEMSLVCMSSRPWLVDSQTSGVNFTPLQYRCIDSIAPLNT 761
Score = 82.8 bits (203), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 93/390 (23%), Positives = 164/390 (42%), Gaps = 68/390 (17%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLP------------ 741
VR+++ ++ E I PLDT E S C F + +GT Y +
Sbjct: 947 VRIVNLKSMETIQLIPLDTNEGCISACVCKFDELDLPCLVLGTVYGMKLNRGYNNNNSNN 1006
Query: 742 ---------EENEPTKGRILVFIVEDGK--LQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
+EN + G + D +L+ + + +L + G+LL IN+
Sbjct: 1007 STDSAKNNDKENHDSFGAAIKIFKYDSNYNFELVHITPIENSATALTGWRGRLLVGINKT 1066
Query: 791 IQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
+++Y K +LR R + +++ D I GD+ + + + +
Sbjct: 1067 LRVYSLGKKRLLRKSEYRNIPQGL--------TWIKVVNDRIFAGDISNGVLVFKFNNTS 1118
Query: 847 GAIEERARDYNANWM-SAVEILD----------DDIYLG-----AENNFNL---FTVRKN 887
A+D W+ SA E+LD D+I + A ++F+ FT N
Sbjct: 1119 NQFILVAKDPMPRWLTSACEVLDYHTIAVSDKFDNIIVSRVPAEASDDFSFVTSFTDNNN 1178
Query: 888 SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIA 947
S+ + ++ V ++HLG+ V + L P S +I+GTV G IG ++
Sbjct: 1179 SQSSALMRTHQINTVAQFHLGDIVTCLQKSQLT---PTS----AEAIIYGTVLGSIGSLS 1231
Query: 948 S-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDL 1006
L +E L KL+ LRK + +H +RS+ + N +DGD ++F L
Sbjct: 1232 PILNNEDIELLSKLEILLRKQKSTLLSRDHLMFRSYYSP-----VHNVIDGDFCQTFTIL 1286
Query: 1007 SRTRMDEISKTMNVSVEELCKRVEEL-TRL 1035
EI+ ++V+VEE+ K++++ TRL
Sbjct: 1287 DSQIQSEIASKLDVTVEEIYKKLDDYKTRL 1316
>gi|390358535|ref|XP_789715.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Strongylocentrotus purpuratus]
Length = 1223
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 82/322 (25%), Positives = 144/322 (44%), Gaps = 38/322 (11%)
Query: 731 YYCVGTAYVLPEENEPTKGRILVF-----IVEDG------KLQLIAEKETKGAVYSLNAF 779
Y V T +V E+ + T+G + ++ + E G KL+ + EK KG V +L
Sbjct: 899 YVVVATTHVYSEDLQ-TRGSVYIYDCIEVVPEPGQPLTKNKLKPLYEKRQKGPVSALCEV 957
Query: 780 NGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
G LL I QK+ Y W +D+ G + ++ H + + FI++ D+MK
Sbjct: 958 MGFLLTCIGQKV--YMWQFKDNDLIGLAFIDTQIYIHNAV------SVKQFILITDVMKG 1009
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATD 893
L Y+ ++ + +RD + E + DD +L ++ + NL + E
Sbjct: 1010 AYFLQYQAQDRTLSLVSRDARPLEIFGCEFMVDDKQMAFLVSDADKNLIVFHYHPEAPES 1069
Query: 894 EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI--------PTVIFGTVNGVIGV 945
L G+ ++G VN F + RL D Q+ V F T++G +G+
Sbjct: 1070 HGGAYLLRRGDMNIGSAVNTFVR--VRCRLTDPSTEQVLSGPVLRRQVVFFATLDGSLGL 1127
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESF 1003
+ + + Y L LQ L + VGGLN + +R + + ++ +N LDGDL+ +
Sbjct: 1128 LLPMVEKTYRRLLMLQNVLTNGLPHVGGLNPKSYRHVKSHMRNLNNPHRNILDGDLLLKY 1187
Query: 1004 LDLSRTRMDEISKTMNVSVEEL 1025
LS +E +K + SV+++
Sbjct: 1188 CHLSVVERNEFAKKIGTSVDQI 1209
>gi|238609081|ref|XP_002397396.1| hypothetical protein MPER_02189 [Moniliophthora perniciosa FA553]
gi|215471757|gb|EEB98326.1| hypothetical protein MPER_02189 [Moniliophthora perniciosa FA553]
Length = 180
Score = 89.0 bits (219), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 85/156 (54%), Gaps = 6/156 (3%)
Query: 295 NLGPIVDFCVVD-LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWS 352
+L PI+D V++ L Q+ G SLR +R+G+ + E S EL GI +W+
Sbjct: 29 SLCPILDSKVLNILPNSDTPQIFAVCGRGARSSLRTLRHGLEVEESVSSELPGIPNAVWT 88
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+ DDPFD+++++SF++ T +L++ + +EE + GF S TL + L+QV
Sbjct: 89 TKKREDDPFDSYIILSFVNGTLVLSIG--ETIEEVQDTGFLSSAPTLAVQQIGADALLQV 146
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQ 448
G +R V + SR NEW+ P G S+ AT N Q
Sbjct: 147 HPGGIRHVLADSRV--NEWRVPTGKSIVAATTNKRQ 180
>gi|402579535|gb|EJW73487.1| hypothetical protein WUBG_15606 [Wuchereria bancrofti]
Length = 254
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 75/236 (31%), Positives = 118/236 (50%), Gaps = 19/236 (8%)
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS 416
T D FD+ +VVSF++ T +L++ + +EE GF T TL C + L+QV
Sbjct: 9 TLDKFDSHIVVSFVNATLVLSIG--ETVEEVTDSGFLGTTPTLGCALIGDDALLQVYPDG 66
Query: 417 VRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTE-VKHA 474
+R + + R NEWK+P ++ N QV +A GG LVY E+ G L E +
Sbjct: 67 IRHIRADRRV--NEWKAPGKRTIMKCALNRRQVAIALAGGELVYFELDVTGQLNEYTERR 124
Query: 475 QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGGEIIPR 531
+L ++ C+ ++ I E S+ VG+ D +VRI SL L+ ++ + L E P
Sbjct: 125 ELPADVLCMSLSEIPEGELRSRFLTVGL-ADKTVRIISLDPQDCLSPLSMQALPSE--PE 181
Query: 532 SVLLCAFEGIS-------YLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
S+++ G +L L +G LL ++ TGELTD + LGT+ + L
Sbjct: 182 SIIVLEMFGTEIQSASTVHLNIGLQNGCLLRTTVDQVTGELTDNRTRYLGTKSVKL 237
>gi|297722899|ref|NP_001173813.1| Os04g0252200 [Oryza sativa Japonica Group]
gi|255675253|dbj|BAH92541.1| Os04g0252200, partial [Oryza sativa Japonica Group]
Length = 432
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 86/353 (24%), Positives = 159/353 (45%), Gaps = 33/353 (9%)
Query: 702 FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF--I 755
+E ST P+ FE ++ L + + ++ +GTAYVL E+ +GR+L+F
Sbjct: 86 WETKSTIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVLGED-VAARGRVLLFSFT 144
Query: 756 VEDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
+ L+ E KE+KGAV ++ + G LL A KI L KW EL + +
Sbjct: 145 KSENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWT-----GAELTAVAFY 199
Query: 813 HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
+ + + +F++ GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 200 DAPLHVVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGST 259
Query: 873 LG-----AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDS 926
L ++ N +F + + +G +L E+H+G + +F + LP
Sbjct: 260 LSLVASDSDKNVQIFYY---APKMVESWKGQKLLSRAEFHVGAHITKFLR---LQMLPTQ 313
Query: 927 DVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF 982
+ T ++FG ++G IG IA + + L+ LQ L + V GLN +R F
Sbjct: 314 GLSSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQF 373
Query: 983 NNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
++ K N +D +L+ + LS ++++ + + ++ +++
Sbjct: 374 HSNGKGHRPGPDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSDIS 426
>gi|260835071|ref|XP_002612533.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
gi|229297910|gb|EEN68542.1| hypothetical protein BRAFLDRAFT_120973 [Branchiostoma floridae]
Length = 1003
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 110/470 (23%), Positives = 198/470 (42%), Gaps = 82/470 (17%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V+ PF++ P L + GEL I + +R +PL P + + +
Sbjct: 560 VTCFSPFHNVNCPKGFLYFNRGGELRISVLPTHLSYDAPWPVRKVPLRCTPHFVAYHMEC 619
Query: 673 RTFAICSLKNQSC---------------AEESEMHFVRLLDDQTFEFISTYPLD------ 711
+ +A+ + + C E+ + + +LD + +S +
Sbjct: 620 KVYAVAASTFEMCNRIPRMAGDEKEYDAVEKDDRYIYPMLDKFNIQLMSPVSWEIIPNTR 679
Query: 712 --TFEYGCSILSCSFSDDSN-VYYCVGTAYVL------PEENEPTKGRILVFIVEDGKLQ 762
E + +CSF N V + G +L PE +P + K++
Sbjct: 680 GMQLEENYAECTCSFLVGINFVLFVAGQIVILDVIEVVPEPGQP---------LTKNKIK 730
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILAL 819
+ KE KG V +L NG LL+AI QKI L W R++ G + ++ H I
Sbjct: 731 ELYGKEQKGPVSALCGCNGYLLSAIGQKIFL--WEFRNNDLIGVAFIDTQVYIHTAI--- 785
Query: 820 YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG----- 874
+ +++++ D+ KSISLL Y+ D VE D+ +G
Sbjct: 786 ---SIKNYVILADVFKSISLLRYQ-----------DMRPLETYCVEFFVDNAQIGFLVSD 831
Query: 875 AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF-RHGSLVM-----RLPDSD- 927
A+ NF L++ + + + +R L ++++G VN F R +M R D+D
Sbjct: 832 AQKNFLLYSYQPEARESYGGQR--LVRRADFNVGSHVNTFFRVRCKIMDPSGERRRDADT 889
Query: 928 VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF-NNEK 986
V + +F T++G +G + + + Y L LQ L + GLN + +R +N +
Sbjct: 890 VAKRHVTMFATLDGGLGALLPMAEKTYRRLLMLQNTLMTHMPFPAGLNPKAFRMLKHNHR 949
Query: 987 KTVDA-KNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
++A +N LDG+L+ FL LS E+++ + S E + + + ++ RL
Sbjct: 950 SLINACRNILDGELLWKFLHLSVVERSELARKIGTSPETITEDLMDIDRL 999
>gi|67600754|ref|XP_666354.1| CG13900 gene product [Cryptosporidium hominis TU502]
gi|54657334|gb|EAL36124.1| CG13900 gene product [Cryptosporidium hominis]
Length = 1318
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 124/565 (21%), Positives = 232/565 (41%), Gaps = 81/565 (14%)
Query: 136 RHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVP---PPLCGVLIIGEETIVYCS--- 189
+H+ +EV L ++V SQ D+ + LI VP GVL+ + I+YC
Sbjct: 204 KHLIFWEVDL-GLNYVSRKSSQVITDS-SHTLISVPGGNDGPSGVLVCDYKGIIYCKVGH 261
Query: 190 ANAFKAIPIRPSITKAYGRVDADGSRY--------LLGDHAGLLHLLVITHEKEKVTGLK 241
+N F P R + YG + S + L+ G ++ + + H + V ++
Sbjct: 262 SNIFCPYPFRFGDSSEYGTMIVASSLHKLKGFFLILVQTELGDIYRINLIHNEGIVKEMR 321
Query: 242 IELLGETSIASTISYLDNAVVYIGSSYGDS---QLIKL---------NLQPDA---KGSY 286
I + +++ L + ++ +G+ Q++ L + PD+ K Y
Sbjct: 322 IYYYDTIPVCNSLLLLRSGFLFASHEFGNHTNYQIVSLGDDKTDPCTSSLPDSNDLKRVY 381
Query: 287 --------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 338
+ E ++L PI D V+D G Q+V G +LR+ G + E
Sbjct: 382 FRPRNCQCIRKSEEILSLSPITDIKVIDTNNDGTPQIVATCGRGPRSTLRVCSYGKNVEE 441
Query: 339 QASVELQG-IKGMWSLRSSTD-------------DPFDTFLVVSFISETRILAMNLEDEL 384
A L G + +W+L++ D D ++++SFI R L + + + +
Sbjct: 442 IAENPLPGRPRCIWTLKNGIDPSLSGSQAEAAILDNIHHYIIISFID--RSLVLTIGEHV 499
Query: 385 EETEIEGFCSQTQTLFCHDAI-YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVAT 443
EET F T++ I YN +QV V+L+ ++ +WK+P + A
Sbjct: 500 EETNDTLFTLNETTMYAASMIFYNSFLQVLETHVKLI---IQDRIYDWKTPDSRKIIAAD 556
Query: 444 ANASQVLLATGGGHLVYLEI-----------GDGILTEVKHAQLEYEISCLDINPIGENP 492
+N QV LA GG +V LE+ G G L EV ++ EI C+ I + +
Sbjct: 557 SNGRQVSLALEGGLIVILELNVNGVSGITNTGIGGLVEVCRREITCEIICIGIQQLSYSG 616
Query: 493 SY-SQIAAVGMWTDISVRIFSLPDLNLITKEHL-----GGEIIPRSVLLCAFEGISYLLC 546
S VG T+ ++R++ + K+ IP +V L +L+
Sbjct: 617 QMRSDYVVVGTSTENALRLYKIDSAEKRLKQTCTQILPNSNSIPENVQLYHSNKYGHLIL 676
Query: 547 ALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN---TTHVFAASDRPTVI 601
+G G +L+ ++ G ++D + LG + + + ++ + S RP ++
Sbjct: 677 FVGLTTGVILSCKVDASNGSISDPRSKYLGNRGVNICRIMREDFGGEMSLVCMSSRPWLV 736
Query: 602 YSSNKKLLYSNVNLKEVSHMCPFNS 626
S + ++ + + + + P N+
Sbjct: 737 DSQTSGVNFTPLQYRCIDSIAPLNT 761
Score = 82.8 bits (203), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 93/391 (23%), Positives = 164/391 (41%), Gaps = 69/391 (17%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLP------------ 741
VR+++ ++ E I PLDT E S C F + +GT Y +
Sbjct: 947 VRIVNLKSMETIQLIPLDTNEGCISACVCKFDELDLPCLVLGTVYGMKLNRGYNNNNNSN 1006
Query: 742 ----------EENEPTKGRILVFIVEDGK--LQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
+EN + G + D +L+ + + +L + G+LL IN+
Sbjct: 1007 NNTDSAKNNDKENHDSFGAAIKIFKYDSNYNFELVHITPIENSATALTGWRGRLLVGINK 1066
Query: 790 KIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
+++Y K +LR R + +++ D I GD+ + + + +
Sbjct: 1067 TLRVYSLGKKRLLRKSEYRNIPQGL--------TWIKVVNDRIFAGDISNGVLVFKFNNT 1118
Query: 846 EGAIEERARDYNANWM-SAVEILD----------DDIYLG-----AENNFNL---FTVRK 886
A+D W+ SA E+LD D+I + A ++F+ FT
Sbjct: 1119 SNQFILVAKDPMPRWLTSACEVLDYHTIAVSDKFDNIIVSRVPVEASDDFSFVTSFTDNN 1178
Query: 887 NSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVI 946
NS+ + ++ V ++HLG+ V + L P S +I+GTV G IG +
Sbjct: 1179 NSQSSALMRTHQINTVAQFHLGDIVTCLQKSQLT---PTS----AEAIIYGTVLGSIGSL 1231
Query: 947 AS-LPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLD 1005
+ L +E L KL+ LRK + +H +RS+ + N +DGD ++F
Sbjct: 1232 SPILNNEDIELLSKLEILLRKQKSTLLSRDHLMFRSYYSP-----VHNVIDGDFCQTFTI 1286
Query: 1006 LSRTRMDEISKTMNVSVEELCKRVEEL-TRL 1035
L EI+ ++V+VEE+ K++++ TRL
Sbjct: 1287 LDSKIQSEIASKLDVTVEEIYKKLDDYKTRL 1317
>gi|242075246|ref|XP_002447559.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
gi|241938742|gb|EES11887.1| hypothetical protein SORBIDRAFT_06g003570 [Sorghum bicolor]
Length = 389
Score = 88.6 bits (218), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 87/347 (25%), Positives = 155/347 (44%), Gaps = 33/347 (9%)
Query: 707 TYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILV--FIVEDGK 760
T P+ +FE ++ L + + ++ +GTAYV E+ +GR+L+ F +
Sbjct: 48 TIPMQSFENALTVRIVTLQNTSTKENETLMAIGTAYV-QGEDVAARGRVLLYSFSRSENS 106
Query: 761 LQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHIL 817
L+ E KE+KGAV ++ + G LL A KI L KW EL + + +
Sbjct: 107 QNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWT-----GSELTAVAFYDAPLH 161
Query: 818 ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG--- 874
+ + +F++ GD+ KSI L +K + + A+D+ + A E L D L
Sbjct: 162 VVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLNLLAKDFGSLDCFATEFLIDGSTLSLVV 221
Query: 875 --AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
++ N +F + + +G +L E+H+G V++F + LP +
Sbjct: 222 SDSDKNVQIFYY---APKMVESWKGQKLLSRAEFHVGAHVSKFLR---LQMLPTQGLASE 275
Query: 932 PT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF--NNE 985
T ++FGT++G IG IA + + L+ LQ L + V GLN +R F N +
Sbjct: 276 KTNRFALVFGTLDGGIGCIAPVDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRHFKSNGK 335
Query: 986 KKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
N +D +L+ + LS EI++ + + ++ +
Sbjct: 336 AHRPGPDNIIDFELLSHYEMLSLEEQLEIAQQIGTTRSQILSNFSDF 382
>gi|321475208|gb|EFX86171.1| hypothetical protein DAPPUDRAFT_313209 [Daphnia pulex]
Length = 1260
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 130/611 (21%), Positives = 254/611 (41%), Gaps = 132/611 (21%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGD------VSDRI 59
++G++ ++ G +QD + ++ K ++++D S L T ++ + + + I
Sbjct: 74 LFGKVMSIAAVSLPGSSQDTILMSFAHAKLSLIEYDPVSDNLKTLSLHNFEVVSILDEGI 133
Query: 60 GRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF------DNKGQLKEAFNIRLEELQ--- 110
G +I + DP+ R L ++ ++PF D+ L ++ I+L +L+
Sbjct: 134 GSNHKIPEIRV-DPEGRCAALLIFRNTLAILPFRKDSAHDSNVTLS-SYIIKLTDLEERV 191
Query: 111 --VLDIKFLYGCAKPTIVVLYQD----------NKDARHVKTYEVALKDKDFVEGP-WSQ 157
V+D++FL+G +PT+++LY+ +D V+L + V WS
Sbjct: 192 DNVIDVQFLHGYYEPTLIILYEPVGTFPGRIAVRQDT--CNMVAVSLNTQQRVHPIIWSL 249
Query: 158 NNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIRPSITK 204
N+L L+PVP PL G LI+ +++Y + A+ + P++P
Sbjct: 250 NSLPFDCSQLLPVPKPLGGALIMAVNSVIYVNQSVPPYGVSVNSIADHCTSFPLKPYEGS 309
Query: 205 AYGRVDADGS-----RYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSIASTISYLD 258
G A + R +L G L++L + + + V +E + + + + D
Sbjct: 310 RIGLDCARAAFLQYDRVVLSLKGGELYVLTLFADSMRSVRKFHLEKAAASVLTTCLCICD 369
Query: 259 NAVVYIGSSYGDSQLIKLNLQ--------------------------------------- 279
N +++GS G+S L+ +
Sbjct: 370 N-YLFLGSRLGNSLLLAFQTKDYNQYATPFAAKKPKMEQFSLLFDQELDHLDEEEIDNYL 428
Query: 280 -------PDAKG-SY-VEVLERYVNLGPIVDFCV-------VDLERQGQG---QVVTCSG 320
D+K SY EV + +N+GP V D +++ ++VT SG
Sbjct: 429 YGEDHESTDSKAISYQFEVCDSLLNIGPCGQMAVGEPASTCTDFDKKSPDPDVEIVTTSG 488
Query: 321 AYKDGSLRIVRNGIGINEQASVELQGIKGMWSL---RSSTDDPFDTFLVVSFISETRILA 377
K+G++ +++ + + EL + M+++ R++ D T+L++S T +L
Sbjct: 489 YGKNGAICVLQRTMKPQVVTTFELPEVSDMFTVFASRNNEDAIMHTYLLLSRADSTMVLQ 548
Query: 378 MNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSRELRNEWKSPPG 436
E+ E + GF + T+ + N+ +VQV SVRL+ +T+ ++ E
Sbjct: 549 TG--QEINEMDQSGFSVTSPTILAANLGNNRFIVQVCPTSVRLLDATATVIQ-ELVMDSD 605
Query: 437 YSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQ 496
+ + A+A+ V + T G + G+LT V+ +QLE L N S
Sbjct: 606 FLITSASASDPYVAVLTENGRI-------GLLTFVEGSQLEMIFPVLSKN--------SP 650
Query: 497 IAAVGMWTDIS 507
+ V ++ DIS
Sbjct: 651 VVCVCLYRDIS 661
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 52/182 (28%), Positives = 88/182 (48%), Gaps = 27/182 (14%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+PE +P + K++ + K+ KG V ++++ G L+AAI QKI Y W L
Sbjct: 1092 VVPEPGQP---------LTKNKIKTLYAKDQKGPVAAISSVCGYLVAAIGQKI--YLWQL 1140
Query: 799 RDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
++D G + +E H +L + FI+ D+ KS+S+L ++ E + ARD
Sbjct: 1141 KNDDLVGIAFIDTEIYIH-QLLNI-----KSFILAADVYKSVSILRFQEEYRTLCIVARD 1194
Query: 856 YNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 910
Y + AV+ D+ LG AE N L+ + E + RL ++H+G+
Sbjct: 1195 YQPLEVMAVDYYIDNTQLGFLVSDAEKNLILYMYQ--PEARESQGGHRLIRKADFHVGQV 1252
Query: 911 VN 912
V+
Sbjct: 1253 VS 1254
>gi|195334368|ref|XP_002033855.1| GM20208 [Drosophila sechellia]
gi|194125825|gb|EDW47868.1| GM20208 [Drosophila sechellia]
Length = 1455
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 108/436 (24%), Positives = 198/436 (45%), Gaps = 64/436 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
+R +PL PR++ + ++R + + + +++ +EES F+
Sbjct: 1026 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1085
Query: 696 -----LLDDQTFEFISTYPLDTFE-----YGCSILSCSFSDDSN---VYYCVGTAYVLPE 742
L+ +T+E + + TFE I+ S+ + Y C+GT +
Sbjct: 1086 QFEMVLISPETWEIVPDASI-TFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-S 1143
Query: 743 ENEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
E+ ++G I ++ + E GK ++ I +KE KG V +++ G L+ + QKI
Sbjct: 1144 EDITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI 1203
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1204 --YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSL 1258
Query: 852 RARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
+RD+N + +E + D+ LG AE N ++ + + + ++ L +YH
Sbjct: 1259 ASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQK--LLRKADYH 1316
Query: 907 LGEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
LG+ VN FR L R P + V++GT++G +G LP + Y LQ
Sbjct: 1317 LGQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQ 1375
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
L + + GLN +++R+ + KK ++ +DGDLI S+ ++ + +E++K +
Sbjct: 1376 NVLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIG 1435
Query: 1020 VSVEELCKRVEELTRL 1035
EE+ + E+ RL
Sbjct: 1436 TRTEEILGDLLEIERL 1451
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 72/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YG + +L+ G +D L I+ + K VLQ D ++ L T ++ + R
Sbjct: 76 LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
G+ + +DPD R + +Y V+PF L E
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
++ I L +L VLDI+FL+G +PT+++LY+ + +K ++
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
L + V W+ N+L + P+ P+ G L++ ++Y SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315
Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
+ A P++P S+ A + +D D + ++ G L++L + + + V
Sbjct: 316 DNSTAFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHF 373
Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
+ + S I L + +++GS G+S L+
Sbjct: 374 HKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405
Score = 42.0 bits (97), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 71/159 (44%), Gaps = 26/159 (16%)
Query: 288 EVLERYVNLGPIVDFCV---VDLERQG-------------QGQVVTCSGAYKDGSLRIVR 331
EV + +N+ PI C V+ E G + ++V +G K+G+L +
Sbjct: 485 EVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFV 544
Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
N + S EL G +W++ +SS +D D F+ +S + T +L E+
Sbjct: 545 NCLNPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMFLSQRNSTLVLQTG--QEIN 601
Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
E E GF T+F + + +VQVT+ VRL+ T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640
>gi|45552619|ref|NP_995833.1| cleavage and polyadenylation specificity factor 160, isoform A
[Drosophila melanogaster]
gi|18203551|sp|Q9V726.1|CPSF1_DROME RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=CPSF 160 kDa subunit;
Short=dCPSF 160
gi|7303176|gb|AAF58240.1| cleavage and polyadenylation specificity factor 160, isoform A
[Drosophila melanogaster]
Length = 1455
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 108/436 (24%), Positives = 198/436 (45%), Gaps = 64/436 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
+R +PL PR++ + ++R + + + +++ +EES F+
Sbjct: 1026 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1085
Query: 696 -----LLDDQTFEFISTYPLDTFE-----YGCSILSCSFSDDSN---VYYCVGTAYVLPE 742
L+ +T+E + + TFE I+ S+ + Y C+GT +
Sbjct: 1086 QFEMVLISPETWEIVPDASI-TFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-S 1143
Query: 743 ENEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
E+ ++G I ++ + E GK ++ I +KE KG V +++ G L+ + QKI
Sbjct: 1144 EDITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI 1203
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1204 --YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSL 1258
Query: 852 RARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
+RD+N + +E + D+ LG AE N ++ + + + ++ L +YH
Sbjct: 1259 ASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQK--LLRKADYH 1316
Query: 907 LGEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
LG+ VN FR L R P + V++GT++G +G LP + Y LQ
Sbjct: 1317 LGQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQ 1375
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
L + + GLN +++R+ + KK ++ +DGDLI S+ ++ + +E++K +
Sbjct: 1376 NVLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIG 1435
Query: 1020 VSVEELCKRVEELTRL 1035
EE+ + E+ RL
Sbjct: 1436 TRTEEILGDLLEIERL 1451
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 72/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YG + +L+ G +D L I+ + K VLQ D ++ L T ++ + R
Sbjct: 76 LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
G+ + +DPD R + +Y V+PF L E
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
++ I L +L VLDI+FL+G +PT+++LY+ + +K ++
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
L + V W+ N+L + P+ P+ G L++ ++Y SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315
Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
+ A P++P S+ A + +D D + ++ G L++L + + + V
Sbjct: 316 DNSTAFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHF 373
Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
+ + S I L + +++GS G+S L+
Sbjct: 374 HKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405
Score = 43.5 bits (101), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 26/159 (16%)
Query: 288 EVLERYVNLGPIVDFCV---VDLERQG-------------QGQVVTCSGAYKDGSLRIVR 331
EV + +N+ PI C V+ E G + ++V +G K+G+L +
Sbjct: 485 EVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFV 544
Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
N I S EL G +W++ +SS +D D F+++S + T +L E+
Sbjct: 545 NCINPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNSTLVLQTG--QEIN 601
Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
E E GF T+F + + +VQVT+ VRL+ T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640
>gi|397615212|gb|EJK63291.1| hypothetical protein THAOC_16062, partial [Thalassiosira oceanica]
Length = 322
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/305 (26%), Positives = 134/305 (43%), Gaps = 41/305 (13%)
Query: 751 ILVFIVEDGKLQLIAEKETK------GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR 804
+L +V +LQL+ + G V +L F G+LL I + ++LY+ G R
Sbjct: 29 VLYRVVSGERLQLLHRTKVDDGSSGGGPVLALVHFQGRLLVGIGKSLRLYEM-----GKR 83
Query: 805 ELQSECGHHG-HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
+L +C G + +Q GD VGD+M+S+ + Y + ARD +A ++
Sbjct: 84 QLLKKCELRGLPTMVKTLQAAGDRAFVGDMMQSMQFVRYDATANRLVLVARDRSARPITC 143
Query: 864 VEILDDDIYLGAENNFNLFTV---RKNSEGATD-------------EERGRLEVVGEYHL 907
E+LD + + N+ T+ R GA D + +LE + YH+
Sbjct: 144 QELLDVNTVAVGDKFGNVTTLRLPRGADTGAVDVSGTRALWDSSREDATPKLETLCTYHV 203
Query: 908 GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRK 966
GE V SLV G ++I+ TV G IG + E F L++++R
Sbjct: 204 GEVVTSLTRASLV-------AGGAESLIYVTVTGRIGALVPFTSREDVEFYTSLESHVRS 256
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
+ G + + +RSF K+ +DGDL E++ L + I+ M S E+
Sbjct: 257 EVPRPTGRDPQSYRSFY-----CPVKHVIDGDLCEAYGGLPYEARERIADQMERSTGEVM 311
Query: 1027 KRVEE 1031
K++E+
Sbjct: 312 KKLED 316
>gi|397627714|gb|EJK68584.1| hypothetical protein THAOC_10223, partial [Thalassiosira oceanica]
Length = 456
Score = 88.2 bits (217), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 80/305 (26%), Positives = 134/305 (43%), Gaps = 41/305 (13%)
Query: 751 ILVFIVEDGKLQLIAEKETK------GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR 804
+L +V +LQL+ + G V +L F G+LL I + ++LY+ G R
Sbjct: 163 VLYRVVSGERLQLLHRTKVDDGSSGGGPVLALVHFQGRLLVGIGKSLRLYEM-----GKR 217
Query: 805 ELQSECGHHG-HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
+L +C G + +Q GD VGD+M+S+ + Y + ARD +A ++
Sbjct: 218 QLLKKCELRGLPTMVKTLQAAGDRAFVGDMMQSMQFVRYDATANRLVLVARDRSARPITC 277
Query: 864 VEILDDDIYLGAENNFNLFTV---RKNSEGATD-------------EERGRLEVVGEYHL 907
E+LD + + N+ T+ R GA D + +LE + YH+
Sbjct: 278 QELLDVNTVAVGDKFGNVTTLRLPRGADTGAVDVSGTRALWDSSREDATPKLETLCTYHV 337
Query: 908 GEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRK 966
GE V SLV G ++I+ TV G IG + E F L++++R
Sbjct: 338 GEVVTSLTRASLV-------AGGAESLIYVTVTGRIGALVPFTSREDVEFYTSLESHVRS 390
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
+ G + + +RSF K+ +DGDL E++ L + I+ M S E+
Sbjct: 391 EVPRPTGRDPQSYRSFY-----CPVKHVIDGDLCEAYGGLPYEARERIADQMERSTGEVM 445
Query: 1027 KRVEE 1031
K++E+
Sbjct: 446 KKLED 450
>gi|422294117|gb|EKU21417.1| uv-damaged dna-binding protein [Nannochloropsis gaditana CCMP526]
Length = 192
Score = 88.2 bits (217), Expect = 2e-14, Method: Composition-based stats.
Identities = 48/106 (45%), Positives = 64/106 (60%), Gaps = 4/106 (3%)
Query: 915 RHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGL 974
R G V R D+G ++FGT G+IG I + E Y F L L KV+KGVGGL
Sbjct: 29 RSGGEVARGHVQDLG----LMFGTQQGMIGSILPISEEDYRFFVALTKCLNKVVKGVGGL 84
Query: 975 NHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
+HE++R F +K D + F+DGDLIESFL+L RM+E+ + M V
Sbjct: 85 SHEEYRRFLTDKAISDTQGFVDGDLIESFLELPTQRMEEVVELMRV 130
>gi|24653655|ref|NP_725397.1| cleavage and polyadenylation specificity factor 160, isoform B
[Drosophila melanogaster]
gi|15292103|gb|AAK93320.1| LD38533p [Drosophila melanogaster]
gi|21627189|gb|AAM68553.1| cleavage and polyadenylation specificity factor 160, isoform B
[Drosophila melanogaster]
Length = 1420
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 108/436 (24%), Positives = 198/436 (45%), Gaps = 64/436 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
+R +PL PR++ + ++R + + + +++ +EES F+
Sbjct: 991 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1050
Query: 696 -----LLDDQTFEFISTYPLDTFE-----YGCSILSCSFSDDSN---VYYCVGTAYVLPE 742
L+ +T+E + + TFE I+ S+ + Y C+GT +
Sbjct: 1051 QFEMVLISPETWEIVPDASI-TFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-S 1108
Query: 743 ENEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
E+ ++G I ++ + E GK ++ I +KE KG V +++ G L+ + QKI
Sbjct: 1109 EDITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI 1168
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1169 --YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSL 1223
Query: 852 RARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
+RD+N + +E + D+ LG AE N ++ + + + ++ L +YH
Sbjct: 1224 ASRDFNPLEVYGIEFMVDNSNLGFLVTDAERNIIVYMYQPEARESLGGQK--LLRKADYH 1281
Query: 907 LGEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
LG+ VN FR L R P + V++GT++G +G LP + Y LQ
Sbjct: 1282 LGQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQ 1340
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
L + + GLN +++R+ + KK ++ +DGDLI S+ ++ + +E++K +
Sbjct: 1341 NVLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIG 1400
Query: 1020 VSVEELCKRVEELTRL 1035
EE+ + E+ RL
Sbjct: 1401 TRTEEILGDLLEIERL 1416
Score = 52.4 bits (124), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 68/324 (20%), Positives = 126/324 (38%), Gaps = 84/324 (25%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YG + +L+ G +D L I+ + K VLQ D ++ L T ++ + R
Sbjct: 76 LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
G+ + +DPD R + +Y V+PF L E
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYEVALKDKDFV 151
++ I L +L VLDI+FL+G +PT+++LY+ + +K Y
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVY---------- 245
Query: 152 EGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPI 198
P+ P+ G L++ ++Y SA+ A P+
Sbjct: 246 -----------------PIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSADNSTAFPL 288
Query: 199 RP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSI 250
+P S+ A + +D D + ++ G L++L + + + V + +
Sbjct: 289 KPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHFHKAAASVL 346
Query: 251 ASTISYLDNAVVYIGSSYGDSQLI 274
S I L + +++GS G+S L+
Sbjct: 347 TSCICVLHSEYIFLGSRLGNSLLL 370
Score = 43.9 bits (102), Expect = 0.54, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 72/159 (45%), Gaps = 26/159 (16%)
Query: 288 EVLERYVNLGPIVDFCV---VDLERQG-------------QGQVVTCSGAYKDGSLRIVR 331
EV + +N+ PI C V+ E G + ++V +G K+G+L +
Sbjct: 450 EVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFV 509
Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
N I S EL G +W++ +SS +D D F+++S + T +L E+
Sbjct: 510 NCINPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNSTLVLQTG--QEIN 566
Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
E E GF T+F + + +VQVT+ VRL+ T
Sbjct: 567 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 605
>gi|428182571|gb|EKX51431.1| hypothetical protein GUITHDRAFT_134365 [Guillardia theta CCMP2712]
Length = 1295
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 115/493 (23%), Positives = 205/493 (41%), Gaps = 80/493 (16%)
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQ---VVTCSGAYKDG 325
GDS ++ + L + +E LER LG DF V+ + G V+ CSG K
Sbjct: 439 GDSAVMMMRL--NGNEIEIETLERLTMLGVTTDFTVLSDKNVNGGDEFDVLVCSGTGKHA 496
Query: 326 SLRIVRNGIGINEQASVE---LQGIKGMWSLRSSTDDPFD-TFLVVSFISETRILAMNLE 381
S+R +R G+ + E GI G++++ + + D +FL ++F + R+L +
Sbjct: 497 SIRRLRYGLPVETHVRSEKGLCDGIVGLYTI--TFEQKHDQSFLFMAFTTGCRVLTVG-- 552
Query: 382 DELEE-TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSS------------TSRELR 428
EL + T+ G TL L+QV+ V ++ S+ R
Sbjct: 553 SELSDVTDDLGLNPTALTLHAAANSDGHLIQVSDEEVIVIQQPVLDSMDLDVIPCSQPER 612
Query: 429 NEWKSPPGYSVNVATANASQVLLATGGGHLVYL-----EIGDG----ILTEVKHAQLEYE 479
W+ ++V+ + + V+ T GG +++ + G I E+ + E
Sbjct: 613 AVWRPAESTKISVSASLGNLVMCCTCGGTSIHILDTTRDQGSAGEFQIRGEITSFRSSEE 672
Query: 480 ISCLDINPIGENPSYS-QIAAVGMWTDI------SVRIFSLP------------DLNLIT 520
+SC+ + +G + S + +I+ +G + D ++I+SL DL + +
Sbjct: 673 VSCMAMTSLGAHGSETRKISIIGTYADALKARAGKIKIYSLGEEKREVELAQSIDLGIYS 732
Query: 521 KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGEL----TDRKKVSLGTQ 576
L G + +L G +L+ L +G ++ F LN + L T R+ L
Sbjct: 733 GSDLSGSVPQSFGILPEPAGGVFLVVGLRNGKVIIFTLNALSLPLVGLGTTRR---LAES 789
Query: 577 PITLRTFSSKNTTHVFA-ASDRPTVIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDSL 633
PIT + +S + A D ++ S L + + + +SH PF A P +
Sbjct: 790 PITFVSITSYAGPSLIALVEDVAFLVTPSRASLQFQRIGFQPGGISHAAPFICDACPLGM 849
Query: 634 AIAKEGELTIGTIDDIQKLHIRSIPLG---EH------------PRRIC-HQEQSRTFAI 677
AI +G+L + +ID +KL +++IPL EH P R+ H +
Sbjct: 850 AIVVDGQLKLVSIDRCEKLDVKTIPLALESEHTESESKLEDNFMPARVVWHPDAKLVIVG 909
Query: 678 CSLKNQSCAEESE 690
C+ + C E E
Sbjct: 910 CNYSRKVCGAEEE 922
>gi|395740218|ref|XP_002819588.2| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
[Pongo abelii]
Length = 1388
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 95/368 (25%), Positives = 163/368 (44%), Gaps = 46/368 (12%)
Query: 688 ESEMHFVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVL 740
+ E ++L+ ++E I + L +E+ + + S + V Y GT +
Sbjct: 1015 QQEAFSIQLISPVSWEAIPNARIELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQ 1074
Query: 741 PEENEPTKGRILVF-----IVEDG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
EE +GRIL+ + E G K +++ EKE KG V +L NG L++AI Q
Sbjct: 1075 GEEVT-CRGRILIMDVIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQ 1133
Query: 790 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
KI + W LR EL + + + +FI+ D+MKSISLL Y+ E +
Sbjct: 1134 KI--FLWSLR---ASELTGMAFIDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTL 1188
Query: 850 EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
+RD + +V+ + D+ LG ++ + NL E RL ++H
Sbjct: 1189 SLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFH 1248
Query: 907 LGEFVNRFRHGSLVMRLPDSDVGQ---IPTVIFG--------TVNGVIGVIASLPHEQYL 955
+G VN F R P + +V++ +V G IG++ + + Y
Sbjct: 1249 VGAHVNTF------WRTPCRGAAEGLSKKSVVWENKHITWLVSVRGGIGLLLPMQEKTYR 1302
Query: 956 FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDE 1013
L LQ L ++ GLN +R + +++T+ +N LDG+L+ +L LS E
Sbjct: 1303 RLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRTLQNAVRNVLDGELLNRYLYLSTMERSE 1362
Query: 1014 ISKTMNVS 1021
++K + +
Sbjct: 1363 LAKKIGTT 1370
Score = 40.4 bits (93), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 41/172 (23%), Positives = 74/172 (43%), Gaps = 36/172 (20%)
Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
EV + +N+GP + + + L + Q ++V CSG K+G+L +++ I
Sbjct: 412 EVCDSILNIGPCANAAMGEPAFLSEEFQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQV 471
Query: 339 QASVELQGIKGMWSL-----RSSTDDP-------------------FDTFLVVSFISETR 374
+ EL G MW++ + D+P FL++S T
Sbjct: 472 VTTFELPGCYDMWTVIAPLRKEEEDNPKGEGTEQEPSTPEADDDGRRHGFLILSREDSTM 531
Query: 375 ILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR 425
IL E+ E + GF +Q T+F + N+ +VQV+ +RL+ ++
Sbjct: 532 ILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLGIRLLEGVNQ 581
>gi|193702313|ref|XP_001945086.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Acyrthosiphon pisum]
Length = 1335
Score = 87.8 bits (216), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 89/334 (26%), Positives = 148/334 (44%), Gaps = 43/334 (12%)
Query: 731 YYCVGTAYVLPEENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAF 779
Y +GT Y E+ ++GRI +F + D K+++I KE KG V ++
Sbjct: 1010 YIAMGTNYSY-SEDITSRGRIFLFDIIDVVPEPGKPLTKNKIKMIYAKEQKGPVTAITHV 1068
Query: 780 NGKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
G L+ A+ QKI Y W L+D+ G + +E H +L++ I+V DL KS
Sbjct: 1069 VGFLVTAVGQKI--YIWQLKDNDLIGIAFIDTEVYVH-QMLSI-----KSLILVADLFKS 1120
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI---YLGAENNFNLFTVRKNSEGATD 893
I+LL ++ E + RD + + L D+ +L ++ + NL
Sbjct: 1121 ITLLRFQEEYRTLSLVCRDSKPLEVFDINFLIDNTELGFLASDRDQNLLLYLYQPMARES 1180
Query: 894 EERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG----------QIPTVIFGTVNGVI 943
L G++++G VN F +R S V + ++ T++G I
Sbjct: 1181 YGGQHLVRRGDFNIGSNVNSF----FRLRCKQSTVAPDRREAIGSDKRHVTMYTTLDGSI 1236
Query: 944 GVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF--NNEKKTVDAKNFLDGDLIE 1001
G I + + Y L LQ L K I + GLN + +RSF ++ A+ +DG+L+
Sbjct: 1237 GYIVPIHEKNYRRLLTLQNMLVKNITHLAGLNPKAYRSFKATAPERMNQARRVIDGELVW 1296
Query: 1002 SFLDLSRTRM-DEISKTMNVSVEELCKRVEELTR 1034
F+ R +EI+ + V EL + + EL R
Sbjct: 1297 MFVTCMNARQRNEIANKVGVKTIELLQDIYELDR 1330
Score = 69.7 bits (169), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 114/553 (20%), Positives = 228/553 (41%), Gaps = 117/553 (21%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I L+ + D L ++ KF ++++D + L T ++ D + N
Sbjct: 65 LFGNIMCLQSVTLCPSSPDALLLSFSEAKFSLVEYDRDMHSLRTLSLHYFEDDKFK---N 121
Query: 66 GQ-------IGIIDPDCRLIGLHLYDGLFKVIPF-----DN--KGQLKEAFNIRLEEL-- 109
G + +DPD R + +Y F V+PF DN Q+ ++ I + ++
Sbjct: 122 GHTQHWSPPLIRVDPDGRCVVGLVYGSYFVVLPFGRTIDDNAKSAQVMPSYTIPISKIDP 181
Query: 110 ---QVLDIKFLYGCAKPTIVVLYQDNKDARHVKTY--------------EVALKDKDFVE 152
++D FL+G +PT+++LY+ VKT+ ++L + V
Sbjct: 182 KMNNIMDFDFLHGYYEPTLLILYEP------VKTFAGRIAVRKDTCAMVAISLNIQQHVH 235
Query: 153 GP-WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGR--- 208
WS ++L +I V P+ GVLI+ +++Y + + SI K
Sbjct: 236 PVIWSLDSLPYDCQKVIAVSRPIGGVLIMAVNSLIYLNQSVPPFGVALNSIAKTLTNFPL 295
Query: 209 ---------VDADGSRYLLGDHA------GLLHLLVITHEKEK-VTGLKIELLGETSIAS 252
+D + ++ D G L+++ + + + V E + + +
Sbjct: 296 GQQEDINLVLDRATATFISSDKLVTSLCNGDLYVITLYADSMRAVRSFHFEKCASSVLTT 355
Query: 253 TISYLDNAVVYIGSSYGDSQLIKL-------NLQPDAK-----------------GSYV- 287
I+ ++ +++GS G+S L++ + +P K GS V
Sbjct: 356 CITVCLDSYLFLGSRLGNSLLLRYYARSQSNDDEPSIKRKKTDETDEDLVELEVYGSEVQ 415
Query: 288 ----------EVLERYVNLGPIVDFCVVDL--------ERQGQGQVVTCSGAYKDGSLRI 329
EV + +N+GP + + + +++ SG K+G+L +
Sbjct: 416 TSICLESYSFEVCDSIINIGPCSQASIGEPAYISDEFSSDEHDVELLCTSGHGKNGALSV 475
Query: 330 VRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI 389
+ I + L G K MW++ D F TF++++ + T IL E+ E +
Sbjct: 476 LHRSIKPQLVTTFHLDGYKDMWTVHGEND--FHTFMILTNVDSTLILQTG--QEINELDS 531
Query: 390 EGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRN---EWKSPPGYSVNVATANA 446
G+ ++ T+F + + ++QV SVRL++ S +L++ ++ SP ++ ++ N
Sbjct: 532 SGYATREHTVFVCN-MNKFVIQVLRYSVRLLNG-SEQLQSVSLDFGSP---IIHGSSCNP 586
Query: 447 SQVLLATGGGHLV 459
VLL G +V
Sbjct: 587 YAVLLTEDGQVIV 599
>gi|308449821|ref|XP_003088088.1| hypothetical protein CRE_07177 [Caenorhabditis remanei]
gi|308249634|gb|EFO93586.1| hypothetical protein CRE_07177 [Caenorhabditis remanei]
Length = 236
Score = 87.4 bits (215), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 99/188 (52%), Gaps = 6/188 (3%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
++PIYG++ T+ L R E + L + TE+++ VL + ++ITR G ++D+ GR
Sbjct: 54 EIPIYGQVLTMALVRCKREKRQSLVVVTEKWQMAVLTY--RDGKVITRTAGALADQSGRA 111
Query: 63 TDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL-YGCA 121
+DN + L+ + Y+G K+I ++ G +FN+R + V D KF+ G
Sbjct: 112 SDN-LFSLTIHRSGLVAIRAYEGSVKMIQWE-PGTDVRSFNVRFDYPNVSDFKFIDTGVD 169
Query: 122 KPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
+ D+ +H++ ++ + DK+ Q ++ A +LIPVP P+ GVL++
Sbjct: 170 DTYRIAFIYDDDHGKHLQFSDLNMHDKEL-HTFSRQASIAADASVLIPVPAPISGVLVLA 228
Query: 182 EETIVYCS 189
+I+Y S
Sbjct: 229 ANSILYKS 236
>gi|195485994|ref|XP_002091320.1| GE12310 [Drosophila yakuba]
gi|194177421|gb|EDW91032.1| GE12310 [Drosophila yakuba]
Length = 1455
Score = 87.4 bits (215), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 106/435 (24%), Positives = 200/435 (45%), Gaps = 62/435 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
+R +PL PR++ + ++R + + + +++ +EES F+
Sbjct: 1026 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1085
Query: 696 -----LLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
L+ +T+E + ++ + +E+ I+ S+ + Y C+GT + E
Sbjct: 1086 QFEMVLISPETWEIVPDASISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1144
Query: 744 NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
+ ++G I ++ + E GK ++ I +KE KG V +++ G L+ + QKI
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI- 1203
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1204 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 1259
Query: 853 ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
+RD+N + +E + D+ LG AE N ++ + + + ++ L +YHL
Sbjct: 1260 SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQK--LLRKADYHL 1317
Query: 908 GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
G+ VN FR L R P + V++GT++G +G LP + Y LQ
Sbjct: 1318 GQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQN 1376
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
L + + GLN +++R+ + KK ++ +DGDLI S+ ++ + +E++K +
Sbjct: 1377 VLLSYQEHLCGLNPKEYRTLKSFKKQGINPSRCVIDGDLIWSYRLMANSERNEVAKKIGT 1436
Query: 1021 SVEELCKRVEELTRL 1035
EE+ + E+ RL
Sbjct: 1437 RTEEILADLLEIERL 1451
Score = 56.2 bits (134), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 72/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YG + +L+ G +D L I+ + K VLQ D ++ L T ++ + R
Sbjct: 76 LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
G+ + +DPD R + +Y V+PF L E
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
++ I L +L VLDI+FL+G +PT+++LY+ + +K ++
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
L + V W+ N+L + P+ P+ G L++ ++Y SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315
Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
+ A P++P S+ A + +D D + ++ G L++L + + + V
Sbjct: 316 DNSTAFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHF 373
Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
+ + S I L + +++GS G+S L+
Sbjct: 374 HKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405
Score = 43.5 bits (101), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 44/159 (27%), Positives = 71/159 (44%), Gaps = 26/159 (16%)
Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
EV + +N+ PI C V+ E G ++V +G K+G+L +
Sbjct: 485 EVCDSLMNVAPINYMCAGERVEFEEDGATLRPHAESLQDLKIELVAATGHSKNGALSVFV 544
Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
N I S EL G +W++ +SS +D D F+++S + T +L E+
Sbjct: 545 NCINPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNSTLVLQTG--QEIN 601
Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
E E GF T+F + + +VQVT+ VRL+ T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640
>gi|147860990|emb|CAN78747.1| hypothetical protein VITISV_022228 [Vitis vinifera]
Length = 1298
Score = 87.4 bits (215), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 58/209 (27%), Positives = 103/209 (49%), Gaps = 20/209 (9%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
N+ PI+D VVD + Q+ C G +GSLRI+R+GI + + + + QGI G W+
Sbjct: 503 NIAPILDMSVVDCHDEEHDQMFACCGVTPEGSLRIIRSGISVEKLLRTAPIYQGITGTWT 562
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
++ D + +FLV+SF+ ETR+L++ L + T+ GF TL C LVQ+
Sbjct: 563 VKMKVIDSYHSFLVLSFVEETRVLSVGLSFT-DVTDSVGFQPDVSTLACGVVDDGLLVQI 621
Query: 413 TSGSVRLVSSTS----------RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE 462
V+L T+ + W P S+++ + +++AT +++
Sbjct: 622 HKNGVKLCLPTTVAHPEGIPLASPICTSW-FPENISISLGAVGYNLIVVATSSPCFLFIL 680
Query: 463 IGDGI------LTEVKHAQLEYEISCLDI 485
+ + E++H +L+ E+SC+ I
Sbjct: 681 GVRSVSAYQYEIYEMQHVRLQNEVSCISI 709
Score = 56.2 bits (134), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 41/189 (21%), Positives = 85/189 (44%), Gaps = 7/189 (3%)
Query: 573 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
+G P+ L S + A SDRP ++ S+ L Y++++ + +H+ P S P
Sbjct: 882 IGITPVFLVPLSDSLEADIIALSDRPWLLQSARHSLSYTSISFQPSTHVTPVCSMECPMG 941
Query: 633 LAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH 692
+ E L + + ++L+++ LG PR++ + +SR + ++++
Sbjct: 942 ILFVAENSLHLVEMVHSKRLNVQKFYLGGTPRKVLYHSESRLLLV---MRTELSQDTYSS 998
Query: 693 FVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDS----NVYYCVGTAYVLPEENEPTK 748
+ +D + +S++ L+ E G S+ ++ G A + E E TK
Sbjct: 999 DICCVDPLSGSVLSSFKLELGETGKSMELVRVVNEQVLVIGTSLSSGPAMMPSGEAESTK 1058
Query: 749 GRILVFIVE 757
GR++V +E
Sbjct: 1059 GRLIVLCLE 1067
>gi|320169222|gb|EFW46121.1| cleavage and polyadenylation specificity factor 1 [Capsaspora
owczarzaki ATCC 30864]
Length = 1725
Score = 87.4 bits (215), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 116/495 (23%), Positives = 206/495 (41%), Gaps = 70/495 (14%)
Query: 591 VFAASDRPT-VIYSSNKKLLYSNVNLKE--VSHMCPFNSAAFPDSLA-IAKEGELTI--- 643
VF RP ++ S +K L +++ L + VS FN+ A P +G L
Sbjct: 1227 VFVCGRRPLWLLMSPTRKALRAHLMLTDGSVSAFSAFNNNACPGGFVYFTTQGTLRFCQL 1286
Query: 644 -GTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS----------------------- 679
T + +R +PL I + E RT+ + +
Sbjct: 1287 APTTNHDNPWPVRRVPLRATAHYIGYHEVFRTYVLVTSHPKPYFNLPRLTNDETYTPVPY 1346
Query: 680 -LKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV-----YYC 733
K ++ + ++L+ T+E I ++ L FE S+ + + V Y
Sbjct: 1347 TPKPRAIPATFDTFSLQLISPVTWESIHSFDLPAFERVTSVDIAAITSQETVTGLKDYVV 1406
Query: 734 VGTAYVLPEENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAFNGK 782
+GT V+ E+ GRI+VF + D KL+ + E+E KGA+ +L+ G
Sbjct: 1407 IGTT-VIEGEDVTCHGRIIVFEIIDVVPEVNRPQTNRKLKYLMEREQKGAITALSHVCGH 1465
Query: 783 LLAAINQKIQLYKWMLRD--DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLL 840
L++ I QKI ++++ D DG + ++ + V +FI+VGDL S+ LL
Sbjct: 1466 LVSCIGQKIIIWQFASDDTMDGVAFIDTQT------FVVSVSAIKNFILVGDLNNSVFLL 1519
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERG 897
+ + ARD++ +++ + L D LG +++ NL N
Sbjct: 1520 RFNETTKHLGFIARDFDHMSVASTQFLVDGSSLGFLATDSHQNLVVFAYNPLNRESNNGQ 1579
Query: 898 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV-----IFGTVNGVIGVIASLPHE 952
RL ++H+G V + +V R V + +V + T+ G + +A +
Sbjct: 1580 RLLRQLDFHVGSHVQQVLR--MVPRSLPVSVDRGASVKRHIDLLATLEGSLNALAPIGET 1637
Query: 953 QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK--TVDAKNFLDGDLIESFLDLSRTR 1010
+ LE LQ L ++ GLN +R++ +K T A N +DG+L+ FL L
Sbjct: 1638 TFRRLEWLQRQLVG-LQQRAGLNPIGYRAYRFPRKMTTTRAGNVIDGELLSRFLYLGLAE 1696
Query: 1011 MDEISKTMNVSVEEL 1025
E+++ + E+L
Sbjct: 1697 QRELARQRRNTPEDL 1711
Score = 47.4 bits (111), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 61/113 (53%), Gaps = 15/113 (13%)
Query: 95 KGQLKEAFNIRLEELQ-----VLDIKFLYGCAKPTIVVLYQDNKDARHVKTYE------- 142
+ +L+ ++ I+L ELQ V+DI+FL G +PT+ +L++ N + +T +
Sbjct: 292 RQRLRPSYEIKLTELQRHIHHVIDIEFLTGYFEPTLALLFEPNAPSWTGRTVQRKDTCSM 351
Query: 143 VALK-DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANA 192
VAL + P WS + L + ++ VP P+CG +I+ + I++ S ++
Sbjct: 352 VALSINTSSHSHPVVWSVDKLPFNSMRVMAVPRPVCGTVIVTPDAILHLSQSS 404
>gi|268580265|ref|XP_002645115.1| Hypothetical protein CBG16808 [Caenorhabditis briggsae]
gi|296439546|sp|A8XPU7.1|CPSF1_CAEBR RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
Length = 1454
Score = 87.0 bits (214), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 84/315 (26%), Positives = 146/315 (46%), Gaps = 35/315 (11%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+PE +PT R K++++ +KE KG V L A NG LL+ + QK+ + W
Sbjct: 1153 VVPEPGQPTSNR---------KIKVLYDKEQKGPVTGLCAINGLLLSGMGQKV--FIWQF 1201
Query: 799 RDDGTRELQSECGHHGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
+D+ + S H ++ L+ ++T + D +S+SL+ ++ E A+ +RD
Sbjct: 1202 KDNDLMGI-SFLDMHYYVYQLHSIRT---IALALDARESMSLIRFQEENKAMSIASRDDR 1257
Query: 858 --ANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 910
A A E L D +++G N LF+ + + E RL V ++G
Sbjct: 1258 KCAQAPMASEFLVDGMHIGFLLSDEHGNITLFSYSPEAPESNGGE--RLTVKAAINIGTN 1315
Query: 911 VNRFRHGSLVMRLPDS-------DVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
+N F L DS ++ Q IFG+++G G I L + Y L LQT
Sbjct: 1316 INAFLRVKGHTSLLDSSSPEERENIEQRMNTIFGSLDGSFGYIRPLTEKSYRRLHFLQTF 1375
Query: 964 LRKVIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
+ V + GL+ + RS + V +A+N +DGD++E +L LS ++++ + V
Sbjct: 1376 IGSVTPQIAGLHIKGARSSKPSQPIVNGRNARNLIDGDVVEQYLHLSVYDKTDLARRLGV 1435
Query: 1021 SVEELCKRVEELTRL 1035
+ + +L R+
Sbjct: 1436 GRYHILDDLMQLRRM 1450
Score = 68.2 bits (165), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 90/399 (22%), Positives = 158/399 (39%), Gaps = 93/399 (23%)
Query: 65 NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYG 119
N I DP R +Y ++PF + ++ I L+++ V D+ FL G
Sbjct: 138 NPPIVRTDPANRCAASLVYGKHIAILPFHENSKRILSYIIPLKQIDPRLDNVADMVFLEG 197
Query: 120 CAKPTIVVLYQD----------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIP 169
+PTI+ LY+ D + V + D+ F W NL + L+
Sbjct: 198 YYEPTILFLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAVV-WQTANLPMDCNSLLS 256
Query: 170 VPPPLCGVLIIGEETIVYCSA-------------NAFKAIPIRPSITKAYGRVDADGSRY 216
+P PL G ++ G TIVY + + F P++ + +D S Y
Sbjct: 257 IPKPLGGAVVFGSNTIVYLNQAVPPCGIVLNSCYDGFTKFPLK-DMKHLKMTLDCSTSVY 315
Query: 217 L------LGDHAGLLHL--LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSY 268
+ +G G L+L LV + V L+ + +TSIA T++ +++GS
Sbjct: 316 MEDGRIAVGSREGDLYLLRLVTSSGGATVKSLEFSKVCDTSIAFTLTVCAPGHLFVGSRL 375
Query: 269 GDSQLIKLNL----QPDAK----------------------GSYVEV------------- 289
GDSQL++ L + AK G +E+
Sbjct: 376 GDSQLLEYTLLKVTKESAKKQRLEQQNPSEIELDEDDIELYGGAIEMQQNDDDEQISESL 435
Query: 290 ----LERYVNLGPIVDFC----------VVDLERQGQ-GQVVTCSGAYKDGSLRIVRNGI 334
L+R +N+GP+ C ++D +R+ +VT SG K+G+L + + +
Sbjct: 436 QFRELDRLLNVGPVKSMCFGRPNYMSNDLIDAKRKDPVFDLVTASGHGKNGALCVHQRSM 495
Query: 335 GINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISET 373
S L+G + +W++ ++ +L+VS + T
Sbjct: 496 RPEIITSSLLEGAEQLWAVGRKENESH-KYLIVSRVRST 533
>gi|194883064|ref|XP_001975624.1| GG22421 [Drosophila erecta]
gi|190658811|gb|EDV56024.1| GG22421 [Drosophila erecta]
Length = 1455
Score = 87.0 bits (214), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 106/435 (24%), Positives = 200/435 (45%), Gaps = 62/435 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
+R +PL PR++ + ++R + + + +++ +EES F+
Sbjct: 1026 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1085
Query: 696 -----LLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
L+ +T+E + ++ + +E+ I+ S+ + Y C+GT + E
Sbjct: 1086 QFEMVLISPETWEIVPDASISFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1144
Query: 744 NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
+ ++G I ++ + E GK ++ I +KE KG V +++ G L+ + QKI
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI- 1203
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1204 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 1259
Query: 853 ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
+RD+N + +E + D+ LG AE N ++ + + + ++ L +YHL
Sbjct: 1260 SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQK--LLRKADYHL 1317
Query: 908 GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
G+ VN FR L R P + V++GT++G +G LP + Y LQ
Sbjct: 1318 GQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQN 1376
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
L + + GLN +++R+ + KK ++ +DGDLI S+ ++ + +E++K +
Sbjct: 1377 VLVSYQEHLCGLNPKEYRTLKSFKKQGINPSRCIIDGDLIWSYRLMANSERNEVAKKIGT 1436
Query: 1021 SVEELCKRVEELTRL 1035
EE+ + E+ RL
Sbjct: 1437 RTEEILGDLLEIERL 1451
Score = 56.2 bits (134), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 74/337 (21%), Positives = 136/337 (40%), Gaps = 75/337 (22%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM---------GDVS 56
+YG + +L+ G +D L I+ + K VLQ D ++ L T ++ G +
Sbjct: 76 LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135
Query: 57 DRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKE---------------- 100
R PT +DPD R + +Y V+PF L E
Sbjct: 136 GRYFVPTVR-----VDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAM 190
Query: 101 --------AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE---- 142
++ I L +L VLDI+FL+G +PT+++LY+ + +K
Sbjct: 191 VSRTPIMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCV 250
Query: 143 ---VALKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY----------- 187
++L + V W+ N+L + P+ P+ G L++ ++Y
Sbjct: 251 LVAISLNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVS 310
Query: 188 --CSANAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-V 237
SA+ A P++P S+ A + +D D + ++ G L++L + + + V
Sbjct: 311 LNSSADNSTAFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTV 368
Query: 238 TGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
+ + S I L + +++GS G+S L+
Sbjct: 369 RNFHFHKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405
Score = 42.7 bits (99), Expect = 0.97, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 71/159 (44%), Gaps = 26/159 (16%)
Query: 288 EVLERYVNLGPIVDFCV---VDLERQGQG-------------QVVTCSGAYKDGSLRIVR 331
EV + +N+ P+ C V+ E G ++V +G K+G+L +
Sbjct: 485 EVCDSLMNVAPVNYMCAGERVEFEEDGATLRPHAESLQDVKIELVAATGHSKNGALSVFV 544
Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
N I S EL G +W++ +SS +D D F+++S + T +L E+
Sbjct: 545 NCINPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNSTLVLQTG--QEIN 601
Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
E E GF T+F + + +VQVT+ VRL+ T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640
>gi|154419018|ref|XP_001582526.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
gi|121916762|gb|EAY21540.1| CPSF A subunit region family protein [Trichomonas vaginalis G3]
Length = 1142
Score = 87.0 bits (214), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 164/814 (20%), Positives = 328/814 (40%), Gaps = 81/814 (9%)
Query: 259 NAVVYIGSSYGDSQ--LIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQ-GQV 315
N+ YI S D + ++ P+ + + +E E + + R Q G +
Sbjct: 356 NSFYYITESCEDVEEGFVEKTFSPNTENTKIEYFEGHEIKNRLTKLITAPSFRGSQLGDI 415
Query: 316 VTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDP---FDTFLVVSFISE 372
++ G++ SL+I R G+ +E+ G G +++ DP +DTF+ VS S
Sbjct: 416 ISIHGSFNKSSLKITRKGMPTKVLKPIEIGG--GCTFVKAVKKDPLNEYDTFIFVSNEST 473
Query: 373 TRILAMNLED-ELEETEIEGFCSQTQTL--FCHDAIYN--QLVQVTSGSVRLVSSTSREL 427
TRI E EL++++ F + +T+ F A N L Q+T+ +R+++ +
Sbjct: 474 TRIFEFFEETRELKDSKESLFVTDKKTIDVFFLPAFNNTSSLAQITTEGMRIINDKEK-- 531
Query: 428 RNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGIL----TEVKHAQLEYEISCL 483
++WK + TAN SQ+ + +V E + L + K ++ +I+ +
Sbjct: 532 -HDWKRDESSKIICVTANPSQIAIVYDDNQIVLFETDETSLPKEVSSNKVMEIAGQITSI 590
Query: 484 DINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITK--EHLGGEIIPRSVLLCAF--- 538
+ Y + AV D + I + +++ T+ +I+ ++V+ F
Sbjct: 591 ALPQPQTGVRYVEWLAVSAVND-GLSIVYIVNISKTTEMWSVSSRQILDKTVISMMFLFV 649
Query: 539 EGISYLL-CALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDR 597
GI +L +G L L+ G L + LG P+T KN +
Sbjct: 650 PGIGNILHIGHNEGLLTRTNLDDSNGSLDNATLKFLGNAPVTFSRCEVKNQNSILINCAS 709
Query: 598 PTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS--LAIAKEGELTIGTIDDIQKLHIR 655
P K L ++ C + + F D + + + T D + +
Sbjct: 710 PWY----TKGLSLVQLSSSPFVSCCQYKAPFFIDDSFIGLTTNNMVLFYTPDKNILIDTQ 765
Query: 656 SIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEY 715
++ L PR+I H ++ +C L + + + +L++ E IS+ PL +
Sbjct: 766 TLDLDMTPRQIVHIPNTK--FVCVLMSDLIKGKWKSQ-AKLVNYD--ELISSEPLQLDDN 820
Query: 716 GCSILSCSFSDDSNVYYCVGTAY-VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVY 774
+C + + V CV P+ + G++L+ + G +++ +
Sbjct: 821 LLVSAACYLENYNLVAVCVAKNLSFFPKRCD--GGQVLLLDIGSGIPKVVQSTDFDDIPQ 878
Query: 775 SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLM 834
++ F ++ + + + + L+ T+ L+ C Y++ G I+ GD M
Sbjct: 879 AIAPFGEYVILGVQETLMV----LKVGRTKLLKKCCSKAFPHCINYLKAFGTRIIAGDAM 934
Query: 835 KSISLLIYKHEEGAIEERARDY-NANWMSAVEILDDDIYLGAENNFNLFTVRK-----NS 888
+S + + EE + A D + +SA+ + + G + F F + + N
Sbjct: 935 ESFHFVKFDKEEDILSIFADDMVPRHPLSAIGLDRSTVCCG--DKFGSFCILRLPPDIND 992
Query: 889 EGATDEE--------------RGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
+ D + ++E+ YH+G + + L D ++
Sbjct: 993 DAEIDPSDVGHAFENEKFPGAKNKVELSNMYHIGSPITG-------LCLSQGDFENT-SI 1044
Query: 935 IFGTVNGVIGVIASLPHEQYLFLEKL-QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
++GTV+G IG+ L + L +L + ++K+IK G EQ+RS+ K V+
Sbjct: 1045 VYGTVDGEIGLFIPLKTDTDARLFRLLEDEMKKLIKSPVGRFIEQFRSYYTPLKCVN--- 1101
Query: 994 FLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
D +L+ S++DL EI++ + V +L +
Sbjct: 1102 --DSNLLLSYIDLPTNLQKEIAEKLKVKPFDLSR 1133
>gi|225455571|ref|XP_002268371.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Vitis vinifera]
Length = 1442
Score = 86.7 bits (213), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 93/369 (25%), Positives = 164/369 (44%), Gaps = 37/369 (10%)
Query: 694 VRLLDDQT----FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENE 745
VR+L+ + ++ +T P+ + E ++ L + + ++ +GTAYV E+
Sbjct: 1076 VRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDV 1134
Query: 746 PTKGRILVFIV---EDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
+GR+L+F V D L++E KE KGA+ ++ + G LL A KI L+KW
Sbjct: 1135 AARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIILHKWT-- 1192
Query: 800 DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
GT + + + +FI++GD+ +SI L +K + + A+D+ +
Sbjct: 1193 --GTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFGSL 1250
Query: 860 WMSAVEILDDDIYLGA-----ENNFNLFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
A E L D L + N +F K SE + +L E+H+G V +
Sbjct: 1251 DCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTK 1307
Query: 914 FRHGSLVMRLPD-------SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
F ++ D SD ++FGT++G IG IA L + L+ LQ L
Sbjct: 1308 FLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1367
Query: 967 VIKGVGGLNHEQWRSF--NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
+ V GLN +R F N + N +D +L+ + L EI++ + + +
Sbjct: 1368 AVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQ 1427
Query: 1025 LCKRVEELT 1033
+ + +L+
Sbjct: 1428 ILSNLNDLS 1436
Score = 76.3 bits (186), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 109/503 (21%), Positives = 194/503 (38%), Gaps = 132/503 (26%)
Query: 6 IYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDAESSELITRAM----GDVSD 57
++G + T+ + G +D + +A + K VL++D L T +M G
Sbjct: 109 LHGNVETMTVLPSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWF 168
Query: 58 RIGRPTDN---GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG------------------ 96
+ R ++ G + +DP R G+ +Y ++ G
Sbjct: 169 HLKRGHESFARGPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSA 228
Query: 97 QLKEAFNIRLEEL---QVLDIKFLYGCAKPTIVVLYQ------DNKDARHVKTYEVALK- 146
+++ ++ I L +L V D F++G +P +V+L++ +H AL
Sbjct: 229 RVESSYVISLRDLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
Query: 147 DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITK 204
+ P WS NL + A L+PVP P+ GV++I +I Y S +A A+ +
Sbjct: 289 STTLKQHPLIWSAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVS 348
Query: 205 AYG-----------RVDADGSRYLLGDHA------GLLHLLVITHEKEKVTGLKIELLGE 247
A +DA + +L D A G L LL + ++ V L +
Sbjct: 349 ADNSQEMPRSSFSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRA 408
Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKLNL----------------QPDAK-------- 283
+ + S I+ + N++ ++GS GDS L++ P AK
Sbjct: 409 SVLTSGIAAIGNSLFFLGSRLGDSLLVQFTSILSSSVKEEVGDIEGDVPSAKRLRKSSSD 468
Query: 284 --------------GSY------------VEVLERYVNLGPIVDFC----------VVDL 307
GS V + ++N+GP+ DF +
Sbjct: 469 ALQDMVNGEELSLYGSAPNSTETSQKTFSFSVRDSFINVGPLKDFAYGLRINADPKATGI 528
Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------------- 353
+Q ++V CSG K+G+L I++ I VEL G KG+W++
Sbjct: 529 AKQSNYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGHNADSTK 588
Query: 354 RSSTDDPFDTFLVVSFISETRIL 376
++ DD + +L++S S T +L
Sbjct: 589 MATKDDEYHAYLIISLESRTMVL 611
>gi|296084122|emb|CBI24510.3| unnamed protein product [Vitis vinifera]
Length = 1448
Score = 86.7 bits (213), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 93/369 (25%), Positives = 164/369 (44%), Gaps = 37/369 (10%)
Query: 694 VRLLDDQT----FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENE 745
VR+L+ + ++ +T P+ + E ++ L + + ++ +GTAYV E+
Sbjct: 1082 VRVLEPEKSGAPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYV-QGEDV 1140
Query: 746 PTKGRILVFIV---EDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
+GR+L+F V D L++E KE KGA+ ++ + G LL A KI L+KW
Sbjct: 1141 AARGRVLLFSVGKNTDNSQNLVSEIYSKELKGAISAVASLQGHLLIASGPKIILHKWT-- 1198
Query: 800 DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
GT + + + +FI++GD+ +SI L +K + + A+D+ +
Sbjct: 1199 --GTELNGVAFFDAPPLYVVSLNIVKNFILLGDIHRSIYFLSWKEQGAQLNLLAKDFGSL 1256
Query: 860 WMSAVEILDDDIYLGA-----ENNFNLFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
A E L D L + N +F K SE + +L E+H+G V +
Sbjct: 1257 DCFATEFLIDGSTLSLIVSDDQKNIQIFYYAPKMSESWKGQ---KLLSRAEFHVGAHVTK 1313
Query: 914 FRHGSLVMRLPD-------SDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
F ++ D SD ++FGT++G IG IA L + L+ LQ L
Sbjct: 1314 FLRLQMLPASSDRTSATQGSDKTNRFALLFGTLDGSIGCIAPLDELTFRRLQSLQKKLVD 1373
Query: 967 VIKGVGGLNHEQWRSF--NNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
+ V GLN +R F N + N +D +L+ + L EI++ + + +
Sbjct: 1374 AVPHVAGLNPRSFRQFRSNGKAHRPGPDNIVDCELLCHYEMLPFEEQLEIAQQIGTTRMQ 1433
Query: 1025 LCKRVEELT 1033
+ + +L+
Sbjct: 1434 ILSNLNDLS 1442
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 109/509 (21%), Positives = 193/509 (37%), Gaps = 138/509 (27%)
Query: 6 IYGRIATLELFRPHG----EAQDFLFIATERYKFCVLQWDAESSELITRAM----GDVSD 57
++G + T+ + G +D + +A + K VL++D L T +M G
Sbjct: 109 LHGNVETMTVLPSGGGDNSRRRDSIILAFQDAKISVLEFDDSIHGLRTSSMHCFEGPEWF 168
Query: 58 RIGRPTDN---GQIGIIDPDCRLIGLHLYDGLFKVIPFDNKG------------------ 96
+ R ++ G + +DP R G+ +Y ++ G
Sbjct: 169 HLKRGHESFARGPLVKVDPQGRCSGVLVYGLQMIILKASQAGYGLVGDEEALSSGSAVSA 228
Query: 97 QLKEAFNIRLEEL---QVLDIKFLYGCAKPTIVVLYQ------DNKDARHVKTYEVALK- 146
+++ ++ I L +L V D F++G +P +V+L++ +H AL
Sbjct: 229 RVESSYVISLRDLDMKHVKDFTFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSI 288
Query: 147 DKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITK 204
+ P WS NL + A L+PVP P+ GV++I +I Y S +A A+ +
Sbjct: 289 STTLKQHPLIWSAVNLPHDAYKLLPVPSPIGGVVVISANSIHYHSQSASCALALNNYAVS 348
Query: 205 AYG-----------RVDADGSRYLLGDHA------GLLHLLVITHEKEKVTGLKIELLGE 247
A +DA + +L D A G L LL + ++ V L +
Sbjct: 349 ADNSQEMPRSSFSVELDAANATWLSNDVAMLSTKTGELLLLTLAYDGRVVHRLDLSKSRA 408
Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKLNL----------------QPDAK---GSYVE 288
+ + S I+ + N++ ++GS GDS L++ P AK S +
Sbjct: 409 SVLTSGIAAIGNSLFFLGSRLGDSLLVQFTSILSSSVKEEVGDIEGDVPSAKRLRKSSSD 468
Query: 289 VLERYVN-------------------------------------LGPIVDFC-------- 303
L+ VN +GP+ DF
Sbjct: 469 ALQDMVNGEELSLYGSAPNSTETSQVEAQVGKTFSFSVRDSFINVGPLKDFAYGLRINAD 528
Query: 304 --VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------- 353
+ +Q ++V CSG K+G+L I++ I VEL G KG+W++
Sbjct: 529 PKATGIAKQSNYELVCCSGHGKNGALCILQQSIRPEMITEVELPGCKGIWTVYHKNTRGH 588
Query: 354 ------RSSTDDPFDTFLVVSFISETRIL 376
++ DD + +L++S S T +L
Sbjct: 589 NADSTKMATKDDEYHAYLIISLESRTMVL 617
>gi|357162146|ref|XP_003579318.1| PREDICTED: probable cleavage and polyadenylation specificity factor
subunit 1-like [Brachypodium distachyon]
Length = 1442
Score = 86.7 bits (213), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 84/350 (24%), Positives = 160/350 (45%), Gaps = 27/350 (7%)
Query: 702 FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF--I 755
+E ST P+ +FE ++ L + + ++ +GTAYV E+ +GR+L+F
Sbjct: 1096 WETRSTIPMQSFENALTVRIVTLHNTTTKENETLMAIGTAYV-QGEDVAARGRVLLFSFT 1154
Query: 756 VEDGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
+ L+ E KE+KGAV ++ + G L+ A KI L KW +G+ EL + +
Sbjct: 1155 KSENSQNLVTEVYSKESKGAVSAVASLQGHLVIASGPKITLNKW----NGS-ELTAVAFY 1209
Query: 813 HGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
+ + + +F++ GD+ KS+ L +K + + A+D+ + A E L D
Sbjct: 1210 DAPLHVVSLNIVKNFVLFGDIHKSVYFLSWKEQGSQLTLLAKDFGSLDCFATEFLIDGST 1269
Query: 873 LG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVG 929
L ++++ NL + + +L E H+G + +F + LP +
Sbjct: 1270 LSLVVSDSDKNLQIFYYAPKMVESWKGQKLLSRAELHVGAHMTKFLR---LQMLPAQGLA 1326
Query: 930 QIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF--N 983
T ++FGT++G IG IA + + L+ LQ L + V GLN +R F N
Sbjct: 1327 SEKTNRFALLFGTLDGSIGCIAPVDELTFRRLQSLQRKLVDAVSHVCGLNPRSFRQFKSN 1386
Query: 984 NEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
+ N +D +L+ + LS ++++ + + ++ +++
Sbjct: 1387 GKAHRPGPDNIIDFELLTYYEILSLEEQLDMAQQIGTTRAQILSNFSDIS 1436
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 90/419 (21%), Positives = 173/419 (41%), Gaps = 110/419 (26%)
Query: 106 LEELQVLDIKFLYGCAKPTIVVLYQDNK------DARH----VKTYEVALKDKDFVEGP- 154
L+ V D F++G +P +V+L++ ++H + + +++ K + P
Sbjct: 245 LDTNHVKDFTFVHGYIEPVLVILHEREPTWAGRISSKHHTCMISAFSISMTLK---QHPM 301
Query: 155 -WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY------CS--ANAFKAIPI-RPSITK 204
WS N+ + A ++ VPPP+ GVL+I +I Y CS N F + P P I K
Sbjct: 302 IWSAANIPHDAYQILSVPPPISGVLVICANSIHYHSQSTSCSLALNNFASQPDGSPEIHK 361
Query: 205 AYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISY 256
V+ D ++ + G + LL + ++ V L + + I+S ++
Sbjct: 362 VNFHVELDAAKATWLSNDIVMFSAKTGEMLLLTVVYDGRTVQKLDLMKSKASVISSGVTT 421
Query: 257 LDNAVVYIGSSYGDSQLIKL---------------------------------------- 276
+ ++ ++GS GDS L++
Sbjct: 422 IGSSFFFLGSRVGDSLLVQFSCGVPTSVIPDIADERSADIEGDLPFSKRLKRVPSDILQD 481
Query: 277 -----------NLQPDA-----KGSYVEVLERYVNLGPIVDFC----------VVDLERQ 310
N+ P++ K SYV V + VN+GP+ DF +Q
Sbjct: 482 VTSVEELSFQNNMLPNSLESAQKISYV-VRDALVNVGPLKDFSYGLRVNADPNATGNAKQ 540
Query: 311 GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------RSSTDDPFDT 363
++V CSG K+G+L +++ I + VEL +G+W++ ++ D+ +
Sbjct: 541 SNYELVCCSGHGKNGALSVLQQSIRPDLITEVELPSCRGIWTVYYKSSRGHTTEDNEYHA 600
Query: 364 FLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTLFCHDAI-YNQLVQVTSGSVRLV 420
+L++S E+R + + D+L E TE + Q T+ + +++QV + R++
Sbjct: 601 YLIISL--ESRTMVLETGDDLGEVTETVDYYVQGATITAGNLFGRRRVIQVYATGARVL 657
>gi|221055487|ref|XP_002258882.1| CPSF (cleavage and polyadenylation specific factor), subunit A
[Plasmodium knowlesi strain H]
gi|193808952|emb|CAQ39655.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
putative [Plasmodium knowlesi strain H]
Length = 2478
Score = 86.3 bits (212), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 122/539 (22%), Positives = 220/539 (40%), Gaps = 96/539 (17%)
Query: 565 LTDRKKVSLGTQ-PITLRTFS----------------SKNTTHVFAASDRPTVIYSS-NK 606
LT R+K+S T+ P+ +TF SK +F D P +IYS+ K
Sbjct: 1907 LTKRRKISTCTKSPVKFKTFLKVNSEKNLIDINMSKLSKKCNFLFVCCDNPIIIYSTLKK 1966
Query: 607 KLLYSNVNLKEV---------SHMCPF-NSAAFP-----DSLAIAKEG-ELTIGTIDDIQ 650
K+ S ++++ V +++ PF N +F +S I +G +L I +++++
Sbjct: 1967 KISTSKLSIRNVLLVDMFSDFNYLNPFHNFLSFKKKNQNNSYFIFFDGHKLCISYLNEMK 2026
Query: 651 KLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYP 709
K + IP +I H + C ++ + + V D F TY
Sbjct: 2027 KTFMERIPFHRTVEKIAYHADTGLLITACPVEEKHKTNQMMKQIVCFFDPFQNSFKYTYI 2086
Query: 710 LDTFEYGCSI----LSCSFS-------DDSN-----------------VYYCVGTAYVLP 741
+ + SI L+ S S D N CVGTA
Sbjct: 2087 IPSKFSVSSICIYELASSHSLVADHSVDQMNQVNHPNNKDGALNPPVHTLICVGTANNNE 2146
Query: 742 EENEPTKGRILVFIVEDGKLQ-----LIAEKETKGAVYSLNAFNGKLLAAINQKIQLY-- 794
EP+ G I VF+ + Q + G + L F K++AA+N + +
Sbjct: 2147 RITEPSSGHIYVFVAKKKTNQFEIKHIYTYNVNCGGITHLKQFRDKIVAAVNNTVLILDI 2206
Query: 795 KWMLRDDGT------RELQSECGHHGHILALYVQTR--------GDFIVVGDLMKSISLL 840
+ L + GT + ++ E +A + + ++IVVGD+M S++LL
Sbjct: 2207 RNFLTNLGTYIYNASKAMKVESNDAFLEVASFTPSSWIMSLDVVKNYIVVGDIMTSVTLL 2266
Query: 841 IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLE 900
Y E + E RDY W +A L +D +L ++ N ++K++ DEE +L
Sbjct: 2267 SYDFENAILNEVCRDYANIWCTA---LSEDHFLVSDMESNFLVLQKSNIKFNDEESFKLS 2323
Query: 901 VVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVI-----GVIASL-PHEQY 954
+V +++ G VN+ SL + + + + P I ++ G I++L P +
Sbjct: 2324 LVSQFNHGSVVNKMLSTSLRNLVDEYESEERPNEIVQKERSILCASSEGSISTLIPFSNF 2383
Query: 955 LFLEK---LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTR 1010
+ ++ ++ + I +G L+H +R + + + K +DG+L + F L R
Sbjct: 2384 IQFKRALCIEIAINDNISSLGNLSHSSYREYKITLASKNCKGVVDGELFKMFFYLPFER 2442
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 20/69 (28%), Positives = 43/69 (62%), Gaps = 1/69 (1%)
Query: 286 YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
++E+L N+GPI+DFCVV + + +++TC+ + G + I+RNG+ ++ + + +
Sbjct: 829 FIEILSVIQNMGPILDFCVVK-NKNEEKEIITCNSYGRTGCISIIRNGMKVDIISKLNIG 887
Query: 346 GIKGMWSLR 354
I M+ ++
Sbjct: 888 KITNMFVVK 896
>gi|67521912|ref|XP_659017.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
gi|74598221|sp|Q5BDG7.1|CFT1_EMENI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|40745387|gb|EAA64543.1| hypothetical protein AN1413.2 [Aspergillus nidulans FGSC A4]
gi|259486722|tpe|CBF84808.1| TPA: Protein cft1 (Cleavage factor two protein 1)
[Source:UniProtKB/Swiss-Prot;Acc:Q5BDG7] [Aspergillus
nidulans FGSC A4]
Length = 1339
Score = 86.3 bits (212), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 96/438 (21%), Positives = 192/438 (43%), Gaps = 70/438 (15%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 694
+R++P+G+ ++ + S T+ + C E+ E+H +
Sbjct: 907 MRTVPIGQQIDKLTYVSASDTYVLGTCQRCEFRLPEDDELHPEWRNEEISFLPEVNQSSL 966
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
+++ +T+ I +YPL+ E+ + + S N + VGT+ E+ P++G
Sbjct: 967 KVVSPKTWSVIDSYPLEPAEHIMVMKTMSLEVSENTHERRDMIVVGTSLAR-GEDIPSRG 1025
Query: 750 RILVF----IVEDG-------KLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKW 796
I VF +V D +L+LI ++ KGAV +L+ G+ L+AA QK +
Sbjct: 1026 CIYVFEVIEVVPDPEQPETNRRLKLIGKEPVKGAVTALSEIGGQGFLIAAQGQKSMVRG- 1084
Query: 797 MLRDDGT----RELQSECGHHGHILALYVQTRGD-FIVVGDLMKSISLLIYKHEEGAIEE 851
L++DG+ + +C +++ + +G + GD +K + Y E +
Sbjct: 1085 -LKEDGSLLPVAFMDMQC-----FVSVIKELKGTGMCIFGDAVKGLWFAGYSEEPYKMSL 1138
Query: 852 RARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
A+D + + A + L D + A+++ NL+ ++ + E +L ++H G
Sbjct: 1139 FAKDLDYLEVLAADFLPDGNKLFIVVADSDCNLYVLQYDPEDPNSSNGDKLLNRSKFHTG 1198
Query: 909 EFVN-------------RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL 955
F + R GS M + + + V+ + NG IG++ +P E Y
Sbjct: 1199 NFASTVTLLPRTLVSSERAMSGSDKMDI--DNTAPLHQVLVTSHNGSIGLVTCVPEESYR 1256
Query: 956 FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
L LQ+ L ++ GLN +R+ ++ + LD +L+ +LD+S+ R EI+
Sbjct: 1257 RLSALQSQLTNTLEHPCGLNPRAYRAVESDASA--GRGMLDSNLLLQYLDMSKQRKAEIA 1314
Query: 1016 KTMNVSVEELCKRVEELT 1033
+ + E+ +E ++
Sbjct: 1315 GRVGATEWEIRADLEAIS 1332
>gi|156097003|ref|XP_001614535.1| hypothetical protein [Plasmodium vivax Sal-1]
gi|148803409|gb|EDL44808.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 2558
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 119/565 (21%), Positives = 226/565 (40%), Gaps = 110/565 (19%)
Query: 552 HLLNFLLNMKTGELTDRKKVSLGTQ-PITLRTF----SSKNTTHV------------FAA 594
+L ++++ K LT R+K+S T P+ + F S +N V F
Sbjct: 1962 YLKSYVVKSKKIMLTKRRKISTCTNSPVKFKAFLKVHSERNQIDVNMSKLTKRCNFLFVC 2021
Query: 595 SDRPTVIYSS-NKKLLYSNVNLKEV---------SHMCPFNSAAF------PDSLAIAKE 638
D P +IYS+ KKL S ++++ V +++ PF++ +S I +
Sbjct: 2022 CDSPIIIYSTLKKKLSISKLSIRNVHLVDMFSDFNYLNPFHNFLLFKKKNQNNSYFIFFD 2081
Query: 639 G-ELTIGTIDDIQKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEMHFVRL 696
G +L I +++++K + IP +I H + C ++ + + V
Sbjct: 2082 GNQLCISYLNEMKKTFMERIPFHRTVEKIAYHADTGLLITACPVEEKHKTNQMMKQIVCF 2141
Query: 697 LD--DQTFEF------------ISTYPLDTFEYGCSI--------------------LSC 722
D +F++ I Y L G S+ L
Sbjct: 2142 FDPFQNSFKYTYIIPSKFSVSSICIYELAPSSGGASMGEMEQTSQMGQMEQTNQTNELKP 2201
Query: 723 SFSDDSN------VYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQ-----LIAEKETKG 771
S ++ CVGTA EP+ G I VF+ + Q + + G
Sbjct: 2202 SHPEERTDAPPVRTLICVGTANNNERITEPSSGHIYVFVAKKQTNQFEIKHVYTYNVSCG 2261
Query: 772 AVYSLNAFNGKLLAAINQKI----------QLYKWMLRDDGTRELQSECGH--------H 813
+ L F K++AA+N + L ++ +++S
Sbjct: 2262 GITHLKQFRDKIVAAVNNTVVILDIGNFLANLGAYIYNSSKAIKIESNDAFLEVASFTPS 2321
Query: 814 GHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYL 873
I++L V ++IVVGD+M S++LL Y E + E RDY W ++V L ++ +L
Sbjct: 2322 SWIMSLDVVE--NYIVVGDIMTSVTLLSYDFENAILNEVCRDYANIWCTSVSALSENHFL 2379
Query: 874 GAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD-----V 928
++ N ++K++ DEE +L +V +++ G VN+ SL + D + +
Sbjct: 2380 VSDMESNFLVLQKSNIKFNDEESFKLSLVSQFNHGSVVNKMFSTSLRNLVDDEERRNEIL 2439
Query: 929 GQIPTVIFGTVNGVIGVIASLPHEQYLFLEK---LQTNLRKVIKGVGGLNHEQWRSFNNE 985
+ +++ + G I + +P +L ++ ++ + I +G L+H +R +
Sbjct: 2440 QKEQSILCASSEGSISAL--IPFSNFLQFKRALCIEIAINDNISSLGNLSHSSYREYKVS 2497
Query: 986 KKTVDAKNFLDGDLIESFLDLSRTR 1010
+ + K +DG+L + F L R
Sbjct: 2498 LASKNCKGVVDGELFKMFFYLPFER 2522
Score = 54.7 bits (130), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 44/69 (63%), Gaps = 1/69 (1%)
Query: 286 YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
++E+L N+GPI+DFCVV + G+ +++TC+ + G + I+RNG+ +N + + L
Sbjct: 822 FIEILAVIQNMGPILDFCVVK-NKNGEKEIITCNSYGRTGCISIIRNGMKVNIISKLSLG 880
Query: 346 GIKGMWSLR 354
I M+ ++
Sbjct: 881 KITNMFVVK 889
>gi|157872916|ref|XP_001684981.1| putative CPSF-domain protein [Leishmania major strain Friedlin]
gi|68128051|emb|CAJ06910.1| putative CPSF-domain protein [Leishmania major strain Friedlin]
Length = 1347
Score = 85.1 bits (209), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 206/987 (20%), Positives = 378/987 (38%), Gaps = 169/987 (17%)
Query: 157 QNNLDNGADLLIPVPPPLCGVLIIGEETI-----VYCSANAF-KAIPIRPSITK-AYGRV 209
Q N+D A ++P G+ ++G + V F + IP R +I + V
Sbjct: 320 QANVDPTASRIVPSRH---GLFVVGNHLVTLIQQVRPRKGVFSREIPSRQAILDVSCASV 376
Query: 210 DADGSRYLLGDHAGLLHLLVITHE----KEKVTGLKIELLGETSIASTISYLDNAVVYIG 265
AD S L+G G+L + + ++ + +I + + L + +
Sbjct: 377 SADASELLIGFSDGVLARVTVISRGMGGEDPELAFRFLSHPAPTIPTELVTLHADLYLLC 436
Query: 266 SSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDG 325
S + S ++L+ + S EVL N GP++D + + VV +G ++ G
Sbjct: 437 SRFDSSFTVRLD-----EASCEEVLH---NCGPVLD--MTTHKNGSHNSVVASTGIHRGG 486
Query: 326 SLRIVRNGIGINEQASVEL----QGIKGMWSLRS-STDDPFDTFLVVSFISETRILAMNL 380
+ ++R+ + + EQA + L Q I L +T + F +VS A+ L
Sbjct: 487 GISVLRSAVMLREQADIPLRCHPQRILCAGELMCITTPAAANRFFLVS------ARALTL 540
Query: 381 EDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVN 440
E+ E DA NQLV +++ + + + + + P
Sbjct: 541 EELAPHVFPEIVSRHLLVELGLDAALNQLVLISARGIAWLRLSGPRVEAVSQLP------ 594
Query: 441 VATANASQVLLATGGGHLVYLEIGDGILTEVKHAQ-LEYEISCLDINPIGENPSYS-QIA 498
A+ Q+L A L + DGI+ V Q L Y + L I S +
Sbjct: 595 -RQADDPQILFAVVRHGLTV--VCDGIVVTVYRGQDLVYTVPSLAGVAISSVSLVSPTLL 651
Query: 499 AVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGIS-------YLLCALGDG 551
+G W D V ++ L L + +PR++ CA +S L + G
Sbjct: 652 VLGQW-DGVVALYELGHRQADVVGLLLLDSVPRTIA-CAVPPLSPSSVVGEAALPMIYIG 709
Query: 552 HLLNFLLN-----MKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK 606
L FL+ ++ GE R V + P+ L + ++ + + V+ +
Sbjct: 710 TLHGFLVGTTVALLRRGE--KRLSVFVAPHPLELLELAGRHAGLLCLGNVPLVVLCVAGG 767
Query: 607 KLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRI 666
L + ++L VS ++ + +G L +G+++ ++K +PLGE ++
Sbjct: 768 GLQLTGLHLTGVSAAATVDADLQCYAFYSKHDGRLHVGSLEALEKTSRTFLPLGETITQL 827
Query: 667 CHQEQSRTFAICSLKNQSCAEESEMHF---------------------VRLLDDQTFEFI 705
H+ + S++ E E+ F V LL+++ F+
Sbjct: 828 -HETAAWGGYAASVRK---VEGDEVLFLPSAVIQFPWAADLALWRSAAVPLLENERCVFL 883
Query: 706 STYPLDTFEYGCSI------LSCSFSDDSNV-------YYCVGTAYVLPEENEPTKGRIL 752
T LD E G + + + SD + V +G+++ P+E GRI
Sbjct: 884 QTVRLDGGEGGGAGERGGEPIDVAASDSAGVSEEEWQHLLLIGSSFTFPDEQRARSGRIT 943
Query: 753 VFIV----EDGKLQLIAEKETKGAVYSLNA---FNGKLLAAINQKIQLYKWMLRDDGTRE 805
+ + +L LIA K+ GA+ A + G++ +N + LYKW +D T
Sbjct: 944 WCALREEHQQQRLHLIASKDIGGALQCCAAVPHYKGRIALGVNGCVCLYKWN-TEDQTFV 1002
Query: 806 LQSECG---HHGHILALYVQTRGDFIVVG-DLMKSISLLIYKHEEGAIEERARDYNANWM 861
+ C ++ LY + ++V D+ S + +G+++ RD
Sbjct: 1003 AEERCRVGLTVTKLIPLYHTSLAASVLVALDVRHSAFFIEVDTLQGSLKVLCRDAELRG- 1061
Query: 862 SAVEILDDDIYLGAEN--------NFNLFTV---------RKNSEGATDEERGRLEVVGE 904
++D I AEN NF V + A+ + R EV +
Sbjct: 1062 ----VMDGHIGSDAENLCLFDDSLNFTALRVVPLPVEAGDGDAAAAASVTAQYRFEVRAQ 1117
Query: 905 YHLGEFVNRFRHGSL----VMRLPDSDVGQI----------PTVIFGTVNGVIGVIASLP 950
HLG+ V R GS +M P S P ++F T +G GV+ +
Sbjct: 1118 CHLGDLVTCVRQGSFAATSLMEAPASCASAQNRLLLPGIAGPQLVFATAHGGFGVVTPVH 1177
Query: 951 HEQYLFLEKLQTNLRKVIKGVGGLNHEQWR--------------------SFNNEK-KTV 989
YL L L+ +L + ++ +GGL+H+ +R + E+ +
Sbjct: 1178 AATYLVLRTLEASLVRTLQPLGGLSHQAFREVLRSGQERGVSYLASKTGCALTRERLRRY 1237
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEISK 1016
+ N +DGD++E F+ LS + ++ +
Sbjct: 1238 EPLNTVDGDMVEQFVQLSVSDKKQVCR 1264
>gi|291232722|ref|XP_002736302.1| PREDICTED: cleavage and polyadenylation specific factor 1-like
[Saccoglossus kowalevskii]
Length = 984
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 123/500 (24%), Positives = 208/500 (41%), Gaps = 97/500 (19%)
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC--PFNSAAFPDS-LAIAKEGELTIGTID 647
VF P ++ +++ L S+ + S C PF++ P L K GEL I +
Sbjct: 523 VFVCGPYPHWLFMTSRGALRSHPMHIDGSVTCFAPFHNINCPKGFLYFNKHGELRICVLP 582
Query: 648 DIQKLH----IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM------------ 691
+R +PL P I + +S+T+A+ + ++ C +M
Sbjct: 583 THLSYDALWPVRKVPLRCTPHFISYHIESKTYAVVTSVSEPCLRICKMTGDDKEFEDVER 642
Query: 692 --HFV---------RLLDDQTFEFISTYPLDT--FEYGCSILSCSFSDDSNV-----YYC 733
F+ +L ++E I +DT +E+ + + + V +
Sbjct: 643 DDRFIFPTIEKFSLQLFSPLSWEAIPNTKIDTEDWEHITGLKTVFLKSEGTVSGLKGFIA 702
Query: 734 VGTAYVLPEENEPTKGRILVF-----IVEDG------KLQLIAEKETKGAVYSLNAFNGK 782
V T V EE +GRIL+F + E G KL+L+ +KE KG V +L G
Sbjct: 703 VSTTIVYGEEVT-CRGRILIFDVIEVVPEPGQPLTKNKLKLLYDKEQKGPVTTLCDIEGL 761
Query: 783 LLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
L AAI QKI L W R++ G + ++ H H L T +FI+ D+ KS+SL
Sbjct: 762 LAAAIGQKIFL--WAFRNNDLIGVAFIDTQI--HIHTLC----TIKNFILAADIRKSVSL 813
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR-KNSEGATDE-ERG 897
L + E+ ++ R+ +L + + + F +R K S+ AT++ G
Sbjct: 814 LRFSDEDRSLSLVTRES----FGGQRLLRRADFNAGSHVCSFFRMRSKLSDPATEKLLTG 869
Query: 898 RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFL 957
+E RH +F T++G IG + + + Y L
Sbjct: 870 PMER-------------RH----------------VTMFATLDGSIGYLIPMTEKTYRRL 900
Query: 958 EKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEIS 1015
LQ L GLN + +R ++ K+++ KN LDGDL+ + LS E++
Sbjct: 901 LMLQNALTTQTLHTAGLNPKGFRMVKHQTKSLENTHKNILDGDLLWKYTFLSVNERTELA 960
Query: 1016 KTMNVSVEELCKRVEELTRL 1035
K + SVE++ + ++ RL
Sbjct: 961 KKIGTSVEQILDDLMDVERL 980
>gi|294889072|ref|XP_002772683.1| spliceosomal protein sap, putative [Perkinsus marinus ATCC 50983]
gi|239877121|gb|EER04499.1| spliceosomal protein sap, putative [Perkinsus marinus ATCC 50983]
Length = 229
Score = 84.7 bits (208), Expect = 2e-13, Method: Composition-based stats.
Identities = 56/199 (28%), Positives = 97/199 (48%), Gaps = 16/199 (8%)
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLR---------SSTDDPFDT 363
Q+ SG +LRI+R+G+ + E + EL G +++++ + + D
Sbjct: 34 QIYALSGRGPRSALRILRHGLTLGEAGASELPGQPNALFTIKPFGASYAPVAEGEVESDR 93
Query: 364 FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST 423
++VVSF+ +T L + D + E GF + TLF +QV +R V++
Sbjct: 94 YIVVSFVDQTLTLLVT-SDNIHEVTDSGFAKEQPTLFAMRMQDKSAIQVMPTGIRHVAAG 152
Query: 424 SRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG---DGILTEVKHAQLEYEI 480
R EW++PPG V +A +N SQV++A GG + E+ +G L+EV + E+
Sbjct: 153 RRT--TEWRAPPGRQVTMAASNGSQVVIALSGGEIQLFELDADTNGHLSEVAKRDIGCEV 210
Query: 481 SCLDINPIGENPSYSQIAA 499
+ L + P+ + SQ A
Sbjct: 211 AALTVQPLSSGRTRSQFMA 229
>gi|313232279|emb|CBY09388.1| unnamed protein product [Oikopleura dioica]
Length = 1451
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 89/373 (23%), Positives = 171/373 (45%), Gaps = 40/373 (10%)
Query: 693 FVRLLDDQTFEFI--STYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENE 745
FV + ++E I S +D E+ S ++++ Y +GT+ + E+ +
Sbjct: 1083 FVEMFSPASWEVIPNSRIEMDAHEHILCCRSVYLKSEASMSGRKQYIAIGTSNICGEDFQ 1142
Query: 746 PTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY 794
++GR+++ V D K + + + +G V ++++ +G L+AAI QK+ ++
Sbjct: 1143 -SRGRLILLEVIDVVPEPGKPLTRYKYKTVFDASQRGPVSAVDSLDGALIAAIGQKVFIH 1201
Query: 795 KWM---LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+ LR G + Q H H Y +VGD+ + I+LL ++ E I +
Sbjct: 1202 AFQDDNLRATGFVDTQLYT-HATHCFKNYA-------LVGDIQQGITLLRHQGERNCISQ 1253
Query: 852 RARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVR---KNSEGATDEERGRLEVVG 903
+R A ++AV IL D +G + N ++ + K S G R +G
Sbjct: 1254 ISRARRAGEVTAVGILLDGNQVGLVSTDMQRNLQVYMYKPDQKESNGGKQLVRQADINLG 1313
Query: 904 EYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
+ + + + R ++ ++ + ++G IG I + + + LE LQT
Sbjct: 1314 KRVISIWNSLGRQNDTFTKVALTENDARHVTFYAGLDGSIGDIVPVSEKVFRRLEMLQTL 1373
Query: 964 LRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
++ + GGLN ++R NE + ++ AKN +DGDL+E F LS T ++S+ + V+
Sbjct: 1374 VQSHLPHYGGLNPREYRYCTNEYRDLENAAKNIIDGDLLERFNGLSFTEQTDLSRKIGVT 1433
Query: 1022 VEELCKRVEELTR 1034
E L + ++ R
Sbjct: 1434 REALLDDMMDVQR 1446
Score = 55.5 bits (132), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 71/339 (20%), Positives = 146/339 (43%), Gaps = 62/339 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSD----RIGR 61
++G + + R G +D L ++ E K +++++ ++ L T +M D R R
Sbjct: 81 LWGIVVCMTRLRLAGSVRDSLLLSIEESKCVIVEYEPDTGSLSTISMHFFQDEDLRRGFR 140
Query: 62 PTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF------DNKGQLKEA-------FNIRLEE 108
+ + +D R + +Y V+PF D GQ +A F + +
Sbjct: 141 KLSSMALARVDGFNRCAAVLVYGSYLAVLPFRRSTERDLSGQRHQAVFYENSGFIANMID 200
Query: 109 LQ--------VLDIKFLYGCAKPTIVVLYQ----------DNKDARHVKTYEVALKDKDF 150
LQ VLD +FL G PTI++LY+ + +D + + L D+
Sbjct: 201 LQSLPVKIASVLDFQFLEGYNDPTILLLYEALPTWTGRVTERQDTCGMVALSINLIDETH 260
Query: 151 VEGP--WSQNNL---DNGADLLIPVPPPLCGVLIIGEETIVYCSAN------AFKAIPIR 199
P W L + + L P+P PL G L+ +++Y + A ++P+
Sbjct: 261 ---PVIWQMAGLPFPNPYSSALFPIPKPLGGSLLFATNSLIYLDQSVPPYGVALNSLPLG 317
Query: 200 PS----ITKAYGRVDADGSRY-LLGDHAGLLHL-----LVITHEKEKVTGLK---IELLG 246
+ T+ ++ + +L D + + L +IT +K+ + ++ ++ +
Sbjct: 318 CTNFALKTQDVAPLNLQNCKACMLSDDSICVSLESGDVYIITLKKDSLNNVRRFYLDQVA 377
Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
+ I +T+S L + ++++GS G+S L++ + ++K S
Sbjct: 378 SSVIPTTLSKLSDNLIFLGSRLGNSLLLRYKCKENSKKS 416
>gi|118346970|ref|XP_001006962.1| CPSF A subunit region family protein [Tetrahymena thermophila]
gi|89288729|gb|EAR86717.1| CPSF A subunit region family protein [Tetrahymena thermophila SB210]
Length = 1112
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 193/935 (20%), Positives = 380/935 (40%), Gaps = 133/935 (14%)
Query: 176 GVLIIGEETIVYCSANAFKAIPIR---PSITKAYGRVDADGSRYLLGDHAGLLHLLVITH 232
G+++ + I Y S N I P + V D + ++ A L + +
Sbjct: 234 GLILFSDYKIFYYSYNDKSTIEFTTPFPDEVICHSHV-KDTTYFIATQDAAFQSSLYLVN 292
Query: 233 EKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN--LQPDAKGSYVEVL 290
+ I+ LG S ++I L ++I S D ++IK++ + +G ++EV
Sbjct: 293 ILASSRSIDIKYLGRISPPTSIIQLPEQHIFITSVDDDCKVIKVHDKFIKNLEG-HLEVK 351
Query: 291 ERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKG 349
+ NL I + +++ + Q + S K+ S+ I + G+ + ++ ++ I+
Sbjct: 352 LKLSNLSSIYNMKLLESDF---NQRLIISSPRKEQSMNIFQKGMSLIQKGEIKFNSPIRE 408
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRIL-----------AMNLEDELEETEIEGFCSQTQT 398
M + S F+VVSF +T IL MN+ L S++Q
Sbjct: 409 MLVVHS-------VFIVVSFDFQTLILKLDDKKGSFNVVMNINCGLLNVFALESSSESQK 461
Query: 399 LFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHL 458
L C V++ S++L+ T + +E + + ++T N + ++ A +L
Sbjct: 462 LVCL---------VSAQSIKLIDLTRANILSEI-AISNKRIILSTINKNILITADSTNNL 511
Query: 459 VYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNL 518
+ +++V+ S IN SY AV +W + + + D
Sbjct: 512 QAFNFLNNQISQVETFSYNSRYSYSAIN------SYGNYIAVALWFQSYIILINASDPTK 565
Query: 519 ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDG--HLLNFLLNMKTGELT--DRKKVSLG 574
+ + + + RS++ F + LL DG L+ + + + +T ++K + +G
Sbjct: 566 THQIDIPTQKVVRSLI---FHEENKLLIGYSDGIISLVYYSTDDQNNIVTSNNQKNMVIG 622
Query: 575 TQPIT---LRTFSSKNTTHVFAASDRPTVIY---------------SSNKKLLYSNVNLK 616
+P+ LR FSS + S P I S ++ + N++
Sbjct: 623 YKPVKFRQLRLFSS------YCQSSTPACISCISNEVSILSLRKVDESYQRYQKTYFNIQ 676
Query: 617 EVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSI--PLGEHPRRICHQE---- 670
+V+ + P L +AK+ L+IG I+ QK HI S + E C E
Sbjct: 677 DVNQIDEIEINGVP-YLIMAKKNCLSIGGIEVSQKFHIYSFGKDVKEQNEFRCQTELVDV 735
Query: 671 -QSRTFAICSLKNQS---CAEESEMHFVR---LLDDQTFEFISTYPL----DTFEYGCSI 719
++ + +I ++N+ +H ++ LL F Y L D F G
Sbjct: 736 IRNMSISISIVENEQDDVVCSNILLHSIKNQDLLQQLKFNNTCIYSLLVVGDYFFIGGK- 794
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKET-------KGA 772
+ + Y Y++ EN+ + L F + ++Q +E
Sbjct: 795 -----QEQNQTYEGFLQVYLVDNENKMIQVDKLAFKIPVMQIQKYSENSIAIILGHENFK 849
Query: 773 VYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGD 832
+Y + + + + + Q Q +K++ +E+ S+ + L+L R I++ D
Sbjct: 850 IYQMPYTSAQEYSILKQNTQ-HKFL------QEIVSQSNLNITALSLDCNKRNQ-ILIAD 901
Query: 833 LMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGAT 892
+ I L ++ + + A+ + + + L D+ + +++N N+ R+N
Sbjct: 902 RTQGIQL--FQEQNKKVMLAAKSPFPVFSTHAKFLGDENIVLSDSNTNIVIFRQNDFAEN 959
Query: 893 DEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQ-------IPTVIFGTVNGVIGV 945
D ER +L V+ + G+ V SL ++ DV + VI+GT G IGV
Sbjct: 960 DFERMKLLVLSAINNGDIV----ITSLRRKVNTKDVKDKYITTEGLDQVIYGTQKGWIGV 1015
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN-EKKTVDAKNFLDGDLIESFL 1004
I SL + Y L+ LQ ++ + K ++ +W+S N + D NF+DG++I L
Sbjct: 1016 IMSLNEQAYTVLQDLQESILQKFKCPLKFDYNKWKSIRNIPQSKSDKSNFIDGEIIFKVL 1075
Query: 1005 DLSRTRMDEISKTMNV----SVEELCKRVEELTRL 1035
+S++ + ++ M+ S+ E+ + +E L L
Sbjct: 1076 KMSQSELVQVLDGMSAMPKPSIAEMEQLIENLDLL 1110
>gi|119484094|ref|XP_001261950.1| cleavage and polyadenylation specificity factor subunit A, putative
[Neosartorya fischeri NRRL 181]
gi|148886830|sp|A1DB13.1|CFT1_NEOFI RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|119410106|gb|EAW20053.1| cleavage and polyadenylation specificity factor subunit A, putative
[Neosartorya fischeri NRRL 181]
Length = 1400
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 88/371 (23%), Positives = 160/371 (43%), Gaps = 37/371 (9%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 748
++++ +T+ I +Y L EY ++ + N + VGTA+ E+ P++
Sbjct: 1027 LKVVSPRTWTVIDSYSLGPAEYVMAVKNMDLEVSENTHERRNMIVVGTAFAW-GEDIPSR 1085
Query: 749 GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
G I VF V D KL+LI ++ KGAV +L+ G+ L+AA QK +
Sbjct: 1086 GCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRG 1145
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ + ++ + ++GD +K + Y E + +D
Sbjct: 1146 --LKEDGSLLPVAFMDMQCYVNVVKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD 1203
Query: 856 YNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ A E L D L A+++ NL ++ + E RL ++H+G F
Sbjct: 1204 QGYLEVVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFAT 1263
Query: 913 RFRHGSLVM-----RLPDSDVGQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
M + D D +I + V+ + +G +G++ S+P E Y L LQ
Sbjct: 1264 TMTLLPRTMVSSEKAMADPDSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQ 1323
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+ L ++ GLN +R+ E + LDG+L+ +LD+ + R EI+ +
Sbjct: 1324 SQLTNSLEHPCGLNPRAYRAV--ESDGTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAH 1381
Query: 1022 VEELCKRVEEL 1032
E+ +E +
Sbjct: 1382 EWEIKADLEAI 1392
>gi|405977622|gb|EKC42064.1| Cleavage and polyadenylation specificity factor subunit 1
[Crassostrea gigas]
Length = 1369
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 144/329 (43%), Gaps = 32/329 (9%)
Query: 731 YYCVGTAYVLPEENEPTKGRILV-----FIVEDG------KLQLIAEKETKGAVYSLNAF 779
Y +GT L EE ++GR+++ + E G K++ + EKE KG V +L
Sbjct: 1045 YIVMGTNLSLGEEVT-SRGRVIIADIIEVVPEPGMPLTKHKIKTLYEKEQKGPVTALADI 1103
Query: 780 NGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISL 839
NG L+ AI QK LY W L+D+ +L HI + T I+ GD++KS+S+
Sbjct: 1104 NGLLITAIGQK--LYIWQLKDN---DLMGVAFIDTHIYIHTLVTIKHIILAGDILKSVSV 1158
Query: 840 LIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEER 896
Y+ E + +RD + + L D+ L ++ NL E
Sbjct: 1159 YQYQEEHKVLSIVSRDPRPLEVYTADFLIDNTQLCCLVSDRMKNLVVYSYQPEARESHGG 1218
Query: 897 GRLEVVGEYHLGEFVNRFRHGSLVMRLPDSD--VGQIP----TVIFGTVNGVIGVIASLP 950
RL +++ G V+ + P SD + P F T++G +G + L
Sbjct: 1219 QRLIRKADFNAGSNVSSMFRVRCKLYDPSSDKRMTGAPEKRHITYFATLDGSLGFVLPLS 1278
Query: 951 HEQYLFLEKLQTNLRKVIKGVGGLNHEQWR----SFNNEKKTVDAKNFLDGDLIESFLDL 1006
+ Y L LQ L I V GLN +R +F + KN LDG+L+ + +L
Sbjct: 1279 EKVYRRLFMLQNALVTHIPHVAGLNPRSYRHVIGTFPELRNP--QKNILDGELLWKYTNL 1336
Query: 1007 SRTRMDEISKTMNVSVEELCKRVEELTRL 1035
S EI+K + S +++ + E+ RL
Sbjct: 1337 SIMEKIEIAKRLGTSNDQIMDDLMEIDRL 1365
Score = 45.1 bits (105), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 63/250 (25%), Positives = 112/250 (44%), Gaps = 39/250 (15%)
Query: 241 KIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYV-EVLERYVNLGPI 299
K ++ G T IAS +S ++N +Y YG ++ N SY EV + N+GP
Sbjct: 359 KKKVEGSTEIASDVSQIEN--LYDLEVYGSAE----NPTSTTITSYTFEVCDNIWNIGPC 412
Query: 300 VDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM 350
+ + + L + ++V SG K+G+L +++ I + EL G M
Sbjct: 413 GNIVMGEPAFLSEEFSSCEDPDIEMVMTSGYGKNGALSVLQRSIRPQVVTTFELPGCLDM 472
Query: 351 WSLRS------------STDDPFD-------TFLVVSFISETRILAMNLEDELEETEIEG 391
W+++S S +D D +FL++S + IL E+ E + G
Sbjct: 473 WTVKSLVPKEKSEDKENSMEDDSDDNIEGGHSFLILSRSDSSMILETG--QEMNELDHSG 530
Query: 392 FCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVL 450
F +QT T+F + ++ +VQV+ S+RL+ R++++ V + A+ VL
Sbjct: 531 FSTQTTTIFAGNIGGDRYIVQVSDTSLRLLEGV-RQIQHIPLDTGSPVVQCSLADPYIVL 589
Query: 451 LATGGGHLVY 460
L G L++
Sbjct: 590 LTQEGQILMF 599
Score = 44.7 bits (104), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 66/325 (20%), Positives = 131/325 (40%), Gaps = 59/325 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G I +++ + G +D L ++ K V+++D + +L T ++ + P+
Sbjct: 13 LFGNIMSMKYVKLPGALRDSLLLSFSEAKLSVVEYDPGTHDLQTTSLHFFEE----PSMK 68
Query: 66 GQIGI--------IDPDCRLIGLHLYDGLFKVIPF---------DN-----KGQLKEAFN 103
G +DPD R + +Y ++PF DN K + ++
Sbjct: 69 GGFFTNYCIPEVRVDPDGRCAAMLVYGTHMVILPFRRDVMVEEGDNLAGTSKSPILSSYI 128
Query: 104 IRLEE-----LQVLDIKFLYGCAKPTIVVLYQD-----NKDARHVKTYEVALKDKDFVEG 153
I L + V D +FL+G +PT+ +L++ + A T + + E
Sbjct: 129 IDLRNFDEKIINVKDFQFLHGYYEPTVFILFEPLQTWAGRTAVRADTCSIVAISLNLQEK 188
Query: 154 P----WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAFKA----I 196
WS +L ++ VP P+ GV+II +++Y S N+ A
Sbjct: 189 VHPVIWSLGSLPFDCCQVLAVPRPIGGVIIIAVNSLLYLNQSVPPYGVSLNSISAQSTLF 248
Query: 197 PIR--PSITKAYGRVDADGSRY---LLGDHAGLLHLLVITHEKEK-VTGLKIELLGETSI 250
P+R + A A Y +L G L++L + + + V + + +
Sbjct: 249 PLRVQEGVRIALDCCQAAFMSYDKIVLSLKGGELYVLTLVVDGMRSVRSFNFDKSAASVL 308
Query: 251 ASTISYLDNAVVYIGSSYGDSQLIK 275
S + ++ +++GS G+S L+K
Sbjct: 309 TSCMCICEDGFLFLGSRLGNSLLLK 333
>gi|154342093|ref|XP_001566998.1| putative CPSF-domain protein [Leishmania braziliensis
MHOM/BR/75/M2904]
gi|134064323|emb|CAM40524.1| putative CPSF-domain protein [Leishmania braziliensis
MHOM/BR/75/M2904]
Length = 1347
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 108/465 (23%), Positives = 185/465 (39%), Gaps = 101/465 (21%)
Query: 637 KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHF--- 693
++G L IG+++ ++K +PLGE ++ +A+ K E E+ F
Sbjct: 798 QDGRLHIGSLEALEKTSRTFLPLGETVTQLHETAAWGGYAVAVRK----VEGDEVLFLPS 853
Query: 694 ------------------VRLLDDQTFEFISTYPLDTFEYG-----C-------SILSCS 723
V LL+++ F+ T LD E G C +I S
Sbjct: 854 SVIQSPWTADLGLWRTAAVPLLENERCVFLQTVRLDGSEGGGPAERCGEQIDVPAITSAG 913
Query: 724 FSD-DSNVYYCVGTAYVLPEENEPTKGRILVFIVED----GKLQLIAEKETKGAVYS--- 775
S+ D VG+++ P+E GR++ F + + +L+LIA K+ GA+
Sbjct: 914 VSEEDWQHLLLVGSSFTFPDEQRARSGRVMWFALHEERQGQRLRLIASKDIGGALQCCAE 973
Query: 776 LNAFNGKLLAAINQKIQLYKWMLRDDG-------------TR--ELQSECGHHGHILALY 820
+ + G++ +N + LYKW D TR L + ++AL
Sbjct: 974 VPYYKGRIALGVNGCVCLYKWNTEDQTFVAEERCRVGLTVTRLIPLYNTALAASVLVALD 1033
Query: 821 VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFN 880
V+ FI V L S+ +L EG + Y + + + DD++ A
Sbjct: 1034 VRHSAFFIEVDLLQGSLKVLC---REGNLRGVMDGYVGSDAENLCLFDDNLNFTALKVVP 1090
Query: 881 LFT---VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL----VMRLPDSDVGQI-- 931
L + A+ + R EV +YH+G+ V R GS +M+ P
Sbjct: 1091 LPVEPGDGDAAAAASGTPQCRFEVRAQYHVGDLVTCVRPGSFAATSLMKAPTPSSSVPSP 1150
Query: 932 --------PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR--- 980
P ++F T +G GV+ L YL L L+ +L + + +GGL+H+ +R
Sbjct: 1151 LLLPGIAGPQLVFATAHGGFGVVTPLHAATYLVLRALEASLERTLPPLGGLSHQAFREVL 1210
Query: 981 -----------------SFNNEK-KTVDAKNFLDGDLIESFLDLS 1007
+ E+ + ++ N +DGD++E FL LS
Sbjct: 1211 RAGQERGVSYLASKVGCALTRERLRRYESLNTVDGDMVEQFLWLS 1255
>gi|341892673|gb|EGT48608.1| CBN-CPSF-1 protein [Caenorhabditis brenneri]
Length = 1440
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 79/311 (25%), Positives = 142/311 (45%), Gaps = 28/311 (9%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+PE +PT R K++++ +KE KG V L A G LL+ + QK+ + W
Sbjct: 1140 VVPEPGQPTSNR---------KIKVLFDKEQKGPVTGLCAMEGLLLSGMGQKV--FIWQF 1188
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN- 857
+D+ L S H ++ L+ + + D +S+SL+ ++ E A+ +RD
Sbjct: 1189 KDNDLMGL-SFLDMHYYVYQLH--SLRSIALACDARESMSLIRFQEENKAMSVASRDDRK 1245
Query: 858 -ANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
A A + + D ++G ++ N N+ E RL V ++G +N
Sbjct: 1246 CAQAPMAAQFMVDGAHIGFLLSDENGNITLFNYAPEAPESNGGERLTVRAAINIGTNINA 1305
Query: 914 FR----HGSLV--MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
F H +L+ Q + IF +++G G I L + Y L LQT + V
Sbjct: 1306 FLRVKGHTALLNLHEFEKEAAEQRMSTIFASLDGSFGFIRPLTEKSYRRLHFLQTFIGSV 1365
Query: 968 IKGVGGLNHEQWRSFNNEKKTVD---AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
+ + GL+ + RS + V+ A+N +DGD++E +L+LS ++++ + V
Sbjct: 1366 SQQIAGLHIKGARSAKPPQPIVNGRNARNLIDGDVVEQYLNLSTYDKTDLARRLGVGKYH 1425
Query: 1025 LCKRVEELTRL 1035
+ + EL R+
Sbjct: 1426 IIDDLMELRRM 1436
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 123/532 (23%), Positives = 210/532 (39%), Gaps = 112/532 (21%)
Query: 72 DPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTIV 126
DP+ R +Y ++PF + ++ I L+++ V DI FL G +PTI+
Sbjct: 144 DPENRCAASLVYGKHIAILPFHENSKRIHSYIIPLKQIDPRLDNVADIVFLDGYYEPTIL 203
Query: 127 VLYQD----------NKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCG 176
LY+ D + V + D+ F W NL L+P+P PL G
Sbjct: 204 FLYEPLQTTPGRACVRYDTMCIMGVSVNIVDRQFAV-VWQTANLPMDCATLLPIPKPLGG 262
Query: 177 VLIIGEETIVYCSA-------------NAFKAIPIRPSITKAYGRVDADGSRYL------ 217
++ G TIVY + + F P++ + +D S Y+
Sbjct: 263 AIVFGSNTIVYLNQAVPPCGIVLNSCYDGFTKFPLK-DMKSMKMTLDCSTSVYMEDGRIA 321
Query: 218 LGDHAGLLHL--LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
+G G L L LV + V L+ + +TSIA T++ +++GS GDSQL++
Sbjct: 322 VGTRDGELFLLRLVTSSGGATVKSLEFSKVWDTSIAYTLTVCAPGHLFLGSRLGDSQLLE 381
Query: 276 LNL-----------QPDAKGSYVEV---------------------------------LE 291
+L + + + ++VE L+
Sbjct: 382 YSLIKTTRESVKRHKMEQEQNHVEAELDEDDLELYGGAIEEQQNDDEEQITESLQFSELD 441
Query: 292 RYVNLGPIVDFCV----------VDLERQGQ-GQVVTCSGAYKDGSLRIVRNGIGINEQA 340
R N+GP+ CV VD +R+ V+T SG K+GSL + + +
Sbjct: 442 RLRNIGPVKSMCVGRPNYMSNDLVDAKRRDPVFDVITASGHGKNGSLCVHQRSLRPEIVT 501
Query: 341 SVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
S L+G + +W++ ++ +L+VS I T +L + E E + F + T+
Sbjct: 502 SSLLEGAEQLWAVGRKENESH-KYLIVSRIRSTLVLELGEELIELEEPL--FVTGQPTVA 558
Query: 401 CHDAIYNQL-VQVTSGSVRLVSSTSR--ELRNEWKSPPGYSVNVATANASQVLLATGGGH 457
+ VQVTS S+ LV+ + E++ + P V + + +L G
Sbjct: 559 AGELSQGAFAVQVTSTSIALVTDGQQLAEVKIDSNFP---VVQASIVDPYVAVLTQNGRL 615
Query: 458 LVYLEIGDGI--LTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDIS 507
L+Y + + L E+ AQ + S SQI ++ M+ D S
Sbjct: 616 LLYTLVSNPYMQLQEIDLAQTPFSTFIAQ--------SASQITSISMYADAS 659
>gi|358348136|ref|XP_003638105.1| Pre-mRNA-splicing factor rse1 [Medicago truncatula]
gi|355504040|gb|AES85243.1| Pre-mRNA-splicing factor rse1 [Medicago truncatula]
Length = 1370
Score = 84.0 bits (206), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 116/236 (49%), Gaps = 28/236 (11%)
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
GDS ++KL D + + +++ N+ PI D D + Q+ C G +GSLR
Sbjct: 441 GDSVVLKLK---DGRLCFTNLIQ---NIAPIFDVTSGDYHDEKHDQMFACCGVTPEGSLR 494
Query: 329 IVRNGIGINE--QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
++++GI + + + +G+ G W++R D + +FLV+SF+ ETRIL++ L +
Sbjct: 495 VIQSGINVEKLLRTPSTYEGVAGTWTVRMKISDQYHSFLVLSFLGETRILSVGLSFT-DV 553
Query: 387 TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPG 436
T+ GF TL C LVQ+ +V+L T S + W P
Sbjct: 554 TDSVGFQPNVCTLACGLVSDGLLVQIYQSAVKLCLPTKDGHSEGIPLSSPICTSWY-PDN 612
Query: 437 YSVNVATANASQVLLATGGGHLVYLEIGDGILT-------EVKHAQLEYEISCLDI 485
++++ + ++++T +++ +G +L+ E++H +L+ E+SC+ I
Sbjct: 613 LNISLGAVGHNFIVVSTSNPCFLFI-LGVRMLSAYQYEIYEMQHLELQNEVSCISI 667
Score = 53.9 bits (128), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 52/205 (25%), Positives = 86/205 (41%), Gaps = 23/205 (11%)
Query: 828 IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKN 887
I VGD I Y E +E+ D + ++ ++DD+ + ++ ++ + +
Sbjct: 1134 IAVGDNRDGILFFSYHEEARKLEQLYGDPSQRLVADCILMDDNTAIVSDRKGSIAVLCSD 1193
Query: 888 ----SEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV---GQIP-------- 932
A+ E RL Y + E R GS RLP D+ G P
Sbjct: 1194 HLEAPNNASTECNLRLSCA--YFMAEIAVSIRKGSYSYRLPADDLLSGGIGPKTNVDSLQ 1251
Query: 933 -TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL--RKVIKGVGGLNHEQWRSFNNEKKTV 989
T++ T+ G I + L E+Y LE +Q L + V G +H ++RS N V
Sbjct: 1252 NTILVSTLLGSIMIFIPLSREEYELLEAVQARLAVHHLTAPVLGNDHNEFRSRENP---V 1308
Query: 990 DAKNFLDGDLIESFLDLSRTRMDEI 1014
LDGD++ FL+L+ + + I
Sbjct: 1309 GTPKILDGDMLTQFLELTNMQQNNI 1333
Score = 46.2 bits (108), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 42/193 (21%), Positives = 84/193 (43%), Gaps = 15/193 (7%)
Query: 573 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
+G P+ L + A SDRP +++S+ + Y++++ + SH P S P
Sbjct: 817 IGITPVFLVPLDDTLDADIIALSDRPWLLHSARHSISYTSISFQPSSHATPVCSIDCPKG 876
Query: 633 LAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESE 690
+ E L + + ++L++R L PR++ + +S+ + L +C +
Sbjct: 877 ILFVAENSLHLVEMVYSKRLNMRKFHLKGTPRKVLYHNESQMLLVMRTELSIGTCLSD-- 934
Query: 691 MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTA-YVLPE-----EN 744
+ +D + +S++ L+ E S+ + + VGT+ Y P E
Sbjct: 935 ---ICCVDPLSGSVLSSFRLELGETATSMELIRVGSEQVLV--VGTSLYSGPPAIPSGEA 989
Query: 745 EPTKGRILVFIVE 757
E KGR+LV ++
Sbjct: 990 ESAKGRLLVLCID 1002
>gi|146324727|ref|XP_747211.2| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus Af293]
gi|148886828|sp|Q4WCL1.2|CFT1_ASPFU RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|129556124|gb|EAL85173.2| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus Af293]
Length = 1401
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 159/371 (42%), Gaps = 37/371 (9%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 748
++++ +T+ I +Y L EY ++ + N + VGTA+ E+ P++
Sbjct: 1028 LKVVSPRTWTVIDSYSLGPDEYVMAVKNMDLEVSENTHERRNMIVVGTAFAR-GEDIPSR 1086
Query: 749 GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
G I VF V D KL+LI ++ KGAV +L+ G+ L+AA QK +
Sbjct: 1087 GCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRG 1146
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ + ++ L ++GD +K + Y E + +D
Sbjct: 1147 --LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD 1204
Query: 856 YNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ A E L D L A+++ NL ++ + E RL ++H+G F
Sbjct: 1205 QGYLEVVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFAT 1264
Query: 913 RFR-------HGSLVMRLPDS---DVGQIPT-VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
M PDS D I V+ + +G +G++ S+P E Y L LQ
Sbjct: 1265 TMTLLPRTMVSSEKAMANPDSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQ 1324
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+ L ++ GLN +R+ E + LDG+L+ +LD+ + R EI+ +
Sbjct: 1325 SQLANSLEHPCGLNPRAYRAV--ESDGTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAH 1382
Query: 1022 VEELCKRVEEL 1032
E+ +E +
Sbjct: 1383 EWEIKADLEAI 1393
>gi|159123784|gb|EDP48903.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus fumigatus A1163]
Length = 1401
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 159/371 (42%), Gaps = 37/371 (9%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 748
++++ +T+ I +Y L EY ++ + N + VGTA+ E+ P++
Sbjct: 1028 LKVVSPRTWTVIDSYSLGPDEYVMAVKNMDLEVSENTHERRNMIVVGTAFAR-GEDIPSR 1086
Query: 749 GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
G I VF V D KL+LI ++ KGAV +L+ G+ L+AA QK +
Sbjct: 1087 GCIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSQIGGQGFLIAAQGQKCMVRG 1146
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ + ++ L ++GD +K + Y E + +D
Sbjct: 1147 --LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDAVKGLWFAGYSEEPYKMSLFGKD 1204
Query: 856 YNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ A E L D L A+++ NL ++ + E RL ++H+G F
Sbjct: 1205 QGYLEVVAAEFLPDGDKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLARSKFHMGHFAT 1264
Query: 913 RFR-------HGSLVMRLPDS---DVGQIPT-VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
M PDS D I V+ + +G +G++ S+P E Y L LQ
Sbjct: 1265 TMTLLPRTMVSSEKAMANPDSMEIDSQTISQQVLITSQSGSVGIVTSVPEESYRRLSALQ 1324
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+ L ++ GLN +R+ E + LDG+L+ +LD+ + R EI+ +
Sbjct: 1325 SQLANSLEHPCGLNPRAYRAV--ESDGTAGRGMLDGNLLYQWLDMGQHRKMEIAARVGAH 1382
Query: 1022 VEELCKRVEEL 1032
E+ +E +
Sbjct: 1383 EWEIKADLEAI 1393
>gi|302504585|ref|XP_003014251.1| hypothetical protein ARB_07556 [Arthroderma benhamiae CBS 112371]
gi|291177819|gb|EFE33611.1| hypothetical protein ARB_07556 [Arthroderma benhamiae CBS 112371]
Length = 460
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 89/353 (25%), Positives = 159/353 (45%), Gaps = 39/353 (11%)
Query: 700 QTFEFISTYPLDTFEYGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IV 756
T +S L+ E SI + SF S + + VGT ++ T G I ++
Sbjct: 96 HTKSVLSNLELEDNEAAVSIAAVSFTSQEDETFLVVGTGKDMVVSPRTFTCGFIHIYRFQ 155
Query: 757 EDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHG 814
E+GK L+ I + + + +L F G+LLA I +++Y D G R+L +C
Sbjct: 156 EEGKELEFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIY-----DLGMRQLLRKCQAQIT 210
Query: 815 HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
+ + +QT+G I+V D+ +S++ ++YK++E A+ A D W + ++D + G
Sbjct: 211 PRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTTMVDYETVAG 270
Query: 875 AENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
+ N++ +R K SE A ++ G + +L NR SLV+ D I
Sbjct: 271 GDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRL---SLVIHFYSQD---I 324
Query: 932 PTVIFGT--------------VNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
PT I T + G +G+ + + + F + L+ L + G +H
Sbjct: 325 PTSIQKTQLVAGGRDILVWTGLQGTVGMFVPFITRDDVDFFQTLEMQLASQNPPLAGRDH 384
Query: 977 EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
+R + K +DGDL E+FL L + I+ ++ SV E+ +++
Sbjct: 385 LIYRGY-----YAPCKGVIDGDLCETFLLLPNDKKQAIAGELDRSVREIERKI 432
>gi|302654423|ref|XP_003019019.1| hypothetical protein TRV_07032 [Trichophyton verrucosum HKI 0517]
gi|291182709|gb|EFE38374.1| hypothetical protein TRV_07032 [Trichophyton verrucosum HKI 0517]
Length = 460
Score = 83.6 bits (205), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 89/353 (25%), Positives = 159/353 (45%), Gaps = 39/353 (11%)
Query: 700 QTFEFISTYPLDTFEYGCSILSCSF-SDDSNVYYCVGTAY-VLPEENEPTKGRILVF-IV 756
T +S L+ E SI + SF S + + VGT ++ T G I ++
Sbjct: 96 HTKSVLSNLELEDNEAAVSIAAVSFTSQEDETFLVVGTGKDMVVSPRTFTCGFIHIYRFQ 155
Query: 757 EDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHG 814
E+GK L+ I + + + +L F G+LLA I +++Y D G R+L +C
Sbjct: 156 EEGKELEFIHKTKVEQPPLALLGFQGRLLAGIGPDLRIY-----DLGMRQLLRKCQAQIT 210
Query: 815 HILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
+ + +QT+G I+V D+ +S++ ++YK++E A+ A D W + ++D + G
Sbjct: 211 PRVIVGLQTQGSRIIVSDVQESVTYVVYKYQENALIPFADDIIPRWTTCTTMVDYETVAG 270
Query: 875 AENNFNLFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
+ N++ +R K SE A ++ G + +L NR SLV+ D I
Sbjct: 271 GDKFGNIWLLRCPTKASEEADEDGSGAHLIHERQYLQGAPNRL---SLVIHFYSQD---I 324
Query: 932 PTVIFGT--------------VNGVIGV-IASLPHEQYLFLEKLQTNLRKVIKGVGGLNH 976
PT I T + G +G+ + + + F + L+ L + G +H
Sbjct: 325 PTSIQKTQLVAGGRDILVWTGLQGTVGMFVPFITRDDVDFFQTLEMQLASQNPPLAGRDH 384
Query: 977 EQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
+R + K +DGDL E+FL L + I+ ++ SV E+ +++
Sbjct: 385 LIYRGY-----YAPCKGVIDGDLCETFLLLPNDKKQAIAGELDRSVREIERKI 432
>gi|358348106|ref|XP_003638090.1| DNA damage-binding protein, partial [Medicago truncatula]
gi|355504025|gb|AES85228.1| DNA damage-binding protein, partial [Medicago truncatula]
Length = 642
Score = 83.6 bits (205), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 116/236 (49%), Gaps = 28/236 (11%)
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
GDS ++KL D + + +++ N+ PI D D + Q+ C G +GSLR
Sbjct: 261 GDSVVLKLK---DGRLCFTNLIQ---NIAPIFDVTSGDYHDEKHDQMFACCGVTPEGSLR 314
Query: 329 IVRNGIGINE--QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
++++GI + + + +G+ G W++R D + +FLV+SF+ ETRIL++ L +
Sbjct: 315 VIQSGINVEKLLRTPSTYEGVAGTWTVRMKISDQYHSFLVLSFLGETRILSVGLSFT-DV 373
Query: 387 TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPG 436
T+ GF TL C LVQ+ +V+L T S + W P
Sbjct: 374 TDSVGFQPNVCTLACGLVSDGLLVQIYQSAVKLCLPTKDGHSEGIPLSSPICTSWY-PDN 432
Query: 437 YSVNVATANASQVLLATGGGHLVYLEIGDGILT-------EVKHAQLEYEISCLDI 485
++++ + ++++T +++ +G +L+ E++H +L+ E+SC+ I
Sbjct: 433 LNISLGAVGHNFIVVSTSNPCFLFI-LGVRMLSAYQYEIYEMQHLELQNEVSCISI 487
>gi|121719617|ref|XP_001276507.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus clavatus NRRL 1]
gi|148886827|sp|A1C3U1.1|CFT1_ASPCL RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|119404719|gb|EAW15081.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus clavatus NRRL 1]
Length = 1401
Score = 83.6 bits (205), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 96/413 (23%), Positives = 172/413 (41%), Gaps = 56/413 (13%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
++ + +GEH + + S T+ + + + E+ E+H +
Sbjct: 969 LKKVAIGEHVDHLAYSISSETYVLGTSHSADFKLPEDDELHPEWRNEAISFLPELRQCCL 1028
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
+++ +T+ I +Y L E ++ + + N + VGTA E+ P +G
Sbjct: 1029 KVVHPKTWTVIDSYTLGPDEEIMAVKNMNLEVSENTHERKNMIVVGTALAR-GEDIPARG 1087
Query: 750 RILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKW 796
I VF V D KL+LI ++ KGAV +L+ G+ L+AA QK +
Sbjct: 1088 CIYVFEVIKVVPDPEKPETDRKLKLIGKELVKGAVTALSEIGGQGFLIAAQGQKCMVRG- 1146
Query: 797 MLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
L++DG+ + ++ L +VGD K I Y E + +D
Sbjct: 1147 -LKEDGSLLPVAFMDVQCYVNVLKELKGTGMCIVGDAFKGIWFAGYSEEPYKMSLFGKDL 1205
Query: 857 NANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR 913
+ A + L D L A+++ NL ++ E +L V ++H+G F +
Sbjct: 1206 EYPEVVAADFLPDGDKLFILVADSDCNLHVLQYEPEDPMSSNGDKLLVRSKFHMGHFTST 1265
Query: 914 FR-----HGSLVMRLPDSDVGQIPT------VIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
S + DSD ++ V+ + +G IG++ S+P E Y L LQ+
Sbjct: 1266 LTLLPRTTASYEIPSADSDSMEVDPRITPQQVLITSQSGSIGIVTSIPEESYRRLSALQS 1325
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
L ++ GLN +R+ E + LDG+L+ +L +S+ R EI+
Sbjct: 1326 QLANTVEHPCGLNPRAYRAI--ESDGTAGRGMLDGNLLYQWLSMSKQRRMEIA 1376
>gi|358348118|ref|XP_003638096.1| DNA damage-binding protein, partial [Medicago truncatula]
gi|355504031|gb|AES85234.1| DNA damage-binding protein, partial [Medicago truncatula]
Length = 822
Score = 83.6 bits (205), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 61/236 (25%), Positives = 116/236 (49%), Gaps = 28/236 (11%)
Query: 269 GDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
GDS ++KL D + + +++ N+ PI D D + Q+ C G +GSLR
Sbjct: 441 GDSVVLKLK---DGRLCFTNLIQ---NIAPIFDVTSGDYHDEKHDQMFACCGVTPEGSLR 494
Query: 329 IVRNGIGINE--QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEE 386
++++GI + + + +G+ G W++R D + +FLV+SF+ ETRIL++ L +
Sbjct: 495 VIQSGINVEKLLRTPSTYEGVAGTWTVRMKISDQYHSFLVLSFLGETRILSVGLSFT-DV 553
Query: 387 TEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSST----------SRELRNEWKSPPG 436
T+ GF TL C LVQ+ +V+L T S + W P
Sbjct: 554 TDSVGFQPNVCTLACGLVSDGLLVQIYQSAVKLCLPTKDGHSEGIPLSSPICTSWY-PDN 612
Query: 437 YSVNVATANASQVLLATGGGHLVYLEIGDGILT-------EVKHAQLEYEISCLDI 485
++++ + ++++T +++ +G +L+ E++H +L+ E+SC+ I
Sbjct: 613 LNISLGAVGHNFIVVSTSNPCFLFI-LGVRMLSAYQYEIYEMQHLELQNEVSCISI 667
>gi|25148482|ref|NP_500157.2| Protein CPSF-1 [Caenorhabditis elegans]
gi|22096347|sp|Q9N4C2.2|CPSF1_CAEEL RecName: Full=Probable cleavage and polyadenylation specificity
factor subunit 1; AltName: Full=Cleavage and
polyadenylation specificity factor 160 kDa subunit;
Short=CPSF 160 kDa subunit
gi|373220398|emb|CCD73182.1| Protein CPSF-1 [Caenorhabditis elegans]
Length = 1454
Score = 83.6 bits (205), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 76/312 (24%), Positives = 146/312 (46%), Gaps = 29/312 (9%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+PE ++PT R K++++ +KE KG V L A NG LL + QK+ + W
Sbjct: 1153 VVPEPDQPTSNR---------KIKVLFDKEQKGPVTGLCAINGLLLCGMGQKV--FIWQF 1201
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN- 857
+D+ + S H ++ L+ + + D +S+SL+ ++ + A+ +RD
Sbjct: 1202 KDNDLMGI-SFLDMHYYVYQLH--SLRTIAIACDARESMSLIRFQEDNKAMSIASRDDRK 1258
Query: 858 -ANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN- 912
A A +++ D ++G ++ N+ E RL V ++G +N
Sbjct: 1259 CAQPPMASQLVVDGAHVGFLLSDETGNITMFNYAPEAPESNGGERLTVRAAINIGTNINA 1318
Query: 913 --RFRHGSLVMRLPDSD----VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
R R + +++L + D + Q T +F +++G G + L + Y L LQT +
Sbjct: 1319 FVRLRGHTSLLQLNNEDEKEAIEQRMTTVFASLDGSFGFVRPLTEKSYRRLHFLQTFIGS 1378
Query: 967 VIKGVGGLNHEQWRSFNNEKKTV---DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
V + GL+ + RS + V +A+N +DGD++E +L LS ++++ + V
Sbjct: 1379 VTPQIAGLHIKGSRSAKPSQPIVNGRNARNLIDGDVVEQYLHLSLYDKTDLARRLGVGRY 1438
Query: 1024 ELCKRVEELTRL 1035
+ + +L R+
Sbjct: 1439 HIIDDLMQLRRM 1450
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 87/395 (22%), Positives = 153/395 (38%), Gaps = 96/395 (24%)
Query: 72 DPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTIV 126
DP R +Y ++PF + ++ I L+++ + D+ FL G +PTI+
Sbjct: 145 DPSNRCAACLVYGKHIAILPFHENSKRIHSYVIPLKQIDPRLDNIADMVFLDGYYEPTIL 204
Query: 127 VLYQDNK----------DARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCG 176
LY+ + D + V + D+ F W NL L+P+P PL G
Sbjct: 205 FLYEPIQTTPGRACVRYDTMCIMGVSVNIVDRQFAV-VWQTANLPMDCSQLLPIPKPLGG 263
Query: 177 VLIIGEETIVYCSA-------------NAFKAIPIRPSITKAYGRVDADGSRYL------ 217
L+ G T+VY + + F P++ + +D S Y+
Sbjct: 264 ALVFGSNTVVYLNQAVPPCGLVLNSCYDGFTKFPLK-DLKHLKMTLDCSTSVYMEDGRIA 322
Query: 218 LGDHAGLLHL--LVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIK 275
+G G L L L+ + V L+ + ETSIA +++ +++GS GDSQL++
Sbjct: 323 VGSRDGDLFLLRLMTSSGGGTVKSLEFSKVYETSIAYSLTVCAPGHLFVGSRLGDSQLLE 382
Query: 276 LNL---------------QPDAKGSYVEV------------------------------- 289
L D + +E+
Sbjct: 383 YTLLKTTRDCAVKRLKIDNKDPAAAEIELDEDDMELYGGAIEEQQNDDDEQIDESLQFRE 442
Query: 290 LERYVNLGPIVDFCV----------VDLERQGQ-GQVVTCSGAYKDGSLRIVRNGIGINE 338
L+R N+GP+ CV VD +R+ +VT SG K+G+L + + +
Sbjct: 443 LDRLRNVGPVKSMCVGRPNYMSNDLVDAKRRDPVFDLVTASGHGKNGALCVHQRSLRPEI 502
Query: 339 QASVELQGIKGMWSLRSSTDDPFDTFLVVSFISET 373
S L+G + +W++ ++ +L+VS + T
Sbjct: 503 ITSSLLEGAEQLWAVGRKENESH-KYLIVSRVRST 536
>gi|148886829|sp|A2R919.1|CFT1_ASPNC RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|134083776|emb|CAK47110.1| unnamed protein product [Aspergillus niger]
Length = 1383
Score = 83.2 bits (204), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 97/438 (22%), Positives = 179/438 (40%), Gaps = 61/438 (13%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESEMH-----------------FV 694
++ + LGE + + S + + C + E+ E+H F+
Sbjct: 942 LKRVHLGEQVDHLAYSTSSGMYVLGTCHATDFKLPEDDELHPEWRNEAISFFPSARGSFI 1001
Query: 695 RLLDDQTFE---------FISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVL 740
+L+ D + + ++ L EY +I + S N + VGTA+
Sbjct: 1002 KLVWDHHLQRQDSVILIFHLHSFSLGADEYVMAIKNISLEVSENTHERKDMIVVGTAFAR 1061
Query: 741 PEENEPTKGRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQ 789
E+ P++G I VF V D KL+LI ++ KGAV +L+ G+ + Q
Sbjct: 1062 -GEDIPSRGCIYVFEVVQVVPDPDHPETDRKLKLIGKEPVKGAVTALSEIGGQGFVLVAQ 1120
Query: 790 KIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAI 849
+ L++DG+ + ++ + ++GD +K + Y E +
Sbjct: 1121 GQKCMVRGLKEDGSLLPVAFMDMQCYVSVVKELKGTGMCILGDAVKGVWFAGYSEEPYKM 1180
Query: 850 EERARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYH 906
A+D + + A E L D L A+++ N+ ++ + E RL ++H
Sbjct: 1181 SLFAKDLDYLEVCAAEFLPDGKRLFIVVADSDCNIHVLQYDPEDPKSSNGDRLLSRSKFH 1240
Query: 907 LGEFVNRFRHGSLVM----RLPDSDVG-----QIP--TVIFGTVNGVIGVIASLPHEQYL 955
+G F + M ++ S G Q P V+ T NG +G+I +P E Y
Sbjct: 1241 MGNFASTLTLLPRTMVSSEKMVSSSDGMDIDNQSPLHQVLMTTQNGSLGLITCIPEESYR 1300
Query: 956 FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEIS 1015
L LQ+ L ++ GLN +R+ E + LDG+L+ ++D+S+ R EI+
Sbjct: 1301 RLSALQSQLTNTLEHPCGLNPRAFRAV--ESDGTAGRGMLDGNLLFKWIDMSKQRKTEIA 1358
Query: 1016 KTMNVSVEELCKRVEELT 1033
+ E+ +E ++
Sbjct: 1359 GRVGAREWEIKADLEAIS 1376
>gi|443684051|gb|ELT88095.1| hypothetical protein CAPTEDRAFT_161045 [Capitella teleta]
Length = 1410
Score = 82.8 bits (203), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 95/433 (21%), Positives = 180/433 (41%), Gaps = 58/433 (13%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE--------ESEMHFVRLLDD------ 699
+R +PL P + + S+T+++ S + C + E E+ V+ D
Sbjct: 979 VRKVPLRCTPHFVVYHPDSKTYSVVSSQQVPCTQLVRVAGDGEKEIEAVQKDDRFVFPIM 1038
Query: 700 -----QTFEFISTYP-------LDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPE 742
Q F +S P L+ +E+ I + + + + Y VGT
Sbjct: 1039 NKFNIQLFSPVSWEPIPNTRFDLEEWEHVMCIKTINLKSEGTLSGLKGYVVVGTNLNY-N 1097
Query: 743 ENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
E+ ++G++ ++ V D K++++ KE KG V +L+ G L+ AI QK+
Sbjct: 1098 EDVSSRGKLTIYDVIDVVPEPGQPLTKNKIKVVYNKEQKGPVTALDGVQGFLVTAIGQKV 1157
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
Y W L+D+ +L I ++ + I++GD+ KSIS+L Y+ + +
Sbjct: 1158 --YIWQLKDN---DLAGIAFIDTQIYIHKMEALKNLIIIGDVCKSISVLRYQEDMKVLSL 1212
Query: 852 RARDYNANWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLG 908
++D + V L D+ L A+ N + + RL + ++G
Sbjct: 1213 VSKDVRPLAVYGVAYLVDETSLAFIVADKLKNFLVYCYQPDLVQSQGGQRLIRKADINIG 1272
Query: 909 EFVNRFRHGSLVMRLP-----DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
VN F + P D + + T++G IG + + Y L LQ
Sbjct: 1273 SLVNAFFRVKCRVSDPSTSKTDQSLAMKHITYYVTLDGSIGYLLPISESLYRRLYMLQKM 1332
Query: 964 LRKVIKGVGGLNHEQWRSFNNEKKTV--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
L + ++ GLN + +R+ E + + +N +DGDL +L L+ E++K + +
Sbjct: 1333 LIQQVQQTAGLNPKAYRTCQTEFRQLINIQRNIIDGDLAWKYLALTSHDRAEMAKRIGTT 1392
Query: 1022 VEELCKRVEELTR 1034
++ + E+ R
Sbjct: 1393 SHQIEDDLLEIDR 1405
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 73/343 (21%), Positives = 142/343 (41%), Gaps = 67/343 (19%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRP 62
D + G ++++E G A+D L + E K + +D ++ +L T ++ D
Sbjct: 71 DYELCGNVSSIESISLVGAARDALLLCFEEAKLSLCDYDPDTDDLKTISLHYFEDA---D 127
Query: 63 TDNG--QIGI------IDPDCRLIGLHLYDGLFKVIPFD--------------NKGQLKE 100
+NG Q G+ +DP+ R + +Y V+PF +K +
Sbjct: 128 LENGCCQRGLHHSEVRVDPEGRCAVMLIYGTHLIVLPFRKESPSDEIDATSCASKSPIMS 187
Query: 101 AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQD----------NKDARHVKTYEVAL 145
+ I L L V+DI+FL+G +PT+++LY+ KD + + L
Sbjct: 188 TYIIDLRTLDERVTNVVDIQFLHGYYEPTVLILYEPLPTWTCRVAVRKDTCSIVAISLNL 247
Query: 146 KDKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY---------CSANAFK 194
+DK P WS +NL PVP P+ GV++ +++Y S N+
Sbjct: 248 QDKTH---PIIWSHSNLPYDCLRTFPVPKPIGGVIVFAVNSLLYLNQSFPPYGVSLNSLT 304
Query: 195 AIP----------IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKIE 243
+ +R S+ A D + ++ G L++L + + + V ++
Sbjct: 305 SFNTEFLLKPQEGVRMSLDCAQAEF-IDNDKLVISLKGGELYVLTLVIDSMRAVRSFHLD 363
Query: 244 LLGETSIASTISYLDNAVVYIGSSYGDSQLIKLN-LQPDAKGS 285
+ + + + + +++GS G+S L++ +P+A S
Sbjct: 364 KAAASVLTTCMCMCGDNYLFLGSRLGNSLLLRYQEKKPEASSS 406
Score = 39.7 bits (91), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 69/278 (24%), Positives = 116/278 (41%), Gaps = 37/278 (13%)
Query: 232 HEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLE 291
KEK+T L I L+G + + S + LD VY DSQ ++ D EV +
Sbjct: 415 QRKEKMT-LAIGLVGSSDV-SKLDDLDELEVY----GRDSQAVE---SEDITQFMFEVCD 465
Query: 292 RYVNLGPI-------VDFCVVDLERQGQG--QVVTCSGAYKDGSLRIVRNGIGINEQASV 342
+N+GP F + Q ++VT SG K+G++ I++ I +
Sbjct: 466 SIINIGPCGQVEMGEPAFLSEEFSHQEDPDLELVTTSGYGKNGAISILQRQIRPQVVTTF 525
Query: 343 ELQGIKGMWSLRSSTDDPFDT---------FLVVSFISETRILAMNLEDELEETEIEGFC 393
EL G +W++ S D+ + FL++S + +L E+ E + GFC
Sbjct: 526 ELPGCTDVWTVLGSPDEQQGSDEKLAGSHAFLLLSRADSSMVLETG--QEIMELDHSGFC 583
Query: 394 SQTQTLFCHDAIYNQ-LVQVTSGSVRLVSSTSR--ELRNEWKSPPGYSVNVATANASQVL 450
+ T+ + + +VQV ++ L+ R L + SP V+ + A+ +L
Sbjct: 584 TDAPTVHAANIGNGRYIVQVGPNAIWLLKGVERIQHLALDVSSP---VVSCSLADPHVLL 640
Query: 451 LATGGG--HLVYLEIGDGILTEVKHAQLEYEISCLDIN 486
L G HLV GD + +L + + IN
Sbjct: 641 LCEDGQLLHLVLSVQGDDPTLSLLTTKLHQKSKVIAIN 678
>gi|301773406|ref|XP_002922132.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1-like [Ailuropoda
melanoleuca]
Length = 1469
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 111/464 (23%), Positives = 186/464 (40%), Gaps = 87/464 (18%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 1015 IDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1074
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N C + E ++L+ ++E I +
Sbjct: 1075 KVYAVATSTNMPCTRIPRMTGEEKEFETIERDDRYIHPQQEAFSIQLISPVSWEAIPNAR 1134
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L+ +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1135 IELEEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1193
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1194 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1248
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1249 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1308
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
LG ++ + NL E RL ++H+G VN F R P
Sbjct: 1309 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGA 1362
Query: 929 GQIPT-----------VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHE 977
+ P+ F T++G IG++ LP ++ +TN + L H
Sbjct: 1363 AEGPSKKSVVWENKHITWFATLDGGIGLL--LPMQE-------KTNRLQPAXSPRML-HV 1412
Query: 978 QWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
R N +N LDG+L+ +L LS E++K + +
Sbjct: 1413 DRRILQNA-----VRNVLDGELLNRYLYLSTMERGELAKKIGTT 1451
Score = 46.2 bits (108), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 61/326 (18%), Positives = 141/326 (43%), Gaps = 60/326 (18%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 112 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 171
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 172 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 230
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 231 VRGLDEKLLNIVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 290
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 291 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNGLTTGTTAFPL 350
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 351 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 407
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIK 275
+ +++ ++ +++GS G+S L+K
Sbjct: 408 LTTSMVTMEPGYLFLGSRLGNSLLLK 433
>gi|389614684|dbj|BAM20371.1| spliceosomal protein sap, partial [Papilio polytes]
Length = 541
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 131/296 (44%), Gaps = 43/296 (14%)
Query: 759 GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHIL 817
GKL+L+ + +L AFNG+L+A + + +++Y D G R+L +C H L
Sbjct: 259 GKLELVHKTPIDDYPGALAAFNGRLMAGVGRMLRMY-----DIGRRKLLRKCENRHIPNL 313
Query: 818 ALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN 877
++T G I V D+ +S+ + YK E + A D N W++ +LD D + +
Sbjct: 314 IADIKTIGQRIFVADVQESVFCVKYKKRENQLIIFADDTNPRWITNTCVLDYDT-VAMSD 372
Query: 878 NFNLFTVRKNSEGATDE------------ERGRL-------EVVGEYHLGEFVNRFRHGS 918
F V + +D+ +RG L EVV +H+GE V + +
Sbjct: 373 KFGNVAVMRLPHSVSDDVDEDPTGNKALWDRGLLNGASQKGEVVVNFHVGETVTSLQRAT 432
Query: 919 LVMRLPDSDVGQIPTVIFGTVNGVIGVIASLP---HEQYLFLEKLQTNLRKVIKGVGGLN 975
L+ G +++ T G +G + LP E + + L+ ++R + G +
Sbjct: 433 LI-------PGGSEALLYATAGGALGAL--LPFTSREDHDXFQHLEMHMRSENAPLCGRD 483
Query: 976 HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEE 1031
H +RS+ KN +DGDL E F L + I+ + + E K++E+
Sbjct: 484 HLSFRSY-----YYPVKNVIDGDLCEQFNSLEPAKQKAIAGDLERTPAEGSKKLED 534
>gi|255581562|ref|XP_002531586.1| spliceosomal protein sap, putative [Ricinus communis]
gi|223528782|gb|EEF30789.1| spliceosomal protein sap, putative [Ricinus communis]
Length = 1220
Score = 82.4 bits (202), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/131 (35%), Positives = 73/131 (55%), Gaps = 3/131 (2%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE--QASVELQGIKGMWS 352
N+ PI+D VVD + Q Q+ C G +GSLRI+R+GI + + + + QG+ G W+
Sbjct: 460 NIAPILDMSVVDHHDEKQEQMFGCCGVAPEGSLRIIRSGISVEKLLRTAPIYQGVTGTWA 519
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
LR D + +F+V+SF+ ETR+L++ + + T+ GF TL C LVQ+
Sbjct: 520 LRMKVTDLYHSFVVLSFVEETRVLSVGVSFA-DVTDSAGFLPNVCTLACGLVGDGLLVQI 578
Query: 413 TSGSVRLVSST 423
+VRL T
Sbjct: 579 HQTAVRLCLPT 589
Score = 58.9 bits (141), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 45/193 (23%), Positives = 90/193 (46%), Gaps = 15/193 (7%)
Query: 573 LGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDS 632
+G P+ L + + A SDRP ++ ++ L Y++++ + +H P S P
Sbjct: 725 IGVTPVFLVPLTDSLDADMIALSDRPWLLQTARHGLSYTSISFQPSTHSTPVCSVECPKG 784
Query: 633 LAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAI--CSLKNQSCAEESE 690
L E L + + ++L+++ LG PR++ + +SR + L N +C+ +
Sbjct: 785 LLFVAENSLHLVEMVHSKRLNVQKFHLGGTPRKVLYHSESRLLLVMRTELSNDTCSSD-- 842
Query: 691 MHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAY-----VLPE-EN 744
+ +D + +S++ L+ E G S+ + + VGT+ ++P E
Sbjct: 843 ---ICCVDPLSGSVVSSFKLEHGETGKSMELVRVGTEQVLV--VGTSLSSGPAIMPSGEA 897
Query: 745 EPTKGRILVFIVE 757
E TKGR++V +E
Sbjct: 898 ESTKGRLIVLCLE 910
>gi|297792471|ref|XP_002864120.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
lyrata]
gi|297309955|gb|EFH40379.1| hypothetical protein ARALYDRAFT_495232 [Arabidopsis lyrata subsp.
lyrata]
Length = 1444
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 78/300 (26%), Positives = 133/300 (44%), Gaps = 20/300 (6%)
Query: 702 FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
+E +T P+ + E+ ++ L + + ++ VGTAYV E+ +GR+L+F
Sbjct: 1099 WETKATIPMQSSEHALTVRVVTLLNASTGENETLLAVGTAYV-QGEDVAARGRVLLFSFG 1157
Query: 758 ---DGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
D ++ E +E KGA+ ++ + G LL + KI L+KW +GT
Sbjct: 1158 KNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW----NGTELNGVAFF 1213
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 1214 DAPPLYVVSMNVVKTFILLGDVHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGN 1273
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
L ++ N+ + A + +L E+H+G V +F L M +D
Sbjct: 1274 TLSLAVSDEQKNIQVFYYAPKMAESWKGQKLLSRAEFHVGSHVTKFLR--LQMVTSGADK 1331
Query: 929 GQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT 988
++FGT++G G IA L + L+ LQ L + V GLN +R F K
Sbjct: 1332 TNRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPHSFRQFRTSGKA 1391
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 95/407 (23%), Positives = 159/407 (39%), Gaps = 116/407 (28%)
Query: 103 NIR-LEELQVLDIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFV---------- 151
N+R LE V D FL+G +P IV+L ++ H V+ K V
Sbjct: 240 NLRDLEMKHVKDFVFLHGYIEPVIVILQEE----EHTWAGRVSWKHHTCVLSALSINTTL 295
Query: 152 -EGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGR 208
+ P WS NL + A L+ VP P+ GVL++ TI Y S +A A+ + + A
Sbjct: 296 KQHPVIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSS 355
Query: 209 VDADGSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIA 251
+ S + LL +G L LL + ++ V L + + +A
Sbjct: 356 QELPASNFSVELDAAHGTWISSDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLA 415
Query: 252 STISYLDNAVVYIGSSYGDSQLI------------------------------KLNLQPD 281
S I+ + N++ ++GS GDS L+ +L + D
Sbjct: 416 SDITSVGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRISSD 475
Query: 282 -----------------------AKGSY-VEVLERYVNLGPIVDFC----------VVDL 307
A+ S+ V + VN+GP+ DF +
Sbjct: 476 TFQDTIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGV 535
Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------------- 353
+Q ++V CSG K+G+L ++R + VEL G KG+W++
Sbjct: 536 SKQSNYELVCCSGHGKNGALCVLRQSVRPEMITEVELPGCKGIWTVYHKSSRGHNADSSK 595
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
++ +D + +L++S E R + + D L E TE + Q +T+
Sbjct: 596 MAADEDEYHAYLIISV--EARTMVLETADLLTEVTESVDYYVQGRTI 640
>gi|225679191|gb|EEH17475.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb03]
Length = 1377
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 98/421 (23%), Positives = 182/421 (43%), Gaps = 46/421 (10%)
Query: 655 RSIPLGEHPRRICHQEQSRTFAICSLK--NQSCAEESEMH-------FVRLLDDQTFEFI 705
R I LGE + + S T+ + + + + E+ E+H V+LL+ +T+ I
Sbjct: 954 RKIGLGEQVDSVEYSSSSETYVLGTSQKVDFKLPEDDEIHPEWRNEESVKLLNPRTWSII 1013
Query: 706 STYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTKGRILVFIV---- 756
+Y L T E C L S + + VGTA E+ +G I VF V
Sbjct: 1014 DSYQLRTAERVMCVKCLNLEASEITHERKEMIAVGTALTRGEDIA-ARGCIYVFEVIKVV 1072
Query: 757 -------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKWMLRDDGTRELQ 807
+ KL+LIA++E KGA+ SL+ G+ L+AA QK + L++DG+
Sbjct: 1073 PEVDRPETNRKLKLIAKEEVKGAITSLSGIGGQGFLIAAQGQKCIVRG--LKEDGSLLPV 1130
Query: 808 SECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL 867
+ ++ L ++GD +K + Y E + ++D + + A + L
Sbjct: 1131 AFMDMQCYVSVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKDDGSLQVMAADFL 1190
Query: 868 DDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR---HGSLVM 921
+ A+++ N+ ++ + E + RL +H G+F + S++
Sbjct: 1191 PHGKRLFIMVADDDCNIHVLQYDPEDPGSAKGDRLLHRSTFHTGQFASTLTLLPRTSVLS 1250
Query: 922 RLPDSDV--------GQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGG 973
+ P+++ G + V+ + G I +I + Y L LQ+ + ++ G
Sbjct: 1251 QGPEAEANAMDLDSSGPLHQVLVTSETGSIALITPVSEMAYRRLSALQSQMINTLEHPCG 1310
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
LN +R+ E + + +DGDL++ +LDL R EI+ + V E+ +E +
Sbjct: 1311 LNPRAFRAV--ESDGIGGRGMVDGDLVQKWLDLGTQRKAEIASRVGADVWEIRADLEAIG 1368
Query: 1034 R 1034
+
Sbjct: 1369 K 1369
>gi|83314897|ref|XP_730560.1| multisubunit cleavage/polyadenylation specificity factor subunit A
[Plasmodium yoelii yoelii 17XNL]
gi|23490318|gb|EAA22125.1| CPSF A subunit region, putative [Plasmodium yoelii yoelii]
Length = 863
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 101/449 (22%), Positives = 179/449 (39%), Gaps = 86/449 (19%)
Query: 552 HLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSS-------------------KNTTHVF 592
+L +++N LT+R+K+++ PI + F+ K + +F
Sbjct: 287 YLKKYIINSDNVLLTNRRKINVCRSPIKFKQFNKVCSEKNKIDINNNNNNTCVKKSNFLF 346
Query: 593 AASDRPTVIYSS-NKKLLYSNVNLKEVSHMCPFNS----AAFPDSLAIAKEGE------- 640
SD P +IYS KK+ S V++K + + FN F + + K+ +
Sbjct: 347 ICSDNPIIIYSDIKKKISLSKVSIKNIFLVDIFNDFDYLNPFHNFNSFKKKNQNNLYFIF 406
Query: 641 -----LTIGTIDDIQKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEMHFV 694
L I +++I + +I+ IP +I H E C + + ++ +
Sbjct: 407 FDGLSLYISHLNEINETYIQKIPFYRTVEKIAYHNESGLLITSCPTEEKHKTNKNLKQII 466
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY---------YCVGTAYVLPEENE 745
+ F +Y + + +Y S + C + + ++Y CVGTA + +E
Sbjct: 467 CFFNPHQNSFKYSYIIPS-KYNVSSI-CIYQINKDIYPNKSNINTLICVGTANINDRVSE 524
Query: 746 PTKGRILVFIVEDGKLQLIAEKE------TKGAVYSLNAFNGKLLAAINQK--------- 790
P+ G I +F + K L K G + L F KL++ IN
Sbjct: 525 PSSGHIYIFFAK-KKANLFEIKHIYTHNINVGGITHLKQFYDKLISTINNTVIYKCVNKK 583
Query: 791 ------------IQLYKWM--------LRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
I L K++ L +DGT + I++L V ++IVV
Sbjct: 584 LIVVILDISDFLINLDKYVDNTNKPIKLENDGTIVDVASFTPSSWIMSLDVI--ENYIVV 641
Query: 831 GDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEG 890
GD+M S+++L Y + E RDY+ W + V L +L ++ N +K+S
Sbjct: 642 GDIMTSVTILSYDFNNSTLTEVCRDYSNVWCTFVCALSKSHFLVSDMESNFLVFQKSSIR 701
Query: 891 ATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
DE+ +L V ++ G VN+ SL
Sbjct: 702 YNDEDSFKLSRVALFNHGHVVNKMLPVSL 730
>gi|47217773|emb|CAG05995.1| unnamed protein product [Tetraodon nigroviridis]
Length = 1446
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 112/498 (22%), Positives = 196/498 (39%), Gaps = 106/498 (21%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
+ PF++ P L K+GEL I + +R IPL + + +S
Sbjct: 943 IESFSPFHNINCPKGFLYFNKQGELRISVLPTYLSYDAPWPVRKIPLRCTVHYVSYHVES 1002
Query: 673 RT-------FAICSLKNQSC-------AEESEMHFV----------------RLLDDQTF 702
+ +A+C+ + C EE E + +L+ ++
Sbjct: 1003 KASLSHCCVYAVCTSVKELCTRIPRMTGEEKEYETIERDERYINPQQDKFSIQLISPVSW 1062
Query: 703 EFISTYPLD--TFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF- 754
E I +D +EY + + + V Y GT + EE +GRIL+
Sbjct: 1063 EAIPNTRIDLEEWEYVTCMKTVALRSQETVSGLKGYIAAGTCLMQGEEVT-CRGRILILD 1121
Query: 755 ----IVEDG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR 804
+ E G K +++ EKE KG V +L NG L++AI QKI L W+L+D+
Sbjct: 1122 VIEVVPEPGQPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFL--WVLKDN--- 1176
Query: 805 ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV 864
+L + + + +FI+ DLMKS+SLL Y+ E + +RD + ++
Sbjct: 1177 DLTGMAFIDTQLYIHQMMSIKNFILAADLMKSVSLLRYQEESKTLSLVSRDAKPLEVYSI 1236
Query: 865 EILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVM 921
E + D+ LG ++ + NL+ E RL +++ G +N F
Sbjct: 1237 EFMVDNSQLGFLVSDRDKNLYVYMYLPEAKESFGGMRLLRRADFNAGANINTF------W 1290
Query: 922 RLPDS---DVGQIPTVI--------FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKG 970
R+P + G + F T++G +G++ + + Y L LQ L ++
Sbjct: 1291 RMPCRGALEAGSRKAMTWDNKHITWFATLDGGVGLLLPMQEKTYRRLLMLQNALTTMLSH 1350
Query: 971 VGGLNHEQWRSFNNEKKTV---------------------------DAKNFLDGDLIESF 1003
GLN + +R ++ + KN LDG+L+ +
Sbjct: 1351 HAGLNPKAFRCVGADRTSAAMLSGMLPDFATSVSRMLHCDRRSLQNPVKNILDGELLNKY 1410
Query: 1004 LDLSRTRMDEISKTMNVS 1021
L LS E++K + +
Sbjct: 1411 LYLSMMERSELAKKIGTT 1428
Score = 48.5 bits (114), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 61/301 (20%), Positives = 131/301 (43%), Gaps = 47/301 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPT-- 63
++G + ++E + G +D L ++ + K V+++D + +L T ++ + R T
Sbjct: 74 LFGNVMSMESVQLVGANRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDTLT 133
Query: 64 DNGQIGIID-PDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAK 122
D ++G+ + P + ++ D V D K L ++D+KFL+G +
Sbjct: 134 DEQELGVGEGPKSSFLPTYIID----VRELDEK------------LLNIIDMKFLHGYYE 177
Query: 123 PTIVVLYQDNKD------ARHVKTYEVALKDKDFVE-GP--WSQNNLDNGADLLIPVPPP 173
PT+++L++ N+ R + VA+ + P WS +NL ++ VP P
Sbjct: 178 PTLLILFEPNQTWPGRVAVRQAQCSIVAISLNIMQKVHPVIWSLSNLPFDCTQVMAVPKP 237
Query: 174 LCGVLIIGEETIVYCS-------------ANAFKAIPIR----PSITKAYGRVDADG-SR 215
+ GV++ +++Y + N A P+R IT + D +
Sbjct: 238 IGGVVVFAVNSLLYLNQSVPPYGVALNSLTNGTTAFPLRLQDEVKITLDCSQADFIAYDK 297
Query: 216 YLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
++ G +++L +IT V + + + + + ++ +++GS G+S L+
Sbjct: 298 MVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTCMVTMEPGYLFLGSRLGNSLLL 357
Query: 275 K 275
K
Sbjct: 358 K 358
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 42/166 (25%), Positives = 72/166 (43%), Gaps = 35/166 (21%)
Query: 288 EVLERYVNLGPIVDFCVVD---LERQGQG------QVVTCSGAYKDGSLRIVRNGIGINE 338
EV + +N+GP + + + L + QG +VV CSG K+G+L +++ I
Sbjct: 430 EVCDSILNIGPCANASMGEPAFLSEEFQGNPEPDLEVVVCSGHGKNGALSVLQRSIRPQV 489
Query: 339 QASVELQGIKGMWSLRSS-----------------------TDDPFDTFLVVSFISETRI 375
+ EL G MW++ S+ D FL++S T I
Sbjct: 490 VTTFELPGCHDMWTVISNEVKEDKKVPQSPGSFTATHYSLEEDTKKHGFLILSREDSTMI 549
Query: 376 LAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLV 420
L E+ E + GF +Q T+F + N+ ++QV+ +RL+
Sbjct: 550 LQTG--QEIMELDTSGFATQGPTVFAGNIGDNKYIIQVSPMGIRLL 593
>gi|384487281|gb|EIE79461.1| hypothetical protein RO3G_04166 [Rhizopus delemar RA 99-880]
Length = 1468
Score = 80.5 bits (197), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 134/583 (22%), Positives = 232/583 (39%), Gaps = 117/583 (20%)
Query: 6 IYGRIATLELFRPHG-EAQDFLFIATERYKFCVLQWDAESSELITRAMGDVS-DRIGRP- 62
I + T+ P G E D L + K +L+W + ++ +IT ++ D +
Sbjct: 111 IITTMGTVRTNSPRGREGCDSLLLGFSDAKMSLLEWSSSTNSIITVSIHYYERDEFKKEF 170
Query: 63 -TDNGQIGI-IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE----------------AFNI 104
T+ I IDP R + YD V+PF +L E +F I
Sbjct: 171 LTNPYPSAIHIDPQQRCAVFNFYDNKLAVLPFRQSDKLDERQGEGEEDEEKWPYYPSFII 230
Query: 105 RLEEL-----QVLDIKFLYGCAKPTIVVLYQ----------DNKDARHVKTYEVALKDKD 149
L L V+D+ FL +PT+ +L+Q +NKD + + + K
Sbjct: 231 DLATLDSRIKNVIDMTFLSDYYEPTLAILFQPEQTWTGRLGNNKDTVSLVVISLDITAKI 290
Query: 150 FVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANA--------------- 192
+ P +S + L L+ +P P+ G+L+I +I++ S +
Sbjct: 291 Y---PIIYSIDKLPYDCFKLVAMPKPVTGMLVIAANSILHVSQGSPGMGVAVNGYTKKTT 347
Query: 193 -FKAIPIRPSITK-----------AYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGL 240
F + PS+ + A+G G R L+ G L+ + + KV G+
Sbjct: 348 DFPGMIYEPSLIELGLSLEGAKALAFG-----GDRCLIFMQNGHWALVEVRRDGNKVVGM 402
Query: 241 KIELLGE---------------TSIASTISYLDNA----VVYIGSSYGDSQLIK-----L 276
I + +AS S + N ++GS GDS LIK +
Sbjct: 403 AISEIKHDLPVMEKKPPRFDTPPLLASVPSCVTNVKAGEYFFLGSRVGDSLLIKYDANRV 462
Query: 277 NLQPDAKGSYVEVLERYVNLGPIVDFCVVDL------ERQGQGQVVTCSGAYKDGSLRIV 330
N Q A + V + +N GPIVD V D+ E Q ++V+ SG K+G+L +
Sbjct: 463 NHQSVAPPVF-RVCDTMLNTGPIVDMAVGDVDTVEQQEDWPQLELVSSSGHGKNGALCVF 521
Query: 331 RNGIGINEQASVELQGIKGMWSLR-------SSTDDPFDTFLVVSFISETRILAMNLEDE 383
+ I + + +WS++ + DD FD L +S T +L+ DE
Sbjct: 522 QRHIYPQTSFAFHQFDSQAIWSIKCRKNDQQQNEDDDFDKLLFISKSKSTLVLSAG--DE 579
Query: 384 LEETEIEGFCSQTQTLFCHDAI-YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVA 442
L+E + GF ++ T+ ++VQV + V +++ + ++ P G + A
Sbjct: 580 LQEVKT-GFYTRGSTIAVSTLFDATRIVQVYATGVMVLTPEGKRIQT-VPIPRGAKIVEA 637
Query: 443 TANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDI 485
+ + +LL ++ L+ GD ++ H QL I + I
Sbjct: 638 SIHDPYILLTLDNNKILALQ-GDASTKDIIHIQLPNHIKDVGI 679
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 97/372 (26%), Positives = 156/372 (41%), Gaps = 50/372 (13%)
Query: 688 ESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPE 742
E E + L+ T+E + + FE C L C+ D YY + L
Sbjct: 1099 EMEQFSMILVSPVTWEIVDKVEFEEFEQ-CFSLECALLDSKQTSTGRKYYMIIGTGTLKG 1157
Query: 743 ENEPTKGRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKI 791
E+ KG I ++ I+E + K + + ++ KGAV ++ +G L A I K+
Sbjct: 1158 EDTTMKGSIRMYDIIEVVPEPDNPQTNHKFKPVLTEDVKGAVTAMCTVSGHLAACIGSKV 1217
Query: 792 QLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEE 851
+ W L DD L I + + +FI++GD KSI L ++ E +
Sbjct: 1218 IV--WSLEDD--ERLVGVAFIDVQIYVTSMSSIKNFILIGDAQKSIWFLGFQLEPAKLTL 1273
Query: 852 RARDYNANWMSAVEILDDD--IYL--GAEN-NFNLFTVRK---NSEGATDEERGRLEVVG 903
+DY + + V+ + DD +YL G N N +L+ S G +L G
Sbjct: 1274 LGKDYQSFDVGCVDFIIDDKSLYLIVGDTNENIDLYQYAPFNLQSFGGQ-----KLMRRG 1328
Query: 904 EYHLGEFVNRFRHGSLVMRLPD-------SDVGQIPTVIFGTVNGVIGVIASLPHEQYLF 956
++H+G V ++RLP + + + GT NG I VI+S+ + +
Sbjct: 1329 DFHVGSQVQ------TMVRLPQIEKTEKGFEYSRRHFCLCGTFNGSIAVISSISEKTFKR 1382
Query: 957 LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKK---TVDAKNFLDGDLIESFLDLSRTRMDE 1013
L L +L ++ V GLN +R K+ T K LDGDLI F LS E
Sbjct: 1383 LNTLYGHLVNNLQHVAGLNPRAFRLIKGPKQRMSTNRTKAVLDGDLIFEFAGLSIEEQKE 1442
Query: 1014 ISKTMNVSVEEL 1025
+K + +V +
Sbjct: 1443 TTKQIGTTVTRI 1454
>gi|240280498|gb|EER44002.1| pre-mRNA-splicing factor rse1 [Ajellomyces capsulatus H143]
Length = 305
Score = 80.5 bits (197), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 76/306 (24%), Positives = 145/306 (47%), Gaps = 35/306 (11%)
Query: 747 TKGRILVF-IVEDGK-LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR 804
T G I ++ E+GK L+ I + + + +L F G+LLA I +++Y ++ R
Sbjct: 9 TAGFIHIYRFQEEGKELEFIHKTKVEQPPMALLGFQGRLLAGIGTDLRIYDLGMKQ-MLR 67
Query: 805 ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV 864
+ Q+ H L + +QT+G I+V D+ +S++ ++YK++E + D + W +
Sbjct: 68 KCQASVVPH---LVVGLQTQGSRIIVSDVQESLTYVVYKYQENRLIPFVDDVISRWTTCT 124
Query: 865 EILDDDIYLGAENNFNLFTVR---KNSEGATDEERG---------------RLEVVGEYH 906
++D + G + NL+ +R K SE A ++ G RL +V ++
Sbjct: 125 TMVDYETVAGGDKFGNLWLLRCPAKASEEADEDGSGAHLIHERQYLQGAPNRLNLVAHFY 184
Query: 907 LGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRK 966
+ + LV D V T + GTV+ +I I+ E+ F + L+ L
Sbjct: 185 PQDLPTSIQKAQLVTGGRDILVW---TGLQGTVSMLIPFIS---REEVDFFQSLEMQLAA 238
Query: 967 VIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
+ G +H +RS+ AK +DGDL E++L L + +I+ ++ SV E+
Sbjct: 239 QNPPLAGRDHLIYRSY-----YAPAKGTIDGDLCETYLLLPNDKKQQIAGELDRSVREIE 293
Query: 1027 KRVEEL 1032
+++ ++
Sbjct: 294 RKIADM 299
>gi|295665178|ref|XP_002793140.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides sp. 'lutzii' Pb01]
gi|226278054|gb|EEH33620.1| cleavage and polyadenylation specificity factor subunit A
[Paracoccidioides sp. 'lutzii' Pb01]
Length = 1408
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 100/431 (23%), Positives = 185/431 (42%), Gaps = 56/431 (12%)
Query: 655 RSIPLGEHPRRICHQEQSRTFAICSLK--NQSCAEESEMH-----------------FVR 695
R I LGE + + S T+ + + + + E+ E+H V+
Sbjct: 977 RKIGLGEQVDSVEYSSSSETYVLGTSQKVDFKLPEDDEIHPEWRNEVISFFPQIDKGSVK 1036
Query: 696 LLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTKGR 750
LL+ +T+ I +Y L T E C L S + + VGTA E+ +G
Sbjct: 1037 LLNPRTWSIIDSYQLRTSERVMCVKCLNLEASEITHERKEMIAVGTALTRGEDIA-ARGC 1095
Query: 751 ILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKWM 797
I VF V + KL+LIA++E KGA+ SL+ G+ L+AA QK +
Sbjct: 1096 IYVFEVIKVVPEVDRPETNRKLKLIAKEEVKGAITSLSGIGGQGFLIAAQGQKCIVRG-- 1153
Query: 798 LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
L++DG+ + ++ L ++GD +K + Y E + ++D
Sbjct: 1154 LKEDGSLLPVAFMDMQCYVSVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKDDG 1213
Query: 858 ANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
+ + A + L D +Y + A+++ N+ ++ + E + RL +H G+F +
Sbjct: 1214 SLQVMAADFLPDGKRLYIMVADDDCNIHVLQYDPEDPGSAKGDRLLHRSTFHTGQFASTL 1273
Query: 915 R---HGSLVMRLPDSDV--------GQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
S++ + P+++ G + V+ + G I +I + Y L LQ+
Sbjct: 1274 TLLPRTSVLSQGPETEANAMDLDLSGPLHQVLVTSETGSIALITPVSEMAYRRLSALQSQ 1333
Query: 964 LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
+ ++ GLN +R+ E + + +DGDL++ +LDL R EI+ + V
Sbjct: 1334 MINTLEHPCGLNPRAFRAV--ESDGIGGRGMVDGDLVQKWLDLGTQRKAEIASRVGADVW 1391
Query: 1024 ELCKRVEELTR 1034
E+ +E + +
Sbjct: 1392 EIRADLEAIGK 1402
>gi|154285962|ref|XP_001543776.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
gi|150407417|gb|EDN02958.1| conserved hypothetical protein [Ajellomyces capsulatus NAm1]
Length = 1283
Score = 79.7 bits (195), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 106/434 (24%), Positives = 182/434 (41%), Gaps = 61/434 (14%)
Query: 655 RSIPLGEHPRRICHQEQSRTFAICSLKNQ----SCAEESEMH-----------------F 693
R I LGE + + S T+ I + NQ + E+ E+H
Sbjct: 851 RKIGLGEQVDAVEYSSSSETYVIGT--NQKVDFNLPEDDEIHPEWRNEVISFLPQIDKGS 908
Query: 694 VRLLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTK 748
V+LL +T+ I +Y L T E C L S + + VGTA + E+ +
Sbjct: 909 VKLLTPRTWSIIDSYNLRTAERIMCVKCLNLEVSEITHERKDTIVVGTA-LTKGEDIAAR 967
Query: 749 GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
G I +F V + KL+LIA++E KGAV SL+ G+ L+AA QK +
Sbjct: 968 GCIYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGSLIAAQGQKCIVRG 1027
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ + ++ L ++GD +K + Y E + ++D
Sbjct: 1028 --LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD 1085
Query: 856 YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ A + L D +Y L A+++ N+ ++ + E + RL + G F +
Sbjct: 1086 DGTLQVMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFAS 1145
Query: 913 RF----RHGSLVMRLPDSD--------VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
R + + PD+D G + V+ + G I +I + Y L L
Sbjct: 1146 TMTLLPRTATSSSQRPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSAL 1205
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q+ L ++ GLN +R+ E + + +DGDL++ +LDL R EI+ +
Sbjct: 1206 QSQLTNTLEHPCGLNPRAFRAV--ESDGIGGRGMVDGDLVKRWLDLGTQRKAEIANRVGA 1263
Query: 1021 SVEELCKRVEELTR 1034
V E+ +E + +
Sbjct: 1264 DVWEIRADLEAIGK 1277
>gi|294654658|ref|XP_456718.2| DEHA2A08932p [Debaryomyces hansenii CBS767]
gi|218511767|sp|Q6BYK1.2|RSE1_DEBHA RecName: Full=Pre-mRNA-splicing factor RSE1
gi|199429049|emb|CAG84677.2| DEHA2A08932p [Debaryomyces hansenii CBS767]
Length = 1256
Score = 79.3 bits (194), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 83/306 (27%), Positives = 133/306 (43%), Gaps = 52/306 (16%)
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTREL----QSECGHHGHI 816
L+ + + E ++ FNG+LL ++ ++LY D G R+L S + +I
Sbjct: 963 LEFVHKTELDYQPTAIIPFNGRLLVGMSNFLRLY-----DLGQRQLLRKASSNIEYLKNI 1017
Query: 817 LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
+ L Q G IVVGD S + + Y E A D ++A+ LD D +G +
Sbjct: 1018 IRLTHQG-GSRIVVGDSSMSTTFVKYDSTENQFIPFADDIMKRQITALVTLDYDTIIGGD 1076
Query: 877 NNFNLFTVR----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 920
N+F R + E + RL+ + E++L + F GSLV
Sbjct: 1077 KFGNIFVSRVPETISQQSDKDWSLLRYQESYLNGSGSRLKNICEFYLQDIPTSFTKGSLV 1136
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVI--------KGV 971
M G ++I+ + G +G++ L E + FL LQ LRK K
Sbjct: 1137 M-------GGKESIIYTGIQGTLGLLLPLSTENEVKFLGDLQLLLRKYFDYNFDDFDKDK 1189
Query: 972 GGLN-----HEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
G N H ++RS+ N KN +DGDLIE F +LS++ I +N + E+
Sbjct: 1190 NGYNLLGKDHLKFRSYYNP-----VKNVMDGDLIERFYELSQSMKIRIGTELNRTPREIE 1244
Query: 1027 KRVEEL 1032
K++ E+
Sbjct: 1245 KKISEM 1250
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 141/680 (20%), Positives = 264/680 (38%), Gaps = 90/680 (13%)
Query: 7 YGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNG 66
+G I ++ R G +D++ I ++ K VL++D E + + S R T G
Sbjct: 67 FGIIQGIDKIRLVGTQKDYVVITSDSGKLVVLEFDIEKLQFVPLFQEPHSKNGLRRTSPG 126
Query: 67 QIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLK-----EAFNIRLEELQV--LDIKF- 116
+ +DP R LIG + L + +++G+L+ E F+ LQ+ +D F
Sbjct: 127 EYLCVDPHNRAILIGAIEKNKLVYKVQSNDEGKLELSSPLETFSKHTLTLQICAMDTGFE 186
Query: 117 --LYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEG------PWSQNNLDNGADLLI 168
++ + QDN + + L + +G S + + LI
Sbjct: 187 NPMFAAIECDYNARQQDNGEEEDAGEASLLLNYYELDQGLNHVVKHKSNEKIPGSSSHLI 246
Query: 169 PVPPPLCGVLIIGEETIVYCSANAFK---AIPIRPS------ITKAYGRVDADGSRYLLG 219
P+P + G+L+ + TI+Y + K IPIR + + R+ + L+
Sbjct: 247 PLPDFIGGLLVCSKSTIIYAHPSKDKLYLPIPIRSNTNETLIVNHVIHRLKKNNFFILVQ 306
Query: 220 DHAGLLHLLVITHEK--EKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS---QLI 274
G + I H++ E + + I ++ +++ + ++ + + Q
Sbjct: 307 SQLGDCFKITIDHDEVNESIENINITYFDTIPLSQSLNIFKSGFLFANVATNNKLFYQFE 366
Query: 275 KL-------NLQPDAKGSYVEVLERYVN--------------------LGPIVDFCVVDL 307
KL LQ Y + E ++ L PI D +++
Sbjct: 367 KLGDDNNNTTLQSCNFSDYNSIFELDISKRSFKVAGLENLALVDIMETLNPITDGALIET 426
Query: 308 ERQGQGQVVTCSGAYKDGS-LRIVRNGIGINEQASVELQGIK--GMWSLRSSTDDPFDTF 364
R A S L+ + +GI N S L IK + + R + D +
Sbjct: 427 LRPEVPDPFKQLTALSSHSYLKTLTHGISTNTVVSSPLP-IKPTAIHTTRIFAESANDEY 485
Query: 365 LVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTS 424
LV+S ++ L +++ + +EE F + T+ + +VQ+ S +R + T
Sbjct: 486 LVISSTLSSQTLVLSIGEVVEEVNDSQFVTNEPTINVQQVGKSSVVQIYSNGIRHIKHTM 545
Query: 425 R-----ELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIG--DGILTEVKHAQLE 477
R + +W P G S+ A+ N QV++ + Y EI D L E + +LE
Sbjct: 546 RNDTIEKKYTDWYPPAGISIIQASTNNEQVIIGLSNREICYFEIDPHDDQLVEYQE-RLE 604
Query: 478 YE--------ISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHLGG 526
IS I+ + SY A VG +D +++ SL L ++T + L
Sbjct: 605 MSGGSISALAISSSSISKLQRKSSY---AIVG-CSDETIQAISLKPHNCLEIVTLQALSA 660
Query: 527 EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSK 586
S+ + + + + +G + ++ TG+L+D + LG++P+ L
Sbjct: 661 N--SSSIAMVPHGYSTSVHIGMENGLYVRVTIDEITGKLSDTRIQFLGSKPVQLSVIGLP 718
Query: 587 NTTH--VFAASDRPTVIYSS 604
+ A S RP + Y S
Sbjct: 719 QLQQNGLLAISSRPWIGYYS 738
>gi|156084934|ref|XP_001609950.1| splicing factor 3b, subunit 3, 130kD [Babesia bovis T2Bo]
gi|154797202|gb|EDO06382.1| splicing factor 3b, subunit 3, 130kD, putative [Babesia bovis]
Length = 1169
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 61/277 (22%), Positives = 121/277 (43%), Gaps = 22/277 (7%)
Query: 760 KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
+++L+ G V + + G+LLA++ ++I+LY + + C HG I
Sbjct: 906 EIRLLHVTRVGGVVRAFTGYEGRLLASVGKRIRLYALGKKQLLLKAEHRTCSDHGFI--- 962
Query: 820 YVQTRGDFIVVGDLMKSISLL---IYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
++ G I GD+ + I +L Y E E W+++ LD + +
Sbjct: 963 WLNAVGSRIFAGDIREGIQILRIKFYSEEAAEFEWVGGATGPRWLTSCAQLDYSTVIAGD 1022
Query: 877 NNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
++F R E +T +LE V ++HLG+ +L V++
Sbjct: 1023 KFDSIFVTRVPQEEST--RHIQLENVCQFHLGDLPTAMDKAALSQ--------STHVVLY 1072
Query: 937 GTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL 995
GTV G IG + + L FL+ L+ + + G H +RS+ V + +
Sbjct: 1073 GTVMGSIGALVPFQSKDELDFLQHLEMLMATEAPPLCGREHSFYRSY-----YVPVQQVV 1127
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
DGDL E F L+ + ++++ ++ +V + ++++++
Sbjct: 1128 DGDLCEQFRHLTEAQQRKVAQQLDTTVNNVLRKLDDI 1164
>gi|403224220|dbj|BAM42350.1| splicing factor 3b subunit [Theileria orientalis strain Shintoku]
Length = 1272
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 146/670 (21%), Positives = 274/670 (40%), Gaps = 103/670 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
++G + + FR G +D+L + ++ K +L++ E + R + + G R
Sbjct: 58 VFGIVRAISAFRLTGSQKDYLVVGSDSGKLVILEFSLEL-KTFKRVHCETYGKTGVRRIV 116
Query: 65 NGQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YG 119
GQ +DP R ++G ++ D+K L + + + + + G
Sbjct: 117 PGQYLGVDPKGRAVMVGAVERQKFVYIMNRDSKANLTISSPLEAHKSHSVCFDLVGLEVG 176
Query: 120 CAKPTIVVLYQ--DNKDARHVK-----TYEVALKDKDFVEGPWSQNNL--------DNGA 164
P + Q +N D+ + T E K F E N++ D A
Sbjct: 177 FENPIFASIEQSYENVDSLQIDLDEELTDEALKKGVSFWEMDLGLNHVVKKVTLPVDLTA 236
Query: 165 DLLIPVPPPLCGVLIIGEETIVYCS---ANAFKAIPIRPSITK-------AYG-RVDADG 213
+L+PVP GV++ E +VY + + + A P R +++ +Y D
Sbjct: 237 HMLVPVPGGPGGVIVCCENFLVYKNLDHGDVYCAYPRRLEVSEHAKLLITSYAVHKMKDF 296
Query: 214 SRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQL 273
LL G L+ + + ++ +V + + +A+++ L + +++ S +GD +
Sbjct: 297 FFILLQSEYGDLYKVDLNYDDAQVKEIVVRYFDTVELATSMCILRSGYLFVASEFGDHHV 356
Query: 274 IKL-------------NLQPDAKGSYVEVLERYV----------NLGPIVDFCVVDLERQ 310
+ +L P +K + + R + +L IVD VVD+
Sbjct: 357 YQFTDLGSNEKDPMCTSLHPHSKSAIIAFKPRVLQNLYETDKLPSLSSIVDMKVVDVMGT 416
Query: 311 GQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSF 369
G + + G + + L+ +R G+ E A EL G + +++++S + +D F++VSF
Sbjct: 417 GDYEFIMGCGRWYNSRLKSLRYGLSTEELAFNELPGRPRAVFTIKS-LESNYDEFIIVSF 475
Query: 370 ISETRILAMNLEDELEETEIEGFCSQTQTLF-CHDAIYNQ----------------LVQV 412
T +L++ + +EE F + TL C+ + Y+ VQV
Sbjct: 476 QGNTLVLSIG--EAVEEVTDSFFLTSITTLHSCYMSNYHATESLEGRFEGGVSDGIFVQV 533
Query: 413 TSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVK 472
R + ++ EWK P V +A N +Q+LL GG +VY E+ D L EV
Sbjct: 534 HDSGFRY---SHGQVVKEWKVPSTKRVKLADNNLNQLLLVLSGGEVVYFELVDNDLEEVA 590
Query: 473 HAQLEYEISCLDI-----NPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGG 526
L EI+C+ + + GE + VG +I +R+ L L + + + L
Sbjct: 591 KRNLSTEITCVALQHTPASKAGERVRKGEFCCVGSIDNI-IRVLKLDKTLKMCSSQLLSN 649
Query: 527 EIIPRSVLLCAFEG---------------ISYLLCALGDGHLLNFLLNMKTGELTDRKKV 571
+P S+ L E SYL L +G L+ ++ G L+D++
Sbjct: 650 NALPESLALVTAETGSGSGGGSSGGLGESESYLYVGLNNGILIRNNVD-SLGNLSDQESR 708
Query: 572 SLGTQPITLR 581
+GT+ + L+
Sbjct: 709 FMGTKAVKLK 718
Score = 72.8 bits (177), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 70/282 (24%), Positives = 126/282 (44%), Gaps = 31/282 (10%)
Query: 761 LQLIAEKETKGAVYSLNAFNGKLL-AAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
++L+ TKG + N + KLL AI ++++Y G ++L + H L
Sbjct: 1007 IKLLHVTNTKGWIRCFNNYENKLLLCAIGTRLRMYAL-----GKKQLLLKGEHRS--LTN 1059
Query: 820 Y----VQTRGDFIVVGDLMKSISLL---IYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
Y ++ G I GD+ +S+ LL Y E G E A W+S +E+LD
Sbjct: 1060 YGFMDIKVIGSRIYCGDIRESVQLLRIKFYGEETGEFEMTATSTGPRWLSTMELLDYSTV 1119
Query: 873 LGAENNFNLFTVR-KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQI 931
+ A+ ++F R N+E E ++HL + V L +S G++
Sbjct: 1120 MAADKFDSVFVARVPNNEDVV--RNNYFEYQNQFHLSDLVTSLSKVKL-----NSVYGEL 1172
Query: 932 PTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD 990
V++ T+ G +G + + + FL+ L+ L + + G + RS+
Sbjct: 1173 --VVYSTILGSLGAFVTFTSKDEVDFLQHLEMLLANELDTLSGREAQMHRSY-----YFP 1225
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+N +DGDL E + +LS +I+ +NV V E+ K+++ +
Sbjct: 1226 VQNVIDGDLCELYFNLSSDLKAKIANQLNVKVAEVVKKLKNI 1267
>gi|12805469|gb|AAH02210.1| Ddb1 protein, partial [Mus musculus]
Length = 86
Score = 79.3 bits (194), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 41/86 (47%), Positives = 53/86 (61%), Gaps = 10/86 (11%)
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN- 1019
Q L KVIK VG + H WRSF+ E+KT A F+DGDLIESFLD+SR +M E+ +
Sbjct: 1 QNRLNKVIKSVGKIEHSFWRSFHTERKTEPATGFIDGDLIESFLDISRPKMQEVVANLQY 60
Query: 1020 ---------VSVEELCKRVEELTRLH 1036
+ ++L K VEELTR+H
Sbjct: 61 DDGSGMKREATADDLIKVVEELTRIH 86
>gi|339253000|ref|XP_003371723.1| cleavage and polyadenylation specificity factor subunit 1
[Trichinella spiralis]
gi|316967988|gb|EFV52332.1| cleavage and polyadenylation specificity factor subunit 1
[Trichinella spiralis]
Length = 1376
Score = 79.0 bits (193), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 77/308 (25%), Positives = 128/308 (41%), Gaps = 29/308 (9%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+PE + P + KL+++ KE KG V SL + G LL + QK+ Y W
Sbjct: 1081 VVPEPDRP---------MTKFKLKVVYSKEQKGPVTSLCSLRGYLLTGMGQKV--YIWQY 1129
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGD---FIVVGDLMKSISLLIYKHEEGAIEERARD 855
+D+ G L +YV + D +SLL Y+ E A+ +RD
Sbjct: 1130 KDNAL------VGISFLDLQVYVHQMASIRYLALTADAFFGVSLLRYQEEYKALSLVSRD 1183
Query: 856 YNANWMSAVEILDDDI---YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ + AVE L D +L ++ T E RL +YH G VN
Sbjct: 1184 PRPDEVLAVEFLVDRTDLSFLMTSAAGDILTYVYLPESLDSFGGQRLVPQADYHFGSQVN 1243
Query: 913 RFR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVI 968
F H + +V Q +IF + +G + + LP +Y L LQ+ L ++
Sbjct: 1244 AFVRMRCHAQEIAGRKRQEVLQRQGLIFASSDGSVNYLLPLPEREYRLLGMLQSLLIDML 1303
Query: 969 KGVGGLNHEQWRS--FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELC 1026
GLN + +R+ F N KN +DG++ +L + + ++I + + S ++
Sbjct: 1304 PSFAGLNVDDYRTVRFPNSCLREPTKNIIDGNICMLYLYIDALQQEDIVRQIGSSHSQIM 1363
Query: 1027 KRVEELTR 1034
+ + R
Sbjct: 1364 LELAYMER 1371
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 80/335 (23%), Positives = 140/335 (41%), Gaps = 76/335 (22%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YGR+A++ + R G D + +A + K V+ +D E+ L+T +M D D
Sbjct: 75 VYGRLASVAIARLTGFQLDVILLAIDDAKLSVVGYDIETHSLVTLSMHYYED------DL 128
Query: 66 GQIGI----------IDPDCRLIGLHLYDGLFKVIPF------------DNKGQLKEAFN 103
++G +DP+ R + +Y V+P D + F+
Sbjct: 129 FKLGFTRFEIPPMLRMDPERRCAAMTIYGAHLVVLPLVRESLYESMNIVDPSQRPGWPFS 188
Query: 104 IRLEEL------------QVLDIKFLYGCAKPTIVVLYQDNKDAR-----HVKTYE---V 143
+RL V D+ FL+G +PT+++LY+ + TY+ V
Sbjct: 189 LRLTSYTVAFNAIDAKMHNVTDMCFLHGFYEPTVLLLYEPTQTTAGRVVVRQDTYQILAV 248
Query: 144 ALKDKDFVEGP-WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYC-------------S 189
+L KD W+ NL A L+ +P PL GVL+ +I+Y +
Sbjct: 249 SLNPKDKTHAVIWTLGNLPFDAFALLALPKPLGGVLLFSVNSIIYLNQSVPCCGILINDN 308
Query: 190 ANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHLL------VITHEKEKVTGLKIE 243
F P+R ++ V DGS L D A +L V++ +++ +K
Sbjct: 309 GRGFTNYPLR---DRSELMVTLDGSHAALIDSANAALVLRSGLVFVVSLLFDRLNMVKEI 365
Query: 244 LLGETSI----ASTISY-LDNAVVYIGSSYGDSQL 273
LL +S+ ST+S + + +++GS+ G+S L
Sbjct: 366 LLTASSVRGAAPSTVSACVSSNCLFVGSAIGNSAL 400
>gi|324499955|gb|ADY39993.1| Cleavage and polyadenylation specificity factor subunit 1 [Ascaris
suum]
Length = 1434
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 73/306 (23%), Positives = 146/306 (47%), Gaps = 25/306 (8%)
Query: 741 PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
PE +PT +++ + +KE KG V SL + NG LLA + QK+ + W+ RD
Sbjct: 1139 PEPGQPTSKH---------RIKTLYDKEQKGPVTSLCSCNGYLLAGMGQKV--FIWLFRD 1187
Query: 801 DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA-- 858
+ + + S H +I L V R + + D+ +S++LL Y+ E A+ +RD A
Sbjct: 1188 NNLQGI-SFLDMHFYIHQL-VGVR-NLALACDIYRSVALLRYQEEYKALSLASRDMRAVV 1244
Query: 859 -NWMSAVEILDDD--IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
M+A ++D+ ++ ++ N+ E RL + E ++G VN F
Sbjct: 1245 QPPMAAQFLIDNRQMAFIMSDEAANIAVFNYLPEALESSGGERLILRSEINIGTNVNSFM 1304
Query: 916 H--GSLVMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
G + +++ + +V+F +++G G + L + + L LQ + ++
Sbjct: 1305 RVKGHISSGFVENEHYSLNRQSVLFCSLDGSFGFVRPLSEKVFRRLHMLQQLMSSLVAQA 1364
Query: 972 GGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
GLN + R+ ++ ++ +N +DGD++ +L LS +++++ + S + +
Sbjct: 1365 AGLNVKGSRAARPQRPNHYLNTRNMVDGDVVFQYLHLSLADKNDLARKLGTSRYHIIDDL 1424
Query: 1030 EELTRL 1035
E++RL
Sbjct: 1425 TEISRL 1430
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 107/511 (20%), Positives = 193/511 (37%), Gaps = 93/511 (18%)
Query: 1 MLDVPIYGRIATLELFR-PHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRI 59
++ V + + +L + R P + L + + K V+ + A L T ++ + +
Sbjct: 110 IIHVRLLAPVKSLAVARIPQNPSCSSLLLGFDTAKLSVVGFSAAERSLKTISLHCFEEEM 169
Query: 60 ---GRPTDNGQIGI-IDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----Q 110
G TD I +DP R + +Y V+PFD+ + + L +
Sbjct: 170 LKDGYVTDLPSPVIRVDPAQRCAVMLIYGRYLAVLPFDDTSPHLHTYTVALSSIDPRLVN 229
Query: 111 VLDIKFLYGCAKPTIVVLYQDNKD--ARHVKTYE------VALKDKDFVEGP-WSQNNLD 161
++DI FL G +PT++ LY+ + R Y+ V+L K+ V W NNL
Sbjct: 230 IIDIAFLDGYYEPTLLFLYEPAQTTAGRACVRYDTVCMLGVSLNTKEQVHASVWQLNNLP 289
Query: 162 NGADLLIPVPPPLCGVLIIGEETIVY-------CSA------NAFKAIPIRP------SI 202
+ ++ +P P+ G LIIG ++Y C + + F P++ ++
Sbjct: 290 MDCNQVLMIPRPIGGALIIGANELIYLNQSVPPCGSLLNSCMDGFTKFPLKSEKEMALTL 349
Query: 203 TKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVV 262
V + + L L ++ V ++ + + SI T++ +
Sbjct: 350 DGCAACVISTNKVVVCARSGALFILTLVVDSTNSVKSIEFKHEFDVSIPHTVTACSPGYL 409
Query: 263 YIGSSYGDSQLIKLNLQ------PDAKGSYVE---------------------------- 288
++GS GDS I+ + P K VE
Sbjct: 410 FVGSRVGDSLFIEYVSELVPVDDPIEKKLKVEVPQDDLEDEDLELYGKALPSVISQDVSV 469
Query: 289 ------VLERYVNLGP---IVDFCVVDLERQGQGQV--------VTCSGAYKDGSLRIVR 331
VL+R +N+ P + C L Q Q V G KD S+ I +
Sbjct: 470 EKMRFRVLDRMLNVAPCKKMTSGCSEGLNSYLQEQPRLDPVFDRVCACGHGKDSSICIFQ 529
Query: 332 NGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEG 391
I + S ++G+ W++ D DT + + E LA+ +++L E E
Sbjct: 530 QSIRPDIITSSSIEGVIQYWAVGRRED---DTHMYIIASKELGSLALETDNDLVELEAPV 586
Query: 392 FCSQTQTLFCHDAIYNQL-VQVTSGSVRLVS 421
F + T+ + L VQVT+ S+ +V+
Sbjct: 587 FITSESTIAAGELADGGLSVQVTTSSIVVVA 617
>gi|24415580|gb|AAN41460.1| putative cleavage and polyadenylation specificity factor 160 kDa
subunit [Arabidopsis thaliana]
Length = 1442
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 95/403 (23%), Positives = 160/403 (39%), Gaps = 108/403 (26%)
Query: 103 NIR-LEELQVLDIKFLYGCAKPTIVVLYQDNK------DARHVKTYEVALK-DKDFVEGP 154
N+R LE V D FL+G +P IV+L ++ +H AL + + P
Sbjct: 240 NLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHP 299
Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDAD 212
WS NL + A L+ VP P+ GVL++ TI Y S +A A+ + + A +
Sbjct: 300 VIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELP 359
Query: 213 GSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
S + LL +G L LL + ++ V L + + +AS I+
Sbjct: 360 ASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDIT 419
Query: 256 YLDNAVVYIGSSYGDSQLIKLNLQ------------------------------------ 279
+ N++ ++GS GDS L++ + +
Sbjct: 420 SVGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTSDTFQD 479
Query: 280 -------------PD----AKGSY-VEVLERYVNLGPIVDFC----------VVDLERQG 311
PD A+ S+ V + VN+GP+ DF + +Q
Sbjct: 480 TIGNEELSLFGSTPDNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL--------------RSST 357
++V CSG K+G+L ++R I VEL G KG+W++ ++
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
+D + +L++S E R + + D L E TE + Q +T+
Sbjct: 600 EDEYHAYLIISL--EARTMVLETADLLTEVTESVDYYVQGRTI 640
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 137/302 (45%), Gaps = 24/302 (7%)
Query: 702 FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
+E + P+ T E+ ++ L + + ++ VGTAYV E+ +GR+L+F
Sbjct: 1097 WETKAKIPMQTSEHALTVRVVTLLNASTGENETLLAVGTAYV-QGEDVAARGRVLLFSFG 1155
Query: 758 ---DGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
D ++ E +E KGA+ ++ + G LL + KI L+KW +GT
Sbjct: 1156 KNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW----NGTELNGVAFF 1211
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD-- 869
+ + + FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 1212 DAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLDCFATEFLIDGS 1271
Query: 870 DIYLGAENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
+ L + V + + +G +L E+H+G V++F L +++ S
Sbjct: 1272 TLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKF----LRLQMVSSGA 1327
Query: 929 GQIP--TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
+I ++FGT++G G IA L + L+ LQ L + V GLN +R F +
Sbjct: 1328 DKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAFRQFRSSG 1387
Query: 987 KT 988
K
Sbjct: 1388 KA 1389
>gi|10257491|dbj|BAB11613.1| cleavage and polyadenylation specificity factor subunit [Arabidopsis
thaliana]
Length = 1448
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 137/302 (45%), Gaps = 24/302 (7%)
Query: 702 FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
+E + P+ T E+ ++ L + + ++ VGTAYV E+ +GR+L+F
Sbjct: 1103 WETKAKIPMQTSEHALTVRVVTLLNASTGENETLLAVGTAYV-QGEDVAARGRVLLFSFG 1161
Query: 758 ---DGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
D ++ E +E KGA+ ++ + G LL + KI L+KW +GT
Sbjct: 1162 KNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW----NGTELNGVAFF 1217
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD-- 869
+ + + FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 1218 DAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLDCFATEFLIDGS 1277
Query: 870 DIYLGAENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
+ L + V + + +G +L E+H+G V++F L +++ S
Sbjct: 1278 TLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKF----LRLQMVSSGA 1333
Query: 929 GQIP--TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
+I ++FGT++G G IA L + L+ LQ L + V GLN +R F +
Sbjct: 1334 DKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAFRQFRSSG 1393
Query: 987 KT 988
K
Sbjct: 1394 KA 1395
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 93/210 (44%), Gaps = 27/210 (12%)
Query: 103 NIR-LEELQVLDIKFLYGCAKPTIVVLYQDNK------DARHVKTYEVALK-DKDFVEGP 154
N+R LE V D FL+G +P IV+L ++ +H AL + + P
Sbjct: 240 NLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHP 299
Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDAD 212
WS NL + A L+ VP P+ GVL++ TI Y S +A A+ + + A +
Sbjct: 300 VIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELP 359
Query: 213 GSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
S + LL +G L LL + ++ V L + + +AS I+
Sbjct: 360 ASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDIT 419
Query: 256 YLDNAVVYIGSSYGDSQLIKLNLQPDAKGS 285
+ N++ ++GS GDS L++ + + S
Sbjct: 420 SVGNSLFFLGSRLGDSLLVQFSCRSGPAAS 449
Score = 48.5 bits (114), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 36/136 (26%), Positives = 62/136 (45%), Gaps = 27/136 (19%)
Query: 289 VLERYVNLGPIVDFC----------VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 338
V + VN+GP+ DF + +Q ++V CSG K+G+L ++R I
Sbjct: 513 VRDSLVNVGPVKDFAYGLRINADANATGVSKQSNYELVCCSGHGKNGALCVLRQSIRPEM 572
Query: 339 QASVELQGIKGMWSL--------------RSSTDDPFDTFLVVSFISETRILAMNLEDEL 384
VEL G KG+W++ ++ +D + +L++S E R + + D L
Sbjct: 573 ITEVELPGCKGIWTVYHKSSRGHNADSSKMAADEDEYHAYLIISL--EARTMVLETADLL 630
Query: 385 EE-TEIEGFCSQTQTL 399
E TE + Q +T+
Sbjct: 631 TEVTESVDYYVQGRTI 646
>gi|30696088|ref|NP_199979.2| cleavage and polyadenylation specificity factor subunit 1
[Arabidopsis thaliana]
gi|290457637|sp|Q9FGR0.2|CPSF1_ARATH RecName: Full=Cleavage and polyadenylation specificity factor subunit
1; AltName: Full=Cleavage and polyadenylation specificity
factor 160 kDa subunit; Short=AtCPSF160; Short=CPSF 160
kDa subunit
gi|332008729|gb|AED96112.1| cleavage and polyadenylation specificity factor subunit 1
[Arabidopsis thaliana]
Length = 1442
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 137/302 (45%), Gaps = 24/302 (7%)
Query: 702 FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE 757
+E + P+ T E+ ++ L + + ++ VGTAYV E+ +GR+L+F
Sbjct: 1097 WETKAKIPMQTSEHALTVRVVTLLNASTGENETLLAVGTAYV-QGEDVAARGRVLLFSFG 1155
Query: 758 ---DGKLQLIAE---KETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
D ++ E +E KGA+ ++ + G LL + KI L+KW +GT
Sbjct: 1156 KNGDNSQNVVTEVYSRELKGAISAVASIQGHLLISSGPKIILHKW----NGTELNGVAFF 1211
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDD-- 869
+ + + FI++GD+ KSI L +K + + A+D+ + A E L D
Sbjct: 1212 DAPPLYVVSMNVVKSFILLGDVHKSIYFLSWKEQGSQLSLLAKDFESLDCFATEFLIDGS 1271
Query: 870 DIYLGAENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
+ L + V + + +G +L E+H+G V++F L +++ S
Sbjct: 1272 TLSLAVSDEQKNIQVFYYAPKMIESWKGLKLLSRAEFHVGAHVSKF----LRLQMVSSGA 1327
Query: 929 GQIP--TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK 986
+I ++FGT++G G IA L + L+ LQ L + V GLN +R F +
Sbjct: 1328 DKINRFALLFGTLDGSFGCIAPLDEVTFRRLQSLQKKLVDAVPHVAGLNPLAFRQFRSSG 1387
Query: 987 KT 988
K
Sbjct: 1388 KA 1389
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 95/403 (23%), Positives = 159/403 (39%), Gaps = 108/403 (26%)
Query: 103 NIR-LEELQVLDIKFLYGCAKPTIVVLYQDNK------DARHVKTYEVALK-DKDFVEGP 154
N+R LE V D FL+G +P IV+L ++ +H AL + + P
Sbjct: 240 NLRDLEMKHVKDFVFLHGYIEPVIVILQEEEHTWAGRVSWKHHTCVLSALSINSTLKQHP 299
Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDAD 212
WS NL + A L+ VP P+ GVL++ TI Y S +A A+ + + A +
Sbjct: 300 VIWSAINLPHDAYKLLAVPSPIGGVLVLCANTIHYHSQSASCALALNNYASSADSSQELP 359
Query: 213 GSRY-----------------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
S + LL +G L LL + ++ V L + + +AS I+
Sbjct: 360 ASNFSVELDAAHGTWISNDVALLSTKSGELLLLTLIYDGRAVQRLDLSKSKASVLASDIT 419
Query: 256 YLDNAVVYIGSSYGDSQLI------------------------------KLNLQPD---- 281
+ N++ ++GS GDS L+ +L + D
Sbjct: 420 SVGNSLFFLGSRLGDSLLVQFSCRSGPAASLPGLRDEDEDIEGEGHQAKRLRMTSDTFQD 479
Query: 282 -------------------AKGSY-VEVLERYVNLGPIVDFC----------VVDLERQG 311
A+ S+ V + VN+GP+ DF + +Q
Sbjct: 480 TIGNEELSLFGSTPNNSDSAQKSFSFAVRDSLVNVGPVKDFAYGLRINADANATGVSKQS 539
Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL--------------RSST 357
++V CSG K+G+L ++R I VEL G KG+W++ ++
Sbjct: 540 NYELVCCSGHGKNGALCVLRQSIRPEMITEVELPGCKGIWTVYHKSSRGHNADSSKMAAD 599
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
+D + +L++S E R + + D L E TE + Q +T+
Sbjct: 600 EDEYHAYLIISL--EARTMVLETADLLTEVTESVDYYVQGRTI 640
>gi|320165260|gb|EFW42159.1| predicted protein [Capsaspora owczarzaki ATCC 30864]
Length = 2631
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 56/205 (27%), Positives = 95/205 (46%), Gaps = 32/205 (15%)
Query: 492 PSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDG 551
P + + A+ +W + V +F LP + L+ + G+++ RS+ L A EG L L DG
Sbjct: 1661 PKATTLVAISLWKTMDVLVFQLPGMELVAQHRFAGDVVVRSLALHAIEGRIKLFVGLADG 1720
Query: 552 HLLNFLLNMKTG--------ELTDRKKVSLGTQPITLRTF---------------SSKNT 588
HL+ F L ++ E +R+ V +G+ P+TL F + +
Sbjct: 1721 HLVQFTLVPRSVVAGTKCELEFAERRAVLVGSSPLTLEVFPLAGPADNDHGESVEAGRPP 1780
Query: 589 THVFAASDRPTVIYS----SNKKLLYSNVNLKEVSHMCPFNSAAFPDS-LAIAKEGELTI 643
+ D+P ++ S ++L N+ L E S C F+SA F S L +A +G +++
Sbjct: 1781 VGLLVCCDKPALVLPGKTDSQTQVLPLNIALAEKS--CCFSSALFRQSTLLLAAQGRISV 1838
Query: 644 GTID--DIQKLHIRSIPLGEHPRRI 666
T D ++LH + LG P I
Sbjct: 1839 ATFAQLDGKQLHAEKLLLGATPLAI 1863
>gi|124505011|ref|XP_001351247.1| CPSF (cleavage and polyadenylation specific factor), subunit A,
putative [Plasmodium falciparum 3D7]
gi|7768292|emb|CAB11136.2| CPSF (cleavage and polyadenylation specific factor), subunit A,
putative [Plasmodium falciparum 3D7]
Length = 2870
Score = 78.2 bits (191), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 108/505 (21%), Positives = 209/505 (41%), Gaps = 79/505 (15%)
Query: 565 LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSS-NKKLLYSNVNLKEVSHMCP 623
+++KK+ + I K +F + P +IYS KK+ S ++LK + +
Sbjct: 2344 FSEKKKIDINQSNII------KKYNFLFVCCESPIIIYSDLKKKINVSKLSLKNIYIVDI 2397
Query: 624 FNS----AAFPDSLAIAKEGE------------LTIGTIDDIQKLHIRSIPLGEHPRRIC 667
FN F + L+ K+ + + I ++ I+K ++ IP +I
Sbjct: 2398 FNDFNYLNPFHNFLSFKKKNQNNFYFIFYDGSNIHISPLNQIKKTFLKKIPFHRTVEKIA 2457
Query: 668 -HQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCS-------- 718
H + A C + + E + D TY + + +Y S
Sbjct: 2458 YHSDTGLLIAACPSEEKHKTNEMMKQIICFFDPYHDSIKYTYIIPS-KYTVSTIIIYDNE 2516
Query: 719 -ILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKE------TKG 771
++ +F S ++ VGT + EPT G I +FI + K + K G
Sbjct: 2517 KLMKSNFDVTSFIF--VGTCNSNEKYTEPTSGHIHIFIAK-KKANIFEIKHIYTHNINYG 2573
Query: 772 AVYSLNAFNGKLLAAINQKIQL---------YKWMLRDDGTRELQSECGHHGHILALY-- 820
V +L ++ K++A IN + + Y+ + + LQ + + I+ L
Sbjct: 2574 GVTNLVPYDDKIVATINNMVVILDINNLIIKYEAFM---DPQNLQPKIEGNNAIVELVSF 2630
Query: 821 --------VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIY 872
V GD+IVVGD+M S+++L Y +E + E RDY+ W +++ L
Sbjct: 2631 TPSSWIMTVDVYGDYIVVGDIMTSVTILQYDYENSQLFEVCRDYSNIWCTSLCALSKSHI 2690
Query: 873 LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNR---FRHGSLVMRLPDSDVG 929
+ ++ + N ++K+ DE+ +L V ++ G +N+ + +L+ D D
Sbjct: 2691 VVSDMDANFIILQKSKFKYNDEDSYKLSSVSLFNHGSIINKMLPLSNTNLIEE--DYDKR 2748
Query: 930 QIPT----VIFGTVNGVIGVIASLPHEQYLFLEK---LQTNLRKVIKGVGGLNHEQWRSF 982
I T ++ + G I V+ +P + +K ++ + I +G L+H +R +
Sbjct: 2749 NILTKNDGILCASSEGSISVL--IPFSSFANFKKALCIEIAITDNISSIGNLSHNAYREY 2806
Query: 983 NNEKKTVDAKNFLDGDLIESFLDLS 1007
++ K +DG+L++ F +S
Sbjct: 2807 KVNFRSKHCKGIVDGELLKMFFHMS 2831
Score = 43.5 bits (101), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 27/108 (25%), Positives = 55/108 (50%), Gaps = 6/108 (5%)
Query: 21 EAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGL 80
+ + +LF+ T +Y +L++D + ++ IT + ++ + G + I ++D + I
Sbjct: 83 KVKSYLFVLTRKYVLLLLEYDVKENDFITLSKINLCELNGLHLEEDVIFLLDERHKTILF 142
Query: 81 HLYDGLFKVI--PFDNKGQLKEAFNIRLEELQVLDIKFL----YGCAK 122
+ Y + K I +DN L + +R++E ++DI FL GC K
Sbjct: 143 YGYKNILKYIYLDYDNFLNLNNVYTMRIDESLIIDIAFLGTHTMGCNK 190
Score = 43.1 bits (100), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 17/69 (24%), Positives = 38/69 (55%), Gaps = 1/69 (1%)
Query: 286 YVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ 345
Y+E+L N+GPI+D CVV + + +++TC+ + G + I+++G+ N +
Sbjct: 970 YIEILSVIQNMGPILDMCVVK-NKNNEYEIITCNSYGRTGCVSIIQSGLKTNITCDLNFN 1028
Query: 346 GIKGMWSLR 354
+ + ++
Sbjct: 1029 KLNNFFVVK 1037
>gi|290981010|ref|XP_002673224.1| CPSF A subunit [Naegleria gruberi]
gi|284086806|gb|EFC40480.1| CPSF A subunit [Naegleria gruberi]
Length = 1373
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 103/528 (19%), Positives = 218/528 (41%), Gaps = 92/528 (17%)
Query: 591 VFAASDRPTVIYSSNKKL-LYSNVNLKEVSHMCPFNSAAFPDSLAI--------AKEGEL 641
+F ++P +++ + L ++ + V+ P++ P K+ +L
Sbjct: 849 LFKTGEKPFWLFTEHSNLRVHPTQSRDPVTTFTPYHHENCPHGFIYLTDKEQDNKKQSKL 908
Query: 642 TIGTIDDIQKLHI----RSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEES------- 689
I +++ K + R I L P I HQ+ + A S+ ++ +S
Sbjct: 909 HISSLNANVKFNAYWPQRKILLKSTPNVITFHQDTNTCLAFTSVPVKAILPDSIPFPEGK 968
Query: 690 ------EMHFVRLLDDQTFEFISTYPLDTFEYGCS-----ILSCSFSDDSNVYY------ 732
+ H V+L ++ + + D E + + ++DD+++ +
Sbjct: 969 CPPPAEQKHTVKLFSGHNWQEMDKFEFDLHESAVAAKVVYLSKEEYNDDTDISFEEPLNS 1028
Query: 733 ---------CVGTAYVLPEENEPTKGRILVFIVE-------DGKLQLIAEKETKGAVYSL 776
VGTAYV E E +GR+L+F ++ + KL LI+ KG + +L
Sbjct: 1029 RKQDLVSVVAVGTAYV-QSERELCRGRLLLFDLDPILGRENEYKLNLISSTSVKGPITTL 1087
Query: 777 NAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
+ ++ ++ +I Y + D + + + + T +FI+ GD+ KS
Sbjct: 1088 EQVDRYIICSVGNRIYTYYF---DWEEKRMHITSFYDTQFYTASLNTVRNFIMFGDIYKS 1144
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGA 891
+S L +K + + A+D + + E L ++ LG N +F+ + +
Sbjct: 1145 VSFLRWKEKGHRLILLAKDNRPLQVVSSEFLVNNDLLGLAVIDTSKNLQIFSYLPQHQES 1204
Query: 892 TDEERGR-LEVVGEYHLGEFVNRFRHGSLVMRLPDSD------VGQIPT----------- 933
D GR L V ++H+G +N V LPD + V + P
Sbjct: 1205 ND---GRNLVPVCDFHIGTLINSLIRMK-VRELPDDNTIRLGNVNEKPKQSGKKDITKTN 1260
Query: 934 -----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT 988
++FG+V+G IG +A + + L LQ + ++ GL+ + +R + ++T
Sbjct: 1261 PNHQFILFGSVDGAIGYVAPINEVTHRRLFALQLKMYTQLEQAAGLHPKSFRLYKPLERT 1320
Query: 989 V--DAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
KN +DG LI ++ +++ ++++ + + + + + ++EL +
Sbjct: 1321 EYNYKKNIIDGQLIWNYANINTILQRDLARQIGTNSDNILRSIQELNQ 1368
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 126/668 (18%), Positives = 237/668 (35%), Gaps = 164/668 (24%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATE-RYKFCVLQWDAESSELITRAMGDVSDRIGRPTD 64
++G I ++ G ++ L I E K V+++D + S+L T ++ + D R
Sbjct: 58 LFGVIDSIIAVCLQGMKKEMLLINFEDEAKVSVVEFDEKRSDLKTLSLHYLEDDFLREGK 117
Query: 65 ----NGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQ----------------------- 97
+ Q I+DP R + + D ++PF G+
Sbjct: 118 ARFFHNQPIILDPQNRFATVIICDSKLVILPFRQSGEDVSLSTEDNFLFALSGDQEEANE 177
Query: 98 ------------LKEAFNIRLEELQVLDIK---FLYGCAKPTIVVLYQDNKD-----ARH 137
++ I L +L + ++K FL G +PTI+ L+++ + A
Sbjct: 178 NVGDQKKHHQPEVQRQVIIDLNDLGIKNVKDYCFLNGYNEPTILFLHENEQTWSGRLAAK 237
Query: 138 VKTYEVALKDKD----FVEGPWSQNNLDNGADLLIPVPPPLCG-VLIIGEETIVYCSANA 192
T V D + WS +L + + LIP+ + G L+IG +I++ + A
Sbjct: 238 SNTSTVTAVSFDLFRKYYPKIWSVGSLPHDCNKLIPLQEDVAGGALVIGMNSIIHINQCA 297
Query: 193 FKAIPIR------PSITKAYGRVDADG-------------SRYLLGDHAGLLHLLVITHE 233
+ P+++ + D + L+ G L+ + +
Sbjct: 298 TYGLSFNDFAVSNPNLSINFNTFDGPALFFDTVAYTFIARDKLLVSLKDGELYTMYLESG 357
Query: 234 KEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL----------------- 276
++ + I+ T+ AS + L ++++GS GDS L +
Sbjct: 358 GSRINNINIKKTSNTTPASCMCTLKGNLIFLGSKIGDSVLYEYQEKVEVETSSLDTDEEM 417
Query: 277 --------NLQPDAKG-------SYVEVLER----------------------------Y 293
N +P+ K + LE+ +
Sbjct: 418 SSVFAAGENFEPEKKKRKLADDDDFFAALEKDEEPTVIESFSKVSKKETTKVELKIKHVF 477
Query: 294 VNLGPIVDFCVV-----DLE------RQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASV 342
N+GPI D+ Q + CSG + G L ++ + + Q+
Sbjct: 478 TNIGPISHLTAAVTSSFDMSGFKSKTNDNQLSAIACSGIGRHGCLTVLNRSLQPDIQSEA 537
Query: 343 ELQG-IKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
L +K +W++ T+ D +L++S +T++ + L E + +T
Sbjct: 538 TLPFLVKQVWTISQKTEH--DLYLILSLEDKTKVFES--KATLAEVTSKSMFVTNETTLN 593
Query: 402 HDAIYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYL 461
I +VQVT SV L+ S +++ + K + + VLL G LV L
Sbjct: 594 IGKIRESIVQVTRKSVMLIGSEPKQVHHSKKE-----IRSSIILDPYVLLHFYDGSLVLL 648
Query: 462 EIGDGILT------EVKHAQLEYEISCLDINP----IGENPSYSQIAAVGMWTDISVRIF 511
+G +T E H ++ + NP G N + WTD + I
Sbjct: 649 THDNGRVTSKQLDIESNHGKIT-AVCLYKTNPEFEFFGINEKEGKYLCCVYWTDGAFEIL 707
Query: 512 SLPDLNLI 519
S+PD+ +
Sbjct: 708 SVPDMTCV 715
>gi|238508528|ref|XP_002385456.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus flavus NRRL3357]
gi|220688975|gb|EED45327.1| cleavage and polyadenylation specificity factor subunit A, putative
[Aspergillus flavus NRRL3357]
Length = 1204
Score = 77.4 bits (189), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 93/435 (21%), Positives = 176/435 (40%), Gaps = 64/435 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
I+ IP+GE + + S + I + E+ E+H +
Sbjct: 772 IQQIPIGEQVDHLAYSSSSGMYVIGTSHRTEFKLPEDDELHPEWRNEMTSFFPEVQRSSL 831
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
+++ +T+ I +Y L E+ ++ + S N + VGTA+ E+ ++G
Sbjct: 832 KVVSPKTWTVIDSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIA-SRG 890
Query: 750 RILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
+ VF V D KL+L+ ++ KGAV +L+ G+ + Q + L
Sbjct: 891 CVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGL 950
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
++DG+ + H+ + ++ D +K + Y E + A+D +
Sbjct: 951 KEDGSLLPVAFMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDLDY 1010
Query: 859 NWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
+ A + L D L A+++ NL ++ + E RL ++H G F+
Sbjct: 1011 LEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFI---- 1066
Query: 916 HGSLVMRLPDSDVG---------------QIP--TVIFGTVNGVIGVIASLPHEQYLFLE 958
S + LP + V +IP ++ + NG +G++ + E Y L
Sbjct: 1067 --STLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLS 1124
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
LQ+ L I+ GLN +R+ E + LDG L+ +LD+S+ R EI+ +
Sbjct: 1125 ALQSQLTNTIEHPCGLNPRAFRAV--ESDGTAGRGMLDGKLLFQWLDMSKQRKVEIASRV 1182
Query: 1019 NVSVEELCKRVEELT 1033
+ E+ E ++
Sbjct: 1183 GANEWEIKADFEAIS 1197
>gi|317157892|ref|XP_001826637.2| protein cft1 [Aspergillus oryzae RIB40]
gi|391864317|gb|EIT73613.1| mRNA cleavage and polyadenylation factor II complex, subunit CFT1
[Aspergillus oryzae 3.042]
Length = 1389
Score = 77.0 bits (188), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 93/435 (21%), Positives = 176/435 (40%), Gaps = 64/435 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
I+ IP+GE + + S + I + E+ E+H +
Sbjct: 957 IQQIPIGEQVDHLAYSSSSGMYVIGTSHRTEFKLPEDDELHPEWRNEMTSFFPEVQRSSL 1016
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
+++ +T+ I +Y L E+ ++ + S N + VGTA+ E+ ++G
Sbjct: 1017 KVVSPKTWTVIDSYLLSPAEHVMAVKNMSLEISENTHERKDMIVVGTAFARGEDIA-SRG 1075
Query: 750 RILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
+ VF V D KL+L+ ++ KGAV +L+ G+ + Q + L
Sbjct: 1076 CVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQGFLIVAQGQKCIVRGL 1135
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
++DG+ + H+ + ++ D +K + Y E + A+D +
Sbjct: 1136 KEDGSLLPVAFMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGYSEEPYKMSLFAKDLDY 1195
Query: 859 NWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
+ A + L D L A+++ NL ++ + E RL ++H G F+
Sbjct: 1196 LEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFHTGNFI---- 1251
Query: 916 HGSLVMRLPDSDVG---------------QIP--TVIFGTVNGVIGVIASLPHEQYLFLE 958
S + LP + V +IP ++ + NG +G++ + E Y L
Sbjct: 1252 --STLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGSVGLVTCVSEESYRRLS 1309
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
LQ+ L I+ GLN +R+ E + LDG L+ +LD+S+ R EI+ +
Sbjct: 1310 ALQSQLTNTIEHPCGLNPRAFRAV--ESDGTAGRGMLDGKLLFQWLDMSKQRKVEIASRV 1367
Query: 1019 NVSVEELCKRVEELT 1033
+ E+ E ++
Sbjct: 1368 GANEWEIKADFEAIS 1382
>gi|195583398|ref|XP_002081509.1| GD25678 [Drosophila simulans]
gi|194193518|gb|EDX07094.1| GD25678 [Drosophila simulans]
Length = 1450
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 99/403 (24%), Positives = 183/403 (45%), Gaps = 62/403 (15%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSL-------------KNQSCAEESE-MHFVR---- 695
+R +PL PR++ + ++R + + + +++ +EES F+
Sbjct: 1026 VRKVPLRCTPRQLVYHRENRVYCLITQTEEPMTKYYRFNGEDKELSEESRGERFIYPIGS 1085
Query: 696 -----LLDDQTFEFI--STYPLDTFEY--GCSILSCSFSDDSN---VYYCVGTAYVLPEE 743
L+ +T+E + ++ + +E+ I+ S+ + Y C+GT + E
Sbjct: 1086 QFEMVLISPETWEIVPDASITFEPWEHVTAFKIVKLSYEGTRSGLKEYLCIGTNFNY-SE 1144
Query: 744 NEPTKGRILVF-----IVEDGK------LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQ 792
+ ++G I ++ + E GK ++ I +KE KG V +++ G L+ + QKI
Sbjct: 1145 DITSRGNIHIYDIIEVVPEPGKPMTKFKIKEIFKKEQKGPVSAISDVLGFLVTGLGQKI- 1203
Query: 793 LYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
Y W LRD +L +I + T I + D+ KSISLL ++ E +
Sbjct: 1204 -YIWQLRDG---DLIGVAFIDTNIYVHQIITVKSLIFIADVYKSISLLRFQEEYRTLSLA 1259
Query: 853 ARDYNANWMSAVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
+RD+N + +E + D+ LG AE N ++ + + + ++ L +YHL
Sbjct: 1260 SRDFNPLEVYGIEFMVDNSNLGFLVTDAERNLIVYMYQPEARESLGGQK--LLRKADYHL 1317
Query: 908 GEFVNR-FR----HGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
G+ VN FR L R P + V++GT++G +G LP + Y LQ
Sbjct: 1318 GQVVNTMFRVQCHQKGLHQRQPFLYENK-HFVVYGTLDGALGYCLPLPEKVYRRFLMLQN 1376
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESF 1003
L + + GLN +++R+ + KK ++ +DGDLI S+
Sbjct: 1377 VLLSYQEHLCGLNPKEYRTLKSSKKQGINPSRCIIDGDLIWSY 1419
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 72/332 (21%), Positives = 136/332 (40%), Gaps = 65/332 (19%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+YG + +L+ G +D L I+ + K VLQ D ++ L T ++ + R
Sbjct: 76 LYGNVMSLQCVSLAGAMRDALLISFKDAKLSVLQHDPDTFALKTLSLHYFEEDDIRGGWT 135
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPFDNKGQLKE--------------------- 100
G+ + +DPD R + +Y V+PF L E
Sbjct: 136 GRYFVPTVRVDPDSRCAVMLVYGKRLVVLPFRKDNSLDEIELADVKPIKKAPTAMVSRTP 195
Query: 101 ---AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKDAR-HVKTYE-------VA 144
++ I L +L VLDI+FL+G +PT+++LY+ + +K ++
Sbjct: 196 IMASYLIALRDLDEKIDNVLDIQFLHGYYEPTLLILYEPVRTCPGRIKVRSDTCVLVAIS 255
Query: 145 LKDKDFVEG-PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY-------------CSA 190
L + V W+ N+L + P+ P+ G L++ ++Y SA
Sbjct: 256 LNIQQRVHPIIWTVNSLPFDCLQVYPIQKPIGGCLVMTVNAVIYLNQSVPPYGVSLNSSA 315
Query: 191 NAFKAIPIRP------SITKA-YGRVDADGSRYLLGDHAGLLHLLVITHEKEK-VTGLKI 242
+ A P++P S+ A + +D D + ++ G L++L + + + V
Sbjct: 316 DNSTAFPLKPQDGVRISLDCANFAFIDVD--KLVISLRTGDLYVLTLCVDSMRTVRNFHF 373
Query: 243 ELLGETSIASTISYLDNAVVYIGSSYGDSQLI 274
+ + S I L + +++GS G+S L+
Sbjct: 374 HKAAASVLTSCICVLHSEYIFLGSRLGNSLLL 405
Score = 42.7 bits (99), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 43/159 (27%), Positives = 72/159 (45%), Gaps = 26/159 (16%)
Query: 288 EVLERYVNLGPIVDFCV---VDLERQG-------------QGQVVTCSGAYKDGSLRIVR 331
EV + +N+ PI C V+ E G + ++V +G K+G+L +
Sbjct: 485 EVCDSLMNVAPINYMCAGERVEFEEDGVTLRPHAESLQDLKIELVAATGHSKNGALSVFV 544
Query: 332 NGIGINEQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELE 385
N + S EL G +W++ +SS +D D F+++S + T L + E+
Sbjct: 545 NCLNPQIITSFELDGCLDVWTVFDDATKKSSRNDQHD-FMLLSQRNST--LVLQTGQEIN 601
Query: 386 ETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGSVRLVSST 423
E E GF T+F + + +VQVT+ VRL+ T
Sbjct: 602 EIENTGFTVNQPTIFVGNLGQQRFIVQVTTRHVRLLQGT 640
>gi|239611898|gb|EEQ88885.1| protein CFT1 [Ajellomyces dermatitidis ER-3]
gi|327352847|gb|EGE81704.1| CFT1 [Ajellomyces dermatitidis ATCC 18188]
Length = 1402
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 106/435 (24%), Positives = 183/435 (42%), Gaps = 63/435 (14%)
Query: 655 RSIPLGEHPRRICHQEQSRTFAICSLK--NQSCAEESEMH-----------------FVR 695
R I LGE + + S T+ I + + + + E+ E+H V+
Sbjct: 970 RKIGLGEQVDIVEYSSSSETYVIGTSQKVDFNLPEDDEIHPEWRNEVISFLPQIDQGSVK 1029
Query: 696 LLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTKGR 750
LL +T+ I ++ L T E C L S + + VGTA V E+ +G
Sbjct: 1030 LLSPRTWSIIDSHTLRTAERIMCVKCLDLEVSEITHERRDMIAVGTA-VTRGEDIAARGC 1088
Query: 751 ILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKWM 797
I +F V + KL+LIA++E KGAV SL+ G+ L+AA QK +
Sbjct: 1089 IYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRG-- 1146
Query: 798 LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
L++DG+ + ++ L ++GD +K I Y E + ++D
Sbjct: 1147 LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDALKGIWFAGYSEEPYKLSLFSKDDG 1206
Query: 858 ANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
+ A + L D +Y L A+++ N+ ++ + E + RL +H G F +
Sbjct: 1207 TLQVMAADFLPDGKRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFHTGHFASTM 1266
Query: 915 RHGSLVMRL-------PDSD--------VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
+L+ R PD++ G + V+ + G I +I L Y L
Sbjct: 1267 ---TLLPRTIIPSAQGPDANPDMMELDSSGPLYHVLVTSETGSIALITPLSETAYRRLSA 1323
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ+ L ++ GLN +R+ E + + +DGDL+ +LDL R EI+ +
Sbjct: 1324 LQSQLINTLEHPCGLNPRAFRAI--ESDGIGGRGMVDGDLLHRWLDLGTQRKAEIAHRVG 1381
Query: 1020 VSVEELCKRVEELTR 1034
+ E+ +E + +
Sbjct: 1382 ADIWEIRADLEAIGK 1396
>gi|261201748|ref|XP_002628088.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
gi|239590185|gb|EEQ72766.1| protein CFT1 [Ajellomyces dermatitidis SLH14081]
Length = 1403
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 106/435 (24%), Positives = 183/435 (42%), Gaps = 63/435 (14%)
Query: 655 RSIPLGEHPRRICHQEQSRTFAICSLK--NQSCAEESEMH-----------------FVR 695
R I LGE + + S T+ I + + + + E+ E+H V+
Sbjct: 971 RKIGLGEQVDIVEYSSSSETYVIGTSQKVDFNLPEDDEIHPEWRNEVISFLPQIDQGSVK 1030
Query: 696 LLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTKGR 750
LL +T+ I ++ L T E C L S + + VGTA V E+ +G
Sbjct: 1031 LLSPRTWSIIDSHTLRTAERIMCVKCLDLEVSEITHERRDMIAVGTA-VTRGEDIAARGC 1089
Query: 751 ILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYKWM 797
I +F V + KL+LIA++E KGAV SL+ G+ L+AA QK +
Sbjct: 1090 IYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRG-- 1147
Query: 798 LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
L++DG+ + ++ L ++GD +K I Y E + ++D
Sbjct: 1148 LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDALKGIWFAGYSEEPYKLSLFSKDDG 1207
Query: 858 ANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
+ A + L D +Y L A+++ N+ ++ + E + RL +H G F +
Sbjct: 1208 TLQVMAADFLPDGKRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFHTGHFASTM 1267
Query: 915 RHGSLVMRL-------PDSD--------VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEK 959
+L+ R PD++ G + V+ + G I +I L Y L
Sbjct: 1268 ---TLLPRTIIPSAQGPDANPDMMELDSSGPLYHVLVTSETGSIALITPLSETAYRRLSA 1324
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
LQ+ L ++ GLN +R+ E + + +DGDL+ +LDL R EI+ +
Sbjct: 1325 LQSQLINTLEHPCGLNPRAFRAI--ESDGIGGRGMVDGDLLHRWLDLGTQRKAEIAHRVG 1382
Query: 1020 VSVEELCKRVEELTR 1034
+ E+ +E + +
Sbjct: 1383 ADIWEIRADLEAIGK 1397
>gi|225558298|gb|EEH06582.1| conserved hypothetical protein [Ajellomyces capsulatus G186AR]
Length = 1408
Score = 77.0 bits (188), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 105/434 (24%), Positives = 181/434 (41%), Gaps = 61/434 (14%)
Query: 655 RSIPLGEHPRRICHQEQSRTFAICSLKNQ----SCAEESEMH-----------------F 693
R I LGE + + S T+ I + NQ + E+ E+H
Sbjct: 976 RKIGLGEQVDAVEYSSSSETYVIGT--NQKVDFNLPEDDEIHPEWRNEVISFLPQIDKGS 1033
Query: 694 VRLLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTK 748
V+LL +T+ I +Y L E C L S + + VGTA + E+ +
Sbjct: 1034 VKLLTPRTWSIIDSYNLRNAERIMCVKCLNLEVSEITHERKDTIVVGTA-LTKGEDIAAR 1092
Query: 749 GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
G I +F V + KL+LIA++E KGAV SL+ G+ L+AA QK +
Sbjct: 1093 GCIYIFEVIEVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRG 1152
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ + ++ L ++GD +K + Y E + ++D
Sbjct: 1153 --LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD 1210
Query: 856 YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ A + L D +Y L A+++ N+ ++ + E + RL + G F +
Sbjct: 1211 DGTLQVMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFAS 1270
Query: 913 RF----RHGSLVMRLPDSD--------VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
R + + PD+D G + V+ + G I +I + Y L L
Sbjct: 1271 TMTLLPRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSAL 1330
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q+ L ++ GLN +R+ E + + +DGDL++ +LDL R EI+ +
Sbjct: 1331 QSQLTNTLEHPCGLNPRAFRAV--ESDGIGGRGMVDGDLVKRWLDLGTQRKAEIANRVGA 1388
Query: 1021 SVEELCKRVEELTR 1034
V E+ +E + +
Sbjct: 1389 DVWEIRADLEAIGK 1402
>gi|240277254|gb|EER40763.1| cleavage factor two protein 1 [Ajellomyces capsulatus H143]
Length = 1408
Score = 76.6 bits (187), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 105/434 (24%), Positives = 181/434 (41%), Gaps = 61/434 (14%)
Query: 655 RSIPLGEHPRRICHQEQSRTFAICSLKNQ----SCAEESEMH-----------------F 693
R I LGE + + S T+ I + NQ + E+ E+H
Sbjct: 976 RKIGLGEQVDAVEYSSSSETYVIGT--NQKVDFNLPEDDEIHPEWRNEVISFLPQIDKGS 1033
Query: 694 VRLLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTK 748
V+LL +T+ I +Y L E C L S + + VGTA + E+ +
Sbjct: 1034 VKLLTPRTWSIIDSYNLRNAERIMCVKCLNLEVSEITHERKDTIVVGTA-LTKGEDIAAR 1092
Query: 749 GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
G I +F V + KL+LIA++E KGAV SL+ G+ L+AA QK +
Sbjct: 1093 GCIYIFEVIKVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRG 1152
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ + ++ L ++GD +K + Y E + ++D
Sbjct: 1153 --LKEDGSLLPVAFMDMQCYVNVLKELKGTGMCIMGDALKGLWFAGYSEEPYKLSLFSKD 1210
Query: 856 YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ A + L D +Y L A+++ N+ ++ + E + RL + G F +
Sbjct: 1211 DGTLQVMAADFLPDGNRLYILVADDDCNIHVLQYDPEDPGSSKGDRLLHRSTFQTGHFAS 1270
Query: 913 RF----RHGSLVMRLPDSD--------VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKL 960
R + + PD+D G + V+ + G I +I + Y L L
Sbjct: 1271 TMTLLPRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSAL 1330
Query: 961 QTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV 1020
Q+ L ++ GLN +R+ E + + +DGDL++ +LDL R EI+ +
Sbjct: 1331 QSQLANTLEHPCGLNPRAFRAV--ESDGIGGRGMVDGDLVKRWLDLGTQRKAEIANRVGA 1388
Query: 1021 SVEELCKRVEELTR 1034
V E+ +E + +
Sbjct: 1389 DVWEIRADLEAIGK 1402
>gi|212541400|ref|XP_002150855.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces marneffei ATCC 18224]
gi|210068154|gb|EEA22246.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces marneffei ATCC 18224]
Length = 1383
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 95/429 (22%), Positives = 176/429 (41%), Gaps = 52/429 (12%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
IR IPL E + + S T+ + + Q ++ E+H +
Sbjct: 951 IRRIPLNEQVDHLAYSTASGTYVVGTTHEQDFKLPDDDELHPEWATEEISLLPKVAYGSI 1010
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSI--LSCSFSDDSNV---YYCVGTAYVLPEENEPTKG 749
+L++ +T++ I +Y E ++ ++ S+ + VGT Y E+ +G
Sbjct: 1011 KLINPKTWKVIDSYTFSPAERITAVENINLEISEKTGKRKDMIVVGTTYA-KGEDIAARG 1069
Query: 750 RILVFIVEDG-----------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
+ VF V D KL+LI E+ +GAV +++ G+ + Q + L
Sbjct: 1070 NVYVFDVIDVVPDPDEPGTNLKLKLIGEESIRGAVTAVSGIGGQGFMIVAQGQKCMVRGL 1129
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
+DDG+ + ++ + ++GD K + Y E + +D +
Sbjct: 1130 KDDGSLLPVAFIDVQCYVSVIKELKGTGMCLIGDAFKGLWFTGYSEEPYKMTLFGKDLDE 1189
Query: 859 NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
+ + L D +Y L A+ + NL+ ++ + E RL ++H+G F +
Sbjct: 1190 LEVVTADFLPDGKKLYILVADGDCNLYVLQYDPEDPKSSNGDRLLNRCKFHMGHFASTLT 1249
Query: 916 -------HGSLVMRLPDS-DVGQ---IPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
L + DS D+ + + T +G + +I SL E Y L LQ+ L
Sbjct: 1250 LLPRTAVSSELAVMSSDSMDIDSYTPLYQALITTQSGSMALITSLSEESYRRLTALQSQL 1309
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
++ GLN +RS E V + +DG L+ +LDLSR+R EI+ + E
Sbjct: 1310 SNTLEHPCGLNPRAYRSV--ESDGVVGRGMIDGKLLMRWLDLSRSRKLEIAGRVGADEWE 1367
Query: 1025 LCKRVEELT 1033
+ +E ++
Sbjct: 1368 IRADLEAVS 1376
>gi|440793679|gb|ELR14857.1| CPSF A subunit region protein [Acanthamoeba castellanii str. Neff]
Length = 1477
Score = 76.3 bits (186), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 94/424 (22%), Positives = 174/424 (41%), Gaps = 66/424 (15%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAIC----------------------------------- 678
+R +PLG PR I + SRT+ +
Sbjct: 1053 VRKVPLGRTPREIAYHPPSRTYVVALATPVTTVVPTPPETDMERQEREREEEESREMGIE 1112
Query: 679 ------SLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDD-SNVY 731
+ + A E H + L+ +T++ + L+ E+ ++ D+ S V
Sbjct: 1113 PEEKQRDMGPREIAMMEERHELHLISPRTWQILHHVELEPKEHVLTLSVLKLGDNYSQVN 1172
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVEDGKLQL-----IAEKETKGAVYSLNAFNGKLLAA 786
+ ++L E I V E KL + + EK KG V + + G L+ A
Sbjct: 1173 RELRPPHLLIYE-------IDVTGEEQCKLTMAYQKPMKEKPMKGPVSAAASLQGYLIIA 1225
Query: 787 INQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE 846
+ KI ++ + D G+ E + +I++ ++T +F++ GD+ KSI L +K
Sbjct: 1226 VGPKIWVFNF---DGGSTEAVAFYDAPHYIVS--IKTLKNFVLCGDIYKSIFFLRWKDSA 1280
Query: 847 GAIEERARDYNANWMSAVEILDD--DIYLGAENNFNLFTVRKNSEGATDEERGRLEVV-G 903
+ A+D + A E + D ++ L + V + + G+L V G
Sbjct: 1281 SQLALLAKDVGRVSVFATEYVVDKQNLALLMSDERQNLQVTAYAPHTAESRGGQLLVPRG 1340
Query: 904 EYHLGEFVNRFRHGSLVMRLPDSDVG-QIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQT 962
++++G+ +N+F L M LP Q + FGT++G +G +A + + L LQ+
Sbjct: 1341 DFNVGQSINKFVR--LPMTLPSGTTSLQRHALWFGTLSGGVGYLAPMDESVFRRLGMLQS 1398
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN-FLDGDLIESFLDLSRTRMDEISKTMNVS 1021
L I GL+ + +R+ E+ + K+ LDG L+ +L L +I+ + S
Sbjct: 1399 ALLSAIPHTAGLHPQAYRALQRERLLRNRKHTILDGLLLSRYLALDSATQQQIALKLGTS 1458
Query: 1022 VEEL 1025
E +
Sbjct: 1459 RERI 1462
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 38/121 (31%), Positives = 65/121 (53%), Gaps = 18/121 (14%)
Query: 288 EVLERYVNLGPIVDFCVVD---------LERQGQG--QVVTCSGAYKDGSLRIVRNGIGI 336
+V + VN+GPI DF + + E++GQ ++VTCSG K+GSL ++++G+
Sbjct: 530 KVCDSLVNVGPITDFAIGESFDPASVSMAEQEGQRSVEIVTCSGQGKNGSLCVLQHGVRP 589
Query: 337 N-EQASVELQGIKGMWSL------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI 389
AS +L G K W+L R + + +L++S +TR++A + DEL E
Sbjct: 590 ELVHASADLAGCKAFWTLYHRSEERQGEEAEYHAYLLLSEEEQTRVIAGDGLDELSNEET 649
Query: 390 E 390
+
Sbjct: 650 D 650
Score = 52.0 bits (123), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 53/239 (22%), Positives = 101/239 (42%), Gaps = 42/239 (17%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG 60
+L ++G I ++ R G ++D L ++ K VL++D +++L T ++ D
Sbjct: 90 VLSSSLFGNIESMAAVRFPGTSKDALLLSFRDAKISVLEFDIATNDLRTISLHYFEDYKV 149
Query: 61 RPTDNGQIGI----IDPDCRLIGLHLYDGLFKVIPF---------DNKGQ----LKEAFN 103
+ + I + +DP R + +D V+PF +N GQ +K +F
Sbjct: 150 KEGHDHYIHVPELRVDPQQRCAAMLAFDRKLVVLPFRQHASLMEIENGGQEDQPVKPSFL 209
Query: 104 IRLEELQVLDIK---FLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGP------ 154
+ L + ++++K FL G +PT+++LY+ + + VA+ V
Sbjct: 210 LDLRAMGIINVKDFVFLQGYYEPTLLILYEPTQ----TWSGRVAVNRNTCVAAAVSLNLW 265
Query: 155 ------------WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPS 201
WS L LI VP P+ G L + +++Y + +F I P+
Sbjct: 266 QHRGQTSAHPVVWSAEFLPYDTQRLIAVPGPIGGALALSTNSLLYLNQVSFPYRLILPA 324
>gi|218194461|gb|EEC76888.1| hypothetical protein OsI_15095 [Oryza sativa Indica Group]
Length = 1503
Score = 76.3 bits (186), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 92/409 (22%), Positives = 175/409 (42%), Gaps = 112/409 (27%)
Query: 97 QLKEAFNIRLEEL---QVLDIKFLYGCAKPTIVVLYQDNKD------ARH----VKTYEV 143
+++ ++ I L L V D F++G +P +V+L++ ++H + + +
Sbjct: 234 RIESSYLIDLRALDMNHVKDFAFVHGYIEPVLVILHEQEPTWAGRILSKHHTCMISAFSI 293
Query: 144 ALKDKDFVEGP--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY------CSA--NAF 193
++ K + P WS NL + A L+ VPPP+ GVL+I +I Y CS N F
Sbjct: 294 SMTLK---QHPVIWSAANLPHDAYQLLAVPPPISGVLVICANSIHYHSQSTSCSLDLNNF 350
Query: 194 KAIPI-RPSITKAYGRVDADGSR--------YLLGDHAGLLHLLVITHEKEKVTGLKIEL 244
+ P P I+K+ +V+ D ++ + AG + LL + ++ V L +
Sbjct: 351 SSHPDGSPEISKSNFQVELDAAKATWFSNDIVMFSSKAGEMLLLTVVYDGRVVQRLDLMK 410
Query: 245 LGETSIASTISYLDNAVVYIGS----------SYGDSQLI-------------------- 274
+ ++S ++ + N+ ++GS SYG S+ +
Sbjct: 411 SKASVLSSAVTSIGNSFFFLGSRLGDSLLVQFSYGASKSVLQDLTNERSADIEGDLPFSK 470
Query: 275 -----------------KLNLQ----PDA-----KGSYVEVLERYVNLGPIVDFC----- 303
+L+ Q P++ K SY+ V + +N+GP+ DF
Sbjct: 471 RLKRIPSDVLQDVTSVEELSFQNIIAPNSLESAQKISYI-VRDALINVGPLKDFSYGLRA 529
Query: 304 -----VVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL----- 353
+ +Q ++V CSG K+GSL +++ I + VEL +G+W++
Sbjct: 530 NADPNAMGNAKQSNYELVCCSGHGKNGSLSVLQQSIRPDLITEVELPSCRGIWTVYYKSY 589
Query: 354 --RSSTDDPFDTFLVVSFISETRILAMNLEDELEE-TEIEGFCSQTQTL 399
+ + D+ + +L++S E R + + D+L E TE + Q T+
Sbjct: 590 RGQMAEDNEYHAYLIISL--ENRTMVLETGDDLGEVTETVDYFVQASTI 636
Score = 66.2 bits (160), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 90/411 (21%), Positives = 168/411 (40%), Gaps = 87/411 (21%)
Query: 702 FEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF--I 755
+E ST P+ FE ++ L + + ++ +GTAYVL E+ +GR+L+F +
Sbjct: 1095 WETKSTIPMQLFENALTVRIVTLHNTTTKENETLLAIGTAYVL-GEDVAARGRVLLFSFM 1153
Query: 756 VEDGKLQLIAE---KETKGAVYSLNAFNGKLLAA-------------------------- 786
+ L+ E KE+KGAV ++ + G LL A
Sbjct: 1154 KSENSQNLVTEVYSKESKGAVSAVASLQGHLLIASGPKITLNKWTGAELTAVAFYDAPLH 1213
Query: 787 ---------------INQKIQLYKW---------MLRDDGTRE-LQSECGHHGHILALY- 820
I++ I W + +D G+ + +E G L+L
Sbjct: 1214 VVSLNIVKNFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLSLVA 1273
Query: 821 ------VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG 874
VQ + +F++ GD+ KSI L +K + + A+D+ + A E L D L
Sbjct: 1274 SDSDKNVQVK-NFVLFGDIHKSIYFLSWKEQGSQLSLLAKDFGSLDCFATEFLIDGSTLS 1332
Query: 875 -----AENNFNLFTVRKNSEGATDEERG-RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDV 928
++ N +F + + +G +L E+H+G + +F + LP +
Sbjct: 1333 LVASDSDKNVQIFYY---APKMVESWKGQKLLSRAEFHVGAHITKFLR---LQMLPTQGL 1386
Query: 929 GQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNN 984
T ++FG ++G IG IA + + L+ LQ L + V GLN +R F++
Sbjct: 1387 SSEKTNRFALLFGNLDGGIGCIAPIDELTFRRLQSLQRKLVDAVPHVCGLNPRSFRQFHS 1446
Query: 985 EKK--TVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
K N +D +L+ + LS ++++ + + ++ +++
Sbjct: 1447 NGKGHRPGPDNIIDFELLAHYEMLSLDEQLDVAQQIGTTRSQILSNFSDIS 1497
>gi|389609725|dbj|BAM18474.1| spliceosomal protein sap [Papilio xuthus]
Length = 367
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 69/268 (25%), Positives = 124/268 (46%), Gaps = 40/268 (14%)
Query: 747 TKGRILVFIV--EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR 804
T+G + V+ + GKL+L+ + +L AFNGKL+A + + +++Y D G R
Sbjct: 112 TEGSLHVYKIFSNTGKLELVHKTPIDDYPGALAAFNGKLMAGVGRMLRMY-----DIGRR 166
Query: 805 ELQSEC-GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
+L +C H L ++T G I V D+ +S+ + YK E + A D N W++
Sbjct: 167 KLLRKCENRHIPNLIADIKTIGQRIFVADVQESVFCVKYKKRENQLIIFADDTNPRWITN 226
Query: 864 VEILDDDIYLGAENNFNLFTVRKNSEGATDE------------ERGRL-------EVVGE 904
+LD D + + F +V + +D+ +RG L E+V
Sbjct: 227 TCVLDYDT-VAMSDKFGNVSVMRLPHSVSDDVDEDPTGNKALWDRGLLNGASQKGEIVVN 285
Query: 905 YHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLP---HEQYLFLEKLQ 961
+H+GE V + +L+ G +++ T++G +GV+ LP E + F + L+
Sbjct: 286 FHVGETVASLQRATLI-------PGGSEALLYATISGALGVL--LPFTSREDHDFFQHLE 336
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTV 989
++R + G +H +RS+ K V
Sbjct: 337 MHMRSENSPLCGRDHLSFRSYYYPVKNV 364
>gi|255948500|ref|XP_002565017.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211592034|emb|CAP98296.1| Pc22g10080 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 1392
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 84/375 (22%), Positives = 158/375 (42%), Gaps = 45/375 (12%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTK 748
++++ +T+ I +YPLD E ++ + + N + VGTA ++ E+ P +
Sbjct: 1019 IKVVSPKTWTIIDSYPLDPDEQVTAVKNVNIEVSENTHERRDLIVVGTA-IVKGEDMPAR 1077
Query: 749 GRILVFIV----------EDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
G I VF V E G KL+LI ++ KGAV +L+ G+ + Q +
Sbjct: 1078 GTIYVFDVIKVAPDPEKPETGHKLKLIGKESVKGAVTALSGIGGQGFVIVAQGQKCMVRG 1137
Query: 798 LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
L++DG+ + ++ +++GD +K + Y E + +D
Sbjct: 1138 LKEDGSLLPVAFMDMQCYVTVAKELKGTGLVILGDAVKGLWFAGYSEEPYRMTLFGKDPE 1197
Query: 858 ANWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
+ A + L D +Y L A+++ NL ++ + E RL +++ G F
Sbjct: 1198 YLEVVAADFLPDGNKLYMLVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNF---- 1253
Query: 915 RHGSLVMRLPDSDVGQIPT-----------------VIFGTVNGVIGVIASLPHEQYLFL 957
S V LP + V T V+ + NG + ++ S+ E Y L
Sbjct: 1254 --ASSVTLLPRTAVSSERTESSEEGMDLDETFARHQVLIASQNGSLALVTSVAEESYRRL 1311
Query: 958 EKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKT 1017
LQ+ L + GLN +R+ E + +DG+L+ +L++ + R EI+
Sbjct: 1312 SALQSQLINTVDHPAGLNPRAFRAI--ESDGAAGRGMVDGNLLRLWLNMGKQRQTEIAGR 1369
Query: 1018 MNVSVEELCKRVEEL 1032
+ + E+ +E +
Sbjct: 1370 VGATEWEIKADLETI 1384
>gi|384946686|gb|AFI36948.1| cleavage and polyadenylation specificity factor subunit 1 [Macaca
mulatta]
Length = 1428
Score = 75.5 bits (184), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 87/346 (25%), Positives = 142/346 (41%), Gaps = 55/346 (15%)
Query: 618 VSHMCPFNSAAFPDS-LAIAKEGELTIGTIDDI----QKLHIRSIPLGEHPRRICHQEQS 672
V PF++ P L ++GEL I + +R IPL + + +S
Sbjct: 971 VDSFAPFHNVNCPRGFLYFNRQGELRISVLPAYLSYDAPWPVRKIPLRCTAHYVAYHVES 1030
Query: 673 RTFAICSLKNQSCAE-----------------------ESEMHFVRLLDDQTFEFI--ST 707
+ +A+ + N CA + E ++L+ ++E I +
Sbjct: 1031 KVYAVATSTNTPCARIPRMTGEEKEFETIERDERYIHPQQEAFSIQLISPVSWEAIPNAR 1090
Query: 708 YPLDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVE 757
L +E+ + + S + V Y GT + EE +GRIL+ + E
Sbjct: 1091 IELQEWEHVTCMKTVSLRSEETVSGLKGYVAAGTCLMQGEEVT-CRGRILIMDVIEVVPE 1149
Query: 758 DG------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
G K +++ EKE KG V +L NG L++AI QKI L W LR EL
Sbjct: 1150 PGQPLTKNKFKVLYEKEQKGPVTALCHCNGHLVSAIGQKIFL--WSLR---ASELTGMAF 1204
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
+ + + +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 1205 IDTQLYIHQMISVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 1264
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
LG ++ + NL E RL ++H+G VN F
Sbjct: 1265 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTF 1310
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 107/566 (18%), Positives = 221/566 (39%), Gaps = 152/566 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLI----KLNLQP--------------------DAKGS 285
+ +++ ++ +++GS G+S L+ KL P DA S
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVDATAS 429
Query: 286 Y--------------------------------VEVLERYVNLGPIVDFCVVD---LERQ 310
+ EV + +N+GP + + + L +
Sbjct: 430 WSAGGKSVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSILNIGPCANAAMGEPAFLSEE 489
Query: 311 GQG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-----RSSTDD 359
Q ++V CSG K+G+L +++ I + EL G MW++ + D+
Sbjct: 490 FQNSPEPDLEIVVCSGHGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEDN 549
Query: 360 P-------------------FDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLF 400
P FL++S T IL E+ E + GF +Q T+F
Sbjct: 550 PKGEGTEQEARSPEADDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVF 607
Query: 401 CHDAIYNQ-LVQVTSGSVRLVSSTSR 425
+ N+ +VQV+ +RL+ ++
Sbjct: 608 AGNIGDNRYIVQVSPLGIRLLEGVNQ 633
>gi|145348791|ref|XP_001418827.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144579057|gb|ABO97120.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 1386
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 90/390 (23%), Positives = 165/390 (42%), Gaps = 41/390 (10%)
Query: 665 RICHQEQSRTFAICSLKNQSCAEESEMHF---VRLLDDQTFEFISTYPLDTFEYGCSILS 721
R Q+ A+ ++ + +M VRLL + + + L+ E+ + +
Sbjct: 985 REIEQDNVHGIALAKVRRERAKANDDMELQYSVRLLVPGSLDSAWQHALEPGEHVQCVRN 1044
Query: 722 CSFSDDSN----VYYCVGTAYVLPE-ENEPTKGRILVFIV--------EDG---KLQLIA 765
D + VGTA +P E+ P +GR+++F + DG K Q+
Sbjct: 1045 VQLRDINTGALLSLLAVGTA--MPGGEDTPCRGRVILFQMVWERDAESMDGYRWKGQVCC 1102
Query: 766 EKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRG 825
+E K A +L+A +G L+ A+ K+ ++ W DG EL S I + +
Sbjct: 1103 VREAKMACTALSALDGHLIVAVGTKLTVHTW----DGV-ELNSVAFFDTPIHTVSINVVK 1157
Query: 826 DFIVVGDLMKSISLLIYKHE--EGAIEERARDYNANWMSAVEILDDDI---YLGAENNFN 880
+FI+VGDL K + +K E +I + ++D++ + + E L D LG++ + N
Sbjct: 1158 NFILVGDLEKGLHFFRWKANGFEKSIIQLSKDFDRMDVVSTEFLIDGATLSLLGSDMSGN 1217
Query: 881 LFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIP------TV 934
+ + + +L V YH+G ++R ++ + G+ P V
Sbjct: 1218 ARIFGYDPKSLESWKGQKLLVRSAYHVGSPISRMVRFNVEGTTAKAAPGERPKGTNRHAV 1277
Query: 935 IFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS---FNNEKKTVDA 991
FGT++G +G+ Y L LQ L ++ G N +R+ F + + A
Sbjct: 1278 FFGTLDGALGIFMPTDEPTYAKLHALQRELNTTVRSPIGCNPRTFRTPKVFEGKHVQLLA 1337
Query: 992 K-NFLDGDLIESFLDLSRTRMDEISKTMNV 1020
+ LDG L+ F L+ T +++ V
Sbjct: 1338 PLDVLDGGLLSKFETLTFTEQRAVAERSGV 1367
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 69/331 (20%), Positives = 138/331 (41%), Gaps = 66/331 (19%)
Query: 8 GRIATLELFR-----PHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-- 60
G I ++ + R P + +D L IA K V+++DA + ++ +M +G
Sbjct: 67 GAIGSMSVLRRRFGAPRNQ-RDALLIAIRERKLSVVEYDAATGDVCCSSMHSFESALGCN 125
Query: 61 ------RPTDNGQIGIIDPDCRLIGLHL--------------YDGLFKVIPFDNKGQLK- 99
R + + + DP+ R + L DG ++ D++G+++
Sbjct: 126 PLGTTLRMSREAPLVVSDPEGRCAAVVLREDGVAGKVRVLPSVDGGLGLVANDDEGRVRG 185
Query: 100 ------EAFNIRLEELQVL-DIKFLYGCAKPTIVVLYQD----------NKDARHVKTYE 142
E+F + L ++++ D FL+G +P + VLY+ +KD +
Sbjct: 186 PAASVRESFPLHLPGVRLIRDACFLHGYGEPALAVLYEKTPTWAGRYNLSKDTCEIVALS 245
Query: 143 VALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSI 202
V + DK W + NL + + L + PPL G L+ ++ +++ S + + +
Sbjct: 246 VDV-DKQKGTVIWRRQNLPSSSYKLTALLPPLGGALVFSQDFLLHESQESSSVLGLN--- 301
Query: 203 TKAYG--------RVDADGS--------RYLLGDHAGLLHLLVITHEKEKVTGLKIELLG 246
T +G + DG+ R L+ G L LL + + + + ++ G
Sbjct: 302 TFGHGGPQEGNDAEITLDGAQASVVSEDRVLVTTKTGALLLLALHTDGRSLRRMMLQRAG 361
Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKLN 277
++S + L ++++GS GDS L+K
Sbjct: 362 GAVLSSGMCLLSRDLLFLGSRIGDSLLVKFT 392
>gi|399216612|emb|CCF73299.1| unnamed protein product [Babesia microti strain RI]
Length = 1195
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 162/767 (21%), Positives = 306/767 (39%), Gaps = 97/767 (12%)
Query: 202 ITKAYGRVDADGSR-YLLGDHAGLLHLLVIT--HEKEK-VTGLKIELLGETSIASTIS-- 255
+ ++ R+ D + Y D G L++ IT HEK+K + K+ L TIS
Sbjct: 350 LQRSLSRIGDDETHLYYACDVTGQLYMFKITIKHEKDKYLINDKMRNLDTIIPLYTISPF 409
Query: 256 ----YLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQG 311
L + +++ + GDS++++L + G E+ + +L PI+D V+
Sbjct: 410 SLLLALSDDCIFVATRVGDSKILQL-----SNGKVSEIWSK-PSLSPIIDATVIG--GNC 461
Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTDDPFDTFLVVSFIS 371
Q++ SG K ++ I+ G+ + ++ L + ++ + S P FLV I+
Sbjct: 462 TDQLIAVSGIGKSANISILTIGLFFHNICNISLAKVSKLFIISSQ---PEMIFLVA--IT 516
Query: 372 ETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVS-STSRELRNE 430
T+ + D+ E S+ + C NQL Q+T S+R+ + R +
Sbjct: 517 STQSYYFKMVDQSIE-----LISRGDKIICAFQYNNQLYQLTPNSIRISTIDNPRITKVL 571
Query: 431 WKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGE 490
W S V +L+ +GG +L L I ++ + ++ S PI
Sbjct: 572 WTSDEPI---VHYLKCENMLIISGGYNLSALVIDKSDPIKITYNHIQSFTS-----PI-- 621
Query: 491 NPSYSQIAAVGMWTDISVRIFSLPDLNLIT---KEHLGG--EIIPRS-------VLLCAF 538
S IA V + + + + ++ LI + +G EI R+ ++
Sbjct: 622 ----SAIALVENMNKLHILVATFTNVLLIVDCETKSIGTICEIPTRTSCVGVSDLVYVVK 677
Query: 539 EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKN----TTHVFAA 594
I Y+ L DG+LL + +R+ V + F SKN +
Sbjct: 678 NSIYYIFIGLTDGNLLVYT--------NEREDVRHFLECKGWEFFMSKNFGVYPLYFKHN 729
Query: 595 SDRPTVIYSSNKKLLYSNVNLKEVSHMC----PFNSAAF-PDSLAIAK-EGELTIGTIDD 648
S R T+I + +S + + V + P F D L IA L +G + D
Sbjct: 730 SQRLTLIGQHVHFVTFSPITGRIVDFLSLEVPPVRDLEFISDDLLIASLTDHLELGKLSD 789
Query: 649 IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEFISTY 708
L +R IP+ I + + S + + S + ++ + T ++TY
Sbjct: 790 CNSLDVRRIPVSGALDLIEYHQPSNCVIVAFPQLTSESIFPSNAYIAVYSFVTGLLVTTY 849
Query: 709 PLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG--------K 760
L+ G SI +F+ S+ +G + + P++GRI+ I + K
Sbjct: 850 YLEQSFEGSSI--STFTRGSDKTLLIGISST-DSDYLPSQGRIVACIYNNSTFTFKDSYK 906
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTR------ELQSECGHHG 814
+ G V + + A +N + + +L + SEC
Sbjct: 907 FDKDLPEVPSGGVLQICTSKNFVFAGVNHSLVVLTPILHGRSWEGELSPIYIYSECN--- 963
Query: 815 HILALYVQTRGDFIVVGDLMKSISLL-IYKHEEG-AIEERARDYNANWMSAVEILDDDIY 872
+ + + D +++ D++ + LL I+ G ++EE A+D + +W S +++D +
Sbjct: 964 --MIIGIDVCNDLVLLTDIITGVKLLKIFSDTNGDSVEEIAKDTDTSWCSCAKLIDQNHI 1021
Query: 873 LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL 919
L ++ N+ ++ E DE+ L + ++HLGE VN+ G
Sbjct: 1022 LVSDTKGNITLLKLIDEPINDEQCYILTKIAKFHLGECVNKIVIGKF 1068
>gi|242798830|ref|XP_002483249.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces stipitatus ATCC 10500]
gi|218716594|gb|EED16015.1| cleavage and polyadenylation specificity factor subunit A, putative
[Talaromyces stipitatus ATCC 10500]
Length = 1382
Score = 74.3 bits (181), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 95/435 (21%), Positives = 177/435 (40%), Gaps = 64/435 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
IR IPL E + + S T+ + + Q + E+H +
Sbjct: 950 IRKIPLNEEVDYLAYSTVSGTYVVGTTHEQDFKLPDNDELHPEWANEDISLRPKVAQGSI 1009
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVYYCVGTAYVLPEENEPTKG 749
+LL+ +T++ I +Y + E +I + + + + VGT + E+ +G
Sbjct: 1010 KLLNPKTWKVIDSYTFNAAERITAIENINLEISEKTSERKDMIVVGTTFA-KGEDIAARG 1068
Query: 750 RILVFIV--------EDG---KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
+ VF V E G KL+LI E+ +GA+ +++ G+ + Q + L
Sbjct: 1069 NVYVFDVINVVPDPDEPGTNLKLKLIGEESVRGALTAVSGIGGQGFLIVAQGQKCMVRGL 1128
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
+DDG+ + ++ + ++GD +K + Y E + +D +
Sbjct: 1129 KDDGSLLPVAFIDVQCYVSVIKELKGTGMCLIGDALKGLWFTGYSEEPYKMTLFGKDLDE 1188
Query: 859 NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
+ + L D +Y L A+++ NL ++ + E RL ++H+G F
Sbjct: 1189 LEVVTADFLPDGKKLYILVADSDCNLHVLQYDPEDPKSSNGDRLLNRCKFHMGHF----- 1243
Query: 916 HGSLVMRLPDSDVGQ---------------IP--TVIFGTVNGVIGVIASLPHEQYLFLE 958
S + LP + V IP + T +G++ ++ SL E Y L
Sbjct: 1244 -ASTITLLPRTAVSSELAVMNSDSMDIDSYIPLHQALITTQSGLMALVTSLSEESYRRLS 1302
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
LQ+ L ++ GLN +R+ E V + +DG L+ +LDLSR R EI+ +
Sbjct: 1303 ALQSQLSNTLEHPCGLNPRAYRAV--ESDGVVGRGMIDGKLLMRWLDLSRPRKLEIAGRV 1360
Query: 1019 NVSVEELCKRVEELT 1033
E+ +E ++
Sbjct: 1361 GADEWEIRADLEAVS 1375
>gi|425765419|gb|EKV04111.1| Cleavage and polyadenylation specificity factor subunit A, putative
[Penicillium digitatum Pd1]
gi|425767100|gb|EKV05682.1| Cleavage and polyadenylation specificity factor subunit A, putative
[Penicillium digitatum PHI26]
Length = 1271
Score = 73.9 bits (180), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 92/434 (21%), Positives = 176/434 (40%), Gaps = 64/434 (14%)
Query: 654 IRSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMH-----------------FV 694
+R +P+ E + + S T+ + + + E E+H +
Sbjct: 839 LRKVPIEEQVNFLAYSTSSETYVLGTSRQGDFKLPEGDELHPEWRNEELSFCPKIPESSI 898
Query: 695 RLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVY-----YCVGTAYVLPEENEPTKG 749
+++ +T+ I +YPLD E ++ + + N + VGTA + E+ P +G
Sbjct: 899 KVVSPKTWTIIDSYPLDPDEQVTAVKNVNIEVSENTHERMDLIVVGTA-IAKGEDMPARG 957
Query: 750 RILVFIV----------EDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
I VF V E G KL+LI ++ KGAV +L+ G+ + Q + L
Sbjct: 958 TIYVFDVIKVAPDPERPETGRKLKLIGKETVKGAVTALSGIGGQGFIIVAQGQKCMVRGL 1017
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
++DG+ + ++ + +++GD +K + Y E + +D
Sbjct: 1018 KEDGSLLPVAFMDMQCYVNVVKELKGTGMVILGDAVKGLWFAGYSEEPYRMTLFGKDPEY 1077
Query: 859 NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
+ A + L D +Y L A+++ NL ++ + E RL +++ G F
Sbjct: 1078 LEVVAADFLPDGNKLYMLVADSDCNLHVLQYDPEDPKSSNGDRLLSRSKFYTGNF----- 1132
Query: 916 HGSLVMRLPDSDVGQIPT-----------------VIFGTVNGVIGVIASLPHEQYLFLE 958
S V LP + V T V+ + NG + ++ S+ E Y L
Sbjct: 1133 -ASSVTLLPRTAVSSELTESSEEAMDVDETFAKYQVLIASQNGSLALVTSVAEESYRRLS 1191
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
LQ+ L + GLN +R+ E + +DG+L+ +L++ + R EI+ +
Sbjct: 1192 GLQSQLINTVDHPAGLNARAFRA--TESDGAAGRGMVDGNLLRLWLNMGKQRQAEIAGRV 1249
Query: 1019 NVSVEELCKRVEEL 1032
+ E+ +E +
Sbjct: 1250 GATEWEIKADLETI 1263
>gi|84999522|ref|XP_954482.1| damage-specific DNA binding protein 1 [Theileria annulata]
gi|65305480|emb|CAI73805.1| damage-specific DNA binding protein 1, putative [Theileria annulata]
Length = 1501
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 62/273 (22%), Positives = 117/273 (42%), Gaps = 39/273 (14%)
Query: 800 DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
D+ T L + + + + T+ D I VGDLM S+ +L K + ++ E RD+N
Sbjct: 1218 DENTPFLDVVASYDSNTFVVSLDTKDDVIFVGDLMTSVKML--KFRDNSLHETCRDFNTL 1275
Query: 860 WMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF--RHG 917
W +++ +D+ L ++++ N ++K TD++ + + +G +H GE VN+ R
Sbjct: 1276 WTTSLVAVDNSSCLVSDDSGNFMLLKKVKHPVTDQQSIKFDKIGLFHHGEVVNKILKRTE 1335
Query: 918 SLVMRLPDSDVGQIPTVIFGTVNGVI--------GVIASLPHEQYLF------------- 956
+ D+ + + F + VI +SL LF
Sbjct: 1336 MPIHHEADTTITRNNPREFMVSSRVICESETNSPSETSSLNEHNNLFKGFFTCATTSGSL 1395
Query: 957 --------------LEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
L L+ ++ V K +G + R+F + + + F+DGDL+ES
Sbjct: 1396 FQVCFFEDLKMFLKLSLLEHTMQLVQKDLGNIPSRNQRNFEDLHSNIPTRGFVDGDLVES 1455
Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
FL L + + TM ++ ++L ++ L L
Sbjct: 1456 FLKLPDSLKKWVFDTMLINSKQLGIKLNSLESL 1488
Score = 41.2 bits (95), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 33/146 (22%), Positives = 62/146 (42%), Gaps = 10/146 (6%)
Query: 245 LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLER---------YVN 295
LG + S++ LD ++ S GDS ++ +N ++ Y + E+ N
Sbjct: 433 LGNFPVPSSLIKLDEEHIFYTSKMGDSSILSINSILNSSNRYEQNSEQSDRNSTEWAQTN 492
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-R 354
LGPI DF + G+ ++ C G GS + G+ ++ G+ ++S+
Sbjct: 493 LGPITDFAYREETSSGETTILACCGMGNSGSFCEIYLGLSSEIIHESDVPGVLNLFSVPM 552
Query: 355 SSTDDPFDTFLVVSFISETRILAMNL 380
S + + L +SF T+ +L
Sbjct: 553 KSLYNSTSSLLCISFFRFTKFYNFSL 578
>gi|164655043|ref|XP_001728653.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
gi|159102535|gb|EDP41439.1| hypothetical protein MGL_4214 [Malassezia globosa CBS 7966]
Length = 1212
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 83/378 (21%), Positives = 152/378 (40%), Gaps = 36/378 (9%)
Query: 661 EHPRRICHQEQSRTFAIC-SLKNQSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSI 719
E P R Q+ +RT+A C +L+ E +H +T + P+D + G
Sbjct: 830 EQPVRDPKQDPTRTYAACGALELLVRVGEPPVHGYEFSACETVSALHMAPMDCLDRGSGR 889
Query: 720 LSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDG------------KLQLIAEK 767
+ + Y E+ +KG + VF V + +LQL+ +
Sbjct: 890 RTFVVVGTTVTY----------GEDRSSKGHMYVFDVVECVPSEGMAASDALRLQLLCTE 939
Query: 768 ETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDF 827
E + V +L+ NG L+AA+ QK+ + W + L + + +Q +F
Sbjct: 940 EMRAPVTALHDLNGFLVAAVGQKLLIRSW----EYCEWLVTVAFLDMGMYTTSIQRVKNF 995
Query: 828 IVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLFTV 884
+++ D +S + ++ + + RDY ++ L D L + N L +
Sbjct: 996 LLLTDYYQSAYFVAFQEDPARLVLLGRDYIPTSVTCGAFLIDRARLSIVTCDMNGCLRLM 1055
Query: 885 RKNSEGATDEERGRLEVVGEYHL-GEFVN-RFRHGSLVMRLPDSDVGQIPTVIFGTVNGV 942
+ T RL EYH GE V R HG L S ++ NG
Sbjct: 1056 DYHPSNPTSLGGQRLLARCEYHAPGEVVRARMLHGPY---LATSGECLTSEIVLAKRNGA 1112
Query: 943 IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS-FNNEKKTVDAKNFLDGDLIE 1001
+ V+ + + + L+ Q+ L ++++ GLN +R+ FN AK LDG L+
Sbjct: 1113 VDVLVPVTEKIFPTLQLFQSQLVRMVRHTAGLNPRGFRAVFNQHISRPLAKGILDGTLLH 1172
Query: 1002 SFLDLSRTRMDEISKTMN 1019
+ +SR ++ + + ++
Sbjct: 1173 TAESMSRPKLTSLVRDLS 1190
>gi|170576536|ref|XP_001893668.1| CPSF A subunit region family protein [Brugia malayi]
gi|158600196|gb|EDP37499.1| CPSF A subunit region family protein [Brugia malayi]
Length = 1323
Score = 73.2 bits (178), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 104/468 (22%), Positives = 200/468 (42%), Gaps = 62/468 (13%)
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLK-EVSHMCPFNSAAFPDSLAIAKE-------GELT 642
VF S P + + + VNL + FN+A P+ E +L
Sbjct: 891 VFICSSYPCIFFLESGVPRLHPVNLDGPILSFTTFNNAVCPNGFIYLTERDRFMRVAKLP 950
Query: 643 IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTF 702
I D ++ I +G + + S T+A+ + + + + V + DD+TF
Sbjct: 951 SDMILD-ASYPVKRINVGATVHSVVYLLHSNTYAVLTSEKRKVTKMC----VLINDDKTF 1005
Query: 703 EFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGKLQ 762
E DTF Y + + Y V+PE +PT +++
Sbjct: 1006 E--EHEKPDTFVY----------PEMDQY----KLQVVPEPGQPTSKH---------RIK 1040
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQ 822
+ +KE KG V SL + NG LL + QK+ + W+ +D+ + + S H +I L +
Sbjct: 1041 TLYDKEQKGPVTSLCSCNGYLLTGMGQKV--FIWLFKDNNLQGI-SFLDMHFYIHQL-IG 1096
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMS--AVEILDDDIYLG-----A 875
R + + D+ +S++LL Y+ E A+ +RD ++ A + L D+ +G
Sbjct: 1097 VR-NLALACDMYRSLALLRYQEEYKALSLASRDMRSDVQPPMAAQFLIDNKQMGFIMSDE 1155
Query: 876 ENNFNLFTVRKNSEGATDEERG--RLEVVGEYHLGEFVNRF--RHGSLVMRLPDSDVGQI 931
N +F N T E G +L + E ++G VN F G + ++++ +
Sbjct: 1156 AANIAIF----NYLPETLESLGGEKLTLRAEINIGTVVNSFIRVKGHISSGFVENELFSL 1211
Query: 932 --PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKT- 988
+V+F +++G G + L + + L LQ + ++ GLN + R+ ++
Sbjct: 1212 ERQSVLFASLDGSFGYLRPLTEKVFRRLHMLQQLMSSMVLQPAGLNAKGARAARPQRPNH 1271
Query: 989 -VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
++ +N +DGD+ +L LS +++++ + S + + E+ R+
Sbjct: 1272 YLNTRNLVDGDVAMQYLHLSLPEKNDLARKLGTSRYHIIDDLIEICRV 1319
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 92/439 (20%), Positives = 177/439 (40%), Gaps = 94/439 (21%)
Query: 71 IDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTI 125
+DP R + ++ V+PF++ ++ ++L ++ V+D+ FL G +PT+
Sbjct: 140 VDPGQRCASMLVFGRYLAVLPFNDSSTQLHSYTVQLSQIDSRLVNVVDMVFLDGYYEPTL 199
Query: 126 VVLYQDNKDA--RHVKTYE------VALKDKDFV-EGPWSQNNLDNGADLLIPVPPPLCG 176
+ LY+ + R Y+ V+L K+ V W NL + ++ +P P+ G
Sbjct: 200 LFLYEPVQTTCGRACVRYDTMCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGG 259
Query: 177 VLIIGEETIVYCS-------------ANAFKAIPIRPSITKAYGRVDADGS--------R 215
+L++ ++Y + + F P++ A + DG+ +
Sbjct: 260 ILLVATNELIYLNQSVPPCGISLNSCMDGFTKFPLKDFKHMA---LTLDGAVVTVVSTNK 316
Query: 216 YLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS--- 271
LL D G L L+++T V L+++ ET I T++ ++IGS DS
Sbjct: 317 ILLCDRNGRLFTLILVTDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFL 376
Query: 272 --------------QLIKLNLQPDAKGS------------------------YVEVLERY 293
+ +KL+ +P+A + VL++
Sbjct: 377 HCIFEQSTLEESATKKMKLSTEPNANEEDEDFELYGEVLPKVAKPDVTEELLNIRVLDKL 436
Query: 294 VNLGPI---------VDFCVVDLERQGQGQVVTCS-GAYKDGSLRIVRNGIGINEQASVE 343
+N+GP V ++ R+ + C+ G K GS+ I++ I S
Sbjct: 437 LNVGPCKKITGGCPSVSAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSSS 496
Query: 344 LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
++G+ W++ DD F+ E LA+ +++L E E F + T+ +
Sbjct: 497 IEGVVQYWAVGRREDDTHMYFIAS---RELGTLALETDNDLVELEAPIFSTSESTIAAGE 553
Query: 404 AIYNQL-VQVTSGSVRLVS 421
L VQVT+ S+ +V+
Sbjct: 554 LADGGLAVQVTTSSLVMVA 572
>gi|70954357|ref|XP_746229.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56526771|emb|CAH77136.1| hypothetical protein PC000016.02.0 [Plasmodium chabaudi chabaudi]
Length = 372
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 76/363 (20%), Positives = 154/363 (42%), Gaps = 45/363 (12%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKG-RIL 752
+++++ + I L+ E S+ +C + T L N P+ R+
Sbjct: 26 IKIINPINLQVIDKVSLELEEAALSVCACELEALHCLIVGTTTNMTLKNRNVPSASLRVY 85
Query: 753 VFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGH 812
+ + + KL L+ + Y FNG+++ ++ K+++Y G ++L +C +
Sbjct: 86 TYDI-NYKLNLLHITPIEDQPYCFCPFNGRVIVSVGNKLRIYAL-----GKKKLLKKCEY 139
Query: 813 HGHILALY-VQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
A+ ++ GD I D+ +S+ + Y + I + D W++ EILD
Sbjct: 140 KDIPEAIVSIKVSGDRIFASDIRESVLIFFYDSNQNVIRLISDDIIPRWITCSEILDHHT 199
Query: 872 YLGAENNFNLFTVRKNSEGATDEERG-------------------RLEVVGEYHLGEFVN 912
+ A+ ++F +R E A EE G ++E + +H+GE V
Sbjct: 200 IMAADKFDSVFILRVPEE-AKQEEYGIANKCWYGGEVISSSTKNRKMEHIMSFHIGEIVT 258
Query: 913 RFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH---EQYLFLEKLQTNLRKVIK 969
+ L P S +I+ T+ G IG A +P+ E+ + L+ LR
Sbjct: 259 SLQKVKLS---PASS----ECIIYSTIMGTIG--AFIPYDNKEELELTQHLEIILRTEKH 309
Query: 970 GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
+ G H +RS+ + ++ +DGDL E F L +++ + + +E+ +++
Sbjct: 310 ALCGREHIFFRSYYHP-----VQHVIDGDLCEQFSSLPFDVQRKVASDLEKTPDEILRKL 364
Query: 1030 EEL 1032
E++
Sbjct: 365 EDI 367
>gi|428186188|gb|EKX55039.1| hypothetical protein GUITHDRAFT_160593 [Guillardia theta CCMP2712]
Length = 2290
Score = 72.4 bits (176), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 93/376 (24%), Positives = 149/376 (39%), Gaps = 52/376 (13%)
Query: 690 EMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNV--------YYCVGTAYVLP 741
E + V+L + E + +Y D F G L N + VGT +
Sbjct: 1209 EKYEVQLRAPYSMELLDSYEFD-FANGEKALCLQVVHLKNTRVKDSLLPFVAVGTGFQNG 1267
Query: 742 E-ENEPTKGRILVFIVE--------DGKLQLIAEK--------ETKGAVYSLNAFNGKLL 784
E E GRI VF V +G+ +K + K V +L G LL
Sbjct: 1268 ESETSRATGRIYVFEVTTVVGEEGYEGRTSFKIKKIFTSADIQDIKAPVSALCQLEGYLL 1327
Query: 785 AAIN--------QKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKS 836
A K+ +Y+W+ +L H+ ++T FIV GD+ S
Sbjct: 1328 VAQGPNPGMIGGSKLYVYEWV-----DEKLVGRAFFDAHLYITTLKTVKFFIVFGDIRHS 1382
Query: 837 ISLLIYKHEEGAIEERARDYNANWMSAVE--ILDDDIYLGA---ENNFNLFTVRKNSEGA 891
+ LL ++ + ++ A+D + A E ++ + L A + N +F NS
Sbjct: 1383 VHLLRWREDIRMLQLLAKDALPLSVYAAEFVVMGSNFGLLASDEQKNVQVFVFNPNS--- 1439
Query: 892 TDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPH 951
+ R +L + H+G +N+F L R +G + T++G IG I +P
Sbjct: 1440 PEYRRQQLICRADLHVGSHINKFIRWPLPFR---PTLGVRTAAHYTTLDGGIGAIIPIPE 1496
Query: 952 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNE--KKTVDAKNFLDGDLIESFLDLSRT 1009
+ Y L LQ L + GLN WR + K AKNFLDG+L+ +L L
Sbjct: 1497 QSYRRLLALQNLLVTAMPHYAGLNPRSWRLYKPAMCMKRRYAKNFLDGNLLGRYLHLDLA 1556
Query: 1010 RMDEISKTMNVSVEEL 1025
++S +N + E +
Sbjct: 1557 LQMQLSSALNQTREAI 1572
>gi|146094112|ref|XP_001467167.1| putative CPSF-domain protein [Leishmania infantum JPCM5]
gi|134071531|emb|CAM70220.1| putative CPSF-domain protein [Leishmania infantum JPCM5]
Length = 1347
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 105/474 (22%), Positives = 188/474 (39%), Gaps = 101/474 (21%)
Query: 637 KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE-----ESEM 691
++G L IG+++ ++K +PLGE ++ H+ + + S++ E S +
Sbjct: 798 QDGRLHIGSLEALEKTSRTFLPLGETITQL-HEAAAWGGYVASVRKVEGDEVLFLPSSAI 856
Query: 692 HF-------------VRLLDDQTFEFISTYPLDTFEYGCS------ILSCSFSDDSNV-- 730
F V LL+++ F+ T LD E G ++ SD + V
Sbjct: 857 QFPWAADLSLWRSAAVPLLENERCVFLETVRLDGAEGGGVGERGGDLIDVPASDGAAVSE 916
Query: 731 -----YYCVGTAYVLPEENEPTKGRILVFIVED----GKLQLIAEKETKGAVYSLNA--- 778
+G+++ P+E GRI + + +L LIA K+ GA+ A
Sbjct: 917 EEWQHLLLIGSSFTFPDEQRARSGRITWCTLREERQRQRLHLIASKDIGGALQCCAAVPH 976
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECG---HHGHILALYVQTRGDFIVVG-DLM 834
+ G++ +N + LY+W +D T + C ++ LY + ++V D+
Sbjct: 977 YKGRIALGVNGCVCLYQWN-TEDQTFVAEERCRVGLTVTKLIPLYHTSLAASVLVALDVR 1035
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN--------NFNLFTV-- 884
S + +G ++ RD + I+D + AEN NF V
Sbjct: 1036 HSAFFIEVDTLQGNLKVLCRDADLRG-----IMDGHVGSDAENLCLFDDSLNFTALKVVP 1090
Query: 885 -------RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL----VMRLP---DSDVGQ 930
+ A + R EV + HLG+ V R GS +M P S Q
Sbjct: 1091 LPVEARDGDAAAAARATAQYRFEVRAQCHLGDLVTCVRQGSFAATSLMEAPAHCSSAQNQ 1150
Query: 931 I-------PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR--- 980
+ P ++F T +G GV+ + YL L L+ +L + + VGGL+H+ +R
Sbjct: 1151 LLLPGIAGPQLVFATAHGGFGVVTPVHAATYLVLRALEASLVRTVPPVGGLSHQAFREVL 1210
Query: 981 -----------------SFNNEK-KTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
+ E+ + + N +DGD++E F+ LS + + +
Sbjct: 1211 RAGQERGVSYLASKTGCALTRERLRRYEPLNTVDGDVVEQFVLLSPSDKKHVCR 1264
>gi|440294229|gb|ELP87246.1| splicing factor 3B subunit, putative [Entamoeba invadens IP1]
Length = 1168
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 159/778 (20%), Positives = 313/778 (40%), Gaps = 91/778 (11%)
Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLV 366
+++ + + +G ++++++ + E A + L I G++ L+ S D +LV
Sbjct: 431 DKKTANKYILYTGRGPSSCVQVLKHQMKTLEVAEITLPAIAMGVYPLQRSKD--VSEYLV 488
Query: 367 VSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRE 426
V F + T IL ++ EDE++ET+ + ++ VQV + + +S +E
Sbjct: 489 VPFANATSILKIS-EDEVKETDETPVILSSYSVRIGMMPDGTFVQVLKEKIVVYTSKIKE 547
Query: 427 LRNEWKSPPGYSVNVATANASQVLLATGGGH--LVYLEIGDGILTEVKHAQLEYEISCLD 484
+ S G V A + V+ + G ++ + DG L E++ +I +
Sbjct: 548 V-----SVVGDIVCAAIVESFVVVGVSKGDDNVIMLFKYEDGGLHEIESKGSFGKIRSVT 602
Query: 485 INPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG------EIIPRSVLLCAF 538
I EN + AV D +RI +L N T + L E V
Sbjct: 603 I----ENKEVPKFCAVA-CEDGGIRILTL--FNTETTKALTRVSVISLESAINDVKFGVL 655
Query: 539 EGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL-RTFSSKNTTH--VFAAS 595
++ L DG L + +GE+ + +G Q + L R +++ + + +
Sbjct: 656 NSKQIIVAGLDDGKLAWCFFDSNSGEIEETNFEFVGVQNVNLGRVYTTSESGEEWMVGYN 715
Query: 596 DRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHI 654
++ +SN ++ + ++ + + F ++ + +A+ L I ++ L
Sbjct: 716 GNGILLSTSNGQIKSTPLDQPNIYGVEGFKASFVDNPIALVSGNILKILVFENTTSSLSG 775
Query: 655 RSIPLGEHPRRICHQEQSR-TFAICSLKNQSCAEESEMH---------FVRLLDDQT-FE 703
+PL PR+I E TF +C+ A + + R+ + T E
Sbjct: 776 VKVPLCMTPRKIVESETVEDTFVLCADNFSKYANDEIIRGDDGEWVSSIYRMDKEATKIE 835
Query: 704 FISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEEN-EPTKGRILVFIVEDGKLQ 762
++ +P + + ++L+ S S +Y VG A + T I V+ V +Q
Sbjct: 836 LVTKFPQNVYALCGTLLTVSDS----IYLVVGLAEKYTSRPIKFTNSSIAVYKVNVDNIQ 891
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILA 818
+ +T+ V S+ + G++L IN+++++Y K +LR R+L SE H++
Sbjct: 892 YLYTTQTEYPVRSMAVYGGRVLCGINRQLRMYEIGKKQLLRKAELRQLASEISDI-HVIN 950
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGA--- 875
+Q G + IS + + + + A D W L Y+G+
Sbjct: 951 NKIQLVG-------ISDGISFVRFNQTKQEFDIYA-DTLPRWTVKSVALTPTSYIGSDKF 1002
Query: 876 -------------ENNFNLFT-VRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVM 921
EN N+F+ + + + + + + E++LG+ F + + +
Sbjct: 1003 GQIFVEGLDKETEENASNIFSSILSGEKTIYNGAKYKAVSLNEFYLGDIATGFVNCCVRI 1062
Query: 922 RLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQY---LFLEKLQTNLRKVIKGVGGLNHEQ 978
P IF + + G+ A +P + F E+L+ ++R + V G +H
Sbjct: 1063 GAPQ---------IFIVSHLLGGISALIPLNGFGEIEFFEQLEMHMRVRHQNVNGRDHIA 1113
Query: 979 WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
+RS V K DGDL E F L+ EI+ M V E+ K++ +L +
Sbjct: 1114 YRS-----SVVPVKGIFDGDLCELFEKLTEEEKKEIAGEMEREVGEIVKKLHDLRNMR 1166
>gi|395324102|gb|EJF56549.1| hypothetical protein DICSQDRAFT_93527 [Dichomitus squalens LYAD-421
SS1]
Length = 1433
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 112/511 (21%), Positives = 215/511 (42%), Gaps = 57/511 (11%)
Query: 565 LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK---KLLYSNVNLKEVSHM 621
L ++K++S P + + VF DRP+ I S++K ++ S N+
Sbjct: 923 LAEQKRISRLLLPFVTSPAPGQTYSGVFFTGDRPSWIVSTDKGGVRVFPSGHNVVHAFTT 982
Query: 622 CPFNSAAFPDSLAIAKEGELTIGTIDDI---QKLHIRSIPLGE-HPRRICHQEQSRTFAI 677
C + D L ++EG + + DI L RS+P + + S A
Sbjct: 983 CSLWESRG-DFLLYSEEGPSLVEWMPDIILDAHLPARSVPRSRPYSHVVFDASSSLIVAA 1041
Query: 678 CSLKNQSCAEESEMHFVRLLDD----------QTFEFIST---YPLDTFEYGCS-ILSC- 722
S N+ + + + + V D T E IS +D +E+ + +SC
Sbjct: 1042 SSFMNRFASYDEDGNIVWEPDSPNISFPHCETSTLELISPDGWITMDGYEFAANEFVSCV 1101
Query: 723 ------SFSDDSNV--YYCVGTAYVLPEENEPTKGRILVF----IVEDG--------KLQ 762
+ S +S + + VGT + E+ KG + +F +V D +L+
Sbjct: 1102 VSVPLETVSTESGMKDFIAVGTT-INRGEDLAVKGAVYIFEIVEVVPDASLNIKRWWRLK 1160
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQ 822
L+ + KG V L NG L++++ QKI + + L + + G + ++
Sbjct: 1161 LLCRDDAKGPVSFLCGMNGYLVSSMGQKIFVRAFDLDERLVGVAFLDVG----VYVTSLR 1216
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
+ +V+GD +KS+ + ++ + + +D + ++ ++ D +L
Sbjct: 1217 AVKNLLVIGDAVKSVWFVAFQEDPYKLVILGKDPHHCCVTRADLFFADGHLSIVTCDEEG 1276
Query: 883 TVRKNSEGATD-EERGRLEVV--GEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--VIFG 937
VR + D E +G ++ E+H G+ +R LV R P + +IP +I G
Sbjct: 1277 VVRLYAYDPHDPESKGGQHLLRRTEFH-GQ--TEYRSSLLVARRPKAGDPEIPQARLICG 1333
Query: 938 TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD-AKNFLD 996
+V+G + + + + L LQ L + ++ V LN + +R NE + +K LD
Sbjct: 1334 SVDGSLTTLTYVDENAFKRLHLLQGQLIRTVQHVAALNPKAFRMVRNEYVSRPLSKGVLD 1393
Query: 997 GDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
G+L+ +F DL R +E+++ + + K
Sbjct: 1394 GNLLATFEDLPIGRQNEVTRQIGTDRATVLK 1424
>gi|209881602|ref|XP_002142239.1| CPSF A subunit domain-containing protein [Cryptosporidium muris RN66]
gi|209557845|gb|EEA07890.1| CPSF A subunit domain-containing protein [Cryptosporidium muris RN66]
Length = 1509
Score = 71.6 bits (174), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 108/536 (20%), Positives = 204/536 (38%), Gaps = 145/536 (27%)
Query: 641 LTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICS------LKNQSCAEE------ 688
L +ID +Q+ H RS+ L P RIC+ + + +KN +
Sbjct: 971 LRASSIDILQQYHCRSLQLNFTPERICYISNLNVYVVVGIWEKGLMKNSKSENKFLIQNN 1030
Query: 689 ----SEMHFVR------------LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYY 732
S + FV+ +L +T E S YP EY ++ + + S +
Sbjct: 1031 STLKSSLSFVQALMCVISANNLEILHFKTLES-SVYP-TCLEYIELNINQTPENISESFI 1088
Query: 733 CVGTAYV---------LPEENEPTKGRILVFIVEDGKLQLIAEKET----KGAVYSLNAF 779
+GT V P + P G++ +F + + I ++ET +G Y++ F
Sbjct: 1089 VLGTNKVGHINDYSNKSPNIDLPI-GKLTLFSIRKLHSKFIVDEETSINAEGCPYAITRF 1147
Query: 780 N-GKLLAAINQKIQLYKWMLRDDGTRELQSEC---------------------------- 810
G +L A+ + Y+ + + L+
Sbjct: 1148 QLGTILVAVENSVVTYEIIQYSTSSNYLKDTSILPSSSSSSSSCLLLDTLPENEKTPKFK 1207
Query: 811 -----GHHGHILALYVQTRGD-FIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAV 864
+ H + ++++ D +++VGDLM+S+ L Y EE RD + W+
Sbjct: 1208 FNKINSYSTHTMIVFLKLWKDEYLLVGDLMRSVGLWKYDKVAKQFEEICRDPSLAWVMDG 1267
Query: 865 EILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLP 924
++ ++Y+ ++ N N+ + K DE L+ + +H GE V+ F+ G L+M P
Sbjct: 1268 IFINKNLYMVSDENKNVRILTKPENPINDEMDTVLQSIAHFHSGEIVSTFQKGKLMMPYP 1327
Query: 925 D--SDVGQI----------PTVIFGTVNGVIGVIASLPHEQYLFLE----------KLQT 962
++ I ++FGT G + VI SL + +FL+ ++
Sbjct: 1328 RCLQEINNIGLDFCHSLYSEQIVFGTAQGSMSVIFSLNADYKMFLQLIMFEEAIIAAIKK 1387
Query: 963 NLRKVIKG--------------------------VGGLNHE-----QW------RSFNNE 985
+LR + K +G + E QW + +N E
Sbjct: 1388 SLRTIYKDCKVCNLSIKQIKQLYIKESSSSFSGVIGTIGGEDVTPFQWNLSKNYQDYNYE 1447
Query: 986 KKTVDA-KNFLDGDLIESFLDLSRTRMDEISKTM------NVSVEELCKRVEELTR 1034
+ A + F+ GD+IE FL S + + + + N+++ E K++E + +
Sbjct: 1448 YECSGASRGFISGDIIELFLKFSNELQEMVIQELKAIQIDNITIPETSKQLEHIIQ 1503
Score = 56.6 bits (135), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 88/406 (21%), Positives = 169/406 (41%), Gaps = 78/406 (19%)
Query: 3 DVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQW---------DAESSELITRAMG 53
++ Y I + +RP G+ D + I TE+Y+ +++ +SS L +
Sbjct: 72 EISTYKNIIQIAPWRPKGQEFDDILIYTEQYELILIRAYKLQNIEDNSTDSSVLYLNVIQ 131
Query: 54 DVSDRIGRPT---DNGQIGIIDPDCRLIGLHLYDGLFKVIPFDNKGQLKEA-----FNIR 105
S + RP N IID + I L +Y+G +V+ LK + +R
Sbjct: 132 --SKDLFRPNLRKSNNIKMIIDSYIQRIILLIYEGCLQVVDCYTDLSLKMSVFTPPIMLR 189
Query: 106 LEELQVLDIKFLYGCAKPTIV-VLYQD--NKDARHVKTYEVALKDKDF-------VEGPW 155
L EL +LD+ + + ++ +LY ++D R +K ++ K + ++ P
Sbjct: 190 LSELNILDLCLINSSSNKCLLGILYDSGYSEDPRLLKLIDLPSDLKSWTYNMRIGMKIPS 249
Query: 156 SQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS------------------------AN 191
+ + N D P G+L+ G+ ++ Y + +N
Sbjct: 250 NIMKIVNYEDF--NNDPLSNGLLLFGDGSVFYLTIGEILYNNNENLISQPLFIQCNFPSN 307
Query: 192 AFKAIP---IRPSITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKE-KVTGLKIELLGE 247
+ + P+I A +D R+L+ D+ G L ++V+ ++ K+ +K+++LG+
Sbjct: 308 YLEKVESGLTLPNIVDAIP-IDGKLKRWLVLDNTGGLFIMVVYKDQNLKIENIKVDILGQ 366
Query: 248 TSIASTISYLDNAVVYIGSSYGDSQLIKL------------NLQPDAKGSYV---EVLER 292
S S + L N + +IGS DS ++ + N+ P ++ E E
Sbjct: 367 YSPFSVLLSLKNDIYFIGSKICDSLVLYIYHNKPLIMQIIHNIGPIRDLLFIRDNEYRED 426
Query: 293 YVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINE 338
VN P+ + + + +V G GS +I+ NGIG+N+
Sbjct: 427 IVNENPLSN---TQISKNLSLPLVAACGFGNSGSFKIICNGIGLNQ 469
>gi|312069702|ref|XP_003137805.1| hypothetical protein LOAG_02219 [Loa loa]
Length = 1065
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 71/310 (22%), Positives = 146/310 (47%), Gaps = 33/310 (10%)
Query: 741 PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
PE +PT +++ + +KE KG V SL + NG LL + QK+ + W+ +D
Sbjct: 770 PEPGQPTSKH---------RIKTLYDKEQKGPVTSLCSCNGYLLTGMGQKV--FIWLFKD 818
Query: 801 DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
+ + + S H ++ L + R + + D+ +S++LL Y+ E A+ +RD ++
Sbjct: 819 NNLQGI-SFLDMHFYVHQL-IGVR-NLALACDMYRSVALLRYQEEYKALSLASRDMRSDV 875
Query: 861 ---MSAVEILDDD----IYLGAENNFNLFTVRKNSEGATDEERG--RLEVVGEYHLGEFV 911
M+A I+D+ + N +F N T E G +L + E ++G V
Sbjct: 876 QPPMAAQFIIDNKQMGFVMSDEAANIAIF----NYLPETLESLGGEKLTLRAEINIGTVV 931
Query: 912 NRF--RHGSLVMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
N F G + ++++ + +V+F +++G G + L + + L LQ + +
Sbjct: 932 NSFIRVKGHISSGFVENELFSLERQSVLFASLDGSFGFLRPLTEKVFRRLHMLQQLMSSM 991
Query: 968 IKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEEL 1025
+ GLN + R+ + ++ +N +DGD++ +L LS +++++ + S +
Sbjct: 992 VPQPAGLNAKGARAARPPRPNHYLNTRNLVDGDMVMQYLHLSLPEKNDLARKLGTSRYHI 1051
Query: 1026 CKRVEELTRL 1035
+ E+ R+
Sbjct: 1052 IDDLIEICRV 1061
>gi|398019848|ref|XP_003863088.1| CPSF-domain protein, putative [Leishmania donovani]
gi|322501319|emb|CBZ36398.1| CPSF-domain protein, putative [Leishmania donovani]
Length = 1347
Score = 71.2 bits (173), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 105/474 (22%), Positives = 187/474 (39%), Gaps = 101/474 (21%)
Query: 637 KEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE-----ESEM 691
++G L IG+++ ++K +PLGE ++ H+ + + S++ E S +
Sbjct: 798 QDGRLHIGSLEALEKTSRTFLPLGETITQL-HEAAAWGGYVASVRKVEGDEVLFLPSSAI 856
Query: 692 HF-------------VRLLDDQTFEFISTYPLDTFEYGCSI------LSCSFSDDSNV-- 730
F V LL+++ F+ T LD E G + SD + V
Sbjct: 857 QFPWAADLSLWRSAAVPLLENERCVFLETVRLDGAEGGGVGERGGDPIDVPASDGAGVSE 916
Query: 731 -----YYCVGTAYVLPEENEPTKGRILVFIVED----GKLQLIAEKETKGAVYSLNA--- 778
+G+++ P+E GRI + + +L LIA K+ GA+ A
Sbjct: 917 EEWQHLLLIGSSFTFPDEQRARSGRITWCTLREERQRQRLHLIASKDIGGALQCCAAVPH 976
Query: 779 FNGKLLAAINQKIQLYKWMLRDDGTRELQSECG---HHGHILALYVQTRGDFIVVG-DLM 834
+ G++ +N + LY+W +D T + C ++ LY + ++V D+
Sbjct: 977 YKGRIALGVNGCVCLYQWN-TEDQTFVAEERCRVGLTVTKLIPLYHTSLAASVLVALDVR 1035
Query: 835 KSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN--------NFNLFTV-- 884
S + +G ++ RD + I+D + AEN NF V
Sbjct: 1036 HSAFFIEVDTLQGNLKVLCRDADLRG-----IMDGHVGSDAENLCLFDDSLNFTALKVVP 1090
Query: 885 -------RKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL----VMRLP---DSDVGQ 930
+ A + R EV + HLG+ V R GS +M P S Q
Sbjct: 1091 LPVEARDGDAAAAARATAQYRFEVRAQCHLGDLVTCVRQGSFAATSLMEAPAHCSSAQNQ 1150
Query: 931 I-------PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR--- 980
+ P ++F T +G GV+ + YL L L+ +L + + VGGL+H+ +R
Sbjct: 1151 LLLPGIAGPQLVFATAHGGFGVVTPVHAATYLVLRALEASLVRTVPPVGGLSHQAFREVL 1210
Query: 981 -----------------SFNNEK-KTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
+ E+ + + N +DGD++E F+ LS + + +
Sbjct: 1211 RAGQERGVSYLASKTGCALTRERLRRYEPLNTVDGDVVEQFVLLSPSDKKHVCR 1264
>gi|384253955|gb|EIE27429.1| hypothetical protein COCSUDRAFT_64224 [Coccomyxa subellipsoidea
C-169]
Length = 1137
Score = 70.9 bits (172), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 120/527 (22%), Positives = 210/527 (39%), Gaps = 104/527 (19%)
Query: 591 VFAASDRPTVIYSSNKKLLYSNVNLK-EVSHMCPFNSAAFPDSLAIA----KEGE----- 640
+F + +RP + +S L+ ++++ VS M PF++ P A +GE
Sbjct: 623 MFVSGERPLWLVASRGTLVAHAMDVEGRVSGMTPFHNINCPLGFITACMAENDGETLKIC 682
Query: 641 -LTIGTIDD----IQKLHIRSIPLGEHPRRICHQEQSRTFAICSLK------NQSCAEES 689
L + T D +QK+ +R+ P R+ + ++R + + + +Q A +
Sbjct: 683 QLPMRTRLDTPWPLQKIAVRATP-----HRLAYYAEARLYVLLVSRPVPYREHQEEASDG 737
Query: 690 EMHF---------------------VRLLDDQTFEFISTYPLDTFEYGCSI----LSCSF 724
+ H VRLL+ ++ ++ + LD E CS+ L +
Sbjct: 738 DPHASYSYICADAAAKASGTELGGEVRLLEPGRYQTVARHALDPGEEPCSVAADWLRNAQ 797
Query: 725 SDDSNVYYCVGTAYVLPEENEPTKGRILVFIVE--------------DGKLQLIAEKETK 770
+ Y VGTA E+ P GRIL+F +L L+
Sbjct: 798 TGALEPYITVGTALNY-GEDYPCSGRILLFKATRTSTSGAEQADPTISWQLTLVHASGFS 856
Query: 771 GAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVV 830
V L +G+L+AA+ +Q+ + LR L H + V T FI++
Sbjct: 857 RPVQGLAVMDGRLVAAVGNNMQVME--LRGS---SLHMISFFHAQLFITSVATIKTFILL 911
Query: 831 GDLMKSISLLIYKHEEG---AIEERARDYNANWMSAVEILDDDIYL-----GAENNFNLF 882
GD+ K ++ +Y ++ A+ + ++DYN + A E L + L A N LF
Sbjct: 912 GDVHKGLT-FVYADKKANYTALTQLSKDYNDVDVEAAEFLVNGKKLFLLACDAAQNLRLF 970
Query: 883 TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRL-PDSDVG-QIPTVIFGTVN 940
E + +L +G H+G+ + SL R+ P S G Q+ +FG+
Sbjct: 971 AYDGGKEQQATWQGKKLLPLGAIHVGQNIC----SSLSHRITPASATGVQLRAAVFGSAA 1026
Query: 941 GVIGVIA----SLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSF--NNEKKTVDAKNF 994
G I +A LP E+ L LQ + + V GLN +R + K ++F
Sbjct: 1027 GSIASLAPTWDGLPAEELL---ALQREMVLAVPQVAGLNPVSFRRRYKHGVKALAGGQSF 1083
Query: 995 ---------LDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
LD D + F L T ++ N+S +++ + E+
Sbjct: 1084 EAPVSDDRVLDLDQLNRFQWLPLTEQVALAAKCNLSRQQVLHALREM 1130
>gi|213408090|ref|XP_002174816.1| predicted protein [Schizosaccharomyces japonicus yFS275]
gi|212002863|gb|EEB08523.1| predicted protein [Schizosaccharomyces japonicus yFS275]
Length = 1053
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 165/773 (21%), Positives = 310/773 (40%), Gaps = 81/773 (10%)
Query: 4 VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSEL-ITRAMGDVSDRIGRP 62
+P++ L +RP QD+L I T+ + + WD ++ + + + + P
Sbjct: 53 IPLHANALHLLAYRPLDLEQDYLLIVTQDGYYFSVYWDDNFGKVALDSPLQMCTTQPSYP 112
Query: 63 -TDNGQIGIIDPDCRLIGLHLYDGLFKVIP---FDNKGQLKEAFNI------RLEELQVL 112
T +++ + +L+ + + KV+ F + + ++ F+I L V+
Sbjct: 113 NTSWRSFCLVNQEAQLVLIRTSE--MKVLCLSIFSEEQRKQQGFSIGDGAELLLPGYLVM 170
Query: 113 DIKFLYGCAKPTIVVLYQDNKDARHVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPP 172
D+ FL P + VL Q D+ V +++ L+ F G N + L+ V
Sbjct: 171 DMCFLQHGNPPQVAVLSQ-TVDSTKVTVHKLNLQSDSFDSGSVVLKN--SLYYKLMAVGK 227
Query: 173 PLCGVLIIGEETIVYCSANAFKAIPIR-PSITKA-YGRVDADGSRYLLGDHAGLLHLLVI 230
G++ +GE+ I Y N I IR PS+ + Y + +++L G ++ L
Sbjct: 228 K--GLIALGEDNISY--FNRGTVISIRAPSVRFSHYCALRGLKRQFVLVSAEGNVYTLSC 283
Query: 231 THEKEKVTGLKIELLGETSI--ASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVE 288
K+ L +E L + + S++ +D ++ + G ++ L SY
Sbjct: 284 NPLFHKLYTLMVEELPFSPVFNVSSVMAIDEERLFASNINGSIHILSL-------PSYT- 335
Query: 289 VLERYVNLGPIVDFCVVDLERQGQGQ-VVTCSGAYKDGSLRIVRNGIGINEQASVELQGI 347
V E G I D + G G ++ C+G+ GSL + N + + +L G+
Sbjct: 336 VSEITAGHGAIHD---MQFFESGSGNTLLLCAGSLNSGSLIYLCNSLSAEPISLFDLSGV 392
Query: 348 KGMWSLRSSTDDPFDTFL--VVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
+W + F + L V +ET++ M L E E + S ++ A
Sbjct: 393 LHVWPVH------FQSVLKLFVGMANETKVFTMV---SLYEAEFDPELSLDESTLALSAC 443
Query: 406 YNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGD 465
N VQ+T+ S+R + TS++ P +V++ A ++ ++Y
Sbjct: 444 ENGYVQITNKSLRYCTETSKD-----AIPLEGITSVSSLGAYAAIVIDNTSLVIY----- 493
Query: 466 GILTEVKHAQLEYEISCLDI-NPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHL 524
+ E+ E+S +D +P+ VG W ++ + ++ D + I
Sbjct: 494 NMTKEIARTTFSKEVSAVDFASPLR--------IVVGFWNK-TMEMLAIRDNSFICVLRK 544
Query: 525 GGEIIPRSVLLCAFEGISYLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
IPR VLL F L +G +G+L + ++ +T+ LG+ PI
Sbjct: 545 SLNFIPRDVLL-QFSNDDRLHLFMGSDEGNLC--VATVEKSLITNMDYRVLGSTPIHFTR 601
Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+V +D+P ++Y + KL + +N+ + F L A ++
Sbjct: 602 LLISTGAYVICTNDKPHMVYGLHGKLAFMPLNVPSSIDLNAFLDEKLRFCLVSAHSEGVS 661
Query: 643 IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMH--FVRLLDDQ 700
I +D+ KL + + P++ +Q+ F+ + E S + + DQ
Sbjct: 662 ILHLDNQPKLSVTRLKTKLLPQKAAIIDQTVLFSGLKQQPNGLGELSAQYSLVCSKVYDQ 721
Query: 701 TFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILV 753
FE S + LD FE S+++ S S + V TA P GR+LV
Sbjct: 722 -FELCS-FKLDEFEECTSVIASSSS-----LFVVTTAKKYPYNYFSVSGRLLV 767
>gi|441648592|ref|XP_004093268.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 1 [Nomascus leucogenys]
Length = 1177
Score = 70.9 bits (172), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 59/220 (26%), Positives = 106/220 (48%), Gaps = 12/220 (5%)
Query: 814 GHILALYVQTR--GDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI 871
H +A +V+++ +FI+ D+MKSISLL Y+ E + +RD + +V+ + D+
Sbjct: 940 AHYVAYHVESKVCPNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNA 999
Query: 872 YLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR----HGSLVMRLP 924
LG ++ + NL E RL ++H+G VN F G+
Sbjct: 1000 QLGFLVSDRDRNLMVYMYLPEAKESFGGMRLLRRADFHVGAHVNTFWRTPCRGATEGLSK 1059
Query: 925 DSDVGQIPTVI-FGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFN 983
S V + + F T++G IG++ + + Y L LQ L ++ GLN +R +
Sbjct: 1060 KSVVWENKHITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLH 1119
Query: 984 NEKKTVD--AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+++T+ +N LDG+L+ +L LS E++K + +
Sbjct: 1120 VDRRTLQNAVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1159
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 101/489 (20%), Positives = 206/489 (42%), Gaps = 79/489 (16%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLVYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV++ +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVVFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQATFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQPDAKGSYVEVL--ERYVNLGPIVDFCVV 305
+ +++ ++ +++GS G+S L+K LQ + E E + VD V
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASAVREAADKEEPPSKKKRVD-ATV 428
Query: 306 DLERQGQG----QVVTC---SGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTD 358
+GQ QVVT G Y + V + E+ + + +G + S + D
Sbjct: 429 GWSGEGQKSIRPQVVTTFELPGCY---DMWTVIAPVRKEEEDNPKGEGTEQEPSTPEADD 485
Query: 359 D-PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTSGS 416
D FL++S T IL E+ E + GF +Q T+F + N+ +VQV+
Sbjct: 486 DCRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSPLG 543
Query: 417 VRLVSSTSR 425
+RL+ ++
Sbjct: 544 IRLLEGVNQ 552
>gi|315045910|ref|XP_003172330.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
gi|311342716|gb|EFR01919.1| serine/threonine protein kinase [Arthroderma gypseum CBS 118893]
Length = 1397
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 99/428 (23%), Positives = 177/428 (41%), Gaps = 52/428 (12%)
Query: 655 RSIPLGEHPRRICHQEQSRTFAICSLKNQ--SCAEESEMHF-----------------VR 695
R IPLGE I + S ++ I + + E+ E H V+
Sbjct: 965 RKIPLGEQVDCIVYSSASESYVIGTSTKEDFKLPEDDESHTEWRNEFITFLPQLDRGTVK 1024
Query: 696 LLDDQTFEFISTYPLDTFE-YGC-SILSCSFSDDSNV---YYCVGTAYVLPEENEPTKGR 750
LL+ + + I Y ++ E C I+ S+ ++ VG+A E+ P KG
Sbjct: 1025 LLEPKNWSAIDIYEVEPAERITCIKIIRLEISETTHERKDMVVVGSAVAKGEDIVP-KGC 1083
Query: 751 ILVFIVED-----------GKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
I VF + D KL+L A +E KGAV +++ G+ + Q + L+
Sbjct: 1084 IRVFEIIDVVPDPDHPEKNKKLKLFAREEVKGAVTAVSGIGGQGFLIVAQGQKCMVRGLK 1143
Query: 800 DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNAN 859
+DG+ + ++ L ++GD K + Y E ++ ++
Sbjct: 1144 EDGSLLPIAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFTGYSEEPYKLDLFGKENENL 1203
Query: 860 WMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR- 915
+ + L D +Y L A+++ NL ++ + E + + RL +H G F +
Sbjct: 1204 AVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMTL 1263
Query: 916 --HGSLVMRLP------DSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
HGS + P D+D+ P+ V+ G IGVI+ L + Y L LQ+ L
Sbjct: 1264 LPHGSHTLSSPVDEDAMDTDLPPPPSKYQVLITFQTGSIGVISPLNEDSYRRLLALQSQL 1323
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
++ GLN +R+ ++ + +DG+L+ +LD+ R EI+ + V
Sbjct: 1324 VNALEHPCGLNPRGYRAVESDGMG-GQRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGA 1382
Query: 1025 LCKRVEEL 1032
+ +E+L
Sbjct: 1383 IRIDLEKL 1390
>gi|68071595|ref|XP_677711.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56497932|emb|CAI04454.1| conserved hypothetical protein [Plasmodium berghei]
Length = 493
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 80/318 (25%), Positives = 132/318 (41%), Gaps = 49/318 (15%)
Query: 641 LTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQ 700
L I +++I + +I+ IP +I ++S L SC E E H Q
Sbjct: 53 LYISHLNEINETYIQKIPFYRTVEKIAFHKES------GLLITSCPPE-EKHKTNKNLKQ 105
Query: 701 TFEFISTYPLDTFEYGCSILS-------CSFSDDSNVY---------YCVGTAYVLPEEN 744
F + Y ++F+Y I S C + + ++Y CVGTA + +
Sbjct: 106 IICFFNPYQ-NSFKYSYIIPSKYNVSSICIYQINKDIYPNKSNINTLICVGTANINDRVS 164
Query: 745 EPTKGRILVFIVE--DGKLQL---IAEKETKGAVYSLNAFNGKLLAAINQK--------- 790
EP+ G I +F + D ++ G + L F KL++ IN
Sbjct: 165 EPSSGNIYIFFAKKKDNLFEIKHIYTHNVNVGGITHLKQFYDKLISTINNTVVILDISDF 224
Query: 791 -IQLYKWM--------LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLI 841
I L K++ L +DGT + I++L V ++IVVGD+M S+++L
Sbjct: 225 LINLDKYVDNTNKPIKLENDGTIVDVASFTPSSWIMSLDVIE--NYIVVGDIMTSVTILS 282
Query: 842 YKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEV 901
Y + E RDY+ W + V L +L ++ N +K+S DE+ +L
Sbjct: 283 YDFNNSTLTEVCRDYSNVWCTFVCALSKSHFLVSDMESNFLVFQKSSIRYNDEDSFKLSR 342
Query: 902 VGEYHLGEFVNRFRHGSL 919
V ++ G VN+ SL
Sbjct: 343 VAFFNHGHVVNKMLPVSL 360
>gi|402590016|gb|EJW83947.1| hypothetical protein WUBG_05142 [Wuchereria bancrofti]
Length = 374
Score = 70.5 bits (171), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 72/310 (23%), Positives = 146/310 (47%), Gaps = 33/310 (10%)
Query: 741 PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
PE +PT +++ + +KE KG V SL + NG LL + QK+ + W+ +D
Sbjct: 79 PEPGQPTSKH---------RIKTLYDKEQKGPVTSLCSCNGYLLTGMGQKV--FIWLFKD 127
Query: 801 DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
+ + + S H +I L + R + + D+ +S++LL Y+ E A+ +RD ++
Sbjct: 128 NNLQGI-SFLDMHFYIHQL-IGVR-NLALACDMYRSLALLRYQEEYKALSLASRDMRSDV 184
Query: 861 MS--AVEILDDDIYLG-----AENNFNLFTVRKNSEGATDEERG--RLEVVGEYHLGEFV 911
A + L D+ +G N +F N T E G +L + E ++G V
Sbjct: 185 QPPMAAQFLIDNKQMGFIMSDEAANIAIF----NYLPETLESLGGEKLTLRAEINIGTVV 240
Query: 912 NRFRH--GSLVMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
N F G + ++++ + +V+F +++G G + L + + L LQ + +
Sbjct: 241 NSFIRVKGHISSGFVENELFSLERQSVLFASLDGSFGYLRPLTEKVFRRLHMLQQLMSSM 300
Query: 968 IKGVGGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEEL 1025
+ GLN + R+ ++ ++ +N +DGD++ +L LS +++++ + S +
Sbjct: 301 VLQPAGLNAKGARAARPQRPNHYLNTRNLVDGDVVMQYLHLSLPEKNDLARKLGTSRYHI 360
Query: 1026 CKRVEELTRL 1035
+ E+ R+
Sbjct: 361 IDDLNEICRV 370
>gi|403373826|gb|EJY86839.1| hypothetical protein OXYTRI_09860 [Oxytricha trifallax]
Length = 1380
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 86/379 (22%), Positives = 154/379 (40%), Gaps = 58/379 (15%)
Query: 19 HGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLI 78
+ + QD++F+ T + C+L ++ + G++SD G+ + + D R I
Sbjct: 112 NKKVQDYIFVLTADFN-CLLFAHDTKQGIVCLSTGNISDVTGQKREPPYSVFLGSDGRYI 170
Query: 79 GLHLYDGLFKVIPF----DNKGQLKEAFNIRL---EELQVLDIKFLYGCAKPTIVVLYQD 131
GL LY+ + K+IP + K L AFN+R+ E Q++ + I V YQ
Sbjct: 171 GLMLYENIIKIIPLVKGQNGKLALSNAFNVRIKHPEAHQIIPLLDDEQNGHSAIAVFYQQ 230
Query: 132 NKDAR----------HVKTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIG 181
+ +K Y + + +KD +E ++ LD +I V G I
Sbjct: 231 QPNRTINPNNTYFQLKLKKYYLNILEKD-IEPAQNEQQLDFENMHIIRVKELEFGGFIAF 289
Query: 182 EETIVYCSANAFKA--------IPIRPSITKAYGRVDA---------DGSRYLLGDHAGL 224
+ ++C + P++ + A + D RYL G
Sbjct: 290 DPENIHCYKKRINSKIISKKLRKPMKVTSITAIDQYDPLTKRTKQGLQLMRYLFATECGD 349
Query: 225 LHLLVITHE-------------KEKVTGLKIELLGET-SIASTISYLDNAVVYIGSSYGD 270
L+LL E E + + IE L + +T++YLDN + + GS D
Sbjct: 350 LYLLAFNLEYLHLVTSIGNINPHEATSFMMIEFLAQKLPSCTTLNYLDNGLFFYGSKLAD 409
Query: 271 SQLIKLNLQPDAKGS----YVEVLERYVNLGPIVDFCVVDLE--RQGQGQVVTCSGAYKD 324
S L L L+ + G Y+ + +RY +LG I D + + + R Q +V G +
Sbjct: 410 S-LTAL-LETENTGHKDYLYISIRKRYESLGCISDLAIREQQNIRGQQKDLVVVGGQGHN 467
Query: 325 GSLRIVRNGIGINEQASVE 343
+ ++R G+ + A+ +
Sbjct: 468 SHISVLRKGVSVKTLAATD 486
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 71/360 (19%), Positives = 151/360 (41%), Gaps = 29/360 (8%)
Query: 590 HVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIA---------KEGE 640
H + D P + S+K+ +YS H N ++ I + +
Sbjct: 817 HKLKSDDGPIIYVQSSKQAIYSLTRQTLQMHFINENDIKQARTMMIEDKCIFVYENTQNQ 876
Query: 641 LTIGTIDDIQKLHIRSIPLGEHPRRICH-QEQSRTFAICSLKNQSCAE--ESEMHFVRLL 697
+ G +D+ ++ I + +I ++ A+ + N + ++ + LL
Sbjct: 877 IIFGQLDNFNSTQLKQIKFDKEVCKISLVPNENLMIALMEVPNYEYLDLNSPAVNQIALL 936
Query: 698 DDQTFEFISTYPLDT---------FEYGCSILSCSFSDDSNVYYCVGTAYVLPEENEPTK 748
D T E + + L FE S + S + S Y +GTA + +E P+K
Sbjct: 937 DSSTMEMLELFSLQKEEIASCLYLFEKKPSSATLSIRNQSR-YVAIGTAIIKTDEYLPSK 995
Query: 749 GRILVFIVEDGKLQLIAEK--ETKGAVYSLNAF--NGKLLAAINQKIQLYKWMLRDDGTR 804
GR+++F ++ + +LI + +T G+V S+ + L+ +N +QL+ ++ +
Sbjct: 996 GRLMIFEIDTRQRKLIIKHSFQTNGSVQSIAMLRDDKYLVLGVNNDLQLFSMQIKPQEIK 1055
Query: 805 ELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEE--GAIEERARDYNANWMS 862
+ G + + + +VVGD+M+++ +K ++ + E A ++ W++
Sbjct: 1056 LQLLQTLQVGTAIQ-QIFAVNNTVVVGDIMRALYTFDFKEQKQNKLLNESAHTNHSIWVN 1114
Query: 863 AVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMR 922
+ L + + + N+F KN D E+ +L +V +GE + GSL ++
Sbjct: 1115 TMLPLSESNTMVFDKECNMFVFNKNLLPTNDHEKFKLNMVACMKIGEEATKAIFGSLAIQ 1174
>gi|121797760|sp|Q2TZ19.1|CFT1_ASPOR RecName: Full=Protein cft1; AltName: Full=Cleavage factor two protein
1
gi|83775384|dbj|BAE65504.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 1393
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 76/331 (22%), Positives = 139/331 (41%), Gaps = 40/331 (12%)
Query: 734 VGTAYVLPEENEPTKGRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK 782
VGTA+ E+ ++G + VF V D KL+L+ ++ KGAV +L+ G+
Sbjct: 1065 VGTAFARGEDIA-SRGCVYVFEVIKVVPDPKRPEMDRKLRLVGKEPVKGAVTALSEIGGQ 1123
Query: 783 LLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIY 842
+ Q + L++DG+ + H+ + ++ D +K + Y
Sbjct: 1124 GFLIVAQGQKCIVRGLKEDGSLLPVAFMDVQCHVSVVKELKGTGMCIIADAVKGLWFAGY 1183
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRL 899
E + A+D + + A + L D L A+++ NL ++ + E RL
Sbjct: 1184 SEEPYKMSLFAKDLDYLEVLAADFLPDGNKLFILVADSDCNLHVLQYDPEDPKSSNGDRL 1243
Query: 900 EVVGEYHLGEFVNRFRHGSLVMRLPDSDVG---------------QIP--TVIFGTVNGV 942
++H G F+ S + LP + V +IP ++ + NG
Sbjct: 1244 LSRSKFHTGNFI------STLTLLPRTSVSSEQMISDVDAMDVDIKIPRHQMLITSQNGS 1297
Query: 943 IGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIES 1002
+G++ + E Y L LQ+ L I+ GLN +R+ E + LDG L+
Sbjct: 1298 VGLVTCVSEESYRRLSALQSQLTNTIEHPCGLNPRAFRAV--ESDGTAGRGMLDGKLLFQ 1355
Query: 1003 FLDLSRTRMDEISKTMNVSVEELCKRVEELT 1033
+LD+S+ R EI+ + + E+ E ++
Sbjct: 1356 WLDMSKQRKVEIASRVGANEWEIKADFEAIS 1386
>gi|125526594|gb|EAY74708.1| hypothetical protein OsI_02601 [Oryza sativa Indica Group]
Length = 314
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 33/65 (50%), Positives = 45/65 (69%), Gaps = 7/65 (10%)
Query: 846 EGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEY 905
+G +E +RDYN WM+AVE+LDD +Y+GA+N +NLFTV K GRL V+G+Y
Sbjct: 153 KGQTDELSRDYNITWMTAVEMLDDYVYIGADNCYNLFTVLK-------RRVGRLLVIGQY 205
Query: 906 HLGEF 910
HLG+
Sbjct: 206 HLGDL 210
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 42/72 (58%), Gaps = 5/72 (6%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQ-----DFLFIATERYKFCVLQWDAESSELITRAMGDV 55
MLD P+YGRIAT+ELFRP A D ++ ++ + + E+ +I R+ D
Sbjct: 84 MLDAPVYGRIATIELFRPCVLASIKFSPDSRAKSSCKFVNAFVYLNQENRYVICRSGRDA 143
Query: 56 SDRIGRPTDNGQ 67
SD IGRPT+ GQ
Sbjct: 144 SDHIGRPTNKGQ 155
>gi|156339124|ref|XP_001620088.1| hypothetical protein NEMVEDRAFT_v1g149219 [Nematostella vectensis]
gi|156204443|gb|EDO27988.1| predicted protein [Nematostella vectensis]
Length = 69
Score = 70.5 bits (171), Expect = 5e-09, Method: Composition-based stats.
Identities = 34/67 (50%), Positives = 45/67 (67%), Gaps = 9/67 (13%)
Query: 979 WRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNV---------SVEELCKRV 1029
WRSF+NE+K A F+DGDLIESFLDL R RM+E+ + + +V++L K V
Sbjct: 3 WRSFSNERKMEPAHGFIDGDLIESFLDLPRARMEEVVTGLQIDDGGMKKECTVDDLVKTV 62
Query: 1030 EELTRLH 1036
EELTR+H
Sbjct: 63 EELTRIH 69
>gi|303321596|ref|XP_003070792.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
gi|240110489|gb|EER28647.1| CPSF A subunit region family protein [Coccidioides posadasii C735
delta SOWgp]
Length = 1394
Score = 70.5 bits (171), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 162/372 (43%), Gaps = 37/372 (9%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVYYCVGTAYVLPEENEPTK 748
++LL +T+ + +Y L E + + + + + VGTA V E+ P +
Sbjct: 1022 IKLLSPRTWSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITP-R 1080
Query: 749 GRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
G I VF I+E + KL++ A+ + KGAV +++ G+ L+ A QK +
Sbjct: 1081 GSIYVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRG 1140
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ + ++ L ++GD +K I Y E + +D
Sbjct: 1141 --LKEDGSLLPVAFMDMQCYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKD 1198
Query: 856 YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ A + L D +Y L A+++ + + + E T + RL +H+G F +
Sbjct: 1199 NEYLQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHMGHFTS 1258
Query: 913 RF----RHGSLVMR----LPDSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
+H S D DV +P V+ + G IGV+ L + Y L LQ
Sbjct: 1259 TMTLLPQHSSSPSADDPGEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQ 1318
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+ L ++ GLN + +R+ E + +DG+L+ +LD+ R EI+ +
Sbjct: 1319 SQLVTSMEHPCGLNPKAYRAV--ESDGFGGRGIVDGNLLLRWLDMGVQRKAEIAGRVGAD 1376
Query: 1022 VEELCKRVEELT 1033
+E + +E+++
Sbjct: 1377 IESIRVDLEKIS 1388
>gi|320040273|gb|EFW22206.1| hypothetical protein CPSG_00105 [Coccidioides posadasii str.
Silveira]
Length = 1387
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 162/372 (43%), Gaps = 37/372 (9%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVYYCVGTAYVLPEENEPTK 748
++LL +T+ + +Y L E + + + + + VGTA V E+ P +
Sbjct: 1015 IKLLSPRTWSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITP-R 1073
Query: 749 GRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
G I VF I+E + KL++ A+ + KGAV +++ G+ L+ A QK +
Sbjct: 1074 GSIYVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRG 1133
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ + ++ L ++GD +K I Y E + +D
Sbjct: 1134 --LKEDGSLLPVAFMDMQCYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKD 1191
Query: 856 YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ A + L D +Y L A+++ + + + E T + RL +H+G F +
Sbjct: 1192 NEYLQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHMGHFTS 1251
Query: 913 RF----RHGSLVMR----LPDSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
+H S D DV +P V+ + G IGV+ L + Y L LQ
Sbjct: 1252 TMTLLPQHSSSPSADDPGEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQ 1311
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+ L ++ GLN + +R+ E + +DG+L+ +LD+ R EI+ +
Sbjct: 1312 SQLVTSMEHPCGLNPKAYRAV--ESDGFGGRGIVDGNLLLRWLDMGVQRKAEIAGRVGAD 1369
Query: 1022 VEELCKRVEELT 1033
+E + +E+++
Sbjct: 1370 IESIRVDLEKIS 1381
>gi|238880502|gb|EEQ44140.1| conserved hypothetical protein [Candida albicans WO-1]
Length = 1168
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 221/1100 (20%), Positives = 432/1100 (39%), Gaps = 186/1100 (16%)
Query: 24 DFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPD--CRLIGLH 81
D + I ++ +LQ+D ++ + I++ ++ G+ IDP+ C L+
Sbjct: 114 DGVVITSDSGNLSILQYDNKTKKFISKIQEPMTKNGWGRNYVGENLAIDPENRCILVAAM 173
Query: 82 LYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKF----------LYGC-----AKPTIV 126
+ LF I ++ G + + + QVL +K L+G K I+
Sbjct: 174 EKNKLFYKIESNSSGSKELSSPLEAHSKQVLCLKIVALNTDHNNPLFGALELTPEKKCII 233
Query: 127 VLYQDNKDARHVKTYEVALKDKDFVEGPWSQNN--LDNGADLLIPVPPPLCGVLIIGEET 184
Y+ ++ HV +K K P S N+ L N + LIP+P + G+++ G
Sbjct: 234 NYYELDQGLNHV------VKKK-----PNSSNSDPLPNDVNYLIPLPGHIGGMVVCGTNW 282
Query: 185 IVYCSANA---FKAIPIRPSITKAYGRVD-----ADGSRY--LLGDHAGLLHLLVITHE- 233
Y + + +P R T+ V+ ++ LL + G L L + ++
Sbjct: 283 CFYDKLDGPRIYLPLPRRNGQTQDSIIVNHVTHVLKKKKFFILLQNALGDLFKLTVDYDF 342
Query: 234 -KEKVTGLKIELLGETSIASTISYLDNAVVY---------------IGSSYGDSQLI--- 274
KE + + I A +++ N ++ +G + +L+
Sbjct: 343 DKEIIKNISITYFDTIPPALSLNIFKNGFLFANVLNNDKLLYQFEKLGDDLTEGELVINS 402
Query: 275 ----KLNLQPDAKGSY----------VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSG 320
LN ++ S+ ++VLE L PI D ++D ++VT S
Sbjct: 403 SDYESLNSVRESVTSFKLKGLDNLALIDVLE---TLSPITDSKIID------SKLVTLSS 453
Query: 321 AYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLRSSTDDPFDTFLVVSFISETRILAMN 379
++ + +G+ L +++ + S + D +LV+S ++ L ++
Sbjct: 454 H---SYVKSITHGVPTTTLVESPLPITPTDIFTTKLSLESANDEYLVISSSLSSKTLVLS 510
Query: 380 LEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS---VRLVSSTSRELRNEWKSPPG 436
+ + +E+ E F T+ +VQ+ S VR+V+ + +W P G
Sbjct: 511 IGEVVEDVEDSEFVLDQPTIAVQQVGIASVVQIYSNGIKHVRMVNGNKK--TTDWFPPAG 568
Query: 437 YSVNVATANASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEI-SCLDINPIGENPSYS 495
++ AT N QVL+A +VY EI D ++ Q EI + + I EN S
Sbjct: 569 ITITHATTNNQQVLIALSNLSVVYFEI-DATDDQLIEYQDRLEIATTITAMAIQENISEK 627
Query: 496 QIAAVGMWTDISVRIFSLPDLNLITKEHL----GGEIIPRSVLLCAFEGISYLLCALGDG 551
A+ +D ++++ SL + N + + L + + E +++ + +G
Sbjct: 628 SPFAIIGCSDETIQVVSLQEHNCLEIKSLQALSANSSSLKMLKSSGKE--THVHIGMENG 685
Query: 552 HLLNFLLNMKTGELTDRKKVSLGTQPITLRT--FSSKNTTHVFAASDRPTVIYSSNKKLL 609
++ G L++ + +G++P++L FSS+ + A S P + Y
Sbjct: 686 VYARIKIDTINGNLSNSRVKYIGSKPVSLSVIKFSSE-IEGILAISSAPWISYLYRDSFK 744
Query: 610 YSNVNLKEVSHMCPFNSAAF-PDSLAIAKEGELTIGTI-------DDIQKLHIRSIPLGE 661
+ + ++++ F S + + K+ L I ++ D Q L I + L
Sbjct: 745 ITPLLEIDITNGSSFISEDIGGEGIVGIKDNNLIIFSVGKEDSVFDPSQDLTIVTTKLRY 804
Query: 662 HPRRICHQEQSRTFAI-----------CSLKNQSCAEESEMHF---------------VR 695
PR++ +R F C++ E ++ ++
Sbjct: 805 TPRKMI-TYGNRLFISESEYNVQGPFKCNINGDVKENVDEDYYGAFGYEWKQNSWASCIQ 863
Query: 696 LLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN-----VYYCVGTA---YVLPEENEPT 747
++D ++ + I + LD E S+ + SF+ S + VG +LP N
Sbjct: 864 VVDSKSNQVIQSLQLDGNESIVSMSAVSFNKTSTPSVPASHLVVGVCTNQTILP--NSYD 921
Query: 748 KGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQ 807
K + F + LQL+ + E L F KLL A I+LY D G ++L
Sbjct: 922 KSYLYTFKIGKKHLQLVHKTELDHIPQVLENFQDKLLVASGNHIRLY-----DIGQKQLL 976
Query: 808 SEC----GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE--ERARDYNANWM 861
+ +I + QT + I++ D KS S++ K +E + A D +
Sbjct: 977 KKSTTIIDFSTNINKIIPQT--NRIIICDSHKS-SIVFAKFDESQNQFVPFADDVMKRQI 1033
Query: 862 SAVEILDDDIYLGAENNFNLFTVR----------------KNSEGATDEERGRLEVVGEY 905
+++ LD D +G + N+F R K +G + +L+ + E+
Sbjct: 1034 TSIMNLDIDTLIGGDKFGNIFVTRIDEDISKQADDDWTILKTQDGILNSCPYKLQNLIEF 1093
Query: 906 HLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYL-FLEKLQTNL 964
H+G+ + F G L ++ +VI+ + G IG++ L + + L LQ +
Sbjct: 1094 HIGDIITSFNLGCL-------NLAGTESVIYTGLQGTIGLLIPLVSKSEVELLFNLQLYM 1146
Query: 965 RKVIKGVGGLNHEQWRSFNN 984
++ + G +H + RS+ N
Sbjct: 1147 QQSQNNLVGKDHLKLRSYYN 1166
>gi|448114553|ref|XP_004202604.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
gi|359383472|emb|CCE79388.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
Length = 1248
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 133/677 (19%), Positives = 267/677 (39%), Gaps = 97/677 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++ I +LE R G +D+L I ++ K +LQ+D + IT S R
Sbjct: 66 VFSVIQSLEKIRLMGSQKDYLVITSDSGKLAILQYDTGRNRFITMFQEPHSKTGFRRNTP 125
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFL---YGC 120
G + DP R LIG + L + D+KG ++ + + + + + G
Sbjct: 126 GAYLLTDPQNRAILIGALERNKLIYKVHSDDKGGMQISSPLESQTRHTITLAMCALDTGY 185
Query: 121 AKPTIVVLYQDNK--DARH------------VKTYEVALKDKDFVEGPWSQNNLDNGADL 166
P V + D D++ + +YE+ + + V N L A
Sbjct: 186 ENPVFVAIEADYGALDSKEHSIDPQAHQSLLLTSYELD-QGLNHVVRRVVNNKLPMTATH 244
Query: 167 LIPVPPPLCGVLIIGEETIVYCS---ANAFKAIPIRP----SITKAYGRVDADGSRYL-- 217
LIP+P P GVLI E +Y + PIR ++ + S YL
Sbjct: 245 LIPLPSPAGGVLICCESYAIYEPFEYKRLYLPFPIRQDRENTVIINHVSQKFKKSNYLIL 304
Query: 218 ----LGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS-- 271
LGD L L E+E++ + + ++++++ L + ++ +++G+
Sbjct: 305 AQTQLGD--CLKFSLHYNDEREQLDDITVSYFDTIPVSNSLNILRSGFLFANTAHGNKLF 362
Query: 272 -QLIKL--------------------------NLQPDAKG----SYVEVLERYVNLGPIV 300
Q KL + +P KG + V+V++ +L P+
Sbjct: 363 YQFEKLGEEGNDLTLRCSDHYEQVTTIDHSKRDFKP--KGLENLALVDVMD---SLNPVT 417
Query: 301 DFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM-WSLRSSTDD 359
D +++ + ++I+ +G+ +N+ + L + ++ + +
Sbjct: 418 DATLLETRTSESPDPLINLFTLSSRFIKILTHGMPVNQLVTSPLPIHPSLIFATKKFNNS 477
Query: 360 PFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRL 419
D +LV+S ++ L +++ + +EE F + + + +VQV S +R
Sbjct: 478 VNDDYLVISSELSSQTLVLSIGEVVEEVSDSEFVTDQPAIHVQQTGNSSIVQVFSNGIRS 537
Query: 420 VSSTSRELRN---------EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
+ R +RN +W P G SV A+ N++Q+++ + Y EI D + +
Sbjct: 538 I----RHIRNGDEEIKKTTDWYPPAGISVTKASGNSTQLIIGMSNREVCYFEI-DPVDDQ 592
Query: 471 VKHAQLEYEIS----CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGG 526
+ Q E+S ++P A+ +D ++++ SL N ++ L
Sbjct: 593 LVEYQERLEVSGGSISSLAILSSKDPEQRSAHAIIGSSDETIQVVSLKRHNCLSVVALQA 652
Query: 527 -EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF-- 583
S+L+ E Y+ + +G + +++ +G+L+D + LG++ + L
Sbjct: 653 LSSKSTSILMLRHEDSIYVHIGMENGLYVCTMIDEVSGKLSDTRIKFLGSKSVRLSPIKV 712
Query: 584 --SSKNTTHVFAASDRP 598
S+ + A S RP
Sbjct: 713 PGSAGEEDGILALSSRP 729
Score = 53.9 bits (128), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 69/305 (22%), Positives = 129/305 (42%), Gaps = 50/305 (16%)
Query: 761 LQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHI 816
L I + E++ A +++ FN +LL ++LY K +LR + S + +I
Sbjct: 955 LLFIHKTESEFAPFAMIEFNNRLLIGTKNLLRLYDIGHKQLLR-----KASSSIDYFENI 1009
Query: 817 LALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAE 876
+ L G+ I+ D S + + Y E A D ++++ LD D +G +
Sbjct: 1010 IKL-TYMGGNRIMAADANISSTFVKYDQVENQFFPLADDIMKRKITSMCSLDYDTIVGGD 1068
Query: 877 NNFNLFTVR----------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLV 920
N+F R ++ + + RL+ + E++ G+ F GSL+
Sbjct: 1069 KFGNIFVSRIPEVLSKQADQDWGLIRHQDSYLNGAVSRLKNLCEFYSGDIPTSFSKGSLI 1128
Query: 921 MRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIK----------- 969
+ S+ T + GTV +I ++ E F+E L+ LR
Sbjct: 1129 L---GSEESIFYTGLMGTVGALIPLVTK--SEVQFFIE-LEAELRGYFDYNFDNFDEQKN 1182
Query: 970 --GVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
+ G +H + RS+ N KN +DGDLIE F ++S I+ ++ + +++ K
Sbjct: 1183 GYNLLGKDHLKHRSYYNP-----VKNVIDGDLIERFSEVSYNNKIRIANKLDRTPKDIDK 1237
Query: 1028 RVEEL 1032
++ E+
Sbjct: 1238 KISEI 1242
>gi|5817204|emb|CAB53699.1| hypothetical protein [Homo sapiens]
Length = 215
Score = 70.1 bits (170), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 56/220 (25%), Positives = 102/220 (46%), Gaps = 31/220 (14%)
Query: 832 DLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR--KNSE 889
D+ +S + YK E + A D W++ +LD D GA+ N+ VR N+
Sbjct: 2 DVQESFIWVRYKRNENQLIIFADDTYPRWVTTASLLDYDTVAGADKFGNICVVRLPPNTN 61
Query: 890 GATDEE---------RGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
DE+ RG L EV+ YH+GE V + +L+ G +
Sbjct: 62 DEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVGETVLSLQKTTLI-------PGGSES 114
Query: 934 VIFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAK 992
+++ T++G IG++ HE + F + ++ +LR + G +H +RS+ K
Sbjct: 115 LVYTTLSGGIGILVPFTSHEDHDFFQHVEMHLRSEHPPLCGRDHLSFRSY-----YFPVK 169
Query: 993 NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
N +DGDL E F + + +S+ ++ + E+ K++E++
Sbjct: 170 NVIDGDLCEQFNSMEPNKQKNVSEELDRTPPEVSKKLEDI 209
>gi|393907594|gb|EJD74706.1| hypothetical protein LOAG_18016 [Loa loa]
Length = 398
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 67/306 (21%), Positives = 145/306 (47%), Gaps = 25/306 (8%)
Query: 741 PEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
PE +PT +++ + +KE KG V SL + NG LL + QK+ + W+ +D
Sbjct: 103 PEPGQPTSKH---------RIKTLYDKEQKGPVTSLCSCNGYLLTGMGQKV--FIWLFKD 151
Query: 801 DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANW 860
+ + + S H ++ L + R + + D+ +S++LL Y+ E A+ +RD ++
Sbjct: 152 NNLQGI-SFLDMHFYVHQL-IGVR-NLALACDMYRSVALLRYQEEYKALSLASRDMRSDV 208
Query: 861 ---MSAVEILDDDI--YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF- 914
M+A I+D+ ++ ++ N+ E +L + E ++G VN F
Sbjct: 209 QPPMAAQFIIDNKQMGFVMSDEAANIAIFNYLPETLESLGGEKLTLRAEINIGTVVNSFI 268
Query: 915 -RHGSLVMRLPDSDVGQI--PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGV 971
G + ++++ + +V+F +++G G + L + + L LQ + ++
Sbjct: 269 RVKGHISSGFVENELFSLERQSVLFASLDGSFGFLRPLTEKVFRRLHMLQQLMSSMVPQP 328
Query: 972 GGLNHEQWRSFNNEKKT--VDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
GLN + R+ + ++ +N +DGD++ +L LS +++++ + S + +
Sbjct: 329 AGLNAKGARAARPPRPNHYLNTRNLVDGDMVMQYLHLSLPEKNDLARKLGTSRYHIIDDL 388
Query: 1030 EELTRL 1035
E+ R+
Sbjct: 389 IEICRV 394
>gi|448111975|ref|XP_004201977.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
gi|359464966|emb|CCE88671.1| Piso0_001448 [Millerozyma farinosa CBS 7064]
Length = 1249
Score = 70.1 bits (170), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 136/678 (20%), Positives = 272/678 (40%), Gaps = 99/678 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++ I +LE R G +D+L + ++ K +LQ+D + L+T S R
Sbjct: 66 VFSVIQSLEKVRLTGSQKDYLVVTSDSGKLAILQYDTGRNRLVTVFQEPHSKTGFRRNTP 125
Query: 66 GQIGIIDPDCR--LIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKP 123
G + DP R LIG + L + D+KG ++ + + E Q+ L CA
Sbjct: 126 GPYLLTDPQNRAILIGALERNKLIYKVHSDDKGGMQISSPL---ESQIRHTITLAMCALD 182
Query: 124 T-----IVVLYQDNKDARHVKTYEV---ALKDKDFVEGPWSQ-----------NNLDNGA 164
T + V + A K Y + A + F Q N L A
Sbjct: 183 TGYENPVFVAIEAEYGALDSKEYSIDSQAHQTLLFTSYELDQGLNHVVRRVVNNKLPISA 242
Query: 165 DLLIPVPPPLCGVLIIGEETIVYCSAN---AFKAIPIRPS------ITKAYGRVDADGSR 215
LIP+P P+ GVLI E +Y + + PIR I + +
Sbjct: 243 THLIPLPSPVGGVLICCESYAIYERFDYKRLYLPFPIRQGTENTVIINHVFQKFKKSNYL 302
Query: 216 YLLGDHAG--LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS-- 271
L G L L E+E++ + + ++++++ L + ++ +++G+
Sbjct: 303 ILAQTQLGDCLKFSLHYNDEREQLDDITVSYFDTIPVSNSLNILRSGFLFANTAHGNKLF 362
Query: 272 -QLIKLNLQPD------------------------AKG----SYVEVLERYVNLGPIVDF 302
Q KL + + KG + V+V++ +L P+ D
Sbjct: 363 YQFEKLGEEDNDLTLRCSDHYEQMTTIDHSKREFKPKGLENLALVDVMD---SLNPVTDA 419
Query: 303 CVVDLERQGQG-----QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGM-WSLRSS 356
+++ R + +VT S + ++I+ +G+ +N+ + L + ++ +
Sbjct: 420 TLLE-TRMSESPDPLINLVTLSSRF----IKILTHGMPVNQLVTSPLPIHPSLIFTTKKF 474
Query: 357 TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSG- 415
+ D +LV+S ++ L +++ + +EE F + T+ + +VQV S
Sbjct: 475 NESVNDDYLVISSELSSQTLVLSIGEVVEEVSDSEFVTNQPTIHVQQTGKSSIVQVFSNG 534
Query: 416 --SVRLVSSTSRELRN--EWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTEV 471
S+R + + E++ +W P G SV A+ N +Q+++ + Y EI D ++
Sbjct: 535 IRSIRHIKNGDEEIKKTTDWYPPAGISVIKASGNNTQLIIGMSNREVCYFEI-DSADDQL 593
Query: 472 KHAQLEYEIS----CLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD---LNLITKEHL 524
Q E+S ++P A+ +D ++++ SL L+++ + L
Sbjct: 594 IEYQERLEVSGGSISSLAILSSKDPEKRSSHAIIGSSDETIQVVSLKRHNCLSIVALQAL 653
Query: 525 GGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTF- 583
+ S+L+ Y+ + +G + +++ +G+L+D + LG++ + L
Sbjct: 654 SSK--STSILMINHNDSVYVHIGMENGLYVRTMIDEVSGKLSDTRIKFLGSKSVRLSPIK 711
Query: 584 ---SSKNTTHVFAASDRP 598
+S + A S RP
Sbjct: 712 VPGNSGEENGILALSSRP 729
Score = 55.8 bits (133), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 80/352 (22%), Positives = 145/352 (41%), Gaps = 63/352 (17%)
Query: 725 SDDSNVYYCVGTAYVLPEENEPTKGRILVFIVEDGK-----------LQLIAEKETKGAV 773
S DS+ Y VGT ++N K + F V + L I + E++ A
Sbjct: 911 SSDSSSYLIVGTTRNSFDKN--AKHYLYTFKVAGSRNSSHSQKGQKTLLFIHKTESEFAP 968
Query: 774 YSLNAFNGKLLAAINQKIQLY----KWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
++ FN +LL ++LY K +LR + S + +I+ L G+ I+
Sbjct: 969 LAMIEFNNRLLIGTKNLLRLYDIGHKQLLR-----KASSSIDYFENIIKL-AYMGGNRIM 1022
Query: 830 VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---- 885
D S + + Y E A D ++++ LD D +G + N+F R
Sbjct: 1023 AADASMSSTFVKYDQVENQFFPLADDVMKRKITSMCPLDYDTIVGGDKFGNVFVSRIPEF 1082
Query: 886 ------------KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
++ + + RL+ + E++ G+ F GSL++ S+ T
Sbjct: 1083 LSKQVDQDWGLIRHQDSYLNGAASRLKNLCEFYSGDIPTSFSKGSLIL---GSEESIFYT 1139
Query: 934 VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVI-------------KGVGGLNHEQWR 980
+ GTV +I ++ +E F+E L+ LR + G +H + R
Sbjct: 1140 GLMGTVGALIPLVTK--NEVQFFIE-LEAELRSYFDYNFDNFDEQKNGHNLLGKDHLKHR 1196
Query: 981 SFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
S+ N KN +DGDLIE F ++S I+ ++ + +++ K++ E+
Sbjct: 1197 SYYNP-----VKNVIDGDLIERFSEVSYNNKIRIANKLDRTPKDIDKKISEI 1243
>gi|71032257|ref|XP_765770.1| hypothetical protein [Theileria parva strain Muguga]
gi|68352727|gb|EAN33487.1| hypothetical protein TP01_0243 [Theileria parva]
Length = 1504
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 63/274 (22%), Positives = 113/274 (41%), Gaps = 39/274 (14%)
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
+D+ T + + + + + T+ D I VGDLM S+ +L K + + E RD+N
Sbjct: 1220 QDETTLFIDVVANYDSNTFVVSLDTKDDVIFVGDLMTSVKML--KFRDNRLLETCRDFNT 1277
Query: 859 NWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF--RH 916
W +++ +D+ L +++ N +K TD++ R + G +H GE VN+ R
Sbjct: 1278 LWTTSLAAVDNSSCLVSDDLGNFLLFKKVQHPTTDQQSIRFDKQGLFHHGEVVNKILKRT 1337
Query: 917 GSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPHE----------------------- 952
V + +S + + F N V+ S P E
Sbjct: 1338 QMSVQHVANSRMSRSNPREFMVSNRVVSESESNNPSETLNVNEYTNLFKSFFTCATTSGS 1397
Query: 953 -----------QYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIE 1001
+L L L+ + V K +G + R+F + + K F+DGDL+E
Sbjct: 1398 LLQVCFFDDLNMFLKLSLLEHTMHLVQKDLGNIPSRNQRNFEDLHSNIPTKGFVDGDLVE 1457
Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
+FL L + + +TM ++ L ++ L L
Sbjct: 1458 AFLKLPDSLKKWVFETMLINSRHLGVKLSSLESL 1491
>gi|257215708|emb|CAX83006.1| Cleavage and polyadenylation specificity factor subunit 1
[Schistosoma japonicum]
Length = 462
Score = 69.7 bits (169), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 63/213 (29%), Positives = 97/213 (45%), Gaps = 36/213 (16%)
Query: 111 VLDIKFLYGCAKPTIVVLYQD------NKDARHVKTYEVALK---DKDFVEGPWSQNNLD 161
VLD++FL+G +PT++VLY+ AR VAL K W Q +L
Sbjct: 214 VLDMQFLHGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQESLP 273
Query: 162 NGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIRPSITKAYGR 208
+IPVP P+ GV+I+ +I+Y C A P+R + G
Sbjct: 274 FDCRSVIPVPQPIGGVVIMAANSILYLKQTLPSCSLPLNCYAQISTNFPMRQDVPSC-GP 332
Query: 209 VDADGSR--------YLLGDHAGLLHLLVITHEK--EKVTGLKIELLGETSIASTISYLD 258
+ DG R +L+G +G L+LL + E+ + VT L +G + L+
Sbjct: 333 LSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKVGHAVPPHCMVLLE 392
Query: 259 NAVVYIGSSYGDSQLIKLN---LQPDAKGSYVE 288
+ ++IGS + DS L+K+N L DA G V+
Sbjct: 393 SKYLFIGSRFCDSVLMKINHSLLCVDASGKEVD 425
>gi|148697643|gb|EDL29590.1| cleavage and polyadenylation specific factor 1, isoform CRA_b [Mus
musculus]
Length = 1311
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
+FI+ D+MKSISLL Y+ E + +RD + +V+ + D+ LG ++ + NL
Sbjct: 1088 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1147
Query: 883 TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
E RL ++H+G VN F R P + P+
Sbjct: 1148 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGAAEGPSKKSVVWENK 1201
Query: 934 --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
F T++G IG++ + + Y L LQ L ++ GLN +R + +++ +
Sbjct: 1202 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1261
Query: 991 -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+N LDG+L+ +L LS E++K + +
Sbjct: 1262 AVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1293
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 107/551 (19%), Positives = 214/551 (38%), Gaps = 151/551 (27%)
Query: 20 GEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIGRPTDNGQIGIIDPD 74
G +D L ++ + K V+++D + +L T ++ ++ D + ++ + DPD
Sbjct: 9 GAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFVQNVHTPRVRV-DPD 67
Query: 75 CRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FNIR-LEE--LQVLDI 114
R + +Y V+PF +++G + E ++R L+E L ++D+
Sbjct: 68 GRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIIDVRALDEKLLNIIDL 127
Query: 115 KFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNNLDNGAD 165
+FL+G +PT+++L++ N+ A T + + + P WS +L
Sbjct: 128 QFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLPFDCT 187
Query: 166 LLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPIRPSITKAYGRVDAD 212
+ VP P+ GV+I +++Y + A P+R T+ R+ D
Sbjct: 188 QALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPLR---TQEGVRITLD 244
Query: 213 GS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDNAVVY 263
+ + ++ G +++L +IT V + + + +++ ++ ++
Sbjct: 245 CAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEPGYLF 304
Query: 264 IGSSYGDSQLIKL--NLQ---------------PDAKGSYVE------------------ 288
+GS G+S L+K LQ P +K VE
Sbjct: 305 LGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVEPAVGWTGGKTVPQDEVDE 364
Query: 289 --------------------VLERYVNLGPIVDFCVVD---LERQGQG------QVVTCS 319
V + +N+GP + V + L + Q ++V CS
Sbjct: 365 IEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEFQNSPEPDLEIVVCS 424
Query: 320 GAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------------------RS 355
G K+G+L +++ I + EL G MW++ ++
Sbjct: 425 GYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAESTEQEPSAPKA 484
Query: 356 STDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQVTS 414
D FL++S T IL E+ E + GF +Q T+F + N+ +VQV+
Sbjct: 485 EEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQVSP 542
Query: 415 GSVRLVSSTSR 425
+RL+ ++
Sbjct: 543 LGIRLLEGVNQ 553
>gi|194474008|ref|NP_001124043.1| cleavage and polyadenylation specificity factor subunit 1 [Rattus
norvegicus]
gi|149066087|gb|EDM15960.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 1386
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
+FI+ D+MKSISLL Y+ E + +RD + +V+ + D+ LG ++ + NL
Sbjct: 1163 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1222
Query: 883 TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
E RL ++H+G VN F R P + P+
Sbjct: 1223 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGAAEGPSKKSVMWENK 1276
Query: 934 --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
F T++G IG++ + + Y L LQ L ++ GLN +R + +++ +
Sbjct: 1277 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1336
Query: 991 -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+N LDG+L+ +L LS E++K + +
Sbjct: 1337 AVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1368
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 106/561 (18%), Positives = 218/561 (38%), Gaps = 147/561 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYV----- 287
+ +++ ++ +++GS G+S L+K LQ P +K V
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVDPTVG 429
Query: 288 -------------------------------EVLERYVNLGPIVD-------FCVVDLER 309
EV + +N+GP + F +
Sbjct: 430 WTGGKTQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEENSP 489
Query: 310 QGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL---------------- 353
+ ++V CSG K+G+L +++ I + EL G MW++
Sbjct: 490 EPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAES 549
Query: 354 --------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAI 405
++ D FL++S T IL E+ E + GF +Q T+F +
Sbjct: 550 TEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIG 607
Query: 406 YNQ-LVQVTSGSVRLVSSTSR 425
N+ +VQV+ +RL+ ++
Sbjct: 608 DNRYIVQVSPLGIRLLEGVNQ 628
>gi|148697642|gb|EDL29589.1| cleavage and polyadenylation specific factor 1, isoform CRA_a [Mus
musculus]
Length = 1417
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
+FI+ D+MKSISLL Y+ E + +RD + +V+ + D+ LG ++ + NL
Sbjct: 1194 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1253
Query: 883 TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
E RL ++H+G VN F R P + P+
Sbjct: 1254 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGAAEGPSKKSVVWENK 1307
Query: 934 --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
F T++G IG++ + + Y L LQ L ++ GLN +R + +++ +
Sbjct: 1308 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1367
Query: 991 -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+N LDG+L+ +L LS E++K + +
Sbjct: 1368 AVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1399
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 108/560 (19%), Positives = 218/560 (38%), Gaps = 151/560 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 101 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 160
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 161 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 219
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 220 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 279
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 280 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 339
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 340 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 396
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE---- 288
+ +++ ++ +++GS G+S L+K LQ P +K VE
Sbjct: 397 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVEPAVG 456
Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
V + +N+GP + V + L +
Sbjct: 457 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEF 516
Query: 312 QG------QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------ 353
Q ++V CSG K+G+L +++ I + EL G MW++
Sbjct: 517 QNSPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETP 576
Query: 354 ------------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFC 401
++ D FL++S T IL E+ E + GF +Q T+F
Sbjct: 577 KAESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFA 634
Query: 402 HDAIYNQ-LVQVTSGSVRLV 420
+ N+ +VQV+ +RL+
Sbjct: 635 GNIGDNRYIVQVSPLGIRLL 654
>gi|221116994|ref|XP_002166173.1| PREDICTED: DNA damage-binding protein 1-like, partial [Hydra
magnipapillata]
Length = 107
Score = 69.3 bits (168), Expect = 1e-08, Method: Composition-based stats.
Identities = 28/54 (51%), Positives = 40/54 (74%)
Query: 2 LDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDV 55
D+ +YGR+A ++LFRP E QD LFI TERY+ +L + AE+ ++IT+A GDV
Sbjct: 53 FDLNVYGRVAVMQLFRPQNENQDLLFILTERYRTAILAYKAETGDIITKAYGDV 106
>gi|354491124|ref|XP_003507706.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 2 [Cricetulus griseus]
Length = 1388
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
+FI+ D+MKSISLL Y+ E + +RD + +V+ + D+ LG ++ + NL
Sbjct: 1165 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1224
Query: 883 TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
E RL ++H+G VN F R P + P+
Sbjct: 1225 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGAAEGPSKKSVMWENK 1278
Query: 934 --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
F T++G IG++ + + Y L LQ L ++ GLN +R + +++ +
Sbjct: 1279 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1338
Query: 991 -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+N LDG+L+ +L LS E++K + +
Sbjct: 1339 AVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1370
Score = 57.4 bits (137), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 106/563 (18%), Positives = 219/563 (38%), Gaps = 149/563 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEESELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLN--LQ---------------PDAKGSYVE---- 288
+ +++ ++ +++GS G+S L+K LQ P +K V+
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVDPTAG 429
Query: 289 ----------------------------------VLERYVNLGPIVDFCVVD---LERQG 311
V + +N+GP + V + L +
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEN 489
Query: 312 QGQ----VVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------------- 353
+ +V CSG K+G+L +++ I + EL G MW++
Sbjct: 490 SPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEEAPRA 549
Query: 354 ----------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
++ D FL++S T IL E+ E + GF +Q T+F +
Sbjct: 550 ESTEQESTTPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGN 607
Query: 404 AIYNQ-LVQVTSGSVRLVSSTSR 425
N+ +VQV+ +RL+ ++
Sbjct: 608 IGDNRYIVQVSPLGIRLLEGVNQ 630
>gi|148697644|gb|EDL29591.1| cleavage and polyadenylation specific factor 1, isoform CRA_c [Mus
musculus]
Length = 1388
Score = 69.3 bits (168), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
+FI+ D+MKSISLL Y+ E + +RD + +V+ + D+ LG ++ + NL
Sbjct: 1165 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1224
Query: 883 TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
E RL ++H+G VN F R P + P+
Sbjct: 1225 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGAAEGPSKKSVVWENK 1278
Query: 934 --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
F T++G IG++ + + Y L LQ L ++ GLN +R + +++ +
Sbjct: 1279 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1338
Query: 991 -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+N LDG+L+ +L LS E++K + +
Sbjct: 1339 AVRNVLDGELLNRYLYLSTMERSELAKKIGTT 1370
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 106/563 (18%), Positives = 218/563 (38%), Gaps = 149/563 (26%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDVSDRIG 60
+G + ++ + G +D L ++ + K V+++D + +L T ++ ++ D
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 61 RPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FN 103
+ ++ + DPD R + +Y V+PF +++G + E +
Sbjct: 134 QNVHTPRVRV-DPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLMGEGQRSSFLPSYIID 192
Query: 104 IR-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG-- 153
+R L+E L ++D++FL+G +PT+++L++ N+ A T + + +
Sbjct: 193 VRALDEKLLNIIDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVH 252
Query: 154 P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAFKAIPI 198
P WS +L + VP P+ GV+I +++Y + A P+
Sbjct: 253 PVIWSLTSLPFDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTTGTTAFPL 312
Query: 199 RPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETS 249
R T+ R+ D + + ++ G +++L +IT V + +
Sbjct: 313 R---TQEGVRITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASV 369
Query: 250 IASTISYLDNAVVYIGSSYGDSQLIKLN--LQ---------------PDAKGSYVE---- 288
+ +++ ++ +++GS G+S L+K LQ P +K VE
Sbjct: 370 LTTSMVTMEPGYLFLGSRLGNSLLLKYTEKLQEPPASSVREAADKEEPPSKKKRVEPAVG 429
Query: 289 ----------------------------------VLERYVNLGPIVD-------FCVVDL 307
V + +N+GP + F +
Sbjct: 430 WTGGKTVPQDEVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEEN 489
Query: 308 ERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL-------------- 353
+ ++V CSG K+G+L +++ I + EL G MW++
Sbjct: 490 SPEPDLEIVVCSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKA 549
Query: 354 ----------RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
++ D FL++S T IL E+ E + GF +Q T+F +
Sbjct: 550 ESTEQEPSAPKAEEDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGN 607
Query: 404 AIYNQ-LVQVTSGSVRLVSSTSR 425
N+ +VQV+ +RL+ ++
Sbjct: 608 IGDNRYIVQVSPLGIRLLEGVNQ 630
>gi|412986884|emb|CCO15310.1| predicted protein [Bathycoccus prasinos]
Length = 1595
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 84/346 (24%), Positives = 145/346 (41%), Gaps = 53/346 (15%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSN----VYYCVGTAYVLPE-ENEPTK 748
+RLL T E ++ E+ + + D S +GTA +P E+ P +
Sbjct: 1230 IRLLVPGTLETAWQKAVEPGEHILCVKNVQIRDQSTGALLSMLAIGTA--MPGGEDTPCR 1287
Query: 749 GRILVFIV-----EDGKL----QLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLR 799
GRIL+F + DG + +L EK +K A ++ + +G + AI K+ + W
Sbjct: 1288 GRILLFAIMWERARDGGVRWRGELKCEKPSKMACSAIESVDGTFMVAIGTKLTAHSW--- 1344
Query: 800 DDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEG--AIEERARDYN 857
DG + L + + + +F++ GDL KSI + +K +G + + +DY
Sbjct: 1345 -DG-KHLNPIAFYDTPLYTTTLCCVKNFLLCGDLHKSIRFVRWKDSQGEKTLSQLGKDYE 1402
Query: 858 ANWMSAVEILDDD---IYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN-- 912
A E + D L A+ N N + + A + +L YH G +
Sbjct: 1403 VLDCIASEFMIDGGTLSLLAADANGNAHVFQYAPKLAESWKGDKLLPKSAYHAGSLIRKM 1462
Query: 913 -RFRHGSLVMRLPDSDVGQIP----TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
RF+ G VG+ V FG+ +G +G+ + + +L LEKLQ +R
Sbjct: 1463 VRFQIG----------VGEQKQNRHAVFFGSSDGGLGIFSPVDEHTFLNLEKLQDAMRSN 1512
Query: 968 IKGVG------GLNHEQWRSFNNEKKTV----DAKNFLDGDLIESF 1003
I GLN + +R+ + + +V + +DG L+ F
Sbjct: 1513 IVASSNSINPLGLNSKTYRALKSSEGSVARQTPPRTIVDGGLLSKF 1558
Score = 42.7 bits (99), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 39/147 (26%), Positives = 64/147 (43%), Gaps = 23/147 (15%)
Query: 296 LGPIVDFCV-----VDLERQGQGQVVTCSGAYKDGSLRIVRNGIG---INEQASVELQGI 347
+ P+VD V V + + ++V G K+G+L I+ GI + E S L G+
Sbjct: 679 ISPVVDLTVGASAPVGTDLDPRTELVAACGHGKNGALAILTRGITPELVTEVESGALPGL 738
Query: 348 KGMWSLRSSTDDP-----------FDTFLVVSFISETRILAMNLEDELEETEIE-GFCSQ 395
+ W+ R+ D+ FD L++S S + + +EL E E F
Sbjct: 739 RACWATRTEDDNDGTVRPKRKDELFDEHLILSLSSTKTTMVLETGEELREVSKEVDFIVD 798
Query: 396 TQTLFCHDAIYN--QLVQVTSGSVRLV 420
+TL C + I+N + QVT +R
Sbjct: 799 EETLAC-ERIFNGRAIAQVTKTKIRFT 824
Score = 39.7 bits (91), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 46/92 (50%), Gaps = 10/92 (10%)
Query: 111 VLDIKFLYGCAKPTIVVLYQDN------KDARHVKTYEVALKDKDFVEGP----WSQNNL 160
V D FL+G +P +++LY+ N + + + T ++ D + W++ L
Sbjct: 292 VRDCCFLHGYGEPVLMILYESNPPTWSGRLSLRMDTCKLVAVSIDCTKKKYTIVWTREKL 351
Query: 161 DNGADLLIPVPPPLCGVLIIGEETIVYCSANA 192
+ A L PVP PL GVL++ I+Y S ++
Sbjct: 352 PSAAYSLFPVPNPLGGVLVLSSGHILYESQSS 383
>gi|50292811|ref|XP_448838.1| hypothetical protein [Candida glabrata CBS 138]
gi|74608786|sp|Q6FLQ6.1|RSE1_CANGA RecName: Full=Pre-mRNA-splicing factor RSE1
gi|49528151|emb|CAG61808.1| unnamed protein product [Candida glabrata]
Length = 1296
Score = 68.9 bits (167), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 80/349 (22%), Positives = 155/349 (44%), Gaps = 58/349 (16%)
Query: 249 SIASTISYLDNAVVYIGSSYGDSQLIKLNLQP-----------DAKGSYVEVLERYVNLG 297
S +S + +N +YI S + + LN + D + + V+ ++ N+
Sbjct: 340 SNSSKLHIFNNGSMYINSQFNYDH-VYLNFESIGDNDENYDKIDNENENISVISKHTNIN 398
Query: 298 PI-VDFCVVD------LERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVEL-QGIKG 349
PI ++ C+++ + QG + + I+RN I + E S L QG+
Sbjct: 399 PIALNLCLMENMPLTFMHFQGGNRTTDSE------KVNIIRNAIPLKEYVSSPLPQGVSN 452
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIY--- 406
++++++ + +F+ ++ I+ T ++ ++ + IE + + T D +
Sbjct: 453 IFTIKTQYQ-SYHSFIFLTMINFTTVIL-----KIADDSIEQYIPASDTFKLKDDMTIHV 506
Query: 407 -----NQLVQVTSGSVRLVSSTSRELRN-----EWKSPPGYSVNVATANASQVLLATGGG 456
N ++QV R + S++ N +W P G S+ A +N SQ++LA
Sbjct: 507 ATMGDNSIIQVCKDEFRQILLDSKDEENFKMNLKWYPPAGVSILSAVSNFSQLILALSNN 566
Query: 457 HLVYLEIGDGILTEVKH-AQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPD 515
+VYL++ + L E K+ +L I+ L + + +N S+I AVG +D V + SL
Sbjct: 567 EIVYLQLENNTLIEYKNRPELPDVITSLAL--LNDNTKKSEILAVGT-SDNMVNVLSLE- 622
Query: 516 LNLITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGH-LLNFLLNMKTG 563
I E + E +V+ A + I L L GH L+N + ++ G
Sbjct: 623 ---IVDEAISFE----TVVFQALDAIPSSLLILNQGHKLVNLHIGVEDG 664
>gi|70945139|ref|XP_742421.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56521397|emb|CAH76894.1| conserved hypothetical protein [Plasmodium chabaudi chabaudi]
Length = 435
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 73/307 (23%), Positives = 128/307 (41%), Gaps = 38/307 (12%)
Query: 646 IDDIQKLHIRSIPLGEHPRRIC-HQEQSRTFAICSLKNQSCAEESEMHFVRLLDDQTFEF 704
+++I + +I+ IP +I H+E C + + ++ + + F
Sbjct: 1 LNEINETYIQKIPFYRTVEKIAYHKESGLLITSCPPEEKHKTNKNLKQIICFFNPHQNSF 60
Query: 705 ISTYPLDTFEYGCSILSCSFSDDSNVY---------YCVGTAYVLPEENEPTKGRILVFI 755
+Y + + +Y S + C + + ++Y CVGTA + +EP+ G I +F
Sbjct: 61 KYSYIIPS-KYNVSSI-CVYQINKDIYPNKSSINTLICVGTANINDRVSEPSSGHIYIFF 118
Query: 756 VEDGKLQLIAEKE------TKGAVYSLNAFNGKLLAAINQK----------IQLYKWM-- 797
+ K L K G + L F KL+ IN I L K++
Sbjct: 119 AK-KKANLFEIKHIYTHNVNVGGITHLKQFYDKLITTINNTVVILDISEFLINLDKYVDN 177
Query: 798 -----LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
L +DGT + I++L V ++IVVGD+M S+++L Y + E
Sbjct: 178 TNKPKLENDGTIVDVASFTPSSWIMSLDV--IENYIVVGDIMTSVTILSYDFNNSILTEV 235
Query: 853 ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
RDY+ W + V L +L ++ N +K+S DE+ +L V ++ G VN
Sbjct: 236 CRDYSNVWCTFVCALSKSHFLVSDMESNFLVFQKSSIKYNDEDSFKLSRVALFNHGHVVN 295
Query: 913 RFRHGSL 919
+ SL
Sbjct: 296 KMLPVSL 302
>gi|302694047|ref|XP_003036702.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
gi|300110399|gb|EFJ01800.1| hypothetical protein SCHCODRAFT_63425 [Schizophyllum commune H4-8]
Length = 1396
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 116/508 (22%), Positives = 206/508 (40%), Gaps = 82/508 (16%)
Query: 573 LGTQPITLRTF----SSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAA 628
L Q LRTF + VF DRP I +++K V + H + +A
Sbjct: 891 LAEQKKALRTFVPFACAGAPAGVFFTGDRPHWIVATDK----GGVQMYPSGHAAVYAFSA 946
Query: 629 FP------DSLAIAKEGELTIGTIDDIQ---KLHIRSIPLGEHPRRICHQEQSRTF-AIC 678
+ L ++EG+ I + + L +R IP G I ++ S A
Sbjct: 947 CTLWERSTEFLIYSEEGQTLCEWITEYEIGRPLPMRHIPRGRAYSNIVYEPASSMIVAAA 1006
Query: 679 SLK----------NQSCAEES--------EMHFVRLLDDQTFEFISTYPLDTFEYGCSIL 720
SL+ NQ A + E + L+ + + + Y T E+ + +
Sbjct: 1007 SLRARFASFDEDGNQIWAPDGPGITEPTVECSTLELISPEVWATVDGYEFATNEF-VNTM 1065
Query: 721 SC----SFSDDSNV--YYCVGTAYVLPEENEPTKGRILVF----IVEDG--------KLQ 762
C + S ++ V + VGT+ V E+ KG +F +V D +L+
Sbjct: 1066 ECVPLETVSTEAGVKHFIAVGTSIVR-GEDLAVKGATYIFEVVEVVPDQSNGPKRWYRLK 1124
Query: 763 LIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQ 822
L + KG V +L N L++++ QKI + + L + + G + ++
Sbjct: 1125 LRCRDDAKGPVTALCGINNYLVSSMGQKIFVRAFDLDERLVGVAFMDVG----VYVTSLR 1180
Query: 823 TRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLF 882
+ +++GD+++ I + ++ + + RD + + V+ + AE +
Sbjct: 1181 ALKNLLLIGDVVRGIQFVAFQEDPYKLVTLGRDVSRMCATTVD------FFFAEEALAIV 1234
Query: 883 TVRKNS--------EGATDEERGRLEVV-GEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT 933
T +N A D GRL + E++L FR +L+ R D IP
Sbjct: 1235 TTDENGVMSMYNYDPEAPDSHDGRLLLKQTEFNLH---TDFRTSTLIARRTKDD-PIIPQ 1290
Query: 934 --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
+IFG +G + + +P + L+ LQ L + ++ V GLN + R NE +
Sbjct: 1291 GILIFGGTDGTLSCLTPVPDDAAKRLQPLQLQLTRNMQHVAGLNPKALRIVRNEHVSRPL 1350
Query: 991 AKNFLDGDLIESFLDLSRTRMDEISKTM 1018
+K LDG+LI F L TR DE+++ +
Sbjct: 1351 SKGILDGNLIAYFEHLPITRQDEMTRQI 1378
Score = 45.1 bits (105), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 63/303 (20%), Positives = 116/303 (38%), Gaps = 55/303 (18%)
Query: 24 DFLFIATERYKFCVLQWDAESSELIT-------RAMGDVSDRIGRPTDNGQIGIIDPDCR 76
D L ++ + K +L+W + +L+T RA+ V+ I T ++ DP R
Sbjct: 139 DRLLVSFKDAKIALLEWSTATQDLLTVSIHTYERAIQMVATDISAFTSELRV---DPQSR 195
Query: 77 LIGLHLYDGLFKV-------------IPFDNKGQLKEAFNIRLEELQVLDIKFLYGCAKP 123
L L F + +P+ L + V+D FL G + P
Sbjct: 196 CAALSLPKDAFAILPPCQVSDSVCRDVPYSPSFILNLPSEVESGIRNVIDFTFLPGFSNP 255
Query: 124 TIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGPWSQNNLDNGAD---LLIPVPPPLC 175
T+ VLY+ + T ++A D V + L G L + P
Sbjct: 256 TVAVLYETYQTWTGRLNEQKDTVKMAFFTLDIVNRRYPVIGLATGLPCDCLSVLACPSTG 315
Query: 176 GVLIIGEETIVYCSANAFK---------------AIPIRPSITKAYGRVDADGSRYLLGD 220
GV+++ +I+Y + K A+P + + ++ +GSR + D
Sbjct: 316 GVMVVASNSIIYVDQSGRKVVLPVNAWIPRMSDIALPTNLTPEEQARTLELEGSRSIFVD 375
Query: 221 HA--------GLLHLLVITHEKEKVTGLKI-ELLGETSIASTISYLDNAVVYIGSSYGDS 271
G ++ + + V+ L + L +T+I S + ++N + +GS+ GDS
Sbjct: 376 DKTAFIILKDGTIYPVELVTAGRVVSKLALGTPLAKTTIPSILRRINNDYLLVGSASGDS 435
Query: 272 QLI 274
L+
Sbjct: 436 ALL 438
>gi|114318675|gb|ABI63336.1| putative splicing factor 3b [Hordeum vulgare subsp. vulgare]
Length = 379
Score = 68.6 bits (166), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 72/274 (26%), Positives = 118/274 (43%), Gaps = 17/274 (6%)
Query: 433 SPPGYSVNVATANASQVLLATGGGHLVYLEIG-DGILTEVKHAQLEYEISCLDINPIGEN 491
+P ++ +N QV++A GG L+Y E+ G L EV+ + +++CL I P+ E
Sbjct: 1 TPGKKTITKVGSNRLQVVIALSGGELIYFEMDMTGQLMEVEKQDMSGDVACLAIAPVPEG 60
Query: 492 PSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGEIIPRSVLLCAFE-------GISY 543
S+ AVG + D ++RI SL PD L P S+L + G Y
Sbjct: 61 RQRSRFLAVGSY-DNTIRILSLDPDDCLQPLSVQSVSSAPESLLFLEVQASVGGEDGADY 119
Query: 544 -----LLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRP 598
L L +G L ++M TG+L+D + LG +P L + + S RP
Sbjct: 120 PANLFLNAGLQNGVLFRTNVDMVTGQLSDTRSRFLGLRPPKLFPCIVSHRQAMLCLSSRP 179
Query: 599 TVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTIGTIDDI-QKLHIRSI 657
+ Y L + ++ + F+S + + L I TI+ + + + SI
Sbjct: 180 WLGYIHQGHFLLTPLSCDTLESAASFSSDQCSEGVVAVAGDALRIFTIERLGETFNETSI 239
Query: 658 PLGEHPRRICHQEQSRTFAIC-SLKNQSCAEESE 690
PL PR+ + + A+ S K AE+ E
Sbjct: 240 PLRYTPRKFVILPKKKYLAVIESDKGAFSAEQRE 273
>gi|348555856|ref|XP_003463739.1| PREDICTED: cleavage and polyadenylation specificity factor subunit 1
isoform 2 [Cavia porcellus]
Length = 1387
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
+FI+ D+MKSISLL Y+ E + +RD + +V+ + D+ LG ++ + NL
Sbjct: 1164 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 1223
Query: 883 TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
E RL ++H+G VN F R P + P+
Sbjct: 1224 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGATEGPSKKSVVWENK 1277
Query: 934 --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
F T++G IG++ + + Y L LQ L ++ GLN +R + +++ +
Sbjct: 1278 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 1337
Query: 991 -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+N LDG+L+ +L LS E++K + +
Sbjct: 1338 AVRNVLDGELLNRYLYLSTMERGELAKKIGTT 1369
Score = 50.1 bits (118), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 96/493 (19%), Positives = 188/493 (38%), Gaps = 143/493 (29%)
Query: 71 IDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FNIR-LEE--LQ 110
+DPD R + +Y V+PF +++G + E ++R L+E L
Sbjct: 142 VDPDGRCAAMLIYGTRLVVLPFRRESLAEEHEGLVGEGQRSSFLPSYIIDVRALDEKLLN 201
Query: 111 VLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEG--P--WSQNNLD 161
++D++FL+G +PT+++L++ N+ A T + + + P WS +L
Sbjct: 202 IVDLQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNITQKVHPVIWSLTSLP 261
Query: 162 NGADLLIPVPPPLCGVLIIGEETIVYCSAN-------------AFKAIPIRPSITKAYGR 208
+ VP P+ GV+I +++Y + + A P+R T+ R
Sbjct: 262 FDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVALNSLTLGTTAFPLR---TQEGVR 318
Query: 209 VDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIELLGETSIASTISYLDN 259
+ D + + ++ G +++L +IT V + + + +++ ++
Sbjct: 319 ITLDCAQAAFISYDKMVISLKGGEIYVLTLITDGMRSVRAFHFDKAAASVLTTSMVTMEP 378
Query: 260 AVVYIGSSYGDSQLIKL--NLQ---------------PDAKGSYVE-------------- 288
+++GS G+S L+K LQ P +K V+
Sbjct: 379 GYLFLGSRLGNSLLLKYTEKLQEPPASTVREAADKEEPPSKKKRVDSTAGWAGSKTVPQD 438
Query: 289 ------------------------VLERYVNLGPIVDFCVVD---LERQGQGQ----VVT 317
V + +N+GP + V + L + + +V
Sbjct: 439 EVDEIEVYGSEAQSGTQLATYSFEVCDSMLNIGPCANAAVGEPAFLSEENSPEPDLEIVV 498
Query: 318 CSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSL------------------------ 353
CSG K+G+L +++ I + EL G MW++
Sbjct: 499 CSGYGKNGALSVLQKSIRPQVVTTFELPGCYDMWTVIAPVRKEEEETPKAEGSEQEPSAP 558
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ-LVQV 412
+ D FL++S T IL E+ E + GF +Q T+F + N+ +VQV
Sbjct: 559 EAEDDGRRHGFLILSREDSTMILQTG--QEIMELDTSGFATQGPTVFAGNIGDNRYIVQV 616
Query: 413 TSGSVRLVSSTSR 425
+ +RL+ ++
Sbjct: 617 SPLGIRLLEGVNQ 629
>gi|119195757|ref|XP_001248482.1| hypothetical protein CIMG_02253 [Coccidioides immitis RS]
gi|121769680|sp|Q1E5B0.1|CFT1_COCIM RecName: Full=Protein CFT1; AltName: Full=Cleavage factor two protein
1
gi|392862316|gb|EAS37050.2| protein CFT1 [Coccidioides immitis RS]
Length = 1387
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 159/372 (42%), Gaps = 37/372 (9%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVYYCVGTAYVLPEENEPTK 748
++LL +T+ + +Y L E + + + + + VGTA V E+ P +
Sbjct: 1015 IKLLSPRTWSVVDSYELGDAERVMCMKTINMEISEITHEMKDMLVVGTATVRGEDITP-R 1073
Query: 749 GRILVF-IVE----------DGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
G I VF I+E + KL++ A+ + KGAV +++ G+ L+ A QK +
Sbjct: 1074 GSIYVFEIIEVAPDPDRPETNRKLKIFAKDDVKGAVTAVSGIGGQGFLIMAQGQKCMVRG 1133
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ + ++ L ++GD +K I Y E + +D
Sbjct: 1134 --LKEDGSLLPVAFMDMQCYVKVLKELQGTGLCIMGDALKGIWFAGYSEEPYRLTLFGKD 1191
Query: 856 YNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ A + L D +Y L A+++ + + + E T + RL +H G F +
Sbjct: 1192 NEYLQVIAADFLPDGKRLYILVADDDCTIHVLEYDPEDPTSSKGDRLLHRSSFHTGHFTS 1251
Query: 913 RF-----RHGSLVMRLP---DSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLEKLQ 961
S P D DV +P V+ + G IGV+ L + Y L LQ
Sbjct: 1252 TMTLLPEHSSSPSADDPEEDDMDVDYVPKSYQVLVTSQEGSIGVVTPLTEDSYRRLSALQ 1311
Query: 962 TNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+ L ++ GLN + +R+ E + +DG+L+ +LD+ R EI+ +
Sbjct: 1312 SQLVTSMEHPCGLNPKAYRAV--ESDGFGGRGIVDGNLLLRWLDMGVQRKAEIAGRVGAD 1369
Query: 1022 VEELCKRVEELT 1033
+E + +E ++
Sbjct: 1370 IESIRVDLETIS 1381
>gi|167386195|ref|XP_001737658.1| spliceosomal protein sap [Entamoeba dispar SAW760]
gi|165899468|gb|EDR26056.1| spliceosomal protein sap, putative [Entamoeba dispar SAW760]
Length = 1079
Score = 67.8 bits (164), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 148/691 (21%), Positives = 280/691 (40%), Gaps = 82/691 (11%)
Query: 249 SIASTISYLDNAVVYIGSSYGDSQLI------KLNLQPDA----KGSYVEVLERYVNLGP 298
+I+ ++S+ N + +S + L+ + +QP + + Y + + P
Sbjct: 262 TISPSVSWSINGDIIFSASESQNHLVLKVPSLTIPIQPSSTYKPQSLYFNISTSISSHHP 321
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSSTD 358
+ VVD Q + G+ SL+I+ +G+ +E S+ LQ + +W+L+ +
Sbjct: 322 LTSVDVVDSHEVFQIRAFVGKGS--RSSLKILTHGLNPDECMSLSLQHPRNIWTLKPYNE 379
Query: 359 DPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVR 418
++V+ ++T +L L D L++ G QTL I LVQ+ + +
Sbjct: 380 QTHQ-YVVIGLENQTYVLKT-LPDALKQCPECGIRPNVQTLHAGMFIDGTLVQIHAHGII 437
Query: 419 LVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH----LVYLEIGDGILTEVKH- 473
+ + E +E ++ V T+++ QV++A +VY E TE K
Sbjct: 438 TILN---EKLSEQDPGAQRTILVGTSSSCQVVIALKNAQSKCEIVYFEYN----TETKKL 490
Query: 474 AQLEYEISCLDINPIG----ENPSYSQIAAV---GMWTDISVRIFSLPDLNLITKEHLGG 526
A++E I +I + EN ++A + GM +++ S L T + G
Sbjct: 491 AEVERRIIRSNIIAMALGDFENGKAKKVAYICDNGMANILTLEKESKFLQILETNITVEG 550
Query: 527 EIIPRSVLLCAFEGIS--YLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLRT 582
E S++L F IS +L +G DG+ + +N K EL K + G P+
Sbjct: 551 E--AESIILTQFNEISEPFLFLGIGMKDGYCYGYKINGKNSELMWCKLI--GNDPVVFGQ 606
Query: 583 FSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELT 642
+ V +S +YS ++ ++ ++ + + + PF + +S+A ELT
Sbjct: 607 YEFGGNIGVIMSSSTMIFMYSYFSRIEFTPISYQPLKALAPFITDFSGESIAGITNDELT 666
Query: 643 ---IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLDD 699
IG++DD + PL PR+I T +CS + + ++D+
Sbjct: 667 VIAIGSLDD--RFTYDEQPLLYTPRKIIKSNYFPTTVLCS-------DNKSNLYNPIIDN 717
Query: 700 QTFEFIS-TYPLDTFEYG----------CSILSCSFSDDSNVY----------YCVGTAY 738
+T EF +D E G SI F D + C ++
Sbjct: 718 ETNEFNERKIGVDLQEDGKWASEVGIINNSIFKSYFKTDEAIICGAHVSFKKKQCFISSQ 777
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V + + KG + + +LI + + +L + +LLA I ++LY
Sbjct: 778 V--KHYKKMKGECWINVYNLKTFKLIHKTPVEDICRALTSCGERLLAGIGTTLRLY---- 831
Query: 799 RDDGTRELQSECGHHGHILAL-YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYN 857
D G + L +C G + ++ GD IVVG + + Y + + +D
Sbjct: 832 -DLGKQILIRKCELDGFPSIINSLEVIGDKIVVGTVATGFIYVNYDSDANILSITEKDRI 890
Query: 858 ANWMSAVEILDDDIYLGAENNFNLFTVRKNS 888
+ ++A ILD+ +G + ++F N+
Sbjct: 891 WHSLTASTILDETSTIGFDKLGSVFITETNT 921
>gi|326501484|dbj|BAK02531.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 101
Score = 67.0 bits (162), Expect = 5e-08, Method: Composition-based stats.
Identities = 36/67 (53%), Positives = 44/67 (65%)
Query: 632 SLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEM 691
SLAIA+EGEL+IG+IDD+QKLHIR+I L E R I HQEQS T + A ES
Sbjct: 4 SLAIAEEGELSIGSIDDVQKLHIRTILLIEQARSIFHQEQSMTRCFAFSSTPTIAGESFA 63
Query: 692 HFVRLLD 698
RL++
Sbjct: 64 KIHRLME 70
>gi|327287424|ref|XP_003228429.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1-like [Anolis carolinensis]
Length = 1294
Score = 67.0 bits (162), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 67/227 (29%), Positives = 104/227 (45%), Gaps = 31/227 (13%)
Query: 710 LDTFEYGCSILSCSFSDDSNV-----YYCVGTAYVLPEENEPTKGRILVF-----IVEDG 759
L+ +E+ + + S + V Y VGT + EE +GRIL+ + E G
Sbjct: 1048 LEEWEHVTCMKTVSLKSEETVSGLKGYIAVGTCLMQGEEVT-CRGRILIMDIIEVVPEPG 1106
Query: 760 ------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDD---GTRELQSEC 810
K +++ EKE KG V +L NG L++AI QKI L W L+D+ G + ++
Sbjct: 1107 QPLTKNKFKVLYEKEQKGPVTALCHCNGYLVSAIGQKIFL--WSLKDNDLTGMAFIDTQL 1164
Query: 811 GHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDD 870
H I + +FI+ D+MKSISLL Y+ E + +RD + V+ + D
Sbjct: 1165 YIHQMI------SVKNFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYCVDFMVDS 1218
Query: 871 IYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF 914
LG ++ + NL E RL ++H+G VN F
Sbjct: 1219 CQLGFLVSDRDRNLLVYMYLPEAKESFGGMRLLRRADFHVGAHVNAF 1265
Score = 47.4 bits (111), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 48/215 (22%), Positives = 98/215 (45%), Gaps = 33/215 (15%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
+G + ++ + G +D L ++ + K V+++D + +L T ++ + R
Sbjct: 74 FFGNVMSMASVQLAGAKRDALLLSFKDAKLSVVEYDPGTHDLKTLSLHYFEEPELRDGFV 133
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPF-------DNKGQLKEA----------FNI 104
+ I +DPD R + +Y V+PF +++G + E +I
Sbjct: 134 QNVHIPKVRVDPDGRCAVMLIYGTRLVVLPFRRDTLTDEHEGVVGEGQKSSFLPSYIIDI 193
Query: 105 R-LEE--LQVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDFVEGP-- 154
R L+E L ++D++FLYG +PT+++L++ N+ A T + + ++
Sbjct: 194 RELDEKLLNIIDMQFLYGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNIMQKVHP 253
Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVY 187
WS +NL + VP P+ GV+I +++Y
Sbjct: 254 VIWSLSNLPFDCTQALAVPKPIGGVVIFAVNSLLY 288
>gi|70952260|ref|XP_745310.1| splicing factor 3b, subunit 3, 130kD [Plasmodium chabaudi chabaudi]
gi|56525593|emb|CAH80831.1| splicing factor 3b, subunit 3, 130kD, putative [Plasmodium chabaudi
chabaudi]
Length = 530
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 96/475 (20%), Positives = 193/475 (40%), Gaps = 71/475 (14%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIG-RPTD 64
I+G I +LE+FR G +D++ I ++ + +L++D E ++ I R + + G R
Sbjct: 58 IFGIIRSLEIFRLTGSNKDYIAIGSDSGRLVILKYDDEKNDFI-RVHCETYGKSGIRRIV 116
Query: 65 NGQIGIIDPDCRLIGLHLYDG--LFKVIPFDNKGQLKEAFNIRLEELQVL---DIKFLYG 119
G+ +DP R + L + ++ DNK L + + + + + G
Sbjct: 117 PGEYIAVDPKGRALMLCAIEKQKFVYILNRDNKENLTISSPLEAHKSHSICHAVVGLNVG 176
Query: 120 CAKPTIVVLYQDNKDA-RHVKTYEVAL-----KDKDFVEGP------WSQN--------- 158
P V + Q+ + + V + L +D+ +E P W +
Sbjct: 177 FENPMFVSIEQNYETLDKEVTNGNIQLSSNPNEDQQIMEYPKKGLCFWEMDLGLNHVIKK 236
Query: 159 ---NLDNGADLLIPVP---PPLCGVLIIGEETIVY---------CSANAFKAIPIRPSIT 203
+D A LLIP+P G+++ E +VY CS + +I+
Sbjct: 237 HTIPIDITAHLLIPLPGGQQGPSGLIVCCENYLVYKKIDHDDVYCSYPRRLEVGEEKNIS 296
Query: 204 KAYGRVDADGSRY--LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAV 261
+ + + L+ G L+ + + HE V + + IA++I L +
Sbjct: 297 IVCWTIHRIKTFFFILIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGA 356
Query: 262 VYIGSSYGDSQLIKLN-------------LQPDAKGSYVE----------VLERYVNLGP 298
+++ + +G+ + + P K + + ++++ +L P
Sbjct: 357 LFVAAEFGNHFFYQFSGIGNDSNESMCTSNHPSGKNAIIAFKTQKLKNLYLVDQIYSLSP 416
Query: 299 IVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSST 357
I+D ++D + Q+ G SLRI+++G+ I E A+ EL G K +W+++
Sbjct: 417 IIDMKILDAKNSSLPQIYALCGRGPRSSLRILQHGLSIEELANNELPGKPKYIWTIKKDN 476
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
+D +++VSF T IL + + +EE + T+ + N +QV
Sbjct: 477 SSEYDGYIIVSFEGNTLILEIG--ETVEEVYDSLLLTNVTTIHINLLYDNSFIQV 529
>gi|149066088|gb|EDM15961.1| cleavage and polyadenylation specific factor 1, 160kDa (predicted),
isoform CRA_b [Rattus norvegicus]
Length = 241
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 55/212 (25%), Positives = 96/212 (45%), Gaps = 22/212 (10%)
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG---AENNFNLF 882
+FI+ D+MKSISLL Y+ E + +RD + +V+ + D+ LG ++ + NL
Sbjct: 18 NFILAADVMKSISLLRYQEESKTLSLVSRDAKPLEVYSVDFMVDNAQLGFLVSDRDRNLM 77
Query: 883 TVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT--------- 933
E RL ++H+G VN F R P + P+
Sbjct: 78 VYMYLPEAKESFGGMRLLRRADFHVGAHVNTF------WRTPCRGAAEGPSKKSVMWENK 131
Query: 934 --VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD- 990
F T++G IG++ + + Y L LQ L ++ GLN +R + +++ +
Sbjct: 132 HITWFATLDGGIGLLLPMQEKTYRRLLMLQNALTTMLPHHAGLNPRAFRMLHVDRRILQN 191
Query: 991 -AKNFLDGDLIESFLDLSRTRMDEISKTMNVS 1021
+N LDG+L+ +L LS E++K + +
Sbjct: 192 AVRNVLDGELLNRYLYLSTMERSELAKKIGTT 223
>gi|85682925|gb|ABC73438.1| CG13900 [Drosophila miranda]
Length = 341
Score = 66.6 bits (161), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 64/288 (22%), Positives = 120/288 (41%), Gaps = 37/288 (12%)
Query: 718 SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVFIVED---GKLQLIAEKETKGAVY 774
+++ S + D Y VG A L ++G L D L+ + E
Sbjct: 67 AMVKFSVAADGRYYLAVGIARDLQLNPRISQGGYLDIYKIDPTCSSLEFMHRTEIDEIPG 126
Query: 775 SLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSEC-GHHGHILALYVQTRGDFIVVGDL 833
+L F G+LLA + +++Y D G +++ +C H + +Q G + V D+
Sbjct: 127 ALCGFQGRLLAGCGRMLRIY-----DLGKKKMLRKCENKHIPYQIVNIQAMGHRVYVSDV 181
Query: 834 MKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATD 893
+S+ L Y+ E + A D + W++A +LD D + + F ++++ TD
Sbjct: 182 QESVFFLRYRRAENQLIIFADDTHPRWVTATTLLDYDT-IAIADKFGNLSIQRLPHSVTD 240
Query: 894 E------------ERGRL-------EVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTV 934
+ +RG L E + +H+GE + + +L+ G +
Sbjct: 241 DVDEDPTGTKSLWDRGLLSGASQKSENICSFHVGEIIMSLQKATLI-------PGGSEAL 293
Query: 935 IFGTVNGVIGVIASL-PHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS 981
I+ T+NG +G E Y F + L+ ++R + G +H +RS
Sbjct: 294 IYSTLNGTVGAFVPFTSREDYDFFQHLEMHMRNENPPLCGRDHLSYRS 341
>gi|308805673|ref|XP_003080148.1| cleavage and polyadenylation specificity factor (ISS) [Ostreococcus
tauri]
gi|116058608|emb|CAL54315.1| cleavage and polyadenylation specificity factor (ISS), partial
[Ostreococcus tauri]
Length = 1473
Score = 66.2 bits (160), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 87/360 (24%), Positives = 155/360 (43%), Gaps = 41/360 (11%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSD----DSNVYYCVGTAYVLPE-ENEPTK 748
VRLL + + + + L+ E+ + + D S Y VGTA +P E+ P +
Sbjct: 845 VRLLVPGSLDCVWQHTLEPGEHVQCVRNVQLKDINTGHSLSYLAVGTA--MPGGEDTPCR 902
Query: 749 GRILVFIV--------EDG---KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
GR+ +F + DG K Q+ +E K A +L G L+ A+ K+ ++ W
Sbjct: 903 GRVYLFNMVWERDSESADGYRWKGQVCCVREAKMACTALEGLGGHLIVAVGTKLTVHTW- 961
Query: 798 LRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE--EGAIEERARD 855
DG REL S I + + +FI+VGDL K + +K E ++ + ++D
Sbjct: 962 ---DG-RELNSVAFFDTPIHTVSINVVKNFILVGDLEKGLHFFRWKDTGFEKSLIQLSKD 1017
Query: 856 YNANWMSAVEILDDDI---YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN 912
+ + + E L D LG++ + N T + + + +L YH+G ++
Sbjct: 1018 FERMDVVSSEFLIDGTTLSLLGSDMSGNARTFGYDPKSIESWKGQKLLPRAAYHVGSPIS 1077
Query: 913 R---FRHGSLVMRLPDSDVGQIP-----TVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
R F ++ +D G+ V FGT++G +G+ Y L +Q L
Sbjct: 1078 RMVRFNVEGSKSKMASTD-GKPKGANRFAVFFGTLDGALGIFMPTDPVTYEKLLAIQREL 1136
Query: 965 RKVIKGVGGLNHEQWRS---FNNEKKTVDAK-NFLDGDLIESFLDLSRTRMDEISKTMNV 1020
++ G N +R+ F + + A + LDG L+ F L+ + +I+ + V
Sbjct: 1137 TTAVRSPIGCNPRTFRTPKVFEGKHVQLRAPLDVLDGGLLSKFETLTFSEQVKIASSAQV 1196
>gi|196476797|gb|ACG76262.1| damaged-DNA binding protein DDB p127 subunit [Amblyomma americanum]
Length = 74
Score = 66.2 bits (160), Expect = 9e-08, Method: Composition-based stats.
Identities = 32/73 (43%), Positives = 48/73 (65%), Gaps = 10/73 (13%)
Query: 974 LNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM----------NVSVE 1023
++H WRSF+ E+KT F+DGDLIESFLDLSR +M E+ + + + +V+
Sbjct: 2 IDHAFWRSFSTERKTEQPVGFIDGDLIESFLDLSRDKMQEVVQGIQMDDGSGMKKDATVD 61
Query: 1024 ELCKRVEELTRLH 1036
+L K +EEL+R+H
Sbjct: 62 DLIKIIEELSRIH 74
>gi|407851810|gb|EKG05523.1| damage-specific DNA binding protein, putative [Trypanosoma cruzi]
Length = 313
Score = 66.2 bits (160), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/154 (27%), Positives = 67/154 (43%), Gaps = 36/154 (23%)
Query: 897 GRLEVVGEYHLGEFVNRFRHGS--------LVMRLPDSDVGQI--PTVIFGTVNGVIGVI 946
G+LE +YHLG+ V + GS + + +P + + I P V+FGT +G G I
Sbjct: 81 GKLETSAQYHLGDMVTAMQLGSFAPCSVTNIAVPIPTTLIPGICGPQVVFGTSHGAFGTI 140
Query: 947 ASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRS------------------------- 981
+ +E YL L+ L+ + V+ +GG H +R
Sbjct: 141 TPVSNETYLLLKALEVAVSSVVPALGGFEHVTYREVLRAGQERGYSRNASFENANVVSSG 200
Query: 982 -FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEI 1014
F+ +K ++ GDLIESFL S+T I
Sbjct: 201 VFDKRRKRYLSRCVCSGDLIESFLTFSQTIQQRI 234
>gi|198415711|ref|XP_002123169.1| PREDICTED: similar to cleavage and polyadenylation specificity
factor 1, partial [Ciona intestinalis]
Length = 1370
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 112/592 (18%), Positives = 224/592 (37%), Gaps = 135/592 (22%)
Query: 6 IYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDN 65
++G + + R G D + ++ K ++++D + ++ T ++ D + +
Sbjct: 74 LFGNVVRMRSVRLAGAKLDSVLLSFAEAKLSIIEFDQATHDIKTTSLHYFEDALYKDGSY 133
Query: 66 GQIGI----IDPDCRLIGLHLYDGLFKVIPFD-NKGQLK--------------------- 99
+I + +DP+ R + L L V+P N L
Sbjct: 134 QRITLPKIAVDPESRCVALQLTTKSVAVVPLRANTAALATDDGAAPQDNVSLQNKRSTTS 193
Query: 100 -----EAFNIRLEELQVLDIKFLYGCAKPTIVVLYQD-----NKDARHVKTYEVALKDKD 149
A + RL+ +++DI+FL+G +PT++VL++ + A T + +
Sbjct: 194 YTIDLHAVDARLQ--RIIDIQFLHGYNEPTLLVLFESLRTWAGRVAMRQDTCNIVAISLN 251
Query: 150 FVEG----PWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANA------------- 192
E WS N L PVP P+ GVLI +I+Y + +
Sbjct: 252 MAEQLHPVVWSLNGLPFDCKYAYPVPKPIGGVLIFAVNSILYLNQSVPPYGTSLNSTTEN 311
Query: 193 FKAIPIRP------SITKAYGRVDADGSRYLLGDHAGLLHLLVITHEKEKVTGLKIELLG 246
+ P++P ++ ++ + S + + L L ++ V +
Sbjct: 312 STSFPLKPQEDVCMTLDCSHAMFISPESLVISLKNGELYVLTLLVDSMRSVRNFHFDKSA 371
Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIK-----------------------LNLQPDAK 283
+ + S ++ LD+ +++GS G+S L+K LN D
Sbjct: 372 SSVLTSCLTVLDDGFLFLGSRLGNSLLLKYTEARPVFRNCYHTEEPAAKRKRLNTAADWA 431
Query: 284 GS-----------------------YVEVLERYVNLGPI-------VDFCVVDL--ERQG 311
S EV + VN+GP F + +R+
Sbjct: 432 ASDTNDIDLQMYGKDTVTSEPLSSYKFEVCDSLVNIGPCGAAELGEPAFLSEEFVSQRES 491
Query: 312 QGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRS---STDDPFDT----- 363
++ SG K+G++ +++ + + EL G MW+++S T+ P T
Sbjct: 492 DLELAILSGHGKNGAISVLQRSVKPQVVTTFELPGCIDMWTVKSVCEKTELPTKTQQQQH 551
Query: 364 -FLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQ--LVQVTSGSVRLV 420
+L++S E L + E+ E E GF ++ Q++F + ++ ++QV + V L+
Sbjct: 552 SYLILS--REESTLILETGKEIMEVENSGFNTREQSVFVGNIGGDKELILQVCASGVWLL 609
Query: 421 SSTS--RELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEIGDGILTE 470
+ + + E SP + + LL T G L+ L + + + +E
Sbjct: 610 AGVKLLQHIPLELGSP----ITQCSICDPYALLLTSDGDLIMLTLTNDLDSE 657
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 103/459 (22%), Positives = 191/459 (41%), Gaps = 63/459 (13%)
Query: 580 LRTFSS-KNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMC--PFNSAAFPDS-LAI 635
LRTFS+ + VF P I+ + + L + + S C PF++ P+ L
Sbjct: 917 LRTFSNIGGYSGVFLCGPYPYWIFVTIRGALCCHPMSVDGSVSCFVPFHNVNCPNGFLYF 976
Query: 636 AKEGELTIGTIDDIQKLH----IRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAE---- 687
+GEL I + K +R I L + + + + +A+ + ++ C
Sbjct: 977 NSQGELRICMLPPHMKYDTAWPMRKITLRCSVHFLAYSIEHKVYALVTSVSEPCTRLPYL 1036
Query: 688 --ESEMHF------------------VRLLDDQTFEFISTYPLDT--FEYGCSI----LS 721
E+E F V+L+ +++ + LD FE+ + LS
Sbjct: 1037 TFENEREFEDLEKGDRFIYPHIDKFSVQLISPASWDLVPNARLDMGEFEHITCMKNVWLS 1096
Query: 722 CSFSDDSNVYYCV-GTAYVLPEENEPTKGRILVF-----IVEDG------KLQLIAEKET 769
C + + V GT V EE ++G+I++ + E G KL+ I +E
Sbjct: 1097 CGQDSSARQNFLVLGTVNVFGEEMS-SRGKIIILEVIEVVPEPGQPLTKNKLKQIYSEEQ 1155
Query: 770 KGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIV 829
KG V ++ G LL AI QKI ++++ D + L+ ++ + + F +
Sbjct: 1156 KGPVTAVCGLEGNLLTAIGQKIFIWRF----DENQSLRGLAFVDTNVYIHHALSFRSFAL 1211
Query: 830 VGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDI---YLGAENNFNLFTVRK 886
VGD+ +SI+LL Y+ + + +RD + +++ D +L +++ NL
Sbjct: 1212 VGDIQRSITLLRYQTDFKTLSVTSRDVRPLEVYTADLVVDGTGINFLVSDHEKNLVLFAY 1271
Query: 887 NSEGATDEERGRLEVVGEYHLGEFVN-RFRHGSL-VMR---LPDSDVGQIPTVIFGTVNG 941
+ E RL + H+G N +R + V R LP+ + + GT++G
Sbjct: 1272 DPEDHESHGGSRLTKRADMHIGSRANCMWRVAACGVDRSTGLPNQPYAGVHITMMGTLDG 1331
Query: 942 VIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR 980
I + + + Y L LQ + ++ + GLN + +R
Sbjct: 1332 SICHVLPVAEKVYRRLLMLQNIMITGLQHIAGLNPKAFR 1370
>gi|401426063|ref|XP_003877516.1| putative CPSF-domain protein [Leishmania mexicana MHOM/GT/2001/U1103]
gi|322493761|emb|CBZ29051.1| putative CPSF-domain protein [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 1347
Score = 65.9 bits (159), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 75/346 (21%), Positives = 138/346 (39%), Gaps = 69/346 (19%)
Query: 734 VGTAYVLPEENEPTKGRILVFIVED----GKLQLIAEKETKGAVYSLNA---FNGKLLAA 786
+G+++ P+E GRI + + +L LIA K+ GA+ A + G++
Sbjct: 925 IGSSFTFPDEQRARSGRITWCALREEHQRQRLHLIASKDIGGALQCCAAVPHYKGRIALG 984
Query: 787 INQKIQLYKWMLRDDGTRELQSECG---HHGHILALYVQTRGDFIVVG-DLMKSISLLIY 842
+N + LY W +D T + C ++ LY + ++V D+ S +
Sbjct: 985 VNGCVCLYNWNT-EDQTFVAEERCRIGLTVTKLIPLYDTSLAASVLVALDVRHSAFFIEV 1043
Query: 843 KHEEGAIEERARDYNANWMSAVEILDDDIYLGAEN--------NFNLFTV---------R 885
+G+++ RD + ++D I AEN NF V
Sbjct: 1044 DTLQGSLKVLCRDADLRG-----VMDGHIGSDAENLCLFDDSLNFTAMKVVPLPVEAGDG 1098
Query: 886 KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSL----VMRLPDS---DVGQI------- 931
+ + + R EV + HLG+ V R G +M P + ++
Sbjct: 1099 DAAAAGSVTAQYRFEVRAQCHLGDLVTCVRPGCFAATSLMEAPAACSLSRNRLLLPGIAG 1158
Query: 932 PTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWR----------- 980
P ++F T +G GV+ + YL L L+ +L + ++ VGGL+H+ +R
Sbjct: 1159 PQLVFATAHGGFGVVTPVQAATYLVLRALEASLVRTLQPVGGLSHQAFREVLRAGQERGV 1218
Query: 981 ---------SFNNEK-KTVDAKNFLDGDLIESFLDLSRTRMDEISK 1016
+ E+ + + N +DGD++E F+ L + I +
Sbjct: 1219 SYLASKTGCALTRERLRRYEPLNTVDGDMVELFVQLPPSDKKHICR 1264
>gi|313215162|emb|CBY42850.1| unnamed protein product [Oikopleura dioica]
Length = 228
Score = 65.5 bits (158), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 56/219 (25%), Positives = 106/219 (48%), Gaps = 10/219 (4%)
Query: 826 DFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLG-----AENNFN 880
++ +VGD+ + I+LL ++ E I + +R A ++AV IL D +G + N
Sbjct: 5 NYALVGDIQQGITLLRHQGERNCISQISRARRAGEVTAVGILLDGNQVGLVSTDMQRNLQ 64
Query: 881 LFTVR---KNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFG 937
++ + K S G R +G+ + + + R ++ ++ +
Sbjct: 65 VYMYKPDQKESNGGKQLVRQADINLGKRVISIWNSLGRQNDTFTKVALTENDARHVTFYA 124
Query: 938 TVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD--AKNFL 995
++G IG I + + + LE LQT ++ + GGLN ++R NE + ++ AKN +
Sbjct: 125 GLDGSIGDIVPVSEKVFRRLEMLQTLVQSHLPHYGGLNPREYRYCTNEYRDLENAAKNII 184
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTR 1034
DGDL+E F LS T ++S+ + V+ E L + ++ R
Sbjct: 185 DGDLLERFNGLSFTEQTDLSRKIGVTREALLDDMMDVQR 223
>gi|392558419|gb|EIW51607.1| hypothetical protein TRAVEDRAFT_176174 [Trametes versicolor FP-101664
SS1]
Length = 1431
Score = 65.5 bits (158), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 111/515 (21%), Positives = 215/515 (41%), Gaps = 65/515 (12%)
Query: 565 LTDRKKVSLGTQPITLRTFSSKNTTHVFAASDRPTVIYSSNK---KLLYSNVNLKEVSHM 621
L ++K++S P + + VF DRP+ I S++K K+ S ++ +
Sbjct: 921 LAEQKRISRLLVPFVTSPAPGQTFSGVFFTGDRPSWILSTDKGGVKVFPSGHSVVQA--- 977
Query: 622 CPFNSAAFPDS----LAIAKEGELTIGTIDDIQ---KLHIRSIPLGE-HPRRICHQEQSR 673
F +++ +S L ++EG + + D+Q L RS+P + + S
Sbjct: 978 --FTTSSLWESRGDFLLYSEEGPSLVEWMPDVQLDGHLPARSVPRSRPYSNVVFDASTSL 1035
Query: 674 TFAICSLKNQSCAEESEMHFV----------RLLDDQTFEFIST---YPLDTFEYG---- 716
A S +N+ + + + + V L + T E IS +D +E+
Sbjct: 1036 IVAASSFQNRFASYDEDGNVVWEPDSPNISSPLCECSTLELISPDGWITMDGYEFAPNEF 1095
Query: 717 ----CSILSCSFSDDSNV--YYCVGTAYVLPEENEPTKGRILVF----IVEDG------- 759
SI + S +S + + VGT + E+ KG + +F +V D
Sbjct: 1096 VNCIVSIPLETMSTESGMKDFIAVGTT-INRGEDLAVKGAVYIFEIVEVVPDPSTHVKRW 1154
Query: 760 -KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILA 818
+L+L+ + KG V L NG L++++ QKI + + L + + G +
Sbjct: 1155 WRLKLLCRDDAKGPVSFLCGINGYLVSSMGQKIFVRAFDLDERLVGVAFLDVG----VYV 1210
Query: 819 LYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEILDDDIYLGAENN 878
++ + +V+GD +KS+ + ++ + + +D ++ ++ D L
Sbjct: 1211 TSLRAVKNLLVIGDAVKSVWFVAFQEDPYKLVVLGKDPQLCCITRADLFFADGQLSIVTC 1270
Query: 879 FNLFTVRKNSEGATDEERG---RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPT-- 933
VR + D E L E+H G+ + +R LV R P + +IP
Sbjct: 1271 DEEGIVRLYAYDPHDPESKSGQHLLRRTEFH-GQ--SEYRSSMLVARRPKNGDPEIPQAR 1327
Query: 934 VIFGTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD-AK 992
++ G+V+G + + + L LQ L + ++ V LN + +R NE + +K
Sbjct: 1328 LVCGSVDGSLSTLTYVDEAASKRLHLLQGQLIRTVQHVAALNPKAFRMVRNEYVSRPLSK 1387
Query: 993 NFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCK 1027
LDG+L+ +F DL R +E+++ + + K
Sbjct: 1388 GILDGNLLATFEDLPIARQNEVTRQIGTDRATVLK 1422
>gi|345566738|gb|EGX49680.1| hypothetical protein AOL_s00078g169 [Arthrobotrys oligospora ATCC
24927]
Length = 1407
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 106/488 (21%), Positives = 187/488 (38%), Gaps = 60/488 (12%)
Query: 591 VFAASDRPTVIYSSNKKL--LYSNVNLKEVSHMCPFNSAAFPDS-LAIAKEGELTIGTID 647
F A D PT I S+K L LY + V + PFN+ L I +G +
Sbjct: 913 AFMAGDTPTFITKSSKTLPKLY-KLQGGMVRSLSPFNTKETERGFLYIDSKGTARVCHFP 971
Query: 648 DIQKLHI---RSIPLGEHPRRICHQEQSRTFAICSLK------------------NQSCA 686
++ H + IPL P + + + + + L +++
Sbjct: 972 EVSMEHTWLSQRIPLERTPTSLTYYDPKNVYVVSVLSTSKPEVDDEDFQMEEGLVDETLL 1031
Query: 687 EESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSF-----SDDSNVYYCVGTAYVLP 741
E E + ++ T+ Y E + + + + V VGT +L
Sbjct: 1032 PELETGHLVMISPVTWTTTDRYEFPVHEVPFVVKAVELEISEVTKERKVLIAVGTG-LLR 1090
Query: 742 EENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAFNGKLLAAINQK 790
EN P +G + VF V D K +LI+ +E KG V +L +G LL QK
Sbjct: 1091 GENSPARGAVYVFDVIDVVPEIGKPETGKKFKLISREEVKGVVSTLAGMDGYLLITHGQK 1150
Query: 791 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
+ L++DG+ + + H +T ++ GD++K +S + + E +
Sbjct: 1151 CMIRG--LKEDGSLLPVAFMDMNTHTTV--AKTLEKMVMFGDVLKGVSFVGFSEEPYKMI 1206
Query: 851 ERARDYNANWMSAVEILDDDI---YLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHL 907
+D ++A + L ++ A+ N+ ++ + E RL GE +
Sbjct: 1207 LFGKDPRQLSITAGDFLPAGTACYFVVADAQSNIHVLQYDPENPKSIHGNRLLPKGEIYC 1266
Query: 908 GEFVNRF----RHGSLVMRLPDSDVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEK 959
G V + SL + D+ + +F T+ GV G ++S+ Y L
Sbjct: 1267 GHEVKSICILPKKKSLFTEPDEDDMDEDEDEEFLCMFSTMTGVFGTVSSITESMYRRLNV 1326
Query: 960 LQTNLRKVIKGVGGLNHEQWRS--FNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKT 1017
+Q + + + GLN +R+ F N + + LDG L+ +L L R E++
Sbjct: 1327 IQGQITNTGEHIAGLNPRAYRAAKFRN-TSSEPMRAILDGKLLVRWLMLGAGRRKELAGR 1385
Query: 1018 MNVSVEEL 1025
S E L
Sbjct: 1386 AGTSEEML 1393
>gi|50549733|ref|XP_502337.1| YALI0D02717p [Yarrowia lipolytica]
gi|74634908|sp|Q6CAH5.1|RSE1_YARLI RecName: Full=Pre-mRNA-splicing factor RSE1
gi|49648205|emb|CAG80525.1| YALI0D02717p [Yarrowia lipolytica CLIB122]
Length = 1143
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 155/735 (21%), Positives = 283/735 (38%), Gaps = 102/735 (13%)
Query: 350 MWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQL 409
+W++R D ++V+S+ + T L + + D + ET G TL C ++ +
Sbjct: 459 LWTMRDGAGS--DKYIVLSYANAT--LVLEIGDSVVETTSSGLTLDKPTLHC-GSVGSSY 513
Query: 410 VQVTSGSVRLVSSTSRELRNE------WKSPPGYSVNVATANASQVLLATGGGHLVYLEI 463
VQV + + ++ SRE +E W +P G V A++++ QV+L L Y E
Sbjct: 514 VQVMTDGMNVIP-MSREGSSESLPATKWTAPSG-QVICASSSSHQVVLGLTSS-LFYFED 570
Query: 464 GDGILTEVKHAQLEYEIS----CLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNL 518
G +E+ YE+S + + P+ S AV D +VRI S+ P+
Sbjct: 571 TPG--SELSAYDGAYELSSPPTAVAVAPVPAGRVRSPFVAVAT-DDETVRIVSVDPESMF 627
Query: 519 ITKEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDR--KKVSLGTQ 576
T G S+ L + + YL L +G + L+ TGE+ K V LG
Sbjct: 628 ETVAVQGLMATASSLALLSVGQVLYLHMGLANGVYVRVELDPLTGEIVGSWSKFVGLGRL 687
Query: 577 PITLRTFSSKNTT------------HVFAASDRPTVIYSSNKKLLYS--NVNLKEVSHMC 622
+ T + + HV A SD V N ++ ++ + +
Sbjct: 688 SVVPVTCGGEESILVSSRGVKTCLGHVNATSDT-WVPTGGNSAPFFALDAISGEPLDLAH 746
Query: 623 PFNSAAFPDSLAIAKEGELTIGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKN 682
F++ P + L I T++ QK + L +R+ Q + T I +
Sbjct: 747 SFHTQDCPHGVIGVAGSTLKIFTVNTAQKWTENEVKLEGTAKRLI-QHDATTLTITQNPD 805
Query: 683 QSCAEESEMHFVRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYCVGTAYVLPE 742
+ + +D+ D SI F D Y+ VG +
Sbjct: 806 RLVS----------VDNGAVGITK----DLGGPPTSICEVMFGDGKR-YFAVGGSRDGSP 850
Query: 743 ENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDG 802
T G I +F L + E + +L A+NG L+A I +++LY L+
Sbjct: 851 GTSGTSGYISIF--SSSSLGHVHTTEVEAPPLALCAYNGLLVAGIGSQVRLYALGLKQV- 907
Query: 803 TRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGA--IEERARDYNANW 860
R+ Q E LA + + + + VGD+ +S+++ + E+ I D +
Sbjct: 908 LRKAQIELSKRVTCLAHFAGS--NRVAVGDIRQSVTVCVVLEEDSGHVIYPLVCDKISRQ 965
Query: 861 MSAVEILD-DDIYLGAE-NNFNLFTVRKNSEGATDEERGRLEVV-------GEYHLGEFV 911
++ + +D + + LG F + + + DE+ + + G H
Sbjct: 966 VTCLFFVDYETVALGDRFGGFTMLRIPSEASKLADEDHNAVHLRQLEPTLNGPAHF---- 1021
Query: 912 NRFRHGSLVMRLPDSDVGQIPTVI------------FGTVNGVIGVIASLPHEQYLFLEK 959
RF H + + +P I GTV+ + V++ +Q L+
Sbjct: 1022 -RFDH------VASFHIEDVPVAIHMYNDYLVVCGLLGTVSAFVPVVSP---KQSRDLKT 1071
Query: 960 LQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
++ + G+ G +H ++R + V K +DGD++ L + R +E+ +
Sbjct: 1072 IEKFVCASDPGLMGRDHGRFRGYY-----VPVKEVVDGDMLREVLVMDEKRREEVGEKTG 1126
Query: 1020 VSVEELCKRVEELTR 1034
+ VE RV + +
Sbjct: 1127 LGVEGAVGRVVNVMK 1141
>gi|407039323|gb|EKE39583.1| CPSF A subunit region protein, putative [Entamoeba nuttalli P19]
Length = 1108
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 167/790 (21%), Positives = 316/790 (40%), Gaps = 92/790 (11%)
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST 357
P+ VVD Q + G+ SL+I+ +G+ +E S+ LQ + +W+L+
Sbjct: 350 PLTSIDVVDSHEVLQIRAFVGKGS--RSSLKILTHGLNPDECMSLSLQHPRNIWTLKPYN 407
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
+ ++V+ ++T +L L D L++ G QTL I LVQ+ + +
Sbjct: 408 EQTHQ-YVVIGLENQTYVLK-TLPDALKQCPECGIRPNVQTLHAGMFIDGTLVQIHAHGI 465
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH----LVYLEIGDGILTEVKH 473
+ + E +E ++ V T+++ QV++A +VY E TE K
Sbjct: 466 ITILN---EKLSEQDPGAQRTILVGTSSSCQVVIALKNAQSKCEIVYFEYN----TETKK 518
Query: 474 -AQLEYEISCLDINPIG----ENPSYSQIAAV---GMWTDISVRIFSLPDLNLITKEHLG 525
A++E I +I + EN ++A + GM +++ S L T +
Sbjct: 519 LAEVERRIIRSNIIAMALGDFENGKAKKVAYICDNGMANILTLEKESKFLQILETNITVE 578
Query: 526 GEIIPRSVLLCAFEGIS--YLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQPITLR 581
GE S++L F IS +L +G DG + + K EL K + GT P+
Sbjct: 579 GE--AESIVLTQFNEISEPFLFLGIGMKDGCCYGYKVIGKNSELMWCKLI--GTDPVVFG 634
Query: 582 TFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGEL 641
+ V +S +YS ++ ++ ++ + + + PF + +S+A EL
Sbjct: 635 QYEFGGNIGVIMSSSTMIFMYSYFSRIEFTPISYQPLKALAPFITDFSGESIAGITNDEL 694
Query: 642 T---IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFVRLLD 698
T IG++DD + PL PR++ T +CS + + +D
Sbjct: 695 TVIAIGSLDD--RFTYDEQPLLYTPRKVIKSNYFPTTVLCS-------DNKSNLYNPTID 745
Query: 699 DQTFEFIS-TYPLDTFEYG----------CSILSCSFSDDSNVY----------YCVGTA 737
++T EF +D E G SI F D + C ++
Sbjct: 746 NETNEFNERKIGIDLQEDGKWASEVGIIDNSIFKSYFKTDEAIICGAHVSFKKKQCFISS 805
Query: 738 YVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWM 797
V + + KG + + +LI + + ++L + +LLA I ++LY
Sbjct: 806 QV--KHYKKMKGECWINVYNLKTFKLIHKTPVEDICHALTSCGERLLAGIGTTLRLY--- 860
Query: 798 LRDDGTRELQSECGHHGHILAL-YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDY 856
D G + L +C G + ++ GD IVVG + + Y + + +D
Sbjct: 861 --DLGKQILIRKCELDGFPSIINSLEVIGDKIVVGTVATGFIYVNYDSDANILSITEKDR 918
Query: 857 NANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA----TDEERGRLEVVGEYHLGEFVN 912
+ ++A ILD+ +G + ++F N+ TD E+V ++++G+ V
Sbjct: 919 IWHSLTASTILDETSTIGFDKLGSVFITETNTNSNQLNLTDIIPLSNEIV-QWYVGDVVT 977
Query: 913 RFRHGSLVMRLPDSD-----VGQIP----TVIFGTVNGVIGVIASLP-HEQYLFLEKLQT 962
+ + D+ QI +I+ + G IGV+ E F KL+
Sbjct: 978 SVSVNEIWKGITDNTNEYTFEEQIKENKNVIIYSCLMGRIGVLIPFNFREDVEFFSKLEM 1037
Query: 963 NLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSV 1022
++ + + + +R N V +DGDL + F + +I+ + ++
Sbjct: 1038 EIKNNYSPLLSNSFDSYRGTNYPGIGV-----IDGDLCDYFNQMDPKLQLQIANNLEMTP 1092
Query: 1023 EELCKRVEEL 1032
++ + E+
Sbjct: 1093 VQIQLKCEQF 1102
>gi|402591342|gb|EJW85272.1| hypothetical protein WUBG_03818, partial [Wuchereria bancrofti]
Length = 1025
Score = 64.7 bits (156), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 94/439 (21%), Positives = 178/439 (40%), Gaps = 94/439 (21%)
Query: 71 IDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTI 125
+DP R + ++ V+PF++ ++ ++L ++ V+D+ FL G +PT+
Sbjct: 140 VDPGQRCASMLVFGRYLAVLPFNDSSAQLHSYTVQLSQIDSRLVNVVDMVFLDGYYEPTL 199
Query: 126 VVLYQDNKDA--RHVKTYE------VALKDKDFV-EGPWSQNNLDNGADLLIPVPPPLCG 176
+ LY+ + R Y+ V+L K+ V W NL + ++ +P P+ G
Sbjct: 200 LFLYEPVQTTCGRACVRYDTMCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGG 259
Query: 177 VLIIGEETIVYCS-------------ANAFKAIPIRPSITKAYGRVDADGS--------R 215
+L++ ++Y + + F P++ A + DG+ +
Sbjct: 260 ILLVATNELIYLNQSVPPCGISLNSCMDGFTKFPLKDFKHMA---LTLDGAVVTVVSTNK 316
Query: 216 YLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS--- 271
LL D G L L+++T V L+++ ET I T++ ++IGS DS
Sbjct: 317 ILLCDRNGRLFTLILVTDATNSVKSLELKFQFETVIPCTMTSCAPGYLFIGSRLCDSVFL 376
Query: 272 --------------QLIKLNLQPDAKGS------------------------YVEVLERY 293
+ IKL+ +P+A + VL++
Sbjct: 377 HCIFEQSTLEESATKKIKLSTEPNANEEDEDFELYGEMLPKVAKPDITEELLNIRVLDKL 436
Query: 294 VNLGP---IVDFC------VVDLERQGQGQVVTCS-GAYKDGSLRIVRNGIGINEQASVE 343
+N+GP I C ++ R+ + C+ G K GS+ I++ I S
Sbjct: 437 LNVGPCKKITGGCPSISAYFQEITRKDPLFDLVCACGHGKFGSICILQRSIRPEIITSSS 496
Query: 344 LQGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHD 403
++G+ W++ DD F+ E LA+ +++L E E F + T+ +
Sbjct: 497 IEGVVQYWAIGRREDDTHMYFIAS---RELGTLALETDNDLVELEAPIFSTSESTIAAGE 553
Query: 404 AIYNQL-VQVTSGSVRLVS 421
L VQVT+ S+ +V+
Sbjct: 554 LADGGLAVQVTTSSLVMVA 572
>gi|296806499|ref|XP_002844059.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
gi|238845361|gb|EEQ35023.1| conserved hypothetical protein [Arthroderma otae CBS 113480]
Length = 1348
Score = 64.3 bits (155), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 100/434 (23%), Positives = 174/434 (40%), Gaps = 60/434 (13%)
Query: 655 RSIPLGEHPRRICHQEQSRTFAI-CSLKNQ-SCAEESEMHF-----------------VR 695
R IPLGE I + S ++ I S+K E+ E H V+
Sbjct: 912 RKIPLGEQVDCIVYSSASESYVIGTSVKEDFKLPEDDESHTEWQNEFITFLPQLERGTVK 971
Query: 696 LLDDQTFEFISTYPLDTFEYGCSILSC---------SFSDDSNVYYCVGTAYVLPEENEP 746
LLD + + P ++C + + VG+A V E+ P
Sbjct: 972 LLDPKNWSIADIAPSSHELEPAERITCIEVIRLEISEITHERKDMVVVGSAIVKGEDIVP 1031
Query: 747 TKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAF--NGKLLAAINQKIQL 793
KG I VF + D +L+L A +E KGAV +L+ G L+ A QK +
Sbjct: 1032 -KGCIRVFEIIDVVPDPDHSEMNKRLKLFAREEVKGAVTALSGIGSQGFLIVAQGQKCMV 1090
Query: 794 YKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERA 853
L++DG+ + ++ L +VGD +K + Y E ++
Sbjct: 1091 RG--LKEDGSLLPVAFKDAQCYVSVLKELKGTGMCIVGDAIKGLWFTGYSEEPYKLDLFG 1148
Query: 854 RDYNANWMSAVEILDDD--IY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF 910
++ + A + L D +Y L A+++ NL ++ + E + + RL +H+G F
Sbjct: 1149 KENENIAVIAADFLPDGNRLYVLVADDDCNLHVLQYDPEDPSSSKGDRLLHRNVFHVGHF 1208
Query: 911 VNRFR---HGSLVMRLP------DSDVGQIPT---VIFGTVNGVIGVIASLPHEQYLFLE 958
+ GS P D+D P+ ++ G +G+I L + Y L
Sbjct: 1209 ASTMTLLPQGSHTPHSPADRDAMDTDAPLPPSKYQILMTFQTGSVGIITPLNEDSYRRLL 1268
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
LQ+ L ++ GLN +R+ ++ + +DG+L+ +LD+ R EI+ +
Sbjct: 1269 ALQSQLVNALEHPCGLNPRGYRAVESD-GIGGQRGMIDGNLLLRWLDMGAQRKAEIAGRV 1327
Query: 1019 NVSVEELCKRVEEL 1032
V + +E+L
Sbjct: 1328 GADVGAIRMDLEKL 1341
>gi|237839083|ref|XP_002368839.1| hypothetical protein TGME49_067710 [Toxoplasma gondii ME49]
gi|211966503|gb|EEB01699.1| hypothetical protein TGME49_067710 [Toxoplasma gondii ME49]
Length = 2136
Score = 63.9 bits (154), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 59/224 (26%), Positives = 93/224 (41%), Gaps = 62/224 (27%)
Query: 733 CVGTAYVLPEENEPTKGRILVFIVE---DGKL--QLIAE-KETKGAVYSLNAFNGKLLAA 786
CVGTA V+ ++EP +G I++ V DG+L QL A E + V L F G LLAA
Sbjct: 1674 CVGTAEVMASDSEPKEGLIILLDVRRGTDGRLVVQLAAHIHELRSGVQQLRPFQGLLLAA 1733
Query: 787 INQKIQLYKWMLRDDGTRELQSE------------------------------------- 809
N +++L+ +G L++E
Sbjct: 1734 CNHRVRLFGLRRELEGAAGLEAEMEGDSPQASEVHAGDRGRSSAGRRQGARARDRTEEKG 1793
Query: 810 --------------CGHHGHILALYVQTRGDFIV-VGDLMKSISLLIYKHEEGAIEERAR 854
C H + + T D IV +GD++ S LL + ++ A +E AR
Sbjct: 1794 RELCEMLKARLELVCSHSSNAFVSSMDTWHDEIVCLGDMVASAVLLRFSPKDNAFKEIAR 1853
Query: 855 DYNANWMSAVEILDDDIYLGAENNFNLFTVRK----NSEGATDE 894
D NA W AV L +++ A+ + N++ + + SEG +E
Sbjct: 1854 DTNACWTLAVSCLSPNLHYLADADRNIWLLERASLLGSEGRREE 1897
Score = 47.0 bits (110), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 40/73 (54%), Gaps = 1/73 (1%)
Query: 935 IFGTVNGVIGVIASLPHEQ-YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
++ + G IG + +P EQ + L LQ + KV K +G L+ + S TV +K
Sbjct: 2050 LWASSEGAIGHLLQIPDEQTFARLAVLQDAVTKVTKSIGKLSAVAFHSVKVGTATVPSKG 2109
Query: 994 FLDGDLIESFLDL 1006
F+DGD++E FL+
Sbjct: 2110 FIDGDILERFLEF 2122
Score = 47.0 bits (110), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 38/137 (27%), Positives = 64/137 (46%), Gaps = 32/137 (23%)
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQ-ASVELQ 345
+++L+ + NLGPIVD C+ ++E Q ++ G + SLR +R+G+ ++ ASV L
Sbjct: 751 LQLLQVFPNLGPIVDCCLAEVEGLDQRLLIVACGHGRSTSLRFIRSGLALHASCASVALH 810
Query: 346 G-IKGMWSL-------------------RSSTDDPF-----DTFL------VVSFISETR 374
+ +W+L RS T F D L V++F ETR
Sbjct: 811 APVHRLWTLDLSPYAIAASATGMRISGRRSHTRGEFSKTRADALLARGPIAVLAFPHETR 870
Query: 375 ILAMNLEDELEETEIEG 391
+LA L+ + + +G
Sbjct: 871 VLAWTLKPRGDRSNPKG 887
Score = 40.0 bits (92), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 48/107 (44%), Gaps = 15/107 (14%)
Query: 26 LFIATERYKFCVLQWDAESSELITRAMGDVSDRIGRPTDNGQIGIIDPDCRLIGLHLYDG 85
L + T+R +L++D E ++T D+ + R D G + + P R I ++ Y+
Sbjct: 186 LLVLTDRQILLLLRYDEEKQTVVTVESIDLKEVSTRFVDGGPLMAVCPTTRCIVVYQYES 245
Query: 86 LFKVIPF--DNKGQLKEAFN-------------IRLEELQVLDIKFL 117
L + +PF + L A N RL+E VLDI FL
Sbjct: 246 LLQYLPFGPEASSSLSPARNHEAGSSLFSGVRLFRLDEQNVLDICFL 292
>gi|393907593|gb|EJD74705.1| CPSF A subunit region family protein [Loa loa]
Length = 990
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 94/438 (21%), Positives = 178/438 (40%), Gaps = 93/438 (21%)
Query: 71 IDPDCRLIGLHLYDGLFKVIPFDNKGQLKEAFNIRLEEL-----QVLDIKFLYGCAKPTI 125
+DP R + ++ V+PF++ G ++ ++L ++ V+D+ FL G +PT+
Sbjct: 140 VDPGQRCAAMLVFGRYLAVLPFNDSGAQLHSYTVQLSQIDSRLVNVVDMVFLDGYYEPTL 199
Query: 126 VVLYQDNKDA--RHVKTYE------VALKDKDFV-EGPWSQNNLDNGADLLIPVPPPLCG 176
+ LY+ + R Y+ V+L K+ V W NL + ++ +P P+ G
Sbjct: 200 LFLYEPVQTTCGRACVRYDTMCVLGVSLNVKEQVLASVWQLTNLPMDCNQILAIPRPVGG 259
Query: 177 VLIIGEETIVYCS-------------ANAFKAIPIRPSITKAYGRVDADG--------SR 215
+L++ ++Y + + F P+R + + DG ++
Sbjct: 260 ILLVATNELIYLNQSVPPCGISLNSCMDGFTKFPLR---DFKHMVLTLDGCVVTVISTNK 316
Query: 216 YLLGDHAG-LLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDS--- 271
LL D G L L+++T V L+++ +T I T++ ++IGS DS
Sbjct: 317 ILLCDRNGRLFTLVLVTDATNSVKSLELKFQFKTVIPCTMTSCAPGYLFIGSRLCDSVFL 376
Query: 272 --------------QLIKLNLQ---------------------PDAKGSY--VEVLERYV 294
+ IKLN + PD+ + VL++ +
Sbjct: 377 HCIFEQSTLDESAPKKIKLNTELNANEDEDFELYGEVLPKVAKPDSAEELLNIRVLDKLL 436
Query: 295 NLGP---IVDFC------VVDLERQGQGQVVTCS-GAYKDGSLRIVRNGIGINEQASVEL 344
N+GP I C ++ R+ + C+ G K GS+ I + + S +
Sbjct: 437 NVGPCKKITGGCPSISAYFQEVTRKDPLFDLVCACGHGKFGSICIFQRSVRPEIVTSSSI 496
Query: 345 QGIKGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDA 404
+G+ W++ DD F+ E LA+ +++L E E F + T+ +
Sbjct: 497 EGVVQYWAVGRREDDTHMYFIAS---KELGTLALETDNDLVELEAPIFATSEPTIAAGEL 553
Query: 405 IYNQL-VQVTSGSVRLVS 421
L VQVT+ S+ +V+
Sbjct: 554 ADGGLAVQVTTSSLVMVA 571
>gi|68060004|ref|XP_671977.1| hypothetical protein [Plasmodium berghei strain ANKA]
gi|56488645|emb|CAI04030.1| hypothetical protein PB301494.00.0 [Plasmodium berghei]
Length = 346
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 52/251 (20%), Positives = 108/251 (43%), Gaps = 29/251 (11%)
Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
L+ G L+ + + HE V + + IA++I L + +++ + +G+ +
Sbjct: 90 LIQSEYGDLYKIEVNHEDGIVKEIICKYFDTVPIANSICVLKSGALFVAAEFGNHFFYQF 149
Query: 277 N-------------LQPDAKGSYVE----------VLERYVNLGPIVDFCVVDLERQGQG 313
+ P K + + ++++ +L PIVD ++D +
Sbjct: 150 SGIGNDSNESMCTSNHPSGKNAIIAFKTQKLKNLYLVDQIYSLSPIVDMKILDAKNSNIP 209
Query: 314 QVVTCSGAYKDGSLRIVRNGIGINEQASVELQG-IKGMWSLRSSTDDPFDTFLVVSFISE 372
Q+ G SLRI+++G+ I E A+ EL G + +W+++ +D +++VSF
Sbjct: 210 QIYALCGRGPRSSLRILQHGLSIEELANNELPGKPRYIWTIKKDNSSEYDGYIIVSFEGN 269
Query: 373 TRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELRNEWK 432
T IL + + +EE + T+ + N +QV +R ++ ++ EW
Sbjct: 270 TLILEIG--ETVEEVYDSLLLTNVTTIHINLLYDNSFIQVYDTGIRHING---KIVQEWV 324
Query: 433 SPPGYSVNVAT 443
P ++ AT
Sbjct: 325 PPKNKQIDAAT 335
>gi|374109159|gb|AEY98065.1| FAFR382Wp [Ashbya gossypii FDAG1]
Length = 1288
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 94/389 (24%), Positives = 161/389 (41%), Gaps = 31/389 (7%)
Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
L +H L + ++ EK + I A + + ++ S YG+S L +
Sbjct: 329 LQSNHGDLYKVKILPDEKTASPVVTISYFDTIPQAQNLHIFKHGYMFANSEYGNSYLYQF 388
Query: 277 -NLQPDAKGSYVEVLE-RYVNLGP---IVDFCVVD--------LERQGQGQVVTCSGAYK 323
NL + + V+ R + + P + + V D L Q +V
Sbjct: 389 ENLDDEEESMLTSVMPGRRLIIEPRTVLKNLLVADKLALVNPILSSQLTERVPLTIATST 448
Query: 324 DGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRIL--AMNL 380
G +R+ G+ + S L +W++ ++ F L ++ T IL A
Sbjct: 449 LGDVRLFTAGVNFMDIISSPLPAAPLDIWTV-ATNGSRFHKLLFIALQESTMILKIAAGT 507
Query: 381 EDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS-VRLVSST--SRELRNEWKSPPGY 437
+ELE +Q +T+ ++QVT V ++ + S E + EW P G
Sbjct: 508 VEELELPHNPFVIAQDKTVLIAHMGGQSIIQVTENKMVHIIENRDESYESKLEWFPPAGI 567
Query: 438 SVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVK-HAQLEYEISCLDINPIGENPSYS 495
+ A++N++Q++LA +VY EIG L E++ ++E I+ L I G Y
Sbjct: 568 CILKASSNSTQLILALSNNEVVYFEIGSNESLNELQDRIEVEERITALAIGN-GNRSDYM 626
Query: 496 QIAAVGMWTDISVRIFSLP---DLNLITKEHLGGEIIPRSVLLCAFEGISYLL-CALGDG 551
IA+V D +V+++SL N + + + P S L A G S L L G
Sbjct: 627 IIASV----DSTVKVYSLKVQDQANFLEVVSMQVLVSPASSLQLASSGGSLCLHIGLDSG 682
Query: 552 HLLNFLLNMKTGELTDRKKVSLGTQPITL 580
+ L+ TGEL D + LGT+P+ +
Sbjct: 683 VYVRSKLDRNTGELFDVRTKYLGTKPVEI 711
>gi|221502136|gb|EEE27880.1| conserved hypothetical protein [Toxoplasma gondii VEG]
Length = 2131
Score = 63.9 bits (154), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 59/224 (26%), Positives = 93/224 (41%), Gaps = 62/224 (27%)
Query: 733 CVGTAYVLPEENEPTKGRILVFIVE---DGKL--QLIAE-KETKGAVYSLNAFNGKLLAA 786
CVGTA V+ ++EP +G I++ V DG+L QL A E + V L F G LLAA
Sbjct: 1669 CVGTAEVIASDSEPKEGLIILLDVRRGTDGRLVVQLAAHIHELRSGVQQLRPFQGLLLAA 1728
Query: 787 INQKIQLYKWMLRDDGTRELQSE------------------------------------- 809
N +++L+ +G L++E
Sbjct: 1729 CNHRVRLFGLRRELEGAAGLEAEMEGDSPQASEVHAGDRGRSSAGRRQGARARDRTEEKG 1788
Query: 810 --------------CGHHGHILALYVQTRGDFIV-VGDLMKSISLLIYKHEEGAIEERAR 854
C H + + T D IV +GD++ S LL + ++ A +E AR
Sbjct: 1789 RELCEMLKARLELVCSHSSNAFVSSMDTWHDEIVCLGDMVASAVLLRFSPKDNAFKEIAR 1848
Query: 855 DYNANWMSAVEILDDDIYLGAENNFNLFTVRK----NSEGATDE 894
D NA W AV L +++ A+ + N++ + + SEG +E
Sbjct: 1849 DTNACWTLAVSCLSPNLHYLADADRNIWLLERASLLGSEGRREE 1892
Score = 47.0 bits (110), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 40/73 (54%), Gaps = 1/73 (1%)
Query: 935 IFGTVNGVIGVIASLPHEQ-YLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKN 993
++ + G IG + +P EQ + L LQ + KV K +G L+ + S TV +K
Sbjct: 2045 LWASSEGAIGHLLQIPDEQTFARLAVLQDAVTKVTKSIGKLSAVAFHSVKVGTATVPSKG 2104
Query: 994 FLDGDLIESFLDL 1006
F+DGD++E FL+
Sbjct: 2105 FIDGDILERFLEF 2117
Score = 44.3 bits (103), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 42/69 (60%), Gaps = 2/69 (2%)
Query: 287 VEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQ-ASVELQ 345
+++L+ + NLGPIVD C+ ++E Q ++ G + SLR +R+G+ ++ ASV L
Sbjct: 751 LQLLQVFPNLGPIVDCCLAEVEGLDQRLLIVACGHGRSTSLRFIRSGLALHASCASVALH 810
Query: 346 G-IKGMWSL 353
+ +W+L
Sbjct: 811 APVHRLWTL 819
>gi|326477251|gb|EGE01261.1| protein kinase subdomain-containing protein [Trichophyton equinum CBS
127.97]
Length = 1267
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 68/308 (22%), Positives = 130/308 (42%), Gaps = 24/308 (7%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+PE ++P K + KL+L A++E KGAV +L+ G+ + Q + L
Sbjct: 963 VVPEPDQPEKSK---------KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGL 1013
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
++DG+ + ++ L ++GD K + + Y E ++ ++
Sbjct: 1014 KEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENEN 1073
Query: 859 NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
+ + L D +Y L A+++ NL ++ + E + + RL +H G F +
Sbjct: 1074 LAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMT 1133
Query: 916 ---HGSLVMRLP----DSDVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
HG+ P D +P ++ G I VI L + Y L LQ+ L
Sbjct: 1134 LLPHGAYTPSAPVDEDAMDTDSLPPSKYQILMTFQTGSIAVITPLSEDSYRRLLALQSQL 1193
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
++ LN +R+ ++ + +DG+L+ +LD+ R EI+ + V
Sbjct: 1194 VNALEHPCSLNPRGYRAVESDGMG-GQRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGA 1252
Query: 1025 LCKRVEEL 1032
+ +E+L
Sbjct: 1253 IRTDLEKL 1260
>gi|67473926|ref|XP_652712.1| splicing factor 3b subunit 3 [Entamoeba histolytica HM-1:IMSS]
gi|56469590|gb|EAL47325.1| splicing factor 3b subunit 3, putative [Entamoeba histolytica
HM-1:IMSS]
Length = 1108
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 164/794 (20%), Positives = 319/794 (40%), Gaps = 100/794 (12%)
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST 357
P+ VVD Q + G+ SL+I+ +G+ +E S+ LQ + +W+L+
Sbjct: 350 PLTSIDVVDSHEVLQIRAFVGKGS--RSSLKILTHGLNPDECMSLSLQHPRNIWTLKPYN 407
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
+ ++V+ ++T +L L D L++ G QTL I LVQ+ + +
Sbjct: 408 EQTHQ-YVVIGLENQTYVLK-TLPDALKQCPECGIRPNVQTLHAGMFIDGTLVQIHAHGI 465
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH----LVYLEIGDGILTEVKH 473
+ + E +E ++ V T+++ QV++A +VY E TE K
Sbjct: 466 ITILN---EKLSEQDPGAQRTILVGTSSSCQVVIALKNAQSKCEIVYFEYN----TETKK 518
Query: 474 -AQLEYEISCLDINPIG----ENPSYSQIAAV---GMWTDISV----RIFSLPDLNLITK 521
A++E I +I + EN ++A + GM +++ + + ++N+ +
Sbjct: 519 LAEVERRIIRSNIIAMALGDFENGKAKKVAYICDNGMANILTLEKESKFLQILEINITVE 578
Query: 522 EHLGGEIIPRSVLLCAFEGIS--YLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQP 577
GE S++L F IS +L +G DG + + K EL K + GT P
Sbjct: 579 ----GE--AESIILTQFNEISEPFLFLGIGMKDGCCYGYKVIGKNSELMWCKLI--GTDP 630
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ + V +S +YS ++ ++ ++ + + + PF + +S+A
Sbjct: 631 VVFGQYEFGGNIGVIMSSSTMIFMYSYFSRIEFTPISYQPLKALAPFITDFSGESIAGIT 690
Query: 638 EGELT---IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
ELT IG++DD + PL PR++ T +CS + +
Sbjct: 691 NDELTVIAIGSLDD--RFTYDEQPLLYTPRKVIKSNYFPTTVLCS-------DNKSNLYN 741
Query: 695 RLLDDQTFEFIS-TYPLDTFEYG----------CSILSCSFSDDSNVY----------YC 733
+D++T EF +D E G S+ F D + C
Sbjct: 742 PTIDNETNEFNERKIGIDLQEDGKWASEVGIIDNSLFKSYFKTDEAIICGAHVSFKKKQC 801
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
++ V + + KG + + +LI + + ++L + +LLA I ++L
Sbjct: 802 FISSQV--KHYKKMKGECWINVYNLKTFKLIHKTPVEDICHALTSCGERLLAGIGTTLRL 859
Query: 794 YKWMLRDDGTRELQSECGHHGHILAL-YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
Y D G + L +C G + ++ GD IVVG + + Y + +
Sbjct: 860 Y-----DLGKQILIRKCELDGFPSIINSLEVIGDKIVVGTVATGFIYVNYDSDANILSIT 914
Query: 853 ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA----TDEERGRLEVVGEYHLG 908
+D + ++A ILD+ +G + ++F N+ TD E+V ++++G
Sbjct: 915 EKDRIWHSLTASTILDETSTIGFDKLGSVFITETNTNSNQLNLTDIIPLSNEIV-QWYVG 973
Query: 909 EFVNRFRHGSLVMRLPDSD-----VGQIP----TVIFGTVNGVIGVIASLP-HEQYLFLE 958
+ V + + D+ QI +I+ + G IGV+ E F
Sbjct: 974 DVVTSVSVNEIWKGITDNTNEYTFEEQIKENKNVIIYSCLMGRIGVLIPFNFREDVEFFS 1033
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KL+ ++ + + + +R N V +DGDL + F + +I+ +
Sbjct: 1034 KLEMEIKNNYSPLLSNSFDSYRGTNYPGIGV-----IDGDLCDYFNQMDPKLQLQIANNL 1088
Query: 1019 NVSVEELCKRVEEL 1032
++ ++ + E+
Sbjct: 1089 EMTPVQIQLKCEQF 1102
>gi|302308830|ref|NP_985929.2| AFR382Wp [Ashbya gossypii ATCC 10895]
gi|442570014|sp|Q753D4.2|RSE1_ASHGO RecName: Full=Pre-mRNA-splicing factor RSE1
gi|299790815|gb|AAS53753.2| AFR382Wp [Ashbya gossypii ATCC 10895]
Length = 1288
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 94/389 (24%), Positives = 161/389 (41%), Gaps = 31/389 (7%)
Query: 217 LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTISYLDNAVVYIGSSYGDSQLIKL 276
L +H L + ++ EK + I A + + ++ S YG+S L +
Sbjct: 329 LQSNHGDLYKVKILPDEKTASPVVTISYFDTIPQAQNLHIFKHGYMFANSEYGNSYLYQF 388
Query: 277 -NLQPDAKGSYVEVLE-RYVNLGP---IVDFCVVD--------LERQGQGQVVTCSGAYK 323
NL + + V+ R + + P + + V D L Q +V
Sbjct: 389 ENLDDEEESMLTSVMPGRRLIIEPRTVLKNLLVADKLALVNPILSSQLTERVPLTIATST 448
Query: 324 DGSLRIVRNGIGINEQASVELQGIK-GMWSLRSSTDDPFDTFLVVSFISETRIL--AMNL 380
G +R+ G+ + S L +W++ ++ F L ++ T IL A
Sbjct: 449 LGDVRLFTAGVNFMDIISSPLPAAPLDIWTV-ATNGSRFHKLLFIALQESTMILKIAAGT 507
Query: 381 EDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGS-VRLVSST--SRELRNEWKSPPGY 437
+ELE +Q +T+ ++QVT V ++ + S E + EW P G
Sbjct: 508 VEELELPHNPFVIAQDKTVLIAHMGGQSIIQVTENKMVHIIENRDESYESKLEWFPPAGI 567
Query: 438 SVNVATANASQVLLATGGGHLVYLEIGDG-ILTEVK-HAQLEYEISCLDINPIGENPSYS 495
+ A++N++Q++LA +VY EIG L E++ ++E I+ L I G Y
Sbjct: 568 CILKASSNSTQLILALSNNEVVYFEIGSNESLNELQDRIEVEERITALAIGN-GNRSDYM 626
Query: 496 QIAAVGMWTDISVRIFSLP---DLNLITKEHLGGEIIPRSVLLCAFEGISYLL-CALGDG 551
IA+V D +V+++SL N + + + P S L A G S L L G
Sbjct: 627 IIASV----DSTVKVYSLKVQDQANFLEVVSMQVLVSPASSLQLASSGGSLCLHIGLDSG 682
Query: 552 HLLNFLLNMKTGELTDRKKVSLGTQPITL 580
+ L+ TGEL D + LGT+P+ +
Sbjct: 683 VYVRSKLDRNTGELFDVRTKYLGTKPVEI 711
>gi|328773280|gb|EGF83317.1| hypothetical protein BATDEDRAFT_21894 [Batrachochytrium dendrobatidis
JAM81]
Length = 1673
Score = 63.5 bits (153), Expect = 5e-07, Method: Compositional matrix adjust.
Identities = 68/314 (21%), Positives = 132/314 (42%), Gaps = 40/314 (12%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+P+ N P R K + + ++ + ++ N LLAAI KI +Y+
Sbjct: 1373 VVPDPNNPQTNR---------KFKHVDSEDDRSPFSAICTVNDYLLAAIGPKIIMYQL-- 1421
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
+DG E+ ++ + + + I + D+ KS+ + ++ E + RD +
Sbjct: 1422 -EDG--EITGVAFLDVNVFVTSLSSVKNLIQICDIQKSVWFVAFQEEPAKLAVLGRDVHP 1478
Query: 859 NWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
A +L DD L A+ + NL T+ + RL GE HLG+ V++F
Sbjct: 1479 LQGYAANMLIDDNQLALLVADGDKNLHTMIYAPDNVQSLGGERLIRKGEIHLGQHVSKF- 1537
Query: 916 HGSLVMRLPDSDVGQIPTVIF--------GTVNGVIGVIASLPHEQYLFLEKLQTNLRKV 967
+R+ + + ++F T++G + +I + + L L + +
Sbjct: 1538 -----IRMRRKPLLRNDAIVFSKQYLNVAATLDGALEIITPVSERIFKRLYGLYSRMVTS 1592
Query: 968 IKGVGGLNHEQWRSFNNEKKTVDAKNF---------LDGDLIESFLDLSRTRMDEISKTM 1018
I+ + GLN +R + + + F LDGDL+ ++ LSRT+ ++K +
Sbjct: 1593 IEHIAGLNPRGFRQAQHRVRPITLSGFIGPPGPRGILDGDLLYEYVRLSRTQQRGLAKAI 1652
Query: 1019 NVSVEELCKRVEEL 1032
+ L + E+
Sbjct: 1653 GSKDDRLMDDLLEV 1666
>gi|449706784|gb|EMD46555.1| splicing factor 3b subunit 3, putative [Entamoeba histolytica KU27]
Length = 1079
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 164/794 (20%), Positives = 319/794 (40%), Gaps = 100/794 (12%)
Query: 298 PIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSST 357
P+ VVD Q + G+ SL+I+ +G+ +E S+ LQ + +W+L+
Sbjct: 321 PLTSIDVVDSHEVLQIRAFVGKGS--RSSLKILTHGLNPDECMSLSLQHPRNIWTLKPYN 378
Query: 358 DDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSV 417
+ ++V+ ++T +L L D L++ G QTL I LVQ+ + +
Sbjct: 379 EQTHQ-YVVIGLENQTYVLK-TLPDALKQCPECGIRPNVQTLHAGMFIDGTLVQIHAHGI 436
Query: 418 RLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGH----LVYLEIGDGILTEVKH 473
+ + E +E ++ V T+++ QV++A +VY E TE K
Sbjct: 437 ITILN---EKLSEQDPGAQRTILVGTSSSCQVVIALKNAQSKCEIVYFEYN----TETKK 489
Query: 474 -AQLEYEISCLDINPIG----ENPSYSQIAAV---GMWTDISV----RIFSLPDLNLITK 521
A++E I +I + EN ++A + GM +++ + + ++N+ +
Sbjct: 490 LAEVERRIIRSNIIAMALGDFENGKAKKVAYICDNGMANILTLEKESKFLQILEINITVE 549
Query: 522 EHLGGEIIPRSVLLCAFEGIS--YLLCALG--DGHLLNFLLNMKTGELTDRKKVSLGTQP 577
GE S++L F IS +L +G DG + + K EL K + GT P
Sbjct: 550 ----GE--AESIILTQFNEISEPFLFLGIGMKDGCCYGYKVIGKNSELMWCKLI--GTDP 601
Query: 578 ITLRTFSSKNTTHVFAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAK 637
+ + V +S +YS ++ ++ ++ + + + PF + +S+A
Sbjct: 602 VVFGQYEFGGNIGVIMSSSTMIFMYSYFSRIEFTPISYQPLKALAPFITDFSGESIAGIT 661
Query: 638 EGELT---IGTIDDIQKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCAEESEMHFV 694
ELT IG++DD + PL PR++ T +CS + +
Sbjct: 662 NDELTVIAIGSLDD--RFTYDEQPLLYTPRKVIKSNYFPTTVLCS-------DNKSNLYN 712
Query: 695 RLLDDQTFEFIS-TYPLDTFEYG----------CSILSCSFSDDSNVY----------YC 733
+D++T EF +D E G S+ F D + C
Sbjct: 713 PTIDNETNEFNERKIGIDLQEDGKWASEVGIIDNSLFKSYFKTDEAIICGAHVSFKKKQC 772
Query: 734 VGTAYVLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQL 793
++ V + + KG + + +LI + + ++L + +LLA I ++L
Sbjct: 773 FISSQV--KHYKKMKGECWINVYNLKTFKLIHKTPVEDICHALTSCGERLLAGIGTTLRL 830
Query: 794 YKWMLRDDGTRELQSECGHHGHILAL-YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEER 852
Y D G + L +C G + ++ GD IVVG + + Y + +
Sbjct: 831 Y-----DLGKQILIRKCELDGFPSIINSLEVIGDKIVVGTVATGFIYVNYDSDANILSIT 885
Query: 853 ARDYNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGA----TDEERGRLEVVGEYHLG 908
+D + ++A ILD+ +G + ++F N+ TD E+V ++++G
Sbjct: 886 EKDRIWHSLTASTILDETSTIGFDKLGSVFITETNTNSNQLNLTDIIPLSNEIV-QWYVG 944
Query: 909 EFVNRFRHGSLVMRLPDSD-----VGQIP----TVIFGTVNGVIGVIASLP-HEQYLFLE 958
+ V + + D+ QI +I+ + G IGV+ E F
Sbjct: 945 DVVTSVSVNEIWKGITDNTNEYTFEEQIKENKNVIIYSCLMGRIGVLIPFNFREDVEFFS 1004
Query: 959 KLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTM 1018
KL+ ++ + + + +R N V +DGDL + F + +I+ +
Sbjct: 1005 KLEMEIKNNYSPLLSNSFDSYRGTNYPGIGV-----IDGDLCDYFNQMDPKLQLQIANNL 1059
Query: 1019 NVSVEELCKRVEEL 1032
++ ++ + E+
Sbjct: 1060 EMTPVQIQLKCEQF 1073
>gi|326471884|gb|EGD95893.1| protein kinase subdomain-containing protein [Trichophyton tonsurans
CBS 112818]
Length = 1398
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 68/308 (22%), Positives = 130/308 (42%), Gaps = 24/308 (7%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+PE ++P K + KL+L A++E KGAV +L+ G+ + Q + L
Sbjct: 1094 VVPEPDQPEKSK---------KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGL 1144
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
++DG+ + ++ L ++GD K + + Y E ++ ++
Sbjct: 1145 KEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENEN 1204
Query: 859 NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
+ + L D +Y L A+++ NL ++ + E + + RL +H G F +
Sbjct: 1205 LAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMT 1264
Query: 916 ---HGSLVMRLP----DSDVGQIPT----VIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
HG+ P D +P ++ G I VI L + Y L LQ+ L
Sbjct: 1265 LLPHGAYTPSAPVDEDAMDTDSLPPSKYQILMTFQTGSIAVITPLSEDSYRRLLALQSQL 1324
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
++ LN +R+ ++ + +DG+L+ +LD+ R EI+ + V
Sbjct: 1325 VNALEHPCSLNPRGYRAVESDGMG-GQRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGA 1383
Query: 1025 LCKRVEEL 1032
+ +E+L
Sbjct: 1384 IRTDLEKL 1391
>gi|255075065|ref|XP_002501207.1| predicted protein [Micromonas sp. RCC299]
gi|226516471|gb|ACO62465.1| predicted protein [Micromonas sp. RCC299]
Length = 1423
Score = 63.5 bits (153), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 86/353 (24%), Positives = 143/353 (40%), Gaps = 59/353 (16%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSI----LSCSFSDDSNVYYCVGTAYVLPE-ENEPTK 748
+RLL T E Y +D E+ SI L + + VGTA LP E+ P +
Sbjct: 1051 LRLLVPGTLECAWQYTVDPGEHVQSIRNVQLRNTMTGALQSMLVVGTA--LPGGEDAPCR 1108
Query: 749 GRILVFIV--------EDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRD 800
GR+L+F V + QL+ ++ K A +L G L AI K+ ++ W
Sbjct: 1109 GRVLIFEVVWQMTDRGTKWQGQLVCVRDAKMACTALEGVGGHLAVAIGTKLIVHSW---- 1164
Query: 801 DGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKH--EEGAIEERARDYNA 858
DG L + + + +FI++GD+ K +K +E + + A+D+
Sbjct: 1165 DG-HSLMPVAFFDTPLHTVTMNVVKNFILLGDIQKGAFFFRWKDTPDEKLLVQMAKDFEG 1223
Query: 859 NWMSAVEILDDDIYLG---AENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVN--- 912
+ A E L D L + N F + + + +L G +H+G V+
Sbjct: 1224 MDILATEFLVDGSTLSMLTTDMTGNAFIFSYDPKSLESWKGQKLLTKGAFHVGSPVHRMV 1283
Query: 913 RFRHGSLVMRLPDSDVGQIPT------------VIFGTVNGVIGVIASLPHEQYLFLEKL 960
RFR ++ P + GQ + V FGT++G +G++ + + L+ L
Sbjct: 1284 RFR-----LKAPTAAPGQTISPAEQKAQANRHAVFFGTLDGSLGILVPIEEAAHASLQSL 1338
Query: 961 QTNLRKVIK--GVGGLNHEQWRSFNNEKKTVDAK--------NFLDGDLIESF 1003
Q L + GLN R KTV+ + + LDG L+ +
Sbjct: 1339 QRYLTYATPHAALAGLNARTHR----HPKTVEGRPMRQPAPHSLLDGGLLAVY 1387
>gi|353231025|emb|CCD77443.1| putative cleavage and polyadenylation specificity factor cpsf
[Schistosoma mansoni]
Length = 1825
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 96/214 (44%), Gaps = 36/214 (16%)
Query: 110 QVLDIKFLYGCAKPTIVVLYQD------NKDARHVKTYEVALK---DKDFVEGPWSQNNL 160
VLD++FLYG +PT++VLY+ AR VAL K W Q +L
Sbjct: 213 NVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQESL 272
Query: 161 DNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIRPSITKAYG 207
+I VP P+ GV+++ +I+Y C A P+R + G
Sbjct: 273 PFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTNFPMRQDVPSC-G 331
Query: 208 RVDADGSR--------YLLGDHAGLLHLLVITHEK--EKVTGLKIELLGETSIASTISYL 257
+ DG R +L+G +G L+LL + E+ + VT L +G + L
Sbjct: 332 PLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKVGHAVPPHCMVLL 391
Query: 258 DNAVVYIGSSYGDSQLIKLN---LQPDAKGSYVE 288
++ ++IGS + DS L+K++ L DA G V+
Sbjct: 392 ESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVD 425
>gi|344300987|gb|EGW31299.1| hypothetical protein SPAPADRAFT_140061 [Spathaspora passalidarum
NRRL Y-27907]
Length = 1107
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 88/381 (23%), Positives = 166/381 (43%), Gaps = 36/381 (9%)
Query: 295 NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQGI-KGMWSL 353
+L PI+D +Q ++ S Y L+ V GI E S L +++
Sbjct: 374 SLSPILD------AKQTPSNMLVLSSHY----LKQVTRGIPTTEIVSTPLPFTPTSVYTT 423
Query: 354 RSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVT 413
+ + D++LVV+ ++ L +++ + +EE + TL + LVQV
Sbjct: 424 KLTQASVHDSYLVVTSSLASQTLVLSIGEVVEEVPDSKLVTDQHTLSIQQMGKSSLVQVY 483
Query: 414 SGSVRLVSSTSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLEI--GDGILTEV 471
+ +R + +W SP G +V A+ NA+QV++A +VY EI D ++
Sbjct: 484 TNGIRQIGHKV----TDWFSPAGITVTHASTNANQVIIAMSNCEVVYFEIDVDDQLIEYQ 539
Query: 472 KHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLITKEHLGGEIIPR 531
+ +L I+ L I + + S +A +G D ++++ SL + + + + L +
Sbjct: 540 QRLELTSSITSLAI-----SDTRSDVAVIGC-ADETIQVVSLAESDCLDVKSLQA-LSAN 592
Query: 532 SVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITLRTFSSKNTTHV 591
+ L E ++ + +G L++K +L D + LG++P+++ S +T V
Sbjct: 593 ASSLVMIENTVHI--GMENGVYARTKLDIK--QLKDTRVQYLGSRPVSMSRISVGDTAGV 648
Query: 592 FAASDRPTVIYSSNKKLLYSNVNLKEVSHMCPFNSAAFPDSLAIAKEGELTI---GTIDD 648
A S + Y+ + + +++ ++S FNS + I EL I GT D
Sbjct: 649 LAISSSAWIGYTHDTWRIMPLLDI-DISSAASFNSEDIEGIVGI-HGNELVIFSLGTEDG 706
Query: 649 IQKLH---IRSIPLGEHPRRI 666
H I+ L PRR+
Sbjct: 707 FDPSHEWTIKKSRLRYTPRRM 727
>gi|256079900|ref|XP_002576222.1| cleavage and polyadenylation specificity factor cpsf [Schistosoma
mansoni]
Length = 1958
Score = 63.2 bits (152), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 61/214 (28%), Positives = 96/214 (44%), Gaps = 36/214 (16%)
Query: 110 QVLDIKFLYGCAKPTIVVLYQD------NKDARHVKTYEVALK---DKDFVEGPWSQNNL 160
VLD++FLYG +PT++VLY+ AR VAL K W Q +L
Sbjct: 230 NVLDMQFLYGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQESL 289
Query: 161 DNGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIRPSITKAYG 207
+I VP P+ GV+++ +I+Y C A P+R + G
Sbjct: 290 PFDCRSVISVPQPIGGVVVMAANSILYLKQTLPSCGLPLNCYAQISTNFPMRQDVPSC-G 348
Query: 208 RVDADGSR--------YLLGDHAGLLHLLVITHEK--EKVTGLKIELLGETSIASTISYL 257
+ DG R +L+G +G L+LL + E+ + VT L +G + L
Sbjct: 349 PLSIDGCRVVTLNETQFLIGTRSGNLYLLSLWLEQATQTVTSLLFHKVGHAVPPHCMVLL 408
Query: 258 DNAVVYIGSSYGDSQLIKLN---LQPDAKGSYVE 288
++ ++IGS + DS L+K++ L DA G V+
Sbjct: 409 ESKYLFIGSRFCDSVLMKIDYSLLCVDANGKEVD 442
>gi|66362282|ref|XP_628105.1| contains a UVDDB domain that is present in CPSF_A and damage specific
DNA binding protein 1 [Cryptosporidium parvum Iowa II]
gi|46227624|gb|EAK88559.1| contains a UVDDB domain that is present in CPSF_A and damage specific
DNA binding protein 1 [Cryptosporidium parvum Iowa II]
Length = 1495
Score = 62.8 bits (151), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 33/123 (26%), Positives = 60/123 (48%), Gaps = 1/123 (0%)
Query: 805 ELQSECGHHGHILALYVQT-RGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSA 863
EL+ + + H + ++++ G++I+VGDLM+S+ L + G E RD + W+
Sbjct: 1188 ELKRKETYCTHTMIVFIKILNGEYILVGDLMRSVGLWEFDRYTGKFHEVCRDNSIAWVVE 1247
Query: 864 VEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRL 923
L D+YL ++ N NL + + DE L + H+GE + F+ G
Sbjct: 1248 GIFLSKDMYLISDENRNLRVLMRTLNPENDETYTSLSCIAHLHVGESITTFQQGKFTQAY 1307
Query: 924 PDS 926
P++
Sbjct: 1308 PET 1310
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 82/405 (20%), Positives = 162/405 (40%), Gaps = 83/405 (20%)
Query: 4 VPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQ---WDAESSELITRAMGDVSDRIG 60
V IY I ++ FRP + D + I T Y+ +L+ +++S +++ ++ D
Sbjct: 75 VEIYKDIIEIDRFRPKNQDFDDVLIFTREYELILLRAHLLNSDSEKILCMSILDKVSLYR 134
Query: 61 RPTDNGQI--GIIDPDCRLIGLHLYDGLFKVIPFD-------NKGQLKEAFNIRLEELQV 111
Q+ ++ + I + Y+G +V+ + +K IRL EL V
Sbjct: 135 ENLRKSQLIKMMVHSEKNRIVILAYEGCLQVVGCEISSKDEEDKAVFTSPLIIRLSELSV 194
Query: 112 LDIKFLYGCAKPTIV-VLYQD--NKDARHVKTYEVALKDKDFVEGPWSQN-NLDNGADLL 167
DI + +++ +LY +KD R +K E+ + + WS N N+ +
Sbjct: 195 TDICLANTTNERSLLGILYDSGYSKDPRLMKIIELPIDLR-----KWSYNTNIGHVMQQN 249
Query: 168 IPVPPPLC----------GVLIIGEETIVYCSANAFKAIPIRPSIT-------------- 203
+ PL G + G+ + Y S + S T
Sbjct: 250 VFKIAPLYYRSKKQKMVKGFFLFGDGIVEYRSIEEIMPNKTKNSETISSTQKIVTSRFNS 309
Query: 204 KAYGRVDA--------------DGSRYLLGDHAGLLHLLVITHEK---EKVTGLKIELLG 246
A+ +D DGSR+L+ D+ G L ++++ +++ +KV +++ +L
Sbjct: 310 SAFSGMDIGLSPLSITDILNLDDGSRWLVLDNLGRLFIMLVEYDQDDIDKVVDIRLNILN 369
Query: 247 ETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVD 306
S S I L + + ++ S DS L+ +K + ++ N+GPI D +
Sbjct: 370 RYSPFSRIVNLGDDLFFLASKLSDSLLLY------SKNNNFYIISSLPNIGPIKDLIFTN 423
Query: 307 LERQGQGQ---------------VVTCSGAYKDGSLRIVRNGIGI 336
+R+ + + ++ G G+L+ + NGIG+
Sbjct: 424 FDRKEETKYEDIPSQISKRDPLPLIAACGFGSGGALKSICNGIGL 468
>gi|388581811|gb|EIM22118.1| hypothetical protein WALSEDRAFT_28358 [Wallemia sebi CBS 633.66]
Length = 1259
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 63/280 (22%), Positives = 121/280 (43%), Gaps = 14/280 (5%)
Query: 760 KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECGHHGHILAL 819
KL+++ +ETKGAV ++ + +G + A+ QK+ + + + L S + +
Sbjct: 989 KLKMLMREETKGAVSAITSCSGYFVVAVGQKVLIRALEINE----RLISVAFYDAGTYIV 1044
Query: 820 YVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL--DDDI-YLGAE 876
++ +FI+VGD +KSI+ L ++ + + +RD L +D I ++ +
Sbjct: 1045 SLEVLKNFILVGDQVKSITFLAFQESPYKLVQLSRDARQIETCVSNFLAHEDQISFVSND 1104
Query: 877 NNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIF 936
+L + N T E +L E+H G + + LP V ++
Sbjct: 1105 IQGDLRLIDYNPFDPTAEGGEKLIRTTEFHKGS------EATCSLLLPKPSVRPSSELLL 1158
Query: 937 GTVNGVIGVIASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVD-AKNFL 995
G V+G + ++ + + L LQ L + I + LN R N+ + +K L
Sbjct: 1159 GCVDGSLSCLSPVDEITFKALWLLQGALVRQIPHIAALNPRAHRHVRNDYVSRSLSKGIL 1218
Query: 996 DGDLIESFLDLSRTRMDEISKTMNVSVEELCKRVEELTRL 1035
DG L+ ++ + EI+K + S EL + + L
Sbjct: 1219 DGLLLSAYQTIDHATQVEIAKRIGYSKAELLGYLRNFSWL 1258
>gi|350584885|ref|XP_003126937.3| PREDICTED: splicing factor 3B subunit 3-like [Sus scrofa]
Length = 425
Score = 62.4 bits (150), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/288 (23%), Positives = 128/288 (44%), Gaps = 39/288 (13%)
Query: 694 VRLLDDQTFEFISTYPLDTFEYGCSILSCSFSDDSNVYYC-VGTAYVLPEENEPTKGRIL 752
+R+++ + L+ E S+ C FS+ + +Y VG A L G +
Sbjct: 150 IRVMNPIQGNTLDLVQLEQNEAAFSVAVCRFSNTGDDWYVLVGVAKDLILNPRSVAGGFV 209
Query: 753 VF--IVEDG-KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSE 809
+V +G KL+ + + + ++ F G++L + + +++Y D G ++L +
Sbjct: 210 YTYKLVNNGEKLEFLHKTPVEEVPAAIAPFQGRVLIGVGKLLRVY-----DLGKKKLLRK 264
Query: 810 CGHHGHILALYV---QTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEI 866
C + HI A Y+ QT G ++V D+ +S + YK E + A D W++ +
Sbjct: 265 C-ENKHI-ANYISGIQTIGHRVIVSDVQESFIWVRYKRNENQLIIFADDTYPRWVTTASL 322
Query: 867 LDDDIYLGAENNFNLFTVR--KNSEGATDE---------ERGRL-------EVVGEYHLG 908
LD D GA+ N+ VR N+ DE +RG L EV+ YH+G
Sbjct: 323 LDYDTVAGADKFGNICVVRLPPNTNDEVDEDPTGNKALWDRGLLNGASQKAEVIMNYHVG 382
Query: 909 EFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIASLPHEQYLF 956
E V + +L+ G ++++ T++G IG++ + ++
Sbjct: 383 ETVLSLQKTTLIP-------GGSESLVYTTLSGGIGILVPFTSHEVIY 423
>gi|327304811|ref|XP_003237097.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
gi|326460095|gb|EGD85548.1| hypothetical protein TERG_01819 [Trichophyton rubrum CBS 118892]
Length = 1398
Score = 62.0 bits (149), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 69/308 (22%), Positives = 128/308 (41%), Gaps = 24/308 (7%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+PE ++P K + KL+L A++E KGAV +L+ G+ + Q + L
Sbjct: 1094 VVPEPDQPEKSK---------KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGL 1144
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
++DG+ + ++ L ++GD K + Y E ++ ++
Sbjct: 1145 KEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFTGYSEEPYKLDLFGKENEN 1204
Query: 859 NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEF---VN 912
+ + L D +Y L A+++ NL ++ + E + + RL +H G F V
Sbjct: 1205 LAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLRRSVFHTGHFASTVT 1264
Query: 913 RFRHGSLVMRLP--------DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
HG+ P DS ++ G I VI L + Y L LQ+ L
Sbjct: 1265 LLPHGAHTTSSPVDEDAMDTDSPPPSKYQILMTFQTGSIAVITPLSEDSYRRLLALQSQL 1324
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
++ LN +R+ ++ + +DG+L+ +LD+ R EI+ + V
Sbjct: 1325 VNALEHPCSLNPRGYRAVESDGMG-GQRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGA 1383
Query: 1025 LCKRVEEL 1032
+ +E+L
Sbjct: 1384 IRIDLEKL 1391
>gi|301628217|ref|XP_002943254.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
1 [Xenopus (Silurana) tropicalis]
Length = 628
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 106/562 (18%), Positives = 216/562 (38%), Gaps = 148/562 (26%)
Query: 1 MLDVPIYGRIATLELFRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM-----GDV 55
M +G + ++ + G +D L ++ + K V+++D + +L T ++ ++
Sbjct: 69 MASFSFFGNVMSMASVQLAGAKRDALLLSFKEAKLSVVEYDPGTHDLKTLSLHYFEEPEL 128
Query: 56 SDRIGRPTDNGQIGIIDPDCRLIGLHLYDGLFKVIPF---------------DNKGQLKE 100
D + N ++ + DP R + +Y V+PF K
Sbjct: 129 RDGFVQNVHNPKVRV-DPSGRCAVMLIYGTQLVVLPFRRDTLAEEHDGLVGEGQKSSFLP 187
Query: 101 AFNIRLEEL-----QVLDIKFLYGCAKPTIVVLYQDNKD-----ARHVKTYEVALKDKDF 150
++ I + EL ++D++FL+G +PT+++L++ N+ A T + +
Sbjct: 188 SYIIDVRELDEKLLNIIDMQFLHGYYEPTLLILFEPNQTWPGRVAVRQDTCSIVAISLNI 247
Query: 151 VEG--P--WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCS-------------ANAF 193
++ P WS NL + VP P+ GV+I +++Y + N
Sbjct: 248 MQKVHPVIWSLTNLPYDCTQALAVPKPIGGVVIFAVNSLLYLNQSVPPYGVSLNSLTNGT 307
Query: 194 KAIPIRPSITKAYGRVDADGS--------RYLLGDHAGLLHLL-VITHEKEKVTGLKIEL 244
+ P++P + RV D S + ++ G +++L +IT V +
Sbjct: 308 TSFPLKP---QEGLRVTLDCSQATFISYDKMVISLKGGEIYVLTLITDGMRSVRSFHFDK 364
Query: 245 LGETSIASTISYLDNAVVYIGSSYGDSQLIKLNLQ-----------PD------------ 281
+ + ++++ ++ +++GS G+S L++ + PD
Sbjct: 365 AAASVLTTSMTPMEPGYLFLGSRLGNSLLLRYTEKVQDSPAGPSKDPDKQDEPPNKKKRV 424
Query: 282 ---------AKGSYV------------------------EVLERYVNLGPIVD------- 301
+KG+ V EV + +N+GP
Sbjct: 425 DSSLARPGGSKGNMVDEIDEIEVYGSEMQSGTQLSTYSFEVCDSILNIGPCATASMGEPA 484
Query: 302 FCVVDLERQGQG--QVVTCSGAYKDGSLRIVRNGIGINEQASVELQGIKGMWSLRSS--- 356
F + + + ++V CSG K+G+L +++ I + EL G MW++ S+
Sbjct: 485 FLSEEFQESPEPDLEIVLCSGYGKNGALSVLQKSIRPQVVTTFELPGCHDMWTVISNHKK 544
Query: 357 -----------------TDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTL 399
D FL++S T IL E+ E + GF +Q T+
Sbjct: 545 EEQEGEKEGETPPVEAEEDTNRHGFLILSRDDSTMILQTG--QEIMELDTSGFATQDPTV 602
Query: 400 FCHDAIYNQ-LVQVTSGSVRLV 420
+ + N+ +VQV+ +RL+
Sbjct: 603 YAGNIGDNKYIVQVSPRGIRLL 624
>gi|326432241|gb|EGD77811.1| hypothetical protein PTSG_08901 [Salpingoeca sp. ATCC 50818]
Length = 1506
Score = 61.6 bits (148), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/334 (23%), Positives = 135/334 (40%), Gaps = 61/334 (18%)
Query: 17 RPHGEAQDFLFIATERYKFCVLQWDAESSELIT---RAMGDVSDRIGRPTDNGQIGI-ID 72
RP G ++D L + + ++++D + +L T A D ++G ++ + +D
Sbjct: 75 RPRGNSRDLLIFTFKDARVAIVRFDPKMRDLETVSLHAFEDTDTKLGGWHSEQRLRVCVD 134
Query: 73 PDCRLIGLHLYDGLFKVIPF-----------DNKGQLKEAFNIRLEEL--------QVLD 113
P R L +Y VI F D + +++F R+ +L +V D
Sbjct: 135 PLHRCAALMVYGCKLIVISFSSGTATAAPEADTQEDTEQSFTSRVIDLLSLPSTIGRVDD 194
Query: 114 IKFLYGCAKPTIVVLYQ----------DNKDARHVKTYEVALKDKDFVEGP--------- 154
+ FL G P + +L+Q KD HV +AL + P
Sbjct: 195 MAFLDGYDVPCLAILHQPRPAWVGHMAKTKDTAHVTALSLALDEMTARRAPTAPPPPPPP 254
Query: 155 --WSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSA--------NAFKAIPIRPSITK 204
W Q NL + L PVP PL GV++IG + Y + N + I +
Sbjct: 255 VVWHQENLPSDTFALQPVPAPLGGVVVIGVNVLFYVTQSLVRSLALNGYSRASTNAPIQE 314
Query: 205 AYG-RVDADGSRY--------LLGDHAGLLHLLVITHEKEKVTGLKIELLGETSIASTIS 255
G +D DG+ + L +G +HLL I V GL+++ L + I S I
Sbjct: 315 QTGISLDLDGAHHALLTPTQILFALPSGDIHLLTIVCTDVTVDGLRMDKLATSVIGSDIC 374
Query: 256 YLDNAVVYIGSSYGDSQLIKLNLQPDAKGSYVEV 289
L ++I S + S L++ P + ++++V
Sbjct: 375 TLGRRHIFIASRHATSLLLEWAPIPLSATTHIDV 408
>gi|296424589|ref|XP_002841830.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295638079|emb|CAZ86021.1| unnamed protein product [Tuber melanosporum]
Length = 595
Score = 61.2 bits (147), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 88/476 (18%), Positives = 189/476 (39%), Gaps = 58/476 (12%)
Query: 247 ETSIASTISYLD-----NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNLGPIVD 301
+ SI + + LD N ++ G + L + P + ++ E N P+ D
Sbjct: 76 DCSIGTAFASLDFGLNKNDMLVAGGEMSSGGVYLLKIGPHPAETEPKLEESVANWSPVFD 135
Query: 302 FCVVDLERQGQG-------QVVTCSGAYKDGSLRIVRNGIGINEQASV--ELQGIKGMWS 352
F +V+L QG +++ C+G G++ +R GI Q S +++G++ +W
Sbjct: 136 FELVNLPAQGHASGKVERDRILACTGRGDHGAITELRYGIQARIQGSADHQMRGVRRLWV 195
Query: 353 LRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQV 412
L +D + FL S ++ + + + E E+ +T A+ VQV
Sbjct: 196 LPDCSDVGY--FLFSSLPDQSNLCFLKPDGEWEDASDLELLDMNETTLAAGAVGKHSVQV 253
Query: 413 TSGSVRLVSSTSRELRN--------EWKSPPGYSVNVATANASQVLLATG-GGHLVYLEI 463
T ++ + L N E P S+ V++A G+ + +
Sbjct: 254 TPSTINIAQLKHASLANHREVSRDVEMADEPQLSILRCRCGGGDVIVAAALKGNYIVVAT 313
Query: 464 GDGILTEVKHAQLEYEISCLD-INPIG------ENPSYSQI--------AAVG------- 501
+G L ++ A + + + D +NPIG E P++ + AAVG
Sbjct: 314 RNGFLVKLTLATIVVDENIEDFLNPIGIPTSLAEEPTFISVLEINGRILAAVGTRQATIQ 373
Query: 502 -MWTDISVRIFSLPDLNLITKEHLGGE--IIPRSVLLCAFEGISYLLCALGDGHLLNFLL 558
D S + L + ++++ G + I ++ G + LLC L G ++ +
Sbjct: 374 LFLFDFSNGLVPLLEKSMMSDSCQGEQDLFICECAVIIRANGTARLLCGLRGGTVVVMDI 433
Query: 559 NMKTGELTDR-KKVSLGTQPITLRTFSSKNTTHVFAAS------DRPTVIYSSNKKLLYS 611
+ G +R + G P+ + ++ + A D P+ + +++ +
Sbjct: 434 QWQRGLSLERIDGIKFGPTPVQIYPDVGRSDSAFILAGPELFRFDLPSGNFRASQVVFEE 493
Query: 612 NVNLKEVSHMCPFNSAAFPDSLAIA-KEGELTIGTIDDIQKLHIRSIPLGEHPRRI 666
+ + +++ P+++ + + ++ + + +K+ +R + L E PRR+
Sbjct: 494 SEVEPSLVAFVQIDTSPDPENIVVGVTKDQIFFADVGESEKVCVRRLRLYETPRRL 549
>gi|149245793|ref|XP_001527373.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449767|gb|EDK44023.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 1223
Score = 60.5 bits (145), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 100/473 (21%), Positives = 188/473 (39%), Gaps = 70/473 (14%)
Query: 160 LDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPS--------ITKAYGRVDA 211
L N A+ LIPV + GVL+ GE I + AN + P + +
Sbjct: 229 LPNDANYLIPVLGAIGGVLLCGENWIRFVKANKDNIVLPLPRRGNQNTIIVNHVSHFLKK 288
Query: 212 DGSRYLLGDHAGLLHLLVITHE--KEKVTGLKIELLGETSIASTISYL-----------D 258
G + G + LV ++ + KV + I + +++ +
Sbjct: 289 KGLFIFMQSTLGDILTLVFDYDSDRSKVKDITITYFDTIPPSQSLNIFKSGFLFANSLNN 348
Query: 259 NAVVY----IGSSYGDSQLIKLNLQPDAKG-------------------SYVEVLERYVN 295
N V+Y +G+ D+ +KL+ DA S ++ LE +
Sbjct: 349 NQVLYQFMKLGADLSDATSVKLSTVTDANSIDIAKKLHYKFEVRSLQNLSEIDTLE---S 405
Query: 296 LGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIVRNGIGINEQASVELQ-GIKGMWSLR 354
L PI D ++ + T S +K ++ V GI + L +++ +
Sbjct: 406 LSPITDSKLLP-----NSHLATLSSNHK---MKTVMAGIPTSVLVESPLTLTPTNVFTTK 457
Query: 355 SSTDDPFDTFLVVSFISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTS 414
S D P D +LV + + L ++ + LE+ F T+ +VQ+ +
Sbjct: 458 LSEDSPNDEYLVFTSTLSSETLVFSIGESLEDVTDSKFVLDQPTIAVQQVGKASVVQIYA 517
Query: 415 GSVRLVSS---TSRELRNEWKSPPGYSVNVATANASQVLLATGGGHLVYLE---IGDGIL 468
+R +S+ S+++ +W P G S+ A N QVL+A +VY E D +L
Sbjct: 518 YGLRHISTKIGNSKKV-TDWYPPAGISIVHAATNNKQVLIALSNSEVVYFETDPTDDQLL 576
Query: 469 TEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSL-PDLNLITKEHLGGE 527
+ ++ ++ + I+P E S++ I +D ++++ SL P L K
Sbjct: 577 EYQEKLEVTSPVTSMTISP--ERSSFAIIGC----SDETIQVISLQPQSCLEVKSLQALS 630
Query: 528 IIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRKKVSLGTQPITL 580
S+ + E +Y+ + +G ++ +G+L+D + LG++PI L
Sbjct: 631 SKANSLAMLTNERTTYVHIGMENGVYARTKIDKFSGKLSDTRVKYLGSKPIAL 683
Score = 40.0 bits (92), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 37/123 (30%), Positives = 59/123 (47%), Gaps = 24/123 (19%)
Query: 925 DSDVGQIPTVIFGTVNGVIGVIASL--PHEQYLFLEKLQTNLRKV-------IKGVG--- 972
DSD G +++ + G IGV+ L HE L LQ LRK+ + GV
Sbjct: 1104 DSDDG---CIVYTGIQGTIGVLILLLTKHEVELLFN-LQLELRKLCNHDDDAVSGVASWI 1159
Query: 973 ---GLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEELCKRV 1029
G +H + RS+ N KN +DGD +E +L L + +I + ++ S E+ KR+
Sbjct: 1160 SPLGKDHLKHRSYFNP-----VKNIIDGDFVECYLQLLQNWKVKIGEAIDKSPNEIEKRL 1214
Query: 1030 EEL 1032
++
Sbjct: 1215 NDI 1217
>gi|428170663|gb|EKX39586.1| hypothetical protein GUITHDRAFT_143370 [Guillardia theta CCMP2712]
Length = 1162
Score = 60.5 bits (145), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 129/649 (19%), Positives = 262/649 (40%), Gaps = 132/649 (20%)
Query: 123 PTIVVLYQDNKDARHVKTYEVALKDKD-----FVEGPWS----------QNNLDNGADLL 167
P + +L++D K ++T + ++ + ++GPWS NL+ GA L
Sbjct: 216 PLMALLFEDEKGIVDLETCSIKYEESNKLNFNLLQGPWSFSTLHPETSLGRNLEVGAFLY 275
Query: 168 IPVPPPLCGVLIIGEETIVYCSANAFKAIPIRPSITKAYGRVDADGSRYLLGDHAGLLHL 227
+P + I T + A++ A+P P + + ++LG + G +H+
Sbjct: 276 PHLPSSQWAIRI----TSRFAVASSL-AVPAGPGSNRKKRSM-----HWILGSNDGQMHI 325
Query: 228 LVITHEKEKVTGLKIEL--------LGETSIASTISYLDNAVVYIGSSYGDSQLIKL--- 276
L ++ E K + I + L +A+ + + + + G + ++ +
Sbjct: 326 LSVSGES-KSANVTINVKRIDTNIPLSNPRVATILRSQERTLQLWCTKTGYTSILDITKL 384
Query: 277 ------NLQPDAKGSYVEVLERYV--NLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLR 328
N Q D + VEV+ +LG I D VD G+ Q++ G D LR
Sbjct: 385 SHHIENNSQTDQEIPGVEVMCSPFEESLGSIKDAIAVDFLGDGEMQLLLACGEGSDSCLR 444
Query: 329 IVRNGIGINE--------QASVELQGIKGMWSLRSSTDD---------PFDTFLVVSF-- 369
+ R+G+ +++ ++ ++G+ L D FD+ LV SF
Sbjct: 445 LCRSGLEVSKIIEEGPEMPECSDIFALRGLHILHVQPHDNGSNIQRLRAFDSHLVFSFAS 504
Query: 370 ISETRILAMNLEDELEETEIEGFCSQTQTLFCHDAIYNQLVQVTSGSVRLV-------SS 422
I++T++L ++ E + G C T+F + LVQVT +RL+ +S
Sbjct: 505 INQTKVLELD-GHEFVPVTLPGLCEDANTVFITSLPHGHLVQVTEMEIRLINMRKSVMNS 563
Query: 423 TSRELRNEWKSPPGYSVNVATANASQVLLATGGG------HLVYLEIG-DGILT------ 469
+ + + W +P ++N+A+ + T G L L + D +L+
Sbjct: 564 SQEDYLHVW-TPKIGNINMASVVKYDTSIKTDGNQSLTPHQLASLVVAVDSVLSLFEILT 622
Query: 470 --------EVKHA-QLEYEISCLDINPIGENPSYSQIAAVGMWTDISVRIFSLPDLNLIT 520
++KH+ Q E++IS + N V M + S+ + LPD L
Sbjct: 623 HSNKKSEVKLKHSRQFEHQISAVWPN------------EVHMLKNDSMEV-KLPD-KLSY 668
Query: 521 KEHLGGEIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTGELTDRK------KVSLG 574
+++ C + ++ + + G H+ F + KT + D + +G
Sbjct: 669 QQYNQH---------CVCDQVAVVGSSTGIAHI--FPIPSKTSGIVDVMTLIEVCSLRVG 717
Query: 575 TQPITLRTFSSKNTTH-VFAASDRPTVIYSSNKKLLYSNVN-LKEVSHMCPFNSAAFPDS 632
+++ ++ H +F S+ I+ + +L +N +++V +C ++ P S
Sbjct: 718 ETAVSIHVEETEQAVHGIFLHSNYNAWIHLRSGELEVKRINGIEKVKAVCSLHTELMPTS 777
Query: 633 LAIAKEGE-LTIGTIDDIQK---LHIRSIPLGEHPRRICHQEQSRTFAI 677
L E + L G + + ++ ++ RS+ L E+ + H + R +
Sbjct: 778 LVWINEKDCLQFGKLAEDRQTSTIYQRSVKLAENVVSMTHSSRHRCLLL 826
>gi|303285993|ref|XP_003062286.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455803|gb|EEH53105.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 1469
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 75/335 (22%), Positives = 141/335 (42%), Gaps = 44/335 (13%)
Query: 733 CVGTAYVLPE-ENEPTKGRILVFIVE----DGKLQLI--AEKETKGAVYSLNAFNGKLLA 785
VGTA +P E+ P +GR+++F + DG+ + + A+ +L+ G L+
Sbjct: 1139 AVGTA--MPGGEDTPCRGRVILFEISWQMVDGETRRVPLLLLFFDDALAALSGLEGHLVV 1196
Query: 786 AINQKIQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHE 845
AI K+ ++ W DG EL + + + +F+ +GD+ K +K +
Sbjct: 1197 AIGTKLIVHAW----DGA-ELIPVAFFDTPVHTVTINVVKNFVCIGDVQKGAYFFRWKDD 1251
Query: 846 ----EGAIEERARDYNANWMSAVEILDDDI---YLGAENNFNLFTVRKNSEGATDEERGR 898
E + + A+D+ + + + E L D L A+ N + + + + + +
Sbjct: 1252 PRTGEKNLIQLAKDFESMDVLSTEFLVDGSTLSLLAADTAGNAYVFAYDPKSSESWKGQK 1311
Query: 899 LEVVGEYHLGEFVNR---FRHGSLVMRLPDSDVGQIPT----------VIFGTVNGVIGV 945
L +H+G V+R F+ + D P V FGT++G +G+
Sbjct: 1312 LLTKASFHVGSPVHRMVRFKLKTPTGAGNDGRAAPTPAEIKANANRHAVFFGTLDGSLGI 1371
Query: 946 IASLPHEQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEK----KTVDAKNFLDGDLIE 1001
+ + + LE LQ L GLN +R+ + ++ N LDG++++
Sbjct: 1372 LVPMESSTHAKLEVLQRWLNYNTAQNAGLNGRSYRAPKTTEGRAMRSPAPHNLLDGEMLQ 1431
Query: 1002 SFLDLSRTRMDEISKTMNVSVEELCKRVEELTRLH 1036
F L+ T+ E + ++ R E LT LH
Sbjct: 1432 GFESLAWTKQAEAADAAGMT------REEALTYLH 1460
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 132/661 (19%), Positives = 245/661 (37%), Gaps = 179/661 (27%)
Query: 8 GRIATLEL-FRPHGEAQDFLFIATERYKFCVLQWDAESSELITRAM---------GDVSD 57
G IATL F E +D L +A K V+++D + L+ ++ G V
Sbjct: 113 GSIATLRRRFGAPREQRDALLLAVRESKLSVVEFDPSTLSLVCSSLHSWETPPGAGGVPS 172
Query: 58 RIGRPTDNGQIGIIDPD--CRLIGLHLYDGL-FKVIPFDN-------------KGQ---- 97
+ R + + DP+ C + L G ++P DN KG+
Sbjct: 173 AL-RLAPTPPVVVADPEGRCAAVLLRAEGGTRLALLPTDNDAMDVDGGDGSEGKGRRTLR 231
Query: 98 -----LKEAFNIRL-EELQVL---DIKFLYGCAKPTIVVLYQDN----------KDARHV 138
+K+++ + L E+ V D+ FL+G +P ++VL+++ KD +
Sbjct: 232 GTAAAVKKSYVVDLVREMGVRYVRDVCFLHGYGEPVLLVLHEERLTWAARATLVKDTMRL 291
Query: 139 KTYEVALKDKDFVEGPWSQNNLDNGADLLIPVPPPLCGVLIIGEETIVYCSANAFKAIPI 198
+ + + W ++ L + L +P PL G +++ + +++ S + A+ +
Sbjct: 292 SAISLNVDARKHTV-IWRRSALPHSCYRLTAMPAPLGGAIVLSQNFLLHESQESSAALAL 350
Query: 199 RPSITKAYGRVDADG----------------------SRYLLGDHAGLLHLLVITHEKEK 236
P GR D + L+ AG L+LL + E +
Sbjct: 351 NP--LAGGGRGDDPAAKAAAAASAAALDGAYAAVISEKQALVTTKAGALYLLSLRIEGRR 408
Query: 237 VT---GLKIELLGETSIASTISYLDNAVVYIGSSYGDS---------------------- 271
+ G+ ++ G ++S + + ++++GS GDS
Sbjct: 409 LATRGGMHLKRAGGAVLSSGMCLVTRRLLFLGSRVGDSLLVSRCSTARASTAAPGRRPRA 468
Query: 272 ------------QLIKLNLQ---------------------PDAKGSYVEVLERYVNLGP 298
+L+ + Q PD G V + + + P
Sbjct: 469 AAAAATTAAAEVRLLPIRPQIDGVGGVSAASLRAAAAAHRAPDHPGYTFTVRDSVLGISP 528
Query: 299 IVDFCV-----VDLERQGQGQVVTCSGAYKDGSLRIVRNGIG---INEQASVELQGIKGM 350
++D V V + + +++ G K+G+L +++ GI + E S L G+KG
Sbjct: 529 VIDLTVGASASVSGDTIERTELIAACGHGKNGALAVLQRGIQPELVTEVESGTLPGLKGT 588
Query: 351 WS----------LRSSTD-------DPFDTFLVVSFISETRILAMNLEDELEE-TEIEGF 392
W+ LR S DP+ +LV+S S T IL +EL+E +E
Sbjct: 589 WTVHHDSADNERLRGSAAAAAAQAVDPYHAYLVISLASSTMILETG--EELKEVSEHVEL 646
Query: 393 CSQTQTLFCHDA--------IYNQLVQVTSGSVRLVSSTSRELRNEWKSPPGYSVNVATA 444
+ TL +A +Y++ V+V +G V++ S EL + G + A
Sbjct: 647 VTDAATLCAGNAFGRERIVQVYDKGVRVAAGPVKVQDIASTELVADAGDGEGIEIVAAEI 706
Query: 445 NASQVLLATGGGHLVYLEIGDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
+ VL + DG L +K + + LD++ + P IA +
Sbjct: 707 SFPYVL----------CRLSDGSLAVLKGDEESKTLVKLDVDALARLPPGGGIACATLVD 756
Query: 505 D 505
D
Sbjct: 757 D 757
>gi|325094074|gb|EGC47384.1| cleavage factor two protein 1 [Ajellomyces capsulatus H88]
Length = 1377
Score = 60.1 bits (144), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 104/431 (24%), Positives = 172/431 (39%), Gaps = 86/431 (19%)
Query: 655 RSIPLGEHPRRICHQEQSRTFAICSLKNQ----SCAEESEMH-----------------F 693
R I LGE + + S T+ I + NQ + E+ E+H
Sbjct: 976 RKIGLGEQVDAVEYSSSSETYVIGT--NQKVDFNLPEDDEIHPEWRNEVISFLPQIDKGS 1033
Query: 694 VRLLDDQTFEFISTYPLDTFE----YGCSILSCS-FSDDSNVYYCVGTAYVLPEENEPTK 748
V+LL +T+ I +Y L E C L S + + VGTA + E+ +
Sbjct: 1034 VKLLTPRTWSIIDSYNLRNAERIMCVKCLNLEVSEITHERKDTIVVGTA-LTKGEDIAAR 1092
Query: 749 GRILVFIV-----------EDGKLQLIAEKETKGAVYSLNAFNGK--LLAAINQKIQLYK 795
G I +F V + KL+LIA++E KGAV SL+ G+ L+AA QK +
Sbjct: 1093 GCIYIFEVIKVVPEVDRPETNRKLKLIAKEEVKGAVTSLSGIGGQGFLIAAQGQKCIVRG 1152
Query: 796 WMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARD 855
L++DG+ +L + ++ V +K GAI
Sbjct: 1153 --LKEDGS------------LLPVAFMDMQCYVNVLKELKG----------GAITNCL-- 1186
Query: 856 YNANWMSAVEILDDDIYLGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRF- 914
++A V DD +L EN + + G++ +R L + G F +
Sbjct: 1187 FSARM--TVPSSDDADFLPDENRLYILVADDDYPGSSKGDR--LLHRSTFQTGHFASTMT 1242
Query: 915 ---RHGSLVMRLPDSD--------VGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTN 963
R + + PD+D G + V+ + G I +I + Y L LQ+
Sbjct: 1243 LLPRTATSSSQGPDADPDMMDLDSSGPLHHVLVTSETGSIALITPVSETSYRRLSALQSQ 1302
Query: 964 LRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVE 1023
L ++ GLN +R+ E + + +DGDL++ +LDL R EI+ + V
Sbjct: 1303 LANTLEHPCGLNPRAFRAV--ESDGIGGRGMVDGDLVKRWLDLGTQRKAEIANRVGADVW 1360
Query: 1024 ELCKRVEELTR 1034
E+ +E + +
Sbjct: 1361 EIRADLEAIGK 1371
>gi|302652143|ref|XP_003017931.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
gi|291181517|gb|EFE37286.1| hypothetical protein TRV_08063 [Trichophyton verrucosum HKI 0517]
Length = 429
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 70/312 (22%), Positives = 132/312 (42%), Gaps = 27/312 (8%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+PE ++P K + KL+L A++E KGAV +L+ G+ + Q + L
Sbjct: 125 VVPEPDQPEKSK---------KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGL 175
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
++DG+ + ++ L ++GD K + + Y E ++ ++
Sbjct: 176 KEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENEN 235
Query: 859 NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
+ + L D +Y L A+++ NL ++ + E + + RL +H G F +
Sbjct: 236 LAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMT 295
Query: 916 ---HGSLVMRLP--------DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
HG+ P DS ++ G + VI L + Y L LQ+ L
Sbjct: 296 LLPHGARTPSSPVDEDAMDTDSPPPSKYQILMTFQTGSVAVITPLGEDSYRRLLALQSQL 355
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
++ LN +R+ ++ + +DG+L+ +LD+ R EI+ + V
Sbjct: 356 VNALEHPCSLNPRGYRAVESDGMG-GQRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGA 414
Query: 1025 LCKRVEELTRLH 1036
+ RV +L +LH
Sbjct: 415 I--RV-DLEKLH 423
>gi|308504990|ref|XP_003114678.1| hypothetical protein CRE_28194 [Caenorhabditis remanei]
gi|308258860|gb|EFP02813.1| hypothetical protein CRE_28194 [Caenorhabditis remanei]
Length = 270
Score = 59.7 bits (143), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 59/274 (21%), Positives = 119/274 (43%), Gaps = 46/274 (16%)
Query: 791 IQLYKWMLRDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIE 850
+++ W G G + + + +Q+ G I+V D +S+ L Y+ + +
Sbjct: 5 FEIFPW-----GNHLFDDRMGKNFPVSIVNIQSTGQRIIVSDSQESVHFLRYRKGDNQLV 59
Query: 851 ERARDYNANWMSAVEILDDDIYLGAENNFNLFTVR---KNSEGATDE--------ERG-- 897
A D ++S V +LD A+ NL VR + +E D+ +RG
Sbjct: 60 VFADDTTPRYVSCVCVLDYHTVAVADKFGNLAVVRLPERVNEDVQDDPTVSKSVWDRGWL 119
Query: 898 -----RLEVVGEYHLGEFVNRFRHGSLVMRLPDSDVGQIPTVIFGTVNGVIGVIAS-LPH 951
++E+V + +G+ + + SL +P G +++ T+ G IG + S +
Sbjct: 120 NGASQKVELVANFFIGDTITSLQKTSL---MP----GANEALVYTTIGGAIGCLVSFMSK 172
Query: 952 EQYLFLEKLQTNLRKVIKGVGGLNHEQWRSFNNEKKTVDAKNFL-------------DGD 998
++ F L+ ++R + G +H +RS+ K NFL DGD
Sbjct: 173 DEVDFFTNLEMHVRSEYPPLCGRDHLAYRSYYAPCKV--CFNFLLFRSIVSLFQSVIDGD 230
Query: 999 LIESFLDLSRTRMDEISKTMNVSVEELCKRVEEL 1032
+ E F + ++ E+++ + +V E+ K++E++
Sbjct: 231 ICEQFSLMDLSKQKEVAEELGKTVSEISKKLEDI 264
>gi|255725990|ref|XP_002547921.1| hypothetical protein CTRG_02218 [Candida tropicalis MYA-3404]
gi|240133845|gb|EER33400.1| hypothetical protein CTRG_02218 [Candida tropicalis MYA-3404]
Length = 1053
Score = 59.7 bits (143), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 74/340 (21%), Positives = 146/340 (42%), Gaps = 21/340 (6%)
Query: 271 SQLIKLNLQPDAKGSYVEVLERYVNLGPIVDFCVVDLERQGQGQVVTCSGAYKDGSLRIV 330
+++I +L+ + ++ LE L PI+D +D ++VT S L+ V
Sbjct: 387 TKVITFDLKSLTNLALIDTLE---TLSPILDAKSID------SKLVTLSSH---SYLKTV 434
Query: 331 RNGIGINEQASVELQGI-KGMWSLRSSTDDPFDTFLVVSFISETRILAMNLEDELEETEI 389
+GI + L I +++ + S + D +LV+S ++ L +++ + +E+ E
Sbjct: 435 THGIPTSTMVESPLPVIPTDIFTTKLSFESENDEYLVISSSLSSKTLVLSIGEVVEDVED 494
Query: 390 EGFCSQTQTLFCHDAIYNQLVQVTSGSVRLVSSTSRELR--NEWKSPPGYSVNVATANAS 447
F T+ N +VQ+ + ++ V + E + +W P G ++ A N
Sbjct: 495 SEFVLDQPTIAVQQVGKNSVVQIYTNGIKHVRQINGEKKKITDWLPPAGITITHAATNNQ 554
Query: 448 QVLLATGGGHLVYLEI---GDGILTEVKHAQLEYEISCLDINPIGENPSYSQIAAVGMWT 504
QVL+A +VY EI D ++ + +L + L I N S A +G +
Sbjct: 555 QVLIALSNLEVVYFEIDHLDDQLIEYQERLELSSSTTALAIEEHTNNHHQSPFAIIG-CS 613
Query: 505 DISVRIFSLPDLNLITKEHLGG-EIIPRSVLLCAFEGISYLLCALGDGHLLNFLLNMKTG 563
D ++++ SL N + + + SV + + + +G ++ G
Sbjct: 614 DETIQVVSLKQQNCLEIQSIQALSSNCSSVKMIKLGKDLMVHIGMNNGVYARIKIDPING 673
Query: 564 ELTDRKKVSLGTQPITLRTFS-SKNTTHVFAASDRPTVIY 602
+L+D + LG++P+ L +K T V A S + + Y
Sbjct: 674 KLSDSRIKYLGSKPVKLNIVKINKEITGVLAVSSKSWIGY 713
>gi|302506529|ref|XP_003015221.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
gi|291178793|gb|EFE34581.1| hypothetical protein ARB_06344 [Arthroderma benhamiae CBS 112371]
Length = 1370
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 70/312 (22%), Positives = 131/312 (41%), Gaps = 27/312 (8%)
Query: 739 VLPEENEPTKGRILVFIVEDGKLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWML 798
V+PE ++P K + KL+L A++E KGAV +L+ G+ + Q + L
Sbjct: 1036 VVPEPDQPEKNK---------KLKLFAKEEVKGAVTALSGIGGQGFLIVAQGQKCMVRGL 1086
Query: 799 RDDGTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNA 858
++DG+ + ++ L ++GD K + + Y E ++ ++
Sbjct: 1087 KEDGSLLPVAFKDTQCYVNVLKELKGTGMCIIGDAFKGLWFIGYSEEPYKLDLFGKENEN 1146
Query: 859 NWMSAVEILDD--DIY-LGAENNFNLFTVRKNSEGATDEERGRLEVVGEYHLGEFVNRFR 915
+ + L D +Y L A+++ NL ++ + E + + RL +H G F +
Sbjct: 1147 LAVVDADFLPDGNKLYILVADDDCNLHVLQYDPEDPSSSKGDRLLHRSVFHTGHFASTMT 1206
Query: 916 ---HGSLVMRLP--------DSDVGQIPTVIFGTVNGVIGVIASLPHEQYLFLEKLQTNL 964
HG P DS ++ G I +I L + Y L LQ+ L
Sbjct: 1207 LLPHGGHTPSSPVDEDAMDTDSPPPSKYQILMTFQTGSIAIITPLGEDSYRRLLALQSQL 1266
Query: 965 RKVIKGVGGLNHEQWRSFNNEKKTVDAKNFLDGDLIESFLDLSRTRMDEISKTMNVSVEE 1024
++ LN +R+ ++ + +DG+L+ +LD+ R EI+ + V
Sbjct: 1267 VNALEHPCSLNPRGYRAVESDGMG-GQRGMIDGNLLLRWLDMGAQRKAEIAGRVGADVGA 1325
Query: 1025 LCKRVEELTRLH 1036
+ RV +L +LH
Sbjct: 1326 I--RV-DLEKLH 1334
>gi|169864473|ref|XP_001838845.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
gi|116500065|gb|EAU82960.1| cleavage factor protein [Coprinopsis cinerea okayama7#130]
Length = 1458
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 111/517 (21%), Positives = 207/517 (40%), Gaps = 74/517 (14%)
Query: 562 TGELTDRKKVSLGTQPITLRTFSSKNT-----THVFAASDRPTVIYSSNK---KLLYSNV 613
+G L + KK+ P T++ +S T + VF D+P I ++K ++ S
Sbjct: 940 SGILAEHKKIQRLFVPFTVKPKTSDGTPSPTYSGVFFTGDKPNWIIGTDKGGVQIYPSGH 999
Query: 614 NLKEVSHMCPF--NSAAFPDSLAIAKEGELTIGTIDDIQKLH---IRSIPLGEHPRRICH 668
N+ C F L ++G I + D H RSIP G +
Sbjct: 1000 NVVHSFSACSLWEERGEF---LVYTEDGPCLIEWLPDFTYSHPLPARSIPRGRGYSNVVF 1056
Query: 669 QEQSRTFAICSLKNQSCAEESEMHFVR------------LLDDQTFEFIST---YPLDTF 713
+ S + + Q+ + VR + D E IS +D F
Sbjct: 1057 -DPSTCLIVAASSMQARFASYDEDGVRVWEKDGPGVDDPITDTSALELISPNSWITMDGF 1115
Query: 714 EYGC----------SILSCSFSDDSNVYYCVGTAYVLPEENEPTKGRILVF----IVEDG 759
E+ ++ + + S + VGT + E+ KG +F +V D
Sbjct: 1116 EFATNEYINDISIVTLETAATETGSKDFIAVGTT-IDRGEDLAAKGAAYIFEIVEVVPDP 1174
Query: 760 --------KLQLIAEKETKGAVYSLNAFNGKLLAAINQKIQLYKWMLRDDGTRELQSECG 811
KL+L + KG V ++ F G L++++ QKI + + D L
Sbjct: 1175 AISPTRWYKLRLRCRDDAKGPVTAVCGFQGYLVSSMGQKIFVRAF----DSDERLVGVAF 1230
Query: 812 HHGHILALYVQTRGDFIVVGDLMKSISLLIYKHEEGAIEERARDYNANWMSAVEIL---D 868
I ++ + +++GD +KS+ + ++ + + A+D + + ++ + D
Sbjct: 1231 MDVGIYVTSLRVLKNLLLIGDAVKSVMFVAFQEDPYKLVLLAKDVHLHSVTRADFFFNAD 1290
Query: 869 DDIYLGAENNFNLFTVRKNSEGATDEERGR-LEVVGEYHLGEFVNRFRHGSLVMRLPDSD 927
D+ L + + + + + D GR L + EYH G+ H S + D +
Sbjct: 1291 GDLALIVGDEEGIMRIYEYNPNDPDSRDGRYLLLRTEYH-GQVP---YHTSTTIARRDKE 1346
Query: 928 VGQIPT--VIFGTVNGVIGVIASLPHEQYLF--LEKLQTNLRKVIKGVGGLNHEQWRSFN 983
IP ++ G+ +G + + +P ++Y F L+ LQ L + I+ V GLN + +R
Sbjct: 1347 DPSIPQSHLLIGSADGSLSSL--VPVDEYAFKRLQLLQGQLTRNIQHVAGLNPKAFRIVK 1404
Query: 984 NEKKTVD-AKNFLDGDLIESFLDLSRTRMDEISKTMN 1019
N+ + +K LDG L+ + L R +E++K +
Sbjct: 1405 NDYVSKPLSKGILDGQLLAQYESLPIPRQNEMTKQIG 1441
>gi|358338426|dbj|GAA28838.2| cleavage and polyadenylation specificity factor subunit 1
[Clonorchis sinensis]
Length = 1741
Score = 59.3 bits (142), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 60/218 (27%), Positives = 93/218 (42%), Gaps = 37/218 (16%)
Query: 111 VLDIKFLYGCAKPTIVVLYQD------NKDARHVKTYEVALK---DKDFVEGPWSQNNLD 161
VLD++FL G +PT++VLY+ AR VAL K W Q +L
Sbjct: 158 VLDMQFLNGFYEPTLLVLYEPIGTWAGRVSARRDTCCIVALSFNLQKRTNPVIWFQESLP 217
Query: 162 NGADLLIPVPPPLCGVLIIGEETIVY-------------CSANAFKAIPIRPSITKAYGR 208
+ VP P+ GVLI+ +I+Y C A P+R + + G
Sbjct: 218 YDCTYVHSVPEPIGGVLILATNSIIYMKQTLPSCGLPLNCYAQVTTNFPMRQDVPQC-GP 276
Query: 209 VDADGSR--------YLLGDHAGLLHLLV--ITHEKEKVTGLKIELLGETSIASTISYLD 258
+ DG R +L+ G + LL + H + V+ L + +G + +++ LD
Sbjct: 277 LTLDGCRIVTMTDSQFLIVTRTGKMCLLSLWVEHTTQTVSSLLLHEIGCSVPPYSVALLD 336
Query: 259 NAVVYIGSSYGDSQLIKLNLQPDAKGSYVEVLERYVNL 296
V++GS DS L+ L A +V L R V+L
Sbjct: 337 KGYVFVGSRLCDSVLLHLT----ASTMFVNTLGRIVDL 370
Score = 46.2 bits (108), Expect = 0.087, Method: Compositional matrix adjust.
Identities = 56/258 (21%), Positives = 105/258 (40%), Gaps = 61/258 (23%)
Query: 650 QKLHIRSIPLGEHPRRICHQEQSRTFAICSLKNQSCA----------EESEM-------- 691
+ L +R +PL P + + +S+T+A+ + +SC+ +E E+
Sbjct: 1245 EHLGMRWVPLELTPYFLQYHIESKTYALVGTRVKSCSSVYHLNAEGNKEEEVLLRPPTCV 1304
Query: 692 -----HFVRLLDDQTFEFISTYPLDTFEYGC------SILSCSFSDD---------SNVY 731
++V + + P + C +++C + + Y
Sbjct: 1305 LPSLDYYVLQMYAPSTSLAEATPWQAIPHACIDFEPWEVVTCMITAQLSSEQTFHGTKDY 1364
Query: 732 YCVGTAYVLPEENEPTKGRILVFIVED-----------GKLQLIAEKETKGAVYSLNAFN 780
+G EE P +GRI++ V D KL+ I + E KG V +L++
Sbjct: 1365 LALGANLSYGEEI-PVRGRIIILDVIDVVPEPGQPLTRHKLKTIYDGEQKGPVTALSSCQ 1423
Query: 781 GKLLAAINQKIQLYKWMLRDD---GTRELQSECGHHGHILALYVQTRGDFIVVGDLMKSI 837
G L++AI QK+ Y W L++ G + SE H + + I+ D++KSI
Sbjct: 1424 GHLVSAIGQKV--YIWTLKNADLVGVAFVDSELYIHSLLCV------KNLILAADVLKSI 1475
Query: 838 SLLIYKHEEGAIEERARD 855
LL ++ + + +RD
Sbjct: 1476 QLLRFQSDLRVLSVVSRD 1493
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.138 0.404
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,396,485,688
Number of Sequences: 23463169
Number of extensions: 723575118
Number of successful extensions: 1575804
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 588
Number of HSP's successfully gapped in prelim test: 596
Number of HSP's that attempted gapping in prelim test: 1567224
Number of HSP's gapped (non-prelim): 3132
length of query: 1036
length of database: 8,064,228,071
effective HSP length: 153
effective length of query: 883
effective length of database: 8,769,330,510
effective search space: 7743318840330
effective search space used: 7743318840330
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 82 (36.2 bits)